BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 038885
         (406 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|224053575|ref|XP_002297880.1| predicted protein [Populus trichocarpa]
 gi|222845138|gb|EEE82685.1| predicted protein [Populus trichocarpa]
          Length = 412

 Score =  780 bits (2015), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 373/408 (91%), Positives = 387/408 (94%), Gaps = 2/408 (0%)

Query: 1   MWVTAIALLLAFVSADNTPKI--ERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLK 58
           +WV A+ L+  FV      ++  ERISGSAGDVL+DDPVG+LKVYVYELPSKYNKKLL K
Sbjct: 5   LWVFAVVLVFGFVDGKEIERLRTERISGSAGDVLDDDPVGRLKVYVYELPSKYNKKLLQK 64

Query: 59  DPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMM 118
           DPRCLTHMFAAEIFMHRFLLSSPVRTLNP+EADWFY+PIYPTCDLTP GLPLPFKSPRMM
Sbjct: 65  DPRCLTHMFAAEIFMHRFLLSSPVRTLNPDEADWFYSPIYPTCDLTPMGLPLPFKSPRMM 124

Query: 119 RSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFG 178
           RSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQR+TLVQTFG
Sbjct: 125 RSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRSTLVQTFG 184

Query: 179 QRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGA 238
           QRNHVCLNEGSITIPP+APPQKMQAHQIPPD PRSIFVYFRGLFYDVNNDPEGGYYARGA
Sbjct: 185 QRNHVCLNEGSITIPPFAPPQKMQAHQIPPDIPRSIFVYFRGLFYDVNNDPEGGYYARGA 244

Query: 239 RAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVII 298
           RAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVII
Sbjct: 245 RAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVII 304

Query: 299 ADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFP 358
           ADDIVLPFADAIPWEEIG+FVAEEDVP LDTILTSIPPEVILRKQRLLANPSMKRAMLFP
Sbjct: 305 ADDIVLPFADAIPWEEIGVFVAEEDVPNLDTILTSIPPEVILRKQRLLANPSMKRAMLFP 364

Query: 359 QPAQPGDAFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
           QPAQPGDAFHQILNGLARKLPHD+S YLK G   LNWT+GPVGDLKPW
Sbjct: 365 QPAQPGDAFHQILNGLARKLPHDRSVYLKSGQNILNWTAGPVGDLKPW 412


>gi|60657598|gb|AAX33320.1| secondary cell wall-related glycosyltransferase family 47 [Populus
           tremula x Populus tremuloides]
          Length = 412

 Score =  775 bits (2000), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 370/408 (90%), Positives = 386/408 (94%), Gaps = 2/408 (0%)

Query: 1   MWVTAIALLLAFVSADNTPKI--ERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLK 58
           +WV A+ L+  FV      ++  ERISGSAGDVL+DDPVG+LKVYVYELPSKYNKKLL K
Sbjct: 5   LWVFALVLVFGFVDGKKIERLRTERISGSAGDVLDDDPVGRLKVYVYELPSKYNKKLLQK 64

Query: 59  DPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMM 118
           DPRCLTHMFAAEIFMHRFLLSSPVRTLNP+EADWFY+PIYPTCDLTP GLPLPFKSPRMM
Sbjct: 65  DPRCLTHMFAAEIFMHRFLLSSPVRTLNPDEADWFYSPIYPTCDLTPMGLPLPFKSPRMM 124

Query: 119 RSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFG 178
           RSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFH QEEKA+ERGILPLLQR+TLVQTFG
Sbjct: 125 RSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHCQEEKAVERGILPLLQRSTLVQTFG 184

Query: 179 QRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGA 238
           +RNHVCLNEGSITIPP+APPQKMQAHQIPPD PRSIFVYFRGLFYDVNNDPEGGYYARGA
Sbjct: 185 RRNHVCLNEGSITIPPFAPPQKMQAHQIPPDIPRSIFVYFRGLFYDVNNDPEGGYYARGA 244

Query: 239 RAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVII 298
           RAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVII
Sbjct: 245 RAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVII 304

Query: 299 ADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFP 358
           ADDIVLPFADAIPWEEIG+FVAEEDVP LDTILTSIPPEVILRKQRLLANPSMKRAMLFP
Sbjct: 305 ADDIVLPFADAIPWEEIGVFVAEEDVPNLDTILTSIPPEVILRKQRLLANPSMKRAMLFP 364

Query: 359 QPAQPGDAFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
           QPAQPGDAFHQILNGLARKLPHD+S YLK G   LNWT+GPVGDLKPW
Sbjct: 365 QPAQPGDAFHQILNGLARKLPHDRSVYLKSGQNILNWTAGPVGDLKPW 412


>gi|15223522|ref|NP_174064.1| putative glycosyltransferase [Arabidopsis thaliana]
 gi|75272002|sp|Q9FZJ1.1|IRX10_ARATH RecName: Full=Probable beta-1,4-xylosyltransferase IRX10; AltName:
           Full=Glucuronoxylan glucuronosyltransferase 1;
           Short=AtGUT1; AltName: Full=Glucuronoxylan
           glucuronosyltransferase 2; AltName: Full=Protein
           IRREGULAR XYLEM 10; AltName: Full=Xylan
           xylosyltransferase IRX10
 gi|9802541|gb|AAF99743.1|AC004557_22 F17L21.23 [Arabidopsis thaliana]
 gi|63003872|gb|AAY25465.1| At1g27440 [Arabidopsis thaliana]
 gi|98960979|gb|ABF58973.1| At1g27440 [Arabidopsis thaliana]
 gi|332192709|gb|AEE30830.1| putative glycosyltransferase [Arabidopsis thaliana]
          Length = 412

 Score =  773 bits (1996), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 374/406 (92%), Positives = 385/406 (94%), Gaps = 2/406 (0%)

Query: 3   VTAIALLLAFV--SADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDP 60
           ++AI L L F   SA    + ERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLL KDP
Sbjct: 7   LSAILLFLFFSASSAKQNVRTERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLQKDP 66

Query: 61  RCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRS 120
           RCLTHMFAAEIFMHRFLLSSPVRT NP+EADWFYTPIYPTCDLTPTGLPLPFKSPRMMRS
Sbjct: 67  RCLTHMFAAEIFMHRFLLSSPVRTRNPDEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRS 126

Query: 121 AIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQR 180
           +IQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQR
Sbjct: 127 SIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQR 186

Query: 181 NHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARA 240
           NHVCL+EGSITIPP+APPQKMQAH IPPD PRSIFVYFRGLFYDVNNDPEGGYYARGARA
Sbjct: 187 NHVCLDEGSITIPPFAPPQKMQAHFIPPDIPRSIFVYFRGLFYDVNNDPEGGYYARGARA 246

Query: 241 AVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD 300
           AVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD
Sbjct: 247 AVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD 306

Query: 301 DIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQP 360
           DIVLPFADAIPWEEIG+FVAE+DVP+LDTILTSIP EVILRKQRLLANPSMKRAMLFPQP
Sbjct: 307 DIVLPFADAIPWEEIGVFVAEKDVPELDTILTSIPTEVILRKQRLLANPSMKRAMLFPQP 366

Query: 361 AQPGDAFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
           AQPGDAFHQILNGLARKLPHDKS YLK G K LNWT+GPV DLKPW
Sbjct: 367 AQPGDAFHQILNGLARKLPHDKSIYLKTGEKALNWTAGPVADLKPW 412


>gi|224075447|ref|XP_002304637.1| predicted protein [Populus trichocarpa]
 gi|222842069|gb|EEE79616.1| predicted protein [Populus trichocarpa]
          Length = 413

 Score =  772 bits (1994), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 369/393 (93%), Positives = 380/393 (96%)

Query: 14  SADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFM 73
           +AD   + ERISGSAGDVLED+PVG+LKV+VYELPSKYNKKLL KDPRCLTHMFAAEIFM
Sbjct: 21  NADQRLRTERISGSAGDVLEDNPVGRLKVFVYELPSKYNKKLLQKDPRCLTHMFAAEIFM 80

Query: 74  HRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWN 133
           HRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPF SPRMMRSAIQL+SSNWPYWN
Sbjct: 81  HRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFNSPRMMRSAIQLLSSNWPYWN 140

Query: 134 RTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIP 193
           RTEGADHFFVVPHDFGACFHYQEEKAIERGILPLL+RATLVQTFGQRNHVCLNEGSITIP
Sbjct: 141 RTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLRRATLVQTFGQRNHVCLNEGSITIP 200

Query: 194 PYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFD 253
           PYAPPQKMQAHQIP DTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFD
Sbjct: 201 PYAPPQKMQAHQIPLDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFD 260

Query: 254 ISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWE 313
           ISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWE
Sbjct: 261 ISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWE 320

Query: 314 EIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNG 373
           EIG+FVAEEDVP LDT LTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNG
Sbjct: 321 EIGVFVAEEDVPHLDTFLTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNG 380

Query: 374 LARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
           LARKLPHD++ YLKPG K LNWT+GP GDLKPW
Sbjct: 381 LARKLPHDRTVYLKPGQKVLNWTAGPAGDLKPW 413


>gi|297851174|ref|XP_002893468.1| hypothetical protein ARALYDRAFT_472948 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297339310|gb|EFH69727.1| hypothetical protein ARALYDRAFT_472948 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 412

 Score =  772 bits (1994), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 373/406 (91%), Positives = 387/406 (95%), Gaps = 2/406 (0%)

Query: 3   VTAIALLLAFV--SADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDP 60
           ++AI L L F   S++   + ERISGSAGDVLED+PVGKLKV+VYELPSKYNKKLL KDP
Sbjct: 7   LSAILLFLFFSASSSEQNVRTERISGSAGDVLEDNPVGKLKVFVYELPSKYNKKLLQKDP 66

Query: 61  RCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRS 120
           RCLTHMFAAEIFMHRFLLSSPVRT NP+EADWFYTPIYPTCDLTPTGLPLPFKSPRMMRS
Sbjct: 67  RCLTHMFAAEIFMHRFLLSSPVRTRNPDEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRS 126

Query: 121 AIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQR 180
           AIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQR
Sbjct: 127 AIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQR 186

Query: 181 NHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARA 240
           NHVCL+EGSITIPP+APPQKMQAH IPPD PRSIFVYFRGLFYDVNNDPEGGYYARGARA
Sbjct: 187 NHVCLDEGSITIPPFAPPQKMQAHFIPPDIPRSIFVYFRGLFYDVNNDPEGGYYARGARA 246

Query: 241 AVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD 300
           AVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD
Sbjct: 247 AVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD 306

Query: 301 DIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQP 360
           DIVLPFADAIPWEEIG+FVAE+DVP+LDTILTSIP EVILRKQRLLANPSMKRAMLFPQP
Sbjct: 307 DIVLPFADAIPWEEIGVFVAEKDVPELDTILTSIPTEVILRKQRLLANPSMKRAMLFPQP 366

Query: 361 AQPGDAFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
           AQPGDAFHQILNGLARKLPHDKS YLK G K LNWT+GPVGDLKPW
Sbjct: 367 AQPGDAFHQILNGLARKLPHDKSIYLKAGEKELNWTAGPVGDLKPW 412


>gi|359473830|ref|XP_003631364.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10L-like [Vitis
           vinifera]
          Length = 412

 Score =  759 bits (1960), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 365/408 (89%), Positives = 380/408 (93%), Gaps = 2/408 (0%)

Query: 1   MWVTAIALL--LAFVSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLK 58
           +WV     +  +     D  P  ERISGSAGDVLEDDPVG+LKV+VYELPSKYNKK+L K
Sbjct: 5   IWVLVFGFVWGIGAHGLDRHPHTERISGSAGDVLEDDPVGRLKVFVYELPSKYNKKILQK 64

Query: 59  DPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMM 118
           DPRCL HMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIY TCDLTP GLPLPFKSPRMM
Sbjct: 65  DPRCLNHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYTTCDLTPNGLPLPFKSPRMM 124

Query: 119 RSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFG 178
           RSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFG
Sbjct: 125 RSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFG 184

Query: 179 QRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGA 238
           QRNHVCLNEGSITIPPYAPPQKMQAH IP +TPRSIFVYFRGLFYDVNNDPEGGYYARGA
Sbjct: 185 QRNHVCLNEGSITIPPYAPPQKMQAHLIPQETPRSIFVYFRGLFYDVNNDPEGGYYARGA 244

Query: 239 RAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVII 298
           RAAVWENFK+NPLFDIST+HPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAV+FGCIPVII
Sbjct: 245 RAAVWENFKDNPLFDISTEHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVIFGCIPVII 304

Query: 299 ADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFP 358
           ADDIVLPFADAIPWEEIG+FVAEEDVP LDTILTSIPPEVILRKQRLLANPSMK+AMLFP
Sbjct: 305 ADDIVLPFADAIPWEEIGVFVAEEDVPNLDTILTSIPPEVILRKQRLLANPSMKQAMLFP 364

Query: 359 QPAQPGDAFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
           QPAQ GDAFHQILNGLARKLPH+K+ YLKPG   LNWT+GPVGDLKPW
Sbjct: 365 QPAQSGDAFHQILNGLARKLPHNKNVYLKPGENALNWTAGPVGDLKPW 412


>gi|255540029|ref|XP_002511079.1| catalytic, putative [Ricinus communis]
 gi|223550194|gb|EEF51681.1| catalytic, putative [Ricinus communis]
          Length = 417

 Score =  755 bits (1949), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 360/403 (89%), Positives = 379/403 (94%), Gaps = 3/403 (0%)

Query: 7   ALLLAFVSA---DNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCL 63
           A  L+  SA    ++ + ERISGSAGDVLEDDPVG+LKV+VYELPSKYNKK+L KDPRCL
Sbjct: 15  AFFLSSSSAYKMGSSQRTERISGSAGDVLEDDPVGRLKVFVYELPSKYNKKILQKDPRCL 74

Query: 64  THMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQ 123
           THMFAAEIFMHRFLLSSPVRTLNPEEADWFYTP+Y TCDLTP GLPLPFKSPRMMRSAIQ
Sbjct: 75  THMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRMMRSAIQ 134

Query: 124 LISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHV 183
           LISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILP+LQRATLVQTFGQRNHV
Sbjct: 135 LISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPMLQRATLVQTFGQRNHV 194

Query: 184 CLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVW 243
           CL EGSITIPPYAPPQKMQAH IP  TPRSIFVYFRGLFYDV NDPEGGYYARGARAAVW
Sbjct: 195 CLKEGSITIPPYAPPQKMQAHLIPEKTPRSIFVYFRGLFYDVGNDPEGGYYARGARAAVW 254

Query: 244 ENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIV 303
           ENFK+NPLFDIST+HPTTYYEDMQRA+FCLCPLGWAPWSPRLVEAV+FGCIPVIIADDIV
Sbjct: 255 ENFKDNPLFDISTEHPTTYYEDMQRAVFCLCPLGWAPWSPRLVEAVIFGCIPVIIADDIV 314

Query: 304 LPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQP 363
           LPFADAIPWEEIG+FV E+DVP LDTILTSIPPEVILRKQRLLANPSMK+AMLFPQPAQ 
Sbjct: 315 LPFADAIPWEEIGVFVDEKDVPNLDTILTSIPPEVILRKQRLLANPSMKQAMLFPQPAQA 374

Query: 364 GDAFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
           GDAFHQ+LNGLARKLPHD+S YLKPG K LNWT+GPVGDLKPW
Sbjct: 375 GDAFHQVLNGLARKLPHDRSVYLKPGEKILNWTAGPVGDLKPW 417


>gi|449484890|ref|XP_004157009.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10-like [Cucumis
           sativus]
          Length = 417

 Score =  754 bits (1948), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 360/406 (88%), Positives = 378/406 (93%)

Query: 1   MWVTAIALLLAFVSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDP 60
           +WV AI L+    +  N    ERISGSAGDVLED+PVG+LKVYVY+LPSKYNKK L KDP
Sbjct: 12  LWVFAIFLVSRIAAVPNAVPTERISGSAGDVLEDNPVGRLKVYVYDLPSKYNKKTLQKDP 71

Query: 61  RCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRS 120
           RCLTHMFAAEI+MHRFLL+SPVRTLNP+EADWFYTPIY TCDLTP GLPLPFKSPRMMRS
Sbjct: 72  RCLTHMFAAEIYMHRFLLNSPVRTLNPDEADWFYTPIYVTCDLTPNGLPLPFKSPRMMRS 131

Query: 121 AIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQR 180
           AIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAI+RGILPLLQRATLVQTFGQR
Sbjct: 132 AIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIDRGILPLLQRATLVQTFGQR 191

Query: 181 NHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARA 240
           NHVCLNEGSITIPPY PPQKM+ H IP +TPRSIFVYFRGLFYDVNNDPEGGYYARGARA
Sbjct: 192 NHVCLNEGSITIPPYCPPQKMKTHLIPSETPRSIFVYFRGLFYDVNNDPEGGYYARGARA 251

Query: 241 AVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD 300
           AVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD
Sbjct: 252 AVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD 311

Query: 301 DIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQP 360
           DIVLPFADAIPWEEIG+FV E+DV  LDTILTSIPP+VILRKQRLLANPSMKRAM+FPQP
Sbjct: 312 DIVLPFADAIPWEEIGVFVDEKDVSNLDTILTSIPPDVILRKQRLLANPSMKRAMMFPQP 371

Query: 361 AQPGDAFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
           AQ GDAFHQILNGLARKLPHDK  YLKPG + LNWT+GPVGDLKPW
Sbjct: 372 AQSGDAFHQILNGLARKLPHDKGVYLKPGERILNWTAGPVGDLKPW 417


>gi|225456511|ref|XP_002284685.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10L [Vitis
           vinifera]
 gi|147860849|emb|CAN83159.1| hypothetical protein VITISV_022554 [Vitis vinifera]
 gi|297734521|emb|CBI15768.3| unnamed protein product [Vitis vinifera]
          Length = 416

 Score =  754 bits (1947), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 363/411 (88%), Positives = 382/411 (92%), Gaps = 6/411 (1%)

Query: 2   WVTAIALLLAF---VSADNT---PKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKL 55
           WV  + LL+A    V A+      + ERISGSAGDVLED+PVG+LKV+VYELPSKYNKK+
Sbjct: 6   WVFCVLLLIASSLRVGAEQLRRGQRTERISGSAGDVLEDNPVGRLKVFVYELPSKYNKKI 65

Query: 56  LLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSP 115
           L KDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTP+Y TCDLTP GLPLPFKSP
Sbjct: 66  LQKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPVYTTCDLTPNGLPLPFKSP 125

Query: 116 RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQ 175
           RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGIL LLQRATLVQ
Sbjct: 126 RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILHLLQRATLVQ 185

Query: 176 TFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYA 235
           TFGQRNHVCL EGSITIPPYAPPQKMQ+H IPPDTPRSIFVYFRGLFYDV NDPEGGYYA
Sbjct: 186 TFGQRNHVCLKEGSITIPPYAPPQKMQSHLIPPDTPRSIFVYFRGLFYDVGNDPEGGYYA 245

Query: 236 RGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIP 295
           RGARAAVWENFK+NPLFDIST+HP TYYEDMQRA+FCLCPLGWAPWSPRLVE V+FGCIP
Sbjct: 246 RGARAAVWENFKDNPLFDISTEHPMTYYEDMQRAVFCLCPLGWAPWSPRLVEGVIFGCIP 305

Query: 296 VIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAM 355
           VIIADDIVLPFADAIPWEEIG+FVAEEDVP LDTILTSIPPEVILRKQRLLANPSMK+AM
Sbjct: 306 VIIADDIVLPFADAIPWEEIGVFVAEEDVPNLDTILTSIPPEVILRKQRLLANPSMKQAM 365

Query: 356 LFPQPAQPGDAFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
           LFPQPAQPGDAFHQILNGLARKLPH ++ YLKPG K LNWT+GPVGDLKPW
Sbjct: 366 LFPQPAQPGDAFHQILNGLARKLPHGRNIYLKPGEKILNWTAGPVGDLKPW 416


>gi|356508876|ref|XP_003523179.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10L-like
           [Glycine max]
          Length = 416

 Score =  749 bits (1934), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 357/406 (87%), Positives = 374/406 (92%)

Query: 1   MWVTAIALLLAFVSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDP 60
           +   A A  +  V        ERISGSAGDVLEDDPVG+LKV+VYELPSKYNKK+L KDP
Sbjct: 11  LLCAASAFAIGAVELGRHQPTERISGSAGDVLEDDPVGRLKVFVYELPSKYNKKILQKDP 70

Query: 61  RCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRS 120
           RCL HMFAAEIFMHRFLLSSPVRTLNPEEADWFYTP+Y TCDLTP GLPLPFKSPRMMRS
Sbjct: 71  RCLNHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRMMRS 130

Query: 121 AIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQR 180
           AIQLISSNWPYWNRTEGADHFFV PHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQR
Sbjct: 131 AIQLISSNWPYWNRTEGADHFFVTPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQR 190

Query: 181 NHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARA 240
           NHVCL EGSITIPPYAPPQKM  H IP  TPRSIFVYFRGLFYDV NDPEGGYYARGARA
Sbjct: 191 NHVCLKEGSITIPPYAPPQKMHTHLIPEKTPRSIFVYFRGLFYDVGNDPEGGYYARGARA 250

Query: 241 AVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD 300
           AVWENFK+NPLFDIST+HPTTYYEDMQRA+FCLCPLGWAPWSPRLVEAVVFGCIPVIIAD
Sbjct: 251 AVWENFKDNPLFDISTEHPTTYYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD 310

Query: 301 DIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQP 360
           DIVLPFADAIPWEEIG+FV E+DVP+LDTILTSIPPEVILRKQRLLANPSMK+AMLFPQP
Sbjct: 311 DIVLPFADAIPWEEIGVFVDEKDVPQLDTILTSIPPEVILRKQRLLANPSMKQAMLFPQP 370

Query: 361 AQPGDAFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
           AQPGDAFHQ+LNGLARKLPHD+S +LKPG K LNWT+GPVGDLKPW
Sbjct: 371 AQPGDAFHQVLNGLARKLPHDRSVFLKPGEKALNWTAGPVGDLKPW 416


>gi|224119858|ref|XP_002318180.1| predicted protein [Populus trichocarpa]
 gi|224122230|ref|XP_002318783.1| predicted protein [Populus trichocarpa]
 gi|222858853|gb|EEE96400.1| predicted protein [Populus trichocarpa]
 gi|222859456|gb|EEE97003.1| predicted protein [Populus trichocarpa]
          Length = 417

 Score =  749 bits (1933), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 353/406 (86%), Positives = 373/406 (91%)

Query: 1   MWVTAIALLLAFVSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDP 60
           +W     L    V    +   ERISGSAGDVLEDDPVG+LKV+VYELP KYNKK+L KDP
Sbjct: 12  LWYAGFVLSTGAVELGRSQHTERISGSAGDVLEDDPVGRLKVFVYELPRKYNKKILQKDP 71

Query: 61  RCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRS 120
           RCLTHMFAAEIFMHRFLLSSPVRTLNP+EADWFYTP+Y TCDLT  GLPLPFKSPRMMRS
Sbjct: 72  RCLTHMFAAEIFMHRFLLSSPVRTLNPDEADWFYTPVYTTCDLTTNGLPLPFKSPRMMRS 131

Query: 121 AIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQR 180
           AIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQR
Sbjct: 132 AIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQR 191

Query: 181 NHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARA 240
           NHVCL +GSIT+PPYAPPQKMQ H IP  TPRSIFVYFRGLFYDV NDPEGGYYARGARA
Sbjct: 192 NHVCLKDGSITVPPYAPPQKMQTHLIPEKTPRSIFVYFRGLFYDVGNDPEGGYYARGARA 251

Query: 241 AVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD 300
           AVWENFK+NPLFDIST+HP TYYEDMQRA+FCLCPLGWAPWSPRLVEAV+FGCIPVIIAD
Sbjct: 252 AVWENFKDNPLFDISTEHPATYYEDMQRAVFCLCPLGWAPWSPRLVEAVIFGCIPVIIAD 311

Query: 301 DIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQP 360
           DIVLPFADAIPWEEIG++V EEDVP LDTILTSIPPEVILRKQRLLANPSMK+AMLFPQP
Sbjct: 312 DIVLPFADAIPWEEIGVYVDEEDVPNLDTILTSIPPEVILRKQRLLANPSMKQAMLFPQP 371

Query: 361 AQPGDAFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
           AQPGDAFHQ+LNGLARKLPHD+S YL+PG K LNWT+GPVGDLKPW
Sbjct: 372 AQPGDAFHQVLNGLARKLPHDRSVYLRPGEKILNWTAGPVGDLKPW 417


>gi|356516468|ref|XP_003526916.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10L-like
           [Glycine max]
          Length = 416

 Score =  748 bits (1930), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 354/385 (91%), Positives = 369/385 (95%)

Query: 22  ERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSP 81
           ERISGSAGDVLEDDPVG+LKV+VYELPSKYNKK+L KDPRCL HMFAAEIFMHRFLLSSP
Sbjct: 32  ERISGSAGDVLEDDPVGRLKVFVYELPSKYNKKILQKDPRCLNHMFAAEIFMHRFLLSSP 91

Query: 82  VRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHF 141
           VRTLNPEEADWFYTP+Y TCDLTP GLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHF
Sbjct: 92  VRTLNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHF 151

Query: 142 FVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKM 201
           FV PHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCL EGSITIPPYAPPQKM
Sbjct: 152 FVTPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLKEGSITIPPYAPPQKM 211

Query: 202 QAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTT 261
             H IP  TPRSIFVYFRGLFYDV NDPEGGYYARGARAAVWENFK+NPLFDIST+HPTT
Sbjct: 212 HTHLIPEKTPRSIFVYFRGLFYDVGNDPEGGYYARGARAAVWENFKDNPLFDISTEHPTT 271

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           YYEDMQRA+FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG+FV E
Sbjct: 272 YYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFVDE 331

Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHD 381
           +DVP+LDTILTSIPPEVILRKQRLLANPSMK+AMLFPQPAQPGDAFHQ+LNGLARKLPHD
Sbjct: 332 KDVPQLDTILTSIPPEVILRKQRLLANPSMKQAMLFPQPAQPGDAFHQVLNGLARKLPHD 391

Query: 382 KSTYLKPGGKFLNWTSGPVGDLKPW 406
           +S +LKPG K LNWT+GPVGDLKPW
Sbjct: 392 RSIFLKPGEKALNWTAGPVGDLKPW 416


>gi|317106607|dbj|BAJ53114.1| JHL07K02.4 [Jatropha curcas]
          Length = 417

 Score =  747 bits (1928), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 358/409 (87%), Positives = 377/409 (92%), Gaps = 6/409 (1%)

Query: 4   TAIALLLAFVSADNTPKI------ERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLL 57
             I L  A VS+    K+      ERISGSAGDVLEDDPVG+LKV+VYELPSKYNKK+L 
Sbjct: 9   VGILLCAATVSSIGAVKLGRSQPTERISGSAGDVLEDDPVGRLKVFVYELPSKYNKKILQ 68

Query: 58  KDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRM 117
           KD RCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTP+Y TCDLTP GLPLPFKSPRM
Sbjct: 69  KDSRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRM 128

Query: 118 MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTF 177
           MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTF
Sbjct: 129 MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTF 188

Query: 178 GQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARG 237
           GQRNHVCL +GSITIPPYAPPQKMQAH IP  TPRSIFVYFRGLFYDV NDPEGGYYARG
Sbjct: 189 GQRNHVCLKDGSITIPPYAPPQKMQAHLIPEKTPRSIFVYFRGLFYDVGNDPEGGYYARG 248

Query: 238 ARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVI 297
           ARAAVWENFK+NPLFDIST+HP+TYYEDMQRA+FCLCPLGWAPWSPRLVEAV+FGCIPVI
Sbjct: 249 ARAAVWENFKDNPLFDISTEHPSTYYEDMQRAVFCLCPLGWAPWSPRLVEAVIFGCIPVI 308

Query: 298 IADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLF 357
           IADDIVLPFADAIPWE+IG+FV E+DVP LD ILTSIPPEVILRKQRLLANPSMK+AMLF
Sbjct: 309 IADDIVLPFADAIPWEQIGVFVDEKDVPNLDAILTSIPPEVILRKQRLLANPSMKQAMLF 368

Query: 358 PQPAQPGDAFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
           PQP QPGDAFHQ+LNGLARKLPHD+S YLKPG K LNWT+GPVGDLKPW
Sbjct: 369 PQPVQPGDAFHQVLNGLARKLPHDRSIYLKPGEKILNWTAGPVGDLKPW 417


>gi|307135997|gb|ADN33853.1| glucuronoxylan glucuronosyltransferase [Cucumis melo subsp. melo]
          Length = 416

 Score =  745 bits (1923), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 353/411 (85%), Positives = 377/411 (91%), Gaps = 6/411 (1%)

Query: 2   WVTAIALLLAFVSA------DNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKL 55
           W   + L  AF++       + + + ERISGSAGDVLED+PVG+LKV+VYELPSKYNKK+
Sbjct: 6   WAIFVPLFAAFLTTTDAIVRERSQRTERISGSAGDVLEDNPVGRLKVFVYELPSKYNKKI 65

Query: 56  LLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSP 115
           L KDPRCL HMFAAEIFMHRFLL+SPVRTLNPEEADWFYTP+Y TCDLTP GLPLPFKSP
Sbjct: 66  LQKDPRCLNHMFAAEIFMHRFLLTSPVRTLNPEEADWFYTPVYTTCDLTPNGLPLPFKSP 125

Query: 116 RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQ 175
           RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQ
Sbjct: 126 RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQ 185

Query: 176 TFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYA 235
           TFGQRNHVCL EGSITIPPYAPPQKM AH IP  TPRSIFVYFRGLFYDV NDPEGGYYA
Sbjct: 186 TFGQRNHVCLKEGSITIPPYAPPQKMHAHLIPEKTPRSIFVYFRGLFYDVGNDPEGGYYA 245

Query: 236 RGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIP 295
           RGARAAVWENFK+NPLFDIST+HPTTYYEDMQRA+FCLCPLGWAPWSPRLVEAV+FGCIP
Sbjct: 246 RGARAAVWENFKDNPLFDISTEHPTTYYEDMQRAVFCLCPLGWAPWSPRLVEAVIFGCIP 305

Query: 296 VIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAM 355
           VIIADDIVLPFADAIPWEEIG+F+ E+DV  LDTILTSIP E+ILRKQRLLANPSMK+AM
Sbjct: 306 VIIADDIVLPFADAIPWEEIGVFLDEKDVANLDTILTSIPLEMILRKQRLLANPSMKQAM 365

Query: 356 LFPQPAQPGDAFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
           LFPQPAQPGDAFHQ+LNGLARKLPHD+S YLK G K LNWT+GPVGDLKPW
Sbjct: 366 LFPQPAQPGDAFHQVLNGLARKLPHDRSVYLKAGEKILNWTAGPVGDLKPW 416


>gi|449440484|ref|XP_004138014.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10L-like
           [Cucumis sativus]
 gi|449528307|ref|XP_004171146.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10L-like
           [Cucumis sativus]
          Length = 416

 Score =  744 bits (1922), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 352/411 (85%), Positives = 377/411 (91%), Gaps = 6/411 (1%)

Query: 2   WVTAIALLLAFVSA------DNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKL 55
           W   + L  AF++       + + + ERISGSAGDVLED+PVG+LKV+VYELPSKYNKK+
Sbjct: 6   WAIFVPLFAAFLTTTDAIVRERSQRTERISGSAGDVLEDNPVGRLKVFVYELPSKYNKKI 65

Query: 56  LLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSP 115
           L KDPRCL HMFAAEIFMHRFLL+SPVRTLNPEEADWFYTP+Y TCDLTP GLPLPFKSP
Sbjct: 66  LQKDPRCLNHMFAAEIFMHRFLLTSPVRTLNPEEADWFYTPVYTTCDLTPNGLPLPFKSP 125

Query: 116 RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQ 175
           RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQ
Sbjct: 126 RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQ 185

Query: 176 TFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYA 235
           TFGQRNHVCL EGSITIPPYAPPQKM AH IP  TPRSIFVYFRGLFYDV NDPEGGYYA
Sbjct: 186 TFGQRNHVCLKEGSITIPPYAPPQKMHAHLIPEKTPRSIFVYFRGLFYDVGNDPEGGYYA 245

Query: 236 RGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIP 295
           RGARAAVWENFK+NPLFDIST+HPTTYYEDMQRA+FCLCPLGWAPWSPRLVEAV+FGCIP
Sbjct: 246 RGARAAVWENFKDNPLFDISTEHPTTYYEDMQRAVFCLCPLGWAPWSPRLVEAVIFGCIP 305

Query: 296 VIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAM 355
           VIIADDIVLPFADAIPWEEIG+F+ E+DV  LDTILTSIP E+ILRKQRLLANPSMK+AM
Sbjct: 306 VIIADDIVLPFADAIPWEEIGVFLDEKDVANLDTILTSIPLEMILRKQRLLANPSMKQAM 365

Query: 356 LFPQPAQPGDAFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
           LFPQPAQPGDAFHQ+LNGLARKLPHD+S Y+K G K LNWT+GPVGDLKPW
Sbjct: 366 LFPQPAQPGDAFHQVLNGLARKLPHDRSVYIKAGEKILNWTAGPVGDLKPW 416


>gi|356512936|ref|XP_003525170.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10L-like
           [Glycine max]
          Length = 416

 Score =  744 bits (1921), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 352/385 (91%), Positives = 367/385 (95%)

Query: 22  ERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSP 81
           ERISGSAGDVLEDDPVG+LKV+VYELPSKYNKK+L KDPRCL HMFAAEIFMHRFLLSSP
Sbjct: 32  ERISGSAGDVLEDDPVGRLKVFVYELPSKYNKKILQKDPRCLNHMFAAEIFMHRFLLSSP 91

Query: 82  VRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHF 141
           VRTLNPEEADWFYTP+Y TCDLTP GLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHF
Sbjct: 92  VRTLNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHF 151

Query: 142 FVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKM 201
           FVVPHDFGACFHYQEEKAIERGIL LL+RATLVQTFGQRNHVCL EGSITIPPYAPPQKM
Sbjct: 152 FVVPHDFGACFHYQEEKAIERGILTLLRRATLVQTFGQRNHVCLKEGSITIPPYAPPQKM 211

Query: 202 QAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTT 261
             H IP  TPRSIFVYFRGLFYDV NDPEGGYYARGARAAVWENFK+N LFDIST+HPTT
Sbjct: 212 HTHLIPDKTPRSIFVYFRGLFYDVGNDPEGGYYARGARAAVWENFKDNLLFDISTEHPTT 271

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           YYEDMQRA+FCLCPLGWAPWSPRLVEAV+FGCIPVIIADDIVLPFADAIPWEEIG+FV E
Sbjct: 272 YYEDMQRAVFCLCPLGWAPWSPRLVEAVIFGCIPVIIADDIVLPFADAIPWEEIGVFVDE 331

Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHD 381
           EDVPKLDTILTSIPPEVILRKQRLLANPSMK+AMLFPQPAQPGDAFHQ+LNGLARKLPHD
Sbjct: 332 EDVPKLDTILTSIPPEVILRKQRLLANPSMKQAMLFPQPAQPGDAFHQVLNGLARKLPHD 391

Query: 382 KSTYLKPGGKFLNWTSGPVGDLKPW 406
            + +LKPG K LNWT+GPVGDLKPW
Sbjct: 392 NTVFLKPGEKILNWTAGPVGDLKPW 416


>gi|18424516|ref|NP_568941.1| Exostosin family protein [Arabidopsis thaliana]
 gi|75163931|sp|Q940Q8.1|IX10L_ARATH RecName: Full=Probable beta-1,4-xylosyltransferase IRX10L; AltName:
           Full=Glucuronoxylan glucuronosyltransferase 1;
           Short=AtGUT1; AltName: Full=Glucuronoxylan
           glucuronosyltransferase 2; Short=AtGUT2; AltName:
           Full=Protein IRREGULAR XYLEM 10-like; AltName:
           Full=Xylan xylosyltransferase IRX10L
 gi|15809826|gb|AAL06841.1| AT5g61840/mac9_140 [Arabidopsis thaliana]
 gi|17978869|gb|AAL47406.1| AT5g61840/mac9_140 [Arabidopsis thaliana]
 gi|332010141|gb|AED97524.1| Exostosin family protein [Arabidopsis thaliana]
          Length = 415

 Score =  739 bits (1907), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 347/385 (90%), Positives = 367/385 (95%)

Query: 22  ERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSP 81
           ERISGSAGDVLEDDPVG+LKV+VYELPSKYNKK+L KDPRCL HMFAAEI+M RFLLSSP
Sbjct: 31  ERISGSAGDVLEDDPVGRLKVFVYELPSKYNKKILQKDPRCLNHMFAAEIYMQRFLLSSP 90

Query: 82  VRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHF 141
           VRTLNPEEADWFY P+Y TCDLTP GLPLPFKSPRMMRSAIQLI+SNWPYWNRTEGADHF
Sbjct: 91  VRTLNPEEADWFYVPVYTTCDLTPNGLPLPFKSPRMMRSAIQLIASNWPYWNRTEGADHF 150

Query: 142 FVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKM 201
           FVVPHDFGACFHYQEEKAI RGILPLLQRATLVQTFGQRNHVCL EGSIT+PPYAPPQKM
Sbjct: 151 FVVPHDFGACFHYQEEKAIGRGILPLLQRATLVQTFGQRNHVCLKEGSITVPPYAPPQKM 210

Query: 202 QAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTT 261
           Q+H IP  TPRSIFVYFRGLFYDV NDPEGGYYARGARAAVWENFK+NPLFDIST+HPTT
Sbjct: 211 QSHLIPEKTPRSIFVYFRGLFYDVGNDPEGGYYARGARAAVWENFKDNPLFDISTEHPTT 270

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           YYEDMQRAIFCLCPLGWAPWSPRLVEAV+FGCIPVIIADDIVLPFADAIPWE+IG+FV E
Sbjct: 271 YYEDMQRAIFCLCPLGWAPWSPRLVEAVIFGCIPVIIADDIVLPFADAIPWEDIGVFVDE 330

Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHD 381
           +DVP LDTILTSIPPEVILRKQRLLANPSMK+AMLFPQPAQPGDAFHQ+LNGLARKLPH+
Sbjct: 331 KDVPYLDTILTSIPPEVILRKQRLLANPSMKQAMLFPQPAQPGDAFHQVLNGLARKLPHE 390

Query: 382 KSTYLKPGGKFLNWTSGPVGDLKPW 406
           +S YL+PG K LNWT+GPV DLKPW
Sbjct: 391 RSVYLRPGEKLLNWTAGPVADLKPW 415


>gi|297797147|ref|XP_002866458.1| hypothetical protein ARALYDRAFT_496352 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297312293|gb|EFH42717.1| hypothetical protein ARALYDRAFT_496352 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 417

 Score =  738 bits (1906), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 350/404 (86%), Positives = 377/404 (93%), Gaps = 1/404 (0%)

Query: 3   VTAIALLLAFVSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRC 62
           +T ++ + AF  + + P  ERISGSAGDVLED+PVG+LKV+VYELPSKYNKK+L KDPRC
Sbjct: 15  LTFLSSISAFRLSRSQPT-ERISGSAGDVLEDNPVGRLKVFVYELPSKYNKKILQKDPRC 73

Query: 63  LTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAI 122
           L HMFAAEI+M RFLLSSPVRTLNPEEADWFY P+Y TCDLTP GLPLPFKSPRMMRSAI
Sbjct: 74  LNHMFAAEIYMQRFLLSSPVRTLNPEEADWFYVPVYTTCDLTPNGLPLPFKSPRMMRSAI 133

Query: 123 QLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNH 182
           QLI+SNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAI RGILPLLQRATLVQTFGQRNH
Sbjct: 134 QLIASNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIGRGILPLLQRATLVQTFGQRNH 193

Query: 183 VCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAV 242
           VCL EGSIT+PPYAPPQKMQ+H IP  TPRSIFVYFRGLFYDV NDPEGGYYARGARAAV
Sbjct: 194 VCLKEGSITVPPYAPPQKMQSHLIPEKTPRSIFVYFRGLFYDVGNDPEGGYYARGARAAV 253

Query: 243 WENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDI 302
           WENFK+NPLFDIST+HPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAV+FGCIPVIIADDI
Sbjct: 254 WENFKDNPLFDISTEHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVIFGCIPVIIADDI 313

Query: 303 VLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQ 362
           VLPFADAIPWE+IG+FV E+DVP LDTILTSIPPEVILRKQRLLANPSMK+AMLFPQPAQ
Sbjct: 314 VLPFADAIPWEDIGVFVDEKDVPYLDTILTSIPPEVILRKQRLLANPSMKQAMLFPQPAQ 373

Query: 363 PGDAFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
           PGDAFHQ+LNGLARKLPH++S YL+PG K LNWT+GPV DLKPW
Sbjct: 374 PGDAFHQVLNGLARKLPHERSVYLRPGEKLLNWTAGPVADLKPW 417


>gi|21592991|gb|AAM64940.1| unknown [Arabidopsis thaliana]
          Length = 415

 Score =  736 bits (1900), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 346/385 (89%), Positives = 366/385 (95%)

Query: 22  ERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSP 81
           ERISGSAGDVLEDDPVG+LKV+VYELPSKYNKK+L KDPRCL HMFAAEI+M RFLLSSP
Sbjct: 31  ERISGSAGDVLEDDPVGRLKVFVYELPSKYNKKILQKDPRCLNHMFAAEIYMQRFLLSSP 90

Query: 82  VRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHF 141
           VRTLNPEEADWFY P+Y TCDLTP GLPLPFKSPRMMRSAIQLI+SNWPYWNRTEGADHF
Sbjct: 91  VRTLNPEEADWFYVPVYTTCDLTPNGLPLPFKSPRMMRSAIQLIASNWPYWNRTEGADHF 150

Query: 142 FVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKM 201
           FVVPHDF ACFHYQEEKAI RGILPLLQRATLVQTFGQRNHVCL EGSIT+PPYAPPQKM
Sbjct: 151 FVVPHDFRACFHYQEEKAIGRGILPLLQRATLVQTFGQRNHVCLKEGSITVPPYAPPQKM 210

Query: 202 QAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTT 261
           Q+H IP  TPRSIFVYFRGLFYDV NDPEGGYYARGARAAVWENFK+NPLFDIST+HPTT
Sbjct: 211 QSHLIPEKTPRSIFVYFRGLFYDVGNDPEGGYYARGARAAVWENFKDNPLFDISTEHPTT 270

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           YYEDMQRAIFCLCPLGWAPWSPRLVEAV+FGCIPVIIADDIVLPFADAIPWE+IG+FV E
Sbjct: 271 YYEDMQRAIFCLCPLGWAPWSPRLVEAVIFGCIPVIIADDIVLPFADAIPWEDIGVFVDE 330

Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHD 381
           +DVP LDTILTSIPPEVILRKQRLLANPSMK+AMLFPQPAQPGDAFHQ+LNGLARKLPH+
Sbjct: 331 KDVPYLDTILTSIPPEVILRKQRLLANPSMKQAMLFPQPAQPGDAFHQVLNGLARKLPHE 390

Query: 382 KSTYLKPGGKFLNWTSGPVGDLKPW 406
           +S YL+PG K LNWT+GPV DLKPW
Sbjct: 391 RSVYLRPGEKLLNWTAGPVADLKPW 415


>gi|116793780|gb|ABK26875.1| unknown [Picea sitchensis]
          Length = 420

 Score =  735 bits (1898), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 346/387 (89%), Positives = 366/387 (94%)

Query: 20  KIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLS 79
           + ERI+GSAGDVLEDDPVG+LKV++YELPSKYNKK+L KDPRCLTHMFA EIFM+RFLL 
Sbjct: 34  RSERIAGSAGDVLEDDPVGRLKVFIYELPSKYNKKILAKDPRCLTHMFATEIFMNRFLLG 93

Query: 80  SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGAD 139
           SPVRTLNPEEADWFYTP+Y TCDLTP GLPLPFKSPRMMRSAIQ IS+NWPYWNRTEGAD
Sbjct: 94  SPVRTLNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRMMRSAIQYISTNWPYWNRTEGAD 153

Query: 140 HFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQ 199
           HFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQR HVCL +GSIT+PPYAPPQ
Sbjct: 154 HFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRYHVCLKKGSITVPPYAPPQ 213

Query: 200 KMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP 259
           KMQAH IPP TPRSIFVYFRGLFYDV NDPEGGYYARGARA+VWENFK+NPLFDIST+HP
Sbjct: 214 KMQAHLIPPSTPRSIFVYFRGLFYDVGNDPEGGYYARGARASVWENFKDNPLFDISTEHP 273

Query: 260 TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFV 319
            TYYEDMQRAIFCLCPLGWAPWSPRLVE V+FGCIPVIIADDIVLPFADAIPWEEIG+FV
Sbjct: 274 ATYYEDMQRAIFCLCPLGWAPWSPRLVEGVIFGCIPVIIADDIVLPFADAIPWEEIGVFV 333

Query: 320 AEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
           AE+DVP LDTILTSIPPEVILRKQRLLANP+MK+AMLFPQPAQPGDAFHQILNGLARKLP
Sbjct: 334 AEKDVPNLDTILTSIPPEVILRKQRLLANPAMKQAMLFPQPAQPGDAFHQILNGLARKLP 393

Query: 380 HDKSTYLKPGGKFLNWTSGPVGDLKPW 406
           HD S YLKP  K LNWT+GP GDLKPW
Sbjct: 394 HDSSIYLKPNQKILNWTAGPPGDLKPW 420


>gi|75159221|sp|Q8S1X7.1|GT15_ORYSJ RecName: Full=Probable glucuronosyltransferase Os01g0926700
 gi|20160728|dbj|BAB89670.1| P0482D04.17 [Oryza sativa Japonica Group]
 gi|20805226|dbj|BAB92893.1| putative pectin-glucuronyltransferase [Oryza sativa Japonica Group]
 gi|125528942|gb|EAY77056.1| hypothetical protein OsI_05014 [Oryza sativa Indica Group]
 gi|125573178|gb|EAZ14693.1| hypothetical protein OsJ_04618 [Oryza sativa Japonica Group]
          Length = 417

 Score =  734 bits (1894), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 345/387 (89%), Positives = 369/387 (95%)

Query: 20  KIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLS 79
           + ERISGSAGDVLEDDPVG+LKVYVY+LPSKYNKKLL KDPRCL HMFAAEIFMHRFLLS
Sbjct: 31  QTERISGSAGDVLEDDPVGRLKVYVYDLPSKYNKKLLKKDPRCLNHMFAAEIFMHRFLLS 90

Query: 80  SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGAD 139
           S VRT NPEEADWFYTP+Y TCDLTP+GLPLPFKSPRMMRSAI+LI++NWPYWNR+EGAD
Sbjct: 91  SAVRTFNPEEADWFYTPVYTTCDLTPSGLPLPFKSPRMMRSAIELIATNWPYWNRSEGAD 150

Query: 140 HFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQ 199
           HFFV PHDFGACFHYQEEKAI RGILPLLQRATLVQTFGQ+NHVCL +GSITIPPYAPPQ
Sbjct: 151 HFFVTPHDFGACFHYQEEKAIGRGILPLLQRATLVQTFGQKNHVCLKDGSITIPPYAPPQ 210

Query: 200 KMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP 259
           KMQAH IPPDTPRSIFVYFRGLFYD +NDPEGGYYARGARA+VWENFKNNPLFDISTDHP
Sbjct: 211 KMQAHLIPPDTPRSIFVYFRGLFYDTSNDPEGGYYARGARASVWENFKNNPLFDISTDHP 270

Query: 260 TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFV 319
            TYYEDMQR++FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG+FV
Sbjct: 271 PTYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFV 330

Query: 320 AEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
           AEEDVPKLD+ILTSIP +VILRKQRLLANPSMK+AMLFPQPAQ GDAFHQILNGLARKLP
Sbjct: 331 AEEDVPKLDSILTSIPTDVILRKQRLLANPSMKQAMLFPQPAQAGDAFHQILNGLARKLP 390

Query: 380 HDKSTYLKPGGKFLNWTSGPVGDLKPW 406
           H ++ +LKPG + LNWT+GPVGDLKPW
Sbjct: 391 HGENVFLKPGERALNWTAGPVGDLKPW 417


>gi|212276027|ref|NP_001130448.1| uncharacterized protein LOC100191546 precursor [Zea mays]
 gi|194689154|gb|ACF78661.1| unknown [Zea mays]
 gi|413946366|gb|AFW79015.1| putative Secondary cell wall glycosyltransferase family 47 [Zea
           mays]
          Length = 415

 Score =  729 bits (1883), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 345/397 (86%), Positives = 369/397 (92%), Gaps = 3/397 (0%)

Query: 10  LAFVSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAA 69
           +A V+A  T   ERI GSAGDVLEDDPVG+LKVYVY+LPSKYNKKLL KDPRCL HMFAA
Sbjct: 22  VAVVTAQET---ERIEGSAGDVLEDDPVGRLKVYVYDLPSKYNKKLLKKDPRCLNHMFAA 78

Query: 70  EIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNW 129
           EIFMHRFLLSS VRT NPEEADWFYTP+Y TCDLTP GLPLPFKSPRMMRSAIQLI++NW
Sbjct: 79  EIFMHRFLLSSAVRTFNPEEADWFYTPVYTTCDLTPKGLPLPFKSPRMMRSAIQLIATNW 138

Query: 130 PYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGS 189
           PYWNR+EGADHFFV PHDFGACFHYQEEKAI RGILPLLQRATLVQTFGQ+NHVCL +GS
Sbjct: 139 PYWNRSEGADHFFVTPHDFGACFHYQEEKAIGRGILPLLQRATLVQTFGQKNHVCLKDGS 198

Query: 190 ITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNN 249
           ITIPPYAPPQKMQ H IPPDTPRSIFVYFRGLFYD  NDPEGGYYARGARA+VWENFKNN
Sbjct: 199 ITIPPYAPPQKMQTHLIPPDTPRSIFVYFRGLFYDTGNDPEGGYYARGARASVWENFKNN 258

Query: 250 PLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADA 309
           PLFDISTDHP TYYEDMQR++FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADA
Sbjct: 259 PLFDISTDHPPTYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADA 318

Query: 310 IPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQ 369
           IPWEEIG+FVAE+DVP+LD+ILTSIP +V+LRKQRLLANPSMK+AMLFPQPAQPGDAFHQ
Sbjct: 319 IPWEEIGVFVAEDDVPQLDSILTSIPTDVVLRKQRLLANPSMKQAMLFPQPAQPGDAFHQ 378

Query: 370 ILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
           ILNGLARKLPH  + +LKPG + LNWT+GP GDLKPW
Sbjct: 379 ILNGLARKLPHGSNVFLKPGERALNWTAGPPGDLKPW 415


>gi|242088703|ref|XP_002440184.1| hypothetical protein SORBIDRAFT_09g027440 [Sorghum bicolor]
 gi|241945469|gb|EES18614.1| hypothetical protein SORBIDRAFT_09g027440 [Sorghum bicolor]
          Length = 415

 Score =  728 bits (1880), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 344/397 (86%), Positives = 368/397 (92%), Gaps = 3/397 (0%)

Query: 10  LAFVSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAA 69
           +A V+A  T   ERI GSAGDVLED+PVG+LKVYVY+LPSKYNKKLL KDPRCL HMFAA
Sbjct: 22  IAVVTAQET---ERIEGSAGDVLEDNPVGRLKVYVYDLPSKYNKKLLKKDPRCLNHMFAA 78

Query: 70  EIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNW 129
           EIFMHRFLLSS VRT NPEEADWFYTP+Y TCDLTP GLPLPFKSPRMMRSAIQLI++NW
Sbjct: 79  EIFMHRFLLSSAVRTFNPEEADWFYTPVYTTCDLTPKGLPLPFKSPRMMRSAIQLIATNW 138

Query: 130 PYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGS 189
           PYWNR+EGADHFFV PHDFGACFHYQEEKAI RGILPLLQRATLVQTFGQ+NHVCL +GS
Sbjct: 139 PYWNRSEGADHFFVTPHDFGACFHYQEEKAIGRGILPLLQRATLVQTFGQKNHVCLKDGS 198

Query: 190 ITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNN 249
           ITIPPYAPPQKMQ H IP DTPRSIFVYFRGLFYD  NDPEGGYYARGARA+VWENFKNN
Sbjct: 199 ITIPPYAPPQKMQTHLIPADTPRSIFVYFRGLFYDTGNDPEGGYYARGARASVWENFKNN 258

Query: 250 PLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADA 309
           PLFDISTDHP TYYEDMQR++FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADA
Sbjct: 259 PLFDISTDHPPTYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADA 318

Query: 310 IPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQ 369
           IPWEEIG+FVAEEDVP+LD+ILTSIP +V+LRKQRLLANPSMK+AMLFPQPAQPGDAFHQ
Sbjct: 319 IPWEEIGVFVAEEDVPQLDSILTSIPTDVVLRKQRLLANPSMKQAMLFPQPAQPGDAFHQ 378

Query: 370 ILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
           ILNGLARKLPH  + +LKPG + LNWT+GP GDLKPW
Sbjct: 379 ILNGLARKLPHGSNVFLKPGERVLNWTAGPPGDLKPW 415


>gi|242059743|ref|XP_002459017.1| hypothetical protein SORBIDRAFT_03g044530 [Sorghum bicolor]
 gi|241930992|gb|EES04137.1| hypothetical protein SORBIDRAFT_03g044530 [Sorghum bicolor]
          Length = 420

 Score =  726 bits (1874), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 341/387 (88%), Positives = 366/387 (94%)

Query: 20  KIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLS 79
           + ERISGSAGDVLED+PVG+LKVYVY+LPSKYNKKLL KDPRCL HMFAAEIFMHRFLLS
Sbjct: 34  QTERISGSAGDVLEDNPVGRLKVYVYDLPSKYNKKLLKKDPRCLNHMFAAEIFMHRFLLS 93

Query: 80  SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGAD 139
           S VRT NPEEADWFYTP+Y TCDLTP+GLPLPFKSPRMMRSAI+LI++NWPYWNR+EGAD
Sbjct: 94  SAVRTFNPEEADWFYTPVYTTCDLTPSGLPLPFKSPRMMRSAIELIATNWPYWNRSEGAD 153

Query: 140 HFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQ 199
           HFFV PHDFGACFHYQEEKAI RGILPLLQRATLVQTFGQ+NHVCL +GSITIPP+APPQ
Sbjct: 154 HFFVTPHDFGACFHYQEEKAIGRGILPLLQRATLVQTFGQKNHVCLKDGSITIPPFAPPQ 213

Query: 200 KMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP 259
           KMQAH IP DTPRSIFVYFRGLFYD +NDPEGGYYARGARA+VWENFKNNPLFDISTDHP
Sbjct: 214 KMQAHLIPADTPRSIFVYFRGLFYDTSNDPEGGYYARGARASVWENFKNNPLFDISTDHP 273

Query: 260 TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFV 319
            TYYEDMQR++FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG+FV
Sbjct: 274 PTYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFV 333

Query: 320 AEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
           AEEDVPKLD+ILTSIP +VILRKQRLLANPSMK+AMLFPQPAQ GDAFHQILNGLARKLP
Sbjct: 334 AEEDVPKLDSILTSIPTDVILRKQRLLANPSMKQAMLFPQPAQAGDAFHQILNGLARKLP 393

Query: 380 HDKSTYLKPGGKFLNWTSGPVGDLKPW 406
           H  + +LKPG + LNWT+GP GDLKPW
Sbjct: 394 HGDNVFLKPGERVLNWTAGPPGDLKPW 420


>gi|226499780|ref|NP_001150738.1| secondary cell wall-related glycosyltransferase family 47 precursor
           [Zea mays]
 gi|195641374|gb|ACG40155.1| secondary cell wall-related glycosyltransferase family 47 [Zea
           mays]
 gi|223974207|gb|ACN31291.1| unknown [Zea mays]
          Length = 415

 Score =  724 bits (1868), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 339/394 (86%), Positives = 365/394 (92%)

Query: 13  VSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIF 72
           ++A    + ERI GSAGDVLED+PVG+LKV+VY+LPSKYNKKLL KDPRCL HMFAAEIF
Sbjct: 22  IAAVTAQETERIEGSAGDVLEDNPVGRLKVFVYDLPSKYNKKLLKKDPRCLNHMFAAEIF 81

Query: 73  MHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYW 132
           MHRFLLSS VRT NPEEADWFYTP+Y TCDLTP GLPLPFKSPRMMRSAIQLI++NWPYW
Sbjct: 82  MHRFLLSSAVRTFNPEEADWFYTPVYTTCDLTPKGLPLPFKSPRMMRSAIQLIATNWPYW 141

Query: 133 NRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITI 192
           NR+EGADHFFV PHDFGACFHYQEEKAI RGILPLLQRATLVQTFGQ+NHVCL +GSITI
Sbjct: 142 NRSEGADHFFVTPHDFGACFHYQEEKAIGRGILPLLQRATLVQTFGQKNHVCLKDGSITI 201

Query: 193 PPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLF 252
           PPYAPPQKMQ H IP DTPRSIFVYFRGLFYD  NDPEGGYYARGARA+VWENFKNNPLF
Sbjct: 202 PPYAPPQKMQTHLIPADTPRSIFVYFRGLFYDTGNDPEGGYYARGARASVWENFKNNPLF 261

Query: 253 DISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW 312
           DISTDHP TYYEDMQR++FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW
Sbjct: 262 DISTDHPPTYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW 321

Query: 313 EEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILN 372
           EEIG+FVAEEDVP+LD+ILTSIP +V+LRKQRLLANPSMK+AMLFPQPAQ GDAFHQILN
Sbjct: 322 EEIGVFVAEEDVPRLDSILTSIPTDVVLRKQRLLANPSMKQAMLFPQPAQAGDAFHQILN 381

Query: 373 GLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
           GLARKLPH  + +LKPG + LNWT+GP GDLKPW
Sbjct: 382 GLARKLPHGSNVFLKPGERALNWTAGPPGDLKPW 415


>gi|357126596|ref|XP_003564973.1| PREDICTED: probable glucuronosyltransferase Os01g0926600-like
           [Brachypodium distachyon]
          Length = 411

 Score =  723 bits (1865), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 343/394 (87%), Positives = 366/394 (92%), Gaps = 3/394 (0%)

Query: 13  VSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIF 72
           VSA +T   ERI GSAGDVLEDDPVG+LKVYVYELP+KYNKK++ KD RCL+HMFAAEIF
Sbjct: 21  VSAQDT---ERIEGSAGDVLEDDPVGRLKVYVYELPTKYNKKMVAKDSRCLSHMFAAEIF 77

Query: 73  MHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYW 132
           MHRFLLSS +RT+NPEEADWFYTP+Y TCDLTP G PLPFKSPR+MRSAIQ ISS+WPYW
Sbjct: 78  MHRFLLSSAIRTMNPEEADWFYTPVYTTCDLTPWGHPLPFKSPRIMRSAIQFISSHWPYW 137

Query: 133 NRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITI 192
           NRT GADHFFVVPHDFGACFHYQEEKAIERGILPLL+RATLVQTFGQ++HVCL EGSI I
Sbjct: 138 NRTAGADHFFVVPHDFGACFHYQEEKAIERGILPLLRRATLVQTFGQKDHVCLKEGSINI 197

Query: 193 PPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLF 252
           PPYAPPQKM+ H +PP+TPRSIFVYFRGLFYD  NDPEGGYYARGARA+VWENFKNNPLF
Sbjct: 198 PPYAPPQKMKTHLVPPETPRSIFVYFRGLFYDTANDPEGGYYARGARASVWENFKNNPLF 257

Query: 253 DISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW 312
           DISTDHP TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW
Sbjct: 258 DISTDHPPTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW 317

Query: 313 EEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILN 372
           +EIG+FVAE+DVPKLDTILTSIP EVILRKQRLLANPSMK+AMLFPQPAQPGDAFHQILN
Sbjct: 318 DEIGVFVAEDDVPKLDTILTSIPMEVILRKQRLLANPSMKQAMLFPQPAQPGDAFHQILN 377

Query: 373 GLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
           GLARKLPH+K  YL PG K LNWT GP GDLKPW
Sbjct: 378 GLARKLPHNKGVYLTPGQKALNWTQGPKGDLKPW 411


>gi|115441967|ref|NP_001045263.1| Os01g0926600 [Oryza sativa Japonica Group]
 gi|75159222|sp|Q8S1X8.1|GT14_ORYSJ RecName: Full=Probable glucuronosyltransferase Os01g0926600
 gi|20160727|dbj|BAB89669.1| P0482D04.16 [Oryza sativa Japonica Group]
 gi|20805225|dbj|BAB92892.1| putative pectin-glucuronyltransferase [Oryza sativa Japonica Group]
 gi|113534794|dbj|BAF07177.1| Os01g0926600 [Oryza sativa Japonica Group]
 gi|125528941|gb|EAY77055.1| hypothetical protein OsI_05013 [Oryza sativa Indica Group]
 gi|125573177|gb|EAZ14692.1| hypothetical protein OsJ_04617 [Oryza sativa Japonica Group]
 gi|215687152|dbj|BAG90922.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 415

 Score =  722 bits (1863), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 340/385 (88%), Positives = 363/385 (94%)

Query: 22  ERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSP 81
           ERI GSAGDVLEDDPVG+LKVYVYELP+KYNKK++ KD RCL+HMFAAEIFMHRFLLSS 
Sbjct: 31  ERIEGSAGDVLEDDPVGRLKVYVYELPTKYNKKMVAKDSRCLSHMFAAEIFMHRFLLSSA 90

Query: 82  VRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHF 141
           +RTLNPEEADWFYTP+Y TCDLTP G PLPFKSPR+MRSAIQ ISS+WPYWNRT+GADHF
Sbjct: 91  IRTLNPEEADWFYTPVYTTCDLTPWGHPLPFKSPRIMRSAIQFISSHWPYWNRTDGADHF 150

Query: 142 FVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKM 201
           FVVPHDFGACFHYQEEKAIERGILPLL+RATLVQTFGQ++HVCL EGSITIPPYAPPQKM
Sbjct: 151 FVVPHDFGACFHYQEEKAIERGILPLLRRATLVQTFGQKDHVCLKEGSITIPPYAPPQKM 210

Query: 202 QAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTT 261
           + H +PP+TPRSIFVYFRGLFYD  NDPEGGYYARGARA+VWENFKNNPLFDISTDHP T
Sbjct: 211 KTHLVPPETPRSIFVYFRGLFYDTANDPEGGYYARGARASVWENFKNNPLFDISTDHPPT 270

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           YYEDMQR+IFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW+EIG+FVAE
Sbjct: 271 YYEDMQRSIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWDEIGVFVAE 330

Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHD 381
           +DVPKLDTILTSIP +VILRKQRLLANPSMK+AMLFPQPAQPGDAFHQILNGL RKLPH 
Sbjct: 331 DDVPKLDTILTSIPMDVILRKQRLLANPSMKQAMLFPQPAQPGDAFHQILNGLGRKLPHP 390

Query: 382 KSTYLKPGGKFLNWTSGPVGDLKPW 406
           KS YL PG K LNWT GPVGDLKPW
Sbjct: 391 KSVYLDPGQKVLNWTQGPVGDLKPW 415


>gi|413951430|gb|AFW84079.1| secondary cell wall glycosyltransferase family 47 [Zea mays]
          Length = 421

 Score =  721 bits (1862), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 339/387 (87%), Positives = 365/387 (94%)

Query: 20  KIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLS 79
           + ERISGSAGDVLED+PVG+LKVYVY+LPSKYNKKL+ KDPRCL HMFAAEIFMHRFLLS
Sbjct: 35  QTERISGSAGDVLEDNPVGRLKVYVYDLPSKYNKKLVKKDPRCLNHMFAAEIFMHRFLLS 94

Query: 80  SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGAD 139
           S VRT NPEEADWFYTP+Y TCDLTP+GLPLPFKSPRMMRSAI+LI++NWPYWNR+EGAD
Sbjct: 95  SAVRTFNPEEADWFYTPVYATCDLTPSGLPLPFKSPRMMRSAIELIATNWPYWNRSEGAD 154

Query: 140 HFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQ 199
           HFFV PHDFGACFHYQEEKAI RGILPLLQRATLVQTFGQ+NHVCL  GSITIPP+APPQ
Sbjct: 155 HFFVTPHDFGACFHYQEEKAIGRGILPLLQRATLVQTFGQKNHVCLKGGSITIPPFAPPQ 214

Query: 200 KMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP 259
           KMQAH IP DTPRSIFVYFRGLFYD +NDPEGGYYARGARA+VWENFKNNPLFDISTDHP
Sbjct: 215 KMQAHLIPLDTPRSIFVYFRGLFYDTSNDPEGGYYARGARASVWENFKNNPLFDISTDHP 274

Query: 260 TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFV 319
            TYYEDMQR++FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG+FV
Sbjct: 275 PTYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFV 334

Query: 320 AEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
           AEEDVPKLD+ILTSIP +VILRKQRLLANP+MK+AMLFPQPAQ GDAFHQILNGLARKLP
Sbjct: 335 AEEDVPKLDSILTSIPTDVILRKQRLLANPAMKQAMLFPQPAQAGDAFHQILNGLARKLP 394

Query: 380 HDKSTYLKPGGKFLNWTSGPVGDLKPW 406
           H  + +LKPG + LNWT+GP GDLKPW
Sbjct: 395 HGDNVFLKPGERVLNWTAGPPGDLKPW 421


>gi|301072484|gb|ADK56172.1| glycosyltransferase 47 [Triticum aestivum]
          Length = 415

 Score =  721 bits (1861), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 339/385 (88%), Positives = 361/385 (93%)

Query: 22  ERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSP 81
           ERI GSAGDVLEDDPVG+LKVYVYELP+KYNKK++ KD RCL+HMFAAEIFMHRFLLSS 
Sbjct: 31  ERIEGSAGDVLEDDPVGRLKVYVYELPTKYNKKMVAKDSRCLSHMFAAEIFMHRFLLSSA 90

Query: 82  VRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHF 141
           +RT+NPEEADWFYTP+Y TCDLTP G PLPFKSPR+MRSA+Q ISS+WPYWNRT GADHF
Sbjct: 91  IRTMNPEEADWFYTPVYTTCDLTPWGHPLPFKSPRIMRSAVQFISSHWPYWNRTAGADHF 150

Query: 142 FVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKM 201
           FVVPHDFGACFHYQEEKAIERGILPLL+RATLVQTFGQ++HVCL EGSI IPPYAPPQKM
Sbjct: 151 FVVPHDFGACFHYQEEKAIERGILPLLRRATLVQTFGQKDHVCLKEGSINIPPYAPPQKM 210

Query: 202 QAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTT 261
           + H +PP+TPRSIFVYFRGLFYD  NDPEGGYYARGARA+VWENFKNNPLFDISTDHP T
Sbjct: 211 KTHLVPPETPRSIFVYFRGLFYDTANDPEGGYYARGARASVWENFKNNPLFDISTDHPPT 270

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW+EIG+FVAE
Sbjct: 271 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWDEIGVFVAE 330

Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHD 381
           +DVPKLDTILTSIP EVILRKQRLLANPSMK+AMLFPQPAQPGDAFHQILNGLARKLPH 
Sbjct: 331 DDVPKLDTILTSIPMEVILRKQRLLANPSMKQAMLFPQPAQPGDAFHQILNGLARKLPHS 390

Query: 382 KSTYLKPGGKFLNWTSGPVGDLKPW 406
           KS YL PG K LNWT GP GDLKPW
Sbjct: 391 KSVYLTPGQKALNWTQGPKGDLKPW 415


>gi|195619204|gb|ACG31432.1| secondary cell wall-related glycosyltransferase family 47 [Zea
           mays]
          Length = 419

 Score =  721 bits (1860), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 341/404 (84%), Positives = 367/404 (90%), Gaps = 2/404 (0%)

Query: 5   AIALLL--AFVSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRC 62
           A++LL     V+       ERI GSAGDVLEDDPVG+LKVYVYELP+KYNKK++ KD RC
Sbjct: 16  AVSLLACSGAVAVARAQDTERIEGSAGDVLEDDPVGRLKVYVYELPTKYNKKMVAKDSRC 75

Query: 63  LTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAI 122
           L+HMFAAEIFMHRFLLSS +RTLNPEEADWFYTP+Y TCDLTP G PLPFKSPR+MRSAI
Sbjct: 76  LSHMFAAEIFMHRFLLSSAIRTLNPEEADWFYTPVYTTCDLTPWGHPLPFKSPRIMRSAI 135

Query: 123 QLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNH 182
           Q IS+ WPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERG+LPLL+RATLVQTFGQ++H
Sbjct: 136 QFISNRWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGVLPLLRRATLVQTFGQKDH 195

Query: 183 VCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAV 242
           VCL EGSITIPPYAPPQKM+ H +PP TPRSIFVYFRGLFYD  NDPEGGYYARGARA+V
Sbjct: 196 VCLKEGSITIPPYAPPQKMKTHLVPPGTPRSIFVYFRGLFYDTANDPEGGYYARGARASV 255

Query: 243 WENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDI 302
           WENFKNNPLFDISTDHP TYYEDMQRA+FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDI
Sbjct: 256 WENFKNNPLFDISTDHPPTYYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDI 315

Query: 303 VLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQ 362
           VLPFADAIPWEEIG+FVAE+DVP+LDTILTSIP EVILRKQRLLANPSMK+AMLFPQPAQ
Sbjct: 316 VLPFADAIPWEEIGVFVAEDDVPRLDTILTSIPMEVILRKQRLLANPSMKQAMLFPQPAQ 375

Query: 363 PGDAFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
            GDAFHQILNGLARKLPH +  +LKPG K LNWT GP GDLKPW
Sbjct: 376 AGDAFHQILNGLARKLPHGEDVFLKPGQKVLNWTEGPRGDLKPW 419


>gi|326511078|dbj|BAJ91886.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 415

 Score =  720 bits (1859), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 339/385 (88%), Positives = 360/385 (93%)

Query: 22  ERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSP 81
           ERI GSAGDVLEDDPVG+LKVYVYELP+KYNKK++ KD RCL+HMFAAEIFMHRFLLSS 
Sbjct: 31  ERIEGSAGDVLEDDPVGRLKVYVYELPTKYNKKMVAKDSRCLSHMFAAEIFMHRFLLSSA 90

Query: 82  VRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHF 141
           +RT+NPEEADWFYTP+Y TCDLTP G PLPFKSPR+MRSA+Q ISS+WPYWNRT GADHF
Sbjct: 91  IRTMNPEEADWFYTPVYTTCDLTPWGHPLPFKSPRIMRSAVQFISSHWPYWNRTAGADHF 150

Query: 142 FVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKM 201
           FVVPHDFGACFHYQEEKAIERGILPLL+RATLVQTFGQ++HVCL EGSI IPPYAPPQKM
Sbjct: 151 FVVPHDFGACFHYQEEKAIERGILPLLRRATLVQTFGQKDHVCLKEGSINIPPYAPPQKM 210

Query: 202 QAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTT 261
           + H +PP+TPRSIFVYFRGLFYD  NDPEGGYYARGARA+VWENFKNNPLFDISTDHP T
Sbjct: 211 KTHLVPPETPRSIFVYFRGLFYDTANDPEGGYYARGARASVWENFKNNPLFDISTDHPPT 270

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW+EIGMFVAE
Sbjct: 271 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWDEIGMFVAE 330

Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHD 381
           +DVPKLDTILTSIP EVILRKQRLLANPSMK+AMLFPQPAQ GDAFHQILNGLARKLPH 
Sbjct: 331 DDVPKLDTILTSIPMEVILRKQRLLANPSMKQAMLFPQPAQAGDAFHQILNGLARKLPHS 390

Query: 382 KSTYLKPGGKFLNWTSGPVGDLKPW 406
           KS YL PG K LNWT GP GDLKPW
Sbjct: 391 KSVYLTPGQKALNWTQGPKGDLKPW 415


>gi|242059741|ref|XP_002459016.1| hypothetical protein SORBIDRAFT_03g044520 [Sorghum bicolor]
 gi|241930991|gb|EES04136.1| hypothetical protein SORBIDRAFT_03g044520 [Sorghum bicolor]
          Length = 417

 Score =  720 bits (1859), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 340/394 (86%), Positives = 363/394 (92%)

Query: 13  VSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIF 72
           V+       ERI GSAGDVLEDDPVG+LKVYVYELP+KYNKK++ KD RCL+HMFAAEIF
Sbjct: 24  VAVARAQDTERIEGSAGDVLEDDPVGRLKVYVYELPTKYNKKMVAKDSRCLSHMFAAEIF 83

Query: 73  MHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYW 132
           MHRFLLSS +RTLNPEEADWFYTP+Y TCDLTP G PLPFKSPR+MRSAIQ IS+ WPYW
Sbjct: 84  MHRFLLSSAIRTLNPEEADWFYTPVYTTCDLTPWGHPLPFKSPRIMRSAIQFISNRWPYW 143

Query: 133 NRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITI 192
           NRTEGADHFFVVPHDFGACFHYQEEKAIERG+LPLL+RATLVQTFGQ++HVCL EGSITI
Sbjct: 144 NRTEGADHFFVVPHDFGACFHYQEEKAIERGVLPLLRRATLVQTFGQKDHVCLKEGSITI 203

Query: 193 PPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLF 252
           PPYAPPQKM+ H +PP TPRSIFVYFRGLFYD  NDPEGGYYARGARA+VWENFKNNPLF
Sbjct: 204 PPYAPPQKMKTHLVPPGTPRSIFVYFRGLFYDTANDPEGGYYARGARASVWENFKNNPLF 263

Query: 253 DISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW 312
           DISTDHP TYYEDMQRA+FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW
Sbjct: 264 DISTDHPPTYYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW 323

Query: 313 EEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILN 372
           EEIG+FVAE+DVPKLDTILTSIP EVILRKQRLLANPSMK+AMLFPQPAQPGDAFHQILN
Sbjct: 324 EEIGVFVAEDDVPKLDTILTSIPMEVILRKQRLLANPSMKQAMLFPQPAQPGDAFHQILN 383

Query: 373 GLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
           GLARKLPH +  +LKPG K LNWT GP GDLKPW
Sbjct: 384 GLARKLPHGEDVFLKPGLKVLNWTEGPRGDLKPW 417


>gi|363543255|ref|NP_001241842.1| uncharacterized protein LOC100857042 precursor [Zea mays]
 gi|194704652|gb|ACF86410.1| unknown [Zea mays]
 gi|224034207|gb|ACN36179.1| unknown [Zea mays]
 gi|414878896|tpg|DAA56027.1| TPA: hypothetical protein ZEAMMB73_615997 [Zea mays]
          Length = 418

 Score =  720 bits (1859), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 337/388 (86%), Positives = 366/388 (94%), Gaps = 1/388 (0%)

Query: 20  KIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLS 79
           + ERISGSAGDVLED+PVG+LKVYVY+LPSKYNKKLL KDPRCL+HMFAAEIFMHRFLLS
Sbjct: 31  QTERISGSAGDVLEDNPVGRLKVYVYDLPSKYNKKLLKKDPRCLSHMFAAEIFMHRFLLS 90

Query: 80  SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGAD 139
           S VRT NPEEADWFYTP+Y TCDLTP+GLPLPFKSPRMMRSAI+L+++NWPYWNR+EGAD
Sbjct: 91  SAVRTFNPEEADWFYTPVYTTCDLTPSGLPLPFKSPRMMRSAIELVATNWPYWNRSEGAD 150

Query: 140 HFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQ 199
           HFFV PHDFGACFHYQEEKAI RGILPLLQRATLVQTFGQ+NHVCL +GSITIPP+APPQ
Sbjct: 151 HFFVTPHDFGACFHYQEEKAIGRGILPLLQRATLVQTFGQKNHVCLKDGSITIPPFAPPQ 210

Query: 200 KMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP 259
           KMQAH IP DTPRSIFVYFRGLFYD +NDPEGGYYARGARA+VWENFKNNPLFDISTDHP
Sbjct: 211 KMQAHLIPADTPRSIFVYFRGLFYDTSNDPEGGYYARGARASVWENFKNNPLFDISTDHP 270

Query: 260 TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFV 319
            TYYEDMQR++FCLCPLGWAPWSPRLVEAVVFGCIPVI+ADDIVLPFADAIPWE+IG+FV
Sbjct: 271 ATYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIVADDIVLPFADAIPWEDIGVFV 330

Query: 320 AEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
           AEEDVP+LDTILTSIP +V+LRKQRLLANPSMK+AMLFPQPAQ GDAFHQILNGLARKLP
Sbjct: 331 AEEDVPRLDTILTSIPTDVVLRKQRLLANPSMKQAMLFPQPAQAGDAFHQILNGLARKLP 390

Query: 380 HDKSTYLKPGG-KFLNWTSGPVGDLKPW 406
           H    +LKPG  + LNWT+GP GDLKPW
Sbjct: 391 HGDGVFLKPGDERVLNWTAGPPGDLKPW 418


>gi|194697530|gb|ACF82849.1| unknown [Zea mays]
          Length = 418

 Score =  719 bits (1857), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 337/388 (86%), Positives = 365/388 (94%), Gaps = 1/388 (0%)

Query: 20  KIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLS 79
           + ERISGSAGDVLED+PVG+LKVYVY+LPSKYNKKLL KDPRCL HMFAAEIFMHRFLLS
Sbjct: 31  QTERISGSAGDVLEDNPVGRLKVYVYDLPSKYNKKLLKKDPRCLNHMFAAEIFMHRFLLS 90

Query: 80  SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGAD 139
           S VRT NPEEADWFYTP+Y TCDLTP+GLPLPFKSPRMMRSAI+L+++NWPYWNR+EGAD
Sbjct: 91  SAVRTFNPEEADWFYTPVYTTCDLTPSGLPLPFKSPRMMRSAIELVATNWPYWNRSEGAD 150

Query: 140 HFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQ 199
           HFFV PHDFGACFHYQEEKAI RGILPLLQRATLVQTFGQ+NHVCL +GSITIPP+APPQ
Sbjct: 151 HFFVTPHDFGACFHYQEEKAIGRGILPLLQRATLVQTFGQKNHVCLKDGSITIPPFAPPQ 210

Query: 200 KMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP 259
           KMQAH IP DTPRSIFVYFRGLFYD +NDPEGGYYARGARA+VWENFKNNPLFDISTDHP
Sbjct: 211 KMQAHLIPADTPRSIFVYFRGLFYDTSNDPEGGYYARGARASVWENFKNNPLFDISTDHP 270

Query: 260 TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFV 319
            TYYEDMQR++FCLCPLGWAPWSPRLVEAVVFGCIPVI+ADDIVLPFADAIPWE+IG+FV
Sbjct: 271 ATYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIVADDIVLPFADAIPWEDIGVFV 330

Query: 320 AEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
           AEEDVP+LDTILTSIP +V+LRKQRLLANPSMK+AMLFPQPAQ GDAFHQILNGLARKLP
Sbjct: 331 AEEDVPRLDTILTSIPTDVVLRKQRLLANPSMKQAMLFPQPAQAGDAFHQILNGLARKLP 390

Query: 380 HDKSTYLKPGG-KFLNWTSGPVGDLKPW 406
           H    +LKPG  + LNWT+GP GDLKPW
Sbjct: 391 HGDGVFLKPGDERVLNWTAGPPGDLKPW 418


>gi|226503833|ref|NP_001149319.1| secondary cell wall-related glycosyltransferase family 47 precursor
           [Zea mays]
 gi|195626366|gb|ACG35013.1| secondary cell wall-related glycosyltransferase family 47 [Zea
           mays]
          Length = 421

 Score =  719 bits (1856), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 338/387 (87%), Positives = 364/387 (94%)

Query: 20  KIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLS 79
           + ERISGSAGDVLED+PVG+LKVYVY+LPSKYNKKL+ KDPRCL HMFAAEIFMHRFLLS
Sbjct: 35  QTERISGSAGDVLEDNPVGRLKVYVYDLPSKYNKKLVKKDPRCLNHMFAAEIFMHRFLLS 94

Query: 80  SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGAD 139
           S VRT NPEEADWFYTP+Y TCDLTP+GLPLPFKSPRMMRSAI+LI++NWPYWNR+EGAD
Sbjct: 95  SAVRTFNPEEADWFYTPVYATCDLTPSGLPLPFKSPRMMRSAIELIATNWPYWNRSEGAD 154

Query: 140 HFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQ 199
           HFFV PHDFGACFHYQEEKAI RGILPLLQRATLVQTFGQ+NHVCL  GSI IPP+APPQ
Sbjct: 155 HFFVTPHDFGACFHYQEEKAIGRGILPLLQRATLVQTFGQKNHVCLKGGSIXIPPFAPPQ 214

Query: 200 KMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP 259
           KMQAH IP DTPRSIFVYFRGLFYD +NDPEGGYYARGARA+VWENFKNNPLFDISTDHP
Sbjct: 215 KMQAHLIPLDTPRSIFVYFRGLFYDTSNDPEGGYYARGARASVWENFKNNPLFDISTDHP 274

Query: 260 TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFV 319
            TYYEDMQR++FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG+FV
Sbjct: 275 PTYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFV 334

Query: 320 AEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
           AEEDVPKLD+ILTSIP +VILRKQRLLANP+MK+AMLFPQPAQ GDAFHQILNGLARKLP
Sbjct: 335 AEEDVPKLDSILTSIPTDVILRKQRLLANPAMKQAMLFPQPAQAGDAFHQILNGLARKLP 394

Query: 380 HDKSTYLKPGGKFLNWTSGPVGDLKPW 406
           H  + +LKPG + LNWT+GP GDLKPW
Sbjct: 395 HGDNVFLKPGERVLNWTAGPPGDLKPW 421


>gi|326509565|dbj|BAJ86998.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326509967|dbj|BAJ87200.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 415

 Score =  718 bits (1854), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 338/385 (87%), Positives = 360/385 (93%)

Query: 22  ERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSP 81
           ERI GSAGDVLEDDPVG+LKVYVYELP+KYNKK++ KD RCL+HMFAAEIFMHRFLLSS 
Sbjct: 31  ERIEGSAGDVLEDDPVGRLKVYVYELPTKYNKKMVAKDSRCLSHMFAAEIFMHRFLLSSA 90

Query: 82  VRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHF 141
           +RT+NPEEADWFYTP+Y TCDLTP G PLPFKSPR+MRSA+Q ISS+WPYWNRT GADHF
Sbjct: 91  IRTMNPEEADWFYTPVYTTCDLTPWGHPLPFKSPRIMRSAVQFISSHWPYWNRTAGADHF 150

Query: 142 FVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKM 201
           FVVPHDFGACFHYQEEKAIERGILPLL+RATLVQTFGQ++HVCL EGSI IPPYAPPQKM
Sbjct: 151 FVVPHDFGACFHYQEEKAIERGILPLLRRATLVQTFGQKDHVCLKEGSINIPPYAPPQKM 210

Query: 202 QAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTT 261
           + H +PP+TPRSIFVYFRGLFYD  NDPEGGYYARGARA+VWENFKNNPLFDISTDHP T
Sbjct: 211 KTHLVPPETPRSIFVYFRGLFYDTANDPEGGYYARGARASVWENFKNNPLFDISTDHPPT 270

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW+EIG+FVAE
Sbjct: 271 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWDEIGVFVAE 330

Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHD 381
           +DVPKLDTILTSIP EVILRKQRLLANPSMK+AMLFPQPAQ GDAFHQILNGLARKLPH 
Sbjct: 331 DDVPKLDTILTSIPMEVILRKQRLLANPSMKQAMLFPQPAQAGDAFHQILNGLARKLPHS 390

Query: 382 KSTYLKPGGKFLNWTSGPVGDLKPW 406
           KS YL PG K LNWT GP GDLKPW
Sbjct: 391 KSVYLTPGQKALNWTQGPKGDLKPW 415


>gi|414878899|tpg|DAA56030.1| TPA: secondary cell wall glycosyltransferase family 47 isoform 1
           [Zea mays]
 gi|414878900|tpg|DAA56031.1| TPA: secondary cell wall glycosyltransferase family 47 isoform 2
           [Zea mays]
          Length = 419

 Score =  718 bits (1853), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 340/404 (84%), Positives = 366/404 (90%), Gaps = 2/404 (0%)

Query: 5   AIALLL--AFVSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRC 62
           A++LL     V+       ERI GSAGDVLEDDPVG+LKVYVYELP+KYNKK++ KD RC
Sbjct: 16  AVSLLACSGAVAVARAQDTERIEGSAGDVLEDDPVGRLKVYVYELPTKYNKKMVAKDSRC 75

Query: 63  LTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAI 122
           L+HMFAAEIFMHRFLLSS +RTLNPE ADWFYTP+Y TCDLTP G PLPFKSPR+MRSAI
Sbjct: 76  LSHMFAAEIFMHRFLLSSAIRTLNPEVADWFYTPVYTTCDLTPWGHPLPFKSPRIMRSAI 135

Query: 123 QLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNH 182
           Q IS+ WPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERG+LPLL+RATLVQTFGQ++H
Sbjct: 136 QFISNRWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGVLPLLRRATLVQTFGQKDH 195

Query: 183 VCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAV 242
           VCL EGSITIPPYAPPQKM+ H +PP TPRSIFVYFRGLFYD  NDPEGGYYARGARA+V
Sbjct: 196 VCLKEGSITIPPYAPPQKMKTHLVPPGTPRSIFVYFRGLFYDTANDPEGGYYARGARASV 255

Query: 243 WENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDI 302
           WENFKNNPLFDISTDHP TYYEDMQRA+FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDI
Sbjct: 256 WENFKNNPLFDISTDHPPTYYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDI 315

Query: 303 VLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQ 362
           VLPFADAIPWEEIG+FVAE+DVP+LDTILTSIP EVILRKQRLLANPSMK+AMLFPQPAQ
Sbjct: 316 VLPFADAIPWEEIGVFVAEDDVPRLDTILTSIPMEVILRKQRLLANPSMKQAMLFPQPAQ 375

Query: 363 PGDAFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
            GDAFHQILNGLARKLPH +  +LKPG K LNWT GP GDLKPW
Sbjct: 376 AGDAFHQILNGLARKLPHGEDVFLKPGQKVLNWTEGPRGDLKPW 419


>gi|226499284|ref|NP_001142343.1| uncharacterized protein LOC100274514 precursor [Zea mays]
 gi|194708308|gb|ACF88238.1| unknown [Zea mays]
          Length = 419

 Score =  717 bits (1850), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 339/404 (83%), Positives = 366/404 (90%), Gaps = 2/404 (0%)

Query: 5   AIALLL--AFVSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRC 62
           A++LL     V+       ERI GSAGDVLEDDPVG+LKVYVYELP+KYNKK++ KD RC
Sbjct: 16  AVSLLACSGAVAVARAQDTERIEGSAGDVLEDDPVGRLKVYVYELPTKYNKKMVAKDSRC 75

Query: 63  LTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAI 122
           L+HMFAAEIFMHRFLLSS +RTLNPE ADWFYTP+Y TCDLTP G PLPF+SPR+MRSAI
Sbjct: 76  LSHMFAAEIFMHRFLLSSAIRTLNPEVADWFYTPVYTTCDLTPWGHPLPFRSPRIMRSAI 135

Query: 123 QLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNH 182
           Q IS+ WPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERG+LPLL+RATLVQTFGQ++H
Sbjct: 136 QFISNRWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGVLPLLRRATLVQTFGQKDH 195

Query: 183 VCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAV 242
           VCL EGSITIPPYAPPQKM+ H +PP TPRSIFVYFRGLFYD  NDPEGGYYARGARA+V
Sbjct: 196 VCLKEGSITIPPYAPPQKMKTHLVPPGTPRSIFVYFRGLFYDTANDPEGGYYARGARASV 255

Query: 243 WENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDI 302
           WENFKNNPLFDISTDHP TYYEDMQRA+FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDI
Sbjct: 256 WENFKNNPLFDISTDHPPTYYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDI 315

Query: 303 VLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQ 362
           VLPFADAIPWEEIG+FVAE+DVP+LDTILTSIP EVILRKQRLLANPSMK+AMLFPQPAQ
Sbjct: 316 VLPFADAIPWEEIGVFVAEDDVPRLDTILTSIPMEVILRKQRLLANPSMKQAMLFPQPAQ 375

Query: 363 PGDAFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
            GDAFHQILNGLARKLPH +  +LKPG K LNWT GP GDLKPW
Sbjct: 376 AGDAFHQILNGLARKLPHGEDVFLKPGQKVLNWTEGPRGDLKPW 419


>gi|357126610|ref|XP_003564980.1| PREDICTED: probable glucuronosyltransferase Os01g0926700-like
           [Brachypodium distachyon]
          Length = 418

 Score =  716 bits (1848), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 335/387 (86%), Positives = 364/387 (94%)

Query: 20  KIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLS 79
           + ERISGSAGDVL+D+PVG+LKVY+Y+LP KYNKKLL KDPRCL HMFAAEIFMHRFLLS
Sbjct: 32  QTERISGSAGDVLDDNPVGRLKVYIYDLPGKYNKKLLKKDPRCLNHMFAAEIFMHRFLLS 91

Query: 80  SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGAD 139
           S VRT NPEEADWFYTP+Y TCDLTP+GLPLPFKSPRM+RSAIQLI+  WPYWNR+EGAD
Sbjct: 92  SAVRTFNPEEADWFYTPVYATCDLTPSGLPLPFKSPRMVRSAIQLIAEKWPYWNRSEGAD 151

Query: 140 HFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQ 199
           HFFV PHDFGACFHYQEEKAI RGILPLLQRATLVQTFGQ+NHVCL +GSITIPPYAPPQ
Sbjct: 152 HFFVAPHDFGACFHYQEEKAIGRGILPLLQRATLVQTFGQKNHVCLKDGSITIPPYAPPQ 211

Query: 200 KMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP 259
           KMQ H IP DTPRSIFVYFRGLFYD +NDPEGGYYARGARA+VWENFKNNPLFDISTDHP
Sbjct: 212 KMQNHLIPGDTPRSIFVYFRGLFYDTSNDPEGGYYARGARASVWENFKNNPLFDISTDHP 271

Query: 260 TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFV 319
            TYYEDMQR++FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWE+IG+FV
Sbjct: 272 PTYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEDIGVFV 331

Query: 320 AEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
           +E+DVPKLD+ILTSIP +VILRKQRLLANPSMK+AMLFPQPA+ GDAFHQILNGLARKLP
Sbjct: 332 SEDDVPKLDSILTSIPTDVILRKQRLLANPSMKQAMLFPQPAEAGDAFHQILNGLARKLP 391

Query: 380 HDKSTYLKPGGKFLNWTSGPVGDLKPW 406
           H ++ +LKPG K LNWT+GPVGDLKPW
Sbjct: 392 HGENVFLKPGQKVLNWTAGPVGDLKPW 418


>gi|326488903|dbj|BAJ98063.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 421

 Score =  716 bits (1847), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 336/387 (86%), Positives = 362/387 (93%)

Query: 20  KIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLS 79
           + ERISGSAGDVL+DDPVG+LKV++Y+LP KYNKKLL KDPRCL HMFAAEIFMHRFLLS
Sbjct: 35  QTERISGSAGDVLDDDPVGRLKVFIYDLPGKYNKKLLKKDPRCLNHMFAAEIFMHRFLLS 94

Query: 80  SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGAD 139
           S VRT NPEEADWFYTP+YPTCDLTP+GLPLPFKSPRMMRSAI+LI++ WPYWNR+EGAD
Sbjct: 95  SAVRTTNPEEADWFYTPVYPTCDLTPSGLPLPFKSPRMMRSAIELIATKWPYWNRSEGAD 154

Query: 140 HFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQ 199
           HFFV PHDFGACFHYQEEKAI RGILPLLQRATLVQTFGQ+NHVCL +GSITIPP+APPQ
Sbjct: 155 HFFVTPHDFGACFHYQEEKAIGRGILPLLQRATLVQTFGQKNHVCLKDGSITIPPFAPPQ 214

Query: 200 KMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP 259
           KMQ H IP +TPRSIFVYFRGLFYD  NDPEGGYYARGARA+VWENFKNNPLFDISTDHP
Sbjct: 215 KMQNHLIPGETPRSIFVYFRGLFYDTGNDPEGGYYARGARASVWENFKNNPLFDISTDHP 274

Query: 260 TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFV 319
            TYYEDMQR++FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG+FV
Sbjct: 275 PTYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFV 334

Query: 320 AEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
            EEDVP+LD+ILTSIP E ILRKQRLLANPSMK+AMLFPQPAQ GDAFHQILNGLARKLP
Sbjct: 335 PEEDVPRLDSILTSIPTEDILRKQRLLANPSMKQAMLFPQPAQAGDAFHQILNGLARKLP 394

Query: 380 HDKSTYLKPGGKFLNWTSGPVGDLKPW 406
           H +S +LKPG   LNWT+GPVGDLKPW
Sbjct: 395 HGESVFLKPGQARLNWTAGPVGDLKPW 421


>gi|242061766|ref|XP_002452172.1| hypothetical protein SORBIDRAFT_04g021180 [Sorghum bicolor]
 gi|241932003|gb|EES05148.1| hypothetical protein SORBIDRAFT_04g021180 [Sorghum bicolor]
          Length = 432

 Score =  716 bits (1847), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 333/385 (86%), Positives = 362/385 (94%)

Query: 22  ERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSP 81
           ERI+GSAGDVLED+P+G+LKV++Y+LPSKYNK+++ KDPRCL HMFAAEIFMHRFLLSS 
Sbjct: 48  ERIAGSAGDVLEDNPIGRLKVFIYDLPSKYNKRIVTKDPRCLHHMFAAEIFMHRFLLSSA 107

Query: 82  VRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHF 141
           VRTLNPEEADWFYTP+Y TCDLTP GLPLPFKSPRMMRSAI+ IS+ WP+WNRT+GADHF
Sbjct: 108 VRTLNPEEADWFYTPVYTTCDLTPAGLPLPFKSPRMMRSAIKFISNKWPFWNRTDGADHF 167

Query: 142 FVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKM 201
           FVVPHDFGACFHYQEEKA ERGILP+L+RATLVQTFGQ+NHVCL EGSI IPPYAPPQKM
Sbjct: 168 FVVPHDFGACFHYQEEKATERGILPMLRRATLVQTFGQKNHVCLKEGSIIIPPYAPPQKM 227

Query: 202 QAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTT 261
           QAH +PPDTPRSIFVYFRGLFYD  NDPEGGYYARGARA++WENFKNNPLFDISTDHP T
Sbjct: 228 QAHLVPPDTPRSIFVYFRGLFYDNGNDPEGGYYARGARASLWENFKNNPLFDISTDHPVT 287

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           YYEDMQRA+FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWE+IG+FV E
Sbjct: 288 YYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEDIGVFVDE 347

Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHD 381
           EDVPKLD+ILTSIP E ILRKQRLLANPSMK+AMLFPQPAQP DAFHQILNGLARKLPH 
Sbjct: 348 EDVPKLDSILTSIPIENILRKQRLLANPSMKKAMLFPQPAQPRDAFHQILNGLARKLPHM 407

Query: 382 KSTYLKPGGKFLNWTSGPVGDLKPW 406
           +S YL+PG K LNWT+GP GDLKPW
Sbjct: 408 QSVYLQPGEKHLNWTAGPAGDLKPW 432


>gi|301072486|gb|ADK56173.1| glycosyltransferase 47 [Triticum aestivum]
          Length = 422

 Score =  715 bits (1845), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 336/387 (86%), Positives = 361/387 (93%)

Query: 20  KIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLS 79
           + ERISGSAGDVL+DDPVG+LKV++Y+LP KYNKKLL KDPRCL HMFAAEIFMHRFLLS
Sbjct: 36  QTERISGSAGDVLDDDPVGRLKVFIYDLPGKYNKKLLKKDPRCLNHMFAAEIFMHRFLLS 95

Query: 80  SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGAD 139
           S VRT NPEEADWFYTP+YPTCDLTP+GLPLPFKSPRMMRSAI+LI++ WPYWNR+EGAD
Sbjct: 96  SAVRTTNPEEADWFYTPVYPTCDLTPSGLPLPFKSPRMMRSAIELIATKWPYWNRSEGAD 155

Query: 140 HFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQ 199
           HFFV PHDFGACFHYQEEKAI RGILPLLQRATLVQTFGQ+NHVCL EGSITIPP+APPQ
Sbjct: 156 HFFVTPHDFGACFHYQEEKAIGRGILPLLQRATLVQTFGQKNHVCLKEGSITIPPFAPPQ 215

Query: 200 KMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP 259
           KMQ H IP +TPRSIFVYFRGLFYD  NDPEGGYYARGARA+VWENFKNNPLFDISTDHP
Sbjct: 216 KMQNHLIPGETPRSIFVYFRGLFYDTGNDPEGGYYARGARASVWENFKNNPLFDISTDHP 275

Query: 260 TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFV 319
            TYYEDMQR++FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG+FV
Sbjct: 276 PTYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFV 335

Query: 320 AEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
            EEDVP+LD+ILTSIP E ILRKQRLLANPSMK+AMLFPQPAQ GDAFHQILNGLARKLP
Sbjct: 336 PEEDVPRLDSILTSIPTEDILRKQRLLANPSMKQAMLFPQPAQAGDAFHQILNGLARKLP 395

Query: 380 HDKSTYLKPGGKFLNWTSGPVGDLKPW 406
           H +  +LKPG   LNWT+GPVGDLKPW
Sbjct: 396 HGEDVFLKPGQARLNWTAGPVGDLKPW 422


>gi|212275360|ref|NP_001130059.1| uncharacterized protein LOC100191151 precursor [Zea mays]
 gi|194688192|gb|ACF78180.1| unknown [Zea mays]
          Length = 418

 Score =  714 bits (1844), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 335/388 (86%), Positives = 364/388 (93%), Gaps = 1/388 (0%)

Query: 20  KIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLS 79
           + ERISGSAGDVLED+PVG+LKVYVY+LPSKYNKKLL KDPRCL+HMFAAEIFMHRFLLS
Sbjct: 31  QTERISGSAGDVLEDNPVGRLKVYVYDLPSKYNKKLLKKDPRCLSHMFAAEIFMHRFLLS 90

Query: 80  SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGAD 139
           S VRT NPEEADWFYTP+Y TCDLTP+GLPLPFKSPRMMRSAI+ +++NWPYWNR+EGAD
Sbjct: 91  SAVRTFNPEEADWFYTPVYTTCDLTPSGLPLPFKSPRMMRSAIERVATNWPYWNRSEGAD 150

Query: 140 HFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQ 199
           HFFV PHDFGACFHYQEEKAI RGI PLLQRATLVQTFGQ+NHVCL +GSITIPP+APPQ
Sbjct: 151 HFFVTPHDFGACFHYQEEKAIGRGIPPLLQRATLVQTFGQKNHVCLKDGSITIPPFAPPQ 210

Query: 200 KMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP 259
           KMQAH IP DTPRSIFVYFRGLFYD +NDPEGGYYARGARA+VWENFKNNPLFDISTDHP
Sbjct: 211 KMQAHLIPADTPRSIFVYFRGLFYDTSNDPEGGYYARGARASVWENFKNNPLFDISTDHP 270

Query: 260 TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFV 319
            TYYEDMQR++FCLCPLGWAPWSPRLVEAVVFGCIPVI+ADDIVLPFADAIPWE+IG+FV
Sbjct: 271 ATYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIVADDIVLPFADAIPWEDIGVFV 330

Query: 320 AEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
           AEEDVP+LDTILTSIP +V+LRKQRLLANPSMK+AMLFPQPAQ GDAFHQILNGLARKLP
Sbjct: 331 AEEDVPRLDTILTSIPTDVVLRKQRLLANPSMKQAMLFPQPAQAGDAFHQILNGLARKLP 390

Query: 380 HDKSTYLKPGG-KFLNWTSGPVGDLKPW 406
           H    +LKPG  + LNWT+GP GDLKPW
Sbjct: 391 HGDGVFLKPGDERVLNWTAGPPGDLKPW 418


>gi|150246869|emb|CAM91796.1| putative glycosyltransferase family 47 [Triticum aestivum]
          Length = 420

 Score =  714 bits (1843), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 336/385 (87%), Positives = 360/385 (93%)

Query: 22  ERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSP 81
           ERISGSAGDVL+DDPVG+LKV++Y+LP KYNKKLL KDPRCL HMFAAEIFMHRFLLSS 
Sbjct: 36  ERISGSAGDVLDDDPVGRLKVFIYDLPGKYNKKLLKKDPRCLNHMFAAEIFMHRFLLSSA 95

Query: 82  VRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHF 141
           VRT NPEEADWFYTP+YPTCDLTP+GLPLPFKSPRMMRSAI+LI++ WPYWNR+EGADHF
Sbjct: 96  VRTTNPEEADWFYTPVYPTCDLTPSGLPLPFKSPRMMRSAIELIATKWPYWNRSEGADHF 155

Query: 142 FVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKM 201
           FV PHDFGACFHYQEEKAI RGILPLLQRATLVQTFGQ+NHVCL EGSITIPP+APPQKM
Sbjct: 156 FVTPHDFGACFHYQEEKAIGRGILPLLQRATLVQTFGQKNHVCLKEGSITIPPFAPPQKM 215

Query: 202 QAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTT 261
           Q H IP +TPRSIFVYFRGLFYD  NDPEGGYYARGARA+VWENFKNNPLFDISTDHP T
Sbjct: 216 QNHLIPGETPRSIFVYFRGLFYDTGNDPEGGYYARGARASVWENFKNNPLFDISTDHPPT 275

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           YYEDMQR++FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG+FV E
Sbjct: 276 YYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFVPE 335

Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHD 381
           EDVP+LD+ILTSIP E ILRKQRLLANPSMK+AMLFPQPAQ GDAFHQILNGLARKLPH 
Sbjct: 336 EDVPRLDSILTSIPTEDILRKQRLLANPSMKQAMLFPQPAQAGDAFHQILNGLARKLPHG 395

Query: 382 KSTYLKPGGKFLNWTSGPVGDLKPW 406
           +  +LKPG   LNWT+GPVGDLKPW
Sbjct: 396 EDVFLKPGQVRLNWTAGPVGDLKPW 420


>gi|115458146|ref|NP_001052673.1| Os04g0398600 [Oryza sativa Japonica Group]
 gi|75143148|sp|Q7XLG3.2|GT42_ORYSJ RecName: Full=Probable glucuronosyltransferase Os04g0398600
 gi|38347167|emb|CAE05157.2| OSJNBa0039C07.13 [Oryza sativa Japonica Group]
 gi|113564244|dbj|BAF14587.1| Os04g0398600 [Oryza sativa Japonica Group]
 gi|116309465|emb|CAH66536.1| H0209H04.3 [Oryza sativa Indica Group]
 gi|125548129|gb|EAY93951.1| hypothetical protein OsI_15724 [Oryza sativa Indica Group]
 gi|125590243|gb|EAZ30593.1| hypothetical protein OsJ_14642 [Oryza sativa Japonica Group]
 gi|215706942|dbj|BAG93402.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215741563|dbj|BAG98058.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 420

 Score =  711 bits (1835), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 336/385 (87%), Positives = 359/385 (93%)

Query: 22  ERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSP 81
           ERISGSAGDVLED+PVG+LKV++Y+LP KYNKK++ KDPRCL HMFAAEIFMHRFLLSS 
Sbjct: 36  ERISGSAGDVLEDNPVGRLKVFIYDLPRKYNKKMVNKDPRCLNHMFAAEIFMHRFLLSSA 95

Query: 82  VRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHF 141
           VRTLNP+EADWFYTP+Y TCDLTP GLPLPFKSPR+MRSAIQ IS  WP+WNRT+GADHF
Sbjct: 96  VRTLNPKEADWFYTPVYTTCDLTPAGLPLPFKSPRVMRSAIQYISHKWPFWNRTDGADHF 155

Query: 142 FVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKM 201
           FVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQ NHVCL EGSITIPPYAPPQKM
Sbjct: 156 FVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQENHVCLKEGSITIPPYAPPQKM 215

Query: 202 QAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTT 261
           QAH IPPDTPRSIFVYFRGLFYD  NDPEGGYYARGARA++WENFKNNPLFDISTDHP T
Sbjct: 216 QAHLIPPDTPRSIFVYFRGLFYDTGNDPEGGYYARGARASLWENFKNNPLFDISTDHPPT 275

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           YYEDMQRA+FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG+FV E
Sbjct: 276 YYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFVEE 335

Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHD 381
           +DVPKLDTILTS+P + ILRKQRLLANPSMK+AMLFPQPAQP DAFHQILNGLARKLPH 
Sbjct: 336 KDVPKLDTILTSMPIDDILRKQRLLANPSMKQAMLFPQPAQPRDAFHQILNGLARKLPHP 395

Query: 382 KSTYLKPGGKFLNWTSGPVGDLKPW 406
           +  YL+P  K LNWT+GPVGDLK W
Sbjct: 396 EGVYLQPSDKRLNWTAGPVGDLKAW 420


>gi|125582313|gb|EAZ23244.1| hypothetical protein OsJ_06939 [Oryza sativa Japonica Group]
          Length = 434

 Score =  711 bits (1835), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 334/385 (86%), Positives = 360/385 (93%)

Query: 22  ERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSP 81
           ERISGSAGDVLED+PVG+LKV+VY+LPSKYNK+++ KDPRCL HMFAAEIFMHRFLLSS 
Sbjct: 49  ERISGSAGDVLEDNPVGRLKVFVYDLPSKYNKRIVAKDPRCLNHMFAAEIFMHRFLLSSA 108

Query: 82  VRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHF 141
           VRTLNPE+ADWFY P+Y TCDLT  GLPLPFKSPRMMRSAIQ +S  WP+WNRT+GADHF
Sbjct: 109 VRTLNPEQADWFYAPVYTTCDLTHAGLPLPFKSPRMMRSAIQFLSRKWPFWNRTDGADHF 168

Query: 142 FVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKM 201
           FVVPHDFGACFHYQEEKAIERGILPLL+RATLVQTFGQ+NHVCL EGSITIPPYAPPQKM
Sbjct: 169 FVVPHDFGACFHYQEEKAIERGILPLLRRATLVQTFGQKNHVCLKEGSITIPPYAPPQKM 228

Query: 202 QAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTT 261
           QAH IPPDTPRSIFVYFRGLFYD  NDPEGGYYARGARA++WENFKNNPLFDIST+HP T
Sbjct: 229 QAHLIPPDTPRSIFVYFRGLFYDNGNDPEGGYYARGARASLWENFKNNPLFDISTEHPAT 288

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           YYEDMQR++FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW+EIG+FV E
Sbjct: 289 YYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWDEIGVFVDE 348

Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHD 381
           EDVP+LD+ILTSIP + ILRKQRLLANPSMK+AMLFPQPAQP DAFHQILNGLARKLPH 
Sbjct: 349 EDVPRLDSILTSIPIDDILRKQRLLANPSMKQAMLFPQPAQPRDAFHQILNGLARKLPHP 408

Query: 382 KSTYLKPGGKFLNWTSGPVGDLKPW 406
            S YLKPG K LNWT+GPV DLKPW
Sbjct: 409 DSVYLKPGEKHLNWTAGPVADLKPW 433


>gi|75122905|sp|Q6H4N0.1|GT21_ORYSJ RecName: Full=Probable glucuronosyltransferase Os02g0520750
 gi|49389081|dbj|BAD26319.1| putative pectin-glucuronyltransferase [Oryza sativa Japonica Group]
 gi|215769122|dbj|BAH01351.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 434

 Score =  710 bits (1833), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 334/385 (86%), Positives = 360/385 (93%)

Query: 22  ERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSP 81
           ERISGSAGDVLED+PVG+LKV+VY+LPSKYNK+++ KDPRCL HMFAAEIFMHRFLLSS 
Sbjct: 49  ERISGSAGDVLEDNPVGRLKVFVYDLPSKYNKRIVAKDPRCLNHMFAAEIFMHRFLLSSA 108

Query: 82  VRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHF 141
           VRTLNPE+ADWFY P+Y TCDLT  GLPLPFKSPRMMRSAIQ +S  WP+WNRT+GADHF
Sbjct: 109 VRTLNPEQADWFYAPVYTTCDLTHAGLPLPFKSPRMMRSAIQFLSRKWPFWNRTDGADHF 168

Query: 142 FVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKM 201
           FVVPHDFGACFHYQEEKAIERGILPLL+RATLVQTFGQ+NHVCL EGSITIPPYAPPQKM
Sbjct: 169 FVVPHDFGACFHYQEEKAIERGILPLLRRATLVQTFGQKNHVCLKEGSITIPPYAPPQKM 228

Query: 202 QAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTT 261
           QAH IPPDTPRSIFVYFRGLFYD  NDPEGGYYARGARA++WENFKNNPLFDIST+HP T
Sbjct: 229 QAHLIPPDTPRSIFVYFRGLFYDNGNDPEGGYYARGARASLWENFKNNPLFDISTEHPAT 288

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           YYEDMQR++FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW+EIG+FV E
Sbjct: 289 YYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWDEIGVFVDE 348

Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHD 381
           EDVP+LD+ILTSIP + ILRKQRLLANPSMK+AMLFPQPAQP DAFHQILNGLARKLPH 
Sbjct: 349 EDVPRLDSILTSIPIDDILRKQRLLANPSMKQAMLFPQPAQPRDAFHQILNGLARKLPHP 408

Query: 382 KSTYLKPGGKFLNWTSGPVGDLKPW 406
            S YLKPG K LNWT+GPV DLKPW
Sbjct: 409 DSVYLKPGEKHLNWTAGPVADLKPW 433


>gi|302823891|ref|XP_002993593.1| glycosyltransferase-like protein [Selaginella moellendorffii]
 gi|300138521|gb|EFJ05285.1| glycosyltransferase-like protein [Selaginella moellendorffii]
          Length = 437

 Score =  709 bits (1829), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 336/384 (87%), Positives = 354/384 (92%)

Query: 22  ERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSP 81
           ERISGSAGDVLEDDPVG+LKV+VYELP KYN+K+L KDPRCLTHMFA EIF+H FLL S 
Sbjct: 25  ERISGSAGDVLEDDPVGRLKVFVYELPVKYNRKVLEKDPRCLTHMFATEIFVHEFLLGSA 84

Query: 82  VRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHF 141
           VRTLNPEEADWFYTP+Y TCDLTP GLPLPFKSPRMMRS IQ IS+ WPYWNRTEGADHF
Sbjct: 85  VRTLNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRMMRSVIQYISNQWPYWNRTEGADHF 144

Query: 142 FVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKM 201
           FVVPHDFGACFHYQEEKAIERGIL LLQRATLVQTFGQR HVCL EGSI +PPY PPQKM
Sbjct: 145 FVVPHDFGACFHYQEEKAIERGILHLLQRATLVQTFGQRYHVCLKEGSIVVPPYCPPQKM 204

Query: 202 QAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTT 261
           QAH IPP  PRSIFVYFRGLFYD  NDPEGGYYARGARAAVWENFK+NPLFDIST+HP T
Sbjct: 205 QAHLIPPSIPRSIFVYFRGLFYDYGNDPEGGYYARGARAAVWENFKDNPLFDISTEHPIT 264

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           YYEDMQRAIFCLCPLGWAPWSPRLVE V+FGCIPVIIADDIVLPFADAIPWEEIG+FVAE
Sbjct: 265 YYEDMQRAIFCLCPLGWAPWSPRLVEGVIFGCIPVIIADDIVLPFADAIPWEEIGVFVAE 324

Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHD 381
           +DVPKLDTILTSIPPEVIL+KQRLLA P+MK+AMLFPQPAQPGDAFHQILNGLARKLPHD
Sbjct: 325 KDVPKLDTILTSIPPEVILKKQRLLATPAMKQAMLFPQPAQPGDAFHQILNGLARKLPHD 384

Query: 382 KSTYLKPGGKFLNWTSGPVGDLKP 405
           +  YL PG K LNW+SGP GDLKP
Sbjct: 385 RRIYLNPGEKALNWSSGPPGDLKP 408


>gi|357163140|ref|XP_003579636.1| PREDICTED: probable glucuronosyltransferase Os04g0398600-like
           [Brachypodium distachyon]
          Length = 429

 Score =  707 bits (1824), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 332/385 (86%), Positives = 358/385 (92%)

Query: 22  ERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSP 81
           ERISGSAGDVLED+PVG+LKV++Y+LP KYNKK++ KDPRCL HMFAAEIFMHRFLLSS 
Sbjct: 45  ERISGSAGDVLEDNPVGRLKVFIYDLPRKYNKKMVAKDPRCLNHMFAAEIFMHRFLLSSA 104

Query: 82  VRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHF 141
           VRTLNP+EADWFY P+Y TCDLTP GLPLPFKSPR+MRSAIQ IS+ WP+WN+T+GADHF
Sbjct: 105 VRTLNPKEADWFYAPVYTTCDLTPAGLPLPFKSPRVMRSAIQYISNKWPFWNKTDGADHF 164

Query: 142 FVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKM 201
           FVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQ NHVCL EGSI IPP+APPQKM
Sbjct: 165 FVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQENHVCLKEGSIIIPPFAPPQKM 224

Query: 202 QAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTT 261
           QAH IPPDTPRSIFVYFRGLFYD  NDPEGGYYARGARA++WENFKNNPLFDIST+HP T
Sbjct: 225 QAHLIPPDTPRSIFVYFRGLFYDTGNDPEGGYYARGARASLWENFKNNPLFDISTEHPAT 284

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           YYEDMQRA+FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG+F+ E
Sbjct: 285 YYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFIEE 344

Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHD 381
           +DVPKLDTILTS+P E ILRKQRLLANPSMK+AMLFPQPAQ  DAFHQILNGLARKLPH 
Sbjct: 345 KDVPKLDTILTSMPIEDILRKQRLLANPSMKQAMLFPQPAQARDAFHQILNGLARKLPHP 404

Query: 382 KSTYLKPGGKFLNWTSGPVGDLKPW 406
           +  YL PG K LNWT+GPVGDLKPW
Sbjct: 405 EGVYLPPGEKHLNWTAGPVGDLKPW 429


>gi|302783378|ref|XP_002973462.1| glycosyltransferase-like protein [Selaginella moellendorffii]
 gi|300159215|gb|EFJ25836.1| glycosyltransferase-like protein [Selaginella moellendorffii]
          Length = 437

 Score =  706 bits (1822), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 335/384 (87%), Positives = 353/384 (91%)

Query: 22  ERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSP 81
           ERISGSAGDVLEDDPVG+LKV+VYELP KYN+K+L KDPRCLTHMFA EIF+H FLL S 
Sbjct: 25  ERISGSAGDVLEDDPVGRLKVFVYELPVKYNRKVLEKDPRCLTHMFATEIFVHEFLLGSA 84

Query: 82  VRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHF 141
           VRTLNPEEADWFYTP+Y TCDLT  GLPLPFKSPRMMRS IQ IS+ WPYWNRTEGADHF
Sbjct: 85  VRTLNPEEADWFYTPVYTTCDLTRNGLPLPFKSPRMMRSVIQYISNQWPYWNRTEGADHF 144

Query: 142 FVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKM 201
           FVVPHDFGACFHYQEEKAIERGIL LLQRATLVQTFGQR HVCL EGSI +PPY PPQKM
Sbjct: 145 FVVPHDFGACFHYQEEKAIERGILHLLQRATLVQTFGQRYHVCLKEGSIVVPPYCPPQKM 204

Query: 202 QAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTT 261
           QAH IPP  PRSIFVYFRGLFYD  NDPEGGYYARGARAAVWENFK+NPLFDIST+HP T
Sbjct: 205 QAHLIPPSIPRSIFVYFRGLFYDYGNDPEGGYYARGARAAVWENFKDNPLFDISTEHPIT 264

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           YYEDMQRAIFCLCPLGWAPWSPRLVE V+FGCIPVIIADDIVLPFADAIPWEEIG+FVAE
Sbjct: 265 YYEDMQRAIFCLCPLGWAPWSPRLVEGVIFGCIPVIIADDIVLPFADAIPWEEIGVFVAE 324

Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHD 381
           +DVPKLDTILTSIPPEVIL+KQRLLA P+MK+AMLFPQPAQPGDAFHQILNGLARKLPHD
Sbjct: 325 KDVPKLDTILTSIPPEVILKKQRLLATPAMKQAMLFPQPAQPGDAFHQILNGLARKLPHD 384

Query: 382 KSTYLKPGGKFLNWTSGPVGDLKP 405
           +  YL PG K LNW+SGP GDLKP
Sbjct: 385 RRIYLNPGEKALNWSSGPPGDLKP 408


>gi|242072854|ref|XP_002446363.1| hypothetical protein SORBIDRAFT_06g014730 [Sorghum bicolor]
 gi|241937546|gb|EES10691.1| hypothetical protein SORBIDRAFT_06g014730 [Sorghum bicolor]
          Length = 430

 Score =  705 bits (1819), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 331/385 (85%), Positives = 360/385 (93%)

Query: 22  ERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSP 81
           ERISGSAGDVLED+PVG+LKV++Y+LP KYNKK++ KDPRCL+HMFAAEIFMHRFLLSS 
Sbjct: 46  ERISGSAGDVLEDNPVGRLKVFIYDLPRKYNKKMVTKDPRCLSHMFAAEIFMHRFLLSSA 105

Query: 82  VRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHF 141
           VRTLNP+EADWFYTP+Y TCDLT  GLPLPFKSPR+MRSAIQ IS+ WP+WNRT+GADHF
Sbjct: 106 VRTLNPKEADWFYTPVYTTCDLTNAGLPLPFKSPRVMRSAIQYISNKWPFWNRTDGADHF 165

Query: 142 FVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKM 201
           FVVPHDF ACFHYQEEKAIERGILPLL+RATLVQTFGQ NHVCL EGSI IPP+APPQKM
Sbjct: 166 FVVPHDFAACFHYQEEKAIERGILPLLRRATLVQTFGQENHVCLKEGSIIIPPFAPPQKM 225

Query: 202 QAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTT 261
           QAH I PDTPRSIFVYFRGLFYD  NDPEGGYYARGARA++WENFK+NPLFDISTDHP T
Sbjct: 226 QAHLISPDTPRSIFVYFRGLFYDTGNDPEGGYYARGARASLWENFKSNPLFDISTDHPAT 285

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           YYEDMQRA+FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG+FV E
Sbjct: 286 YYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFVEE 345

Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHD 381
           +DVPKLDTILTS+P + ILRKQRLLANPSMK+AMLFPQPAQP DAFHQILNGLARKLPH 
Sbjct: 346 KDVPKLDTILTSMPIDDILRKQRLLANPSMKQAMLFPQPAQPRDAFHQILNGLARKLPHP 405

Query: 382 KSTYLKPGGKFLNWTSGPVGDLKPW 406
           + TYL+PG + LNWT+GPVGDLKPW
Sbjct: 406 EGTYLQPGDQHLNWTAGPVGDLKPW 430


>gi|357149222|ref|XP_003575040.1| PREDICTED: probable glucuronosyltransferase Os02g0520750-like
           [Brachypodium distachyon]
          Length = 428

 Score =  704 bits (1818), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 331/391 (84%), Positives = 359/391 (91%)

Query: 16  DNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHR 75
           D     ERISGSAGDVLED+PVGKLKV+VY+LPSKYNK ++ KDPRCL+HMFAAEIFMHR
Sbjct: 38  DAQRNTERISGSAGDVLEDNPVGKLKVFVYDLPSKYNKMIVTKDPRCLSHMFAAEIFMHR 97

Query: 76  FLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRT 135
           FL SS VRT+NPEEADWFYTP+Y TCDLT  GLPLPFKSPRMMRSAIQ IS+ WP+WNRT
Sbjct: 98  FLFSSAVRTVNPEEADWFYTPVYTTCDLTRAGLPLPFKSPRMMRSAIQFISNKWPFWNRT 157

Query: 136 EGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY 195
           +G DHFFVVPHDF ACFHYQEE AI RGILPLL+ ATLVQTFGQ+NHVCL +GSITIPPY
Sbjct: 158 DGGDHFFVVPHDFAACFHYQEENAIARGILPLLRHATLVQTFGQKNHVCLKDGSITIPPY 217

Query: 196 APPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDIS 255
           APPQKMQAH IPPDTPRSIFVYFRGLFYD  NDPEGGYYARGARA++WENFKNNPLFDIS
Sbjct: 218 APPQKMQAHLIPPDTPRSIFVYFRGLFYDNGNDPEGGYYARGARASLWENFKNNPLFDIS 277

Query: 256 TDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEI 315
           T+HP TYYEDMQR++FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW+EI
Sbjct: 278 TEHPATYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWDEI 337

Query: 316 GMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLA 375
           G+FV EEDVPKLD+ILTSIP + ILRKQRLLANPSMK+AMLFPQPAQP DAFHQILNGLA
Sbjct: 338 GVFVDEEDVPKLDSILTSIPIDDILRKQRLLANPSMKKAMLFPQPAQPRDAFHQILNGLA 397

Query: 376 RKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
           RKLPH +S YL+PG K +NWT+GPVGDLKPW
Sbjct: 398 RKLPHPESVYLQPGEKHVNWTAGPVGDLKPW 428


>gi|226499262|ref|NP_001149267.1| secondary cell wall-related glycosyltransferase family 47 precursor
           [Zea mays]
 gi|195625922|gb|ACG34791.1| secondary cell wall-related glycosyltransferase family 47 [Zea
           mays]
          Length = 427

 Score =  702 bits (1812), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 330/385 (85%), Positives = 359/385 (93%)

Query: 22  ERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSP 81
           ERISGSAGDVLED+PVG+LKV++Y+LP KYNKK++ KD RCL+HMFAAEIFMHRFLLSS 
Sbjct: 43  ERISGSAGDVLEDNPVGRLKVFIYDLPRKYNKKMVTKDSRCLSHMFAAEIFMHRFLLSSA 102

Query: 82  VRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHF 141
           VRTLNP+EADWFYTP+Y TCDLT  GLPLPFKSPR+MRSAIQ IS+ WP+WNRT+GADHF
Sbjct: 103 VRTLNPKEADWFYTPVYTTCDLTNAGLPLPFKSPRVMRSAIQYISNKWPFWNRTDGADHF 162

Query: 142 FVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKM 201
           FVVPHDF ACFHYQEEKAIERGILPLL+RATLVQTFGQ NHVCL EGSI IPPYAPPQKM
Sbjct: 163 FVVPHDFAACFHYQEEKAIERGILPLLRRATLVQTFGQENHVCLKEGSIIIPPYAPPQKM 222

Query: 202 QAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTT 261
           QAH I PDTPRSIFVYFRGLFYD  NDPEGGYYARGARA++WENFK+NPLFDISTDHP T
Sbjct: 223 QAHLISPDTPRSIFVYFRGLFYDTGNDPEGGYYARGARASLWENFKSNPLFDISTDHPAT 282

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           YYEDMQRA+FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG+FV E
Sbjct: 283 YYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFVEE 342

Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHD 381
           +DVPKLDTILTS+P + ILRKQRLLANPSMK+AMLFPQPAQP DAFHQILNGLARKLPH 
Sbjct: 343 KDVPKLDTILTSMPIDDILRKQRLLANPSMKQAMLFPQPAQPRDAFHQILNGLARKLPHP 402

Query: 382 KSTYLKPGGKFLNWTSGPVGDLKPW 406
           + T+L+PG + LNWT+GPVGDLKPW
Sbjct: 403 EGTFLQPGDQRLNWTAGPVGDLKPW 427


>gi|194700342|gb|ACF84255.1| unknown [Zea mays]
 gi|414587472|tpg|DAA38043.1| TPA: secondary cell wall glycosyltransferase family 47 [Zea mays]
          Length = 427

 Score =  702 bits (1811), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 330/385 (85%), Positives = 359/385 (93%)

Query: 22  ERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSP 81
           ERISGSAGDVLED+PVG+LKV++Y+LP KYNKK++ KD RCL+HMFAAEIFMHRFLLSS 
Sbjct: 43  ERISGSAGDVLEDNPVGRLKVFIYDLPRKYNKKMVTKDSRCLSHMFAAEIFMHRFLLSSA 102

Query: 82  VRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHF 141
           VRTLNP+EADWFYTP+Y TCDLT  GLPLPFKSPR+MRSAIQ IS+ WP+WNRT+GADHF
Sbjct: 103 VRTLNPKEADWFYTPVYTTCDLTNAGLPLPFKSPRVMRSAIQYISNKWPFWNRTDGADHF 162

Query: 142 FVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKM 201
           FVVPHDF ACFHYQEEKAIERGILPLL+RATLVQTFGQ NHVCL EGSI IPPYAPPQKM
Sbjct: 163 FVVPHDFAACFHYQEEKAIERGILPLLRRATLVQTFGQENHVCLKEGSIIIPPYAPPQKM 222

Query: 202 QAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTT 261
           QAH I PDTPRSIFVYFRGLFYD  NDPEGGYYARGARA++WENFK+NPLFDISTDHP T
Sbjct: 223 QAHLISPDTPRSIFVYFRGLFYDTGNDPEGGYYARGARASLWENFKSNPLFDISTDHPAT 282

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           YYEDMQRA+FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG+FV E
Sbjct: 283 YYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFVEE 342

Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHD 381
           +DVPKLDTILTS+P + ILRKQRLLANPSMK+AMLFPQPAQP DAFHQILNGLARKLPH 
Sbjct: 343 KDVPKLDTILTSMPIDDILRKQRLLANPSMKQAMLFPQPAQPRDAFHQILNGLARKLPHP 402

Query: 382 KSTYLKPGGKFLNWTSGPVGDLKPW 406
           + T+L+PG + LNWT+GPVGDLKPW
Sbjct: 403 EGTFLQPGDQRLNWTAGPVGDLKPW 427


>gi|296085534|emb|CBI29266.3| unnamed protein product [Vitis vinifera]
          Length = 390

 Score =  702 bits (1811), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 344/408 (84%), Positives = 358/408 (87%), Gaps = 24/408 (5%)

Query: 1   MWVTAIALL--LAFVSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLK 58
           +WV     +  +     D  P  ERISGSAGDVLEDDPVG+LKV+VYELPSKYNKK+L K
Sbjct: 5   IWVLVFGFVWGIGAHGLDRHPHTERISGSAGDVLEDDPVGRLKVFVYELPSKYNKKILQK 64

Query: 59  DPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMM 118
           DPRCL HMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIY TCDLTP GLPLPFKSPRMM
Sbjct: 65  DPRCLNHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYTTCDLTPNGLPLPFKSPRMM 124

Query: 119 RSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFG 178
           RSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFG
Sbjct: 125 RSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFG 184

Query: 179 QRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGA 238
           QRNHVCLNEGSITIPPYAPPQKMQAH IP +TPRSIFVYFRGLFYDVNNDPEGGYYARGA
Sbjct: 185 QRNHVCLNEGSITIPPYAPPQKMQAHLIPQETPRSIFVYFRGLFYDVNNDPEGGYYARGA 244

Query: 239 RAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVII 298
           RAAVWENFK+NPLFDIST+HPTTYYEDMQRAIFCLCPL                      
Sbjct: 245 RAAVWENFKDNPLFDISTEHPTTYYEDMQRAIFCLCPL---------------------- 282

Query: 299 ADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFP 358
           ADDIVLPFADAIPWEEIG+FVAEEDVP LDTILTSIPPEVILRKQRLLANPSMK+AMLFP
Sbjct: 283 ADDIVLPFADAIPWEEIGVFVAEEDVPNLDTILTSIPPEVILRKQRLLANPSMKQAMLFP 342

Query: 359 QPAQPGDAFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
           QPAQ GDAFHQILNGLARKLPH+K+ YLKPG   LNWT+GPVGDLKPW
Sbjct: 343 QPAQSGDAFHQILNGLARKLPHNKNVYLKPGENALNWTAGPVGDLKPW 390


>gi|413932967|gb|AFW67518.1| hypothetical protein ZEAMMB73_420245 [Zea mays]
          Length = 386

 Score =  701 bits (1808), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 329/384 (85%), Positives = 359/384 (93%), Gaps = 1/384 (0%)

Query: 23  RISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPV 82
           ++ GSAGDVLED+PVG+LKVYV++LPSKYNKKL+ KDPRCL HMFAAEIFMHR LLSS V
Sbjct: 4   KVEGSAGDVLEDNPVGRLKVYVHDLPSKYNKKLVKKDPRCLNHMFAAEIFMHRLLLSSAV 63

Query: 83  RTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFF 142
           RT NPEEADWFYTP+Y TCDLTP+GLPLPFKSPRMM SAI+LI++NWPYWNR+EGADHFF
Sbjct: 64  RTFNPEEADWFYTPVYATCDLTPSGLPLPFKSPRMMLSAIELIATNWPYWNRSEGADHFF 123

Query: 143 VVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQ 202
           V PHDFGACFHYQ+EKAI RGILPLLQ ATLVQTFGQ+NHVCL  GSITIPP+APPQKMQ
Sbjct: 124 VTPHDFGACFHYQDEKAIGRGILPLLQHATLVQTFGQKNHVCLKGGSITIPPFAPPQKMQ 183

Query: 203 AHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTY 262
           AH IP DTPRSIFVYFRGLFYD +NDPEGGYYARGARA+VWENFKNNPLFDISTDHP+TY
Sbjct: 184 AHLIPADTPRSIFVYFRGLFYDTSNDPEGGYYARGARASVWENFKNNPLFDISTDHPSTY 243

Query: 263 YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEE 322
           YEDM+R++FCLCPLGWAPWSPRLVEAVVFGCIP+IIA DIVLPFADAIPWEEIG+FVAEE
Sbjct: 244 YEDMERSVFCLCPLGWAPWSPRLVEAVVFGCIPLIIA-DIVLPFADAIPWEEIGVFVAEE 302

Query: 323 DVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHDK 382
           DVPKLD+ILTSIP +VILRKQRLLANPSMK+AMLFPQPAQ GDAFHQILNGLARKLPH  
Sbjct: 303 DVPKLDSILTSIPTDVILRKQRLLANPSMKQAMLFPQPAQAGDAFHQILNGLARKLPHGD 362

Query: 383 STYLKPGGKFLNWTSGPVGDLKPW 406
           S +LKPG + LNWT+GP GDLKPW
Sbjct: 363 SVFLKPGERVLNWTAGPPGDLKPW 386


>gi|242088705|ref|XP_002440185.1| hypothetical protein SORBIDRAFT_09g027450 [Sorghum bicolor]
 gi|241945470|gb|EES18615.1| hypothetical protein SORBIDRAFT_09g027450 [Sorghum bicolor]
          Length = 416

 Score =  681 bits (1758), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 324/397 (81%), Positives = 352/397 (88%), Gaps = 3/397 (0%)

Query: 10  LAFVSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAA 69
           +A V+A  T   ERI GSAGDVLEDDPVG+LKVYVYELP KYNK +L KD RCL+HMFA 
Sbjct: 23  VAVVAAQET---ERIQGSAGDVLEDDPVGRLKVYVYELPPKYNKNILAKDSRCLSHMFAT 79

Query: 70  EIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNW 129
           EIFMHRFLL+S VRTLNP+EADWFYTP+Y TCDLTP G PL  KSPRMMRSAIQ IS  W
Sbjct: 80  EIFMHRFLLTSAVRTLNPDEADWFYTPVYTTCDLTPWGHPLTTKSPRMMRSAIQYISKRW 139

Query: 130 PYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGS 189
           PYWNRTEGADHFFV PHDFGACF++QEE AI+RG+LP+L+RATLVQTFGQ++HVCL EGS
Sbjct: 140 PYWNRTEGADHFFVTPHDFGACFYFQEETAIQRGVLPVLRRATLVQTFGQKHHVCLKEGS 199

Query: 190 ITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNN 249
           ITIPPYAPP K++ H +PP+TPRSIFVYFRGLFYD  NDPEGGYYARGARA+VWENFKNN
Sbjct: 200 ITIPPYAPPHKIRTHIVPPETPRSIFVYFRGLFYDTANDPEGGYYARGARASVWENFKNN 259

Query: 250 PLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADA 309
            LFDIST+HP TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADA
Sbjct: 260 ALFDISTEHPPTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADA 319

Query: 310 IPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQ 369
           IPWEEI +FVAE+DV KLDTILTSIP E ILRKQRLLANPSMK+AMLFPQPA+P DAFHQ
Sbjct: 320 IPWEEIAVFVAEDDVLKLDTILTSIPMEEILRKQRLLANPSMKQAMLFPQPAEPRDAFHQ 379

Query: 370 ILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
           +LNGLARKLPH K  +LKPG K LNWT G   DLKPW
Sbjct: 380 VLNGLARKLPHGKGVFLKPGQKVLNWTEGTREDLKPW 416


>gi|363543513|ref|NP_001241766.1| secondary cell wall-related glycosyltransferase family 47 precursor
           [Zea mays]
 gi|195634597|gb|ACG36767.1| secondary cell wall-related glycosyltransferase family 47 [Zea
           mays]
          Length = 417

 Score =  680 bits (1755), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 322/397 (81%), Positives = 354/397 (89%), Gaps = 3/397 (0%)

Query: 10  LAFVSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAA 69
           +A V+A +   IERI GSAGDVLEDDPVG+LKVYVYELP KYNK ++ KD RCL+HMFA 
Sbjct: 24  VAVVTAQD---IERIQGSAGDVLEDDPVGRLKVYVYELPPKYNKNVVAKDSRCLSHMFAT 80

Query: 70  EIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNW 129
           EIFMHRFLL+S VRTLNP+EADWFYTP+Y TCDLTP G PL  KSPRMMRSAIQ +S  W
Sbjct: 81  EIFMHRFLLASAVRTLNPDEADWFYTPVYTTCDLTPWGHPLTVKSPRMMRSAIQYVSKRW 140

Query: 130 PYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGS 189
           PYWNRTEGADHFFV PHDFGACF++QEEKAI+RG+LP+L+RATLVQTFGQ+NHVCL EGS
Sbjct: 141 PYWNRTEGADHFFVTPHDFGACFYFQEEKAIQRGVLPVLRRATLVQTFGQKNHVCLREGS 200

Query: 190 ITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNN 249
           ITIPPYAPP K++AH +PP+TPRSIFVYFRGLFYD  NDPEGGYYARGARA+VWENFKNN
Sbjct: 201 ITIPPYAPPHKIRAHIVPPETPRSIFVYFRGLFYDTANDPEGGYYARGARASVWENFKNN 260

Query: 250 PLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADA 309
            LFDIST+HP TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADA
Sbjct: 261 ALFDISTEHPPTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADA 320

Query: 310 IPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQ 369
           IPWEEI +FV E+DV +LDTILTSIP + ILRKQRLLANPSMK+AMLFPQPA+P DAFHQ
Sbjct: 321 IPWEEIAVFVPEDDVLRLDTILTSIPMDEILRKQRLLANPSMKQAMLFPQPAEPRDAFHQ 380

Query: 370 ILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
           +LNGLARKLPH K  +LKPG K LNWT G   DLKPW
Sbjct: 381 VLNGLARKLPHAKDVFLKPGQKVLNWTEGTQDDLKPW 417


>gi|226529361|ref|NP_001147714.1| LOC100281324 precursor [Zea mays]
 gi|195613258|gb|ACG28459.1| secondary cell wall-related glycosyltransferase family 47 [Zea
           mays]
          Length = 418

 Score =  680 bits (1754), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 322/397 (81%), Positives = 354/397 (89%), Gaps = 3/397 (0%)

Query: 10  LAFVSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAA 69
           +A V+A +   IERI GSAGDVLEDDPVG+LKVYVYELP KYNK ++ KD RCL+HMFA 
Sbjct: 25  VAVVTAQD---IERIQGSAGDVLEDDPVGRLKVYVYELPPKYNKNVVAKDSRCLSHMFAT 81

Query: 70  EIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNW 129
           EIFMHRFLL+S VRTLNP+EADWFYTP+Y TCDLTP G PL  KSPRMMRSAIQ +S  W
Sbjct: 82  EIFMHRFLLASAVRTLNPDEADWFYTPVYTTCDLTPWGHPLTVKSPRMMRSAIQYVSKRW 141

Query: 130 PYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGS 189
           PYWNRTEGADHFFV PHDFGACF++QEEKAI+RG+LP+L+RATLVQTFGQ+NHVCL EGS
Sbjct: 142 PYWNRTEGADHFFVTPHDFGACFYFQEEKAIQRGVLPVLRRATLVQTFGQKNHVCLREGS 201

Query: 190 ITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNN 249
           ITIPPYAPP K++AH +PP+TPRSIFVYFRGLFYD  NDPEGGYYARGARA+VWENFKNN
Sbjct: 202 ITIPPYAPPHKIRAHIVPPETPRSIFVYFRGLFYDTANDPEGGYYARGARASVWENFKNN 261

Query: 250 PLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADA 309
            LFDIST+HP TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADA
Sbjct: 262 ALFDISTEHPPTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADA 321

Query: 310 IPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQ 369
           IPWEEI +FV E+DV +LDTILTSIP + ILRKQRLLANPSMK+AMLFPQPA+P DAFHQ
Sbjct: 322 IPWEEIAVFVPEDDVLRLDTILTSIPMDEILRKQRLLANPSMKQAMLFPQPAEPRDAFHQ 381

Query: 370 ILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
           +LNGLARKLPH K  +LKPG K LNWT G   DLKPW
Sbjct: 382 VLNGLARKLPHAKDVFLKPGQKVLNWTEGTQDDLKPW 418


>gi|194706890|gb|ACF87529.1| unknown [Zea mays]
          Length = 418

 Score =  679 bits (1753), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 322/397 (81%), Positives = 354/397 (89%), Gaps = 3/397 (0%)

Query: 10  LAFVSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAA 69
           +A V+A +   IERI GSAGDVLEDDPVG+LKVYVYELP KYNK ++ KD RCL+HMFA 
Sbjct: 25  VAVVTAQD---IERIQGSAGDVLEDDPVGRLKVYVYELPPKYNKNVVAKDSRCLSHMFAT 81

Query: 70  EIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNW 129
           EIFMHRFLL+S VRTLNP+EADWFYTP+Y TCDLTP G PL  KSPRMMRSAIQ +S  W
Sbjct: 82  EIFMHRFLLASAVRTLNPDEADWFYTPVYTTCDLTPWGHPLTVKSPRMMRSAIQYVSKRW 141

Query: 130 PYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGS 189
           PYWNRTEGADHFFV PHDFGACF++QEEKAI+RG+LP+L+RATLVQTFGQ+NHVCL EGS
Sbjct: 142 PYWNRTEGADHFFVTPHDFGACFYFQEEKAIQRGVLPVLRRATLVQTFGQKNHVCLREGS 201

Query: 190 ITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNN 249
           ITIPPYAPP K++AH +PP+TPRSIFVYFRGLFYD  NDPEGGYYARGARA+VWENFKNN
Sbjct: 202 ITIPPYAPPHKIRAHIVPPETPRSIFVYFRGLFYDTANDPEGGYYARGARASVWENFKNN 261

Query: 250 PLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADA 309
            LFDIST+HP TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADA
Sbjct: 262 ALFDISTEHPPTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADA 321

Query: 310 IPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQ 369
           IPWEEI +FV E+DV +LDTILTSIP + ILRKQRLLANPSMK+AMLFPQPA+P DAFHQ
Sbjct: 322 IPWEEIAVFVPEDDVLRLDTILTSIPMDEILRKQRLLANPSMKQAMLFPQPAEPRDAFHQ 381

Query: 370 ILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
           +LNGLARKLPH K  +LKPG K LNWT G   DLKPW
Sbjct: 382 VLNGLARKLPHPKDVFLKPGQKVLNWTEGAQDDLKPW 418


>gi|449469226|ref|XP_004152322.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10-like [Cucumis
           sativus]
          Length = 388

 Score =  679 bits (1753), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 331/406 (81%), Positives = 349/406 (85%), Gaps = 29/406 (7%)

Query: 1   MWVTAIALLLAFVSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDP 60
           +WV AI L+    +  N    ERISGSAGDVLED+PVG+LKVYVY+LPSKYNKK L KDP
Sbjct: 12  LWVFAIFLVSRIAAVPNAVPTERISGSAGDVLEDNPVGRLKVYVYDLPSKYNKKTLQKDP 71

Query: 61  RCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRS 120
           RCLTHMFAAEI+MHRFLL+SPVRTLNP+EADWFYTPIY TCDLTP GLPLPFKSPRMMRS
Sbjct: 72  RCLTHMFAAEIYMHRFLLNSPVRTLNPDEADWFYTPIYVTCDLTPNGLPLPFKSPRMMRS 131

Query: 121 AIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQR 180
           AIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAI+RGI                
Sbjct: 132 AIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIDRGI---------------- 175

Query: 181 NHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARA 240
                        PY PPQKM+ H IP +TPRSIFVYFRGLFYDVNNDPEGGYYARGARA
Sbjct: 176 -------------PYCPPQKMKTHLIPSETPRSIFVYFRGLFYDVNNDPEGGYYARGARA 222

Query: 241 AVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD 300
           AVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD
Sbjct: 223 AVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD 282

Query: 301 DIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQP 360
           DIVLPFADAIPWEEIG+FV E+DV  LDTILTSIPP+VILRKQRLLANPSMKRAM+FPQP
Sbjct: 283 DIVLPFADAIPWEEIGVFVDEKDVSNLDTILTSIPPDVILRKQRLLANPSMKRAMMFPQP 342

Query: 361 AQPGDAFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
           AQ GDAFHQILNGLARKLPHDK  YLKPG + LNWT+GPVGDLKPW
Sbjct: 343 AQSGDAFHQILNGLARKLPHDKGVYLKPGERILNWTAGPVGDLKPW 388


>gi|413946367|gb|AFW79016.1| secondary cell wall glycosyltransferase family 47 [Zea mays]
          Length = 418

 Score =  679 bits (1752), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 322/397 (81%), Positives = 354/397 (89%), Gaps = 3/397 (0%)

Query: 10  LAFVSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAA 69
           +A V+A +   IERI GSAGDVLEDDPVG+LKVYVYELP KYNK ++ KD RCL+HMFA 
Sbjct: 25  VAVVTAQD---IERIQGSAGDVLEDDPVGRLKVYVYELPPKYNKNVVAKDSRCLSHMFAT 81

Query: 70  EIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNW 129
           EIFMHRFLL+S VRTLNP+EADWFYTP+Y TCDLTP G PL  KSPRMMRSAIQ +S  W
Sbjct: 82  EIFMHRFLLASAVRTLNPDEADWFYTPVYTTCDLTPWGHPLTVKSPRMMRSAIQYVSKRW 141

Query: 130 PYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGS 189
           PYWNRTEGADHFFV PHDFGACF++QEEKAI+RG+LP+L+RATLVQTFGQ+NHVCL EGS
Sbjct: 142 PYWNRTEGADHFFVTPHDFGACFYFQEEKAIQRGVLPVLRRATLVQTFGQKNHVCLREGS 201

Query: 190 ITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNN 249
           ITIPPYAPP K++AH +PP+TPRSIFVYFRGLFYD  NDPEGGYYARGARA+VWENFKNN
Sbjct: 202 ITIPPYAPPHKIRAHIVPPETPRSIFVYFRGLFYDTANDPEGGYYARGARASVWENFKNN 261

Query: 250 PLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADA 309
            LFDIST+HP TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADA
Sbjct: 262 ALFDISTEHPPTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADA 321

Query: 310 IPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQ 369
           IPWEEI +FV E+DV +LDTILTSIP + ILRKQRLLANPSMK+AMLFPQPA+P DAFHQ
Sbjct: 322 IPWEEIAVFVPEDDVLRLDTILTSIPMDEILRKQRLLANPSMKQAMLFPQPAEPRDAFHQ 381

Query: 370 ILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
           +LNGLARKLPH K  +LKPG K LNWT G   DLKPW
Sbjct: 382 VLNGLARKLPHPKDVFLKPGQKVLNWTEGAQDDLKPW 418


>gi|242090769|ref|XP_002441217.1| hypothetical protein SORBIDRAFT_09g022500 [Sorghum bicolor]
 gi|241946502|gb|EES19647.1| hypothetical protein SORBIDRAFT_09g022500 [Sorghum bicolor]
          Length = 422

 Score =  671 bits (1731), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 315/387 (81%), Positives = 350/387 (90%), Gaps = 2/387 (0%)

Query: 22  ERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSP 81
           ERISGS GDVLEDDPVGKLKV+VY+LP+KYN K + KDPRCLTHMFA EIF+HR LLSS 
Sbjct: 36  ERISGSGGDVLEDDPVGKLKVFVYDLPAKYNTKPVEKDPRCLTHMFATEIFVHRSLLSSA 95

Query: 82  VRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHF 141
           VRTL+PEEADWFYTP+Y TCDLT +G P+PF SPRMMRSAI+LI+  WPYWNR+EGADHF
Sbjct: 96  VRTLDPEEADWFYTPVYTTCDLTASGHPMPFDSPRMMRSAIRLIADRWPYWNRSEGADHF 155

Query: 142 FVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEG--SITIPPYAPPQ 199
           FV PHDFGACFH+QEEKA+ RGILP+L+RATLVQTFGQRNHVCL +G  SITIPPYAPP 
Sbjct: 156 FVTPHDFGACFHFQEEKAMARGILPVLRRATLVQTFGQRNHVCLKDGGGSITIPPYAPPW 215

Query: 200 KMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP 259
           KM+A  +PP TPRSIFVYFRGLFYD  NDPEGGYYARGARA+VWENFK+NPLFDIST HP
Sbjct: 216 KMEAQLLPPATPRSIFVYFRGLFYDAGNDPEGGYYARGARASVWENFKSNPLFDISTAHP 275

Query: 260 TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFV 319
           TTYY+DMQRA+FCLCPLGWAPWSPRLVEAVVFGCIPV+IADDIVLPFADAIPW +IG+FV
Sbjct: 276 TTYYQDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVVIADDIVLPFADAIPWADIGVFV 335

Query: 320 AEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
           AE+DVP+LDTILTSIP EV+LRKQRLLA+P+MKRA+LFPQPAQPGDAFHQILNGLARKLP
Sbjct: 336 AEDDVPRLDTILTSIPVEVVLRKQRLLASPAMKRAVLFPQPAQPGDAFHQILNGLARKLP 395

Query: 380 HDKSTYLKPGGKFLNWTSGPVGDLKPW 406
           H    +L+ G   LNWT+GP  DLKPW
Sbjct: 396 HGDDAFLRNGQTVLNWTAGPPRDLKPW 422


>gi|23821292|dbj|BAC20928.1| pectin-glucuronyltransferase [Nicotiana plumbaginifolia]
          Length = 341

 Score =  666 bits (1718), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 314/341 (92%), Positives = 328/341 (96%)

Query: 66  MFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLI 125
           MFAAEI+MHRFLLSS VRT NPEEADWFYTP+Y TCDLTP GLPLPFKSPRMMRSAIQLI
Sbjct: 1   MFAAEIYMHRFLLSSAVRTFNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRMMRSAIQLI 60

Query: 126 SSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCL 185
           +SNWPYWNRTEGADHFF+ PHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCL
Sbjct: 61  ASNWPYWNRTEGADHFFITPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCL 120

Query: 186 NEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWEN 245
            +GSITIPPYAPPQKMQ+H IPPDTPRSIFVYFRGLFYDV NDPEGGYYARGARAAVWEN
Sbjct: 121 KDGSITIPPYAPPQKMQSHLIPPDTPRSIFVYFRGLFYDVGNDPEGGYYARGARAAVWEN 180

Query: 246 FKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLP 305
           FK+NPLFDIST+HPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAV+FGCIPVIIADDIVLP
Sbjct: 181 FKDNPLFDISTEHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVIFGCIPVIIADDIVLP 240

Query: 306 FADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGD 365
           FADAIPWE+IG+FVAE+DVP LDTILTSIPPE ILRKQRLLANPSMK+AMLFPQPAQ GD
Sbjct: 241 FADAIPWEDIGVFVAEKDVPNLDTILTSIPPEEILRKQRLLANPSMKQAMLFPQPAQSGD 300

Query: 366 AFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
           AFHQILNGLARKLPHDKSTYLKPG  FLNWT+GPVGDLKPW
Sbjct: 301 AFHQILNGLARKLPHDKSTYLKPGENFLNWTAGPVGDLKPW 341


>gi|168000965|ref|XP_001753186.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162695885|gb|EDQ82227.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 449

 Score =  664 bits (1714), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 316/408 (77%), Positives = 354/408 (86%), Gaps = 4/408 (0%)

Query: 3   VTAIALLLAFV--SADNTPKI--ERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLK 58
           + ++ LLL F+    D T ++  +RISGSAGDVLED+PVG+LKV++Y++PSKYN   L K
Sbjct: 38  LRSVVLLLLFIYFVQDVTAEMGHQRISGSAGDVLEDNPVGRLKVFIYDIPSKYNTDWLKK 97

Query: 59  DPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMM 118
           DPRCLTHMFA E ++H FL  SPVRTLNPEEADWFYTP+Y TCDLTP GLPLPFKSPR+M
Sbjct: 98  DPRCLTHMFAVEEYLHDFLTESPVRTLNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRVM 157

Query: 119 RSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFG 178
           RSAI  ISS+WPYWNRT+GADHFFVVPHDF ACFHYQEEKAIERGILPLL+RATL+QTFG
Sbjct: 158 RSAISYISSHWPYWNRTDGADHFFVVPHDFAACFHYQEEKAIERGILPLLKRATLIQTFG 217

Query: 179 QRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGA 238
           Q +HVCL E SI IPPYAPP++MQ    PP TPRSIF YFRGLFYD  NDPEGGYYARGA
Sbjct: 218 QNHHVCLKEDSIVIPPYAPPERMQTRLNPPSTPRSIFAYFRGLFYDPGNDPEGGYYARGA 277

Query: 239 RAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVII 298
           RAA+WENFK+NPLFDIST+HP TYYEDMQRAIFCLCPLGWAPWSPRLVE V+FGCIPVII
Sbjct: 278 RAAIWENFKDNPLFDISTEHPATYYEDMQRAIFCLCPLGWAPWSPRLVEGVIFGCIPVII 337

Query: 299 ADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFP 358
           ADDIVLPFADAIPWE+IG+FV E+DVP LD IL +I  E +L KQRLLANP+MK+AMLFP
Sbjct: 338 ADDIVLPFADAIPWEKIGVFVEEKDVPILDKILCTINHEEVLEKQRLLANPAMKQAMLFP 397

Query: 359 QPAQPGDAFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
           +PA+PGDAFHQILNGLARKLPHD S YL+PG  FLNWT GP GDL PW
Sbjct: 398 RPAKPGDAFHQILNGLARKLPHDPSIYLQPGQSFLNWTEGPPGDLYPW 445


>gi|326490197|dbj|BAJ94172.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326526653|dbj|BAK00715.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 422

 Score =  661 bits (1706), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 308/384 (80%), Positives = 345/384 (89%)

Query: 23  RISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPV 82
           RISG+AGDVLEDDPVGKLKV+VYE+P KYN  LL KD RCL HMFAAEIFMH+FLL+S V
Sbjct: 39  RISGNAGDVLEDDPVGKLKVFVYEMPRKYNHYLLEKDNRCLYHMFAAEIFMHQFLLASAV 98

Query: 83  RTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFF 142
           RT NPEEADWFYTP+Y TCDLT  G PLPF++PR+MRSAIQ I++ WPYWNRTEGADHFF
Sbjct: 99  RTKNPEEADWFYTPVYVTCDLTQQGFPLPFRAPRIMRSAIQYIAATWPYWNRTEGADHFF 158

Query: 143 VVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQ 202
           + PHDFGACFHYQEE+AIERGILPLL+RATLVQTFGQRNHVC+ EGSITIPPYA PQKMQ
Sbjct: 159 LAPHDFGACFHYQEERAIERGILPLLRRATLVQTFGQRNHVCMQEGSITIPPYANPQKMQ 218

Query: 203 AHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTY 262
           AH I P TPRSIF YFRGLFYD+ NDPEGGYYARGARA+VWENFK+NPLFD+S++HP+TY
Sbjct: 219 AHLISPGTPRSIFAYFRGLFYDMGNDPEGGYYARGARASVWENFKDNPLFDMSSEHPSTY 278

Query: 263 YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEE 322
           YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWE+I +FVAE 
Sbjct: 279 YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEQISVFVAEA 338

Query: 323 DVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHDK 382
           DVP+LD+IL S+ PE +LRKQRLLA+P+MK+A+LF QPA+PGDAF Q+LNGLARKLPH  
Sbjct: 339 DVPRLDSILASVAPEDVLRKQRLLASPAMKQAVLFHQPARPGDAFDQVLNGLARKLPHPD 398

Query: 383 STYLKPGGKFLNWTSGPVGDLKPW 406
             +LKPG K L+W +G   DLKPW
Sbjct: 399 GAFLKPGQKVLDWNAGLDSDLKPW 422


>gi|357126594|ref|XP_003564972.1| PREDICTED: probable glucuronosyltransferase Os01g0926400-like
           [Brachypodium distachyon]
          Length = 423

 Score =  660 bits (1704), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 303/384 (78%), Positives = 347/384 (90%)

Query: 23  RISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPV 82
           RISG+AGDVLED+PVGKLKV+VYE+P KYN+ LL KD RCL HMFAAEIFMH+FLL+S V
Sbjct: 40  RISGNAGDVLEDNPVGKLKVFVYEMPRKYNQYLLEKDNRCLYHMFAAEIFMHQFLLASAV 99

Query: 83  RTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFF 142
           RT++PEEADWFYTP+Y TCDLT  G PLPF++PRMMRSA+Q I++ WPYWNRTEGADHFF
Sbjct: 100 RTMDPEEADWFYTPVYTTCDLTQQGFPLPFRAPRMMRSAVQYIAATWPYWNRTEGADHFF 159

Query: 143 VVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQ 202
           + PHDFGACFHYQEE+AIERGILP+L+RATLVQTFGQRNHVC+ EGSITIPPYA PQKMQ
Sbjct: 160 LAPHDFGACFHYQEERAIERGILPVLRRATLVQTFGQRNHVCMQEGSITIPPYANPQKMQ 219

Query: 203 AHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTY 262
           AH I P TPRSIF YFRGLFYD+ NDPEGGYYARGARA+VWENFK+NPLFD+ST+HP+TY
Sbjct: 220 AHLISPGTPRSIFAYFRGLFYDMGNDPEGGYYARGARASVWENFKDNPLFDMSTEHPSTY 279

Query: 263 YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEE 322
           YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW++I +FV E 
Sbjct: 280 YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWDQISVFVPEA 339

Query: 323 DVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHDK 382
           DVP+LD+IL S+PPE +LRKQRLLA+P+MK+A+LF QPAQP DAF Q+LNGLARKLPH +
Sbjct: 340 DVPRLDSILASVPPEDVLRKQRLLASPAMKQAVLFHQPAQPRDAFDQVLNGLARKLPHRE 399

Query: 383 STYLKPGGKFLNWTSGPVGDLKPW 406
             +L+PG K ++W +G   DLKPW
Sbjct: 400 GVFLRPGQKVIDWNAGQYDDLKPW 423


>gi|115441965|ref|NP_001045262.1| Os01g0926400 [Oryza sativa Japonica Group]
 gi|75159223|sp|Q8S1X9.1|GT13_ORYSJ RecName: Full=Probable glucuronosyltransferase Os01g0926400;
           AltName: Full=OsGT47D
 gi|20160726|dbj|BAB89668.1| P0482D04.15 [Oryza sativa Japonica Group]
 gi|20805224|dbj|BAB92891.1| putative pectin-glucuronyltransferase [Oryza sativa Japonica Group]
 gi|113534793|dbj|BAF07176.1| Os01g0926400 [Oryza sativa Japonica Group]
 gi|215686656|dbj|BAG88909.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215707279|dbj|BAG93739.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 422

 Score =  657 bits (1696), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 305/394 (77%), Positives = 348/394 (88%)

Query: 13  VSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIF 72
            +A  T K  R+SG  GDVLEDDP GKLKV+VYE+P KYN  LL KD RCL HMFAAEIF
Sbjct: 29  AAATTTQKHGRMSGKGGDVLEDDPTGKLKVFVYEMPRKYNLNLLAKDSRCLQHMFAAEIF 88

Query: 73  MHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYW 132
           MH+FLLSSPVRTL+PEEADWFYTP Y TCDLTP G PLPF++PR+MRSA++ +++ WPYW
Sbjct: 89  MHQFLLSSPVRTLDPEEADWFYTPAYTTCDLTPQGFPLPFRAPRIMRSAVRYVAATWPYW 148

Query: 133 NRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITI 192
           NRT+GADHFF+ PHDFGACFHYQEE+AIERGILP+L+RATLVQTFGQR+H CL  GSIT+
Sbjct: 149 NRTDGADHFFLAPHDFGACFHYQEERAIERGILPVLRRATLVQTFGQRHHPCLQPGSITV 208

Query: 193 PPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLF 252
           PPYA P+KM+AH+I P TPRSIFVYFRGLFYD+ NDPEGGYYARGARA+VWENFK+NPLF
Sbjct: 209 PPYADPRKMEAHRISPATPRSIFVYFRGLFYDMGNDPEGGYYARGARASVWENFKDNPLF 268

Query: 253 DISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW 312
           DIST+HP TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW
Sbjct: 269 DISTEHPATYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW 328

Query: 313 EEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILN 372
            EI +FVAEEDVP+LDTIL S+P + ++RKQRLLA+P+MK+A+LF QPA+PGDAFHQILN
Sbjct: 329 GEISVFVAEEDVPRLDTILASVPLDEVIRKQRLLASPAMKQAVLFHQPARPGDAFHQILN 388

Query: 373 GLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
           GLARKLPH K  +L+PG K ++W  G   DLKPW
Sbjct: 389 GLARKLPHPKGVFLEPGEKGIDWDQGLENDLKPW 422


>gi|21671946|gb|AAM74308.1|AC083944_26 Unknown protein similar to exostosin-2 [Oryza sativa Japonica
           Group]
 gi|110288700|gb|ABB46946.2| exostosin family protein, putative, expressed [Oryza sativa
           Japonica Group]
          Length = 401

 Score =  656 bits (1693), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 310/406 (76%), Positives = 346/406 (85%), Gaps = 12/406 (2%)

Query: 6   IALLLAF-----VSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDP 60
           +A+LL F     V+A     IERI        EDDPVGKLKVYVYELP KYNK ++ KD 
Sbjct: 3   LAVLLLFACSSGVAAAAAQGIERIK-------EDDPVGKLKVYVYELPPKYNKNIVAKDS 55

Query: 61  RCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRS 120
           RCL+HMFA EIFMHRFLLSS +RT NP+EADWFYTP+Y TCDLTP G PL  KSPRMMRS
Sbjct: 56  RCLSHMFATEIFMHRFLLSSAIRTSNPDEADWFYTPVYTTCDLTPWGHPLTTKSPRMMRS 115

Query: 121 AIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQR 180
           AI+ IS  WPYWNRTEGADHFFVVPHDF ACF++QE KAIERGILP+L+RATLVQTFGQ+
Sbjct: 116 AIKFISKYWPYWNRTEGADHFFVVPHDFAACFYFQEAKAIERGILPVLRRATLVQTFGQK 175

Query: 181 NHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARA 240
           NH CL +GSIT+PPY P  K++AH +PP+TPRSIFVYFRGLFYD +NDPEGGYYARGARA
Sbjct: 176 NHACLKDGSITVPPYTPAHKIRAHLVPPETPRSIFVYFRGLFYDTSNDPEGGYYARGARA 235

Query: 241 AVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD 300
           +VWENFKNNP+FDISTDHP TYYEDMQRA+FCLCPLGWAPWSPRLVEAVVFGCIPVIIAD
Sbjct: 236 SVWENFKNNPMFDISTDHPQTYYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD 295

Query: 301 DIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQP 360
           DIVLPF+DAIPWEEI +FVAE+DVP+LDTILTSIP EVILRKQ +LA PSMK+ MLFPQP
Sbjct: 296 DIVLPFSDAIPWEEIAVFVAEDDVPQLDTILTSIPTEVILRKQAMLAEPSMKQTMLFPQP 355

Query: 361 AQPGDAFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
           A+PGD FHQ++N LARKLPH +  +LKPG K LNWT G   DLKPW
Sbjct: 356 AEPGDGFHQVMNALARKLPHGRDVFLKPGQKVLNWTEGTREDLKPW 401


>gi|125528940|gb|EAY77054.1| hypothetical protein OsI_05011 [Oryza sativa Indica Group]
          Length = 422

 Score =  655 bits (1690), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 304/394 (77%), Positives = 347/394 (88%)

Query: 13  VSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIF 72
            +A  T K   +SG  GDVLEDDP GKLKV+VYE+P KYN  LL KD RCL HMFAAEIF
Sbjct: 29  AAATTTQKHGSMSGKGGDVLEDDPTGKLKVFVYEMPRKYNLNLLAKDSRCLQHMFAAEIF 88

Query: 73  MHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYW 132
           MH+FLLSSPVRTL+PEEADWFYTP Y TCDLTP G PLPF++PR+MRSA++ +++ WPYW
Sbjct: 89  MHQFLLSSPVRTLDPEEADWFYTPAYTTCDLTPQGFPLPFRAPRIMRSAVRYVAATWPYW 148

Query: 133 NRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITI 192
           NRT+GADHFF+ PHDFGACFHYQEE+AIERGILP+L+RATLVQTFGQR+H CL  GSIT+
Sbjct: 149 NRTDGADHFFLAPHDFGACFHYQEERAIERGILPVLRRATLVQTFGQRHHPCLQPGSITV 208

Query: 193 PPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLF 252
           PPYA P+KM+AH+I P TPRSIFVYFRGLFYD+ NDPEGGYYARGARA+VWENFK+NPLF
Sbjct: 209 PPYADPRKMEAHRISPATPRSIFVYFRGLFYDMGNDPEGGYYARGARASVWENFKDNPLF 268

Query: 253 DISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW 312
           DIST+HP TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW
Sbjct: 269 DISTEHPATYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW 328

Query: 313 EEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILN 372
            EI +FVAEEDVP+LDTIL S+P + ++RKQRLLA+P+MK+A+LF QPA+PGDAFHQILN
Sbjct: 329 GEISVFVAEEDVPRLDTILASVPLDEVIRKQRLLASPAMKQAVLFHQPARPGDAFHQILN 388

Query: 373 GLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
           GLARKLPH K  +L+PG K ++W  G   DLKPW
Sbjct: 389 GLARKLPHPKGVFLEPGEKGIDWDQGLENDLKPW 422


>gi|218184216|gb|EEC66643.1| hypothetical protein OsI_32904 [Oryza sativa Indica Group]
 gi|222612524|gb|EEE50656.1| hypothetical protein OsJ_30886 [Oryza sativa Japonica Group]
          Length = 400

 Score =  655 bits (1689), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 309/406 (76%), Positives = 346/406 (85%), Gaps = 13/406 (3%)

Query: 6   IALLLAF-----VSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDP 60
           +A+LL F     V+A     IERI        +DDPVGKLKVYVYELP KYNK ++ KD 
Sbjct: 3   LAVLLLFACSSGVAAAAAQGIERI--------KDDPVGKLKVYVYELPPKYNKNIVAKDS 54

Query: 61  RCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRS 120
           RCL+HMFA EIFMHRFLLSS +RT NP+EADWFYTP+Y TCDLTP G PL  KSPRMMRS
Sbjct: 55  RCLSHMFATEIFMHRFLLSSAIRTSNPDEADWFYTPVYTTCDLTPWGHPLTTKSPRMMRS 114

Query: 121 AIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQR 180
           AI+ IS  WPYWNRTEGADHFFVVPHDF ACF++QE KAIERGILP+L+RATLVQTFGQ+
Sbjct: 115 AIKFISKYWPYWNRTEGADHFFVVPHDFAACFYFQEAKAIERGILPVLRRATLVQTFGQK 174

Query: 181 NHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARA 240
           NH CL +GSIT+PPY P  K++AH +PP+TPRSIFVYFRGLFYD +NDPEGGYYARGARA
Sbjct: 175 NHACLKDGSITVPPYTPAHKIRAHLVPPETPRSIFVYFRGLFYDTSNDPEGGYYARGARA 234

Query: 241 AVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD 300
           +VWENFKNNP+FDISTDHP TYYEDMQRA+FCLCPLGWAPWSPRLVEAVVFGCIPVIIAD
Sbjct: 235 SVWENFKNNPMFDISTDHPQTYYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD 294

Query: 301 DIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQP 360
           DIVLPF+DAIPWEEI +FVAE+DVP+LDTILTSIP EVILRKQ +LA PSMK+ MLFPQP
Sbjct: 295 DIVLPFSDAIPWEEIAVFVAEDDVPQLDTILTSIPTEVILRKQAMLAEPSMKQTMLFPQP 354

Query: 361 AQPGDAFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
           A+PGD FHQ++N LARKLPH +  +LKPG K LNWT G   DLKPW
Sbjct: 355 AEPGDGFHQVMNALARKLPHGRDVFLKPGQKVLNWTEGTREDLKPW 400


>gi|10176877|dbj|BAB10084.1| unnamed protein product [Arabidopsis thaliana]
 gi|23821294|dbj|BAC20929.1| NpGUT1 homolog [Arabidopsis thaliana]
          Length = 341

 Score =  654 bits (1687), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 308/341 (90%), Positives = 325/341 (95%)

Query: 66  MFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLI 125
           MFAAEI+M RFLLSSPVRTLNPEEADWFY P+Y TCDLTP GLPLPFKSPRMMRSAIQLI
Sbjct: 1   MFAAEIYMQRFLLSSPVRTLNPEEADWFYVPVYTTCDLTPNGLPLPFKSPRMMRSAIQLI 60

Query: 126 SSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCL 185
           +SNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAI RGILPLLQRATLVQTFGQRNHVCL
Sbjct: 61  ASNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIGRGILPLLQRATLVQTFGQRNHVCL 120

Query: 186 NEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWEN 245
            EGSIT+PPYAPPQKMQ+H IP  TPRSIFVYFRGLFYDV NDPEGGYYARGARAAVWEN
Sbjct: 121 KEGSITVPPYAPPQKMQSHLIPEKTPRSIFVYFRGLFYDVGNDPEGGYYARGARAAVWEN 180

Query: 246 FKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLP 305
           FK+NPLFDIST+HPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAV+FGCIPVIIADDIVLP
Sbjct: 181 FKDNPLFDISTEHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVIFGCIPVIIADDIVLP 240

Query: 306 FADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGD 365
           FADAIPWE+IG+FV E+DVP LDTILTSIPPEVILRKQRLLANPSMK+AMLFPQPAQPGD
Sbjct: 241 FADAIPWEDIGVFVDEKDVPYLDTILTSIPPEVILRKQRLLANPSMKQAMLFPQPAQPGD 300

Query: 366 AFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
           AFHQ+LNGLARKLPH++S YL+PG K LNWT+GPV DLKPW
Sbjct: 301 AFHQVLNGLARKLPHERSVYLRPGEKLLNWTAGPVADLKPW 341


>gi|115481310|ref|NP_001064248.1| Os10g0180000 [Oryza sativa Japonica Group]
 gi|122212636|sp|Q33AH8.2|GT101_ORYSJ RecName: Full=Probable glucuronosyltransferase GUT1; AltName:
           Full=Glucuronoxylan glucuronosyltransferase 1;
           Short=OsGUT1
 gi|110288701|gb|ABB46947.2| exostosin family protein, putative, expressed [Oryza sativa
           Japonica Group]
 gi|113638857|dbj|BAF26162.1| Os10g0180000 [Oryza sativa Japonica Group]
          Length = 417

 Score =  654 bits (1687), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 309/406 (76%), Positives = 346/406 (85%), Gaps = 13/406 (3%)

Query: 6   IALLLAF-----VSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDP 60
           +A+LL F     V+A     IERI        +DDPVGKLKVYVYELP KYNK ++ KD 
Sbjct: 20  LAVLLLFACSSGVAAAAAQGIERI--------KDDPVGKLKVYVYELPPKYNKNIVAKDS 71

Query: 61  RCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRS 120
           RCL+HMFA EIFMHRFLLSS +RT NP+EADWFYTP+Y TCDLTP G PL  KSPRMMRS
Sbjct: 72  RCLSHMFATEIFMHRFLLSSAIRTSNPDEADWFYTPVYTTCDLTPWGHPLTTKSPRMMRS 131

Query: 121 AIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQR 180
           AI+ IS  WPYWNRTEGADHFFVVPHDF ACF++QE KAIERGILP+L+RATLVQTFGQ+
Sbjct: 132 AIKFISKYWPYWNRTEGADHFFVVPHDFAACFYFQEAKAIERGILPVLRRATLVQTFGQK 191

Query: 181 NHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARA 240
           NH CL +GSIT+PPY P  K++AH +PP+TPRSIFVYFRGLFYD +NDPEGGYYARGARA
Sbjct: 192 NHACLKDGSITVPPYTPAHKIRAHLVPPETPRSIFVYFRGLFYDTSNDPEGGYYARGARA 251

Query: 241 AVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD 300
           +VWENFKNNP+FDISTDHP TYYEDMQRA+FCLCPLGWAPWSPRLVEAVVFGCIPVIIAD
Sbjct: 252 SVWENFKNNPMFDISTDHPQTYYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD 311

Query: 301 DIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQP 360
           DIVLPF+DAIPWEEI +FVAE+DVP+LDTILTSIP EVILRKQ +LA PSMK+ MLFPQP
Sbjct: 312 DIVLPFSDAIPWEEIAVFVAEDDVPQLDTILTSIPTEVILRKQAMLAEPSMKQTMLFPQP 371

Query: 361 AQPGDAFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
           A+PGD FHQ++N LARKLPH +  +LKPG K LNWT G   DLKPW
Sbjct: 372 AEPGDGFHQVMNALARKLPHGRDVFLKPGQKVLNWTEGTREDLKPW 417


>gi|242059739|ref|XP_002459015.1| hypothetical protein SORBIDRAFT_03g044510 [Sorghum bicolor]
 gi|241930990|gb|EES04135.1| hypothetical protein SORBIDRAFT_03g044510 [Sorghum bicolor]
          Length = 415

 Score =  652 bits (1681), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 310/408 (75%), Positives = 352/408 (86%), Gaps = 4/408 (0%)

Query: 3   VTAIALLLAFVSA----DNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLK 58
           V AI L+   V+A        K  RISG+A  VL+DDPVGKLKV+VYE+P KYNK LL K
Sbjct: 8   VVAILLVATCVAAAPPQQQPRKPGRISGNAVGVLDDDPVGKLKVFVYEMPRKYNKMLLDK 67

Query: 59  DPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMM 118
           D RCL HMFAAEIFMH+FLLSS VRTL+PEEADWFYTP+Y TCDLTP G PLPF++PR+M
Sbjct: 68  DSRCLHHMFAAEIFMHQFLLSSAVRTLDPEEADWFYTPVYTTCDLTPQGFPLPFRAPRIM 127

Query: 119 RSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFG 178
           RSAI+ +++ WP+WNRT+GADHFF+ PHDFGACFHYQEE+A+ERGILPLL+RATLVQTFG
Sbjct: 128 RSAIRYVATTWPFWNRTDGADHFFLTPHDFGACFHYQEERAMERGILPLLRRATLVQTFG 187

Query: 179 QRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGA 238
           QRNHVCL +GSIT+PPYA P KMQAH I P TPRSIFVYFRGLFYD+ NDPEGGYYARGA
Sbjct: 188 QRNHVCLQDGSITVPPYADPHKMQAHLISPGTPRSIFVYFRGLFYDMGNDPEGGYYARGA 247

Query: 239 RAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVII 298
           RA+VWENFK+NPLFDIST+HP TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVII
Sbjct: 248 RASVWENFKDNPLFDISTEHPYTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVII 307

Query: 299 ADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFP 358
           ADDIVLPFADAIPWE+I +FVAE DVP+LD+ILTSIP   ILR+QRLLA  S+K+A+LF 
Sbjct: 308 ADDIVLPFADAIPWEDISVFVAERDVPRLDSILTSIPLADILRRQRLLARESVKQALLFH 367

Query: 359 QPAQPGDAFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
           QPA+ GDAFHQ+LNGLARKLPH    +L+PG K L+W +G   DLKPW
Sbjct: 368 QPARTGDAFHQVLNGLARKLPHGDGVFLQPGEKVLDWDAGLDSDLKPW 415


>gi|18642697|gb|AAL76189.1|AC092173_1 Unknown protein [Oryza sativa Japonica Group]
 gi|23821296|dbj|BAC20930.1| NpGUT1 homolog [Oryza sativa Japonica Group]
          Length = 401

 Score =  652 bits (1681), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 308/406 (75%), Positives = 344/406 (84%), Gaps = 12/406 (2%)

Query: 6   IALLLAF-----VSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDP 60
           +A+LL F     V+A     IERI        EDDPVGKLKVYVYELP KYNK ++ KD 
Sbjct: 3   LAVLLLFACSSGVAAAAAQGIERIK-------EDDPVGKLKVYVYELPPKYNKNIVAKDS 55

Query: 61  RCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRS 120
           RCL+HMFA EIFMHRFLLSS +RT NP+EADWFYTP+Y TCDLTP G PL  KSPRMMRS
Sbjct: 56  RCLSHMFATEIFMHRFLLSSAIRTSNPDEADWFYTPVYTTCDLTPWGHPLTTKSPRMMRS 115

Query: 121 AIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQR 180
           AI+ IS  WPYWNRTEGADHFFVVPHDF ACF++QE KAIERGILP+L+RATLVQTFGQ+
Sbjct: 116 AIKFISKYWPYWNRTEGADHFFVVPHDFAACFYFQEAKAIERGILPVLRRATLVQTFGQK 175

Query: 181 NHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARA 240
           NH CL +GSIT+PPY P  K++AH +PP+TPRSIFVYFRGLFYD +NDPEGGYYARGARA
Sbjct: 176 NHACLKDGSITVPPYTPAHKIRAHLVPPETPRSIFVYFRGLFYDTSNDPEGGYYARGARA 235

Query: 241 AVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD 300
           +VWENFKNNP+FDISTDHP TYYEDMQRA+FCLCPLGWAPWSPRLVEAVVFGCIPVIIAD
Sbjct: 236 SVWENFKNNPMFDISTDHPQTYYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD 295

Query: 301 DIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQP 360
           DI LP +DAIPWEEI +FVAE+DVP+LDTILTSIP EVILRKQ +LA PSMK+ MLFPQP
Sbjct: 296 DIDLPLSDAIPWEEIAVFVAEDDVPQLDTILTSIPTEVILRKQAMLAEPSMKQTMLFPQP 355

Query: 361 AQPGDAFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
           A+PGD FHQ++N LARKLPH +  +LKPG K LNWT G   DLKPW
Sbjct: 356 AEPGDGFHQVMNALARKLPHGRDVFLKPGQKVLNWTEGTREDLKPW 401


>gi|195648072|gb|ACG43504.1| secondary cell wall-related glycosyltransferase family 47 [Zea
           mays]
          Length = 420

 Score =  651 bits (1680), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 303/384 (78%), Positives = 345/384 (89%)

Query: 23  RISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPV 82
           RISG A  VL+DDPVGKLKV+VYE+P KYN+ LL KD RCL HMFAAEIFMH+FLLSS V
Sbjct: 37  RISGDAVGVLDDDPVGKLKVFVYEMPRKYNRMLLDKDSRCLHHMFAAEIFMHQFLLSSAV 96

Query: 83  RTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFF 142
           RTL+PEEADWFYTP Y TCDLTP G PLPF++PR+MRSAI+ +++ WP+WNRT+GADHFF
Sbjct: 97  RTLDPEEADWFYTPAYTTCDLTPQGFPLPFRAPRIMRSAIRYVATTWPFWNRTDGADHFF 156

Query: 143 VVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQ 202
           + PHDFGACFHYQEE+A+ERGILPLL+RATLVQTFGQRNHVCL +GSIT+PPYA P+KMQ
Sbjct: 157 LTPHDFGACFHYQEERAMERGILPLLRRATLVQTFGQRNHVCLQDGSITVPPYADPRKMQ 216

Query: 203 AHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTY 262
           AH I P TPRSIFVYFRGLFYD+ NDPEGGYYARGARA+VWENFK+NPLFDIST+HP+TY
Sbjct: 217 AHLISPGTPRSIFVYFRGLFYDMGNDPEGGYYARGARASVWENFKDNPLFDISTEHPSTY 276

Query: 263 YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEE 322
           YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWE+I +FVAE 
Sbjct: 277 YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEDISVFVAER 336

Query: 323 DVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHDK 382
           DVP+LD+ILTSIP   ILR+QRLLA  S+K+A+LF QPA+PGDAFHQ+LNGLARKLPH +
Sbjct: 337 DVPRLDSILTSIPLPDILRRQRLLARDSVKQALLFHQPARPGDAFHQVLNGLARKLPHGE 396

Query: 383 STYLKPGGKFLNWTSGPVGDLKPW 406
             +L+PG K L+W +G   DLKPW
Sbjct: 397 GVFLQPGEKALDWDAGLQSDLKPW 420


>gi|414878903|tpg|DAA56034.1| TPA: hypothetical protein ZEAMMB73_301237 [Zea mays]
          Length = 393

 Score =  650 bits (1678), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 305/388 (78%), Positives = 346/388 (89%), Gaps = 3/388 (0%)

Query: 22  ERISGSAGD---VLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLL 78
           ERI G AGD   VL+DDPVGKLKV+VYE+P KYN+ LL KD RCL HMFAAEIFMH+FLL
Sbjct: 6   ERIDGFAGDAVGVLDDDPVGKLKVFVYEMPRKYNRMLLDKDSRCLHHMFAAEIFMHQFLL 65

Query: 79  SSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGA 138
           SS VRTL+PEEADWFYTP Y TCDLTP G PLPF++PR+MRSAI+ +++ WP+WNRT+GA
Sbjct: 66  SSAVRTLDPEEADWFYTPAYTTCDLTPQGFPLPFRAPRIMRSAIRYVATTWPFWNRTDGA 125

Query: 139 DHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPP 198
           DHFF+ PHDFGACFHYQEE+A+ERGILPLL+RATLVQTFGQRNHVCL +GSIT+PPYA P
Sbjct: 126 DHFFLTPHDFGACFHYQEERAMERGILPLLRRATLVQTFGQRNHVCLQDGSITVPPYADP 185

Query: 199 QKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDH 258
            KMQAH I P TPRSIFVYFRGLFYD+ NDPEGGYYARGARA+VWENFK+NPLFDIST+H
Sbjct: 186 GKMQAHLISPGTPRSIFVYFRGLFYDMGNDPEGGYYARGARASVWENFKDNPLFDISTEH 245

Query: 259 PTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMF 318
           P+TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWE+I +F
Sbjct: 246 PSTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEDISVF 305

Query: 319 VAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKL 378
           VAE DVP+LD+ILTSIP   ILR+QRLLA  S+K+A+LF QPA+PGDAFHQ+LNGLARKL
Sbjct: 306 VAERDVPRLDSILTSIPLPDILRRQRLLARDSVKQALLFHQPARPGDAFHQVLNGLARKL 365

Query: 379 PHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
           PH +  +L+PG K L+W +G   DLKPW
Sbjct: 366 PHGEGVFLQPGEKALDWDAGLQSDLKPW 393


>gi|414878901|tpg|DAA56032.1| TPA: secondary cell wall glycosyltransferase family 47 [Zea mays]
          Length = 417

 Score =  649 bits (1675), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 303/384 (78%), Positives = 344/384 (89%)

Query: 23  RISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPV 82
           RISG A  VL+DDPVGKLKV+VYE+P KYN+ LL KD RCL HMFAAEIFMH+FLLSS V
Sbjct: 34  RISGDAVGVLDDDPVGKLKVFVYEMPRKYNRMLLDKDSRCLHHMFAAEIFMHQFLLSSAV 93

Query: 83  RTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFF 142
           RTL+PEEADWFYTP Y TCDLTP G PLPF++PR+MRSAI+ +++ WP+WNRT+GADHFF
Sbjct: 94  RTLDPEEADWFYTPAYTTCDLTPQGFPLPFRAPRIMRSAIRYVATTWPFWNRTDGADHFF 153

Query: 143 VVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQ 202
           + PHDFGACFHYQEE+A+ERGILPLL+RATLVQTFGQRNHVCL +GSIT+PPYA P KMQ
Sbjct: 154 LTPHDFGACFHYQEERAMERGILPLLRRATLVQTFGQRNHVCLQDGSITVPPYADPGKMQ 213

Query: 203 AHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTY 262
           AH I P TPRSIFVYFRGLFYD+ NDPEGGYYARGARA+VWENFK+NPLFDIST+HP+TY
Sbjct: 214 AHLISPGTPRSIFVYFRGLFYDMGNDPEGGYYARGARASVWENFKDNPLFDISTEHPSTY 273

Query: 263 YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEE 322
           YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWE+I +FVAE 
Sbjct: 274 YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEDISVFVAER 333

Query: 323 DVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHDK 382
           DVP+LD+ILTSIP   ILR+QRLLA  S+K+A+LF QPA+PGDAFHQ+LNGLARKLPH +
Sbjct: 334 DVPRLDSILTSIPLPDILRRQRLLARDSVKQALLFHQPARPGDAFHQVLNGLARKLPHGE 393

Query: 383 STYLKPGGKFLNWTSGPVGDLKPW 406
             +L+PG K L+W +G   DLKPW
Sbjct: 394 GVFLQPGEKALDWDAGLQSDLKPW 417


>gi|293336613|ref|NP_001170701.1| uncharacterized protein LOC100384778 precursor [Zea mays]
 gi|238007038|gb|ACR34554.1| unknown [Zea mays]
 gi|413951433|gb|AFW84082.1| hypothetical protein ZEAMMB73_582447 [Zea mays]
          Length = 412

 Score =  649 bits (1674), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 299/384 (77%), Positives = 343/384 (89%)

Query: 23  RISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPV 82
           R SG+A  VL+DDPVGKLKV+VYE+P KYNK L+ KD RCL HMFAAEIFMH+FLLSS V
Sbjct: 29  RTSGNAVGVLDDDPVGKLKVFVYEMPRKYNKMLVDKDSRCLQHMFAAEIFMHQFLLSSAV 88

Query: 83  RTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFF 142
           RTL+PE ADWFYTP+Y TCDLTP G PLPF++PRMMRSAI+ +++ WP+WNRT+GADHFF
Sbjct: 89  RTLDPEAADWFYTPVYTTCDLTPQGFPLPFRAPRMMRSAIRYVAATWPFWNRTDGADHFF 148

Query: 143 VVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQ 202
           + PHDFGACFHYQEE+A+ERGILPLL+RATLVQTFGQRNHVCL +GSIT+PPYA P ++Q
Sbjct: 149 LTPHDFGACFHYQEERAVERGILPLLRRATLVQTFGQRNHVCLQDGSITVPPYASPHRLQ 208

Query: 203 AHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTY 262
           AH + P TPRSIFVYFRGLFYD+ NDPEGGYYARGARA+VWENFK+NPLFDIST+HP TY
Sbjct: 209 AHLVGPGTPRSIFVYFRGLFYDMGNDPEGGYYARGARASVWENFKDNPLFDISTEHPATY 268

Query: 263 YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEE 322
           YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWE++ +FVAE 
Sbjct: 269 YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEDMSVFVAER 328

Query: 323 DVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHDK 382
           DVP+LD+ILTSIP   ILR+QRLLA  S+KRA+LF QPA+PGDAFHQ+LNGLARKLPH +
Sbjct: 329 DVPRLDSILTSIPLPDILRRQRLLARDSVKRALLFHQPARPGDAFHQVLNGLARKLPHRE 388

Query: 383 STYLKPGGKFLNWTSGPVGDLKPW 406
             +L+PG K L+W +G   DLKPW
Sbjct: 389 GVFLQPGEKALDWDAGLESDLKPW 412


>gi|223974777|gb|ACN31576.1| unknown [Zea mays]
          Length = 412

 Score =  646 bits (1667), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 298/384 (77%), Positives = 341/384 (88%)

Query: 23  RISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPV 82
           R SG A  VL+DDPVGKLKV+VYE+P KYNK L+  D RCL HMFAAEIFMH+FLLSS V
Sbjct: 29  RTSGDAVGVLDDDPVGKLKVFVYEMPRKYNKMLVDNDSRCLQHMFAAEIFMHQFLLSSAV 88

Query: 83  RTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFF 142
           RTL+PE ADWFYTP+Y TCDLTP G PLPF++PRMMRSAI+ +++ WP+WNRT+GADHFF
Sbjct: 89  RTLDPEAADWFYTPVYTTCDLTPQGFPLPFRAPRMMRSAIRYVAATWPFWNRTDGADHFF 148

Query: 143 VVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQ 202
           + PHDFGACFHYQEE+A+ERGILPLL+RATLVQTFGQRNHVCL +GSIT+PPYA P ++Q
Sbjct: 149 LTPHDFGACFHYQEERAVERGILPLLRRATLVQTFGQRNHVCLQDGSITVPPYASPHRLQ 208

Query: 203 AHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTY 262
           AH + P TPRSIFVYFRGLFYD+ NDPEGGYYARGARA+VWENFK+NPLFDIST+HP TY
Sbjct: 209 AHLVGPGTPRSIFVYFRGLFYDMGNDPEGGYYARGARASVWENFKDNPLFDISTEHPATY 268

Query: 263 YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEE 322
           YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWE++ +FVAE 
Sbjct: 269 YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEDMSVFVAER 328

Query: 323 DVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHDK 382
           DVP+LD+ILTSIP   ILR+QRLLA  S+KRA+LF QPA+PGDAFHQ+LNGLARKLPH +
Sbjct: 329 DVPRLDSILTSIPLPDILRRQRLLARDSVKRALLFHQPARPGDAFHQVLNGLARKLPHGE 388

Query: 383 STYLKPGGKFLNWTSGPVGDLKPW 406
             +L+PG K L+W +G   DLKPW
Sbjct: 389 DVFLQPGEKALDWDAGLESDLKPW 412


>gi|414878902|tpg|DAA56033.1| TPA: hypothetical protein ZEAMMB73_301237 [Zea mays]
          Length = 387

 Score =  642 bits (1657), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 299/380 (78%), Positives = 340/380 (89%)

Query: 27  SAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLN 86
            A  VL+DDPVGKLKV+VYE+P KYN+ LL KD RCL HMFAAEIFMH+FLLSS VRTL+
Sbjct: 8   DAVGVLDDDPVGKLKVFVYEMPRKYNRMLLDKDSRCLHHMFAAEIFMHQFLLSSAVRTLD 67

Query: 87  PEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPH 146
           PEEADWFYTP Y TCDLTP G PLPF++PR+MRSAI+ +++ WP+WNRT+GADHFF+ PH
Sbjct: 68  PEEADWFYTPAYTTCDLTPQGFPLPFRAPRIMRSAIRYVATTWPFWNRTDGADHFFLTPH 127

Query: 147 DFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQI 206
           DFGACFHYQEE+A+ERGILPLL+RATLVQTFGQRNHVCL +GSIT+PPYA P KMQAH I
Sbjct: 128 DFGACFHYQEERAMERGILPLLRRATLVQTFGQRNHVCLQDGSITVPPYADPGKMQAHLI 187

Query: 207 PPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM 266
            P TPRSIFVYFRGLFYD+ NDPEGGYYARGARA+VWENFK+NPLFDIST+HP+TYYEDM
Sbjct: 188 SPGTPRSIFVYFRGLFYDMGNDPEGGYYARGARASVWENFKDNPLFDISTEHPSTYYEDM 247

Query: 267 QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK 326
           QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWE+I +FVAE DVP+
Sbjct: 248 QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEDISVFVAERDVPR 307

Query: 327 LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHDKSTYL 386
           LD+ILTSIP   ILR+QRLLA  S+K+A+LF QPA+PGDAFHQ+LNGLARKLPH +  +L
Sbjct: 308 LDSILTSIPLPDILRRQRLLARDSVKQALLFHQPARPGDAFHQVLNGLARKLPHGEGVFL 367

Query: 387 KPGGKFLNWTSGPVGDLKPW 406
           +PG K L+W +G   DLKPW
Sbjct: 368 QPGEKALDWDAGLQSDLKPW 387


>gi|125539672|gb|EAY86067.1| hypothetical protein OsI_07437 [Oryza sativa Indica Group]
          Length = 342

 Score =  633 bits (1633), Expect = e-179,   Method: Compositional matrix adjust.
 Identities = 299/341 (87%), Positives = 318/341 (93%)

Query: 66  MFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLI 125
           MFAAEIFMHRFLLSS VRTLNPE+ADWFY P+Y TCDLT  GLPLPFKSPRMMRSAIQ +
Sbjct: 1   MFAAEIFMHRFLLSSAVRTLNPEQADWFYAPVYTTCDLTHAGLPLPFKSPRMMRSAIQFL 60

Query: 126 SSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCL 185
           S  WP+WNRT+GADHFFVVPHDFGACFHYQEEKAIERGILPLL+RATLVQTFGQ+NHVCL
Sbjct: 61  SRKWPFWNRTDGADHFFVVPHDFGACFHYQEEKAIERGILPLLRRATLVQTFGQKNHVCL 120

Query: 186 NEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWEN 245
            EGSITIPPYAPPQKMQAH IPPDTPRSIFVYFRGLFYD  NDPEGGYYARGARA++WEN
Sbjct: 121 KEGSITIPPYAPPQKMQAHLIPPDTPRSIFVYFRGLFYDNGNDPEGGYYARGARASLWEN 180

Query: 246 FKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLP 305
           FKNNPLFDIST+HP TYYEDMQR++FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLP
Sbjct: 181 FKNNPLFDISTEHPATYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLP 240

Query: 306 FADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGD 365
           FADAIPW+EIG+FV EEDVP+LD+ILTSIP + ILRKQRLLANPSMK+AMLFPQPAQP D
Sbjct: 241 FADAIPWDEIGVFVDEEDVPRLDSILTSIPIDDILRKQRLLANPSMKQAMLFPQPAQPRD 300

Query: 366 AFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
           AFHQILNGLARKLPH  S YLKPG K LNWT+GPV DLKPW
Sbjct: 301 AFHQILNGLARKLPHPDSVYLKPGEKHLNWTAGPVADLKPW 341


>gi|386576430|gb|AFJ12119.1| glycosyltransferase, partial [Nicotiana tabacum]
          Length = 317

 Score =  621 bits (1602), Expect = e-175,   Method: Compositional matrix adjust.
 Identities = 293/317 (92%), Positives = 306/317 (96%)

Query: 66  MFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLI 125
           MFAAEI+MHRFLLSS VRT NPEEADWFYTP+Y TCDLTP GLPLPFKSPRMMRSAIQLI
Sbjct: 1   MFAAEIYMHRFLLSSAVRTFNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRMMRSAIQLI 60

Query: 126 SSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCL 185
           +SNWPYWNRTEGADHFF+ PHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCL
Sbjct: 61  ASNWPYWNRTEGADHFFITPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCL 120

Query: 186 NEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWEN 245
            +GSITIPPYAPPQKMQ+H IPPDTPRSIFVYFRGLFYDV NDPEGGYYARGARAAVWEN
Sbjct: 121 KDGSITIPPYAPPQKMQSHLIPPDTPRSIFVYFRGLFYDVGNDPEGGYYARGARAAVWEN 180

Query: 246 FKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLP 305
           FK+NPLFDIST+HPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAV+FGCIPVIIADDIVLP
Sbjct: 181 FKDNPLFDISTEHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVIFGCIPVIIADDIVLP 240

Query: 306 FADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGD 365
           FADAIPWE+IG+FVAE+DVP LDTILTSIPPE ILRKQRLLANPSMK+AMLFPQPAQ GD
Sbjct: 241 FADAIPWEDIGVFVAEKDVPNLDTILTSIPPEEILRKQRLLANPSMKQAMLFPQPAQSGD 300

Query: 366 AFHQILNGLARKLPHDK 382
           AFHQILNGLARKLPHDK
Sbjct: 301 AFHQILNGLARKLPHDK 317


>gi|39725600|dbj|BAD04923.1| pectin-glucuronyltransferase [Nicotiana tabacum]
          Length = 317

 Score =  612 bits (1579), Expect = e-173,   Method: Compositional matrix adjust.
 Identities = 290/317 (91%), Positives = 303/317 (95%)

Query: 66  MFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLI 125
           MFAAEI+MHRFLLSS VRT NPEEADWFYTP+Y TCDLTP GLPLPFKSPRMMRSAIQLI
Sbjct: 1   MFAAEIYMHRFLLSSAVRTFNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRMMRSAIQLI 60

Query: 126 SSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCL 185
           +SNWPYWNRTEGADHFF+ PHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCL
Sbjct: 61  ASNWPYWNRTEGADHFFITPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCL 120

Query: 186 NEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWEN 245
            +GSITIPPYAPPQKMQ+H IPPDTPRSIFVYFRGLFYDV NDPEGGYYARGARAAVWEN
Sbjct: 121 KDGSITIPPYAPPQKMQSHLIPPDTPRSIFVYFRGLFYDVGNDPEGGYYARGARAAVWEN 180

Query: 246 FKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLP 305
           FK+NP FDIST+HPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAV+FGCIPVIIADDIVLP
Sbjct: 181 FKDNPPFDISTEHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVIFGCIPVIIADDIVLP 240

Query: 306 FADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGD 365
           FADAIPWE+IG+FVAE+DVP LDTILTSIPPE ILRKQRLLA PSMK+AML PQPAQ GD
Sbjct: 241 FADAIPWEDIGVFVAEKDVPTLDTILTSIPPEEILRKQRLLAKPSMKQAMLSPQPAQSGD 300

Query: 366 AFHQILNGLARKLPHDK 382
           AFHQILNGLARKLPHDK
Sbjct: 301 AFHQILNGLARKLPHDK 317


>gi|414878904|tpg|DAA56035.1| TPA: hypothetical protein ZEAMMB73_301237 [Zea mays]
          Length = 360

 Score =  612 bits (1578), Expect = e-173,   Method: Compositional matrix adjust.
 Identities = 284/360 (78%), Positives = 323/360 (89%)

Query: 47  LPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPT 106
           +P KYN+ LL KD RCL HMFAAEIFMH+FLLSS VRTL+PEEADWFYTP Y TCDLTP 
Sbjct: 1   MPRKYNRMLLDKDSRCLHHMFAAEIFMHQFLLSSAVRTLDPEEADWFYTPAYTTCDLTPQ 60

Query: 107 GLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILP 166
           G PLPF++PR+MRSAI+ +++ WP+WNRT+GADHFF+ PHDFGACFHYQEE+A+ERGILP
Sbjct: 61  GFPLPFRAPRIMRSAIRYVATTWPFWNRTDGADHFFLTPHDFGACFHYQEERAMERGILP 120

Query: 167 LLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVN 226
           LL+RATLVQTFGQRNHVCL +GSIT+PPYA P KMQAH I P TPRSIFVYFRGLFYD+ 
Sbjct: 121 LLRRATLVQTFGQRNHVCLQDGSITVPPYADPGKMQAHLISPGTPRSIFVYFRGLFYDMG 180

Query: 227 NDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLV 286
           NDPEGGYYARGARA+VWENFK+NPLFDIST+HP+TYYEDMQRAIFCLCPLGWAPWSPRLV
Sbjct: 181 NDPEGGYYARGARASVWENFKDNPLFDISTEHPSTYYEDMQRAIFCLCPLGWAPWSPRLV 240

Query: 287 EAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLL 346
           EAVVFGCIPVIIADDIVLPFADAIPWE+I +FVAE DVP+LD+ILTSIP   ILR+QRLL
Sbjct: 241 EAVVFGCIPVIIADDIVLPFADAIPWEDISVFVAERDVPRLDSILTSIPLPDILRRQRLL 300

Query: 347 ANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
           A  S+K+A+LF QPA+PGDAFHQ+LNGLARKLPH +  +L+PG K L+W +G   DLKPW
Sbjct: 301 ARDSVKQALLFHQPARPGDAFHQVLNGLARKLPHGEGVFLQPGEKALDWDAGLQSDLKPW 360


>gi|222619798|gb|EEE55930.1| hypothetical protein OsJ_04615 [Oryza sativa Japonica Group]
          Length = 401

 Score =  598 bits (1541), Expect = e-168,   Method: Compositional matrix adjust.
 Identities = 284/394 (72%), Positives = 327/394 (82%), Gaps = 21/394 (5%)

Query: 13  VSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIF 72
            +A  T K  R+SG  GDVLEDDP GKLKV+VYE+P KYN  LL KD RCL HMFAAEIF
Sbjct: 29  AAATTTQKHGRMSGKGGDVLEDDPTGKLKVFVYEMPRKYNLNLLAKDSRCLQHMFAAEIF 88

Query: 73  MHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYW 132
           MH+FLLSSPVRTL+PEEADWFYTP Y TCDLTP G PLPF++PR+MRSA++ +++ WPYW
Sbjct: 89  MHQFLLSSPVRTLDPEEADWFYTPAYTTCDLTPQGFPLPFRAPRIMRSAVRYVAATWPYW 148

Query: 133 NRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITI 192
           NRT+GADHFF+ PHDFGACFHYQEE+AIERGILP+L+RATLVQTFGQR+H CL  GSIT+
Sbjct: 149 NRTDGADHFFLAPHDFGACFHYQEERAIERGILPVLRRATLVQTFGQRHHPCLQPGSITV 208

Query: 193 PPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLF 252
           PPYA P+KM+AH+I P TPRSIFVYFRGLFYD+ NDPEGGYYARGARA+VWENFK+NPLF
Sbjct: 209 PPYADPRKMEAHRISPATPRSIFVYFRGLFYDMGNDPEGGYYARGARASVWENFKDNPLF 268

Query: 253 DISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW 312
           DIST+HP T                     PRLVEAVVFGCIPVIIADDIVLPFADAIPW
Sbjct: 269 DISTEHPAT---------------------PRLVEAVVFGCIPVIIADDIVLPFADAIPW 307

Query: 313 EEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILN 372
            EI +FVAEEDVP+LDTIL S+P + ++RKQRLLA+P+MK+A+LF QPA+PGDAFHQILN
Sbjct: 308 GEISVFVAEEDVPRLDTILASVPLDEVIRKQRLLASPAMKQAVLFHQPARPGDAFHQILN 367

Query: 373 GLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
           GLARKLPH K  +L+PG K ++W  G   DLKPW
Sbjct: 368 GLARKLPHPKGVFLEPGEKGIDWDQGLENDLKPW 401


>gi|110288702|gb|ABB46945.2| exostosin family protein, putative, expressed [Oryza sativa
           Japonica Group]
 gi|215766610|dbj|BAG98714.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 341

 Score =  595 bits (1535), Expect = e-168,   Method: Compositional matrix adjust.
 Identities = 274/341 (80%), Positives = 304/341 (89%)

Query: 66  MFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLI 125
           MFA EIFMHRFLLSS +RT NP+EADWFYTP+Y TCDLTP G PL  KSPRMMRSAI+ I
Sbjct: 1   MFATEIFMHRFLLSSAIRTSNPDEADWFYTPVYTTCDLTPWGHPLTTKSPRMMRSAIKFI 60

Query: 126 SSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCL 185
           S  WPYWNRTEGADHFFVVPHDF ACF++QE KAIERGILP+L+RATLVQTFGQ+NH CL
Sbjct: 61  SKYWPYWNRTEGADHFFVVPHDFAACFYFQEAKAIERGILPVLRRATLVQTFGQKNHACL 120

Query: 186 NEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWEN 245
            +GSIT+PPY P  K++AH +PP+TPRSIFVYFRGLFYD +NDPEGGYYARGARA+VWEN
Sbjct: 121 KDGSITVPPYTPAHKIRAHLVPPETPRSIFVYFRGLFYDTSNDPEGGYYARGARASVWEN 180

Query: 246 FKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLP 305
           FKNNP+FDISTDHP TYYEDMQRA+FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLP
Sbjct: 181 FKNNPMFDISTDHPQTYYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLP 240

Query: 306 FADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGD 365
           F+DAIPWEEI +FVAE+DVP+LDTILTSIP EVILRKQ +LA PSMK+ MLFPQPA+PGD
Sbjct: 241 FSDAIPWEEIAVFVAEDDVPQLDTILTSIPTEVILRKQAMLAEPSMKQTMLFPQPAEPGD 300

Query: 366 AFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
            FHQ++N LARKLPH +  +LKPG K LNWT G   DLKPW
Sbjct: 301 GFHQVMNALARKLPHGRDVFLKPGQKVLNWTEGTREDLKPW 341


>gi|386576428|gb|AFJ12118.1| glycosyltransferase, partial [Nicotiana tabacum]
          Length = 294

 Score =  574 bits (1480), Expect = e-161,   Method: Compositional matrix adjust.
 Identities = 271/294 (92%), Positives = 284/294 (96%)

Query: 68  AAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISS 127
           AAEI+MHRFLLSS VRT NPEEADWFYTP+Y TCDLTP GLPLPFKSPRMMRSAIQLI+S
Sbjct: 1   AAEIYMHRFLLSSAVRTFNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRMMRSAIQLIAS 60

Query: 128 NWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNE 187
           NWPYWNRTEGADHFF+ PHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCL +
Sbjct: 61  NWPYWNRTEGADHFFITPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLKD 120

Query: 188 GSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFK 247
           GSITIPPYAPPQKMQ+H IPPDTPRSIFVYFRGLFYDV NDPEGGYYARGARAAVWENFK
Sbjct: 121 GSITIPPYAPPQKMQSHLIPPDTPRSIFVYFRGLFYDVGNDPEGGYYARGARAAVWENFK 180

Query: 248 NNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFA 307
           +NPLFDIST+HPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAV+FGCIPVIIADDIVLPFA
Sbjct: 181 DNPLFDISTEHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVIFGCIPVIIADDIVLPFA 240

Query: 308 DAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPA 361
           DAIPWE+IG+FVAE+DVP LDTILTSIPPE ILRKQRLLANPSMK+AMLFPQPA
Sbjct: 241 DAIPWEDIGVFVAEKDVPNLDTILTSIPPEEILRKQRLLANPSMKQAMLFPQPA 294


>gi|224136520|ref|XP_002322350.1| predicted protein [Populus trichocarpa]
 gi|222869346|gb|EEF06477.1| predicted protein [Populus trichocarpa]
          Length = 343

 Score =  544 bits (1402), Expect = e-152,   Method: Compositional matrix adjust.
 Identities = 255/294 (86%), Positives = 272/294 (92%)

Query: 112 FKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRA 171
           F+SPRM+RSA QLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQ A
Sbjct: 50  FQSPRMIRSATQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQHA 109

Query: 172 TLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEG 231
           TLVQTFGQRNHVCL +GSIT+P YAPPQKMQ H IP  TPRSIFVYFRGLFYDV NDPEG
Sbjct: 110 TLVQTFGQRNHVCLKDGSITVPSYAPPQKMQTHLIPEKTPRSIFVYFRGLFYDVGNDPEG 169

Query: 232 GYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVF 291
           GYYARGARAAVWENFK+NPLFDIST+HPTTYYEDMQ+A+FCLCPL WAPWSPRLVEA++F
Sbjct: 170 GYYARGARAAVWENFKDNPLFDISTEHPTTYYEDMQQAVFCLCPLSWAPWSPRLVEALIF 229

Query: 292 GCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSM 351
           G IPVII DDIVLPFADAIPWEEIG+FV E+DVP LDTILTSIPPEVILRKQRLLANPSM
Sbjct: 230 GYIPVIIVDDIVLPFADAIPWEEIGVFVDEKDVPNLDTILTSIPPEVILRKQRLLANPSM 289

Query: 352 KRAMLFPQPAQPGDAFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKP 405
           K+AMLFPQ AQ GDAFHQ+LNGLARKL HD++ YLK G K LNWT+G VGDLKP
Sbjct: 290 KQAMLFPQLAQAGDAFHQVLNGLARKLSHDRNVYLKSGEKILNWTAGLVGDLKP 343


>gi|326499682|dbj|BAJ86152.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 304

 Score =  536 bits (1382), Expect = e-150,   Method: Compositional matrix adjust.
 Identities = 253/283 (89%), Positives = 266/283 (93%)

Query: 124 LISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHV 183
           +ISS+WPYWNRT GADHFFVVPHDFGACFHYQEEKAIERGILPLL+RATLVQTFGQ++HV
Sbjct: 22  VISSHWPYWNRTAGADHFFVVPHDFGACFHYQEEKAIERGILPLLRRATLVQTFGQKDHV 81

Query: 184 CLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVW 243
           CL EGSI IPPYAPPQKM+ H +PP+TPRSIFVYFRGLFYD  NDPEGGYYARGARA+VW
Sbjct: 82  CLKEGSINIPPYAPPQKMKTHLVPPETPRSIFVYFRGLFYDTANDPEGGYYARGARASVW 141

Query: 244 ENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIV 303
           ENFKNNPLFDISTDHP TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIV
Sbjct: 142 ENFKNNPLFDISTDHPPTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIV 201

Query: 304 LPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQP 363
           LPFADAIPW+EIG+FVAE+DVPKLDTILTSIP EVILRKQRLLANPSMK+AMLFPQPAQ 
Sbjct: 202 LPFADAIPWDEIGVFVAEDDVPKLDTILTSIPMEVILRKQRLLANPSMKQAMLFPQPAQA 261

Query: 364 GDAFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
           GDAFHQILNGLARKLPH KS YL PG K LNWT GP GDLKPW
Sbjct: 262 GDAFHQILNGLARKLPHSKSVYLTPGQKALNWTQGPKGDLKPW 304


>gi|255543228|ref|XP_002512677.1| catalytic, putative [Ricinus communis]
 gi|223548638|gb|EEF50129.1| catalytic, putative [Ricinus communis]
          Length = 253

 Score =  493 bits (1270), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 236/252 (93%), Positives = 242/252 (96%)

Query: 155 QEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSI 214
           QEEKAIERGI PLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQA QIPP+TPRSI
Sbjct: 2   QEEKAIERGIFPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQARQIPPETPRSI 61

Query: 215 FVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLC 274
           FVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDIST+HPTTYYEDMQRAIFCLC
Sbjct: 62  FVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTEHPTTYYEDMQRAIFCLC 121

Query: 275 PLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSI 334
           PLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG+FVAEEDVP LDTILTSI
Sbjct: 122 PLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFVAEEDVPNLDTILTSI 181

Query: 335 PPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHDKSTYLKPGGKFLN 394
           P +V+LRKQRLLANPSMKRAMLFPQPAQ GDAFHQILNGLARKLPHD S YLKPG K LN
Sbjct: 182 PTQVVLRKQRLLANPSMKRAMLFPQPAQSGDAFHQILNGLARKLPHDNSIYLKPGEKVLN 241

Query: 395 WTSGPVGDLKPW 406
           WT+GPV DLKPW
Sbjct: 242 WTAGPVKDLKPW 253


>gi|414866628|tpg|DAA45185.1| TPA: hypothetical protein ZEAMMB73_313698 [Zea mays]
          Length = 588

 Score =  459 bits (1182), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 218/252 (86%), Positives = 232/252 (92%)

Query: 155 QEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSI 214
           +EEKAI RGILPLLQRATLVQTFGQ+NHVCL  GSITIPP+APPQKMQAH IP DTPRSI
Sbjct: 337 KEEKAIGRGILPLLQRATLVQTFGQKNHVCLKGGSITIPPFAPPQKMQAHLIPVDTPRSI 396

Query: 215 FVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLC 274
           FVYFRGLFYD +NDPEGGYYARGA A+VWENFKNNPLFDISTDHP TYYEDMQR++FCLC
Sbjct: 397 FVYFRGLFYDTSNDPEGGYYARGACASVWENFKNNPLFDISTDHPPTYYEDMQRSVFCLC 456

Query: 275 PLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSI 334
           PLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG+FVAEEDVPKLD+IL SI
Sbjct: 457 PLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFVAEEDVPKLDSILMSI 516

Query: 335 PPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHDKSTYLKPGGKFLN 394
           P +VILRKQRLLANPSMK+AMLFPQPAQ GDAFHQILNGLARKL H    +LKPG + L 
Sbjct: 517 PTDVILRKQRLLANPSMKQAMLFPQPAQAGDAFHQILNGLARKLLHGDGVFLKPGERVLK 576

Query: 395 WTSGPVGDLKPW 406
           WT+GP GDLKPW
Sbjct: 577 WTAGPPGDLKPW 588


>gi|289166870|gb|ADC84485.1| glycosyltransferase family 47A [Salix sachalinensis]
          Length = 215

 Score =  436 bits (1121), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 207/215 (96%), Positives = 212/215 (98%)

Query: 142 FVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKM 201
           FVVPHDFGACFHYQEEKA+ERGILPLLQR+TLVQTFGQRNHVCLNEGSI IPP+APPQKM
Sbjct: 1   FVVPHDFGACFHYQEEKAVERGILPLLQRSTLVQTFGQRNHVCLNEGSIAIPPFAPPQKM 60

Query: 202 QAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTT 261
           QAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTT
Sbjct: 61  QAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTT 120

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCI VIIADDIVLPFADAIPWEEIG+FVAE
Sbjct: 121 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCISVIIADDIVLPFADAIPWEEIGVFVAE 180

Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPSMKRAML 356
           EDVP LDTILT+IPPEVILRKQRLLANPSMKRAML
Sbjct: 181 EDVPNLDTILTTIPPEVILRKQRLLANPSMKRAML 215


>gi|289166872|gb|ADC84486.1| glycosyltransferase family 47A [Salix miyabeana]
          Length = 215

 Score =  416 bits (1069), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 197/215 (91%), Positives = 206/215 (95%)

Query: 142 FVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKM 201
           FVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCL +GSITIPPYAPPQKM
Sbjct: 1   FVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLKDGSITIPPYAPPQKM 60

Query: 202 QAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTT 261
           Q H IP  TPRSIFVYFRGLFYDV NDPEGGYYARGARAAVWENFK+NPLFDIST+HP T
Sbjct: 61  QTHLIPEKTPRSIFVYFRGLFYDVGNDPEGGYYARGARAAVWENFKDNPLFDISTEHPAT 120

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           YYEDMQRA+FCLCPLGWAPWSPRLVEAV+FGCIPVIIADDIVLPFADAIPWE+IG++V E
Sbjct: 121 YYEDMQRAVFCLCPLGWAPWSPRLVEAVIFGCIPVIIADDIVLPFADAIPWEDIGVYVDE 180

Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPSMKRAML 356
           EDVP LDTILTSIPPEVILRKQRLLANPSMK+AML
Sbjct: 181 EDVPNLDTILTSIPPEVILRKQRLLANPSMKQAML 215


>gi|414866629|tpg|DAA45186.1| TPA: hypothetical protein ZEAMMB73_313698 [Zea mays]
          Length = 206

 Score =  373 bits (957), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 178/206 (86%), Positives = 189/206 (91%)

Query: 201 MQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPT 260
           MQAH IP DTPRSIFVYFRGLFYD +NDPEGGYYARGA A+VWENFKNNPLFDISTDHP 
Sbjct: 1   MQAHLIPVDTPRSIFVYFRGLFYDTSNDPEGGYYARGACASVWENFKNNPLFDISTDHPP 60

Query: 261 TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVA 320
           TYYEDMQR++FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG+FVA
Sbjct: 61  TYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFVA 120

Query: 321 EEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPH 380
           EEDVPKLD+IL SIP +VILRKQRLLANPSMK+AMLFPQPAQ GDAFHQILNGLARKL H
Sbjct: 121 EEDVPKLDSILMSIPTDVILRKQRLLANPSMKQAMLFPQPAQAGDAFHQILNGLARKLLH 180

Query: 381 DKSTYLKPGGKFLNWTSGPVGDLKPW 406
               +LKPG + L WT+GP GDLKPW
Sbjct: 181 GDGVFLKPGERVLKWTAGPPGDLKPW 206


>gi|224093354|ref|XP_002334839.1| predicted protein [Populus trichocarpa]
 gi|222875141|gb|EEF12272.1| predicted protein [Populus trichocarpa]
          Length = 175

 Score =  328 bits (841), Expect = 3e-87,   Method: Compositional matrix adjust.
 Identities = 153/171 (89%), Positives = 164/171 (95%)

Query: 236 RGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIP 295
           RGARAAVWENFK+NPLFDIST+HP TYYEDMQRA+FCLCPLGWAPWSPRLVEAV+FGCIP
Sbjct: 5   RGARAAVWENFKDNPLFDISTEHPATYYEDMQRAVFCLCPLGWAPWSPRLVEAVIFGCIP 64

Query: 296 VIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAM 355
           VIIADDIVLPFADAIPWEEIG++V EEDVP LDTILTSIPPEVILRKQRLLANPSMK+AM
Sbjct: 65  VIIADDIVLPFADAIPWEEIGVYVDEEDVPNLDTILTSIPPEVILRKQRLLANPSMKQAM 124

Query: 356 LFPQPAQPGDAFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
           LFPQPAQPGDAFHQ+LNGLARKLPHD+S YL+P  K LNWT+GPVGDLKPW
Sbjct: 125 LFPQPAQPGDAFHQVLNGLARKLPHDRSVYLRPSEKILNWTAGPVGDLKPW 175


>gi|224106838|ref|XP_002314302.1| predicted protein [Populus trichocarpa]
 gi|222850710|gb|EEE88257.1| predicted protein [Populus trichocarpa]
          Length = 462

 Score =  315 bits (808), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 160/384 (41%), Positives = 233/384 (60%), Gaps = 11/384 (2%)

Query: 11  AFVSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAE 70
           A   + N+  I++       +L+D     LKVY+YELPSKYN   L  + RC  H+FA+E
Sbjct: 84  ALFESSNSTFIQQSKNINRGLLKD-----LKVYIYELPSKYNTDWL-ANERCSNHLFASE 137

Query: 71  IFMHRFLLSS-PVRTLNPEEADWFYTPIYPTCDL-TPTGLPLPFKSPRMMRSAIQLISSN 128
           + +H+ L +S  +RT +P EAD+F+ P+Y +C+  T  G P    +  ++ SA+QLISSN
Sbjct: 138 VAIHKALSNSLDIRTFDPYEADFFFVPVYVSCNFSTVNGFPAIGHARSLLSSAVQLISSN 197

Query: 129 WPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQR-NHVCLNE 187
           +P+WNR++G+DH FV  HD+GACFH  EE+A+E GI   L+R+ ++QTFG + NH C + 
Sbjct: 198 YPFWNRSQGSDHVFVASHDYGACFHAMEERAMEDGIPEFLKRSIILQTFGVKFNHPCQDV 257

Query: 188 GSITIPPYAPPQKMQA--HQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWEN 245
            ++ IPPY  P++++      P +  R I+ +FRG       +  G YY++  R  +W  
Sbjct: 258 ENVVIPPYISPERVRTTLENYPLNGRRDIWAFFRGKMEVHPKNISGRYYSKKVRTVIWRK 317

Query: 246 FKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLP 305
           +  +  F +       Y  ++ R++FCLCPLGWAPWSPRLVE+V  GC+PVIIAD I LP
Sbjct: 318 YSGDRRFYLQRHRFAGYQSEIVRSVFCLCPLGWAPWSPRLVESVALGCVPVIIADGIRLP 377

Query: 306 FADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGD 365
           F  A+ W EI + VAE+DV  L T+L  +    +   Q+ L +P ++RA+LF    Q GD
Sbjct: 378 FPTAVRWSEISLTVAEKDVANLGTLLDHVAATNLSAIQKNLWDPDVRRALLFNDRVQEGD 437

Query: 366 AFHQILNGLARKLPHDKSTYLKPG 389
           A  Q+L  LARKL     T   P 
Sbjct: 438 ATWQVLYALARKLDRSYRTVRLPN 461


>gi|42568020|ref|NP_197685.2| FRA8-like protein [Arabidopsis thaliana]
 gi|75127070|sp|Q6NMM8.1|F8H_ARATH RecName: Full=Probable glucuronoxylan glucuronosyltransferase F8H;
           AltName: Full=FRA8 homolog; AltName: Full=Protein
           FRAGILE FIBER 8 homolog
 gi|44681390|gb|AAS47635.1| At5g22940 [Arabidopsis thaliana]
 gi|48958521|gb|AAT47813.1| At5g22940 [Arabidopsis thaliana]
 gi|332005716|gb|AED93099.1| FRA8-like protein [Arabidopsis thaliana]
          Length = 469

 Score =  315 bits (807), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 157/388 (40%), Positives = 231/388 (59%), Gaps = 6/388 (1%)

Query: 4   TAIALLLAFVSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCL 63
           ++I LL    ++ + P    I  SA           +K+YVY+LP+ YN   +    RC 
Sbjct: 73  SSIRLLSNHKTSSSLPSRALIESSAIKTTSIGLFTGMKIYVYDLPASYNDDWVTASDRCA 132

Query: 64  THMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDL-TPTGLPLPFKSPRMMRSAI 122
           +H+FAAE+ +HR LLSS VRTL+P+EAD+F+ P+Y +C+  T  G P    +  ++ SA+
Sbjct: 133 SHLFAAEVAIHRALLSSDVRTLDPDEADYFFVPVYVSCNFSTSNGFPSLSHARSLLSSAV 192

Query: 123 QLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQR-N 181
             +S ++P+WNR++G+DH FV  HDFGACFH  E+ AIE GI   ++R+ ++QTFG +  
Sbjct: 193 DFLSDHYPFWNRSQGSDHVFVASHDFGACFHAMEDMAIEEGIPKFMKRSIILQTFGVKYK 252

Query: 182 HVCLNEGSITIPPYAPPQKMQA--HQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGAR 239
           H C     + IPPY PP+ +Q    + P +  R I+ +FRG       +  G +Y++G R
Sbjct: 253 HPCQEVEHVVIPPYIPPESVQKAIEKAPVNGRRDIWAFFRGKMEVNPKNISGRFYSKGVR 312

Query: 240 AAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIA 299
            A+ + F     F ++      Y  ++ R++FCLCPLGWAPWSPRLVE+ V GC+PV+IA
Sbjct: 313 TAILKKFGGRRRFYLNRHRFAGYRSEIVRSVFCLCPLGWAPWSPRLVESAVLGCVPVVIA 372

Query: 300 DDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQ 359
           D I LPF++ + W EI + VAE+DV  L  +L  +    +   QR L  P  KRA+L+  
Sbjct: 373 DGIQLPFSETVQWPEISLTVAEKDVRNLRKVLEHVAATNLSAIQRNLHEPVFKRALLYNV 432

Query: 360 PAQPGDAFHQILNGLARKLPHDKSTYLK 387
           P + GDA   IL  L RKL  D  +Y +
Sbjct: 433 PMKEGDATWHILESLWRKL--DDRSYRR 458


>gi|60657602|gb|AAX33322.1| secondary cell wall-related glycosyltransferase family 47 [Populus
           tremula x Populus tremuloides]
          Length = 442

 Score =  315 bits (807), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 161/379 (42%), Positives = 231/379 (60%), Gaps = 11/379 (2%)

Query: 11  AFVSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAE 70
           A   + N+  I++       +L+D     LKVY+YELPSKYN   L  + RC  H+FA+E
Sbjct: 64  ALFESSNSTFIQQSRNINRGLLKD-----LKVYIYELPSKYNTDWL-ANERCSNHLFASE 117

Query: 71  IFMHRFLLSS-PVRTLNPEEADWFYTPIYPTCDL-TPTGLPLPFKSPRMMRSAIQLISSN 128
           + +H+ L SS  VRT +P EAD+F+ P+Y +C+  T  G P    +  ++ SA+QLISSN
Sbjct: 118 VAIHKALSSSLDVRTFDPYEADFFFVPVYVSCNFSTVNGFPAIGHARSLLSSAVQLISSN 177

Query: 129 WPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQR-NHVCLNE 187
           +P+WNR++G+DH FV  HD+GACFH  EE+A+E GI   L+R+ ++QTFG + NH C + 
Sbjct: 178 YPFWNRSQGSDHVFVASHDYGACFHAMEERAMEDGIPEFLKRSIILQTFGVKFNHPCQDV 237

Query: 188 GSITIPPYAPPQKMQA--HQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWEN 245
            ++ IPPY  P  ++    + P    R I+ +FRG       +  G YY++  R  +W  
Sbjct: 238 ENVVIPPYISPGSVRTTLEKYPLTGRRDIWAFFRGKMEVHPKNISGRYYSKKVRTVIWRK 297

Query: 246 FKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLP 305
           +  +  F +       Y  ++ R++FCLCPLGWAPWSPRLVE+V  GC+PVIIAD I LP
Sbjct: 298 YSGDRRFYLQRHRFAGYQSEIVRSVFCLCPLGWAPWSPRLVESVALGCVPVIIADGIRLP 357

Query: 306 FADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGD 365
           F  A+ W EI + VAE+DV  L T+L  +    +   Q+ L +P ++RA+LF  P Q GD
Sbjct: 358 FPTAVRWSEISLTVAEKDVANLGTLLDQVAATNLSAIQKNLWDPDVRRALLFNDPVQGGD 417

Query: 366 AFHQILNGLARKLPHDKST 384
           A  Q+L  LA+KL     T
Sbjct: 418 ATWQVLYALAQKLDRSYRT 436


>gi|297808283|ref|XP_002872025.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297317862|gb|EFH48284.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 465

 Score =  315 bits (807), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 159/388 (40%), Positives = 231/388 (59%), Gaps = 6/388 (1%)

Query: 4   TAIALLLAFVSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCL 63
           ++I LL    ++ + P    I  SA           +K+YVY+LP+ +N   +    RC 
Sbjct: 74  SSIRLLSNHKTSSSLPSRALIESSAIKTNSFGLFSGMKIYVYDLPASFNDDWVTASDRCA 133

Query: 64  THMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDL-TPTGLPLPFKSPRMMRSAI 122
           TH+FAAE+ +HR LLSS VRTL+PEEAD+F+ P+Y +C+  T  G P    +  ++ SA+
Sbjct: 134 THLFAAEVAIHRALLSSDVRTLDPEEADFFFVPVYVSCNFSTSNGFPSLSHARSLLSSAV 193

Query: 123 QLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQR-N 181
             +S ++P+WNRT+G+DH FV  HDFGACFH  E+ AIE GI   ++++ ++QTFG +  
Sbjct: 194 DFLSDHYPFWNRTQGSDHVFVASHDFGACFHAMEDMAIEEGIPEFMKKSIILQTFGVKYK 253

Query: 182 HVCLNEGSITIPPYAPPQKMQA--HQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGAR 239
           H C     + IPPY PP+ +Q    + P +  R I+ +FRG       +  G +Y++G R
Sbjct: 254 HPCQEVEHVVIPPYIPPESVQRAIEKAPANGRRDIWAFFRGKMEVNPKNISGRFYSKGVR 313

Query: 240 AAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIA 299
            A+ + F     F ++      Y  ++ R++FCLCPLGWAPWSPRLVE+ V GC+PV+IA
Sbjct: 314 TAILKKFGGRRRFYLNRHRFAGYRSEIVRSVFCLCPLGWAPWSPRLVESAVLGCVPVVIA 373

Query: 300 DDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQ 359
           D I LPF++ + W EI + VAE+DV  L  IL  +    +   QR L  P  KRA+L+  
Sbjct: 374 DGIKLPFSETVRWPEISLTVAEKDVRSLRKILEHVAATNLSVIQRNLHGPVFKRALLYNV 433

Query: 360 PAQPGDAFHQILNGLARKLPHDKSTYLK 387
           P + GDA   IL  L RKL  D  +Y +
Sbjct: 434 PMKEGDATWHILESLWRKL--DDRSYRR 459


>gi|297721259|ref|NP_001172992.1| Os02g0520750 [Oryza sativa Japonica Group]
 gi|255670949|dbj|BAH91721.1| Os02g0520750 [Oryza sativa Japonica Group]
          Length = 213

 Score =  314 bits (804), Expect = 6e-83,   Method: Compositional matrix adjust.
 Identities = 148/171 (86%), Positives = 160/171 (93%)

Query: 236 RGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIP 295
           RGARA++WENFKNNPLFDIST+HP TYYEDMQR++FCLCPLGWAPWSPRLVEAVVFGCIP
Sbjct: 42  RGARASLWENFKNNPLFDISTEHPATYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIP 101

Query: 296 VIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAM 355
           VIIADDIVLPFADAIPW+EIG+FV EEDVP+LD+ILTSIP + ILRKQRLLANPSMK+AM
Sbjct: 102 VIIADDIVLPFADAIPWDEIGVFVDEEDVPRLDSILTSIPIDDILRKQRLLANPSMKQAM 161

Query: 356 LFPQPAQPGDAFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
           LFPQPAQP DAFHQILNGLARKLPH  S YLKPG K LNWT+GPV DLKPW
Sbjct: 162 LFPQPAQPRDAFHQILNGLARKLPHPDSVYLKPGEKHLNWTAGPVADLKPW 212


>gi|114325715|gb|ABI64067.1| glycosyltransferase GT47C [Populus tremula x Populus alba]
          Length = 442

 Score =  313 bits (802), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 161/379 (42%), Positives = 232/379 (61%), Gaps = 11/379 (2%)

Query: 11  AFVSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAE 70
           A   + N+  I++       +L+D     LKVY+YELPSKYN   L ++ RC  H+FA+E
Sbjct: 64  ALFESSNSTFIQQSRNINRGLLKD-----LKVYIYELPSKYNTDWL-ENERCSNHLFASE 117

Query: 71  IFMHRFLLSS-PVRTLNPEEADWFYTPIYPTCDL-TPTGLPLPFKSPRMMRSAIQLISSN 128
           + +H+ L SS  VRT +P EAD+F+ P+Y +C+  T  G P    +  ++ SA+QLISSN
Sbjct: 118 VAIHKALSSSLDVRTFDPYEADFFFVPVYVSCNFSTVNGFPAIGHARSLLSSAVQLISSN 177

Query: 129 WPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQR-NHVCLNE 187
           +P+WNR++G+DH FV  HD+GACFH  EE+A+E GI   L+R+ ++QTFG + NH C + 
Sbjct: 178 YPFWNRSQGSDHVFVASHDYGACFHAMEERAMEDGIPEFLKRSIILQTFGVKFNHPCQDV 237

Query: 188 GSITIPPYAPPQKMQA--HQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWEN 245
            ++ IPPY  P  ++A   + P    R I+ +FRG       +  G YY++  R  +   
Sbjct: 238 ENVVIPPYISPGSVRATLEKYPLTGRRDIWAFFRGKMEVHPKNISGRYYSKKVRTVILRK 297

Query: 246 FKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLP 305
           +  +  F +       Y  ++ R++FCLCPLGWAPWSPRLVE+V  GC+PVIIAD I LP
Sbjct: 298 YSGDRRFYLQRHRFAGYQSEIVRSVFCLCPLGWAPWSPRLVESVALGCVPVIIADGIRLP 357

Query: 306 FADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGD 365
           F  A+ W EI + VAE+DV  L T+L  +    +   Q+ L +P ++RA+LF  P Q GD
Sbjct: 358 FPTAVRWSEISLTVAEKDVANLGTLLDQVAATNLSAIQKNLWDPDVRRALLFNDPVQGGD 417

Query: 366 AFHQILNGLARKLPHDKST 384
           A  Q+L  LA+KL     T
Sbjct: 418 ATWQVLYALAQKLDRSYRT 436


>gi|10177241|dbj|BAB10615.1| unnamed protein product [Arabidopsis thaliana]
          Length = 498

 Score =  313 bits (801), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 150/352 (42%), Positives = 219/352 (62%), Gaps = 6/352 (1%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYP 99
           +K+YVY+LP+ YN   +    RC +H+FAAE+ +HR LLSS VRTL+P+EAD+F+ P+Y 
Sbjct: 138 MKIYVYDLPASYNDDWVTASDRCASHLFAAEVAIHRALLSSDVRTLDPDEADYFFVPVYV 197

Query: 100 TCDL-TPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEK 158
           +C+  T  G P    +  ++ SA+  +S ++P+WNR++G+DH FV  HDFGACFH  E+ 
Sbjct: 198 SCNFSTSNGFPSLSHARSLLSSAVDFLSDHYPFWNRSQGSDHVFVASHDFGACFHAMEDM 257

Query: 159 AIERGILPLLQRATLVQTFGQR-NHVCLNEGSITIPPYAPPQKMQA--HQIPPDTPRSIF 215
           AIE GI   ++R+ ++QTFG +  H C     + IPPY PP+ +Q    + P +  R I+
Sbjct: 258 AIEEGIPKFMKRSIILQTFGVKYKHPCQEVEHVVIPPYIPPESVQKAIEKAPVNGRRDIW 317

Query: 216 VYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCP 275
            +FRG       +  G +Y++G R A+ + F     F ++      Y  ++ R++FCLCP
Sbjct: 318 AFFRGKMEVNPKNISGRFYSKGVRTAILKKFGGRRRFYLNRHRFAGYRSEIVRSVFCLCP 377

Query: 276 LGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIP 335
           LGWAPWSPRLVE+ V GC+PV+IAD I LPF++ + W EI + VAE+DV  L  +L  + 
Sbjct: 378 LGWAPWSPRLVESAVLGCVPVVIADGIQLPFSETVQWPEISLTVAEKDVRNLRKVLEHVA 437

Query: 336 PEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHDKSTYLK 387
              +   QR L  P  KRA+L+  P + GDA   IL  L RKL  D  +Y +
Sbjct: 438 ATNLSAIQRNLHEPVFKRALLYNVPMKEGDATWHILESLWRKL--DDRSYRR 487


>gi|225436482|ref|XP_002275679.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase IRX7
           [Vitis vinifera]
 gi|297734915|emb|CBI17149.3| unnamed protein product [Vitis vinifera]
          Length = 460

 Score =  308 bits (790), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 156/350 (44%), Positives = 218/350 (62%), Gaps = 5/350 (1%)

Query: 33  EDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADW 92
             D +  LKVY+Y+LPSKYN   L  + RC  H+FA+E+ +H+ L  S VRT +P EAD+
Sbjct: 92  HQDLLKDLKVYIYDLPSKYNVDWL-SNERCSNHLFASEVALHKALQESDVRTFDPWEADF 150

Query: 93  FYTPIYPTCDL-TPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGAC 151
           F+ P+Y +C+  T  G P    +  ++ SAIQ IS+  P+WNR+ GADH FV  HD+GAC
Sbjct: 151 FFVPVYVSCNFSTVNGFPAIGHARPLLASAIQHISTQLPFWNRSLGADHVFVASHDYGAC 210

Query: 152 FHYQEEKAIERGILPLLQRATLVQTFG-QRNHVCLNEGSITIPPYAPPQKMQA--HQIPP 208
           FH  E+ A   GI   L+++ ++QTFG +  H C +  ++ IPPY  P+K+Q+     P 
Sbjct: 211 FHAMEDVARADGIPEFLKKSIILQTFGVKHQHPCQDVENVLIPPYVSPEKVQSTLDSAPA 270

Query: 209 DTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQR 268
           +  R I+V+FRG       +  G +Y++  R A+W+ +  N  F +       Y  ++ R
Sbjct: 271 NGQRDIWVFFRGKMEVHPKNISGRFYSKAVRTAIWQKYGGNRKFYLKRHRFAGYQSEIVR 330

Query: 269 AIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLD 328
           ++FCLCPLGWAPWSPRLVE+VV GC+PVIIAD I LPF++AI W EI + VAE+DV KL 
Sbjct: 331 SVFCLCPLGWAPWSPRLVESVVLGCVPVIIADGIRLPFSEAIRWPEISLTVAEKDVGKLG 390

Query: 329 TILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKL 378
            IL  +    +   Q+ L +P  KRA+LF    Q GDA  Q+LN L  KL
Sbjct: 391 MILEDVAATNLSTIQKNLWDPENKRALLFNNQVQEGDATWQVLNALWHKL 440


>gi|302786830|ref|XP_002975186.1| glycosyltransferase-like protein [Selaginella moellendorffii]
 gi|300157345|gb|EFJ23971.1| glycosyltransferase-like protein [Selaginella moellendorffii]
          Length = 405

 Score =  308 bits (789), Expect = 3e-81,   Method: Compositional matrix adjust.
 Identities = 150/342 (43%), Positives = 216/342 (63%), Gaps = 5/342 (1%)

Query: 38  GKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPI 97
             +KVY+Y+LPS YN   L+ D RC +H+FAAE+ +H+ LL SPVRTL+P+EAD+F+ P+
Sbjct: 53  SNIKVYIYDLPSSYNTDWLV-DSRCSSHLFAAEVAIHQNLLRSPVRTLDPDEADFFFMPV 111

Query: 98  YPTCDLTP-TGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQE 156
           Y +C+ T  +G P  F +  ++++A+ L+S N P+W+R +G DH FV  HDFGACFH  E
Sbjct: 112 YVSCNFTSRSGFPTLFHASDILQAAVGLVSRNMPFWDRHQGRDHVFVATHDFGACFHAME 171

Query: 157 EKAIERGILPLLQRATLVQTFGQRN-HVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIF 215
           + A+  GI   L+ + ++QTFG++N H C N   I IPPY  P K      P    R I 
Sbjct: 172 DLAVTMGIPQFLRNSIILQTFGEKNKHPCQNVDHIQIPPYVVPAKKLPD--PRSQRRKIL 229

Query: 216 VYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCP 275
            +FRG       +  G  Y+RG R  +W  F ++  F I       Y  +M R++FCLCP
Sbjct: 230 AFFRGKMEIHPKNVSGHMYSRGVRTTIWRRFSHDRRFFIKRKRSDNYKAEMLRSVFCLCP 289

Query: 276 LGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIP 335
           LGWAPWSPR+VE+V+ GCIPVIIAD+I LP++  I W +I + VAE DV KLD IL+ + 
Sbjct: 290 LGWAPWSPRIVESVIQGCIPVIIADNIQLPYSHVIDWRKISVTVAERDVHKLDRILSRVA 349

Query: 336 PEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARK 377
              +   Q  L    +++A+++ QP   GDA  Q+L+ L+++
Sbjct: 350 ATNVSMIQANLWRDEVRQALVYNQPLVRGDATWQVLDLLSKR 391


>gi|302791649|ref|XP_002977591.1| glycosyltransferase-like protein [Selaginella moellendorffii]
 gi|300154961|gb|EFJ21595.1| glycosyltransferase-like protein [Selaginella moellendorffii]
          Length = 345

 Score =  308 bits (788), Expect = 4e-81,   Method: Compositional matrix adjust.
 Identities = 150/342 (43%), Positives = 216/342 (63%), Gaps = 5/342 (1%)

Query: 38  GKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPI 97
             +KVY+Y+LPS YN   L+ D RC +H+FAAE+ +H+ LL SPVRTL+P+EAD+F+ P+
Sbjct: 4   SNIKVYIYDLPSSYNTDWLV-DSRCSSHLFAAEVAIHQNLLRSPVRTLDPDEADFFFMPV 62

Query: 98  YPTCDLTP-TGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQE 156
           Y +C+ T  +G P  F +  ++++A+ L+S N P+W+R +G DH FV  HDFGACFH  E
Sbjct: 63  YVSCNFTSRSGFPTLFHASDILQAAVGLVSRNMPFWDRHQGRDHVFVATHDFGACFHAME 122

Query: 157 EKAIERGILPLLQRATLVQTFGQRN-HVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIF 215
           + A+  GI   L+ + ++QTFG++N H C N   I IPPY  P K      P    R I 
Sbjct: 123 DLAVAMGIPQFLRNSIILQTFGEKNKHPCQNVDHIQIPPYVVPAKKLPD--PRGQRRKIL 180

Query: 216 VYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCP 275
            +FRG       +  G  Y+RG R  +W  F ++  F I       Y  +M R++FCLCP
Sbjct: 181 AFFRGKMEIHPKNVSGHMYSRGVRTTIWRRFSHDRRFFIKRKRSDNYKAEMLRSVFCLCP 240

Query: 276 LGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIP 335
           LGWAPWSPR+VE+V+ GCIPVIIAD+I LP++  I W +I + VAE DV KLD IL+ + 
Sbjct: 241 LGWAPWSPRIVESVIQGCIPVIIADNIQLPYSHVIDWRKISVTVAERDVHKLDRILSKVA 300

Query: 336 PEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARK 377
              +   Q  L    +++A+++ QP   GDA  Q+L+ L+++
Sbjct: 301 ATNVSMIQANLWRDEVRQALVYNQPLVRGDATWQVLDLLSKR 342


>gi|357472125|ref|XP_003606347.1| Secondary cell wall-related glycosyltransferase family [Medicago
           truncatula]
 gi|355507402|gb|AES88544.1| Secondary cell wall-related glycosyltransferase family [Medicago
           truncatula]
          Length = 427

 Score =  304 bits (779), Expect = 5e-80,   Method: Compositional matrix adjust.
 Identities = 146/358 (40%), Positives = 224/358 (62%), Gaps = 5/358 (1%)

Query: 25  SGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRT 84
           + S+   L+   +  LK++VY+LP KYNK  L K+PRC TH+FA+E+ +HR LL+S VRT
Sbjct: 54  TTSSPQTLQKTSLKNLKIFVYDLPPKYNKNWL-KNPRCKTHLFASEVAIHRALLTSDVRT 112

Query: 85  LNPEEADWFYTPIYPTCDL-TPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFV 143
            +P EAD+F+ P+Y +C+  T  G P    +  ++ SA++LIS+ +P+WNR+ G+DH FV
Sbjct: 113 FDPYEADFFFVPVYVSCNFSTVNGFPAIGHARSLISSAVKLISTEYPFWNRSTGSDHVFV 172

Query: 144 VPHDFGACFHYQEEKAIERGILPLLQRATLVQTFG-QRNHVCLNEGSITIPPYAPPQKMQ 202
             HDFG+CFH  E+ A++ G+  +++ + ++QTFG   +H C     + IPP+  P+ ++
Sbjct: 173 ASHDFGSCFHTLEDVAMKDGVPEIMKNSIVLQTFGVTYDHPCQKVEHVVIPPFVSPESVR 232

Query: 203 A--HQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPT 260
                 P +  R I+V+FRG       +  G +Y++  R  +W+ F  +  F +      
Sbjct: 233 NTLENFPVNGRRDIWVFFRGKMEVHPKNVSGRFYSKKVRTVIWKKFNGDRRFYLRRHRFA 292

Query: 261 TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVA 320
            Y  ++ R++FCLCPLGWAPWSPRLVE+V  GC+PVIIAD I LPF+ A+ W EI + VA
Sbjct: 293 GYQSEIARSVFCLCPLGWAPWSPRLVESVALGCVPVIIADSIRLPFSSAVNWPEISVTVA 352

Query: 321 EEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKL 378
           E+DV +L  IL  +    +   QR L +P  ++A+LF      GDA  Q+L+ L+ K+
Sbjct: 353 EKDVWRLGEILEKVAATNLSIIQRNLWDPRTRKALLFNSRVHEGDATWQVLHSLSEKV 410


>gi|42570324|ref|NP_850113.2| exostosin-like protein [Arabidopsis thaliana]
 gi|75216857|sp|Q9ZUV3.1|IRX7_ARATH RecName: Full=Probable glucuronoxylan glucuronosyltransferase IRX7;
           AltName: Full=Protein FRAGILE FIBER 8; AltName:
           Full=Protein IRREGULAR XYLEM 7
 gi|4063747|gb|AAC98455.1| hypothetical protein [Arabidopsis thaliana]
 gi|77022037|gb|ABA60868.1| putative glucuronyltransferase [Arabidopsis thaliana]
 gi|77022039|gb|ABA60869.1| putative glucuronyltransferase [Arabidopsis thaliana]
 gi|330252987|gb|AEC08081.1| exostosin-like protein [Arabidopsis thaliana]
          Length = 448

 Score =  303 bits (775), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 149/353 (42%), Positives = 217/353 (61%), Gaps = 7/353 (1%)

Query: 32  LEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLS--SPVRTLNPEE 89
           ++ D    LK+YVY+LPSK+NK  L  D RC  H+FAAE+ +H+  LS    VRT +P E
Sbjct: 86  IKTDVFNNLKIYVYDLPSKFNKDWLAND-RCTNHLFAAEVALHKAFLSLEGDVRTEDPYE 144

Query: 90  ADWFYTPIYPTCDL-TPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDF 148
           AD+F+ P+Y +C+  T  G P    +  ++  AI+L+S+ +P+WNRT G+DH F   HDF
Sbjct: 145 ADFFFVPVYVSCNFSTINGFPAIGHARSLINDAIKLVSTQYPFWNRTSGSDHVFTATHDF 204

Query: 149 GACFHYQEEKAIERGILPLLQRATLVQTFGQR-NHVCLNEGSITIPPYAPPQKMQAHQ-- 205
           G+CFH  E++AI  G+   L+ + ++QTFG   NH C    ++ IPPY  P+ +   Q  
Sbjct: 205 GSCFHTMEDRAIADGVPIFLRNSIILQTFGVTFNHPCQEVENVVIPPYISPESLHKTQKN 264

Query: 206 IPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYED 265
           IP    R I+V+FRG       +  G +Y++  R  +W ++  +  F +       Y  +
Sbjct: 265 IPVTKERDIWVFFRGKMELHPKNISGRFYSKRVRTNIWRSYGGDRRFYLQRQRFAGYQSE 324

Query: 266 MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVP 325
           + R++FCLCPLGWAPWSPRLVE+V  GC+PVIIAD I LPF   + W +I + VAE DV 
Sbjct: 325 IARSVFCLCPLGWAPWSPRLVESVALGCVPVIIADGIRLPFPSTVRWPDISLTVAERDVG 384

Query: 326 KLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKL 378
           KL  IL  +    +   QR L +PS++RA++F  P++ GDA  Q+L  L++KL
Sbjct: 385 KLGDILEHVAATNLSVIQRNLEDPSVRRALMFNVPSREGDATWQVLEALSKKL 437


>gi|255565439|ref|XP_002523710.1| transferase, putative [Ricinus communis]
 gi|223537014|gb|EEF38650.1| transferase, putative [Ricinus communis]
          Length = 461

 Score =  301 bits (771), Expect = 4e-79,   Method: Compositional matrix adjust.
 Identities = 144/350 (41%), Positives = 220/350 (62%), Gaps = 6/350 (1%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSP-VRTLNPEEADWFYTPIY 98
           LK+Y+YELPSKYN+  L  + RC  H+FA+E+ +H+ + +S  +RT +P EAD+F+ P+Y
Sbjct: 107 LKIYIYELPSKYNRDWL-SNKRCSNHLFASEVAIHKAISNSDDIRTFDPYEADFFFVPVY 165

Query: 99  PTCDL-TPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEE 157
            +C+  T  G P    +  ++ SA+  IS+N+P+WNR++GADH FV  HDFG+CFH  EE
Sbjct: 166 VSCNFSTINGFPAIGHARSLLSSAVTFISTNYPFWNRSQGADHVFVASHDFGSCFHTLEE 225

Query: 158 KAIERGILPLLQRATLVQTFGQR-NHVCLNEGSITIPPYAPPQKMQA--HQIPPDTPRSI 214
           +A++ G+   L+++ ++QTFG + +H C    ++ IPPY  P  +++   + P    R I
Sbjct: 226 RAMQDGVPEFLKKSIILQTFGVKYDHPCQQVENVVIPPYISPVSVRSTLKKAPLTGRRDI 285

Query: 215 FVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLC 274
           +V+FRG       +  G +Y++  R  +W  F  +  F +       Y  ++ R++FCLC
Sbjct: 286 WVFFRGKMEVHPKNVSGRFYSKKVRTEIWRRFNGDRRFYLQRHRFAGYQSEIARSVFCLC 345

Query: 275 PLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSI 334
           PLGWAPWSPRLVE+V  GC+PVIIAD I LPF  A+PW  I + VAE+DV KL  IL  +
Sbjct: 346 PLGWAPWSPRLVESVALGCVPVIIADGIRLPFPSAVPWPAISLTVAEKDVAKLGRILEDV 405

Query: 335 PPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHDKST 384
               +   Q+ + +P+++RA+LF    + GDA  Q+L  L +KL   + T
Sbjct: 406 AATNLTLIQKNIWDPTVRRALLFNDQIEEGDATWQVLYALTKKLDRSRRT 455


>gi|356539378|ref|XP_003538175.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase
           IRX7-like [Glycine max]
          Length = 460

 Score =  298 bits (763), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 145/348 (41%), Positives = 213/348 (61%), Gaps = 5/348 (1%)

Query: 35  DPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFY 94
           + + KLKV+VY+LP KYN   L  + RC  H+FA+E+ +HR LL+S VRT +P +AD+F+
Sbjct: 97  ESLKKLKVFVYDLPPKYNTDWLTNE-RCSKHLFASEVAIHRALLTSEVRTFDPYDADFFF 155

Query: 95  TPIYPTCDL-TPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFH 153
            P+Y +C+  T  G P    +  ++ SA+ LISS +P+WNR+ G+DH FV  HDFG+CFH
Sbjct: 156 VPVYVSCNFSTVNGFPAIGHARSLIASAVNLISSEYPFWNRSRGSDHVFVASHDFGSCFH 215

Query: 154 YQEEKAIERGILPLLQRATLVQTFGQR-NHVCLNEGSITIPPYAPPQKMQ--AHQIPPDT 210
             E+ A+  G+  +++ + ++QTFG   +H C     + IPPY  P+ ++      P D 
Sbjct: 216 TLEDVAMADGVPEIVRNSIVLQTFGVVFDHPCQKVEHVVIPPYVSPESVRDTMENFPVDG 275

Query: 211 PRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAI 270
            R I+ +FRG       +  G +Y++  R  +W  F  +  F +       Y  ++ R++
Sbjct: 276 RRDIWAFFRGKMEVHPKNVSGRFYSKEVRTVIWRKFNGDRRFYLQRHRFAGYQSEIARSV 335

Query: 271 FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTI 330
           FCLCPLGWAPWSPRLVE+V  GC+PV+IAD I LPF  A+ W EI + VAE+DV +L  I
Sbjct: 336 FCLCPLGWAPWSPRLVESVALGCVPVVIADGIRLPFVSAVKWSEISVTVAEKDVGRLAEI 395

Query: 331 LTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKL 378
           L  +    +   QR L +P+ +RA+LF    Q GDA  Q+L  L+ KL
Sbjct: 396 LERVAATNLSTIQRNLWDPATRRALLFNSQVQVGDATWQVLRALSEKL 443


>gi|297822463|ref|XP_002879114.1| hypothetical protein ARALYDRAFT_481698 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297324953|gb|EFH55373.1| hypothetical protein ARALYDRAFT_481698 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 452

 Score =  297 bits (760), Expect = 8e-78,   Method: Compositional matrix adjust.
 Identities = 145/353 (41%), Positives = 216/353 (61%), Gaps = 7/353 (1%)

Query: 32  LEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLS--SPVRTLNPEE 89
           +  D    LK+YVY+LPSK+NK  L  D RC  H+FAAE+ +H+  LS    +RT +P E
Sbjct: 90  IRTDVFNNLKIYVYDLPSKFNKDWLAND-RCSNHLFAAEVALHKAFLSLEGDIRTEDPYE 148

Query: 90  ADWFYTPIYPTCDL-TPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDF 148
           AD+F+ P+Y +C+  T  G P    +  ++  AI+ +S+ +P+WNR  G+DH F   HDF
Sbjct: 149 ADFFFVPVYVSCNFSTINGFPAIGHARTLINDAIKFVSTQYPFWNRNNGSDHVFTATHDF 208

Query: 149 GACFHYQEEKAIERGILPLLQRATLVQTFGQR-NHVCLNEGSITIPPYAPPQKMQA--HQ 205
           G+CFH  E++AI  G+  +L+ + ++QTFG   NH C    ++ IPPY  P+ +      
Sbjct: 209 GSCFHTMEDRAIADGVPKILRSSIVLQTFGVTFNHPCQEVENVVIPPYISPESLHKTLKN 268

Query: 206 IPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYED 265
           IP +  R I+ +FRG       +  G +Y++  R  +W ++  +  F +     + Y  +
Sbjct: 269 IPVNKERDIWAFFRGKMELHPKNISGRFYSKRVRTKIWRSYGGDRRFYLQRQRFSGYQLE 328

Query: 266 MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVP 325
           + R++FCLCPLGWAPWSPRLVE+V  GC+PVIIAD I LPF  A+ W +I + VAE DV 
Sbjct: 329 IARSVFCLCPLGWAPWSPRLVESVALGCVPVIIADGIRLPFPSAVRWPDISLTVAERDVG 388

Query: 326 KLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKL 378
           KL  IL  +    +   QR L +PS++RA++F  P++ GDA  Q+L  L++KL
Sbjct: 389 KLGDILEHVVATNLSVIQRNLEDPSVRRALMFNVPSREGDATWQVLEALSKKL 441


>gi|449459136|ref|XP_004147302.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase F8H-like
           [Cucumis sativus]
          Length = 447

 Score =  296 bits (758), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 151/342 (44%), Positives = 213/342 (62%), Gaps = 6/342 (1%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYP 99
           LKV+VY+LP ++N   L  D RC  H+FA+E+ +H+ LL+S VRTL+P EAD+F+ P+Y 
Sbjct: 97  LKVFVYDLPPEFNANWL-SDARCGGHLFASEVAIHKALLTSHVRTLDPSEADFFFVPVYV 155

Query: 100 TCDLTP-TGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEK 158
           +C+ +   G P    +P ++ SA+ +IS  +P+WNR+ G DH FV  HD+GACFH  E+ 
Sbjct: 156 SCNFSSFNGFPAIAHAPSLLASAVDVISGQFPFWNRSRGFDHVFVASHDYGACFHSLEDM 215

Query: 159 AIERGILPLLQRATLVQTFGQR-NHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVY 217
           AI  GI   L+ + ++QTFG +  H C +  +I IPPY  P+ M+   +     R IF +
Sbjct: 216 AIANGIPEFLKNSIILQTFGVKYKHPCQDVENILIPPYISPEFMEP-AVVDGRRRDIFAF 274

Query: 218 FRGLFYDVNNDPEGG-YYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPL 276
           FRG   +VN    GG +Y +  R  +W+ F  +  F +       Y  ++ R++FCLCPL
Sbjct: 275 FRGKM-EVNPKNVGGRFYGKRVRTTIWKKFHRDRRFYLRRHRFAGYRSEIARSVFCLCPL 333

Query: 277 GWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPP 336
           GWAPWSPRLVE+V  GC+PVIIAD I LPF  A+ W  I + VAE+DV KL  IL  +  
Sbjct: 334 GWAPWSPRLVESVALGCVPVIIADGIRLPFPSAVDWPGISLTVAEKDVGKLRKILERVAA 393

Query: 337 EVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKL 378
             +   Q+ L +P  +RA+LF  P QP DA  Q+L+ LA KL
Sbjct: 394 TNLTAIQKNLWDPKNRRALLFHNPTQPQDATWQVLSALAEKL 435


>gi|302826405|ref|XP_002994684.1| glycosyltransferase-like protein [Selaginella moellendorffii]
 gi|300137154|gb|EFJ04250.1| glycosyltransferase-like protein [Selaginella moellendorffii]
          Length = 332

 Score =  295 bits (755), Expect = 3e-77,   Method: Compositional matrix adjust.
 Identities = 146/342 (42%), Positives = 212/342 (61%), Gaps = 18/342 (5%)

Query: 38  GKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPI 97
             +KVY+Y+LPS YN   L+ D RC +H+FAAE+ +H+ LL SPVRTL+P+EAD+F+ P+
Sbjct: 4   SNIKVYIYDLPSSYNTDWLV-DSRCSSHLFAAEVAIHQNLLRSPVRTLDPDEADFFFMPV 62

Query: 98  YPTCDLTP-TGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQE 156
           Y +C+ T  +G P  F +  ++++A+ L+S N P+W+R +G DH FV  HDFGACFH  +
Sbjct: 63  YVSCNFTSRSGFPTLFHASDILQAAVGLVSRNMPFWDRHQGRDHVFVATHDFGACFHAMD 122

Query: 157 EKAIERGILPLLQRATLVQTFGQRN-HVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIF 215
             A+  GI   L+ + ++QTFG++N H C N   I IPPY                R I 
Sbjct: 123 -LAVTMGIPQFLRNSIILQTFGEKNKHPCQNVDHIQIPPY--------------VRRKIL 167

Query: 216 VYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCP 275
            +FRG       +  G  Y+RG R  +W  F ++  F I       Y  +M R++FCLCP
Sbjct: 168 AFFRGKMEIHPKNVSGHMYSRGVRTTIWRRFSHDRRFFIKRKRSDNYKAEMLRSVFCLCP 227

Query: 276 LGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIP 335
           LGWAPWSPR+VE+V+ GCIPVIIAD+I LP++  I W +I + VAE DV KLD IL+ + 
Sbjct: 228 LGWAPWSPRIVESVIQGCIPVIIADNIQLPYSHVIDWRKISVTVAERDVHKLDRILSRVA 287

Query: 336 PEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARK 377
              +   Q  L    +++A+++ QP   GDA  Q+L+ L+++
Sbjct: 288 ATNVSMIQANLWRDEVRQALVYNQPLVRGDATWQVLDLLSKR 329


>gi|168003473|ref|XP_001754437.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162694539|gb|EDQ80887.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 353

 Score =  294 bits (752), Expect = 7e-77,   Method: Compositional matrix adjust.
 Identities = 147/343 (42%), Positives = 213/343 (62%), Gaps = 5/343 (1%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYP 99
           +++YVY+LP+K+N+  L  D RC  H+FAAE+ +H+ L++SP+RTL+P EAD+F+ P+Y 
Sbjct: 5   IRIYVYDLPAKFNEDWL-ADERCSNHLFAAEVAIHKVLMTSPIRTLDPCEADFFFIPVYV 63

Query: 100 TCDLTP-TGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEK 158
           +C  TP TG P   ++ + M +A+  +S+   +WNR+ G DH FV  HD+GACFH  E +
Sbjct: 64  SCKFTPKTGFPWLGQARKFMEAAVNHVSTRMEFWNRSGGRDHIFVASHDYGACFHTLETE 123

Query: 159 AIERGILPLLQRATLVQTFG-QRNHVCLNEGSITIPPYAPPQKMQAH-QIPPD-TPRSIF 215
           AI  GI   ++++ ++QTFG Q  H C     I IPPY  P    ++ + PP+   R+IF
Sbjct: 124 AIAHGIPEFMRKSLILQTFGVQDFHPCQAAEHIQIPPYVSPSVAASYIKDPPERQKRNIF 183

Query: 216 VYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCP 275
            +FRG       +  G  Y+RG R  +++ F +N  F +       Y  +M R+ FCLCP
Sbjct: 184 AFFRGKMEINPKNVSGLVYSRGVRTVLYKKFSHNRRFLLKRHRTDNYQLEMLRSTFCLCP 243

Query: 276 LGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIP 335
           +GWAPWSPR+VEAVV GC+PVIIAD+I LP++ AI W  I + V E DVPKLD IL ++ 
Sbjct: 244 VGWAPWSPRIVEAVVHGCVPVIIADNISLPYSHAIDWTGISLSVREHDVPKLDKILLNVA 303

Query: 336 PEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKL 378
              +   Q  L     +RA+LF  P   GDA   + + L+ KL
Sbjct: 304 ATNLSTIQHNLWKEENRRALLFTDPLVKGDATWHVFDRLSTKL 346


>gi|168059925|ref|XP_001781950.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666596|gb|EDQ53246.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 356

 Score =  293 bits (751), Expect = 8e-77,   Method: Compositional matrix adjust.
 Identities = 145/343 (42%), Positives = 212/343 (61%), Gaps = 5/343 (1%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYP 99
           ++VYVY+LP K+NK  L+ D RC  H+FA+E+ +H+ LLSSP++TLNP EAD+F+ P+Y 
Sbjct: 5   IRVYVYDLPQKFNKDWLV-DERCSNHLFASEVAIHKILLSSPIKTLNPYEADFFFMPVYF 63

Query: 100 TCDLTP-TGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEK 158
           +C  +  TG P    +P++M  A+  +SS   +WNR+ G DH FV  HDFGACFH  E +
Sbjct: 64  SCKFSSKTGFPRLGHAPKLMEDAVNHVSSMMEFWNRSGGKDHVFVAAHDFGACFHSLESE 123

Query: 159 AIERGILPLLQRATLVQTFGQRN-HVCLNEGSITIPPYAPPQKMQAH--QIPPDTPRSIF 215
           AI  GI  ++Q + ++QTFG    H C    +I IPPY  P  + ++  + P +  R+IF
Sbjct: 124 AIAHGIPEIVQSSLILQTFGVHGFHPCQAAENIQIPPYISPSTVFSYVKKPPEEQRRNIF 183

Query: 216 VYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCP 275
            +FRG       +  G  Y+RG R  +++ F  N  F +       Y  D+ R+ FCLCP
Sbjct: 184 AFFRGKMEINPKNVSGLVYSRGVRTYIYKKFSRNRRFFLKRHRADNYQLDLLRSTFCLCP 243

Query: 276 LGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIP 335
           LGWAPWSPR+VEAV +GC+PVIIAD+I LP++ AI W  + + + E DV KL  IL ++ 
Sbjct: 244 LGWAPWSPRIVEAVAYGCVPVIIADNIRLPYSHAIDWSNMSLNIREHDVHKLYKILLNVA 303

Query: 336 PEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKL 378
            + +   Q+ L     +R +LF +P   GDA   +L+ L+ K+
Sbjct: 304 AKNLSSIQKNLWKEENRRVLLFMEPLAKGDATWHMLDRLSTKV 346


>gi|356542668|ref|XP_003539788.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase
           IRX7-like [Glycine max]
          Length = 461

 Score =  293 bits (750), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 143/346 (41%), Positives = 210/346 (60%), Gaps = 5/346 (1%)

Query: 37  VGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTP 96
           +  LKV+VY+LP KYN   L  + RC  H+FA+E+ +HR LL+S VRT +P +AD+F+ P
Sbjct: 100 LKNLKVFVYDLPQKYNTDWL-SNERCSKHLFASEVAIHRALLTSEVRTFDPYDADFFFVP 158

Query: 97  IYPTCDL-TPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQ 155
           +Y +C+  T  G P    +  ++ SA+ L+SS +P+WNR+ G+DH FV  HDFG+CFH  
Sbjct: 159 VYVSCNFSTVNGFPAIGHARSLIASAVSLVSSEYPFWNRSRGSDHVFVASHDFGSCFHTL 218

Query: 156 EEKAIERGILPLLQRATLVQTFGQ-RNHVCLNEGSITIPPYAPPQKMQ--AHQIPPDTPR 212
           E+ A+  G+  +++ + ++QTFG   +H C +   + IPPY  P+ ++      P +  R
Sbjct: 219 EDVAMADGVPEIMRNSIVLQTFGVVYDHPCQSVEHVVIPPYVSPESVRDTMENFPVNGRR 278

Query: 213 SIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFC 272
            I+ +FRG       +  G +Y++  R  +W  F  +  F +       Y  ++ R++FC
Sbjct: 279 DIWAFFRGKMELHPKNVSGRFYSKKVRTVIWRKFNGDRRFYLQRQRFAGYQSEIARSVFC 338

Query: 273 LCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILT 332
           LCPLGWAPWSPRLVE+V  GC+PVIIAD I LPF  A+ W EI + VAE+DV +L  IL 
Sbjct: 339 LCPLGWAPWSPRLVESVALGCVPVIIADGIRLPFISAVKWPEISITVAEKDVGRLAEILE 398

Query: 333 SIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKL 378
            +    +   QR L +P  + A+LF    Q GDA  QIL  L+ KL
Sbjct: 399 RVAATNLSTIQRNLWDPVTRSALLFNSQVQKGDATWQILRALSEKL 444


>gi|414864293|tpg|DAA42850.1| TPA: hypothetical protein ZEAMMB73_024068 [Zea mays]
          Length = 434

 Score =  293 bits (749), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 145/356 (40%), Positives = 212/356 (59%), Gaps = 4/356 (1%)

Query: 39  KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIY 98
           ++++YVY+LP+++N+     D RC  H+FAAE+ +H  LL+   R   PE+AD F+ P+Y
Sbjct: 77  RVRIYVYDLPARFNRDWAAADARCSRHLFAAEVAVHEALLAYAGRAARPEDADLFFVPVY 136

Query: 99  PTCDL-TPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEE 157
            +C+  TP G P    +  ++  A+ L+ +  PYWNR+ GADH FV  HDFGACFH  E+
Sbjct: 137 VSCNFSTPNGFPSLSHARGLLADAVDLVRARMPYWNRSAGADHVFVASHDFGACFHPMED 196

Query: 158 KAIERGILPLLQRATLVQTFG-QRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFV 216
            AI  GI   L+R+ L+QTFG Q +HVC     + IPP+ PP+       P    R IF 
Sbjct: 197 VAIADGIPEFLKRSILLQTFGVQGHHVCQEVEHVVIPPHVPPEVAHELPEPEKAQRDIFA 256

Query: 217 YFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPL 276
           +FRG       +  G +Y++  R  + +++  N  F +       Y  +M R++FCLCPL
Sbjct: 257 FFRGKMEVHPKNISGRFYSKKVRTELLQHYGRNRKFYLKRKRFDNYRSEMARSLFCLCPL 316

Query: 277 GWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPP 336
           GWAPWSPRLVE+V+ GCIPVIIADDI LPF   + W+EI + VAE+D+  L  +L  +  
Sbjct: 317 GWAPWSPRLVESVLLGCIPVIIADDIRLPFPPVLQWQEISLQVAEKDIASLGMVLDHVVA 376

Query: 337 EVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHDKSTYLKPGGKF 392
             +   Q+ L +P  +RA++F +P + GDA  Q+L  L   L  D+S   +  G +
Sbjct: 377 TNLTVIQKNLWDPVKRRALVFNRPMEAGDATWQVLRELEVLL--DQSRRRRSVGSY 430


>gi|356547155|ref|XP_003541982.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase
           IRX7-like [Glycine max]
          Length = 458

 Score =  291 bits (746), Expect = 3e-76,   Method: Compositional matrix adjust.
 Identities = 147/364 (40%), Positives = 223/364 (61%), Gaps = 13/364 (3%)

Query: 21  IERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSS 80
           +E  S + G VL++     +KV+VYELP KYN   L  + RC  H+FA+E+ +HR LL+S
Sbjct: 85  VESTSNTLG-VLKN-----MKVFVYELPPKYNTDWLANE-RCSNHLFASEVAIHRALLTS 137

Query: 81  PVRTLNPEEADWFYTPIYPTCDLTP-TGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGAD 139
            VRT +P EAD+F+ P+Y +C+ +   G P    +  ++ SA+ L+S+ +P+WNR+ G+D
Sbjct: 138 EVRTFDPYEADFFFVPVYVSCNFSAVNGFPAIGHARTLISSAVNLVSTEYPFWNRSRGSD 197

Query: 140 HFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRN-HVCLNEGSITIPPYAPP 198
           H FV  HDFGACFH  E+ A+  GI  +L+ + ++QTFG  + H C +  ++ IPPY  P
Sbjct: 198 HVFVASHDFGACFHTLEDVAMADGIPKILKNSIVLQTFGVIHPHPCQDVENVVIPPYVAP 257

Query: 199 QKMQA--HQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDIST 256
           + +++   + P +  R I+ +FRG       +  G +Y++  R  +W  F  +  F +  
Sbjct: 258 ESVRSTLEKFPVNGRRDIWAFFRGKMEVHPKNVSGQFYSKRVRTEIWRKFNGDRRFYLQR 317

Query: 257 DHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG 316
                Y  ++ R++FCLCPLGWAPWSPRLVE+V  GC+PV+IAD I LPF+ A+ W EI 
Sbjct: 318 RRFAGYQLEIARSVFCLCPLGWAPWSPRLVESVALGCVPVVIADGIRLPFSSAVRWSEIS 377

Query: 317 MFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLF--PQPAQPGDAFHQILNGL 374
           + VAE DV KL  IL  +    +   Q+ L +P  +RA+LF   +  + GDA  Q++  L
Sbjct: 378 LTVAERDVGKLGKILERVAATNLSVIQKSLWDPGTRRALLFNNNKKVEEGDATWQVMVSL 437

Query: 375 ARKL 378
           + KL
Sbjct: 438 SEKL 441


>gi|356541948|ref|XP_003539434.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase
           IRX7-like [Glycine max]
          Length = 459

 Score =  290 bits (743), Expect = 8e-76,   Method: Compositional matrix adjust.
 Identities = 144/345 (41%), Positives = 209/345 (60%), Gaps = 6/345 (1%)

Query: 39  KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIY 98
            +KV+VYELP KYN   L  + RC +H+FA+E+ +HR LL+S VRT +P EAD+F+ P+Y
Sbjct: 99  NMKVFVYELPPKYNTDWLANE-RCSSHLFASEVAIHRALLTSEVRTFDPYEADFFFVPVY 157

Query: 99  PTCDLTPTG-LPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEE 157
            +C+ +     P    +  ++ SA+ L+S+ +P+WNR+ G+DH FV  HDFGACFH  E+
Sbjct: 158 VSCNFSAVNDFPAIGHARTLISSAVNLVSTEYPFWNRSRGSDHVFVASHDFGACFHTLED 217

Query: 158 KAIERGILPLLQRATLVQTFGQ-RNHVCLNEGSITIPPYAPPQKMQA--HQIPPDTPRSI 214
            A+  GI  +L+ + ++QTFG    H C    ++ IPPY  P+ +++   + P    R I
Sbjct: 218 VAMADGIPIILKNSIVLQTFGVIHQHPCQEVENVVIPPYVSPESVRSTLEKFPVTGRRDI 277

Query: 215 FVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLC 274
           F +FRG       +    +Y++  R  +W  F  +  F +       Y  ++ R++FCLC
Sbjct: 278 FAFFRGKMEVHPKNVSRRFYSKRVRTEIWRKFNGDRRFYLQRHRFAGYQLEIARSVFCLC 337

Query: 275 PLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSI 334
           PLGWAPWSPRLVE+V  GC+PV+IAD I LPF+ A+ W EI + VAE DV KL  IL  +
Sbjct: 338 PLGWAPWSPRLVESVALGCVPVVIADGIQLPFSSAVRWSEISLSVAERDVGKLGKILERV 397

Query: 335 PPEVILRKQRLLANPSMKRAMLF-PQPAQPGDAFHQILNGLARKL 378
               +   QR L +P  +RA+LF     Q GDA  Q+L  L+ KL
Sbjct: 398 AATNLSVIQRNLWDPRTRRALLFNNNKVQEGDATWQVLVSLSEKL 442


>gi|168049543|ref|XP_001777222.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162671450|gb|EDQ58002.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 368

 Score =  290 bits (741), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 145/343 (42%), Positives = 208/343 (60%), Gaps = 5/343 (1%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYP 99
           +++YVY+LP K+N+  L+ D RC  H+FA+E+ +H+ LL+SP+RTL+P EAD+F+ P+Y 
Sbjct: 20  IRIYVYDLPPKFNEDWLV-DERCSNHLFASEVAIHKILLTSPIRTLDPYEADFFFMPVYV 78

Query: 100 TCDLTP-TGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEK 158
           +C  +P TG P    +P++M++A+  +S+   +WNR+ G DH FV  HD+GACFH  E +
Sbjct: 79  SCKFSPKTGFPWLGHAPKLMQAAVNHVSTKMEFWNRSWGRDHIFVAAHDYGACFHTLETQ 138

Query: 159 AIERGILPLLQRATLVQTFGQRN-HVCLNEGSITIPPYAPPQKMQAHQIPP--DTPRSIF 215
           AI +GI   ++ + ++QTFG +  H C     I IPPY  P    ++   P     R IF
Sbjct: 139 AIAQGIPQFMRNSLILQTFGVKGFHPCQAAEHIQIPPYISPSVAVSYVKDPLEHQQRDIF 198

Query: 216 VYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCP 275
            YFRG       +  G  Y++G R  +++ F  N  F +         ++M R+ FCLCP
Sbjct: 199 AYFRGKMEINPKNVSGLLYSKGIRTVLYKRFSRNKRFVLKRHRVDNSQQEMLRSTFCLCP 258

Query: 276 LGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIP 335
           LGWAPWSPR+VEAV +GCIPVIIAD+I LP++  I W  I + V E DVPKLD IL  + 
Sbjct: 259 LGWAPWSPRIVEAVTYGCIPVIIADNISLPYSHTIDWSSISLTVPEHDVPKLDKILIGVA 318

Query: 336 PEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKL 378
              +   QR L     +RA+LF  P   GDA   I + L+  L
Sbjct: 319 VTNLTAIQRNLWRGENRRALLFTDPLVQGDATWHIFDLLSSTL 361


>gi|293334733|ref|NP_001169191.1| uncharacterized protein LOC100383044 [Zea mays]
 gi|223975431|gb|ACN31903.1| unknown [Zea mays]
 gi|413957212|gb|AFW89861.1| hypothetical protein ZEAMMB73_311893 [Zea mays]
          Length = 428

 Score =  290 bits (741), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 144/355 (40%), Positives = 212/355 (59%), Gaps = 4/355 (1%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYP 99
           +++YVY+LP+++N+   + D RC  H+FAAE+ +H  LL+   R   PE+AD F+ P+Y 
Sbjct: 74  VRIYVYDLPARFNRDWAVADARCARHLFAAEVAVHEALLAYTGRAARPEDADLFFVPVYV 133

Query: 100 TCDL-TPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEK 158
           +C+  TP G P    +  M+  A+ L+ +  PYWNR+ GADH FV  HDFGACFH  E+ 
Sbjct: 134 SCNFSTPNGFPSLSHARGMLADAVDLVQAGMPYWNRSAGADHVFVASHDFGACFHPMEDV 193

Query: 159 AIERGILPLLQRATLVQTFG-QRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVY 217
           AI  GI   L+R+ L+QTFG Q +H C     + IPP+ PP+       P    R IF +
Sbjct: 194 AIADGIPEFLKRSILLQTFGVQGHHTCQEVEHVVIPPHVPPEVEHELPEPEKAQRDIFAF 253

Query: 218 FRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLG 277
           FRG       +  G +Y++  R  + +++  N  F +       Y  +M R++FCLCPLG
Sbjct: 254 FRGKMEVHPKNISGRFYSKKVRTELLQHYGRNRKFYLKRKRFDNYRSEMARSLFCLCPLG 313

Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
           WAPWSPRLVE+V+ GCIPVIIAD+I +PF   + W EI + VAE+DV  L+ +L  +   
Sbjct: 314 WAPWSPRLVESVLLGCIPVIIADNIRMPFPSVLQWPEISLQVAEKDVANLEVVLDHVVAT 373

Query: 338 VILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHDKSTYLKPGGKF 392
            +   Q+ L +P  ++A++F +P + GDA  Q+L  L   L  D+S   + GG +
Sbjct: 374 NLSVIQKNLWDPVKRKALVFNRPMEVGDATWQVLRELEVLL--DQSQRRRYGGSW 426


>gi|449456052|ref|XP_004145764.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase
           IRX7-like [Cucumis sativus]
          Length = 459

 Score =  288 bits (738), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 147/372 (39%), Positives = 219/372 (58%), Gaps = 11/372 (2%)

Query: 11  AFVSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAE 70
           A +   NT  + ++  + G +LE+     +KV+VY+LP KYN + L  + RC  H+FA+E
Sbjct: 83  ALIETSNTTFLRQVQQNQG-LLEE-----VKVFVYDLPPKYNVEWL-SNERCSNHLFASE 135

Query: 71  IFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDL-TPTGLPLPFKSPRMMRSAIQLISSNW 129
           + +HR LL+S  RT +P EAD+F+ P+Y +C+  T  G P    +  ++ SA+  ISS++
Sbjct: 136 VAIHRALLNSDYRTFDPLEADFFFVPVYVSCNFSTVNGFPAIGHARSLISSAVSHISSHY 195

Query: 130 PYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQR-NHVCLNEG 188
            +WNRT G+DH FV  HDF +CFH  E  AI  G+   L+ + ++QTFG +  H C +  
Sbjct: 196 SFWNRTNGSDHVFVASHDFASCFHTMEHVAIADGVPSFLKNSIILQTFGVKYKHPCQDVE 255

Query: 189 SITIPPYAPPQKMQA--HQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENF 246
            + IPPY  P+ ++    + P    R IF +FRG       +  G +Y++  R  +W  F
Sbjct: 256 HVVIPPYISPESIENTLERSPVTGRRDIFAFFRGKMEMNPKNVSGRFYSKKVRTMIWRKF 315

Query: 247 KNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 306
             +  F +       Y  ++ R++FCLCPLGWAPWSPRLVE+V  GC+PVIIAD I LPF
Sbjct: 316 NGDRRFYLQRHRFPGYQSEIVRSVFCLCPLGWAPWSPRLVESVALGCVPVIIADGIRLPF 375

Query: 307 ADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDA 366
             A+ W EI + VAE+D+ KL  IL  +    +   Q+ L +P  +RA+LF    + GDA
Sbjct: 376 PSAVNWPEISITVAEKDIGKLGRILDHVAASNLTTIQKNLWDPRNRRALLFHNQVEDGDA 435

Query: 367 FHQILNGLARKL 378
             Q++  L+ KL
Sbjct: 436 TWQVIGALSEKL 447


>gi|115450193|ref|NP_001048697.1| Os03g0107900 [Oryza sativa Japonica Group]
 gi|122247627|sp|Q10SX7.1|GT31_ORYSJ RecName: Full=Probable glucuronosyltransferase Os03g0107900
 gi|108705764|gb|ABF93559.1| exostosin family protein, putative, expressed [Oryza sativa
           Japonica Group]
 gi|113547168|dbj|BAF10611.1| Os03g0107900 [Oryza sativa Japonica Group]
 gi|125542077|gb|EAY88216.1| hypothetical protein OsI_09667 [Oryza sativa Indica Group]
 gi|215766485|dbj|BAG98793.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222624042|gb|EEE58174.1| hypothetical protein OsJ_09104 [Oryza sativa Japonica Group]
          Length = 427

 Score =  288 bits (737), Expect = 4e-75,   Method: Compositional matrix adjust.
 Identities = 140/341 (41%), Positives = 203/341 (59%), Gaps = 2/341 (0%)

Query: 36  PVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYT 95
           P   L++YVY+LP+++N+  +  D RC TH+FAAE+ +H  LL+   R   P++A  F+ 
Sbjct: 70  PRPALRIYVYDLPARFNRHWVAADARCATHLFAAEVALHEALLAYAGRAARPDDATLFFV 129

Query: 96  PIYPTCDL-TPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHY 154
           P+Y +C+  T  G P    +  ++  A+ L+ +  PYWNR+ GADH FV  HDFGACFH 
Sbjct: 130 PVYVSCNFSTDNGFPSLSHARALLADAVDLVRAQMPYWNRSAGADHVFVASHDFGACFHP 189

Query: 155 QEEKAIERGILPLLQRATLVQTFG-QRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRS 213
            E+ AI  GI   L+R+ L+QTFG Q  HVC     + IPP+ PP+       P    R 
Sbjct: 190 MEDVAIADGIPEFLKRSILLQTFGVQGTHVCQEADHVVIPPHVPPEVALELPEPEKAQRD 249

Query: 214 IFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCL 273
           IF +FRG       +  G +Y++  R  + + +  N  F +       Y  +M R++FCL
Sbjct: 250 IFAFFRGKMEVHPKNISGRFYSKKVRTELLQKYGRNRKFYLKRKRYGNYRSEMARSLFCL 309

Query: 274 CPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTS 333
           CPLGWAPWSPRLVE+V+ GCIPVIIADDI LPF   + W +I + VAE+DV  L+ +L  
Sbjct: 310 CPLGWAPWSPRLVESVLLGCIPVIIADDIRLPFPSVLQWLDISLQVAEKDVASLEMVLDH 369

Query: 334 IPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
           +    +   Q+ L +P  ++A++F +P + GDA  Q+L  L
Sbjct: 370 VVATNLTVIQKNLWDPVKRKALVFNRPMEEGDATWQVLREL 410


>gi|449496198|ref|XP_004160070.1| PREDICTED: LOW QUALITY PROTEIN: probable glucuronoxylan
           glucuronosyltransferase IRX7-like [Cucumis sativus]
          Length = 459

 Score =  288 bits (737), Expect = 4e-75,   Method: Compositional matrix adjust.
 Identities = 147/372 (39%), Positives = 220/372 (59%), Gaps = 11/372 (2%)

Query: 11  AFVSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAE 70
           A +   NT  + ++  +  ++LE+     +KV+VY+LP KYN + L  + RC  H+FA+E
Sbjct: 83  ALIETSNTTFLRQVQQNQ-ELLEE-----VKVFVYDLPPKYNVEWL-SNERCSNHLFASE 135

Query: 71  IFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDL-TPTGLPLPFKSPRMMRSAIQLISSNW 129
           + +HR LL+S  RT +P EAD+F+ P+Y +C+  T  G P    +  ++ SA+  ISS++
Sbjct: 136 VAIHRALLNSHYRTFDPLEADFFFVPVYVSCNFSTVNGFPAIGHARSLISSAVSHISSHY 195

Query: 130 PYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQR-NHVCLNEG 188
            +WNRT G+DH FV  HDF +CFH  E  AI  G+   L+ + ++QTFG +  H C +  
Sbjct: 196 SFWNRTNGSDHVFVASHDFASCFHTMEHVAIADGVPSFLKNSIILQTFGVKYKHPCQDVE 255

Query: 189 SITIPPYAPPQKMQA--HQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENF 246
            + IPPY PP+ ++    + P    R IF +FRG       +  G +Y++  R  +W  F
Sbjct: 256 HVVIPPYIPPESIENTLERSPVTGRRDIFAFFRGKMEMNPKNVSGRFYSKKVRTMIWRKF 315

Query: 247 KNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 306
             +  F +       Y  ++ R++FCLCPLGWAPWSPRLVE+V  GC+PVIIAD I LPF
Sbjct: 316 NGDRRFYLQRHRFPGYQSEIVRSVFCLCPLGWAPWSPRLVESVALGCVPVIIADGIRLPF 375

Query: 307 ADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDA 366
             A+ W EI + VAE+D+ KL  IL  +    +   Q+ L +P  +RA+LF    + GDA
Sbjct: 376 PSAVNWPEISITVAEKDIGKLGRILDHVAGSNLTTIQKNLWDPRNRRALLFHNQVEDGDA 435

Query: 367 FHQILNGLARKL 378
             Q++  L+ KL
Sbjct: 436 TWQVIGALSEKL 447


>gi|242042569|ref|XP_002468679.1| hypothetical protein SORBIDRAFT_01g050110 [Sorghum bicolor]
 gi|241922533|gb|EER95677.1| hypothetical protein SORBIDRAFT_01g050110 [Sorghum bicolor]
          Length = 429

 Score =  287 bits (735), Expect = 5e-75,   Method: Compositional matrix adjust.
 Identities = 139/337 (41%), Positives = 203/337 (60%), Gaps = 2/337 (0%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYP 99
           +++YVY+LP+++N+     D RC  H+FAAE+ +H  LL+   R   PE+AD F+ P+Y 
Sbjct: 75  VRIYVYDLPARFNRDWAAADARCARHLFAAEVAVHEALLAYAGRAARPEDADLFFVPVYV 134

Query: 100 TCDL-TPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEK 158
           +C+  TP G P    +  ++  A+ L+    PYWNR+ GADH FV  HDFGACFH  E+ 
Sbjct: 135 SCNFSTPNGFPSLSHARGLLAEAVDLVRVRMPYWNRSAGADHVFVASHDFGACFHPMEDV 194

Query: 159 AIERGILPLLQRATLVQTFG-QRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVY 217
           AI  GI   L+R+ L+QTFG Q +HVC     + IPP+ PP+       P    R IF +
Sbjct: 195 AIADGIPEFLKRSILLQTFGVQGHHVCQEVEHVVIPPHVPPEVAHELPEPEKAQRDIFAF 254

Query: 218 FRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLG 277
           FRG       +  G +Y++  R  + +++  N  F +       Y  +M R++FCLCPLG
Sbjct: 255 FRGKMEVHPKNISGRFYSKKVRTELLQHYGRNRKFYLKRKRFDNYRSEMARSLFCLCPLG 314

Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
           WAPWSPRLVE+V+ GCIPVIIAD+I LPF   + W EI + VAE+D+  L+ +L  +   
Sbjct: 315 WAPWSPRLVESVLLGCIPVIIADNIRLPFPSVLQWPEISLQVAEKDIANLEMVLDHVVAT 374

Query: 338 VILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
            +   Q+ L +P  ++A++F +P + GDA  Q+L  L
Sbjct: 375 NLTMIQKNLWDPVKRKALVFNRPMEVGDATWQVLREL 411


>gi|357121010|ref|XP_003562215.1| PREDICTED: probable glucuronosyltransferase Os03g0107900-like
           [Brachypodium distachyon]
          Length = 441

 Score =  283 bits (724), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 142/349 (40%), Positives = 206/349 (59%), Gaps = 3/349 (0%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYP 99
           +++YVY+LP ++N+  L  DPRC  H+FAAE+ +H  LL      + PE+AD F+ P+Y 
Sbjct: 89  VRIYVYDLPRRFNRDWLAADPRCARHLFAAEVALHEALLGH-YSAVRPEDADLFFVPVYV 147

Query: 100 TCDL-TPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEK 158
           +C+  TP G P    +  ++  A+ L+    PYWNR+ GADH FV  HDFGACFH  E+ 
Sbjct: 148 SCNFSTPNGFPSLSHARGLLADAVDLVRREAPYWNRSAGADHVFVASHDFGACFHPMEDV 207

Query: 159 AIERGILPLLQRATLVQTFG-QRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVY 217
           AI  GI   L+R+ L+QTFG Q  HVC     + IPP+ PP+          T R IF +
Sbjct: 208 AIADGIPDFLKRSILLQTFGVQGPHVCQEAEHVVIPPHVPPEVALEILELEKTRRDIFAF 267

Query: 218 FRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLG 277
           FRG       +  G +Y++  R  + + +  N  F +       Y  +M R++FCLCPLG
Sbjct: 268 FRGKMEVHPKNISGRFYSKKVRTELLQRYGRNSKFYLKRKRYDNYRSEMARSLFCLCPLG 327

Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
           WAPWSPRLVE+V+ GCIPVIIAD+I LPF   + W +I + VAE+DV  L+ +L  +   
Sbjct: 328 WAPWSPRLVESVLLGCIPVIIADNIRLPFPSVLRWSDISLQVAEKDVASLEKVLDHVVAT 387

Query: 338 VILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHDKSTYL 386
            +   Q+ L +P  ++A++F +  + GDA  Q+L  L   L   +S+Y+
Sbjct: 388 NLTVIQKNLWDPVKRKALVFNRRLEEGDATWQVLRELEVLLDRSQSSYV 436


>gi|357141002|ref|XP_003572039.1| PREDICTED: probable glucuronosyltransferase Os03g0107900-like
           [Brachypodium distachyon]
          Length = 465

 Score =  282 bits (721), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 142/343 (41%), Positives = 204/343 (59%), Gaps = 3/343 (0%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYP 99
           +++YVY+LP ++N+  +  D RC  H+FAAE+ +H  LL   +R   PEEAD F  P+Y 
Sbjct: 111 VRIYVYDLPPRFNRAWVSADARCGRHLFAAEVAVHEALLRRHLRA-RPEEADLFLVPVYV 169

Query: 100 TCDL-TPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEK 158
           +C+  TPTGLP    +  ++  A++L+  + PYWNR+ G DH FV  HDFGACFH  E+ 
Sbjct: 170 SCNFSTPTGLPSLKHARGLLAEAVELVRRDMPYWNRSAGTDHVFVASHDFGACFHAMEDV 229

Query: 159 AIERGILPLLQRATLVQTFG-QRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVY 217
           AI  GI   L+R+ L+QTFG Q  H C     + IPP+  P+  +    P  + R IF +
Sbjct: 230 AIAGGIPEFLKRSILLQTFGVQGRHTCQEVEHVVIPPHVLPEVARELPEPEKSHRDIFAF 289

Query: 218 FRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLG 277
           FRG       +  G +Y +  R  + + + +N  F +       Y  +M R++FCLCPLG
Sbjct: 290 FRGKMEVHPKNMSGRFYGKKVRTKLLQLYGHNRKFYLKRKQHDGYRLEMARSLFCLCPLG 349

Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
           WAPWSPRLVE+V+ GCIPVIIAD+I LPF   + W +I + VAE D+  L+ +L  +   
Sbjct: 350 WAPWSPRLVESVLLGCIPVIIADNIRLPFPGVLRWPDISLQVAERDIANLEAMLDHVAST 409

Query: 338 VILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPH 380
            +   Q  L +P  ++A++F QP + GDA  Q+L  L  KL H
Sbjct: 410 NLTTIQGNLWDPVKRKALVFNQPMEEGDATWQVLKELEAKLGH 452


>gi|24476038|gb|AAN62780.1| Unknown protein [Oryza sativa Japonica Group]
          Length = 449

 Score =  278 bits (712), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 140/359 (38%), Positives = 203/359 (56%), Gaps = 24/359 (6%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYP 99
           L++YVY+LP+++N+  +  D RC TH+FAAE+ +H  LL+   R   P++A  F+ P+Y 
Sbjct: 74  LRIYVYDLPARFNRHWVAADARCATHLFAAEVALHEALLAYAGRAARPDDATLFFVPVYV 133

Query: 100 TCDL-TPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFH----- 153
           +C+  T  G P    +  ++  A+ L+ +  PYWNR+ GADH FV  HDFGACFH     
Sbjct: 134 SCNFSTDNGFPSLSHARALLADAVDLVRAQMPYWNRSAGADHVFVASHDFGACFHPMELF 193

Query: 154 -----------------YQEEKAIERGILPLLQRATLVQTFG-QRNHVCLNEGSITIPPY 195
                             QE+ AI  GI   L+R+ L+QTFG Q  HVC     + IPP+
Sbjct: 194 VIIHFELGVNAKSNLALGQEDVAIADGIPEFLKRSILLQTFGVQGTHVCQEADHVVIPPH 253

Query: 196 APPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDIS 255
            PP+       P    R IF +FRG       +  G +Y++  R  + + +  N  F + 
Sbjct: 254 VPPEVALELPEPEKAQRDIFAFFRGKMEVHPKNISGRFYSKKVRTELLQKYGRNRKFYLK 313

Query: 256 TDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEI 315
                 Y  +M R++FCLCPLGWAPWSPRLVE+V+ GCIPVIIADDI LPF   + W +I
Sbjct: 314 RKRYGNYRSEMARSLFCLCPLGWAPWSPRLVESVLLGCIPVIIADDIRLPFPSVLQWLDI 373

Query: 316 GMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
            + VAE+DV  L+ +L  +    +   Q+ L +P  ++A++F +P + GDA  Q+L  L
Sbjct: 374 SLQVAEKDVASLEMVLDHVVATNLTVIQKNLWDPVKRKALVFNRPMEEGDATWQVLREL 432


>gi|326514612|dbj|BAJ96293.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 428

 Score =  273 bits (698), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 142/349 (40%), Positives = 202/349 (57%), Gaps = 4/349 (1%)

Query: 34  DDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRT-LNPEEADW 92
            +P   +++Y Y+LP ++N++    D RC  H+FAAE+ +H  LL    R  L PEEAD 
Sbjct: 67  SEPRPGVRIYAYDLPPRFNRRWAAADARCSRHLFAAEVAVHEALLLRQRRAGLRPEEADL 126

Query: 93  FYTPIYPTCDL-TPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGAC 151
           F  P+Y  C+  TPTGLP    +  ++  A+ L+ +  P+WNR+ GADH FV  HDFGAC
Sbjct: 127 FLVPVYACCNFSTPTGLPSLAHARGLLADAVGLVRAQMPFWNRSAGADHVFVASHDFGAC 186

Query: 152 FHYQEEKAIERGILPLLQRATLVQTFG-QRNHVCLNEGSITIPPYAPPQKMQAHQIPPDT 210
           FH  E+ A+  GI   L+ + L+QTFG Q  H C +   + IPPY PP+        P+ 
Sbjct: 187 FHPMEDVAMAAGIPEFLKGSILLQTFGVQGRHPCQDVEHVVIPPYVPPELAPRELPEPEK 246

Query: 211 P-RSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRA 269
             R IF +FRG       +  G +Y+R  R  +   +  N  F +       Y  +M R+
Sbjct: 247 AHRDIFAFFRGKMEVHPKNISGHFYSRKVRTELLRLYGRNRKFYLKRKRNDGYRSEMARS 306

Query: 270 IFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDT 329
           +FC+CPLGWAPWSPRLVE+V+ GCIPV+IADDI LPF   + W +I + VAE DV  L+ 
Sbjct: 307 LFCICPLGWAPWSPRLVESVLLGCIPVVIADDIRLPFPGVLRWPDISLQVAERDVAGLEA 366

Query: 330 ILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKL 378
           +L  +    +   Q  L +P  ++A++F +P + GDA  Q+L  L  KL
Sbjct: 367 VLDHVAATNLTTIQGNLWDPVKRKALVFNRPMEEGDATWQVLKELEAKL 415


>gi|388503508|gb|AFK39820.1| unknown [Lotus japonicus]
          Length = 141

 Score =  268 bits (686), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 126/141 (89%), Positives = 136/141 (96%)

Query: 266 MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVP 325
           MQRA+FCLCPLGWAPWSPRLVEAVVFGCIPVI+ADDIVLPFADAIPWE+IG+FV EEDVP
Sbjct: 1   MQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIVADDIVLPFADAIPWEDIGVFVDEEDVP 60

Query: 326 KLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHDKSTY 385
           KLDTILTSIPPE+ILRKQRLLANPSMK+AMLFPQPAQPGDAFHQ+LNGLARKLPHD+S +
Sbjct: 61  KLDTILTSIPPEIILRKQRLLANPSMKQAMLFPQPAQPGDAFHQVLNGLARKLPHDRSVF 120

Query: 386 LKPGGKFLNWTSGPVGDLKPW 406
           LK G K LNWT+GPVGDLKPW
Sbjct: 121 LKAGDKILNWTAGPVGDLKPW 141


>gi|388510202|gb|AFK43167.1| unknown [Medicago truncatula]
          Length = 216

 Score =  266 bits (679), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 125/141 (88%), Positives = 131/141 (92%)

Query: 22  ERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSP 81
           ERISGSAGDVLEDDPVG+LKV+VYELPSKYNKK+L KDPRCLTHMFAAEIFMHRFLLSS 
Sbjct: 32  ERISGSAGDVLEDDPVGRLKVFVYELPSKYNKKILQKDPRCLTHMFAAEIFMHRFLLSSA 91

Query: 82  VRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHF 141
           VRTLNPEEADWFYTP+Y TCDLTP GLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHF
Sbjct: 92  VRTLNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHF 151

Query: 142 FVVPHDFGACFHYQEEKAIER 162
           FVVPHDFGACF  + E   +R
Sbjct: 152 FVVPHDFGACFLPRREGNRKR 172


>gi|255633862|gb|ACU17292.1| unknown [Glycine max]
          Length = 141

 Score =  265 bits (678), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 126/141 (89%), Positives = 136/141 (96%)

Query: 266 MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVP 325
           MQRA+FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG+FV E+DVP
Sbjct: 1   MQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFVDEKDVP 60

Query: 326 KLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHDKSTY 385
           +LDTILTSIPPEVILRKQRLLANP MK+AMLFPQPAQPGDAFHQ+LNGLARKLPHD+S +
Sbjct: 61  QLDTILTSIPPEVILRKQRLLANPFMKQAMLFPQPAQPGDAFHQVLNGLARKLPHDRSVF 120

Query: 386 LKPGGKFLNWTSGPVGDLKPW 406
           LKPG + LNWT+GPVGDLKPW
Sbjct: 121 LKPGEEALNWTAGPVGDLKPW 141


>gi|255543226|ref|XP_002512676.1| conserved hypothetical protein [Ricinus communis]
 gi|223548637|gb|EEF50128.1| conserved hypothetical protein [Ricinus communis]
          Length = 145

 Score =  241 bits (614), Expect = 6e-61,   Method: Compositional matrix adjust.
 Identities = 116/138 (84%), Positives = 124/138 (89%), Gaps = 3/138 (2%)

Query: 2   WVTAIALLLAFVS---ADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLK 58
           W+  + L+  F++   A   P+ ERISGSAGDVLEDDPVG+LKVYVYELPSKYNKKLL K
Sbjct: 6   WIFLLLLVSCFINRTFAHQKPRTERISGSAGDVLEDDPVGRLKVYVYELPSKYNKKLLQK 65

Query: 59  DPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMM 118
           DPRCLTHMFAAEIFMHRFLLSSPVRTLNP+EADWFYTPIY TCDLTPTGLPLPFKSPRMM
Sbjct: 66  DPRCLTHMFAAEIFMHRFLLSSPVRTLNPDEADWFYTPIYTTCDLTPTGLPLPFKSPRMM 125

Query: 119 RSAIQLISSNWPYWNRTE 136
           RSAIQLISSNWPYWNRTE
Sbjct: 126 RSAIQLISSNWPYWNRTE 143


>gi|289166880|gb|ADC84490.1| glycosyltransferase family 47C [Salix miyabeana]
          Length = 252

 Score =  223 bits (569), Expect = 9e-56,   Method: Compositional matrix adjust.
 Identities = 105/244 (43%), Positives = 153/244 (62%), Gaps = 3/244 (1%)

Query: 117 MMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQT 176
           ++ SA+ LISSN P+WNR+ G+DH FV  HD+GACFH  EE+A E GI   L+R+ ++QT
Sbjct: 5   LLSSAVNLISSNHPFWNRSRGSDHVFVASHDYGACFHAMEERAAEDGIPEFLKRSIILQT 64

Query: 177 FGQR-NHVCLNEGSITIPPYAPPQKMQA--HQIPPDTPRSIFVYFRGLFYDVNNDPEGGY 233
           FG + +H C +  ++ IPP+  P+ +Q    + P    R I+V+FRG       +  G Y
Sbjct: 65  FGVKFDHPCQDVENVVIPPFITPESVQTTLEKYPLTGRRDIWVFFRGKMEVHPKNISGRY 124

Query: 234 YARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGC 293
           Y++  R  +W  +  +P F +       Y  ++ R++FCLCPLGWAPWSPRLVE++  GC
Sbjct: 125 YSKKVRTVIWRKYSGDPRFYLRRHRFAGYQSEIARSVFCLCPLGWAPWSPRLVESIALGC 184

Query: 294 IPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKR 353
           +PVIIAD I LPF  A+ W +I + VAE+DV  L T+L  +    +   Q+ L  P ++R
Sbjct: 185 VPVIIADGIRLPFPAAVRWSDISLTVAEKDVADLGTLLDHVAASNLSAIQKNLWAPDVRR 244

Query: 354 AMLF 357
           A+LF
Sbjct: 245 ALLF 248


>gi|289166878|gb|ADC84489.1| glycosyltransferase family 47C [Salix sachalinensis]
          Length = 252

 Score =  223 bits (568), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 105/244 (43%), Positives = 153/244 (62%), Gaps = 3/244 (1%)

Query: 117 MMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQT 176
           ++ SA+ LISSN P+WNR+ G+DH FV  HD+GACFH  EE+A E GI   L+R+ ++QT
Sbjct: 5   LLSSAVNLISSNHPFWNRSRGSDHVFVASHDYGACFHAMEERAAEDGIPEFLKRSIILQT 64

Query: 177 FGQR-NHVCLNEGSITIPPYAPPQKMQA--HQIPPDTPRSIFVYFRGLFYDVNNDPEGGY 233
           FG + +H C +  ++ IPP+  P+ +Q    + P    R I+V+FRG       +  G Y
Sbjct: 65  FGVKFDHPCQDVENVVIPPFITPESVQTTLEKYPLTGRRDIWVFFRGKMEVHPKNISGRY 124

Query: 234 YARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGC 293
           Y++  R  +W  +  +P F +       Y  ++ R++FCLCPLGWAPWSPRLVE++  GC
Sbjct: 125 YSKKVRTVIWRKYSGDPRFYLRRHRFAGYQSEIARSVFCLCPLGWAPWSPRLVESIALGC 184

Query: 294 IPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKR 353
           +PVIIAD I LPF  A+ W +I + VAE+DV  L T+L  +    +   Q+ L  P ++R
Sbjct: 185 VPVIIADGIRLPFPAAVRWSDISLTVAEKDVADLRTLLDHVAASNLSAIQKNLWAPDVRR 244

Query: 354 AMLF 357
           A+LF
Sbjct: 245 ALLF 248


>gi|62319307|dbj|BAD94554.1| hypothetical protein [Arabidopsis thaliana]
          Length = 114

 Score =  208 bits (530), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 99/114 (86%), Positives = 108/114 (94%)

Query: 293 CIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMK 352
           CIPVIIADDIVLPFADAIPWE+IG+FV E+DVP LDTILTSIPPEVILRKQRLLANPSMK
Sbjct: 1   CIPVIIADDIVLPFADAIPWEDIGVFVDEKDVPYLDTILTSIPPEVILRKQRLLANPSMK 60

Query: 353 RAMLFPQPAQPGDAFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
           +AMLFPQPAQPGDAFHQ+LNGLARKLPH++S YL+PG K LNWT+GPV DLKPW
Sbjct: 61  QAMLFPQPAQPGDAFHQVLNGLARKLPHERSVYLRPGEKLLNWTAGPVADLKPW 114


>gi|118481039|gb|ABK92473.1| unknown [Populus trichocarpa]
          Length = 239

 Score =  189 bits (479), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 96/237 (40%), Positives = 138/237 (58%), Gaps = 3/237 (1%)

Query: 156 EEKAIERGILPLLQRATLVQTFGQR-NHVCLNEGSITIPPYAPPQKMQA--HQIPPDTPR 212
           EE+A+E GI   L+R+ ++QTFG + NH C +  ++ IPPY  P++++      P +  R
Sbjct: 2   EERAMEDGIPEFLKRSIILQTFGVKFNHPCQDVENVVIPPYISPERVRTTLENYPLNGRR 61

Query: 213 SIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFC 272
            I+ +FRG       +  G YY++  R  +W  +  +  F +       Y  ++ R++FC
Sbjct: 62  DIWAFFRGKMEVHPKNISGRYYSKKVRTVIWRKYSGDRRFYLQRHRFAGYQSEIVRSVFC 121

Query: 273 LCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILT 332
           LCPLGWAPWSPRLVE+V  GC+PVIIAD I LPF  A+ W EI + VAE+DV  L T+L 
Sbjct: 122 LCPLGWAPWSPRLVESVALGCVPVIIADGIRLPFPTAVRWSEISLTVAEKDVANLGTLLD 181

Query: 333 SIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHDKSTYLKPG 389
            +    +   Q+ L +P ++RA+LF    Q GDA  Q+L  LARKL     T   P 
Sbjct: 182 HVAATNLSAIQKNLWDPDVRRALLFNDRVQEGDATWQVLYALARKLDRSYRTVRLPN 238


>gi|412991334|emb|CCO16179.1| predicted protein [Bathycoccus prasinos]
          Length = 558

 Score =  180 bits (457), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 113/361 (31%), Positives = 185/361 (51%), Gaps = 32/361 (8%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYP 99
            K++VY LP K++ ++L K+ RC+T  +  EI +H  ++ S + TL+P EA++FY P+Y 
Sbjct: 208 FKIFVYNLPPKFHVEMLKKNKRCVTDQYGTEIRIHANIMQSKMYTLDPLEAEFFYVPVYG 267

Query: 100 TCDLTPTGLPLPFKSPRMMRSA-----IQLISSNWPYWNRTEGADHFFVVPHDFGACFHY 154
            C L      L  K      +A     ++L++  +P+WNRT+G DH F      G     
Sbjct: 268 ECKLFENIATLGAKKGLQETNAWWLEAMKLVTDQYPFWNRTQGRDHVFTFAGARGPHIFK 327

Query: 155 QEEKAIERGIL--PLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQI--PPDT 210
             ++ I++ I   P   R +L + F            I IP   P +   +  +    + 
Sbjct: 328 DWKRHIKKSIFLTPEGDR-SLSEQFNTWK-------DIVIPGLEPEKAFWSGSLRKQKEV 379

Query: 211 PRS-IFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP----TTYYED 265
            R+  F YFRG       +  G  Y++G R  + E FK+      +  H     T Y E+
Sbjct: 380 KRAKTFAYFRGTI----ANKLGKQYSKGIRIKMKEAFKDIKDVVFTEQHSSCDKTCYREE 435

Query: 266 MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVP 325
           M+ + FCLCP GW+PW+ R  +A++ GCIPVIIAD+I  P+ ++  W ++ + + E+   
Sbjct: 436 MRASTFCLCPRGWSPWTLRAYQALMVGCIPVIIADEIEFPYENSFDWRQVSIKIPEKRHL 495

Query: 326 KLDTILTSIPPEVILRKQRLLAN--PSMKRAMLFPQPAQPGDAFHQILNGLARKLPHDKS 383
           +   IL S+P +V+ RK++ +A   PS    + + +PA   DAFH ++  L RK    KS
Sbjct: 496 ETIDILRSVPDDVVERKRKAMAKFWPS----VAWKKPAADDDAFHLVMKELERKKRGFKS 551

Query: 384 T 384
           +
Sbjct: 552 S 552


>gi|293336592|ref|NP_001167834.1| uncharacterized protein LOC100381534 [Zea mays]
 gi|223944319|gb|ACN26243.1| unknown [Zea mays]
          Length = 241

 Score =  177 bits (449), Expect = 8e-42,   Method: Compositional matrix adjust.
 Identities = 94/238 (39%), Positives = 136/238 (57%), Gaps = 3/238 (1%)

Query: 156 EEKAIERGILPLLQRATLVQTFG-QRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSI 214
           E+ AI  GI   L+R+ L+QTFG Q +HVC     + IPP+ PP+       P    R I
Sbjct: 2   EDVAIADGIPEFLKRSILLQTFGVQGHHVCQEVEHVVIPPHVPPEVAHELPEPEKAQRDI 61

Query: 215 FVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLC 274
           F +FRG       +  G +Y++  R  + +++  N  F +       Y  +M R++FCLC
Sbjct: 62  FAFFRGKMEVHPKNISGRFYSKKVRTELLQHYGRNRKFYLKRKRFDNYRSEMARSLFCLC 121

Query: 275 PLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSI 334
           PLGWAPWSPRLVE+V+ GCIPVIIADDI LPF   + W+EI + VAE+D+  L  +L  +
Sbjct: 122 PLGWAPWSPRLVESVLLGCIPVIIADDIRLPFPPVLQWQEISLQVAEKDIASLGMVLDHV 181

Query: 335 PPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHDKSTYLKPGGKF 392
               +   Q+ L +P  +RA++F +P + GDA  Q+L  L   L  D+S   +  G +
Sbjct: 182 VATNLTVIQKNLWDPVKRRALVFNRPMEAGDATWQVLRELEVLL--DQSRRRRSVGSY 237


>gi|388496634|gb|AFK36383.1| unknown [Medicago truncatula]
          Length = 316

 Score =  172 bits (435), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 84/217 (38%), Positives = 136/217 (62%), Gaps = 5/217 (2%)

Query: 23  RISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPV 82
           + + S+   L+   +  LK++VY+LP KYNK  L K+PRC TH+FA+E+ +HR LL+S V
Sbjct: 90  KKTTSSPQTLQKTSLKNLKIFVYDLPPKYNKNWL-KNPRCKTHLFASEVAIHRALLTSDV 148

Query: 83  RTLNPEEADWFYTPIYPTCDL-TPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHF 141
           RT +P EAD+F+ P+Y +C+  T  G P    +  ++ SA++LIS+ +P+WNR+ G+DH 
Sbjct: 149 RTFDPYEADFFFVPVYVSCNFSTVNGFPAIGHARSLISSAVKLISTEYPFWNRSTGSDHV 208

Query: 142 FVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFG-QRNHVCLNEGSITIPPYAPPQK 200
           FV  HDFG+CFH  E+ A++ G+  + + + ++QTFG   +H C     + IPP+  P+ 
Sbjct: 209 FVASHDFGSCFHTLEDVAMKDGVPEITKNSIVLQTFGVTYDHPCQKVEHVVIPPFVSPES 268

Query: 201 MQA--HQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYA 235
           ++      P +  R I+V+FRG       +  G +Y+
Sbjct: 269 VRNTLENFPVNGRRDIWVFFRGKMEVHPKNVSGRFYS 305


>gi|412993472|emb|CCO13983.1| predicted protein [Bathycoccus prasinos]
          Length = 614

 Score =  170 bits (431), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 110/352 (31%), Positives = 175/352 (49%), Gaps = 25/352 (7%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYP 99
            K++VY+L  ++N  L    PRC T  +  EI  H  LL   V T +PEEA++F+ PIY 
Sbjct: 260 FKIFVYDLKPEFNADLARDQPRCRTDQYGTEIRFHENLLHHSVLTNDPEEAEFFFVPIYG 319

Query: 100 TC----DLTPTGLPLPFKSPRM-MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHY 154
            C    +   +G     K   +  R A++ I + +PYWNRT+G DH +  P   G     
Sbjct: 320 ECYLFRETQNSGTNNAMKVTNLWYRDALKTIQTEYPYWNRTDGRDHVWSFPGARGPHIFR 379

Query: 155 QEEKAIERGIL--PLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTP- 211
             +K I++ I   P   R     +FG++ +   +   I IP   P  +    ++   +  
Sbjct: 380 DWKKLIKKSIFLTPEGDR-----SFGEQFNTWKD---IVIPGLEPDSEFIDGKLRKQSSL 431

Query: 212 -RSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTT----YYEDM 266
            + IF +FRG    + N      Y+RG R  +   FK +     + + P+     Y +++
Sbjct: 432 KKDIFAFFRGT---ILNKAGILAYSRGIRPKMEAAFKKHKDVIFTEEIPSCDRDCYRKEL 488

Query: 267 QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK 326
           +++ FCLCP GW+PW+ R  +A++ GCIPVIIAD+I LP+ +++ W ++ + +AE D  K
Sbjct: 489 RKSTFCLCPRGWSPWTLRAYQAMMVGCIPVIIADEIELPYENSLDWTKLSVKIAEVDAEK 548

Query: 327 LDTILTSIPPEVILRKQRLLANPSMKRAM-LFPQPAQPGDAFHQILNGLARK 377
              IL  I    I  KQ+ +       A    P+   P DA   +L+ L RK
Sbjct: 549 TIDILKQISKSEIRNKQKAIEKVWKSVAWGSNPKKLDPMDAMECVLHELGRK 600


>gi|297598226|ref|NP_001045264.2| Os01g0926700 [Oryza sativa Japonica Group]
 gi|255674019|dbj|BAF07178.2| Os01g0926700, partial [Oryza sativa Japonica Group]
          Length = 113

 Score =  161 bits (407), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 75/83 (90%), Positives = 79/83 (95%)

Query: 154 YQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRS 213
           +QEEKAI RGILPLLQRATLVQTFGQ+NHVCL +GSITIPPYAPPQKMQAH IPPDTPRS
Sbjct: 31  FQEEKAIGRGILPLLQRATLVQTFGQKNHVCLKDGSITIPPYAPPQKMQAHLIPPDTPRS 90

Query: 214 IFVYFRGLFYDVNNDPEGGYYAR 236
           IFVYFRGLFYD +NDPEGGYYAR
Sbjct: 91  IFVYFRGLFYDTSNDPEGGYYAR 113


>gi|428181360|gb|EKX50224.1| hypothetical protein GUITHDRAFT_104038 [Guillardia theta CCMP2712]
          Length = 723

 Score =  150 bits (380), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 111/385 (28%), Positives = 189/385 (49%), Gaps = 67/385 (17%)

Query: 42  VYVYELPSKYNKKLL-LKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPT 100
           V+VY++  ++N  +L L   +C   ++AAE+++H  LL S   TL+P  AD F+ P+Y  
Sbjct: 346 VFVYDIHPQWNSAMLSLNMQQCRNSIYAAEVYIHEQLLLSDSLTLDPGAADLFFIPLYAA 405

Query: 101 CDLTPTGL-PLPF---------KSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGA 150
           C L+   + P P          K+ + ++  ++ +   +P+++R+ GADH  V+  D+G+
Sbjct: 406 CFLSSHFVRPGPGWPDNDVDIGKTYQAVQLVLEHVRQTYPFFDRSAGADHVLVLSSDWGS 465

Query: 151 CFHYQEEKAIERGILPLLQRATLVQTFGQR-------------NHVCLNE---------- 187
           C          +G    L  + L+ T G R             +H+  +E          
Sbjct: 466 C----------QGPFLELHNSILLVTSGDRTLVRPAWYAARAADHMGSSEEFAVRSRLPC 515

Query: 188 ----GSITIPPYAPPQKMQAHQIPPDT-PRSIFVYFRG--------LFYDVNNDPEGGYY 234
                 + IPP  P   + A  +   T  R I VYFRG        L Y  N D     Y
Sbjct: 516 FQLFKDVVIPPLVPHPALTASYMGERTRGRDILVYFRGTAAGSVKALLY--NKD-----Y 568

Query: 235 ARGARAAVWENFKNNPLFDISTD-HPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGC 293
           + G R  +   +     + +S   + ++Y++++ R++FCL P GW  WS R  EA++ GC
Sbjct: 569 SLGIRQLLLRRYSRVRGWVVSDRINSSSYHDELLRSVFCLAPAGWELWSVRFFEAILLGC 628

Query: 294 IPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKR 353
           IPV++ DD+ LPF   + +    + V +  + +L++IL+SI   VI RKQ  L    + +
Sbjct: 629 IPVLLTDDVQLPFQQRLDYSRFTVKVEQRRILELESILSSINETVIRRKQEGL--KEVWK 686

Query: 354 AMLFPQPAQPGDAFHQILNGLARKL 378
            M + +P + GDAF  I++ LAR++
Sbjct: 687 RMTYQRPPEDGDAFTGIMDELARRV 711


>gi|361069335|gb|AEW08979.1| Pinus taeda anonymous locus CL2637Contig1_04 genomic sequence
 gi|361069337|gb|AEW08980.1| Pinus taeda anonymous locus CL2637Contig1_04 genomic sequence
 gi|376339236|gb|AFB34142.1| hypothetical protein CL2637Contig1_04, partial [Pinus cembra]
 gi|376339238|gb|AFB34143.1| hypothetical protein CL2637Contig1_04, partial [Pinus cembra]
 gi|376339240|gb|AFB34144.1| hypothetical protein CL2637Contig1_04, partial [Pinus cembra]
 gi|376339242|gb|AFB34145.1| hypothetical protein CL2637Contig1_04, partial [Pinus cembra]
 gi|376339244|gb|AFB34146.1| hypothetical protein CL2637Contig1_04, partial [Pinus mugo]
 gi|376339246|gb|AFB34147.1| hypothetical protein CL2637Contig1_04, partial [Pinus mugo]
 gi|376339248|gb|AFB34148.1| hypothetical protein CL2637Contig1_04, partial [Pinus mugo]
 gi|376339250|gb|AFB34149.1| hypothetical protein CL2637Contig1_04, partial [Pinus mugo]
 gi|383157350|gb|AFG60998.1| Pinus taeda anonymous locus CL2637Contig1_04 genomic sequence
 gi|383157351|gb|AFG60999.1| Pinus taeda anonymous locus CL2637Contig1_04 genomic sequence
 gi|383157352|gb|AFG61000.1| Pinus taeda anonymous locus CL2637Contig1_04 genomic sequence
 gi|383157353|gb|AFG61001.1| Pinus taeda anonymous locus CL2637Contig1_04 genomic sequence
 gi|383157354|gb|AFG61002.1| Pinus taeda anonymous locus CL2637Contig1_04 genomic sequence
 gi|383157355|gb|AFG61003.1| Pinus taeda anonymous locus CL2637Contig1_04 genomic sequence
 gi|383157356|gb|AFG61004.1| Pinus taeda anonymous locus CL2637Contig1_04 genomic sequence
 gi|383157357|gb|AFG61005.1| Pinus taeda anonymous locus CL2637Contig1_04 genomic sequence
 gi|383157358|gb|AFG61006.1| Pinus taeda anonymous locus CL2637Contig1_04 genomic sequence
 gi|383157359|gb|AFG61007.1| Pinus taeda anonymous locus CL2637Contig1_04 genomic sequence
 gi|383157360|gb|AFG61008.1| Pinus taeda anonymous locus CL2637Contig1_04 genomic sequence
 gi|383157361|gb|AFG61009.1| Pinus taeda anonymous locus CL2637Contig1_04 genomic sequence
 gi|383157362|gb|AFG61010.1| Pinus taeda anonymous locus CL2637Contig1_04 genomic sequence
 gi|383157363|gb|AFG61011.1| Pinus taeda anonymous locus CL2637Contig1_04 genomic sequence
 gi|383157364|gb|AFG61012.1| Pinus taeda anonymous locus CL2637Contig1_04 genomic sequence
 gi|383157365|gb|AFG61013.1| Pinus taeda anonymous locus CL2637Contig1_04 genomic sequence
 gi|383157366|gb|AFG61014.1| Pinus taeda anonymous locus CL2637Contig1_04 genomic sequence
 gi|383157367|gb|AFG61015.1| Pinus taeda anonymous locus CL2637Contig1_04 genomic sequence
          Length = 79

 Score =  149 bits (377), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 68/79 (86%), Positives = 75/79 (94%)

Query: 37  VGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTP 96
           VG+LKV++YELPSKYN+K+L KDPRCLTHMFA EIFM+RFLLSSPVRTLNPEEADWFYTP
Sbjct: 1   VGRLKVFIYELPSKYNRKILAKDPRCLTHMFATEIFMNRFLLSSPVRTLNPEEADWFYTP 60

Query: 97  IYPTCDLTPTGLPLPFKSP 115
           +Y TCDLTP GLPLPFKSP
Sbjct: 61  VYTTCDLTPNGLPLPFKSP 79


>gi|376339230|gb|AFB34139.1| hypothetical protein CL2637Contig1_04, partial [Abies alba]
 gi|376339232|gb|AFB34140.1| hypothetical protein CL2637Contig1_04, partial [Abies alba]
 gi|376339234|gb|AFB34141.1| hypothetical protein CL2637Contig1_04, partial [Abies alba]
          Length = 79

 Score =  148 bits (373), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 67/79 (84%), Positives = 75/79 (94%)

Query: 37  VGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTP 96
           VG+LKV++YELPSKYN+K+L KDPRCLTHMFA EIFM+RFLL+SPVRTLNPEEADWFYTP
Sbjct: 1   VGRLKVFIYELPSKYNRKILAKDPRCLTHMFATEIFMNRFLLASPVRTLNPEEADWFYTP 60

Query: 97  IYPTCDLTPTGLPLPFKSP 115
           +Y TCDLTP GLPLPFKSP
Sbjct: 61  VYTTCDLTPNGLPLPFKSP 79


>gi|412994079|emb|CCO14590.1| predicted protein [Bathycoccus prasinos]
          Length = 553

 Score =  144 bits (362), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 103/364 (28%), Positives = 166/364 (45%), Gaps = 22/364 (6%)

Query: 29  GDVLEDDPVGKL-KVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSP--VRTL 85
            +V+E++      KV+VY++P +   +L  +  RC    +  EI+ HR        VRT+
Sbjct: 186 ANVIEEEESSSFPKVFVYDVPKQLTSELAKRYGRCERDQYGTEIWFHRNFRDDKNGVRTM 245

Query: 86  NPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSA-------IQLISSNWPYWNRTEGA 138
           NPEEAD F+ P Y  C L    +       + M          +  +    PY+NRT+G 
Sbjct: 246 NPEEADLFFVPQYGECFLWSREMLRHENQGQAMEETNEYFLEVLSHVKGKLPYFNRTDGR 305

Query: 139 DHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPP 198
           DH FV     G       +K I   I    +    +  F     + +         Y   
Sbjct: 306 DHIFVFAGARGPTIFRDWQKEIPHSIYLTPEGDRTLPQFDTWKDIVIPGLEYDKRMYLEE 365

Query: 199 QKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKN--NPLFD--I 254
            + +    PP   R I   FRG      + P G  Y++G R  + + F+N  + ++D  I
Sbjct: 366 HRNELVTNPP--KRKILAMFRGTI----DHPAGFAYSKGLRPKLKKIFQNATDVIYDTKI 419

Query: 255 STDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEE 314
                  Y  +M  ++FCL PLGW PW+ R  +AV+  CIP+IIAD+I  PF   I + E
Sbjct: 420 KDCDRDCYVREMTESVFCLNPLGWTPWTLRFYQAVMTRCIPIIIADNIEFPFESEINYSE 479

Query: 315 IGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
             + + E+DV  +   +  +P E   R++R +    + +   + +PA+ GDA++  +  L
Sbjct: 480 FALKIPEKDVSDILETMRHMPEEERERRRRYM--DKIWKQFTYQRPAEIGDAYYSTVKEL 537

Query: 375 ARKL 378
           ARK+
Sbjct: 538 ARKV 541


>gi|224142563|ref|XP_002324625.1| predicted protein [Populus trichocarpa]
 gi|222866059|gb|EEF03190.1| predicted protein [Populus trichocarpa]
          Length = 336

 Score =  134 bits (337), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 92/309 (29%), Positives = 151/309 (48%), Gaps = 29/309 (9%)

Query: 79  SSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKS----PRMMRSAIQLISSNWPYWNR 134
            SP    +P+EA  F  PI     ++    P+   S     R+++  + +++  +PYWNR
Sbjct: 41  KSPFAASHPDEAHMFLLPISVAYIISYVYKPIVTYSRDELQRLVQDYVGVVADKYPYWNR 100

Query: 135 TEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPP 194
           ++GADHF V  HD+           + +  + +L  A   + F  R  V + E +I    
Sbjct: 101 SKGADHFLVSCHDWAPDIS-GANPDLYKNFIRVLCNANTSERFEPRRDVSIPEINIPNGK 159

Query: 195 YAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNN----P 250
             PP K     +PP + RSIF +F G  +        GY     R  + EN+K+      
Sbjct: 160 LGPPHK----GLPP-SKRSIFAFFAGGAH--------GY----IRKVLLENWKDKDDEIQ 202

Query: 251 LFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAI 310
           + +      T Y+E M ++ FCLCP G+   SPR+V A+  GC+PV I+D+  LPF+D +
Sbjct: 203 VHEYLDKKGTDYFELMGQSKFCLCPSGYEVASPRVVTAIQLGCVPVTISDNYTLPFSDVL 262

Query: 311 PWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQI 370
            W +  + +  E +P++ TIL  I P+  L  Q  +    ++R     +PA+P D  H +
Sbjct: 263 DWSKFSVHIPSEKIPEIKTILKKISPQRYLMMQMRVIQ--VQRHFELNRPARPYDLLHML 320

Query: 371 LNGL-ARKL 378
           L+ +  R+L
Sbjct: 321 LHSVWVRRL 329


>gi|224120362|ref|XP_002331029.1| predicted protein [Populus trichocarpa]
 gi|222872959|gb|EEF10090.1| predicted protein [Populus trichocarpa]
          Length = 336

 Score =  133 bits (335), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 94/305 (30%), Positives = 145/305 (47%), Gaps = 29/305 (9%)

Query: 79  SSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKS----PRMMRSAIQLISSNWPYWNR 134
            SP    +P+EA  F  PI     +     PL   S     R+++  + +++  +PYWNR
Sbjct: 41  KSPFAASHPDEAHTFLLPISVAYIIHYIYRPLVTFSRVELQRLVQDYVTVVAGKYPYWNR 100

Query: 135 TEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPP 194
           TEGADHF V  HD+         + + +  + +L  A   + F  R  V + E +I    
Sbjct: 101 TEGADHFLVSCHDWAPDISRANPR-LYKNFIRVLCNANTSERFEPRRDVSIPEINIPFGK 159

Query: 195 YAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGY-YARGARAAVWENFKNNPLFD 253
           + PP K     +PP + RSIF +F G          G + Y R      W++  +     
Sbjct: 160 FGPPGK----GLPP-SKRSIFAFFAG----------GAHGYIRKLLLEHWKDKDDEIQVH 204

Query: 254 ISTDH--PTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIP 311
              DH     Y++ M ++ FCLCP G+   SPR+V A+  GCIPV I+D+  LPF+D + 
Sbjct: 205 EYLDHNKKNDYFKLMGQSKFCLCPSGYEVASPRVVTAIQSGCIPVTISDNYTLPFSDVLD 264

Query: 312 WEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMK--RAMLFPQPAQPGDAFHQ 369
           W +  + +  E +P++ TIL  I      R+  +L    +K  R     +PAQP D  H 
Sbjct: 265 WSKFSVNIPSEKIPEIKTILKKIS----FRRYLILQGRVIKIRRHFKLNRPAQPYDMLHM 320

Query: 370 ILNGL 374
           IL+ +
Sbjct: 321 ILHSI 325


>gi|224143183|ref|XP_002324873.1| predicted protein [Populus trichocarpa]
 gi|222866307|gb|EEF03438.1| predicted protein [Populus trichocarpa]
          Length = 335

 Score =  133 bits (334), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 92/299 (30%), Positives = 137/299 (45%), Gaps = 20/299 (6%)

Query: 80  SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSA----IQLISSNWPYWNRT 135
           SP    NP+EA  F+ P+     L     P    S + ++      I LISS +PYWNR+
Sbjct: 42  SPFSAKNPDEALAFFIPVSIASILHFIYRPYVTYSRKQIQDIAEDYIGLISSKYPYWNRS 101

Query: 136 EGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY 195
            GADHF +  HD+           + R  + +L  A   + F     V L E  +     
Sbjct: 102 SGADHFMISCHDWAPDVS-AANPDLYRNFIRVLCNANSSEGFKPARDVSLPEFKLPRGKL 160

Query: 196 APPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDIS 255
            P   +Q    P D  RSI  +F G          G    R      W+   N+      
Sbjct: 161 EPEHILQ----PCDNNRSILAFFAG-------GSHGS--VRKILFKHWKEKDNDIQVYKY 207

Query: 256 TDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEI 315
                 Y E M ++ +CLCP GW   SPR+VEA+  GC+PVII+D  VLPF+D + W + 
Sbjct: 208 LPETLNYTEQMSKSRYCLCPSGWEVASPRVVEAIYSGCVPVIISDYYVLPFSDVLDWIKF 267

Query: 316 GMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
            + +    +P++ TIL SIP E  L KQ+ +    +++     +PA+P D  H +++ +
Sbjct: 268 SVHIPVSGIPEIKTILQSIPVEEYLEKQKRVLQ--VQQHFKLHRPAKPFDVVHMVMHSV 324


>gi|356565368|ref|XP_003550913.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
           max]
          Length = 534

 Score =  133 bits (334), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 107/351 (30%), Positives = 171/351 (48%), Gaps = 43/351 (12%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAE-IFMHRFLLSSPVRTLNPEEADWFYTP-- 96
           LKVY+Y   S+    +  K P  L  ++A+E  FM     +    T +PE+A  FY P  
Sbjct: 198 LKVYIYRDGSR---PIFHKPP--LKGIYASEGWFMKLMEENKQFVTKDPEKAHLFYLPYS 252

Query: 97  ---------IYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHD 147
                    +  + DL P  +         +R  +  I++ +P+WNRT+G+DHF V  HD
Sbjct: 253 ARQMGLTLYVPGSHDLKPLSI--------FLRDYVNKIAAKYPFWNRTQGSDHFLVACHD 304

Query: 148 FGACFHYQEEKAIERGILPLLQRATLVQ-TFGQRNHVCLNEGSITIPPYAPPQKMQAHQI 206
           +G  +     + ++R  +  L  A L +  F     V L E +I   P  P + +  +++
Sbjct: 305 WGP-YTVTGHEELKRNTIKALCNADLSEGVFVAGRDVSLPETTIR-APRRPLRYLGGNRV 362

Query: 207 PPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM 266
              + R I  +F G  +     P    Y  G +    + +K  PL         TY + M
Sbjct: 363 ---SLRPILAFFAGSMHG-RVRPTLLTYWGGGKDEDMKIYKRLPL---RVSQRMTYIQHM 415

Query: 267 QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK 326
           + + +C+CP+G+   SPR+VEA+ + C+PVIIAD+ VLPF++ + W    + VAE+D+P+
Sbjct: 416 KSSKYCVCPMGFEVNSPRIVEAIYYECVPVIIADNFVLPFSEVLDWSAFSVVVAEKDIPR 475

Query: 327 LDTILTSIPPEVILRKQRLLANP---SMKRAMLFPQPAQPGDAFHQILNGL 374
           L  IL SIP    LRK   + N      K  +  P+P +  D FH IL+ +
Sbjct: 476 LKEILLSIP----LRKYLTMQNNVKMVQKHFLWNPRPIR-YDLFHMILHSI 521


>gi|357476811|ref|XP_003608691.1| hypothetical protein MTR_4g100730 [Medicago truncatula]
 gi|355509746|gb|AES90888.1| hypothetical protein MTR_4g100730 [Medicago truncatula]
          Length = 535

 Score =  131 bits (330), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 107/351 (30%), Positives = 167/351 (47%), Gaps = 43/351 (12%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAE-IFMHRFLLSSPVRTLNPEEADWFYTPIY 98
           LKVY+Y   S+     +  +P  L  ++A+E  FM     +    T +PE A  FY P  
Sbjct: 204 LKVYIYRDGSR----PIFHNP-SLKGIYASEGWFMKLMQENKQFVTKDPERAHLFYLPYS 258

Query: 99  P-----------TCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHD 147
                       + DL P  +         +R  +  I++ +P+WNRT G+DHF V  HD
Sbjct: 259 ARQMEVTLYVPGSHDLKPLSI--------FLRDYVNKIAAKYPFWNRTHGSDHFLVACHD 310

Query: 148 FGACFHYQEEKAIERGILPLLQRATLVQ-TFGQRNHVCLNEGSITIPPYAPPQKMQAHQI 206
           +G  +   E + + R  L  L  A L +  F +   V L E +I   P  P + +  ++ 
Sbjct: 311 WGP-YTVTEHEELARNTLKALCNADLSERIFIEGRDVSLPETTIR-APRRPLRYLGGNRA 368

Query: 207 PPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM 266
              + R I  +F G  +     P    Y  G +    + +K  PL         TY + M
Sbjct: 369 ---SLRPILAFFAGSMHG-RVRPTLLKYWGGEKYEDMKIYKRLPL---RVSKKMTYIQHM 421

Query: 267 QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK 326
           + + +CLCP+G+   SPR+VEA+ + C+PVIIAD+ VLP ++ + W    + VAE+D+P+
Sbjct: 422 KSSKYCLCPMGFEVNSPRIVEAIYYECVPVIIADNFVLPLSEVLDWSAFSVVVAEKDIPR 481

Query: 327 LDTILTSIPPEVILRKQRLLANP---SMKRAMLFPQPAQPGDAFHQILNGL 374
           L  IL SIP    +RK   + N      K  +  P+P +  D FH IL+ +
Sbjct: 482 LKDILLSIP----MRKYVAMQNNVKMVQKHFLWNPKPIR-YDLFHMILHSI 527


>gi|302829555|ref|XP_002946344.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
 gi|300268090|gb|EFJ52271.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
          Length = 785

 Score =  130 bits (327), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 99/374 (26%), Positives = 163/374 (43%), Gaps = 73/374 (19%)

Query: 42  VYVYELPSKYNKKLL----------------LKDPRCLTHMFAAEIFMHRFLLSSPVRTL 85
           +YVY++P  Y+ ++L                  D   L+  ++ E+++H  +L S  RT 
Sbjct: 342 IYVYDMPPAYHSRMLQYRIGSDACMWRRFSEANDTYLLSMTYSVEVYLHEMMLQSEHRTF 401

Query: 86  NPEEADWFYTPIYPTCDL-------------TPTGLPLPFKSPRMMRSAIQLISSNWPYW 132
           +PEEAD+FY P+Y TC +              P     P     M+  A + +S+ +P+W
Sbjct: 402 DPEEADFFYVPMYITCFMWPVMGWADFPWWHAPLAHTRPMHVSNMILEAYEWLSTTFPWW 461

Query: 133 NRTEGADHFFVVPHDFGACF--------------------HYQEEKAIERGI----LPLL 168
           NR  G DH +++  D GAC+                     ++   A ++ I    LP+ 
Sbjct: 462 NRRGGRDHIWLMAPDEGACYMPTVVYNSSIILTHWGRMDPDHKSGSAFDQDIYDKDLPVA 521

Query: 169 QRA---TLVQTFGQRNHVCLN-EGSITIPPYAPPQKMQAHQI--PPDTPRSIFVYFRGLF 222
           Q      L      R H+C N E  + IP +  P   Q   +   P   R I +YFRG  
Sbjct: 522 QFKGWRGLDWMEKSRPHLCYNPEKDLVIPAFKSPDHFQESPLLGAPPLERDILLYFRGDV 581

Query: 223 YDVNNDPEGGYYARGARA-----AVWENFKNNPLFDISTDHPT--TYYEDMQRAIFCLCP 275
            +   D    +Y+RG R      A W  +       I T      +Y E + R+ FCL  
Sbjct: 582 GEGRRD----HYSRGIRQKLFQFAHWGKWAEKYKIYIGTGETIGGSYSEHLARSKFCLVA 637

Query: 276 LGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED--VPKLDTILTS 333
            G   WS R  +A++ GC+P+++ D +   F   + W+   + + E++  +  +  +LT+
Sbjct: 638 PG-DGWSARAEDAILHGCVPLVVMDGVHAVFESILDWDSFSIRIREDNQALQAIPELLTA 696

Query: 334 IPPEVILRKQRLLA 347
           I PE + + QR LA
Sbjct: 697 ISPERLAKMQRNLA 710


>gi|320164161|gb|EFW41060.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
          Length = 409

 Score =  129 bits (325), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 92/327 (28%), Positives = 146/327 (44%), Gaps = 61/327 (18%)

Query: 39  KLKVYVYELPSKYNKKLL---------LKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEE 89
            L+++VY LP+++N +++         ++DP C T  +++E   H+ LL SP+RTLNP +
Sbjct: 24  SLRIFVYPLPAEFNTRVIEHNLAHPPDMRDPVCTTSFYSSEWAFHQLLLDSPLRTLNPRD 83

Query: 90  ADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSN---------WPY------WNR 134
           AD++Y P+Y TC         P  S  +  +A+  I+S+         W Y      WN 
Sbjct: 84  ADYYYVPVYGTCHGFNRMAVQPNASAELFSAALDWITSHGSIPRDTLPWRYDPYSPDWNS 143

Query: 135 --------TEG---------ADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTF 177
                   T G          DH ++     GA            G    ++ A  +   
Sbjct: 144 LGTIEQVATRGEYPPFPAFAQDHLWLFSQGHGAKLF---------GDYSRIKNAVFLTAN 194

Query: 178 GQRNHVCLN-EGSITIPP----YAPP-----QKMQAHQIPPDTPRSIFVYFRGLFYDV-N 226
           GQ +         +TIPP    Y P      + +   ++     R     F G       
Sbjct: 195 GQLSAAEFTLAKDVTIPPRLTHYVPTPIYANKSVDELEVILTGQRPTLACFGGTKLPCFV 254

Query: 227 NDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLV 286
           ND  G  ++RG R  + E F  +P F I     + Y + ++ + FCLCP GW  W+PR+ 
Sbjct: 255 NDARGSCHSRGVRPYLKETFSKHPDFRILGIRSSGYEKALRSSTFCLCPEGWHAWTPRVF 314

Query: 287 EAVVFGCIPVIIADDIVLPFADAIPWE 313
           EA++ GCIPV+I+DD+ LPF   I ++
Sbjct: 315 EAILSGCIPVLISDDLALPFESLIDYD 341


>gi|102140005|gb|ABF70140.1| exostosin family protein [Musa balbisiana]
          Length = 533

 Score =  129 bits (324), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 102/344 (29%), Positives = 167/344 (48%), Gaps = 30/344 (8%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLN-PEEADWFYTPIY 98
           LKVY+YE       K L   P  L  ++A+E +  + +  +    +N P +A  FY P Y
Sbjct: 201 LKVYIYE----DGPKPLCHTPH-LDGIYASEGWFMKLMEENTQFVVNDPNKAHLFYLP-Y 254

Query: 99  PTCDLTPTGLPLPFKSPR----MMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHY 154
            +  L         +S R     +R  +  IS+ +P+WNRT GADHF V  HD+ A +  
Sbjct: 255 SSRQLRTHLYVAGSRSMRPLSIFLRDYVNSISAKYPFWNRTRGADHFLVACHDW-ATYTT 313

Query: 155 QEEKAIERGILPLLQRATLVQ-TFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRS 213
              + + +  + ++  A + +  F +   V L E  +  P    P+K    +  P + RS
Sbjct: 314 NLHEELRKNTIKVVCNADVSEGVFVRGKDVSLAETYVRTP--NSPRKAIGGR--PASRRS 369

Query: 214 IFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDH---PTTYYEDMQRAI 270
           I  +F G  +     P    Y RG         ++  ++++  D      +Y E M+ + 
Sbjct: 370 ILAFFAGQMHG-RVRPILLRYWRGRD-------RDMRIYEVLPDEIAAKMSYIEHMKSSK 421

Query: 271 FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTI 330
           FC+CP+G+   SPR+VEA+ + C+PVIIA++ VLPF + + W    + VAE+D+PKL  I
Sbjct: 422 FCICPMGYEVNSPRIVEAIYYDCVPVIIANNFVLPFEEVLDWGAFSVVVAEKDIPKLKQI 481

Query: 331 LTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
           L  I     +R QR +    +++  L+       D FH IL+ +
Sbjct: 482 LLGISGRRYVRMQRNVRR--LRKHFLWNDKPVKYDLFHMILHSI 523


>gi|359484704|ref|XP_002264922.2| PREDICTED: probable glycosyltransferase At5g20260-like [Vitis
           vinifera]
          Length = 467

 Score =  129 bits (323), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 91/302 (30%), Positives = 141/302 (46%), Gaps = 39/302 (12%)

Query: 86  NPEEADWFYTPIY----------PTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRT 135
           +P+EA  FY PI           P  D +   L       R++   I ++S  +PYWNR+
Sbjct: 181 HPDEAHVFYIPISVTRIAHYIYSPPVDYSGHMLQ------RLVTDYIYVVSDKYPYWNRS 234

Query: 136 EGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY 195
            GADHF V  HD+           + +  + +L  A   + F     + L E +I     
Sbjct: 235 NGADHFLVSCHDWAPEISIVTPD-LYKHFIRVLCNANTSERFQPIRDISLPEVNIPKGKL 293

Query: 196 APPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFK-NNPLFDI 254
            PP     H   P   R I  +F G         E GY     R  ++ ++K N+    +
Sbjct: 294 GPP-----HLDKPPNQRHILAFFAGR--------ESGY----MRTLLFRSWKENDDEVQV 336

Query: 255 STDHPTT--YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW 312
               P+   Y + M  + FCLCP GW   SPR+VEA+  GC+PVII D  VLPF++ + W
Sbjct: 337 YEHLPSNRDYAKSMGDSKFCLCPSGWEVASPRVVEAIAAGCVPVIICDYYVLPFSEVLVW 396

Query: 313 EEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILN 372
            +  + +  + +P++  IL ++P E  LR Q+ +    ++R  +  +PAQP D  H IL+
Sbjct: 397 SKFSINITSDKIPEIKKILKAVPNERYLRMQKRVKQ--VQRHFVINRPAQPYDMLHMILH 454

Query: 373 GL 374
            +
Sbjct: 455 SV 456


>gi|79326862|ref|NP_001031828.1| putative glycosyltransferase [Arabidopsis thaliana]
 gi|292630778|sp|Q9FFN2.2|GLYT3_ARATH RecName: Full=Probable glycosyltransferase At5g03795
 gi|332003272|gb|AED90655.1| putative glycosyltransferase [Arabidopsis thaliana]
          Length = 518

 Score =  128 bits (322), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 105/359 (29%), Positives = 154/359 (42%), Gaps = 55/359 (15%)

Query: 39  KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTP-- 96
           + K+YVY    K  +  L  D  C +       F++     +  RT NP++A  FY P  
Sbjct: 189 QFKIYVY----KEGEPPLFHDGPCKSIYSMEGSFIYEIETDTRFRTNNPDKAHVFYLPFS 244

Query: 97  -------IYP--TCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHD 147
                  +Y   + D +P            ++  I L+   +PYWNR+ GADHF +  HD
Sbjct: 245 VVKMVRYVYERNSRDFSPIR--------NTVKDYINLVGDKYPYWNRSIGADHFILSCHD 296

Query: 148 FGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNE-----GSITIPPYAPPQKMQ 202
           +G    +         I  L   A   + F  R  V + E     GS+T           
Sbjct: 297 WGPEASFSHPHLGHNSIRALCN-ANTSERFKPRKDVSIPEINLRTGSLT----------G 345

Query: 203 AHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYA--RGARAAVWENFKNNPLFDISTDHPT 260
               P  + R I  +F            GG +   R      WEN  N+          T
Sbjct: 346 LVGGPSPSSRPILAFF-----------AGGVHGPVRPVLLQHWENKDNDIRVHKYLPRGT 394

Query: 261 TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVA 320
           +Y + M+ + FC+CP G+   SPR+VEA+  GC+PV+I    V PF+D + W    + V+
Sbjct: 395 SYSDMMRNSKFCICPSGYEVASPRIVEALYSGCVPVLINSGYVPPFSDVLNWRSFSVIVS 454

Query: 321 EEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL-ARKL 378
            ED+P L TILTSI P   LR  R +    ++R      PA+  D FH IL+ +  R+L
Sbjct: 455 VEDIPNLKTILTSISPRQYLRMYRRVLK--VRRHFEVNSPAKRFDVFHMILHSIWVRRL 511


>gi|297810461|ref|XP_002873114.1| hypothetical protein ARALYDRAFT_487158 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297318951|gb|EFH49373.1| hypothetical protein ARALYDRAFT_487158 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 519

 Score =  128 bits (322), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 104/351 (29%), Positives = 151/351 (43%), Gaps = 39/351 (11%)

Query: 39  KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTP-- 96
           + K+YVY    K  +  L  D  C +       F++     +  RT NP++A  FY P  
Sbjct: 190 QFKIYVY----KEGEPPLFHDGPCKSIYSMEGSFIYEMETDTRFRTNNPDKAHAFYLPFS 245

Query: 97  IYPTCDLTPTGLPLPFKSPR-MMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQ 155
           +              F   R  +R  I L+   +PYWNR+ GADHF +  HD+G    + 
Sbjct: 246 VVKMVRYVYERNSRDFSPIRNTVRDYINLVGDKYPYWNRSIGADHFILSCHDWGPEASFS 305

Query: 156 EEKAIERGILPLLQRATLVQTFGQRNHVCLNE-----GSITIPPYAPPQKMQAHQIPPDT 210
                   I  L   A   + F  R  V + E     GS+T     P            +
Sbjct: 306 HPHLGHNSIRALCN-ANTSEKFKPRKDVSIPEINLRTGSLTGLVGGPSP----------S 354

Query: 211 PRSIFVYFRGLFYDVNNDPEGGYYA--RGARAAVWENFKNNPLFDISTDHPTTYYEDMQR 268
            R I  +F            GG +   R      WEN  N+          T+Y + M+ 
Sbjct: 355 SRPILAFF-----------AGGVHGPVRPVLLEHWENKDNDIRVHKYLPRGTSYSDMMRN 403

Query: 269 AIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLD 328
           + FC+CP G+   SPR+VEA+  GC+PV+I    V PF+D + W    + V+ ED+P L 
Sbjct: 404 SKFCICPSGYEVASPRIVEALYSGCVPVLINSGYVPPFSDVLNWRSFSVIVSVEDIPNLK 463

Query: 329 TILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL-ARKL 378
           TILT+I P   LR  R +    ++R      PA+  D FH IL+ +  R+L
Sbjct: 464 TILTAISPRQYLRMYRRVLK--VRRHFEVNSPAKRFDVFHMILHSIWVRRL 512


>gi|359484708|ref|XP_002264880.2| PREDICTED: probable glycosyltransferase At5g20260-like [Vitis
           vinifera]
          Length = 467

 Score =  128 bits (322), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 81/262 (30%), Positives = 129/262 (49%), Gaps = 23/262 (8%)

Query: 116 RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQ 175
           R++   I ++S+ +PYWNR+ GADHF V  HD+           + +  + +L  A   +
Sbjct: 215 RLVTDYIYVVSNKYPYWNRSNGADHFLVSCHDWAPEISIVTPD-LYKHFIRVLCNANTSE 273

Query: 176 TFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYA 235
            F     + L E +I      PP     H   P   R I  +F G         E GY  
Sbjct: 274 RFQPIRDISLPEVNIPKGKLGPP-----HLDKPPNQRHILAFFSGR--------ESGY-- 318

Query: 236 RGARAAVWENFK-NNPLFDISTDHPTT--YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFG 292
              R  ++ ++K N+    +    P+   Y + M  + FCLCP GW   SPR+VEA+  G
Sbjct: 319 --MRTLLFRSWKENDDEVQVYEHLPSNRDYAKSMVDSKFCLCPSGWEVASPRVVEAIAAG 376

Query: 293 CIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMK 352
           C+PVII D  VLPF++ + W +  + +  + +P++  IL ++P E  LR Q+ +    ++
Sbjct: 377 CVPVIICDYYVLPFSEVLDWSKFSINITSDKIPEIKKILKAVPNERYLRMQKRVKQ--VQ 434

Query: 353 RAMLFPQPAQPGDAFHQILNGL 374
           R  +  +PAQP D  H IL+ +
Sbjct: 435 RHFVINRPAQPYDMLHMILHSV 456


>gi|356533401|ref|XP_003535253.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
           max]
          Length = 523

 Score =  128 bits (321), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 99/353 (28%), Positives = 160/353 (45%), Gaps = 52/353 (14%)

Query: 39  KLKVYVYE---LPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYT 95
           + KV+VYE   LP       +  D  C +       F+H   ++   RT +P++A+ F+ 
Sbjct: 185 QFKVFVYEEGELP-------VFHDGPCSSIYSTEGSFIHAIEMNEHFRTRDPKKANVFFL 237

Query: 96  P-----------IYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVV 144
           P           I  + D  P          R +R  + +I++ +PYWNR+ GADHF + 
Sbjct: 238 PFSIAWMVRYVYIRNSYDFGPI--------KRTVRDYVNVIATRYPYWNRSLGADHFMLS 289

Query: 145 PHDFGACFHYQEEKAI---ERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKM 201
            HD+G     +  K+I    +  + +L  A   + F       + + S       P  K 
Sbjct: 290 CHDWGP----ETSKSIPYLRKNSIRVLCNANTSEGFDP-----IKDASFPEINLQPGLKD 340

Query: 202 QAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTT 261
                PP + RSI  +F G     N+ P      R      WEN   +           +
Sbjct: 341 SFVGGPPASKRSILAFFAG----GNHGP-----IRPILLEHWENKDEDIQVHKYLPKGVS 391

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           YY  ++ + FCLCP G+   SPR+VEA+  GC+PV+I++  V PF+D + W+   + V+ 
Sbjct: 392 YYGMLRNSKFCLCPSGYEVASPRVVEAIYTGCVPVLISEHYVPPFSDVLNWKMFSVNVSV 451

Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
           +++P L  ILTSI P   +R Q+ +    ++R      P +  D FH IL+ +
Sbjct: 452 KEIPNLKDILTSISPRQYIRMQKRVGQ--IRRHFEVHSPPKRYDVFHMILHSV 502


>gi|297802132|ref|XP_002868950.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297314786|gb|EFH45209.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 428

 Score =  128 bits (321), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 107/351 (30%), Positives = 159/351 (45%), Gaps = 52/351 (14%)

Query: 39  KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIY 98
           + KVY+Y  P   +     + PR +T  +A+E +  + +  S  RTL+PEEAD F+ PI 
Sbjct: 106 RFKVYIY--PDG-DPNTFYQTPRKVTGKYASEGYFFQNIRESRFRTLDPEEADLFFIPI- 161

Query: 99  PTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEK 158
            +C               ++++ +  + + +PYWNRT GADHFFV  HD G        +
Sbjct: 162 -SCHKMRGKGTSYENMTVIVQNYVDGLIAKYPYWNRTLGADHFFVTCHDVGV-------R 213

Query: 159 AIERGILPLLQRATLVQTFGQRNHVC---LNEGSITIPPYAPPQKMQAHQIPPDTPRSIF 215
           A E G L L++    V        VC    N G I     A PQ +Q   +P        
Sbjct: 214 AFE-GSLLLIKNTIRV--------VCSPSYNVGFIPHKDVALPQVLQPFALPAGGN---- 260

Query: 216 VYFRGLFYDVNNDPEGGYYA-------RGARAAVWEN-----FKNNPLFDISTDHPTTYY 263
                   DV N    G++A       R   A VWEN       NN + + +T H   Y 
Sbjct: 261 --------DVENRTTLGFWAGHRNSKIRVILARVWENDTELDISNNRI-NRATGH-LVYQ 310

Query: 264 EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED 323
           +   R  FC+CP G    S R+ +++ +GCIPVI++D   LPF D + W +  + + E+D
Sbjct: 311 KRFYRTKFCICPGGSQVNSARITDSIHYGCIPVILSDYYDLPFNDILNWRKFAVVLREQD 370

Query: 324 VPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
           V  L  IL +IP    +     L    +++   +  P    DAFH I+  L
Sbjct: 371 VYNLKQILKNIPHSEFISLHNNLV--KVQKHFQWNSPPVKFDAFHMIMYEL 419


>gi|9758008|dbj|BAB08605.1| unnamed protein product [Arabidopsis thaliana]
          Length = 408

 Score =  128 bits (321), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 104/351 (29%), Positives = 151/351 (43%), Gaps = 39/351 (11%)

Query: 39  KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTP-- 96
           + K+YVY    K  +  L  D  C +       F++     +  RT NP++A  FY P  
Sbjct: 79  QFKIYVY----KEGEPPLFHDGPCKSIYSMEGSFIYEIETDTRFRTNNPDKAHVFYLPFS 134

Query: 97  IYPTCDLTPTGLPLPFKSPR-MMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQ 155
           +              F   R  ++  I L+   +PYWNR+ GADHF +  HD+G    + 
Sbjct: 135 VVKMVRYVYERNSRDFSPIRNTVKDYINLVGDKYPYWNRSIGADHFILSCHDWGPEASFS 194

Query: 156 EEKAIERGILPLLQRATLVQTFGQRNHVCLNE-----GSITIPPYAPPQKMQAHQIPPDT 210
                   I  L   A   + F  R  V + E     GS+T               P  +
Sbjct: 195 HPHLGHNSIRALCN-ANTSERFKPRKDVSIPEINLRTGSLT----------GLVGGPSPS 243

Query: 211 PRSIFVYFRGLFYDVNNDPEGGYYA--RGARAAVWENFKNNPLFDISTDHPTTYYEDMQR 268
            R I  +F            GG +   R      WEN  N+          T+Y + M+ 
Sbjct: 244 SRPILAFF-----------AGGVHGPVRPVLLQHWENKDNDIRVHKYLPRGTSYSDMMRN 292

Query: 269 AIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLD 328
           + FC+CP G+   SPR+VEA+  GC+PV+I    V PF+D + W    + V+ ED+P L 
Sbjct: 293 SKFCICPSGYEVASPRIVEALYSGCVPVLINSGYVPPFSDVLNWRSFSVIVSVEDIPNLK 352

Query: 329 TILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL-ARKL 378
           TILTSI P   LR  R +    ++R      PA+  D FH IL+ +  R+L
Sbjct: 353 TILTSISPRQYLRMYRRVL--KVRRHFEVNSPAKRFDVFHMILHSIWVRRL 401


>gi|296084508|emb|CBI25529.3| unnamed protein product [Vitis vinifera]
          Length = 388

 Score =  128 bits (321), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 81/262 (30%), Positives = 129/262 (49%), Gaps = 23/262 (8%)

Query: 116 RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQ 175
           R++   I ++S+ +PYWNR+ GADHF V  HD+           + +  + +L  A   +
Sbjct: 136 RLVTDYIYVVSNKYPYWNRSNGADHFLVSCHDWAPEISIVTPD-LYKHFIRVLCNANTSE 194

Query: 176 TFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYA 235
            F     + L E +I      PP     H   P   R I  +F G         E GY  
Sbjct: 195 RFQPIRDISLPEVNIPKGKLGPP-----HLDKPPNQRHILAFFSGR--------ESGY-- 239

Query: 236 RGARAAVWENFK-NNPLFDISTDHPTT--YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFG 292
              R  ++ ++K N+    +    P+   Y + M  + FCLCP GW   SPR+VEA+  G
Sbjct: 240 --MRTLLFRSWKENDDEVQVYEHLPSNRDYAKSMVDSKFCLCPSGWEVASPRVVEAIAAG 297

Query: 293 CIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMK 352
           C+PVII D  VLPF++ + W +  + +  + +P++  IL ++P E  LR Q+ +    ++
Sbjct: 298 CVPVIICDYYVLPFSEVLDWSKFSINITSDKIPEIKKILKAVPNERYLRMQKRVKQ--VQ 355

Query: 353 RAMLFPQPAQPGDAFHQILNGL 374
           R  +  +PAQP D  H IL+ +
Sbjct: 356 RHFVINRPAQPYDMLHMILHSV 377


>gi|302837063|ref|XP_002950091.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
 gi|300264564|gb|EFJ48759.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
          Length = 593

 Score =  127 bits (320), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 123/424 (29%), Positives = 175/424 (41%), Gaps = 96/424 (22%)

Query: 42  VYVYELPSKYNKKLL-LKDPRCLTH-----MFAAEIFMHRFLLSSPVRTLNPEEADWFYT 95
           +YVY+L   Y  KLL  + P   T      ++AA+  +H  LL S  RT +PEEAD+FY 
Sbjct: 183 IYVYDLEPLYQSKLLQYRWPDNGTQWTDVWVYAADTLLHESLLISEHRTFDPEEADFFYV 242

Query: 96  PIYPTCDLTPTGL--------------------------------PLPF-KSPR------ 116
           P Y +C   P G+                                 LP  + PR      
Sbjct: 243 PHYASCLPFPIGVWADAPWFPEPGARAHHHLCEVLPATRVITVTCRLPVNRGPRVKQMVN 302

Query: 117 MMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGI-LPLLQRATLVQ 175
           M+R  +  I+S +PYW R  G DH ++  HD GAC+      AI   I L    R  L  
Sbjct: 303 MVRDTLDWINSTFPYWRRRGGRDHIWLFTHDEGACW---APTAINASIWLTHWGRTELNH 359

Query: 176 TFGQ------------------------RNHVCLN-EGSITIPPYAPPQKMQAH--QIPP 208
           T                           + H C N E  + IP +  P    A   Q  P
Sbjct: 360 TSNTAFLADKYDSDFAGPLQPEGFVKYIKGHPCFNPEKDLVIPAFKAPSHYHASPLQGNP 419

Query: 209 DTPRSIFVYFRGLFYDV--NNDPEGGYYARGARAAVWENFKNNPLFDI-------STDHP 259
              R +  +FRG   DV  N  P    Y+RG R  +++  K     +         +D  
Sbjct: 420 ARERDLLFFFRG---DVGKNRLPN---YSRGVRQQIYKMAKEGGWAEKYRFYIGDGSDVE 473

Query: 260 TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFV 319
             Y E + RAIFCL   G   WS R+ +AV+ GCIPV+IAD +   F + +  +   + +
Sbjct: 474 GDYSEMLSRAIFCLVAPG-DGWSARMEDAVLHGCIPVVIADGVEAVFENVLELDAFALRL 532

Query: 320 AEEDVPKLDTILTSIPPEVILRKQRLLANPSMK-RAMLFPQPAQPGDAFHQILNGLARKL 378
            +E VP+L  +L ++P   I  KQ  L     + R    P   +  DAF  I+  L  ++
Sbjct: 533 PQEAVPRLLDVLRAVPQRAIRSKQAHLGRVWQRYRWASLP---KLDDAFATIMQWLHSRI 589

Query: 379 PHDK 382
           PH +
Sbjct: 590 PHTR 593


>gi|255575457|ref|XP_002528630.1| catalytic, putative [Ricinus communis]
 gi|223531919|gb|EEF33733.1| catalytic, putative [Ricinus communis]
          Length = 574

 Score =  127 bits (320), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 92/316 (29%), Positives = 139/316 (43%), Gaps = 40/316 (12%)

Query: 72  FMHRFLLSSPVRTLNPEEADWFYTP-----------IYPTCDLTPTGLPLPFKSPRMMRS 120
           F+HR  +    RT +PE+A  ++ P           +  + D  P          R +R 
Sbjct: 268 FIHRMEIDDQFRTKDPEKAHVYFLPFSVAMMVQFVYVRDSHDFGPI--------KRTVRD 319

Query: 121 AIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQR 180
            + L++  +PYWNR+ GADHF +  HD+G    +      +  I  L   A   + F   
Sbjct: 320 YVNLVAGKYPYWNRSLGADHFMLACHDWGPETSFSLPDLAKNSIRALCN-ANTSERFNPI 378

Query: 181 NHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYA--RGA 238
             V   E  I +        +     P  + RSI  +F            GG +   R  
Sbjct: 379 KDVSFPE--INLQTGTTKGFIGG---PSPSKRSILAFF-----------AGGLHGPIRPI 422

Query: 239 RAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVII 298
               WEN  N+           +YYE M+++ FCLCP G+   SPR+VEA+  GC+PV+I
Sbjct: 423 LLEHWENKDNDMKVHRYLPKGVSYYEMMRKSKFCLCPSGYEVASPRVVEALYTGCVPVLI 482

Query: 299 ADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFP 358
           +D  V PF+D + W+   + V   D+P L  ILTSI     +R QR +    ++R     
Sbjct: 483 SDHYVPPFSDVLNWKSFSVEVPVSDIPNLKRILTSISSRQYIRMQRRVLQ--VRRHFEVN 540

Query: 359 QPAQPGDAFHQILNGL 374
            P +  D FH IL+ +
Sbjct: 541 SPPKRYDVFHMILHSI 556


>gi|19699001|gb|AAL91236.1| unknown protein [Arabidopsis thaliana]
 gi|30023662|gb|AAP13364.1| At2g28110 [Arabidopsis thaliana]
          Length = 214

 Score =  126 bits (317), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 58/125 (46%), Positives = 83/125 (66%), Gaps = 4/125 (3%)

Query: 32  LEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLS--SPVRTLNPEE 89
           ++ D    LK+YVY+LPSK+NK  L  D RC  H+FAAE+ +H+  LS    VRT +P E
Sbjct: 86  IKTDVFNNLKIYVYDLPSKFNKDWLAND-RCTNHLFAAEVALHKAFLSLEGDVRTEDPYE 144

Query: 90  ADWFYTPIYPTCDL-TPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDF 148
           AD+F+ P+Y +C+  T  G P    +  ++  AI+L+S+ +P+WNRT G+DH F   HDF
Sbjct: 145 ADFFFVPVYVSCNFSTINGFPAIGHARSLINDAIKLVSTQYPFWNRTSGSDHVFTATHDF 204

Query: 149 GACFH 153
           G+CFH
Sbjct: 205 GSCFH 209


>gi|359484710|ref|XP_002266490.2| PREDICTED: probable glycosyltransferase At5g20260 [Vitis vinifera]
          Length = 480

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 83/295 (28%), Positives = 136/295 (46%), Gaps = 26/295 (8%)

Query: 86  NPEEADWFYTPIYPTCDLTPTGLPLPFKS---PRMMRSAIQLISSNWPYWNRTEGADHFF 142
           +P+EA+ FY P+  T  +     P  +     PR++   I  ++  +PYWNR++GADHF 
Sbjct: 195 HPDEANAFYIPMSLTRVVHFIYEPPHYHGKWIPRLVTDYINFVADKYPYWNRSKGADHFL 254

Query: 143 VVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQ 202
           V  HD+       +  A++  +     RA       +R H   +   I+IP    P+   
Sbjct: 255 VSCHDWAP-----DVSALKPDLYKHFIRALCNANTSERFHPIRD---ISIPEINIPRGKL 306

Query: 203 A--HQIPPDTPRSIFVYFRGLFYDVNNDPEGGY-YARGARAAVWENFKNNPLFDISTDHP 259
              H   P   R I  +F G          G + Y R      W+   +           
Sbjct: 307 GPPHLDQPPNKRPILAFFAG----------GAHGYVRSVLFKYWKEKDDEVQVFERLPRN 356

Query: 260 TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFV 319
             Y + M  + FCLCP G+   SPR+VEA+  GC+P+II D   LPF+D + W +  +++
Sbjct: 357 RNYSKSMGDSKFCLCPSGYEVASPRIVEAIAAGCVPMIICDHYSLPFSDVLDWSKFSIYI 416

Query: 320 AEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
             + +P++  IL ++P E  L  Q+ +    ++R     +PA+P D  H IL+ +
Sbjct: 417 TSDKIPEIKKILKAVPTETYLEMQKRVKQ--VQRHFAINRPARPYDMLHMILHSV 469


>gi|15233650|ref|NP_195517.1| Exostosin family protein [Arabidopsis thaliana]
 gi|4467110|emb|CAB37544.1| putative protein [Arabidopsis thaliana]
 gi|7270787|emb|CAB80469.1| putative protein [Arabidopsis thaliana]
 gi|15293111|gb|AAK93666.1| unknown protein [Arabidopsis thaliana]
 gi|21280961|gb|AAM45007.1| unknown protein [Arabidopsis thaliana]
 gi|332661466|gb|AEE86866.1| Exostosin family protein [Arabidopsis thaliana]
          Length = 425

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 106/351 (30%), Positives = 159/351 (45%), Gaps = 52/351 (14%)

Query: 39  KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIY 98
           + KVY+Y  P   +     + PR +T  +A+E +  + +  S  RTL+P+EAD F+ PI 
Sbjct: 103 RFKVYIY--PDG-DPNTFYQTPRKVTGKYASEGYFFQNIRESRFRTLDPDEADLFFIPI- 158

Query: 99  PTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEK 158
            +C               ++++ +  + + +PYWNRT GADHFFV  HD G        +
Sbjct: 159 -SCHKMRGKGTSYENMTVIVQNYVDGLIAKYPYWNRTLGADHFFVTCHDVGV-------R 210

Query: 159 AIERGILPLLQRATLVQTFGQRNHVC---LNEGSITIPPYAPPQKMQAHQIPPDTPRSIF 215
           A E    PLL + T+         VC    N G I     A PQ +Q   +P        
Sbjct: 211 AFEGS--PLLIKNTI-------RVVCSPSYNVGFIPHKDVALPQVLQPFALPAGGN---- 257

Query: 216 VYFRGLFYDVNNDPEGGYYA-------RGARAAVWEN-----FKNNPLFDISTDHPTTYY 263
                   DV N    G++A       R   A VWEN       NN + + +T H   Y 
Sbjct: 258 --------DVENRTTLGFWAGHRNSKIRVILAHVWENDTELDISNNRI-NRATGH-LVYQ 307

Query: 264 EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED 323
           +   R  FC+CP G    S R+ +++ +GCIPVI++D   LPF D + W +  + + E+D
Sbjct: 308 KRFYRTKFCICPGGSQVNSARITDSIHYGCIPVILSDYYDLPFNDILNWRKFAVVLREQD 367

Query: 324 VPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
           V  L  IL +IP    +     L    +++   +  P    DAFH I+  L
Sbjct: 368 VYNLKQILKNIPHSEFVSLHNNLV--KVQKHFQWNSPPVKFDAFHMIMYEL 416


>gi|302753728|ref|XP_002960288.1| glycosylransferase-like protein [Selaginella moellendorffii]
 gi|300171227|gb|EFJ37827.1| glycosylransferase-like protein [Selaginella moellendorffii]
          Length = 372

 Score =  125 bits (315), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 97/350 (27%), Positives = 169/350 (48%), Gaps = 34/350 (9%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAE-IFMHRFLLSSPVRTLNPEEADWFYTPI- 97
            KVYVY+   +   +L+ K P  LT ++++E  F+H    +S   T +P+EA  F+ P  
Sbjct: 32  FKVYVYK---EGEPRLVHKGP--LTGIYSSEGRFIHEMNQNSRFVTHDPQEAHMFFLPYS 86

Query: 98  --YPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQ 155
             +   DL   G          ++  + LI+S  P+WN T G+DHFF   HD+G     +
Sbjct: 87  VAHMVLDLYVPGSHTMLPLATFIKDYVNLIASKHPFWNLTRGSDHFFTSCHDWGPAT-AR 145

Query: 156 EEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIF 215
           +   + +  + ++  + L + F       L E    +     P K+     P  + R I 
Sbjct: 146 DHPELRKNSVKVVCNSDLTEEFVPDKDASLPE--TYLHAVKLPTKLGG---PGPSKRPIL 200

Query: 216 VYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP------TTYYEDMQRA 269
            +F G  +            R A    W++ + +P   I    P      T+Y + M+ +
Sbjct: 201 AFFAGQMHG---------RVRPALIKHWKD-RGDPDMRIYEVLPPEVARRTSYVQHMKSS 250

Query: 270 IFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDT 329
            FC+C +G+   SPR+VE++ + C+PV+IAD+ VLPF+D + W    + V+E+DVP+L  
Sbjct: 251 KFCICAMGFEVNSPRIVESIYYDCVPVLIADNFVLPFSDVLNWGSFSLTVSEKDVPRLKE 310

Query: 330 ILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL-ARKL 378
           +L ++  +   + Q  L    +++  L+   A+  D FH IL+ +  R+L
Sbjct: 311 LLLAVSEDRYRKMQSRLKK--VRKHFLWHDSAERFDMFHMILHSVWTRRL 358


>gi|296084509|emb|CBI25530.3| unnamed protein product [Vitis vinifera]
          Length = 387

 Score =  125 bits (315), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 83/295 (28%), Positives = 136/295 (46%), Gaps = 26/295 (8%)

Query: 86  NPEEADWFYTPIYPTCDLTPTGLPLPFKS---PRMMRSAIQLISSNWPYWNRTEGADHFF 142
           +P+EA+ FY P+  T  +     P  +     PR++   I  ++  +PYWNR++GADHF 
Sbjct: 102 HPDEANAFYIPMSLTRVVHFIYEPPHYHGKWIPRLVTDYINFVADKYPYWNRSKGADHFL 161

Query: 143 VVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQ 202
           V  HD+       +  A++  +     RA       +R H   +   I+IP    P+   
Sbjct: 162 VSCHDWAP-----DVSALKPDLYKHFIRALCNANTSERFHPIRD---ISIPEINIPRGKL 213

Query: 203 A--HQIPPDTPRSIFVYFRGLFYDVNNDPEGGY-YARGARAAVWENFKNNPLFDISTDHP 259
              H   P   R I  +F G          G + Y R      W+   +           
Sbjct: 214 GPPHLDQPPNKRPILAFFAG----------GAHGYVRSVLFKYWKEKDDEVQVFERLPRN 263

Query: 260 TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFV 319
             Y + M  + FCLCP G+   SPR+VEA+  GC+P+II D   LPF+D + W +  +++
Sbjct: 264 RNYSKSMGDSKFCLCPSGYEVASPRIVEAIAAGCVPMIICDHYSLPFSDVLDWSKFSIYI 323

Query: 320 AEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
             + +P++  IL ++P E  L  Q+ +    ++R     +PA+P D  H IL+ +
Sbjct: 324 TSDKIPEIKKILKAVPTETYLEMQKRVKQ--VQRHFAINRPARPYDMLHMILHSV 376


>gi|384490457|gb|EIE81679.1| hypothetical protein RO3G_06384 [Rhizopus delemar RA 99-880]
          Length = 793

 Score =  125 bits (315), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 98/370 (26%), Positives = 162/370 (43%), Gaps = 46/370 (12%)

Query: 34  DDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLL--SSPVR----TLNP 87
           +  V K K+YVY+ P        ++  +C    + +EI +H  L   +SP+     T NP
Sbjct: 429 EQAVCKPKIYVYKAPESIQVPEQIQKEKCHESNYNSEIILHNQLTDPTSPIYEHYVTENP 488

Query: 88  EEADWFYTPIYPTCDLTPTGLPLPFK-------SPRMMRSAIQLISSNWPYWNRTEGADH 140
           EEAD+F+ P + +C L        +          + +   + ++   +PYWN+T G +H
Sbjct: 489 EEADFFFIPFFGSCYLYNCWYENKWNWDERCEVDAKYVDPLMDMVIQEYPYWNKTGGRNH 548

Query: 141 FFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEG----SITIPPYA 196
             + P D    F Y +         P  Q A  ++T G + +  ++       I IP   
Sbjct: 549 IMIHPMD--KTFTYYQSN-------PRFQSAIFLKTVGDKRNKWMSRHRYHRDIVIPSAT 599

Query: 197 ---------PPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFK 247
                    P   + A   P    R IF  F+G   DV    E   Y+ G R+  + +F 
Sbjct: 600 RMIHHLRANPLDYLNAQGQPKSGKRDIFALFQGCCPDVQPTDE---YSNGIRSLFFNHFA 656

Query: 248 NNPLFDISTD-HPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 306
           + P ++I        Y E + RA + L P+GW   + R+ E + FG +PV+IAD I+ PF
Sbjct: 657 HYPGYEIGQSVADEEYLEKLSRAKYGLSPMGWTLDTTRIWEFMAFGVVPVVIADGIIEPF 716

Query: 307 ADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDA 366
              + W++  + +  ++V +LD IL SI  +    KQ+ L     +  +         DA
Sbjct: 717 EFDVDWDKFIVRIRRDEVHRLDEILKSIDDKTYEYKQKNLWEFGRRVGLEM-------DA 769

Query: 367 FHQILNGLAR 376
           +H I+  L R
Sbjct: 770 WHFIVRELCR 779


>gi|159485716|ref|XP_001700890.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
 gi|158281389|gb|EDP07144.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
          Length = 704

 Score =  125 bits (315), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 108/369 (29%), Positives = 162/369 (43%), Gaps = 69/369 (18%)

Query: 42  VYVYELPSKYNKKLL---LKDPRCLTH--------------MFAAEIFMHRFLLSSPVRT 84
           +YVY+L   YN +LL   +    C TH               +  E  +H +LL S  RT
Sbjct: 269 IYVYDLDPLYNARLLQYRIVQTWC-THRHYHTGNTSTWSATFYGLEAALHEYLLISEHRT 327

Query: 85  LNPEEADWFYTPIYPTCDLTPT------------GLPLPFKSPRMMRSAIQLISSNWPYW 132
            +PEEAD+FY P Y  C + P             G P   +   M+R  ++ I   +P+W
Sbjct: 328 FDPEEADYFYVPFYGACMIYPVAGWADYPWFWTPGGPRVMQVINMIREIVEWIDKQYPFW 387

Query: 133 NRTEGADHFFVVPHDFGACF------------HYQE---EKAIERGILPLLQRATLV--- 174
            R  G DH ++  HD GAC+            H+     E       +       +V   
Sbjct: 388 KRRGGRDHIWLFTHDEGACWAPSVIKDSVWLTHWGRLDPEHTSNTAFVGDNYTHDMVNWR 447

Query: 175 QTFGQ----RNHVCLN-EGSITIPPY-APPQKMQA-HQIPPDTPRSIFVYFRGLFYDVNN 227
           Q  G     + H C + +  + +P + +PP  +++  Q  P  PR IF +F+G   DV  
Sbjct: 448 QPEGYIKYIKGHPCYDPQKDLVVPNFKSPPHYVRSPLQSTPSKPRDIFFFFKG---DVGK 504

Query: 228 DPEGGYYARGARAAV--------WENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWA 279
                +Y+RG R  +        W N + + + D    H   Y + + R++FCL   G  
Sbjct: 505 H-RLSHYSRGIRQKIYKMAMEQDWANTQKSLIGDGGNVH-GDYSDLLSRSLFCLVAPG-D 561

Query: 280 PWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVI 339
            WSPRL +AV+ GCIPVIIAD +   F   +  +   + VAE DVP++  IL ++    I
Sbjct: 562 GWSPRLEDAVLHGCIPVIIADRVHAVFESVLDIDSFAVRVAEADVPRVMDILRAVSDIKI 621

Query: 340 LRKQRLLAN 348
             KQ  L  
Sbjct: 622 RLKQSRLGQ 630


>gi|302768010|ref|XP_002967425.1| glycosyltransferase-like protein [Selaginella moellendorffii]
 gi|300165416|gb|EFJ32024.1| glycosyltransferase-like protein [Selaginella moellendorffii]
          Length = 372

 Score =  125 bits (315), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 97/350 (27%), Positives = 169/350 (48%), Gaps = 34/350 (9%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAE-IFMHRFLLSSPVRTLNPEEADWFYTPI- 97
            KVYVY+   +   +L+ K P  LT ++++E  F+H    +S   T +P+EA  F+ P  
Sbjct: 32  FKVYVYK---EGEPRLVHKGP--LTGIYSSEGRFIHEMNQNSRFVTHDPQEAHMFFLPYS 86

Query: 98  --YPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQ 155
             +   DL   G          ++  + LI+S  P+WN T G+DHFF   HD+G     +
Sbjct: 87  VAHMVLDLYVPGSHSMLPLATFIKDYVNLIASKHPFWNLTRGSDHFFASCHDWGPAT-AR 145

Query: 156 EEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIF 215
           +   + +  + ++  + L + F       L E    +     P K+     P  + R I 
Sbjct: 146 DHPELRKNSVKVVCNSDLTEEFVPDKDASLPE--TYLHAVKLPTKLGG---PGPSKRPIL 200

Query: 216 VYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP------TTYYEDMQRA 269
            +F G  +            R A    W++ + +P   I    P      T+Y + M+ +
Sbjct: 201 AFFAGQMHG---------RVRPALIKHWKD-RGDPDMRIYEVLPPDVARRTSYVQHMKSS 250

Query: 270 IFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDT 329
            FC+C +G+   SPR+VE++ + C+PV+IAD+ VLPF+D + W    + V+E+DVP+L  
Sbjct: 251 KFCICAMGFEVNSPRIVESIYYDCVPVLIADNFVLPFSDVLNWGSFSLTVSEKDVPRLKE 310

Query: 330 ILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL-ARKL 378
           +L ++  +   + Q  L    +++  L+   A+  D FH IL+ +  R+L
Sbjct: 311 LLLAVSEDRYRKMQSRLKK--VRKHFLWHDSAERFDMFHMILHSVWTRRL 358


>gi|255579651|ref|XP_002530666.1| catalytic, putative [Ricinus communis]
 gi|223529799|gb|EEF31735.1| catalytic, putative [Ricinus communis]
          Length = 528

 Score =  125 bits (315), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 110/348 (31%), Positives = 162/348 (46%), Gaps = 37/348 (10%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSS-PVRTLNPEEADWFYTPIY 98
           LKVY+Y  P    K+ +   P  L  ++A+E +  +F+  +    T +PE+A  FY P Y
Sbjct: 192 LKVYIY--PD--GKRPIFHVPH-LNGIYASEGWFMKFMEENRQFVTRDPEKAHLFYLP-Y 245

Query: 99  PTCDLTPTGLPLPFKS-----PRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFH 153
               L    L +P           MR    +I++ +P+WNRT G DHF V  HD+G    
Sbjct: 246 SARQLQ-MALYVPNSHNLRPLSIFMRDYANMIATKYPFWNRTHGRDHFLVACHDWGPYTL 304

Query: 154 YQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRS 213
              E+  +  I  L         F     V L E +I IP    P K     I   + R 
Sbjct: 305 TMHEELTKNTIKALCNADASEGIFDPTKDVSLPETTIRIP--RRPLKNVGGGIRV-SQRP 361

Query: 214 IFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPL-----FDISTDHPTTYYEDMQR 268
           I  +F G  +            R      W+N K+  L               Y + M+ 
Sbjct: 362 ILAFFAGNMHG---------RVRPTLLQYWQN-KDEDLKIYGPLPARISRKMNYVQHMKS 411

Query: 269 AIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLD 328
           + +C+CP+G    SPR+VEA+ + C+PVIIAD+ VLPF+D + W    + VAE+D+PKL 
Sbjct: 412 SRYCICPMGHEVNSPRIVEAIYYECVPVIIADNFVLPFSDVLDWSAFSIVVAEKDIPKLK 471

Query: 329 TILTSIPPEVILRKQRLLANPSM-KRAMLF-PQPAQPGDAFHQILNGL 374
            IL +IP   + R   +L N  M +R  L+ P+P +  D FH IL+ +
Sbjct: 472 EILLAIP---LRRYLTMLTNLKMLQRHFLWNPRPLR-YDLFHMILHSI 515


>gi|255582309|ref|XP_002531945.1| catalytic, putative [Ricinus communis]
 gi|223528391|gb|EEF30427.1| catalytic, putative [Ricinus communis]
          Length = 453

 Score =  125 bits (315), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 80/293 (27%), Positives = 135/293 (46%), Gaps = 22/293 (7%)

Query: 87  PEEADWFYTPIYPTCDLTPTGLPLPFKS----PRMMRSAIQLISSNWPYWNRTEGADHFF 142
           PEEA  F+ P+     +     P+   S     R++   +  +++ +PYWNR+ GADHF 
Sbjct: 166 PEEAQVFFLPVSVANIINFIYKPIVTYSRDQLQRLVTDYVGTVANKYPYWNRSSGADHFL 225

Query: 143 VVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQ 202
           V  HD+         + + R  + +L  A   + F  +  V + E +I      PP    
Sbjct: 226 VSCHDWAPDISTANPE-LYRNFIRVLCNANTSERFNPKRDVSIPEINIPSGKLGPP---- 280

Query: 203 AHQIPPDTPRSIFVYFRGLFYDVNNDPEGGY-YARGARAAVWENFKNNPLFDISTDHPTT 261
            HQ    + R+I  +F G          G + Y R      W+   +        D    
Sbjct: 281 LHQASSPSNRTILAFFAG----------GSHGYIRKLLLEHWKGKDSEIQVHEYLDKKQN 330

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y++ M ++ FCLCP G+   SPR+V A+  GC+PV I+D+  LPF+D + W +  + +  
Sbjct: 331 YFKLMGQSRFCLCPSGYEVASPRVVTAIQLGCVPVTISDNYTLPFSDILDWSKFSVHIPS 390

Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
             + ++ TIL  I P   L+  + +    ++R  +  +PAQP D  H +L+ +
Sbjct: 391 GKIQEIKTILKGISPRQYLKMHKRVM--LVRRHFMLNRPAQPFDMIHMMLHSI 441


>gi|296084507|emb|CBI25528.3| unnamed protein product [Vitis vinifera]
          Length = 862

 Score =  125 bits (314), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 83/294 (28%), Positives = 135/294 (45%), Gaps = 26/294 (8%)

Query: 87  PEEADWFYTPIYPTCDLTPTGLPLPFKS---PRMMRSAIQLISSNWPYWNRTEGADHFFV 143
           P+EA+ FY P+  T  +     P  +     PR++   I  ++  +PYWNR++GADHF V
Sbjct: 578 PDEANAFYIPMSLTKIVHFIYEPPHYYGKWIPRLVTDYINFVADKYPYWNRSKGADHFLV 637

Query: 144 VPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQA 203
             HD+       +  A++  +     RA       +R H   +   I+IP    P+    
Sbjct: 638 SCHDWAP-----DVSALKPDLYKHFIRALCNANTSERFHPIRD---ISIPEINIPRGKLG 689

Query: 204 --HQIPPDTPRSIFVYFRGLFYDVNNDPEGGY-YARGARAAVWENFKNNPLFDISTDHPT 260
             H   P   R I  +F G          G + Y R      W+   +            
Sbjct: 690 PPHLDQPPNKRPILAFFAG----------GAHGYVRSVLFKYWKEKDDEVQVFERLPGNR 739

Query: 261 TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVA 320
            Y + M  + FCLCP G+   SPR+VEA+  GC+P+II D   LPF+D + W +  +++ 
Sbjct: 740 NYSKSMGDSKFCLCPSGYEVASPRIVEAIAAGCVPMIICDHYSLPFSDVLDWSKFSIYIT 799

Query: 321 EEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
            + +P++  IL ++P E  L  Q+ +    ++R     +PA+P D  H IL+ +
Sbjct: 800 SDKIPEIKKILKAVPTETYLEMQKRVK--QVQRHFAINRPARPYDMLHMILHSV 851


>gi|224071129|ref|XP_002303362.1| predicted protein [Populus trichocarpa]
 gi|222840794|gb|EEE78341.1| predicted protein [Populus trichocarpa]
          Length = 386

 Score =  125 bits (314), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 107/354 (30%), Positives = 167/354 (47%), Gaps = 49/354 (13%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSS-PVRTLNPEEADWFYTP-- 96
           LKVY+Y  P     K +   P  L  ++A+E +  +F+ +S    + +PE+A  FY P  
Sbjct: 50  LKVYIY--PD--GDKPIFHQPH-LYGIYASEGWFMKFMEASREFVSRDPEKAHLFYLPYS 104

Query: 97  -------IY--PTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHD 147
                  +Y   + +L P  +         MR    +I++ +PYWNRT G DHF V  HD
Sbjct: 105 ARQLEVAVYVPNSHNLRPLSI--------FMRDYANMIAAKYPYWNRTHGRDHFLVACHD 156

Query: 148 FGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIP 207
           +G       E+  +  +  L         F     V L E +I  P    P +     I 
Sbjct: 157 WGPYALTMHEELTKNTMKALCNADVSEGIFTAGQDVSLPETTIRSP--KRPLRNVGGGIR 214

Query: 208 PDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNN-----PLFDISTDHPTTY 262
             + R I  +F G  +            R      W N  ++     PL  I      TY
Sbjct: 215 V-SQRPILAFFAGNLHG---------RVRPTLLKYWHNKDDDMKIYGPL-PIGISRKMTY 263

Query: 263 YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEE 322
            + M+ + +C+CP+G+   SPR+VEA+ + C+PVIIAD+ VLPF + + W    + VAE+
Sbjct: 264 VQHMKSSKYCICPMGYEVNSPRIVEAIYYECVPVIIADNFVLPFNEVLDWSAFSVVVAEK 323

Query: 323 DVPKLDTILTSIPPEVILRKQRLLAN-PSMKRAMLF-PQPAQPGDAFHQILNGL 374
           D+PKL  IL +IP   + R   +LAN  ++++  L+ P+P +  D FH IL+ +
Sbjct: 324 DIPKLKEILLAIP---LRRYLTMLANLKTVQKHFLWNPRPLR-YDLFHMILHSI 373


>gi|357138738|ref|XP_003570946.1| PREDICTED: probable glycosyltransferase At5g11130-like
           [Brachypodium distachyon]
          Length = 405

 Score =  125 bits (314), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 101/351 (28%), Positives = 162/351 (46%), Gaps = 54/351 (15%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYP 99
            KVY+Y  P   + K   + PR LT  +A+E +  + +  S  RT +P+ AD F+ PI P
Sbjct: 84  FKVYIY--PDG-DPKTFYQTPRKLTGKYASEGYFFQNIRESRFRTEDPDSADLFFVPISP 140

Query: 100 TCDLTPTGLPLPFKSPRMM-RSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEK 158
                  G    +++  ++ +  ++ + + +PYWNRT GADHFFV  HD G        +
Sbjct: 141 H---KMRGKGTSYENMTIIVKDYVEGLINKYPYWNRTLGADHFFVTCHDVGV-------R 190

Query: 159 AIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY---APPQKMQAHQIPPDTPRSIF 215
           A E   LP + + ++         VC    ++   P+   A PQ +Q   +P        
Sbjct: 191 AFEG--LPFMVKNSI-------RVVCSPSYNVDFIPHKDVALPQVLQPFALPKGGN---- 237

Query: 216 VYFRGLFYDVNNDPEGGYYA-------RGARAAVWEN-----FKNNPLFDISTDHPTTYY 263
                   DV N    G++A       R   A VWEN       NN +     +    Y 
Sbjct: 238 --------DVENRTNLGFWAGHRNSKIRVILARVWENDTELAISNNRISRAIGE--LVYQ 287

Query: 264 EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED 323
           +   R  FC+CP G    S R+ +++ +GC+PVI++D   LPF D + W++  + + E D
Sbjct: 288 KQFYRTKFCICPGGSQVNSARISDSIHYGCVPVILSDYYDLPFNDVLDWKKFAIVLKERD 347

Query: 324 VPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
           V +L +IL SI  E  +     L    +++  ++  P  P DAFH ++  L
Sbjct: 348 VYELKSILKSISQEEFVALHNSLVQ--IQKHFVWHSPPIPYDAFHMVMYEL 396


>gi|357125316|ref|XP_003564340.1| PREDICTED: probable glycosyltransferase At5g03795-like
           [Brachypodium distachyon]
          Length = 553

 Score =  125 bits (313), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 85/268 (31%), Positives = 129/268 (48%), Gaps = 33/268 (12%)

Query: 118 MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQ-T 176
           +++ I +IS+ +PYWNRT+GADHFFV  HD+G  +  +    + R  +  L  A L +  
Sbjct: 297 VKNYIDMISAKFPYWNRTKGADHFFVACHDWGP-YTTKLHDELRRNTIKALCNADLSEGV 355

Query: 177 FGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRG----------LFYDVN 226
           F +   V L E  +     +P + ++     P T RSI  +F G          L Y   
Sbjct: 356 FIRGRDVSLPETFVR----SPRRPLRDIGGKPATERSILAFFAGQMHGRVRPILLQYWGG 411

Query: 227 NDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLV 286
            D +   Y R  R                      Y + M+ + +C+CP+G+   SPR+V
Sbjct: 412 KDTDMRIYGRLPRRITRR---------------MNYVQHMKSSKYCICPMGYEVNSPRIV 456

Query: 287 EAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLL 346
           EA+ + CIPVIIAD+ VLPF DA+ W    + V E+DVP+L  IL  IP    +  Q  +
Sbjct: 457 EAIYYECIPVIIADNFVLPFDDALDWSTFSVVVPEKDVPRLKEILLRIPESRYITMQSNV 516

Query: 347 ANPSMKRAMLFPQPAQPGDAFHQILNGL 374
               +++  L+       D FH IL+ +
Sbjct: 517 KK--VQKHFLWHAKPVKYDIFHMILHSV 542


>gi|159470363|ref|XP_001693329.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
 gi|158277587|gb|EDP03355.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
          Length = 626

 Score =  125 bits (313), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 111/369 (30%), Positives = 167/369 (45%), Gaps = 48/369 (13%)

Query: 39  KLKVYVYELP------SKYNKKLLLKDPRCLTHMFAA-EIFMHRFLLSSPVRTLNPEEAD 91
           KLK+Y+Y+LP        YN     +DP     M++A E+FM  FL  +  RT NP EA+
Sbjct: 242 KLKIYMYDLPWDVAFPGAYNDGFFGRDP-----MYSAYELFMRYFLQDNVTRTENPWEAN 296

Query: 92  WFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGAC 151
            FY P+     +      +P         AI  + S WP+W+R+ G DHF+ +  D G C
Sbjct: 297 LFYVPMLLYFYIGNVRDAVP-----QTAWAINHVRSRWPFWDRSGGRDHFYFMTGDRGTC 351

Query: 152 FHYQE--EKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPD 209
              +E  ++AI+     +    T       +++ C+      +    PP  M A  +P D
Sbjct: 352 HLPRELQDQAIKVVHWGMQVAGTDWIGLDNKDYACIQLKRDLV---VPPINMFAEILPTD 408

Query: 210 TPRSI-FVYFRG--------LFYDVNNDPEGGYYARGARAAVWE--------NFKNNPLF 252
           T +    V   G        LF+      +   Y+ G R A+ E        N  + P  
Sbjct: 409 TVKHYQTVVANGGQDFGRTLLFFFAGGIAQSMEYSGGTRQAIKELLTSVHIANGNSTPAD 468

Query: 253 DISTDHPTTYYEDMQ-RAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIP 311
            +  +  T  Y+ +   + FC+ P G+  W  RLV+A+ FGCIPVII D +   F D +P
Sbjct: 469 VVFVEGRTQEYKKLLLTSKFCIAPYGFG-WGLRLVQAIEFGCIPVIIQDHVYQAFEDFLP 527

Query: 312 WEEIGMFVAEEDVPKLDTILTSIPPE--VILRKQRLLANPSMKRAMLFPQPAQPGDAFHQ 369
           +EE  + +   DVP+L  IL S  PE    LR    L      RA ++ +    G+A+  
Sbjct: 528 YEEFSVRLPLRDVPRLLDILRSYSPEQQAALR----LGMAKYYRAFVWHREFG-GEAYEW 582

Query: 370 ILNGLARKL 378
            L GL R+L
Sbjct: 583 TLAGLQRRL 591


>gi|147775379|emb|CAN73458.1| hypothetical protein VITISV_022180 [Vitis vinifera]
          Length = 333

 Score =  125 bits (313), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 83/296 (28%), Positives = 135/296 (45%), Gaps = 30/296 (10%)

Query: 87  PEEADWFYTPIYPTCDLTPTGLPLPFKS---PRMMRSAIQLISSNWPYWNRTEGADHFFV 143
           P+EA+ FY P+  T  +     P  +     PR++   I  ++  +PYWNR++GADHF V
Sbjct: 49  PDEANAFYIPMSLTKIVHFIYEPPHYYGKWIPRLVTDYINFVADKYPYWNRSKGADHFLV 108

Query: 144 VPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNH----VCLNEGSITIPPYAPPQ 199
             HD+       +  A++  +     RA       +R H    + + E +I      PP 
Sbjct: 109 SCHDWAP-----DVSALKPDLYKHFIRALCNANTSERFHPIRDISIPEINIPXGKLGPPH 163

Query: 200 KMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGY-YARGARAAVWENFKNNPLFDISTDH 258
             Q     P   R I  +F G          G + Y R      W+   +          
Sbjct: 164 LDQ-----PPNKRPILAFFAG----------GAHGYVRSVLFKYWKEKDDEVQVFERLPG 208

Query: 259 PTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMF 318
              Y + M  + FCLCP G+   SPR+VEA+  GC+P+II D   LPF+D + W +  ++
Sbjct: 209 NRNYSKSMGDSKFCLCPSGYEVASPRIVEAIAAGCVPMIICDHYSLPFSDVLDWSKFSIY 268

Query: 319 VAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
           +  + +P++  IL ++P E  L  Q+ +    ++R     +PA+P D  H IL+ +
Sbjct: 269 ITSDKIPEIKKILKAVPTETYLEMQKRVKQ--VQRHFAINRPARPYDMLHMILHSV 322


>gi|449531189|ref|XP_004172570.1| PREDICTED: probable glycosyltransferase At5g03795-like, partial
           [Cucumis sativus]
          Length = 344

 Score =  124 bits (312), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 105/348 (30%), Positives = 167/348 (47%), Gaps = 38/348 (10%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAE-IFMHRFLLSSPVRTLNPEEADWFYTPIY 98
           LKVY+Y   S+     +   P  L  ++A+E  FM     +    T +PE+A  FY   Y
Sbjct: 9   LKVYIYRDGSR----PIFHTPH-LRGIYASEGWFMKLMEENRQFVTKDPEKAHLFYLA-Y 62

Query: 99  PTCDLTPTGLPLPFKSPRM------MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACF 152
            +  L  T L +P  S  M      +R  +  I+  +PYWNRT G DHF V  HD+G  +
Sbjct: 63  SSRQLQ-TALYVP-DSHNMKPLSIYLRDHVNWIAGKYPYWNRTHGYDHFLVACHDWGP-Y 119

Query: 153 HYQEEKAIERGILPLLQRATLVQ-TFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTP 211
              E + + +  +  L  A L +  F     V L E +I  P   P + +   ++   + 
Sbjct: 120 TVNEHRELSQHTIKALCNADLSEGVFKLGKDVSLPETTIRTPR-KPLRNVGGKRV---SQ 175

Query: 212 RSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPL-----FDISTDHPTTYYEDM 266
           R I  +F G  +            R      W N K++ +       +      TY + M
Sbjct: 176 RPILAFFAGNMHG---------RVRPILLKHW-NDKDDDIRVYGPLPLRVSRKMTYIQHM 225

Query: 267 QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK 326
           + + +C+CP+G+   SPR++EA+ + C+PVIIAD+ VLPF++ + W    + VAE+D+PK
Sbjct: 226 KSSKYCICPMGYEVNSPRIIEAIYYECVPVIIADNFVLPFSEFLDWSAFSVVVAEKDIPK 285

Query: 327 LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
           L  ILT+IP +  L  Q +      K  +  P+P +  D FH +L+ +
Sbjct: 286 LKEILTAIPLKRYLTMQ-INVKMVQKHFLWNPKPLK-YDLFHMVLHSI 331


>gi|449469432|ref|XP_004152424.1| PREDICTED: probable glycosyltransferase At5g03795-like [Cucumis
           sativus]
          Length = 472

 Score =  124 bits (312), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 105/348 (30%), Positives = 167/348 (47%), Gaps = 38/348 (10%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAE-IFMHRFLLSSPVRTLNPEEADWFYTPIY 98
           LKVY+Y   S+     +   P  L  ++A+E  FM     +    T +PE+A  FY   Y
Sbjct: 137 LKVYIYRDGSR----PIFHTPH-LRGIYASEGWFMKLMEENRQFVTKDPEKAHLFYLA-Y 190

Query: 99  PTCDLTPTGLPLPFKSPRM------MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACF 152
            +  L  T L +P  S  M      +R  +  I+  +PYWNRT G DHF V  HD+G  +
Sbjct: 191 SSRQLQ-TALYVP-DSHNMKPLSIYLRDHVNWIAGKYPYWNRTHGYDHFLVACHDWGP-Y 247

Query: 153 HYQEEKAIERGILPLLQRATLVQ-TFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTP 211
              E + + +  +  L  A L +  F     V L E +I  P   P + +   ++   + 
Sbjct: 248 TVNEHRELSQHTIKALCNADLSEGVFKLGKDVSLPETTIRTPR-KPLRNVGGKRV---SQ 303

Query: 212 RSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPL-----FDISTDHPTTYYEDM 266
           R I  +F G  +            R      W N K++ +       +      TY + M
Sbjct: 304 RPILAFFAGNMHG---------RVRPILLKHW-NDKDDDIRVYGPLPLRVSRKMTYIQHM 353

Query: 267 QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK 326
           + + +C+CP+G+   SPR++EA+ + C+PVIIAD+ VLPF++ + W    + VAE+D+PK
Sbjct: 354 KSSKYCICPMGYEVNSPRIIEAIYYECVPVIIADNFVLPFSEFLDWSAFSVVVAEKDIPK 413

Query: 327 LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
           L  ILT+IP +  L  Q +      K  +  P+P +  D FH +L+ +
Sbjct: 414 LKEILTAIPLKRYLTMQ-INVKMVQKHFLWNPKPLK-YDLFHMVLHSI 459


>gi|359484706|ref|XP_002266646.2| PREDICTED: probable glycosyltransferase At5g20260-like [Vitis
           vinifera]
          Length = 480

 Score =  124 bits (312), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 83/294 (28%), Positives = 135/294 (45%), Gaps = 26/294 (8%)

Query: 87  PEEADWFYTPIYPTCDLTPTGLPLPFKS---PRMMRSAIQLISSNWPYWNRTEGADHFFV 143
           P+EA+ FY P+  T  +     P  +     PR++   I  ++  +PYWNR++GADHF V
Sbjct: 196 PDEANAFYIPMSLTKIVHFIYEPPHYYGKWIPRLVTDYINFVADKYPYWNRSKGADHFLV 255

Query: 144 VPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQA 203
             HD+       +  A++  +     RA       +R H   +   I+IP    P+    
Sbjct: 256 SCHDWAP-----DVSALKPDLYKHFIRALCNANTSERFHPIRD---ISIPEINIPRGKLG 307

Query: 204 --HQIPPDTPRSIFVYFRGLFYDVNNDPEGGY-YARGARAAVWENFKNNPLFDISTDHPT 260
             H   P   R I  +F G          G + Y R      W+   +            
Sbjct: 308 PPHLDQPPNKRPILAFFAG----------GAHGYVRSVLFKYWKEKDDEVQVFERLPGNR 357

Query: 261 TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVA 320
            Y + M  + FCLCP G+   SPR+VEA+  GC+P+II D   LPF+D + W +  +++ 
Sbjct: 358 NYSKSMGDSKFCLCPSGYEVASPRIVEAIAAGCVPMIICDHYSLPFSDVLDWSKFSIYIT 417

Query: 321 EEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
            + +P++  IL ++P E  L  Q+ +    ++R     +PA+P D  H IL+ +
Sbjct: 418 SDKIPEIKKILKAVPTETYLEMQKRVKQ--VQRHFAINRPARPYDMLHMILHSV 469


>gi|15233924|ref|NP_195005.1| Exostosin family protein [Arabidopsis thaliana]
 gi|3063691|emb|CAA18582.1| putative protein [Arabidopsis thaliana]
 gi|7270226|emb|CAB79996.1| putative protein [Arabidopsis thaliana]
 gi|332660719|gb|AEE86119.1| Exostosin family protein [Arabidopsis thaliana]
          Length = 593

 Score =  124 bits (311), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 102/349 (29%), Positives = 167/349 (47%), Gaps = 42/349 (12%)

Query: 39  KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEI-FMHRFLLSSPVRTLNPEEADWFYTPI 97
           KLKVYVY    +  K+ +L  P  L  ++A+E  FM +   S    T +P +A  FY P 
Sbjct: 266 KLKVYVY----REGKRPVLHKP-VLKGIYASEGWFMKQLKSSRTFVTKDPRKAHLFYLPF 320

Query: 98  YPTCDLTPTGLPLPFKSP-----RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACF 152
             +  +    L +P         + +++ + +ISS + +WN+T G+DHF V  HD+    
Sbjct: 321 --SSKMLEETLYVPGSHSDKNLIQFLKNYLDMISSKYSFWNKTGGSDHFLVACHDWAPS- 377

Query: 153 HYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPR 212
              E +      +  L  + + + F     V L E +I +P     + ++A    P + R
Sbjct: 378 ---ETRQYMAKCIRALCNSDVSEGFVFGKDVALPETTILVPR----RPLRALGGKPVSQR 430

Query: 213 SIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNN--PLFDISTDHPTT-----YYED 265
            I  +F G  +        GY     R  + +N+  N  P   I ++ P +     Y E 
Sbjct: 431 QILAFFAGGMH--------GY----LRPLLLQNWGGNRDPDMKIFSEIPKSKGKKSYMEY 478

Query: 266 MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVP 325
           M+ + +C+CP G    SPR+VEA+ + C+PVII+D+ V PF + + WE   +FV E+D+P
Sbjct: 479 MKSSKYCICPKGHEVNSPRVVEALFYECVPVIISDNFVPPFFEVLNWESFAVFVLEKDIP 538

Query: 326 KLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
            L  IL SI  E   R+ ++      K  +   +P +  D FH IL+ +
Sbjct: 539 DLKNILVSITEER-YREMQMRVKMVQKHFLWHSKPER-FDIFHMILHSI 585


>gi|297804498|ref|XP_002870133.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297315969|gb|EFH46392.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 540

 Score =  124 bits (311), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 102/346 (29%), Positives = 160/346 (46%), Gaps = 33/346 (9%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAE-IFMHRFLLSSPVRTLNPEEADWFYTPIY 98
           LKVY+Y  P    +K +   P  L  ++A+E  FM     ++   T NPE A  FY P  
Sbjct: 198 LKVYIY--PD--GEKPIFHQPH-LNGIYASEGWFMKLMESNTQFVTKNPERAHLFYMPYS 252

Query: 99  PTCDLTPTGLPLPFKSPRM---MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQ 155
                T   +P       +   +R  + ++S+ +P+WNRT G+DHF V  HD+G  +   
Sbjct: 253 VKQLQTSIFVPGSHNIKPLSIFLRDYVNMLSTKYPFWNRTHGSDHFLVACHDWGP-YTVN 311

Query: 156 EEKAIERGILPLLQRATLVQ-TFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSI 214
           E   + R  +  L  A L    F     V L E SI             +++   + R I
Sbjct: 312 EHPELRRNTIKALCNADLADGIFIPGKDVSLPETSIRNAGKPLRNIGNGNRV---SQRPI 368

Query: 215 FVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNN-----PLFDISTDHPTTYYEDMQRA 269
             +F G  +            R      W N  ++     PL   +     TY + M+ +
Sbjct: 369 LAFFAGNLHG---------RVRPKLLKHWRNKDDDMKIYGPLPH-NVARKMTYVQHMKSS 418

Query: 270 IFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDT 329
            +CLCP+G+   SPR+VEA+ + C+PV+IAD+ +LPF+D + W    + V E+++P+L  
Sbjct: 419 KYCLCPMGYEVNSPRIVEAIYYECVPVVIADNFMLPFSDVLDWSAFSVVVPEKEIPRLKE 478

Query: 330 ILTSIPPEVILRKQRLLANPSM-KRAMLFPQPAQPGDAFHQILNGL 374
           IL  IP    L+ Q   +N  M +R  L+    +  D FH IL+ +
Sbjct: 479 ILLEIPMRRYLKMQ---SNVKMVQRHFLWSPKPRKYDVFHMILHSI 521


>gi|413947133|gb|AFW79782.1| putative homeodomain-like transcription factor superfamily protein
           [Zea mays]
          Length = 544

 Score =  124 bits (311), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 83/261 (31%), Positives = 123/261 (47%), Gaps = 19/261 (7%)

Query: 118 MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTF 177
           +++ I LIS+ +PYWNRT+GADHFFV  HD+G  +  +    + +  +  L  A L + F
Sbjct: 290 VKNYIDLISAKYPYWNRTKGADHFFVACHDWGP-YTTKLHDELRKNTIKALCNADLSEGF 348

Query: 178 GQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARG 237
             R        +    P  P + +      P   R+I  +F G  +            R 
Sbjct: 349 FIRGKDVSLPETFLRSPRRPLRDIGGR---PAAQRTILAFFAGQMHG---------RVRP 396

Query: 238 ARAAVWENFKNNPLFDISTDHPTT----YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGC 293
                W N  ++        H  T    Y + M+ + +C+CP+G+   SPR+VEA+ + C
Sbjct: 397 VLLKYWGNKDDDMRIYSRLPHRITRKRNYVQHMKSSKYCICPMGYEVNSPRIVEAIYYEC 456

Query: 294 IPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKR 353
           +PVIIAD  VLPF  A+ W    + V E DVPKL  IL +IP    +  Q  +     K 
Sbjct: 457 VPVIIADHFVLPFDAALNWSTFSVVVPESDVPKLKQILLAIPESRYITLQANVKR-VQKH 515

Query: 354 AMLFPQPAQPGDAFHQILNGL 374
            M  P P +  D FH IL+ +
Sbjct: 516 FMWHPNPVK-YDIFHMILHSV 535


>gi|297802718|ref|XP_002869243.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297315079|gb|EFH45502.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 583

 Score =  124 bits (310), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 103/350 (29%), Positives = 166/350 (47%), Gaps = 44/350 (12%)

Query: 39  KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEI-FMHRFLLSSPVRTLNPEEADWFYTPI 97
           KLKVYVY    +  K+ +L  P  L  ++A+E  FM +   S    T +P +A  FY P 
Sbjct: 256 KLKVYVY----REGKRPVLHKP-VLKGIYASEGWFMKQLKSSRTFVTKDPRKAHLFYLPF 310

Query: 98  YPTCDLTPTGLPLPFKSP-----RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACF 152
             +  +    L +P         + +++ + +ISS + +WN+T G+DHF V  HD+    
Sbjct: 311 --SSKMLEETLYVPGSHSDQNLIQFLKNYLDMISSKYNFWNKTGGSDHFLVACHDWAPS- 367

Query: 153 HYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPR 212
              E +      +  L  + + + F     V L E +I +P     + ++A    P + R
Sbjct: 368 ---ETRQYMAKCIRALCNSDVSEGFVFGKDVALPETTILVPR----RPLRALGGKPVSQR 420

Query: 213 SIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNN--PLFDISTDHPTT-----YYED 265
            I  +F G  +        GY     R  +  N+  N  P   I ++ P +     Y E 
Sbjct: 421 QILAFFAGGMH--------GY----LRPLLLRNWGGNRDPDMKIFSEIPKSKGKKSYMEY 468

Query: 266 MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVP 325
           M+ + FC+CP G    SPR+VEA+ + C+PVII+D+ V PF + + WE   +FV E+D+P
Sbjct: 469 MKSSKFCICPKGHEVNSPRVVEALFYECVPVIISDNFVPPFFEVLNWEAFAVFVLEKDIP 528

Query: 326 KLDTILTSIPPEVILRKQRLLANPSM-KRAMLFPQPAQPGDAFHQILNGL 374
            L  IL SI  E   R + +     M ++  L+    +  D FH IL+ +
Sbjct: 529 DLKNILVSITEE---RYREMQTRVKMVQKHFLWHSKPERFDIFHMILHSI 575


>gi|384251887|gb|EIE25364.1| exostosin-like glycosyltransferase [Coccomyxa subellipsoidea C-169]
          Length = 705

 Score =  124 bits (310), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 102/366 (27%), Positives = 162/366 (44%), Gaps = 70/366 (19%)

Query: 42  VYVYELPSKYNKKLL---LKDPRCLTHMFAA-------------EIFMHRFLLSSPVRTL 85
           +Y+ ELP  YN ++L   +    C    F +             E  +H  LL SP RTL
Sbjct: 276 IYMVELPPIYNSRMLQYRIHKDTCTWRGFDSGNASFITDWTYQIEPALHEMLLQSPHRTL 335

Query: 86  NPEEADWFYTPIYPTCDLTPT------------GLPLPFKSPRMMRSAIQLISSNWPYWN 133
           +PE AD+FY P+Y +C + P             G P    +  MM  A + + +  PYWN
Sbjct: 336 DPEAADFFYVPVYTSCFIHPVYGWADTPWFHNPGSPRVMHAATMMLEAKRWLETELPYWN 395

Query: 134 RTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQT----------------- 176
           RT G DH +++ HD G+C+   E ++    IL    R  L                    
Sbjct: 396 RTGGRDHIWLISHDEGSCWAPSEIRS--SIILSHWGRKALDHESYSAYPFDNYSDNAVHP 453

Query: 177 ----FGQRN----HVCLN-EGSITIPPYAPPQKMQAHQIPP--DTPRSIFVYFRGLFYDV 225
                G R+    H C + +  + IP + PP ++    +    + PR + ++FRG    +
Sbjct: 454 EWRPHGWRHIIEGHPCYDPDKDLIIPAFVPPARIVPSPLTGAREDPRPLLLFFRGDV-GL 512

Query: 226 NNDPEGGYYARGARAAVW-----ENFKNNPLFDIST--DHPTTYYEDMQRAIFCLCPLGW 278
           N  P   +Y+RG R  ++     + ++      I T  D P  Y E +  + FCL   G 
Sbjct: 513 NRRP---HYSRGIRQRIYALSKEQRWREKYRIWIGTKEDTPGGYSELLSSSKFCLVVPG- 568

Query: 279 APWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEV 338
             WSPR  +A++ GC+PV++ D +   F   + WEE  + + E ++  L  IL SI P  
Sbjct: 569 DGWSPRAEDAMLHGCVPVVVNDGVDQVFETLLDWEEFAVRIPEREMEFLPEILLSISPSR 628

Query: 339 ILRKQR 344
           + + Q+
Sbjct: 629 LQQLQK 634


>gi|357440259|ref|XP_003590407.1| hypothetical protein MTR_1g061760 [Medicago truncatula]
 gi|355479455|gb|AES60658.1| hypothetical protein MTR_1g061760 [Medicago truncatula]
          Length = 508

 Score =  124 bits (310), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 91/322 (28%), Positives = 151/322 (46%), Gaps = 51/322 (15%)

Query: 72  FMHRFLLSSPVRTLNPEEADWFYTP-----------IYPTCDLTPTGLPLPFKSPRMMRS 120
           F+H   L+   RT +P++A  ++ P           +  + D  P          + +  
Sbjct: 206 FIHAIELNDQFRTRDPQKAHVYFLPFSVVMLVRFVYLRDSRDFGPIR--------KTVTD 257

Query: 121 AIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQR 180
            I +I+  +PYWNR+ GADHF +  HD+G    +          +P L + ++       
Sbjct: 258 YINVIAGKYPYWNRSLGADHFMLACHDWGPETSFS---------VPYLHKNSI------- 301

Query: 181 NHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARG--- 237
             V  N    T   + P + +   +I   T  SI  +  GL    +  P   ++A G   
Sbjct: 302 -RVLCNAN--TSERFNPAKDVSFPEINLQTG-SINGFLGGL--SASKRPILAFFAGGLHG 355

Query: 238 -ARAAVWENFKNNPLFDISTDH----PTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFG 292
             RA + E+++NN   D+          +YYE ++++ FCLCP G+   SPR+VEA+  G
Sbjct: 356 HIRAILLEHWENNKDQDMMIQKYLPKGVSYYEMLRKSKFCLCPSGYEVASPRIVEAIYTG 415

Query: 293 CIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMK 352
           C+PV+I+D  V PF+D + W+   + ++ ED+PKL  IL  I P   +R QR +    ++
Sbjct: 416 CVPVLISDHYVPPFSDVLNWKSFSVEISVEDIPKLKDILMRISPTQYIRMQRRVVQ--IR 473

Query: 353 RAMLFPQPAQPGDAFHQILNGL 374
           R      P +  D FH IL+ +
Sbjct: 474 RHFEVHSPPKRFDVFHMILHSV 495


>gi|125538390|gb|EAY84785.1| hypothetical protein OsI_06153 [Oryza sativa Indica Group]
          Length = 332

 Score =  124 bits (310), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 101/355 (28%), Positives = 163/355 (45%), Gaps = 56/355 (15%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYP 99
            KVY+Y  P   + K   + PR LT  +A+E +  + +  S  RT +P++A  F+ PI P
Sbjct: 5   FKVYMY--PDG-DPKTFYQTPRKLTGKYASEGYFFQNIRESRFRTGDPDKAHLFFVPISP 61

Query: 100 TCDLTPTGLPLPFKSPRMM-RSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEK 158
                  G    +++  ++ +  ++ + + +PYWNRT GADHFFV  HD G        +
Sbjct: 62  H---KMRGKGTSYENMTIIVKDYVEGLINKYPYWNRTLGADHFFVTCHDVGV-------R 111

Query: 159 AIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY---APPQKMQAHQIPPDTPRSIF 215
           A E   LP + + ++         VC    ++   P+   A PQ +Q   +P        
Sbjct: 112 AFEG--LPFMVKNSI-------RVVCSPSYNVDFIPHKDIALPQVLQPFALPEGGN---- 158

Query: 216 VYFRGLFYDVNNDPEGGYYA-------RGARAAVWEN-----FKNNPLFDISTDHPTTYY 263
                   DV N    G++A       R   A VWEN       NN +     +    Y 
Sbjct: 159 --------DVENRTILGFWAGHRNSKIRVILAQVWENDTELAISNNRISRAIGE--LVYQ 208

Query: 264 EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED 323
           +   R  FC+CP G    S R+ +++ +GC+PVI++D   LPF D + W +  + + E D
Sbjct: 209 KQFYRTKFCICPGGSQVNSARISDSIHYGCVPVILSDYYDLPFNDILDWRKFAVVLKERD 268

Query: 324 VPKLDTILTSIPPEVILRKQRLLAN----PSMKRAMLFPQPAQPGDAFHQILNGL 374
           V +L +IL SI  E  +   + L      P +++  ++  P  P DAFH ++  L
Sbjct: 269 VYQLKSILKSISQEEFVELHKSLVQNISCPQVQKHFVWHSPPLPYDAFHMVMYEL 323


>gi|147789666|emb|CAN71921.1| hypothetical protein VITISV_031190 [Vitis vinifera]
          Length = 310

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 82/295 (27%), Positives = 135/295 (45%), Gaps = 26/295 (8%)

Query: 86  NPEEADWFYTPIYPTCDLTPTGLPLPFKS---PRMMRSAIQLISSNWPYWNRTEGADHFF 142
           +P+EA+ FY P+  T  +     P  +     PR++   I  ++  +PYWNR++GADHF 
Sbjct: 25  HPDEANAFYIPMSLTRVVHFIYEPPHYXGKWIPRLVXDYINFVADKYPYWNRSKGADHFL 84

Query: 143 VVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQ 202
           V  HD+       +  A++  +     RA       +R H   +   I+IP    P+   
Sbjct: 85  VSCHDWAP-----DVSALKPDLYKHFIRALCNANTSERFHPIRD---ISIPEINIPRGKL 136

Query: 203 A--HQIPPDTPRSIFVYFRGLFYDVNNDPEGGY-YARGARAAVWENFKNNPLFDISTDHP 259
              H   P   R I  +F G          G + Y R      W+   +           
Sbjct: 137 GPPHLDQPPNKRPILAFFAG----------GAHGYVRSVLFKYWKEKDDEVQVFERLPRN 186

Query: 260 TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFV 319
             Y + M  + FCLCP G+   SPR+V+A+  GC+P+II D   LPF+D + W +  +++
Sbjct: 187 RNYSKSMGDSKFCLCPSGYEVASPRIVKAIAAGCVPMIICDHYSLPFSDXLDWSKFSIYI 246

Query: 320 AEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
             + +P++  IL ++P E  L  Q+ +    ++R     +PA P D  H IL+ +
Sbjct: 247 TSDKIPEIKKILKAVPTETYLEMQKRVKQ--VQRHFAINRPAXPYDMLHMILHSV 299


>gi|115434052|ref|NP_001041784.1| Os01g0107700 [Oryza sativa Japonica Group]
 gi|52076197|dbj|BAD44851.1| pectin-glucuronyltransferase -like [Oryza sativa Japonica Group]
 gi|52076236|dbj|BAD44890.1| pectin-glucuronyltransferase -like [Oryza sativa Japonica Group]
 gi|113531315|dbj|BAF03698.1| Os01g0107700 [Oryza sativa Japonica Group]
 gi|215697737|dbj|BAG91731.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222617591|gb|EEE53723.1| hypothetical protein OsJ_00064 [Oryza sativa Japonica Group]
          Length = 550

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 80/259 (30%), Positives = 129/259 (49%), Gaps = 15/259 (5%)

Query: 118 MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQ-T 176
           ++  I  IS+ +PYWNRT+GADHFFV  HD+G  +  +    + +  +  L  A L +  
Sbjct: 294 VKKYIDFISTKFPYWNRTKGADHFFVACHDWGP-YTTKLHDELRKNTIKALCNADLSEGV 352

Query: 177 FGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYA- 235
           F     V L E  +     +P + ++     P   RSI  +F G  +         Y+  
Sbjct: 353 FIHGRDVSLPETFLR----SPRRPLRGIGGKPAAERSILAFFAGQMHGRVRPVLLQYWGG 408

Query: 236 RGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIP 295
           + A   +++   +             Y + M+ + +C+CP+G+   SPR+VEA+ + C+P
Sbjct: 409 KDADMRIYDRLPHR------ITRRMNYIQHMKSSKYCICPMGYEVNSPRIVEAIYYECVP 462

Query: 296 VIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAM 355
           VIIAD+ VLPF DA+ W    + + E+DVPKL  IL +IP +  +  Q  +     K  +
Sbjct: 463 VIIADNFVLPFDDALNWSAFSVVIPEKDVPKLKQILLAIPDDQYMAMQSNVQR-VQKHFI 521

Query: 356 LFPQPAQPGDAFHQILNGL 374
             P P +  D FH IL+ +
Sbjct: 522 WHPNPIK-YDIFHMILHSI 539


>gi|224068931|ref|XP_002326234.1| predicted protein [Populus trichocarpa]
 gi|222833427|gb|EEE71904.1| predicted protein [Populus trichocarpa]
          Length = 328

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 95/316 (30%), Positives = 141/316 (44%), Gaps = 40/316 (12%)

Query: 72  FMHRFLLSSPVRTLNPEEADWFYTP---------IYP--TCDLTPTGLPLPFKSPRMMRS 120
           F+HR  +    RT +P++A  ++ P         +Y   + D  P          R +  
Sbjct: 33  FIHRMEIDGHFRTKDPDKAHVYFLPFSVAMMVRFVYERESRDFGPIR--------RTVSD 84

Query: 121 AIQLISSNWPYWNRTEGADHFFVVPHDFG--ACFHYQEEKAIERGILPLLQRATLVQTFG 178
            I LIS  +P+WNR+ GADHF +  HD+G  A F       I    +  L  A   + F 
Sbjct: 85  YINLISGKYPFWNRSLGADHFMLACHDWGPEASFSVPHLGKIS---IRALCNANTSEKFN 141

Query: 179 QRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGA 238
               V L E ++         K     + P + RSI  +F G  +            R  
Sbjct: 142 PIKDVSLPEINLRTGSI----KGFVGGLSP-SKRSILAFFAGRLHGP---------IRPV 187

Query: 239 RAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVII 298
               WEN  ++           +YYE M+ + FCLCP G+   SPR+VEA+  GC+PV+I
Sbjct: 188 VLEHWENKDDDIKVHQQLPKGVSYYEMMRGSKFCLCPSGYEVASPRIVEALYAGCVPVLI 247

Query: 299 ADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFP 358
           +D  V PF+D + W+   + V   D+P L  ILTSI P   +R QR +    ++R     
Sbjct: 248 SDHYVPPFSDVLNWKSFSVEVPVSDIPSLKKILTSISPRQYIRMQRRVLQ--VRRHFEVN 305

Query: 359 QPAQPGDAFHQILNGL 374
            P +  D FH IL+ +
Sbjct: 306 SPPKRFDVFHMILHSI 321


>gi|357478261|ref|XP_003609416.1| Xylogalacturonan beta-1,3-xylosyltransferase [Medicago truncatula]
 gi|355510471|gb|AES91613.1| Xylogalacturonan beta-1,3-xylosyltransferase [Medicago truncatula]
          Length = 380

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 85/306 (27%), Positives = 141/306 (46%), Gaps = 33/306 (10%)

Query: 80  SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLP-FKSPRMMRSAI---QLISSNWPYWNRT 135
           SP  T NP+EA  F  P+  T  +     PL  +   ++M   I    +I+  + YWNR+
Sbjct: 86  SPFSTHNPDEAHVFMLPLSVTNMVHYLYNPLTTYSRDQIMHVTIDYTNIIAHKYRYWNRS 145

Query: 136 EGADHFFVVPHDFGACFHYQEE-KAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPP 194
           +GADH  V  HD+      +   K + + ++ +L  A   + F  +  V +         
Sbjct: 146 KGADHLLVSCHDWAPEISRESSGKELFKNLIKVLCNANTSEGFDPKRDVSM--------- 196

Query: 195 YAPPQKMQAHQIPPDTP------RSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKN 248
             P   +Q +++    P      RSI  +F G         E G   R      W+    
Sbjct: 197 --PEMNLQGYKLSSPIPSKESNNRSILAFFAG--------GEHGM-IRKTLLDQWKGKDK 245

Query: 249 NPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFAD 308
             L          Y++ M ++ FCLCP G+   SPRLVE++  GC+PVI++D+  LPF+D
Sbjct: 246 EVLVYEYLPKKLKYFKLMGKSKFCLCPSGYEVASPRLVESINTGCVPVIVSDNYQLPFSD 305

Query: 309 AIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFH 368
            + W +  + +  + + ++ TIL S+P    L+  R +    ++R  +   PA+P D FH
Sbjct: 306 VLDWSKFSLHIPSKRISEIKTILKSVPHARYLKLHRRVLK--VQRHFVLNPPAKPFDVFH 363

Query: 369 QILNGL 374
            IL+ +
Sbjct: 364 MILHSI 369


>gi|159466264|ref|XP_001691329.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
 gi|158279301|gb|EDP05062.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
          Length = 815

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 104/375 (27%), Positives = 160/375 (42%), Gaps = 82/375 (21%)

Query: 42  VYVYELPSKYNKKLL-LKDPR--CLTHMFAA-------------EIFMHRFLLSSPVRTL 85
           +YVY+LP+ YN ++L  ++ R  C    F +             E+ MH  LL S  RT 
Sbjct: 370 IYVYDLPAAYNSRMLQYRNDRGMCTWRAFGSRNHTETFAWTYGLEVLMHEMLLQSEHRTF 429

Query: 86  NPEEADWFYTPIYPTCDL-------------TPTGLPLPFKSPRMMRSAIQLISSNWPYW 132
           +PE AD+FY P+Y +C +              P+G P       MM     LI  ++PYW
Sbjct: 430 DPEAADFFYVPVYGSCFIFPLHCYADGPWWYAPSG-PRVMHVTNMMLEVRDLIRKHFPYW 488

Query: 133 NRTEGADHFFVVPHDFGACF-------------HYQEEKAIERG-------------ILP 166
           +R  G DH +++ HD GAC+             H+    +  +              + P
Sbjct: 489 DRRGGRDHIWLMTHDEGACYAPSDIYGSSIFLTHWGRRDSDHKSNTAFTPDNYTQEYVHP 548

Query: 167 LLQRATLVQTFGQRNHVCLNEG-SITIPPYAPPQKMQAHQI--PPDTPRSIFVYFRGLFY 223
           +  +  L    G   H C   G  + +P    P       +   P  PR I +Y RG   
Sbjct: 549 MQDKGWLHLIEG---HPCYTPGKDLIVPALKLPHHFSGSPLLFHPPRPRDILLYLRG--- 602

Query: 224 DVNND--PEGGYYARGARAAV--------WENFKNNPLFDISTDHPTTYYEDMQRAIFCL 273
           DV     P    Y+RG R  +        W+N K N +    +D P  Y E +  + FC+
Sbjct: 603 DVGKHRLPN---YSRGIRQRLYKLWKDHDWQN-KYNAMIGDGSDVPGGYSEHLASSKFCV 658

Query: 274 CPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFV--AEEDVPKLDTIL 331
              G   WS RL +AV+ GC+PVI+ D++   F + + +    + V  AE ++  L   L
Sbjct: 659 VAPG-DGWSARLEDAVLHGCVPVIVMDNVSAVFEEQLDFNSFSIRVGEAEAELAYLPERL 717

Query: 332 TSIPPEVILRKQRLL 346
            S+PP ++   Q+ L
Sbjct: 718 KSVPPRILEGMQKKL 732


>gi|212721192|ref|NP_001132213.1| exostosin-like protein [Zea mays]
 gi|194693780|gb|ACF80974.1| unknown [Zea mays]
 gi|413935957|gb|AFW70508.1| exostosin-like protein [Zea mays]
          Length = 403

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 100/350 (28%), Positives = 159/350 (45%), Gaps = 52/350 (14%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYP 99
            KVY+Y  P   + K   + PR LT  +A+E +  + +  S  RT +P++A  F+ PI P
Sbjct: 82  FKVYIY--PDG-DPKTFYQTPRKLTGKYASEGYFFQNIRESRFRTDDPDQAHLFFVPISP 138

Query: 100 TCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKA 159
              +   G         +++  ++ + + +PYWNRT GADHFFV  HD G        +A
Sbjct: 139 H-KMRGKGTSYE-NMTVIVKDYVEGLINKYPYWNRTLGADHFFVTCHDVGV-------RA 189

Query: 160 IERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY---APPQKMQAHQIPPDTPRSIFV 216
            E   LP + + ++         VC    ++   P+   A PQ +Q   +P         
Sbjct: 190 FEG--LPFMVKNSI-------RVVCSPSYNVDFIPHKDIALPQVLQPFALPEGGN----- 235

Query: 217 YFRGLFYDVNNDPEGGYYA-------RGARAAVWEN-----FKNNPLFDISTDHPTTYYE 264
                  D+ N    G++A       R   A VWEN       NN +     +    Y +
Sbjct: 236 -------DIENRTTLGFWAGHRNSKIRVILARVWENDTELAISNNRISRAIGE--LVYQK 286

Query: 265 DMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDV 324
              R  FC+CP G    S R+ +++ +GC+PVI++D   LPF DA+ W +  + + E DV
Sbjct: 287 QFYRTKFCICPGGSQVNSARISDSIHYGCVPVILSDYYDLPFNDALDWRKFAVILRERDV 346

Query: 325 PKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
            +L  IL SI  E  +   + L    +++  ++  P    DAFH I+  L
Sbjct: 347 YQLKNILKSISQEEFVSLHKSLVQ--VQKHFVWHSPPVSYDAFHMIMYEL 394


>gi|359484716|ref|XP_002263848.2| PREDICTED: probable glycosyltransferase At5g20260-like [Vitis
           vinifera]
 gi|296084516|emb|CBI25537.3| unnamed protein product [Vitis vinifera]
          Length = 477

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 85/304 (27%), Positives = 137/304 (45%), Gaps = 28/304 (9%)

Query: 80  SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSP---------RMMRSAIQLISSNWP 130
           SP    +P+EA  F+ P+  +       L LP  SP         R++   +++++  +P
Sbjct: 182 SPFIAKHPDEAHAFFLPL--SVVKVVQFLYLPITSPEDYSRKRLQRIVTDYVKVVADKYP 239

Query: 131 YWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSI 190
           YWNR+ GADHF V  HD+     Y   +   +  + +L  A   + F     V L E ++
Sbjct: 240 YWNRSGGADHFMVSCHDWAPSVSYANPELF-KNFIRVLCNANSSEGFRPGRDVSLPEVNL 298

Query: 191 TIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNP 250
                 PP   Q     P   R +  +F G  +            R      W++  N  
Sbjct: 299 PAGELGPPHLGQ-----PSNNRPVLAFFAGRAHG---------NIRKILFEHWKDQDNEV 344

Query: 251 LFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAI 310
           L          Y + M ++ FCLCP G+   SPR+VEA+  GC+PVII+++  LPF D +
Sbjct: 345 LVHERLHKGQNYAKLMGQSKFCLCPSGYEVASPRVVEAIHAGCVPVIISNNYSLPFNDVL 404

Query: 311 PWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQI 370
            W +  + +    +P++ TIL  I     L+ Q  +    ++R  +  +PA+P D  H I
Sbjct: 405 DWSQFSIQIPVAKIPEIKTILLGISKNKYLKMQERVLR--VRRHFVLNRPARPFDIIHMI 462

Query: 371 LNGL 374
           L+ L
Sbjct: 463 LHSL 466


>gi|195624236|gb|ACG33948.1| exostosin-like [Zea mays]
          Length = 403

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 100/350 (28%), Positives = 159/350 (45%), Gaps = 52/350 (14%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYP 99
            KVY+Y  P   + K   + PR LT  +A+E +  + +  S  RT +P++A  F+ PI P
Sbjct: 82  FKVYIY--PDG-DPKTFYQTPRKLTGKYASEGYFFQNIRESRFRTDDPDQAHLFFVPISP 138

Query: 100 TCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKA 159
              +   G         +++  ++ + + +PYWNRT GADHFFV  HD G        +A
Sbjct: 139 H-KMRGKGTSYE-NMTVIVKDYVEGLINKYPYWNRTLGADHFFVTCHDVGV-------RA 189

Query: 160 IERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY---APPQKMQAHQIPPDTPRSIFV 216
            E   LP + + ++         VC    ++   P+   A PQ +Q   +P         
Sbjct: 190 FEG--LPFMVKNSI-------RVVCSPSYNVDFIPHKDIALPQVLQPFALPEGGN----- 235

Query: 217 YFRGLFYDVNNDPEGGYYA-------RGARAAVWEN-----FKNNPLFDISTDHPTTYYE 264
                  D+ N    G++A       R   A VWEN       NN +     +    Y +
Sbjct: 236 -------DIENRTTLGFWAGHRNSKIRVILARVWENDTELAISNNRISRAIGE--LVYQK 286

Query: 265 DMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDV 324
              R  FC+CP G    S R+ +++ +GC+PVI++D   LPF DA+ W +  + + E DV
Sbjct: 287 QFYRTKFCICPGGSQVNSARISDSIHYGCVPVILSDYYDLPFNDALDWRKFAVILRERDV 346

Query: 325 PKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
            +L  IL SI  E  +   + L    +++  ++  P    DAFH I+  L
Sbjct: 347 YQLKNILKSISQEEFVSLHKSLVQ--VQKHFVWHSPPVSYDAFHMIMYEL 394


>gi|356570806|ref|XP_003553575.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
           max]
          Length = 537

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 93/356 (26%), Positives = 153/356 (42%), Gaps = 58/356 (16%)

Query: 39  KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTP-- 96
           + KV+VYE      +  +  +  C +       F+H   ++   RT +PEEA  F+ P  
Sbjct: 207 QFKVFVYE----EGEPPVFHNGPCKSIYSMEGNFIHAIEMNDQFRTRDPEEAHVFFLPFS 262

Query: 97  ---------IYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHD 147
                    +  + D  P          + +   + +I   +PYWNR+ GADHF++  HD
Sbjct: 263 VAMLVQFVYVRDSHDFGPI--------KKTVTDYVNVIGGRYPYWNRSLGADHFYLACHD 314

Query: 148 FGACFHYQEEKAI---ERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAH 204
           +G     +  ++I    +  + +L  A   + F     V             P   +Q  
Sbjct: 315 WGP----ETSRSIPNLNKNSIRVLCNANTSEGFKPSKDVSF-----------PEINLQTG 359

Query: 205 QI------PPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDH 258
            I      P  + R +  +F G  +            R      WEN   +         
Sbjct: 360 SINGFIGGPSASRRPLLAFFAGGLHGP---------IRPVLLEHWENKDEDIQVHKYLPK 410

Query: 259 PTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMF 318
             +YYE ++++ FCLCP G+   SPR+VEA+  GC+PV+I+D  V PF D + W+   + 
Sbjct: 411 GVSYYEMLRKSKFCLCPSGYEVASPRVVEAIYTGCVPVLISDHYVPPFNDVLNWKSFSVE 470

Query: 319 VAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
           V+ +D+P+L  IL SI P   +R QR +    ++R      P +  D FH IL+ +
Sbjct: 471 VSVKDIPRLKEILLSISPRQYIRMQRRVG--QVRRHFEVHSPPKRYDVFHMILHSV 524


>gi|326519558|dbj|BAK00152.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 564

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 83/266 (31%), Positives = 126/266 (47%), Gaps = 30/266 (11%)

Query: 118 MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTF 177
           +++ I +IS+ +PYWNRT+GADHFFV  HD+G       ++  +  I  L         F
Sbjct: 307 VKNYIDMISAKFPYWNRTKGADHFFVACHDWGPYTTKLHDELRKNTIKALCNADVSEGVF 366

Query: 178 GQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRG---------LFYDVNND 228
            +   V L E  I     +  + ++     P   RSI  +F G         L Y    D
Sbjct: 367 IRGRDVSLPETYIK----SARRPVRDIGGKPAAERSILAFFAGQMHGRVRPVLKYWGGKD 422

Query: 229 PEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEA 288
            +   Y+R  R                      Y + M+ + +C+CP+G+   SPR+VEA
Sbjct: 423 TDMRIYSRIPRQITRR---------------MNYAKHMKSSKYCICPMGYEVNSPRIVEA 467

Query: 289 VVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLAN 348
           + + C+PVIIAD+ VLPF DA+ W    + VAE+DVPKL  IL +IP    +  +  +  
Sbjct: 468 IYYECVPVIIADNFVLPFDDALDWSAFSVVVAEKDVPKLKAILLAIPESRYITMRSNVKK 527

Query: 349 PSMKRAMLFPQPAQPGDAFHQILNGL 374
             ++R  L+       D FH IL+ +
Sbjct: 528 --VQRHFLWHAKPVKYDIFHMILHSV 551


>gi|218187361|gb|EEC69788.1| hypothetical protein OsI_00072 [Oryza sativa Indica Group]
          Length = 536

 Score =  122 bits (307), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 80/259 (30%), Positives = 127/259 (49%), Gaps = 15/259 (5%)

Query: 118 MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQ-T 176
           ++  I  IS+ +PYWNRT+GADHFFV  HD+G  +  +    + +  +  L  A L +  
Sbjct: 280 VKKYIDFISTKFPYWNRTKGADHFFVACHDWGP-YTTKLHDELRKNTIKALCNADLSEGV 338

Query: 177 FGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYA- 235
           F     V L E  +     +P + ++     P   RSI  +F G  +         Y+  
Sbjct: 339 FIHGRDVSLPETFLR----SPRRPLRGIGGKPAAERSILAFFAGQMHGRVRPVLLQYWGG 394

Query: 236 RGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIP 295
           + A   +++   +             Y + M+ + +C+CP+G+   SPR+VEA+ + C+P
Sbjct: 395 KDADMRIYDRLPHR------ITRRMNYIQHMKSSKYCICPMGYEVNSPRIVEAIYYECVP 448

Query: 296 VIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAM 355
           VIIAD+ VLPF DA  W    + + E+DVPKL  IL  IP +  +  Q  +     K  +
Sbjct: 449 VIIADNFVLPFDDAFNWSAFSVVILEKDVPKLKQILLEIPDDQYMAMQSNVQR-VQKHFI 507

Query: 356 LFPQPAQPGDAFHQILNGL 374
             P P +  D FH IL+ +
Sbjct: 508 WHPNPIK-YDIFHMILHSI 525


>gi|359496494|ref|XP_002269459.2| PREDICTED: probable glycosyltransferase At5g03795-like [Vitis
           vinifera]
          Length = 554

 Score =  122 bits (306), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 93/306 (30%), Positives = 146/306 (47%), Gaps = 26/306 (8%)

Query: 84  TLNPEEADWFYTPIYPTCDLTPTGLPLPFKS-----PRMMRSAIQLISSNWPYWNRTEGA 138
           T +P++A  FY P Y    L  T L +P           +R  + +I++ +P+WNRT G+
Sbjct: 243 TRDPKKAHLFYLP-YSARQLE-TALYVPNSHNIRPLSIFLRDHVNMIAAKYPFWNRTHGS 300

Query: 139 DHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQ-TFGQRNHVCLNEGSITIPPYAP 197
           DHF V  HD+G  +   E + + R  +  L  A L +  F     V L E +I   P  P
Sbjct: 301 DHFLVACHDWGP-YTVNEHQELSRNTIKALCNADLSEGIFVAGKDVSLPETTIR-NPRRP 358

Query: 198 PQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYA-RGARAAVWENFKNNPLFDIST 256
            + +   ++   + R I  +F G  +         Y++ +     ++    N        
Sbjct: 359 LRNVGGRRV---SQRPILAFFAGNMHGRVRPTLLKYWSDKDEDMRIYGPLPNR------I 409

Query: 257 DHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG 316
               +Y + M+ + FC+CP+G+   SPR+VEA+ + C+PVIIAD+ V P  D + W    
Sbjct: 410 SRKMSYIQHMKSSRFCICPMGYEVNSPRIVEAIYYECVPVIIADNFVPPLNDVLDWTAFS 469

Query: 317 MFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSM--KRAMLFPQPAQPGDAFHQILNGL 374
           + VAE+D+PKL  IL +IP    L  Q    N  M  K  +  P+P +  D FH IL+ +
Sbjct: 470 VIVAEKDIPKLKEILLAIPLRRYLVMQ---TNVKMVQKHFLWNPKPVR-YDLFHMILHSI 525

Query: 375 ARKLPH 380
              L H
Sbjct: 526 WVVLKH 531


>gi|357485421|ref|XP_003612998.1| Exostosin-like protein [Medicago truncatula]
 gi|355514333|gb|AES95956.1| Exostosin-like protein [Medicago truncatula]
          Length = 415

 Score =  122 bits (306), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 101/352 (28%), Positives = 158/352 (44%), Gaps = 54/352 (15%)

Query: 39  KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIY 98
           K KVY+Y  P   + K   + PR LT  +A+E +  + +  S  RTL+P+EA  F+ PI 
Sbjct: 93  KFKVYIY--PDG-DSKTFYQTPRKLTGKYASEGYFFQNIRESRFRTLDPDEAHLFFIPI- 148

Query: 99  PTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEK 158
            +C               ++++ ++ + S +PYWNRT GADHFFV  HD G        +
Sbjct: 149 -SCHKMRGKGTSYENMTIIVQNYVESLISKYPYWNRTLGADHFFVTCHDVGV-------R 200

Query: 159 AIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY---APPQKMQAHQIPPDTPRSIF 215
           A E   LPLL + ++         VC     +   P+   A PQ +Q   +P        
Sbjct: 201 ATEG--LPLLVKNSI-------RAVCSPSYDVGFIPHKDVALPQVLQPFALPAGGN---- 247

Query: 216 VYFRGLFYDVNNDPEGGYYA-------RGARAAVWENFKNNPLFDISTDHPT------TY 262
                   DV N    G++A       R   A VWEN   +   DIS +  +       Y
Sbjct: 248 --------DVENRTSLGFWAGHRNSKIRVILARVWEN---DTELDISNNRISRATGHLVY 296

Query: 263 YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEE 322
            +      FC+CP G    S R+ +++ +GCIPVI+++   LPF D + W +  + + E 
Sbjct: 297 QKRFYSTKFCICPGGSQVNSARIADSIHYGCIPVILSNYYDLPFNDILDWRKFAVVLKES 356

Query: 323 DVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
           DV +L  IL +   +  +     L    +++   +  P    DAFH ++  L
Sbjct: 357 DVYQLKQILKNKSQDEFVALHNNLV--KIQKHFQWNSPPVRYDAFHMVMYDL 406


>gi|224143179|ref|XP_002324871.1| predicted protein [Populus trichocarpa]
 gi|222866305|gb|EEF03436.1| predicted protein [Populus trichocarpa]
          Length = 334

 Score =  122 bits (305), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 88/301 (29%), Positives = 140/301 (46%), Gaps = 30/301 (9%)

Query: 83  RTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMR---SAIQLISSNWPYWNRTEGAD 139
           +T +P  A  ++ P   T  +     PL +    M +     ++++SS +P+WNRT+GAD
Sbjct: 45  KTRDPRRAHVYFMPFSVTWMVKYLYKPLTYDHTAMKQFVADYVRVVSSKYPFWNRTQGAD 104

Query: 140 HFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSI---TIPP-- 194
           HF +  HD+G    +         I  +L  A   + F  R  V L E  +    +PP  
Sbjct: 105 HFMLSCHDWGPHASHGNPFLYNTSIR-VLCNANSSEGFSPRKDVSLPEIHLYGGNVPPKL 163

Query: 195 YAPPQKMQAHQIPPDTPRSIFVYFRGLFYD-VNNDPEGGYYARGARAAVWENFKNNPLFD 253
            +PP        P  +PR    +F G  +  +       +  R     V+E    +    
Sbjct: 164 ISPP--------PATSPRPYLAFFSGGLHGPIRPILLDHWKGRDPDLQVYEYLPKD---- 211

Query: 254 ISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWE 313
                   YY  M R+ FCLCP G    SPR+VEA+   C+PVI++D  VLPF+D + WE
Sbjct: 212 ------LDYYSFMLRSKFCLCPSGHEVASPRIVEAIYAECVPVILSDHYVLPFSDVLRWE 265

Query: 314 EIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNG 373
              + V   ++P+L  +L S+P E   R +  L   ++++  +  QPA+  D FH IL+ 
Sbjct: 266 AFAIQVNVSEIPRLKEVLISVPEERYRRLKEGLR--AIRKHFVLNQPAKRFDVFHMILHS 323

Query: 374 L 374
           +
Sbjct: 324 I 324


>gi|8953375|emb|CAB96648.1| putative protein [Arabidopsis thaliana]
          Length = 336

 Score =  122 bits (305), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 89/301 (29%), Positives = 138/301 (45%), Gaps = 23/301 (7%)

Query: 79  SSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPF----KSPRMMRSAIQLISSNWPYWNR 134
           +S  +  +PEEA  FY P+     +     P       +   +++  I LIS+ +PYWNR
Sbjct: 41  NSRFKAASPEEATVFYIPVGIVNIIRFVYRPYTSYARDRLQNIVKDYISLISNRYPYWNR 100

Query: 135 TEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPP 194
           + GADHFF+  HD+       + +  +  I  L   A   + F     V L E  I IP 
Sbjct: 101 SRGADHFFLSCHDWAPDVSAVDPELYKHFIRALCN-ANSSEGFTPMRDVSLPE--INIPH 157

Query: 195 YAPPQKMQAHQIPPDTPRSIFVYFRGLFY-DVNNDPEGGYYARGARAAVWENFKNNPLFD 253
               Q    H   P   R +  +F G  + DV       +  +     V+EN        
Sbjct: 158 ---SQLGFVHTGEPPQNRKLLAFFAGGSHGDVRKILFQHWKEKDKDVLVYENLPKT---- 210

Query: 254 ISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWE 313
                   Y + M +A FCLCP GW   SPR+VE++  GC+PVIIAD  VLPF+D + W+
Sbjct: 211 ------MNYTKMMDKAKFCLCPSGWEVASPRIVESLYSGCVPVIIADYYVLPFSDVLNWK 264

Query: 314 EIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNG 373
              + +    +P +  IL +I  E  L  QR +    +++  +  +P++P D  H I++ 
Sbjct: 265 TFSVHIPISKMPDIKKILEAITEEEYLNMQRRVL--EVRKHFVINRPSKPYDMLHMIMHS 322

Query: 374 L 374
           +
Sbjct: 323 I 323


>gi|334187609|ref|NP_196674.2| putative glycosyltransferase [Arabidopsis thaliana]
 gi|292630779|sp|Q9LFP3.2|GLYT4_ARATH RecName: Full=Probable glycosyltransferase At5g11130
 gi|332004254|gb|AED91637.1| putative glycosyltransferase [Arabidopsis thaliana]
          Length = 480

 Score =  122 bits (305), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 89/301 (29%), Positives = 138/301 (45%), Gaps = 23/301 (7%)

Query: 79  SSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPF----KSPRMMRSAIQLISSNWPYWNR 134
           +S  +  +PEEA  FY P+     +     P       +   +++  I LIS+ +PYWNR
Sbjct: 185 NSRFKAASPEEATVFYIPVGIVNIIRFVYRPYTSYARDRLQNIVKDYISLISNRYPYWNR 244

Query: 135 TEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPP 194
           + GADHFF+  HD+       + +  +  I  L   A   + F     V L E  I IP 
Sbjct: 245 SRGADHFFLSCHDWAPDVSAVDPELYKHFIRALCN-ANSSEGFTPMRDVSLPE--INIPH 301

Query: 195 YAPPQKMQAHQIPPDTPRSIFVYFRGLFY-DVNNDPEGGYYARGARAAVWENFKNNPLFD 253
               Q    H   P   R +  +F G  + DV       +  +     V+EN        
Sbjct: 302 ---SQLGFVHTGEPPQNRKLLAFFAGGSHGDVRKILFQHWKEKDKDVLVYENLPKT---- 354

Query: 254 ISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWE 313
                   Y + M +A FCLCP GW   SPR+VE++  GC+PVIIAD  VLPF+D + W+
Sbjct: 355 ------MNYTKMMDKAKFCLCPSGWEVASPRIVESLYSGCVPVIIADYYVLPFSDVLNWK 408

Query: 314 EIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNG 373
              + +    +P +  IL +I  E  L  QR +    +++  +  +P++P D  H I++ 
Sbjct: 409 TFSVHIPISKMPDIKKILEAITEEEYLNMQRRVL--EVRKHFVINRPSKPYDMLHMIMHS 466

Query: 374 L 374
           +
Sbjct: 467 I 467


>gi|357119191|ref|XP_003561329.1| PREDICTED: probable glycosyltransferase At5g03795-like
           [Brachypodium distachyon]
          Length = 569

 Score =  122 bits (305), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 95/347 (27%), Positives = 153/347 (44%), Gaps = 43/347 (12%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYP 99
            K+YVY+      +  +  D  C         F+H   + + +RT +P  A  F+ P   
Sbjct: 237 FKIYVYD----EGEPPIYHDGPCHNIYSTEGRFIHAMEMENRMRTTDPGLAHVFFLPFSI 292

Query: 100 TCDLTPTGLPLPFKSPRMMRSA---IQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQE 156
                   +P       + R+    I ++S+  PYWNR++GADHF +  HD+G      +
Sbjct: 293 AKMEKTIYVPGSHTMEPLRRTVFDYIDVLSTKHPYWNRSQGADHFMLSCHDWGPYVSSVD 352

Query: 157 EKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPD------T 210
                  I  L    T             +EG I     + P+    +    D      +
Sbjct: 353 GNLFSNSIRVLCNANT-------------SEGFIPSKDVSLPEINHLNDFKKDIGGPSAS 399

Query: 211 PRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKN-NPLFDISTDHP--TTYYEDMQ 267
            R I  +F G     N+ P         R  + +++K  +P   +S   P   +Y E M+
Sbjct: 400 GRPILAFFAG----GNHGP--------VRPLLLKHWKGKDPDVQVSEYLPAGVSYVETMR 447

Query: 268 RAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKL 327
           R+ FCLCP G+   SPR+ EA+   C+PV+IADD VLPF+D + W    + VA  D+P +
Sbjct: 448 RSKFCLCPSGFEVASPRVAEAIYVECVPVVIADDYVLPFSDVLSWPAFSLRVAVRDIPDI 507

Query: 328 DTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
             IL+++ P   +R QR +   +++R  +     Q  D FH IL+ +
Sbjct: 508 KRILSAVSPRRYIRMQRRVR--AVRRHFMLNGVPQRYDVFHMILHSI 552


>gi|102139904|gb|ABF70053.1| exostosin family protein [Musa acuminata]
          Length = 585

 Score =  122 bits (305), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 99/342 (28%), Positives = 162/342 (47%), Gaps = 26/342 (7%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEI-FMHRFLLSSPVRTLNPEEADWFYTPIY 98
           LKVY+YE       K L   P  L  ++A+E  FM     ++     +P +A  FY P Y
Sbjct: 253 LKVYIYE----DGPKPLCHTPH-LDGIYASEGWFMKLMEENTQFVVKDPNKAHLFYLP-Y 306

Query: 99  PTCDLTPTGLPLPFKSPR----MMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHY 154
            +  L         +S +     +R  +  I++ +P+WNRT GADHF V  HD+ A +  
Sbjct: 307 SSRQLRTHLYVAGSRSMQPLSIFLRDYVNSIAAKYPFWNRTRGADHFLVACHDW-ATYTT 365

Query: 155 QEEKAIERGILPLLQRATLVQ-TFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRS 213
              + + +  + ++  A + +  F +   V L E  +  P    P+K    +  P + RS
Sbjct: 366 NLHEDLRKNTIKVVCNADVSEGVFVRGKDVSLAETYVRTP--NSPRKAIGGR--PASRRS 421

Query: 214 IFVYFRGLFYD-VNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFC 272
           I  +F G  +  V       +  R     ++E   +     +S      Y E M+ + FC
Sbjct: 422 ILAFFAGQMHGRVRPILLRHWRGRDRDMRIYEVLPDEIAAKMS------YIEHMKSSKFC 475

Query: 273 LCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILT 332
           +CP+G+   SPR+VEA+ + C+PVIIA++ VLPF + + W    + VAE+D+PKL  IL 
Sbjct: 476 ICPMGYEVNSPRIVEAIYYDCVPVIIANNFVLPFEEVLDWGAFSVVVAEKDIPKLKQILL 535

Query: 333 SIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
            I     +R Q  +    +++  L+       D FH IL+ +
Sbjct: 536 GISGRRYVRMQTNVRR--LRKHFLWNDKPVKYDLFHMILHSI 575


>gi|359476148|ref|XP_002285239.2| PREDICTED: probable glycosyltransferase At5g11130 [Vitis vinifera]
          Length = 472

 Score =  121 bits (304), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 92/307 (29%), Positives = 139/307 (45%), Gaps = 32/307 (10%)

Query: 79  SSPVRTLNPEEADWFYTPI-------YPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPY 131
           + P    +P+EA  FY PI       Y     T +      +   ++   I +IS  +P+
Sbjct: 176 NCPFSARHPDEAMLFYIPISVVNIIRYVYQPYTNSASYSRVRLQGLVADYIDVISRKYPF 235

Query: 132 WNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSIT 191
           WNR+ GADHF V  HD+         K   R  + +L  A   + F     V L E  + 
Sbjct: 236 WNRSSGADHFMVSCHDWAPEISAANPKFF-RHFIRVLCNANTSEGFKPVRDVSLPEILVP 294

Query: 192 IPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPL 251
                PP   Q     P T RSI  +F G              A G   ++  ++     
Sbjct: 295 YRMLGPPYLGQ-----PPTNRSILAFFAGG-------------AHGKVRSILFHYWKEKD 336

Query: 252 FDISTDH--PTT--YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFA 307
            DI      PTT  Y E M R+ FCLCP G+   SPR+VE++  GC+PVII+D+  LPF+
Sbjct: 337 EDIQVHEYLPTTLNYTELMGRSKFCLCPSGFEVASPRVVESIYAGCVPVIISDNYSLPFS 396

Query: 308 DAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAF 367
           D + W +  + +    +P+  TIL +IP +  L KQ+ +    ++R     +PA+  D  
Sbjct: 397 DVLDWSQFSVHIPIARIPETKTILQAIPIQEYLTKQKTVMQ--VQRHFTLNRPAKRFDVL 454

Query: 368 HQILNGL 374
           H +L+ +
Sbjct: 455 HMVLHSI 461


>gi|30683790|ref|NP_567512.2| Exostosin family protein [Arabidopsis thaliana]
 gi|19347795|gb|AAL86348.1| unknown protein [Arabidopsis thaliana]
 gi|26983908|gb|AAN86206.1| unknown protein [Arabidopsis thaliana]
 gi|332658395|gb|AEE83795.1| Exostosin family protein [Arabidopsis thaliana]
          Length = 542

 Score =  121 bits (304), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 102/346 (29%), Positives = 162/346 (46%), Gaps = 33/346 (9%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSP-VRTLNPEEADWFYTPIY 98
           LKVY+Y  P     K +  +P  L  ++A+E +  + + S+    T NPE A  FY P Y
Sbjct: 200 LKVYIY--PD--GDKPIFHEPH-LNGIYASEGWFMKLMESNKQFVTKNPERAHLFYMP-Y 253

Query: 99  PTCDLTPTGL--------PLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGA 150
               L  +          PL       +R  + ++S  +P+WNRT G+DHF V  HD+G 
Sbjct: 254 SVKQLQKSIFVPGSHNIKPLSI----FLRDYVNMLSIKYPFWNRTHGSDHFLVACHDWGP 309

Query: 151 CFHYQEEKAIERGILPLLQRATLVQ-TFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPD 209
            +   E   ++R  +  L  A L    F     V L E SI             +++   
Sbjct: 310 -YTVNEHPELKRNAIKALCNADLSDGIFVPGKDVSLPETSIRNAGRPLRNIGNGNRV--- 365

Query: 210 TPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRA 269
           + R I  +F G  +     P+   + R       E+ K       +     TY + M+ +
Sbjct: 366 SQRPILAFFAGNLHG-RVRPKLLKHWRNKD----EDMKIYGPLPHNVARKMTYVQHMKSS 420

Query: 270 IFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDT 329
            +CLCP+G+   SPR+VEA+ + C+PV+IAD+ +LPF+D + W    + V E+++P+L  
Sbjct: 421 KYCLCPMGYEVNSPRIVEAIYYECVPVVIADNFMLPFSDVLDWSAFSVVVPEKEIPRLKE 480

Query: 330 ILTSIPPEVILRKQRLLANPSM-KRAMLFPQPAQPGDAFHQILNGL 374
           IL  IP    L+ Q   +N  M +R  L+    +  D FH IL+ +
Sbjct: 481 ILLEIPMRRYLKMQ---SNVKMVQRHFLWSPKPRKYDVFHMILHSI 523


>gi|296082060|emb|CBI21065.3| unnamed protein product [Vitis vinifera]
          Length = 402

 Score =  121 bits (303), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 92/307 (29%), Positives = 139/307 (45%), Gaps = 32/307 (10%)

Query: 79  SSPVRTLNPEEADWFYTPI-------YPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPY 131
           + P    +P+EA  FY PI       Y     T +      +   ++   I +IS  +P+
Sbjct: 106 NCPFSARHPDEAMLFYIPISVVNIIRYVYQPYTNSASYSRVRLQGLVADYIDVISRKYPF 165

Query: 132 WNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSIT 191
           WNR+ GADHF V  HD+         K   R  + +L  A   + F     V L E  + 
Sbjct: 166 WNRSSGADHFMVSCHDWAPEISAANPKFF-RHFIRVLCNANTSEGFKPVRDVSLPEILVP 224

Query: 192 IPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPL 251
                PP   Q     P T RSI  +F G              A G   ++  ++     
Sbjct: 225 YRMLGPPYLGQ-----PPTNRSILAFFAGG-------------AHGKVRSILFHYWKEKD 266

Query: 252 FDISTDH--PTT--YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFA 307
            DI      PTT  Y E M R+ FCLCP G+   SPR+VE++  GC+PVII+D+  LPF+
Sbjct: 267 EDIQVHEYLPTTLNYTELMGRSKFCLCPSGFEVASPRVVESIYAGCVPVIISDNYSLPFS 326

Query: 308 DAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAF 367
           D + W +  + +    +P+  TIL +IP +  L KQ+ +    ++R     +PA+  D  
Sbjct: 327 DVLDWSQFSVHIPIARIPETKTILQAIPIQEYLTKQKTVMQ--VQRHFTLNRPAKRFDVL 384

Query: 368 HQILNGL 374
           H +L+ +
Sbjct: 385 HMVLHSI 391


>gi|296083497|emb|CBI23466.3| unnamed protein product [Vitis vinifera]
          Length = 585

 Score =  121 bits (303), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 91/300 (30%), Positives = 144/300 (48%), Gaps = 26/300 (8%)

Query: 84  TLNPEEADWFYTPIYPTCDLTPTGLPLPFKS-----PRMMRSAIQLISSNWPYWNRTEGA 138
           T +P++A  FY P Y    L  T L +P           +R  + +I++ +P+WNRT G+
Sbjct: 243 TRDPKKAHLFYLP-YSARQLE-TALYVPNSHNIRPLSIFLRDHVNMIAAKYPFWNRTHGS 300

Query: 139 DHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQ-TFGQRNHVCLNEGSITIPPYAP 197
           DHF V  HD+G  +   E + + R  +  L  A L +  F     V L E +I   P  P
Sbjct: 301 DHFLVACHDWGP-YTVNEHQELSRNTIKALCNADLSEGIFVAGKDVSLPETTIR-NPRRP 358

Query: 198 PQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYA-RGARAAVWENFKNNPLFDIST 256
            + +   ++   + R I  +F G  +         Y++ +     ++    N        
Sbjct: 359 LRNVGGRRV---SQRPILAFFAGNMHGRVRPTLLKYWSDKDEDMRIYGPLPNR------I 409

Query: 257 DHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG 316
               +Y + M+ + FC+CP+G+   SPR+VEA+ + C+PVIIAD+ V P  D + W    
Sbjct: 410 SRKMSYIQHMKSSRFCICPMGYEVNSPRIVEAIYYECVPVIIADNFVPPLNDVLDWTAFS 469

Query: 317 MFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSM--KRAMLFPQPAQPGDAFHQILNGL 374
           + VAE+D+PKL  IL +IP    L  Q    N  M  K  +  P+P +  D FH IL+ +
Sbjct: 470 VIVAEKDIPKLKEILLAIPLRRYLVMQ---TNVKMVQKHFLWNPKPVR-YDLFHMILHSI 525


>gi|242064274|ref|XP_002453426.1| hypothetical protein SORBIDRAFT_04g005890 [Sorghum bicolor]
 gi|241933257|gb|EES06402.1| hypothetical protein SORBIDRAFT_04g005890 [Sorghum bicolor]
          Length = 403

 Score =  121 bits (303), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 99/350 (28%), Positives = 159/350 (45%), Gaps = 52/350 (14%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYP 99
            KVY+Y  P   + K   + PR LT  +A+E +  + +  S  RT +P++A  F+ PI P
Sbjct: 82  FKVYIY--PDG-DPKTFYQTPRKLTGKYASEGYFFQNIRESRFRTDDPDQAHLFFVPISP 138

Query: 100 TCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKA 159
              +   G         +++  ++ + + +PYWNRT GADHFFV  HD G        +A
Sbjct: 139 H-KMRGKGTTYE-NMTVIVKDYVEGLINKYPYWNRTLGADHFFVTCHDVGV-------RA 189

Query: 160 IERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY---APPQKMQAHQIPPDTPRSIFV 216
            E   LP + + ++         VC    ++   P+   A PQ +Q   +P         
Sbjct: 190 FEG--LPFMVKNSI-------RVVCSPSYNVDFIPHKDIALPQVLQPFALPEGGN----- 235

Query: 217 YFRGLFYDVNNDPEGGYYA-------RGARAAVWEN-----FKNNPLFDISTDHPTTYYE 264
                  D+ N    G++A       R   A VWEN       NN +     +    Y +
Sbjct: 236 -------DIENRTILGFWAGHRNSKIRVILARVWENDTELAISNNRISRAIGE--LVYQK 286

Query: 265 DMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDV 324
              R  FC+CP G    S R+ +++ +GC+PVI++D   LPF DA+ W +  + + E DV
Sbjct: 287 QFYRTKFCICPGGSQVNSARISDSIHYGCVPVILSDYYDLPFNDALDWRKFAVILRERDV 346

Query: 325 PKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
            +L  IL SI  E  +   + L    +++  ++  P    DAFH ++  L
Sbjct: 347 YQLKNILKSISQEEFVSLHKSLVQ--VQKHFVWHSPPVSYDAFHMVMYEL 394


>gi|227202794|dbj|BAH56870.1| AT4G38040 [Arabidopsis thaliana]
          Length = 407

 Score =  121 bits (303), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 98/312 (31%), Positives = 145/312 (46%), Gaps = 50/312 (16%)

Query: 39  KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIY 98
           + KVY+Y  P   +     + PR +T  +A+E +  + +  S  RTL+P+EAD F+ PI 
Sbjct: 103 RFKVYIY--PDG-DPNTFYQTPRKVTGKYASEGYFFQNIRESRFRTLDPDEADLFFIPI- 158

Query: 99  PTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEK 158
            +C               ++++ +  + + +PYWNRT GADHFFV  HD G        +
Sbjct: 159 -SCHKMRGKGTSYENMTVIVQNYVDGLIAKYPYWNRTLGADHFFVTCHDVGV-------R 210

Query: 159 AIERGILPLLQRATLVQTFGQRNHVC---LNEGSITIPPYAPPQKMQAHQIPPDTPRSIF 215
           A E    PLL + T+         VC    N G I     A PQ +Q   +P        
Sbjct: 211 AFEGS--PLLIKNTI-------RVVCSPSYNVGFIPHKDVALPQVLQPFALPAGGN---- 257

Query: 216 VYFRGLFYDVNNDPEGGYYA-------RGARAAVWEN-----FKNNPLFDISTDHPTTYY 263
                   DV N    G++A       R   A VWEN       NN + + +T H   Y 
Sbjct: 258 --------DVENRTTLGFWAGHRNSKIRVILAHVWENDTELDISNNRI-NRATGH-LVYQ 307

Query: 264 EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED 323
           +   R  FC+CP G    S R+ +++ +GCIPVI++D   LPF D + W +  + + E+D
Sbjct: 308 KRFYRTKFCICPGGSQVNSARITDSIHYGCIPVILSDYYDLPFNDILNWRKFAVVLREQD 367

Query: 324 VPKLDTILTSIP 335
           V  L  IL +IP
Sbjct: 368 VYNLKQILKNIP 379


>gi|168059393|ref|XP_001781687.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666856|gb|EDQ53500.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 376

 Score =  121 bits (303), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 100/373 (26%), Positives = 172/373 (46%), Gaps = 41/373 (10%)

Query: 22  ERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRC-LTH------MFAAE-IFM 73
           +R  G+ G++L    +  L +  YEL  K  K  + KD R  L H      ++A+E  F+
Sbjct: 4   KRNYGTRGNILTRSALVNL-LRSYELMEKVFKVYIYKDGRKPLVHSGPQLGIYASEGQFI 62

Query: 74  HRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLP-----LPFKSPRMMRSAIQLISSN 128
            R   +S   T +P  A  F+ P      +T   +P     LP  +   ++  ++ ++  
Sbjct: 63  ERMEAASEFLTDDPSRAHMFFLPYSVYRMVTHLYVPNSRSMLPLAT--FIKDYVEALARQ 120

Query: 129 WPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEG 188
           +PYWNRT+GADHFFV  HD+G     ++   +    + ++  A L + F       L E 
Sbjct: 121 YPYWNRTKGADHFFVSCHDWGPATA-RDHPTLRSNAVKVVCNADLTEEFVVGKDASLPEV 179

Query: 189 SITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKN 248
            +       P K+     P    R    +F G  +               R  + +++K+
Sbjct: 180 YMHKSKTKAPIKLGG---PGYDERPYLAFFAGQMHG------------RVRPILLDHWKD 224

Query: 249 -NPLFDISTDHPT------TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADD 301
            +P   I    P       +Y + M+ + +C+C  G+   SPR+VE++ + C+PVIIAD+
Sbjct: 225 KDPDLMIYGVLPKPIAKQISYVQHMKMSKYCICAAGYEVNSPRIVESIHYDCVPVIIADN 284

Query: 302 IVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPA 361
            VLPF+D + W+   + + E D+PKL  IL  IP +     Q  L    +++  ++ +  
Sbjct: 285 FVLPFSDVLNWDAFSVTMPESDIPKLKAILNDIPEKTYRSMQIRLRK--IRQHFVWHKKP 342

Query: 362 QPGDAFHQILNGL 374
           +  D FH IL+ +
Sbjct: 343 EKYDVFHMILHSV 355


>gi|125581086|gb|EAZ22017.1| hypothetical protein OsJ_05674 [Oryza sativa Japonica Group]
          Length = 408

 Score =  121 bits (303), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 100/354 (28%), Positives = 160/354 (45%), Gaps = 54/354 (15%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYP 99
            KVY+Y  P   + K   + PR LT  +A+E +  + +  S  RT +P++A  F+ PI P
Sbjct: 81  FKVYMY--PDG-DPKTFYQTPRKLTGKYASEGYFFQNIRESRFRTGDPDKAHLFFVPISP 137

Query: 100 TCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKA 159
              +   G         +++  ++ + + +PYWNRT GADHFFV  HD G        +A
Sbjct: 138 H-KMRGKGTSYE-NMTIIVKDYVEGLINKYPYWNRTLGADHFFVTCHDVGV-------RA 188

Query: 160 IERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY---APPQKMQAHQIPPDTPRSIFV 216
            E   LP + + ++         VC    ++   P+   A PQ +Q   +P         
Sbjct: 189 FEG--LPFMVKNSI-------RVVCSPSYNVDFIPHKDIALPQVLQPFALPEGGN----- 234

Query: 217 YFRGLFYDVNNDPEGGYYA-------RGARAAVWEN-----FKNNPLFDISTDHPTTYYE 264
                  DV N    G++A       R   A VWEN       NN +     +    Y +
Sbjct: 235 -------DVENRTILGFWAGHRNSKIRVILAQVWENDTELAISNNRISRAIGE--LVYQK 285

Query: 265 DMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDV 324
                 FC+CP G    S R+ +++ +GC+PVI++D   LPF D + W +  + + E DV
Sbjct: 286 QFYHTKFCICPGGSQVNSARISDSIHYGCVPVILSDYYDLPFNDILDWRKFAVVLKERDV 345

Query: 325 PKLDTILTSIPPEVILRKQRLLAN----PSMKRAMLFPQPAQPGDAFHQILNGL 374
            +L +IL SI  E  +   + L      P +++  ++  P  P DAFH ++  L
Sbjct: 346 YQLKSILKSISQEEFVELHKSLVQNISCPQVQKHFVWHSPPLPYDAFHMVMYEL 399


>gi|449443279|ref|XP_004139407.1| PREDICTED: probable glycosyltransferase At3g42180-like [Cucumis
           sativus]
 gi|449494024|ref|XP_004159425.1| PREDICTED: probable glycosyltransferase At3g42180-like [Cucumis
           sativus]
          Length = 447

 Score =  120 bits (302), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 88/303 (29%), Positives = 140/303 (46%), Gaps = 26/303 (8%)

Query: 80  SPVRTLNPEEADWFYTPIYPTCDL----TPTGLPLPFKSPRMMRSA---IQLISSNWPYW 132
           SP R  +P+EA  F  P+  T  +     P   P  +   RM R     I+++++ +PYW
Sbjct: 152 SPFRASHPDEAHVFLLPLSITNIIHFIYRPITSPADYNRDRMHRVTTDYIRVVANRYPYW 211

Query: 133 NRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITI 192
           NR+ GADHF V  HD+           + +  + ++  A + + F     + L E +I  
Sbjct: 212 NRSNGADHFVVSCHDWAPEIS-DANPQLFKNFIRVVCNANITEGFRPNIDIPLPEINIHP 270

Query: 193 PPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGY-YARGARAAVWENFKNNPL 251
               PP   Q    PP+  R I  +F G          G + Y R      W+   N   
Sbjct: 271 GTLGPPDLGQ----PPER-RPILAFFAG----------GAHGYIRKILIKHWKEKDNEVQ 315

Query: 252 FDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIP 311
                     Y + +  + FCLCP G+   SPR+VEA+  GC+PVII+D+  LPF+D + 
Sbjct: 316 VHEYLPKTQNYTKLIGESKFCLCPSGYEVASPRVVEAIYGGCVPVIISDNYSLPFSDVLD 375

Query: 312 WEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQIL 371
           W    + +  + +P++ TIL +I  E  L+  + +    +KR     +PA+P D  H +L
Sbjct: 376 WSRFSVQIPVQRIPEIKTILKAISEEKYLKLYKGVIK--VKRHFKINRPAKPFDVIHMLL 433

Query: 372 NGL 374
           + L
Sbjct: 434 HSL 436


>gi|296088784|emb|CBI38234.3| unnamed protein product [Vitis vinifera]
          Length = 326

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 103/354 (29%), Positives = 161/354 (45%), Gaps = 58/354 (16%)

Query: 39  KLKVYVY--ELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTP 96
           K KV+VY    P+ Y      + PR LT  +A+E +  + +  S  RT +P++A  F+ P
Sbjct: 4   KFKVFVYPDGDPNTY-----YQTPRKLTGKYASEGYFFQNIRQSQFRTDDPDQAHLFFIP 58

Query: 97  IYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQE 156
           I  +C               ++++ +Q + S +PYWNRT GADHFF+  HD G       
Sbjct: 59  I--SCHKMRGKGTSYENMTIIVQNYVQSLMSKYPYWNRTLGADHFFLTCHDVGV------ 110

Query: 157 EKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY---APPQKMQAHQIPPDTPRS 213
            +A E G+  L++ +  V        VC     +   P+   A PQ +Q   +P      
Sbjct: 111 -RATE-GVPLLVKNSIRV--------VCSPSYDVGFIPHKDVALPQVLQPFALPTGGR-- 158

Query: 214 IFVYFRGLFYDVNNDPEGGYYA-------RGARAAVWEN-----FKNNPLFDISTDHPTT 261
                     D+ N    G++A       R   A +WEN      KNN + + +T H   
Sbjct: 159 ----------DIKNRTTLGFWAGHRNSKIRVILARIWENDTELDIKNNRI-NRATGH-LV 206

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y     R  FC+CP G    S R+ +++ +GC+PVI++D   LPF D + W +  + + E
Sbjct: 207 YQNKFYRTKFCICPGGSQVNSARIADSIHYGCVPVILSDYYDLPFNDILDWRKFSVILKE 266

Query: 322 EDVPKLDTILTSIP-PEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
            DV +L  IL  IP  E I     L+    +++   +  P    DAFH ++  L
Sbjct: 267 RDVYRLKYILKDIPDAEFIALHDNLV---KVQKHFQWNTPPIKYDAFHMVMYEL 317


>gi|356518346|ref|XP_003527840.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
           max]
          Length = 633

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 98/345 (28%), Positives = 158/345 (45%), Gaps = 37/345 (10%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEI-FMHRFLLSSPVRTLNPEEADWFYTPIY 98
           LKVYVY    +   K ++  P  L+ ++A+E  FM     S    T +P+ A  FY P  
Sbjct: 305 LKVYVY----REGDKAIMHSP-ILSGLYASEGWFMKHMEASKQFVTTDPKNAHLFYLPF- 358

Query: 99  PTCDLTPTGLPLPFKS----PRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHY 154
            +       L +P  S     + + + + +I+    +WNRT GADHF V  HD+      
Sbjct: 359 -SSQRLVDALWVPKSSYGNLIQYLSNYVDMIAGKHHFWNRTGGADHFLVACHDWAPA--- 414

Query: 155 QEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSI 214
            E K      L  L  A + Q F     + L E ++   P  P + +  +Q+   + R  
Sbjct: 415 -ETKQHMAKCLRALCNADVKQGFVFGKDMSLPE-TVVRSPRNPTRSIGGNQV---SKRKT 469

Query: 215 FVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTT-----YYEDMQRA 269
             +F G  +          Y R      WEN   +P   I    P +     Y + M+ +
Sbjct: 470 LAFFAGQMHG---------YVRPILLQHWEN--KDPDMKIFGRLPKSKGNRNYIQYMKSS 518

Query: 270 IFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDT 329
            +C+C  G+   SPR+VEA+++ C+PVI++D+ V PF + + WE   +FV E+D+P L  
Sbjct: 519 KYCICAKGYEVNSPRVVEAILYECVPVILSDNFVPPFFEMLNWESFAVFVLEKDIPNLKN 578

Query: 330 ILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
           IL SIP +  L+ Q ++     +  +   +     D FH IL+ +
Sbjct: 579 ILLSIPRKRYLQMQ-MMVRKVQQHFLWHNKSPVKYDIFHMILHSI 622


>gi|359481952|ref|XP_002284018.2| PREDICTED: probable glycosyltransferase At5g03795 [Vitis vinifera]
          Length = 546

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 93/347 (26%), Positives = 152/347 (43%), Gaps = 40/347 (11%)

Query: 39  KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTP-- 96
           +LKVYVY+      +  L  +  C +       F+H+  + S  RT +PE+A  F+ P  
Sbjct: 213 QLKVYVYD----EGEPPLFHNGPCKSIYSMEGNFIHKMEMDSHFRTKDPEKAHLFFLPFS 268

Query: 97  ---------IYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHD 147
                    +  + DL P          + +   + ++S+ +PYWNR+ GADHF +  HD
Sbjct: 269 VAMLVRFVYVRDSHDLGPI--------KQTVIDYVNVVSTKYPYWNRSLGADHFMLACHD 320

Query: 148 FGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIP 207
           +G    +     + +  + +L  A   + F     V   E  I +   +    +     P
Sbjct: 321 WGPETSFSI-PYLHKNSIRVLCNANTSEGFNPSKDVSFPE--INLLTGSTDSFIGG---P 374

Query: 208 PDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQ 267
             + R++  +F G  +            R      WEN   +           +YYE M+
Sbjct: 375 SPSHRTLLAFFAGGLHGP---------IRPILLEHWENKDEDVKVHKYLPKGVSYYEMMR 425

Query: 268 RAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKL 327
           ++ +CLCP G+   SPR+VEA+  GC+PV+I+D  V PF+D + W+   + V   ++P L
Sbjct: 426 KSKYCLCPSGYEVASPRVVEALYTGCVPVLISDHYVPPFSDVLNWKSFSVEVPVREIPNL 485

Query: 328 DTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
             IL  I P   +R QR       +R      P +  D FH IL+ L
Sbjct: 486 KRILMDISPRQYIRMQR--RGIQARRHFEVNSPPKRYDVFHMILHSL 530


>gi|297807149|ref|XP_002871458.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297317295|gb|EFH47717.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 336

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 88/303 (29%), Positives = 147/303 (48%), Gaps = 27/303 (8%)

Query: 79  SSPVRTLNPEEADWFYTPIYPTCDLTPTGLPL-PFKSPRM---MRSAIQLISSNWPYWNR 134
           +S  +  +PEEA  FY P+     +     P   +   R+   ++  I LIS+ +PYWNR
Sbjct: 41  NSRYKAASPEEATVFYIPVGIVNIIRFVYRPYTSYARDRLQNIVKDYISLISNRYPYWNR 100

Query: 135 TEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPP 194
           + GADHFF+  HD+       + + + R  +  L  A   + F     V L E +I   P
Sbjct: 101 SRGADHFFLSCHDWAPDVSAVDPE-LYRHFIRALCNANASEGFTPMRDVSLPEINI---P 156

Query: 195 YAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDI 254
           ++    +   + P +  R +  +F            GG +    R  ++E +K      +
Sbjct: 157 HSQLGFVHTGEAPQN--RKLLAFF-----------AGGSHGE-VRKILFEQWKEKDKDVL 202

Query: 255 STDH-PTT--YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIP 311
             ++ P T  Y + M +A FCLCP GW   SPR+VE++  GC+PVIIAD  VLPF+D + 
Sbjct: 203 VYEYLPKTMNYTKMMDKAKFCLCPSGWEVASPRIVESLYSGCVPVIIADSYVLPFSDVLN 262

Query: 312 WEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQIL 371
           W+   + +    +P +  IL +I  E  L  QR +    +++  +  +P++P D  H I+
Sbjct: 263 WKTFSVHIPISKMPDIKKILEAISEEEYLEMQRRVL--EVRKHFVINRPSKPYDMLHMIM 320

Query: 372 NGL 374
           + +
Sbjct: 321 HSI 323


>gi|242056743|ref|XP_002457517.1| hypothetical protein SORBIDRAFT_03g008610 [Sorghum bicolor]
 gi|241929492|gb|EES02637.1| hypothetical protein SORBIDRAFT_03g008610 [Sorghum bicolor]
          Length = 521

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 82/258 (31%), Positives = 128/258 (49%), Gaps = 13/258 (5%)

Query: 118 MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQ-T 176
           +++ I LIS+ +PYWNRT+GADHFFV  HD+G  +  +    + +  +  L  A L +  
Sbjct: 267 VKNYIDLISAKYPYWNRTKGADHFFVACHDWGP-YTTKLHDELRKNTIKALCNADLSEGI 325

Query: 177 FGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYAR 236
           F +   V L E  +     +P + ++     P   R+I  +F G  +         Y+  
Sbjct: 326 FIRGKDVSLPETFLR----SPRRPLRDIGGRPAAQRTILAFFAGQMHGRVRPVLLKYW-- 379

Query: 237 GARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPV 296
           G + A    +   P           Y + M+ + +C+CP+G+   SPR+VEA+ + C+PV
Sbjct: 380 GDKDADMRIYSRLPH---RITRKRNYVQHMKSSKYCICPMGYEVNSPRIVEAIYYECVPV 436

Query: 297 IIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAML 356
           IIAD+ VLPF  A+ W    + V E DVPKL  IL +IP    +  Q  +     K  + 
Sbjct: 437 IIADNFVLPFDAALNWSAFSVVVPESDVPKLKEILLAIPESRYITLQSNVKR-VQKHFLW 495

Query: 357 FPQPAQPGDAFHQILNGL 374
            P P +  D FH IL+ +
Sbjct: 496 HPNPVK-YDIFHMILHSV 512


>gi|449518133|ref|XP_004166098.1| PREDICTED: probable glycosyltransferase At5g20260-like, partial
           [Cucumis sativus]
          Length = 362

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 88/301 (29%), Positives = 142/301 (47%), Gaps = 22/301 (7%)

Query: 79  SSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLP-FKSPRMMR---SAIQLISSNWPYWNR 134
            SP     PEEA  F+ PI     +     P+  +   R++R     ++++++ +PYWNR
Sbjct: 68  KSPFSAHEPEEAQVFFLPISIVYIVDYIYKPITTYARDRLVRIFTDYVRVVANKYPYWNR 127

Query: 135 TEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPP 194
           T GADHF V  HD+      +E+  + +  + +L  A   + F       L E  I +PP
Sbjct: 128 TRGADHFMVSCHDWAPEV-TKEDPNLFKYFIRVLCNANTSEGFNPMRDASLPE--INLPP 184

Query: 195 YAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGY-YARGARAAVWENFKNNPLFD 253
                  +  Q P +  RSI  +F G          G + + R      W++  +     
Sbjct: 185 TFHLNLPRLGQPPQN--RSILAFFAG----------GAHGFIRHILMQHWKDKDHEIQVH 232

Query: 254 ISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWE 313
                   Y E + R+ FCLCP G+   SPRLVEA+  GC+PV+I+D   LPF D + W 
Sbjct: 233 EYLPPSQNYTELIDRSKFCLCPSGYEVASPRLVEAIHGGCVPVVISDYYSLPFDDVLDWS 292

Query: 314 EIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNG 373
           +  M +  E +P++ TIL  +  +  L+ QR +    ++R     +PA+  D FH +L+ 
Sbjct: 293 KFSMRIPSERIPEIKTILRGVSMKKYLKLQRGVMK--VQRHFEIHRPAKAFDMFHMVLHS 350

Query: 374 L 374
           +
Sbjct: 351 V 351


>gi|449465629|ref|XP_004150530.1| PREDICTED: probable glycosyltransferase At5g20260-like [Cucumis
           sativus]
          Length = 299

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 88/301 (29%), Positives = 142/301 (47%), Gaps = 22/301 (7%)

Query: 79  SSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLP-FKSPRMMR---SAIQLISSNWPYWNR 134
            SP     PEEA  F+ PI     +     P+  +   R++R     ++++++ +PYWNR
Sbjct: 5   KSPFSAHEPEEAQVFFLPISIVYIVDYIYKPITTYARDRLVRIFTDYVRVVANKYPYWNR 64

Query: 135 TEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPP 194
           T GADHF V  HD+      +E+  + +  + +L  A   + F       L E  I +PP
Sbjct: 65  TRGADHFMVSCHDWAPEV-TKEDPNLFKYFIRVLCNANTSEGFNPMRDASLPE--INLPP 121

Query: 195 YAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGY-YARGARAAVWENFKNNPLFD 253
                  +  Q P +  RSI  +F G          G + + R      W++  +     
Sbjct: 122 TFHLNLPRLGQPPQN--RSILAFFAG----------GAHGFIRHILMQHWKDKDHEIQVH 169

Query: 254 ISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWE 313
                   Y E + R+ FCLCP G+   SPRLVEA+  GC+PV+I+D   LPF D + W 
Sbjct: 170 EYLPPSQNYTELIDRSKFCLCPSGYEVASPRLVEAIHGGCVPVVISDYYSLPFDDVLDWS 229

Query: 314 EIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNG 373
           +  M +  E +P++ TIL  +  +  L+ QR +    ++R     +PA+  D FH +L+ 
Sbjct: 230 KFSMRIPSERIPEIKTILRGVSMKKYLKLQRGVMK--VQRHFEIHRPAKAFDMFHMVLHS 287

Query: 374 L 374
           +
Sbjct: 288 V 288


>gi|225464075|ref|XP_002268836.1| PREDICTED: probable glycosyltransferase At5g03795-like [Vitis
           vinifera]
          Length = 417

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 103/354 (29%), Positives = 161/354 (45%), Gaps = 58/354 (16%)

Query: 39  KLKVYVY--ELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTP 96
           K KV+VY    P+ Y      + PR LT  +A+E +  + +  S  RT +P++A  F+ P
Sbjct: 95  KFKVFVYPDGDPNTY-----YQTPRKLTGKYASEGYFFQNIRQSQFRTDDPDQAHLFFIP 149

Query: 97  IYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQE 156
           I  +C               ++++ +Q + S +PYWNRT GADHFF+  HD G       
Sbjct: 150 I--SCHKMRGKGTSYENMTIIVQNYVQSLMSKYPYWNRTLGADHFFLTCHDVGV------ 201

Query: 157 EKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY---APPQKMQAHQIPPDTPRS 213
            +A E G+  L++ +  V        VC     +   P+   A PQ +Q   +P      
Sbjct: 202 -RATE-GVPLLVKNSIRV--------VCSPSYDVGFIPHKDVALPQVLQPFALPTGGR-- 249

Query: 214 IFVYFRGLFYDVNNDPEGGYYA-------RGARAAVWEN-----FKNNPLFDISTDHPTT 261
                     D+ N    G++A       R   A +WEN      KNN + + +T H   
Sbjct: 250 ----------DIKNRTTLGFWAGHRNSKIRVILARIWENDTELDIKNNRI-NRATGH-LV 297

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y     R  FC+CP G    S R+ +++ +GC+PVI++D   LPF D + W +  + + E
Sbjct: 298 YQNKFYRTKFCICPGGSQVNSARIADSIHYGCVPVILSDYYDLPFNDILDWRKFSVILKE 357

Query: 322 EDVPKLDTILTSIP-PEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
            DV +L  IL  IP  E I     L+    +++   +  P    DAFH ++  L
Sbjct: 358 RDVYRLKYILKDIPDAEFIALHDNLV---KVQKHFQWNTPPIKYDAFHMVMYEL 408


>gi|226498050|ref|NP_001147269.1| exostosin-like [Zea mays]
 gi|195609318|gb|ACG26489.1| exostosin-like [Zea mays]
 gi|413926327|gb|AFW66259.1| exostosin-like protein [Zea mays]
          Length = 403

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 98/351 (27%), Positives = 161/351 (45%), Gaps = 54/351 (15%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYP 99
            KVY+Y  P   + K   + PR LT  +A+E +  + +  S  RT +P++A  F+ PI P
Sbjct: 82  FKVYIY--PDG-DPKTFYQTPRKLTGKYASEGYFFQNIRESRFRTDDPDQAHLFFVPISP 138

Query: 100 TCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKA 159
              +   G         +++  ++ + + +PYWNRT GADHFFV  HD G        +A
Sbjct: 139 H-KMRGKGTSYE-NMTVIVKDYVEGLINKYPYWNRTLGADHFFVTCHDVGV-------RA 189

Query: 160 IERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY---APPQKMQAHQIPPDTPRSIFV 216
            E G+  +++ +  V        VC    ++   P+   A PQ +Q   +          
Sbjct: 190 FE-GLKFMVKNSIRV--------VCSPSYNVDFIPHKDIALPQVLQPFALHEGGN----- 235

Query: 217 YFRGLFYDVNNDPEGGYYA-------RGARAAVWENFKNNPLFDISTDHPT------TYY 263
                  D++N    G++A       R   A VWEN   +    IS +  +       Y 
Sbjct: 236 -------DIDNRVILGFWAGHRNSKIRVILARVWEN---DTELAISNNRISRAIGELVYQ 285

Query: 264 EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED 323
           +   R  FC+CP G    S R+ +++ +GC+PVI++D   LPF DA+ W +  + + E D
Sbjct: 286 KQFYRTKFCICPGGSQVNSARISDSIHYGCVPVILSDYYDLPFNDALDWRKFAVILRERD 345

Query: 324 VPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
           V +L  IL SI  E  +   + L    +++  ++  P    DAFH ++  L
Sbjct: 346 VYQLKNILKSISQEEFISLHKSLVQ--VQKHFVWHSPPVSYDAFHMVMYEL 394


>gi|297740031|emb|CBI30213.3| unnamed protein product [Vitis vinifera]
          Length = 337

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 93/347 (26%), Positives = 152/347 (43%), Gaps = 40/347 (11%)

Query: 39  KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTP-- 96
           +LKVYVY+      +  L  +  C +       F+H+  + S  RT +PE+A  F+ P  
Sbjct: 4   QLKVYVYD----EGEPPLFHNGPCKSIYSMEGNFIHKMEMDSHFRTKDPEKAHLFFLPFS 59

Query: 97  ---------IYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHD 147
                    +  + DL P          + +   + ++S+ +PYWNR+ GADHF +  HD
Sbjct: 60  VAMLVRFVYVRDSHDLGPI--------KQTVIDYVNVVSTKYPYWNRSLGADHFMLACHD 111

Query: 148 FGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIP 207
           +G    +     + +  + +L  A   + F     V   E  I +   +    +     P
Sbjct: 112 WGPETSFSI-PYLHKNSIRVLCNANTSEGFNPSKDVSFPE--INLLTGSTDSFIGG---P 165

Query: 208 PDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQ 267
             + R++  +F G  +            R      WEN   +           +YYE M+
Sbjct: 166 SPSHRTLLAFFAGGLHGP---------IRPILLEHWENKDEDVKVHKYLPKGVSYYEMMR 216

Query: 268 RAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKL 327
           ++ +CLCP G+   SPR+VEA+  GC+PV+I+D  V PF+D + W+   + V   ++P L
Sbjct: 217 KSKYCLCPSGYEVASPRVVEALYTGCVPVLISDHYVPPFSDVLNWKSFSVEVPVREIPNL 276

Query: 328 DTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
             IL  I P   +R QR       +R      P +  D FH IL+ L
Sbjct: 277 KRILMDISPRQYIRMQR--RGIQARRHFEVNSPPKRYDVFHMILHSL 321


>gi|115444695|ref|NP_001046127.1| Os02g0187200 [Oryza sativa Japonica Group]
 gi|46390046|dbj|BAD15422.1| putative pectin-glucuronyltransferase [Oryza sativa Japonica Group]
 gi|46390077|dbj|BAD15452.1| putative pectin-glucuronyltransferase [Oryza sativa Japonica Group]
 gi|113535658|dbj|BAF08041.1| Os02g0187200 [Oryza sativa Japonica Group]
 gi|215694561|dbj|BAG89554.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 402

 Score =  120 bits (300), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 99/350 (28%), Positives = 159/350 (45%), Gaps = 52/350 (14%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYP 99
            KVY+Y  P   + K   + PR LT  +A+E +  + +  S  RT +P++A  F+ PI P
Sbjct: 81  FKVYMY--PDG-DPKTFYQTPRKLTGKYASEGYFFQNIRESRFRTGDPDKAHLFFVPISP 137

Query: 100 TCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKA 159
              +   G         +++  ++ + + +PYWNRT GADHFFV  HD G        +A
Sbjct: 138 H-KMRGKGTSYE-NMTIIVKDYVEGLINKYPYWNRTLGADHFFVTCHDVGV-------RA 188

Query: 160 IERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY---APPQKMQAHQIPPDTPRSIFV 216
            E   LP + + ++         VC    ++   P+   A PQ +Q   +P         
Sbjct: 189 FEG--LPFMVKNSI-------RVVCSPSYNVDFIPHKDIALPQVLQPFALPEGGN----- 234

Query: 217 YFRGLFYDVNNDPEGGYYA-------RGARAAVWEN-----FKNNPLFDISTDHPTTYYE 264
                  DV N    G++A       R   A VWEN       NN +     +    Y +
Sbjct: 235 -------DVENRTILGFWAGHRNSKIRVILAQVWENDTELAISNNRISRAIGE--LVYQK 285

Query: 265 DMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDV 324
                 FC+CP G    S R+ +++ +GC+PVI++D   LPF D + W +  + + E DV
Sbjct: 286 QFYHTKFCICPGGSQVNSARISDSIHYGCVPVILSDYYDLPFNDILDWRKFAVVLKERDV 345

Query: 325 PKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
            +L +IL SI  E  +   + L    +++  ++  P  P DAFH ++  L
Sbjct: 346 YQLKSILKSISQEEFVELHKSLVQ--VQKHFVWHSPPLPYDAFHMVMYEL 393


>gi|159478873|ref|XP_001697525.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
 gi|158274404|gb|EDP00187.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
          Length = 597

 Score =  120 bits (300), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 109/388 (28%), Positives = 157/388 (40%), Gaps = 88/388 (22%)

Query: 42  VYVYELPSKYNKKLLLK--DPRCLTH----------------MFAAEIFMHRFLLSSPVR 83
           +YVY+L   Y  K+L     PR   H                ++A +  +H  LL SP R
Sbjct: 209 IYVYDLEPMYQSKILQYRVTPRWCVHRWHVWPANHTEWTDLWVYAMDTLLHESLLGSPHR 268

Query: 84  TLNPEEADWFYTPIYPTCDLTPTGLPLPF------KSPR------MMRSAIQLISSNWPY 131
           T +PEEAD+FY P   +C   P G    F        PR      ++   +Q I++ +P+
Sbjct: 269 TFDPEEADFFYVPHQASCLPFPIGAWADFPWFPDAGGPRTRQMLNLVIDTVQWINATFPF 328

Query: 132 WNRTEGADHFFVVPHDFGACF------------HYQEEK------------AIERGILPL 167
           W R  G DH F   HD GAC+            H+   +              +R    +
Sbjct: 329 WQRRGGRDHIFTFTHDEGACWAPNIVNNSIWLTHWGRTELNHTSNTAYLLDKYDRDTPTI 388

Query: 168 LQRATLVQTFGQRNHVCLN-EGSITIPPYAPPQKMQAHQI--PPDTPRSIFVYF------ 218
           LQ    V  F  + H C N E  + IP +  P    +  +   P   R +  +F      
Sbjct: 389 LQPDGFVHLF--KGHPCYNPEKDLVIPAFKAPGHYASSGLVGAPTRERDLLFFFSPPVSA 446

Query: 219 -----------RGLFYDVNNDPEGGYYARGARAAVWENFK------NNPLFDISTDHPTT 261
                      RG   DV    +   Y+RG R A++   K       +  +    D    
Sbjct: 447 PGAASCPHASTRG---DVGKRRQPN-YSRGVRQAIYRAAKAGDWAAKHKFYIGGHDDVKG 502

Query: 262 YYEDM-QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVA 320
            Y DM  RA FCL   G   WS R+ +AV+ GCIPVIIAD +   F   +  +  G+ + 
Sbjct: 503 EYSDMLSRAKFCLVAPG-DGWSARMEDAVLHGCIPVIIADGVHAVFESILDIDGFGLRIP 561

Query: 321 EEDVPKLDTILTSIPPEVILRKQRLLAN 348
           +E VP++  IL ++PP  I  KQ  L  
Sbjct: 562 QEQVPRILDILLAVPPRAIRSKQAHLGR 589


>gi|225429942|ref|XP_002281263.1| PREDICTED: probable glycosyltransferase At5g03795 [Vitis vinifera]
          Length = 675

 Score =  120 bits (300), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 106/396 (26%), Positives = 173/396 (43%), Gaps = 51/396 (12%)

Query: 2   WVTAI--ALLLAFVSADNTPKIERISGSAGDVLEDDPVGK---------LKVYVYELPSK 50
           W +A+   LL A    +N P I+   G    +  +  V K         LKVY Y    +
Sbjct: 291 WSSAVDKELLYAKSQIENAPIIKNDPGLHASLYRNVSVFKRSYELMENTLKVYTY----R 346

Query: 51  YNKKLLLKDPRCLTHMFAAEI-FMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLP 109
             ++ +   P  +  ++A+E  FM     +    T N  +A  FY P   +  +    L 
Sbjct: 347 EGERPVFHQP-PIKGIYASEGWFMKLMQANKKFVTKNGRKAHLFYLPF--SSLMLEEALY 403

Query: 110 LPFKSPRM-----MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGI 164
           +P    R      +++ + +I + +P+WNRT GADHF V  HD+       E   +    
Sbjct: 404 VPNSHSRKNLEQYLKNYLDMIGAKYPFWNRTGGADHFLVACHDWAPS----ETLKLMANS 459

Query: 165 LPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYD 224
           +  L  + + + F     V L E  + IP   P +++      P + R I  +F G  + 
Sbjct: 460 IRALCNSDIREGFKLGKDVSLPETCVRIPQ-NPLRQLGG---KPPSQRRILAFFAGSMHG 515

Query: 225 VNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPT------TYYEDMQRAIFCLCPLGW 278
                    Y R      WEN   +P   I    P        Y + M+ + +C+C  G+
Sbjct: 516 ---------YVRPILLKYWEN--KDPDMKIYGRMPKAKKGTMNYIQHMKSSKYCICAKGY 564

Query: 279 APWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEV 338
              SPR+VEA+ + C+PVII+D+ V PF   + WE   +F+ E+D+P L +IL SIP + 
Sbjct: 565 EVNSPRVVEAIFYECVPVIISDNFVPPFFGVLNWESFAVFILEKDIPNLKSILLSIPEKS 624

Query: 339 ILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
            L  Q  +    +++  L+       D FH IL+ +
Sbjct: 625 YLEIQMRVKQ--VQQHFLWHAKPVKYDVFHMILHSV 658


>gi|356505300|ref|XP_003521429.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
           max]
          Length = 534

 Score =  120 bits (300), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 93/353 (26%), Positives = 150/353 (42%), Gaps = 52/353 (14%)

Query: 39  KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTP-- 96
           + KV+VYE      +  +  +  C +       F+H   ++   RT +PE+A  F+ P  
Sbjct: 204 QFKVFVYE----EGEPPVFHNGPCKSIYSMEGNFIHAIEMNDQFRTRDPEKAHVFFLPFS 259

Query: 97  ---------IYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHD 147
                    +  + D  P          + +   + +I+  +PYWNR+ GADHF++  HD
Sbjct: 260 VAMLVQFVYVRDSHDFGPI--------KKTVTDYVNVIAGRYPYWNRSLGADHFYLACHD 311

Query: 148 FGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQI- 206
           +G           E  I  +L  A   + F     V             P   +Q   I 
Sbjct: 312 WGPETSRSIPNLNENSIR-VLCNANTSEGFKPSKDVSF-----------PEINLQTGSIN 359

Query: 207 -----PPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTT 261
                P  + R +  +F G  +            R      WEN   +           +
Sbjct: 360 GFIGGPSASGRPLLAFFAGGLHGP---------IRPVLLEHWENRDEDIQVHKYLPKGVS 410

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           YYE ++++ FCLCP G+   SPR+VEA+  GC+PV+I+D  V PF D + W+   + V+ 
Sbjct: 411 YYEMLRKSRFCLCPSGYEVASPRVVEAIYTGCVPVLISDHYVPPFNDVLNWKSFSVEVSV 470

Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
           +D+P+L  IL SI P   +R QR +    ++R      P +  D FH IL+ +
Sbjct: 471 KDIPRLKEILLSISPRHYIRMQRRVG--LVRRHFEVHSPPKRYDVFHMILHSV 521


>gi|358343604|ref|XP_003635890.1| Exostosin-like protein [Medicago truncatula]
 gi|358344094|ref|XP_003636128.1| Exostosin-like protein [Medicago truncatula]
 gi|355501825|gb|AES83028.1| Exostosin-like protein [Medicago truncatula]
 gi|355502063|gb|AES83266.1| Exostosin-like protein [Medicago truncatula]
          Length = 435

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 101/346 (29%), Positives = 156/346 (45%), Gaps = 58/346 (16%)

Query: 39  KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIY 98
           K KVY+Y  P   + K   + PR LT  +A+E +  + +  S  RTL+P++A  F+ PI 
Sbjct: 129 KFKVYIY--PDG-DSKTFYQTPRKLTGKYASEGYFFQNIRESRFRTLDPDQAHLFFIPI- 184

Query: 99  PTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEK 158
            +C               ++++ ++ + S +PYWNRT GADHFFV  HD G        +
Sbjct: 185 -SCHKMRGKGTSYDNMTIIVQNYVESLISKYPYWNRTLGADHFFVTCHDVGV-------R 236

Query: 159 AIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY---APPQKMQAHQIPPDTPRSIF 215
           A E   LPLL + ++         VC     +   P+   A PQ +Q   +P        
Sbjct: 237 ATEG--LPLLVKNSI-------RAVCSPSYDVGFIPHKDVALPQVLQPFALPAGGN---- 283

Query: 216 VYFRGLFYDVNNDPEGGYYA-------RGARAAVWENFKNNPLFDISTDHPTTYYEDMQR 268
                   DV N    G++A       R   A VWEN   +   DIS +   T       
Sbjct: 284 --------DVENRTSLGFWAGHRNSKIRVILARVWEN---DTELDISNNRIYTK------ 326

Query: 269 AIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLD 328
             FC+CP G    S R+ +++ +GCIPVI+++   LPF D + W +  +   E DV +L 
Sbjct: 327 --FCICPGGSQVNSARIADSIHYGCIPVILSNYYDLPFNDILDWRKFAVVHNESDVYQLK 384

Query: 329 TILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
            IL +   +  +     L N  +++   +  P    DAFH ++  L
Sbjct: 385 QILKNKSQDEFI----ALHNNLIQKHFQWNSPPVRYDAFHMVMYDL 426


>gi|307107123|gb|EFN55367.1| hypothetical protein CHLNCDRAFT_52572 [Chlorella variabilis]
          Length = 815

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 111/424 (26%), Positives = 178/424 (41%), Gaps = 107/424 (25%)

Query: 42  VYVYELPSKYNKKLL---LKDPRCLTHMFA--------------AEIFMHRFLLSSPVRT 84
           +YVYELP  +N+ LL   +    C+  +F               AE  +H  LL S  RT
Sbjct: 375 IYVYELPPIFNQVLLQYRVDHGSCVHRLFTDGNGTNWEDSGGYLAETGLHEALLQSKHRT 434

Query: 85  LNPEEADWFYTPIYPTCDLTPT-------------GLPLPFKSPRMMRSAIQLISSNWPY 131
           L+PEEAD+FY P+Y +C + P               +P    +  M+      + ++ PY
Sbjct: 435 LDPEEADYFYIPVYSSCYMYPIHGFADTPFFHAFHKIPRVHATTNMLIEVYHWLRAHHPY 494

Query: 132 WNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQR-ATLVQTFGQ----------- 179
           W+R+ G DH  +  HD G+C+            LP + R AT++  +G+           
Sbjct: 495 WDRSGGRDHIILQSHDEGSCW------------LPAVLRPATMLTHWGRMDLGHTSSTGY 542

Query: 180 ---------RNHVCLNEGS---------------ITIPPYAPPQKMQAHQIPP--DTPRS 213
                    R+ + + EG+               + +PP   P K +   +       R+
Sbjct: 543 IDDVYSRPARHPIYMPEGTEGKLGDFPCYDPAKDLVVPPMTSPLKYELSPLVGAFTRNRT 602

Query: 214 IFVYFRGLFYDVNNDPEGGYYARGARAAV---------WENFK-----NNPLFDISTDHP 259
              +F+G     NN P    Y+RG R  +         W  FK      NP      D  
Sbjct: 603 TLAFFKGRTQQ-NNKP----YSRGIRQTLENLCRDKDWWGKFKIWIGEGNP-----PDMD 652

Query: 260 TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFV 319
            TY + +  + FC    G   +SPR  +AV  GC+PVII D++ L F   I + +  + +
Sbjct: 653 RTYSQLLASSTFCFVLPG-DGFSPRFEDAVQHGCLPVIIQDEVHLAFESIIDYRKFVVRI 711

Query: 320 AEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
            ++D+ ++  IL +IPPE +   Q+ LA    K +    +P    +    +L G  RK+P
Sbjct: 712 QQKDMERVPEILGAIPPEKVQTMQKALATVWRKWSYTGYRPY--ANVTLDLLEGYRRKVP 769

Query: 380 HDKS 383
            D+ 
Sbjct: 770 EDQQ 773


>gi|224142559|ref|XP_002324623.1| predicted protein [Populus trichocarpa]
 gi|222866057|gb|EEF03188.1| predicted protein [Populus trichocarpa]
          Length = 337

 Score =  119 bits (298), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 84/302 (27%), Positives = 134/302 (44%), Gaps = 24/302 (7%)

Query: 80  SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLP----FKSPRMMRSA---IQLISSNWPYW 132
           SP R  +P+EA  F+ P+     +    LP+     +   R+ R     + +++  +PYW
Sbjct: 42  SPFRAQDPDEAHVFFLPVSVASIVHFIYLPITAAADYSRDRLRRVVTDYVHIVAKKYPYW 101

Query: 133 NRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITI 192
           NR+ GADHF V  HD+         +   + I  +L  A +   F     V L E  +  
Sbjct: 102 NRSNGADHFMVSCHDWAPDVSIANSELFNKFIR-VLCNANISIGFRPPRDVLLPEIYL-- 158

Query: 193 PPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLF 252
            P++        Q P + P  I  +F G             Y R      W+N  N    
Sbjct: 159 -PFSGLGTTHMGQAPNNRP--ILAFFEG---------RAHGYIRQVLFKHWKNKDNEVQV 206

Query: 253 DISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW 312
                    Y   M ++ FCLCP G+   SPR+VEA+  GC+PVII+++  LPF+D + W
Sbjct: 207 HELLPKGKNYTRLMGQSKFCLCPSGFEVASPRVVEAIYQGCVPVIISNNYSLPFSDVLNW 266

Query: 313 EEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILN 372
            +  + +  E +P++  IL  I     LR    +    ++R  +  +PA+P D  H +L+
Sbjct: 267 SQFSVQIPVEKIPEIKMILQRISNSKYLRMHERVKR--VQRHFVLNRPAKPFDVIHMVLH 324

Query: 373 GL 374
            L
Sbjct: 325 SL 326


>gi|356548516|ref|XP_003542647.1| PREDICTED: xylogalacturonan beta-1,3-xylosyltransferase-like
           [Glycine max]
          Length = 340

 Score =  119 bits (298), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 84/301 (27%), Positives = 136/301 (45%), Gaps = 23/301 (7%)

Query: 80  SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPR------MMRSAIQLISSNWPYWN 133
           S  R  NP +A  F  P +   ++         + P       ++   I++I+  +PYWN
Sbjct: 44  SHFRARNPNQAHVFLIP-FSIVNIVQYVYNRNLRQPGSQSIQLLVEDYIRVIAHKYPYWN 102

Query: 134 RTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIP 193
           RTEGADHF +  HD+G    Y   K   +  + +L  A   + F     V + E  + + 
Sbjct: 103 RTEGADHFLLSCHDWGPTISYANPKLF-KNFIRVLCNANTSEGFRPNKDVSIPE--VNLL 159

Query: 194 PYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFD 253
           P          Q P D  R+I  +F G  +            R      W++  N+    
Sbjct: 160 PRGTLGSPNRGQHPND--RTILAFFAGREHGA---------IRTILLNHWKDKDNDVQIY 208

Query: 254 ISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWE 313
            S      Y + M ++ FCLCP G+   SPR+VEA+  GC+PV+I+     PF D + W 
Sbjct: 209 ESLPKGKVYTKLMGQSKFCLCPSGYEVASPRVVEAIYAGCVPVLISSSYSPPFTDVLNWS 268

Query: 314 EIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNG 373
           +  + +  E +P++ TIL S+ P+  L+ Q  +    ++R     +PA+P D  H IL+ 
Sbjct: 269 QFSVEIPVEKIPEIKTILQSVSPKKYLKLQMNVL--RVQRHFTINRPAKPFDLMHMILHS 326

Query: 374 L 374
           +
Sbjct: 327 I 327


>gi|147805375|emb|CAN76361.1| hypothetical protein VITISV_035438 [Vitis vinifera]
          Length = 459

 Score =  119 bits (298), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 86/312 (27%), Positives = 138/312 (44%), Gaps = 35/312 (11%)

Query: 80  SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSP---------RMMRSAIQLISSNWP 130
           SP    +P+EA  F+ P+  +       L LP  SP         R++   +++++  +P
Sbjct: 155 SPFIAKHPDEAHAFFLPL--SVVKVVQFLYLPITSPEDYSRKRLQRVVTDYVKVVADKYP 212

Query: 131 YWNRTEGADHFFVVPHDFGACFHYQ--------EEKAIERGILPLLQRATLVQTFGQRNH 182
           YWNR+ GADHF V  HD+ +    +            + +  + +L  A   + F     
Sbjct: 213 YWNRSGGADHFMVSCHDWVSSSSLEPMAPSVSYANPELFKNFIRVLCNANSSEGFRPGRD 272

Query: 183 VCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAV 242
           V L E ++      PP   Q     P   R I  +F G  +            R      
Sbjct: 273 VSLPEVNLPAGELGPPHLGQ-----PSNNRPILAFFAGRAHG---------NIRKILFEH 318

Query: 243 WENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDI 302
           W++  N  L          Y + M ++ FCLCP G+   SPR+VEA+  GC+PVII++  
Sbjct: 319 WKDQDNEVLVHERLHKGQNYAKLMGQSKFCLCPSGYEVASPRVVEAIHAGCVPVIISNXY 378

Query: 303 VLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQ 362
            LPF D + W +  + + E  +P++ TIL  I     L+ Q  +    ++R  +  +PA+
Sbjct: 379 SLPFNDVLDWSQFSIQIPEAKIPEIKTILLGISKNKYLKMQERVLR--VRRHFVLNRPAR 436

Query: 363 PGDAFHQILNGL 374
           P D  H IL+ L
Sbjct: 437 PFDIIHMILHSL 448


>gi|302852375|ref|XP_002957708.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
 gi|300257002|gb|EFJ41257.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
          Length = 600

 Score =  119 bits (298), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 112/384 (29%), Positives = 170/384 (44%), Gaps = 60/384 (15%)

Query: 36  PVGKLKVYVYELP------SKYNKKLLLKDPRCLTHMFAA-EIFMHRFLLSSPVRTLNPE 88
           P  KLK+Y+Y+LP       +YN     +D      M+AA E FM  FL    VRT NP 
Sbjct: 228 PRSKLKIYMYDLPWEVAFPYEYNDGHFGRDK-----MYAAYEYFMTYFLQDHAVRTENPY 282

Query: 89  EADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDF 148
           EA+ FY P+     +     P+P      +  A+  + + WP++NRT G DHF+ +  D 
Sbjct: 283 EANLFYIPMLAYFYIANVRNPVP-----QVTLALDYVRTKWPFYNRTGGRDHFYFLTGDR 337

Query: 149 GACF--HYQEEKAIERGILPLLQRATLVQTFGQRNHVCLN-EGSITIPPYA-----PPQK 200
           GAC    + ++  I+     L            R + C+  +  + IPP        P +
Sbjct: 338 GACSTPRWLQDSCIKLVHFGLQGEELPGTGVPNREYGCVQVKRDLVIPPINLFTDLVPSE 397

Query: 201 MQAH------QIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDI 254
            QA+      +   D+ R +  +F G    V   PE   Y+ G R A+        L   
Sbjct: 398 TQAYYKWLVSKKGYDSNRKLLFFFAG---GVGQVPE---YSGGVRQAI------KGLLSS 445

Query: 255 STDHPT----------TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVL 304
            T  P            Y E +Q + FC+ P G+  W  RL++A+ +GCIP+II D +  
Sbjct: 446 LTPKPEDVEFFEGRVHNYKELLQSSKFCIAPYGFG-WGLRLIQAIEYGCIPLIIQDHVYQ 504

Query: 305 PF---ADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPA 361
           PF    D +P+EE  + +   D+P +  +L S   E  L + R L      +A ++ +  
Sbjct: 505 PFERPKDFLPYEEFSVRMGLVDIPYMIELLRSY-TEAQLAQLR-LGMAKYYQAFIWNR-E 561

Query: 362 QPGDAFHQILNGLARKLPHDKSTY 385
             G A+   L GL R+L H  S Y
Sbjct: 562 YGGLAYEWTLAGLERRLAHMNSEY 585


>gi|356562054|ref|XP_003549290.1| PREDICTED: probable glycosyltransferase At5g25310-like [Glycine
           max]
          Length = 482

 Score =  119 bits (297), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 92/302 (30%), Positives = 137/302 (45%), Gaps = 30/302 (9%)

Query: 83  RTLNPEEADWFYTPIYPTCDLTPTGLPLPFKS---PRMMRSAIQLISSNWPYWNRTEGAD 139
           RT +P  A  F+ P   T  +     PL F      + +   ++++S+  P+WN T GAD
Sbjct: 191 RTNDPNAAHVFFLPFSVTWMVKYLYTPLSFNVTPLKKFVSDYVRVVSTRHPFWNITHGAD 250

Query: 140 HFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQ 199
           HF +  HD+G     Q    +    + +L  A   + F  R  V L E    I  Y    
Sbjct: 251 HFMLACHDWGP-HASQGNPFLYNTSIRVLCNANTSEGFNPRKDVSLPE----IHLYGGEV 305

Query: 200 KMQAHQIPPDT-PRSIFVYFRGLFYDVNNDPEGGYYA--RGARAAVWENFKNNPLFDIST 256
             +    PPDT PR    +F            GG +   R A    W+N   N +  +  
Sbjct: 306 SPKLLSPPPDTAPRRYLAFF-----------SGGLHGPIRPALLGHWKNHDENDVIRVYE 354

Query: 257 DHPTT--YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEE 314
             P    YY  M  + FCLCP G    SPR+VEA+   C+PVI+++  VLPF+D + WE 
Sbjct: 355 YLPKDLDYYSFMLTSKFCLCPSGHEVASPRIVEAIYAECVPVILSEYYVLPFSDVLQWEA 414

Query: 315 IGMFVAEEDVPKLDTILTSIPPEVILRKQRLLAN--PSMKRAMLFPQPAQPGDAFHQILN 372
             + V   D+P+L  IL++I  +    K R L     +++R     +PA+  D FH IL+
Sbjct: 415 FSVQVDVSDIPRLKEILSAISED----KYRKLKEGVKAVRRHFTLNRPAKRFDVFHMILH 470

Query: 373 GL 374
            +
Sbjct: 471 SI 472


>gi|414881250|tpg|DAA58381.1| TPA: hypothetical protein ZEAMMB73_758303 [Zea mays]
          Length = 474

 Score =  119 bits (297), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 95/342 (27%), Positives = 158/342 (46%), Gaps = 25/342 (7%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTL-NPEEADWFYTPIY 98
           LKVY+Y+      ++ +   P  L+ ++A+E +  + L  S    + +  +A  FY P Y
Sbjct: 137 LKVYIYQ----DGRRPIFHTP-PLSGIYASEGWFMKLLKESRRHVVADAGKAHLFYLP-Y 190

Query: 99  PTCDLTPTGLPLPFKSPR----MMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHY 154
            +  L  T       + R     +R+ ++ ++S +P+WNRT GADHF V  HD+G     
Sbjct: 191 SSQQLRLTLYEAGSHNLRPLAAYLRNFVRGLASKYPFWNRTRGADHFLVACHDWGPYTTT 250

Query: 155 QEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSI 214
                 +  I  L    +    F     V L E +I      P + ++     P + RSI
Sbjct: 251 AHRDLRKNAIKALCNADSSEGIFTPGKDVSLPETTIR----TPRRPLRYVGGLPVSRRSI 306

Query: 215 FVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLC 274
             +F G  +         ++  G      ++ +   L          Y + M+ + FCLC
Sbjct: 307 LAFFAGNVHGRVRPVLLRHWGNGQD----DDMRVYSLLPSRVSRRMNYIQHMKNSRFCLC 362

Query: 275 PLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSI 334
           P+G+   SPR+VEA+ + C+PVIIAD+ VLPF++ + W    + +AE+D+P L  IL  I
Sbjct: 363 PMGYEVNSPRIVEALYYECVPVIIADNFVLPFSEVLDWSAFSVVIAEKDIPDLKKILKGI 422

Query: 335 PPEVILRKQRLLANPSMKRAMLFPQPAQP--GDAFHQILNGL 374
                LR+   + +   +    F   A+P   D FH IL+ +
Sbjct: 423 S----LRRYVAMHDSVKRLQRHFLWHARPIKYDLFHMILHSI 460


>gi|356548353|ref|XP_003542567.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
           max]
          Length = 505

 Score =  118 bits (296), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 94/352 (26%), Positives = 155/352 (44%), Gaps = 50/352 (14%)

Query: 39  KLKVYVYE---LPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYT 95
           + KV+VYE   LP       +  +  C +       F+H   ++   RT +P++A  F+ 
Sbjct: 175 QFKVFVYEEGELP-------VFHEGPCASIYSTEGSFIHAIEMNEHFRTRDPKKAHVFFL 227

Query: 96  P-----------IYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVV 144
           P           I  + D  P          R +R  I +I++ +PYWNR+ GADHF + 
Sbjct: 228 PFSVVMMVRYVYIRDSHDFGPI--------KRTVRDYINVIAARYPYWNRSLGADHFMLS 279

Query: 145 PHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAH 204
            HD+G     +    + +  + +L  A   + F  R  V   E ++   P          
Sbjct: 280 CHDWGPEAS-KFSPYLRKNSIRVLCNANTSEGFDPRKDVSFPEINLQRGPIDGLLGG--- 335

Query: 205 QIPPDTPRSIFVYFRGLFYDVNNDPEGGYYA--RGARAAVWENFKNNPLFDISTDHPTTY 262
             P  + RSI  +F            GG +   R      WE    +           +Y
Sbjct: 336 --PSASQRSILAFF-----------AGGIHGPIRPILLEHWEKKDEDIQVHQYLPKGVSY 382

Query: 263 YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEE 322
           Y  ++++ FCLCP G+   SPR+VEA+  GC+PV+I+D  V PF+D + W+   + V+ +
Sbjct: 383 YGMLRKSKFCLCPSGYEVASPRVVEAIYTGCVPVLISDHYVPPFSDVLNWKMFSVEVSMK 442

Query: 323 DVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
           ++P L  IL +I P   +R Q+ +    ++R      P +  D FH IL+ +
Sbjct: 443 EIPNLKDILMNISPRKYIRMQKRVRQ--IRRHFEVHSPPKRYDVFHMILHSV 492


>gi|302852377|ref|XP_002957709.1| hypothetical protein VOLCADRAFT_119761 [Volvox carteri f.
           nagariensis]
 gi|300257003|gb|EFJ41258.1| hypothetical protein VOLCADRAFT_119761 [Volvox carteri f.
           nagariensis]
          Length = 1481

 Score =  118 bits (296), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 102/377 (27%), Positives = 159/377 (42%), Gaps = 84/377 (22%)

Query: 42  VYVYELPSKYNKKLL---LKDPRCLTH---------------MFAAEIFMHRFLLSSPVR 83
           +YVY+L   Y  K+L   +  P C+                 ++AA+  +H  LL S  R
Sbjct: 307 IYVYDLEPLYQSKILQYRISPPWCVHRRHDLPGNQTVWSDGWVYAADTLLHELLLISEHR 366

Query: 84  TLNPEEADWFYTPIYPTCDLTPTGLPLPFKS------------PR------MMRSAIQLI 125
           T +PEEAD+FY P   +C      LP P  S            PR      M+R  +  I
Sbjct: 367 TFDPEEADFFYVPHSASC------LPFPMGSWADYPWFLGPGGPRIRQMVNMLREVVDWI 420

Query: 126 SSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGI-LPLLQRATLVQTFGQ----- 179
              +P+W R  G DH ++  HD GAC+     K +E    L    R  L    G      
Sbjct: 421 DKTYPFWRRRGGRDHIWLFTHDEGACW---APKVLENSTWLTHWGRMGLEHRSGTAFLAD 477

Query: 180 -------------------RNHVCLNE-GSITIPPYAPPQKMQAHQI--PPDTPRSIFVY 217
                              + H C +    + +P +  P+  ++  +       R IF++
Sbjct: 478 KYDIDFVSPHQPEGFLTHIKGHPCYDSTKDLVVPAFKQPRHYRSSPLLGSATKQRDIFLF 537

Query: 218 FRGLFYDVNNDPEGGYYARGARAAVWE-----NFKNNPLFDISTDHPTTYYED-MQRAIF 271
           FRG   DV       +Y+RG R  +++     N+K+  +    T      Y D + R+ F
Sbjct: 538 FRG---DVGKH-RMAHYSRGVRQKLYKLSVENNWKSKNVLIGGTHEVRGEYSDLLSRSQF 593

Query: 272 CLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTIL 331
           CL   G   WS RL +AV+ GCIPVI+ D++ + F   +  +   + + E+ +P++  IL
Sbjct: 594 CLVAAG-DGWSARLEDAVLHGCIPVIVIDEVHVVFESILNVDSFAVRIDEQQLPQILDIL 652

Query: 332 TSIPPEVILRKQRLLAN 348
            +IP   I  KQ  L +
Sbjct: 653 AAIPERKIRAKQAHLGH 669



 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 87/293 (29%), Positives = 129/293 (44%), Gaps = 36/293 (12%)

Query: 67   FAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTP------------TGLPLPFKS 114
            + AE  +H +LL S  RT +PEEAD+FY P Y TC + P            TG P   + 
Sbjct: 1043 YGAESALHEYLLLSEHRTFDPEEADFFYVPYYGTCMIWPVLHWADFPYFHTTGGPRILQV 1102

Query: 115  PRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLV 174
              M+   +  I+  +P+W R  G DH F+ PHD GAC+              +L  AT +
Sbjct: 1103 INMLIDTVDWINKMYPFWGRRGGRDHIFLFPHDEGACW-----------APNVLVNATWL 1151

Query: 175  QTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYY 234
              +G+ + +  ++ S     Y          +    P       RG       DP   Y 
Sbjct: 1152 THWGRTDMIHESKTSFDADNYT------RDYVGWRQPGGFVNLIRG---HPCYDPVKIY- 1201

Query: 235  ARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCI 294
             R A+   W++ K+N L   + D P  Y + + R++FCL   G   WS R  +AV+ GCI
Sbjct: 1202 -RLAKENNWQD-KHNILIGDAADVPGDYSDLLSRSLFCLVATG-DGWSARTEDAVLHGCI 1258

Query: 295  PVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLA 347
            PVII D + + F      +E  + + E +  ++  IL  IP   I   Q  L 
Sbjct: 1259 PVIIIDGVHIKFETVFSVDEFSIRIPEANASRILEILKEIPKTKIRSIQAHLG 1311


>gi|15239475|ref|NP_198516.1| Exostosin family protein [Arabidopsis thaliana]
 gi|332006752|gb|AED94135.1| Exostosin family protein [Arabidopsis thaliana]
          Length = 547

 Score =  118 bits (296), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 92/308 (29%), Positives = 146/308 (47%), Gaps = 33/308 (10%)

Query: 39  KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTL-NPEEADWFYTPI 97
           KLK+YVY    K   K +   P     ++A+E +  + + S+    + +P +A  FY PI
Sbjct: 239 KLKIYVY----KEGGKPIFHTP-MPRGIYASEGWFMKLMESNKKFVVKDPRKAHLFYIPI 293

Query: 98  YPTCDLTPTGLPLPFKSPRMM----RSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFH 153
             +     + L L F++P+ +    +  + LI+  + +WNRT GADHF V  HD+G    
Sbjct: 294 --SIKALRSSLGLDFQTPKSLADHLKEYVDLIAGKYKFWNRTGGADHFLVACHDWGNKLT 351

Query: 154 YQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRS 213
            +  K   R     L  + + Q F       L    I     AP + +        + R 
Sbjct: 352 TKTMKNSVRS----LCNSNVAQGFRIGTDTALPVTYIR-SSEAPLEYLGG---KTSSERK 403

Query: 214 IFVYFRGLFYDVNNDPEGGYYARGARAAVWENFK-NNPLFDISTDHPTT---YYEDMQRA 269
           I  +F G  +          Y R     +WEN + +  +F      P +   Y E M+ +
Sbjct: 404 ILAFFAGSMHG---------YLRPILVKLWENKEPDMKIFGPMPRDPKSKKQYREYMKSS 454

Query: 270 IFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDT 329
            +C+C  G+   +PR+VEA++  C+PVIIAD+ V PF + + WEE  +FV E+D+P L  
Sbjct: 455 RYCICARGYEVHTPRVVEAIINECVPVIIADNYVPPFFEVLNWEEFAVFVEEKDIPNLRN 514

Query: 330 ILTSIPPE 337
           IL SIP +
Sbjct: 515 ILLSIPED 522


>gi|255548866|ref|XP_002515489.1| catalytic, putative [Ricinus communis]
 gi|223545433|gb|EEF46938.1| catalytic, putative [Ricinus communis]
          Length = 481

 Score =  118 bits (296), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 87/303 (28%), Positives = 149/303 (49%), Gaps = 34/303 (11%)

Query: 83  RTLNPEEADWFYTPIYPTCDLTPTGLPLPF-KSP--RMMRSAIQLISSNWPYWNRTEGAD 139
           RT +P+ A  ++ P   T  +     PL +  SP  + +   ++++S+ +P+WNRT GAD
Sbjct: 192 RTRDPKRAHVYFMPFSVTWMVKYLYKPLTYDHSPLRQFVADYVRVLSTKYPFWNRTHGAD 251

Query: 140 HFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSI---TIPP-- 194
           HF +  HD+G     + +  +    + +L  A   + F  R  V L E  +    +PP  
Sbjct: 252 HFMLACHDWGP-HASRGDHLLYNTSIRVLCNANTSEGFNPRKDVSLPEIHLYGGNVPPQL 310

Query: 195 YAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYA--RGARAAVWENFKNN-PL 251
            +PP        P +T R    +F            GG +   R      W++ +++  +
Sbjct: 311 LSPP--------PANTTRPHLAFF-----------AGGLHGPIRPLLLKHWKDRESDLRV 351

Query: 252 FDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIP 311
           F+    H   YY  M R+ FCLCP G    SPR+VE++   C+PVI++D  VLPF+D + 
Sbjct: 352 FEYLPKH-LDYYSFMLRSKFCLCPSGHEVASPRIVESIYAECVPVILSDHYVLPFSDVLR 410

Query: 312 WEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQIL 371
           W+   + +   ++P+L+ +L S+P E   R +  L   +++   +  QPA+  D FH IL
Sbjct: 411 WDAFSIQLNVSEIPRLEEVLRSVPEEKYERLKEGLR--TVRTHFMLNQPAKRFDVFHMIL 468

Query: 372 NGL 374
           + +
Sbjct: 469 HSI 471


>gi|115470965|ref|NP_001059081.1| Os07g0188700 [Oryza sativa Japonica Group]
 gi|34394611|dbj|BAC83913.1| putative pectin-glucuronyltransferase [Oryza sativa Japonica Group]
 gi|50508944|dbj|BAD31848.1| putative pectin-glucuronyltransferase [Oryza sativa Japonica Group]
 gi|113610617|dbj|BAF20995.1| Os07g0188700 [Oryza sativa Japonica Group]
 gi|125599387|gb|EAZ38963.1| hypothetical protein OsJ_23384 [Oryza sativa Japonica Group]
 gi|215707162|dbj|BAG93622.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 606

 Score =  118 bits (296), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 96/353 (27%), Positives = 158/353 (44%), Gaps = 50/353 (14%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTP--- 96
            KV+VYE      +  +  D  C +       F++   + + +RT +P++A  F+ P   
Sbjct: 272 FKVFVYE----EGEPPVFHDGPCRSIYSTEGRFIYAMEMENRMRTRDPDQAHVFFLPFSV 327

Query: 97  ------IYP--TCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDF 148
                 IY   + D+ P          R +   I ++S+ +P+WNR+ GADHF +  HD+
Sbjct: 328 VKMVKMIYEPNSHDMDPL--------RRTISDYINVVSTKYPHWNRSLGADHFMLSCHDW 379

Query: 149 GACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPP 208
           G              I  +L  A   + F     V L E  I +      +++     P 
Sbjct: 380 GPYVSSANGHLFSNSIR-VLCNANTSEGFDPSRDVSLPE--INLRSDVVDRQVGG---PS 433

Query: 209 DTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTD------HPTTY 262
            + R I  +F G     ++ P         R  + +++      DI         H  +Y
Sbjct: 434 ASHRPILAFFAG----GDHGP--------VRPLLLQHWGKGQDADIQVSEYLPRRHGMSY 481

Query: 263 YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEE 322
            + M+R+ FCLCP G+   SPR+VEA+   C+PV+I DD  LPFAD + W    + VA  
Sbjct: 482 TDMMRRSRFCLCPSGYEVASPRVVEAIYLECVPVVIGDDYTLPFADVLNWAAFSVRVAVG 541

Query: 323 DVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPA-QPGDAFHQILNGL 374
           D+P+L  IL ++ P   +R QR +   +++R  +    A +  D FH IL+ +
Sbjct: 542 DIPRLKEILAAVSPRQYIRMQRRVR--AVRRHFMVSDGAPRRFDVFHMILHSI 592


>gi|356565159|ref|XP_003550812.1| PREDICTED: LOW QUALITY PROTEIN: probable glycosyltransferase
           At5g20260-like [Glycine max]
          Length = 488

 Score =  118 bits (296), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 80/300 (26%), Positives = 137/300 (45%), Gaps = 35/300 (11%)

Query: 87  PEEADWFYTPIYPTCDLTPTGLPLP-FKSPRMMRSAI---QLISSNWPYWNRTEGADHFF 142
           P+EA  F  PI  T  +     PL  +   ++MR  +    +I+  +PYWNRT+GADHF 
Sbjct: 201 PDEAHVFMLPISVTQIVRYVYNPLTTYSRDQLMRITVDYTNIIAHRYPYWNRTKGADHFL 260

Query: 143 VVPHDFGACFHYQEE-KAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKM 201
              HD+      +E  + + + I+ +L  A   + F     V +           P   +
Sbjct: 261 ASCHDWAPDISREESGRELFKNIIRVLCNANTSEGFKPEKDVPM-----------PEMNL 309

Query: 202 QAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAV-------WENFKNNPLFDI 254
           Q  ++    P           +D+NN     ++A GA   +       W++         
Sbjct: 310 QGFKLSSPIPG----------FDLNNRSILAFFAGGAHGRIRKILLEHWKDKDEEVQVHE 359

Query: 255 STDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEE 314
                  Y   M ++ FCLCP G+   SPR+VE++  GC+PVI++D   LPF+D + W +
Sbjct: 360 YLPKGVDYQGLMGQSKFCLCPSGYEVASPRIVESINIGCVPVIVSDYYQLPFSDVLDWSK 419

Query: 315 IGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
             + +    + ++ TIL ++P    L+ Q+ +    ++R     +PA+P D FH IL+ +
Sbjct: 420 FSLHIPSRRIAEIKTILKNVPHAKYLKLQKRVM--KVQRHFELNRPAKPFDVFHMILHSI 477


>gi|356565155|ref|XP_003550810.1| PREDICTED: xylogalacturonan beta-1,3-xylosyltransferase-like
           [Glycine max]
          Length = 496

 Score =  118 bits (296), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 87/308 (28%), Positives = 135/308 (43%), Gaps = 36/308 (11%)

Query: 80  SPVRTLNPEEADWF-------------YTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLIS 126
           S  R  +PEEA  F             Y PI    D  P  L L      ++   I +I 
Sbjct: 200 SQFRARHPEEAHVFFLPFSIANVVHYVYKPILKQSDYEPVRLQL------LVEDYISVIE 253

Query: 127 SNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLN 186
             +PYWNR++GADHF +  HD+         +  +  I  L   A   + F     V + 
Sbjct: 254 DKYPYWNRSKGADHFLLSCHDWAPKVSNGNPELFQSFIRALCN-ANTSEGFHPNRDVSIP 312

Query: 187 EGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENF 246
           E  + +    PP   Q     P++ R+I  +F G  +            R      W++ 
Sbjct: 313 EVYLPVGKLGPPSLGQH----PNS-RTILAFFAGGVHG---------EIRKILLKHWKDK 358

Query: 247 KNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 306
            N             Y + M ++ FCLCP G    SPR+VEA+  GC+PVII D+  LPF
Sbjct: 359 DNEVRVHEYLPKSQNYTKLMGQSKFCLCPSGHEVASPRVVEAIHAGCVPVIICDNYSLPF 418

Query: 307 ADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDA 366
           +D + W +  + V+ + +P++ +IL SI  +  LR    +    ++R  +  +PA+P D 
Sbjct: 419 SDVLHWSQFSVKVSVQKIPEIKSILQSISRKKYLRLHMNVLR--VRRHFMINRPAKPFDM 476

Query: 367 FHQILNGL 374
            H IL+ +
Sbjct: 477 MHMILHSI 484


>gi|363808264|ref|NP_001242494.1| uncharacterized protein LOC100777368 [Glycine max]
 gi|255635742|gb|ACU18220.1| unknown [Glycine max]
          Length = 409

 Score =  118 bits (296), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 99/353 (28%), Positives = 160/353 (45%), Gaps = 56/353 (15%)

Query: 39  KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIY 98
           K KVY+Y  P   +     + PR LT  +A+E +  + +  S  RT NP+EA  F+ PI 
Sbjct: 87  KFKVYIY--PDG-DPNTFYQTPRKLTGKYASEGYFFQNIRDSRFRTENPDEAHLFFIPI- 142

Query: 99  PTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEK 158
            +C               ++++ ++ + S +PYWNRT GADHFFV  HD G        +
Sbjct: 143 -SCHKMRGKGTSYENMTIIVQNYVESLISKYPYWNRTLGADHFFVTCHDVGV-------R 194

Query: 159 AIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY---APPQKMQAHQIPPDTPRSIF 215
           A E G+  L++ +           VC     +   P+   A PQ +Q   +P        
Sbjct: 195 ATE-GLEFLVKNSIRA--------VCSPSYDVGFIPHKDVALPQVLQPFALPAGGN---- 241

Query: 216 VYFRGLFYDVNNDPEGGYYA-------RGARAAVWENFKNNPLFDISTDHPT------TY 262
                   D+ N    G++A       R   A VWEN   +   DIS +  +       Y
Sbjct: 242 --------DIENRTTLGFWAGHRNSKIRVILARVWEN---DTELDISNNRISRATGHLVY 290

Query: 263 YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEE 322
            +   R+ FC+CP G    S R+ +++ +GCIPV++++   LPF D + W +  + + E 
Sbjct: 291 QKRFYRSKFCICPGGSQVNSARIADSIHYGCIPVMLSNYYDLPFNDILDWNKFAVVLKES 350

Query: 323 DVPKLDTILTSIP-PEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
           DV +L  IL +I   E +     L+    +++   +  P+   DAFH ++  L
Sbjct: 351 DVYQLKQILKNISDAEFVTLHNNLV---KVQKHFQWNSPSIRFDAFHLVMYDL 400


>gi|125557509|gb|EAZ03045.1| hypothetical protein OsI_25187 [Oryza sativa Indica Group]
          Length = 601

 Score =  118 bits (295), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 97/355 (27%), Positives = 157/355 (44%), Gaps = 54/355 (15%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTP--- 96
            KV+VYE      +  +  D  C +       F++   + + +RT +P +A  F+ P   
Sbjct: 267 FKVFVYE----EGEPPVFHDGPCHSIYSTEGRFIYAMEMENRMRTRDPNQAHVFFLPFSV 322

Query: 97  ------IYP--TCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDF 148
                 IY   + D+ P          R +   I ++S+ +P+WNR+ GADHF +  HD+
Sbjct: 323 VKMVKMIYEPNSHDMDPL--------RRTISDYINVVSTKYPHWNRSLGADHFMLSCHDW 374

Query: 149 GACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQI-- 206
           G              I  +L  A   + F     V L E ++          + A Q+  
Sbjct: 375 GPYVSSANGHLFSNSIR-VLCNANTSEGFDPSRDVSLPEINLR-------SDVVARQVGG 426

Query: 207 PPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTD------HPT 260
           P  + R I  +F G     ++ P         R  + +++      DI         H  
Sbjct: 427 PSASHRPILAFFAG----GDHGP--------VRPLLLQHWGKGQDADIQVSEYLPRRHSM 474

Query: 261 TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVA 320
           +Y + M+R+ FCLCP G+   SPR+VEA+   C+PV+I DD  LPFAD + W    + VA
Sbjct: 475 SYTDMMRRSRFCLCPSGYEVASPRVVEAIYLECVPVVIGDDYALPFADVLNWAAFSVRVA 534

Query: 321 EEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQ-PGDAFHQILNGL 374
             D+P+L  IL ++ P   +R QR +   +++R  +    A    D FH IL+ +
Sbjct: 535 VGDIPRLKEILAAVSPRQYIRMQRRVR--AVRRHFMVSDGAPWRFDVFHMILHSI 587


>gi|224142567|ref|XP_002324627.1| predicted protein [Populus trichocarpa]
 gi|222866061|gb|EEF03192.1| predicted protein [Populus trichocarpa]
          Length = 342

 Score =  118 bits (295), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 84/306 (27%), Positives = 142/306 (46%), Gaps = 27/306 (8%)

Query: 79  SSPVRTLNPEEADWFYTPIYPTCDLTPTGLP-LPFKSPRMMR---SAIQLISSNWPYWNR 134
            SP    +P+EA  F+ PI     +     P + F   ++ R     +++++  + YWNR
Sbjct: 41  KSPFIARHPDEAHAFFLPISVAYIIHYVYKPRITFARDQLQRLVTDYVRVVADKYTYWNR 100

Query: 135 TEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPP 194
           T+GADHF V  HD+      +    + R  + +L  A + + F  +  V + E  + +  
Sbjct: 101 TQGADHFSVSCHDWAPDVS-RANPELFRYFVRVLCNANISEGFRPQRDVSIPEIFLPVGK 159

Query: 195 YAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGY-YARGARAAVWENFKNNP--- 250
             PP++       P + RSI  +F G          G + + R      W+   +     
Sbjct: 160 LGPPREYTK----PPSKRSILAFFAG----------GAHGHIRKVLLTHWKEKDDEVQVH 205

Query: 251 --LFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFAD 308
             L   +  +   Y+E M ++ FCLCP G    SPR+V A+  GC+PV I+ +  LPF+D
Sbjct: 206 EYLTQRNKKNTNLYFELMGQSKFCLCPSGHEVASPRVVTAIQLGCVPVTISANYSLPFSD 265

Query: 309 AIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFH 368
            + W +  + +  E +P++ TIL  I     L  QR +    ++R  +  +PAQP D  H
Sbjct: 266 VLDWSKFSVDIPPEKIPEIKTILKGISSRRYLTMQRRVMQ--IQRHFMLNRPAQPYDMLH 323

Query: 369 QILNGL 374
            IL+ +
Sbjct: 324 MILHSV 329


>gi|302804767|ref|XP_002984135.1| glycosyltransferase-like protein [Selaginella moellendorffii]
 gi|300147984|gb|EFJ14645.1| glycosyltransferase-like protein [Selaginella moellendorffii]
          Length = 376

 Score =  118 bits (295), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 96/363 (26%), Positives = 156/363 (42%), Gaps = 30/363 (8%)

Query: 15  ADNTPKIERISGSAGDVLED--DPVGKLKVYVYEL-PSKYNKKLLLKDPRCLTHMFAAEI 71
           ADN P   R +      L+D  D V  +KV+VY   P      + L      +  +A+E 
Sbjct: 29  ADNGPYHSRRA-----FLDDYRDMVNTMKVFVYPCSPRDPFSHIFLPTSSAPSGNYASEA 83

Query: 72  FMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPY 131
           +  + L  S + T +P +AD F+ P   T       + +  + P  +R  ++ IS  WPY
Sbjct: 84  YFKKALAGSGMVTDDPSQADLFFMPFSITRLRNDPKVGVG-RMPAFVRDYVKNISHRWPY 142

Query: 132 WNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSIT 191
           WNRT G+DHF+V  H  G     + +      I  +      VQ F     V +      
Sbjct: 143 WNRTGGSDHFYVACHSIGKVALEKAQHVKLNAIQVVCSSNYYVQGFIPHKDVAI------ 196

Query: 192 IPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPL 251
             P   P+     +I     R +  +F G     +N P      R      W N  +  +
Sbjct: 197 --PQIWPRSESFREIKTIEQRKVLAFFAG----GSNSP-----VRANVVRTWRN--DTQI 243

Query: 252 FDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIP 311
               +    +Y E + R+ FCL   G+   + RL +A  +GC+PV+IA+   LPF+  + 
Sbjct: 244 HAYPSRIQGSYAEALLRSKFCLHVKGYEVNTARLGDAFFYGCVPVVIANHYDLPFSSVLN 303

Query: 312 WEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQIL 371
           W+   + V   ++PKL  IL+ I  E   +  RL+ +   +R   +  P +  DAF+ ++
Sbjct: 304 WKSFSVVVTTANIPKLKAILSGISREDYSQMHRLVLDA--RRHFQWHAPPREYDAFYMVM 361

Query: 372 NGL 374
             L
Sbjct: 362 YQL 364


>gi|449431924|ref|XP_004133750.1| PREDICTED: probable glycosyltransferase At5g03795-like [Cucumis
           sativus]
          Length = 412

 Score =  117 bits (294), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 100/356 (28%), Positives = 158/356 (44%), Gaps = 54/356 (15%)

Query: 35  DPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFY 94
           D   K KVY+Y  P   +     + PR LT  +A+E +  + +  S  RT +P++A  F+
Sbjct: 86  DMESKFKVYIY--PDG-DPNTFYQTPRKLTGKYASEGYFFQNIRESRFRTEDPDQAHLFF 142

Query: 95  TPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHY 154
            PI  +C               ++++ ++ + S +PYWNRT GADHFFV  HD G     
Sbjct: 143 IPI--SCHKMRGKGTSYENMTVIVQNYVEGLISKYPYWNRTLGADHFFVTCHDVGV---- 196

Query: 155 QEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY---APPQKMQAHQIPPDTP 211
              +A E G+  L++ A  V        VC     +   P+   A PQ +Q   +P    
Sbjct: 197 ---RASE-GLPFLIKNAIRV--------VCSPSYDVGFIPHKDVALPQVLQPFALPAGGN 244

Query: 212 RSIFVYFRGLFYDVNNDPEGGYYA-------RGARAAVWENFKNNPLFDISTDHPTT--- 261
                       D  N    G++A       R   A VWEN   +   DIS +  +    
Sbjct: 245 ------------DTENRTTLGFWAGHRNSKIRVILARVWEN---DTELDISNNRISRATG 289

Query: 262 ---YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMF 318
              Y +   +  FC+CP G    S R+ +++ +GC+PVI++D   LPF D + W +  + 
Sbjct: 290 HLLYQKRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSDYYDLPFNDILDWRKFSVI 349

Query: 319 VAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
           V E DV +L  IL  I     ++  + L    +++   +  P    DAFH ++  L
Sbjct: 350 VKERDVYQLKQILKDISDIEFIKLHKNLMQ--VQKHFQWNSPPIKYDAFHMVMYDL 403


>gi|224071347|ref|XP_002303415.1| predicted protein [Populus trichocarpa]
 gi|222840847|gb|EEE78394.1| predicted protein [Populus trichocarpa]
          Length = 326

 Score =  117 bits (294), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 97/352 (27%), Positives = 158/352 (44%), Gaps = 54/352 (15%)

Query: 39  KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIY 98
             K+Y+Y  P   +     + PR LT  +A+E +  + +  S  RTL+P++A  F+ PI 
Sbjct: 4   NFKIYIY--PDG-DPNTFYQTPRKLTGKYASEGYFFQNIRESRFRTLDPDQAHLFFIPI- 59

Query: 99  PTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEK 158
            +C               ++ + ++ + + + YWNRT GADHFFV  HD G        +
Sbjct: 60  -SCHKMRGKGTSYENMTVIVENYVESLIAKYSYWNRTLGADHFFVTCHDVGV-------R 111

Query: 159 AIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY---APPQKMQAHQIPPDTPRSIF 215
           A E G+  L++ A  V        VC     +   P+   A PQ +Q   +P        
Sbjct: 112 ATE-GVPFLIKNAIRV--------VCSPSYDVGFIPHKDVALPQVLQPFALPAGGN---- 158

Query: 216 VYFRGLFYDVNNDPEGGYYA--RGAR-----AAVWENFKNNPLFDISTDHPT------TY 262
                   DV N    G++A  R +R     A VWEN   +   DIS++          Y
Sbjct: 159 --------DVENRTTLGFWAGHRNSRIRVILARVWEN---DTELDISSNRINRATGHLVY 207

Query: 263 YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEE 322
            +      FC+CP G    S R+ +++ +GC+PVI+++   LPF D + W +  + + E+
Sbjct: 208 QKRFYGTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWHKFSVILKEQ 267

Query: 323 DVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
           DV +L  IL  IP    +   + L    +++   +  P    DAFH ++  L
Sbjct: 268 DVYQLKQILKDIPDNQFVSLHKNLI--KVQKHFQWNSPPIKNDAFHMVMYDL 317


>gi|224120366|ref|XP_002331030.1| predicted protein [Populus trichocarpa]
 gi|222872960|gb|EEF10091.1| predicted protein [Populus trichocarpa]
          Length = 339

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 88/303 (29%), Positives = 136/303 (44%), Gaps = 35/303 (11%)

Query: 86  NPEEADWFYTPIYPTCDLTPTGLP-LPFKSPRMMR---SAIQLISSNWPYWNRTEGADHF 141
           +PEEA  F  P+     +     P + F   ++ R     +++I+   PYWNRT GADHF
Sbjct: 48  HPEEAQAFLLPVSVAYIIHYVYRPRITFSRDQLQRLVTDYVRVIADKHPYWNRTHGADHF 107

Query: 142 FVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKM 201
            V  HD+      + +  + +  +  L  A   + F  +  V + E  + +    PPQ+ 
Sbjct: 108 SVSCHDWAPDVS-RADPGLFKYFIRALCNANTSEGFQPQRDVSIPEIFLPVGKLGPPQEY 166

Query: 202 QAHQIPPDTPRSIFVYFRGLFYD----------VNNDPEGGYYARGARAAVWENFKNNPL 251
                 P + RSI  +F G  +              D E   +    R    +N KNN L
Sbjct: 167 AQ----PPSKRSILAFFAGGAHGHIRKILLERWKEKDDEIQVHEYLTR----KNKKNNNL 218

Query: 252 FDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIP 311
                     Y+E M ++ FCLCP G    SPR+V A+  GC+PV I+D+  LPF+D + 
Sbjct: 219 ----------YFELMGQSKFCLCPSGHEVASPRVVTAIQLGCVPVTISDNYSLPFSDVLD 268

Query: 312 WEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQIL 371
           W +  + +  E +P +  IL  I     L  QR +    ++R     +PAQP D  H IL
Sbjct: 269 WSKFSVDIPSEKIPDIKIILKGISVRRYLTMQRRVMQ--IRRHFTLNRPAQPYDMLHMIL 326

Query: 372 NGL 374
           + +
Sbjct: 327 HSV 329


>gi|168062371|ref|XP_001783154.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162665352|gb|EDQ52040.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 351

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 77/258 (29%), Positives = 125/258 (48%), Gaps = 15/258 (5%)

Query: 122 IQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRN 181
           ++ I+S +PYWNRT GADHFFV  HD+        ++ +    + ++  A L   F  + 
Sbjct: 87  VERITSKYPYWNRTRGADHFFVSCHDWAPLSTILHDE-LHNNSMKVVCNADLTANFDIQK 145

Query: 182 HVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAA 241
            V + +            ++    +PP   R    ++ G  + +   P    + RG  ++
Sbjct: 146 DVSIPQAVKG----GNQSELDIDNLPPGK-RDYLAFYAGQMHGLVR-PVLIQHWRGKDSS 199

Query: 242 --VWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIA 299
             V+E        +IS      Y + M+R+ FCLCP G+   SPR+VEA++ GC+PVIIA
Sbjct: 200 MKVYEVLPPEIAKNIS------YAQHMKRSKFCLCPKGFEVNSPRIVEAILSGCVPVIIA 253

Query: 300 DDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQ 359
           D+ VLPF++ + W +  + V E+D+P L  ILT++P       Q  L         L  Q
Sbjct: 254 DNFVLPFSNVLDWSKFSITVEEKDIPNLKRILTNVPDGTYRSMQSCLKYIRRHFVWLEDQ 313

Query: 360 PAQPGDAFHQILNGLARK 377
                D+FH  +  + R+
Sbjct: 314 EDTQYDSFHMTMYSIWRQ 331


>gi|356562965|ref|XP_003549738.1| PREDICTED: probable glycosyltransferase At3g42180-like [Glycine
           max]
          Length = 473

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 90/311 (28%), Positives = 137/311 (44%), Gaps = 42/311 (13%)

Query: 80  SPVRTLNPEEADWF-------------YTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLIS 126
           SP +  +PE+A  F             Y P     D  P  L       R++   I +++
Sbjct: 176 SPFKATHPEQAHLFLLPYSVSKVIRYVYKPRRSRSDYDPDRLQ------RLVADYINILA 229

Query: 127 SNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLN 186
           + +PYWNR++GADHF V  HD+G        +  +  I  L   A   + F     V + 
Sbjct: 230 NRYPYWNRSKGADHFLVSCHDWGPRISDANPELFKYFIRALCN-ANTSEGFQPNRDVSIP 288

Query: 187 EGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAV--WE 244
           E  +      PP   Q     P+  R+I  +F            GG + +  +  +  W+
Sbjct: 289 EVYLPSGKLGPPNMGQH----PNN-RTILAFF-----------AGGAHGKIRKKLLKRWK 332

Query: 245 NFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVL 304
           N                Y + M  + FCLCP G    SPR+VEA+  GC+PVII D+  L
Sbjct: 333 NKDKEVQVHEYLPKGQDYTKLMGLSKFCLCPSGHEVASPRVVEAIYAGCVPVIICDNYSL 392

Query: 305 PFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLAN-PSMKRAMLFPQPAQP 363
           PF D + W +  M +A E +P++ TIL S+  +  L    L +N   ++R  +  +PA+P
Sbjct: 393 PFIDVLNWRKFSMEIAVERMPEIKTILQSVSKDKYL---ELYSNVRRVRRHFVINRPAKP 449

Query: 364 GDAFHQILNGL 374
            D  H IL+ L
Sbjct: 450 FDLIHMILHSL 460


>gi|224087070|ref|XP_002308053.1| predicted protein [Populus trichocarpa]
 gi|222854029|gb|EEE91576.1| predicted protein [Populus trichocarpa]
          Length = 339

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 86/299 (28%), Positives = 135/299 (45%), Gaps = 27/299 (9%)

Query: 86  NPEEADWFYTPIYPTCDLTPTGLP-LPFKSPRMMR---SAIQLISSNWPYWNRTEGADHF 141
           +PEEA  F  P+     +     P + F   ++ R     +++I+   PYWNRT GADHF
Sbjct: 48  HPEEAQAFLLPVSVAYIIHYVYRPRITFSRDQLQRLVTDYVRVIADKHPYWNRTHGADHF 107

Query: 142 FVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKM 201
            V  HD+      + +  + +  +  L  A   + F  +  V + E  + +    PPQ+ 
Sbjct: 108 SVSCHDWAPDVS-RADPGLFKYFIRALCNANTSEGFQPQRDVSIPEIFLPVGKLGPPQEY 166

Query: 202 QAHQIPPDTPRSIFVYFRGLFYD------VNNDPEGGYYARGARAAVWENFKNNPLFDIS 255
                 P + RSI  +F G  +       +    E     +       +N KNN L    
Sbjct: 167 AQ----PPSKRSILAFFAGGAHGHIRKILLERWKEKDDEIQVHEYLTQKNKKNNNL---- 218

Query: 256 TDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEI 315
                 Y+E M ++ FCLCP G    SPR+V A+  GC+PV I+D+  LPF+D + W + 
Sbjct: 219 ------YFELMGQSKFCLCPSGHEVASPRVVTAIQLGCVPVTISDNYSLPFSDVLDWSKF 272

Query: 316 GMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
            + +  E +P +  IL  I     L  QR +    ++R     +PAQP D  H IL+ +
Sbjct: 273 SVDIPSEKIPDIKIILKGISVRRYLTMQRRVMQ--IRRHFTLNRPAQPYDMLHMILHSV 329


>gi|9758523|dbj|BAB08970.1| unnamed protein product [Arabidopsis thaliana]
          Length = 559

 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 99/352 (28%), Positives = 166/352 (47%), Gaps = 44/352 (12%)

Query: 39  KLKVYVYELPSK--YNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTL-NPEEADWFYT 95
           KLK+YVY+   K  ++  +    PR +   +A+E +  + + S+    + +P +A  FY 
Sbjct: 225 KLKIYVYKEGGKPIFHTPM----PRGI---YASEGWFMKLMESNKKFVVKDPRKAHLFYI 277

Query: 96  PIYPTCDLTPTGLPLPFKSPRMM----RSAIQLISSNWPYWNRTEGADHFFVVPHDFGAC 151
           PI  +     + L L F++P+ +    +  + LI+  + +WNRT GADHF V  HD+G  
Sbjct: 278 PI--SIKALRSSLGLDFQTPKSLADHLKEYVDLIAGKYKFWNRTGGADHFLVACHDWGNK 335

Query: 152 FHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTP 211
              +  K   R     L  + + Q F       L    I     AP + +        + 
Sbjct: 336 LTTKTMKNSVRS----LCNSNVAQGFRIGTDTALPVTYIR-SSEAPLEYLGG---KTSSE 387

Query: 212 RSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFK-NNPLFDISTDHPTT---YYEDMQ 267
           R I  +F G  +          Y R     +WEN + +  +F      P +   Y E M+
Sbjct: 388 RKILAFFAGSMHG---------YLRPILVKLWENKEPDMKIFGPMPRDPKSKKQYREYMK 438

Query: 268 RAI-----FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEE 322
            +      +C+C  G+   +PR+VEA++  C+PVIIAD+ V PF + + WEE  +FV E+
Sbjct: 439 SSSSHFNRYCICARGYEVHTPRVVEAIINECVPVIIADNYVPPFFEVLNWEEFAVFVEEK 498

Query: 323 DVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
           D+P L  IL SIP +  +  Q  +   ++++  L+ +     D FH IL+ +
Sbjct: 499 DIPNLRNILLSIPEDRYIGMQARV--KAVQQHFLWHKKPVKFDQFHMILHSI 548


>gi|449482564|ref|XP_004156325.1| PREDICTED: probable glycosyltransferase At5g03795-like [Cucumis
           sativus]
          Length = 516

 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 92/356 (25%), Positives = 152/356 (42%), Gaps = 58/356 (16%)

Query: 39  KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTP-- 96
           ++K++VYE      +  L  +  C +       F+H   + S  RT +P +A  F+ P  
Sbjct: 189 EMKIFVYE----EGEPPLFHNGPCKSIYSTEGNFIHAIEMDSQFRTKDPNKAHVFFLPLS 244

Query: 97  ---------IYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHD 147
                    ++ + D TP            +   I +I + +P+WNR+ GADHF +  HD
Sbjct: 245 VAMLVRFVYVHDSHDFTPIR--------HTVVDYINVIGTKYPFWNRSLGADHFMLSCHD 296

Query: 148 FGACFHYQEEKAI---ERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAH 204
           +G     +  K++    +  + +L  A   + F     V             P   +Q  
Sbjct: 297 WGP----EASKSVPNLYKNSIRVLCNANTSEGFNPSKDVSF-----------PEINLQTG 341

Query: 205 QI------PPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDH 258
            +      P  + R I  +F G  +            R      WEN   +         
Sbjct: 342 HLTGFLGGPSPSHRPILAFFAGGLHGP---------IRPILIQQWENQDQDIQVHQYLPK 392

Query: 259 PTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMF 318
             +Y + M+++ FCLCP G+   SPR+VEA+  GC+PV+I+D  V PF+D I W+   + 
Sbjct: 393 GVSYIDMMRKSKFCLCPSGYEVASPRIVEAIYTGCVPVLISDHYVPPFSDVINWKSFSVE 452

Query: 319 VAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
           V+ +D+P L TILT I     LR  R +    ++R      P +  D +H IL+ +
Sbjct: 453 VSVDDIPNLKTILTGISTRQYLRMYRRVVK--VRRHFEVNSPPKRYDVYHMILHSV 506


>gi|449451243|ref|XP_004143371.1| PREDICTED: probable glycosyltransferase At5g03795-like [Cucumis
           sativus]
          Length = 516

 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 92/356 (25%), Positives = 152/356 (42%), Gaps = 58/356 (16%)

Query: 39  KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTP-- 96
           ++K++VYE      +  L  +  C +       F+H   + S  RT +P +A  F+ P  
Sbjct: 189 EMKIFVYE----EGEPPLFHNGPCKSIYSTEGNFIHAIEMDSQFRTKDPNKAHVFFLPLS 244

Query: 97  ---------IYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHD 147
                    ++ + D TP            +   I +I + +P+WNR+ GADHF +  HD
Sbjct: 245 VAMLVRFVYVHDSHDFTPIR--------HTVVDYINVIGTKYPFWNRSLGADHFMLSCHD 296

Query: 148 FGACFHYQEEKAI---ERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAH 204
           +G     +  K++    +  + +L  A   + F     V             P   +Q  
Sbjct: 297 WGP----EASKSVPNLYKNSIRVLCNANTSEGFNPSKDVSF-----------PEINLQTG 341

Query: 205 QI------PPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDH 258
            +      P  + R I  +F G  +            R      WEN   +         
Sbjct: 342 HLTGFLGGPSPSHRPIMAFFAGGLHGP---------IRPILIQRWENQDQDIQVHQYLPK 392

Query: 259 PTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMF 318
             +Y + M+++ FCLCP G+   SPR+VEA+  GC+PV+I+D  V PF+D I W+   + 
Sbjct: 393 GVSYIDMMRKSKFCLCPSGYEVASPRIVEAIYTGCVPVLISDHYVPPFSDVINWKSFSVE 452

Query: 319 VAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
           V+ +D+P L TILT I     LR  R +    ++R      P +  D +H IL+ +
Sbjct: 453 VSVDDIPNLKTILTGISTRQYLRMYRRVVK--VRRHFEVNSPPKRYDVYHMILHSV 506


>gi|302780894|ref|XP_002972221.1| glycosyltransferase-like protein [Selaginella moellendorffii]
 gi|300159688|gb|EFJ26307.1| glycosyltransferase-like protein [Selaginella moellendorffii]
          Length = 376

 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 96/363 (26%), Positives = 156/363 (42%), Gaps = 30/363 (8%)

Query: 15  ADNTPKIERISGSAGDVLED--DPVGKLKVYVYEL-PSKYNKKLLLKDPRCLTHMFAAEI 71
           ADN P   R +      L+D  D V  +KV+VY   P      + L      +  +A+E 
Sbjct: 29  ADNGPYHSRRA-----FLDDYRDMVNTMKVFVYPCSPRDPFSHIFLPTSSAPSGNYASEA 83

Query: 72  FMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPY 131
           +  + L  S + T +P +AD F+ P   T       + +  + P  +R  ++ IS  WPY
Sbjct: 84  YFKKALAESGMVTDDPSQADLFFMPFSITRLRNDPKVGVG-RMPAFVRDYVKNISHRWPY 142

Query: 132 WNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSIT 191
           WNRT G+DHF+V  H  G     + +      I  +      VQ F     V +      
Sbjct: 143 WNRTGGSDHFYVACHSIGKVALEKAQHVRLNAIQVVCSSNYYVQGFIPHKDVAM------ 196

Query: 192 IPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPL 251
             P   P+     +I     R +  +F G     +N P      R      W N  +  +
Sbjct: 197 --PQIWPRSESFREIKTIEQRKVLAFFAG----GSNSP-----VRANVVRTWRN--DTQI 243

Query: 252 FDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIP 311
               +    +Y E + R+ FCL   G+   + RL +A  +GC+PV+IA+   LPF+  + 
Sbjct: 244 HAYPSRIQGSYAEALLRSKFCLHVKGYEVNTARLGDAFFYGCVPVVIANYYDLPFSSVLN 303

Query: 312 WEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQIL 371
           W+   + V   ++PKL  IL+ I  E   +  RL+ +   +R   +  P +  DAF+ ++
Sbjct: 304 WKSFSVVVTTANIPKLKAILSGISREDYSQMHRLVLDA--RRHFQWHAPPREYDAFYMVM 361

Query: 372 NGL 374
             L
Sbjct: 362 YQL 364


>gi|326529697|dbj|BAK04795.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 348

 Score =  117 bits (292), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 75/259 (28%), Positives = 123/259 (47%), Gaps = 14/259 (5%)

Query: 118 MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTF 177
           +R  ++ +++ +P+WNRT GADHF V  HD+G           +  I  L    +    F
Sbjct: 90  LRDFVKGLAAKYPFWNRTRGADHFLVACHDWGPYTTTAHRDLSKNSIKALCNADSSEGIF 149

Query: 178 GQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARG 237
                V L E +I      P + ++     P + R I  +F G  +         ++ +G
Sbjct: 150 TPGKDVSLPETTIR----TPKRPLRYVGGLPVSRRRILAFFAGNVHGRVRPVLLQHWGKG 205

Query: 238 ARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVI 297
                 ++ +   L          Y + M+ + FCLCP+G+   SPR+VEA+ + C+PVI
Sbjct: 206 QD----DDMRVYALLPGRVSRTMNYIQHMKNSKFCLCPMGYEVNSPRIVEALYYECVPVI 261

Query: 298 IADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLAN--PSMKRAM 355
           IAD+ VLPF+D + W    + VAE+D+P+L  IL  I     LR+   + +    ++R  
Sbjct: 262 IADNFVLPFSDVLDWSAFSVVVAEKDIPELKRILQGIS----LRRYVAMHDCVKRLQRHF 317

Query: 356 LFPQPAQPGDAFHQILNGL 374
           L+       D FH IL+ +
Sbjct: 318 LWYDRPLRYDLFHMILHSI 336


>gi|302830910|ref|XP_002947021.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
 gi|300268065|gb|EFJ52247.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
          Length = 834

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 104/372 (27%), Positives = 156/372 (41%), Gaps = 78/372 (20%)

Query: 42  VYVYELPSKYNKKLL-LKDPRCL-----------THMFA----AEIFMHRFLLSSPVRTL 85
           +YVY+LP  YN ++L  ++ + L           T +FA     E+  H  LL S  RT 
Sbjct: 391 IYVYDLPPAYNARMLQYRNDKGLCTWRGFGSGNRTEIFAWTYGLEVLFHEMLLQSEHRTF 450

Query: 86  NPEEADWFYTPIYPTCDL-------------TPTGLPLPFKSPRMMRSAIQLISSNWPYW 132
           +PE AD+FY P+Y +C +              P+G P       MM      I  ++PYW
Sbjct: 451 DPEAADYFYVPVYGSCFIFPLHCYADGPWWHVPSG-PRVMHVTNMMLEVRDWIRKHFPYW 509

Query: 133 NRTEGADHFFVVPHDFGACF--------------------HYQEEKAI------ERGILP 166
           +R  G DH +++ HD GAC+                    H+    A       +  + P
Sbjct: 510 DRRGGRDHIWLMTHDEGACYAPTEIYNSSIFLTHWGRIDKHHASNTAFTPDNYTQEYVHP 569

Query: 167 LLQRATLVQTFGQRNHVCLNEGS-ITIPPYAPPQKMQAHQI--PPDTPRSIFVYFRGLFY 223
                 L    G   H C   G  + +P    P   +   +   P   R I +Y RG   
Sbjct: 570 EQPGGWLHLIDG---HPCYTPGKDLVVPALKLPHHFRQSPLLFHPPRQRDILLYLRG--- 623

Query: 224 DVNND--PEGGYYARGARAAVWENFKN-------NPLFDISTDHPTTYYEDMQRAIFCLC 274
           DV     P    Y+RG R  ++  +++       N +    +D P  Y E + R+ FCL 
Sbjct: 624 DVGKHRLPN---YSRGIRQRLYRLWRDQQWLQGYNVMIGDGSDVPGDYSEHLSRSKFCLV 680

Query: 275 PLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSI 334
             G   WSPRL +AV+ GC+PVII D +   + D +  E   + V E+++  L   L  +
Sbjct: 681 VPG-DGWSPRLEDAVLHGCVPVIIMDGVHGVWEDQLELERFSIRVGEDELEGLPQQLAVV 739

Query: 335 PPEVILRKQRLL 346
           P  V+   QR L
Sbjct: 740 PQRVLEDMQRKL 751


>gi|115456850|ref|NP_001052025.1| Os04g0109900 [Oryza sativa Japonica Group]
 gi|38345576|emb|CAE01775.2| OSJNBa0027H06.12 [Oryza sativa Japonica Group]
 gi|113563596|dbj|BAF13939.1| Os04g0109900 [Oryza sativa Japonica Group]
 gi|125589112|gb|EAZ29462.1| hypothetical protein OsJ_13536 [Oryza sativa Japonica Group]
          Length = 441

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 98/347 (28%), Positives = 163/347 (46%), Gaps = 30/347 (8%)

Query: 39  KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSP------VRTLNPEEADW 92
           + KVYVYE      +  +L +  C         F+ +  L SP      VRT +P  A  
Sbjct: 103 RFKVYVYE----EGEPPILHEGPCKNIYTIEGSFIEQLELMSPSDAGGGVRTWDPTRAHA 158

Query: 93  FYTPIYPTCDLT-----PTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHD 147
           F+ P   +  +      P+    P ++  ++   ++++++  P+WNR+ GADHF +  HD
Sbjct: 159 FFLPFSVSQMVKFVYRPPSQDRAPLRA--IVADYVRVVAARHPFWNRSAGADHFMLSCHD 216

Query: 148 FGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIP 207
           +G  +  + +  +    +  L  A   + F     V + E  I +     P+++ A   P
Sbjct: 217 WGP-YASRGQPELYTNAIRALCNANTSEGFRPGKDVSVPE--INLYDGDMPRELLA-PAP 272

Query: 208 PDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQ 267
               R +  +F G  +    D    ++ +G  AA +      P+++        YY  M+
Sbjct: 273 GLESRPLLAFFAGGRHGHVRDLLLRHW-KGRDAATF------PVYEYDLPAAGDYYSFMR 325

Query: 268 RAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKL 327
           RA FCLCP G    SPR+VEA+   C+PV+IAD   LPFAD + WE   + VA  D+P+L
Sbjct: 326 RARFCLCPSGHEVASPRVVEAIQAECVPVVIADGYALPFADVLRWEAFSVAVAVGDIPRL 385

Query: 328 DTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
              L  IP   + R +R +    +KR ++  QP +  D F+ IL+ +
Sbjct: 386 RERLERIPAAEVERLRRGVR--LVKRHLMLQQPPRRLDMFNMILHSV 430


>gi|159480086|ref|XP_001698117.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
 gi|158273916|gb|EDO99702.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
          Length = 821

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 108/386 (27%), Positives = 157/386 (40%), Gaps = 85/386 (22%)

Query: 36  PVGKLK--VYVYELPSKYNKKLLLKDPRCLTH------------------MFAAEIFMHR 75
           P G+ +  +YVY++P KY  ++L    R L H                   ++ E   H 
Sbjct: 342 PGGRRRPLIYVYDVPPKYTSRMLQY--RLLAHACLWRRWLDGNNTELTGWTYSVETMFHE 399

Query: 76  FLLSSPVRTLNPEEADWFYTPIYPTCDLTPT-------GLPLPFKSPRMMRSAIQL---- 124
            LL S  RT +PEEAD+FY P Y TC   P            P+   R M  A  L    
Sbjct: 400 LLLQSEHRTFDPEEADFFYVPHYVTCYFWPIMGWADAPWWHAPYVDSRPMHGANMLTELH 459

Query: 125 --ISSNWPYWNRTEGADHFFVVPHDFGACF-------------HYQEEKAIERGILPLLQ 169
             + +N PYW+R  G DH +++  D GAC+             H+   +A        LQ
Sbjct: 460 GWLRTNLPYWDRRGGRDHIWLMAADEGACWMPTAIYNTSIVLTHWGRLEANHTSGTAYLQ 519

Query: 170 RATLVQTFG-QR-----------NHVCLN-EGSITIPPYAPPQKMQAHQI--PPDTPRSI 214
                  +G QR            H C + +  + IP + PP       +   P   R I
Sbjct: 520 DVYDRPVYGFQRWPGVDYHHDIEGHPCFDPKKDLVIPAFKPPFHFARSPLLGAPPLQRDI 579

Query: 215 FVYFRGLFYDVNNDPEGGY----YARGARAAVWENFKNNPLFD---ISTDHPTT----YY 263
            +YFRG          G +    Y+RG R  + +       F+   I   H       Y 
Sbjct: 580 LLYFRG--------DSGAFRLPQYSRGIRQRITDLSNRQDWFNRYKIVISHGGMVGGDYS 631

Query: 264 EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED 323
           E + R+ FCL   G   WSPR  +A++ GCIPV++ D +   F   + W+   + + E+D
Sbjct: 632 EHLARSKFCLVAPG-DGWSPRAEDAILHGCIPVVVMDGVQAVFESILDWDSFSLRIREDD 690

Query: 324 --VPKLDTILTSIPPEVILRKQRLLA 347
             +  L  +L SI PE +   QR LA
Sbjct: 691 AALEALPQLLASISPERLAHMQRHLA 716


>gi|297808029|ref|XP_002871898.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297317735|gb|EFH48157.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 610

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 94/351 (26%), Positives = 159/351 (45%), Gaps = 48/351 (13%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTL-NPEEADWFYTP-- 96
           LKVYVY    K   + +   P  L  ++A+E +  + +  +   T+ +P +A  +Y P  
Sbjct: 281 LKVYVY----KEGNRPIFHTP-ILKGLYASEGWFMKLMEGNKQYTVKDPRKAHLYYMPFS 335

Query: 97  -------IYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFG 149
                  +Y       T L       + ++   + ISS +P++NRT+GADHF V  HD+ 
Sbjct: 336 ARMLEYTLYVRNSHNRTNLR------QFLKEYTEHISSKYPFFNRTDGADHFLVACHDWA 389

Query: 150 ACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPD 209
               Y+    +E  I  L   A +   F     + L E  +     A    ++     P 
Sbjct: 390 P---YETRHHMEHCIKALCN-ADVTAGFKIGRDISLPETYVR----AAKNPLRDLGGKPP 441

Query: 210 TPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP------TTYY 263
           + R    ++ G  +          Y R      W++   +P   I    P        Y 
Sbjct: 442 SQRRTLAFYAGSMHG---------YLRQILLQHWKD--KDPDMKIFGRMPFGVASKMNYI 490

Query: 264 EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED 323
           E M+ + +C+CP G+   SPR+VE++ + C+PVII+D+ V PF + + W    + VAE+D
Sbjct: 491 EQMKSSKYCICPKGYEVNSPRVVESIFYECVPVIISDNFVPPFFEVLDWSAFSVIVAEKD 550

Query: 324 VPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
           +P+L  IL+SIP E  ++ Q  +A    +R  L+    +  D FH +L+ +
Sbjct: 551 IPRLKDILSSIPEEKYVKMQ--MAVRKAQRHFLWHAKPEKYDLFHMVLHSI 599


>gi|255582307|ref|XP_002531944.1| catalytic, putative [Ricinus communis]
 gi|223528390|gb|EEF30426.1| catalytic, putative [Ricinus communis]
          Length = 363

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 84/332 (25%), Positives = 143/332 (43%), Gaps = 53/332 (15%)

Query: 79  SSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPF----KSPRMMRSAIQLISSNWPYWNR 134
            SP    + +EA  F+ P+     +    LP+      K  R+ +  +++++  +P+WNR
Sbjct: 40  ESPFIARHADEAHVFFLPVSVAHIVEYIYLPITSYDRDKLIRVFKDYVKVVADKYPFWNR 99

Query: 135 TEGADHFFVVPHD------------------------FGACFHY-------QEEKAIERG 163
           + G+DHF +  HD                        F   FH        Q +  + + 
Sbjct: 100 SSGSDHFMLSCHDWAMVHINSIPVELVRAFALGRNLKFEISFHSHRAPEISQHDPELYKN 159

Query: 164 ILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFY 223
           ++ ++  A   + F     V L E  + IPP         H   P   R I  +F G   
Sbjct: 160 LIRVICNANTSEGFLPTRDVTLPE--LNIPPGGFDH---VHHCLPSHKRRILAFFAG--- 211

Query: 224 DVNNDPEGGY-YARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWS 282
                  G + Y R      W+N  +             Y + M ++ FCLCP G+   S
Sbjct: 212 -------GAHGYIRKILLHHWKNKDDEVQVHEYLSKDEDYRKLMGQSKFCLCPSGYEVAS 264

Query: 283 PRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRK 342
           PR+VE++  GCIPVII+D   LPF+D + W +I + +  E +P++ TIL  +  +  LR 
Sbjct: 265 PRIVESIYAGCIPVIISDHYNLPFSDVLDWSQISVQIPVEKIPEIKTILKGVSNDKYLRM 324

Query: 343 QRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
           Q+ +    ++R     +P++P D  H +L+ +
Sbjct: 325 QKRVRR--VQRHFEINRPSKPFDVLHMVLHSV 354


>gi|356565157|ref|XP_003550811.1| PREDICTED: xylogalacturonan beta-1,3-xylosyltransferase-like
           [Glycine max]
          Length = 411

 Score =  115 bits (289), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 83/284 (29%), Positives = 128/284 (45%), Gaps = 25/284 (8%)

Query: 92  WFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGAC 151
           + Y PI    D  P  L       R++   I +I++ +PYWNR+EGADHF +  HD+G  
Sbjct: 137 YVYKPIRKHSDYEPIRLQ------RLVEDYIGVIANKYPYWNRSEGADHFLLSCHDWGPK 190

Query: 152 FHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTP 211
             Y   K   +  + +L  A   + F     V + E  +      PP   Q    P D  
Sbjct: 191 VSYGNPKLF-KNFIRVLCNANTSEGFLPNKDVSIPEVYLPKGKLGPPNLGQR---PND-- 244

Query: 212 RSIFVYFRGLFY-DVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAI 270
           RSI  +F G  + D+          R      W+   N+            Y + M ++ 
Sbjct: 245 RSILAFFAGREHGDI----------RKILLNHWKGKDNDIQVHEYLPKGKNYTQLMGQSK 294

Query: 271 FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTI 330
           FCLCP G+   SPR+VEA+  GC+PV+I+     PF D + W +  + +  E + ++ TI
Sbjct: 295 FCLCPSGYEVASPRVVEAIHAGCVPVLISSSYSPPFTDVLNWSQFSVEIPVEKISEIKTI 354

Query: 331 LTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
           L SI     LR    +    ++R  +  +PA+P D  H IL+ +
Sbjct: 355 LQSISRNRYLRLHMNVLR--VRRHFMLNRPAKPFDLMHMILHSI 396


>gi|224084457|ref|XP_002307304.1| predicted protein [Populus trichocarpa]
 gi|222856753|gb|EEE94300.1| predicted protein [Populus trichocarpa]
          Length = 406

 Score =  115 bits (289), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 94/333 (28%), Positives = 158/333 (47%), Gaps = 51/333 (15%)

Query: 58  KDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRM 117
           + PR LT  +A+E +  + +  S  +T +P++A  F+ PI  +C     G  + +++  +
Sbjct: 100 QTPRKLTGKYASEGYFFQNIRESRFQTQDPDQAHLFFIPI--SCH-KMRGKGISYENMTI 156

Query: 118 M-RSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQT 176
           +  + ++ + S +PYWNRT GADHFFV  HD G        +A E G+  L++ A  V  
Sbjct: 157 IVDNYVESLKSKYPYWNRTLGADHFFVTCHDVGV-------RATE-GVPFLIKNAIRV-- 206

Query: 177 FGQRNHVCLNEGSITIPPY---APPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGY 233
                 VC     +   P+   A PQ +Q   +P                DV      G+
Sbjct: 207 ------VCSPSYDVGFIPHKDIALPQVLQPFALPAGGN------------DVEKRTTLGF 248

Query: 234 YA--RGAR-----AAVWEN-----FKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPW 281
           +A  R +R     A VWEN       NN + + +T H   Y +    + +C+CP G    
Sbjct: 249 WAGHRNSRIRVILARVWENDTELDISNNRI-NRATGH-LVYQKRFYGSKYCICPGGSQVN 306

Query: 282 SPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILR 341
           S R+ +++ +GCIPVI+++   LPF D + W +  + + E+DV +L  IL  IP   ++ 
Sbjct: 307 SARIADSIHYGCIPVILSNYYDLPFNDILDWHKFSVILKEQDVYRLKQILKDIPDNKLVS 366

Query: 342 KQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
             + L    +++   +  P    DAFH ++  L
Sbjct: 367 LHKNLV--KVQKHFQWNSPPVKYDAFHMVMYDL 397


>gi|356575096|ref|XP_003555678.1| PREDICTED: probable glycosyltransferase At3g07620-like [Glycine
           max]
          Length = 493

 Score =  115 bits (289), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 84/309 (27%), Positives = 140/309 (45%), Gaps = 24/309 (7%)

Query: 71  IFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSA---IQLISS 127
           IF++   ++S  RT NP+EA  ++ P      L     P+      + R+    + +IS 
Sbjct: 194 IFINSLEINSQFRTQNPDEAHVYFLPFSVVMILEHLFHPVIRDKAVLERTIGDYVHIISH 253

Query: 128 NWPYWNRTEGADHFFVVPHDFG--ACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCL 185
            + YWNR+ GADHF +  HD+G  A ++ +E   I    + +L  A + + F  +     
Sbjct: 254 KYKYWNRSYGADHFMLSCHDWGPRATWYVKELYFIA---IRVLCNANISEHFNPKKDASF 310

Query: 186 NEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWEN 245
            E ++         +      PP   R+I  +F G  +            R      WE 
Sbjct: 311 PEINLV----NGETRGLIGGYPP-CNRTILAFFAGQMHG---------RIRPVLFQHWEG 356

Query: 246 FKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLP 305
              + L          Y+E M+++ +C+CP G+   SPR+VEA+   C+PVII+   VLP
Sbjct: 357 KDKDVLVYEKLPDGVPYHETMKKSKYCICPSGFEVASPRIVEAIYAQCVPVIISQQYVLP 416

Query: 306 FADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGD 365
           F+D + W+   + +   DVPKL  IL  I  +  +R Q       ++R  +   P +  D
Sbjct: 417 FSDVLNWDSFSVQILVSDVPKLKEILLGISEDKYMRLQE--GVKQVQRHFVVNNPPKRYD 474

Query: 366 AFHQILNGL 374
            FH I++ +
Sbjct: 475 VFHMIIHSI 483


>gi|297811281|ref|XP_002873524.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297319361|gb|EFH49783.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 546

 Score =  115 bits (288), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 103/350 (29%), Positives = 162/350 (46%), Gaps = 47/350 (13%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEI-FM------HRFLLSSPVRTLNPEEADW 92
           LKVY+Y   S+ ++ +  +    +  ++A+E  FM      HRFL      T +P  A  
Sbjct: 220 LKVYIY---SEGDRPIFHQPEAIMEGIYASEGWFMKLMESNHRFL------TKDPNIAHL 270

Query: 93  FYTPIYPTCDLTPTGLPLPFKSPR----MMRSAIQLISSNWPYWNRTEGADHFFVVPHDF 148
           FY P + T  L          S R     +++ + LI+SN+P+WNRT G+DHFF   HD+
Sbjct: 271 FYLP-FSTRILQQKLYVHDSHSRRNLVKYLKNYLDLIASNYPFWNRTRGSDHFFTACHDW 329

Query: 149 GACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPP 208
                  E +      +  L  A +   F     V L E  I+     P   +  ++   
Sbjct: 330 APA----ETRGPYINCIRSLCNADVGVDFVVGKDVSLPETKIS-SSQNPNGNIGGNR--- 381

Query: 209 DTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDIST----DHPTTYYE 264
            + R+I  +F G  +        GY     R  +   + + P  D+      DH  +Y  
Sbjct: 382 PSKRTILAFFAGNLH--------GY----VRPILLNQWSSRPEPDMKIFNRIDH-KSYIR 428

Query: 265 DMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDV 324
            M+R+ FC+C  G+   SPR+VE+V++GC+PVII+D+ V PF + + WE   +FV E+++
Sbjct: 429 YMKRSRFCVCAKGYEVNSPRVVESVLYGCVPVIISDNFVPPFLEILNWESFAVFVPEKEI 488

Query: 325 PKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
           P L  IL SIP    +  Q+ +     K  M         D FH IL+ +
Sbjct: 489 PNLRKILISIPVRRYVEMQKRVMK-VQKHFMWHDGEPVRYDVFHMILHSV 537


>gi|242053777|ref|XP_002456034.1| hypothetical protein SORBIDRAFT_03g029210 [Sorghum bicolor]
 gi|241928009|gb|EES01154.1| hypothetical protein SORBIDRAFT_03g029210 [Sorghum bicolor]
          Length = 517

 Score =  115 bits (288), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 92/340 (27%), Positives = 153/340 (45%), Gaps = 21/340 (6%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTL-NPEEADWFYTPIY 98
           LKVY+Y+      ++ +   P  L+ ++A+E +  + L  S    + +  +A  FY P Y
Sbjct: 182 LKVYIYQ----DGRRPIFHTP-PLSGIYASEGWFMKLLKESRRHVVADAGKAHLFYLP-Y 235

Query: 99  PTCDLTPTGLPLPFKSPR----MMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHY 154
            +  L  T       + R     +R+ ++ +++ +P+WNRT GADHF V  HD+G     
Sbjct: 236 SSQQLRLTLYQADSHNLRPLAAYLRNFVRGLANKYPFWNRTRGADHFLVACHDWGPYTTA 295

Query: 155 QEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSI 214
                 +  I  L    +    F     V L E +I      P + ++     P + RSI
Sbjct: 296 AHRDLRKNAIKALCNADSSEGIFTPGKDVSLPETTIR----NPRRPLRYVGGLPVSRRSI 351

Query: 215 FVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLC 274
             +F G  +         ++  G      +  +   L          Y + M+ + FCLC
Sbjct: 352 LAFFAGNVHGRVRPVLLRHWGDGQD----DEMRVYSLLPNRVSRRMNYIQHMKNSRFCLC 407

Query: 275 PLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSI 334
           P+G+   SPR+VEA  + C+PVIIAD+ VLP ++ + W    + VAE+D+P L  IL  I
Sbjct: 408 PMGYEVNSPRIVEAFYYECVPVIIADNFVLPLSEVLDWSAFSVVVAEKDIPDLKKILQGI 467

Query: 335 PPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
            P   +     +    ++R  L+       D FH IL+ +
Sbjct: 468 SPRRYVAMHSCVKR--LQRHFLWHARPIKYDLFHMILHSI 505


>gi|449501885|ref|XP_004161484.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein
           LOC101226446 [Cucumis sativus]
          Length = 859

 Score =  115 bits (288), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 93/350 (26%), Positives = 151/350 (43%), Gaps = 63/350 (18%)

Query: 42  VYVYELPSKYNKKLLLKDP---RCLTHMF-------------AAEIFMHRFLLSSPVRTL 85
           +Y+Y+LP  +N +LL        C+  M+              AE+  +  +L+SP RTL
Sbjct: 350 IYIYDLPPGFNSQLLQGRHWKFECVNRMYNERNATMWTDDLYGAEMAFYESILASPHRTL 409

Query: 86  NPEEADWFYTPIYPTCDLTPTGLP--LPFKSPRMMRSAIQL---------ISSNWPYWNR 134
           N EEAD+F+ P+  +C +T       L  +    +RS + L         I   +PYWNR
Sbjct: 410 NGEEADFFFVPVLDSCIITRADDAPHLSLRDYMGLRSFLTLDFYKKAHDHIVEQYPYWNR 469

Query: 135 TEGADHFFVVPHDFGACFH----YQEEKAIERGILPLLQRATLVQTFGQR---------- 180
           + G DH +    D GAC+     +     +  G        +    +G            
Sbjct: 470 SSGRDHIWFFSWDEGACYAPKEIWNSMMLVHWGNTNSKHNHSTTAYWGDNWDNIPSSKRG 529

Query: 181 NHVCLN-EGSITIPPYAPPQKMQAHQ---IPPDTPRSIFVYFRGLF---YDVNNDPEGGY 233
           NH C + E  + +P +  P   +  +     P   R  F +F G     Y+    PE   
Sbjct: 530 NHPCFDPEKDLVVPAWKRPDGSRLSKKLWARPREERKTFFFFNGNLGPAYE-RGRPE-ST 587

Query: 234 YARGARAAVWENFKNNPLFD--ISTDHPT----------TYYEDMQRAIFCLCPLGWAPW 281
           Y+ G R  V E F ++P  +  +   H             Y+ED+  ++FC    G   W
Sbjct: 588 YSMGIRQKVAEEFGSSPNKEGKLGKQHAADVIVTPLRSENYHEDLASSVFCGVMPG-DGW 646

Query: 282 SPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTIL 331
           S R+ ++++ GCIPVII D I LP+ + + ++   + + E+D+P L  IL
Sbjct: 647 SGRMEDSILQGCIPVIIQDGIFLPYENVLNYDSFAVRIGEDDIPNLINIL 696


>gi|90265152|emb|CAH67778.1| H0201G08.5 [Oryza sativa Indica Group]
 gi|125546968|gb|EAY92790.1| hypothetical protein OsI_14593 [Oryza sativa Indica Group]
          Length = 441

 Score =  115 bits (288), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 98/345 (28%), Positives = 160/345 (46%), Gaps = 26/345 (7%)

Query: 39  KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSP------VRTLNPEEADW 92
           + KVYVYE      +  +L +  C         F+ +  L SP      VRT +P  A  
Sbjct: 103 RFKVYVYE----EGEPPILHEGPCKNIYTIEGSFIEQLELMSPSDAGGGVRTWDPTRAHA 158

Query: 93  FYTPIYPTCDLTPTGLPLPFKSP---RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFG 149
           F+ P   +  +     P     P    ++   ++++++  P+WNR+ GADHF +  HD+G
Sbjct: 159 FFLPFSVSQMVKFVYRPPSQDRPPLRAIVADYVRVVAARHPFWNRSAGADHFMLSCHDWG 218

Query: 150 ACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPD 209
             +  + +  +    +  L  A   + F     V + E  I +     P+++ A   P  
Sbjct: 219 P-YASRGQPELYTNAIRALCNANTSEGFRPGKDVSVPE--INLYDGDMPRELLA-PAPGL 274

Query: 210 TPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRA 269
             R +  +F G  +    D    ++ +G  AA +      P+++        YY  M+RA
Sbjct: 275 ESRPLLAFFAGGRHGHVRDLLLRHW-KGRDAATF------PVYEYDLPAAGDYYSFMRRA 327

Query: 270 IFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDT 329
            FCLCP G    SPR+VEA+   C+PV+IAD   LPFAD + WE   + VA  D+P+L  
Sbjct: 328 RFCLCPSGHEVASPRVVEAIQAECVPVVIADGYALPFADVLRWEAFSVAVAVGDIPRLRE 387

Query: 330 ILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
            L  IP   + R +R +    +KR ++  QP +  D F+ IL+ +
Sbjct: 388 RLERIPAAEVERLRRGVR--LVKRHLMLQQPPRRLDMFNMILHSV 430


>gi|255548864|ref|XP_002515488.1| catalytic, putative [Ricinus communis]
 gi|223545432|gb|EEF46937.1| catalytic, putative [Ricinus communis]
          Length = 474

 Score =  115 bits (288), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 85/298 (28%), Positives = 138/298 (46%), Gaps = 33/298 (11%)

Query: 87  PEEADWFYTPIYPTCDLTPTGLPLP----FKSPRMMRSAIQLISSNWPYWNRTEGADHFF 142
           P+EA  F+ P+     +     P       +   +++  + +ISS +P+WNR++GADHF 
Sbjct: 185 PDEAHAFFLPVSIVSIVKYVYRPYSDYSRIRLQNVVKDYVGVISSKYPFWNRSDGADHFL 244

Query: 143 VVPHDFG---ACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQ 199
              HD+    +  H +  K   R    +L  A   + F     V L E  +     +P  
Sbjct: 245 TSCHDWAPDVSAGHPELYKYFTR----VLCNANTSEGFVPERDVSLPEIRLRDRKLSP-- 298

Query: 200 KMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPL-FDISTDH 258
             + H +PP   R I  +F G         E G+     R  ++E++K       +    
Sbjct: 299 --EPHSLPPKD-RRILAFFAG--------GEHGH----VRTKLFEHWKGKDRDVQVYEYL 343

Query: 259 PTT--YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG 316
           P T  Y E M  + FCLCP GW   SPR+ EA+  GC+PVII+D   LPF+D + W +  
Sbjct: 344 PKTLNYTELMSHSKFCLCPSGWEVASPRVPEAIYSGCVPVIISDYYYLPFSDVLDWSKFS 403

Query: 317 MFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
           + +    +P++ T+L  IP    L  Q+ +    ++R     +PA+P D  H +L+ +
Sbjct: 404 VHIPVARIPEIKTVLQKIPMRKYLTMQKRVIQ--VQRHFKLNRPAKPYDVLHMVLHSI 459


>gi|255563758|ref|XP_002522880.1| catalytic, putative [Ricinus communis]
 gi|223537865|gb|EEF39480.1| catalytic, putative [Ricinus communis]
          Length = 406

 Score =  115 bits (288), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 97/351 (27%), Positives = 158/351 (45%), Gaps = 52/351 (14%)

Query: 39  KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIY 98
             KVY+Y  P   +     + PR LT  +A+E +  + +  S  RT +P++A  F+ PI 
Sbjct: 84  NFKVYIY--PDG-DPNTFYQTPRKLTGKYASEGYFFQNIRESKFRTNDPDQAHLFFIPI- 139

Query: 99  PTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEK 158
            +C               ++++ ++ ++  +PYWNRT GADHFFV  HD G        +
Sbjct: 140 -SCHKMRGKGTSYENMTIIVQNYVESLAVKYPYWNRTLGADHFFVTCHDVGV-------R 191

Query: 159 AIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY---APPQKMQAHQIPPDTPRSIF 215
           A E G+  L++ A  V        VC     +   P+   A PQ +Q   +P        
Sbjct: 192 ATE-GVPFLVKNAIRV--------VCSPSYDVGFIPHKDVALPQVLQPFALPAGGN---- 238

Query: 216 VYFRGLFYDVNNDPEGGYYA-------RGARAAVWEN-----FKNNPLFDISTDHPTTYY 263
                   D+ N    G++A       R   A VWEN       NN + + +T H   Y 
Sbjct: 239 --------DLENRTTLGFWAGHRNSKIRVILARVWENDTELDISNNRI-NRATGH-LVYQ 288

Query: 264 EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED 323
           +   R  FC+CP G    S R+ +++ +GC+PVI++D   LPF D + W+   + + E+D
Sbjct: 289 KRFYRTKFCICPGGSQVNSARIADSIHYGCVPVILSDYYDLPFNDILDWKRFSVILKEKD 348

Query: 324 VPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
           V +L  +L  I  +  +     L    +++   +  P    DAFH ++  L
Sbjct: 349 VYRLKQVLKDISDDEFVALHENLV--EVQKHFQWNSPPIKYDAFHMVMFDL 397


>gi|449437619|ref|XP_004136589.1| PREDICTED: uncharacterized protein LOC101206674 [Cucumis sativus]
          Length = 791

 Score =  115 bits (288), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 93/350 (26%), Positives = 151/350 (43%), Gaps = 63/350 (18%)

Query: 42  VYVYELPSKYNKKLLLKDP---RCLTHMF-------------AAEIFMHRFLLSSPVRTL 85
           +Y+Y+LP  +N +LL        C+  M+              AE+  +  +L+SP RTL
Sbjct: 350 IYIYDLPPGFNSQLLQGRHWKFECVNRMYNERNATMWTDDLYGAEMAFYESILASPHRTL 409

Query: 86  NPEEADWFYTPIYPTCDLTPTGLP--LPFKSPRMMRSAIQL---------ISSNWPYWNR 134
           N EEAD+F+ P+  +C +T       L  +    +RS + L         I   +PYWNR
Sbjct: 410 NGEEADFFFVPVLDSCIITRADDAPHLSLRDYMGLRSFLTLDFYKKAHDHIVEQYPYWNR 469

Query: 135 TEGADHFFVVPHDFGACFH----YQEEKAIERGILPLLQRATLVQTFGQR---------- 180
           + G DH +    D GAC+     +     +  G        +    +G            
Sbjct: 470 SSGRDHIWFFSWDEGACYAPKEIWNSMMLVHWGNTNSKHNHSTTAYWGDNWDNIPSSKRG 529

Query: 181 NHVCLN-EGSITIPPYAPPQKMQAHQ---IPPDTPRSIFVYFRGLF---YDVNNDPEGGY 233
           NH C + E  + +P +  P   +  +     P   R  F +F G     Y+    PE   
Sbjct: 530 NHPCFDPEKDLVVPAWKRPDGSRLSKKLWARPREERKTFFFFNGNLGPAYE-RGRPE-ST 587

Query: 234 YARGARAAVWENFKNNPLFD--ISTDHPT----------TYYEDMQRAIFCLCPLGWAPW 281
           Y+ G R  V E F ++P  +  +   H             Y+ED+  ++FC    G   W
Sbjct: 588 YSMGIRQKVAEEFGSSPNKEGKLGKQHAADVIVTPLRSENYHEDLASSVFCGVMPG-DGW 646

Query: 282 SPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTIL 331
           S R+ ++++ GCIPVII D I LP+ + + ++   + + E+D+P L  IL
Sbjct: 647 SGRMEDSILQGCIPVIIQDGIFLPYENVLNYDSFAVRIGEDDIPNLINIL 696


>gi|413956860|gb|AFW89509.1| hypothetical protein ZEAMMB73_873038 [Zea mays]
          Length = 783

 Score =  115 bits (288), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 91/357 (25%), Positives = 156/357 (43%), Gaps = 71/357 (19%)

Query: 42  VYVYELPSKYNKKLLLKDP---RCLT-------------HMFAAEIFMHRFLLSSPVRTL 85
           +YVY+LP++++  LL       +C+               ++ A+I +H  +L+SP RTL
Sbjct: 342 IYVYDLPAEFDSHLLEGRHYRFQCVNRIYDDMNRTIWTQQLYGAQIALHESILASPHRTL 401

Query: 86  NPEEADWFYTPIYPTCDLTPTG-----------LPLPFKSPRMMRSAIQLISSNWPYWNR 134
           N +EAD+FY P+  +C +T +                + +    R A   I+  +PYWNR
Sbjct: 402 NGDEADYFYVPVLDSCLITRSDDAPHLLLPRDLRRRSYHALEYYRMAHGHIAQQYPYWNR 461

Query: 135 TEGADHFFVVPHDFGACFH----YQEEKAIERGILPLLQRATLVQTFGQR---------- 180
           T G DH +    D GAC+     ++    +  G      + +    +             
Sbjct: 462 TSGRDHIWFFSWDEGACYAPKEIWKSMMLVHWGNTNTKHKNSTTAYWADNWDDIPLDKRG 521

Query: 181 NHVCLN-EGSITIPPYAPPQ------KMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGY 233
           NH C +    + +P +  P       K+ A    P   R+   YF G   ++ +  EGG 
Sbjct: 522 NHPCFDPRKDLVLPAWKEPNPGAIWLKLWAR---PRNNRTTLFYFNG---NLGSAYEGGR 575

Query: 234 ----YARGARAAVWENFKNNP----------LFDISTDHPTT--YYEDMQRAIFCLCPLG 277
               Y+ G R  +   F + P            D++  +  T  YYE++  ++FC    G
Sbjct: 576 PEDTYSMGIRQKLAAEFGSTPNKQGRLGRQHAADVTVTYLRTEKYYEELASSVFCGVLPG 635

Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSI 334
              WS R+ ++++ GCIPVII D I LP+ + + +    + + E+D+P L + L  I
Sbjct: 636 -DGWSGRMEDSMLQGCIPVIIQDGIFLPYENVLNYNSFAVRIQEDDIPGLISTLRGI 691


>gi|356546126|ref|XP_003541482.1| PREDICTED: xylogalacturonan beta-1,3-xylosyltransferase-like
           [Glycine max]
          Length = 490

 Score =  115 bits (287), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 84/302 (27%), Positives = 133/302 (44%), Gaps = 24/302 (7%)

Query: 80  SPVRTLNPEEADWFYTPIYPTCDLTPTGLPL-------PFKSPRMMRSAIQLISSNWPYW 132
           S  R  +PEEA  F+ PI     +     P+       P +   ++   I +I   +PYW
Sbjct: 194 SQFRARHPEEAHVFFLPISIANVVHYVYKPILKQSDYEPVRLQHLVEDYIGVIQDKYPYW 253

Query: 133 NRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITI 192
           NR+ GADHF +  HD+G    Y   +  +  I  L   A   + F     V + E  + +
Sbjct: 254 NRSIGADHFLLSCHDWGPKVSYGNPELFQTFIRALCN-ANTSEGFHPNRDVSIPEVYLPV 312

Query: 193 PPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLF 252
               P    Q     P++ R+   +F G  +            R      W++  N  L 
Sbjct: 313 GKLGPASLGQH----PNS-RTTLAFFAGGVHG---------EIRKILLKHWKDKDNEVLV 358

Query: 253 DISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW 312
                    Y + M ++ FCLCP G    SPR+VEA+  GC+PVII D+  LPF+D + W
Sbjct: 359 HEYLPKGQDYTKLMGQSKFCLCPSGHEVASPRVVEAIHAGCVPVIICDNYSLPFSDVLNW 418

Query: 313 EEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILN 372
            +  + +  E +P++ +IL SI     LR    +    ++R  +  +P +P D  H IL+
Sbjct: 419 SQFSVEIPVEKIPEIKSILQSISRNKYLRLH--MNVLRVRRHFMINRPTKPFDMMHMILH 476

Query: 373 GL 374
            +
Sbjct: 477 SI 478


>gi|356518344|ref|XP_003527839.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
           max]
          Length = 637

 Score =  115 bits (287), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 98/392 (25%), Positives = 169/392 (43%), Gaps = 44/392 (11%)

Query: 2   WVTAI--ALLLAFVSADNTPKIERISGSAGDVLEDDPVGK---------LKVYVYELPSK 50
           W +A+   LL A +  +N P +  +      +  +    K         LKVYVY    +
Sbjct: 260 WSSAVDQELLQARLEIENAPIVNNVENLYAPLFRNISRFKRSYELMEKTLKVYVY----R 315

Query: 51  YNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPL 110
              K ++  P  L    +   FM     S    T +P++A  FY P           +P 
Sbjct: 316 EGDKPIMHSPYLLGIYASEGWFMRLMEASKQFVTKDPKKAHLFYLPFSSRMLEETLYVPN 375

Query: 111 PFKSPRM---MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPL 167
              S  +   +++ + +I+    +WNRT GADHF V  HD+       E +      L  
Sbjct: 376 SHSSRNLIQYLKNYVDMIAGKHRFWNRTGGADHFLVACHDWAPT----ETRQHMARCLRA 431

Query: 168 LQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNN 227
           L  A + + F     + L E  +      P + +  +++   + R    +F G  +    
Sbjct: 432 LCNADVKEGFVLGKDISLPETYVR-NAQKPTRNIGGNRV---SKRKTLAFFAGGMHG--- 484

Query: 228 DPEGGYYARGARAAVWENFKNNPLFDI-----STDHPTTYYEDMQRAIFCLCPLGWAPWS 282
                 Y R      WEN   +P   I      +     Y + M+ + +C+C  G+   S
Sbjct: 485 ------YVRPILLQHWEN--KDPAMKIFGILPKSKGNRNYIQYMKSSKYCICAKGYEVNS 536

Query: 283 PRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRK 342
           PR+VEA+++ C+PVI++D+ V PF + + WE   +FV E+D+P L  IL SIP +  L+ 
Sbjct: 537 PRVVEAILYECVPVILSDNFVPPFFEMLNWESFAVFVLEKDIPNLKNILLSIPQKRYLQM 596

Query: 343 QRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
           Q ++    +++  L+ +     D FH +L+ +
Sbjct: 597 QMMVR--KVQQHFLWHRSPVKYDIFHMVLHSI 626


>gi|15239502|ref|NP_197954.1| Exostosin family protein [Arabidopsis thaliana]
 gi|110738111|dbj|BAF00988.1| hypothetical protein [Arabidopsis thaliana]
 gi|332006108|gb|AED93491.1| Exostosin family protein [Arabidopsis thaliana]
          Length = 654

 Score =  115 bits (287), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 103/348 (29%), Positives = 160/348 (45%), Gaps = 40/348 (11%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVR--TLNPEEADWFYTPI 97
           LKVY Y    K   K ++  P  L  ++A+E +    + S+  +  T +P +A  FY P 
Sbjct: 324 LKVYAY----KEGNKPIMHSP-ILRGIYASEGWFMNIIESNNNKFVTKDPAKAHLFYLP- 377

Query: 98  YPTCDLTPTGLPLPFKSPR----MMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFH 153
           + +  L  T       S R     ++  I  IS+ +P+WNRT GADHF    HD+     
Sbjct: 378 FSSRMLEVTLYVQDSHSHRNLIKYLKDYIDFISAKYPFWNRTSGADHFLAACHDWAPS-- 435

Query: 154 YQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIP--PYAPPQKMQAHQIPPDTP 211
            +  K + + I  L   + + + F       L E  +  P  P +      A+Q      
Sbjct: 436 -ETRKHMAKSIRALCN-SDVKEGFVFGKDTSLPETFVRDPKKPLSNMGGKSANQ------ 487

Query: 212 RSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTT-----YYEDM 266
           R I  +F G        P+ GY  R    + W N K+ P   I    P T     Y + M
Sbjct: 488 RPILAFFAG-------KPDHGYL-RPILLSYWGNNKD-PDLKIFGKLPRTKGNKNYLQFM 538

Query: 267 QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK 326
           + + +C+C  G+   SPR+VEA+ + C+PVII+D+ V PF + + WE   +F+ E+D+P 
Sbjct: 539 KTSKYCICAKGFEVNSPRVVEAIFYDCVPVIISDNFVPPFFEVLNWESFAIFIPEKDIPN 598

Query: 327 LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
           L  IL SI PE   R  ++      K  +   +P +  D FH IL+ +
Sbjct: 599 LKKILMSI-PESRYRSMQMRVKKVQKHFLWHAKPEK-YDMFHMILHSI 644


>gi|357462311|ref|XP_003601437.1| hypothetical protein MTR_3g080690 [Medicago truncatula]
 gi|355490485|gb|AES71688.1| hypothetical protein MTR_3g080690 [Medicago truncatula]
          Length = 450

 Score =  115 bits (287), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 81/309 (26%), Positives = 139/309 (44%), Gaps = 24/309 (7%)

Query: 71  IFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSA---IQLISS 127
           IF++    ++  RT NP EA  ++ P      L     P+      + R+    + +IS 
Sbjct: 151 IFINLLENNTLFRTQNPNEAHVYFLPFSVVMILEHLFHPVIRDKAVLGRTIGDYVHIISH 210

Query: 128 NWPYWNRTEGADHFFVVPHDFG--ACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCL 185
            + YWNR+ GADHF +  HD+G  A ++ +E   I    + +L  A + + F  +     
Sbjct: 211 KYAYWNRSYGADHFMLSCHDWGPRATWYVKELYFIA---IRVLCNANISEHFNPKKDASF 267

Query: 186 NEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWEN 245
            E  I +        +  +   P   R+I  +F G         +     R      W+N
Sbjct: 268 PE--INLVSGETTGLLGGY---PTWNRTILAFFAG---------QMNGRIRPVLFQHWKN 313

Query: 246 FKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLP 305
              + L         +Y+E M+ + +C+CP GW   SPR+VEA+   C+P++I+   VLP
Sbjct: 314 KDKDVLVYEKLPEKISYHETMKMSKYCICPSGWEVASPRIVEAIYAECVPILISQQYVLP 373

Query: 306 FADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGD 365
           F+D + W+   + +   ++PKL  IL  I  E  +R Q       ++R  +   P +  D
Sbjct: 374 FSDVLNWDSFSVQIEVSEIPKLKEILLGISEEKYMRLQE--GVKQVQRHFVVNNPPKKYD 431

Query: 366 AFHQILNGL 374
            FH I++ +
Sbjct: 432 VFHMIIHSI 440


>gi|334186614|ref|NP_001190743.1| Exostosin family protein [Arabidopsis thaliana]
 gi|332658396|gb|AEE83796.1| Exostosin family protein [Arabidopsis thaliana]
          Length = 589

 Score =  114 bits (286), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 93/314 (29%), Positives = 147/314 (46%), Gaps = 29/314 (9%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSP-VRTLNPEEADWFYTPIY 98
           LKVY+Y  P     K +  +P  L  ++A+E +  + + S+    T NPE A  FY P Y
Sbjct: 200 LKVYIY--PD--GDKPIFHEPH-LNGIYASEGWFMKLMESNKQFVTKNPERAHLFYMP-Y 253

Query: 99  PTCDLTPTGL--------PLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGA 150
               L  +          PL       +R  + ++S  +P+WNRT G+DHF V  HD+G 
Sbjct: 254 SVKQLQKSIFVPGSHNIKPLSI----FLRDYVNMLSIKYPFWNRTHGSDHFLVACHDWGP 309

Query: 151 CFHYQEEKAIERGILPLLQRATLVQ-TFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPD 209
            +   E   ++R  +  L  A L    F     V L E SI             +++   
Sbjct: 310 -YTVNEHPELKRNAIKALCNADLSDGIFVPGKDVSLPETSIRNAGRPLRNIGNGNRV--- 365

Query: 210 TPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRA 269
           + R I  +F G  +     P+   + R       E+ K       +     TY + M+ +
Sbjct: 366 SQRPILAFFAGNLHG-RVRPKLLKHWRNKD----EDMKIYGPLPHNVARKMTYVQHMKSS 420

Query: 270 IFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDT 329
            +CLCP+G+   SPR+VEA+ + C+PV+IAD+ +LPF+D + W    + V E+++P+L  
Sbjct: 421 KYCLCPMGYEVNSPRIVEAIYYECVPVVIADNFMLPFSDVLDWSAFSVVVPEKEIPRLKE 480

Query: 330 ILTSIPPEVILRKQ 343
           IL  IP    L+ Q
Sbjct: 481 ILLEIPMRRYLKMQ 494


>gi|357478275|ref|XP_003609423.1| Xylogalacturonan beta-1,3-xylosyltransferase [Medicago truncatula]
 gi|355510478|gb|AES91620.1| Xylogalacturonan beta-1,3-xylosyltransferase [Medicago truncatula]
          Length = 398

 Score =  114 bits (286), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 89/320 (27%), Positives = 142/320 (44%), Gaps = 37/320 (11%)

Query: 58  KDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRM 117
           K P   TH   A +F   F +S  +R        + Y P     D  P  L L      +
Sbjct: 96  KSPFKATHPELAHVFFLPFSVSKVIR--------YVYKPRKSRSDYNPHRLQL------L 141

Query: 118 MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTF 177
           +   I+++++ +PYWN ++GADHF +  HD+G    Y   K  +  I  L   A   + F
Sbjct: 142 VEDYIKIVANKYPYWNISQGADHFLLSCHDWGPRVSYANPKLFKHFIRALCN-ANTSEGF 200

Query: 178 GQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGY-YAR 236
                V + + ++ +    PP   Q     P+  R+I  +F G          G +   R
Sbjct: 201 WPNRDVSIPQLNLPVGKLGPPNTDQH----PNN-RTILTFFAG----------GAHGKIR 245

Query: 237 GARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPV 296
                 W++                Y + M  + FCLCP G    SPR+VEA+  GC+PV
Sbjct: 246 KKLLKSWKDKDEEVQVHEYLPKGQDYTKLMGLSKFCLCPSGHEVASPRVVEAIYAGCVPV 305

Query: 297 IIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANP--SMKRA 354
           II D+  LPF+D + W +  M +A + +P++ TIL +I       K R+L +    +++ 
Sbjct: 306 IICDNYSLPFSDVLNWSQFSMEIAVDRIPEIKTILQNITET----KYRVLYSNVRRVRKH 361

Query: 355 MLFPQPAQPGDAFHQILNGL 374
               +PA+P D  H IL+ +
Sbjct: 362 FEMNRPAKPFDLIHMILHSV 381


>gi|168033546|ref|XP_001769276.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162679541|gb|EDQ65988.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 408

 Score =  114 bits (286), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 82/270 (30%), Positives = 128/270 (47%), Gaps = 39/270 (14%)

Query: 122 IQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRN 181
           +  ++S +PYWNRT GADHFFV  HD+ A         +    + ++  A L   F    
Sbjct: 147 VDTLASKYPYWNRTHGADHFFVSCHDW-APLSTMLHGELHTNSMKVVCNADLTVNFDIEK 205

Query: 182 HVCL-------NEGSITIPPYAPPQK----MQAHQIPPDTPRSIFVYFRGLFYDVNNDPE 230
            V +       N+  + +    P ++      A Q+       +  Y++G       DP 
Sbjct: 206 DVSIPQTLKGGNQSDLDVGSLGPEERDFLAFYAGQMHGTVRPVLLDYWKG------KDPT 259

Query: 231 GGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVV 290
                      V+E   ++   +IS      Y + M+R+ +CLCP G+   SPR+VEA++
Sbjct: 260 ---------MKVYEVLPSDIAVNIS------YAQHMKRSRYCLCPKGFEVNSPRIVEAIL 304

Query: 291 FGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPS 350
            GC+PVIIAD+ VLP+ D + W +  + V EED+P L  IL+SI        QR L    
Sbjct: 305 SGCVPVIIADNFVLPYNDVLDWTKFSVTVPEEDIPDLKKILSSISNVTYRSMQRRLR--Y 362

Query: 351 MKRAMLF---PQPAQPGDAFHQILNGLARK 377
           ++R  L+   P+  Q  D+FH  L  + R+
Sbjct: 363 IRRHFLWLEDPEDTQ-YDSFHMTLYSIWRQ 391


>gi|159478058|ref|XP_001697121.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
 gi|158274595|gb|EDP00376.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
          Length = 802

 Score =  114 bits (286), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 101/362 (27%), Positives = 154/362 (42%), Gaps = 73/362 (20%)

Query: 42  VYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTC 101
           +Y+Y+L     K L L+       + A E ++H  LL S  RT +P+EAD+FY P+Y TC
Sbjct: 381 IYIYDLTDYTTKFLQLR-------LGAVEAYLHETLLQSEHRTFDPDEADFFYVPVYVTC 433

Query: 102 DLTPT-GLP------LPFKSPRMMRSAIQL------ISSNWPYWNRTEGADHFFVVPHDF 148
            + P  G         P    R M  +  L      ISS +P+WNR  G DH +++  D 
Sbjct: 434 YMWPILGWADGPWWYAPLAHTRTMHVSNMLSEVHAHISSTFPWWNRRGGRDHIWLMAADE 493

Query: 149 GACF--------------------HYQEEKAIERGILPLLQRATLVQTFG------QRNH 182
           GAC+                     +Q   A ++    +          G       R H
Sbjct: 494 GACYMPTAIYNTSIILTHWGRMDLEHQSNTAYQQDNYNMAMPGEFKAWPGMDWQSRMRGH 553

Query: 183 VCLN-EGSITIPPYAPPQKMQAHQIPPDTP--RSIFVYFRGLFYDVNND--PEGGYYARG 237
            C +    + IP +      +   +    P  R +  YFRG   D+     P+   Y+RG
Sbjct: 554 PCYDPRKDLVIPAFKSVDHFRDSPLLGGAPLVRDLLCYFRG---DIGQARFPQ---YSRG 607

Query: 238 ARAAVWENFKNNPLFDISTDHPTT----------YYEDMQRAIFCLCPLGWAPWSPRLVE 287
            R  ++  +  N   D +  H             Y E + R+ FCL   G   WSPR  +
Sbjct: 608 LRQKLFHLWHKN---DWAAKHKIYIGNGEMVRGPYSEHLLRSRFCLVLPG-DGWSPRAED 663

Query: 288 AVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED--VPKLDTILTSIPPEVILRKQRL 345
           AV+ GCIPV+I D++   F   + WE   + + E+D  +  L  +L ++PPE + + QR 
Sbjct: 664 AVLHGCIPVVIMDNVHAVFESILDWESFSIRIREDDAALEALPQLLEAVPPERVAKMQRN 723

Query: 346 LA 347
           LA
Sbjct: 724 LA 725


>gi|224120358|ref|XP_002331028.1| predicted protein [Populus trichocarpa]
 gi|222872958|gb|EEF10089.1| predicted protein [Populus trichocarpa]
          Length = 334

 Score =  114 bits (286), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 80/303 (26%), Positives = 142/303 (46%), Gaps = 27/303 (8%)

Query: 79  SSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPF----KSPRMMRSAIQLISSNWPYWNR 134
            SP    N +EA  F+ PI     +    LP+      +  R+ +  + ++++ +PYWNR
Sbjct: 41  KSPFLARNHDEAHAFFLPISVAYIVEFVYLPITTYHRERLVRIFKDYVTVVANKYPYWNR 100

Query: 135 TEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPP 194
           + G DHF V  HD+      +++  + + ++ ++  A   + F  R    L E +     
Sbjct: 101 SRGGDHFMVSCHDWAPQVS-RDDPELYKNLIRVMCNANTSEGFRPRRDATLPELN----- 154

Query: 195 YAPPQKMQ--AHQIPPDTPRSIFVYFRGLFY-DVNNDPEGGYYARGARAAVWENFKNNPL 251
             PP K+      + P   R IF +F G  + D+       +  +     V E    +  
Sbjct: 155 -CPPLKLTPACRGLAPHE-RKIFAFFAGGAHGDIRKILLRHWKEKDDEIQVHEYLPKD-- 210

Query: 252 FDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIP 311
                     Y E M ++ FCLCP G+   SPR+ E++  GC+PVII+D   LPF+D + 
Sbjct: 211 --------QDYMELMGQSKFCLCPSGFEVASPRVAESIYSGCVPVIISDHYNLPFSDVLD 262

Query: 312 WEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQIL 371
           W +  + +  E +P++ TIL  I  +  L+ Q+ +    ++R  +  +PA+P D  H +L
Sbjct: 263 WSQFSVQIPVEKIPEIKTILRGISYDEYLKMQKGVMK--VQRHFVLNRPAKPYDVLHMVL 320

Query: 372 NGL 374
           + +
Sbjct: 321 HSV 323


>gi|356565153|ref|XP_003550809.1| PREDICTED: probable glycosyltransferase At5g20260-like [Glycine
           max]
          Length = 398

 Score =  114 bits (285), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 78/304 (25%), Positives = 138/304 (45%), Gaps = 28/304 (9%)

Query: 80  SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKS-------PRMMRSAIQLISSNWPYW 132
           SP +  NP+EA  F+ P+     +     P   ++        R++   I +++  +PYW
Sbjct: 101 SPFKARNPDEAHAFFLPLSVVNVVHYVYKPYMSQNDYSRDRLQRLVEDYIGVVADKYPYW 160

Query: 133 NRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITI 192
           NR+ GADHF +  HD+     +       +  + +L  A   + F  +  V + E  + +
Sbjct: 161 NRSNGADHFLLSCHDWAPEISHANPDLF-KNFIRVLCNANNSEGFQPKRDVSIPEVYLPV 219

Query: 193 PPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYA--RGARAAVWENFKNNP 250
               PP   Q         R+I  +F            GG +   R      W++  N+ 
Sbjct: 220 GKLGPPNLGQH-----PMNRTILAFF-----------SGGAHGDIRKLLLKHWKDKDNHV 263

Query: 251 LFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAI 310
                      Y E M  + FCLCP G+   SPR+VEA+  GC+PVII+++  LPF+D +
Sbjct: 264 QVHEYLPKGQNYTELMGLSKFCLCPSGYEVASPRVVEAINAGCVPVIISENYSLPFSDVL 323

Query: 311 PWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQI 370
            W +  + ++ E++  + TIL ++  +   +  R +    ++R  +  +PA+P D  H I
Sbjct: 324 NWSQFSIQISVENISDIKTILQNVTQKKYKKLHRNVR--RVQRHFVMNRPAKPFDLMHMI 381

Query: 371 LNGL 374
           L+ +
Sbjct: 382 LHSI 385


>gi|356537024|ref|XP_003537031.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
           max]
          Length = 472

 Score =  114 bits (285), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 95/358 (26%), Positives = 154/358 (43%), Gaps = 62/358 (17%)

Query: 39  KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIY 98
           + KV+VYE      +  +  +  C +       F+H   ++   RT +P++A  F+ P  
Sbjct: 142 QFKVFVYE----EGETPVFHNGPCKSIYSMEGNFIHAIEMNDHFRTKDPKKAHVFFLPF- 196

Query: 99  PTCDLTPTGLPLPFKSPRMMRS----------AIQLISSNWPYWNRTEGADHFFVVPHDF 148
                    + + F   R  R            I LI++ + YWNR+ GADHF +  HD+
Sbjct: 197 ------SVVMMVRFVYQRDSRDFGPIRKTVIDYINLIAARYSYWNRSLGADHFMLACHDW 250

Query: 149 GACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYA----PPQKMQAH 204
           G       E ++    LP L + ++         +C    S    P      P   +Q  
Sbjct: 251 GP------EASLS---LPYLHKNSI-------RVLCNANTSEGFKPAKDVSFPEINLQTG 294

Query: 205 QI------PPDTPRSIFVYFRGLFYDVNNDPEGGYYA--RGARAAVWENFKNNPLFDIST 256
            I      P  + RSI  +F            GG +   R      WEN   +       
Sbjct: 295 SINGFIGGPSASKRSILAFF-----------AGGVHGPIRPILLEHWENKDEDIQVHKYL 343

Query: 257 DHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG 316
               +YY+ ++ + FCLCP G+   SPR+VEA+  GC+PV+I++  V PF+D + W+   
Sbjct: 344 PKGVSYYDKLRNSKFCLCPSGYEVASPRVVEAIYTGCVPVLISEHYVPPFSDVLNWKSFS 403

Query: 317 MFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
           + ++ +D+P L  IL SI P   +R QR +    ++R      P +  D FH IL+ +
Sbjct: 404 VELSVKDIPNLKDILMSISPRQYIRMQRRVIQ--IQRHFEVHSPPKRFDVFHMILHSV 459


>gi|297812207|ref|XP_002873987.1| hypothetical protein ARALYDRAFT_326421 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297319824|gb|EFH50246.1| hypothetical protein ARALYDRAFT_326421 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 654

 Score =  114 bits (285), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 84/300 (28%), Positives = 134/300 (44%), Gaps = 22/300 (7%)

Query: 80  SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSA----IQLISSNWPYWNRT 135
           SP     PEEA  F  P+     +     PL   S   +       + +++  +PYWNR+
Sbjct: 361 SPFAASGPEEAHAFLLPVSIANVVHYLYRPLVTYSREQLHKVFLDYVNVVAHKYPYWNRS 420

Query: 136 EGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY 195
            GADHFFV  HD+         + + + ++ +L  A   + F  +  V + E +I     
Sbjct: 421 LGADHFFVSCHDWAPDVSGSNPEMM-KNLIRVLCNANTSEGFMPQRDVSIPEINIPRGQL 479

Query: 196 APPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGY-YARGARAAVWENFKNNPLFDI 254
            PPQ  ++        R I  +F G          G + Y R      W++         
Sbjct: 480 GPPQLSRS----SGHDRPILAFFSG----------GSHGYIRKILLQHWKDKDEEVQVHE 525

Query: 255 STDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEE 314
              +   Y++ M  A FCLCP G+   SPR+V A+  GC+PVII+D   LPF+D + W +
Sbjct: 526 YLTNNKDYFKLMATARFCLCPSGYEVASPRVVAAINLGCVPVIISDHYALPFSDVLDWTK 585

Query: 315 IGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
             + V  E +P++ TIL SI        QR +    ++R  +  +P+QP D    +L+ +
Sbjct: 586 FTIHVPSEKIPEIKTILKSISWRRYKVLQRRVLQ--VQRHFVINRPSQPFDMLRMLLHSV 643


>gi|307111637|gb|EFN59871.1| hypothetical protein CHLNCDRAFT_133704 [Chlorella variabilis]
          Length = 833

 Score =  114 bits (285), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 99/359 (27%), Positives = 149/359 (41%), Gaps = 72/359 (20%)

Query: 42  VYVYELPSKYNKKLL----LKDPRCLTH---------------MFAAEIFMHRFLLSSPV 82
           +YVYELP  YN  +L     KD    TH               ++  E  +H  LL S  
Sbjct: 371 IYVYELPPMYNALMLQYRVAKDD--CTHRVFNEQNESYTPFLWLYQPETGIHEMLLQSEH 428

Query: 83  RTLNPEEADWFYTPIYPTCDLTPT----------GLPLPFKS---PRMMRSAIQLISSNW 129
           RTL+PEEAD+FY P+Y +C + P           G P   ++     M       + S++
Sbjct: 429 RTLDPEEADYFYLPVYASCAIWPVLYSNDFPYFHGGPAAQRTHGATNMFMEVQSWVRSHF 488

Query: 130 PYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGS 189
           PYW+R  G DH  +  HD G+C+            LP + R  +V +   R  V    G+
Sbjct: 489 PYWDRNGGRDHIVLTVHDEGSCW------------LPAVLRPAIVMSHWGRTDVNPPAGT 536

Query: 190 ----------ITIPPYAPPQKM-QAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGA 238
                     +  P + P   + +  + P   P  +     G        PE   Y+RG 
Sbjct: 537 GYDADTYSNEVRHPVWQPEGHLSKLGEFPCYDPSKVTYILGGRI-----QPENARYSRGT 591

Query: 239 RA---------AVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAV 289
           R            W+ ++ +         P  Y E M R++FCL  +G   +S R  +AV
Sbjct: 592 RQFLANISEAEGWWDKYRIHVGAGSPPGGPGDYSECMARSVFCLALMG-DGYSSRFDDAV 650

Query: 290 VFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLAN 348
           + GCIPVI+ D I L +   +      + V + D+ ++  IL ++P E I R Q  LA 
Sbjct: 651 LHGCIPVIVQDGIELTWHSLLDIPAYSLRVPQADMARIPQILQAVPQEDIARMQANLAK 709


>gi|218192182|gb|EEC74609.1| hypothetical protein OsI_10218 [Oryza sativa Indica Group]
          Length = 737

 Score =  114 bits (285), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 100/394 (25%), Positives = 168/394 (42%), Gaps = 68/394 (17%)

Query: 17  NTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLL------------LKDPRCLT 64
           N P ++ IS S+ +V       +  +YVY+LP++++  LL            + D +  T
Sbjct: 271 NLPDLKNISSSSINVKVVAKKKRPLIYVYDLPAEFDSHLLEGRHYKFQCVNRIYDDKNRT 330

Query: 65  ----HMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMM-- 118
                ++ A+I ++  +L+SP RTLN +EAD+FY P   +C +T +      + PR +  
Sbjct: 331 IWTQQLYGAQIALYESILASPHRTLNGDEADYFYVPALDSCLITRSDDAPHLQMPRDLRL 390

Query: 119 ---------RSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFH----YQEEKAIERGIL 165
                    R     I+  +PYWNRT G DH +    D GAC+     +     +  G  
Sbjct: 391 RSYHTLEYYRMTYDHIAQRYPYWNRTSGRDHIWFFSWDEGACYAPKEIWNSMMLVHWGNT 450

Query: 166 PLLQRATLVQTFGQR----------NHVCLN-EGSITIPPYAPPQ------KMQAHQIPP 208
               + +    +             NH C +    + +P +  P       K+ A     
Sbjct: 451 NTKHKNSTTAYWADNWNYIPIDRRGNHPCFDPRKDLVLPAWKQPNPAAIWLKLWAR---T 507

Query: 209 DTPRSIFVYFRGLFYDVNNDPEG-GYYARGARAAVWENFKNNP----------LFDISTD 257
              R+   YF G       D      Y+ G R  +   F + P            +++  
Sbjct: 508 RNNRTTLFYFNGNLGPAYKDGRHEDTYSMGIRQKLAAEFGSTPDKQGKLGRQHTANVTVT 567

Query: 258 HPTT--YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEI 315
           +  T  YYE++  +IFC    G   WS R+ ++++ GCIPVII D I+LP+ + + +   
Sbjct: 568 YLRTEKYYEELASSIFCGVLPG-DGWSGRMEDSMLQGCIPVIIQDGILLPYENMLNYNSF 626

Query: 316 GMFVAEEDVPKLDTILTSIPP---EVILRKQRLL 346
            + + E+D+P L  IL  I     E +LR  R +
Sbjct: 627 AVRIQEDDIPNLIRILRGINETQVEFMLRNVRQI 660


>gi|224142972|ref|XP_002324801.1| predicted protein [Populus trichocarpa]
 gi|222866235|gb|EEF03366.1| predicted protein [Populus trichocarpa]
          Length = 398

 Score =  114 bits (285), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 92/349 (26%), Positives = 157/349 (44%), Gaps = 43/349 (12%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAE-IFMHRFLLSSPVRTLNPEEADWFYTPIY 98
           LKVY+Y+   +  + +L + P  L  ++A+E  FM     +    T +P+++  FY P  
Sbjct: 68  LKVYIYK---EGERPILHQAP--LKGIYASEGWFMKLLETNKKFVTKDPKKSHLFYLPFS 122

Query: 99  PTCDLTPTGLPLPFKSPRM---MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQ 155
                    +P       +   +++ + +IS+ +P+WNRT GADHF V  HD+      Q
Sbjct: 123 SRNLEVNLYVPNSHSHKNLIQYLKNYLDMISAKYPFWNRTRGADHFLVACHDWAPTETRQ 182

Query: 156 EEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIF 215
                 R +     +   V  FG+     L E ++      P   ++     P + RSI 
Sbjct: 183 HMANCIRALCNSDAKGGFV--FGK--DAALPETTVR----TPQNLLRDLGGKPASKRSIL 234

Query: 216 VYFRGLFYDV----------NNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYED 265
            +F G  +            N DP+   + +              L  +       Y + 
Sbjct: 235 AFFAGSMHGYLRPILLQHWGNKDPDVKVFGK--------------LPKVKGRGKMNYPQY 280

Query: 266 MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVP 325
           M+ + +C+C  G+   SPR+VEA+ + C+PVII+D+ V PF + + WE   +FV E+D+P
Sbjct: 281 MKSSKYCICAKGFEVNSPRVVEAIFYECVPVIISDNFVPPFFEVLNWESFAVFVLEKDIP 340

Query: 326 KLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
            L  IL SI PE   R+ ++      +  +   +P +  D FH IL+ +
Sbjct: 341 NLKNILLSI-PENKYREMQMRVKKVQQHFLWHARPVK-YDIFHMILHSV 387


>gi|356496874|ref|XP_003517290.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
           max]
          Length = 404

 Score =  114 bits (285), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 99/353 (28%), Positives = 158/353 (44%), Gaps = 56/353 (15%)

Query: 39  KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIY 98
           K KVY+Y  P   +     + PR LT  +A+E +  + +  S   T NP+EA  F+ PI 
Sbjct: 82  KFKVYIY--PDG-DPNTFYQTPRKLTGKYASEGYFFQNIRESRFCTENPDEAHLFFIPI- 137

Query: 99  PTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEK 158
            +C               ++++ ++ + S +PYWNRT GADHFFV  HD G        +
Sbjct: 138 -SCHKMRGKGTSYENMTIIVQNYVESLISKYPYWNRTLGADHFFVTCHDVGV-------R 189

Query: 159 AIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY---APPQKMQAHQIPPDTPRSIF 215
           A E G+  L++ +           VC     +   P+   A PQ +Q   +P        
Sbjct: 190 ATE-GLEFLVKNSIRA--------VCSPSYDVGFIPHKDVALPQVLQPFALPAGGN---- 236

Query: 216 VYFRGLFYDVNNDPEGGYYA-------RGARAAVWENFKNNPLFDISTDHPT------TY 262
                   D+ N    G++A       R   A VWEN   +   DIS +  +       Y
Sbjct: 237 --------DIENRTTLGFWAGHRNSKIRVILARVWEN---DTELDISNNRISRATGHLVY 285

Query: 263 YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEE 322
            +   R+ FC+CP G    S R+ +++ +GCIPVI+++   LPF D + W +  + + E 
Sbjct: 286 QKRFYRSKFCICPGGSQVNSARIADSIHYGCIPVILSNYYDLPFNDILDWNKFAVVLKES 345

Query: 323 DVPKLDTILTSIP-PEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
           DV +L  IL +I   E +     L+    +++   +  P    DAFH ++  L
Sbjct: 346 DVYQLKQILKNISDAEFVTLHNNLV---KVQKHFQWNSPPIRFDAFHLVMYDL 395


>gi|356546040|ref|XP_003541440.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
           max]
          Length = 491

 Score =  114 bits (285), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 95/354 (26%), Positives = 154/354 (43%), Gaps = 54/354 (15%)

Query: 39  KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIY 98
           + KV+VYE      +  +  +  C +       F+H   ++   RT +P++A  F+ P  
Sbjct: 161 QFKVFVYE----EGETPVFHNGPCKSIYSMEGNFIHAIEMNDHFRTKDPKKAHVFFLPFS 216

Query: 99  PTC--------DLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFG- 149
                      D    G   P K  + +   + LI++ +PYWNR+ GADHF +  HD+G 
Sbjct: 217 VVMMVRFVYERDSRDFG---PIK--KTVIDYVNLIATRYPYWNRSLGADHFMLACHDWGP 271

Query: 150 -ACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQI-- 206
            A F       + +  + +L  A   + F     V             P   +Q   I  
Sbjct: 272 EASFSL---PYLHKNSIRVLCNANTSEGFKPAKDVSF-----------PEINLQTGSING 317

Query: 207 ----PPDTPRSIFVYFRGLFYDVNNDPEGGYYA--RGARAAVWENFKNNPLFDISTDHPT 260
               P  + RSI  +F            GG +   R      WEN   +           
Sbjct: 318 FVGGPSASKRSILAFF-----------AGGVHGPIRPILLEHWENKDEDIQVHKYLPKGV 366

Query: 261 TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVA 320
           +YY  ++++ FCLCP G+   SPR+VEA+  GC+PV+I++  V PF+D + W+   + ++
Sbjct: 367 SYYGMLRKSKFCLCPSGYEVASPRVVEAIYTGCVPVLISEHYVPPFSDVLNWKSFSVELS 426

Query: 321 EEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
            +D+P L  IL SI P   +R QR +    ++R      P +  D FH IL+ +
Sbjct: 427 VKDIPILKDILMSISPRQHIRMQRRVGQ--IRRHFEVHSPPKRFDVFHMILHSV 478


>gi|302820946|ref|XP_002992138.1| glycosyltransferase-like protein [Selaginella moellendorffii]
 gi|300140064|gb|EFJ06793.1| glycosyltransferase-like protein [Selaginella moellendorffii]
          Length = 342

 Score =  114 bits (285), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 98/345 (28%), Positives = 149/345 (43%), Gaps = 61/345 (17%)

Query: 58  KDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKS--- 114
           + PR +T  +A+E +  + L  S   T NP +A  F+ PI  +C      +P    S   
Sbjct: 20  QTPRKITGKYASEGYFFQNLRESKFVTKNPNKAHLFFIPI--SCHKMRGKVPYYLTSNWN 77

Query: 115 ----------PRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGI 164
                       +++  ++ +   +PYWNRT GADHFFV  HD GA       +A  + +
Sbjct: 78  KMQGISYEKMADIVQEYVEGLIVKYPYWNRTLGADHFFVTCHDVGA-------RATNK-V 129

Query: 165 LPLLQRATLVQTFGQRNHVC---LNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGL 221
             L++ +  V        VC    N   I     A PQ +Q   +P           RG 
Sbjct: 130 ANLVKNSIRV--------VCSPSYNGDFIPHKDIAMPQVLQPFALP-----------RG- 169

Query: 222 FYDVNNDPEGGYYA-------RGARAAVWEN-----FKNNPLFDISTDHPTTYYEDMQRA 269
             DV N    G++A       R   A +WE        NN +   + +    Y +   R+
Sbjct: 170 GNDVRNRTILGFWAGHRNSKIRVVLAKLWEEDDVLAISNNRISRATGE--LVYQKQFYRS 227

Query: 270 IFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDT 329
            FC+CP G    S R+V+++ +GC+PVI++D   LPF D + W+   + + E DV  L  
Sbjct: 228 KFCICPGGSQVNSARIVDSIHYGCVPVILSDHYDLPFNDVLDWKRFALLLRERDVGDLKL 287

Query: 330 ILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
            L S+  E  L   R L      R   +  P +P DAFH ++  L
Sbjct: 288 KLQSVSKEQYLSLHRGLVEVVQDR-FEWHTPPRPYDAFHMVVYEL 331


>gi|334187775|ref|NP_197468.2| Exostosin family protein [Arabidopsis thaliana]
 gi|332005353|gb|AED92736.1| Exostosin family protein [Arabidopsis thaliana]
          Length = 610

 Score =  114 bits (284), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 93/351 (26%), Positives = 158/351 (45%), Gaps = 48/351 (13%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTL-NPEEADWFYTP-- 96
           LKVYVY    K   + +   P  L  ++A+E +  + +  +   T+ +P +A  +Y P  
Sbjct: 281 LKVYVY----KEGNRPIFHTP-ILKGLYASEGWFMKLMEGNKQYTVKDPRKAHLYYMPFS 335

Query: 97  -------IYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFG 149
                  +Y       T L       + ++   + ISS +P++NRT+GADHF V  HD+ 
Sbjct: 336 ARMLEYTLYVRNSHNRTNLR------QFLKEYTEHISSKYPFFNRTDGADHFLVACHDWA 389

Query: 150 ACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPD 209
               Y+    +E  I  L   A +   F     + L E  +     A    ++     P 
Sbjct: 390 P---YETRHHMEHCIKALCN-ADVTAGFKIGRDISLPETYVR----AAKNPLRDLGGKPP 441

Query: 210 TPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP------TTYY 263
           + R    ++ G  +          Y R      W++   +P   I    P        Y 
Sbjct: 442 SQRRTLAFYAGSMHG---------YLRQILLQHWKD--KDPDMKIFGRMPFGVASKMNYI 490

Query: 264 EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED 323
           E M+ + +C+CP G+   SPR+VE++ + C+PVII+D+ V PF + + W    + VAE+D
Sbjct: 491 EQMKSSKYCICPKGYEVNSPRVVESIFYECVPVIISDNFVPPFFEVLDWSAFSVIVAEKD 550

Query: 324 VPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
           +P+L  IL SIP +  ++ Q  +A    +R  L+    +  D FH +L+ +
Sbjct: 551 IPRLKDILLSIPEDKYVKMQ--MAVRKAQRHFLWHAKPEKYDLFHMVLHSI 599


>gi|125585107|gb|EAZ25771.1| hypothetical protein OsJ_09611 [Oryza sativa Japonica Group]
          Length = 779

 Score =  114 bits (284), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 100/394 (25%), Positives = 168/394 (42%), Gaps = 68/394 (17%)

Query: 17  NTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLL------------LKDPRCLT 64
           N P ++ IS S+ +V       +  +YVY+LP++++  LL            + D +  T
Sbjct: 313 NLPDLKNISSSSINVKVVAKKKRPLIYVYDLPAEFDSHLLEGRHYKFQCVNRIYDDKNRT 372

Query: 65  ----HMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMM-- 118
                ++ A+I ++  +L+SP RTLN +EAD+FY P   +C +T +      + PR +  
Sbjct: 373 IWTQQLYGAQIALYESILASPHRTLNGDEADYFYVPALDSCLITRSDDAPHLQMPRDLRL 432

Query: 119 ---------RSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFH----YQEEKAIERGIL 165
                    R     I+  +PYWNRT G DH +    D GAC+     +     +  G  
Sbjct: 433 RSYHTLEYYRMTYDHIAQRYPYWNRTSGRDHIWFFSWDEGACYAPKEIWNSMMLVHWGNT 492

Query: 166 PLLQRATLVQTFGQR----------NHVCLN-EGSITIPPYAPPQ------KMQAHQIPP 208
               + +    +             NH C +    + +P +  P       K+ A     
Sbjct: 493 NTKHKNSTTAYWADNWNYIPIDRRGNHPCFDPRKDLVLPAWKQPNPAAIWLKLWART--- 549

Query: 209 DTPRSIFVYFRGLFYDVNNDPEG-GYYARGARAAVWENFKNNP----------LFDISTD 257
              R+   YF G       D      Y+ G R  +   F + P            +++  
Sbjct: 550 RNNRTTLFYFNGNLGPAYKDGRHEDTYSMGIRQKLAAEFGSTPDKQGKLGRQHTANVTVT 609

Query: 258 HPTT--YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEI 315
           +  T  YYE++  +IFC    G   WS R+ ++++ GCIPVII D I+LP+ + + +   
Sbjct: 610 YLRTEKYYEELASSIFCGVLPG-DGWSGRMEDSMLQGCIPVIIQDGILLPYENMLNYNSF 668

Query: 316 GMFVAEEDVPKLDTILTSIPP---EVILRKQRLL 346
            + + E+D+P L  IL  I     E +LR  R +
Sbjct: 669 AVRIQEDDIPNLIRILRGINETQVEFMLRNVRQI 702


>gi|297805260|ref|XP_002870514.1| hypothetical protein ARALYDRAFT_915839 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297316350|gb|EFH46773.1| hypothetical protein ARALYDRAFT_915839 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 536

 Score =  114 bits (284), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 100/346 (28%), Positives = 156/346 (45%), Gaps = 57/346 (16%)

Query: 39  KLKVYVYELPSK--YNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTL-NPEEADWFYT 95
           KLKVYVY+   K  ++K L    PR +   +A+E +  + + S+    + +P +A  FY 
Sbjct: 227 KLKVYVYKEGGKPIFHKPL----PRGI---YASEGWFMKLMESNKKFVVRDPRKAHLFYI 279

Query: 96  PIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQ 155
           PI                    ++  + LI+  + +WNRT GADHF V  HD+G     +
Sbjct: 280 PI------------------NHLKEYVDLIAGKYKFWNRTGGADHFIVACHDWGNKLTKK 321

Query: 156 EEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIF 215
             K   R     L  + + Q F       L      I     P +    + P  + R I 
Sbjct: 322 TMKNSVRA----LCNSNVAQGFRIGTDTAL--PVTYIRSAESPLEYLGGKTP--SKRKIL 373

Query: 216 VYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP------TTYYEDMQRA 269
            +F G  +          Y R     +WEN    P   I    P      T Y E M+ +
Sbjct: 374 AFFAGSMHG---------YLRPILVKLWEN--KEPDMKIVGPMPRDPESKTQYREYMKSS 422

Query: 270 IFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDT 329
            +C+C  G+   +PR+VEA++  C+PVIIAD+ V PF + + WEE  +FV E+++  L  
Sbjct: 423 KYCICARGYEVHTPRVVEAIINECVPVIIADNYVPPFFEILNWEEFAVFVEEKEIANLRN 482

Query: 330 ILTSIPPE-VILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
           IL SI  E  I+ + R+ A   +++  L+ +     D FH IL+ +
Sbjct: 483 ILLSISEERYIVMQARVKA---VQQHFLWHKKPVKFDLFHMILHSI 525


>gi|449462352|ref|XP_004148905.1| PREDICTED: probable glycosyltransferase At5g25310-like [Cucumis
           sativus]
 gi|449523501|ref|XP_004168762.1| PREDICTED: LOW QUALITY PROTEIN: probable glycosyltransferase
           At5g25310-like [Cucumis sativus]
          Length = 684

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 95/345 (27%), Positives = 160/345 (46%), Gaps = 37/345 (10%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSP-VRTLNPEEADWFYTPIY 98
           LKVY+Y   ++    +  + P  L  ++A+E +  + L S+    T NP +A  FY P +
Sbjct: 355 LKVYIYREGAR---PIFHQGP--LQSIYASEGWFMKILESNKKFVTKNPRKAHLFYLP-F 408

Query: 99  PTCDLTPTGLPLPFKSPR----MMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHY 154
            +  L          S +     +++ +  I++ +P+WNRT GADHF V  HD+      
Sbjct: 409 SSRQLEEVLYVRDSHSHKNLIQHLKNYLDFIAAKYPHWNRTGGADHFLVACHDWAPA--- 465

Query: 155 QEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSI 214
           +  K + + I  L   + + + F     V L E  + +    P + +  +   P + R I
Sbjct: 466 ETRKYMAKCIRALCN-SDVKEGFVFGKDVSLPETFVRVA-RNPLRDVGGN---PSSKRPI 520

Query: 215 FVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPT-----TYYEDMQRA 269
             +F G  +          Y R      WE  + +P   IS   P       Y   M+ +
Sbjct: 521 LAFFAGSMHG---------YLRSTLLEYWE--RKDPDMKISGPMPKVKGSKNYLWHMKNS 569

Query: 270 IFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDT 329
            +C+C  G+   SPR+VE++++ C+PVII+D+ V P  + + WE   +FVAE+D+P L  
Sbjct: 570 KYCICAKGYEVNSPRVVESILYECVPVIISDNFVPPLFEVLNWESFAVFVAEKDIPNLKK 629

Query: 330 ILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
           IL SIP +     Q  +    ++   L+    Q  D FH IL+ +
Sbjct: 630 ILLSIPEKRYREMQMRVK--KLQPHFLWHAKPQKYDMFHMILHSI 672


>gi|297819890|ref|XP_002877828.1| hypothetical protein ARALYDRAFT_906539 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297323666|gb|EFH54087.1| hypothetical protein ARALYDRAFT_906539 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 107

 Score =  113 bits (283), Expect = 1e-22,   Method: Composition-based stats.
 Identities = 48/105 (45%), Positives = 75/105 (71%), Gaps = 1/105 (0%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYP 99
           +K+YVY+LP+ +N   +    RC TH+FAAE+ +HR LLSS VRTL+PEEAD+F+ P+Y 
Sbjct: 1   MKIYVYDLPASFNDDWVTASDRCATHLFAAEVAIHRTLLSSDVRTLDPEEADFFFVPVYV 60

Query: 100 TCDLTPT-GLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFV 143
           +C+ + + G P    +  +  SA+  +S ++P+W+RT+G+DH FV
Sbjct: 61  SCNFSKSNGFPSLSHARSLFSSAVDFLSDHYPFWDRTQGSDHVFV 105


>gi|358348291|ref|XP_003638181.1| hypothetical protein MTR_121s0013 [Medicago truncatula]
 gi|355504116|gb|AES85319.1| hypothetical protein MTR_121s0013 [Medicago truncatula]
          Length = 550

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 98/350 (28%), Positives = 155/350 (44%), Gaps = 36/350 (10%)

Query: 39  KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEI-FMHRFLLSSPVRTLNPEEADWFYTPI 97
           KLKVY+Y    +  +K +   P+ +  ++A+E  FM     +      +P++A  FY P 
Sbjct: 225 KLKVYIY----REGEKPIFHQPK-MRGIYASEGWFMKLMEGNKRFIVKDPKKAHLFYLPF 279

Query: 98  YPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEE 157
             +  +    L    K  + +   + +I+  + +WNRT GADHF V  HD+ +    Q  
Sbjct: 280 --SSQMLRANLSDNKKMEQYLDKYVNIIAGKYRFWNRTGGADHFLVACHDWASRITRQPM 337

Query: 158 KAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVY 217
           K   R     L  A + + F       L   +  I     P +  A + P +  R+I  +
Sbjct: 338 KNCIRS----LCNANVAKGFQIGKDTTL--PATYIHSVMNPLRKIAGKHPSE--RTILAF 389

Query: 218 FRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDI------STDHPTTYYEDMQRAIF 271
           F G  +          Y R      WEN    P   I        +    Y + M  + +
Sbjct: 390 FAGSMHG---------YLRPILLKHWEN--KEPDMKIFGAMARDAEGKRIYMDYMNSSKY 438

Query: 272 CLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTIL 331
           C+C  G+  +SPR+VEA+   C+PVII+D+ V PF + + WE   +FV E DVP L +IL
Sbjct: 439 CICARGYEVYSPRIVEAIFSECVPVIISDNYVPPFFEVLKWEAFSVFVRERDVPNLRSIL 498

Query: 332 TSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL-ARKLPH 380
            SI  E  L     L    +++  L+ +     D FH IL+ +   +L H
Sbjct: 499 LSITEEKYLALH--LGVKKVQQHFLWHKVPVKYDLFHMILHSIWNNRLSH 546


>gi|15239118|ref|NP_196722.1| Exostosin family protein [Arabidopsis thaliana]
 gi|7573418|emb|CAB87721.1| putative protein [Arabidopsis thaliana]
 gi|332004317|gb|AED91700.1| Exostosin family protein [Arabidopsis thaliana]
          Length = 546

 Score =  113 bits (283), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 98/345 (28%), Positives = 160/345 (46%), Gaps = 37/345 (10%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSS-PVRTLNPEEADWFYTPIY 98
           LKVYVY   S+ ++ +  +    +  ++A+E +  + + SS    T +P +A  FY P  
Sbjct: 220 LKVYVY---SEGDRPIFHQPEAIMEGIYASEGWFMKLMESSHRFLTKDPTKAHLFYIPF- 275

Query: 99  PTCDLTPTGLPLPFKSPR-----MMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFH 153
            +  +    L +     R      + + I LI+SN+P WNRT G+DHFF   HD+     
Sbjct: 276 -SSRILQQKLYVHDSHSRNNLVKYLGNYIDLIASNYPSWNRTCGSDHFFTACHDWAPT-- 332

Query: 154 YQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRS 213
             E +      +  L  A +   F     V L E  ++     P  K+   +    + R+
Sbjct: 333 --ETRGPYINCIRALCNADVGIDFVVGKDVSLPETKVS-SLQNPNGKIGGSR---PSKRT 386

Query: 214 IFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDIST----DHPTTYYEDMQRA 269
           I  +F G  +        GY     R  +   + + P  D+      DH  +Y   M+R+
Sbjct: 387 ILAFFAGSLH--------GY----VRPILLNQWSSRPEQDMKIFNRIDH-KSYIRYMKRS 433

Query: 270 IFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDT 329
            FC+C  G+   SPR+VE++++GC+PVII+D+ V PF + + WE   +FV E+++P L  
Sbjct: 434 RFCVCAKGYEVNSPRVVESILYGCVPVIISDNFVPPFLEILNWESFAVFVPEKEIPNLRK 493

Query: 330 ILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
           IL SIP    +  Q+ +     K  M         D FH IL+ +
Sbjct: 494 ILISIPVRRYVEMQKRVLK-VQKHFMWHDGEPVRYDIFHMILHSV 537


>gi|26450497|dbj|BAC42362.1| unknown protein [Arabidopsis thaliana]
          Length = 546

 Score =  113 bits (283), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 98/345 (28%), Positives = 160/345 (46%), Gaps = 37/345 (10%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSS-PVRTLNPEEADWFYTPIY 98
           LKVYVY   S+ ++ +  +    +  ++A+E +  + + SS    T +P +A  FY P  
Sbjct: 220 LKVYVY---SEGDRPIFHQPEAIMEGIYASEGWFMKLMESSHRFLTKDPTKAHLFYIPF- 275

Query: 99  PTCDLTPTGLPLPFKSPR-----MMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFH 153
            +  +    L +     R      + + I LI+SN+P WNRT G+DHFF   HD+     
Sbjct: 276 -SSRILQQKLYVHDSHSRNNLVKYLGNYIDLIASNYPSWNRTCGSDHFFTACHDWAPT-- 332

Query: 154 YQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRS 213
             E +      +  L  A +   F     V L E  ++     P  K+   +    + R+
Sbjct: 333 --ETRGPYINCIRALCNADVGIDFVVGKDVSLPETKVS-SLQNPNGKIGGSR---PSKRT 386

Query: 214 IFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDIST----DHPTTYYEDMQRA 269
           I  +F G  +        GY     R  +   + + P  D+      DH  +Y   M+R+
Sbjct: 387 ILAFFAGSLH--------GY----VRPILLNQWSSRPEQDMKIFNRIDH-KSYIRYMKRS 433

Query: 270 IFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDT 329
            FC+C  G+   SPR+VE++++GC+PVII+D+ V PF + + WE   +FV E+++P L  
Sbjct: 434 RFCVCAKGYEVNSPRVVESILYGCVPVIISDNFVPPFLEILNWESFAVFVPEKEIPNLRK 493

Query: 330 ILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
           IL SIP    +  Q+ +     K  M         D FH IL+ +
Sbjct: 494 ILISIPVRRYVEMQKRVLK-VQKHFMWHDGEPVRYDIFHMILHSV 537


>gi|218188734|gb|EEC71161.1| hypothetical protein OsI_03019 [Oryza sativa Indica Group]
          Length = 499

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 99/349 (28%), Positives = 158/349 (45%), Gaps = 39/349 (11%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAE-IFMHRFLLSSPVRTLNPEEADWFYTPIY 98
           LKVY+Y+      ++ +   P  L+ ++A+E  FM     S      +P +A  FY P Y
Sbjct: 164 LKVYIYQ----DGRRPIFHTP-PLSGIYASEGWFMKLLKESRRFAVTDPAKAHLFYLP-Y 217

Query: 99  PTCDLTPTGLPLPFKS-----PRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFH 153
            +  L    L +P           +R  ++ +++ +P+WNRT GADHF V  HD+G+ + 
Sbjct: 218 SSQQLR-ISLYVPDSHNLRPLAAYLRDFVKGLAAKYPFWNRTRGADHFLVACHDWGS-YT 275

Query: 154 YQEEKAIERGILPLLQRATLVQ-TFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPR 212
                 + R  +  L  A   +  F     V L E +I      P + ++     P + R
Sbjct: 276 TTAHGDLRRNTVKALCNADSSEGIFTPGRDVSLPETTIR----TPRRPLRYVGGLPVSRR 331

Query: 213 SIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPL-----FDISTDHPTTYYEDMQ 267
            I  +F G  +            R      W + +++ +              +Y + M+
Sbjct: 332 GILAFFAGNVHG---------RVRPVLLKHWGDGRDDDMRVYGPLPARVSRRMSYIQHMK 382

Query: 268 RAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKL 327
            + FCLCP+G+   SPR+VEA+ + C+PVIIAD+ VLP +D + W    + VAE+DVP L
Sbjct: 383 NSRFCLCPMGYEVNSPRIVEALYYECVPVIIADNFVLPLSDVLDWSAFAVVVAEKDVPDL 442

Query: 328 DTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQP--GDAFHQILNGL 374
             IL  I     LRK   +     +    F   A+P   D FH IL+ +
Sbjct: 443 KKILQGI----TLRKYVAMHGCVKRLQRHFLWHARPLRYDLFHMILHSI 487


>gi|356495238|ref|XP_003516486.1| PREDICTED: probable glycosyltransferase At3g07620-like [Glycine
           max]
          Length = 610

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 95/343 (27%), Positives = 150/343 (43%), Gaps = 35/343 (10%)

Query: 39  KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEI-FMHRFLLSSPVRTLNPEEADWFYTPI 97
           KLKV++Y    +   K +   P+ +  ++A+E  FM     +      +P +A  FY P 
Sbjct: 285 KLKVFIY----REGAKPIFHQPK-MRGIYASEGWFMKLMEGNKRFIVKDPRKAHLFYLPF 339

Query: 98  YPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEE 157
             +  +    L  P +  + +   ++LI+  + +WNRT+GADHF V  HD+ +    Q  
Sbjct: 340 --SSQMLRVTLSNPKQMEQHLEKYVELIAGRYRFWNRTDGADHFLVACHDWASRITRQPM 397

Query: 158 KAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVY 217
           K    G +  L  + + + F       L      I     P K  A +  P + RS   +
Sbjct: 398 K----GCIRSLCNSNVAKGFQIGKDTTL--PVTYIHSVMDPLKECAGK--PPSERSALAF 449

Query: 218 FRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDI------STDHPTTYYEDMQRAIF 271
           F G  +          Y R      W N    P   I        +    Y E M  + +
Sbjct: 450 FAGSMHG---------YLRPILLKHWAN--KEPDMKIFGPMPRDLEGKKMYMEYMNSSKY 498

Query: 272 CLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTIL 331
           C+C  G+   +PR++EA+  GC+PVII+D+ V P  + + WE   +FV E DVP L  IL
Sbjct: 499 CICARGYEVHTPRIIEAIFSGCVPVIISDNYVPPLFEVLKWEAFSLFVRERDVPSLRDIL 558

Query: 332 TSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
            SIP E  L     L    +++  L+ +     D FH IL+ +
Sbjct: 559 LSIPEEKYLALH--LGVKKVQQHFLWHKVPVKYDLFHMILHAI 599


>gi|357118563|ref|XP_003561022.1| PREDICTED: probable glycosyltransferase At5g25310-like
           [Brachypodium distachyon]
          Length = 479

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 99/375 (26%), Positives = 162/375 (43%), Gaps = 66/375 (17%)

Query: 44  VYELPSKYNKKLLLKDPRCLTHMFAAE------------------IFMHRFLLSSP---- 81
           VY  P+ +++  +  + R   H++A                     F+ +  L SP    
Sbjct: 112 VYRNPAAFHRSYVEMERRFKVHVYAEGEPPILHAGPCKNIYTIEGRFIEQLELMSPGGAG 171

Query: 82  VRTLNPEEADWFYTPIYPTCDLTPTGLPLPF-KSP--RMMRSAIQLISSNWPYWNRTEGA 138
           VRT +PE A  F+ P   +  +    +PL + ++P   ++   ++++++   +WNR+ GA
Sbjct: 172 VRTWDPERAHAFFLPFSVSQMVQFAYVPLSYDRAPLRALVADYVRVVAARHRFWNRSSGA 231

Query: 139 DHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPP 198
           DHF +  HD+G      + +    GI  L   A   + F     V + E ++        
Sbjct: 232 DHFMLSCHDWGPEASRGDPELYGNGIRALCN-ANTSEGFRPGKDVSIPEINL-------- 282

Query: 199 QKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAV-------WENFK--NN 249
                     DTPR + +   GL    +  P   ++A G    V       W+     N 
Sbjct: 283 -------YDGDTPRQLLLPAPGL----SERPYLAFFAGGRHGHVRDLLLREWKGRDPDNF 331

Query: 250 PLFDIS----------TDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIA 299
           P+++             D    YY  MQRA FCLCP G    SPR+VEA+  GC+PV+++
Sbjct: 332 PVYEYDLPTTTNTTGGGDKQHDYYSYMQRARFCLCPSGHEVASPRVVEAIHAGCVPVLVS 391

Query: 300 DDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQ 359
           D    PFAD + WE   + V   D+P+L  +L SIP   + R +       +K+     Q
Sbjct: 392 DGYAPPFADVLRWEGFSVSVPVADIPRLREVLESIPAAEVERLRD--GGRLVKQHFTLRQ 449

Query: 360 PAQPGDAFHQILNGL 374
           P +  D FH IL+ +
Sbjct: 450 PPERLDMFHMILHSV 464


>gi|302761638|ref|XP_002964241.1| xylosyltransferase-like protein [Selaginella moellendorffii]
 gi|300167970|gb|EFJ34574.1| xylosyltransferase-like protein [Selaginella moellendorffii]
          Length = 332

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 86/296 (29%), Positives = 127/296 (42%), Gaps = 24/296 (8%)

Query: 84  TLNPEEADWFYTPIYPTCDLTPTGLPLPFKSP-----RMMRSAIQLISSNWPYWNRTEGA 138
           T +PE+A  F+ P   +  +  T L  P         R  R  I +IS  +  WNR+ GA
Sbjct: 45  TSDPEKAHLFFLPF--SVAMMVTYLYTPGSHDMGPLGRFTRDYIDVISHRYSAWNRSRGA 102

Query: 139 DHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPP 198
           DHF V  HD+G          +   I  L    T         +V   + S+        
Sbjct: 103 DHFMVSCHDWGPHISRAHPDLMANSIRVLCNANT------SEGYVPSKDASLPEIHLVGG 156

Query: 199 QKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDH 258
           Q       PP   R    +F G     ++ P      R      W+    +         
Sbjct: 157 QVPSVLGGPPPEERRYLAFFAG----GDHGP-----VRPVLFKYWKEKDEDVRVFEKLPS 207

Query: 259 PTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMF 318
              Y + M  + +CLCP G+   SPR+VEA+   C+PV+IADD VLPF+D + W+   + 
Sbjct: 208 RDAYLDYMSHSKYCLCPGGYEVNSPRIVEAIYNDCVPVVIADDFVLPFSDVLDWDAFSVK 267

Query: 319 VAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
           V E D+P+L TIL +IP    L  Q  ++   ++R   F QP +  D F+ IL+ +
Sbjct: 268 VLERDIPRLKTILQAIPTARYLEMQARVSK--VRRHFRFNQPPERYDVFNMILHSV 321


>gi|357466145|ref|XP_003603357.1| hypothetical protein MTR_3g106690 [Medicago truncatula]
 gi|355492405|gb|AES73608.1| hypothetical protein MTR_3g106690 [Medicago truncatula]
          Length = 393

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 85/288 (29%), Positives = 134/288 (46%), Gaps = 35/288 (12%)

Query: 92  WFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGAC 151
           + YTP   + D+TP          + +   ++++S  +P+WNRT GADHF +  HD+G  
Sbjct: 125 YLYTP--SSYDITPL--------TQFVSDYVRVVSMRYPFWNRTHGADHFMLACHDWGP- 173

Query: 152 FHYQEEK-AIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDT 210
            H  +    +    + +L  A   + F     V L E    I  Y      +   +PP+ 
Sbjct: 174 -HASKGNPFLYNTSIRVLCNANTSEGFNPLKDVSLPE----IHLYGGEVSPKLLSLPPEN 228

Query: 211 -PRSIFVYFRGLFYDVNNDPEGGYYA--RGARAAVWENFKNNPLFDISTDHPTTYYEDMQ 267
            PR    +F            GG +   R      W+N   + L +        YY  M 
Sbjct: 229 APRRYLAFF-----------AGGMHGPIRPILLQHWKNRDKDILVNEYLPKGIDYYSIML 277

Query: 268 RAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKL 327
            + FCLCP G+   SPR+VE++   C+PVI+++  VLPF+D + WE   + V   D+P+L
Sbjct: 278 SSKFCLCPSGFEVASPRIVESIYAECVPVILSNYYVLPFSDVLQWEAFSVQVDVSDIPRL 337

Query: 328 DTILTSIPPEVILR-KQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
             IL++IP     + KQ + A   ++R     QPA+  D FH IL+ +
Sbjct: 338 KEILSAIPESKYKKLKQGVRA---VRRHFTLNQPAKRFDVFHMILHSI 382


>gi|255567222|ref|XP_002524592.1| catalytic, putative [Ricinus communis]
 gi|223536145|gb|EEF37800.1| catalytic, putative [Ricinus communis]
          Length = 388

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 80/307 (26%), Positives = 137/307 (44%), Gaps = 20/307 (6%)

Query: 71  IFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAI---QLISS 127
           +F+      +  RTLNP+EA  F+ P      +     P+      + R+ +   ++IS 
Sbjct: 88  LFLSLMETDTKFRTLNPDEAQVFFLPFSVVMIIEHLFHPIIRDKAVLERTVVDYVRIISH 147

Query: 128 NWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNE 187
            + YWNR+ GADHF +  HD+G    + E +     I  +L  A   + F  +      E
Sbjct: 148 KYTYWNRSLGADHFMLSCHDWGPRATWYERQLYFNSI-RVLCNANTSEYFNPKKDASFPE 206

Query: 188 GSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFK 247
            ++     A         +PP   R+I  +F G  +            R      W+   
Sbjct: 207 INLITGEIAD----LTGGLPPSN-RTILAFFSGKMHGK---------LRPLLFQHWKEKD 252

Query: 248 NNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFA 307
            + L   +     +Y E M+++ +C+CP G    SPR+VEA+   C+PV+I+ + V PF+
Sbjct: 253 KDVLVYETFPEGLSYQEMMKKSRYCICPSGHEVASPRIVEAIYAECVPVLISQNYVFPFS 312

Query: 308 DAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAF 367
           D + WE   + V+  D+  L  IL  IP +  LR +  +    +++  L   P +  D F
Sbjct: 313 DVLNWESFSIQVSVSDISNLKNILLGIPEDQYLRMRERVKQ--VQQHFLINNPPKRYDVF 370

Query: 368 HQILNGL 374
           H I++ +
Sbjct: 371 HMIIHSI 377


>gi|242043198|ref|XP_002459470.1| hypothetical protein SORBIDRAFT_02g005180 [Sorghum bicolor]
 gi|241922847|gb|EER95991.1| hypothetical protein SORBIDRAFT_02g005180 [Sorghum bicolor]
          Length = 584

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 100/355 (28%), Positives = 158/355 (44%), Gaps = 41/355 (11%)

Query: 39  KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIY 98
           +LKVYVYE      +  +  D  C +       F+H    ++ +RT +P +A  F+ P  
Sbjct: 237 QLKVYVYE----EGEPPVFHDGPCRSIYSTEGRFIHAMETATRLRTSDPSQAHVFFLPFS 292

Query: 99  PTCDLTPTGLP-----LPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFH 153
               +     P      P K  R +   +++IS  +P+WNR+ GADHF +  HD+G    
Sbjct: 293 VVKMVKTIYEPGSHDMAPLK--RTVADYLRVISDKYPFWNRSAGADHFMLSCHDWGPYVS 350

Query: 154 YQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRS 213
               +     I  +L  A   + F     V L E  I +   A  +++      P   R 
Sbjct: 351 SANAELFGNSIR-VLCNANTSEGFDLAKDVSLPE--INLRSDAVERQVGG----PSASRR 403

Query: 214 IFVYFRGLFYDVNNDPEGGYYARGARAAVW---ENFKNNPLFDISTDHPTT--------- 261
            F+ F   F   N+ P      R A  A W      +++P   +S   PT          
Sbjct: 404 PFLAF---FAGGNHGP-----VRPALLAHWGPGSGREDDPDVRVSEYLPTRGGRAGASAA 455

Query: 262 -YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD-DIVLPFADAIPWEEIGMFV 319
            Y + M+R+ FCLCP G+   SPRL EAV   C+PV++ D +  LPFAD + W+   + V
Sbjct: 456 AYTDMMRRSRFCLCPGGYEVASPRLAEAVYLECVPVVVDDGEYALPFADVLNWDAFAVRV 515

Query: 320 AEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
              DVP++  IL+++ P   + + +       +  M+   P +  DAFH IL+ +
Sbjct: 516 RVADVPRIKEILSAVSPRQYI-RMQRRVRMVRRHFMVHGGPPRRYDAFHMILHSV 569


>gi|334187794|ref|NP_197526.5| Exostosin family protein [Arabidopsis thaliana]
 gi|332005439|gb|AED92822.1| Exostosin family protein [Arabidopsis thaliana]
          Length = 458

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 83/302 (27%), Positives = 136/302 (45%), Gaps = 26/302 (8%)

Query: 80  SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSA----IQLISSNWPYWNRT 135
           SP    NPEEA  F  P+     +     PL   S   +       + +++  +PYWNR+
Sbjct: 166 SPFAANNPEEAHAFLLPVSVANIVHYLYRPLVTYSREQLHKVFLDYVDVVAHKYPYWNRS 225

Query: 136 EGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY 195
            GADHF+V  HD+         + + + ++ +L  A   + F  +  V + E +I     
Sbjct: 226 LGADHFYVSCHDWAPDVSGSNPELM-KNLIRVLCNANTSEGFMPQRDVSIPEINIPGGHL 284

Query: 196 APPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGY-YARGARAAVWENFKNNPLFDI 254
            PP+  ++        R I  +F G          G + Y R      W++         
Sbjct: 285 GPPRLSRS----SGHDRPILAFFAG----------GSHGYIRRILLQHWKDKDEEVQVHE 330

Query: 255 STDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEE 314
                  Y++ M  A FCLCP G+   SPR+V A+  GC+PVII+D   LPF+D + W +
Sbjct: 331 YLAKNKDYFKLMATARFCLCPSGYEVASPRVVAAINLGCVPVIISDHYALPFSDVLDWTK 390

Query: 315 IGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANP--SMKRAMLFPQPAQPGDAFHQILN 372
             + V  + +P++ TIL SI      R+ R+L      ++R  +  +P+QP D    +L+
Sbjct: 391 FTIHVPSKKIPEIKTILKSIS----WRRYRVLQRRVLQVQRHFVINRPSQPFDMLRMLLH 446

Query: 373 GL 374
            +
Sbjct: 447 SV 448


>gi|292630785|sp|Q3E9A4.3|GLYT5_ARATH RecName: Full=Probable glycosyltransferase At5g20260
          Length = 466

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 83/302 (27%), Positives = 136/302 (45%), Gaps = 26/302 (8%)

Query: 80  SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSA----IQLISSNWPYWNRT 135
           SP    NPEEA  F  P+     +     PL   S   +       + +++  +PYWNR+
Sbjct: 174 SPFAANNPEEAHAFLLPVSVANIVHYLYRPLVTYSREQLHKVFLDYVDVVAHKYPYWNRS 233

Query: 136 EGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY 195
            GADHF+V  HD+         + + + ++ +L  A   + F  +  V + E +I     
Sbjct: 234 LGADHFYVSCHDWAPDVSGSNPELM-KNLIRVLCNANTSEGFMPQRDVSIPEINIPGGHL 292

Query: 196 APPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGY-YARGARAAVWENFKNNPLFDI 254
            PP+  ++        R I  +F G          G + Y R      W++         
Sbjct: 293 GPPRLSRS----SGHDRPILAFFAG----------GSHGYIRRILLQHWKDKDEEVQVHE 338

Query: 255 STDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEE 314
                  Y++ M  A FCLCP G+   SPR+V A+  GC+PVII+D   LPF+D + W +
Sbjct: 339 YLAKNKDYFKLMATARFCLCPSGYEVASPRVVAAINLGCVPVIISDHYALPFSDVLDWTK 398

Query: 315 IGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANP--SMKRAMLFPQPAQPGDAFHQILN 372
             + V  + +P++ TIL SI      R+ R+L      ++R  +  +P+QP D    +L+
Sbjct: 399 FTIHVPSKKIPEIKTILKSIS----WRRYRVLQRRVLQVQRHFVINRPSQPFDMLRMLLH 454

Query: 373 GL 374
            +
Sbjct: 455 SV 456


>gi|302815783|ref|XP_002989572.1| xylosyltransferase-like protein [Selaginella moellendorffii]
 gi|300142750|gb|EFJ09448.1| xylosyltransferase-like protein [Selaginella moellendorffii]
          Length = 332

 Score =  112 bits (279), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 86/296 (29%), Positives = 126/296 (42%), Gaps = 24/296 (8%)

Query: 84  TLNPEEADWFYTPIYPTCDLTPTGLPLPFKSP-----RMMRSAIQLISSNWPYWNRTEGA 138
           T +PE+A  F+ P   +  +  T L  P         R  R  I +IS  +  WNR+ GA
Sbjct: 45  TSDPEKAHLFFLPF--SVAMMVTYLYTPGSHDMGPLGRFTRDYIDVISHRYSSWNRSRGA 102

Query: 139 DHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPP 198
           DHF V  HD+G          +   I  L    T         +V   + S+        
Sbjct: 103 DHFMVSCHDWGPHISRAHPDLMANSIRVLCNANT------SEGYVPSKDASLPEIHLVGG 156

Query: 199 QKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDH 258
           Q       PP   R    +F G     ++ P      R      W+    +         
Sbjct: 157 QVPSVLGGPPPEERRYLAFFAG----GDHGP-----VRPVLFKYWKEKDEDVRVFEKLPS 207

Query: 259 PTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMF 318
              Y + M  + +CLCP G+   SPR+VEA+   C+PV+IADD VLPF+D + W+   + 
Sbjct: 208 RDAYLDYMSHSKYCLCPGGYEVNSPRIVEAIYNDCVPVVIADDFVLPFSDVLDWDAFSVK 267

Query: 319 VAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
           V E D+P+L TIL +IP    L  Q   +   ++R   F QP +  D F+ IL+ +
Sbjct: 268 VLERDIPRLKTILQAIPTARYLEMQARASK--VRRHFRFNQPPERYDVFNMILHSV 321


>gi|302835519|ref|XP_002949321.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
 gi|300265623|gb|EFJ49814.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
          Length = 855

 Score =  112 bits (279), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 106/405 (26%), Positives = 170/405 (41%), Gaps = 90/405 (22%)

Query: 19  PKIERISGSAGDVLEDDPVGKLK--VYVYELPSKYNKKLLLKD------------PRCLT 64
           P  E  +G+   V E    G+++  +Y+Y++P ++  ++                P   T
Sbjct: 379 PNSEGAAGTGSAVGEQHTPGRMRPLIYIYDMPPEFTSRMHQYKNVHEHCSYRRFIPSNRT 438

Query: 65  HMFA----AEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTG--LPLPF------ 112
            ++A     E + H  L  SP RT +PEEAD+FY P+Y TC + P      +PF      
Sbjct: 439 ELYADTYSVEAYFHEVLSISPHRTFDPEEADFFYVPVYYTCWMWPINGWADMPFYGAPTS 498

Query: 113 -----KSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFH----YQEEKAIE-- 161
                 +  +   A   I SN+P+W+R  G DH ++  HD GAC+     YQ    +   
Sbjct: 499 WHRYSNAANLWLKAKTWIQSNFPFWDRRGGRDHIWMTNHDEGACYMPTEIYQTSIMLTHW 558

Query: 162 ------------------------RGILPLLQRATLVQTFGQRNHVCLN-EGSITIPPYA 196
                                   +G+L      TL Q      H C +    + IP + 
Sbjct: 559 GRMDLNHTSNTAYRPDNYSDGITWKGVLDGKDVKTLYQ-----GHPCYDPRKDLVIPAFK 613

Query: 197 PPQKMQAHQIPPDTPRS--IFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPL--- 251
            P       +    PR   I +Y RG   DV    E   Y+RG R  +++   +N     
Sbjct: 614 TPDHFSQSPLLGSWPRQRDILLYLRG---DVGKHREPN-YSRGIRQKLYKLAVDNEWAKK 669

Query: 252 --------FDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIV 303
                   F+I      +Y E + R++FC    G   +SPR  +AV+ GC+P+II D+  
Sbjct: 670 HRIFIGEQFEIQ----GSYGEHLSRSLFCAVVPG-DGYSPRFEDAVLHGCLPLIIVDNTH 724

Query: 304 LPFADAIPWEEIGMFVAEEDVPK-LDTILTSIPPEVILRKQRLLA 347
           + F   I  +   + ++E  + + L  +LT+I P+ I R QR L+
Sbjct: 725 VLFESIIDVDSFSLRISEAALNEYLPHLLTAISPDQIARMQRRLS 769


>gi|109946595|gb|ABG48476.1| At3g42180 [Arabidopsis thaliana]
          Length = 425

 Score =  112 bits (279), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 88/309 (28%), Positives = 141/309 (45%), Gaps = 36/309 (11%)

Query: 79  SSPVRTLNPEEADWFYTPIYPTCDLT-----PTGLPLPFKSPRMMR---SAIQLISSNWP 130
           S   R   PEEA  F+ P +   ++      P   P  F   R+ R     + +++   P
Sbjct: 129 SGRFRASRPEEAHAFFLP-FSVANIVHYVYQPITSPADFNRARLHRIFNDYVDVVAHKHP 187

Query: 131 YWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSI 190
           +WN++ GADHF V  HD+       + +  +  +  L    T   + G R ++  +   I
Sbjct: 188 FWNQSNGADHFMVSCHDWAPDVPDSKPEFFKNFMRGLCNANT---SEGFRRNIDFSIPEI 244

Query: 191 TIPP--YAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKN 248
            IP     PP   Q     P+  R+I  +F G  +        GY     R  ++ ++K 
Sbjct: 245 NIPKRKLKPPFMGQN----PEN-RTILAFFAGRAH--------GY----IREVLFSHWKG 287

Query: 249 NPLFDISTDHPT---TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLP 305
                   DH T    Y+E +  + FCLCP G+   SPR VEA+  GC+PV+I+D+  LP
Sbjct: 288 KDKDVQVYDHLTKGQNYHELIGHSKFCLCPSGYEVASPREVEAIYSGCVPVVISDNYSLP 347

Query: 306 FADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGD 365
           F D + W +  + +  + +P +  IL  IP +  LR  R +    ++R  +  +PAQP D
Sbjct: 348 FNDVLDWSKFSVEIPVDKIPDIKKILQEIPHDKYLRMYRNVM--KVRRHFVVNRPAQPFD 405

Query: 366 AFHQILNGL 374
             H IL+ +
Sbjct: 406 VIHMILHSV 414


>gi|26451681|dbj|BAC42936.1| unknown protein [Arabidopsis thaliana]
          Length = 270

 Score =  112 bits (279), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 78/266 (29%), Positives = 131/266 (49%), Gaps = 29/266 (10%)

Query: 116 RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQ 175
           + +++ + +ISS + +WN+T G+DHF V  HD+       E +      +  L  + + +
Sbjct: 19  QFLKNYLDMISSKYSFWNKTGGSDHFLVACHDWAP----SETRQYMAKCIRALCNSDVSE 74

Query: 176 TFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYA 235
            F     V L E +I +P     + ++A    P + R I  +F G  +        GY  
Sbjct: 75  GFVFGKDVALPETTILVPR----RPLRALGGKPVSQRQILAFFAGGMH--------GY-- 120

Query: 236 RGARAAVWENFKNN--PLFDISTDHPTT-----YYEDMQRAIFCLCPLGWAPWSPRLVEA 288
              R  + +N+  N  P   I ++ P +     Y E M+ + +C+CP G    SPR+VEA
Sbjct: 121 --LRPLLLQNWGGNRDPDMKIFSEIPKSKGKKSYMEYMKSSKYCICPKGHEVNSPRVVEA 178

Query: 289 VVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLAN 348
           + + C+PVII+D+ V PF + + WE   +FV E+D+P L  IL SI  E   R+ ++   
Sbjct: 179 LFYECVPVIISDNFVPPFFEVLNWESFAVFVLEKDIPDLKNILVSITEER-YREMQMRVK 237

Query: 349 PSMKRAMLFPQPAQPGDAFHQILNGL 374
              K  +   +P +  D FH IL+ +
Sbjct: 238 MVQKHFLWHSKPER-FDIFHMILHSI 262


>gi|6735291|emb|CAB68119.1| putative protein [Arabidopsis thaliana]
          Length = 340

 Score =  112 bits (279), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 88/309 (28%), Positives = 141/309 (45%), Gaps = 36/309 (11%)

Query: 79  SSPVRTLNPEEADWFYTPIYPTCDLT-----PTGLPLPFKSPRMMR---SAIQLISSNWP 130
           S   R   PEEA  F+ P +   ++      P   P  F   R+ R     + +++   P
Sbjct: 44  SGRFRASRPEEAHAFFLP-FSVANIVHYVYQPITSPADFNRARLHRIFNDYVDVVAHKHP 102

Query: 131 YWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSI 190
           +WN++ GADHF V  HD+       + +  +  +  L    T   + G R ++  +   I
Sbjct: 103 FWNQSNGADHFMVSCHDWAPDVPDSKPEFFKNFMRGLCNANT---SEGFRRNIDFSIPEI 159

Query: 191 TIPP--YAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKN 248
            IP     PP   Q     P+  R+I  +F G  +        GY     R  ++ ++K 
Sbjct: 160 NIPKRKLKPPFMGQN----PEN-RTILAFFAGRAH--------GY----IREVLFSHWKG 202

Query: 249 NPLFDISTDHPT---TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLP 305
                   DH T    Y+E +  + FCLCP G+   SPR VEA+  GC+PV+I+D+  LP
Sbjct: 203 KDKDVQVYDHLTKGQNYHELIGHSKFCLCPSGYEVASPREVEAIYSGCVPVVISDNYSLP 262

Query: 306 FADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGD 365
           F D + W +  + +  + +P +  IL  IP +  LR  R +    ++R  +  +PAQP D
Sbjct: 263 FNDVLDWSKFSVEIPVDKIPDIKKILQEIPHDKYLRMYRNVM--KVRRHFVVNRPAQPFD 320

Query: 366 AFHQILNGL 374
             H IL+ +
Sbjct: 321 VIHMILHSV 329


>gi|413952956|gb|AFW85605.1| hypothetical protein ZEAMMB73_586394 [Zea mays]
          Length = 509

 Score =  111 bits (278), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 89/306 (29%), Positives = 138/306 (45%), Gaps = 36/306 (11%)

Query: 82  VRTLNPEEADWFYTPIYPTCDLTPTGLPLPF-KSP--RMMRSAIQLISSNWPYWNRTEGA 138
           VRT  P  A  F+ P   +  +     P  + K+P   ++   +++++S  PYWNR+ GA
Sbjct: 216 VRTSEPARAHAFFLPFSVSQMVQFAYRPNTYDKTPLRAIVADYVRVVASRHPYWNRSAGA 275

Query: 139 DHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPP 198
           DHF +  HD+G        +    GI  L                C    S    P+   
Sbjct: 276 DHFMLACHDWGPEASTGHPELHANGIRAL----------------CNANSSEGFRPWQDV 319

Query: 199 QKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAV-------WENFKNN-- 249
                +    D PR +     G    V + P   ++A G    +       W+    +  
Sbjct: 320 SVPDINLYDGDMPRQLLAPAPG----VTSRPFLAFFAGGRHGHIRDLLLRHWKGRDPDFF 375

Query: 250 PLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADA 309
           P+++   +    YY  M+RA FCLCP G+   SPR+VE++   C+PVI++D  VLPFAD 
Sbjct: 376 PVYEHRHEDGFDYYSFMRRARFCLCPSGYQVASPRVVESIHAECVPVILSDGYVLPFADV 435

Query: 310 IPWEEIGMFVAEEDVPKLDTILTSIP-PEVILRKQRLLANPSMKRAMLFPQPAQPGDAFH 368
           + WE   + V   D+P+L  +L  IP PEV   +Q +     +KR  +  QP Q  D F+
Sbjct: 436 LRWEAFSVAVPVADIPRLREVLERIPAPEVEKLQQGVRL---VKRHFMLNQPPQRLDMFN 492

Query: 369 QILNGL 374
            IL+ +
Sbjct: 493 MILHSV 498


>gi|145339080|ref|NP_189804.4| putative glycosyltransferase [Arabidopsis thaliana]
 gi|292630777|sp|Q3EAR7.2|GLYT2_ARATH RecName: Full=Probable glycosyltransferase At3g42180
 gi|332644208|gb|AEE77729.1| putative glycosyltransferase [Arabidopsis thaliana]
          Length = 470

 Score =  111 bits (278), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 88/309 (28%), Positives = 141/309 (45%), Gaps = 36/309 (11%)

Query: 79  SSPVRTLNPEEADWFYTPIYPTCDLT-----PTGLPLPFKSPRMMR---SAIQLISSNWP 130
           S   R   PEEA  F+ P +   ++      P   P  F   R+ R     + +++   P
Sbjct: 174 SGRFRASRPEEAHAFFLP-FSVANIVHYVYQPITSPADFNRARLHRIFNDYVDVVAHKHP 232

Query: 131 YWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSI 190
           +WN++ GADHF V  HD+       + +  +  +  L    T   + G R ++  +   I
Sbjct: 233 FWNQSNGADHFMVSCHDWAPDVPDSKPEFFKNFMRGLCNANT---SEGFRRNIDFSIPEI 289

Query: 191 TIPP--YAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKN 248
            IP     PP   Q     P+  R+I  +F G  +        GY     R  ++ ++K 
Sbjct: 290 NIPKRKLKPPFMGQN----PEN-RTILAFFAGRAH--------GY----IREVLFSHWKG 332

Query: 249 NPLFDISTDHPT---TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLP 305
                   DH T    Y+E +  + FCLCP G+   SPR VEA+  GC+PV+I+D+  LP
Sbjct: 333 KDKDVQVYDHLTKGQNYHELIGHSKFCLCPSGYEVASPREVEAIYSGCVPVVISDNYSLP 392

Query: 306 FADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGD 365
           F D + W +  + +  + +P +  IL  IP +  LR  R +    ++R  +  +PAQP D
Sbjct: 393 FNDVLDWSKFSVEIPVDKIPDIKKILQEIPHDKYLRMYRNVM--KVRRHFVVNRPAQPFD 450

Query: 366 AFHQILNGL 374
             H IL+ +
Sbjct: 451 VIHMILHSV 459


>gi|302825130|ref|XP_002994198.1| glycosyltransferase-like protein [Selaginella moellendorffii]
 gi|300137940|gb|EFJ04733.1| glycosyltransferase-like protein [Selaginella moellendorffii]
          Length = 762

 Score =  111 bits (278), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 104/384 (27%), Positives = 164/384 (42%), Gaps = 67/384 (17%)

Query: 22  ERISGSAGDVLEDDPVGKLK--VYVYELPSKYNKKLLLKDP---RCLTHMF--------- 67
           +R S S    +++  V K +  +Y+Y+LP++YN  LL       +C+T ++         
Sbjct: 300 QRSSESDDQQVQEISVEKRRPLIYIYDLPAEYNSHLLEGRHFKFQCVTRVYDGVNATFWS 359

Query: 68  ----AAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPT------------GLPLP 111
                AE+     LL+SP RT+N +EAD+F+ P+   C +T              GL   
Sbjct: 360 EYLEGAELAFLEGLLASPHRTMNGDEADYFFAPVLGACAITRADDAPHFSMEKHMGLRGY 419

Query: 112 FKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQE----EKAIERGILPL 167
           F S  + ++A   I   +P+WNR+ G DH ++ P D GAC   +E       +  G    
Sbjct: 420 F-SGELYKNAYMHIKEQYPFWNRSSGRDHIWLFPWDEGACSAPKEIWNGTMLVHWGNTNS 478

Query: 168 LQRATLVQTFG----------QRNHVCLNE-GSITIPPYAPPQKMQAHQ---IPPDTPRS 213
             + +    F           + +H C +    I +P +  P      +     P   R 
Sbjct: 479 KHKKSTTGYFADSWDDIPKEWRGDHPCYDPLKDIVLPAWKNPDPRSVAERFWSRPREERK 538

Query: 214 IFVYFRGLF---YDVNNDPEGGYYARGARAAVWENFKNNPLFD------------ISTDH 258
              YF G     YD    PE   Y+ G R  V E F + P               ++   
Sbjct: 539 TLFYFNGNLGKGYDFGR-PE-DRYSMGIRQRVAEEFGSTPNNHGKLGRQAAPDVVVTPQR 596

Query: 259 PTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMF 318
              Y +++  + FC    G   WS R+ +AV+ GCIPVII D I LP+   + +E   + 
Sbjct: 597 SDDYAKELSSSRFCGVFPG-DGWSGRMEDAVLHGCIPVIIQDGIHLPYESLLDYESFTVR 655

Query: 319 VAEEDVPKLDTILTSIPPEVILRK 342
           VAE+ +P+L TIL +I    +  K
Sbjct: 656 VAEDKIPELITILRNISNAEVESK 679


>gi|356554237|ref|XP_003545455.1| PREDICTED: probable glycosyltransferase At5g25310-like [Glycine
           max]
          Length = 483

 Score =  111 bits (278), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 90/302 (29%), Positives = 136/302 (45%), Gaps = 31/302 (10%)

Query: 83  RTLNPEEADWFYTPIYPTCDLTPTGLPLPFKS---PRMMRSAIQLISSNWPYWNRTEGAD 139
           RT +P  A  ++ P   T  +     PL F      + +   +++IS+  P+WN T GAD
Sbjct: 193 RTNDPNAAHVYFLPFSVTWMVKYLYTPLSFNVTPLKQFVSDYVRVISTRHPFWNITHGAD 252

Query: 140 HFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQ 199
           HF +  HD+G     Q    +    + +L  A   + F  R  V L E    I  Y    
Sbjct: 253 HFMLACHDWGP-HASQGNPFLYNTSIRVLCNANTSEGFNPRKDVSLPE----IHLYGGEV 307

Query: 200 KMQAHQIPPDT-PRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDH 258
             +    PPDT PR    +F G  +               R A+  ++KN+   DI    
Sbjct: 308 SPKLLSPPPDTAPRRYLAFFSGGLHG------------PIRPALLRHWKNDNDDDIRVYE 355

Query: 259 ----PTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEE 314
                  YY  M  + FCLCP G    SPR+VEA+   C+PVI+++  VLPF+D + WE 
Sbjct: 356 YLPKDLDYYSFMLNSKFCLCPSGHEVASPRIVEAIYAECVPVILSEYYVLPFSDVLQWEA 415

Query: 315 IGMFVAEEDVPKLDTILTSIPPEVILRKQRLLAN--PSMKRAMLFPQPAQPGDAFHQILN 372
             + V   D+P+L  IL++I  +    K R L     +++      +PA+  D FH IL+
Sbjct: 416 FSVQVDVSDIPRLKEILSAISED----KYRKLKEGVKAVRGHFTLNRPAKRFDVFHMILH 471

Query: 373 GL 374
            +
Sbjct: 472 SI 473


>gi|224092292|ref|XP_002309546.1| predicted protein [Populus trichocarpa]
 gi|222855522|gb|EEE93069.1| predicted protein [Populus trichocarpa]
          Length = 417

 Score =  111 bits (278), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 102/391 (26%), Positives = 168/391 (42%), Gaps = 48/391 (12%)

Query: 4   TAIALLLAFVSADNTPKIERISGSAGDVLEDDPVGK---------LKVYVYELPSKYNKK 54
           +++ +L A    +N+P I+  +     +  +  + +         LKVYVY+      +K
Sbjct: 44  SSVEMLFAKSQIENSPLIKNETRLYAPIYRNVSMFRRSYELMEKMLKVYVYQ----DGEK 99

Query: 55  LLLKDPRCLTHMFAAE-IFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFK 113
            +   P  L  ++A+E  FM     +    T +P +A  FY P   +  L    L +   
Sbjct: 100 PIFHQP-ILDGIYASEGWFMKHMEANENFVTKDPGKAHLFYLPF--SSRLLELTLYVRHS 156

Query: 114 SPRM-----MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLL 168
             R      MR+   +I++ + +WNRT GADHF    HD+       E +      +  L
Sbjct: 157 HSRTNLIEYMRNYAGMIAAKYHFWNRTGGADHFVAACHDWAPA----ETRGPLLNCIRAL 212

Query: 169 QRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNND 228
             A +   F     V L E  +      P + ++ +   P + R I  +F G  +     
Sbjct: 213 CNADIEVGFSIGKDVSLPETYVR-SAQNPLKNLEGN---PPSQRPILAFFAGNMHG---- 264

Query: 229 PEGGYYARGARAAVWENFKNNPLFDISTDHP-----TTYYEDMQRAIFCLCPLGWAPWSP 283
                Y R      W N   +P   I    P     T Y + M+ + FC+CP G    SP
Sbjct: 265 -----YVRPVLLDYWGN--KDPDMKIFGPMPHVKGNTNYIQHMKSSKFCICPRGHEVNSP 317

Query: 284 RLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQ 343
           R+VEA+   C+PVII+D+ V PF + + WE   + V E+D+P L  IL SI  E  +   
Sbjct: 318 RIVEAIFLECVPVIISDNFVPPFFEVLDWESFAVIVLEKDIPNLKNILVSISEEKYIEMH 377

Query: 344 RLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
           + +    +++  L+    +  D FH IL+ +
Sbjct: 378 KRVK--KVQQHFLWHSKPEKYDLFHMILHSV 406


>gi|357520911|ref|XP_003630744.1| Xylogalacturonan beta-1,3-xylosyltransferase [Medicago truncatula]
 gi|355524766|gb|AET05220.1| Xylogalacturonan beta-1,3-xylosyltransferase [Medicago truncatula]
          Length = 653

 Score =  111 bits (277), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 97/351 (27%), Positives = 152/351 (43%), Gaps = 48/351 (13%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEI-FMHRFLLSSPVRTLNPEEADWFYTP-- 96
           LKVY+Y        K +   P  L  ++A+E  FM     +      +P +A  FY P  
Sbjct: 322 LKVYIY----MEGNKPIFHQP-ILKGLYASEGWFMKLMEENKQFVVKDPAKAHLFYMPFS 376

Query: 97  -------IYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFG 149
                  +Y       T L       + ++     IS+ + Y+NRT GADHF V  HD+ 
Sbjct: 377 SRMLEFSVYVRNSHNRTNLR------QYLKEYTDKISAKYRYFNRTGGADHFLVACHDWA 430

Query: 150 ACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPD 209
               Y+    +E  I  L   + + Q F     V L E    +     PQ+    +  P 
Sbjct: 431 P---YETRHHMEYCIKALCN-SDVTQGFKIGRDVSLPE--TMVRSVRNPQRDLGGK--PP 482

Query: 210 TPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDI------STDHPTTYY 263
             RSI  ++ G  +          Y R      W+  + +P   I         H   Y 
Sbjct: 483 QQRSILAFYAGNMHG---------YLRSILLKHWK--EKDPDMKIFGPMPHGVAHKMNYI 531

Query: 264 EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED 323
           E M+ + +C+CP G+   SPR+VEA+ + C+PVII+D+ V PF + + W+   + +AE+D
Sbjct: 532 EHMKSSKYCICPKGYEVNSPRVVEAIFYECVPVIISDNFVPPFFEVLNWDSFSLILAEKD 591

Query: 324 VPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
           +P L  IL S+P E  L+ Q  L    +++  L+       D FH  L+ +
Sbjct: 592 IPNLKQILLSVPEEKYLKLQ--LGVRRVQKHFLWHTKPLKYDLFHMTLHSI 640


>gi|224090294|ref|XP_002308967.1| predicted protein [Populus trichocarpa]
 gi|222854943|gb|EEE92490.1| predicted protein [Populus trichocarpa]
          Length = 793

 Score =  111 bits (277), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 101/397 (25%), Positives = 168/397 (42%), Gaps = 78/397 (19%)

Query: 28  AGDVLEDDPVGKLK---VYVYELPSKYNKKLLLKDP---RCLT-------------HMFA 68
            G +++ + V K K   +Y+Y+LP K+N  LL        C+               ++ 
Sbjct: 334 TGKLVDLNAVVKKKRPLIYIYDLPPKFNSLLLEGRHFKFECVNRLYNDNNATIWTDQLYG 393

Query: 69  AEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLP--LPFKSPRMMRSAIQL-- 124
           A++ ++  +L+SP RTLN EEAD+F+ P+  +C +T       L  +    +RS++ L  
Sbjct: 394 AQMALYESILASPYRTLNGEEADFFFVPVLDSCIITRADDAPHLSMEQHLGLRSSLTLEF 453

Query: 125 -------ISSNWPYWNRTEGADHFFVVPHDFGACFHYQE--------------EKAIERG 163
                  I  ++P+WNR+ G DH +    D GAC+  +E               K     
Sbjct: 454 YRKAYDHIVEHYPFWNRSSGRDHLWSFSWDEGACYAPKEIWNSMMVVHWGNTNSKHNHST 513

Query: 164 ILPLLQRATLVQTFGQRNHVCLN-EGSITIPPYAPPQ------KMQAHQIPPDTPRSIFV 216
                     + +  +  H C + +  + +P +  P       K+ A    P   R    
Sbjct: 514 TAYWADNWDKISSDRRGKHPCFDPDKDLVLPAWKRPDVNALSTKLWAR---PLEKRKTLF 570

Query: 217 YFRGLF--YDVNNDPEGGYYARGARAAVWENFKNNPLFD------------ISTDHPTTY 262
           YF G      +N  PE   Y+ G R  + E F + P  D            +S     +Y
Sbjct: 571 YFNGNLGPAYLNGRPE-ALYSMGIRQKLAEEFGSTPNKDGNLGKQHAENVIVSPLRSESY 629

Query: 263 YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEE 322
           +ED+  ++FC    G   WS R+ ++++ GCIPV+I D I LP+ + + +E   + + E+
Sbjct: 630 HEDLASSVFCGVMPG-DGWSGRMEDSILQGCIPVVIQDGIYLPYENVLNYESFAVRILED 688

Query: 323 DVPKLDTILTSIPPEVILRK--------QRLLANPSM 351
           ++P L  IL       I  K        QR L   SM
Sbjct: 689 EIPNLIKILQGFNETEIENKLTSVQKIGQRFLYRDSM 725


>gi|115451051|ref|NP_001049126.1| Os03g0174300 [Oryza sativa Japonica Group]
 gi|108706455|gb|ABF94250.1| exostosin family protein, putative, expressed [Oryza sativa
           Japonica Group]
 gi|113547597|dbj|BAF11040.1| Os03g0174300 [Oryza sativa Japonica Group]
 gi|215695509|dbj|BAG90700.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 576

 Score =  111 bits (277), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 95/376 (25%), Positives = 161/376 (42%), Gaps = 65/376 (17%)

Query: 17  NTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLL------------LKDPRCLT 64
           N P ++ IS S+ +V       +  +YVY+LP++++  LL            + D +  T
Sbjct: 192 NLPDLKNISSSSINVKVVAKKKRPLIYVYDLPAEFDSHLLEGRHYKFQCVNRIYDDKNRT 251

Query: 65  ----HMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMM-- 118
                ++ A+I ++  +L+SP RTLN +EAD+FY P   +C +T +      + PR +  
Sbjct: 252 IWTQQLYGAQIALYESILASPHRTLNGDEADYFYVPALDSCLITRSDDAPHLQMPRDLRL 311

Query: 119 ---------RSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFH----YQEEKAIERGIL 165
                    R     I+  +PYWNRT G DH +    D GAC+     +     +  G  
Sbjct: 312 RSYHTLEYYRMTYDHIAQRYPYWNRTSGRDHIWFFSWDEGACYAPKEIWNSMMLVHWGNT 371

Query: 166 PLLQRATLVQTFGQR----------NHVCLN-EGSITIPPYAPPQ------KMQAHQIPP 208
               + +    +             NH C +    + +P +  P       K+ A     
Sbjct: 372 NTKHKNSTTAYWADNWNYIPIDRRGNHPCFDPRKDLVLPAWKQPNPAAIWLKLWART--- 428

Query: 209 DTPRSIFVYFRGLFYDVNNDPEG-GYYARGARAAVWENFKNNP----------LFDISTD 257
              R+   YF G       D      Y+ G R  +   F + P            +++  
Sbjct: 429 RNNRTTLFYFNGNLGPAYKDGRHEDTYSMGIRQKLAAEFGSTPDKQGKLGRQHTANVTVT 488

Query: 258 HPTT--YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEI 315
           +  T  YYE++  +IFC    G   WS R+ ++++ GCIPVII D I+LP+ + + +   
Sbjct: 489 YLRTEKYYEELASSIFCGVLPG-DGWSGRMEDSMLQGCIPVIIQDGILLPYENMLNYNSF 547

Query: 316 GMFVAEEDVPKLDTIL 331
            + + E+D+P L  IL
Sbjct: 548 AVRIQEDDIPNLIRIL 563


>gi|255550904|ref|XP_002516500.1| catalytic, putative [Ricinus communis]
 gi|223544320|gb|EEF45841.1| catalytic, putative [Ricinus communis]
          Length = 456

 Score =  111 bits (277), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 84/301 (27%), Positives = 136/301 (45%), Gaps = 33/301 (10%)

Query: 84  TLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPR-----MMRSAIQLISSNWPYWNRTEGA 138
           T +P+EA  FY P   +  L    L +     R      M++   +I++ +P+W+RT GA
Sbjct: 168 TKDPKEAHLFYIPF--SSRLLELTLYVRKSHSRNNLIEYMKNYTDMIAAKYPFWSRTGGA 225

Query: 139 DHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPP 198
           DHF    HD+       E +      +  L  A +   F     V L E  +      P 
Sbjct: 226 DHFVAACHDWAPA----ETRGRMLNCIRALCNADIDVGFRIGKDVSLPETYVR-SAQNPL 280

Query: 199 QKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDH 258
           + +  +   P + R I  +F G  +          + R      WEN   +P   I    
Sbjct: 281 KNLDGN---PPSQRPILAFFAGNVHG---------FVRPILLEYWEN--KDPEMKIFGPM 326

Query: 259 P-----TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWE 313
           P     T Y + M+ + +C+CP G    SPR+VE++ + C+PVII+D+ V PF + + WE
Sbjct: 327 PRVKGNTNYIQLMKSSKYCICPRGHEVNSPRIVESIFYECVPVIISDNYVPPFFEVLDWE 386

Query: 314 EIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNG 373
              +FV E+D+P L  IL SIP E  +   + +    +++  L+    +  D FH IL+ 
Sbjct: 387 SFAVFVLEKDIPNLKKILLSIPEETYVEMHKRVK--KVQQHFLWHSEPEKHDLFHMILHS 444

Query: 374 L 374
           +
Sbjct: 445 V 445


>gi|449527873|ref|XP_004170933.1| PREDICTED: probable glycosyltransferase At5g03795-like [Cucumis
           sativus]
          Length = 466

 Score =  111 bits (277), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 89/318 (27%), Positives = 146/318 (45%), Gaps = 43/318 (13%)

Query: 67  FAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTC-----DLTPTGLPLPFKSPRMMRSA 121
           +A+E +  + L+ S   T +P+EAD+F+ P   T       ++ +G+P        +R  
Sbjct: 169 YASESYFKKSLIKSHFITNDPKEADFFFLPFSITGLRNDRRVSVSGIP------NFIRDY 222

Query: 122 IQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRN 181
           I  +S  +PYWNRT GADHF+V  H  G     +  +A          ++++VQ     +
Sbjct: 223 IFDVSHKYPYWNRTGGADHFYVACHSVGRSAMDKSSEA----------KSSIVQVVCSSS 272

Query: 182 HVCLNEGSITIPPYAPPQKMQAHQIPPD---TPRSIFVYFRGLFYDVNNDPEGGYYARGA 238
           +     G I+    A PQ     + P +   + R+   +F G      N P      R A
Sbjct: 273 YFL--TGYISHKDAALPQIWPRKEDPSNLASSKRTRLAFFAGAM----NSP-----TRQA 321

Query: 239 RAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVII 298
              VW   K++ +F  S    T Y +++ R+ FCL   G+   + R+ +++ +GC+PVII
Sbjct: 322 LVQVWG--KDSEIFAYSGRLKTPYADELLRSKFCLHVKGFEVNTARVGDSIFYGCVPVII 379

Query: 299 ADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFP 358
           A+   LPF D + W+   + V   D+P+L  IL  I  E   R    L +  +K    F 
Sbjct: 380 ANYYDLPFGDILNWKSFSIVVTTSDIPRLKEILKGINDEEYAR----LQSNVLKVRKHFK 435

Query: 359 QPAQP--GDAFHQILNGL 374
             + P   D FH ++  L
Sbjct: 436 WHSSPVDYDTFHMVMYQL 453


>gi|240256346|ref|NP_197913.4| putative glycosyltransferase [Arabidopsis thaliana]
 gi|292630776|sp|Q3E7Q9.2|GLYT6_ARATH RecName: Full=Probable glycosyltransferase At5g25310
 gi|332006042|gb|AED93425.1| putative glycosyltransferase [Arabidopsis thaliana]
          Length = 480

 Score =  110 bits (276), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 99/354 (27%), Positives = 160/354 (45%), Gaps = 51/354 (14%)

Query: 39  KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLS-----SPVRTLNPEEADWF 93
           + KVYVYE      +  L+ D  C + ++A E    RF+       +  RT +P +A  +
Sbjct: 150 RFKVYVYE----EGEPPLVHDGPCKS-VYAVE---GRFITEMEKRRTKFRTYDPNQAYVY 201

Query: 94  YTPIYPTCDL----TPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFG 149
           + P   T  +           P K+   +   I+L+S+N P+WNRT GADHF +  HD+G
Sbjct: 202 FLPFSVTWLVRYLYEGNSDAKPLKT--FVSDYIRLVSTNHPFWNRTNGADHFMLTCHDWG 259

Query: 150 ACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPD 209
                Q  + +    + ++  A   + F     V L E  I +       K++  +    
Sbjct: 260 P-LTSQANRDLFNTSIRVMCNANSSEGFNPTKDVTLPE--IKLYGGEVDHKLRLSKTLSA 316

Query: 210 TPRSIFVYFRGLFYDVNNDPEGGYYARGARAAV-------WENFK-NNPLFDISTDHPTT 261
           +PR                P  G++A G    V       W+    + P+++    H   
Sbjct: 317 SPR----------------PYLGFFAGGVHGPVRPILLKHWKQRDLDMPVYEYLPKH-LN 359

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           YY+ M+ + FC CP G+   SPR++EA+   CIPVI++ + VLPF D + WE   + V  
Sbjct: 360 YYDFMRSSKFCFCPSGYEVASPRVIEAIYSECIPVILSVNFVLPFTDVLRWETFSVLVDV 419

Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPS-MKRAMLFPQPAQPGDAFHQILNGL 374
            ++P+L  IL SI  E   + + L +N   ++R      P Q  DAFH  L+ +
Sbjct: 420 SEIPRLKEILMSISNE---KYEWLKSNLRYVRRHFELNDPPQRFDAFHLTLHSI 470


>gi|159478515|ref|XP_001697348.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
 gi|158274506|gb|EDP00288.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
          Length = 620

 Score =  110 bits (276), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 111/397 (27%), Positives = 164/397 (41%), Gaps = 67/397 (16%)

Query: 28  AGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNP 87
           AG     +P G  K+YVYE P +Y+   LL   R L       I +   L+S  +RT+NP
Sbjct: 145 AGQGYRPNPRGP-KIYVYEFPPQYHVWSLLWLDRPLN------IILWERLMSLGLRTVNP 197

Query: 88  EEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHD 147
           EEAD+F+    P C          FK          ++     YWNR  G DH      D
Sbjct: 198 EEADYFF---LPGCGRGCNKWDQKFK---------YIMEHYAKYWNRRNGRDHLMTHAGD 245

Query: 148 FGACFHYQEEKAIERGILPLLQRATLVQTFG-------QRNHVCLN-----EGSITIPPY 195
           +G C     EKA      P +   T++  +G       +  H   N     +  I +P  
Sbjct: 246 WGRC-----EKAWGPDSAPFIANLTMLNHWGITVDRSQETEHDLFNACHIPDQDIQVPVL 300

Query: 196 AP---PQ--------KMQAHQIPPDTPRSIFVYFRGLFYDVNNDPE----GGYYARGARA 240
                PQ        K +AH +     +++     G     N+  E      YY+ G RA
Sbjct: 301 CGDLYPQFEHNVWHPKRRAHPVN----KTVLASVAGSICGWNSAEEPPCKNKYYSFGVRA 356

Query: 241 AVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD 300
           A+W   ++ P F I+   P    + M  + FC  P G A    R V +V  GC+PVII+D
Sbjct: 357 ALWTTLRDKPGFHIAKRVPV-LGQSMAESEFCFAPTG-AGHGKRQVVSVTLGCMPVIISD 414

Query: 301 DIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAM----- 355
            +  PF   + W + G+++AE D+P ++ IL    P+    K + L   +   A      
Sbjct: 415 HVAQPFEPFLDWNDFGVWIAEADLPDVEAILRGFTPQQKAAKMKKLYCAARHVAYTTVFG 474

Query: 356 -LFPQPAQPGDAFHQILNGLARKLPHDKSTYL--KPG 389
            LF       DA   I N L  ++  D + ++  KPG
Sbjct: 475 GLFDGDTGEYDAIATIFNIL--RMDQDFADFMACKPG 509


>gi|356546130|ref|XP_003541484.1| PREDICTED: probable glycosyltransferase At5g20260-like [Glycine
           max]
          Length = 468

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 78/307 (25%), Positives = 140/307 (45%), Gaps = 34/307 (11%)

Query: 80  SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSP---------RMMRSAIQLISSNWP 130
           SP +  NP+EA  F+ P +   ++       P+ S          R++   I +++  +P
Sbjct: 171 SPFKAKNPDEAHAFFLP-FSVVNVVHYAYK-PYMSQNDYRRDRLQRLVEDYIVVVADKYP 228

Query: 131 YWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSI 190
           YWNR+ GADHF +  HD+     +       +  + +L  A   + F  +  V + E  +
Sbjct: 229 YWNRSNGADHFLLSCHDWAPEISHANPDLF-KNFIRVLCNANNSEGFQPKRDVSIPEVYL 287

Query: 191 TIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYA--RGARAAVWENFKN 248
           ++    PP   Q         R+I  +F            GG +   R      W++  N
Sbjct: 288 SVGKLGPPNLGQH-----PMNRTILAFF-----------SGGAHGDIRKLLLKHWKDKDN 331

Query: 249 NPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFAD 308
                        Y E M  + FCLCP G+   SPR+VEA+   C+PVII+++  LP +D
Sbjct: 332 QVQVHEYLPKGQNYTELMGLSKFCLCPSGYEVASPRVVEAINAVCVPVIISENYSLPLSD 391

Query: 309 AIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLAN-PSMKRAMLFPQPAQPGDAF 367
            + W +  + ++ E++P + TIL ++  +   + ++L  N   ++R  +  +PA+P D  
Sbjct: 392 VLNWSQFSIQISVENIPDIKTILQNVTQK---KYKKLYRNVRRVRRHFVMHRPAKPFDLM 448

Query: 368 HQILNGL 374
           H I++ +
Sbjct: 449 HMIIHSI 455


>gi|449461995|ref|XP_004148727.1| PREDICTED: probable glycosyltransferase At5g03795-like [Cucumis
           sativus]
 gi|449501299|ref|XP_004161331.1| PREDICTED: probable glycosyltransferase At5g03795-like [Cucumis
           sativus]
          Length = 664

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 98/351 (27%), Positives = 152/351 (43%), Gaps = 48/351 (13%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEI-FMHRFLLSSPVRTLNPEEADWFYTP-- 96
           LK+YVY    +  KK +   P  L  ++A+E  FM     +      +P +A  FY P  
Sbjct: 335 LKIYVY----RDGKKPIFHQP-ILKGLYASEGWFMKLMEGNKRFVVKDPRKAHLFYMPFS 389

Query: 97  -------IYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFG 149
                  +Y       T L       + ++   + I++ +PYWNRT GADHF    HD+ 
Sbjct: 390 SRMLEYTLYVRNSHNRTNLR------QFLKEYAENIAAKYPYWNRTGGADHFLAGCHDWA 443

Query: 150 ACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPD 209
               Y+    +E  I  L   A +   F     V L E  +      P + +      P 
Sbjct: 444 P---YETRHHMEHCIKALCN-ADVTVGFKIGRDVSLPETYVR-SARNPLRDLGG---KPA 495

Query: 210 TPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP------TTYY 263
           + R I  ++ G  +          Y R      W++   NP   I    P        Y 
Sbjct: 496 SQRHILAFYAGNMHG---------YVRPILLKYWKD--KNPDMKIFGPMPPGVASKMNYI 544

Query: 264 EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED 323
           + M+ + +C+CP G+   SPR+VEA+ + C+PVII+D+ V PF + + WE   + VAE+D
Sbjct: 545 QHMKSSKYCICPKGYEVNSPRVVEAIFYECVPVIISDNFVPPFFEVLDWEAFSVIVAEKD 604

Query: 324 VPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
           +P L  IL SIP +  L  Q  L    +++  L+       D FH  L+ +
Sbjct: 605 IPNLQDILLSIPKDRYLEMQ--LRVRKVQKHFLWHAKPLKYDLFHMTLHSI 653


>gi|297818734|ref|XP_002877250.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297323088|gb|EFH53509.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 422

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 86/309 (27%), Positives = 141/309 (45%), Gaps = 36/309 (11%)

Query: 79  SSPVRTLNPEEADWFYTPIYPTCDLT-----PTGLPLPFKSPRMMR---SAIQLISSNWP 130
           S   R   PEEA  F+ P +   ++      P   P  F   R+ R     + +++   P
Sbjct: 126 SGRFRASRPEEAHAFFLP-FSVANIVHYVYQPITSPADFNRARLHRIFNDYVDVVARKHP 184

Query: 131 YWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSI 190
           +WN++ GADHF V  HD+       + +  +  +  L    T   T G R ++      I
Sbjct: 185 FWNQSNGADHFMVSCHDWAPDVADSKPEFFKDFMRGLCNANT---TEGFRPNI-----DI 236

Query: 191 TIPPYAPPQKMQAHQIPPDTP--RSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKN 248
           +IP    P++         TP  R+I  +F G  +        GY     R  ++ ++K 
Sbjct: 237 SIPEINIPKRKLKPPFMGQTPENRTILAFFAGRAH--------GY----IREVLFTHWKG 284

Query: 249 NPLFDISTDHPT---TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLP 305
                   DH T    Y+E +  + FCLCP G+   SPR VEA+  GC+PV+I+D+  LP
Sbjct: 285 KDKDVQVYDHLTKGQNYHELIGHSKFCLCPSGYEVASPREVEAIYSGCVPVVISDNYSLP 344

Query: 306 FADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGD 365
           F D + W +  + +  + +P +  IL  IP +  +R  + +    ++R  +  +PAQP D
Sbjct: 345 FNDVLDWSKFSVEIPVDKIPDIKKILQEIPHDKYIRMYQNVM--KVRRHFVVNRPAQPFD 402

Query: 366 AFHQILNGL 374
             H IL+ +
Sbjct: 403 VIHMILHSV 411


>gi|357113818|ref|XP_003558698.1| PREDICTED: uncharacterized protein LOC100844507 [Brachypodium
           distachyon]
          Length = 781

 Score =  110 bits (274), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 98/381 (25%), Positives = 162/381 (42%), Gaps = 69/381 (18%)

Query: 17  NTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLL------------LKDPRCLT 64
           N P + ++S S  +V       +  +YVY+LP++++  LL            + D +  T
Sbjct: 316 NLPDLNKLSKSPINVNAVVKKKRPLIYVYDLPAEFDSHLLEGRHFKLQCVNRIYDDKNRT 375

Query: 65  ----HMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTC----------DLTPTGLPL 110
                ++ A++ ++  +L+SP RTLN +EAD+FY P+  +C           LTP  L L
Sbjct: 376 IWTEQLYGAQMALYESILASPHRTLNGDEADYFYVPVLDSCLITRSDDAPHLLTPEDLHL 435

Query: 111 -PFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFH----YQEEKAIERGIL 165
             + +    R A   IS  + YWNRT G DH +    D GAC+     +     +  G  
Sbjct: 436 RSYHALEYYRKAYDHISQRYAYWNRTSGRDHIWFFSWDEGACYAPKEIWNSMMLVHWGNT 495

Query: 166 PLLQRATLVQTFGQR----------NHVCLN-EGSITIPPYAPPQ------KMQAHQIPP 208
                 +    +             NH C +    + +P +  P+      K+ A    P
Sbjct: 496 NTKHENSTTAYWADNWDDIPLDRRGNHPCFDPRKDLVLPAWKVPEPGAIWLKLWAR---P 552

Query: 209 DTPRSIFVYFRGLF---YDVNNDPEGGYYARGARAAVWENFKNNPLFD------------ 253
              R+   YF G     Y+    PE   Y+ G R  +   F + P  +            
Sbjct: 553 RINRTTLFYFNGNLGPAYE-QGRPEDT-YSMGIRQKLAAEFGSTPSKEGKLGRQHTANVT 610

Query: 254 ISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWE 313
           ++      YYE++  ++FC    G   WS R+ ++++ GCIPVII D I LP+ + + + 
Sbjct: 611 VTYLRSEKYYEELASSVFCGALPG-DGWSGRMEDSMLQGCIPVIIQDGIFLPYENVLNYN 669

Query: 314 EIGMFVAEEDVPKLDTILTSI 334
              + + E D+P L  IL  I
Sbjct: 670 SFAVRIQEHDIPNLIRILGGI 690


>gi|167517883|ref|XP_001743282.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163778381|gb|EDQ91996.1| predicted protein [Monosiga brevicollis MX1]
          Length = 503

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 94/333 (28%), Positives = 143/333 (42%), Gaps = 46/333 (13%)

Query: 65  HMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRM--MRSAI 122
           + +  E  + + L SSP     PEEA  F  P   T            ++     + S I
Sbjct: 196 YKYGVERRLPQVLASSPYAVQQPEEATHFLIPFQCTAHRYTVADRAGGQNAAEAGLASWI 255

Query: 123 QLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTF----- 177
             IS+ +PYWNR+ GA+HF+V  HD G+    Q  +A ++ ++ L+  A     F     
Sbjct: 256 ASISAAYPYWNRSAGANHFYVCSHDMGSSAVAQLSRAAQQNLIGLVNTADRRDGFFNVHR 315

Query: 178 --------GQRNHVCLNEG---SITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVN 226
                   G     CL  G   S+T   +A             TPR+   +  G   ++ 
Sbjct: 316 DLATAPHIGDGCDTCLQGGTRLSVTREAWA------------GTPRNRLAFMAG---NLQ 360

Query: 227 NDPEGGYYARGARAAVWENFKNNPLFDI--STDHPTTYYEDMQRAIFCLCPLGWAPWSPR 284
             P         R  + + F  +P F +   T     Y + +  + FCL   G+  W+PR
Sbjct: 361 RGP--------VRPRLRQFFDGDPDFLLVDGTLAAAHYRQALAESEFCLVVRGFRVWTPR 412

Query: 285 LVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQR 344
           L++AV  GCIPVIIAD   LPF+  + W    +FV E DVP+L  IL +   +  L +  
Sbjct: 413 LMDAVWSGCIPVIIADGYELPFSSLLHWPSFAVFVPEHDVPRLKDILLAKLSQAPLLRAN 472

Query: 345 LLANPSMKRAMLFPQPAQPGDAFHQILNGLARK 377
           LLA     + + +     P DAF  ++  LA +
Sbjct: 473 LLA---ASQYLTYHSNWVPLDAFDILMLQLAAR 502


>gi|115438799|ref|NP_001043679.1| Os01g0640600 [Oryza sativa Japonica Group]
 gi|55297176|dbj|BAD68851.1| pectin-glucuronyltransferase-like [Oryza sativa Japonica Group]
 gi|113533210|dbj|BAF05593.1| Os01g0640600 [Oryza sativa Japonica Group]
          Length = 501

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 98/350 (28%), Positives = 155/350 (44%), Gaps = 39/350 (11%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAE-IFMHRFLLSSPVRTLNPEEADWFYTPIY 98
           LKVY+Y+      ++ +   P  L+ ++A+E  FM     S      +P +A  FY P Y
Sbjct: 164 LKVYIYQ----DGRRPIFHTP-PLSGIYASEGWFMKLLKESRRFAVTDPAKAHLFYLP-Y 217

Query: 99  PTCDLTPTGLPLPFKS-----PRMMRSAIQLISSNWPYWNRTEGADHFFVVPHD-FGACF 152
            +  L    L +P           +R  ++ +++ +P+WNRT GADHF V  HD     +
Sbjct: 218 SSQQLR-ISLYVPDSHNLRPLAAYLRDFVKGLAAKYPFWNRTRGADHFLVACHDWLQGSY 276

Query: 153 HYQEEKAIERGILPLLQRATLVQ-TFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTP 211
                  + R  +  L  A   +  F     V L E +I      P + ++     P + 
Sbjct: 277 TTTAHGDLRRNTVKALCNADSSEGIFTPGRDVSLPETTIR----TPRRPLRYVGGLPVSR 332

Query: 212 RSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPL-----FDISTDHPTTYYEDM 266
           R I  +F G  +            R      W + +++ +              +Y + M
Sbjct: 333 RGILAFFAGNVHG---------RVRPVLLKHWGDGRDDDMRVYGPLPARVSRRMSYIQHM 383

Query: 267 QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK 326
           + + FCLCP+G+   SPR+VEA+ + C+PVIIAD+ VLP +D + W    + VAE+DVP 
Sbjct: 384 KNSRFCLCPMGYEVNSPRIVEALYYECVPVIIADNFVLPLSDVLDWSAFAVVVAEKDVPD 443

Query: 327 LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQP--GDAFHQILNGL 374
           L  IL  I     LRK   +     +    F   A+P   D FH IL+ +
Sbjct: 444 LKKILQGI----TLRKYVAMHGCVKRLQRHFLWHARPLRYDLFHMILHSI 489


>gi|449442160|ref|XP_004138850.1| PREDICTED: probable glycosyltransferase At5g25310-like [Cucumis
           sativus]
 gi|449530249|ref|XP_004172108.1| PREDICTED: probable glycosyltransferase At5g25310-like [Cucumis
           sativus]
          Length = 465

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 86/310 (27%), Positives = 145/310 (46%), Gaps = 47/310 (15%)

Query: 83  RTLNPEEADWFYTPI-------YPTCDLTPTGLPLPFKSPRMMRS-AIQLISSNWPYWNR 134
           RT++P  A   + P        Y   D +    PL     RM  S  ++++S  +P+WN+
Sbjct: 175 RTVDPSRAHVLFMPFSVAWMVKYLYKDGSYDQTPL-----RMFVSDYVEVVSKKYPFWNK 229

Query: 135 TEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNE-----GS 189
           T GADHF +  HD+G     +  + +    + +L  A   + F  +  V L E     G 
Sbjct: 230 TNGADHFILACHDWGPIA-TEGNRFLYNTSIRVLCNANSSEGFNPQKDVSLPEIHLYDGE 288

Query: 190 ITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNN 249
           I+      P+ + A       P   F               GG +    R  +  ++KN 
Sbjct: 289 IS------PKLLSASNSHHHRPHLAFF-------------AGGLHG-PIRPILLNHWKNR 328

Query: 250 PLFDISTDH----PTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLP 305
              +I           YY++M ++ FCLCP G+   SPR+VEA+   C+PVII++  VLP
Sbjct: 329 THTNIHVYEYLPKQLDYYDEMLQSRFCLCPSGYEVASPRIVEAIYAECVPVIISERYVLP 388

Query: 306 FADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLAN-PSMKRAMLFPQPAQPG 364
           F+D + WE   + V+  ++P+L+ IL  +  E   R ++L+    ++++  +  +PA+  
Sbjct: 389 FSDVLRWEGFSIEVSVSEIPRLEEILMGVSEE---RYEKLIQGLRTVRKHFVLNRPAKRF 445

Query: 365 DAFHQILNGL 374
           DAFH IL+ +
Sbjct: 446 DAFHMILHSV 455


>gi|296090519|emb|CBI40850.3| unnamed protein product [Vitis vinifera]
          Length = 416

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 96/350 (27%), Positives = 154/350 (44%), Gaps = 50/350 (14%)

Query: 39  KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIY 98
             KVY+Y  P   +     + PR LT  +A+E +  + +  S  RT +P++A  F+ PI 
Sbjct: 94  NFKVYIY--PDG-DPNTFYQTPRKLTGKYASEGYFFQNIRDSRFRTNDPDQAHLFFIPI- 149

Query: 99  PTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEK 158
            +C               ++++ +  + S +PYWNRT GADHFFV  HD G        +
Sbjct: 150 -SCHKMRGKGTSYENMTVIVQNYVGSLISKYPYWNRTLGADHFFVTCHDVGV-------R 201

Query: 159 AIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY---APPQKMQAHQIPPDTPRSIF 215
           A E G+  L++ +  V        VC     +   P+   A PQ +Q   +P        
Sbjct: 202 ATE-GVPFLVKNSIRV--------VCSPSYDVGFIPHKDVALPQVLQPFALPAGGN---- 248

Query: 216 VYFRGLFYDVNNDPEGGYYA-------RGARAAVWENFKNNPLFDISTDHPT---TYYED 265
                   D+ N    G++A       R   A VWEN     + +   +       Y + 
Sbjct: 249 --------DIENRTTLGFWAGHRNSKIRVILARVWENDTELDIMNNRINRAAGELLYQKR 300

Query: 266 MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVP 325
             R  FC+CP G    S R+ +++ +GC+PVI+++   LPF D + W +  + + E DV 
Sbjct: 301 FYRTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFSVVLKELDVY 360

Query: 326 KLDTILTSIP-PEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
           +L  IL  IP  E +     L+    +++   +  P    DAFH I+  L
Sbjct: 361 QLKQILKDIPDAEFVALHNNLV---QVQKHFQWNSPPIRYDAFHMIMYEL 407


>gi|443705810|gb|ELU02170.1| hypothetical protein CAPTEDRAFT_209273 [Capitella teleta]
          Length = 473

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 89/339 (26%), Positives = 142/339 (41%), Gaps = 34/339 (10%)

Query: 16  DNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHR 75
           DN+  I          LE     K+ +Y  EL  K N+ +  K        +  E    +
Sbjct: 158 DNSEAIAHKDNFGAPYLEMMSSLKIFMYTSELDDKVNRGVHWK--------YGVESLFIK 209

Query: 76  FLLSSPVRTLNPEEADWFYTPIYPTC--DLTPTGLPLPFKSPRMMRSAIQLISSNWPYWN 133
            L  S   T + EEA +F+ P       ++          +  ++ + ++ ISS + YW+
Sbjct: 210 LLSKSSFVTKDAEEAHFFFLPFQCATYRNVIRDRAAAQNFTENLVSNILKDISSRYTYWD 269

Query: 134 RTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIP 193
           R+ GADHF+V  HD GA      +  +++  + L+  A     F       +    I +P
Sbjct: 270 RSLGADHFYVCAHDMGASSVAAADANLQKNAIALVNTADYADPF------YVPHKDIALP 323

Query: 194 PYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYA----RGARAAVWENFKNN 249
           P+    K     I     +S               P   +YA     G    V++++ N+
Sbjct: 324 PHPAHGKGSLPDIGRGGGKS------------TERPNLAFYAGNLDSGQLRPVFKDWLND 371

Query: 250 PLFDISTDH--PTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFA 307
               I   H     Y +++Q A FCL P G   WSP +++AV  GC+PVII+D   LP  
Sbjct: 372 SDIHIHHGHMSDNVYIKNLQSAKFCLVPRGHRVWSPVVMDAVWTGCVPVIISDYYDLPLH 431

Query: 308 DAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLL 346
             I W    +F+ E++V  L + L SIP E + R Q  +
Sbjct: 432 GLIDWTHFAVFLKEKEVLSLKSKLKSIPEEKLRRMQSYI 470


>gi|222618931|gb|EEE55063.1| hypothetical protein OsJ_02778 [Oryza sativa Japonica Group]
          Length = 482

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 98/344 (28%), Positives = 155/344 (45%), Gaps = 46/344 (13%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAE-IFMHRFLLSSPVRTLNPEEADWFYTPIY 98
           LKVY+Y+      ++ +   P  L+ ++A+E  FM     S      +P +A  FY P Y
Sbjct: 164 LKVYIYQ----DGRRPIFHTP-PLSGIYASEGWFMKLLKESRRFAVTDPAKAHLFYLP-Y 217

Query: 99  PTCDLTPTGLPLPFKS-----PRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFH 153
            +  L    L +P           +R  ++ +++ +P+WNRT GADHF V  HD+G+ + 
Sbjct: 218 SSQQLR-ISLYVPDSHNLRPLAAYLRDFVKGLAAKYPFWNRTRGADHFLVACHDWGS-YT 275

Query: 154 YQEEKAIERGILPLLQRATLVQ-TFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPR 212
                 + R  +  L  A   +  F     V L E +I      P + ++     P + R
Sbjct: 276 TTAHGDLRRNTVKALCNADSSEGIFTPGRDVSLPETTIR----TPRRPLRYVGGLPVSRR 331

Query: 213 SIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFC 272
            I  +F G  +            R      W + +++ +              M+ + FC
Sbjct: 332 GILAFFAGNVHG---------RVRPVLLKHWGDGRDDDM------------RHMKNSRFC 370

Query: 273 LCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILT 332
           LCP+G+   SPR+VEA+ + C+PVIIAD+ VLP +D + W    + VAE+DVP L  IL 
Sbjct: 371 LCPMGYEVNSPRIVEALYYECVPVIIADNFVLPLSDVLDWSAFAVVVAEKDVPDLKKILQ 430

Query: 333 SIPPEVILRKQRLLANPSMKRAMLFPQPAQP--GDAFHQILNGL 374
            I     LRK   +     +    F   A+P   D FH IL+ +
Sbjct: 431 GI----TLRKYVAMHGCVKRLQRHFLWHARPLRYDLFHMILHSI 470


>gi|359495862|ref|XP_002266299.2| PREDICTED: probable glycosyltransferase At5g03795 [Vitis vinifera]
          Length = 594

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 95/350 (27%), Positives = 153/350 (43%), Gaps = 50/350 (14%)

Query: 39  KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIY 98
             KVY+Y      +     + PR LT  +A+E +  + +  S  RT +P++A  F+ PI 
Sbjct: 272 NFKVYIYP---DGDPNTFYQTPRKLTGKYASEGYFFQNIRDSRFRTNDPDQAHLFFIPI- 327

Query: 99  PTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEK 158
            +C               ++++ +  + S +PYWNRT GADHFFV  HD G        +
Sbjct: 328 -SCHKMRGKGTSYENMTVIVQNYVGSLISKYPYWNRTLGADHFFVTCHDVGV-------R 379

Query: 159 AIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY---APPQKMQAHQIPPDTPRSIF 215
           A E G+  L++ +  V        VC     +   P+   A PQ +Q   +P        
Sbjct: 380 ATE-GVPFLVKNSIRV--------VCSPSYDVGFIPHKDVALPQVLQPFALPAGGN---- 426

Query: 216 VYFRGLFYDVNNDPEGGYYA-------RGARAAVWENFKNNPLFDISTDHPTT---YYED 265
                   D+ N    G++A       R   A VWEN     + +   +       Y + 
Sbjct: 427 --------DIENRTTLGFWAGHRNSKIRVILARVWENDTELDIMNNRINRAAGELLYQKR 478

Query: 266 MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVP 325
             R  FC+CP G    S R+ +++ +GC+PVI+++   LPF D + W +  + + E DV 
Sbjct: 479 FYRTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFSVVLKELDVY 538

Query: 326 KLDTILTSIP-PEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
           +L  IL  IP  E +     L+    +++   +  P    DAFH I+  L
Sbjct: 539 QLKQILKDIPDAEFVALHNNLV---QVQKHFQWNSPPIRYDAFHMIMYEL 585


>gi|326511980|dbj|BAJ95971.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 784

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 96/355 (27%), Positives = 155/355 (43%), Gaps = 67/355 (18%)

Query: 42  VYVYELPSKYNKKLL------------LKDPRCLT----HMFAAEIFMHRFLLSSPVRTL 85
           +YVY+LP++++  LL            + D +  T     ++ A++ ++  +L+SP RTL
Sbjct: 341 IYVYDLPAEFDSHLLEGRHYKLECVNRIYDEKNRTIWTRQLYGAQMALYESILASPHRTL 400

Query: 86  NPEEADWFYTPIYPTCDLT----------PTGLPL-PFKSPRMMRSAIQLISSNWPYWNR 134
           N +EAD+FY P+  +C +T          P  L L  + +    R A   I+  +PYWNR
Sbjct: 401 NGDEADYFYVPVLDSCLITRSDDAPHLRMPEDLRLRSYHTLEYYRKAYDHIAQRYPYWNR 460

Query: 135 TEGADHFFVVPHDFGACFHYQE-------------EKAIERGILPLLQRATLVQTFGQR- 180
           T G DH +    D GAC+  +E                 E+              F +R 
Sbjct: 461 TSGRDHIWFFSWDEGACYAPKEIWNSMMLVHWGNTNTKHEKSTTAYWADNWDDIPFDRRG 520

Query: 181 NHVCLN-EGSITIPPYAPPQ------KMQAHQIPPDTPRSIFVYFRGLFYDVNND--PEG 231
           NH C +    + +P +  P       K+ A    P   R+   YF G       +  PE 
Sbjct: 521 NHPCFDPRKDLVLPAWKEPNPGAIWLKLWAR---PKINRTTLFYFNGNLGPAYEEGRPED 577

Query: 232 GYYARGARAAVWENFKNNP-----LFDISTDHPTT-------YYEDMQRAIFCLCPLGWA 279
             Y+ G R  +   F + P     L    T + T        YYE++  +IFC    G  
Sbjct: 578 T-YSMGIRQKLAAEFGSTPNKQGKLGRQQTANVTVTYLKSEMYYEELASSIFCGVLPG-D 635

Query: 280 PWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSI 334
            WS R+ ++++ GCIPVII D I LP+ + + +    + + E+D+P L  +L  +
Sbjct: 636 GWSGRMEDSMLQGCIPVIIQDGIFLPYENVLNYNSFSVRIQEDDIPNLIKVLQGL 690


>gi|242094892|ref|XP_002437936.1| hypothetical protein SORBIDRAFT_10g005120 [Sorghum bicolor]
 gi|241916159|gb|EER89303.1| hypothetical protein SORBIDRAFT_10g005120 [Sorghum bicolor]
          Length = 412

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 90/307 (29%), Positives = 142/307 (46%), Gaps = 29/307 (9%)

Query: 83  RTLNPEEADWFYTPIYPTCDLTPTGLPLPF-KSP--RMMRSAIQLISSNWPYWNRTEGAD 139
           RT +P  A  F+ P   +  +     P  + K+P   ++   +++++S  PYWNR+ GAD
Sbjct: 109 RTSDPARAHAFFLPFSVSKMVQFAYRPDTYDKTPLRAIVADYVRVVASRHPYWNRSAGAD 168

Query: 140 HFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQ 199
           HF +  HD+G        +    GI  L   A   + F     V + E  I +     P+
Sbjct: 169 HFMLSCHDWGPEASRGHPELHANGIRALCN-ANTSEGFRPGQDVSVPE--INLYTGDMPR 225

Query: 200 KMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNN--PLFDISTD 257
           ++ A   PP   R    +F G  +          + R      W+    +  P+++   +
Sbjct: 226 QLLAPPAPPLASRPFLAFFAGGRHG---------HVRDLLLRRWKGHDPDVFPVYEHEHE 276

Query: 258 H----------PTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFA 307
           H          P  YY  M+RA FCLCP G+   SPR+VEA+   C+PVI++D   LPFA
Sbjct: 277 HSHGRRQQDGAPLDYYWYMRRARFCLCPSGYEVASPRVVEAIHAECVPVILSDGYALPFA 336

Query: 308 DAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAF 367
           D + WE   + V   D+P+L  +L  IP   + R QR +    +KR  +  QP +  D F
Sbjct: 337 DVLRWEAFSVAVPVADIPRLREVLERIPAPEVERLQRGVR--LVKRHFMLHQPPERLDMF 394

Query: 368 HQILNGL 374
           + IL+ +
Sbjct: 395 NMILHSV 401


>gi|449446746|ref|XP_004141132.1| PREDICTED: probable glycosyltransferase At5g03795-like [Cucumis
           sativus]
          Length = 466

 Score =  109 bits (272), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 89/318 (27%), Positives = 145/318 (45%), Gaps = 43/318 (13%)

Query: 67  FAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTC-----DLTPTGLPLPFKSPRMMRSA 121
           +A+E +  + L+ S   T +P+EAD+F  P   T       ++ +G+P        +R  
Sbjct: 169 YASESYFKKSLIKSHFITNDPKEADFFSLPFSITGLRNDRRVSVSGIP------NFIRDY 222

Query: 122 IQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRN 181
           I  +S  +PYWNRT GADHF+V  H  G     +  +A          ++++VQ     +
Sbjct: 223 IFDVSHKYPYWNRTGGADHFYVACHSVGRSAMDKSSEA----------KSSIVQVVCSSS 272

Query: 182 HVCLNEGSITIPPYAPPQKMQAHQIPPD---TPRSIFVYFRGLFYDVNNDPEGGYYARGA 238
           +     G I+    A PQ     + P +   + R+   +F G      N P      R A
Sbjct: 273 YFL--TGYISHKDAALPQIWPRKEDPSNLASSKRTRLAFFAGAM----NSP-----TRQA 321

Query: 239 RAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVII 298
              VW   K++ +F  S    T Y +++ R+ FCL   G+   + R+ +++ +GC+PVII
Sbjct: 322 LVQVWG--KDSEIFAYSGRLKTPYADELLRSKFCLHVKGFEVNTARVGDSIFYGCVPVII 379

Query: 299 ADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFP 358
           A+   LPF D + W+   + V   D+P+L  IL  I  E   R    L +  +K    F 
Sbjct: 380 ANYYDLPFGDILNWKSFSIVVTTSDIPRLKEILKGINDEEYAR----LQSNVLKVRKHFK 435

Query: 359 QPAQP--GDAFHQILNGL 374
             + P   D FH ++  L
Sbjct: 436 WHSSPVDYDTFHMVMYQL 453


>gi|255567220|ref|XP_002524591.1| catalytic, putative [Ricinus communis]
 gi|223536144|gb|EEF37799.1| catalytic, putative [Ricinus communis]
          Length = 507

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 90/348 (25%), Positives = 149/348 (42%), Gaps = 42/348 (12%)

Query: 39  KLKVYVYEL--PSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTP 96
           + K+YVYE   P  Y+      D  C +   +   F+H        RTL+P+EA  ++ P
Sbjct: 176 QFKIYVYEEGGPPMYH------DGPCKSIYSSEGRFIHELEKGKLYRTLDPDEALVYFLP 229

Query: 97  IYPTCDLTPTGLPLPFKSPRMMRSA---IQLISSNWPYWNRTEGADHFFVVPHDFGACFH 153
                 +    +P   ++  + R+    I +IS+  P+WNR+ GADHF +  HD+G    
Sbjct: 230 FSVVMMVEYLYVPDSHETNAIGRAIVDYIHVISNKHPFWNRSLGADHFMLSCHDWGPRAS 289

Query: 154 YQEEKAIERGILPLLQRATLVQTFGQRN-------HVCLNEGSITIPPYAPPQKMQAHQI 206
                     I  +L  A   + F           H+   E S  +   +P +       
Sbjct: 290 SYVPHLFNSSIR-VLCNANTSEGFNPSKDASFPEIHLKTGEISGLLGGVSPSR------- 341

Query: 207 PPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM 266
                RSI  +F G  +          + R      W+N   +        +  +Y   +
Sbjct: 342 -----RSILAFFAGRLHG---------HIRQILLEQWKNKDEDVQVYDQMPNGVSYESML 387

Query: 267 QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK 326
           + + FCLCP G+   SPR+VEA+   C+PV+I+D+ V PF+D + W+   + +   D+PK
Sbjct: 388 KTSRFCLCPSGYEVASPRIVEAIYTECVPVLISDNYVPPFSDVLNWKAFSVQIQVRDIPK 447

Query: 327 LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
           +  IL  I     LR QR L    ++R  +   P +  D FH  ++ +
Sbjct: 448 IKEILMGISQRQYLRMQRRLKQ--VQRHFVVNGPPKRFDMFHMTIHSI 493


>gi|297808545|ref|XP_002872156.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297317993|gb|EFH48415.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 334

 Score =  108 bits (271), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 77/261 (29%), Positives = 121/261 (46%), Gaps = 30/261 (11%)

Query: 122 IQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRN 181
           I+L+S+N P+WNRT GADHF +  HD+G     Q +  +    + ++  A   + F    
Sbjct: 86  IRLVSTNHPFWNRTNGADHFMLACHDWGP-LTSQADNDLFNTSIRVMCNANSSEGFNPSK 144

Query: 182 HVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAA 241
            V L E  +      P  K++  +    +PR                P  G++A G    
Sbjct: 145 DVTLPEIKLYGGEVDP--KLRLSKTLSASPR----------------PYLGFFAGGVHGP 186

Query: 242 VWENFKNN--------PLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGC 293
           V     N+        P+++    H   YY+ M+ + FC CP G+   SPR++EA+   C
Sbjct: 187 VRPILLNHWKQRDPDMPVYEYLPKH-LNYYDFMRSSKFCFCPSGYEVASPRVIEAIYSEC 245

Query: 294 IPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKR 353
           IPVI++ + VLPF D + WE   + V   ++P+L  IL SI  E     +R L    ++R
Sbjct: 246 IPVILSVNFVLPFTDVLRWETFSVLVDVSEIPRLKEILMSISDEKYEWLKRNLR--YVRR 303

Query: 354 AMLFPQPAQPGDAFHQILNGL 374
                 P +  DAFH  L+ +
Sbjct: 304 HFELNDPPKRFDAFHLTLHSI 324


>gi|75223627|gb|ABA18110.1| exostosin family protein [Arabidopsis arenosa]
          Length = 340

 Score =  108 bits (271), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 88/310 (28%), Positives = 142/310 (45%), Gaps = 38/310 (12%)

Query: 79  SSPVRTLNPEEADWFYTPIYPTCDLT-----PTGLPLPFKSPRMMR---SAIQLISSNWP 130
           S   R   PEEA  F+ P +   ++      P   P  F   R+ R     + +++   P
Sbjct: 44  SGRFRASRPEEAHAFFLP-FSVANIVHYVYQPITSPADFNRARLHRIFNDYVDVVACKHP 102

Query: 131 YWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSI 190
           +WN++ GADHF V  HD+       + +  +  +  L    T   T G R ++  +   I
Sbjct: 103 FWNQSNGADHFMVSCHDWAPDVADSKPEFFKDFMRGLCNANT---TEGFRPNIDFSIPEI 159

Query: 191 TIPP--YAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKN 248
            IP     PP   Q     P+  R+I  +F G  +        GY     R  ++ ++K 
Sbjct: 160 NIPKRKLKPPFMGQT----PEN-RTILAFFAGRAH--------GY----IREVLFTHWKG 202

Query: 249 NPLFDISTDHPT---TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLP 305
                   DH T    Y+E +  + FCLCP G+   SPR VEA+  GC+PV+I+D+  LP
Sbjct: 203 KDKDVQVYDHLTKGQNYHELIGHSKFCLCPSGYEVASPREVEAIYSGCVPVVISDNYSLP 262

Query: 306 FADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANP-SMKRAMLFPQPAQPG 364
           F D + W +  + +  + +P +  IL  IP +   + +R+  N   ++R  +  +PAQP 
Sbjct: 263 FKDVLDWSKFSVEIPVDKIPDIKKILQEIPHD---KYRRMYQNVMKVRRHFVVNRPAQPF 319

Query: 365 DAFHQILNGL 374
           D  H IL+ +
Sbjct: 320 DVIHMILHSV 329


>gi|159470095|ref|XP_001693195.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
 gi|158277453|gb|EDP03221.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
          Length = 659

 Score =  108 bits (271), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 103/390 (26%), Positives = 156/390 (40%), Gaps = 76/390 (19%)

Query: 42  VYVYELPSKYNKKLL---LKDPRCLTHMF--------------AAEIFMHRFLLSSPVRT 84
           +YVY++   YN K+L   L    C TH F                E  +H +LL S  RT
Sbjct: 297 IYVYDMEPLYNAKMLQYRLASAWC-THRFYTSGNGTAYSPWCYGVESGLHEYLLLSEHRT 355

Query: 85  LNPEEADWFYTPIYPTCDLTP------------TGLPLPFKSPRMMRSAIQLISSNWPYW 132
            +PEEAD+FY P+Y +C + P             G      +  M+  A   I +N+P+W
Sbjct: 356 FDPEEADFFYVPVYISCLIWPVLNYADFPVFYSNGGTRVMHAVNMLSEARDWIDANYPFW 415

Query: 133 NRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITI 192
            R  G DH +  PHD GAC+                   ++V +      + L       
Sbjct: 416 KRRGGRDHIWTFPHDEGACW----------------APNSIVSS------IWLTHWGRMD 453

Query: 193 PPYAPPQKMQAHQIPPD--TPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNP 250
           P +       A     D  +PR    Y   +      DP+  Y    + A  W   K N 
Sbjct: 454 PDHTSKSSFDADNYTRDFVSPRQPKGYTHLIQGHGCYDPKKIYNM--SIANNWRQ-KYNV 510

Query: 251 LFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAI 310
           L     D    Y + + R++FCL   G   WS R  +AV+ GCIPV++ D + + F    
Sbjct: 511 LVGDGQDVQGDYSDLLSRSLFCLVATG-DGWSARTEDAVLHGCIPVVVIDGVHMKFETLF 569

Query: 311 PWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLL------ANPSMKRAMLFPQPAQP- 363
             +   + + E DV  + TIL ++P E +   Q  L      A+P ++ A    + A+P 
Sbjct: 570 DVDSFSIRIPEADVANILTILKALPEERVRAMQANLGQASNAADPLVREAAELSKSAEPR 629

Query: 364 -----------GDAFHQILNGLARKLPHDK 382
                       DAF  I+  L  ++P  +
Sbjct: 630 FPRPFRGDPAVDDAFATIIQWLHSRIPDTR 659


>gi|168029668|ref|XP_001767347.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|168029742|ref|XP_001767384.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162681411|gb|EDQ67838.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162681448|gb|EDQ67875.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 333

 Score =  108 bits (271), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 92/348 (26%), Positives = 153/348 (43%), Gaps = 42/348 (12%)

Query: 39  KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAE-IFMHRFLLSSPVRTLNPEEADWFYTPI 97
           + KVYVY   S+  + L+   P     ++A E  F+      +P  T +P+ A  ++ P 
Sbjct: 4   RFKVYVY---SEGEEPLVHNGP--CKEIYAVEGRFIQELQGDNPFVTHDPDNAHVYFLPF 58

Query: 98  Y-----------PTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPH 146
                        + D+ P  L         +   + ++   +P+WNR+ GADHF +  H
Sbjct: 59  SVAMMVAYLYEKESGDMDPLRL--------FVGDYVDVLMHKYPFWNRSGGADHFMLSCH 110

Query: 147 DFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQI 206
           D+G     +      R I  +L  A   + +     V L E  I +     P ++     
Sbjct: 111 DWGPLITRENMNLGTRSIR-VLCNANSSEGYVPWKDVSLPE--IHLVGGHIPAELGG--- 164

Query: 207 PPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM 266
           PP   R    +F G     ++ P      R      WE   ++ +          Y+E M
Sbjct: 165 PPAKDRPHLAFFAGR----DHGP-----VRPQLFKHWEGKDDDVIVYQWLPAHLKYHELM 215

Query: 267 QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK 326
           + + +C+CP G+   SPR+VEA+   C+PVIIAD  +LPF+D + WE   + V E D+P 
Sbjct: 216 KTSRYCICPGGYEVNSPRIVEAIYNECVPVIIADSFILPFSDVLNWESFSLHVKESDIPN 275

Query: 327 LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
           L +IL ++  E     Q  ++   ++R  +  QP +  D FH IL+ +
Sbjct: 276 LKSILQNVTMETYTSMQERVS--QVQRHFVLHQPPKRYDVFHMILHSV 321


>gi|224142181|ref|XP_002324437.1| predicted protein [Populus trichocarpa]
 gi|222865871|gb|EEF03002.1| predicted protein [Populus trichocarpa]
          Length = 334

 Score =  108 bits (271), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 92/343 (26%), Positives = 152/343 (44%), Gaps = 32/343 (9%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAE-IFMHRFLLSSPVRTLNPEEADWFYTPIY 98
           LKVYVY    K  +K +    + +  ++A+E  FM     +      +P +A  FY P  
Sbjct: 5   LKVYVY----KEGEKPIFHQSK-MRGIYASEGWFMKLIEGNKKFVVRDPRKAHLFYLPFS 59

Query: 99  PTCDLTPTGLPLPFKSPRM---MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQ 155
           P    T            +   +++ + L++  + +WNRT G DHF V  HD+ +    Q
Sbjct: 60  PHMLRTALFDHNSLNQKELAEFLKNYVDLVAKKYSFWNRTGGTDHFLVGCHDWAS----Q 115

Query: 156 EEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIF 215
             +   R  + +L  + + + F       L      I     P K    + P + P  I 
Sbjct: 116 MTRHHMRNCIRVLCNSNVAKGFKIGKDTTL--PVTYIRSVENPLKELGGKSPSERP--IL 171

Query: 216 VYFRGLFYDVNNDPEGGYYARGARAAVWENFK--NNPLFDISTD--HPTTYYEDMQRAIF 271
            +F G  +          Y R      WEN +     L  +S D      Y E M+R+ +
Sbjct: 172 AFFAGNMHG---------YLRPILLEYWENKEPDMKILGPMSRDIAGKRRYREYMKRSKY 222

Query: 272 CLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTIL 331
           C+C  G+   +PR+VE++ + C+PVII+D+ V P  + + WE   +F+ E+D+P L  IL
Sbjct: 223 CICARGYEVHTPRVVESIFYECVPVIISDNYVPPLFEVLNWEAFSVFIQEKDIPNLRNIL 282

Query: 332 TSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
            SIP E  +  Q  L    +++  L+ +     D FH IL+ +
Sbjct: 283 LSIPQEKYVAMQ--LGVKKVQQHFLWHKKPVKYDLFHMILHSV 323


>gi|302837496|ref|XP_002950307.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
 gi|300264312|gb|EFJ48508.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
          Length = 368

 Score =  108 bits (271), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 100/374 (26%), Positives = 159/374 (42%), Gaps = 43/374 (11%)

Query: 39  KLKVYVYELPSKYNKKLLL-KDPRCLTHMFAA-EIFMHRFLLSSPVRTLNPEEADWFYTP 96
           + K+Y+YELP +      L +D     +++ A E FM  FL+   VRT NP EA+ FY P
Sbjct: 10  EFKIYMYELPWEIAFPYELGEDVHTRDNIYTAYEEFMKYFLVDDMVRTQNPYEANLFYVP 69

Query: 97  IYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQ- 155
                 LT               + I+ I + WP++NRT G DHF     D  +C H+Q 
Sbjct: 70  A-----LTYFYATNVRNGQWQAEAVIEYIRTKWPFYNRTGGRDHFVFFTGDRASC-HFQR 123

Query: 156 --EEKAIERGILPLLQRATLVQTFGQRNHVCL-NEGSITIPPYA------------PPQK 200
             ++  I+     +  R         R++ C+ N+  + +PP              P  K
Sbjct: 124 WIQDSVIKVVHFGMQHRNLTWNEISNRDYACIQNKRDLVVPPRTVNLGPLLPSFSTPYYK 183

Query: 201 MQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLF--DIS--- 255
                   D  R++  +F G   D       G Y+ G R A+ +   +      D+    
Sbjct: 184 WLVSNQGYDGNRTLLFFFAGGVAD-------GEYSGGVRLAIKQMLSSITHLPADVKFVE 236

Query: 256 ---TDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW 312
                    Y+  ++ + FC+ P G   W  RLV+AV  GC+PVII D +   F D +P+
Sbjct: 237 GRVGGGEDEYFAMIRASKFCIAPYG-HGWGNRLVQAVHLGCVPVIIQDYVYQAFEDFLPY 295

Query: 313 EEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILN 372
           E+  + +   DVP +  +L S     + R +  LA     RA ++ +    G A+   L 
Sbjct: 296 EDFSVRMRLADVPHMIDLLRSYSEADLARLRLGLAR--YYRAFIWEREYD-GLAYEWTLA 352

Query: 373 GLARKLPHDKSTYL 386
           GL R+  +  + Y 
Sbjct: 353 GLQRRAYNLAAEYF 366


>gi|357130589|ref|XP_003566930.1| PREDICTED: probable glycosyltransferase At5g03795-like
           [Brachypodium distachyon]
          Length = 510

 Score =  108 bits (271), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 76/271 (28%), Positives = 126/271 (46%), Gaps = 31/271 (11%)

Query: 118 MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQ------EEKAIERGILPLLQRA 171
           +R  +Q +++ +P+WNR  GADHF V  HD+   ++ Q        + + R  +  L  A
Sbjct: 245 LRDFVQGLAAKYPFWNRNRGADHFLVACHDWRKFYYLQGPYTTTAHRDLRRNSIKALCNA 304

Query: 172 -TLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPE 230
            +  + F     V L E +I      P + ++     P + R I  +F G  +       
Sbjct: 305 DSSERIFSPGKDVSLPETTIR----TPKRPLRYVGGLPVSRRRILAFFAGNVHG------ 354

Query: 231 GGYYARGARAAVWENFKNNPL-----FDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRL 285
                R      W + +++ +              +Y + M+ + FCLCP+G    SPR+
Sbjct: 355 ---RVRPVLLKHWGDGRDDDMRVYGPLPNRVSRQMSYIQHMKNSRFCLCPMGHEVNSPRI 411

Query: 286 VEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRL 345
           VEA+ + C+PV+IAD+ VLPF+D + W    + VAE+D+P L  IL  I     LR+   
Sbjct: 412 VEALYYECVPVVIADNFVLPFSDVLDWTAFSVVVAEKDIPDLKKILQGIS----LRRYVA 467

Query: 346 LANPSMKRAMLFPQPAQP--GDAFHQILNGL 374
           + +   +    F   A+P   D FH IL+ +
Sbjct: 468 MHDCVKRLQRHFLWHARPLRYDLFHMILHSI 498


>gi|443732873|gb|ELU17436.1| hypothetical protein CAPTEDRAFT_228334 [Capitella teleta]
          Length = 506

 Score =  108 bits (271), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 91/324 (28%), Positives = 147/324 (45%), Gaps = 42/324 (12%)

Query: 70  EIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPR----MMRSAIQLI 125
           E+F+H  L SSP  T +P EA +F+ P    C      +    ++ R    M+   +  I
Sbjct: 202 ELFIH-LLKSSPYITQDPSEATFFFLPF--RCFAYRKTISDRDRAQRFTEEMVSKILYEI 258

Query: 126 SSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQR--ATLVQTFGQRNHV 183
            SN+ +W+RT GADHF+V  HDFG         AI  G  P L +    +V T    +  
Sbjct: 259 KSNYSFWDRTLGADHFYVCAHDFGP--------AIVAGSDPFLHKNAIAMVNTADYEHIY 310

Query: 184 CLNEGSITIPPYAPPQKMQ-------AHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYAR 236
            +    I++PP+    K          H + P + R++  ++ G     N D       R
Sbjct: 311 YVPHKDISLPPHPSHGKNSLANIGKGGHGLNP-SDRTVLAFYAG-----NLD-------R 357

Query: 237 G-ARAAVWENFKNNPLFDISTDHPT--TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGC 293
           G  R ++ + +  +  F I   H T   Y   ++ + FCL   G   WSP L++A+ FGC
Sbjct: 358 GRIRPSIKDFWSTDIDFRIFMGHLTDERYQHYLKTSKFCLILRGNEAWSPCLMDAIWFGC 417

Query: 294 IPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKR 353
           +PVII+D   LP    + W +  + + E  V  L  IL ++ P+ +   Q  L    +  
Sbjct: 418 VPVIISDYYDLPLHGMLDWNQFAVVIRESKVKSLKEILLAVSPQKLTSMQEKLKQ--VYG 475

Query: 354 AMLFPQPAQPGDAFHQILNGLARK 377
             ++  P +P DAF  ++  L ++
Sbjct: 476 HFVWNDPPRPYDAFQSVMYQLWKR 499


>gi|75214755|gb|ABA18105.1| exostosin family protein [Capsella rubella]
          Length = 423

 Score =  108 bits (270), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 85/308 (27%), Positives = 140/308 (45%), Gaps = 34/308 (11%)

Query: 79  SSPVRTLNPEEADWFYTPIYPTCDLT-----PTGLPLPFKSPRMMR---SAIQLISSNWP 130
           SS  R + PEEA  F+ P +   ++      P   P  F   R+ R     + +++  +P
Sbjct: 127 SSRFRAVRPEEAHVFFLP-FSVANIVHYVYKPIASPADFNRARLHRIFNDYVDVVARKYP 185

Query: 131 YWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSI 190
           +W ++ GADHF V  HD+        +    +  +  L  A   + F       + E +I
Sbjct: 186 FWKQSNGADHFMVSCHDWAPDVP-DSKPEFFKDFMRGLCNANTSEGFKPSIDFSIPEINI 244

Query: 191 TIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNP 250
                 PP   Q     P+  R+I  +F G  +        GY     R  ++ ++K   
Sbjct: 245 PKGKLKPPFMGQN----PEN-RTILAFFAGRAH--------GY----IREVLFTHWKGKD 287

Query: 251 LFDISTDHPT---TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFA 307
                 DH T    Y+E    + FCLCP G+   SPR VEA+  GC+PV+I+D+  LPF+
Sbjct: 288 KDVQVYDHLTKGQNYHELTGHSKFCLCPSGYEVASPREVEAIYSGCVPVVISDNYSLPFS 347

Query: 308 DAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRK-QRLLANPSMKRAMLFPQPAQPGDA 366
           D + W +  + +  + +P +  IL  IP +  +R  Q +L    +++  +  +PAQP D 
Sbjct: 348 DVLDWSKFSVEIPVDRIPDIKNILQEIPHDKYIRMYQNVL---KVRKHFVVNRPAQPFDV 404

Query: 367 FHQILNGL 374
            H IL+ +
Sbjct: 405 IHMILHSV 412


>gi|356510025|ref|XP_003523741.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
           max]
          Length = 619

 Score =  108 bits (269), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 99/346 (28%), Positives = 150/346 (43%), Gaps = 39/346 (11%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEI-FMHRFLLSSPVRTLNPEEADWFYTPIY 98
           LKVY+Y+       + +  +P  L  ++A+E  FM     +    T +P +A  FY P  
Sbjct: 291 LKVYIYQ----DGDRPIFHEP-LLDGIYASEGWFMKLMEANKQFVTRDPGKAHLFYIPF- 344

Query: 99  PTCDLTPTGLPLPFKSPR-----MMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFH 153
            +  L    L +     R      M++ + +I+  +P+WNRT GADHF V  HD+     
Sbjct: 345 -SSRLLQQTLYVRNSHRRSNLIEYMKNYVDMIAGKYPFWNRTSGADHFVVACHDWAPA-- 401

Query: 154 YQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRS 213
             E +      +  L  A +   F     V L E  I      P + +      P + R 
Sbjct: 402 --ETRGRMLSCIRALCNADIEVGFKIGKDVSLPETYIR-SSENPVKNIGG---DPPSKRP 455

Query: 214 IFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP-----TTYYEDMQR 268
           I  +F G  +          Y R      WEN    P   IS   P       Y + M+ 
Sbjct: 456 ILAFFAGGLHG---------YVRPILLKHWEN--KEPDMKISGPLPHVRGNVNYIQLMKS 504

Query: 269 AIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLD 328
           + FC+C  G    SPR+VEA+   CIPVII+D+ + PF + + WE   +FV EE++P L 
Sbjct: 505 SKFCICARGHEVNSPRVVEAIFHECIPVIISDNFIPPFFEILNWESFAVFVKEEEIPNLR 564

Query: 329 TILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
            IL SI  E  L   +  A    +  +   +P +  D FH +L+ +
Sbjct: 565 NILLSISEERYLEMHK-RAKKVQEHFLWHAEPVK-YDLFHMLLHSI 608


>gi|75214619|gb|ABA18091.1| exostosin family protein [Olimarabidopsis pumila]
          Length = 343

 Score =  108 bits (269), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 87/309 (28%), Positives = 139/309 (44%), Gaps = 36/309 (11%)

Query: 79  SSPVRTLNPEEADWFYTPIYPTCDLT-----PTGLPLPFKSPRMMR---SAIQLISSNWP 130
           S   R   PEEA  F+ P +   ++      P   P  F   R+ R     +++++   P
Sbjct: 45  SGRFRAGRPEEAHAFFLP-FSVANIVHYVSQPIASPADFNRARLHRIFNDYVEVVARKHP 103

Query: 131 YWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSI 190
           +WN++ GADHF V  HD+       + +  +  I  L    T   + G R  +  +   I
Sbjct: 104 FWNQSNGADHFMVSCHDWAPDVAGSKPEFFKDFIRGLCNANT---SEGFRPSIDFSIPEI 160

Query: 191 TIPP--YAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKN 248
            IP     PP   Q     P+  R+I  +F G  +        GY     R  ++ ++K 
Sbjct: 161 NIPKGKLKPPFMGQT----PEN-RTILAFFAGRAH--------GY----IREVLFTHWKG 203

Query: 249 NPLFDISTDHPT---TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLP 305
                   DH T    Y+E +  + FCLCP G+   SPR VEA+  GC+PV+I+D+  LP
Sbjct: 204 KDKDVQVYDHLTKGQNYHELIGHSKFCLCPSGYEVASPREVEAIYSGCVPVVISDNYSLP 263

Query: 306 FADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGD 365
           F D + W +  + +  + +P +  IL  IP E  ++    +    + R  +  +PAQP D
Sbjct: 264 FNDVLDWSKFSVEIPVDKIPDIKKILQEIPHEKYIKMYHNVM--KVGRHFVVNRPAQPFD 321

Query: 366 AFHQILNGL 374
             H IL+ +
Sbjct: 322 VIHMILHSV 330


>gi|255546379|ref|XP_002514249.1| catalytic, putative [Ricinus communis]
 gi|223546705|gb|EEF48203.1| catalytic, putative [Ricinus communis]
          Length = 334

 Score =  108 bits (269), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 92/343 (26%), Positives = 156/343 (45%), Gaps = 32/343 (9%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAE-IFMHRFLLSSPVRTLNPEEADWFYTPIY 98
           LKVYVY    K  +K +    + +  ++A+E  FM     +      +P +A  F+ P  
Sbjct: 5   LKVYVY----KEGEKPIFHQSK-MRGIYASEGWFMKHIEGNKKFLVKDPRKAHLFFLPFS 59

Query: 99  PTCDLTPT-GLPLPFKS--PRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQ 155
           P    T   G  L  +      +++ + L++  + +WNRT G DHF V  HD+ +    +
Sbjct: 60  PQMLRTVIFGQKLQSQKDLEEYLKNYVDLVARKYSFWNRTGGTDHFLVGCHDWAS----R 115

Query: 156 EEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIF 215
             +   +  + +L  A + + F       L      I     P K    + P +  R   
Sbjct: 116 ITRKYMQNCIRVLCNANVAKGFKIGKDTTL--PVTYIRSAENPLKDVGGKHPSE--RYTL 171

Query: 216 VYFRGLFYDVNNDPEGGYYARGARAAVWENFKNN-PLFD---ISTDHPTTYYEDMQRAIF 271
            +F G  +          Y R      WEN +++  +F       +    Y E M+ + +
Sbjct: 172 AFFAGGMHG---------YLRPILVQFWENKESDMKIFGPMPRDIEGKRLYREYMKSSKY 222

Query: 272 CLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTIL 331
           C+C  G+   +PR+VEA+++ C+PVII+D+ V PF + + WE   +FV E+D+P L +IL
Sbjct: 223 CICARGYEVHTPRIVEAILYECVPVIISDNYVPPFFEVLNWEAFSVFVQEKDIPNLRSIL 282

Query: 332 TSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
            SIP E  L  Q  L    +++  L+ +     D FH IL+ +
Sbjct: 283 LSIPEEKYLEMQ--LRVKMVQQHFLWHKNPVKYDLFHMILHSV 323


>gi|302799042|ref|XP_002981280.1| glycosyltransferase-like protein [Selaginella moellendorffii]
 gi|300150820|gb|EFJ17468.1| glycosyltransferase-like protein [Selaginella moellendorffii]
          Length = 352

 Score =  108 bits (269), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 94/336 (27%), Positives = 152/336 (45%), Gaps = 40/336 (11%)

Query: 53  KKLLLKDPRCLTHMFAAE-IFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLP 111
           K L+ + P+  T ++A+E +F+      +P     P+ A  F+ P +    +        
Sbjct: 35  KPLMHEGPK--TGIYASEGLFIATMERGNPFAVTEPKIATMFFIP-FSLKQMVDYMYDTN 91

Query: 112 FKSPRMMRSAI----QLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPL 167
             S + ++S I    + ++S +PY N T G DHFFV  HD+       E++  +R I+ +
Sbjct: 92  SHSMKNIQSYIAGYLRRLASKYPYMNATNGIDHFFVSCHDWA--LMALEKQDCQRNIVKV 149

Query: 168 LQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTP---RSIFVYFRGLFYD 224
           +  A   + F     V L E  +    ++P        I  DT    R    +F G  + 
Sbjct: 150 VCNADSSRGFNTSRDVSLPETRVRQGKHSP--------IIRDTSGMDRPYLAFFAGQMHG 201

Query: 225 VNNDPEGGYYARGARAAVWENFKNNPLFDI------STDHPTTYYEDMQRAIFCLCPLGW 278
                      R    A W++   +P   I      S     +Y E M+ + +C+C  G+
Sbjct: 202 K---------LRPVLLAHWKD--KDPEMKIYEVLPPSVAERISYSEHMRLSKYCICAAGF 250

Query: 279 APWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEV 338
              SPRLVEA+V  C+PVI+AD+ VLPF++ I W+ I + VAE+DV  L  IL  IP   
Sbjct: 251 EVNSPRLVEAIVNECVPVILADNFVLPFSEVINWDSISVTVAEKDVANLKAILAGIPLRT 310

Query: 339 ILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
               Q  L +  +KR   +    +  D F+ I++ L
Sbjct: 311 YKEMQARLKH--VKRHFEWKNSPEKYDIFNMIVHSL 344


>gi|356507416|ref|XP_003522463.1| PREDICTED: LOW QUALITY PROTEIN: probable glycosyltransferase
           At3g07620-like [Glycine max]
          Length = 471

 Score =  107 bits (268), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 81/310 (26%), Positives = 135/310 (43%), Gaps = 40/310 (12%)

Query: 72  FMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPY 131
           F+H        RT +P+EA  +Y P +    + P GL        +++  IQ+I+   P+
Sbjct: 183 FIHEMEKGRYYRTYDPDEAFVYYLP-FSGVYVDPLGL--------VVKDYIQVIAHKHPF 233

Query: 132 WNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNE---- 187
           WNR+ G DHF +  HD+G       +      I  +L  A + + F     V   E    
Sbjct: 234 WNRSLGYDHFMLSCHDWGPLVSSYVDHFYNNAIR-VLCNANVSEGFKPAKDVSFPEIKLI 292

Query: 188 -GSIT--IPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWE 244
            G +T  +  Y P Q            R+I  +F G  +          Y R    + W+
Sbjct: 293 KGEVTNLVGGYPPSQ------------RTILAFFAGHQHG---------YIRXLLQSTWK 331

Query: 245 NFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVL 304
           N   +           +YY  ++ + FCLCP G+   SPR+V+A+   C+PV+I+D  V 
Sbjct: 332 NKDQDMQIYEELPEGISYYTKLRSSKFCLCPSGYEVASPRVVKAIFAECVPVLISDGYVP 391

Query: 305 PFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPG 364
           PF+D + W    + V  +D+P +  IL  I     LR  + +    ++R  +  +P +  
Sbjct: 392 PFSDVLNWNSFSVQVDVKDIPNIKKILMGISERQYLRMYKRVKQ--VQRHFVPNEPPKRY 449

Query: 365 DAFHQILNGL 374
           D FH  ++ +
Sbjct: 450 DMFHMTVHSI 459


>gi|357478279|ref|XP_003609425.1| Xylogalacturonan beta-1,3-xylosyltransferase [Medicago truncatula]
 gi|355510480|gb|AES91622.1| Xylogalacturonan beta-1,3-xylosyltransferase [Medicago truncatula]
          Length = 617

 Score =  107 bits (268), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 77/303 (25%), Positives = 135/303 (44%), Gaps = 26/303 (8%)

Query: 80  SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSP-------RMMRSAIQLISSNWPYW 132
           SP +  +P EA  F+ P      +     P+  K         RM+   + +++  +PYW
Sbjct: 320 SPFKAKHPNEAHIFFLPFSVANVVQYVYKPIMSKKDFNRDRLHRMVEDYVNVVAHKYPYW 379

Query: 133 NRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITI 192
           NR+ GADHF +  HD+           + +    +L  A   + F  +  V + E  + +
Sbjct: 380 NRSNGADHFLLSCHDWAPEIS-DANPNLFKNFTRVLCNANTSEGFQPKRDVSIPEVYLPV 438

Query: 193 PPYAPPQKMQAHQIPPDTPRSIFVYFRGLFY-DVNNDPEGGYYARGARAAVWENFKNNPL 251
               PP   Q+   P +  R+I  +F G  + D+       +  + A+  V E       
Sbjct: 439 GKLGPPNLGQS---PLN--RTILAFFSGGAHGDIRKLLLNHWKNKDAQVQVHEYLPKG-- 491

Query: 252 FDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIP 311
                     Y E M  + FCLCP G+   SPR+VEA+  GC+PVII+ +  LPF D + 
Sbjct: 492 --------QNYTELMGLSKFCLCPSGYEVASPRIVEAINAGCVPVIISQNYSLPFNDVLN 543

Query: 312 WEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQIL 371
           W E  + +  E + ++  IL ++  +  ++    +    +++  +  +PA+P D  H IL
Sbjct: 544 WSEFSVEIPVEKIVEIKNILQNVTKDKYMKLHMNVM--KVQKHFVMNRPAKPFDVMHMIL 601

Query: 372 NGL 374
           + +
Sbjct: 602 HSI 604


>gi|356563757|ref|XP_003550126.1| PREDICTED: probable glycosyltransferase At5g03795 [Glycine max]
          Length = 645

 Score =  107 bits (268), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 91/344 (26%), Positives = 153/344 (44%), Gaps = 35/344 (10%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEI-FMHRFLLSSPVRTLNPEEADWFYTPIY 98
           LKVYVY    +   + ++  P   T ++A+E  FM +   +    T +P +A  FY P  
Sbjct: 317 LKVYVY----REGARPIMHSP-FFTGLYASEGWFMKQMEANKRFLTRDPNKAHLFYLPFS 371

Query: 99  PTCDLTPTGLPLPFKSPRMMR---SAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQ 155
                    +        +++   + +++I+  + +WNRT GADHF V  HD+       
Sbjct: 372 SRMLEETLYVQNSHNHKNLVQYLHNYVEMIAGKYTFWNRTGGADHFLVGCHDWAPG---- 427

Query: 156 EEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIF 215
           E K      +  L  A + + F       L E  +      P + +  +     + R+  
Sbjct: 428 ETKVDMANCIRSLCNADVKEGFVFGKDASLPETYVR-DAKIPTKDLSGNSA---SKRTTL 483

Query: 216 VYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTT-----YYEDMQRAI 270
            +F G  +          Y R      WEN   +P   I    P +     Y + M+ + 
Sbjct: 484 AFFAGSMHG---------YVRPILLQHWEN--KDPDMKIFGRLPKSKGNRNYIQYMKSSK 532

Query: 271 FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTI 330
           +C+C  G+   SPR+VEA+ + C+PVII+D+ V PF + + WE   + V E+D+P L  I
Sbjct: 533 YCICAKGYEVNSPRVVEAIFYECVPVIISDNFVPPFLEVLNWESFAVIVLEKDIPNLKNI 592

Query: 331 LTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
           L SIP +  LR Q  +    +++  L+ +     D FH IL+ +
Sbjct: 593 LLSIPEKQYLRLQMRVK--KVQQHFLWHKNPVKYDIFHMILHSV 634


>gi|359484345|ref|XP_002280595.2| PREDICTED: probable glycosyltransferase At5g03795-like [Vitis
           vinifera]
 gi|297738776|emb|CBI28021.3| unnamed protein product [Vitis vinifera]
          Length = 665

 Score =  107 bits (268), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 106/390 (27%), Positives = 163/390 (41%), Gaps = 53/390 (13%)

Query: 8   LLLAFVSADNTPKIERISGSAGDVLEDDPVGK---------LKVYVYELPSKYNKKLLLK 58
           LL A     N P I    G    V  +  + K         LK+Y+Y    +  +K +  
Sbjct: 295 LLSARSEIQNAPVIRNTPGLYASVYRNVSMFKRSYELMERVLKIYIY----REGEKPIFH 350

Query: 59  DPRCLTHMFAAEI-FMHRFLLSSPVRTLNPEEADWFYTPI---------YPTCDLTPTGL 108
            PR L  ++A+E  FM     +      +P +A  FY P          Y     TP  L
Sbjct: 351 QPR-LRGIYASEGWFMKLIEGNKRFVVRDPRKAHLFYVPFSSKMLRTVFYEQNSSTPRDL 409

Query: 109 PLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLL 168
              FK+       + LI+  + +WNRT GADH  V  HD+      Q         +  L
Sbjct: 410 EKYFKN------YVGLIAGKYRFWNRTGGADHLIVACHDWAPRITRQ----CSWNSIRAL 459

Query: 169 QRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNND 228
             + +   F       L      I     P K    + P   P  I  +F G  +     
Sbjct: 460 CNSNIASGFKIGKDTTL--PVTYIRKSEDPLKYLGGKPPSQRP--ILAFFAGSMHG---- 511

Query: 229 PEGGYYARGARAAVWENFKNN-PLFD-ISTDH--PTTYYEDMQRAIFCLCPLGWAPWSPR 284
                Y R      WEN + +  +F  +S D    + Y + M+ + +C+C  G+   +PR
Sbjct: 512 -----YLRPILLQYWENKEQDIKIFGPMSRDDGGKSRYRDHMKSSKYCICARGYEVHTPR 566

Query: 285 LVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQR 344
           +VEA+ + C+PVII+D+ V PF + + WE   +F+ E+DVP L  IL SIP E  L+ Q 
Sbjct: 567 VVEAIFYECVPVIISDNYVPPFFEILNWEAFAVFILEKDVPNLRNILLSIPEEKYLQMQ- 625

Query: 345 LLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
            +    +++  L+ +     D FH IL+ +
Sbjct: 626 -MRVKMVQQHFLWHKKPVKYDLFHMILHSV 654


>gi|302829362|ref|XP_002946248.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
 gi|300269063|gb|EFJ53243.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
          Length = 656

 Score =  107 bits (268), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 103/388 (26%), Positives = 169/388 (43%), Gaps = 62/388 (15%)

Query: 39  KLKVYVYELPSKYNKKLLLKDPRCLTH-MFAAEIFMHRFLLS--SPVRTLNPEEADWFYT 95
            LK+Y+Y++P         +D     H  + + +      L+  S +RT NP EA+ FY 
Sbjct: 290 SLKIYMYDIPPNIVGPHQFEDGNGGIHPQYESFLRFQGLFLNDVSGIRTENPHEANLFYI 349

Query: 96  PI---YPTCDLT-PTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGAC 151
           P    Y + +L  PTG  +          A+  +++ +P++NRT G DHF ++  D GAC
Sbjct: 350 PAFTYYSSSNLGDPTGAAV---------RAVNWVAATFPFFNRTGGRDHFVLLSGDRGAC 400

Query: 152 FHYQEEKAIERGILPLLQRATLVQTFG--------------QRNHVCLNEG-SITIPPY- 195
           +            LP  +    V  FG                 + C   G  + +PPY 
Sbjct: 401 YLK---------TLPQTENLIRVTHFGYERPNITDMGPLVTNTEYGCFKAGRDVVMPPYV 451

Query: 196 ----APPQKMQAHQIPPDTPRSIFVYFRGLFY---DV-NNDPEGGYYARGARAAVWENFK 247
               A  Q ++A    P    ++      L +   D+ +N+PE   Y+ G R A+     
Sbjct: 452 KSNVAGIQGVRAKLEEPGGAEALLAGKDTLLFFSGDIRHNEPE---YSGGVRQALALLLA 508

Query: 248 NNPLFDI------STDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADD 301
           N    D+             Y   ++R+ FCL P G   W  RL+ A+   CIPVII D 
Sbjct: 509 NTSYPDVVFKGGYMMMGMGEYESLLRRSKFCLAPYGHG-WGIRLIHAITHACIPVIIQDK 567

Query: 302 IVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPA 361
           +  PF D + + +  + V++ ++P+L  IL ++P   +LR   +  N  + RA L+ QP 
Sbjct: 568 VRQPFEDILHYPDFSVRVSKAELPRLVEILRAVPEPDLLRM--IKENSRVYRAFLW-QPE 624

Query: 362 QPGDAFHQILNGLARKLPHDKSTYLKPG 389
             G A++  +  L R+L H +    + G
Sbjct: 625 LGGLAYNITIASLRRRLSHVRGDLYEAG 652


>gi|147846684|emb|CAN80640.1| hypothetical protein VITISV_016911 [Vitis vinifera]
          Length = 1363

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 95/368 (25%), Positives = 159/368 (43%), Gaps = 68/368 (18%)

Query: 28  AGDVLEDDPVGKLK---VYVYELPSKYNKKLL------------LKDPRCLTH----MFA 68
           +G ++  D V K K   +YVY+LP ++N  LL            + D R  T+    ++ 
Sbjct: 351 SGSLVNLDAVVKKKRPLIYVYDLPPEFNSLLLEGRHFKFECVNRIYDDRNATYWTEQLYG 410

Query: 69  AEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLP--LPFKSPRMMRSAIQL-- 124
           A++ ++  +L+SP RTL+ EEAD+F+ P+  +C +        L   +   +RS++ L  
Sbjct: 411 AQMAIYESILASPHRTLDGEEADFFFVPVLDSCIIVRADDAPHLNMHAHGGLRSSLTLEF 470

Query: 125 -------ISSNWPYWNRTEGADHFFVVPHDFGACFHYQE--------------EKAIERG 163
                  I   +P+WNR+ G DH +    D GAC+  +E               K     
Sbjct: 471 YKTAYDHIVEQYPFWNRSSGRDHIWFFSWDEGACYAPKEIWDSMMLVHWGNTNSKHNHST 530

Query: 164 ILPLLQRATLVQTFGQRNHVCLN-EGSITIPPYAPPQKMQAHQ---IPPDTPRSIFVYFR 219
                     V +  + NH C +    + +P +  P  +         P   R    YF 
Sbjct: 531 TAYWADNWDSVSSDRRGNHPCFDPYKDLVLPAWKRPDVVSLSSKLWSRPREQRKTLFYFN 590

Query: 220 GLFYDVNNDPEGGY----YARGARAAVWENFKNNPLFD--ISTDHPT----------TYY 263
           G   ++    EGG     Y+ G R  V E F ++P  +  +   H             Y+
Sbjct: 591 G---NLGPAYEGGRPETTYSMGIRQKVAEEFGSSPNKEGKLGKQHAEDVIVTPLRSGNYH 647

Query: 264 EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED 323
           E +  ++FC    G   WS R  ++++ GCIPV+I D I LPF + + +E   + + E++
Sbjct: 648 ESLASSVFCGVMPG-DGWSGRFEDSILQGCIPVVIQDGIFLPFENMLNYESFAVRIREDE 706

Query: 324 VPKLDTIL 331
           +P L  IL
Sbjct: 707 IPNLIKIL 714


>gi|224092294|ref|XP_002309547.1| predicted protein [Populus trichocarpa]
 gi|222855523|gb|EEE93070.1| predicted protein [Populus trichocarpa]
          Length = 471

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 93/349 (26%), Positives = 151/349 (43%), Gaps = 43/349 (12%)

Query: 40  LKVYVY---ELPSKYNKKLLLKDPRCLTHMFAAE-IFMHRFLLSSPVRTLNPEEADWFYT 95
           LKVY+Y   E+P       +   P  L  ++A+E  FM     +    T + ++A  FY 
Sbjct: 141 LKVYIYKEGEMP-------IFHQP-LLNGIYASEGWFMKLLEGNKKFVTKDSKKAHLFYL 192

Query: 96  PIYPTCDLTPTGLPLPFKSPRM---MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACF 152
           P           +P       +   ++  + +IS  +P+WNRT+GADHF    HD+    
Sbjct: 193 PFSSRYLEIRLYVPNSHSHKNLIEYLKKYLDMISEKYPFWNRTQGADHFLAACHDWAPS- 251

Query: 153 HYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPR 212
              E +      +  L  +   + F       L E +  +    P + +  ++    + R
Sbjct: 252 ---ETRQHMANCIRALCNSDAKEDFVYGKDASLPE-TYVLTQENPLRDLGGNRA---SKR 304

Query: 213 SIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPT-------TYYED 265
           SI  +F G  +          Y R      WEN   +P   I    P         Y   
Sbjct: 305 SILAFFAGSMHG---------YLRPILLQHWEN--KDPDMKIFGRLPKVKGRGKMNYARY 353

Query: 266 MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVP 325
           M+ + +C+C  G+   SPR+VEA+ + C+PVII+D+ V PF + + WE   +FV E+D+P
Sbjct: 354 MKSSKYCICAKGYEVNSPRVVEAIFYECVPVIISDNFVPPFLEVLNWESFAVFVLEKDIP 413

Query: 326 KLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
            L  IL SIP +   R Q  +    +++  L+       D FH IL+ +
Sbjct: 414 NLKKILLSIPAKKYRRMQMRVKR--VQQHFLWHARPVKYDVFHMILHSI 460


>gi|159479930|ref|XP_001698039.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
 gi|158273838|gb|EDO99624.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
          Length = 810

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 96/374 (25%), Positives = 150/374 (40%), Gaps = 73/374 (19%)

Query: 42  VYVYELPSKYNKKLLLK---DPRCL-------------THMFAAEIFMHRFLLSSPVRTL 85
           +YVY++P  Y  ++L        CL              + +  E  MH  +L S  RT 
Sbjct: 373 IYVYDVPPDYTSRMLQYRNFGDTCLWRRWHDGNFTGITGYTYGIETLMHELMLQSEHRTF 432

Query: 86  NPEEADWFYTPIYPTCDL-------------TPTGLPLPFKSPRMMRSAIQLISSNWPYW 132
           +PEEAD+FY P+Y TC                P GL +      M+      + +  PYW
Sbjct: 433 DPEEADFFYVPMYITCYFWPILGWADGPWWHAPNGLRV-MHGANMITELHDWLRTKLPYW 491

Query: 133 NRTEGADHFFVVPHDFGACF-------------HYQEEKAIERGILPLLQRATLVQ---- 175
           +R  G DH +++  D GAC+             H+       +     LQ     +    
Sbjct: 492 DRRGGRDHIWLMAADEGACWMPKAVYDTSIVLTHWGRLDPEHKSNTAYLQDNYTAKPESA 551

Query: 176 -------TFGQR--NHVCLN-EGSITIPPYAPPQKMQAHQI--PPDTPRSIFVYFRGLFY 223
                   FG R   H C +    + +P +  P       +   P   R + ++FRG   
Sbjct: 552 FDAWRGVDFGDRIKGHPCFDPRKDLVVPAFKSPNHFPRSPLIGAPPLERDLLLFFRG--- 608

Query: 224 DVNNDPEGGYYARGARAAVW-----ENFKNNPLFDISTDHP--TTYYEDMQRAIFCLCPL 276
           DV       +Y+RG R  ++      ++ N     I +       Y E + R+ FCL   
Sbjct: 609 DVGAS-RLPHYSRGIRQRLFHLAHKHDWYNRFKIAIGSGDSLKGDYSEQLARSKFCLVAP 667

Query: 277 GWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED--VPKLDTILTSI 334
           G   WSPR  +A++ GCIPV++ D +   F   + W+   + + E+D  +  L  +L SI
Sbjct: 668 G-DGWSPRAEDAILHGCIPVVVMDGVQAVFESILDWDSFSLRIREDDAALEALPQLLASI 726

Query: 335 PPEVILRKQRLLAN 348
            PE +   QR LA 
Sbjct: 727 SPERLAHMQRHLAR 740


>gi|224087074|ref|XP_002308055.1| predicted protein [Populus trichocarpa]
 gi|222854031|gb|EEE91578.1| predicted protein [Populus trichocarpa]
          Length = 338

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 77/268 (28%), Positives = 123/268 (45%), Gaps = 29/268 (10%)

Query: 122 IQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRN 181
           +++I+  + YWNRT GADHF +  HD+G        +  +  I  L   A   + F  + 
Sbjct: 88  VRVIADKYTYWNRTNGADHFSISCHDWGPDISRTNPELFKYFIRALCN-ANTSEGFQPQR 146

Query: 182 HVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYF--------RGLFYDVNNDPEGGY 233
            V + E  + +     P++       P + R I  +F        R +      D +G  
Sbjct: 147 DVSVPEIFLHVGKLGLPREGAQ----PPSKRPILAFFAGGAHGRIRKVLLKRWKDKDGEI 202

Query: 234 YARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGC 293
                   V +  KNN L          Y++ M ++ FCLCP G    SPR+V A+  GC
Sbjct: 203 ---QVHEYVTQRKKNNNL----------YFKLMGQSKFCLCPSGHEVASPRVVTAIQLGC 249

Query: 294 IPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKR 353
           +PVII+D+  LPF+D + W +  + +  E + ++ TIL  I  +  L  QR +     +R
Sbjct: 250 VPVIISDNYSLPFSDVLDWSKFSVNIPSEKIQEIKTILKGISHKRYLTMQRRVIQA--QR 307

Query: 354 AMLFPQPAQPGDAFHQILNGL-ARKLPH 380
                +PA+P D  H IL+ +  R+L H
Sbjct: 308 HFTLNRPAKPYDMIHMILHSIWLRRLNH 335


>gi|226496701|ref|NP_001141657.1| uncharacterized protein LOC100273782 [Zea mays]
 gi|194705444|gb|ACF86806.1| unknown [Zea mays]
          Length = 497

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 105/362 (29%), Positives = 158/362 (43%), Gaps = 58/362 (16%)

Query: 38  GKLKVYVYELPSKYNKKLL--LKDP-RCLTHMFAAEIFMHRF------------------ 76
           G L+VYVYE+PSK+   LL   +D  R   ++ +    +HR                   
Sbjct: 109 GLLRVYVYEMPSKFTYDLLRLFRDSYRDTDNLTSNGSPVHRLIEQHSIDYWLWADLIALD 168

Query: 77  ---LLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWN 133
              LL S +R    EEAD FY P + T       L    +   + R A++ ++   P W 
Sbjct: 169 SQRLLKSVIRVQQQEEADIFYVPFFTTISYF---LLEKQECKALYREALKWVTDQ-PAWQ 224

Query: 134 RTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIP 193
           R+EG DH   V H +         +++++ I  LL        + +   V L +    I 
Sbjct: 225 RSEGRDHVIPVHHPWS---FKSVRRSVKKAIW-LLPDMDSTGNWYKPGQVYLEKD--VIL 278

Query: 194 PYAPPQKMQAHQ--IPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPL 251
           PY P   +  H+  +   + RSI ++FRG          GG      R+ + E  K+   
Sbjct: 279 PYVPNVDLCDHKCVLETQSKRSILLFFRGRL----KRNAGG----KIRSKLVEELKSAK- 329

Query: 252 FDISTDHPTT-------YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVL 304
            DI  +  +T         + M+++ FCL P G  P S RL +A+V GCIPVII+D++ L
Sbjct: 330 -DIVIEEGSTGAQGKAAAQDGMRKSFFCLSPAGDTPSSARLFDAIVTGCIPVIISDELEL 388

Query: 305 PFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPA 361
           PF   + + EI +FV+  D  +   L   L  I  + I   Q  L   S  R  L+  PA
Sbjct: 389 PFEGILDYREIALFVSASDAVQPGWLLKYLRGINAKRIREIQSNLVKYS--RHFLYSSPA 446

Query: 362 QP 363
           QP
Sbjct: 447 QP 448


>gi|359484343|ref|XP_002283936.2| PREDICTED: uncharacterized protein LOC100268163 [Vitis vinifera]
          Length = 738

 Score =  107 bits (266), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 96/351 (27%), Positives = 153/351 (43%), Gaps = 48/351 (13%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEI-FMHRFLLSSPVRTLNPEEADWFYTP-- 96
           LKVYVY    K  +K +   P  L  ++A+E  FM     +      +P +A  FY P  
Sbjct: 409 LKVYVY----KDGEKPIFHQP-ILKGLYASEGWFMKLMERNKHFVVKDPRQAQLFYMPFS 463

Query: 97  -------IYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFG 149
                  +Y       T L       + ++   + I++ + +WNRT GADHF V  HD+ 
Sbjct: 464 SRMLEYKLYVRNSHNRTNLR------QYLKQYSEKIAAKYRFWNRTGGADHFLVACHDWA 517

Query: 150 ACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPD 209
               Y+    +E+ I  L   A +   F     V L E  +      P + +      P 
Sbjct: 518 P---YETRHHMEQCIKALCN-ADVTAGFKIGRDVSLPETYVR-SARNPLRDLGG---KPP 569

Query: 210 TPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP------TTYY 263
           + R I  ++ G  +          Y R      W++   +P   I    P        Y 
Sbjct: 570 SERHILAFYAGNMHG---------YLRPILLKYWKD--KDPDMKIYGPMPPGVASKMNYI 618

Query: 264 EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED 323
           + M+ + FC+CP G+   SPR+VEA+ + C+PVII+D+ V PF D + W    + +AE+D
Sbjct: 619 QHMKSSKFCICPKGYEVNSPRVVEAIFYECVPVIISDNFVPPFFDVLDWGAFSIILAEKD 678

Query: 324 VPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
           +P L  +L SIP +  L+ Q  L    +++  L+       D FH  L+ +
Sbjct: 679 IPNLKDVLLSIPNDKYLQMQ--LGVRKVQKHFLWHAKPLKYDLFHMTLHSI 727


>gi|225441752|ref|XP_002277596.1| PREDICTED: uncharacterized protein LOC100267584 [Vitis vinifera]
          Length = 794

 Score =  107 bits (266), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 101/410 (24%), Positives = 172/410 (41%), Gaps = 71/410 (17%)

Query: 28  AGDVLEDDPVGKLK---VYVYELPSKYNKKLL------------LKDPRCLTH----MFA 68
           +G ++  D V K K   +YVY+LP ++N  LL            + D R  T+    ++ 
Sbjct: 335 SGSLVNLDAVVKKKRPLIYVYDLPPEFNSLLLEGRHFKFECVNRIYDDRNATYWTEQLYG 394

Query: 69  AEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLP--LPFKSPRMMRSAIQL-- 124
           A++ ++  +L+SP RTL+ EEAD+F+ P+  +C +        L   +   +RS++ L  
Sbjct: 395 AQMAIYESILASPHRTLDGEEADFFFVPVLDSCIIVRADDAPHLNMHAHGGLRSSLTLEF 454

Query: 125 -------ISSNWPYWNRTEGADHFFVVPHDFGACFHYQE--------------EKAIERG 163
                  I   +P+WNR+ G DH +    D GAC+  +E               K     
Sbjct: 455 YKTAYDHIVEQYPFWNRSSGRDHIWFFSWDEGACYAPKEIWDSMMLVHWGNTNSKHNHST 514

Query: 164 ILPLLQRATLVQTFGQRNHVCLN-EGSITIPPYAPPQKMQAHQ---IPPDTPRSIFVYFR 219
                     V +  + NH C +    + +P +  P  +         P   R    YF 
Sbjct: 515 TAYWADNWDSVSSDRRGNHPCFDPYKDLVLPAWKRPDVVSLSSKLWSRPREQRKTLFYFN 574

Query: 220 GLFYDVNNDPEGGY----YARGARAAVWENFKNNPLFD--ISTDHPT----------TYY 263
           G   ++    EGG     Y+ G R  V E F ++P  +  +   H             Y+
Sbjct: 575 G---NLGPAYEGGRPETTYSMGIRQKVAEEFGSSPNKEGKLGKQHAEDVIVTPLRSGNYH 631

Query: 264 EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED 323
           E +  ++FC    G   WS R  ++++ GCIPV+I D I LPF + + +E   + + E++
Sbjct: 632 ESLASSVFCGVMPG-DGWSGRFEDSILQGCIPVVIQDGIFLPFENMLNYESFAVRIREDE 690

Query: 324 VPKLDTILTSIPP---EVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQI 370
           +P L  IL  +     E  L   R +    + R  +  +  +   AF  +
Sbjct: 691 IPNLIKILRGMNETEIEFKLENVRKIWQRFLYRDSILLEAERQKTAFGNV 740


>gi|414880038|tpg|DAA57169.1| TPA: hypothetical protein ZEAMMB73_490377 [Zea mays]
          Length = 497

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 105/362 (29%), Positives = 158/362 (43%), Gaps = 58/362 (16%)

Query: 38  GKLKVYVYELPSKYNKKLL--LKDP-RCLTHMFAAEIFMHRF------------------ 76
           G L+VYVYE+PSK+   LL   +D  R   ++ +    +HR                   
Sbjct: 109 GLLRVYVYEMPSKFTYDLLRLFRDSYRDTDNLTSNGSPVHRLIEQHSIDYWLWADLIALD 168

Query: 77  ---LLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWN 133
              LL S +R    EEAD FY P + T       L    +   + R A++ ++   P W 
Sbjct: 169 SQRLLKSVIRVQQQEEADIFYVPFFTTISYF---LLEKQECKALYREALKWVTDQ-PAWQ 224

Query: 134 RTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIP 193
           R+EG DH   V H +         +++++ I  LL        + +   V L +    I 
Sbjct: 225 RSEGRDHVIPVHHPWS---FKSVRRSVKKAIW-LLPDMDSTGNWYKPGQVYLEKD--VIL 278

Query: 194 PYAPPQKMQAHQ--IPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPL 251
           PY P   +  H+  +   + RSI ++FRG          GG      R+ + E  K+   
Sbjct: 279 PYVPNVDLCDHKCVLETQSKRSILLFFRGRL----KRNAGG----KIRSKLVEELKSAK- 329

Query: 252 FDISTDHPTT-------YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVL 304
            DI  +  +T         + M+++ FCL P G  P S RL +A+V GCIPVII+D++ L
Sbjct: 330 -DIVIEEGSTGAQGKAAAQDGMRKSFFCLSPAGDTPSSARLFDAIVSGCIPVIISDELEL 388

Query: 305 PFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPA 361
           PF   + + EI +FV+  D  +   L   L  I  + I   Q  L   S  R  L+  PA
Sbjct: 389 PFEGILDYREIALFVSASDAVQPGWLLKYLRGINAKRIREIQSNLVKYS--RHFLYSSPA 446

Query: 362 QP 363
           QP
Sbjct: 447 QP 448


>gi|297739695|emb|CBI29877.3| unnamed protein product [Vitis vinifera]
          Length = 822

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 101/410 (24%), Positives = 172/410 (41%), Gaps = 71/410 (17%)

Query: 28  AGDVLEDDPVGKLK---VYVYELPSKYNKKLL------------LKDPRCLTH----MFA 68
           +G ++  D V K K   +YVY+LP ++N  LL            + D R  T+    ++ 
Sbjct: 363 SGSLVNLDAVVKKKRPLIYVYDLPPEFNSLLLEGRHFKFECVNRIYDDRNATYWTEQLYG 422

Query: 69  AEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLP--LPFKSPRMMRSAIQL-- 124
           A++ ++  +L+SP RTL+ EEAD+F+ P+  +C +        L   +   +RS++ L  
Sbjct: 423 AQMAIYESILASPHRTLDGEEADFFFVPVLDSCIIVRADDAPHLNMHAHGGLRSSLTLEF 482

Query: 125 -------ISSNWPYWNRTEGADHFFVVPHDFGACFHYQE--------------EKAIERG 163
                  I   +P+WNR+ G DH +    D GAC+  +E               K     
Sbjct: 483 YKTAYDHIVEQYPFWNRSSGRDHIWFFSWDEGACYAPKEIWDSMMLVHWGNTNSKHNHST 542

Query: 164 ILPLLQRATLVQTFGQRNHVCLN-EGSITIPPYAPPQKMQAHQ---IPPDTPRSIFVYFR 219
                     V +  + NH C +    + +P +  P  +         P   R    YF 
Sbjct: 543 TAYWADNWDSVSSDRRGNHPCFDPYKDLVLPAWKRPDVVSLSSKLWSRPREQRKTLFYFN 602

Query: 220 GLFYDVNNDPEGGY----YARGARAAVWENFKNNPLFD--ISTDHPT----------TYY 263
           G   ++    EGG     Y+ G R  V E F ++P  +  +   H             Y+
Sbjct: 603 G---NLGPAYEGGRPETTYSMGIRQKVAEEFGSSPNKEGKLGKQHAEDVIVTPLRSGNYH 659

Query: 264 EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED 323
           E +  ++FC    G   WS R  ++++ GCIPV+I D I LPF + + +E   + + E++
Sbjct: 660 ESLASSVFCGVMPG-DGWSGRFEDSILQGCIPVVIQDGIFLPFENMLNYESFAVRIREDE 718

Query: 324 VPKLDTILTSIPP---EVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQI 370
           +P L  IL  +     E  L   R +    + R  +  +  +   AF  +
Sbjct: 719 IPNLIKILRGMNETEIEFKLENVRKIWQRFLYRDSILLEAERQKTAFGNV 768


>gi|147799433|emb|CAN76867.1| hypothetical protein VITISV_012309 [Vitis vinifera]
          Length = 1908

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 96/351 (27%), Positives = 152/351 (43%), Gaps = 48/351 (13%)

Query: 40   LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEI-FMHRFLLSSPVRTLNPEEADWFYTP-- 96
            LKVYVY    K  +K +   P  L  ++A+E  FM     +      +P +A  FY P  
Sbjct: 1065 LKVYVY----KDGEKPIFHQP-ILKGLYASEGWFMKLMERNKXFVVKDPRQAQLFYMPFS 1119

Query: 97   -------IYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFG 149
                   +Y       T L       + ++   + I++ + +WNRT G DHF V  HD+ 
Sbjct: 1120 SRMLEYKLYVRNSHNRTNLR------QYLKQYSEKIAAKYRFWNRTGGXDHFLVACHDWA 1173

Query: 150  ACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPD 209
                Y+    +E+ I  L   A +   F     V L E  +      P + +      P 
Sbjct: 1174 P---YETRHHMEQCIKALCN-ADVTAGFKIGRDVSLPETYVR-SARNPLRDLGG---KPP 1225

Query: 210  TPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP------TTYY 263
            + R I  ++ G  +          Y R      W++   +P   I    P        Y 
Sbjct: 1226 SERHILAFYAGNMHG---------YLRPILLKYWKD--KDPDMKIYGPMPPGVASKMNYI 1274

Query: 264  EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED 323
            + M+ + FC+CP G+   SPR+VEA+ + C+PVII+D+ V PF D + W    + +AE+D
Sbjct: 1275 QHMKSSKFCICPKGYEVNSPRVVEAIFYECVPVIISDNFVPPFFDVLDWGAFSIILAEKD 1334

Query: 324  VPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
            +P L  +L SIP E  L+ Q  L    +++  L+       D FH  L+ +
Sbjct: 1335 IPNLKDVLLSIPNEKYLQMQ--LGVRKVQKHFLWHAKPLKYDLFHMTLHSI 1383



 Score =  101 bits (252), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 108/430 (25%), Positives = 171/430 (39%), Gaps = 85/430 (19%)

Query: 8   LLLAFVSADNTPKIERISGSAGDVLEDDPVGK---------LKVYVYELPSKYNKKLLLK 58
           LL A     N P I    G    V  +  + K         LK+Y+Y    +  +K +  
Sbjct: 295 LLSARSEIQNAPVIRNTPGLYASVYRNVSMFKRSYELMERVLKIYIY----REGEKPIFH 350

Query: 59  DPRCLTHMFAAEI-FMHRFLLSSPVRTLNPEEADWFYTPI---------YPTCDLTPTGL 108
            PR L  ++A+E  FM     +      +P +A  FY P          Y     TP  L
Sbjct: 351 QPR-LRGIYASEGWFMKLIEGNKRFVVRDPRKAHLFYVPFSSKMLRTVFYEQNSSTPRDL 409

Query: 109 PLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIE------- 161
              FK+       + LI+  + +WNRT GADH  V  HD+   +       I        
Sbjct: 410 EKYFKN------YVGLIAGKYRFWNRTGGADHLIVACHDWNPIYRTISTNTIRIKSQAIT 463

Query: 162 ----------------------RGILPLLQRATLVQTFGQRNHVCLNE-------GSITI 192
                                 +G      R T   ++     +C +        G  T 
Sbjct: 464 MPPFIFVGGESTYDLVSGTFSNKGFNSQAPRITRQCSWNSIRALCNSNIASGFKIGKDTT 523

Query: 193 PPYAPPQK----MQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKN 248
            P    +K    ++     P + R I  +F G  +          Y R      WEN + 
Sbjct: 524 LPVTYIRKSEDPLKYLGGKPPSQRPILAFFAGSMHG---------YLRPILLQYWENKEQ 574

Query: 249 N-PLFD-ISTDH--PTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVL 304
           +  +F  +S D    + Y + M+ + +C+C  G+   +PR+VEA+ + C+PVII+D+ V 
Sbjct: 575 DIKIFGPMSRDDGGKSRYRDHMKSSKYCICARGYEVHTPRVVEAIFYECVPVIISDNYVP 634

Query: 305 PFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPG 364
           PF + + WE   +F+ E+DVP L  IL SIP E  L+ Q  +    +++  L+ +     
Sbjct: 635 PFFEILNWEAFAVFILEKDVPNLRNILLSIPEEKYLQMQ--MRVKMVQQHFLWHKKPVKY 692

Query: 365 DAFHQILNGL 374
           D FH IL+ +
Sbjct: 693 DLFHMILHSV 702


>gi|302772499|ref|XP_002969667.1| glycosyltransferase-like protein [Selaginella moellendorffii]
 gi|300162178|gb|EFJ28791.1| glycosyltransferase-like protein [Selaginella moellendorffii]
          Length = 352

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 91/333 (27%), Positives = 152/333 (45%), Gaps = 34/333 (10%)

Query: 53  KKLLLKDPRCLTHMFAAE-IFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLP 111
           K L+ + P+  T ++A+E +F+      +P     P+ A  F+ P +    +        
Sbjct: 35  KPLMHEGPK--TGIYASEGLFIATMERGNPFAVTEPKIATMFFIP-FSLKQMVDYMYDTN 91

Query: 112 FKSPRMMRSAI----QLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPL 167
             S + ++S I    + ++S +PY N T G DHFFV  HD+       E++  +R I+ +
Sbjct: 92  SHSMKNIQSYIAGYLRRLASKYPYMNATNGIDHFFVSCHDWA--LMALEKQDCQRNIVKV 149

Query: 168 LQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNN 227
           +  A   + F     V L E  +    ++P  +  +        R    +F G  +    
Sbjct: 150 VCNADSSRGFNTSRDVSLPETRVRQGKHSPIIRDIS-----GMDRPYLAFFAGQMHG--- 201

Query: 228 DPEGGYYARGARAAVWENFKNNPLFDI------STDHPTTYYEDMQRAIFCLCPLGWAPW 281
                   R    A W++   +P   I      S     +Y E M+ + +C+C  G+   
Sbjct: 202 ------KLRPVLLAHWKD--KDPEMKIYEVLPPSVAERISYSEHMRLSKYCICAAGFEVN 253

Query: 282 SPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILR 341
           SPRLVEA+V  C+PVI+AD+ VLPF++ I W+ I + VAE+DV  L  IL  IP      
Sbjct: 254 SPRLVEAIVNECVPVILADNFVLPFSEVINWDSISVTVAEKDVANLKAILAGIPLRRYKE 313

Query: 342 KQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
            Q  L +  +KR  ++    +  D F+ I++ L
Sbjct: 314 MQARLKH--VKRHFVWKNSPEKYDIFNMIVHSL 344


>gi|384252594|gb|EIE26070.1| exostosin-domain-containing protein [Coccomyxa subellipsoidea
           C-169]
          Length = 898

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 101/391 (25%), Positives = 160/391 (40%), Gaps = 92/391 (23%)

Query: 42  VYVYELPSKYNKKLL---LKDPRCLTHMFA--------------AEIFMHRFL--LSSPV 82
           +YVY+LP+++N ++L   L    C+  +F                  F  R L  LSS  
Sbjct: 448 IYVYDLPAEFNSRMLQYRLNKRECVWRLFKDNNESYINSWTYGIESAFHERLLQALSSEH 507

Query: 83  RTLNPEEADWFYTPIYPTCDLTP---------------------------TGLPLPFKSP 115
           RTL+PEEAD+F+ P+Y +C L P                           TG     ++ 
Sbjct: 508 RTLDPEEADFFFMPVYTSCFLHPVWGYVDHPWYYGPTIDCRRDGDMQICQTGANRVMQAM 567

Query: 116 RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQ 175
            M+  A + +  N P+W R  G DH +++ HD G+C+  +E +     IL    R  +  
Sbjct: 568 FMLLEAQKWVEVNHPWWRRKGGRDHIWLITHDEGSCWAPKEIRL--SIILSHWGRKDVNH 625

Query: 176 TFGQ--------------------------RNHVCLNE-GSITIPPYAPPQKMQAHQIP- 207
           T                             + H C +    + IP    P +  A+  P 
Sbjct: 626 TSNSAFKPWDNYTQEVIHPEWWPEGYTHHIKGHACYDPIKDLIIPNLKHPAEF-ANFSPL 684

Query: 208 ---PDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNP-------LFDISTD 257
              P  PR I   FRG   DV       +Y+RG R  ++   + +        L     D
Sbjct: 685 VGHPQPPRDILFLFRG---DVGKH-RLPHYSRGIRQRLFALAQEHDWAGRHAILIGDRDD 740

Query: 258 HPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGM 317
               Y E + R+ FCL   G   +SPR  +A++ GC+PV++ D++   F+  + W    +
Sbjct: 741 VAGDYSELLTRSKFCLVAPG-DGFSPRAEDAILHGCVPVVVMDEVDPVFSSILDWSAFSL 799

Query: 318 FVAEEDVPKLDTILTSIPPEVILRKQRLLAN 348
            +AE D+ +L  IL ++P   +   QR L N
Sbjct: 800 RIAEADIEQLPQILLAVPEARLQAMQRSLRN 830


>gi|449466083|ref|XP_004150756.1| PREDICTED: probable glycosyltransferase At3g07620-like [Cucumis
           sativus]
          Length = 341

 Score =  105 bits (262), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 95/352 (26%), Positives = 154/352 (43%), Gaps = 51/352 (14%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTL-NPEEADWFYTPIY 98
           LKVYVY+      +K +   P  LT ++A+E +  + L  +    + +PE+A  FY P  
Sbjct: 13  LKVYVYD----EGEKPIFHQP-ILTGIYASEGWFMKLLEDNKKFVVKDPEKAHLFYLPF- 66

Query: 99  PTCDLTPTGLPLPFKSPRMM----RSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHY 154
            +     +     F++ R +    ++ I +I   + +WN+  G+DHF V  HD+      
Sbjct: 67  -SSQFLRSAFGNKFRNKRDLQKPLKNYIDVIGKKYRFWNKNGGSDHFLVACHDWAPKLTK 125

Query: 155 QEEKAIER--------GILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQI 206
           +  K   R        G   + +  +L  TF     V   E  IT     PP        
Sbjct: 126 RLVKNCIRALCNANGAGDFEIGKDTSLPVTF-----VHSTEDLITKIGGKPP-------- 172

Query: 207 PPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFD----ISTDHPTTY 262
              + R+   +F G  +          Y R      WEN + + +       S +    Y
Sbjct: 173 ---SERTTLAFFAGSMHG---------YLRPILLHYWENKEPDMMIVGPMPNSIEGKNAY 220

Query: 263 YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEE 322
            E M+ + +C+C  G+   SPR++EA++  CIPVII+D+ V P  + + WE   +FV E 
Sbjct: 221 MEQMKSSKYCICARGYQVHSPRVIEAILNECIPVIISDNYVPPLFEVLNWESFSVFVKER 280

Query: 323 DVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
           ++P L  IL SIP E   R          +  +   +PA+  DAFH IL+ +
Sbjct: 281 EIPNLRDILLSIPEEN-YRAMHSRVKMVQQHFLWHEKPAK-YDAFHMILHSI 330


>gi|255546377|ref|XP_002514248.1| catalytic, putative [Ricinus communis]
 gi|223546704|gb|EEF48202.1| catalytic, putative [Ricinus communis]
          Length = 676

 Score =  105 bits (262), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 95/351 (27%), Positives = 151/351 (43%), Gaps = 48/351 (13%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEI-FMHRFLLSSPVRTLNPEEADWFYTP-- 96
           LKVY+Y    K  KK +   P  +  ++A+E  FM     +      +P  A  FY P  
Sbjct: 347 LKVYIY----KDGKKPIFHLP-IMKGLYASEGWFMKLMQGNKHFLVKDPRRAHLFYMPFS 401

Query: 97  -------IYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFG 149
                  +Y       T L       + ++   + I++ +P+WNRT+GADHF V  HD+ 
Sbjct: 402 SRMLEYTLYVRNSHNRTNLR------QYLKDYSEKIAAKYPFWNRTDGADHFLVACHDWA 455

Query: 150 ACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPD 209
               Y+    +E  I  L   A +   F     + L E  +     +    ++     P 
Sbjct: 456 P---YETRHHMEHCIKALCN-ADVTAGFKIGRDISLPETYVR----SARNPLRDLGGKPP 507

Query: 210 TPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP------TTYY 263
           + R I  ++ G  +          Y R      W++   +P   I    P        Y 
Sbjct: 508 SQRHILAFYAGSMHG---------YLRPILLKYWKD--KDPSMKIFGPMPPGVASKMNYI 556

Query: 264 EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED 323
           + M+ + +C+CP G+   SPR+VEA+ + C+PVII+D+ V PF +   W    + +AE+D
Sbjct: 557 QHMKSSKYCICPKGYEVNSPRVVEAIFYECVPVIISDNFVPPFFEVFNWGAFSLILAEKD 616

Query: 324 VPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
           +P L  IL SIP E  L  Q L      K  +  P P +  D F+  L+ +
Sbjct: 617 IPNLKEILLSIPEEKYLEMQ-LGVRKVQKHFLWHPSPMK-YDLFYMTLHAI 665


>gi|302851968|ref|XP_002957506.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
 gi|300257148|gb|EFJ41400.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
          Length = 706

 Score =  105 bits (262), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 93/375 (24%), Positives = 150/375 (40%), Gaps = 71/375 (18%)

Query: 39  KLKVYVYELPSKYN-KKLLLKDPRCLTHMFAAEI-FMHRFLLSSPVRTLNPEEADWFYTP 96
           KLK+YVY+LP + + +K    +P  L  M+ AE+ FM + L    VRT NP EA+ F  P
Sbjct: 250 KLKIYVYDLPERVSYRKPWHDEPALLDTMYLAELLFMEQLLGDWSVRTENPWEANLFVLP 309

Query: 97  IYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGAC----- 151
            Y        G P      +   +    + SN+P+WN T G +H     +D G C     
Sbjct: 310 TYTIYYTGNIGFP-----AKHFANVFNYVRSNYPFWNLTGGRNHVAFATNDRGCCDLYKL 364

Query: 152 ---------------FHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYA 196
                          F    + A+E       + A    + G        +G  T+    
Sbjct: 365 ARSNPELQHPIKVVHFSQTSDPALEAAFREFKRLAVHQTSGGAEGEAMEEDGGGTVMQEL 424

Query: 197 PPQKMQAHQIPPDTPRSIFVYFRGLF-------------------------YD-----VN 226
           P   ++    PP T  ++ +     F                         YD     V+
Sbjct: 425 PEALVRFRGFPPYTLEALRMEREPCFRPEQDVAVPNYLERGWIGRLQEAYAYDREGNAVH 484

Query: 227 NDPE-------GGY------YARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCL 273
            D +        GY      Y+ G R  +   + N    D++ +      E M ++ FCL
Sbjct: 485 RDKQRPYLFYFNGYSKPDMAYSGGVRQGLLSMYHNLTRGDVAINPGCCTAEYMLQSRFCL 544

Query: 274 CPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTS 333
           CPLG+  W  RL +A+  GC+PVI+ D     F D +P+E+  + +   ++ +L  +L +
Sbjct: 545 CPLGYG-WGIRLTQAMQSGCVPVIVQDHTYSAFWDLLPYEKFSVRINRHNLHRLFDLLDA 603

Query: 334 IPPEVILRKQRLLAN 348
           + PE +   Q+ LA+
Sbjct: 604 VTPEQLKDLQKGLAD 618


>gi|255571564|ref|XP_002526728.1| catalytic, putative [Ricinus communis]
 gi|223533917|gb|EEF35642.1| catalytic, putative [Ricinus communis]
          Length = 728

 Score =  105 bits (261), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 86/313 (27%), Positives = 138/313 (44%), Gaps = 51/313 (16%)

Query: 65  HMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLP--LPFKSPRMMRSAI 122
           H++ A++ ++  LL+SP RTLN EEAD+F+ PI  +C +T       L  +    +RS++
Sbjct: 326 HLYGAQMALYESLLASPYRTLNGEEADFFFVPILDSCIITRADDAPHLSMQDHMGLRSSL 385

Query: 123 QL---------ISSNWPYWNRTEGADHFFVVPHDFGACFHYQE--------------EKA 159
            L         I  ++PYWNRT G DH +    D GAC+  +E               K 
Sbjct: 386 TLEYYRKAYDHIVEHYPYWNRTSGRDHIWFFSWDEGACYAPKEIWNSMMLVHWGNTNSKH 445

Query: 160 IERGILPLLQRATLVQTFGQRNHVCLN-EGSITIPPYAPPQ------KMQAHQIPPDTPR 212
                         + +  +  H C + +  + +P +  P       K+ A    P   R
Sbjct: 446 NHSTTAYWADNWDKISSDRRGRHPCFDPDKDLVLPAWKRPDVSALSTKLWAR---PLERR 502

Query: 213 SIFVYFRGLFYDV--NNDPEGGYYARGARAAVWENFKNNPLFD--ISTDH---------- 258
               +F G       N  PE   Y+ G R  + E F ++P  D  +   H          
Sbjct: 503 KTLFFFNGNLGPAYPNGRPELS-YSMGIRQKLAEEFGSSPNKDGKLGKQHAEDVIVTPLR 561

Query: 259 PTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMF 318
              Y+ED+  +IFC    G   WS R+ ++++ GCIPVII D I LP+ + + +E   + 
Sbjct: 562 SENYHEDLASSIFCGVLPG-DGWSGRMEDSILQGCIPVIIQDGIFLPYENVLNYESFAVR 620

Query: 319 VAEEDVPKLDTIL 331
           + E+++  L  IL
Sbjct: 621 IREDEISNLLKIL 633


>gi|15231488|ref|NP_187419.1| putative glycosyltransferase [Arabidopsis thaliana]
 gi|75207507|sp|Q9SSE8.1|GLYT1_ARATH RecName: Full=Probable glycosyltransferase At3g07620
 gi|6466945|gb|AAF13080.1|AC009176_7 hypothetical protein [Arabidopsis thaliana]
 gi|332641055|gb|AEE74576.1| putative glycosyltransferase [Arabidopsis thaliana]
          Length = 470

 Score =  105 bits (261), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 99/359 (27%), Positives = 158/359 (44%), Gaps = 65/359 (18%)

Query: 40  LKVYVYEL--PSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPI 97
            K+YVYE   P  ++  L  KD   +  +F    FM   +L    RT +P++A  ++ P 
Sbjct: 143 FKIYVYEEGDPPIFHYGLC-KDIYSMEGLFLN--FMENDVLK--YRTRDPDKAHVYFLPF 197

Query: 98  YPTCDLTPTGLPLPFKSP---RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHY 154
                L     P+        R++   +Q+IS  +PYWN ++G DHF +  HD+G     
Sbjct: 198 SVVMILHHLFDPVVRDKAVLERVIADYVQIISKKYPYWNTSDGFDHFMLSCHDWG----- 252

Query: 155 QEEKAIERGILPLLQRAT--LVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPR 212
                          RAT  + + F     V  N     I  Y  P+K        D P 
Sbjct: 253 --------------HRATWYVKKLFFNSIRVLCNAN---ISEYFNPEK--------DAP- 286

Query: 213 SIFVYFRGLFYDVNNDPEG---------GYYARGARAAV-------WENFKNNPLFDIST 256
             F     L  D+NN   G          ++A  +   +       W+    + L   + 
Sbjct: 287 --FPEINLLTGDINNLTGGLDPISRTTLAFFAGKSHGKIRPVLLNHWKEKDKDILVYENL 344

Query: 257 DHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG 316
                Y E M+++ FC+CP G    SPR+ EA+  GC+PV+I+++ VLPF+D + WE+  
Sbjct: 345 PDGLDYTEMMRKSRFCICPSGHEVASPRVPEAIYSGCVPVLISENYVLPFSDVLNWEKFS 404

Query: 317 MFVAEEDVPKLDTILTSIPPEVILRKQRLLAN-PSMKRAMLFPQPAQPGDAFHQILNGL 374
           + V+ +++P+L  IL  IP E   R  RL      +KR +L   P +  D F+ I++ +
Sbjct: 405 VSVSVKEIPELKRILMDIPEE---RYMRLYEGVKKVKRHILVNDPPKRYDVFNMIIHSI 460


>gi|168019297|ref|XP_001762181.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162686585|gb|EDQ72973.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 344

 Score =  105 bits (261), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 98/354 (27%), Positives = 152/354 (42%), Gaps = 57/354 (16%)

Query: 39  KLKVYVY--ELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTP 96
           K KVYVY    P  Y      + PR LT  +++E +  + L  S   T +   AD F+ P
Sbjct: 21  KFKVYVYPDGDPETY-----YQTPRKLTGKYSSEGYFFQNLRESRFVTNDSAAADLFFLP 75

Query: 97  IYPTC-DLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQ 155
           +  +C  +   GL    K   ++R+ ++ +   +P+WNRT GADHFFV  HD G     +
Sbjct: 76  V--SCHKMRGKGLSYE-KMADIVRAYVESLIIKYPFWNRTVGADHFFVTCHDVGVRATAK 132

Query: 156 EEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIF 215
            E  ++  I     R     ++        N   I     A PQ +Q   +P        
Sbjct: 133 VEHLVKNSI-----RVVCSPSY--------NGSFIPHKDVALPQVLQPFPLPAGGD---- 175

Query: 216 VYFRGLFYDVNNDPEGGYYARGARAAVWENF------------KNNPLFDISTDHPTTYY 263
                   D++N    G++A    + V  N              NN L   + D+   Y 
Sbjct: 176 --------DIHNRTVLGFWAGHRNSKVRVNLADAWQYDPILFVANNRLNRSTGDY--IYQ 225

Query: 264 EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIP-VIIADDIVLPFADAIPWEEIGMFVAEE 322
               R+ FC+CP G    S R+ E++ +GC+P VI+AD   LPF D + W +  + V E 
Sbjct: 226 NQFYRSKFCICPAGSQVNSARIAESIHYGCVPEVIMADFYDLPFNDILDWRKFSLVVRER 285

Query: 323 DVPKLDTILTSIPPEVILRKQRLL--ANPSMKRAMLFPQPAQPGDAFHQILNGL 374
           +   L  IL +    V ++K R+L      ++R   +  P    DAFH ++  L
Sbjct: 286 EYDNLKKILQA----VTVQKYRMLHAGVRQVRRHFEWHSPPIKYDAFHMVMYEL 335


>gi|356518828|ref|XP_003528079.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
           max]
          Length = 505

 Score =  105 bits (261), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 79/315 (25%), Positives = 135/315 (42%), Gaps = 38/315 (12%)

Query: 72  FMHRFLLSSPVRTLNPEEADWFYTP---------IY---PTCDLTPTGLPLPFKSPRMMR 119
           F+H        RT +P+EA  +Y P         +Y      +L P GL        +++
Sbjct: 205 FIHEMEKGRYYRTYDPDEAFVYYLPFSVVMLVEYVYDRGSNYNLDPLGL--------VVK 256

Query: 120 SAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQ 179
             IQ+I+   P+WNR+ G DH  +  HD+G       +      I  +L  A   + F  
Sbjct: 257 DYIQIIAHKHPFWNRSLGHDHVMLSCHDWGPLVSSYVDHLYNNAI-RVLCNANTSEGFKP 315

Query: 180 RNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGAR 239
              V   E  +         +++     P + R+I  +F G  +          Y R   
Sbjct: 316 AKDVSFPEIKLI------KGEVKGLGGYPPSQRTILAFFAGHLHG---------YIRYLL 360

Query: 240 AAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIA 299
            + W+N   +           +YY  ++ + FCLCP G+   SPR+VEA+   C+PV+I+
Sbjct: 361 LSTWKNKDQDMQIYEELPEGISYYTKLRSSKFCLCPSGYEVASPRVVEAIFAECVPVLIS 420

Query: 300 DDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQ 359
           D  V PF+D + W    + V  +D+P +  IL  I  +  LR  + +    ++R  +  +
Sbjct: 421 DSYVPPFSDVLNWNSFSVQVNVKDIPNIKRILMEISEKQYLRMHKRVKQ--VQRHFVPNE 478

Query: 360 PAQPGDAFHQILNGL 374
           P +  D FH  ++ +
Sbjct: 479 PPKRYDMFHMTVHSI 493


>gi|449511061|ref|XP_004163851.1| PREDICTED: probable glycosyltransferase At3g07620-like [Cucumis
           sativus]
          Length = 341

 Score =  104 bits (260), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 94/354 (26%), Positives = 157/354 (44%), Gaps = 55/354 (15%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTL-NPEEADWFYTPIY 98
           LKVYVY+      +K +   P  LT ++A+E +  + L  +    + +PE+A  FY P  
Sbjct: 13  LKVYVYD----EGEKPIFHQP-ILTGIYASEGWFMKLLEDNKKFVVKDPEKAHLFYLPF- 66

Query: 99  PTCDLTPTGLPLPFKSPRMM----RSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHY 154
            +     +     F++ R +    ++ I +I   + +WN+  G+DHF V  HD+      
Sbjct: 67  -SSQFLRSAFGNKFRNKRDLQKPLKNYIDVIGKKYRFWNKNGGSDHFLVACHDWAPKLTK 125

Query: 155 QEEKAIER--------GILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQI 206
           +  K   R        G   + +  +L  TF     V   E  IT     PP        
Sbjct: 126 RLVKNCIRALCNANGAGDFEIGKDTSLPVTF-----VHSTEDLITKIGGKPP-------- 172

Query: 207 PPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFD----ISTDHPTTY 262
              + R+   +F G  +          Y R      WEN + + +       S +    Y
Sbjct: 173 ---SERTTLAFFAGSMHG---------YLRPILLHYWENKEPDMMIVGPMPNSIEGKNAY 220

Query: 263 YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEE 322
            E M+ + +C+C  G+   SPR++EA++  CIPVII+D+ V P  + + WE   +FV E 
Sbjct: 221 MEQMKSSKYCICARGYQVHSPRVIEAILNECIPVIISDNYVPPLFEVLNWESFSVFVKER 280

Query: 323 DVPKLDTILTSIPPEVILRKQRLLAN--PSMKRAMLFPQPAQPGDAFHQILNGL 374
           ++P L  IL SIP E      R++ +    +++  L+ +     DAFH IL+ +
Sbjct: 281 EIPYLRDILLSIPEE----NYRVMHSRVKMVQQHFLWHEKPAKYDAFHMILHSI 330


>gi|159490314|ref|XP_001703124.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
 gi|158270754|gb|EDO96589.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
          Length = 490

 Score =  104 bits (260), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 99/368 (26%), Positives = 143/368 (38%), Gaps = 51/368 (13%)

Query: 42  VYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTC 101
           VYVYELP         K     TH     I  ++ LL S  R  + ++ADW+Y PI    
Sbjct: 105 VYVYELPPDLTTWTNTKRLDRSTH-----IHFYQRLLGSGARIADGDKADWYYIPIRQRM 159

Query: 102 DLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIE 161
                         R +  A+  IS+ +P+WNRT G+ HF +   D GA     E +   
Sbjct: 160 T----------ADSRFLSEAVAYISATYPWWNRTGGSRHFVIHTGDLGA----DETQLGA 205

Query: 162 RGILPLLQ-----RATLVQTFGQRNHVCLNEGSITIPPYAPP--------QKMQAHQIPP 208
           R   P +        T+ + F         +  + IP +  P        ++   H +  
Sbjct: 206 RLQAPNITWLTHWGLTMDKVFSGWKKAHRPDKDVVIPVFLTPGHFKHFGLERTPLHPLMD 265

Query: 209 DTPRSIFVYFRGLFYDVNNDPEGGY----------YARGARAAVWENFKNNPLFDISTDH 258
              R+   +F G        P+ G           Y+ G R  V  +  + P F +    
Sbjct: 266 KQERTTTFFFAGRICGDRKPPKTGSWPNCGPRSPGYSAGVRQLVHHHHWDPPGFKVVLHE 325

Query: 259 PTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMF 318
           P  Y   +  + FCL PLG      R +     GC+PV IADD+  PF     W + G+ 
Sbjct: 326 PN-YGAALGSSKFCLAPLG-GGHGQRQIIVSFMGCLPVCIADDVYEPFEPQYNWTQFGVR 383

Query: 319 VAEEDVPKLDTILTSIPPEVILRKQRLLANPSMK------RAMLFPQPAQPGDAFHQILN 372
            AE D+P+L TIL S+  +    KQR L   +           LF +  +  DAF   L 
Sbjct: 384 PAESDIPELHTILESVSAKEYAAKQRALRCAAQHFVYSSIVGGLFGEDGRY-DAFETTLE 442

Query: 373 GLARKLPH 380
            L  K  H
Sbjct: 443 VLRVKAEH 450


>gi|147784355|emb|CAN72733.1| hypothetical protein VITISV_033460 [Vitis vinifera]
          Length = 321

 Score =  104 bits (260), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 91/324 (28%), Positives = 141/324 (43%), Gaps = 50/324 (15%)

Query: 68  AAEIFMHRFLLSSPVRTL-NPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLIS 126
             ++F+  FL   P   L N      F+  +   CD T            ++++ +Q + 
Sbjct: 22  GTKLFLRFFLDIPPCNNLMNFPSLVVFFLSLSNLCDATVLQGTSYENMTIIVQNYVQSLM 81

Query: 127 SNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLN 186
           S +PYWNRT GADHFF+  HD G        +A E   +PLL + ++         VC  
Sbjct: 82  SKYPYWNRTLGADHFFLTCHDXGV-------RATEG--VPLLVKNSI-------RVVCSP 125

Query: 187 EGSITIPPY---APPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYA-------R 236
              +   P+   A PQ +Q   +P                D+ N    G++A       R
Sbjct: 126 SYDVGFIPHKDVALPQVLQPFALPTGGR------------DIKNRTTLGFWAGHRNSKIR 173

Query: 237 GARAAVWEN-----FKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVF 291
              A +WEN      KNN + + +T H   Y     R  FC+CP G    S R+ +++ +
Sbjct: 174 VILARIWENDTELDIKNNRI-NRATGH-LVYQNKFYRTKFCICPGGSQVNSARIADSIHY 231

Query: 292 GCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIP-PEVILRKQRLLANPS 350
           GC+PVI++D   LPF D + W +  + + E DV +L  IL  IP  E I     L+    
Sbjct: 232 GCVPVILSDYYDLPFNDILDWRKFSVILKERDVYRLKYILKDIPDAEFIALHDNLV---K 288

Query: 351 MKRAMLFPQPAQPGDAFHQILNGL 374
           +++   +  P    DAFH ++  L
Sbjct: 289 VQKHFQWNTPPIKYDAFHMVMYEL 312


>gi|242059031|ref|XP_002458661.1| hypothetical protein SORBIDRAFT_03g037670 [Sorghum bicolor]
 gi|241930636|gb|EES03781.1| hypothetical protein SORBIDRAFT_03g037670 [Sorghum bicolor]
          Length = 499

 Score =  104 bits (259), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 103/360 (28%), Positives = 155/360 (43%), Gaps = 58/360 (16%)

Query: 40  LKVYVYELPSKYNKKLL--LKDP-RCLTHMFAAEIFMHRF-------------------- 76
           L+VYVYE+P K+   LL   +D  R   ++ +    +HR                     
Sbjct: 113 LRVYVYEMPGKFTYDLLRLFRDSYRDTDNLTSNGSPVHRLIEQHSIDYWLWADLIAPESQ 172

Query: 77  -LLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRT 135
            LL S VR    EEAD FY P + T       L    +   + R A++ ++   P W R+
Sbjct: 173 RLLKSVVRVQRQEEADIFYVPFFTTISYF---LLEKQECKALYREALKWVTDQ-PAWQRS 228

Query: 136 EGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY 195
           EG DH   V H +     ++  +   +  + LL        + +   V L +    I PY
Sbjct: 229 EGRDHVIPVHHPWS----FKSVRRFVKKAIWLLPDMDSTGNWYKPGQVYLEKD--VILPY 282

Query: 196 APPQKMQAHQIPPDT--PRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFD 253
            P   +  H+   +T   RSI ++FRG          GG      R+ + E  K+    D
Sbjct: 283 VPNVDLCDHKCVLETQFKRSILLFFRGRL----KRNAGG----KIRSKLVEELKSAE--D 332

Query: 254 ISTDHPTTYY-------EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 306
           I  +  +          + M++++FCL P G  P S RL +A+V GCIPVII+D++ LPF
Sbjct: 333 IVIEEGSAGAQGKAAAQDGMRKSLFCLSPAGDTPSSARLFDAIVSGCIPVIISDELELPF 392

Query: 307 ADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQP 363
              + + EI +FV+  D  +   L   L  I  + I   Q  L   S  R  L+  PAQP
Sbjct: 393 EGILDYREIALFVSSSDAVQPGWLVKYLRGIDAKRIREIQSNLVKYS--RHFLYSSPAQP 450


>gi|297805042|ref|XP_002870405.1| hypothetical protein ARALYDRAFT_493575 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297316241|gb|EFH46664.1| hypothetical protein ARALYDRAFT_493575 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 509

 Score =  104 bits (259), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 84/305 (27%), Positives = 134/305 (43%), Gaps = 30/305 (9%)

Query: 80  SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLP----FKSPRMMR---SAIQLISSNWPYW 132
           S  R  +PE+A  F+ P      +     P+     F   R+ R     + ++++   YW
Sbjct: 214 SRFRADHPEDAHVFFIPFSVAKVIHFVYKPITSVEGFSRARLHRLIEDYVDVVATKHRYW 273

Query: 133 NRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITI 192
           NR++G DHF V  HD+         K  E+ I  L    T   + G R +V      ++I
Sbjct: 274 NRSKGGDHFMVSCHDWAPDVIDGNPKLFEKFIRALCNANT---SEGFRPNV-----DVSI 325

Query: 193 PPYAPPQKMQAHQIPPDTPR--SIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNP 250
           P    P+          +PR  SI  +F G  +            R      W+   N  
Sbjct: 326 PEIYLPKGKLGPSFLGKSPRIRSILAFFAGRSHG---------EIRKILFKHWKEMDNEV 376

Query: 251 LFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAI 310
                      Y + M  + FCLCP GW   SPR VEA+  GC+PVII+D+  LPF+D +
Sbjct: 377 QVYDRLPPGKDYTKTMGMSKFCLCPSGWEVASPREVEAIYAGCVPVIISDNYSLPFSDVL 436

Query: 311 PWEEIGMFVAEEDVPKLDTILTSIPPEVILRK-QRLLANPSMKRAMLFPQPAQPGDAFHQ 369
            W+   + +    +P++ TIL S+     L+  +R+L    +K+  +  +PA+P D  H 
Sbjct: 437 NWDSFSIQIPVSRIPEIKTILQSVSLVRYLKMYKRVL---EVKQHFVLNRPAKPYDVMHM 493

Query: 370 ILNGL 374
           +L+ +
Sbjct: 494 MLHSI 498


>gi|42568129|ref|NP_198314.2| xylogalacturonan beta-1,3-xylosyltransferase [Arabidopsis thaliana]
 gi|292630939|sp|Q94AA9.2|XGD1_ARATH RecName: Full=Xylogalacturonan beta-1,3-xylosyltransferase;
           AltName: Full=Protein XYLOGALACTURONAN DEFICIENT 1
 gi|332006509|gb|AED93892.1| xylogalacturonan beta-1,3-xylosyltransferase [Arabidopsis thaliana]
          Length = 500

 Score =  104 bits (259), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 74/262 (28%), Positives = 120/262 (45%), Gaps = 23/262 (8%)

Query: 116 RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQ 175
           R++   + ++++  PYWNR++G DHF V  HD+         K  E+ I  L    T   
Sbjct: 248 RLIEDYVDVVATKHPYWNRSQGGDHFMVSCHDWAPDVIDGNPKLFEKFIRGLCNANT--- 304

Query: 176 TFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPR--SIFVYFRGLFYDVNNDPEGGY 233
           + G R +V      ++IP    P+          +PR  SI  +F G  +          
Sbjct: 305 SEGFRPNV-----DVSIPEIYLPKGKLGPSFLGKSPRVRSILAFFAGRSHG--------- 350

Query: 234 YARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGC 293
             R      W+   N             Y + M  + FCLCP GW   SPR VEA+  GC
Sbjct: 351 EIRKILFQHWKEMDNEVQVYDRLPPGKDYTKTMGMSKFCLCPSGWEVASPREVEAIYAGC 410

Query: 294 IPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRK-QRLLANPSMK 352
           +PVII+D+  LPF+D + W+   + +    + ++ TIL S+     L+  +R+L    +K
Sbjct: 411 VPVIISDNYSLPFSDVLNWDSFSIQIPVSRIKEIKTILQSVSLVRYLKMYKRVL---EVK 467

Query: 353 RAMLFPQPAQPGDAFHQILNGL 374
           +  +  +PA+P D  H +L+ +
Sbjct: 468 QHFVLNRPAKPYDVMHMMLHSI 489


>gi|414883870|tpg|DAA59884.1| TPA: hypothetical protein ZEAMMB73_690759, partial [Zea mays]
          Length = 577

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 98/367 (26%), Positives = 153/367 (41%), Gaps = 64/367 (17%)

Query: 39  KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSP------VRTLNPEEADW 92
           +LKVYVYE      +  +  D  C +       F+H     +       +RT +P  A  
Sbjct: 233 QLKVYVYE----EGEPPVFHDGPCRSIYSTEGRFIHSMETETEAEEGRRLRTRDPARAHV 288

Query: 93  FYTPIYPTCDLTPTGLP-----LPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHD 147
           F+ P      +     P      P K  R +   ++++SS +PYWNR+ GADHF +  HD
Sbjct: 289 FFLPFSVVKMVQTIYEPGSRDMAPLK--RTVADYVRVLSSKYPYWNRSLGADHFMLSCHD 346

Query: 148 FGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGS---------ITIPPYAPP 198
           +G                P +  A   Q FG    V  N  +         +++P     
Sbjct: 347 WG----------------PYVSSAN-AQLFGNSIRVLCNANTSEGFDPARDVSLPQVNLR 389

Query: 199 QKMQAHQI--PPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPL---FD 253
                 Q+  P  + R +  +F G     N+ P      R A  A W             
Sbjct: 390 SDAVERQVGGPSASRRPVLAFFAG----GNHGP-----VRPALLAHWGPGGRRGGDPDVR 440

Query: 254 ISTDHPT-----TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD-DIVLPFA 307
           +S   P      +Y + M+R+ FCLCP G+   SPRL EA+  GC+PV++ D +  LPFA
Sbjct: 441 VSEYLPRGGGAPSYADMMRRSRFCLCPGGYEVASPRLAEALYLGCVPVVVDDGEYALPFA 500

Query: 308 DAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAF 367
           D + W+   + +   D+P+L  IL ++ P   + + +       +  ML   P +  DAF
Sbjct: 501 DVLDWDAFALRLRVADIPRLKEILAAVSPRQYI-RMQRRVRMVRRHFMLHGGPPRRYDAF 559

Query: 368 HQILNGL 374
           H IL+ +
Sbjct: 560 HMILHSV 566


>gi|356500491|ref|XP_003519065.1| PREDICTED: uncharacterized protein LOC100783624 [Glycine max]
          Length = 795

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 94/376 (25%), Positives = 161/376 (42%), Gaps = 65/376 (17%)

Query: 15  ADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLL------------LKDPRC 62
           AD+T   E++      V +  P+    VYVY+LP ++N  LL            + D   
Sbjct: 331 ADDTHANEKMINLNAVVAKKRPL----VYVYDLPPEFNSLLLEGRHFKLECVNRIYDGNN 386

Query: 63  LT----HMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLP--LPFKSPR 116
           +T     ++ A+I ++  LL+SP RTLN EEAD+F+ P+  +C +T       L  +   
Sbjct: 387 ITVWTDQLYGAQIALYESLLASPHRTLNGEEADFFFVPVLDSCIITRADDAPHLSMQEHM 446

Query: 117 MMRSAIQL---------ISSNWPYWNRTEGADHFFVVPHDFGACFH----YQEEKAIERG 163
            +RS++ L         I   +PYWNR+ G DH +    D GAC+     +     +  G
Sbjct: 447 GLRSSLTLEYYKKAYIHIVEQYPYWNRSSGRDHVWSFSWDEGACYAPKEIWNSMMLVHWG 506

Query: 164 ILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQ--AHQIP------------PD 209
                   +    +        ++   T P + P + +   A ++P              
Sbjct: 507 NTNTKHNHSTTAYWADNWDKISSDKRGTHPCFDPDKDLVLPAWKVPDANVLTSKLWAWSH 566

Query: 210 TPRSIFVYFRGLFYDV--NNDPEGGYYARGARAAVWENFKNNPLFD------------IS 255
             R    YF G       +  PE   Y+ G R  + E F ++P  D            ++
Sbjct: 567 EKRKTLFYFNGNLGPAYPHGRPE-DTYSMGIRQKLAEEFGSSPNKDGKLGKQHAKDVIVT 625

Query: 256 TDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEI 315
            +    Y+ D+  ++FC    G   WS R+ ++++ GCIPV+I D I LP+ + + ++  
Sbjct: 626 PERSENYHLDLASSVFCGVFPG-DGWSGRMEDSILQGCIPVVIQDGIFLPYENVLNYDSF 684

Query: 316 GMFVAEEDVPKLDTIL 331
            + + E ++P L  IL
Sbjct: 685 AVRIPEAEIPNLIKIL 700


>gi|159470401|ref|XP_001693348.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
 gi|158277606|gb|EDP03374.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
          Length = 356

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 91/331 (27%), Positives = 146/331 (44%), Gaps = 40/331 (12%)

Query: 68  AAEIFMHRFLLSSPVRTLNPEEADWFYTP---IYPTCDLTPTGLPLPFKSPRMMRSAIQL 124
           A E F+  F+  + VRT NP EA  FY P    + + +L P    L         + +  
Sbjct: 27  AYEYFLKYFITDNIVRTENPYEAHLFYVPALNFFYSGNLRPPEYHL--------EAVMDH 78

Query: 125 ISSNWPYWNRTEGADHFFVVPHDFGACFHYQE-EKAIERGILPLLQRATLVQTFGQRN-- 181
           + + WP++NR+ G DHF  +  D GAC   ++ + ++ + +   +Q+  L  T  + N  
Sbjct: 79  VKTAWPFYNRSGGRDHFIFLTGDRGACHMPRDMQDSMIKVVHFGMQKQGLNWTSMEHNKE 138

Query: 182 HVCLN-EGSITIPPYAPPQKMQAHQIPPDTPRSIFVYF------------RGLFYDVNND 228
           + C+     + +PP+    K       P  P     YF            R + +     
Sbjct: 139 YGCIRMRQDLVVPPHPNDHK-------PLWPVGAAAYFQRIAAAGGHDAGRNITFLFAGG 191

Query: 229 PEGGYYARGARAAVWENFKN--NPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLV 286
              G Y+ G R AV     N  +P           Y + + R+ FCL   G   W  R++
Sbjct: 192 VGEGEYSGGTRQAVRALLLNITDPAIMFVEGRRDDYVDLLWRSQFCLAAYGHG-WGIRVM 250

Query: 287 EAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLL 346
           +++ FGCIPVII D +   F D +P+EE  + +   DVP+L  +L S  PE +   +  L
Sbjct: 251 QSIQFGCIPVIIQDHVYQAFEDFLPYEEFSVRLPLRDVPRLLELLRSYSPEQLAALR--L 308

Query: 347 ANPSMKRAMLFPQPAQPGDAFHQILNGLARK 377
                 RA ++ +  Q G+AF   L GL R+
Sbjct: 309 GMAKYFRAFIWNRD-QGGEAFEWTLAGLQRR 338


>gi|297820564|ref|XP_002878165.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297324003|gb|EFH54424.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 792

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 96/368 (26%), Positives = 158/368 (42%), Gaps = 68/368 (18%)

Query: 42  VYVYELPSKYNKKLL------------LKDPRCLT----HMFAAEIFMHRFLLSSPVRTL 85
           +Y+Y+LP  +N  L+            + D R  T    +++ +++  +  +L++  RTL
Sbjct: 349 IYIYDLPPDFNSLLIEGRHFKLECVNRIYDERNATVWTDYLYGSQMAFYENILATAHRTL 408

Query: 86  NPEEADWFYTPIYPTC-----DLTP-------TGLPLPFKSPRMMRSAIQLISSNWPYWN 133
           N EEAD+F+ P+  +C     D  P       TGL   F +    + A + I   +PYWN
Sbjct: 409 NGEEADFFFVPVLDSCIINRADDAPHINMQNHTGLRSSF-TLEFYKRAYEHIVEKYPYWN 467

Query: 134 RTEGADHFFVVPHDFGACFH----YQEEKAIERGILPLLQRATLVQTFGQR--------- 180
           R+ G DH +    D GAC+     +     +  G        +    +G           
Sbjct: 468 RSAGRDHIWFFSWDEGACYAPKEIWNSMMLVHWGNTNSKHNHSTTAYWGDNWDDISDERR 527

Query: 181 -NHVCLN-EGSITIPPYAPPQ--KMQA-HQIPPDTPRSIFVYFRGLF---YDVNNDPEGG 232
            +H C +    + IP +  P    M+A +   P   R    YF G     Y+    PE  
Sbjct: 528 GDHPCFDPRKDLVIPAWKVPDPYSMRANYWARPREKRKTLFYFNGNLGPAYE-KGRPEDS 586

Query: 233 YYARGARAAVWENFKNNPLFD--ISTDH----------PTTYYEDMQRAIFCLCPLGWAP 280
            Y+ G R  + E F ++P  +  +   H             Y++D+  +IFC    G   
Sbjct: 587 -YSMGIRQKLAEEFGSSPNKEGKLGKQHAEDVIVTPLRSDNYHKDIANSIFCGAFPG-DG 644

Query: 281 WSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVIL 340
           WS R+ ++++ GC+PVII D I LP+ + + +E   + V+E+D+P L   L       I 
Sbjct: 645 WSGRMEDSILQGCVPVIIQDGIYLPYENMLNYESFAVRVSEDDIPNLINTLRGFSETEI- 703

Query: 341 RKQRLLAN 348
             Q  LAN
Sbjct: 704 --QFRLAN 709


>gi|302824287|ref|XP_002993788.1| glycosyltransferase-like protein [Selaginella moellendorffii]
 gi|300138384|gb|EFJ05154.1| glycosyltransferase-like protein [Selaginella moellendorffii]
          Length = 353

 Score =  103 bits (257), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 99/359 (27%), Positives = 152/359 (42%), Gaps = 58/359 (16%)

Query: 29  GDVLEDD---PVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTL 85
            DV ++D      +LKV+VY  P  Y K         L   +A+E +  R L+ S   T 
Sbjct: 31  ADVFQEDYEQMEQQLKVFVYPDPVVYTK---------LAGKYASEGYFFRNLMESRFVTT 81

Query: 86  NPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVP 145
           +PEEA  F+ PI     L   GL     S  +  S ++ + + +PYWNRT GADHFFV  
Sbjct: 82  DPEEAQLFFVPI-SCARLKEEGLDHDEISDNVA-SFVESVIAKFPYWNRTMGADHFFVTC 139

Query: 146 HDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQ 205
           H+ G                    RAT       +N + +   S    P+ P + +   Q
Sbjct: 140 HEIGT-------------------RATSKVAHLVKNSIRVVCASSYSGPFIPHKDVALPQ 180

Query: 206 I--PPDTPRSIFVYFRGLFYDVNNDPEGGYYARGA----RAAVWENFKNNPLFDISTDHP 259
           I  P  +PR           D       G++A  A    R  + + ++ +    IST H 
Sbjct: 181 ILQPFPSPRG--------GDDTEKRETLGFWAGPANSKTRILLTKTWQEDSDMVISTKHV 232

Query: 260 TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIP----VIIADDIVLPFADAIPWEEI 315
               +   R+ FC+CP G    + R+VE++ FGC+P    +I++D   LPF D + W + 
Sbjct: 233 GM--QQFYRSKFCICPSGTRVSTARIVESIHFGCVPGKSYLILSDHYDLPFNDVLDWRKF 290

Query: 316 GMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
            + + E+D   L   L  + P   L +  L A    +    +  P    D FH ++  L
Sbjct: 291 AVILPEQDAGTLKDAL-ELAPYATLHRNLLQAQAHFE----WHSPPIKYDTFHMVMYEL 344


>gi|15146187|gb|AAK83577.1| AT5g33290/F19N2_10 [Arabidopsis thaliana]
 gi|18252279|gb|AAL62020.1| AT5g33290/F19N2_10 [Arabidopsis thaliana]
          Length = 341

 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 74/262 (28%), Positives = 120/262 (45%), Gaps = 23/262 (8%)

Query: 116 RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQ 175
           R++   + ++++  PYWNR++G DHF V  HD+         K  E+ I  L    T   
Sbjct: 89  RLIEDYVDVVATKHPYWNRSQGGDHFMVSCHDWAPDEIDGNPKLFEKFIRGLCNANT--- 145

Query: 176 TFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPR--SIFVYFRGLFYDVNNDPEGGY 233
           + G R +V      ++IP    P+          +PR  SI  +F G  +          
Sbjct: 146 SEGFRPNV-----DVSIPEIYLPKGKLGPSFLGKSPRVRSILAFFAGRSHG--------- 191

Query: 234 YARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGC 293
             R      W+   N             Y + M  + FCLCP GW   SPR VEA+  GC
Sbjct: 192 EIRKILFQHWKEMDNEVQVYDRLPPGKDYTKTMGMSKFCLCPSGWEVASPREVEAIYAGC 251

Query: 294 IPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRK-QRLLANPSMK 352
           +PVII+D+  LPF+D + W+   + +    + ++ TIL S+     L+  +R+L    +K
Sbjct: 252 VPVIISDNYSLPFSDVLNWDSFSIQIPVSRIKEIKTILQSVSLVRYLKMYKRVL---EVK 308

Query: 353 RAMLFPQPAQPGDAFHQILNGL 374
           +  +  +PA+P D  H +L+ +
Sbjct: 309 QHFVLNRPAKPYDVMHMMLHSI 330


>gi|356518348|ref|XP_003527841.1| PREDICTED: LOW QUALITY PROTEIN: probable glycosyltransferase
           At5g03795-like [Glycine max]
          Length = 549

 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 88/322 (27%), Positives = 139/322 (43%), Gaps = 33/322 (10%)

Query: 63  LTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRM---MR 119
           L  ++A+E +  + + ++   T +P +A  FY P           +    +   +   M+
Sbjct: 240 LDGIYASEGWFMKLMEANKXVTGDPGKAHLFYIPFSSRLLQQTLYVRNSHRHSNLIEYMK 299

Query: 120 SAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQ 179
           + +++I+  +P+WNRT GADHF V  HD+       E +      +  L  A +   F  
Sbjct: 300 NYVKMIAGKYPFWNRTSGADHFVVACHDWAPA----ETRGRMLSSIRALCNADIEVGFKI 355

Query: 180 RNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGAR 239
              V L E  I      P + ++     P + R I  +F G  +          Y     
Sbjct: 356 GKDVSLPETYIR-SSENPVKNIEG---DPPSQRPILAFFAGGLH---------VYVXPIL 402

Query: 240 AAVWENFKNNPLFDISTDHP-----TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCI 294
              WEN    P   IS   P       Y + M+ + FC+   G    SPR+VEA+   CI
Sbjct: 403 LKHWEN--KEPDMKISGPLPHVRGNVNYIQFMKSSKFCIHARGHEVNSPRVVEAIFHECI 460

Query: 295 PVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRA 354
           PVII+D+ + PF + + WE   +FV EE++P L  IL SI  E  L   + +     K  
Sbjct: 461 PVIISDNFIPPFFEILNWESFAVFVTEEEIPNLRNILLSISEERYLEMHKRVK----KVQ 516

Query: 355 MLFPQPAQP--GDAFHQILNGL 374
             FP  A+P   D  H +L+ +
Sbjct: 517 EHFPWHAEPVKDDLSHMLLHSI 538


>gi|224129246|ref|XP_002320537.1| predicted protein [Populus trichocarpa]
 gi|222861310|gb|EEE98852.1| predicted protein [Populus trichocarpa]
          Length = 395

 Score =  102 bits (255), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 76/314 (24%), Positives = 139/314 (44%), Gaps = 34/314 (10%)

Query: 71  IFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSA---IQLISS 127
           +F+     ++  RT NP+EA  ++ P      +     P+      + R+    +++IS 
Sbjct: 95  VFLSLMETNTKFRTSNPDEAHVYFLPFSVVMIIEHLFHPIIRDKAVLERTVSDYVRIISH 154

Query: 128 NWPYWNRTEGADHFFVVPHDFG--ACFHYQEEKAIERGILPLLQRATLVQTFGQRN---- 181
            + YWNR+ GADHF +  HD+G  A ++ ++   +    + +L  A   + F  +     
Sbjct: 155 KYLYWNRSLGADHFMLSCHDWGPRATWYVRQ---LYYNSIRVLCNANTSEYFNPKKDASF 211

Query: 182 -HVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARA 240
             + L  G IT              +PP   R++  +F G  +            R A  
Sbjct: 212 PEINLKTGEIT---------GLTGGLPPSN-RTVLAFFAGKMHGK---------LRPALL 252

Query: 241 AVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD 300
             W     +     +     +Y+E M+++ +C+CP G    SPR+ EA+   C+PV+I+ 
Sbjct: 253 QHWMGKDKDVQVYETLPQGISYHEMMKKSKYCICPSGHEVASPRIAEAIYAECVPVLISQ 312

Query: 301 DIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQP 360
             + PF+D + W+   + V   ++P L  IL  IP +  LR Q  +    ++R  +   P
Sbjct: 313 HYIFPFSDVLNWDSFTIQVPVTEIPNLKNILEGIPEDQYLRMQERVR--QVQRHFVVNNP 370

Query: 361 AQPGDAFHQILNGL 374
            +  D FH I++ +
Sbjct: 371 PRRYDVFHMIIHSI 384


>gi|356546124|ref|XP_003541481.1| PREDICTED: probable glycosyltransferase At5g20260-like, partial
           [Glycine max]
          Length = 290

 Score =  102 bits (255), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 79/298 (26%), Positives = 141/298 (47%), Gaps = 28/298 (9%)

Query: 86  NPEEADWFYTPIYPTCDLTPTGLPLP-FKSPRMMRSAI---QLISSNWPYWNRTEGADHF 141
           NP+EA  F  PI     +     PL  +   ++M   I    +I+  +PYWNRT GADHF
Sbjct: 1   NPDEAHVFMLPISVAQIVRYVYNPLTTYSRDQLMWITIDYTNIIAHRYPYWNRTRGADHF 60

Query: 142 FVVPHDFG--ACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQ 199
               HD+        +  K + + I+ +L  A   + F     V + E ++     + P 
Sbjct: 61  LASCHDWAPPDISRAESGKELFKNIIRVLYNANKSEGFKPEKDVPMPEVNLQGFKLSSP- 119

Query: 200 KMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKN-NPLFDISTDH 258
                 + P+  RSI  +F            GG + R  R  + +++K+ +    +    
Sbjct: 120 ---ILGLDPNN-RSILAFF-----------AGGVHGR-IREILLQHWKDKDEEVQVHEYL 163

Query: 259 P--TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG 316
           P    Y+  M ++ FCLCP G+   SPR+VE++  GC+PVI++D   LPF+D +   +  
Sbjct: 164 PKGVDYHGLMGQSKFCLCPSGYEVASPRIVESINIGCVPVIVSDYYQLPFSDVLDRSKFS 223

Query: 317 MFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
           + +    + ++ T+L ++P    L+ Q+ +    ++R  +  +PA+  + FH IL+ +
Sbjct: 224 LHIPSRRIAEIKTMLKNVPHAKYLKLQKRVM--KVQRHFVLNRPAKSFNVFHMILHSI 279


>gi|357142838|ref|XP_003572711.1| PREDICTED: probable glycosyltransferase At5g20260-like
           [Brachypodium distachyon]
          Length = 484

 Score =  102 bits (255), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 85/312 (27%), Positives = 138/312 (44%), Gaps = 52/312 (16%)

Query: 86  NPEEADWFYTPIYPTCDL-------TPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGA 138
           +P+EA+ F  PI   C+L         T    P +  +++   + +IS   PYWNR+ GA
Sbjct: 191 HPDEANVFLLPI-SVCNLVHYVYRLNTTAHLAPLR--KLLADYVAVISDKHPYWNRSGGA 247

Query: 139 DHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRN-----HVCLNEGSITIP 193
           DH  V  HD+ A    +    +    + +L  A + + F  R       V L +G + +P
Sbjct: 248 DHVLVSCHDW-APLVSEGSPELRDNAIRVLCNANVSEGFVPRKDATLPEVNLADGVLRLP 306

Query: 194 PYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWE-NFKNNPLF 252
               P++           R+   +F            GG      RA + +   + +P  
Sbjct: 307 TQGLPRQN----------RTTLAFF-----------AGGMLGEIRRALLEQWAGREDPEM 345

Query: 253 DISTDHP--------TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVL 304
           D+    P          Y+  M RA FCLCP G+   SPR+VE+V  GC+PVII++   L
Sbjct: 346 DVHEYLPPHGGGPGYDDYHALMGRARFCLCPSGFEVASPRVVESVFAGCVPVIISEGYPL 405

Query: 305 PFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE--VILRKQRLLANPSMKRAMLFPQPAQ 362
           PF D + W ++ + V    +P+L  IL  +      +LR + L A    +R  +  +PA+
Sbjct: 406 PFGDVLDWSKMSVAVPAARIPELKAILRGVSERRYRVLRARVLQA----QRHFVLHRPAR 461

Query: 363 PGDAFHQILNGL 374
             D  H +L+ +
Sbjct: 462 RFDMIHMVLHSI 473


>gi|356546741|ref|XP_003541781.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
           max]
          Length = 410

 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 92/356 (25%), Positives = 152/356 (42%), Gaps = 55/356 (15%)

Query: 39  KLKVYVY--ELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTP 96
           + KV+VY    P  Y        PR LT  +A+E +  + +  S   T +P  A  F+ P
Sbjct: 87  EFKVFVYPDGDPETY-----FHTPRKLTGKYASEGYFFKNIRESRFFTDDPRRAHLFFLP 141

Query: 97  IYPTC-DLTPTGLPLPFKSPRMMRSA---IQLISSNWPYWNRTEGADHFFVVPHDFGACF 152
           I  +C  +   GL     + RM+      ++ +   +PYWNRT GADHFFV  HD G   
Sbjct: 142 I--SCHKMRGRGLT----NERMIDEVEKYVEHLKFEYPYWNRTLGADHFFVTCHDIGV-- 193

Query: 153 HYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPR 212
                    +G+  +++ +  V    + +    ++G I       PQ       PP    
Sbjct: 194 ------KATKGVPHMMKNSIRVICSSRYD----DDGYIPHKDVTLPQVQLPFFHPPGGN- 242

Query: 213 SIFVYFRGLFYDVNNDPEGGYYARGARA-------AVWENFK----NNPLFDISTDHPTT 261
                      D+ N     ++A  + +       A+W+N       N   D+    P  
Sbjct: 243 -----------DIKNRNTLAFWAGRSDSRLKEDLIAIWDNDTEIDIQNSRVDLRATGPVV 291

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y E + ++ FCLCP G    S R+ +++ FGC+PVI++    LPF D + W +  + + E
Sbjct: 292 YMEKLYKSKFCLCPHG-PIGSSRIADSIHFGCVPVIMSKYYDLPFNDILDWTQFSIVLKE 350

Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARK 377
            DV +L   L SI  +  +     +    +++   +  P    DAFH ++  L R+
Sbjct: 351 TDVYQLKYTLRSISEKHFITLNHNIVK--IQKHFKWNTPPVRQDAFHMVMYELWRR 404


>gi|320163884|gb|EFW40783.1| conserved hypothetical protein [Capsaspora owczarzaki ATCC 30864]
          Length = 478

 Score =  102 bits (253), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 91/399 (22%), Positives = 152/399 (38%), Gaps = 83/399 (20%)

Query: 37  VGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTP 96
           +  L++Y+Y++      + L+ D       + AE      L +S   T  P++A+ F+ P
Sbjct: 103 LDSLRIYMYDIALGREMRWLVDDK------YGAEQLFINLLATSAFHTTAPDKANMFFMP 156

Query: 97  IYPTCDLTPTGLPL--PFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHY 154
              T         +     +  +      ++ + + +WN + G DHF++  HD G     
Sbjct: 157 FRCTAYRRSVQERVLGDIVAKNVTAQYFDVVMNKYRWWNVSSGTDHFYICGHDMGTAVTA 216

Query: 155 QEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPP-------------------- 194
               A+ +  +       LV T    +   +    I++PP                    
Sbjct: 217 LSHPALVKNAI------GLVNTADYDDARYIPHKDISLPPNIDVLPSAHVATEEEITADL 270

Query: 195 ----------YAPPQKMQAH-----------------------QIPPDTPRSIFVYFRGL 221
                     Y   ++  AH                        I P   R+   YF G 
Sbjct: 271 IRLEMARDRLYRATRQKVAHPDMNFEPLMERRMGKLVQYGLGGLIHPREKRTKLAYFAGP 330

Query: 222 FYDVNNDPEGGYYARGARAAVWENFKNNP---LFDISTDHPTTYYEDMQRAIFCLCPLGW 278
            +          Y R  R  V + F N+    LF+     P  YY ++  + FCL   G+
Sbjct: 331 LH----------YGR-VRPKVRDAFANDTDIVLFEGRHAQPILYYNELATSKFCLFLRGY 379

Query: 279 APWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEV 338
             WSPRL++AV  GCIPVII+D   LP    + W E  + + E  +P+L   L ++    
Sbjct: 380 RAWSPRLMDAVFMGCIPVIISDHYDLPLGQLLDWSEFSITIPEARIPRLKQTLLAVSDAQ 439

Query: 339 ILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARK 377
           + R Q  LA   + +  ++  P +P DAFH +L  L R+
Sbjct: 440 LSRMQNRLAE--VYQHFVWNDPPKPFDAFHMVLWQLWRR 476


>gi|168059257|ref|XP_001781620.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666934|gb|EDQ53576.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 460

 Score =  102 bits (253), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 102/403 (25%), Positives = 161/403 (39%), Gaps = 74/403 (18%)

Query: 34  DDPVGK----LKVYVYELPSKYNKKLLLKDPRC-----------------LTHMFAAEIF 72
           DD  G     LK+Y+YELP K+N  +L KD                    +    + E +
Sbjct: 57  DDSTGSYSCPLKIYMYELPRKFNMGMLKKDKNQEIPWTNHVAPPWKQKFEVNKQHSVEYW 116

Query: 73  MHRFLL---------SSPVRTLNPEEADWFYTPIYPTCDLTPTG---------------L 108
           +  +LL         ++ VR ++PE+AD F+ P +        G               +
Sbjct: 117 LMVYLLDGWDRKDGKTAAVRVIDPEQADVFFVPFFSALSFNSHGHGMSEGAAADKRLQIV 176

Query: 109 PLPF-----KSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERG 163
            L F      S  ++++ +  I S   +W  ++G DH  V  H   A  HY++       
Sbjct: 177 LLTFGRHVNASCHLVQAGLVDILSKSKWWQASQGRDHILVAHHP-NALRHYRDM------ 229

Query: 164 ILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAP--PQKMQAHQIPPDTPRSIFVYFRGL 221
              L Q   +V  FG+ +          + PY    P   Q +   P + R   ++F+G 
Sbjct: 230 ---LNQSIFIVADFGRYDKTVARLSKDVVAPYVHVLPSYDQDNPADPFSLRKTLLFFQGR 286

Query: 222 FYDVNNDPEGGYYARGARAAVWENFKNNPLFD--ISTDHPTTYYEDMQRAIFCLCPLGWA 279
            +      +G    R   A +  N  +    D   S +   T    M+ + FCL P G  
Sbjct: 287 IH-----RKGDGIVRTKLAELLANNSDVHYVDSLASAEAIATSTAGMRTSRFCLHPAGDT 341

Query: 280 PWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPP 336
           P S RL +A+V  C+PVII+D I LPF D + +++  +F + E+  K   L   L SI  
Sbjct: 342 PSSCRLFDAIVSHCVPVIISDRIELPFEDDLNYKDFSIFFSSEESVKPGHLLRTLRSITR 401

Query: 337 EVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
           E  LR    L   S      +  P +  DA + I   +  K+P
Sbjct: 402 ERWLRMWNALKTVS--HHFEYQHPPKKDDAVNMIFKQVQHKVP 442


>gi|356511373|ref|XP_003524401.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
           max]
          Length = 643

 Score =  102 bits (253), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 94/347 (27%), Positives = 152/347 (43%), Gaps = 40/347 (11%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTL-NPEEADWFYTPIY 98
           LKVY+Y    K   K +   P  +  ++A+E +  + +  +    L +P +A  FY P  
Sbjct: 314 LKVYIY----KDGNKPIFHQP-IMKGLYASEGWFMKLMEENKHFVLKDPAKAHLFYMPF- 367

Query: 99  PTCDLTPTGLPLPFKSPR-----MMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFH 153
            +  +    L +     R      ++     IS+ + Y+NRT GADHF V  HD+     
Sbjct: 368 -SSRMLEHALYVRNSHNRTNLRQFLKDYTDKISAKYRYFNRTGGADHFLVACHDWAP--- 423

Query: 154 YQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRS 213
           Y+    +E  I  L   A + Q F     V L E  +       PQ+    + P   P  
Sbjct: 424 YETRHHMEYCIKALCN-ADVTQGFKIGRDVSLPEAYVR--SVRDPQRDLGGKPPHQRP-- 478

Query: 214 IFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP------TTYYEDMQ 267
           I  ++ G  +          Y R      W++   +P   I    P        Y   M+
Sbjct: 479 ILAFYAGNMHG---------YLRPILLKHWKD--KDPDMKIYGPMPHGAASKMNYINHMK 527

Query: 268 RAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKL 327
            + +C+CP G+   SPR+VEA+ + C+PVII+D+ V PF + + W+   + +AE+D+P L
Sbjct: 528 NSKYCICPKGYEVNSPRVVEAIFYECVPVIISDNFVPPFFEVLNWDAFSIILAEKDIPNL 587

Query: 328 DTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
             IL S+  E  L+ Q L    + K      +P +  D FH  L+ +
Sbjct: 588 KQILLSVSQEKYLKLQ-LGVRKAQKHFFWHVKPLKY-DLFHMTLHSI 632


>gi|302823230|ref|XP_002993269.1| glycosyltransferase-like protein [Selaginella moellendorffii]
 gi|300138939|gb|EFJ05690.1| glycosyltransferase-like protein [Selaginella moellendorffii]
          Length = 353

 Score =  101 bits (252), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 98/359 (27%), Positives = 152/359 (42%), Gaps = 58/359 (16%)

Query: 29  GDVLEDD---PVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTL 85
            DV ++D      +LKV+VY  P  Y K         L   +A+E +  R L+ S   T 
Sbjct: 31  ADVFQEDYEQMEQQLKVFVYPDPVVYTK---------LAGKYASEGYFFRNLMESRFVTT 81

Query: 86  NPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVP 145
           +PE+A  F+ PI     L   GL     S  +  S ++ + + +PYWNRT GADHFFV  
Sbjct: 82  DPEKAQLFFVPI-SCARLREEGLDHDEISDNVA-SFVESVIAKFPYWNRTMGADHFFVTC 139

Query: 146 HDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQ 205
           H+ G                    RAT       +N + +   S    P+ P + +   Q
Sbjct: 140 HEIGT-------------------RATSKVAHLVKNSIRVVCASSYSGPFIPHKDVALPQ 180

Query: 206 I--PPDTPRSIFVYFRGLFYDVNNDPEGGYYARGA----RAAVWENFKNNPLFDISTDHP 259
           I  P  +PR           D       G++A  A    R  + + ++ +    IST H 
Sbjct: 181 ILQPFPSPRG--------GDDTEKRETLGFWAGPANSKTRILLTKTWQEDSDMVISTKHV 232

Query: 260 TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIP----VIIADDIVLPFADAIPWEEI 315
               +   R+ FC+CP G    + R+VE++ FGC+P    +I++D   LPF D + W + 
Sbjct: 233 GM--QQFYRSKFCICPSGTRVSTARIVESIHFGCVPGKSFLILSDHYDLPFNDVLDWRKF 290

Query: 316 GMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
            + + E+D   L   L  + P   L +  L A    +    +  P    D FH ++  L
Sbjct: 291 AVILPEQDAGTLKDAL-ELAPYATLHRNLLQAQAHFE----WHSPPIKYDTFHMVMYEL 344


>gi|224284379|gb|ACN39924.1| unknown [Picea sitchensis]
          Length = 787

 Score =  101 bits (252), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 92/353 (26%), Positives = 148/353 (41%), Gaps = 63/353 (17%)

Query: 42  VYVYELPSKYNKKLL------------LKDPRCLT----HMFAAEIFMHRFLLSSPVRTL 85
           +Y+Y+LP ++N  LL            + D R  +     ++ +++ ++  LL+SP RTL
Sbjct: 340 IYIYDLPPEFNIHLLEGRHFRFQCVNRIYDDRNKSLWTDQLYGSQMALYESLLASPYRTL 399

Query: 86  NPEEADWFYTPIYPTCDLTPTGLP--LPFKSPRMMRSAIQL---------ISSNWPYWNR 134
           N EEAD+FY P+   C +T       L  K+   +RS + L         I  ++ YWNR
Sbjct: 400 NGEEADYFYVPVLDACLITRADDAPHLSMKNHMGLRSYLTLDFYKKAYDHIMEHYTYWNR 459

Query: 135 TEGADHFFVVPHDFGACFHYQE--------------EKAIERGILPLLQRATLVQTFGQR 180
           + G DH +    D GAC+  +E               K        L      +    + 
Sbjct: 460 SSGHDHIWFFAWDEGACYAPKEIWNSMMLVHWGNTNSKHNHSTTAYLADNWDHIPIERRG 519

Query: 181 NHVCLN-EGSITIPPYAPPQKMQAHQ---IPPDTPRSIFVYFRGLFYDV--NNDPEGGYY 234
            H C + E  + +P +  P                R    YF G       NN PE   Y
Sbjct: 520 RHPCFDPEKDLVLPAWKRPDPYNVKARFWARSRRERFTLFYFNGNLGASFKNNRPEPT-Y 578

Query: 235 ARGARAAVWENFKNNP-------------LFDISTDHPTTYYEDMQRAIFCLCPLGWAPW 281
           + G R  +   F + P             +  +S   P  YY ++  ++FC    G   W
Sbjct: 579 SLGIRQKLAAEFASEPNKEGKFGRQSTKDVIVVSQKSP-NYYSELGSSLFCGVFPG-DGW 636

Query: 282 SPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSI 334
           S R+ ++V+ GCIPVII D I + + + + ++   + +AE+D+P L  IL  I
Sbjct: 637 SGRMEDSVLQGCIPVIIQDGIQVAYENVLNYDSFAVRIAEDDIPHLVQILRGI 689


>gi|302845214|ref|XP_002954146.1| hypothetical protein VOLCADRAFT_118607 [Volvox carteri f.
           nagariensis]
 gi|300260645|gb|EFJ44863.1| hypothetical protein VOLCADRAFT_118607 [Volvox carteri f.
           nagariensis]
          Length = 401

 Score =  101 bits (252), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 91/363 (25%), Positives = 155/363 (42%), Gaps = 62/363 (17%)

Query: 77  LLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTE 136
           LLSS VRT + + AD+F+ P+          +    +S   + + +  I  +WP+W R  
Sbjct: 4   LLSSGVRTADGDAADFFFIPLV---------MRTKGQSANHLTAVVSYIQQHWPWWGRYG 54

Query: 137 GAD-HFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGS------ 189
           G   H  VVP D G        + +   +L L++  T +  +G  +H   +EG+      
Sbjct: 55  GGHRHLLVVPADLG-------RRMLPEELLKLVENVTFLTHWG--SHTNHSEGAWVESHR 105

Query: 190 ----ITIPP---------YAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYAR 236
               I +PP         ++P   + + +      R+  ++F G      ++P  G    
Sbjct: 106 PGKDIVVPPLHNADEPIVFSPLHTLHSKR---RRQRTSGLFFSGRICSDGSEPHRGRCRT 162

Query: 237 GA----RAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFG 292
            +    R  V ++  N   + ++T     Y   +    FCL P G   +  R V+A V G
Sbjct: 163 NSQGNVRHKVLKHHWNRTTWTLTT-RAKAYASALSSHTFCLSP-GGGGYGRRSVQAAVMG 220

Query: 293 CIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVI--LRKQRLLANPS 350
           C+PV+I D +  PF   + W +  M V E+D+P L TIL S+    I  +++Q   A   
Sbjct: 221 CVPVLIGDGLHQPFEPELDWSQFSMSVPEQDIPHLHTILESMNSSTIAAMQEQLRCAAQH 280

Query: 351 MKRAMLFPQP-AQPG--DAFHQILNGLARKLPH---DKSTYLKPGGKFLNW-------TS 397
           +  +  F +   + G  DAF  ++  L  +  H   D S Y +   +F ++       T 
Sbjct: 281 LYYSTTFGEVMGEDGRYDAFETLMEVLRMRRDHPSLDPSEYARTDSRFADFINCKLEPTG 340

Query: 398 GPV 400
           GPV
Sbjct: 341 GPV 343


>gi|159470093|ref|XP_001693194.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
 gi|158277452|gb|EDP03220.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
          Length = 673

 Score =  101 bits (251), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 93/347 (26%), Positives = 146/347 (42%), Gaps = 70/347 (20%)

Query: 42  VYVYELPSKYNKKLL---LKDPRCLTH---------------MFAAEIFMHRFLLSSPVR 83
           +YVY+L   Y  ++L   +  P C+                 ++AA+  +H  LL SP R
Sbjct: 254 IYVYDLEPLYQARILQYRVSPPWCVFRRHDLPANRTEWSDMWVYAADTLLHELLLVSPHR 313

Query: 84  TLNPEEADWFYTPIYPTCDLTPTGLPLP---------FKSP---------RMMRSAIQLI 125
           T +PEEAD+FY P   +C      LP P         FK P          M+      I
Sbjct: 314 TFDPEEADFFYVPHQASC------LPFPIGNWADWPWFKGPGGPRIRQMLNMIMETRDWI 367

Query: 126 SSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCL 185
             ++P+W R  G DH +   HD GAC+             P +   ++  T   R     
Sbjct: 368 DQHYPFWKRRGGRDHIWTFTHDEGACW------------APNVLNTSIWLTHWGRMDPDH 415

Query: 186 NEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGL-FYDVNNDPEGGYYARGARAAVWE 244
              +  +P         A+Q     P    V+ +G   Y    D     + R       +
Sbjct: 416 TSNTAFVPDRYDRDFKSAYQ-----PEGYRVHMQGHPCYRPGQDLVIPAFKRP------D 464

Query: 245 NFKNNPLFDIST---DHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADD 301
           +++ +PL   ++   + P  Y + + R++FCL   G   WS RL +AV+ GCIPVII D+
Sbjct: 465 HYRASPLAAATSKPRELPGDYSDMLSRSLFCLVAAG-DGWSARLEDAVLHGCIPVIIIDN 523

Query: 302 IVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLAN 348
           + + F   +  +   + +AE DV ++  IL +IP   I  KQ  L +
Sbjct: 524 VHVVFESILDIDSFSVRIAEADVDRILEILQAIPERKIRFKQAHLGH 570


>gi|413926326|gb|AFW66258.1| hypothetical protein ZEAMMB73_148778 [Zea mays]
          Length = 267

 Score =  100 bits (250), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 76/274 (27%), Positives = 126/274 (45%), Gaps = 49/274 (17%)

Query: 117 MMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQT 176
           +++  ++ + + +PYWNRT GADHFFV  HD G        +A E G+  +++ +  V  
Sbjct: 18  IVKDYVEGLINKYPYWNRTLGADHFFVTCHDVGV-------RAFE-GLKFMVKNSIRV-- 67

Query: 177 FGQRNHVCLNEGSITIPPY---APPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGY 233
                 VC    ++   P+   A PQ +Q   +                 D++N    G+
Sbjct: 68  ------VCSPSYNVDFIPHKDIALPQVLQPFALHEGGN------------DIDNRVILGF 109

Query: 234 YA-------RGARAAVWENFKNNPLFDISTDHPT------TYYEDMQRAIFCLCPLGWAP 280
           +A       R   A VWEN   +    IS +  +       Y +   R  FC+CP G   
Sbjct: 110 WAGHRNSKIRVILARVWEN---DTELAISNNRISRAIGELVYQKQFYRTKFCICPGGSQV 166

Query: 281 WSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVIL 340
            S R+ +++ +GC+PVI++D   LPF DA+ W +  + + E DV +L  IL SI  E  +
Sbjct: 167 NSARISDSIHYGCVPVILSDYYDLPFNDALDWRKFAVILRERDVYQLKNILKSISQEEFI 226

Query: 341 RKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
              + L    +++  ++  P    DAFH ++  L
Sbjct: 227 SLHKSLV--QVQKHFVWHSPPVSYDAFHMVMYEL 258


>gi|357125540|ref|XP_003564451.1| PREDICTED: probable glucuronosyltransferase GUT1-like [Brachypodium
           distachyon]
          Length = 495

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 98/360 (27%), Positives = 156/360 (43%), Gaps = 58/360 (16%)

Query: 40  LKVYVYELPSKYNKKLL--LKDP-RCLTHMFAAEIFMHRF-------------------- 76
           L+VYVYE+P K+   LL   +D  R  T++ +    +HR                     
Sbjct: 109 LRVYVYEMPRKFTYDLLRLFRDSYRDTTNLTSNGSPVHRLIEQHSIDYWLWADLIAPESQ 168

Query: 77  -LLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRT 135
            LL + +R    EEAD FY P + T       L    K   + R A++ ++   P W R+
Sbjct: 169 RLLKNVIRVQQQEEADIFYVPFFTTISYF---LLEKQKCKALYREALKWVTDQ-PAWQRS 224

Query: 136 EGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY 195
           EG DH   V H +     ++  +   +  + LL        + +   V L +    I PY
Sbjct: 225 EGRDHIIPVHHPWS----FKSVRRFVKKAIWLLPDMDSTGNWYKPGQVYLEKD--VILPY 278

Query: 196 APPQKMQAHQIPPDT--PRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFD 253
            P   +  ++   +T   RS+ ++FRG          GG      R+ +    K+    D
Sbjct: 279 VPNVDLCDYKCASETQSKRSMLLFFRGRL----KRNAGG----KVRSKLVTELKDAE--D 328

Query: 254 ISTDHPTTYYED-------MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 306
           +  +  T   E        M++++FCL P G  P S RL +A+V GCIPVII+D++ LPF
Sbjct: 329 VVIEEGTAGAEGKVAAQNGMRKSLFCLNPAGDTPSSARLFDAIVSGCIPVIISDELELPF 388

Query: 307 ADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQP 363
              + + +I +FV+  D  +   L   L  I  + +   Q  L   S  R  ++ +PAQP
Sbjct: 389 EGILDYRKIALFVSSSDALQPGWLVKYLRGIDAKRVREMQSNLVKYS--RHFIYSKPAQP 446


>gi|302790752|ref|XP_002977143.1| glycosyltransferase-like protein [Selaginella moellendorffii]
 gi|300155119|gb|EFJ21752.1| glycosyltransferase-like protein [Selaginella moellendorffii]
          Length = 342

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 92/347 (26%), Positives = 147/347 (42%), Gaps = 65/347 (18%)

Query: 58  KDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKS--- 114
           + PR +T  +A+E +  + L  S   T NP +A  F+ PI  +C      +P    S   
Sbjct: 20  QTPRKITGKYASEGYFFQNLRESKFVTKNPNKAHLFFIPI--SCHKMRGKVPYYLTSNWN 77

Query: 115 ----------PRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGI 164
                       +++  ++ +   +PYWNRT GADHFFV  HD GA       +A  + +
Sbjct: 78  KMQGISYEKMADIVQEYVEGLIVKYPYWNRTLGADHFFVTCHDVGA-------RATNK-V 129

Query: 165 LPLLQRATLVQTFGQRNHVC---LNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGL 221
             L++ +  V        VC    N   I     A PQ +Q   +P              
Sbjct: 130 ANLVKNSIRV--------VCSPSYNGDFIPHKDIAMPQVLQPFALPRGGN---------- 171

Query: 222 FYDVNNDPEGGYYA-------RGARAAVWEN-----FKNNPLFDISTDHPTTYYEDMQRA 269
             DV N    G++A       R   A +WE        NN +   + +    Y +   R+
Sbjct: 172 --DVRNRTILGFWAGHRNSKIRVVLAKLWEEDDVLAISNNRISRATGE--LVYQKQFYRS 227

Query: 270 IFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDT 329
            FC+CP G    S R+V+++ +GC+PVI++D   LPF D + W+   + + E DV  L  
Sbjct: 228 KFCICPGGSQVNSARIVDSIHYGCVPVILSDHYDLPFNDVLDWKRFALLLRERDVGDLKL 287

Query: 330 ILTSIPPEVILRKQRLLAN--PSMKRAMLFPQPAQPGDAFHQILNGL 374
              S         + L+A+    ++    +  P +P DAFH ++  L
Sbjct: 288 FFFSFFLFS---SRVLIASLFRQVQDRFEWHTPPRPYDAFHMVVYEL 331


>gi|159489064|ref|XP_001702517.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
 gi|158280539|gb|EDP06296.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
          Length = 489

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 85/335 (25%), Positives = 139/335 (41%), Gaps = 63/335 (18%)

Query: 40  LKVYVYELPSKYNK-KLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIY 98
           +K+YVYELP   N    + +  R L      ++ + R +LSS VRT + + AD+++ P+ 
Sbjct: 119 VKIYVYELPPVANTWTYIARIDRPL-----VQVLLQR-MLSSGVRTTDGDSADYYFIPLL 172

Query: 99  PTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGAD-HFFVVPHDFGACFHYQEE 157
                    +     +   + + +  +  +WP+W+RT G   H  V P D G        
Sbjct: 173 ---------MRTRTHTVNHLAAVVHYVRKHWPWWDRTGGGHRHLLVAPGDIG-------R 216

Query: 158 KAIERGILPLLQRATLVQTFG-QRNHVCLN-------EGSITIPPYAPPQK------MQA 203
           + +   +L + +  T +  +G  RNH   N          I +PP  PP +      + A
Sbjct: 217 RILTPELLHMTENCTFLTHWGLHRNHSGGNWLASHRPGKDIVVPPLTPPDEPIVYSPLHA 276

Query: 204 HQIPPDTPRSIFVYFRGLFYDVNNDPEGGY-------YARGARAAVWENFKNNPLFDIST 256
                   R   ++F G     N  P  G        Y+ G R  V  +  N P + I+T
Sbjct: 277 TLKKNRKSRLGELFFAGRICGDNQKPTDGKCSEKRQDYSAGTRQQVAHHHWNRPNWTITT 336

Query: 257 DHPTTYYEDMQRAIFCLCPLGWA-----------------PWSPRLVEAVVFGCIPVIIA 299
            H   Y E +   IFCL P G A                  +  R V++++ GCIPV + 
Sbjct: 337 -HTPAYAEALSTHIFCLSPTGTARLSHARLDLSTLQRPGGGYGRRSVQSLLMGCIPVTVT 395

Query: 300 DDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSI 334
           D +  PF   + W    + V E+D+ +L  +LT +
Sbjct: 396 DHVHQPFEPEVDWARFSVPVREDDIAQLHHVLTGL 430


>gi|449520748|ref|XP_004167395.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase
           IRX7-like [Cucumis sativus]
          Length = 517

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 101/367 (27%), Positives = 156/367 (42%), Gaps = 63/367 (17%)

Query: 36  PVGK-LKVYVYELPSKYNKKLL---LKDPRCLTHMFAAEIFMHRF--------------- 76
           P+G  ++VYVYE+P K+   LL       R  +++ +    +HR                
Sbjct: 123 PLGSPIRVYVYEMPWKFTYDLLWTFRNTYRETSNLTSNGSPVHRLIEQHSIDYWLWADLI 182

Query: 77  ------LLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWP 130
                 LL   VR    EEAD FY P + T       L    +   + R A++ ++   P
Sbjct: 183 APESERLLKGVVRVYRQEEADLFYIPFFTTISFF---LLEKQQCKALYREALKWVTDQ-P 238

Query: 131 YWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSI 190
            W R+EG DH   V H +     ++  +   +  + LL        + +   V L +  I
Sbjct: 239 AWKRSEGRDHILPVHHPWS----FKTVRKFMKNAIWLLPDMDSTGNWYKPGQVFLEKDLI 294

Query: 191 TIPPYAP-----PQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYAR------GAR 239
              PY P      +K  ++Q    + RSI ++FRG    +  +  G   A+      GA 
Sbjct: 295 L--PYVPNVELCDRKCLSYQ---QSKRSILLFFRG---RLKRNAGGKIRAKLGGELSGAD 346

Query: 240 AAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIA 299
             + E                     M+++IFCL P G  P S RL +A+V GCIPVI++
Sbjct: 347 DVLIEEGTAG------EGGKAAAQTGMRKSIFCLSPAGDTPSSARLFDAIVSGCIPVIVS 400

Query: 300 DDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAML 356
           D++ LPF   + + +I +FV+  D  K   L T L S     I R Q+ LA   + R  +
Sbjct: 401 DELELPFEGILDYRKIALFVSSSDALKSGWLLTYLRSFSAADIRRLQQNLAK--LSRHFI 458

Query: 357 FPQPAQP 363
           +  PAQP
Sbjct: 459 YSSPAQP 465


>gi|297833558|ref|XP_002884661.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297330501|gb|EFH60920.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 470

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 98/360 (27%), Positives = 157/360 (43%), Gaps = 67/360 (18%)

Query: 40  LKVYVYEL--PSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPI 97
            K+YVYE   P  ++  L  KD   +  +F    FM   +L    RT +P++A  ++ P 
Sbjct: 143 FKIYVYEEGDPPIFHYGLC-KDIYSMEGLFLN--FMENDVLK--YRTRDPDKAHVYFLPF 197

Query: 98  YPTCDLTPTGLPLPFKSPRMMRSAI----QLISSNWPYWNRTEGADHFFVVPHDFGACFH 153
                L     P+  +   ++ S I    Q+IS  +PYWN ++G DHF +  HD+G    
Sbjct: 198 SVVMILHHLFDPV-VRDKAVLESVIADYVQIISKKYPYWNTSDGFDHFMLSCHDWG---- 252

Query: 154 YQEEKAIERGILPLLQRAT--LVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTP 211
                           RAT  + + F     V  N     I  Y  P+K        D P
Sbjct: 253 ---------------HRATWYVKKLFFNSIRVLCNAN---ISEYFNPEK--------DAP 286

Query: 212 RSIFVYFRGLFYDVNNDPEG---------GYYARGARAAV-------WENFKNNPLFDIS 255
              F     L  ++NN   G          ++A  +   +       W+    + L   +
Sbjct: 287 ---FPEINLLTGEINNLTGGLDPISRKTLAFFAGKSHGKIRPVLLNHWKEKDKDILVYEN 343

Query: 256 TDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEI 315
                 Y E M++  FC+CP G    SPR+ EA+  GC+PV+I+++ VLPF+D + WE+ 
Sbjct: 344 LPEDLNYTEMMRKTRFCICPSGHEVASPRIPEAIYSGCVPVLISENYVLPFSDVLNWEKF 403

Query: 316 GMFVAEEDVPKLDTILTSIPPEVILRKQRLLAN-PSMKRAMLFPQPAQPGDAFHQILNGL 374
            + V+ +++P+L  IL  IP E   R  RL      +K  +L   P +  D F+ I++ +
Sbjct: 404 SVSVSVKEIPELKRILMDIPEE---RYMRLYEGVKKVKMHILVNDPPKRYDVFNMIIHSI 460


>gi|297807715|ref|XP_002871741.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297317578|gb|EFH48000.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 510

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 96/360 (26%), Positives = 153/360 (42%), Gaps = 49/360 (13%)

Query: 36  PVG-KLKVYVYELPSKYNKKLL----------------------LKDPRCLTHMFAAEIF 72
           PVG  +KVYVYE+P K+   LL                      L +   + +   A++ 
Sbjct: 112 PVGFPIKVYVYEMPKKFTFDLLWLFHNTYKETSNATSNGSPVHRLIEQHSIDYWLWADLI 171

Query: 73  M--HRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWP 130
                  L S VR    ++AD+FY P + T       L    +   + R A++ ++   P
Sbjct: 172 SPESERRLKSVVRVHKQQDADFFYVPFFTTISFF---LLEKQQCKALYREALKWVTDQ-P 227

Query: 131 YWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSI 190
            W R+EG DH F + H +     ++  +   +  + LL        + +   V L +  I
Sbjct: 228 AWKRSEGRDHIFPIHHPWS----FKSVRKFVKNAIWLLPDMDSTGNWYKPGQVSLEKDLI 283

Query: 191 TIPPYAPPQKM-QAHQIPPDTP-RSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKN 248
              PY P   +  A  +    P R+  ++FRG           G   R    A     K 
Sbjct: 284 L--PYVPNVDICDAKCLSESAPMRTTLLFFRGRL-----KRNAGGKIRAKLGAELSGVKG 336

Query: 249 NPLFDISTDHPTTYYED--MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 306
             + + +            M+R++FCLCP G  P S RL +A+V GCIPVI++D++  PF
Sbjct: 337 VIISEGTAGEGGKLAAQGGMRRSLFCLCPAGDTPSSARLFDAIVSGCIPVIVSDELEFPF 396

Query: 307 ADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQP 363
              + ++++ + V+  DV +   L   L S+ P  I   Q+ LA  S  R  L+  PAQP
Sbjct: 397 EGILDYKKVAVLVSSNDVVQPGWLVNHLRSLTPFQIKELQKNLAQYS--RHFLYSSPAQP 454


>gi|224053619|ref|XP_002297901.1| predicted protein [Populus trichocarpa]
 gi|222845159|gb|EEE82706.1| predicted protein [Populus trichocarpa]
          Length = 457

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 82/289 (28%), Positives = 127/289 (43%), Gaps = 37/289 (12%)

Query: 65  HMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQL 124
           HMF         LL SP+ T  PE+A +F+ P           +    K  + +      
Sbjct: 152 HMFKVS------LLHSPLLTATPEKAHFFFLPFSINDLRNDPRVHSEAKISQFVAQYTSS 205

Query: 125 ISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVC 184
           ISS++ +WN + GADHF+V  H  G     +E  +   G+     + T   ++ QR ++ 
Sbjct: 206 ISSSFRFWNASGGADHFYVCCHSVG-----REAPSRHHGLRNNAIQLTCCSSYFQRFYLS 260

Query: 185 LNEGSITIPPYAPPQKMQAHQIPPDTP-----------RSIFVYFRGLFYDVNNDPEGGY 233
             +  +   P   P+  Q    PP              R   VYF G    V N      
Sbjct: 261 HKDVGL---PQVWPRTDQTALNPPHASVCYLDVNTYRCRHRLVYFAG---RVQNSQ---- 310

Query: 234 YARGARAAVWENFKNNPLFDISTDHPTTYYED-MQRAIFCLCPLGWAPWSPRLVEAVVFG 292
             R     +W    N+  FDI   +PT  YE+  +R+ FCL   G+   + R+ +A+ +G
Sbjct: 311 -VRQQLVNLW---GNDTQFDIFNGNPTFPYEEGFKRSKFCLHVKGYEVNTARVSDAIHYG 366

Query: 293 CIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILR 341
           CIPVII++   LPFA+ + W +  + + + D+  L T L SI  E+ LR
Sbjct: 367 CIPVIISNYYDLPFANVLDWSKFSVVINQRDIAFLKTKLLSIKREMYLR 415


>gi|42566010|ref|NP_191322.3| exostosin family protein [Arabidopsis thaliana]
 gi|44917463|gb|AAS49056.1| At3g57630 [Arabidopsis thaliana]
 gi|46931284|gb|AAT06446.1| At3g57630 [Arabidopsis thaliana]
 gi|332646159|gb|AEE79680.1| exostosin family protein [Arabidopsis thaliana]
          Length = 793

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 92/367 (25%), Positives = 156/367 (42%), Gaps = 68/367 (18%)

Query: 29  GDVLEDDPVGKLK---VYVYELPSKYNKKLL------------LKDPRCLT----HMFAA 69
           G+++    V K K   +Y+Y+LP  +N  L+            + D R  T    +++ +
Sbjct: 334 GNLINLSAVVKKKRPLIYIYDLPPDFNSLLIEGRHFKFECVNRIYDERNATVWTDYLYGS 393

Query: 70  EIFMHRFLLSSPVRTLNPEEADWFYTPIYPTC-----DLTP-------TGLPLPFKSPRM 117
           ++  +  +L++  RT+N EEAD+F+ P+  +C     D  P       TGL     +   
Sbjct: 394 QMAFYENILATAHRTMNGEEADFFFVPVLDSCIINRADDAPHINMQNHTGLRSSL-TLEF 452

Query: 118 MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFH----YQEEKAIERGILPLLQRATL 173
            + A + I   +PYWNR+ G DH +    D GAC+     +     +  G        + 
Sbjct: 453 YKRAYEHIVEKYPYWNRSAGRDHIWFFSWDEGACYAPKEIWNSMMLVHWGNTNSKHNHST 512

Query: 174 VQTFGQR----------NHVCLN-EGSITIPPYAPP---QKMQAHQIPPDTPRSIFVYFR 219
              FG            +H C +    + IP +  P      + +   P   R    YF 
Sbjct: 513 TAYFGDNWDDISDERRGDHPCFDPRKDLVIPAWKVPDPYSMRKNYWERPREKRKTLFYFN 572

Query: 220 GLF---YDVNNDPEGGYYARGARAAVWENFKNNPLFD--ISTDH----------PTTYYE 264
           G     Y+    PE  Y + G R  + E F ++P  +  +   H             Y++
Sbjct: 573 GNLGPAYE-KGRPEDSY-SMGIRQKLAEEFGSSPNKEGKLGKQHAEDVIVTPLRSDNYHK 630

Query: 265 DMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDV 324
           D+  +IFC    G   WS R+ ++++ GC+PVII D I LP+ + + +E   + V E+D+
Sbjct: 631 DIANSIFCGAFPG-DGWSGRMEDSILQGCVPVIIQDGIYLPYENMLNYESFAVRVNEDDI 689

Query: 325 PKLDTIL 331
           P L   L
Sbjct: 690 PNLINTL 696


>gi|42572713|ref|NP_974452.1| exostosin family protein [Arabidopsis thaliana]
 gi|110740929|dbj|BAE98560.1| hypothetical protein [Arabidopsis thaliana]
 gi|332646160|gb|AEE79681.1| exostosin family protein [Arabidopsis thaliana]
          Length = 791

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 92/367 (25%), Positives = 156/367 (42%), Gaps = 68/367 (18%)

Query: 29  GDVLEDDPVGKLK---VYVYELPSKYNKKLL------------LKDPRCLT----HMFAA 69
           G+++    V K K   +Y+Y+LP  +N  L+            + D R  T    +++ +
Sbjct: 332 GNLINLSAVVKKKRPLIYIYDLPPDFNSLLIEGRHFKFECVNRIYDERNATVWTDYLYGS 391

Query: 70  EIFMHRFLLSSPVRTLNPEEADWFYTPIYPTC-----DLTP-------TGLPLPFKSPRM 117
           ++  +  +L++  RT+N EEAD+F+ P+  +C     D  P       TGL     +   
Sbjct: 392 QMAFYENILATAHRTMNGEEADFFFVPVLDSCIINRADDAPHINMQNHTGLRSSL-TLEF 450

Query: 118 MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFH----YQEEKAIERGILPLLQRATL 173
            + A + I   +PYWNR+ G DH +    D GAC+     +     +  G        + 
Sbjct: 451 YKRAYEHIVEKYPYWNRSAGRDHIWFFSWDEGACYAPKEIWNSMMLVHWGNTNSKHNHST 510

Query: 174 VQTFGQR----------NHVCLN-EGSITIPPYAPP---QKMQAHQIPPDTPRSIFVYFR 219
              FG            +H C +    + IP +  P      + +   P   R    YF 
Sbjct: 511 TAYFGDNWDDISDERRGDHPCFDPRKDLVIPAWKVPDPYSMRKNYWERPREKRKTLFYFN 570

Query: 220 GLF---YDVNNDPEGGYYARGARAAVWENFKNNPLFD--ISTDH----------PTTYYE 264
           G     Y+    PE  Y + G R  + E F ++P  +  +   H             Y++
Sbjct: 571 GNLGPAYE-KGRPEDSY-SMGIRQKLAEEFGSSPNKEGKLGKQHAEDVIVTPLRSDNYHK 628

Query: 265 DMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDV 324
           D+  +IFC    G   WS R+ ++++ GC+PVII D I LP+ + + +E   + V E+D+
Sbjct: 629 DIANSIFCGAFPG-DGWSGRMEDSILQGCVPVIIQDGIYLPYENMLNYESFAVRVNEDDI 687

Query: 325 PKLDTIL 331
           P L   L
Sbjct: 688 PNLINTL 694


>gi|296085067|emb|CBI28482.3| unnamed protein product [Vitis vinifera]
          Length = 634

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 78/306 (25%), Positives = 141/306 (46%), Gaps = 42/306 (13%)

Query: 83  RTLNPEEADWFYTPIYPTCDLTPTGLPL---PFKSPRMMRSAIQLISSNWPYWNRTEGAD 139
           RT +P+EA  ++ P      +     P+    +    ++   +++IS  + YWNR+ GAD
Sbjct: 348 RTQDPDEAHVYFLPFSVVMIIHHLFDPIVRDKYVMKHVVSDYVKVISQKYRYWNRSLGAD 407

Query: 140 HFFVVPHDFG--ACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNE------GSIT 191
           HF +  HD+G  A ++  +   +    + LL  A   + F  R    + E       +I 
Sbjct: 408 HFMLSCHDWGPRATWYVPQ---LYYNSIRLLCNANTSECFNPRKDASIPEINLIDGETIG 464

Query: 192 IPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFK-NNP 250
           +    PP K           R+I  +F            GG + R  R A+ +++K  + 
Sbjct: 465 LTGGLPPSK-----------RTILAFF-----------AGGLHGR-IRPALLQHWKEKDE 501

Query: 251 LFDISTDHPT--TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFAD 308
              +    P   +Y + M+++ +C+CP G    SPR+VEA+   C+PV+I+   VLPF+D
Sbjct: 502 QVQVYETLPEGLSYPDLMKKSKYCICPSGHEVASPRIVEAIYAECVPVLISQHYVLPFSD 561

Query: 309 AIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFH 368
            + W    + V+  ++P L  IL  IP +  +R Q  +    +++  +   P +  D FH
Sbjct: 562 VLDWGSFSIQVSVNEIPNLKKILLGIPQDRYIRMQERVKQ--VQQHFVVNNPPKRFDVFH 619

Query: 369 QILNGL 374
            I++ +
Sbjct: 620 MIIHSI 625


>gi|359485860|ref|XP_002264111.2| PREDICTED: probable glycosyltransferase At3g07620-like [Vitis
           vinifera]
          Length = 410

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 78/304 (25%), Positives = 138/304 (45%), Gaps = 38/304 (12%)

Query: 83  RTLNPEEADWFYTPIYPTCDLTPTGLPL---PFKSPRMMRSAIQLISSNWPYWNRTEGAD 139
           RT +P+EA  ++ P      +     P+    +    ++   +++IS  + YWNR+ GAD
Sbjct: 124 RTQDPDEAHVYFLPFSVVMIIHHLFDPIVRDKYVMKHVVSDYVKVISQKYRYWNRSLGAD 183

Query: 140 HFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNE------GSITIP 193
           HF +  HD+G    +   +     I  LL  A   + F  R    + E       +I + 
Sbjct: 184 HFMLSCHDWGPRATWYVPQLYYNSI-RLLCNANTSECFNPRKDASIPEINLIDGETIGLT 242

Query: 194 PYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFK-NNPLF 252
              PP K           R+I  +F            GG + R  R A+ +++K  +   
Sbjct: 243 GGLPPSK-----------RTILAFF-----------AGGLHGR-IRPALLQHWKEKDEQV 279

Query: 253 DISTDHPT--TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAI 310
            +    P   +Y + M+++ +C+CP G    SPR+VEA+   C+PV+I+   VLPF+D +
Sbjct: 280 QVYETLPEGLSYPDLMKKSKYCICPSGHEVASPRIVEAIYAECVPVLISQHYVLPFSDVL 339

Query: 311 PWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQI 370
            W    + V+  ++P L  IL  IP +  +R Q  +    +++  +   P +  D FH I
Sbjct: 340 DWGSFSIQVSVNEIPNLKKILLGIPQDRYIRMQERVKQ--VQQHFVVNNPPKRFDVFHMI 397

Query: 371 LNGL 374
           ++ +
Sbjct: 398 IHSI 401


>gi|223943427|gb|ACN25797.1| unknown [Zea mays]
          Length = 380

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 90/302 (29%), Positives = 134/302 (44%), Gaps = 40/302 (13%)

Query: 77  LLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTE 136
           LL S +R    EEAD FY P + T       L    +   + R A++ ++   P W R+E
Sbjct: 55  LLKSVIRVQQQEEADIFYVPFFTTISYF---LLEKQECKALYREALKWVTDQ-PAWQRSE 110

Query: 137 GADHFFVVPHDFGACFHYQEEKAIERGI---LPLLQRATLVQTFGQRNHVCLNEGSITIP 193
           G DH   V H +         K++ R +   + LL        + +   V L +    I 
Sbjct: 111 GRDHVIPVHHPWSF-------KSVRRSVKKAIWLLPDMDSTGNWYKPGQVYLEKD--VIL 161

Query: 194 PYAPPQKMQAHQ--IPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPL 251
           PY P   +  H+  +   + RSI ++FRG          GG      R+ + E  K+   
Sbjct: 162 PYVPNVDLCDHKCVLETQSKRSILLFFRGRL----KRNAGG----KIRSKLVEELKSAK- 212

Query: 252 FDISTDHPTT-------YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVL 304
            DI  +  +T         + M+++ FCL P G  P S RL +A+V GCIPVII+D++ L
Sbjct: 213 -DIVIEEGSTGAQGKAAAQDGMRKSFFCLSPAGDTPSSARLFDAIVSGCIPVIISDELEL 271

Query: 305 PFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPA 361
           PF   + + EI +FV+  D  +   L   L  I  + I   Q  L   S  R  L+  PA
Sbjct: 272 PFEGILDYREIALFVSASDAVQPGWLLKYLRGINAKRIREIQSNLVKYS--RHFLYSSPA 329

Query: 362 QP 363
           QP
Sbjct: 330 QP 331


>gi|449453962|ref|XP_004144725.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase
           IRX7-like [Cucumis sativus]
          Length = 518

 Score = 99.4 bits (246), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 101/367 (27%), Positives = 155/367 (42%), Gaps = 63/367 (17%)

Query: 36  PVGK-LKVYVYELPSKYNKKLL---LKDPRCLTHMFAAEIFMHRF--------------- 76
           P+G  ++VYVYE+P K+   LL       R  +++ +    +HR                
Sbjct: 123 PLGSPIRVYVYEMPWKFTYDLLWTFRNTYRETSNLTSNGSPVHRLIEQHSIDYWLWADLI 182

Query: 77  ------LLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWP 130
                 LL   VR    EEAD FY P + T       L    +   + R A++ ++   P
Sbjct: 183 APESERLLKGVVRVYRQEEADLFYIPFFTTISFF---LLEKQQCKALYREALKWVTDQ-P 238

Query: 131 YWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSI 190
            W R+EG DH   V H +     ++  +   +  + LL        + +   V L +  I
Sbjct: 239 AWKRSEGRDHILPVHHPWS----FKTVRKFMKNAIWLLPDMDSTGNWYKPGQVFLEKDLI 294

Query: 191 TIPPYAP-----PQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYAR------GAR 239
              PY P       K  ++Q    + RSI ++FRG    +  +  G   A+      GA 
Sbjct: 295 L--PYVPNVELCDSKCLSYQ---QSKRSILLFFRG---RLKRNAGGKIRAKLGGELSGAD 346

Query: 240 AAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIA 299
             + E                     M+++IFCL P G  P S RL +A+V GCIPVI++
Sbjct: 347 DVLIEEGTAG------EGGKAAAQTGMRKSIFCLSPAGDTPSSARLFDAIVSGCIPVIVS 400

Query: 300 DDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAML 356
           D++ LPF   + + +I +FV+  D  K   L T L S     I R Q+ LA   + R  +
Sbjct: 401 DELELPFEGILDYRKIALFVSSSDALKSGWLLTYLRSFSAADIRRLQQNLAK--LSRHFI 458

Query: 357 FPQPAQP 363
           +  PAQP
Sbjct: 459 YSSPAQP 465


>gi|242065760|ref|XP_002454169.1| hypothetical protein SORBIDRAFT_04g025885 [Sorghum bicolor]
 gi|241934000|gb|EES07145.1| hypothetical protein SORBIDRAFT_04g025885 [Sorghum bicolor]
          Length = 338

 Score = 99.4 bits (246), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 81/312 (25%), Positives = 136/312 (43%), Gaps = 43/312 (13%)

Query: 80  SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSP--------RMMRSAIQLISSNWPY 131
           +P    +P EA  F  P+   C+L     P   ++         R +   + +++  +PY
Sbjct: 42  NPFAARHPGEAHAFLLPV-SVCNLVQYIYPFYRRNTTAYMAHMRRALADYVDVVAGRYPY 100

Query: 132 WNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRN-----HVCLN 186
           WNR+ GADH  V  HD+      +    +    + +L  A   ++F  R       V L 
Sbjct: 101 WNRSRGADHVMVSCHDWAPLVS-EANGELYANAIRVLCNANTSESFRPRKDATLPEVNLG 159

Query: 187 EGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENF 246
           +G +  P +          +PP+  R+   +F G  +          + R A    W   
Sbjct: 160 DGLLRRPTFG---------MPPEN-RTTLAFFAGGMHG---------HIRKALLGYWLG- 199

Query: 247 KNNPLFDISTDHPT--TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVL 304
           + +P  DI    P    Y+  M  A FCLCP G+   SPR+VE+V  GC+PVII+D    
Sbjct: 200 RKDPDMDIHEYLPKGQDYHALMASARFCLCPSGFEVASPRVVESVFSGCVPVIISDGYPP 259

Query: 305 PFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE--VILRKQRLLANPSMKRAMLFPQPAQ 362
           PF+D + W ++ + V    +P+L  IL  +      +LR + L A    +R  +  +P+Q
Sbjct: 260 PFSDVLDWSKMSVTVPPARIPELKDILKGVSERRYRVLRARVLQA----QRHFVVHRPSQ 315

Query: 363 PGDAFHQILNGL 374
             D    +++ +
Sbjct: 316 RFDMIRMVMHSI 327


>gi|242059273|ref|XP_002458782.1| hypothetical protein SORBIDRAFT_03g040220 [Sorghum bicolor]
 gi|241930757|gb|EES03902.1| hypothetical protein SORBIDRAFT_03g040220 [Sorghum bicolor]
          Length = 426

 Score = 99.0 bits (245), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 90/360 (25%), Positives = 148/360 (41%), Gaps = 42/360 (11%)

Query: 40  LKVYVYELPSKYNKKLLLKD-------PRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADW 92
           L++YVY        + LL+          CL   +  ++ +H+FLL S  RT N ++A+ 
Sbjct: 53  LRIYVYAEDEIQGLRALLRGRDGTITAATCLKGQWGTQVKIHQFLLKSRFRTFNKDQANL 112

Query: 93  FYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACF 152
           F+ P Y  C      L     S + +      + S  PY+ R+ G DH FV P   GA  
Sbjct: 113 FFVPSYVKCVRMTGAL-----SDKEINQTYVKVLSQMPYFRRSGGRDHIFVFPSGAGA-- 165

Query: 153 HYQEEKAIERGILPLLQRATLVQTFG----QRNHVCLNEGSITIPPYAPPQKM-----QA 203
                  + R     L R+ ++   G    +R     N     I P      M     +A
Sbjct: 166 ------HLFRSWATFLNRSIILTPEGDRTDKRGTSAFNTWKDIIIPGNVDDSMVKSDARA 219

Query: 204 HQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP---T 260
            Q  P T R     F G         + G       A  + +   +P+  +S  +     
Sbjct: 220 VQPIPLTKRKYLANFLG-----RAQGKAGRLQLVELAKQYPDKLESPVLKLSGPNKLGRI 274

Query: 261 TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVA 320
            Y++ ++ A FCL P G + W+ R  E+    C+PVI++D++ LPF + I + EI +   
Sbjct: 275 EYFKHLRNAKFCLAPRGESSWTLRFYESFFVECVPVILSDEVELPFQNVIDYGEISIKWP 334

Query: 321 EEDV-PKLDTILTSIPPEVILRKQRLLANPSMKRAM-LFPQPAQPGDAFHQILNGLARKL 378
              + P+L   L SI  E   R + ++ +    R + ++    +P  A   IL  L +K+
Sbjct: 335 SSRIGPELLEYLESISDE---RIEEMIGHGRQMRCLWVYAADTEPCSAMSGILTELQKKV 391


>gi|413937765|gb|AFW72316.1| hypothetical protein ZEAMMB73_504926 [Zea mays]
          Length = 472

 Score = 99.0 bits (245), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 81/304 (26%), Positives = 134/304 (44%), Gaps = 30/304 (9%)

Query: 80  SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKS-----PRMMRSAIQLISSNWPYWNR 134
           +P    +P EA  F  P+   C+L      L   +      R +   + +++  +PYWNR
Sbjct: 179 NPFAARDPGEAHAFLLPV-SVCNLVHYIYRLNTTAYMAHMRRALADYVDVVAHKYPYWNR 237

Query: 135 TEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPP 194
           + GADH  V  HD+      +  + +    + +L  A   + F  R    L E ++    
Sbjct: 238 SRGADHVIVSCHDWAPLVS-EANRELYANAIRVLCNANTSEGFRPRKDATLPEVNLADGL 296

Query: 195 YAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDI 254
              P       +PP+  R+   +F G  +          + R A    W   + +P  DI
Sbjct: 297 LRRPTL----GLPPEN-RTTLAFFAGGMHG---------HIRRALLGYWLG-RKDPDMDI 341

Query: 255 STDHPT--TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW 312
               P    Y+  M RA FCLCP G+   SPR+VE+V  GC+PVII+D    PF+D + W
Sbjct: 342 HEYLPAGQDYHALMARARFCLCPSGFEVASPRVVESVFTGCVPVIISDGYPPPFSDVLDW 401

Query: 313 EEIGMFVAEEDVPKLDTILTSIPPE--VILRKQRLLANPSMKRAMLFPQPAQPGDAFHQI 370
            ++ + V    +P+L  +L  +      +LR + L A    +R  +  +PA+  D    +
Sbjct: 402 SKMSVTVPPARIPELKAVLKGVSERRYRVLRARVLQA----QRHFVVHRPARRFDMIRMV 457

Query: 371 LNGL 374
           L+ +
Sbjct: 458 LHSI 461


>gi|307104488|gb|EFN52741.1| hypothetical protein CHLNCDRAFT_138314 [Chlorella variabilis]
          Length = 647

 Score = 99.0 bits (245), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 91/343 (26%), Positives = 147/343 (42%), Gaps = 64/343 (18%)

Query: 64  THMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPT--------------GLP 109
           T  ++ E  +   +L S  RTL+PEEAD+FY P++P+C + P                  
Sbjct: 231 TWTYSLEFGLLEMMLQSEHRTLDPEEADFFYVPVFPSCFIWPVRSTADSLRDFYYGWAQS 290

Query: 110 LPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQ 169
               +  ++  A   + +++PYW+R  G DH ++V HD  +C+     K+    IL    
Sbjct: 291 RVQGAANLLLEAYHWLRAHYPYWDRRGGRDHIWLVTHDEASCYVPAAIKSASI-ILSHWG 349

Query: 170 RATLVQTFGQ---RNHVCLN--------EGS-----ITIPPYAPPQKM---------QAH 204
           R     T G     N   LN        EGS     ++ P + P + +           H
Sbjct: 350 RKDPNHTSGTGFPGNVYHLNVSHPHWEPEGSMAKLDLSQPCHDPVKDLVLPLMKTPDHYH 409

Query: 205 QIP----PDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFD----IST 256
           Q P    P   R+   + RG  +  +  PE   Y+RG R  +W   + +   D    +  
Sbjct: 410 QSPLVGAPTRNRTWLAFHRGRQHKTDA-PE---YSRGVRQRLWSASQEHGWLDKYGILLG 465

Query: 257 DHPTT-----------YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLP 305
           ++P++           Y + +  +IFCL   G   WS R+ +A + GCIPVI+ D++ + 
Sbjct: 466 ENPSSPGAEEVKLAGDYSQLLASSIFCLVLPG-DGWSARMDDATLHGCIPVIVMDEVDVS 524

Query: 306 FADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLAN 348
           F   I  ++  + VA+ DV +L  IL  I  E     QR L  
Sbjct: 525 FESVIDLQQFTVRVAQADVERLPEILLEISQERRQEMQRALGR 567


>gi|225461772|ref|XP_002285599.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase
           IRX7-like [Vitis vinifera]
          Length = 513

 Score = 98.6 bits (244), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 96/366 (26%), Positives = 155/366 (42%), Gaps = 47/366 (12%)

Query: 40  LKVYVYELPSKYNKKLL----------------------LKDPRCLTHMFAAEIFM--HR 75
           ++VYVYE+P+K+   LL                      L +   + +   A++      
Sbjct: 128 IRVYVYEMPAKFTYDLLWLFRNTYKETSNRTSNGSPVHRLIEQHSIDYWLWADLTAPESE 187

Query: 76  FLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRT 135
            LL + VR    EEAD FY P + T       L  P +   + R A++ ++   P W R+
Sbjct: 188 RLLKNVVRVHRQEEADLFYIPFFTTISFF---LLEPEQWKPLYREALKWVTDQ-PAWKRS 243

Query: 136 EGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY 195
           EG DH   V H +     ++  +   +  + LL        + +   V L +  I   PY
Sbjct: 244 EGRDHILPVHHPWS----FKTVRKSMKNAIWLLPDMDSTGNWYKPGQVSLEKDLIL--PY 297

Query: 196 APPQKMQAHQIPPDT--PRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFD 253
            P   +   +   ++   R   ++FRG    +  +  G   A+       ++        
Sbjct: 298 VPNVDLCDAKCSSESESKRKTLLFFRG---RLKRNAGGKIRAKLMAELSGDDGVVIQEGT 354

Query: 254 ISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWE 313
                       M+++IFCL P G  P S RL +A+V GCIPVI++D++ LPF   + + 
Sbjct: 355 AGEGGKEAAQRGMRKSIFCLSPAGDTPSSARLFDAIVSGCIPVIVSDELELPFEGILDYR 414

Query: 314 EIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQ---PGDAF 367
           +I +FV+  D  +   L T L SI P  I   QR LA  S  R  ++  PAQ   P D  
Sbjct: 415 KIALFVSSSDAMQPGWLLTFLKSISPAQIKEMQRNLAKYS--RHFVYSSPAQLLGPEDLV 472

Query: 368 HQILNG 373
            +++ G
Sbjct: 473 WRMMAG 478


>gi|356557595|ref|XP_003547101.1| PREDICTED: LOW QUALITY PROTEIN: probable glycosyltransferase
           At5g03795-like [Glycine max]
          Length = 412

 Score = 98.6 bits (244), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 83/327 (25%), Positives = 140/327 (42%), Gaps = 48/327 (14%)

Query: 60  PRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTC-DLTPTGLPLPFKSPRMM 118
           PR LT  +A+E +  + +  S   T +P  A  F+ PI  +C  +   GL +     RM+
Sbjct: 107 PRKLTGKYASEGYFFKNIRESRFFTDDPRRAHLFFLPI--SCHKMRGRGLTIE----RMI 160

Query: 119 RSA---IQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQ 175
                 ++ +   +PYWNRT GADHFFV  HD G            +G+  L + +  V 
Sbjct: 161 DEVEKYVEHLKLKYPYWNRTLGADHFFVTCHDIGV--------KATKGVPHLTKNSIRVA 212

Query: 176 TFGQRN------HVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDP 229
                +      H  +    + +P + PP +           R+ F ++ G         
Sbjct: 213 CSSSYDDDDYVPHKDVTLPQVQLPFFHPPGENDIKN------RNTFAFWAG--------- 257

Query: 230 EGGYYARGARAAVWENFK----NNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRL 285
                 +    A+W+N       N   D+    P  Y E + ++ FCLCP G  P    L
Sbjct: 258 RSDSRLKDDLMAMWDNDTELDIQNXRVDLRATGPVVYMEKLYKSKFCLCPHG--PVGNSL 315

Query: 286 V-EAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQR 344
           + +++ FGC+PVI+ +   LPF D + W +  + + E ++  L  IL SI  +  +   R
Sbjct: 316 IADSIHFGCVPVIMPNYYDLPFNDILDWSQFSVVLKETNIYLLKDILRSISEKHFISLNR 375

Query: 345 LLANPSMKRAMLFPQPAQPGDAFHQIL 371
            +    +++   +  P    DAFH ++
Sbjct: 376 NIV--KIQKHFKWNTPPVRQDAFHMVM 400


>gi|224129250|ref|XP_002320538.1| predicted protein [Populus trichocarpa]
 gi|222861311|gb|EEE98853.1| predicted protein [Populus trichocarpa]
          Length = 332

 Score = 98.6 bits (244), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 82/343 (23%), Positives = 141/343 (41%), Gaps = 34/343 (9%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYP 99
            K+Y+Y    K     +  D  C +   +   F+H         T +P+EA  ++ P   
Sbjct: 5   FKIYIY----KEGDPPMFHDGPCKSIYSSEGRFIHELEKGKSFTTTDPDEALVYFLPFSV 60

Query: 100 TCDLTPTGLPLPFKSPRMMRSA---IQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQE 156
              +    +P   +   +  +    I +I+  +P+WNR+ GADHF +  HD+G       
Sbjct: 61  VMLVQYLYVPGSHEIDAIGNTVVDYINVIADKYPFWNRSLGADHFILSCHDWGPRTSSYV 120

Query: 157 EKAIERGILPLLQRATLVQTFGQRN-----HVCLNEGSITIPPYAPPQKMQAHQIPPDTP 211
                  I  +L  A   + F  +       + L  G IT     P            + 
Sbjct: 121 PHLFNNSI-RVLCNANTSEGFNPKKDASFPEIHLRTGEITGLVGGPSP----------SR 169

Query: 212 RSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIF 271
           RSI  +F G  +          + R      W++   +        +  +Y   ++ + F
Sbjct: 170 RSILAFFAGRLHG---------HIRRLLLEQWKDKDQDVQVHDQLRNGMSYDSMLKNSRF 220

Query: 272 CLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTIL 331
           CLCP G+   SPR+VEA+   C+PV+I+D  V PF+D + W+   + V  +D+PK+  IL
Sbjct: 221 CLCPSGYEVASPRIVEAIYAECVPVLISDGYVPPFSDVLNWKAFSIQVQVKDIPKIKDIL 280

Query: 332 TSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
             I     LR QR +    ++R  +     +  D FH  ++ +
Sbjct: 281 MGISQRQYLRMQRRVKQ--VQRHFVVNGIPKRFDVFHMTIHSI 321


>gi|168051833|ref|XP_001778357.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162670236|gb|EDQ56808.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 406

 Score = 98.6 bits (244), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 94/383 (24%), Positives = 160/383 (41%), Gaps = 67/383 (17%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRC------------------LTHMFAAEIFMHRFLLS-- 79
           LKVY+Y++P K+N  L+  D +                   +    + E +M  +LL   
Sbjct: 31  LKVYMYDIPRKFNFGLMTMDNKNEDLPWGNHAAPPWSQQWEVNKQHSVEYWMTVYLLDGW 90

Query: 80  -------SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLP---FKSPRMMRSAIQLISSNW 129
                  + +R  +P +AD F+ P + +      G  +     +  + ++  +  I  N 
Sbjct: 91  DRKDGRRAAIRVRDPYQADVFFVPFFASLSFNNYGYGMEGPGAELDKNLQECVVNILLNS 150

Query: 130 PYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATL-VQTFGQ--RNHVCLN 186
            +W  ++G DH  V+ H   A  HY+           LL  + L V  FG+   +  CL 
Sbjct: 151 KWWKASQGRDHVIVLHHP-NAFRHYRH----------LLNSSMLIVADFGRFSTDVACL- 198

Query: 187 EGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENF 246
           +  I  P     Q          + R I +YF+G    ++   +G   A+ A+A + E  
Sbjct: 199 QKDIVAPYEHVVQSYVDDHSNSFSQRHILLYFQG---RIHRKADGIVRAKLAKALMNEKD 255

Query: 247 KNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 306
            +    + S++        M+ + FCL P G  P S RL +A+V  C+PVI++D I LPF
Sbjct: 256 VHYMDSEASSEALAEATSGMRSSRFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDRIELPF 315

Query: 307 ADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLAN----------PSMKRAML 356
            D I + E  +F + E+         ++ P+ +LR  R +             ++     
Sbjct: 316 EDDIDYNEFSLFFSSEE---------AVRPQYLLRILRGINETKWTQMWTKLKAVSHHFE 366

Query: 357 FPQPAQPGDAFHQILNGLARKLP 379
           F  PA+  DA + I   + RKLP
Sbjct: 367 FQHPAKKDDAVNMIFKQVQRKLP 389


>gi|356503460|ref|XP_003520526.1| PREDICTED: uncharacterized protein LOC100775594 [Glycine max]
          Length = 761

 Score = 98.2 bits (243), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 92/383 (24%), Positives = 157/383 (40%), Gaps = 74/383 (19%)

Query: 15  ADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLK---DPRCLT------- 64
           ADN    E++  +   V +  P+    +YVY+LP  +N  LL        C+        
Sbjct: 296 ADNQHFDEKVINAKAVVAKKRPL----IYVYDLPPVFNSLLLEGRHFKQNCVNRLYDVYN 351

Query: 65  ------HMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTC--DLTPTGLPLPFKSPR 116
                  ++ A+I ++  +L+SP RTLN +EAD+F+ P+  +C  D       L  ++  
Sbjct: 352 ATIWTDELYGAQIALYESILASPHRTLNGDEADFFFVPVLDSCLIDRADHAPHLSTQNHE 411

Query: 117 MMRSAIQL---------ISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPL 167
            +RS + L         I   +PYWN + G DH +    D GAC+     K I   ++ +
Sbjct: 412 GLRSFLTLDFYKNAYNHIVEQYPYWNCSSGRDHIWFFSWDEGACY---APKEIWSSMMLV 468

Query: 168 LQRATLVQTFGQRN-----------------HVCLN-EGSITIPPYAPPQKMQAHQIP-- 207
               T  + +                     H C + E  + IP +   +    H +   
Sbjct: 469 HWGNTNTKHYHSTTAYCPDNWDGIPSDRRGFHPCFDPEKDLVIPAW---KVTHVHVLSSK 525

Query: 208 ----PDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFD---------- 253
               P   R    YF G            +Y+ G R  + E F + P  +          
Sbjct: 526 LWAWPLEKRKTLFYFNGNLGPAYPYGRNEWYSMGIRQKLAEEFGSKPNKEGKLGKQRAKD 585

Query: 254 --ISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIP 311
             ++ +    Y  ++  ++FC    G   WS R+ ++V+ GCIPVII D I LP+ + + 
Sbjct: 586 VVVTAERSENYEVELASSVFCGVLPG-DGWSGRMEDSVLQGCIPVIIQDGIFLPYENVLN 644

Query: 312 WEEIGMFVAEEDVPKLDTILTSI 334
           ++   + + E+++P L  IL  I
Sbjct: 645 YDSFAVRIPEDEIPNLIKILRGI 667


>gi|356533217|ref|XP_003535163.1| PREDICTED: uncharacterized protein LOC100807663 [Glycine max]
          Length = 795

 Score = 98.2 bits (243), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 96/383 (25%), Positives = 161/383 (42%), Gaps = 79/383 (20%)

Query: 15  ADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLL------------LKDPRC 62
           AD+T   E++      V +  P+    VYVY+LP ++N  LL            + D   
Sbjct: 331 ADDTHANEKMINLNAVVAKKRPL----VYVYDLPPEFNSLLLEGRHYKLECVNRIYDDNN 386

Query: 63  LT----HMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLP--LPFKSPR 116
           +T     ++ A+I ++  LL+SP RTLN EEAD+F+ P+  +C +T       L  +   
Sbjct: 387 ITVWTDQLYGAQIALYESLLASPHRTLNGEEADFFFVPVLDSCIITRADDAPHLSMQEHM 446

Query: 117 MMRSAIQL---------ISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPL 167
            +RS++ L         I   +PYW+ + G DH +    D GAC+  +E       I   
Sbjct: 447 GLRSSLTLEYYKNTYTHIVEQYPYWSHSSGRDHIWSFSWDEGACYAPKE-------IWNS 499

Query: 168 LQRATLVQTFGQRNHVC----------LNEGSITIPPYAPPQK---MQAHQIP------- 207
           +       T  + NH            ++     I P   P K   + A ++P       
Sbjct: 500 MMLVHWGNTNTKHNHSTTAYWADNWDKISSDRRGIHPCFDPDKDLVLPAWKVPDAYVLTS 559

Query: 208 -----PDTPRSIFVYFRGLFYDV--NNDPEGGYYARGARAAVWENFKNNPLFD------- 253
                    R    YF G       +  PE   Y+ G R  + E F ++P  D       
Sbjct: 560 KLWARSHEKRKTLFYFNGNLGPAYPHGRPEDT-YSMGIRQKLAEEFGSSPNKDGKLGKQH 618

Query: 254 -----ISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFAD 308
                ++ +    Y+ D+  ++FC    G   WS R+ ++++ GCIPV+I D I LP+ +
Sbjct: 619 AKDVIVTPERSEDYHMDLASSVFCGVFPG-DGWSGRMEDSILQGCIPVVIQDGIFLPYEN 677

Query: 309 AIPWEEIGMFVAEEDVPKLDTIL 331
            + ++   + + E ++P L   L
Sbjct: 678 VLNYDSFAVRIPEAEIPNLIKTL 700


>gi|168033234|ref|XP_001769121.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162679650|gb|EDQ66095.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 765

 Score = 97.8 bits (242), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 102/383 (26%), Positives = 161/383 (42%), Gaps = 75/383 (19%)

Query: 16  DNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLL------------LKDPRCL 63
           D + +  RIS  A ++++  P+    VYVY+LP+++  + L            L D    
Sbjct: 305 DVSDETNRIS-VASNIMKKRPL----VYVYDLPAEFTTQFLQGRHFKFECVNRLYDVDNA 359

Query: 64  T----HMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTC---------DLTPTGLPL 110
           T    +++ A I ++  LL+S  RT N +EAD+FY P    C          LT  G  +
Sbjct: 360 TIWTENLYGAGIALYESLLASEHRTTNGDEADFFYVPFLQACIVEQGDAAPHLTFQGKYM 419

Query: 111 PFKSPRMMRSAIQL---ISSNWPYWNRTEGADHFFVVPHDFGAC------------FHYQ 155
             +       + Q+   I  N+PYWNR+ G DH +  P D GAC             H+ 
Sbjct: 420 GLRQYFAGDYSKQIYFHIQQNYPYWNRSAGRDHIWFFPWDEGACSAPKEIWNSMMLSHWG 479

Query: 156 EEKAIERGILPLLQRAT--LVQTFGQRNHVCLN-EGSITIP------PYAPPQKMQAHQI 206
              A  +      +     L+    + +H C +    + +P      PY   Q + +   
Sbjct: 480 NTNAKHKASTTAYRADNWDLIPPEWRGDHPCYDPAKDLVLPAWKFPDPYPIVQNLSSRH- 538

Query: 207 PPDTPRSIFVYFRGLF---YDVNNDPEGGYYARGARAAVWENFKNNPLFD---------- 253
                R    YF G     YD N  PE G Y+ G R  +   F + P             
Sbjct: 539 --RQDRPTLFYFNGNLGSAYD-NGRPEPG-YSMGIRQKLAAEFGSQPNKKGLLGRQAVDD 594

Query: 254 --ISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIP 311
             +       Y  ++ ++ FC    G   WS R+ ++++ GCIPVII D I LPF + + 
Sbjct: 595 VVVQAQRSPQYKLELSKSRFCGVLPG-DGWSGRMEDSILSGCIPVIIQDGIHLPFENVLD 653

Query: 312 WEEIGMFVAEEDVPKLDTILTSI 334
           +E   + VAE+++  L TIL +I
Sbjct: 654 YESFTVRVAEDNIHNLITILKAI 676


>gi|147775380|emb|CAN73459.1| hypothetical protein VITISV_022181 [Vitis vinifera]
          Length = 319

 Score = 97.8 bits (242), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 44/113 (38%), Positives = 69/113 (61%), Gaps = 2/113 (1%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y + M  + FCLCP GW   SPR+VEA+  GC+PVII D  VLPF++ + W +  + +  
Sbjct: 198 YAKSMGDSKFCLCPSGWEVASPRVVEAIAAGCVPVIICDYYVLPFSEVLDWSKFSINITS 257

Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
           + +P++  IL ++P E  LR Q+ +    ++R  +  +PAQP D  H IL+ +
Sbjct: 258 DKIPEIKKILKAVPNERYLRMQKRVKQ--VQRHFVINRPAQPYDMLHMILHSV 308



 Score = 43.9 bits (102), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 17/37 (45%), Positives = 24/37 (64%)

Query: 116 RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACF 152
           R++   I ++S  +PYWNR+ GADHF V  HD+   F
Sbjct: 82  RLVTDYIYVVSDKYPYWNRSNGADHFLVSCHDWVQSF 118


>gi|297597828|ref|NP_001044591.2| Os01g0811400 [Oryza sativa Japonica Group]
 gi|255673802|dbj|BAF06505.2| Os01g0811400 [Oryza sativa Japonica Group]
          Length = 497

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 96/358 (26%), Positives = 153/358 (42%), Gaps = 54/358 (15%)

Query: 40  LKVYVYELPSKYNKKLL--LKDP-RCLTHMFAAEIFMHRF-------------------- 76
           L+VYVYE+PS++   LL   +D  R  +++ +    +HR                     
Sbjct: 113 LRVYVYEMPSRFTYDLLRLFRDSYRETSNLTSNGSPVHRLVEQHSIDYWLWADLIAPESQ 172

Query: 77  -LLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRT 135
            LL + +R    EEAD FY P + T       L    +   + R A++ ++   P W R+
Sbjct: 173 RLLKNVIRVRRQEEADIFYVPFFTTISYF---LLEKQECKALYREALKWVTDQ-PAWQRS 228

Query: 136 EGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY 195
           EG DH   V H +     ++  +   +  + LL        + +   V L +    I PY
Sbjct: 229 EGRDHVIPVHHPWS----FKSVRRFVKKAIWLLPDMDSTGNWYKPGQVYLEKD--VILPY 282

Query: 196 APPQKMQAHQIPPDTP--RSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFD 253
            P   +   +   +T   RS  ++FRG          GG      R+ +    K+     
Sbjct: 283 VPNVDLCDSKCVSETQSRRSTLLFFRGRL----RRNAGG----KIRSKLVTELKDAEGII 334

Query: 254 I-----STDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFAD 308
           I       D        M++++FCL P G  P S RL +A+V GCIPVI++D++ LPF  
Sbjct: 335 IEEGTAGADGKAAAQNGMRKSLFCLNPAGDTPSSARLFDAIVSGCIPVIVSDELELPFEG 394

Query: 309 AIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQP 363
            + + +I +FV+  D  +   L   L SI  + I + Q  L   S  R  L+  PA+P
Sbjct: 395 ILDYRKIALFVSSNDAVQPGWLVKYLRSIDAKRIRQMQSNLLKYS--RHFLYSSPARP 450


>gi|212722910|ref|NP_001131485.1| uncharacterized protein LOC100192822 [Zea mays]
 gi|194691662|gb|ACF79915.1| unknown [Zea mays]
          Length = 426

 Score = 97.1 bits (240), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 91/359 (25%), Positives = 144/359 (40%), Gaps = 40/359 (11%)

Query: 40  LKVYVYELPSKYNKKLLLKD-------PRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADW 92
           L++YVY        + LL+          CL   +  ++ +H+FLL S  RT N + A+ 
Sbjct: 53  LRIYVYAEDEIQGLRALLRGRDGTINAATCLKGQWGTQVKVHQFLLKSRFRTFNKDHANL 112

Query: 93  FYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACF 152
           F+ P Y  C      L     S + +      + S  PY+ R+ G DH FV P   GA  
Sbjct: 113 FFVPSYVKCVRMTGAL-----SDKEINQTYVKVLSQMPYFRRSGGRDHIFVFPSGAGA-- 165

Query: 153 HYQEEKAIERGILPLLQRATLVQTFG----QRNHVCLNEGSITIPPYAPPQKMQAHQIP- 207
           H     AI       L R+ ++   G    +R     N     I P      M     P 
Sbjct: 166 HLFRSWAI------FLNRSIILTPEGDRTDKRGTSAFNTWKDIIIPGNVDDSMVKSDAPA 219

Query: 208 ----PDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP---T 260
               P T R     F G         + G       A  + +   +P   +S  +     
Sbjct: 220 VQPIPLTKRKYLANFLG-----RAQGKAGRLQLVELAKQYPDKLESPELKLSGPNKLGRI 274

Query: 261 TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVA 320
            Y++ ++ A FCL P G + W+ R  E+    C+PVI++D++ LPF + I + EI +   
Sbjct: 275 EYFKHLRNAKFCLAPRGESSWTLRFYESFFVECVPVILSDEVELPFQNVIDYSEISIKWP 334

Query: 321 EEDV-PKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKL 378
              + P+L   L SI  E I  ++ +     M+   ++    +P  A   IL  L +K+
Sbjct: 335 SSRIGPELLEYLESISDERI--EEMIGHGREMRCLWVYAADTEPCSAMSGILTELQKKV 391


>gi|55297487|dbj|BAD68203.1| exostosin family protein-like [Oryza sativa Japonica Group]
 gi|55297674|dbj|BAD68245.1| exostosin family protein-like [Oryza sativa Japonica Group]
          Length = 512

 Score = 97.1 bits (240), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 96/358 (26%), Positives = 153/358 (42%), Gaps = 54/358 (15%)

Query: 40  LKVYVYELPSKYNKKLL--LKDP-RCLTHMFAAEIFMHRF-------------------- 76
           L+VYVYE+PS++   LL   +D  R  +++ +    +HR                     
Sbjct: 113 LRVYVYEMPSRFTYDLLRLFRDSYRETSNLTSNGSPVHRLVEQHSIDYWLWADLIAPESQ 172

Query: 77  -LLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRT 135
            LL + +R    EEAD FY P + T       L    +   + R A++ ++   P W R+
Sbjct: 173 RLLKNVIRVRRQEEADIFYVPFFTTISYF---LLEKQECKALYREALKWVTDQ-PAWQRS 228

Query: 136 EGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY 195
           EG DH   V H +     ++  +   +  + LL        + +   V L +    I PY
Sbjct: 229 EGRDHVIPVHHPWS----FKSVRRFVKKAIWLLPDMDSTGNWYKPGQVYLEKD--VILPY 282

Query: 196 APPQKMQAHQIPPDTP--RSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFD 253
            P   +   +   +T   RS  ++FRG          GG      R+ +    K+     
Sbjct: 283 VPNVDLCDSKCVSETQSRRSTLLFFRGRL----RRNAGG----KIRSKLVTELKDAEGII 334

Query: 254 I-----STDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFAD 308
           I       D        M++++FCL P G  P S RL +A+V GCIPVI++D++ LPF  
Sbjct: 335 IEEGTAGADGKAAAQNGMRKSLFCLNPAGDTPSSARLFDAIVSGCIPVIVSDELELPFEG 394

Query: 309 AIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQP 363
            + + +I +FV+  D  +   L   L SI  + I + Q  L   S  R  L+  PA+P
Sbjct: 395 ILDYRKIALFVSSNDAVQPGWLVKYLRSIDAKRIRQMQSNLLKYS--RHFLYSSPARP 450


>gi|356553158|ref|XP_003544925.1| PREDICTED: probable glucuronosyltransferase Os01g0926700-like
           [Glycine max]
          Length = 440

 Score = 97.1 bits (240), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 97/378 (25%), Positives = 156/378 (41%), Gaps = 43/378 (11%)

Query: 40  LKVYVYELPSKYNKKLLLK-------DPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADW 92
           LK+YVY+       K LL+       D  CL   + +++ +H+ LL S  RT   EEAD 
Sbjct: 67  LKIYVYQEDEIDGLKELLRGRDAKITDEACLKGQWGSQVKIHKLLLQSKQRTWKKEEADL 126

Query: 93  FYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACF 152
           F+ P Y  C     GL     + + + S    + S  PY+  + G +H FV P   GA  
Sbjct: 127 FFVPSYVKCARMMGGL-----NDKEINSTYVKVISQMPYFRLSGGRNHIFVFPSGAGAHL 181

Query: 153 HYQEEKAIERGIL--PLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDT 210
                  I R I+  P   R        +R+    N     I P      M   +    T
Sbjct: 182 FKSWATYINRSIILTPEGDRTD------KRDTSAFNTWKDIIIPGNIDDGMT--KTGDTT 233

Query: 211 PRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLF----DISTDHP-----TT 261
            + + +  R    +     +G    +  R  + E  K  P      D+    P       
Sbjct: 234 VQPLPLSKRKYLANYLGRAQG----KAGRLKLIELSKQFPEKLECPDLKFSGPDKLGRKE 289

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGM-FVA 320
           Y+E ++ + FCL P G + W+ R  E+    C+PVI++D I LPF + I + +I + + +
Sbjct: 290 YFEHLRNSKFCLAPRGESSWTLRFYESFFVECVPVILSDQIELPFQNVIDYSQISIKWPS 349

Query: 321 EEDVPKLDTILTSIPPEVILRKQRLLANPSMKRA-MLFPQPAQPGDAFHQILNGLARKLP 379
            +  P+L   L SIP E I   ++++A     R   ++   ++   A   I+  L RK+ 
Sbjct: 350 SQIGPELLQYLESIPDEEI---EKIIARGRQVRCWWVYASDSESCSAMRGIMWELQRKVR 406

Query: 380 ---HDKSTYLKPGGKFLN 394
              H   T+    G  +N
Sbjct: 407 QFHHSAETFWLHNGSIVN 424


>gi|168004467|ref|XP_001754933.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162694037|gb|EDQ80387.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 378

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 94/386 (24%), Positives = 155/386 (40%), Gaps = 80/386 (20%)

Query: 44  VYELPSKYNKKLLLKD------------------PRCLTHMFAAEIFMHRFLLS------ 79
           +YELP KYN  LL +D                     + +  + E ++  +LLS      
Sbjct: 1   MYELPRKYNLGLLQRDNPDQELPWTSDVIPPWKMEFEVNNQHSVEYWLMVYLLSGRDRKK 60

Query: 80  ---SPVRTLNPEEADWFYTPIYPTCDLTPTG----LPLPFKSPRMMRSAIQLISSNWPYW 132
              + VR  +PE+A+ F+ P + +      G     P   K   +    ++++S N  +W
Sbjct: 61  GNMAAVRVKDPEQAEVFFVPFFASLSFNSFGRNMAAPNAAKDRELQEGVVEMLS-NSKWW 119

Query: 133 NRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITI 192
            +++G DH  V+ H     F Y  +         + Q   +V  FG+ N          +
Sbjct: 120 QKSQGRDHIIVIHH--PNAFRYYRDM--------MNQSMFIVADFGRYNQTVARLKKDIV 169

Query: 193 PPYAP--PQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNP 250
            PYA   P   + +   P + R   ++F+G    V    +G   A+              
Sbjct: 170 APYAHVVPSYNEDNPSDPFSARKTLLFFQG---RVRRKADGVIRAK-------------- 212

Query: 251 LFDISTDHPTTYYED--------------MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPV 296
           L  +  +    YYED              M+ + FCL P G  P S RL +A+V  C+PV
Sbjct: 213 LGKLLMNQTDVYYEDSLARTEAIAMSTQGMRFSRFCLHPAGDTPSSCRLFDAIVSHCVPV 272

Query: 297 IIADDIVLPFADAIPWEEIGMFVAEEDV---PKLDTILTSIPPEVILRKQRLLANPSMKR 353
           I++D I LPF D + + E  +F + ++      L   L SI  E  L+    L   ++  
Sbjct: 273 IVSDRIELPFEDDLDYSEFSIFFSAKEAIIPGHLLGTLRSITRERWLQMWNKLK--AISH 330

Query: 354 AMLFPQPAQPGDAFHQILNGLARKLP 379
              +  P++  DA + I   + RKLP
Sbjct: 331 HFEYQNPSKEDDAVNLIFKQVQRKLP 356


>gi|125528116|gb|EAY76230.1| hypothetical protein OsI_04166 [Oryza sativa Indica Group]
          Length = 513

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 96/358 (26%), Positives = 153/358 (42%), Gaps = 54/358 (15%)

Query: 40  LKVYVYELPSKYNKKLL--LKDP-RCLTHMFAAEIFMHRF-------------------- 76
           L+VYVYE+PS++   LL   +D  R  +++ +    +HR                     
Sbjct: 114 LRVYVYEMPSRFTYDLLRLFRDSYRETSNLTSNGSPVHRLVEQHSIDYWLWADLIAPESQ 173

Query: 77  -LLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRT 135
            LL + +R    EEAD FY P + T       L    +   + R A++ ++   P W R+
Sbjct: 174 RLLKNVIRVRRQEEADIFYVPFFTTISYF---LLEKQECKALYREALKWVTDQ-PAWQRS 229

Query: 136 EGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY 195
           EG DH   V H +     ++  +   +  + LL        + +   V L +    I PY
Sbjct: 230 EGRDHVIPVHHPWS----FKSVRRFVKKAIWLLPDMDSTGNWYKPGQVYLEKD--VILPY 283

Query: 196 APPQKMQAHQIPPDTP--RSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFD 253
            P   +   +   +T   RS  ++FRG          GG      R+ +    K+     
Sbjct: 284 VPNVDLCDSKCVSETQSRRSTLLFFRGRL----RRNAGGKI----RSKLVTELKDAEGII 335

Query: 254 I-----STDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFAD 308
           I       D        M++++FCL P G  P S RL +A+V GCIPVI++D++ LPF  
Sbjct: 336 IEEGTAGADGKAAAQNGMRKSLFCLNPAGDTPSSARLFDAIVSGCIPVIVSDELELPFEG 395

Query: 309 AIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQP 363
            + + +I +FV+  D  +   L   L SI  + I + Q  L   S  R  L+  PA+P
Sbjct: 396 ILDYRKIALFVSSNDAVQPGWLVKYLRSIDAKRIRQMQSNLLKYS--RHFLYSSPARP 451


>gi|326517848|dbj|BAK03842.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 451

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 92/366 (25%), Positives = 148/366 (40%), Gaps = 53/366 (14%)

Query: 39  KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSP---VRTLNPEEADWFYT 95
           + KVYVYE   +    +L   P    +           LL+ P   VRT + + A  F+ 
Sbjct: 98  RFKVYVYE---EGEPPILHTGPCKDIYTIEGRFIEQLELLAPPAPGVRTRDADRAHAFFL 154

Query: 96  PIYPTCDLTPTGLPLPF-KSP--RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACF 152
           P      +      L + + P   ++   +++++S  P+WNR+ GADHF +  HD+G   
Sbjct: 155 PFSVAQMMQFAYRQLSYDRGPLLSLVGDYVRVVASRHPFWNRSAGADHFMLSCHDWGPDA 214

Query: 153 HYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPR 212
              + +    GI  L   A   + F     V + E  I +     P+++     P  + R
Sbjct: 215 SKGDPELYANGIRALCN-ANTSEGFRPGKDVSIPE--INLYDGDTPRQLLGPS-PGLSAR 270

Query: 213 SIFVYFRG------------------------LFYDVNNDPEGGYYARGARAAVWENFKN 248
               +F G                          YD+ +   G    R  R         
Sbjct: 271 PYLAFFAGGRHGHVRDLLLRHWKGRDPATFPVYEYDIPSTTGGNSSGRHNRRG------- 323

Query: 249 NPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFAD 308
                   D  + Y+  M R+ FCLCP G    SPR+VEA+   C+PV++++    PFAD
Sbjct: 324 -------RDRQSDYFAYMHRSRFCLCPSGHEVASPRVVEAIHAECVPVLVSEGYAPPFAD 376

Query: 309 AIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFH 368
            + WE   + V   D+P+L  +L  IP   + R +  +    +KR     QP +  D FH
Sbjct: 377 VLRWESFSVSVPVVDIPRLKEVLEGIPMAEVERLREGVR--LVKRHFTLRQPPERLDMFH 434

Query: 369 QILNGL 374
            IL+ +
Sbjct: 435 MILHSV 440


>gi|358054164|dbj|GAA99700.1| hypothetical protein E5Q_06404 [Mixia osmundae IAM 14324]
          Length = 420

 Score = 96.3 bits (238), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 88/331 (26%), Positives = 136/331 (41%), Gaps = 56/331 (16%)

Query: 39  KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVR---TLNPEEADWFYT 95
            LK+YVY+LP         +D +C    +++E+ +   L S+P     T +P EA++F  
Sbjct: 78  SLKIYVYDLPEHLRLGRAQED-KCRWSAYSSELHLTHMLASTPAARYVTHDPSEANFFLV 136

Query: 96  PIYPTCDL----TPTGLPLPFK---SPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDF 148
           P++P C L       G     +     ++++ A+  I S  P W R  G DH    P DF
Sbjct: 137 PLFPACYLFHCWDTAGWNRDLRCNVDEKLIQPAMAYIQSQ-PSWQRHSGRDHVMFHPMDF 195

Query: 149 GACFHYQEEKAIERGILPLLQRA---TLVQTFGQRNHVCLNEGSITIPPYA--PPQKMQA 203
           G  ++  + + + R +   +      T    +  R  V +   +  +  Y   P   +  
Sbjct: 196 GDTYYSTDSRVMMRQMSYFVTNGDARTNGTIYRPRKDVVIPSATYLLHSYRYHPRDYLDE 255

Query: 204 H-----QIPPD-----------------------------TPRSIFVYFRGLFYDVNNDP 229
           H     Q+P                               T R+I  YFRGL  DV  D 
Sbjct: 256 HGHPLSQMPKRPKDGTAVSDHVNIYEPTRARKWFASKRDPTGRTILAYFRGLGADVQPDD 315

Query: 230 EGGYYARG---ARAAVWENFKNNPLFDISTDHPTTYYE-DMQRAIFCLCPLGWAPWSPRL 285
           E     R     R    + F + P FD+S +     Y  ++  A + L P G    S R+
Sbjct: 316 EYSLGVRSLFYGRKGSHDGFASLPGFDVSVESENAEYAVELAHARYGLTPPGHTLDSTRI 375

Query: 286 VEAVVFGCIPVII-ADDIVLPFADAIPWEEI 315
            E + FG +PVII AD +VLPFA  +PW E+
Sbjct: 376 WEYLAFGVVPVIIGADGLVLPFAQHLPWSEM 406


>gi|302142837|emb|CBI20132.3| unnamed protein product [Vitis vinifera]
          Length = 379

 Score = 96.3 bits (238), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 85/305 (27%), Positives = 134/305 (43%), Gaps = 23/305 (7%)

Query: 77  LLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTE 136
           LL + VR    EEAD FY P + T       L  P +   + R A++ ++   P W R+E
Sbjct: 55  LLKNVVRVHRQEEADLFYIPFFTTISFF---LLEPEQWKPLYREALKWVTDQ-PAWKRSE 110

Query: 137 GADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYA 196
           G DH   V H +     ++  +   +  + LL        + +   V L +    I PY 
Sbjct: 111 GRDHILPVHHPWS----FKTVRKSMKNAIWLLPDMDSTGNWYKPGQVSLEKD--LILPYV 164

Query: 197 PPQKMQAHQIPPDT--PRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDI 254
           P   +   +   ++   R   ++FRG    +  +  G   A+       ++         
Sbjct: 165 PNVDLCDAKCSSESESKRKTLLFFRG---RLKRNAGGKIRAKLMAELSGDDGVVIQEGTA 221

Query: 255 STDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEE 314
                      M+++IFCL P G  P S RL +A+V GCIPVI++D++ LPF   + + +
Sbjct: 222 GEGGKEAAQRGMRKSIFCLSPAGDTPSSARLFDAIVSGCIPVIVSDELELPFEGILDYRK 281

Query: 315 IGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQ---PGDAFH 368
           I +FV+  D  +   L T L SI P  I   QR LA  S  R  ++  PAQ   P D   
Sbjct: 282 IALFVSSSDAMQPGWLLTFLKSISPAQIKEMQRNLAKYS--RHFVYSSPAQLLGPEDLVW 339

Query: 369 QILNG 373
           +++ G
Sbjct: 340 RMMAG 344


>gi|358054165|dbj|GAA99701.1| hypothetical protein E5Q_06403 [Mixia osmundae IAM 14324]
          Length = 419

 Score = 96.3 bits (238), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 88/331 (26%), Positives = 136/331 (41%), Gaps = 56/331 (16%)

Query: 39  KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVR---TLNPEEADWFYT 95
            LK+YVY+LP         +D +C    +++E+ +   L S+P     T +P EA++F  
Sbjct: 77  SLKIYVYDLPEHLRLGRAQED-KCRWSAYSSELHLTHMLASTPAARYVTHDPSEANFFLV 135

Query: 96  PIYPTCDL----TPTGLPLPFK---SPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDF 148
           P++P C L       G     +     ++++ A+  I S  P W R  G DH    P DF
Sbjct: 136 PLFPACYLFHCWDTAGWNRDLRCNVDEKLIQPAMAYIQSQ-PSWQRHSGRDHVMFHPMDF 194

Query: 149 GACFHYQEEKAIERGILPLLQRA---TLVQTFGQRNHVCLNEGSITIPPYA--PPQKMQA 203
           G  ++  + + + R +   +      T    +  R  V +   +  +  Y   P   +  
Sbjct: 195 GDTYYSTDSRVMMRQMSYFVTNGDARTNGTIYRPRKDVVIPSATYLLHSYRYHPRDYLDE 254

Query: 204 H-----QIPPD-----------------------------TPRSIFVYFRGLFYDVNNDP 229
           H     Q+P                               T R+I  YFRGL  DV  D 
Sbjct: 255 HGHPLSQMPKRPKDGTAVSDHVNIYEPTRARKWFASKRDPTGRTILAYFRGLGADVQPDD 314

Query: 230 EGGYYARG---ARAAVWENFKNNPLFDISTDHPTTYYE-DMQRAIFCLCPLGWAPWSPRL 285
           E     R     R    + F + P FD+S +     Y  ++  A + L P G    S R+
Sbjct: 315 EYSLGVRSLFYGRKGSHDGFASLPGFDVSVESENAEYAVELAHARYGLTPPGHTLDSTRI 374

Query: 286 VEAVVFGCIPVII-ADDIVLPFADAIPWEEI 315
            E + FG +PVII AD +VLPFA  +PW E+
Sbjct: 375 WEYLAFGVVPVIIGADGLVLPFAQHLPWSEM 405


>gi|357474305|ref|XP_003607437.1| Cation proton exchanger [Medicago truncatula]
 gi|355508492|gb|AES89634.1| Cation proton exchanger [Medicago truncatula]
          Length = 1198

 Score = 95.9 bits (237), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 104/360 (28%), Positives = 151/360 (41%), Gaps = 58/360 (16%)

Query: 40  LKVYVYELPSKYNKKLLL---KDPRCLTHMFAAEIFMHRF-------------------- 76
           LKVYVY +P+K+   LLL      R  +++ +    +HR                     
Sbjct: 112 LKVYVYNMPNKFTYDLLLLFRNTYRDTSNLTSNGSPVHRLIEQHSIDYWLWADLIAPESE 171

Query: 77  -LLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRT 135
            LL S VR    EEAD+FY P + T       L    +   + R A++ I+ + P W R+
Sbjct: 172 RLLKSVVRVHRQEEADFFYIPFFTTISFF---LLEKQQCKALYREALKWIT-DQPAWKRS 227

Query: 136 EGADHFFVVPH--DFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIP 193
            G DH   V H   F     Y + KAI   +LP +         GQ       E  + +P
Sbjct: 228 GGRDHILPVHHPWSFKTVRRYVK-KAI--WLLPDMDSTGNWYKPGQ----VYLEKDLILP 280

Query: 194 PYAPPQKMQAHQIPPDTP-RSIFVYFRGLFYDVNNDPEGGYYA------RGARAAVWENF 246
             A      A  +    P R+  ++FRG    +  +  G   +      RGA   V E  
Sbjct: 281 YVANVDFCDATCLSEINPKRNTLLFFRG---RLKRNAGGKIRSKLVDQLRGADGVVIEEG 337

Query: 247 KNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 306
            +                 M++++FCL P G  P S RL +A+V GCIPVI++D++ LPF
Sbjct: 338 TSGE------GGKEAAQNGMRKSLFCLNPAGDTPSSARLFDAIVSGCIPVIVSDELELPF 391

Query: 307 ADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQP 363
              + + +I +FV+  D  K   L   L  I    I   Q+ LA  S  R  L+  PAQP
Sbjct: 392 EGILDYRKIALFVSSNDALKPSWLLKYLKDIRSAHIKEMQQNLAKYS--RHFLYSSPAQP 449


>gi|15237870|ref|NP_197191.1| Exostosin family protein [Arabidopsis thaliana]
 gi|9755690|emb|CAC01702.1| putative protein [Arabidopsis thaliana]
 gi|15810401|gb|AAL07088.1| unknown protein [Arabidopsis thaliana]
 gi|23296585|gb|AAN13125.1| unknown protein [Arabidopsis thaliana]
 gi|332004972|gb|AED92355.1| Exostosin family protein [Arabidopsis thaliana]
          Length = 511

 Score = 95.9 bits (237), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 89/355 (25%), Positives = 149/355 (41%), Gaps = 48/355 (13%)

Query: 40  LKVYVYELPSKYNKKLL----------------------LKDPRCLTHMFAAEIFM--HR 75
           +KVYVYE+P K+   LL                      L +   + +   A++      
Sbjct: 118 IKVYVYEMPKKFTFDLLWLFHNTYKETSNATSNGSPVHRLIEQHSIDYWLWADLISPESE 177

Query: 76  FLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRT 135
             L S VR    ++AD+FY P + T       L    +   + R A++ ++   P W R+
Sbjct: 178 RRLKSVVRVQKQQDADFFYVPFFTTISFF---LLEKQQCKALYREALKWVTDQ-PAWKRS 233

Query: 136 EGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY 195
           EG DH F + H +     ++  +   +  + LL        + +   V L +  I   PY
Sbjct: 234 EGRDHIFPIHHPWS----FKSVRKFVKNAIWLLPDMDSTGNWYKPGQVSLEKDLIL--PY 287

Query: 196 APPQKMQAHQIPPDTP--RSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFD 253
            P   +   +   ++   R+  ++FRG           G   R    A     K+  + +
Sbjct: 288 VPNVDICDTKCLSESAPMRTTLLFFRGRL-----KRNAGGKIRAKLGAELSGIKDIIISE 342

Query: 254 ISTDH--PTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIP 311
            +            M+R++FCLCP G  P S RL +A+V GCIPVI++D++  PF   + 
Sbjct: 343 GTAGEGGKLAAQRGMRRSLFCLCPAGDTPSSARLFDAIVSGCIPVIVSDELEFPFEGILD 402

Query: 312 WEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQP 363
           ++++ + V+  D  +   L   L S+ P  +   Q  LA  S  R  L+  PAQP
Sbjct: 403 YKKVAVLVSSSDAIQPGWLVNHLRSLTPFQVKGLQNNLAQYS--RHFLYSSPAQP 455


>gi|297850568|ref|XP_002893165.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297339007|gb|EFH69424.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 462

 Score = 95.5 bits (236), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 97/392 (24%), Positives = 151/392 (38%), Gaps = 71/392 (18%)

Query: 40  LKVYVYELPSKYNKKLLL-------KDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADW 92
           LK+YVY+       K LL       K   CL   + +++ +H+ LL S  RT+  +EAD 
Sbjct: 89  LKIYVYDENEIDGLKELLYGRDGSVKTTACLKGQWGSQVKIHKLLLESKFRTIKKDEADL 148

Query: 93  FYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACF 152
           F+ P Y  C     GL     + + +      + S  PY+ R+ G DH FV P   GA  
Sbjct: 149 FFVPAYVKCVRMLGGL-----NDKEINQTYVKVLSQMPYFRRSGGRDHIFVFPSGAGAHL 203

Query: 153 HYQEEKAIERGIL--PLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIP--- 207
                  I R I+  P   R        +++    N     I P      M  +  P   
Sbjct: 204 FRSWSTFINRSIILTPEADRTD------KKDTTAFNTWKDIIIPGNVDDAMTKNGQPDVQ 257

Query: 208 --PDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP------ 259
             P + R     + G              A+G    +        L D+S  +P      
Sbjct: 258 PLPLSKRKYLANYLG-------------RAQGKAGRL-------KLIDLSKQYPDKLECP 297

Query: 260 ------------TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFA 307
                       TTY+E ++ A FCL P G + W+ R  E+    C+PV+++D   LPF 
Sbjct: 298 DLKFSGTEKFGRTTYFEHLRNAKFCLAPRGESSWTLRFYESFFVECVPVLLSDHAELPFQ 357

Query: 308 DAIPWEEIGMFVAEEDV-PKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQP-AQPGD 365
           + I + ++ +      +  +    L SI  + I   + ++A     R +    P + P  
Sbjct: 358 NVIDYAQVSIKWPSTRIGAEFLDYLASISDKDI---EGMIARGREIRCLFVYGPDSAPCS 414

Query: 366 AFHQILNGLARKLPH---DKSTYLKPGGKFLN 394
           A   IL  L RK+ H      T+    G F+N
Sbjct: 415 AVKGILWELQRKVRHFQQSTETFWLHNGSFVN 446


>gi|357474287|ref|XP_003607428.1| Exostosin family protein-like protein [Medicago truncatula]
 gi|355508483|gb|AES89625.1| Exostosin family protein-like protein [Medicago truncatula]
          Length = 502

 Score = 95.5 bits (236), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 106/360 (29%), Positives = 153/360 (42%), Gaps = 58/360 (16%)

Query: 40  LKVYVYELPSKYNKKLLL---KDPRCLTHMFAAEIFMHRF-------------------- 76
           LKVYVY +P+K+   LLL      R  +++ +    +HR                     
Sbjct: 112 LKVYVYNMPNKFTYDLLLLFRNTYRDTSNLTSNGSPVHRLIEQHSIDYWLWADLIAPESE 171

Query: 77  -LLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRT 135
            LL S VR    EEAD+FY P + T       L    +   + R A++ I+ + P W R+
Sbjct: 172 RLLKSVVRVHRQEEADFFYIPFFTTISFF---LLEKQQCKALYREALKWIT-DQPAWKRS 227

Query: 136 EGADHFFVVPH--DFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIP 193
            G DH   V H   F     Y + KAI   +LP +         GQ   V L E  + +P
Sbjct: 228 GGRDHILPVHHPWSFKTVRRYVK-KAI--WLLPDMDSTGNWYKPGQ---VYL-EKDLILP 280

Query: 194 PYAPPQKMQAHQIPPDTP-RSIFVYFRGLFYDVNNDPEGGYYA------RGARAAVWENF 246
             A      A  +    P R+  ++FRG    +  +  G   +      RGA   V E  
Sbjct: 281 YVANVDFCDATCLSEINPKRNTLLFFRG---RLKRNAGGKIRSKLVDQLRGADGVVIEEG 337

Query: 247 KNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 306
            +                 M++++FCL P G  P S RL +A+V GCIPVI++D++ LPF
Sbjct: 338 TSGE------GGKEAAQNGMRKSLFCLNPAGDTPSSARLFDAIVSGCIPVIVSDELELPF 391

Query: 307 ADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQP 363
              + + +I +FV+  D  K   L   L  I    I   Q+ LA  S  R  L+  PAQP
Sbjct: 392 EGILDYRKIALFVSSNDALKPSWLLKYLKDIRSAHIKEMQQNLAKYS--RHFLYSSPAQP 449


>gi|326522384|dbj|BAK07654.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 496

 Score = 95.5 bits (236), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 94/355 (26%), Positives = 151/355 (42%), Gaps = 48/355 (13%)

Query: 40  LKVYVYELPSKYNKKLL--LKDP-RCLTHMFAAEIFMHRF-------------------- 76
           L+VYVYE+P K+   LL   +D  R  T++ +    +HR                     
Sbjct: 110 LRVYVYEMPRKFTYDLLRLFRDSYRDTTNLTSNGSPVHRLIEQHSIDYWLWADLIAPESQ 169

Query: 77  -LLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRT 135
            LL + +R    EEAD FY P + T       L    +   + R A++ ++   P W R+
Sbjct: 170 RLLKNVIRVERQEEADIFYVPFFTTISYF---LLEKQECKALYREALKWVTDQ-PAWQRS 225

Query: 136 EGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY 195
           EG DH   V H +     ++  +   +  + LL        + +   V L +  I   PY
Sbjct: 226 EGRDHVIPVHHPWS----FKSVRRFVKKAIWLLPDMDSTGNWYKPGQVYLEKDVIL--PY 279

Query: 196 APPQKMQAHQIPPDT--PRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFD 253
            P   +  ++   +T   RS  ++FRG           G   R       +N ++  + +
Sbjct: 280 VPNVDLCDYKCVSETQSKRSTLLFFRGRL-----KRNAGGKIRSKLVTELQNIEDIIIEE 334

Query: 254 IST--DHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIP 311
            S            M++++FCL P G  P S RL +A+V GCIPVII+D++ LPF   + 
Sbjct: 335 GSAGAKGKVAALTGMRKSLFCLNPAGDTPSSARLFDAIVSGCIPVIISDELELPFEGILD 394

Query: 312 WEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQP 363
           + +I +FV+  D  +   L   L  +  + +   Q  L   S  R  L+  PAQP
Sbjct: 395 YSKIALFVSSTDAVQPGWLVKYLRGVDGKRVREMQSNLLKYS--RHFLYSSPAQP 447


>gi|224133768|ref|XP_002321656.1| predicted protein [Populus trichocarpa]
 gi|222868652|gb|EEF05783.1| predicted protein [Populus trichocarpa]
          Length = 407

 Score = 95.5 bits (236), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 94/366 (25%), Positives = 156/366 (42%), Gaps = 57/366 (15%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYP 99
           LKV+VY  P    K       + L   +A+E +    L +    T NP+EA  F+ P+  
Sbjct: 65  LKVFVY--PGGNPKTCYHSIDKKLKSNYASEHYFFMNLRNGSFLTENPDEAHLFFIPL-- 120

Query: 100 TCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFG----ACFHYQ 155
           +C        LP     ++++ ++ ++  +PYWNRT GADHFFV  H  G    A F + 
Sbjct: 121 SCQPMEDQDALPRYKEMVIQNYVRALTIKYPYWNRTLGADHFFVSCHGIGNRATAAFPFL 180

Query: 156 EEKAIERGILP----------------LLQRATLVQTFGQRNHVCLNEGSITIPPYAP-P 198
            + AI     P                +L+ +   +  G  N   +    I + P    P
Sbjct: 181 LKNAIRLVCSPSYDSNYIPHKDVSLPQILELSFPPEGDGMWNDSTMESLPIQLSPVETHP 240

Query: 199 QKMQ----AHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDI 254
            + +    A     +  +++ V+++GL      + E  +     RA V + F+       
Sbjct: 241 SRTKLCFWAGSPNSEVRKNLRVHYKGL-----EEFEIHFVENVKRALVLDTFQ------- 288

Query: 255 STDHPTTYYEDMQRAIFCLCPLGWAPWSPR-LVEAVVFGCIPVIIADDIVLPFADAIPWE 313
                    +++ R+ FC+CP G        L E++ FGC+PVI++D   LPF D + W 
Sbjct: 289 ---------KEIHRSKFCICPRGKTQVGGVCLAESMAFGCVPVIMSDYYDLPFNDILDWN 339

Query: 314 EIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQP--GDAFHQIL 371
              + + E DVP +  IL  I PE +  K R      +K +  F    +P   D FH ++
Sbjct: 340 AFSVILKEHDVPIMGEILKGI-PEDMFEKMR---QNVLKVSKYFKWHFRPVKYDEFHMVM 395

Query: 372 NGLARK 377
             L ++
Sbjct: 396 YELWKR 401


>gi|307106650|gb|EFN54895.1| hypothetical protein CHLNCDRAFT_135016 [Chlorella variabilis]
          Length = 584

 Score = 95.5 bits (236), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 91/345 (26%), Positives = 147/345 (42%), Gaps = 35/345 (10%)

Query: 68  AAEIFMHRFLLSSPVRTLNPEEADWFYTPI----YPTCDLTPTGLPLPFKSPRMM--RSA 121
           A + F+ +FLLS PVRT +P EA+ FY P+    Y      P+  P       MM  ++ 
Sbjct: 247 AYQQFLEQFLLS-PVRTEDPSEANLFYIPMLLYGYSGTPGGPSRAPQVDSLCNMMPGQAH 305

Query: 122 IQL----ISSNWPYWNRTEGADHFFVVPHDFGACFHY----QEEKAIERGILPLLQRATL 173
           I L    I+  WPYWNRT G DHF+  P D GAC+H     Q  K    G+        L
Sbjct: 306 IDLVLDQIAHKWPYWNRTRGRDHFYWAPADRGACYHKGLAEQAIKVSHFGLHATNNSIDL 365

Query: 174 VQTFGQR----NHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRG----LFYDV 225
              +       +H C +     + P  P +K+ A  +       +    +G     ++  
Sbjct: 366 GDLYSHNQMSPDHGCYHPLRDVVAP--PFEKLAASWLNTTLRLGLDGNIKGKNATFYFSG 423

Query: 226 NNDPEGGYYARGARAAVWENFK--NNPLFDISTD--HPTTYYEDMQRAIFCLCPLGWAPW 281
           N       Y+ G R  +    K  ++P F           Y + ++ + FCL P G   +
Sbjct: 424 NVQGINLMYSGGTRQKLQALIKQWDDPEFGFVEGRLQEGAYEQRIRESRFCLAPYGHG-Y 482

Query: 282 SPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILR 341
             RL + +  G IPVI+ + +  P  D +P+E   + +  +D+P+L  IL  I       
Sbjct: 483 GMRLGQCIFAGSIPVIVQEHVFQPLEDVLPYEAFSIRLTNDDLPQLREILRGITEA---- 538

Query: 342 KQRLLANPSMKRAMLFP-QPAQPGDAFHQILNGLARKLPHDKSTY 385
           + R L    ++ ++      +  G AF   ++ L R+  + KS +
Sbjct: 539 QYRELMTGLLRYSLALSWDTSLGGTAFDYTISALRRRYMNLKSLH 583


>gi|302835858|ref|XP_002949490.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
 gi|300265317|gb|EFJ49509.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
          Length = 499

 Score = 95.5 bits (236), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 73/302 (24%), Positives = 125/302 (41%), Gaps = 40/302 (13%)

Query: 70  EIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNW 129
            ++  + LL S  R  + + ADW++ P+              F     +R A+  I ++ 
Sbjct: 93  HVYFWQRLLGSGARVADGDLADWYFIPVRQR----------SFSDSWFLREALSYIRTHH 142

Query: 130 PYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQ--RATLVQTFGQRNHVCLNE 187
           P+WNRTEG  H  +   D+G     ++ + +   +  L     +T      +       E
Sbjct: 143 PWWNRTEGHRHMVLHTGDWGLGEVAKDVRQMSLNVTWLTHWGLSTDRPNIQRWTRAFRPE 202

Query: 188 GSITIPPYAPP--------QKMQAHQIPPDTPRSI-------FVYFRG-LFYDVN----- 226
             + IP Y  P         +   H +   + R+         ++F G + +D       
Sbjct: 203 RDVVIPVYISPGHFVHFGINRSPLHPVTAASRRTAARPRNESLLFFAGRICHDAKRPNPD 262

Query: 227 -----NDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPW 281
                 D    +Y  G R   + +  N   F +    P  Y   M R++FCL P G A  
Sbjct: 263 TFPACGDDTAEWYGGGVREKFFVSHWNRSGFHVVRSEPR-YSHYMSRSVFCLAPPG-AGH 320

Query: 282 SPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILR 341
             R ++A+  GC+PV +AD +  PF  A+ WEE G+ +AE+D+P+   +L  +  E +  
Sbjct: 321 GQRQIQALFMGCVPVTVADGVYEPFEPALSWEEWGLRIAEQDIPRAHELLGGLTREQLAE 380

Query: 342 KQ 343
           KQ
Sbjct: 381 KQ 382


>gi|18394994|ref|NP_564141.1| exostosin-like protein [Arabidopsis thaliana]
 gi|332191985|gb|AEE30106.1| exostosin-like protein [Arabidopsis thaliana]
          Length = 462

 Score = 95.1 bits (235), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 97/396 (24%), Positives = 151/396 (38%), Gaps = 79/396 (19%)

Query: 40  LKVYVYELPSKYNKKLLL-------KDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADW 92
           LK+YVY+       K LL       K   CL   + +++ +H+ LL S  RT+  +EAD 
Sbjct: 89  LKIYVYDENEIDGLKELLYGRDGSVKTTACLKGQWGSQVKIHKLLLESKFRTIKKDEADL 148

Query: 93  FYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACF 152
           F+ P Y  C     GL     + + +      + S  PY+ R+ G DH FV P   GA  
Sbjct: 149 FFVPAYVKCVRMLGGL-----NDKEINQTYVKVLSQMPYFRRSGGRDHIFVFPSGAGAHL 203

Query: 153 HYQEEKAIERGIL--PLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIP--- 207
                  I R I+  P   R        +++    N     I P      M  +  P   
Sbjct: 204 FRSWSTFINRSIILTPEADRTD------KKDTTAFNSWKDIIIPGNVDDAMTKNGQPDVQ 257

Query: 208 --PDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP------ 259
             P + R     + G              A+G    +        L D+S   P      
Sbjct: 258 PLPLSKRKYLANYLG-------------RAQGKAGRL-------KLIDLSKQFPDKLECP 297

Query: 260 ------------TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFA 307
                       TTY+E ++ A FCL P G + W+ R  E+    C+PV+++D   LPF 
Sbjct: 298 DLKFSGTEKFGRTTYFEHLRNAKFCLAPRGESSWTLRFYESFFVECVPVLLSDHAELPFQ 357

Query: 308 DAIPWEEIGM-----FVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQP-A 361
           + I + ++ +      +  E +  L +I       +I R +++       R +    P +
Sbjct: 358 NVIDYAQVSIKWPSTRIGSEFLDYLASISDRDIEGMIARGRKI-------RCLFVYGPDS 410

Query: 362 QPGDAFHQILNGLARKLPH---DKSTYLKPGGKFLN 394
            P  A   IL  L RK+ H      T+    G F+N
Sbjct: 411 APCSAVKGILWELQRKVRHFQQSTETFWLHNGSFVN 446


>gi|359481095|ref|XP_002265438.2| PREDICTED: probable glycosyltransferase At5g03795-like [Vitis
           vinifera]
          Length = 336

 Score = 95.1 bits (235), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 85/314 (27%), Positives = 136/314 (43%), Gaps = 35/314 (11%)

Query: 67  FAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLIS 126
           +A+E +  + L+ S   T +P +AD F+ P +    L              +R  I  IS
Sbjct: 33  YASESYFKKVLMKSHFITKDPSKADLFFLP-FSIARLRHDPRVGVGGIQDFIRDYIFNIS 91

Query: 127 SNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLN 186
            N+PYWN+T GADHF+V  H  G       EKA E        +   +Q     ++    
Sbjct: 92  QNYPYWNQTGGADHFYVACHSIGRS---AMEKADE-------VKLNAIQVVCSSSYFL-- 139

Query: 187 EGSITIPPYAPPQKMQAHQIPPD---TPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVW 243
            G I     + PQ       PPD   + R    +F G      N P         R  + 
Sbjct: 140 SGYIAHKDASLPQIWPRQGDPPDLALSERKKLAFFAGSI----NSP--------VRERLL 187

Query: 244 ENFKNNPLFDISTDHPTT-YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDI 302
           + ++N+    +     TT Y +++  + FCL   G+   + R+ +++ +GC+PVIIA+  
Sbjct: 188 QVWRNDSEISVHFGRLTTPYADELLGSKFCLHVKGFEINTARIADSLYYGCVPVIIANHY 247

Query: 303 VLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQ 362
            LPFAD + W+   + VA  D+P L  +L  I     L +  +L +  +K    F     
Sbjct: 248 DLPFADILNWKSFSIVVATLDIPLLKQVLKGIS----LNEYLMLQSNVLKVRNHFQWHVS 303

Query: 363 P--GDAFHQILNGL 374
           P   DAF+ ++  L
Sbjct: 304 PVDYDAFYMVMYEL 317


>gi|21537341|gb|AAM61682.1| unknown [Arabidopsis thaliana]
          Length = 462

 Score = 95.1 bits (235), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 97/396 (24%), Positives = 151/396 (38%), Gaps = 79/396 (19%)

Query: 40  LKVYVYELPSKYNKKLLL-------KDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADW 92
           LK+YVY+       K LL       K   CL   + +++ +H+ LL S  RT+  +EAD 
Sbjct: 89  LKIYVYDENEIDGLKELLYGRDGSVKTTACLKGQWGSQVKIHKLLLESKFRTIKKDEADL 148

Query: 93  FYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACF 152
           F+ P Y  C     GL     + + +      + S  PY+ R+ G DH FV P   GA  
Sbjct: 149 FFVPAYVKCVRMLGGL-----NDKEINQTYVKVLSQMPYFRRSGGRDHIFVFPSGAGAHL 203

Query: 153 HYQEEKAIERGIL--PLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIP--- 207
                  I R I+  P   R        +++    N     I P      M  +  P   
Sbjct: 204 FRSWSTFINRSIILTPEADRTD------KKDTTAFNSWKDIIIPGNVDDAMTKNGQPDVQ 257

Query: 208 --PDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP------ 259
             P + R     + G              A+G    +        L D+S   P      
Sbjct: 258 PLPLSKRKYLANYLG-------------RAQGKAGRL-------KLIDLSKQFPDKLECP 297

Query: 260 ------------TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFA 307
                       TTY+E ++ A FCL P G + W+ R  E+    C+PV+++D   LPF 
Sbjct: 298 DLKFSGTEKFGRTTYFEHLRNAKFCLAPRGESSWTLRFYESFFVECVPVLLSDHAELPFQ 357

Query: 308 DAIPWEEIGM-----FVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQP-A 361
           + I + ++ +      +  E +  L +I       +I R +++       R +    P +
Sbjct: 358 NVIDYAQVSIKWPSTRIGSEFLDYLASISDRDIEGMIARGRKI-------RCLFVYGPDS 410

Query: 362 QPGDAFHQILNGLARKLPH---DKSTYLKPGGKFLN 394
            P  A   IL  L RK+ H      T+    G F+N
Sbjct: 411 APCSAVKGILWELQRKVRHFQQSTETFWLHNGSFVN 446


>gi|18650643|gb|AAL75891.1| At1g21480/F24J8_23 [Arabidopsis thaliana]
          Length = 462

 Score = 95.1 bits (235), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 97/396 (24%), Positives = 151/396 (38%), Gaps = 79/396 (19%)

Query: 40  LKVYVYELPSKYNKKLLL-------KDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADW 92
           LK+YVY+       K LL       K   CL   + +++ +H+ LL S  RT+  +EAD 
Sbjct: 89  LKIYVYDENEIDGLKELLYGRDGSVKTTACLKGQWGSQVKIHKLLLESKFRTIKKDEADL 148

Query: 93  FYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACF 152
           F+ P Y  C     GL     + + +      + S  PY+ R+ G DH FV P   GA  
Sbjct: 149 FFVPAYVKCVRMLGGL-----NDKEINQTYVKVLSQMPYFRRSGGRDHIFVFPSGAGAHL 203

Query: 153 HYQEEKAIERGIL--PLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIP--- 207
                  I R I+  P   R        +++    N     I P      M  +  P   
Sbjct: 204 FRSWSTFINRSIILTPEADRTD------KKDTTAFNSWKDIIIPGNVDDAMTKNGQPDVQ 257

Query: 208 --PDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP------ 259
             P + R     + G              A+G    +        L D+S   P      
Sbjct: 258 PLPLSKRKYLANYLG-------------RAQGKAGRL-------KLIDLSKQFPDKLECP 297

Query: 260 ------------TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFA 307
                       TTY+E ++ A FCL P G + W+ R  E+    C+PV+++D   LPF 
Sbjct: 298 DLKFSGTEKFGRTTYFEHLRNAKFCLAPRGESSWTLRFYESFFVECVPVLLSDHAELPFQ 357

Query: 308 DAIPWEEIGM-----FVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQP-A 361
           + I + ++ +      +  E +  L +I       +I R +++       R +    P +
Sbjct: 358 NVIDYAQVSIKWPSTRIGSEFLDYLASISDRDIEGMIARGRKI-------RCLFVYGPDS 410

Query: 362 QPGDAFHQILNGLARKLPH---DKSTYLKPGGKFLN 394
            P  A   IL  L RK+ H      T+    G F+N
Sbjct: 411 APCSAVKGILWELQRKVRHFQQSTETFWLHNGSFVN 446


>gi|168059172|ref|XP_001781578.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666988|gb|EDQ53629.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 344

 Score = 95.1 bits (235), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 82/342 (23%), Positives = 145/342 (42%), Gaps = 26/342 (7%)

Query: 39  KLKVYVYELPSKYNKKLLLKDPRCL-THMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPI 97
           KL+++VY    K    ++ +    + +  +A+E F  + LL+S   T    EAD+F+ P+
Sbjct: 17  KLRIFVYPHDRKDPFHMIFESGNKVPSGNYASEEFFQQSLLTSTFLTKTASEADFFFMPV 76

Query: 98  YPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEE 157
             T       + +         + I  + S W YWNR+ GADHF++  H        +  
Sbjct: 77  SITKARMDKRINVG-GLQSFCANYITDVRSQWSYWNRSNGADHFYLSCHSIARNAMDRVP 135

Query: 158 KAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVY 217
              +  I  L   +  + ++       +     ++P   P    +  ++   T R    +
Sbjct: 136 DVRQNAIQLLCPASYFLPSY-------ITHKDASVPQIWPRLGKEPEEVRTITQRKRLAF 188

Query: 218 FRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLG 277
           F G      N P      R      W N  ++ +       P  Y E +    FCL   G
Sbjct: 189 FAGAL----NSP-----VRKDLERTWAN--DSKILVHKGRVPYPYSEALLTTKFCLHAKG 237

Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
           +   + RL +A+ +GC+PV+IA+   LPF D + W +  + V+  D+P L   L ++  E
Sbjct: 238 FEVNTARLGDAMYYGCVPVVIANYYDLPFQDILDWTKFSIVVSSLDIPLLKKTLEAVTDE 297

Query: 338 --VILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARK 377
               L +Q LLA    ++   +  P +  DAFH ++  L ++
Sbjct: 298 QYAELHRQVLLA----RKHFQWHAPPEEYDAFHTVMYELWKR 335


>gi|255558009|ref|XP_002520033.1| catalytic, putative [Ricinus communis]
 gi|223540797|gb|EEF42357.1| catalytic, putative [Ricinus communis]
          Length = 507

 Score = 95.1 bits (235), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 94/359 (26%), Positives = 147/359 (40%), Gaps = 56/359 (15%)

Query: 40  LKVYVYELPSKYNKKLL---LKDPRCLTHMFAAEIFMHRF-------------------- 76
           ++VYVYE+P+K+   LL       R   ++ +    +HR                     
Sbjct: 119 IRVYVYEMPNKFTYDLLWLFRNTYRDTVNLTSNGSPVHRLIEQHSIDYWLWADLIAPETE 178

Query: 77  -LLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRT 135
            LL S VR    EEAD FY P + T       L    +   + R A++ ++   P W R+
Sbjct: 179 RLLKSVVRVYRQEEADLFYIPFFTTISFF---LLEKQQCKALYREALKWVTDQ-PAWKRS 234

Query: 136 EGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY 195
            G DH   V H +     ++  +   +  + LL        + +   V L +  I   PY
Sbjct: 235 GGRDHILPVHHPWS----FKSVRRYVKNAIWLLPDMDSTGNWYKPGQVFLEKDLIL--PY 288

Query: 196 APPQKMQAHQIPPD--TPRSIFVYFRGLFYDVNNDPEGGYYAR------GARAAVWENFK 247
            P   +   +   +  + R+  ++FRG    +  +  G   A+      GA   V E   
Sbjct: 289 VPNVDLCDAKCASENESKRTTLLFFRG---RLKRNAGGKIRAKLVAELSGAEGVVVEEGT 345

Query: 248 NNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFA 307
                             M+++IFCL P G  P S RL +A+V GCIPVI++D++ LPF 
Sbjct: 346 AGE------GGKAAAQTGMRKSIFCLSPAGDTPSSARLFDAIVSGCIPVIVSDELELPFE 399

Query: 308 DAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQP 363
             + + +I +FV+  D  +   L   L  + P      QR L   S  R  L+  PAQP
Sbjct: 400 GILDYRKIAVFVSSSDAIQPGWLIKFLKDVSPAQTREMQRNLVKYS--RHFLYSSPAQP 456


>gi|47497631|dbj|BAD19700.1| exostosin-like protein [Oryza sativa Japonica Group]
          Length = 345

 Score = 94.7 bits (234), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 77/300 (25%), Positives = 133/300 (44%), Gaps = 24/300 (8%)

Query: 86  NPEEADWFYTPIYPTCDL-------TPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGA 138
            P++A  F  PI   C+L         TG   P +   ++   +++++  +PYWNR+ GA
Sbjct: 48  RPDDAHAFLLPI-SVCNLVHYVYRLNATGDLAPLRG--LVADYVRVVAERYPYWNRSRGA 104

Query: 139 DHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPP 198
           DH  V  HD+ A       + +    + +L  A   + F  R    L E ++       P
Sbjct: 105 DHVIVSCHDW-APMVTSAHRQLYGNAIRVLCNANTSEGFRPRKDATLPEVNLADGVLRRP 163

Query: 199 QKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDH 258
                  +PP+  R+   +F G  +    +    ++  G +     +   +    +    
Sbjct: 164 TA----GLPPEN-RTTLAFFAGGRHGHIRESLLRHWLIGNKGGAAADGDGDGDMRVHEYL 218

Query: 259 PT--TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG 316
           P    Y+  M  A FCLCP G+   SPR+VE+V  GC+PVII++    PF D + W ++ 
Sbjct: 219 PAGEDYHAQMAAARFCLCPSGFEVASPRVVESVFAGCVPVIISEGYPPPFGDVLDWGKMS 278

Query: 317 MFVAEEDVPKLDTILTSIPPE--VILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
           + V    +P+L  IL  +      +LR + L A    +R  +  +PA+  D  H +L+ +
Sbjct: 279 VAVPAARIPELRAILRRVSERRYRVLRARVLQA----QRHFVLHRPARRFDMIHMVLHSI 334


>gi|359485862|ref|XP_002264076.2| PREDICTED: probable glycosyltransferase At5g03795-like [Vitis
           vinifera]
          Length = 484

 Score = 94.7 bits (234), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 83/342 (24%), Positives = 142/342 (41%), Gaps = 32/342 (9%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYP 99
            K+YVYE      +  +  +  C +       F+H     S  RT +P++A  ++ P   
Sbjct: 157 FKIYVYE----EGEPPMFHNGPCKSIYSTEGRFIHEMEKGSVYRTTDPDQALLYFLPFSV 212

Query: 100 TCDLTPTGLPLPFKSPRMMRSAI---QLISSNWPYWNRTEGADHFFVVPHDFG----ACF 152
              +    +P   +   + ++ I    LIS N P+WNR+ GADHF +  HD+G       
Sbjct: 213 VMMVQYLYVPDSHEIHAIEKTVIDYINLISHNHPFWNRSLGADHFMLSCHDWGPRASTSV 272

Query: 153 HYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPR 212
            Y    +I      +L  A   + F     V   E  +     + P       + P + R
Sbjct: 273 PYLYNNSIR-----VLCNANTSEGFNPSKDVSFPEIHLRTGEMSGP----LGGLSP-SRR 322

Query: 213 SIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFC 272
            I  +F G  +          + R      W++   +        +  +Y   ++++ FC
Sbjct: 323 PILGFFAGRLHG---------HIRYLLLEQWKDKDKDLQVYDQLPNGLSYDSMLKKSRFC 373

Query: 273 LCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILT 332
           LCP G+   SPR+VEA+   C+PV+I+D+ V PF D + W+   + V   D+  +  IL 
Sbjct: 374 LCPSGYEVASPRVVEAIYAECVPVLISDNYVPPFNDVLNWKSFAVQVQVRDIANIKRILM 433

Query: 333 SIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
            I     LR  R +    ++R  +     Q  D FH  ++ +
Sbjct: 434 GISQTQYLRMYRRVKQ--VQRHFMVNAAPQRFDVFHMTIHSI 473


>gi|159470791|ref|XP_001693540.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
 gi|158283043|gb|EDP08794.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
          Length = 510

 Score = 94.7 bits (234), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 100/383 (26%), Positives = 152/383 (39%), Gaps = 71/383 (18%)

Query: 41  KVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPT 100
           +VY+Y+LP +        D R     +    F+   L ++  R  +P  ADWFY P+   
Sbjct: 121 RVYIYDLPPELTT--WRNDDR--LDRWTTRHFLE-MLTATGARVGDPAAADWFYLPVRLR 175

Query: 101 CDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAI 160
                           ++R A++ + +  P++N T G DHF +   D G           
Sbjct: 176 SS----------SDGHVLRRALEYVQAAQPWFNATGGKDHFVLAVGDMG-------RLES 218

Query: 161 ERGILPLLQRATLVQTFG--------------QRNHVCLNEGSITIPPYAPPQKMQAHQI 206
           ERG  PL    T V  +G              + +H   N   I +P Y   +K+Q   I
Sbjct: 219 ERG--PLSANVTFVSHWGLYRSKAEQLQSPHWRASH--RNATDIVLPVYLTLRKLQKFGI 274

Query: 207 -------------PPDTPRS---IFVYFRGLFYDVN--------NDPEGGYYARGARAAV 242
                        PPD       +F +   +  D +        N P+   Y+   R AV
Sbjct: 275 LGSRHHPKFATVAPPDVRERNGPLFWFAGRVCQDSSPPRTDVWPNCPKAMGYSAMTRQAV 334

Query: 243 WENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDI 302
           + +  N   F +       Y + M  A FC  P+G      R  +A + GC+PV+I D +
Sbjct: 335 YFHHWNRTGFAVLRGD-KQYAKHMLTAKFCFGPMG-GGHGQRQFQAALAGCVPVVIGDGV 392

Query: 303 VLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQ 362
           +  +   + W + G+ VAE D+P+L TIL +I PE   RK R L   +   A      A 
Sbjct: 393 LEAWEPYLDWNDFGVRVAEADIPRLHTILGAIGPEEYARKVRSLRCAAQHMAFSSVTGAY 452

Query: 363 PG-----DAFHQILNGLARKLPH 380
            G     DAF  +L  LA +  H
Sbjct: 453 MGESGRFDAFETLLAVLAARARH 475


>gi|159489402|ref|XP_001702686.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
 gi|158280708|gb|EDP06465.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
          Length = 615

 Score = 94.7 bits (234), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 83/338 (24%), Positives = 135/338 (39%), Gaps = 52/338 (15%)

Query: 39  KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIY 98
            +K+YVYELP        +K      H+    +F  R L+S+ +RT+N +EAD+F+ P+ 
Sbjct: 178 SIKIYVYELPPNVTSWFNIKRLDRPLHL----LFWQR-LMSAGLRTVNGDEADYFFIPLN 232

Query: 99  PTCDLTPTGLP--LPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQE 156
               + P      LP+            I + WPYW+R  G  H  +   D G    ++ 
Sbjct: 233 TRTLMAPEQAAWILPY------------IRNTWPYWDRDNGHRHLIIHTGDMGL---HEL 277

Query: 157 EKAIERGILPLLQRATLVQTFGQRNH-------------------VCLNEGSITIPPYAP 197
              + R +   L   T +  +G   +                   V +      + P  P
Sbjct: 278 PLGLRRKMNETLSNITWLTHWGLHTYHPIGTWFPAHRPGKDIVIPVMITTPGFQLSPLNP 337

Query: 198 PQKMQAHQIPPDTPRSIFVYFRGLFYDVNN---------DPEGGYYARGARAAVWENFKN 248
               +A +      R    +F G                 P+   Y+   R  V+ +  N
Sbjct: 338 AVAEKAAKRGRPYTREQTFFFAGRICGDRKPPDPLTHECAPKRTDYSASVRQRVYFHHHN 397

Query: 249 NPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFAD 308
              F + T   + Y +++    FCL P G      R V   + GCIPV I D +  PF  
Sbjct: 398 RTGFKVLTG-TSKYMQEITSHKFCLAPTGGG-HGKRQVLVALMGCIPVTITDGVYQPFEP 455

Query: 309 AIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLL 346
            +PW +  + VAE+D+P+L  +L ++PPE + + Q  L
Sbjct: 456 ELPWADFSVPVAEDDIPRLHEVLEALPPEQVEQMQSRL 493


>gi|296085068|emb|CBI28483.3| unnamed protein product [Vitis vinifera]
          Length = 416

 Score = 94.7 bits (234), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 83/342 (24%), Positives = 142/342 (41%), Gaps = 32/342 (9%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYP 99
            K+YVYE      +  +  +  C +       F+H     S  RT +P++A  ++ P   
Sbjct: 89  FKIYVYE----EGEPPMFHNGPCKSIYSTEGRFIHEMEKGSVYRTTDPDQALLYFLPFSV 144

Query: 100 TCDLTPTGLPLPFKSPRMMRSAI---QLISSNWPYWNRTEGADHFFVVPHDFG----ACF 152
              +    +P   +   + ++ I    LIS N P+WNR+ GADHF +  HD+G       
Sbjct: 145 VMMVQYLYVPDSHEIHAIEKTVIDYINLISHNHPFWNRSLGADHFMLSCHDWGPRASTSV 204

Query: 153 HYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPR 212
            Y    +I      +L  A   + F     V   E  +     + P       + P + R
Sbjct: 205 PYLYNNSIR-----VLCNANTSEGFNPSKDVSFPEIHLRTGEMSGPLG----GLSP-SRR 254

Query: 213 SIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFC 272
            I  +F G  +          + R      W++   +        +  +Y   ++++ FC
Sbjct: 255 PILGFFAGRLHG---------HIRYLLLEQWKDKDKDLQVYDQLPNGLSYDSMLKKSRFC 305

Query: 273 LCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILT 332
           LCP G+   SPR+VEA+   C+PV+I+D+ V PF D + W+   + V   D+  +  IL 
Sbjct: 306 LCPSGYEVASPRVVEAIYAECVPVLISDNYVPPFNDVLNWKSFAVQVQVRDIANIKRILM 365

Query: 333 SIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
            I     LR  R +    ++R  +     Q  D FH  ++ +
Sbjct: 366 GISQTQYLRMYRRVKQ--VQRHFMVNAAPQRFDVFHMTIHSI 405


>gi|125540270|gb|EAY86665.1| hypothetical protein OsI_08049 [Oryza sativa Indica Group]
          Length = 482

 Score = 94.7 bits (234), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 77/300 (25%), Positives = 133/300 (44%), Gaps = 24/300 (8%)

Query: 86  NPEEADWFYTPIYPTCDL-------TPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGA 138
            P++A  F  PI   C+L         TG   P +   ++   +++++  +PYWNR+ GA
Sbjct: 185 RPDDAHAFLLPI-SVCNLVHYVYRLNATGDLAPLRG--LVADYVRVVAERYPYWNRSRGA 241

Query: 139 DHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPP 198
           DH  V  HD+ A       + +    + +L  A   + F  R    L E ++       P
Sbjct: 242 DHVIVSCHDW-APMVTSAHRQLYGNAIRVLCNANTSEGFRPRKDATLPEVNLADGVLRRP 300

Query: 199 QKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDH 258
                  +PP+  R+   +F G  +    +    ++  G +     +   +    +    
Sbjct: 301 TA----GLPPEN-RTTLAFFAGGRHGHIRESLLRHWLIGNKGGAAADGDGDGDMRVHEYL 355

Query: 259 PT--TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG 316
           P    Y+  M  A FCLCP G+   SPR+VE+V  GC+PVII++    PF D + W ++ 
Sbjct: 356 PAGEDYHAQMAAARFCLCPSGFEVASPRVVESVFAGCVPVIISEGYPPPFGDVLDWGKMS 415

Query: 317 MFVAEEDVPKLDTILTSIPPE--VILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
           + V    +P+L  IL  +      +LR + L A    +R  +  +PA+  D  H +L+ +
Sbjct: 416 VAVPAARIPELRAILRRVSERRYRVLRARVLQA----QRHFVLHRPARRFDMIHMVLHSI 471


>gi|307103001|gb|EFN51266.1| hypothetical protein CHLNCDRAFT_141180 [Chlorella variabilis]
          Length = 404

 Score = 94.7 bits (234), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 102/379 (26%), Positives = 155/379 (40%), Gaps = 63/379 (16%)

Query: 39  KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRF---LLSSPVRTLNPEEADWFYT 95
             K+Y+YEL ++    L L       H+  A    H+F   LL SPVRT +P EA+ F+ 
Sbjct: 56  NFKIYMYELSAQLAYDLDLASGPEEDHIHLAH---HKFIEQLLMSPVRTEDPSEANLFFV 112

Query: 96  PI----YPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGAC 151
           P     Y    L    L L              I+S++PYWNR++G DH F + +D GAC
Sbjct: 113 PALSWSYGGNALNAVHLDL----------VADHIASHYPYWNRSQGRDHIFWLTNDRGAC 162

Query: 152 FHYQEEKAIERGILPLLQRATLVQTFGQRN---HVCLNEGSITIPPYAPPQKMQAHQIPP 208
                       +    + A  +  FG       V    G+ T P  A    ++    PP
Sbjct: 163 -----------ALTGRTEAAIKLTHFGLNTINISVGWGPGAATNPENACYNPLRDVVAPP 211

Query: 209 --DTPRSIFVYFRGLFYD---------------VNNDPEGGYYARGARAAVWENFK--NN 249
             D  R +    R L  +               V+ND E   Y+   R  + E  K  N+
Sbjct: 212 FDDMARELMEVSRKLSVEDIIAAKTSLFFFSGAVSNDSE---YSGNTRQLLRELVKRWND 268

Query: 250 PLFDISTDHPT---TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 306
           P     T+  T    Y + ++ + FC    G+  +  RL+  V  G +P++I + +  P 
Sbjct: 269 PEIIFETEGDTGLGDYVKRLRASKFCPAVFGYG-FGMRLLTCVFSGSVPLVIQERVAQPL 327

Query: 307 ADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDA 366
            D +P+E   + +    +P L  IL SI  +   + QRL+      R     +PA  G A
Sbjct: 328 EDLLPYETFSLRLNNGHLPDLPRILRSITDQ---QYQRLVQGLVRYRDAFHWEPAAGGKA 384

Query: 367 FHQILNGLARKLPHDKSTY 385
           F   +  L R+  + KS Y
Sbjct: 385 FEYTIASLRRRHLNFKSLY 403


>gi|296089301|emb|CBI39073.3| unnamed protein product [Vitis vinifera]
          Length = 467

 Score = 94.7 bits (234), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 85/314 (27%), Positives = 136/314 (43%), Gaps = 35/314 (11%)

Query: 67  FAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLIS 126
           +A+E +  + L+ S   T +P +AD F+ P +    L              +R  I  IS
Sbjct: 152 YASESYFKKVLMKSHFITKDPSKADLFFLP-FSIARLRHDPRVGVGGIQDFIRDYIFNIS 210

Query: 127 SNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLN 186
            N+PYWN+T GADHF+V  H  G       EKA E        +   +Q     ++    
Sbjct: 211 QNYPYWNQTGGADHFYVACHSIGRS---AMEKADE-------VKLNAIQVVCSSSYFL-- 258

Query: 187 EGSITIPPYAPPQKMQAHQIPPD---TPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVW 243
            G I     + PQ       PPD   + R    +F G      N P         R  + 
Sbjct: 259 SGYIAHKDASLPQIWPRQGDPPDLALSERKKLAFFAGSI----NSP--------VRERLL 306

Query: 244 ENFKNNPLFDISTDHPTT-YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDI 302
           + ++N+    +     TT Y +++  + FCL   G+   + R+ +++ +GC+PVIIA+  
Sbjct: 307 QVWRNDSEISVHFGRLTTPYADELLGSKFCLHVKGFEINTARIADSLYYGCVPVIIANHY 366

Query: 303 VLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQ 362
            LPFAD + W+   + VA  D+P L  +L  I     L +  +L +  +K    F     
Sbjct: 367 DLPFADILNWKSFSIVVATLDIPLLKQVLKGIS----LNEYLMLQSNVLKVRNHFQWHVS 422

Query: 363 P--GDAFHQILNGL 374
           P   DAF+ ++  L
Sbjct: 423 PVDYDAFYMVMYEL 436


>gi|60657600|gb|AAX33321.1| secondary cell wall-related glycosyltransferase family 47 [Populus
           tremula x Populus tremuloides]
          Length = 509

 Score = 94.4 bits (233), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 93/380 (24%), Positives = 152/380 (40%), Gaps = 62/380 (16%)

Query: 40  LKVYVYELPSKYNKKLLLKDPR-----------------CLTHMFAAEIFMHRFLLS--- 79
           LKV++Y+LPS+++  LL   P+                  L    + E ++   LL+   
Sbjct: 119 LKVFMYDLPSEFHFGLLDWKPQGGSVWPDLRAKVPAYPGGLNLQHSIEYWLTMDLLASEV 178

Query: 80  -------SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPF----KSPRMMRSAIQLISSN 128
                  S VR  N  EAD  + P + +          P     K+  +    ++ ++S 
Sbjct: 179 PGIPRAGSAVRVQNSSEADVIFVPFFSSISYNRYSKVNPHQKKSKNKSLEEKLVKFVTSQ 238

Query: 129 WPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEG 188
              W R+ G DH  +  H     +            + L     ++  FG+ +    N G
Sbjct: 239 -KEWKRSGGRDHIILAHHPNSMLY----------ARMKLWTAMFILADFGRYSPNIANVG 287

Query: 189 SITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKN 248
              I PY    K  A+       R   +YF+G  Y      +GG+    AR  ++   K+
Sbjct: 288 KDVIAPYKHVIKSYANDSSNFDSRPTLLYFQGAIY----RKDGGF----ARQELFYALKD 339

Query: 249 NP-----LFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIV 303
                     +  D  +   + M  + FCL   G  P S RL +A+   C+PVII+DDI 
Sbjct: 340 EKDVHFQFGSVQKDGVSKASQGMHSSKFCLNIAGDTPSSNRLFDAIASHCVPVIISDDIE 399

Query: 304 LPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRK-QRLLANPSMKRAMLFPQ 359
           LP+ D + + +  +FV   D  +   L  ++ SI  +   R  QRL     ++    F  
Sbjct: 400 LPYEDVLDYSQFCIFVRTSDAVREKFLINLVRSIKKDEWTRMWQRL---KEVENFFEFQY 456

Query: 360 PAQPGDAFHQILNGLARKLP 379
           P++ GDA   I   +ARK+P
Sbjct: 457 PSKEGDAVQMIWQAVARKVP 476


>gi|218189647|gb|EEC72074.1| hypothetical protein OsI_05008 [Oryza sativa Indica Group]
          Length = 174

 Score = 94.4 bits (233), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 42/54 (77%), Positives = 46/54 (85%)

Query: 28  AGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSP 81
            GDVLEDDP GKLKV+VYE+P KYN  LL KD RCL HMFAAEIFMH+FLLS+P
Sbjct: 110 GGDVLEDDPTGKLKVFVYEMPRKYNLNLLAKDSRCLQHMFAAEIFMHQFLLSAP 163


>gi|356544337|ref|XP_003540609.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase F8H-like
           [Glycine max]
          Length = 494

 Score = 94.4 bits (233), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 98/356 (27%), Positives = 148/356 (41%), Gaps = 50/356 (14%)

Query: 40  LKVYVYELPSKYNKKLL--LKDP-RCLTHMFAAEIFMHRF-------------------- 76
           L+VYVY++P K+   LL   K+  R  +++ +    +HR                     
Sbjct: 104 LRVYVYDMPPKFTHDLLWLFKNTYRDTSNLTSNGSPVHRLIEQHSIDYWLWADLIAPQSE 163

Query: 77  -LLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRT 135
            LL+S VR    EEAD FY P + T       L    +   + R A++ I+   P W R+
Sbjct: 164 RLLTSVVRVHRQEEADLFYIPFFTTISFF---LMEKQQCKALYREALKWITDQ-PAWKRS 219

Query: 136 EGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY 195
            G DH   V H +     ++  +   +  + LL        + +   V L +  I   PY
Sbjct: 220 GGRDHILPVHHPWS----FKSVRRYVKNAIWLLPDMDSTGNWYKPGQVYLEKDLIL--PY 273

Query: 196 APPQKMQAHQIPPDT--PRSIFVYFRGLFYDVNND---PEGGYYARGARAAVWENFKNNP 250
            P   +   +   +T   RS  ++FRG            + G    GA   V E      
Sbjct: 274 VPNVDLCDAKCLSETNPKRSTLLFFRGRLKRNAGGKIRSKLGAELSGADGVVIEEGT--- 330

Query: 251 LFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAI 310
                          M++++FCL P G  P S RL +A+V GCIPVII+D++ LPF   +
Sbjct: 331 ---AGEGGKEAAQRGMRKSLFCLSPAGDTPSSARLFDAIVSGCIPVIISDELELPFEGIL 387

Query: 311 PWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQP 363
            + +I +F++  D  K   L   L  I P  I   Q+ LA  S  R  L+  PA P
Sbjct: 388 DYRKIAVFISSNDAVKPGWLLKYLKGIRPAHIKEMQQNLAKYS--RHFLYSSPALP 441


>gi|224123938|ref|XP_002319201.1| predicted protein [Populus trichocarpa]
 gi|222857577|gb|EEE95124.1| predicted protein [Populus trichocarpa]
          Length = 450

 Score = 94.4 bits (233), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 93/380 (24%), Positives = 153/380 (40%), Gaps = 62/380 (16%)

Query: 40  LKVYVYELPSKYNKKLLLKDPR-----------------CLTHMFAAEIFMHRFLLS--- 79
           LKV++Y+LPS+++  LL   P+                  L    + E ++   LL+   
Sbjct: 80  LKVFMYDLPSEFHFGLLDWKPQGGSVWPDLRAKVPAYPGGLNLQHSIEYWLTMDLLASEI 139

Query: 80  -------SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPF----KSPRMMRSAIQLISSN 128
                  S VR  N  EAD  + P + +          P     K+  +    ++ ++S 
Sbjct: 140 PGIPRAGSAVRVQNSSEADVIFVPFFSSISYNRYSKVNPHQKKSKNKSLEEKLVKFVTSQ 199

Query: 129 WPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEG 188
              W R+ G DH  +  H            ++    + L     ++  FG+ +    N G
Sbjct: 200 -KEWKRSGGRDHIILAHH----------PNSMLYARMKLWTAMFILADFGRYSPNIANVG 248

Query: 189 SITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKN 248
              I PY    K  A+       R   +YF+G  Y      +GG+    AR  ++   K+
Sbjct: 249 KDVIAPYKHVIKSYANDSSNFDSRPTLLYFQGAIY----RKDGGF----ARQELFYALKD 300

Query: 249 NP-----LFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIV 303
                     +  D  +   + M  + FCL   G  P S RL +A+   C+PVII+DDI 
Sbjct: 301 EKDVHFQFGSVQKDGVSKASQGMHSSKFCLNIAGDTPSSNRLFDAIASHCVPVIISDDIE 360

Query: 304 LPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRK-QRLLANPSMKRAMLFPQ 359
           LP+ D + + +  +FV   D  +   L  ++ SI  +   R  QRL     ++    F  
Sbjct: 361 LPYEDVLDYSQFCIFVRTSDAVREKFLINLVRSIKKDEWTRMWQRL---KEVENFFEFQY 417

Query: 360 PAQPGDAFHQILNGLARKLP 379
           P++ GDA   I   +ARK+P
Sbjct: 418 PSKEGDAVQMIWQAVARKVP 437


>gi|302812737|ref|XP_002988055.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
 gi|300144161|gb|EFJ10847.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
          Length = 459

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 99/397 (24%), Positives = 159/397 (40%), Gaps = 66/397 (16%)

Query: 34  DDPVGKLKVYVYELPSKY----------------NKKLLLKDPRCLTHMFAAEIFMHRFL 77
           DD V  LKVY+Y+LP+++                N   L + P  L    + E ++   L
Sbjct: 79  DDRV--LKVYMYDLPAEFHFGMLDAAISGGSWPRNISSLPRYPGGLYQQHSPEYWLTADL 136

Query: 78  LSSP-----------VRTLNPEEADWFYTPIYPTCDLTP--------TGLPLPFKSPRMM 118
           LSS            VR  +P  AD F+ P + +              G     ++ R+ 
Sbjct: 137 LSSADPSSRKSPCSAVRVADPATADIFFVPFFSSLSYNRYCRTGHRFQGGRGCVENDRLE 196

Query: 119 RSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFG 178
           +  ++ +      W R  G DH  V+        H+     + R +L   +   +V  FG
Sbjct: 197 KRLVEFLRGQ-ELWRRNGGVDHVIVM--------HHPNSLMVARSLLK--EAMFVVADFG 245

Query: 179 QRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGA 238
           + +    N     + PY       A        R   ++F+G         EGG      
Sbjct: 246 RFSRAVANMRKDIVAPYKHVIPSFARDATTFESRETLLFFQGAIVR----KEGGII---- 297

Query: 239 RAAVWENFKNNPLFDIST-----DHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGC 293
           R  ++E  K++P     T     D   +    M+ A FCL   G  P S RL +A+   C
Sbjct: 298 RQKLYEILKDSPGVHFVTGNTQKDGIRSATAGMRNAKFCLHLAGDTPSSNRLFDAIASHC 357

Query: 294 IPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTI---LTSIPPEVILRKQRLLANPS 350
           +PVII+D+I LPF D + + +  +FV  +   +   +   L  I  +   RK  +L   S
Sbjct: 358 VPVIISDEIELPFEDELDYSQFCVFVESDKALRKGFVVRALERIGRDEWTRKWAML--KS 415

Query: 351 MKRAMLFPQPAQPGDAFHQILNGLARKLPHDKSTYLK 387
           ++R   +  P+ P DA H    G+A+++P  KST  K
Sbjct: 416 VERHFEYQHPSLPEDAVHMTWRGIAKRVPALKSTAHK 452


>gi|2245029|emb|CAB10449.1| limonene cyclase like protein [Arabidopsis thaliana]
 gi|7268424|emb|CAB78716.1| limonene cyclase like protein [Arabidopsis thaliana]
          Length = 1024

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 44/115 (38%), Positives = 71/115 (61%), Gaps = 4/115 (3%)

Query: 261  TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVA 320
            TY + M+ + +CLCP+G+   SPR+VEA+ + C+PV+IAD+ +LPF+D + W    + V 
Sbjct: 894  TYVQHMKSSKYCLCPMGYEVNSPRIVEAIYYECVPVVIADNFMLPFSDVLDWSAFSVVVP 953

Query: 321  EEDVPKLDTILTSIPPEVILRKQRLLANPSM-KRAMLFPQPAQPGDAFHQILNGL 374
            E+++P+L  IL  IP    L+ Q   +N  M +R  L+    +  D FH IL+ +
Sbjct: 954  EKEIPRLKEILLEIPMRRYLKMQ---SNVKMVQRHFLWSPKPRKYDVFHMILHSI 1005


>gi|307111406|gb|EFN59640.1| hypothetical protein CHLNCDRAFT_133110 [Chlorella variabilis]
          Length = 767

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 92/338 (27%), Positives = 140/338 (41%), Gaps = 65/338 (19%)

Query: 67  FAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTP-----TGLPLPFKSP------ 115
           +  E      L+ S  RTL+PEEAD+FY P++ +C + P     + L   F +P      
Sbjct: 372 YGLESAFLEMLMQSEHRTLDPEEADFFYVPVFTSCFIYPVRSGASSLQDWFYAPVQNRVQ 431

Query: 116 ---RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACF-------------------- 152
               M+  A   I S+ P+W R  G DH ++V HD G+C+                    
Sbjct: 432 GAANMLLEAFHWIQSHHPWWERRGGRDHIWLVTHDEGSCWVPAAIRPSIILSHWGRMDLN 491

Query: 153 -----HYQEE---KAIERGILPLLQRATLVQ--TFGQR--NHVCLNE-GSITIPPYAPPQ 199
                 Y E+   +A  R +    Q+  L +   F Q+   H C +    + +P    P 
Sbjct: 492 HSSTTGYWEDDYRQANARKLQHAEQQQHLFEPDGFQQKIAGHACYDPVKDLVVPLIKTPN 551

Query: 200 KMQAHQI--PPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAV--------WENFKNN 249
           + +   +   P   R+   + RG    VN+  E   Y+RG R  V        W     +
Sbjct: 552 RNKHSPLFGAPTRNRTWLAFHRGR---VNH--EFPRYSRGVRQRVDNASREHQWLENYGS 606

Query: 250 PLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADA 309
              D S      Y E +  +IFCL   G   WS R+ +A+  GCIPV+I DD+ + F   
Sbjct: 607 KFGDESLQ--GDYSELLASSIFCLVLQG-DGWSARMDDAMSHGCIPVVIIDDVHVSFESV 663

Query: 310 IPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLA 347
           +   +  + V   DV +L  IL ++  E     QR LA
Sbjct: 664 LDLSQFSLRVKSADVERLPEILQAVSQERREELQRNLA 701


>gi|356538648|ref|XP_003537813.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase F8H-like
           [Glycine max]
          Length = 502

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 96/355 (27%), Positives = 148/355 (41%), Gaps = 48/355 (13%)

Query: 40  LKVYVYELPSKYNKKLL--LKDP-RCLTHMFAAEIFMHRF-------------------- 76
           L+VYVY++P K+   LL   K+  R  +++ +    +HR                     
Sbjct: 112 LRVYVYDMPPKFTHDLLWLFKNTYRDTSNLTSNGSPVHRLIEQHSIDYWLWADLIAPQSE 171

Query: 77  -LLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRT 135
            LL+S VR    EEAD FY P + T       L    +   + R A++ I+   P W R+
Sbjct: 172 RLLTSVVRVHRQEEADLFYIPFFTTISFF---LMEKQQCKALYREALKWITDQ-PAWKRS 227

Query: 136 EGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY 195
            G DH   V H +     ++  +   +  + LL        + +   V L +  I   PY
Sbjct: 228 GGRDHILPVHHPWS----FKSVRRYVKNAIWLLPDMDSTGNWYKPGQVYLEKDLIL--PY 281

Query: 196 APPQKMQAHQIPPDT--PRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFD 253
            P   +   +   +T   RS  ++FRG           G   R    A         + +
Sbjct: 282 VPNVDLCDAKCLSETNPKRSTLLFFRGRL-----KRNAGGKIRSKLGAELSGVDGVVIEE 336

Query: 254 ISTDH--PTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIP 311
            +            M++++FCL P G  P S RL +A+V GCIPVII+D++ LPF   + 
Sbjct: 337 GTAGDGGKEAAQRGMRKSLFCLSPAGDTPSSARLFDAIVSGCIPVIISDELELPFEGILD 396

Query: 312 WEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQP 363
           + +I +F++  D  K   L   L  I P  I   Q+ L   S  R  L+  PAQP
Sbjct: 397 YRKIAVFISSIDAVKPGWLLKYLKGIRPAHIKAMQQNLVKYS--RHFLYSSPAQP 449


>gi|302790387|ref|XP_002976961.1| glycosyltransferase-like protein [Selaginella moellendorffii]
 gi|300155439|gb|EFJ22071.1| glycosyltransferase-like protein [Selaginella moellendorffii]
          Length = 408

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 74/312 (23%), Positives = 134/312 (42%), Gaps = 24/312 (7%)

Query: 34  DDPVGKLKVYVYELPSKYNKKLLLKDP----RCLTHMFAAEIFMHRFLLSSPVRTLNPEE 89
           ++ V   KVYVY   +    ++ L  P    R L + F+  +F    LL+S   T +P E
Sbjct: 75  EEMVKSFKVYVYPFGNSDYSQVFLPHPDPYDRKLGNFFSEHMFKIN-LLNSTFATRDPGE 133

Query: 90  ADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFG 149
           A  F+ P           +         + S ++ IS  + +WNRTEG DHF+V  H  G
Sbjct: 134 AHLFFMPFSINAMRNHPRIRSEAMISSFVESYVEEISQRYKFWNRTEGVDHFYVGCHSVG 193

Query: 150 ACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPD 209
                +   +  R +     + T    + Q+ +V   + ++   P   P+ +    +PP+
Sbjct: 194 -----RNAASNSRALQQNAIQVTCSANYYQKLYVPHKDVAL---PQVWPRPLDTFIVPPE 245

Query: 210 TPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRA 269
             R+   +F G             + R     +W N  +  +F  +     +Y + + R+
Sbjct: 246 -KRTKLAFFSG--------RAQNSHLRETLLKLWSNDSDMDIFAGTMQ--GSYEDALSRS 294

Query: 270 IFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDT 329
            FCL   G+   + R+ +A+ FGC+PVII++   LP ++ + W    + ++   +P L  
Sbjct: 295 KFCLHVKGYEVNTARISDALHFGCVPVIISNQYDLPLSNVLNWRSFSIVLSYTQIPALKA 354

Query: 330 ILTSIPPEVILR 341
            L S+  +   R
Sbjct: 355 KLQSVTHDEYAR 366


>gi|125547211|gb|EAY93033.1| hypothetical protein OsI_14833 [Oryza sativa Indica Group]
          Length = 429

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 81/320 (25%), Positives = 136/320 (42%), Gaps = 37/320 (11%)

Query: 36  PVGKLKVYVYELPSKYNKKLLLKD-------PRCLTHMFAAEIFMHRFLLSSPVRTLNPE 88
           P   L++YVY        + LL+          C+   +  ++ +H+ LLSS  RT + +
Sbjct: 51  PHRDLRIYVYAEDEVDGLRALLRGRNDDVTAATCIKGQWGTQVKIHQLLLSSRFRTFDKD 110

Query: 89  EADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDF 148
           EAD F+ P Y  C +  TG         + ++ ++++ S  PY+ R+ G DH FV P   
Sbjct: 111 EADLFFVPTYVKC-VRMTG---KLNDKEINQTYVKVVLSQMPYFRRSGGRDHIFVFPSGA 166

Query: 149 GACFHYQEEKAIERGILPLLQRATLVQTFGQRNH----VCLNE-GSITIPPYAPPQKMQA 203
           GA         + R     L R+ ++   G R         N    I IP       +++
Sbjct: 167 GA--------HLFRSWATFLNRSIILTPEGDRTDKRGISAFNTWKDIIIPGNVDDSMVKS 218

Query: 204 HQIP----PDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP 259
            ++     P T R     F G         + G       A  + +   +P   +S    
Sbjct: 219 DRLAVKPIPLTKRKYLANFLG-----RAQGKVGRLQLVKLAKQYPDKLESPELKLSGPDK 273

Query: 260 ---TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG 316
                Y++ ++ A FCL P G + W+ R  E+    C+PVI++D++ LPF + I + E+ 
Sbjct: 274 LGRIDYFKHLRNAKFCLAPRGESSWTLRFYESFFVECVPVILSDEVELPFQNVIDYTEVS 333

Query: 317 M-FVAEEDVPKLDTILTSIP 335
           + + A +  P L   L SIP
Sbjct: 334 IKWPASKIGPGLLEYLESIP 353


>gi|255569522|ref|XP_002525728.1| catalytic, putative [Ricinus communis]
 gi|223535028|gb|EEF36711.1| catalytic, putative [Ricinus communis]
          Length = 336

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 79/321 (24%), Positives = 135/321 (42%), Gaps = 49/321 (15%)

Query: 67  FAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRM--------M 118
           +A+E +  + L+ S   T +P +AD F+ P           +      PR+        +
Sbjct: 33  YASESYFKKVLMKSHFITKDPTKADLFFLPF---------SIARLRHDPRIGVEGIQDFI 83

Query: 119 RSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFG 178
           R+ +  IS  +PYWNRT G DHF+V  H  G     + E+     I  +   +  +    
Sbjct: 84  RAYVYNISQKYPYWNRTGGTDHFYVACHSIGRTAMEKAEEVKFNAIQVVCSSSYYL---- 139

Query: 179 QRNHVCLNEGSITIPPYAPPQKMQAHQIPPD---TPRSIFVYFRGLFYDVNNDPEGGYYA 235
                    G I     + PQ       PP+   + R    +F G      N P      
Sbjct: 140 --------SGYIAHKDASLPQVWPRQGDPPNLASSERQKLAFFAGSI----NSP-----V 182

Query: 236 RGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIP 295
           R     VW N  ++ ++       T+Y +++  + FCL   G+   + R+ +++ +GC+P
Sbjct: 183 RERLLQVWRN--DSEIYVHYGRLNTSYADELLGSKFCLHVKGFEVNTARIADSLYYGCVP 240

Query: 296 VIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE--VILRKQRLLANPSMKR 353
           +IIA+   LPF D + WE   + VA  D+  L  IL  +  +  V+L+   L     +++
Sbjct: 241 IIIANHYDLPFTDILNWESFSVVVATLDILYLKKILQGVSSDRYVMLQSNVL----KVRK 296

Query: 354 AMLFPQPAQPGDAFHQILNGL 374
              +  P    DAFH ++  L
Sbjct: 297 HFQWHFPPVDYDAFHMVMYEL 317


>gi|302782213|ref|XP_002972880.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
 gi|300159481|gb|EFJ26101.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
          Length = 459

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 99/397 (24%), Positives = 159/397 (40%), Gaps = 66/397 (16%)

Query: 34  DDPVGKLKVYVYELPSKY----------------NKKLLLKDPRCLTHMFAAEIFMHRFL 77
           DD V  LKVY+Y+LP+++                N   L + P  L    + E ++   L
Sbjct: 79  DDRV--LKVYMYDLPAEFHFGMLDAAISGRPWPRNISSLPRYPGGLYQQHSPEYWLTADL 136

Query: 78  LSS-----------PVRTLNPEEADWFYTPIYPTCDLTP--------TGLPLPFKSPRMM 118
           LSS            VR  +P  AD F+ P + +              G     ++ R+ 
Sbjct: 137 LSSTDPSSRKSPCSAVRVADPATADIFFVPFFSSLSYNRYCRTGHRFQGGRGCVENDRLE 196

Query: 119 RSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFG 178
           +  ++ +      W R  GADH  V+        H+     + R +L   +   +V  FG
Sbjct: 197 KRLVEFLRGQ-ELWRRNGGADHVIVM--------HHPNSLMVARSLLK--EAMFVVADFG 245

Query: 179 QRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGA 238
           + +    N     + PY       A        R   ++F+G         EGG      
Sbjct: 246 RFSRAVANMRKDIVAPYKHVIPSFARDATTFESRETLLFFQGAIVR----KEGGII---- 297

Query: 239 RAAVWENFKNNPLFDIST-----DHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGC 293
           R  ++E  K++P     T     D   +    M+ A FCL   G  P S RL +A+   C
Sbjct: 298 RQKLYEILKDSPGVHFVTGNTQKDGIRSATAGMRNAKFCLHLAGDTPSSNRLFDAIASHC 357

Query: 294 IPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTI---LTSIPPEVILRKQRLLANPS 350
           +PVII+D+I LPF D + + +  +FV  +   +   +   L  I  +   RK  +L   S
Sbjct: 358 VPVIISDEIELPFEDELDYSQFCVFVESDKALRKGFVVRALERIGRDEWTRKWAML--KS 415

Query: 351 MKRAMLFPQPAQPGDAFHQILNGLARKLPHDKSTYLK 387
           ++R   +  P+ P DA H    G+A+++P  KS   K
Sbjct: 416 VERHFEYQHPSLPEDAVHMTWRGIAKRVPALKSMAHK 452


>gi|42571577|ref|NP_973879.1| exostosin-like protein [Arabidopsis thaliana]
 gi|332191986|gb|AEE30107.1| exostosin-like protein [Arabidopsis thaliana]
          Length = 410

 Score = 92.8 bits (229), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 78/310 (25%), Positives = 120/310 (38%), Gaps = 63/310 (20%)

Query: 40  LKVYVYELPSKYNKKLLL-------KDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADW 92
           LK+YVY+       K LL       K   CL   + +++ +H+ LL S  RT+  +EAD 
Sbjct: 89  LKIYVYDENEIDGLKELLYGRDGSVKTTACLKGQWGSQVKIHKLLLESKFRTIKKDEADL 148

Query: 93  FYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACF 152
           F+ P Y  C     GL     + + +      + S  PY+ R+ G DH FV P   GA  
Sbjct: 149 FFVPAYVKCVRMLGGL-----NDKEINQTYVKVLSQMPYFRRSGGRDHIFVFPSGAGAHL 203

Query: 153 HYQEEKAIERGIL--PLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIP--- 207
                  I R I+  P   R        +++    N     I P      M  +  P   
Sbjct: 204 FRSWSTFINRSIILTPEADRTD------KKDTTAFNSWKDIIIPGNVDDAMTKNGQPDVQ 257

Query: 208 --PDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP------ 259
             P + R     + G              A+G    +        L D+S   P      
Sbjct: 258 PLPLSKRKYLANYLG-------------RAQGKAGRL-------KLIDLSKQFPDKLECP 297

Query: 260 ------------TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFA 307
                       TTY+E ++ A FCL P G + W+ R  E+    C+PV+++D   LPF 
Sbjct: 298 DLKFSGTEKFGRTTYFEHLRNAKFCLAPRGESSWTLRFYESFFVECVPVLLSDHAELPFQ 357

Query: 308 DAIPWEEIGM 317
           + I + ++ +
Sbjct: 358 NVIDYAQVSI 367


>gi|51477380|gb|AAU04753.1| EXO [Cucumis melo]
          Length = 343

 Score = 92.4 bits (228), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 82/345 (23%), Positives = 149/345 (43%), Gaps = 38/345 (11%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYP 99
           LK+YVY    K  +  +     C +       F+H     +   T +P++A  ++ P   
Sbjct: 16  LKIYVY----KEGEPPMFHGGPCKSIYSTEGRFIHEMEKGNLYTTNDPDQALLYFLPFSV 71

Query: 100 TCDLTPTGLPLPFKSPRMMRSA---IQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQE 156
              +    +P   +   + R+    I +IS   P+W+R+ GADHF +  HD+G       
Sbjct: 72  VNLVQYLYVPNSHEVNAIGRAITDYINVISKKHPFWDRSLGADHFMLSCHDWGP------ 125

Query: 157 EKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQ-KMQAHQIP------PD 209
                   +PLL   + ++     N   ++EG +     + P+  ++  +I         
Sbjct: 126 ---RTTSYVPLLFNNS-IRVLCNAN---VSEGFLPSKDASFPEIHLRTGEIDGLIGGLSP 178

Query: 210 TPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRA 269
           + RS+  +F G  +          + R      W+    + L         +Y   ++++
Sbjct: 179 SRRSVLAFFAGRLHG---------HIRYLLLQEWKEKDEDVLVYEELPSGISYNSMLKKS 229

Query: 270 IFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDT 329
            FCLCP G+   SPR+VEA+   C+PV+I++  V PF+D + W+   + +  +D+P +  
Sbjct: 230 RFCLCPSGYEVASPRVVEAIYAECVPVLISESYVPPFSDVLNWKSFSVQIQVKDIPNIKK 289

Query: 330 ILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
           IL  I     LR QR +    ++R        +  DAFH IL+ +
Sbjct: 290 ILKGISQTQYLRMQRRVKQ--VQRHFALNGTPKRFDAFHMILHSI 332


>gi|388493832|gb|AFK34982.1| unknown [Lotus japonicus]
          Length = 229

 Score = 92.4 bits (228), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 44/132 (33%), Positives = 75/132 (56%), Gaps = 2/132 (1%)

Query: 243 WENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDI 302
           WEN   +     S     +YY+ ++++ FCLCP G+   SPR+VEA+  GC+PV+I++  
Sbjct: 87  WENKDGDMQVYSSLPKGVSYYDMLRKSKFCLCPSGYEVASPRVVEAIYTGCVPVLISEHY 146

Query: 303 VLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQ 362
           V PF+D + W+   + V+ +D+P+L  IL S+     +R QR +    ++R      P +
Sbjct: 147 VPPFSDVLNWKSFSLEVSVKDIPRLKEILLSVNTRQYIRMQRRVGQ--IRRHFEIHSPPK 204

Query: 363 PGDAFHQILNGL 374
             D FH +L+ +
Sbjct: 205 RFDVFHMVLHSV 216


>gi|22330483|ref|NP_176908.2| exostosin-like protein [Arabidopsis thaliana]
 gi|115311405|gb|ABI93883.1| At1g67410 [Arabidopsis thaliana]
 gi|332196520|gb|AEE34641.1| exostosin-like protein [Arabidopsis thaliana]
          Length = 430

 Score = 92.4 bits (228), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 92/379 (24%), Positives = 156/379 (41%), Gaps = 53/379 (13%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRC-------------------LTHMFAAEIFMHRFLLS- 79
           L+V++Y+LP K+N  + + DP                     +    + E ++   LL+ 
Sbjct: 53  LRVFMYDLPRKFN--IAMMDPHSSDVEPITGKNLPSWPQTSGIKRQHSVEYWLMASLLNG 110

Query: 80  -----SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLP---FKSPRMMRSAIQLISSNWPY 131
                  +R  +P+ AD FY P + +      G  +     +  R+++  +     N  Y
Sbjct: 111 GEDENEAIRVFDPDLADVFYVPFFSSLSFNTHGKNMTDPDTEFDRLLQVELMEFLENSKY 170

Query: 132 WNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSIT 191
           WNR+ G DH  V+P      F +  ++ +   IL       +V  FG+ +          
Sbjct: 171 WNRSGGKDH--VIPMTHPNAFRFLRQQ-VNASIL-------IVVDFGRYSKDMARLSKDV 220

Query: 192 IPPY-----APPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENF 246
           + PY     +  ++       P   R+  +YFRG   +     EG    R  +     + 
Sbjct: 221 VSPYVHVVESLNEEGDDGMGDPFEARTTLLYFRG---NTVRKDEGKIRLRLEKLLAGNSD 277

Query: 247 KNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 306
            +      +T +     E M+ + FCL P G  P S RL +A+V  CIPVII+D I LPF
Sbjct: 278 VHFEKSVATTQNIKVSTEGMRSSKFCLHPAGDTPSSCRLFDAIVSHCIPVIISDKIELPF 337

Query: 307 ADAIPWEEIGMFVAEEDVPKLDTILTSI---PPEVILRKQRLLANPSMKRAMLFPQPAQP 363
            D I + E  +F + ++  +   IL ++   P E  L   + L N S      F  P + 
Sbjct: 338 EDEIDYSEFSLFFSIKESLEPGYILNNLRQFPKEKWLEMWKRLKNVS--HHFEFQYPPKR 395

Query: 364 GDAFHQILNGLARKLPHDK 382
            DA + +   +  K+P+ K
Sbjct: 396 EDAVNMLWRQVKHKIPYVK 414


>gi|326491429|dbj|BAJ94192.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 400

 Score = 92.0 bits (227), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 87/354 (24%), Positives = 147/354 (41%), Gaps = 57/354 (16%)

Query: 40  LKVYVYELPSKYNKKLLLKD-------PRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADW 92
           L+VYVY        + LL+          CL   +  ++ +H+ LL S  RTL+ +EA+ 
Sbjct: 54  LRVYVYAEDEVDGLRALLRGRDGAVSAATCLKGQWGTQVKIHQLLLRSRYRTLDKDEANL 113

Query: 93  FYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACF 152
           F+ P Y  C     GL     + + +      + S  PY+ R+ G DH FV P   GA  
Sbjct: 114 FFVPSYVKCVRMTGGL-----TDKEINQTYVKVLSQMPYFRRSGGRDHIFVFPSGAGAHL 168

Query: 153 HYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPR 212
                    R     L R+ ++   G R     ++  I                      
Sbjct: 169 F--------RSWATFLNRSIILTPEGDRT----DKRGI---------------------- 194

Query: 213 SIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKN---NPLFDISTDHP---TTYYEDM 266
           S F  ++ +    N D   G   R     + + + +   +P   +S         Y++ +
Sbjct: 195 SAFNTWKDIIIPGNVDDSMGKVGRLKLVELAKQYPDKLESPELKLSGPDKLGRIDYFKHL 254

Query: 267 QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGM-FVAEEDVP 325
           + A FCL P G + W+ R  E+    C+PV+++D++ LPF + I + +I + + A +  P
Sbjct: 255 RNAKFCLAPRGESSWTLRFYESFFVECVPVLLSDEVELPFQNVIDYTKISIKWPASKIGP 314

Query: 326 KLDTILTSIPPEVILRKQRLLANPSMKRAM-LFPQPAQPGDAFHQILNGLARKL 378
           +L   L SIP E   R + ++A     R M ++    +P  A   I+  L RK+
Sbjct: 315 ELFQYLESIPEE---RIEEMIARGREVRCMWVYALDTEPCSAMTAIMWELQRKV 365


>gi|255076909|ref|XP_002502118.1| glycosyltransferase family 47 protein [Micromonas sp. RCC299]
 gi|226517383|gb|ACO63376.1| glycosyltransferase family 47 protein [Micromonas sp. RCC299]
          Length = 717

 Score = 92.0 bits (227), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 102/413 (24%), Positives = 171/413 (41%), Gaps = 81/413 (19%)

Query: 41  KVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPT 100
           ++Y+Y LP K+N  + L+    LT     ++   R LLSS  R  NPE+AD F+ PI PT
Sbjct: 227 RIYIYPLPPKFNGHVDLR----LTDRPLEQMIYER-LLSSHHRVANPEDADLFFLPI-PT 280

Query: 101 CDLTPTGLPLPFKSPRM---MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEE 157
                 GL      P +      AI+ + + W +  + E  +   V   D+G C  + E+
Sbjct: 281 RAAFRGGLDNVGGWPGVNDFFHEAIEYVDNTWEWSKKHEWRNTIMVFTGDWGPCEWFSEK 340

Query: 158 KAIE-------------RGILPLLQRATLVQTFGQR--NHVCLNEG-------SITIPPY 195
            + +             R I  ++  A ++  +G    + + L  G        + IPP 
Sbjct: 341 YSKKENDPDYEAFWKKRRRINEVIANAIVLTHWGLTIADDLYLGGGPCFDPAKDVLIPPV 400

Query: 196 AP--------------------------------PQKMQAHQIPPDTPRSIFVYFRGLFY 223
            P                                P   +      D PR   ++F G + 
Sbjct: 401 NPHFGLGPFDPDGWKAPMGTRRIEFDVGLRGSDVPFGSERAMTEQDEPRRWLLFFAGAWV 460

Query: 224 DVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYE-DMQRAIFCLCPLGWAPWS 282
           D     +  Y  R A A      +   +  +   H   +YE +   + FC+ P G + W 
Sbjct: 461 D-----KPAYADRRAIAEAMAGREQEGIHVVQ--HAGQFYEKNYASSTFCIAPTG-SGWG 512

Query: 283 PRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVI--L 340
            R+  A   GCIPVI+ D+I  P+ D +P++E  + VA+ D+PK+  I+ +I PE +  +
Sbjct: 513 RRMNLATQSGCIPVIVQDNIAAPYDDVLPYDEFSVRVAKADIPKIPDIVKAITPEKLDRM 572

Query: 341 RKQRLLANPSMKRAMLFPQPAQPG---DAFHQILNGLARKLPHDKSTYLKPGG 390
           R+Q   A  +++ + +       G   DAF  ++  L  +L     T++KP G
Sbjct: 573 RQQLACAARALQWSSILGSDFGEGGENDAFALLMLTLQHRL----VTHVKPEG 621


>gi|356577103|ref|XP_003556667.1| PREDICTED: probable glucuronosyltransferase Os02g0520750-like
           [Glycine max]
          Length = 459

 Score = 92.0 bits (227), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 86/320 (26%), Positives = 133/320 (41%), Gaps = 51/320 (15%)

Query: 82  VRTLNPEEADWFYTPIYPTCDL---------TPTGLPLPFKSPRMMRSAIQLISSNWPYW 132
           VR  +PEEAD F+ P + +  L         + +GL  P  S    + A+        YW
Sbjct: 136 VRVADPEEADLFFVPFFSSLSLIVNPVRPPGSNSGLEKPVYSDEENQEALVEWLEKQEYW 195

Query: 133 NRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITI 192
            R  G DH  V            +  A+ R I  +     LV  FG+   +  ++GS+ +
Sbjct: 196 KRNNGRDHVIVA----------SDPNAMYRVIDRVRNAVLLVSDFGR---LRPDQGSL-V 241

Query: 193 PPYAPPQKMQAHQIPPDT---PRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNN 249
                P   +    P D     R   ++F G  Y      EGG   R     + EN K  
Sbjct: 242 KDVVVPYSHRIRTYPGDVGVEDRKTLLFFMGNRYR----KEGGK-IRDLLFQILENEK-- 294

Query: 250 PLFDISTDHPTTYYED-------MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDI 302
              D+   H     E        M  + FCL P G  P + RL +A+V  CIPVI++D+I
Sbjct: 295 ---DVIIKHGAQSRESRRAASHGMHTSKFCLHPAGDTPSACRLFDAIVSLCIPVIVSDNI 351

Query: 303 VLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQ 359
            LPF D I + +I +FV      K   L + L ++ P+ +L  Q+      +K    + +
Sbjct: 352 ELPFEDTIDYRKIAVFVETSSAIKPGHLLSKLRAVTPDRVLEYQK-----KLKEVKRYFE 406

Query: 360 PAQPGDAFHQILNGLARKLP 379
             +P    ++I   +++KLP
Sbjct: 407 YEEPDGTINEIWRQVSKKLP 426


>gi|302848153|ref|XP_002955609.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
 gi|300259018|gb|EFJ43249.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
          Length = 572

 Score = 91.7 bits (226), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 93/337 (27%), Positives = 139/337 (41%), Gaps = 63/337 (18%)

Query: 41  KVYVYELPSKYN-KKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYP 99
           K+YVYE+P  ++ K+ + K  R   HM      M R +LS   RT +PE+AD+FY P   
Sbjct: 140 KIYVYEIPPDFHVKRDIHKVDRPPLHM----ALMER-ILSGGHRTADPEKADFFYIPASA 194

Query: 100 TCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKA 159
             DL         K   ++   +  I   WP WN+T GA H      D G C        
Sbjct: 195 R-DL---------KRAFLLEPLLNYIIEAWPIWNQTGGARHIMPAEGDVGTC-------E 237

Query: 160 IERGILPLLQRATLVQTFGQ-----------RNHV-CLNEGSITIPPYAPPQKMQAH-QI 206
           +   I  +    T +Q +G             N V C+  G   + P+     M +H + 
Sbjct: 238 LPMKIRNMTANVTWLQFWGMYDFHPHWTQIFHNRVPCMVPGRDIVVPF---MAMSSHDRF 294

Query: 207 PPDTP---------RSIFVYFRGLFYDVNND----PEGGYYAR-----GARAAVWENFKN 248
             +TP         R+   +F G      N     P   YY +     G R AV+ ++  
Sbjct: 295 VIETPLHPRNQKRNRTNTFFFAGGICGSGNKRALPPHCTYYKQVRYSGGVRQAVYYHYHK 354

Query: 249 NPLFDI--STDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 306
            P + +   TD    Y  D   +IFCL   G   W  R + A ++GCIPV   D +   F
Sbjct: 355 RPGWRVVPGTD---DYARDYASSIFCLAAAG-GGWGKRGIVATMYGCIPVAATDMLYEAF 410

Query: 307 ADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQ 343
              + W   G+ V++  +P+L  +L +  PE I + Q
Sbjct: 411 EPEMDWNRFGVRVSQAQIPQLGDMLEAFTPEQIRQMQ 447


>gi|224144703|ref|XP_002325382.1| predicted protein [Populus trichocarpa]
 gi|222862257|gb|EEE99763.1| predicted protein [Populus trichocarpa]
          Length = 505

 Score = 91.7 bits (226), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 93/359 (25%), Positives = 148/359 (41%), Gaps = 56/359 (15%)

Query: 40  LKVYVYELPSKYNKKLL---LKDPRCLTHMFAAEIFMHRF-------------------- 76
           ++VY+YE+PSK+   LL       R   ++ +    +HR                     
Sbjct: 118 IRVYLYEMPSKFTYDLLWLFRNTYRNTDNLTSNGSPVHRLIEQHSVDYWLWADLIAPESE 177

Query: 77  -LLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRT 135
            LL S VR    E+AD FY P + T       L    +   + R A++ ++   P W R+
Sbjct: 178 RLLKSVVRVERQEDADLFYVPFFTTISFF---LLEKQQCKALYREALKWVTDQ-PAWKRS 233

Query: 136 EGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY 195
           EG +H F + H +     ++  +   +  + LL        + +   V L +  I   PY
Sbjct: 234 EGRNHIFPIHHPWS----FKSVRRYVKNAIWLLPDMDSTGNWYKPGQVFLEKDLIL--PY 287

Query: 196 APPQKMQAHQI--PPDTPRSIFVYFRGLFYDVNNDPEGGYYAR------GARAAVWENFK 247
            P   +   +     ++ RS  +YFRG    +  +  G   A+      GA     E   
Sbjct: 288 VPNVNLCDTKCISESESKRSTLLYFRG---RLKRNAGGKIRAKLVAELSGAEGVFIEEGT 344

Query: 248 NNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFA 307
                             M+++IFCL P G  P S RL +A+V GCIPV+++D++ LPF 
Sbjct: 345 ------AGEGGKAAAQIGMRKSIFCLSPAGDTPSSARLFDAIVSGCIPVVVSDELELPFE 398

Query: 308 DAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQP 363
             + + +I +FV+  D  +   L   L  I    I   QR LA  S  R  ++  PA P
Sbjct: 399 GILDYRKIALFVSSSDAVQPGWLLKFLKGISLAQIRGMQRNLAKYS--RHFIYSSPALP 455


>gi|255565307|ref|XP_002523645.1| catalytic, putative [Ricinus communis]
 gi|223537097|gb|EEF38731.1| catalytic, putative [Ricinus communis]
          Length = 452

 Score = 91.3 bits (225), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 106/434 (24%), Positives = 182/434 (41%), Gaps = 87/434 (20%)

Query: 3   VTAIALLLAFVSADNT-------------PKIERISGSAGDVLEDDPVG---KLKVYVYE 46
           + +I L+LA  +  NT             P+   +SG  G V E++      K+K+++Y+
Sbjct: 13  IFSICLILALYAVFNTFFNPTTSSLLYPSPEDNSLSGFPGKVTENNDNNNINKVKIFMYD 72

Query: 47  LPSKYNKKLLL--------KDPRCLT---HMFAAEIFM--------HRFLLSSPVRTLNP 87
           LP K+   ++         KD   +    H    E ++        H  + S  V+  +P
Sbjct: 73  LPKKFTTGIIQQHALARGSKDTSNVKYPGHQHMGEWYLFSDLNRPEHGRIGSPVVKVDDP 132

Query: 88  EEADWFYTPIYPTCDL---------TPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGA 138
           +EAD FY P++ +  L         T  GL   +    M    ++ +     YW R  G 
Sbjct: 133 DEADLFYVPVFSSLSLIVNPVRPAGTEPGLVQHYSDEEMQEQLVEWLEQQ-EYWKRNNGR 191

Query: 139 DHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSIT---IPPY 195
           DH  ++  D  A +   +   ++  IL       L+  FG+   V  ++GS+    I PY
Sbjct: 192 DHV-IIAGDPNALYRVLDR--VKNAIL-------LLSDFGR---VRPDQGSLVKDIIVPY 238

Query: 196 APPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDIS 255
           +    +    I     R+  ++F G  Y      +GG      R  +++  ++    D+ 
Sbjct: 239 SHRINVYNGDIGV-RDRNTLLFFMGNRYR----KDGG----KIRDLLFQMLESEE--DVV 287

Query: 256 TDHPTTYYED-------MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFAD 308
             H T   E+       M  + FCL P G  P + RL +++V  C+PVI++D I LPF D
Sbjct: 288 IKHGTQSRENRRAASRGMHTSKFCLNPAGDTPSACRLFDSIVSLCVPVIVSDSIELPFED 347

Query: 309 AIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGD 365
            I + +I +FV   D  K   L  +L  +  E IL  Q+      +K+   + +      
Sbjct: 348 VIDYTKIAIFVETTDSLKPGYLVKLLREVTSERILEYQK-----ELKKVTRYFEYDNSNG 402

Query: 366 AFHQILNGLARKLP 379
             ++I   +A+KLP
Sbjct: 403 TVNEIWRQVAQKLP 416


>gi|357508695|ref|XP_003624636.1| Exostosin-like protein [Medicago truncatula]
 gi|87162615|gb|ABD28410.1| Exostosin-like [Medicago truncatula]
 gi|116831751|gb|ABK28848.1| exostosin-like protein [Medicago truncatula]
 gi|355499651|gb|AES80854.1| Exostosin-like protein [Medicago truncatula]
          Length = 486

 Score = 91.3 bits (225), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 75/310 (24%), Positives = 133/310 (42%), Gaps = 32/310 (10%)

Query: 67  FAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLIS 126
           +A+E +  + L+ S   T +P +AD F+ P +    L              +R  +Q + 
Sbjct: 187 YASESYFKKALMKSHFITKDPTKADLFFMP-FSIASLRHDRRVGVGGIQDFIRDYVQNMI 245

Query: 127 SNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLN 186
             +PYWNRT GADHF+V  H  G     +        I  +   +  +  +      CL 
Sbjct: 246 HKYPYWNRTNGADHFYVACHSIGRSAMDKAPDVKFNAIQVVCSSSYFLSGYIAHKDACL- 304

Query: 187 EGSITIPPYAPPQKMQAHQIPPD---TPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVW 243
                      PQ    ++ PP+   + R    +F G   +VN+        R      W
Sbjct: 305 -----------PQIWPRNENPPNLVSSNRKKLAFFAG---EVNSP------VRINLVETW 344

Query: 244 ENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIV 303
           +N  +  +F  +    T Y +++  + FC    G+   + R+ +++ +GC+PVIIA+   
Sbjct: 345 KN--DTEIFVHNGRLKTPYGDELLGSKFCFHVRGYEVNTARIGDSLYYGCVPVIIANYYD 402

Query: 304 LPFADAIPWEEIGMFVAEEDVPKLDTILTSI--PPEVILRKQRLLANPSMKRAMLFPQPA 361
           LPFAD + W+   + V   D+P L  IL  I    E ++ ++ +L    ++    +  P 
Sbjct: 403 LPFADVLNWKSFSVVVTTLDIPLLKKILKGIVNSGEYLMLQKNVLK---VREHFQWHSPP 459

Query: 362 QPGDAFHQIL 371
              DAF+ ++
Sbjct: 460 IDFDAFYMVM 469


>gi|356546128|ref|XP_003541483.1| PREDICTED: LOW QUALITY PROTEIN: xylogalacturonan
           beta-1,3-xylosyltransferase-like [Glycine max]
          Length = 252

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 82/321 (25%), Positives = 131/321 (40%), Gaps = 78/321 (24%)

Query: 58  KDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRM 117
           K P   TH   A +F+  F +S  +R        + Y P     D  P  L       R+
Sbjct: 4   KSPFKATHPEQAHLFLLPFSVSKVIR--------YVYKPRRSRSDYDPDRLQ------RL 49

Query: 118 MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTF 177
           +   I ++++ +P WNR+ GADHF V  HD+           + +  +  L  A   + F
Sbjct: 50  VLDYINIVANRYPNWNRSRGADHFLVSFHDW-----LDANPEVFKYFIRALCNANTSEGF 104

Query: 178 GQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARG 237
                V + E  + +P     +++Q H+  P          +GL                
Sbjct: 105 QPSRDVSITE--VYLPSRKLDKEVQVHEYLP----------KGL---------------- 136

Query: 238 ARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVI 297
                                   Y + M +  FCLCP      SPR+VEA+  GC+PVI
Sbjct: 137 -----------------------EYTKLMGQRKFCLCP-SXQVASPRVVEAIYVGCVPVI 172

Query: 298 IADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLAN-PSMKRAML 356
           I D+  LPF+D +   +   F+A E +P+  TIL ++  +   +   L +N   ++R  +
Sbjct: 173 ICDNYSLPFSDVVKRSK---FIAVERIPETKTILQNVSKD---KYMELYSNVKRVRRHFV 226

Query: 357 FPQPAQPGDAFHQILNGLARK 377
             +PA+P D  H IL+ L  K
Sbjct: 227 INRPAKPFDLIHMILHSLRNK 247


>gi|356571729|ref|XP_003554026.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
           max]
          Length = 487

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 87/342 (25%), Positives = 138/342 (40%), Gaps = 66/342 (19%)

Query: 67  FAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRM--------- 117
           + +E +  + L+ S   T +P EAD F+               LPF   R+         
Sbjct: 185 YTSESYFKKVLMKSHFITKDPPEADLFF---------------LPFSMARLWHDRRVGVG 229

Query: 118 -----MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRAT 172
                +R  I  IS  +PYWN T GADHF+V  H  G     +        I  +   + 
Sbjct: 230 GIQDFIRDYIHNISHRYPYWNNTGGADHFYVACHSIGRSAMDKAPDEKFNAIQVVCSSSY 289

Query: 173 LVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPD---TPRSIFVYFRGLFYDVNNDP 229
            +  +      CL            PQ       PP+   + R    +F G      N P
Sbjct: 290 FLTGYFAHKDACL------------PQIWPRKGNPPNLVSSKRKRLAFFAGGV----NSP 333

Query: 230 EGGYYARGARAAVWENFKNNPLFDISTDH---PTTYYEDMQRAIFCLCPLGWAPWSPRLV 286
                    R  + E +KN+   +I   H    T Y +++  + FCL   G+   + R+ 
Sbjct: 334 --------VRVKLLETWKNDS--EIFVHHGRLKTPYADELLGSKFCLHVKGFEVNTARIG 383

Query: 287 EAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLL 346
           +++ +GC+PVIIA+   LPFAD + W+   + V   D+P L  IL  I   +   K  +L
Sbjct: 384 DSLYYGCVPVIIANYYDLPFADVLNWKSFSVVVTTLDIPLLKKILKDI---ISSNKYLML 440

Query: 347 ANPSMKRAMLFP--QPAQPGDAFHQILNGLARKLPHDKSTYL 386
            +  +K    F    P Q  DAF+ ++  L  +    K+T++
Sbjct: 441 QSNVLKVRKHFQWHSPPQDFDAFYMVMYELWLRRSSIKNTWV 482


>gi|260813939|ref|XP_002601673.1| hypothetical protein BRAFLDRAFT_94550 [Branchiostoma floridae]
 gi|229286975|gb|EEN57685.1| hypothetical protein BRAFLDRAFT_94550 [Branchiostoma floridae]
          Length = 786

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 116/468 (24%), Positives = 179/468 (38%), Gaps = 102/468 (21%)

Query: 13  VSADNTPKIERISGSAGDV-------LEDDPVGK---------------------LKVYV 44
           V  D +  + R  G AG +       L+DD  G+                     L+V+V
Sbjct: 317 VKDDESSPLPRREGQAGHISGKRKVYLKDDTTGQPSPKPVKRPESAGNAAQTSSPLRVFV 376

Query: 45  YELPSKYNKKLL----LKDPRCL---------------------THMFAAEIFMHRFLLS 79
           Y+LPS++N  L+    +K+ RC                      THMF+ E+ +H+ LLS
Sbjct: 377 YDLPSEFNSGLVHCIQVKN-RCYQLQDYGMGLEFARYGNVSFRSTHMFSLEVILHQKLLS 435

Query: 80  SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLP-FKSPRMMRSAIQLISSNWPYWNRTEGA 138
           S  RTL+PE+AD FY P YP   L     P+    SP + R   Q I+SN+PY+ + +  
Sbjct: 436 STFRTLDPEKADVFYIPYYPA--LAAACEPVSTIDSPALDRELWQFITSNYPYFQQGK-- 491

Query: 139 DHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNH---------VCLNEGS 189
                 PH        +E   +  GIL    R +   TF    H         +  +   
Sbjct: 492 ------PHMMALGRIEREHADVTGGILK--TRESRSVTFVAIEHESDPKTLKFIRRSGLP 543

Query: 190 ITIPPYAPPQKM----------QAHQIPPDTPRSIFVYFRG---LFYDVNNDPEGGYYAR 236
           + + PY     +          ++ +   D PR + V F G   + +D+           
Sbjct: 544 MVVAPYPSCGHLLSDNKFGGESKSERTQLDIPRDVLVLFAGSRRMSHDIRRILSQQLRPT 603

Query: 237 GARAAVWENF-KNNPLFDISTDHPTTYYED----MQRAIFCLCPLGWAPWSPRLVEAVVF 291
             +     +  K N  F        ++ E+    M  ++FCL P G +P      +AV  
Sbjct: 604 SEKYDATSSLNKQNVWFITQECRDRSWQENLVEWMHHSVFCLQPPGDSPTRKSFFDAVQC 663

Query: 292 GCIPVIIADDI--VLPFADAIPWEEIGMFVAEEDV----PKLDTILTSIPPEVILRKQRL 345
           GCIPVI   D   V PF D + + +  + V + D       +  IL  IP  VI  K+  
Sbjct: 664 GCIPVIFKLDHEPVYPFDDVLDYSKFTVKVTDGDFFQEKRSIVDILQDIPEAVIAAKRAE 723

Query: 346 LANPSMKRAMLFP--QPAQPGDAFHQILNGLARKLPHDKSTYLKPGGK 391
           L   +      +P        DAF  I+  + R      +   + G K
Sbjct: 724 LRQVTPLLQYSYPPLPETHVQDAFDMIMQEIGRTRGGRSNVRRRYGSK 771


>gi|222635067|gb|EEE65199.1| hypothetical protein OsJ_20321 [Oryza sativa Japonica Group]
          Length = 551

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 52/124 (41%), Positives = 70/124 (56%), Gaps = 6/124 (4%)

Query: 253 DISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW 312
           D   +    YY  M+R+ FCLCP G    SPR+VEA+  GC+PV++AD    PFAD + W
Sbjct: 420 DAGGEGGNPYYWYMRRSRFCLCPSGHEVASPRVVEAIHAGCVPVVVADGYAPPFADVLRW 479

Query: 313 EEIGMFVAEEDVPKLDTILTSIP-PEVILRKQRLLANPSM-KRAMLFPQPAQPGDAFHQI 370
           E   + VA  DVP+L  +L  IP PEV    +RL     + KR  +  QP +  D FH I
Sbjct: 480 EAFSVAVAVADVPRLRELLERIPAPEV----ERLRDGVRLVKRHFMLHQPPERLDMFHMI 535

Query: 371 LNGL 374
           L+ +
Sbjct: 536 LHSV 539



 Score = 43.1 bits (100), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 40/160 (25%), Positives = 76/160 (47%), Gaps = 22/160 (13%)

Query: 10  LAFVSADNTPKIERI-------SGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRC 62
           L+F    +  ++ER+       +G +   +E +   + KVYVYE      +  +  +  C
Sbjct: 106 LSFFGDADHARLERVYRNPRRSTGGSRSYVEMER--RFKVYVYE----EGEPPIAHEGPC 159

Query: 63  LTHMFAAE-IFMHRFLLSSP----VRTLNPEEADWFYTPIYPTCDLTPTGLPLPFK-SP- 115
             +++A E  F+    L +P    VRT +P  A   + P+  +  +     PL +  SP 
Sbjct: 160 -KNIYAVEGRFIEELELMAPPLGGVRTWDPARAHALFLPLSVSQMVQLAYRPLSYDLSPL 218

Query: 116 -RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHY 154
             ++   + +++S   +WNR+ GADHF +  HD+ +  H+
Sbjct: 219 RAIVADYVAVVASRHRFWNRSAGADHFMLSCHDWASTNHF 258


>gi|218197695|gb|EEC80122.1| hypothetical protein OsI_21883 [Oryza sativa Indica Group]
          Length = 545

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 52/124 (41%), Positives = 70/124 (56%), Gaps = 6/124 (4%)

Query: 253 DISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW 312
           D   +    YY  M+R+ FCLCP G    SPR+VEA+  GC+PV++AD    PFAD + W
Sbjct: 414 DAGGEGGNPYYWYMRRSRFCLCPSGHEVASPRVVEAIHAGCVPVVVADGYAPPFADVLRW 473

Query: 313 EEIGMFVAEEDVPKLDTILTSIP-PEVILRKQRLLANPSM-KRAMLFPQPAQPGDAFHQI 370
           E   + VA  DVP+L  +L  IP PEV    +RL     + KR  +  QP +  D FH I
Sbjct: 474 EAFSVAVAVADVPRLRELLERIPAPEV----ERLRDGVRLVKRHFMLHQPPERLDMFHMI 529

Query: 371 LNGL 374
           L+ +
Sbjct: 530 LHSV 533



 Score = 42.0 bits (97), Expect = 0.56,   Method: Compositional matrix adjust.
 Identities = 34/124 (27%), Positives = 61/124 (49%), Gaps = 13/124 (10%)

Query: 39  KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAE-IFMHRFLLSSP----VRTLNPEEADWF 93
           + KVYVYE      +  +  +  C  +++A E  F+    L +P    VRT +P  A   
Sbjct: 137 RFKVYVYE----EGEPPIAHEGPC-KNIYAVEGRFIEELELMAPPLGGVRTWDPARAHAL 191

Query: 94  YTPIYPTCDLTPTGLPLPFK-SP--RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGA 150
           + P+  +  +     PL +  SP   ++   + +++S   +WNR+ GADHF +  HD+ +
Sbjct: 192 FLPLSVSQMVQLAYRPLSYDLSPLRAIVADYVAVVASRHRFWNRSAGADHFMLSCHDWAS 251

Query: 151 CFHY 154
             H+
Sbjct: 252 TNHF 255


>gi|52075645|dbj|BAD44815.1| putative pectin-glucuronyltransferase [Oryza sativa Japonica Group]
 gi|52075653|dbj|BAD44823.1| putative pectin-glucuronyltransferase [Oryza sativa Japonica Group]
          Length = 514

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 52/124 (41%), Positives = 70/124 (56%), Gaps = 6/124 (4%)

Query: 253 DISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW 312
           D   +    YY  M+R+ FCLCP G    SPR+VEA+  GC+PV++AD    PFAD + W
Sbjct: 383 DAGGEGGNPYYWYMRRSRFCLCPSGHEVASPRVVEAIHAGCVPVVVADGYAPPFADVLRW 442

Query: 313 EEIGMFVAEEDVPKLDTILTSIP-PEVILRKQRLLANPSM-KRAMLFPQPAQPGDAFHQI 370
           E   + VA  DVP+L  +L  IP PEV    +RL     + KR  +  QP +  D FH I
Sbjct: 443 EAFSVAVAVADVPRLRELLERIPAPEV----ERLRDGVRLVKRHFMLHQPPERLDMFHMI 498

Query: 371 LNGL 374
           L+ +
Sbjct: 499 LHSV 502



 Score = 38.9 bits (89), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 33/119 (27%), Positives = 58/119 (48%), Gaps = 13/119 (10%)

Query: 39  KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAE-IFMHRFLLSSP----VRTLNPEEADWF 93
           + KVYVYE      +  +  +  C  +++A E  F+    L +P    VRT +P  A   
Sbjct: 137 RFKVYVYE----EGEPPIAHEGPC-KNIYAVEGRFIEELELMAPPLGGVRTWDPARAHAL 191

Query: 94  YTPIYPTCDLTPTGLPLPFK-SP--RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFG 149
           + P+  +  +     PL +  SP   ++   + +++S   +WNR+ GADHF +  HD+ 
Sbjct: 192 FLPLSVSQMVQLAYRPLSYDLSPLRAIVADYVAVVASRHRFWNRSAGADHFMLSCHDWA 250


>gi|413915952|gb|AFW55884.1| secondary cell wall glycosyltransferase family 47 [Zea mays]
          Length = 484

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 96/396 (24%), Positives = 157/396 (39%), Gaps = 63/396 (15%)

Query: 28  AGDVLEDDPVGKLKVYVYELPSKYNKKLLL--------------------KDPRCLTHMF 67
           A  V+  DP   ++V++Y++P +++  LL                     + P  L    
Sbjct: 95  AAGVVRCDPRDAVRVFMYDMPPEFHFGLLGWSPPSPDSVWPDVTAASPPPRYPGGLNQQH 154

Query: 68  AAEIFMHRFLLSSP--------VRTLNPEEADWFYTPIYPTCDLTPTGLPLP---FKSPR 116
           + E ++   LLSS         VR  +  +AD  + P + +        P+P       R
Sbjct: 155 SVEYWLTLDLLSSSPPCGRHSAVRVSDSRDADLVFVPFFASLSYNRHYRPVPPEKVSRDR 214

Query: 117 MMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQT 176
           +++  +    +  P W R  GADH  +V H   +  H +           +L  A  V +
Sbjct: 215 VLQEKLVRYLAARPEWRRYGGADHV-IVAHHPNSLLHARA----------VLHPAVFVLS 263

Query: 177 -FGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGG--- 232
            FG+      +     I PY    K  A+       R   +YFRG  Y      EGG   
Sbjct: 264 DFGRYPPRVASLEKDVIAPYKHMAKTYANDSAGFDDRPTLLYFRGAIYR----KEGGSIR 319

Query: 233 ---YYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAV 289
              YY       V+ +F +     +     +   + M  + FCL   G  P S RL +A+
Sbjct: 320 QELYYMLKEEKDVYFSFGS-----VQDHGASKASQGMHSSKFCLNIAGDTPSSNRLFDAI 374

Query: 290 VFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLL 346
           V  C+PVII+DDI LP+ D + + +  +FV   D  K   L  +L+ +  +   +    L
Sbjct: 375 VTHCVPVIISDDIELPYEDVLDYSKFSIFVRSSDAVKKGYLMRLLSGVSKQQWTKMWDRL 434

Query: 347 ANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHDK 382
               + +   +  P+Q  DA   I   L+RK+P  K
Sbjct: 435 KE--VDKHFEYQYPSQKDDAVQMIWQALSRKVPSIK 468


>gi|224144815|ref|XP_002325425.1| predicted protein [Populus trichocarpa]
 gi|222862300|gb|EEE99806.1| predicted protein [Populus trichocarpa]
          Length = 332

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 78/316 (24%), Positives = 133/316 (42%), Gaps = 43/316 (13%)

Query: 67  FAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRM--------M 118
           +A+E +  + L+ S   T +P +AD F+ P       + T L      PR+        +
Sbjct: 33  YASESYFKKALMKSHFITKDPAKADLFFLP------FSITRLR---HDPRVGVGGIQDFI 83

Query: 119 RSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFG 178
           R  I  IS  +P+WNRT GADHF+   H  G     + E+     I  +   +  +  + 
Sbjct: 84  RDYILNISRKYPFWNRTGGADHFYAACHSIGRSAMEKSEEVKFNAIQVVCSSSYFLSGY- 142

Query: 179 QRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGA 238
                 +    ++ P           Q+     R    +F G      N P      R  
Sbjct: 143 ------IAHKDVSFP------GCHLSQVVKCDYRKKLAFFAGSI----NSP-----VRER 181

Query: 239 RAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVII 298
               W N  ++ +F       T Y +++  + FCL   G+   + R+ +++ +GC+PVII
Sbjct: 182 LLHSWRN--DSEIFAHFGRLTTPYADELLGSKFCLHVKGFEVNTARIGDSLYYGCVPVII 239

Query: 299 ADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFP 358
           A+   LPFAD + W+   + VA  D+P L  IL  I  +  L  Q+ +    +++   + 
Sbjct: 240 ANHYDLPFADILNWKSFSVVVATLDIPLLKKILKGISSDQYLMFQKKVL--EVRKHFQWH 297

Query: 359 QPAQPGDAFHQILNGL 374
            P    DAF+ ++  L
Sbjct: 298 CPPVDYDAFYMVMYEL 313


>gi|359472749|ref|XP_002276440.2| PREDICTED: probable glycosyltransferase At5g11130-like [Vitis
           vinifera]
 gi|147815974|emb|CAN68074.1| hypothetical protein VITISV_007510 [Vitis vinifera]
 gi|297737986|emb|CBI27187.3| unnamed protein product [Vitis vinifera]
          Length = 461

 Score = 89.7 bits (221), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 96/383 (25%), Positives = 149/383 (38%), Gaps = 68/383 (17%)

Query: 38  GKLKVYVYELPSKYN--------------KKLLLKDPRCLT---HMFAAEIFMHRFLLSS 80
           G +KVY+Y+LP K+               +K+ + D   L    H  + E ++   L+  
Sbjct: 70  GPVKVYMYDLPRKFTYGVIESYAVARGGLEKVPVDDVSSLKYPGHQHSGEWYLFSDLIRE 129

Query: 81  P------VRTLNPEEADWFYTPIYPTCDLTPT--------GLPLPFKSPRMMRSAIQLIS 126
                  VR  +PEEAD FY   + +  L           G    +    M  S ++ + 
Sbjct: 130 DRGRRYVVRVSDPEEADLFYVSFFSSLSLVVNPIRPANGEGAGTGYSDEEMQESLMEWLE 189

Query: 127 SNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLN 186
               YW R  G DH F+           Q+  A+   +  +     LV  FG+      +
Sbjct: 190 QQ-EYWKRNNGRDHVFIC----------QDPNALHLIVDRVKNGVLLVSDFGRLRSDTAS 238

Query: 187 EGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENF 246
                I PYA   K  + +I  +  +S+ ++F G  Y      EGG      R  +++  
Sbjct: 239 LVKDVILPYAHRIKSYSGEIGVENRKSL-LFFMGNRYR----KEGG----KIRDLLFQIL 289

Query: 247 KNNPLFDISTDHPTTYYED-------MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIA 299
           +     D+   H     E        M  + FCL P G  P + RL +A+V  C+PVI++
Sbjct: 290 EQEE--DVIIKHGAQSRESRRMASQGMHSSKFCLHPAGDTPSACRLFDAIVSLCVPVIVS 347

Query: 300 DDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAML 356
           D I LPF D I + +I +FV      K   L   L  I  E IL  QR      M+    
Sbjct: 348 DQIELPFEDVIDYRKIAIFVDSTSAVKPGFLVKNLRKITRERILEYQR-----EMQEVTR 402

Query: 357 FPQPAQPGDAFHQILNGLARKLP 379
           + +         +I   ++ KLP
Sbjct: 403 YFEYEDTNGTVSEIWRQVSMKLP 425


>gi|307111456|gb|EFN59690.1| hypothetical protein CHLNCDRAFT_133230 [Chlorella variabilis]
          Length = 569

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 91/356 (25%), Positives = 144/356 (40%), Gaps = 38/356 (10%)

Query: 39  KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTP-- 96
             K+Y+YEL +++       D          + FM  FL  SPVRT +P EA  FY P  
Sbjct: 243 NFKIYMYELNTQW-----AYDNDHHIAWTGHDPFMEEFL-ESPVRTEDPSEASLFYIPAF 296

Query: 97  IYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQE 156
           +Y        G           +  +  I + WPYW+R  G DHF  VP D G C     
Sbjct: 297 LYSYSGNMAGG-------DEHTQLLLDHIRATWPYWDRHGGRDHFLFVPADRGTCPWGSR 349

Query: 157 EKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQI--PPDTPRSI 214
              + R I+     +T         H    +G      Y P + + A     P   P + 
Sbjct: 350 FSDLIR-IVHFGMHSTRTNHNPHFGH----QGHPEFGCYNPLRDIVAAGTGAPLSLPWAG 404

Query: 215 FVYFRGLFYDVNNDPEGGYYAR-----GARAAVWENFKNNPLFDISTDHPTTYYEDMQRA 269
           +++F   F       +  Y  R         A W    N+P F  S  +   Y    + A
Sbjct: 405 WLFF---FAGSIRTDDNVYSGRTRLILSELVAQW----NDPEFSFSGGYVNNYPAGFREA 457

Query: 270 IFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDT 329
            FCL P G+  +  RL ++++ GC+PV+I + +  P+ + +P+E   + ++ ED+P+L  
Sbjct: 458 KFCLAPWGYG-FGMRLHQSILGGCVPVVIQEHVFQPYEEVLPYETFSLRLSNEDLPQLRE 516

Query: 330 ILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHDKSTY 385
            L S+  E   + + LL      +     +    G AF   +  L R+  +  S Y
Sbjct: 517 TLRSVTDE---QYRELLEGVVRYKEAFSWERHLGGRAFDYTIASLRRRWLNSLSLY 569


>gi|302773760|ref|XP_002970297.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
 gi|300161813|gb|EFJ28427.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
          Length = 437

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 95/404 (23%), Positives = 169/404 (41%), Gaps = 65/404 (16%)

Query: 13  VSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKD------------- 59
           + +++   I R SG            KLKV++YELP KYN  L  +D             
Sbjct: 39  IRSESRLGISRTSGKNA---------KLKVFMYELPRKYNFGLFNRDGPAQEIPWKNLSN 89

Query: 60  -----PRCLTHMFAAEIFMHRFLLS-------SPVRTLNPEEADWFYTPIYPTCDLTPTG 107
                 + L    + E +M   LL        +  R  +P EAD F+ P + +      G
Sbjct: 90  LPGPHTQGLKKQHSVEYWMTLDLLDEGGREFRAAQRVSDPGEADVFFVPYFASLSFNVFG 149

Query: 108 L----PLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERG 163
           +    P      ++    I+ +S + P++ R+ G DH  V+ H     F + +    +R 
Sbjct: 150 VSMRDPETEHDKKLQVGMIEYLSKS-PWYQRSGGRDHVLVLHHP--NAFRFLK----DRL 202

Query: 164 ILPLLQRATLVQTFGQ--RNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGL 221
            L LL    +V  FG+  +    L++  +    +  P         P   R+  ++F+G 
Sbjct: 203 NLSLL----VVADFGRFPKGVAALHKDVVAPYSHMVPTYNGDDGTDPFEERTTLLFFQG- 257

Query: 222 FYDVNNDPEGGYYARGARAAVWEN-----FKNNPLFDISTDHPTTYYEDMQRAIFCLCPL 276
              V    +G    R   AA+ EN     F+     + + +      + M+ + FCL P 
Sbjct: 258 --RVKRKDDG--VVRTQLAAILENQPRVHFEEGIATNFTVEQAM---QGMRSSRFCLHPA 310

Query: 277 GWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPP 336
           G  P S RL +A+V  C+PVI++D I LPF D + + E  +F + ++  +   +L ++  
Sbjct: 311 GDTPSSCRLFDAIVSHCVPVIVSDKIELPFEDELDYSEFSLFFSVDEAVRPGFLLGALEK 370

Query: 337 EVILRKQRLLAN-PSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
               R  ++      + R   +  P+Q  DA + + + + +K+P
Sbjct: 371 FSKRRWMKMWRRLKQVTRHFEYQHPSQRDDAVNMLWSQIHKKVP 414


>gi|159481468|ref|XP_001698801.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
 gi|158273512|gb|EDO99301.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
          Length = 967

 Score = 89.0 bits (219), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 99/402 (24%), Positives = 158/402 (39%), Gaps = 98/402 (24%)

Query: 32  LEDDPVGKLKVYVYELPSKYNKKLLLKDPRC-----------------LTHMFAAEIFMH 74
           L +D  G  +     LP  + K+L L   RC                     ++ E++ H
Sbjct: 472 LTNDHCGYRQSSHTPLPFNFPKRLCLCSARCDHCGYRQFSGPSNSSELFADGYSVEVYFH 531

Query: 75  RFLLSSPVRTLNPEEADWFYTPIYPTCDL-------------TPTGLPLPFKSPRMMRSA 121
             L  SP RT +P+EAD+FY P+Y TC +              PT    P  +  +  +A
Sbjct: 532 EVLAISPHRTFDPDEADFFYLPVYYTCWMWPVNGWADTPFWGAPTSWHRPSNAANLWLAA 591

Query: 122 IQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQE--EKAIERGILPLLQRATLVQT--- 176
            + I  ++PYW+R  G DH ++  HD GAC+   E    +I   +L    R  L  T   
Sbjct: 592 KRWIQQHFPYWDRRGGRDHIWMTNHDEGACYMPTEIYNSSI---MLTHWGRLDLNHTSNT 648

Query: 177 -FGQRN----------------------HVCLN-EGSITIPPYAPPQKMQAHQIP--PDT 210
            +G  N                      H C + +  + IP + PP+  +   +   P  
Sbjct: 649 AYGPDNYSTGLTWPDINGGRDVTELWAGHPCYDPKKDLVIPGFKPPEHYKRSPLLGFPPY 708

Query: 211 PRSIFVYFRGLFYDVNND--PEGGYYARGARAAVWENFKNNPLFDISTDHP--------- 259
            R I +Y RG   DV     P    Y+RG R  +   +K +       +H          
Sbjct: 709 QRDILLYLRG---DVGKHRLPN---YSRGTRQKL---YKLSQAHGWIAEHRIFIGEKYEL 759

Query: 260 -TTYYEDMQRAIFCLCPLGWAP------------WSPRLVEAVVFGCIPVIIADDIVLPF 306
              Y + + R++FC       P            ++ R  +AV+ GC+P++I D     F
Sbjct: 760 VGDYSDHLARSVFCAVVPAPPPTSKPPNLQPGDGYAMRFEDAVLHGCLPLVIMDRTHAVF 819

Query: 307 ADAIPWEEIGMFVAEEDVPK-LDTILTSIPPEVILRKQRLLA 347
              +  +   + ++E  + + L  +L +I PE I R QR LA
Sbjct: 820 ESILDIDGFSLRISEAALDEHLPALLKAIAPEQIERMQRRLA 861


>gi|224120296|ref|XP_002318294.1| predicted protein [Populus trichocarpa]
 gi|222858967|gb|EEE96514.1| predicted protein [Populus trichocarpa]
          Length = 382

 Score = 89.0 bits (219), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 85/323 (26%), Positives = 136/323 (42%), Gaps = 50/323 (15%)

Query: 78  LSSPVRTLN-PEEADWFYTPIYPTCDLTPTGLPL-------PFKSPRMMRSAIQLISSNW 129
           + SPV  +N PEEAD FY P++ +  LT   + +       P  S   M+  +       
Sbjct: 53  VGSPVVKVNDPEEADLFYVPVFSSLSLTVNPVQVGKVPVSDPVYSDEKMQDELVEWLEKQ 112

Query: 130 PYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGS 189
            YW R  G DH               +  A+ R +  +     L+  FG+   V  ++GS
Sbjct: 113 EYWRRNNGRDHVLFAG----------DPNALYRVLDRVKNAVLLLSDFGR---VRSDQGS 159

Query: 190 IT---IPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENF 246
           +    I PYA    +    I  D  R   ++F G  Y      +GG      R  +++  
Sbjct: 160 LVKDVIVPYAHRINVYNGDIGVD-ERKTLLFFMGNRYR----KDGG----KIRDMLFQLL 210

Query: 247 KNNPLFDISTDHPTTYYED-------MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIA 299
           +     D+   H T   E        M  + FCL P G  P + RL +++V  C+P+I++
Sbjct: 211 EKEE--DVLISHGTQSRESRRTATLGMHTSKFCLNPAGDTPSACRLFDSIVSLCVPLIVS 268

Query: 300 DDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAML 356
           D I LPF D I + +I +FV  E   K   L  +L ++  E IL  Q+      M+    
Sbjct: 269 DSIELPFEDVIDYRKIAIFVDTESSLKPGYLVKLLRAVSTERILEYQK-----EMREVKR 323

Query: 357 FPQPAQPGDAFHQILNGLARKLP 379
           + + + P    ++I   + +KLP
Sbjct: 324 YFEYSDPNGTVNEIWREIGQKLP 346


>gi|357126548|ref|XP_003564949.1| PREDICTED: probable glycosyltransferase At3g42180-like
           [Brachypodium distachyon]
          Length = 432

 Score = 89.0 bits (219), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 85/311 (27%), Positives = 133/311 (42%), Gaps = 33/311 (10%)

Query: 82  VRTLNPEEADWFYTPIYPTCDLTPTGLPLP---FKSPRMMRSAIQLISSNWP--YWNRTE 136
           VR  +PE A+ F+ P + +      G  +     ++ R+++  ++LI   W   YW R+ 
Sbjct: 124 VRVADPEAAEAFFVPFFSSLSFNVHGRNMTDPDTEADRLLQ--VELIDVLWKSKYWQRSA 181

Query: 137 GADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYA 196
           G DH  V+P      F +  +  +   +L       +V  FG+      +     + PY 
Sbjct: 182 GRDH--VIPMHHPNAFRFLRD-MVNASVL-------IVADFGRYTQELASLRKDVVAPYV 231

Query: 197 PPQKMQAHQIPPD--TPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDI 254
                  +  PPD    R   ++FRG         EG   A+ A+       K+   F+ 
Sbjct: 232 HVVDSFINDDPPDPFEARPTLLFFRG---RTVRKAEGKIRAKLAKIL---KDKDGVRFED 285

Query: 255 ST---DHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIP 311
           S    +   T  E M+ + FCL P G  P S RL +A+V  CIPVI++  I LPF D I 
Sbjct: 286 SLATGEGINTSTEGMRSSKFCLHPAGDTPSSCRLFDAIVSHCIPVIVSSRIELPFEDEID 345

Query: 312 WEEIGMFVAEEDVPKLDTILTSI---PPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFH 368
           + E  +F + E+  K D +L  +   P E  +     L N S      F  P + GDA +
Sbjct: 346 YSEFSLFFSVEEALKPDYLLDQLRQMPKEKWVEMWSKLKNVSSHYE--FQYPTRKGDAVN 403

Query: 369 QILNGLARKLP 379
            I   +  K+P
Sbjct: 404 MIWRQVRHKIP 414


>gi|159473729|ref|XP_001694986.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
 gi|158276365|gb|EDP02138.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
          Length = 703

 Score = 88.6 bits (218), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 101/415 (24%), Positives = 158/415 (38%), Gaps = 91/415 (21%)

Query: 39  KLKVYVYELPSKYNKKLLLKDPRCLTH-MFAAEIFMHRFLLSS-PVRTLNPEEADWFYTP 96
           K+KVYVY+LPS    +    D   L   M+ AE+     LL    VRT NP EA  FY P
Sbjct: 272 KIKVYVYDLPSNVVHRREFHDQWSLIDLMYNAELEFTELLLGDWGVRTENPWEAALFYVP 331

Query: 97  IYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQE 156
            +        G P       +++ A + +  N P++N T G +H F   +D G C     
Sbjct: 332 TFTYWYTGNVGHPY-----FVIQHATKWLQENSPFFNLTGGRNHIFWATNDRGVCKLQMA 386

Query: 157 EKAIERGI----------LPLLQRATLVQTFGQRNHVCLN-------------------- 186
              ++  I           P   R  + QT     H+ L                     
Sbjct: 387 PPEMQHSIKLVHFGQSPRRPYAARMKVGQTTDPTAHLALRGALPQPGTRFPEFPAEFTAM 446

Query: 187 ------------EGSITIPPYAPPQKMQAH---------QIP-----------PDTP-RS 213
                       E  +  P Y  P  ++ H         Q+P           PD P R+
Sbjct: 447 DILEESEICIRPEKDVVTPNYIMPTWVEPHNYNQVWNVTQLPDGTRQVTRKRGPDVPVRN 506

Query: 214 IFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYE--------- 264
           + +YF G        P+  Y ++G R  + + F     +D    +    Y+         
Sbjct: 507 LTLYFAGF-----TKPQMSY-SQGVRQLIHKLFGPGGKYDPKGPNARPDYKVGGPGGGEA 560

Query: 265 --DMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEE 322
              MQ++ FCL P+G + W  RLVEA++ GC+PVII D +     D +P+ E  + V   
Sbjct: 561 ATYMQQSRFCLAPMG-SGWGIRLVEAMISGCVPVIIQDHVYQAHWDVVPFPEFSIRVGRH 619

Query: 323 DVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARK 377
           D+ +L  +L  + P+ +   + L A         F      G A++  +  L ++
Sbjct: 620 DLHRLVELLDDVAPQEL---EELQAGIERYHRAFFWDAQWGGLAYNYTIQALKQR 671


>gi|449487654|ref|XP_004157734.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10L-like
           [Cucumis sativus]
          Length = 464

 Score = 88.6 bits (218), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 102/412 (24%), Positives = 161/412 (39%), Gaps = 70/412 (16%)

Query: 12  FVSADNTPKIERISGSAGDVLEDD--PVGKLKVYVYELPSKYNKKLL------------- 56
           F SA++    +  S    D+L      +  +K+Y+Y++P+++   ++             
Sbjct: 43  FSSANSVIVSDEFSSQDTDLLNSSGKSLSPVKIYLYDVPTRFTYGVIENHGIARGGKPVP 102

Query: 57  -LKDPRCLTHMFAAEIFMHRFLL--------SSPVRTLNPEEADWFYTPIYPTCDLTPTG 107
            + D +   H   AE F+   LL        S+ VR  +PEEAD FY P + +  L    
Sbjct: 103 DVTDLKYPGHQHMAEWFLFTDLLRPESERIGSAVVRVFDPEEADLFYVPFFSSLSLIVNP 162

Query: 108 LPLPFKSPRMMRSAI------QLISSNW----PYWNRTEGADHFFVVPHDFGACFHYQEE 157
           +     S +  R  +      Q     W     YW R+ G DH  +           Q+ 
Sbjct: 163 IRPATGSDQQQRKLVYSDEETQDAFMEWLEKQEYWKRSNGRDHVIIA----------QDP 212

Query: 158 KAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVY 217
            A+ R I  +     LV  FG+      +     I PY+         I  +  R   ++
Sbjct: 213 NALYRLIDRVKNSILLVSDFGRLRADQASLVKDVIVPYSHRINTYTGDIGVEN-RKTLLF 271

Query: 218 FRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYED-------MQRAI 270
           F G  Y      EGG      R  ++   +     D+   H T   E        M  + 
Sbjct: 272 FMGNRYR----KEGG----KIRDMLFNILELEQ--DVIIKHGTQSRESRRAATHGMHTSK 321

Query: 271 FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK---L 327
           FCL P G  P + RL ++VV  C+PVI++D I LPF D I + +I +F       K   L
Sbjct: 322 FCLNPAGDTPSACRLFDSVVSLCVPVIVSDSIELPFEDVIDYSKIAVFFDSVSAVKPEFL 381

Query: 328 DTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
            + L  I  E IL  QR      MK+   + +        ++I   +++KLP
Sbjct: 382 ISKLRRISEERILDYQR-----EMKKIKRYFEYTDSNGTVNEIWRQVSQKLP 428


>gi|116787500|gb|ABK24530.1| unknown [Picea sitchensis]
          Length = 446

 Score = 88.6 bits (218), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 92/378 (24%), Positives = 151/378 (39%), Gaps = 56/378 (14%)

Query: 40  LKVYVYELPSKYNKKLLLKDPR------------------CLTHMFAAEIFMHRFLLS-- 79
           LKVY+Y+LP ++N  +L K+                     L    + E +M  +LL   
Sbjct: 66  LKVYMYDLPRRFNLGMLKKNSSDLDLPWTSSKIPPWPQRSGLKKQHSIEYWMMVYLLGQH 125

Query: 80  -------SPVRTLNPEEADWFYTPIYPTCDLTPTGL----PLPFKSPRMMRSAIQLISSN 128
                  + VR  +P++AD FY P + +      G     P      ++    + ++  +
Sbjct: 126 VGEEGERTAVRVADPDQADVFYVPFFASLSFNTHGQNMRDPETEFDKKLQIEVVDMLKRS 185

Query: 129 WPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQ--RNHVCLN 186
              W R+ G DH  V+ H     F   E  A             +V  FG+  R+   L 
Sbjct: 186 -KSWQRSGGRDHVIVIHHPNAFRFLRDEVNA----------SIFVVADFGRYPRSVSFLR 234

Query: 187 EGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENF 246
           +  +    +     +      P   R++ +YFRG         EG  + R   A +  N 
Sbjct: 235 KDVVAPYVHVVDTYVNDDSSDPFESRTMLLYFRG---RTKRKDEG--FVRLKLAKILGNH 289

Query: 247 KNNPLFD--ISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVL 304
           K     D   +T+      + M+ + FCL P G  P S RL +A+V  C+PVI++D I L
Sbjct: 290 KRVHFEDSLATTEGFEVAKQGMRSSRFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDRIEL 349

Query: 305 PFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPA 361
           PF D I ++E  +F + ++  +   L   L + P E  L+    L    +     +  P 
Sbjct: 350 PFEDEIDYQEFSLFFSVKEALRPGYLMQKLETFPKEKWLKMWNKLKQ--VAHHFEYQYPP 407

Query: 362 QPGDAFHQILNGLARKLP 379
              DA + +   + RKLP
Sbjct: 408 IKDDAVNMLWRQIHRKLP 425


>gi|302793388|ref|XP_002978459.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
 gi|300153808|gb|EFJ20445.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
          Length = 437

 Score = 88.6 bits (218), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 92/404 (22%), Positives = 167/404 (41%), Gaps = 65/404 (16%)

Query: 13  VSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKD------------- 59
           + +++   I R SG            KLKV++YELP KYN  L  +D             
Sbjct: 39  IRSESRLGISRTSGKNA---------KLKVFMYELPRKYNFGLFDRDGPAQEIPWKNLSN 89

Query: 60  -----PRCLTHMFAAEIFMHRFLLS-------SPVRTLNPEEADWFYTPIYPTCDLTPTG 107
                 + L    + E +M   LL        +  R  +P EAD F+ P + +      G
Sbjct: 90  LPGPHTQGLKKQHSVEYWMTLDLLDEGGREFRAAQRVSDPGEADVFFVPYFASLSFNVFG 149

Query: 108 L----PLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERG 163
           +    P      ++    I+ +S + P++ R+ G DH  V+ H     F + +++     
Sbjct: 150 VSMRDPETEHDKKLQVGMIEYLSKS-PWYQRSGGRDHVLVLHHP--NAFRFLKDR----- 201

Query: 164 ILPLLQRATLVQTFGQ--RNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGL 221
              L     +V  FG+  +    L++  +    +  P         P   R+  ++F+G 
Sbjct: 202 ---LNSSLLVVADFGRFPKGVAALHKDVVAPYSHMVPTYNGDDGSDPFEERTTLLFFQG- 257

Query: 222 FYDVNNDPEGGYYARGARAAVWEN-----FKNNPLFDISTDHPTTYYEDMQRAIFCLCPL 276
              V    +G    R   AA+ EN     F+     + + +      + M+ + FCL P 
Sbjct: 258 --RVKRKDDG--VVRTQLAAILENQPRVHFEEGIATNFTVEQAM---QGMRSSRFCLHPA 310

Query: 277 GWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPP 336
           G  P S RL +A+V  C+PVI++D I LPF D + + E  +F + ++  +   +L ++  
Sbjct: 311 GDTPSSCRLFDAIVSHCVPVIVSDKIELPFEDELDYSEFSLFFSVDEAVRPGFLLGALEK 370

Query: 337 EVILRKQRLLAN-PSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
               R  ++      + R   +  P+Q  DA + + + + +K+P
Sbjct: 371 FSKRRWMKMWRRLKQVTRHFEYQHPSQRDDAVNMLWSQIHKKVP 414


>gi|242046000|ref|XP_002460871.1| hypothetical protein SORBIDRAFT_02g036640 [Sorghum bicolor]
 gi|241924248|gb|EER97392.1| hypothetical protein SORBIDRAFT_02g036640 [Sorghum bicolor]
          Length = 500

 Score = 88.2 bits (217), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 100/378 (26%), Positives = 152/378 (40%), Gaps = 77/378 (20%)

Query: 36  PVGKLKVYVYELPSKYNKKLLL--------------------KDPRCLTHMFAAEIFMHR 75
           P   +K+Y+Y+LP+K+   ++                     +  R   H  +AE ++ +
Sbjct: 100 PPAPVKIYLYDLPAKFTYGVVRSYTATRAPSGSADAAAALPDEQLRYPGHQHSAEWWLFK 159

Query: 76  FLL-----SSPV-RTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSP----------RMMR 119
            LL       PV R  +P +AD FY P + +  L       P +SP              
Sbjct: 160 DLLRRGPRDRPVARVDDPSDADLFYVPFFSSLSLVVN----PIRSPPAANASGAAAAYSD 215

Query: 120 SAIQLISSNW----PYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQ 175
            A+Q     W    PYW R  G DH F+           Q+  A+ R I  +     LV 
Sbjct: 216 DAMQEELLEWLERQPYWRRHMGRDHVFIC----------QDPNALYRVIDRISNAVLLVS 265

Query: 176 TFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYA 235
            FG+      +     I PY+        ++  D  R + ++F G  Y      EGG   
Sbjct: 266 DFGRLRSDQASLVKDVILPYSHRINSFKGEVGVDG-RPLLLFFMGNRYR----KEGG--- 317

Query: 236 RGARAAVWENFKNNPLFDISTDHPTTYYED-------MQRAIFCLCPLGWAPWSPRLVEA 288
              R A+++  +N    D++  H T   E        M  + FCL P G  P + RL +A
Sbjct: 318 -KVRDALFQILENED--DVTIKHGTQSRESRRAARQGMHSSKFCLHPAGDTPSACRLFDA 374

Query: 289 VVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRL 345
           +V  C+PVI++D I LPF D I + +I +FV      +   L ++L  I  E IL  QR 
Sbjct: 375 LVSLCVPVIVSDYIELPFEDIIDYNKISIFVGTSKAVQPGYLTSMLRRISSERILEYQR- 433

Query: 346 LANPSMKRAMLFPQPAQP 363
                +KR   +  P  P
Sbjct: 434 -ETKKVKRYFEYEDPNGP 450


>gi|224062946|ref|XP_002300942.1| predicted protein [Populus trichocarpa]
 gi|222842668|gb|EEE80215.1| predicted protein [Populus trichocarpa]
          Length = 460

 Score = 88.2 bits (217), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 88/355 (24%), Positives = 137/355 (38%), Gaps = 32/355 (9%)

Query: 40  LKVYVYELPSKYNKKLLLKD-------PRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADW 92
           LK+YVYE       K LL+          CL   +  ++ +H  LL S  RT   EEAD 
Sbjct: 87  LKIYVYEEDEIDGLKELLRGRDGKISADACLKGQWGTQVKIHGLLLESRFRTRKKEEADL 146

Query: 93  FYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACF 152
           F+ P Y  C     GL     + + +      + S  PY+ R+ G DH FV P   GA  
Sbjct: 147 FFVPAYVKCVRMMGGL-----NDKEINHTYVKVLSQMPYFRRSGGRDHIFVFPSGAGAHL 201

Query: 153 HYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIP----- 207
                  I R I+      T      +++    N     I P      M   +I      
Sbjct: 202 FRSWATYINRSII----LTTEADRTDKKDTSAFNTWKDIIIPGNVEDGMTKRRIAMVQPL 257

Query: 208 PDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP---TTYYE 264
           P + R     + G         + G       A  + +   +P    S         Y++
Sbjct: 258 PLSKRKYLANYLG-----RAQGKVGRLKLIELAKQYPDKLESPELKFSGPGKFGRMEYFQ 312

Query: 265 DMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDV 324
            ++ A FCL P G + W+ R  E+    C+PVI++D    PF + I + +I +      +
Sbjct: 313 HLRNAKFCLAPRGESSWTLRFYESFFVECVPVILSDQAEFPFQNVIDYTQISIKWPSTRI 372

Query: 325 P-KLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKL 378
             +L   L SIP E I  +Q + A   ++   ++    +   A   I+  L RK+
Sbjct: 373 GLELLEYLESIPDENI--EQMIAAGRQIRCLWVYAPEFESCSAMQGIMWELQRKV 425


>gi|302845052|ref|XP_002954065.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
 gi|300260564|gb|EFJ44782.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
          Length = 600

 Score = 87.8 bits (216), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 89/338 (26%), Positives = 137/338 (40%), Gaps = 53/338 (15%)

Query: 40  LKVYVYELPSKYNKKL-LLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIY 98
           +KVY+YELPS        ++  R +  MF         L+SS +RTL+  +AD+FY PI 
Sbjct: 163 VKVYIYELPSNMTSWYPFMRMDRPVHLMFWQR------LMSSGMRTLDGNKADYFYIPIN 216

Query: 99  PTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEK 158
                T TG      +   +   +  I   +P+W++  G  H  +   D G         
Sbjct: 217 -----TRTG----SLAREELEWTLPYIKKTYPWWSKDNGNRHLIIHTGDMGI---NDFPL 264

Query: 159 AIERGILPLLQRATLVQTFGQRNH-------------------VCLNEGSITIPPYAPPQ 199
           A  R +   L   T +  +G   +                   V +      + P  P  
Sbjct: 265 ATRRELNESLSNITWLTHWGLHEYHPIAKWYPAHRPGKDIVIPVMIMTQGFHLSPMNP-- 322

Query: 200 KMQAHQIPPDTPR--SIFVYFRGLFYDVNN--DPEGGY-------YARGARAAVWENFKN 248
           +M+A       PR  +  ++F G      +  DP+ G        Y+ G R AV+   +N
Sbjct: 323 RMEAEIKAQGAPRLRNGTLFFAGRICGDRDLPDPKTGKCGPGHEDYSFGVRQAVYLQHRN 382

Query: 249 NPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFAD 308
              F I     +TY ED+    FCL P+G      R +     GC+PV+I D ++ PF  
Sbjct: 383 VKGFRIVA-WTSTYLEDISSHKFCLAPVG-GGHGKRNILVAFMGCLPVLIGDHVLQPFEP 440

Query: 309 AIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLL 346
            I W    + V E D+P L  IL ++P   +  KQ+ L
Sbjct: 441 EIDWSRFSISVPEADIPDLPRILANVPASEVASKQKRL 478


>gi|412986347|emb|CCO14773.1| predicted protein [Bathycoccus prasinos]
          Length = 445

 Score = 87.8 bits (216), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 102/379 (26%), Positives = 154/379 (40%), Gaps = 37/379 (9%)

Query: 28  AGDVLEDDPVGKLKVYVYELPSKYNKKLLL--------KDPRCLTHMFAAEIFMHRFLLS 79
           A + ++DD    +K+YVY       KKLL         K   C   M+ +++ +  FLL 
Sbjct: 73  AEESIKDDISPPIKIYVYNDAD--VKKLLFPGKETQAYKSGVCGMKMYGSQVHIADFLLK 130

Query: 80  SP-VRTLNPEEADWFYTPIYPTC--DLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTE 136
           S  +RT NP +AD+F+ P +P C  D  P G  L   +   +   +  +    PY  ++ 
Sbjct: 131 SKELRTENPSDADFFFLPGWPKCMLDAPPNGAGL---TDDELAKRLNGVIEKLPYIKKSG 187

Query: 137 GADHFFVVPHDFGACFHYQEEKAIERGILPLLQR------ATLVQTFGQRNHVCLNEGSI 190
           G DH FV P   G   +      I   I    +        TL   F     V L     
Sbjct: 188 GRDHVFVWPSGRGPTLYKNWRCKIPNSIFLTPEGFYTDPYRTLAPYFDPWKDVVL----- 242

Query: 191 TIPPYAPPQKMQAHQIPPDTP-RSIFVYFRGLFYDVNNDPEGGYYARGA-RAAVWENFKN 248
             P +   +K    +    T  R+    F G   D         + +   R  + +  K 
Sbjct: 243 --PGFMDGRKDSYLETNKRTSKRTKLASFAGTVPDGQALKGDEKHVKAHPRERLLKLSKK 300

Query: 249 NP--LFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 306
            P  L  IS   P  Y E +  + FC+ P G +PW+ R  E    GC+PVII+D + LPF
Sbjct: 301 YPDDLLAISGRTPK-YAEILGDSKFCIVPRGLSPWTLRTYETFFAGCVPVIISDSVRLPF 359

Query: 307 ADAIPWEEIGMFVAEEDVPK-LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGD 365
            + + W  I +   E  + + L T L SIP E I +  R      ++    +   A   +
Sbjct: 360 QEFLDWSLISIKWPEAKIDESLLTYLKSIPDEEIEKIVR--RGEQVRCVFAYQADATKCN 417

Query: 366 AFHQILNGLARKLPHDKST 384
           AF  I+  L+ K  +  ST
Sbjct: 418 AFSAIMWALSLKDRNVAST 436


>gi|359476146|ref|XP_002285237.2| PREDICTED: probable glycosyltransferase At5g25310-like [Vitis
           vinifera]
 gi|296082059|emb|CBI21064.3| unnamed protein product [Vitis vinifera]
          Length = 478

 Score = 87.8 bits (216), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 42/113 (37%), Positives = 66/113 (58%), Gaps = 2/113 (1%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           YY  M ++ +CLCP G    SPR+VEA+   C+PVI++D  VLPF+D + WE   + V  
Sbjct: 358 YYSLMLQSKYCLCPSGHEVASPRIVEAIYSECVPVILSDHYVLPFSDVLRWEAFSVKVEA 417

Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
            ++P+L  +L +I  E   R +  +   +++R     QPA+  D FH IL+ +
Sbjct: 418 SEIPRLKEVLQAISEEKYTRLKEGVR--AVRRHFELNQPAKRFDVFHMILHSV 468


>gi|297841429|ref|XP_002888596.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297334437|gb|EFH64855.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 429

 Score = 87.8 bits (216), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 91/379 (24%), Positives = 151/379 (39%), Gaps = 53/379 (13%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRC-------------------LTHMFAAEIFMHRFLLS- 79
           L+V++Y+LP K+N  ++  DP                     +    + E ++   LL+ 
Sbjct: 52  LRVFMYDLPRKFNVAMM--DPHSSDVEPLTGKNLPSWPQTSGIKRQHSVEYWLMASLLNG 109

Query: 80  -----SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLP---FKSPRMMRSAIQLISSNWPY 131
                  +R  +P+ AD FY P + +      G  +     +  R ++  +        Y
Sbjct: 110 GDDDNEAIRVFDPDLADAFYVPFFSSLSFNTHGKNMTDPDTEFDRQLQVELMEFLEGSEY 169

Query: 132 WNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSIT 191
           WNR+ G DH  V+P      F +  ++ +   IL       +V  FG+            
Sbjct: 170 WNRSGGKDH--VIPMTHPNAFRFLRQQ-VNASIL-------IVVDFGRYAKDMARLSKDV 219

Query: 192 IPPYAPPQKMQAHQ-----IPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENF 246
           + PY    +    +       P   R+  +YFRG   +     EG    R  +     + 
Sbjct: 220 VSPYVHVVESLNEEDDDGLTDPFEARTTLLYFRG---NTVRKDEGKIRLRLEKLLAGNSD 276

Query: 247 KNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 306
            +      +T +     E M+ + FCL P G  P S RL +A+V  CIPVII+D I LPF
Sbjct: 277 VHFEKSVATTQNIKVSTEGMRSSKFCLHPAGDTPSSCRLFDAIVSHCIPVIISDKIELPF 336

Query: 307 ADAIPWEEIGMFVAEEDVPKLDTILTSI---PPEVILRKQRLLANPSMKRAMLFPQPAQP 363
            D I + E  +F + ++  +   IL  +   P E  L   + L N S      F  P + 
Sbjct: 337 EDEIDYSEFSLFFSIKESLEPGYILNKLRQFPKEKWLEMWKRLKNVS--HHFEFQYPPKR 394

Query: 364 GDAFHQILNGLARKLPHDK 382
            DA + +   +  K+P+ K
Sbjct: 395 EDAVNMLWRQVKHKIPNVK 413


>gi|359473867|ref|XP_002268876.2| PREDICTED: probable glycosyltransferase At5g03795-like [Vitis
           vinifera]
 gi|296085575|emb|CBI29307.3| unnamed protein product [Vitis vinifera]
          Length = 449

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 81/342 (23%), Positives = 147/342 (42%), Gaps = 33/342 (9%)

Query: 37  VGKLKVYVYELPSKYNK-----KLLLKDPRCLTHMFAAEIFMHRF---LLSSPVRTLNPE 88
           + KLK++VY   S         ++ L +P       A     H F   LL S + T +P+
Sbjct: 114 LHKLKIFVYPDASMNQSSSPFARVFLPNPNPFHPKLANYFSEHMFKVALLRSSLLTPHPQ 173

Query: 89  EADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDF 148
           +A +F+ P           +         +      IS  + +WN + G DHF++  H  
Sbjct: 174 DAHFFFLPFSVNTLRNDPRVHSEASISDFVTQYTTRISWEYKFWNASRGTDHFYICCHSV 233

Query: 149 GACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY--APPQKMQAHQI 206
           G     +E  +    +     + T   ++ QR  + ++   + +P     PP+K+    +
Sbjct: 234 G-----REAASKHHDLHNNAIQVTCSSSYFQR--LYISHKDVGLPQVWPRPPEKL----L 282

Query: 207 PPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM 266
            P   R   V+F G    V N      + R    AVW N  +  LF  S   P  Y E +
Sbjct: 283 NPPELRHKLVFFAG---RVQNS-----HIRQELMAVWGNDTDIDLFSGSP--PFPYEEGL 332

Query: 267 QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK 326
           +++ +CL   G+   + R+ +A+ +GCIPVI+++   LPF++ + W +  + ++ + +  
Sbjct: 333 RKSKYCLHVKGYEVNTARVCDAIHYGCIPVIVSNYYDLPFSNVLDWSKFSVIISHKSIAT 392

Query: 327 LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFH 368
           L  IL SI  +  L   + L    ++R   +    +  D+FH
Sbjct: 393 LKKILLSISKQKYLSMYQNLC--LVRRHFAWHTTPRGYDSFH 432


>gi|125572389|gb|EAZ13904.1| hypothetical protein OsJ_03829 [Oryza sativa Japonica Group]
          Length = 504

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 89/346 (25%), Positives = 150/346 (43%), Gaps = 38/346 (10%)

Query: 40  LKVYVYELPSKYNKKLL--LKDP-RCLTHMFAAEIFMHRFLLSSPVR------TLNPEEA 90
           L+VYVYE+PS++   LL   +D  R  +++ +    +HR +    +        + PE  
Sbjct: 113 LRVYVYEMPSRFTYDLLRLFRDSYRETSNLTSNGSPVHRLVEQHSIDYWLWADLIAPESQ 172

Query: 91  DWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLI---SSNWPYWNRTEGADHFFVVPHD 147
                 I  + D       +  +S R++++ I+      ++ P W R+EG DH   V H 
Sbjct: 173 RLLKNVIRHSIDYWLWADLIAPESQRLLKNVIREALKWVTDQPAWQRSEGRDHVIPVHHP 232

Query: 148 FGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIP 207
           +     ++  +   +  + LL        + +   V L +    I PY P   +   +  
Sbjct: 233 WS----FKSVRRFVKKAIWLLPDMDSTGNWYKPGQVYLEKD--VILPYVPNVDLCDSKCV 286

Query: 208 PDTP--RSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDI-----STDHPT 260
            +T   RS  ++FRG          GG      R+ +    K+     I       D   
Sbjct: 287 SETQSRRSTLLFFRGRL----RRNAGGKI----RSKLVTELKDAEGIIIEEGTAGADGKA 338

Query: 261 TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVA 320
                M++++FCL P G  P S RL +A+V GCIPVI++D++ LPF   + + +I +FV+
Sbjct: 339 AAQNGMRKSLFCLNPAGDTPSSARLFDAIVSGCIPVIVSDELELPFEGILDYRKIALFVS 398

Query: 321 EEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQP 363
             D  +   L   L SI  + I + Q  L   S  R  L+  PA+P
Sbjct: 399 SNDAVQPGWLVKYLRSIDAKRIRQMQSNLLKYS--RHFLYSSPARP 442


>gi|449455387|ref|XP_004145434.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10L-like
           [Cucumis sativus]
          Length = 464

 Score = 87.0 bits (214), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 101/412 (24%), Positives = 160/412 (38%), Gaps = 70/412 (16%)

Query: 12  FVSADNTPKIERISGSAGDVLEDD--PVGKLKVYVYELPSKYNKKLL------------- 56
           F SA++    +  S    D+L      +  +K+Y+Y++P+++   ++             
Sbjct: 43  FSSANSVIVSDEFSSQDTDLLNSSGKSLSPVKIYLYDVPTRFTYGVIENHGIARGGKPVP 102

Query: 57  -LKDPRCLTHMFAAEIFMHRFLL--------SSPVRTLNPEEADWFYTPIYPTCDLTPTG 107
            + D +   H   AE F+   LL        S+ VR  +PE AD FY P + +  L    
Sbjct: 103 DVTDLKYPGHQHMAEWFLFTDLLRPESERIGSAVVRVFDPEVADLFYVPFFSSLSLIVNP 162

Query: 108 LPLPFKSPRMMRSAI------QLISSNW----PYWNRTEGADHFFVVPHDFGACFHYQEE 157
           +     S +  R  +      Q     W     YW R+ G DH  +           Q+ 
Sbjct: 163 IRPATGSDQQQRKLVYSDEETQDAFMEWLEKQEYWKRSNGRDHVIIA----------QDP 212

Query: 158 KAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVY 217
            A+ R I  +     LV  FG+      +     I PY+         I  +  R   ++
Sbjct: 213 NALYRLIDRVKNSILLVSDFGRLRADQASLVKDVIVPYSHRINTYTGDIGVEN-RKTLLF 271

Query: 218 FRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYED-------MQRAI 270
           F G  Y      EGG      R  ++   +     D+   H T   E        M  + 
Sbjct: 272 FMGNRYR----KEGG----KIRDMLFNILEQEQ--DVIIKHGTQSRESRRAATHGMHTSK 321

Query: 271 FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK---L 327
           FCL P G  P + RL ++VV  C+PVI++D I LPF D I + +I +F       K   L
Sbjct: 322 FCLNPAGDTPSACRLFDSVVSLCVPVIVSDSIELPFEDVIDYSKIAVFFDSVSAVKPEFL 381

Query: 328 DTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
            + L  I  E IL  QR      MK+   + +        ++I   +++KLP
Sbjct: 382 ISKLRRISEERILDYQR-----EMKKIKRYFEYTDSNGTVNEIWRQVSQKLP 428


>gi|168012366|ref|XP_001758873.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162690010|gb|EDQ76379.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 426

 Score = 87.0 bits (214), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 87/356 (24%), Positives = 144/356 (40%), Gaps = 49/356 (13%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCL---THMFAAEIFMHRF-------------------- 76
           L +YVYE+P+K+   LL      L    ++ +    +HR                     
Sbjct: 38  LNIYVYEMPAKFTTDLLWLFHNSLDQTVNLTSNGSPVHRLIQQHSVDFWLFSDLMTREDK 97

Query: 77  -LLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRT 135
            LL +  R  + E+AD +Y P + T    P  L    +S  + R A++ I+     W R+
Sbjct: 98  RLLKTFRRVSHQEQADVYYVPFFTTI---PFFLLSRVQSRTLYREAVKWITRQ-AAWQRS 153

Query: 136 EGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY 195
            G DH   V H +    H    +   +  + LL        + +   V L E  + +P  
Sbjct: 154 GGRDHVLAVHHPWSMKSH----RRFLKSAIWLLSDLDSSGNWYKEGEVSL-EKDVIMPYV 208

Query: 196 APPQKMQAHQIPPDTP-RSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFD- 253
           A       + +    P R   ++F+G     +         R   AAV  + K   +F  
Sbjct: 209 ANVDACDDNCLATSKPSRKTLLFFQGRIVRGSAGK-----VRSRLAAVLRDEKERIVFQE 263

Query: 254 --ISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIP 311
                +   T    M+ ++FCL P G  P S RL +A+V GCIPV+++D++ LPF   + 
Sbjct: 264 GFSGAEGKATAQHGMRSSVFCLSPAGDTPSSARLFDAIVSGCIPVVVSDELELPFEGILD 323

Query: 312 WEEIGMFV----AEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQP 363
           + ++ +FV    A +    +  +    P +V   +QRL       R   +  PAQP
Sbjct: 324 YRQVALFVPAARAAQKGWLVAHLRNKTPQDVAAMQQRL---AQYGRHFRYGTPAQP 376


>gi|51535578|dbj|BAD37522.1| pectin-glucuronyltransferase-like [Oryza sativa Japonica Group]
          Length = 342

 Score = 87.0 bits (214), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 93/348 (26%), Positives = 144/348 (41%), Gaps = 65/348 (18%)

Query: 58  KDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPI--------YPTCDLTPTGLP 109
           + PR LT  +A+E +  + +  S  RT + E+A  F+ PI         P+  L  T   
Sbjct: 20  QTPRKLTGKYASEGYFFQNIRESRFRTDDLEKAHLFFVPISPHKMRGKVPSSLLLVTYAW 79

Query: 110 LPFKSPRMMRSAIQL--------ISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIE 161
           L        RS + L         SS   +W    GADHFFV  HD G        +A E
Sbjct: 80  LILHIRSYDRSILFLDLYWWCPLCSSFRGHWG--VGADHFFVTCHDVGV-------RAFE 130

Query: 162 RGILPLLQRATLVQTFGQRNHVC---LNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYF 218
              LP + + ++         VC    N G I     A PQ +Q   +P           
Sbjct: 131 G--LPFIIKNSI-------RVVCSPSYNAGYIPHKDVALPQILQPFALPAGGN------- 174

Query: 219 RGLFYDVNNDPEGGYYA-------RGARAAVWEN-----FKNNPLFDISTDHPTTYYEDM 266
                D+ N    G++A       R   A +WEN       NN +     +    Y +  
Sbjct: 175 -----DIENRTILGFWAGHRNSKIRVILARIWENDTELAISNNRINRAIGN--LVYQKHF 227

Query: 267 QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK 326
            R  FC+CP G    S R+ +++ +GC+PVI++D   L F+  + W +  + + E DV +
Sbjct: 228 FRTKFCVCPGGSQVNSARISDSIHYGCMPVILSDYYDLRFSGILNWRKFAVVLKESDVYE 287

Query: 327 LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
           L +IL S+  +  +   + L    +++   +  P  P DAFH I+  L
Sbjct: 288 LKSILKSLSQKEFVSLHKSLVQ--VQKHFEWHSPPVPYDAFHMIMYEL 333


>gi|225457795|ref|XP_002265632.1| PREDICTED: probable glucuronosyltransferase Os03g0107900 [Vitis
           vinifera]
 gi|302142755|emb|CBI19958.3| unnamed protein product [Vitis vinifera]
          Length = 459

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 92/387 (23%), Positives = 153/387 (39%), Gaps = 39/387 (10%)

Query: 40  LKVYVYELPSKYNKKLLL--KDPR-----CLTHMFAAEIFMHRFLLSSPVRTLNPEEADW 92
           LK+YVYE       K LL  +D       C+T  +  ++ +HR LL S  RT   EEAD 
Sbjct: 86  LKIYVYEEDEIDGLKSLLYGRDGSIPTEVCVTGQWGTQVKIHRLLLKSRFRTRRKEEADL 145

Query: 93  FYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACF 152
           F+ P Y  C     GL     + + +      + S  PY+  + G +H FV P   G   
Sbjct: 146 FFVPTYIKCVRMKGGL-----NDKEIDQMYVKVLSQMPYFRLSGGRNHIFVFPSGAGPHL 200

Query: 153 HYQEEKAIERGIL--PLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAH-----Q 205
                  + R I+  P   R        +++    N     I P     +M  +     Q
Sbjct: 201 FKSWATYLNRSIILTPEGDRTD------KKDTSAFNTWKDIIIPGNVADEMTTNGATFVQ 254

Query: 206 IPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP---TTY 262
             P + R     F G         + G       A  + +   +P    S         Y
Sbjct: 255 PLPLSKRKFLANFLG-----RAQRKLGRLQLIELAKQYPDKLESPELQFSGPDKLGRIEY 309

Query: 263 YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGM-FVAE 321
           +  ++ A FC  P G + W+ R  E+    C+PVI++D + LPF + I + ++ + + + 
Sbjct: 310 FHHLRNAKFCFAPRGESSWTLRFYESFFVECVPVILSDQVELPFQNVIDYTQVSIKWPSS 369

Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP-- 379
           +  P+L   L SIP +VI  ++ +     ++   ++   ++P  A   I+  L RK+   
Sbjct: 370 QIGPQLLEYLESIPDKVI--EEMISRGREVRCWWVYASESEPCSAMRGIMWELQRKVRKF 427

Query: 380 -HDKSTYLKPGGKFLNWTSGPVGDLKP 405
                T+    G  +N       + KP
Sbjct: 428 HQSTETFWLHNGSIINRNLEQFSNWKP 454


>gi|357157638|ref|XP_003577864.1| PREDICTED: probable glycosyltransferase At3g07620-like
           [Brachypodium distachyon]
          Length = 485

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 75/309 (24%), Positives = 123/309 (39%), Gaps = 27/309 (8%)

Query: 80  SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLP---FKSPRMMRSAIQLISSNWPYWNRTE 136
           +  R  + ++AD  + P + +        P+P       R ++  +    +  P W R+ 
Sbjct: 175 AAARVADAQDADVVFVPFFASLSYNRHSKPVPPEKVSRDRALQEKLVRYLAARPEWKRSG 234

Query: 137 GADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYA 196
           GADH  +V H   +  H +           L     ++  FG+ +    +     + PY 
Sbjct: 235 GADHV-IVAHHPNSLLHARSA---------LFPAVFVLSDFGRYHPRVASLEKDLVAPYR 284

Query: 197 PPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNP----LF 252
              K   +       R   +YFRG  Y      EGG      R  ++   K+       F
Sbjct: 285 HMAKTFVNDTAGFDDRPTLLYFRGAIYR----KEGG----NIRQELYNMLKDEKDVFFSF 336

Query: 253 DISTDHPTTY-YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIP 311
               DH  +   + M  + FCL   G  P S RL +A+V  C+PVII+DDI LP+ D + 
Sbjct: 337 GSVQDHGVSKASQGMHSSKFCLNIAGDTPSSNRLFDAIVSHCVPVIISDDIELPYEDVLD 396

Query: 312 WEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLAN-PSMKRAMLFPQPAQPGDAFHQI 370
           + +  +FV   D  K   ++  I      R  R+      + +   +  P++  DA   I
Sbjct: 397 YSKFSIFVRSSDAVKRGYLMKLIRGVTKHRWTRMWKRLKEVDKHFEYQFPSRKDDAVQMI 456

Query: 371 LNGLARKLP 379
              LARK+P
Sbjct: 457 WQALARKVP 465


>gi|242055419|ref|XP_002456855.1| hypothetical protein SORBIDRAFT_03g044100 [Sorghum bicolor]
 gi|241928830|gb|EES01975.1| hypothetical protein SORBIDRAFT_03g044100 [Sorghum bicolor]
          Length = 432

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 93/371 (25%), Positives = 149/371 (40%), Gaps = 49/371 (13%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRC-----------LTHMFAAEIFMHRFLLSSP------- 81
           L+V++Y+LP++++  ++  D              +    + E +M   L           
Sbjct: 61  LRVFMYDLPARFHVAMMGADDGAGFPAWPPSAGGIRRQHSVEYWMMASLQDGAAGPDGGR 120

Query: 82  --VRTLNPEEADWFYTPIYPTCDLTPTGLPLP---FKSPRMMRSAIQLISSNWPYWNRTE 136
             VR  +P+ AD F+ P + +      G  +     ++ R+++  I  I     YW R+ 
Sbjct: 121 EAVRVRDPDAADAFFVPFFSSLSFNVHGRNMTDPDTEADRLLQVEIVDILWKSKYWQRSA 180

Query: 137 GADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYA 196
           G DH  V+P      F +     +   IL       +V  FG+      +     + PY 
Sbjct: 181 GRDH--VIPMHHPNAFRFLRA-MVNASIL-------IVSDFGRYTKELASLRKDVVAPYV 230

Query: 197 PPQKMQAHQIPPD--TPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDI 254
                     PPD    R   ++FRG    V  D EG   A+  +       K    F+ 
Sbjct: 231 HVVDSFLDDDPPDPFEARHTLLFFRG--RTVRKD-EGKIRAKLGKVL---KGKEGVRFED 284

Query: 255 ST---DHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIP 311
           S    D      E M+ + FCL P G  P S RL +A+V  C+PVI++  I LPF D I 
Sbjct: 285 SIATGDGIKISTEGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSSRIELPFEDEID 344

Query: 312 WEEIGMFVAEEDVPKLDTILT---SIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFH 368
           + E  +F + E+  + D +L     IP +  +     L N S      F  P + GDA +
Sbjct: 345 YSEFSLFFSVEEALRPDYLLNQLRQIPKKKWVDMWSKLKNVSHHYE--FQYPPRKGDAVN 402

Query: 369 QILNGLARKLP 379
            I   +  K+P
Sbjct: 403 MIWRQVRHKIP 413


>gi|168043245|ref|XP_001774096.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162674642|gb|EDQ61148.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 351

 Score = 86.3 bits (212), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 86/353 (24%), Positives = 152/353 (43%), Gaps = 37/353 (10%)

Query: 34  DDPVGKLKVYVYELPSKYN--KKLLLKDPRCLTHMFAAEI------FMHRFLLSSPVRTL 85
           +D    L++Y+Y     YN  +     +P  +      E       F +  + S    T 
Sbjct: 15  NDMAKNLRIYLYPASQNYNFTQYEYGMNPSEMVSELGVETSSTTDTFFNLLVESKRFVTD 74

Query: 86  NPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVP 145
           + + A  ++ PI         G   P K    +R  +Q + + +  W+ + GADHF+   
Sbjct: 75  DADGAHLYFLPISIDRVWAAVG---PAKVGEHLRHYLQWLRNTYKLWDLSLGADHFYFSS 131

Query: 146 HDFGACFHYQEEKAIERGILPLLQRATLVQTFG-QRNHVCLNEGSITIPPYAPPQKMQAH 204
           H +    H        R  L L + A  V +   +RN        I++P Y      +  
Sbjct: 132 HAYDPINH--------RNNLELTKNAIQVASSPLRRNQNFFPHKDISLPSYKSQHIAEVQ 183

Query: 205 QIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDI-STDHPTTYY 263
            +   + R   V+      D+  DP          A+V + + ++  F + S D P+  +
Sbjct: 184 NLVGASQRPKLVFVSSPPEDI--DP--------IVASVIQKWTSDSDFHVESADQPSPPF 233

Query: 264 EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIV--LPFADAIPWEEIGMFVAE 321
           E +  + FC+     A  +  +V+++  GC+PV+IAD I+  LPF D + W+E  + +  
Sbjct: 234 EKLLSSRFCVSVSPQAMLN--VVDSLRLGCVPVLIADSIIYDLPFQDVLNWKEFSVVLGV 291

Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
           ++ P L T+L+SI  +   RK + L + + K  M +  P +P DAFH  L+ L
Sbjct: 292 KESPNLKTLLSSISTDE-YRKMQYLGHQASKH-MEWNDPPKPWDAFHMTLHEL 342


>gi|302847711|ref|XP_002955389.1| hypothetical protein VOLCADRAFT_96311 [Volvox carteri f.
           nagariensis]
 gi|300259231|gb|EFJ43460.1| hypothetical protein VOLCADRAFT_96311 [Volvox carteri f.
           nagariensis]
          Length = 1222

 Score = 86.3 bits (212), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 94/387 (24%), Positives = 149/387 (38%), Gaps = 80/387 (20%)

Query: 41  KVYVYELP---SKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPI 97
           +VYVY+LP   + +   + L  P          +F  R + ++ VR  + + ADW++ P+
Sbjct: 576 RVYVYDLPPSMTTWRNDMRLDRP-------TLRLFQER-ITATGVRVGDGDTADWYFIPV 627

Query: 98  YPT--CDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQ 155
                CD T             + +A+  I    P+WNRT+G  HF +   D G      
Sbjct: 628 VLRHFCDAT------------TLSAAVSYIREMHPWWNRTQGHRHFVIATGDMG------ 669

Query: 156 EEKAIERGILPLLQRATLVQTFG----------QRNHVCLNEGSITIPPYAPPQKMQAHQ 205
                ERG   L    T V  +G          + +H   N   I +P +    K+    
Sbjct: 670 -RSESERG--HLTANVTFVSYWGLHAPKLSSGWRASH--RNATDIVLPVFLGSPKLSRMG 724

Query: 206 I------------PPDTPRS----IFVYFRGLFYDVNN-----------DPEGGYYARGA 238
           I             P   R     IF +   +  D +             P    Y+ G 
Sbjct: 725 IFTSRLHPKFATKAPHELRERNGPIFFFAGRICGDHSKPQVDGVWPNCKSPHNMGYSGGT 784

Query: 239 RAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVII 298
           R  +  +  N   + I       Y + +  + FC  P G      R ++AV  GC+PV+I
Sbjct: 785 RQKIHFHHWNRTGYFIQLG-DRHYAKHLLTSKFCFGPTG-GGHGQRQMQAVQAGCVPVVI 842

Query: 299 ADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFP 358
           +DD++  F   + W   G+ +AE D+P++  +L +I PE   RK+ LL   +   A    
Sbjct: 843 SDDVLEAFEPFLDWNTFGVRLAEADIPRMHEVLEAISPEEYARKEVLLRCAAQHMAFSTV 902

Query: 359 QPAQPG-----DAFHQILNGLARKLPH 380
             +  G     DAF  +L  L  K  H
Sbjct: 903 TGSYIGESGRYDAFETLLEILRAKAAH 929



 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 37/124 (29%), Positives = 60/124 (48%), Gaps = 6/124 (4%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y + +  + +C  P G      R ++AV  GC+PV+I+DD++  F   + W   G+ +AE
Sbjct: 255 YAKHLLTSKYCFGPTG-GGHGQRQMQAVQAGCVPVVISDDVLEAFEPFLDWNTFGVRLAE 313

Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPG-----DAFHQILNGLAR 376
            D+P++  +L +I PE    K+ LL   +   A      +  G     DAF  +L  L  
Sbjct: 314 ADIPRMHEVLEAISPEEYAHKEVLLRCAAQHMAFSTVTGSYIGESGRYDAFETLLEILRA 373

Query: 377 KLPH 380
           K  H
Sbjct: 374 KAAH 377



 Score = 46.6 bits (109), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 32/112 (28%), Positives = 49/112 (43%), Gaps = 21/112 (18%)

Query: 41   KVYVYELPSKYNKKLL-LKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPI-- 97
            +VYVY+LP      L  ++  R  T +F   +       ++ VR  + + ADW++ P+  
Sbjct: 1037 RVYVYDLPPSMTTWLNDMRLDRPTTRLFQERV------TATGVRVGDGDTADWYFIPVTL 1090

Query: 98   YPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFG 149
              TCD T             +  A+  I    P+WNRT G  HF +   D G
Sbjct: 1091 RHTCDAT------------TLSGAMSYIREMHPWWNRTHGHRHFVIATGDGG 1130


>gi|224136688|ref|XP_002322391.1| predicted protein [Populus trichocarpa]
 gi|222869387|gb|EEF06518.1| predicted protein [Populus trichocarpa]
          Length = 457

 Score = 85.9 bits (211), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 98/381 (25%), Positives = 166/381 (43%), Gaps = 70/381 (18%)

Query: 39  KLKVYVYELPSKYNKKL-----LLKDPRCLT------HMFAAEIFMHRFL-------LSS 80
           K++V++Y+LP K+   +     L +    L+      H    E +M+  L       + S
Sbjct: 71  KVRVFMYDLPKKFTTGIIENHALARGSSDLSKVSYPGHQHMGEWYMYLDLSRPDLDRVGS 130

Query: 81  PVRTLN-PEEADWFYTPIYPTCDL--------TPTGLPLPFKSPRMMRSAIQLISSNWPY 131
           PV  +N PEEAD FY P++ +  L        T  G    +   +M    ++ +     Y
Sbjct: 131 PVVKVNDPEEADLFYVPVFSSLSLIVNPARAGTVPGSDPVYSDEKMQEELVEWLEEQ-EY 189

Query: 132 WNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSI- 190
           W R  G DH  V   D  A +     + ++R    +L    L+  FG+   V  ++GS+ 
Sbjct: 190 WRRNNGRDHV-VFAGDPNALY-----RVLDRVKNVVL----LLSDFGR---VRSDQGSLI 236

Query: 191 --TIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKN 248
              I PY+    +    I  +  R   ++F G  Y      +GG      R  +++  + 
Sbjct: 237 KDVIVPYSHRINVYNGDIGVE-ERKTLLFFMGNRYR----KDGG----KIRDLLFQMLEK 287

Query: 249 NPLFDISTDHPTTYYED-------MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADD 301
               D+   H T   E+       M  + FCL P G  P + RL +++V  C+P+I++D 
Sbjct: 288 EE--DVVIRHGTQSRENRRTATRGMHTSKFCLNPAGDTPSACRLFDSIVSLCVPLIVSDS 345

Query: 302 IVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFP 358
           I LPF D I + +I +FV  E   K   L  +L ++  E IL  Q+ +    +KR  ++ 
Sbjct: 346 IELPFEDVIDYRKIAIFVDTESSLKPGYLVRMLRAVSTEKILEYQKQMRE--VKRYFVY- 402

Query: 359 QPAQPGDAFHQILNGLARKLP 379
             +      ++I   +A+KLP
Sbjct: 403 --SDSNGTVNEIWREVAQKLP 421


>gi|302838103|ref|XP_002950610.1| hypothetical protein VOLCADRAFT_104742 [Volvox carteri f.
           nagariensis]
 gi|300264159|gb|EFJ48356.1| hypothetical protein VOLCADRAFT_104742 [Volvox carteri f.
           nagariensis]
          Length = 1119

 Score = 85.9 bits (211), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 97/374 (25%), Positives = 144/374 (38%), Gaps = 76/374 (20%)

Query: 40  LKVYVYELPSKYNKKLLLKDP-RCLTHMFAAEI-FMHRFLLSSPVRTLNPEEADWFYTPI 97
           +K+YVY+LPS    +   +D    +  M+ AE+ F  R L    VRT NP EA  FY P 
Sbjct: 298 IKIYVYDLPSIVLHRREFQDQWSYIDLMYNAELEFTERLLGDWSVRTENPWEAALFYVPT 357

Query: 98  YPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEE 157
           +        G P       + +  +  +    P++N T G +H     +D GAC     +
Sbjct: 358 FTYWFTGNVGHPT-----FIFQHVVHHLQHLAPFFNLTGGRNHVMWATNDRGACKLQMSQ 412

Query: 158 -------KAIERGILP---LLQRATL------VQTFGQRNHV------------------ 183
                  K +  G  P    LQ+A +      + T G    +                  
Sbjct: 413 LELQHPIKMVHFGQAPRHAYLQKAHMMGSGHGIDTLGGVASLLGALPHPGHRFEEFPEFT 472

Query: 184 ---CLNEGSITIPP---YAPPQKMQAHQIPPDTPRSI----FVYFRGLFYDVNNDPE--- 230
               L E  I + P      P  +    + P   R +    FV    +     +D E   
Sbjct: 473 AADILEEHEICLRPEKDVVTPNVLHQGWVEPGAYRQVWDVKFVDGERVVTRQQDDLEKRT 532

Query: 231 -----GGY------YARGARAAVWENFKNNPLFD----------ISTDHPTTYYEDMQRA 269
                GGY      Y++G R A+   F     +D          + TD      + M R+
Sbjct: 533 YTLFFGGYTKSIMAYSQGVRQALRSMFGPGGKYDPNGPNARSDFLVTDPRHDAIDLMARS 592

Query: 270 IFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDT 329
            FCL P+G A W  RL EA+V GC+PV+I D +  P  D +P+EE  +  +  DV  L  
Sbjct: 593 KFCLAPMG-AGWGIRLAEAMVRGCVPVVIQDHVYQPLWDVVPFEEFSLRFSRRDVADLVD 651

Query: 330 ILTSIPPEVILRKQ 343
            L  +  E + R Q
Sbjct: 652 HLDDVTSEQLARLQ 665


>gi|297826983|ref|XP_002881374.1| hypothetical protein ARALYDRAFT_482474 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297327213|gb|EFH57633.1| hypothetical protein ARALYDRAFT_482474 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 446

 Score = 85.9 bits (211), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 98/397 (24%), Positives = 164/397 (41%), Gaps = 78/397 (19%)

Query: 31  VLEDDPVG------KLKVYVYELPSKYNKKLL---------LKDP--RCLTHMFAAEIFM 73
           + EDD  G      K++VY+Y LP ++   L+         +K P     T  +     M
Sbjct: 44  IREDDDEGRAPIQPKVRVYMYNLPKRFTYGLIDQHSIARGGIKKPVDDVTTLKYPGHQHM 103

Query: 74  HRFLLSSP-------------VRTLNPEEADWFYTPIYPTCDL-TPTGLPLP----FKSP 115
           H + L S              VR L+P++AD FY P++ +  L    G P+     +   
Sbjct: 104 HEWYLFSDLNRPEVDRSGSPIVRVLDPDDADLFYVPVFSSLSLIVNAGRPVEPGSGYSDE 163

Query: 116 RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQ 175
           +M    ++ +     +W R  G DH  V+P          +  A+ R +  +     LV 
Sbjct: 164 KMQEGLMEWLEGQ-EWWRRNGGRDH--VIPAG--------DPNALYRILDRVKNSVLLVA 212

Query: 176 TFGQRNHVCLNEGSIT---IPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGG 232
            FG+  H   ++GS     + PY+    +   +I     R+  ++F G  Y      +GG
Sbjct: 213 DFGRLRH---DQGSFVKDVVIPYSHRVNLFNGEIGV-QDRNTLLFFMGNRYR----KDGG 264

Query: 233 YYARGARAAVWENFKNNPLFDISTDHPTTYYED-------MQRAIFCLCPLGWAPWSPRL 285
                 R  +++  +     D++  H T   E+       M  + FCL P G  P + RL
Sbjct: 265 ----KVRDLLFQVLEKED--DVTIKHGTQSRENRRAATKGMHTSKFCLNPAGDTPSACRL 318

Query: 286 VEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRK 342
            +++V  C+PVI++D I LPF D I + +  +FV      +   L  +L  I  + IL  
Sbjct: 319 FDSIVSLCVPVIVSDSIELPFEDVIDYRKFSIFVEANAALQPGFLVQMLRKIKTKKILEY 378

Query: 343 QRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
           QR      M+    +     P  A  +I   +++KLP
Sbjct: 379 QR-----EMQPVRRYFDYDNPNGAVKEIWRQVSQKLP 410


>gi|356527767|ref|XP_003532479.1| PREDICTED: LOW QUALITY PROTEIN: probable glycosyltransferase
           At5g03795-like [Glycine max]
          Length = 467

 Score = 85.5 bits (210), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 64/216 (29%), Positives = 96/216 (44%), Gaps = 25/216 (11%)

Query: 125 ISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVC 184
           IS+  PY+NRT GADHF    HD+     Y+    +E  I  L   A + Q F     V 
Sbjct: 223 ISAKIPYFNRTGGADHFLAACHDWAP---YETRHHMEYCIKALCN-ADVTQGFKIGRDVS 278

Query: 185 LNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWE 244
           L E  +       PQ+    + P   P  I  +  G  +          Y        W+
Sbjct: 279 LPEAYVR--SVRDPQRDLGGKPPHQXP--ILAFHAGNMHG---------YLHPILLKHWK 325

Query: 245 NFKNNPLFDISTDHP------TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVII 298
           +   +P   I    P        Y   M+ + +C+CP G+   SPR+VEA+   C+PVII
Sbjct: 326 D--KDPDMKIYGPMPHGVTSKMNYINHMKNSKYCICPKGYEVNSPRMVEAIFXECVPVII 383

Query: 299 ADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSI 334
           +D+ V  F + + W+   + +AE+D+P L  IL S+
Sbjct: 384 SDNFVPHFFEVLNWDVFSIILAEKDIPNLKQILLSV 419


>gi|357168395|ref|XP_003581626.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10-like
           [Brachypodium distachyon]
          Length = 348

 Score = 85.1 bits (209), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 85/344 (24%), Positives = 141/344 (40%), Gaps = 61/344 (17%)

Query: 73  MHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYW 132
           +H+ LL S  RTL+ +EA  F+ P Y  C      L     + + +      + S  PY+
Sbjct: 42  IHQLLLKSRFRTLDKDEAHLFFVPSYVKCVRMTGAL-----TDKEINQTYVKVLSQMPYF 96

Query: 133 NRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITI 192
            R+ G DH FV P   GA         + R     L R+ ++   G R     ++  I  
Sbjct: 97  RRSGGRDHIFVFPSGAGA--------HLFRSWATFLNRSIILTPEGDRT----DKRGI-- 142

Query: 193 PPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKN---N 249
                               S F  ++ +    N D   G   R     + + + +   +
Sbjct: 143 --------------------SAFNTWKDIIIPGNVDDSMGKAGRLKLVELAKQYPDKLES 182

Query: 250 PLFDISTDHP---TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 306
           P   +S         Y++ ++ A FCL P G + W+ R  E+    C+PVI++D++ LPF
Sbjct: 183 PELKLSGPDKLGRIDYFKHLRNAKFCLAPRGESSWTLRFYESFFVECVPVILSDEVELPF 242

Query: 307 ADAIPWEEIGM-FVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQP-AQPG 364
            + I + EI + + + +  P+L   L SIP E   R + ++A     R +    P  +P 
Sbjct: 243 QNMIDYTEISIKWPSSKISPELFEYLESIPEE---RIEEMIARGREVRCLWVYAPDLEPC 299

Query: 365 DAFHQILNGLARKLP-----------HDKSTYLKPGGKFLNWTS 397
            A   IL  L RK+            H++ST  +   +F  WT+
Sbjct: 300 SAMAAILWELQRKVRRFHQSAQTFWLHNRSTVNRDLVEFHQWTT 343


>gi|224142185|ref|XP_002324439.1| predicted protein [Populus trichocarpa]
 gi|222865873|gb|EEF03004.1| predicted protein [Populus trichocarpa]
          Length = 208

 Score = 85.1 bits (209), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 43/114 (37%), Positives = 64/114 (56%), Gaps = 2/114 (1%)

Query: 261 TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVA 320
            Y   MQR+ +C+CP G+   SPR+VEA+ + C+PVII+D+ V PF D + W    + +A
Sbjct: 92  NYIHHMQRSKYCICPKGYEVNSPRVVEAIFYECVPVIISDNFVPPFFDVLDWGAFSLILA 151

Query: 321 EEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
           E+D+  L  IL SIP E  L+ Q  L     +R  L+       D F+  L+ +
Sbjct: 152 EKDISNLKEILLSIPKEKYLQMQ--LGVRKAQRHFLWHASPMKYDLFYMTLHSI 203


>gi|255550902|ref|XP_002516499.1| catalytic, putative [Ricinus communis]
 gi|223544319|gb|EEF45840.1| catalytic, putative [Ricinus communis]
          Length = 601

 Score = 85.1 bits (209), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 41/114 (35%), Positives = 68/114 (59%), Gaps = 2/114 (1%)

Query: 261 TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVA 320
            Y + M+ + +C+C  G+   SPR+VEA+++ C+PVII+D+ V PF + + WE   +FV 
Sbjct: 479 NYVQHMKSSRYCICARGYEVNSPRIVEAILYECVPVIISDNYVPPFFEVLNWESFAVFVL 538

Query: 321 EEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
           E+D+P L  IL SI PE   R+ ++      +  +   +P +  D FH IL+ +
Sbjct: 539 EKDIPNLKNILLSI-PEKRYREMQMRVKMVQQHFLWHARPVK-YDLFHMILHSV 590



 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 40/132 (30%), Positives = 63/132 (47%), Gaps = 25/132 (18%)

Query: 30  DVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAE-IFMHRFLLSSPVRTLNPE 88
           +++E++    LKVY+Y    K  +K +L  P  L  ++A+E  FM +   S    T    
Sbjct: 319 EIMEEN----LKVYIY----KEGEKPILHQP-VLKGIYASEGWFMKQLEASKKFVTKKSR 369

Query: 89  EADWFYTPIYPTCDLTPTGLPLPFKSP---------RMMRSAIQLISSNWPYWNRTEGAD 139
           +A  FY P       +   L L    P         + +++ + LI + +P+WNRTEG D
Sbjct: 370 KAHLFYLP------FSSRNLELQLYVPDSHSRKNLIKYLKNYLDLIVAKYPFWNRTEGVD 423

Query: 140 HFFVVPHDFGAC 151
           HF V  HD+ A 
Sbjct: 424 HFLVACHDWAAS 435


>gi|357122399|ref|XP_003562903.1| PREDICTED: xylogalacturonan beta-1,3-xylosyltransferase-like
           [Brachypodium distachyon]
          Length = 498

 Score = 85.1 bits (209), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 102/387 (26%), Positives = 154/387 (39%), Gaps = 72/387 (18%)

Query: 40  LKVYVYELPSKYNKKLLL--------------------KDPRCLTHMFAAEIFMHRFLL- 78
           +K+Y+Y+LPSK+   ++                     ++ R   H  +AE ++ + L  
Sbjct: 101 VKIYLYDLPSKFTYGVVRSYMSARAPPGSADAAATLPDEELRYPGHQHSAEWWLFKDLRR 160

Query: 79  ----SSPV-RTLNPEEADWFYTPIYPTCDLTPTGL-PL-----------PFKSPRMMRSA 121
                 PV R  +P EAD FY P + +  L    + PL           P  S   M+  
Sbjct: 161 RGPRERPVARVDDPAEADLFYVPFFSSLSLVVNPIRPLAAANASVAAAEPAYSDEAMQDE 220

Query: 122 IQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRN 181
           +       PYW R  G DH F+           Q+  A+ R +  +     LV  FG+  
Sbjct: 221 LVEWLERQPYWRRHRGRDHVFIC----------QDPNALYRVVDRISNAVLLVSDFGRLR 270

Query: 182 HVCLNEGSITIPPYAPPQKMQAHQIPP-DTPRSIFVYFRGLFYDVNNDPEGGYYARGARA 240
               +     I PY+       H+I P     SI      LF+  N   + G   R    
Sbjct: 271 GDQASLVKDVILPYS-------HRINPFQGDVSIEARPALLFFMGNRYRKEGGKVRDTLF 323

Query: 241 AVWEN-----FKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIP 295
            V EN      K+     +S    T   + M  + FCL P G  P + RL +A+V  C+P
Sbjct: 324 QVLENEGDVIIKHGTQSRVSRRMAT---QGMHSSKFCLHPAGDTPSACRLFDALVSLCVP 380

Query: 296 VIIADDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMK 352
           VII+D I LPF D I +  I +FV      +   L ++L  +  E IL  QR      +K
Sbjct: 381 VIISDHIELPFEDVIDYSNISIFVDTSKAVQPGFLTSMLRRVSSERILEYQR-----EIK 435

Query: 353 RAMLFPQPAQPGDAFHQILNGLARKLP 379
           R   + +   P    +QI + ++ K P
Sbjct: 436 RVKHYFEYEDPNGPVNQIWHQVSMKAP 462


>gi|302783725|ref|XP_002973635.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
 gi|300158673|gb|EFJ25295.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
          Length = 452

 Score = 85.1 bits (209), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 90/335 (26%), Positives = 135/335 (40%), Gaps = 67/335 (20%)

Query: 40  LKVYVYELPSKYNKKLL---LKDPRC----------LTHMFAAEIFMHRFLLSSP----- 81
           L+VYV +LP +++  LL    +   C          L    +AE ++ R LL SP     
Sbjct: 56  LRVYVADLPREFHHGLLESYCRSQNCCSTGEYPTNPLLKQHSAEFWLLRDLLDSPSKKKE 115

Query: 82  --VRTLNPEEADWFYTPIYPTCDLT---PTGLPLPFKSPR-------MMRSAIQLISSNW 129
             VR  +   AD  + P +           G    F+            R  ++L++S+ 
Sbjct: 116 NFVRVWDSRLADVVFVPFFAALSAQIQLRGGHRGEFRKKSSKNSDFDRQRRVVELVTSSL 175

Query: 130 PYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFG----QRNHVCL 185
             W R+ G DH FV+  D  A +H +E+  I   +        LV  FG    +     L
Sbjct: 176 E-WRRSNGVDHVFVLA-DPMAMWHVREQ--ISTAMF-------LVVDFGGWYLEDAKNKL 224

Query: 186 NEGSITIPPYAPPQKM----QAHQIPP-----DTPRSIFVYFRGLFYDVNNDPEGGYYAR 236
           N  +I       P K       H +PP     D  R++ +YFRG      +    G    
Sbjct: 225 NSSTIIQHSQVSPIKDVIIPHTHLLPPLKIADDQHRTVLLYFRG----ARHRHRSGL--- 277

Query: 237 GARAAVWENFKNNPLFDISTDHP-----TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVF 291
             R  +W+   N P   +    P           M+ + FCL P G  P S RL +A+  
Sbjct: 278 -VREKLWKILDNEPEVLLEKGLPDDAGLAEATRGMRSSEFCLTPAGDTPSSCRLYDAIAS 336

Query: 292 GCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK 326
            CIPVI++DDI LPF   + +EE  +FV+  D  +
Sbjct: 337 LCIPVIVSDDIQLPFEGFVNYEEFCVFVSTRDATQ 371


>gi|302787839|ref|XP_002975689.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
 gi|300156690|gb|EFJ23318.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
          Length = 452

 Score = 84.7 bits (208), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 90/335 (26%), Positives = 135/335 (40%), Gaps = 67/335 (20%)

Query: 40  LKVYVYELPSKYNKKLL---LKDPRC----------LTHMFAAEIFMHRFLLSSP----- 81
           L+VYV +LP +++  LL    +   C          L    +AE ++ R LL SP     
Sbjct: 56  LRVYVADLPREFHHGLLESYCRSQNCCSTGEYPTNPLLKQHSAEFWLLRDLLDSPSKKKE 115

Query: 82  --VRTLNPEEADWFYTPIYPTCDLT---PTGLPLPFKSPR-------MMRSAIQLISSNW 129
             VR  +   AD  + P +           G    F+            R  ++L++S+ 
Sbjct: 116 NFVRVWDSRLADVVFVPFFAALSAQIQLRGGHRGEFRKRSSKNSDFDRQRRVVELVTSSL 175

Query: 130 PYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFG----QRNHVCL 185
             W R+ G DH FV+  D  A +H +E+  I   +        LV  FG    +     L
Sbjct: 176 E-WRRSNGVDHVFVLA-DPMAMWHVREQ--ISTAVF-------LVVDFGGWYLEDAKNKL 224

Query: 186 NEGSITIPPYAPPQKM----QAHQIPP-----DTPRSIFVYFRGLFYDVNNDPEGGYYAR 236
           N  +I       P K       H +PP     D  R++ +YFRG      +    G    
Sbjct: 225 NSSTIIQHSQVSPIKDVIIPHTHLLPPLKIADDQHRTVLLYFRG----ARHRHRSGL--- 277

Query: 237 GARAAVWENFKNNPLFDISTDHP-----TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVF 291
             R  +W+   N P   +    P           M+ + FCL P G  P S RL +A+  
Sbjct: 278 -VREKLWKILDNEPEVLLEEGLPDDAGLAEATRGMRSSEFCLTPAGDTPSSCRLYDAIAS 336

Query: 292 GCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK 326
            CIPVI++DDI LPF   + +EE  +FV+  D  +
Sbjct: 337 LCIPVIVSDDIQLPFEGFVNYEEFCVFVSARDATQ 371


>gi|356565097|ref|XP_003550781.1| PREDICTED: xylogalacturonan beta-1,3-xylosyltransferase-like
           [Glycine max]
          Length = 472

 Score = 84.7 bits (208), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 83/329 (25%), Positives = 131/329 (39%), Gaps = 42/329 (12%)

Query: 80  SPVRTLNPEEADWFYTPIYPTCDL---TPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTE 136
           + +R  N  +AD  + P + +      +           RM++  +  +      W R+ 
Sbjct: 166 TAIRVQNSRQADVVFVPFFSSLSYNRHSKIHGKEKVSVNRMLQQRLVQLLMEREEWKRSG 225

Query: 137 GADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY- 195
           G DH  V  H            +I R    L     ++  FG+      N     I PY 
Sbjct: 226 GRDHVIVAHH----------PNSILRARRKLGSAMLVLADFGRYPSQLANIKKDIIAPYR 275

Query: 196 -----APPQKMQAHQIPPDTPRSIFVYFRGLFY--DVNNDPEGGYYARGARAAVWENFKN 248
                 P  +  +++      RS  +YF+G  Y  D     +  YY       V   F +
Sbjct: 276 HLVSTVPRAESASYE-----ERSTLLYFQGAIYRKDGGAIRQKLYYLLKDEKDVHFAFGS 330

Query: 249 NPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFAD 308
                I  +      + M  + FCL   G  P S RL +A+V  C+PVII+D+I LPF D
Sbjct: 331 -----IRKNGINQASQGMALSKFCLNVAGDTPSSNRLFDAIVSHCVPVIISDEIELPFED 385

Query: 309 AIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGD 365
            + + E G+FV   D  +   L  +L SI PE   +    L +  + +   +  P+QPGD
Sbjct: 386 VLDYSEFGLFVHASDAVRKGYLLNLLRSIKPEKWTQMWERLKD--ITQHFEYQYPSQPGD 443

Query: 366 AFHQILNGLARKLP------HDKSTYLKP 388
           A + I   +A K+       H K+ Y +P
Sbjct: 444 AVNMIWEEVAHKISSLQFNLHRKNRYQRP 472


>gi|302843170|ref|XP_002953127.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
 gi|300261514|gb|EFJ45726.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
          Length = 616

 Score = 84.3 bits (207), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 87/298 (29%), Positives = 118/298 (39%), Gaps = 58/298 (19%)

Query: 77  LLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQL---ISSNWPYWN 133
           ++SS  RTLN  EAD+FY             +P+ F+   M   A ++   + + WP+WN
Sbjct: 231 IISSGHRTLNGGEADYFY-------------IPVDFR--HMFSEAFEVLEYVQATWPFWN 275

Query: 134 RTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFG-QRNH------VCLN 186
            T G         D G C    E K +   +  L   A  +  +G  R H       C  
Sbjct: 276 ATGG---------DLGGC----EGKQLML-LRNLTSAAIWLTPWGLTRKHPRVWWPGCHR 321

Query: 187 EG-SITIPPYAPPQKMQAHQIPPDT---PRSIFVYFRGLFYDVNNDPEGG---------- 232
            G  I IP  A    M    + P     PR+   YF G     N DP+            
Sbjct: 322 PGHDIVIPVLAQANNMIMTPLNPKVKPLPRNTTFYFAGKICGDNKDPKEDTSSWPICQTP 381

Query: 233 ---YYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAV 289
               Y+ G R  V+        F +     ++Y  DM  + FCL P G      R V   
Sbjct: 382 RNPLYSAGVRQLVYFYHSKRDGF-VVRPRSSSYVRDMSTSKFCLAPTG-GGHGKRQVLVG 439

Query: 290 VFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLA 347
            +GCIPV I D ++ PF   + W    + V EEDVP L TIL +I    +   QR LA
Sbjct: 440 RYGCIPVPITDYVLQPFEPELDWPAFSVTVKEEDVPNLHTILAAINDTKLAEMQRALA 497


>gi|159483641|ref|XP_001699869.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
 gi|158281811|gb|EDP07565.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
          Length = 427

 Score = 84.3 bits (207), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 85/314 (27%), Positives = 123/314 (39%), Gaps = 68/314 (21%)

Query: 72  FMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPY 131
           FM R +LS   RT NPEEAD+FY P          G     K   +++  +  IS+ WP+
Sbjct: 125 FMER-ILSGGHRTHNPEEADFFYIP----------GSSRDLKKAFLLQPLLAYISTTWPF 173

Query: 132 WNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQR-----ATLVQTFGQ------- 179
           WN T GA H      D G C             LPL  R      T +Q +G        
Sbjct: 174 WNATGGARHIMPAEGDVGTC------------ELPLKVRLFTANVTWLQFWGMYDFHPHW 221

Query: 180 ----RNHV-CLNEGSITIPPYAPPQKMQAH-QIPPDTP---------RSIFVYFRGLFYD 224
                N + C+  G   + P+     M +H +   +TP         R+   +F G    
Sbjct: 222 TQIFHNRIPCMVPGRDIVVPF---MAMSSHDRFVIETPLHPRNQKRNRTNTFFFAGGVCG 278

Query: 225 VNND----PEGGYYAR-----GARAAVWENFKNNPLFDI--STDHPTTYYEDMQRAIFCL 273
             N     P   YY +     G R AV+ +F N   + +   TD    Y  D   + FCL
Sbjct: 279 SGNKRALPPHCTYYKQVRYSGGVRQAVYLHFHNRTGWRVVPGTD---DYARDYASSRFCL 335

Query: 274 CPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTS 333
              G   W  R + A ++GCIPV   D +   F   + W   G+ + + ++P+L   L +
Sbjct: 336 AAAG-GGWGKRGIVAAMYGCIPVAATDMLYEAFEPEMDWGRFGVRITQAEIPQLADKLEA 394

Query: 334 IPPEVILRKQRLLA 347
                + R Q   A
Sbjct: 395 YSEAEVARMQERTA 408


>gi|226529660|ref|NP_001151894.1| secondary cell wall-related glycosyltransferase family 47 [Zea
           mays]
 gi|195650683|gb|ACG44809.1| secondary cell wall-related glycosyltransferase family 47 [Zea
           mays]
          Length = 488

 Score = 84.3 bits (207), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 95/400 (23%), Positives = 155/400 (38%), Gaps = 67/400 (16%)

Query: 28  AGDVLEDDPVGKLKVYVYELPSKYNKKLLL--------------------KDPRCLTHMF 67
           A   +  DP   ++V++Y++P +++  LL                     + P  L    
Sbjct: 95  AAGAVRCDPRDAVRVFMYDMPPEFHFGLLGWSPPSPDSVWPDVTAASPPPRYPGGLNQQH 154

Query: 68  AAEIFMHRFLLS------------SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLP---F 112
           + E ++   LLS            S VR  +  +AD  + P + +        P+P    
Sbjct: 155 SVEYWLTLDLLSSSSSSSPPCGRHSAVRVSDSRDADLVFVPFFASLSYNRHYRPVPPEKV 214

Query: 113 KSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRAT 172
              R ++  +    +  P W R  GADH  +V H   +  H +           +L  A 
Sbjct: 215 SRDRALQEKLVRYLAARPEWRRFGGADH-VIVAHHPNSLLHARA----------VLHPAV 263

Query: 173 LVQT-FGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEG 231
            V + FG+      +     I PY    K  A+       R   +YFRG  Y      EG
Sbjct: 264 FVLSDFGRYPPRVASLEKDVIAPYKHMAKTYANDSAGFDDRPTLLYFRGAIY----RKEG 319

Query: 232 G------YYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRL 285
           G      YY       V+ +F +     +     +   + M  + FCL   G  P S RL
Sbjct: 320 GSIRQELYYMLKEEKDVYFSFGS-----VQDHGASKASQGMHSSKFCLNIAGDTPSSNRL 374

Query: 286 VEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRK 342
            +A+V  C+PVII+DDI LP+ D + + +  +FV   D  K   L  +L+ +  +   + 
Sbjct: 375 FDAIVTHCVPVIISDDIELPYEDVLDYSKFSIFVRSSDAVKKGYLMRLLSGVSKQQWTKM 434

Query: 343 QRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHDK 382
              L    + +   +  P+Q  DA   I   L+RK+P  K
Sbjct: 435 WDRLKE--VDKHFEYQYPSQKDDAVQMIWQALSRKVPSIK 472


>gi|412988024|emb|CCO19420.1| exostosin-like glycosyltransferase [Bathycoccus prasinos]
          Length = 909

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 86/342 (25%), Positives = 137/342 (40%), Gaps = 70/342 (20%)

Query: 66  MFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSA---- 121
            ++ E  +H FLL S  RT+NPE AD F+ P Y TC       P P  S  ++++     
Sbjct: 502 FYSLENTLHEFLLRSAHRTINPENADVFFIPQYGTCYRLAYQTPSPQVSLSLIKTKPGDR 561

Query: 122 ----------IQLISSNWP-------------YWNRTEGADHFFVVPHDFGACFHYQEEK 158
                     +     N P             Y++R EG DH  +  +D GA  H+ +  
Sbjct: 562 SHAANLFLERVTEYVRNIPFNVINNEKGEIQSYFDRNEGRDHAVIAAYDEGA-VHFPD-- 618

Query: 159 AIERGIL------------------------PLLQRATLVQTFGQRNHVCLNEGSITIPP 194
           +I   I                          L+++  +   +   N        I  PP
Sbjct: 619 SIANAIFITHWGNTGYPRNSSHTAYSPDKWDELVKQGVVTGAWRAYNR----NKDIVAPP 674

Query: 195 YAPPQKMQAHQ------IPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKN 248
           ++ P+  +  +        P T R+ F +F G        P G  Y+RG R  V   ++N
Sbjct: 675 WSQPKTNEVREPADVNSWTPATQRTTFCFFSGNL--GLEKPWGEDYSRGLRQKVARRWQN 732

Query: 249 NPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFAD 308
              FDI + H   Y   ++ + FCL   G   WS  L   +  GCIPVI+ D + +P+  
Sbjct: 733 VYGFDILS-HTDDYLGRIRSSKFCLALPG-DGWSGGLSVYIRNGCIPVIVQDGVDMPWEG 790

Query: 309 A-IPWEEIGMFVAEEDVP-KLDTILTSIPPEVILRKQRLLAN 348
             + + +  + V E DV  +L ++L ++ PE +   Q  L N
Sbjct: 791 TFLDYSKFSIRVREGDVENRLQSVLETVTPEELQNLQNGLKN 832


>gi|414882087|tpg|DAA59218.1| TPA: hypothetical protein ZEAMMB73_484283 [Zea mays]
          Length = 479

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 77/320 (24%), Positives = 127/320 (39%), Gaps = 29/320 (9%)

Query: 80  SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLP---FKSPRMMRSAIQLISSNWPYWNRTE 136
           S VR  +P +AD  + P + +        PLP       + ++  +    +  P W R  
Sbjct: 170 SAVRVTDPRDADLVFVPFFASLSYNRHSRPLPPEKVGRDKALQEKLVGYLTARPEWRRFG 229

Query: 137 GADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYA 196
           GADH  +V H   +  H +   A+   +        ++  FG+      +     I PY 
Sbjct: 230 GADHV-IVAHHPNSLLHARA--ALSPAVF-------VLSDFGRYPPRVASLEKDVIAPYK 279

Query: 197 PPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGG------YYARGARAAVWENFKNNP 250
              K   +       R   +YFRG  Y      EGG      YY       V+ +F +  
Sbjct: 280 HMAKTFVNDSAGFDDRPTLLYFRGAIYR----KEGGTIRQELYYMLKDEKDVYFSFGS-- 333

Query: 251 LFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAI 310
              +     +   + M  + FCL   G  P S R+ +A+V  C+PVII+DDI LP+ D +
Sbjct: 334 ---VQDHGASKASQGMHSSKFCLNIAGDTPSSNRMFDAIVSHCVPVIISDDIELPYEDVL 390

Query: 311 PWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLAN-PSMKRAMLFPQPAQPGDAFHQ 369
            + +  +FV   D  +   ++  +      R   + +    + R   +  P+Q  DA   
Sbjct: 391 DYSKFSIFVRSSDAVEKGHLMRLLSGVSKQRWTEMWSRLREVDRHFEYQYPSQKDDAVQM 450

Query: 370 ILNGLARKLPHDKSTYLKPG 389
           I   L+RK+P  K    + G
Sbjct: 451 IWRSLSRKVPSIKLKVHRSG 470


>gi|357436207|ref|XP_003588379.1| Exostosin-2 [Medicago truncatula]
 gi|355477427|gb|AES58630.1| Exostosin-2 [Medicago truncatula]
          Length = 551

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 81/304 (26%), Positives = 133/304 (43%), Gaps = 55/304 (18%)

Query: 39  KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEI-FMHRFLLSSPVRTLNPEEADWFYTPI 97
           +LKVYVY    +   + +L  P  LT ++A+E  FM     +    T NP++A  FY P 
Sbjct: 253 RLKVYVY----REGARPILHSP-FLTGIYASEGWFMKLMEANKRFVTKNPKKAHLFYLPF 307

Query: 98  YPTCDLTPTGLPLPFKSPRMMRSAIQ-------LISSNWPYWNRTEGADHFFVVPHDFGA 150
             +  +    L +  K+    ++ IQ       +I++   +WNRT GADHF V  HD+  
Sbjct: 308 --SSRMLEEALYV--KNSHSHKNLIQYLHDYVDMIAARHSFWNRTGGADHFLVGCHDWAP 363

Query: 151 CFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPD- 209
                E K      +  L  A + + F     V   + S+      P   ++  QIP   
Sbjct: 364 S----ETKLRLANCIRSLCNADVKEGF-----VFGKDASL------PETYVRNAQIPTRD 408

Query: 210 ------TPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDI-----STDH 258
                 + ++   +F G  +          Y R      WEN   +P   I     ++  
Sbjct: 409 LGGNSFSKKTTLAFFAGSMHG---------YVRPILLKHWEN--KDPDMKIFGKLPNSKG 457

Query: 259 PTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMF 318
            + Y   M+ + +C+C  G+   SPR+VEA+ + C+PVII+D+ V PF + + WE   + 
Sbjct: 458 NSNYIHYMKSSKYCICAKGYEVNSPRVVEAIFYECVPVIISDNFVPPFFEVLDWESFSVI 517

Query: 319 VAEE 322
           V + 
Sbjct: 518 VQKH 521


>gi|449452903|ref|XP_004144198.1| PREDICTED: probable glycosyltransferase At3g07620-like [Cucumis
           sativus]
          Length = 429

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 87/374 (23%), Positives = 149/374 (39%), Gaps = 51/374 (13%)

Query: 40  LKVYVYELPSKYNKKLLLKD---------------PR--CLTHMFAAEIFMHRFLL---- 78
           L+VY+Y+LP ++N  +L +                PR   L    + E +M   LL    
Sbjct: 52  LRVYMYDLPRRFNVGILNRRNLDQTPVTASTWPPWPRNSGLKRQHSVEYWMMGSLLHEAT 111

Query: 79  ---SSPVRTLNPEEADWFYTPIYPTCDLTPTGLPL---PFKSPRMMRSAIQLISSNWPYW 132
                 VR ++PE AD F+ P + +      G  +     +    ++  +    S   YW
Sbjct: 112 GDGRDAVRVMDPENADAFFVPFFSSLSFNSHGRNMTDPATEVDHQLQIDLMKFLSESKYW 171

Query: 133 NRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITI 192
            R++G DH  V+P      F +   +        +     +V  FG+      N G   +
Sbjct: 172 QRSKGRDH--VIPMTHPNAFRFLRNQ--------VNASIQIVVDFGRYPKTMSNLGKDVV 221

Query: 193 PPYAPPQKMQAHQIPPD--TPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNP 250
            PY           PPD    R   ++F+G  +  ++        R   A + + + +  
Sbjct: 222 APYVHVVSSFIDDNPPDPFESRPTLLFFQGKTFRKDDG-----IIRVKLAKILDGYDDVH 276

Query: 251 LFDISTDHPT--TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFAD 308
               +    +  T  + M+ + FCL P G  P S RL +A+V  C+PVI++D I LP+ D
Sbjct: 277 YERSAATEKSIKTSSQGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDQIELPYED 336

Query: 309 AIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGD 365
            I + +  +F + E+  +   +   L   P E  +   + L   S  R   F  P +  D
Sbjct: 337 EIDYSQFTLFFSFEEALQPGYMVEKLREFPKERWIEMWKQLKEIS--RHYEFQYPPKKED 394

Query: 366 AFHQILNGLARKLP 379
           A + +   +  KLP
Sbjct: 395 AVNMLWRQVKHKLP 408


>gi|356519776|ref|XP_003528545.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10L-like
           [Glycine max]
          Length = 452

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 82/321 (25%), Positives = 136/321 (42%), Gaps = 53/321 (16%)

Query: 82  VRTLNPEEADWFYTPIYPTCDL---------TPTGLPLPFKSPRMMRSAIQLISSNWPYW 132
           V   +PEEAD F+ P + +  L         + +G   P  S    + A+        YW
Sbjct: 130 VLVADPEEADLFFVPFFSSLSLIVNPVRPPGSNSGSEKPVYSDEENQEALVEWLEKQEYW 189

Query: 133 NRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSIT- 191
            R  G DH  V            +  A+ R I  +     LV  FG+   +  ++GS+  
Sbjct: 190 KRNSGRDHVIVA----------SDPNAMYRVIDRVRNAVLLVSDFGR---LRPDQGSLVK 236

Query: 192 --IPPYAPPQKMQAHQIPPDTP-RSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKN 248
             + PY+   +++ +Q       R+  ++F G  Y      EGG   R     + EN K 
Sbjct: 237 DVVVPYS--HRIRTYQGDAGVEDRNTLLFFMGNRYR----KEGGKI-RDILFKILENEK- 288

Query: 249 NPLFDISTDHPTTYYED-------MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADD 301
               D+   H     E        M  + FCL P G  P + RL +A+V  CIPVI++D+
Sbjct: 289 ----DVIIKHGAQSRESRRAASQGMHTSKFCLHPAGDTPSACRLFDAIVSLCIPVIVSDN 344

Query: 302 IVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFP 358
           I LPF D I + ++ +F+      K   L + L ++ P+ +L  Q+      +K    + 
Sbjct: 345 IELPFEDTIDYRKLAVFIETSSAIKPGYLVSKLRALTPDRVLAYQK-----ELKEVKRYF 399

Query: 359 QPAQPGDAFHQILNGLARKLP 379
           +  +P    ++I   +++KLP
Sbjct: 400 EYEEPDGTVNEIWRQVSKKLP 420


>gi|302835860|ref|XP_002949491.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
 gi|300265318|gb|EFJ49510.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
          Length = 554

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 101/357 (28%), Positives = 139/357 (38%), Gaps = 56/357 (15%)

Query: 42  VYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTC 101
           VYVYELP      L  K     TH+    +F  R LL S  R  + + ADW+Y PI    
Sbjct: 121 VYVYELPPHLFTWLNTKRLDRSTHL----MFYQR-LLGSGARVADGDLADWYYIPI---- 171

Query: 102 DLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIE 161
               T     F     ++ AI+ I   +P+WNRT GA HF +   D GA         + 
Sbjct: 172 -RLRTATDSAF-----LKYAIEYIREAYPWWNRTGGARHFVIHTGDLGA-------DEVM 218

Query: 162 RGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVY-FRG 220
             +  +    T +  +G           +T+         +AH+  PD  R+   +  RG
Sbjct: 219 DDVYGMAANMTWLTHWG-----------LTVDKNTSGW-WKAHR--PDKARAGARWGTRG 264

Query: 221 LFY---DVNND--------PEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRA 269
            +Y    VN          P    +  G R  V  +  N   F I T     Y + +  +
Sbjct: 265 GYYTRVSVNRRRGSHMWGPPSPAPHRAGVRQKVHFHHWNRTGFRIVT-FERNYGKALVSS 323

Query: 270 IFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDT 329
            FCL PLG      R +     GCIPV IAD +  PF     W E  +  AE D+P+L  
Sbjct: 324 KFCLAPLG-GGHGQRQIIVSYMGCIPVCIADGVYEPFEPQTDWTEFAVRPAEADIPRLHE 382

Query: 330 ILTSIPPEVILRKQ----RLLANPSMKRAMLFPQPAQPG--DAFHQILNGLARKLPH 380
           IL  I     L +     R  A   +  +M+     + G  DAF   L  L  K  H
Sbjct: 383 ILEGISAGNKLAEMQVALRCAAQHLLYSSMVGGLFGEDGRYDAFETTLEVLRVKAAH 439


>gi|449489243|ref|XP_004158257.1| PREDICTED: LOW QUALITY PROTEIN: probable glycosyltransferase
           At3g07620-like [Cucumis sativus]
          Length = 429

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 87/374 (23%), Positives = 148/374 (39%), Gaps = 51/374 (13%)

Query: 40  LKVYVYELPSKYNKKLLLKD---------------PR--CLTHMFAAEIFMHRFLL---- 78
           L+VY+Y+LP ++N  +L +                PR   L    + E +M   LL    
Sbjct: 52  LRVYMYDLPRRFNVGILNRRNLDQTPVTASTWPPWPRNSGLKRQHSVEYWMMGSLLHEAT 111

Query: 79  ---SSPVRTLNPEEADWFYTPIYPTCDLTPTGLPL---PFKSPRMMRSAIQLISSNWPYW 132
                 VR ++PE AD F+ P + +      G  +     +    ++  +    S   YW
Sbjct: 112 GDGRDAVRVMDPENADAFFVPFFSSLSFNSHGRNMTDPATEVDHQLQIDLMKFLSESKYW 171

Query: 133 NRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITI 192
            R++G DH  V+P      F +   +        +     +V  FG+      N G   +
Sbjct: 172 QRSKGRDH--VIPMTHPNAFRFLRNQ--------VNASIQIVVDFGRYPKTMSNLGKDVV 221

Query: 193 PPYAPPQKMQAHQIPPD--TPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNP 250
            PY           PPD    R   ++F+G  +  ++        R   A + + + +  
Sbjct: 222 APYVHVVSSFIDDNPPDPFESRPTLLFFQGKTFRKDDG-----IIRVKLAKILDGYDDVH 276

Query: 251 LFDISTDHPT--TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFAD 308
               +    +  T  + M+ + FCL P G  P S RL +A+V  C+PVI++D I LP+ D
Sbjct: 277 YERSAATEKSIKTSSQGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDQIELPYED 336

Query: 309 AIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGD 365
            I + +  +F   E+  +   +   L   P E  +   + L   S  R   F  P +  D
Sbjct: 337 EIDYSQFTLFFXFEEALQPGYMVEKLREFPKERWIEMWKQLKEIS--RHYEFQYPPKKED 394

Query: 366 AFHQILNGLARKLP 379
           A + +   +  KLP
Sbjct: 395 AVNMLWRQVKHKLP 408


>gi|159490316|ref|XP_001703125.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
 gi|158270755|gb|EDO96590.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
          Length = 528

 Score = 82.4 bits (202), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 85/315 (26%), Positives = 133/315 (42%), Gaps = 49/315 (15%)

Query: 42  VYVYELPSK----YNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPI 97
           VYVYELP +    YN + L +     THM    +F  R LLSS  R  + ++AD++Y P+
Sbjct: 181 VYVYELPPELSAWYNHRRLDRP----THM----LFAQR-LLSSGARIADGDQADYYYIPV 231

Query: 98  YPTCDLTPTGLPLP-FKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQE 156
                +     P   F S   +     +I +    W R E   +   +  +     H+  
Sbjct: 232 RQRSTMFALAAPCSSFMSAHTLVPGRFMIRTGCGDWGRAEMDPNLLQLTENMTWLTHWGL 291

Query: 157 EKAIERGILPLL-----QRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTP 211
                  IL  +     ++  +V  F    H    + S+   P  P  K +    P D  
Sbjct: 292 TTDRHDKILRWVPSYRPEKDVVVPVFISPGH--FVKFSMIHTPLNPANKAK----PRDKA 345

Query: 212 RSIF---VYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQR 268
           R  F   + F   +  V++    GY+   AR+                     Y + + R
Sbjct: 346 RFFFAGRICFNSKWVFVSHWNRSGYHV--ARSE------------------KRYGQYLAR 385

Query: 269 AIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLD 328
           +++CL P G A    R ++A+  GC+PV IAD +  PF  A+ W + G+ VAE DVP++ 
Sbjct: 386 SLYCLAPPG-AGHGQRQIQALFMGCVPVTIADGVAEPFEPAVNWTDWGVRVAEADVPQMH 444

Query: 329 TILTSIPPEVILRKQ 343
           T+L  I PE +  KQ
Sbjct: 445 TLLDDIGPEQLAVKQ 459


>gi|224093268|ref|XP_002309859.1| predicted protein [Populus trichocarpa]
 gi|222852762|gb|EEE90309.1| predicted protein [Populus trichocarpa]
          Length = 420

 Score = 82.4 bits (202), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 75/307 (24%), Positives = 129/307 (42%), Gaps = 59/307 (19%)

Query: 59  DPR-CLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRM 117
           DPR  L   +A+E +  + L+ S   T +P  A  F  P+  +C  T        +  + 
Sbjct: 101 DPRDKLKRKYASEHYFLKNLIPSSFFTDDPTVAHLFLIPL--SCKKTGG------REEKD 152

Query: 118 MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGA-----------------CFHYQEEKAI 160
           + + ++ + S++PYWNRT GADHF+   H   +                 C    + K I
Sbjct: 153 IENYVKSLISSYPYWNRTLGADHFYFSCHGIDSGTIEEVPLLMKNVIRLVCSPSYDSKYI 212

Query: 161 ERGILPLLQRATLVQTFGQ---RNHVCLNEGSITIPPYAPPQKMQ----AHQIPPDTPRS 213
               + L Q   L    G         ++   +  P    P++ +    A  +  D  ++
Sbjct: 213 PHKDISLPQTLELSLHDGDDVWSRSTVMSRPLMIYPEMMLPRRTKLGFWAGSLNSDVRKN 272

Query: 214 IFVYFRGLFYDVNNDPEGGY--YARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIF 271
           + V+++G        PE  +  + +  +AA+ + ++N                ++  + F
Sbjct: 273 LQVFYKGA-------PEFNFHFFDKMKKAAILDAYEN----------------ELYGSKF 309

Query: 272 CLCPLGWAP-WSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTI 330
           C+CP G     S  L E++ FGC+PVI+ D    PF D + W    + + EE VP L+ I
Sbjct: 310 CICPRGNNHVGSVCLTESMTFGCVPVILHDYYDFPFNDVLDWNNFSVILKEEHVPDLEKI 369

Query: 331 LTSIPPE 337
           L  IP E
Sbjct: 370 LKGIPEE 376


>gi|226508962|ref|NP_001152546.1| exostosin-like [Zea mays]
 gi|195657373|gb|ACG48154.1| exostosin-like [Zea mays]
 gi|414878975|tpg|DAA56106.1| TPA: exostosin-like protein [Zea mays]
          Length = 427

 Score = 82.0 bits (201), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 83/314 (26%), Positives = 133/314 (42%), Gaps = 39/314 (12%)

Query: 82  VRTLNPEEADWFYTPIYPTCDLTPTGLPLP---FKSPRMMRSAIQLISSNWP--YWNRTE 136
           VR  +P+ AD F+ P + +      G  +     ++ R+++  ++L+   W   YW R+ 
Sbjct: 118 VRVRDPDAADAFFVPFFSSLSFNVHGRNMTDPDTEADRLLQ--VELVDILWKSKYWQRSA 175

Query: 137 GADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYA 196
           G DH  V+P      F +     +   IL       +V  FG+      +     + PY 
Sbjct: 176 GRDH--VIPMHHPNAFRFL-RAMVNASIL-------IVSDFGRYTKELASLRKDVVAPYV 225

Query: 197 PPQKMQAHQIPPD--TPRSIFVYFRGLFYDVNNDPEGGYYA------RGARAAVWEN-FK 247
                     PPD    R   ++FRG    V  D EG   +      +G     +E+   
Sbjct: 226 HVVGSFLDDDPPDPFEARHTLLFFRG--RTVRKD-EGKIRSKLEKILKGKEGVRFEDSIA 282

Query: 248 NNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFA 307
                +IST       E M+ + FCL P G  P S RL +A+V  C+PVI++  I LPF 
Sbjct: 283 TGDGINIST-------EGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSSRIELPFE 335

Query: 308 DAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRK--QRLLANPSMKRAMLFPQPAQPGD 365
           D I + E  +F + E+  + D +L  +  +V  RK     L   ++     F  P + GD
Sbjct: 336 DEIDYSEFSLFFSVEEALRPDYLLNEL-RQVPKRKWVDMWLKLKNVSHHYEFQYPPRKGD 394

Query: 366 AFHQILNGLARKLP 379
           A + I   +  K+P
Sbjct: 395 AVNMIWRQVRHKIP 408


>gi|326521044|dbj|BAJ92885.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 289

 Score = 82.0 bits (201), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 79/281 (28%), Positives = 118/281 (41%), Gaps = 51/281 (18%)

Query: 64  THMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLT----------PTGLPL-PF 112
            +M  A + ++  +L+SP RTLN +EAD+FY P+  +C +T          P  L L  +
Sbjct: 4   NYMDTATMALYESILASPHRTLNGDEADYFYVPVLDSCLITRSDDAPHLRMPEDLRLRSY 63

Query: 113 KSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQE-------------EKA 159
            +    R A   I+  +PYWNRT G DH +    D GAC+  +E                
Sbjct: 64  HTLEYYRKAYDHIAQRYPYWNRTSGRDHIWFFSWDEGACYAPKEIWNSMMLVHWGNTNTK 123

Query: 160 IERGILPLLQRATLVQTFGQR-NHVCLN-EGSITIPPYAPPQ------KMQAHQIPPDTP 211
            E+              F +R NH C +    + +P +  P       K+ A    P   
Sbjct: 124 HEKSTTAYWADNWDDIPFDRRGNHPCFDPRKDLVLPAWKEPNPGAIWLKLWAR---PKIN 180

Query: 212 RSIFVYFRGLFYDVNND--PEGGYYARGARAAVWENFKNNP-----LFDISTDHPTT--- 261
           R+   YF G       +  PE   Y+ G R  +   F + P     L    T + T    
Sbjct: 181 RTTLFYFNGNLGPAYEEGRPEDT-YSMGIRQKLAAEFGSTPNKQGKLGRQQTANVTVTYL 239

Query: 262 ----YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVII 298
               YYE++  +IFC    G   WS R+ ++++ GCIPVII
Sbjct: 240 KSEMYYEELASSIFCGVLPG-DGWSGRMEDSMLQGCIPVII 279


>gi|255543306|ref|XP_002512716.1| catalytic, putative [Ricinus communis]
 gi|223548677|gb|EEF50168.1| catalytic, putative [Ricinus communis]
          Length = 426

 Score = 82.0 bits (201), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 74/275 (26%), Positives = 116/275 (42%), Gaps = 26/275 (9%)

Query: 63  LTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAI 122
           L + F+  IF    L SS V TL+P +A +F+ P                     +    
Sbjct: 126 LGNYFSEHIFKVALLRSSLV-TLDPAKALFFFLPFSINNLRNDPRFHSEESISEFVAHYT 184

Query: 123 QLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNH 182
             IS  + YWN + GADHF+V  H  G         A+    + L   ++  Q F     
Sbjct: 185 TTISQRFSYWNASAGADHFYVCCHSVGRQA-ASRHPALHNNAIQLTCSSSYFQRF----- 238

Query: 183 VCLNEGSITIPPY--APPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARA 240
             ++   + +P     PPQ      + P   R   VYF G    V N        R    
Sbjct: 239 -FVSHKDVGLPQVWPRPPQTA----LNPPHARHRLVYFAG---RVQNSQ-----VRRELV 285

Query: 241 AVWENFKNNPLFDISTDHPTTYYED-MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIA 299
            +W    N+   DI    P+  YE+  +R+ +CL   G+   + R+ +++ +GCIPVII+
Sbjct: 286 NLW---GNDTEMDIINGSPSFPYEEGFKRSKYCLHVKGYEVNTARVSDSIHYGCIPVIIS 342

Query: 300 DDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSI 334
           +   LPFA  + W +  + + + D+P L T L +I
Sbjct: 343 NYYDLPFATVLDWSKFSVVINQADIPFLKTTLLAI 377


>gi|356570361|ref|XP_003553358.1| PREDICTED: probable glycosyltransferase At3g07620-like [Glycine
           max]
          Length = 520

 Score = 82.0 bits (201), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 75/311 (24%), Positives = 121/311 (38%), Gaps = 31/311 (9%)

Query: 80  SPVRTLNPEEADWFYTPIYPTC---DLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTE 136
           S +R  N  EAD  + P + +     L+ TG        ++++  +    +    W R+ 
Sbjct: 197 SVIRVRNSSEADVIFVPFFSSLCYNRLSKTGPHEKRSRNKLLQEKLVKYVTAQEEWKRSG 256

Query: 137 GADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYA 196
           G DH  +  H            ++    + L     ++  FG+      N     I PY 
Sbjct: 257 GKDHVILAHH----------PNSMLDARMKLWPGTFILSDFGRYPTNIANVEKDVIAPYK 306

Query: 197 PPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNN-----PL 251
                  +       R+  +YF+G  Y      +GG+     R  ++   KN        
Sbjct: 307 HVVGSYDNDQSSFDSRTTLLYFQGAIYR----KDGGH----VRHELYYLLKNEKDVHFSF 358

Query: 252 FDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIP 311
             +         E M+ + FCL   G  P S RL +A+   C+PVII+DDI LP+ D + 
Sbjct: 359 GSVQKGGVRKATEGMRSSKFCLNIAGDTPSSNRLFDAIASHCVPVIISDDIELPYEDVLD 418

Query: 312 WEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFH 368
           + +  +FV   D  K   L   + SI  E   R    L    ++    F  P++ GDA  
Sbjct: 419 YSQFCIFVRTRDALKKRYLINFIRSIGKEEWTRMWNRL--KEVESFFEFQFPSKEGDAVQ 476

Query: 369 QILNGLARKLP 379
            I   +ARK+P
Sbjct: 477 MIWKAIARKVP 487


>gi|357478277|ref|XP_003609424.1| Xylogalacturonan beta-1,3-xylosyltransferase [Medicago truncatula]
 gi|355510479|gb|AES91621.1| Xylogalacturonan beta-1,3-xylosyltransferase [Medicago truncatula]
          Length = 229

 Score = 81.6 bits (200), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 43/115 (37%), Positives = 67/115 (58%), Gaps = 6/115 (5%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y + M  + FCLCP G    SPR+VEA+  GC+PVII D+  LPF+D + W +  M +A 
Sbjct: 102 YTKLMGLSKFCLCPSGHEVASPRVVEAIYAGCVPVIICDNYSLPFSDVLNWSQFSMEIAV 161

Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANP--SMKRAMLFPQPAQPGDAFHQILNGL 374
           + +P++ TIL +I       K R+L +    +++     +PA+P D  H IL+ +
Sbjct: 162 DRIPEIKTILQNITET----KYRVLYSNVRRVRKHFEMNRPAKPFDLIHMILHSV 212


>gi|223975191|gb|ACN31783.1| unknown [Zea mays]
 gi|414887161|tpg|DAA63175.1| TPA: exostosin-like protein [Zea mays]
          Length = 500

 Score = 81.6 bits (200), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 97/386 (25%), Positives = 154/386 (39%), Gaps = 72/386 (18%)

Query: 40  LKVYVYELPSKYNKKLL--LKDPRCLT------------------HMFAAEIFMHRFLL- 78
           +K+Y+Y+LP+K+   ++   K  R  +                  H  +AE ++ + LL 
Sbjct: 104 VKIYLYDLPAKFTYGVVRSYKAARATSGSADAAATLPDEQLRYPGHQHSAEWWLFKDLLR 163

Query: 79  ----SSPV-RTLNPEEADWFYTPIYPTCDL--TPTGLPLPFK--------SPRMMRSAIQ 123
                 PV R  +P +AD FY P + +  L   P   PL           S   ++  + 
Sbjct: 164 RRPQDRPVARVDDPSDADLFYVPFFSSLSLVVNPIRSPLAANASGAGTAYSDEALQEELL 223

Query: 124 LISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHV 183
                  YW R  G DH F+           Q+  A+ R +  +     LV  FG+    
Sbjct: 224 EWLERQLYWQRHRGRDHVFIC----------QDPNALYRVVDRISNAVLLVSDFGRLRSD 273

Query: 184 CLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVW 243
             +     I PY+        ++  D  R   ++F G  Y      EGG      R A++
Sbjct: 274 QASLVKDVILPYSHRINSFKGEVGVDG-RPSLLFFMGNRYR----KEGG----KVRDALF 324

Query: 244 ENFKNNPLFDISTDHPTTYYED-------MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPV 296
           +  +N    D++  H T   E        M  + FCL P G  P + RL +A+V  C+PV
Sbjct: 325 QILENED--DVTIKHGTQSRESRREATQGMHSSKFCLHPAGDTPSACRLFDALVSLCVPV 382

Query: 297 IIADDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKR 353
           I +D I LPF D I + +I +FV      +   L + L  I  E IL  QR      +K+
Sbjct: 383 IASDYIELPFEDIIDYNKISIFVGTSKAVQPGYLTSTLRRISSERILEYQR-----EIKK 437

Query: 354 AMLFPQPAQPGDAFHQILNGLARKLP 379
              + +   P    ++I   ++ K P
Sbjct: 438 VRHYFEYEDPNGPVNEIWRQVSLKAP 463


>gi|307110230|gb|EFN58466.1| hypothetical protein CHLNCDRAFT_140476 [Chlorella variabilis]
          Length = 632

 Score = 81.6 bits (200), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 87/344 (25%), Positives = 134/344 (38%), Gaps = 53/344 (15%)

Query: 55  LLLKDPRCLTHM----FAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPT---- 106
           + L+  R   H+    +A E      LL S  RTL+PEEAD+FY P++ +C + P     
Sbjct: 220 VCLRAQRGAHHLWEWTYALEAGFLEMLLQSEHRTLDPEEADFFYVPVFTSCFINPVRDGA 279

Query: 107 -GLPLPF---------KSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQE 156
             L   F          +  M+  A   + + +PYW R  G DH ++V HD  +C+    
Sbjct: 280 DSLRDFFYGVSHNRVQGAANMLLEAYHWVQAMFPYWERRGGRDHIWLVTHDEASCWVPAA 339

Query: 157 EKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPP---QKMQAHQ-------- 205
            ++    IL    R     T G      +    +T P + P     K+   Q        
Sbjct: 340 IRSTSI-ILSHWGRMDAHHTSGTGYSADVYSNDVTHPQFEPDGFLGKLNLTQPCYDPVKD 398

Query: 206 --IP----------------PDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAV---WE 244
             +P                P   R+   + RG     N     G   R A+AA    W 
Sbjct: 399 LVVPLMKTPEHYRLSPLVGAPPRQRTWLAFHRGRVQADNPPYSRGIRQRLAKAAAEGGWL 458

Query: 245 NFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVL 304
                 + +  T     Y E +  ++FC    G   WS R+ +A++ GCIPV+I D++ +
Sbjct: 459 EKHKIAVGEYDTLQ-GDYSELLASSVFCPVIPG-DGWSARMDDAMLHGCIPVLIMDEVQV 516

Query: 305 PFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLAN 348
            F   +      + + E D  KL  IL ++  E     QR LA 
Sbjct: 517 SFESVVDLSTFTIRIPEADAEKLPDILQAVTQERREEMQRALAR 560


>gi|195611602|gb|ACG27631.1| exostosin-like [Zea mays]
          Length = 456

 Score = 81.6 bits (200), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 94/351 (26%), Positives = 140/351 (39%), Gaps = 67/351 (19%)

Query: 40  LKVYVYELPSKY------------------NKKLLLKDP--RCLTHMFAAEIFMHRFLL- 78
           +K+Y+Y+LP+K+                  N    L D   R   H  +AE ++ + LL 
Sbjct: 60  VKIYLYDLPAKFTYGVVRSYKAARATSGSANAAATLPDEQLRYPGHQHSAEWWLFKDLLR 119

Query: 79  ----SSPV-RTLNPEEADWFYTPIYPTCDL--TPTGLPLPFK--------SPRMMRSAIQ 123
                 PV R  +P +AD FY P + +  L   P   PL           S   ++  + 
Sbjct: 120 RRPQDRPVARVDDPSDADLFYVPFFSSLSLVVNPIRSPLAANASGAGTAYSDEALQEELL 179

Query: 124 LISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHV 183
                  YW R  G DH F+           Q+  A+ R +  +     LV  FG+    
Sbjct: 180 EWLERQLYWQRHRGRDHVFIC----------QDPNALYRVVDRISNAVLLVSDFGRLRSD 229

Query: 184 CLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVW 243
             +     I PY+        ++  D  R   ++F G  Y      EGG      R A++
Sbjct: 230 QASLVKDVILPYSHRINSFKGEVGVDG-RPSLLFFMGNRYR----KEGG----KVRDALF 280

Query: 244 ENFKNNPLFDISTDHPTTYYED-------MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPV 296
           +  +N    D++  H T   E        M  + FCL P G  P + RL +A+V  C+PV
Sbjct: 281 QILENED--DVTIKHGTQSRESRRAATQGMHSSKFCLHPAGDTPSACRLFDALVSLCVPV 338

Query: 297 IIADDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQR 344
           I +D I LPF D I + +I +FV      +   L + L  I  E IL  QR
Sbjct: 339 IASDYIELPFEDIIDYNKISIFVGTSKAVQPGYLTSTLRRISSERILEYQR 389


>gi|255580236|ref|XP_002530948.1| catalytic, putative [Ricinus communis]
 gi|223529463|gb|EEF31420.1| catalytic, putative [Ricinus communis]
          Length = 512

 Score = 81.6 bits (200), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 89/380 (23%), Positives = 149/380 (39%), Gaps = 62/380 (16%)

Query: 40  LKVYVYELPSKYNKKLL------------LKD-----PRCLTHMFAAEIFMHRFLLS--- 79
           LKV++Y+LP +++ +LL            L+      P  L    + E ++   LL+   
Sbjct: 110 LKVFMYDLPPEFHFELLDWKAQGDSVWPDLRTKIPGYPGGLNLQHSIEYWLTLDLLASEI 169

Query: 80  -------SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPF----KSPRMMRSAIQLISSN 128
                  S +R  N  EAD  + P + +          P     K+  +    ++ ++S 
Sbjct: 170 SGIPRAGSAIRVRNSSEADVIFVPFFSSLSYNRYSKVNPHQKRSKNKLLQEKLVRYVTSQ 229

Query: 129 WPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEG 188
              W R++G DH  +  H            ++    + L     ++  FG+      N  
Sbjct: 230 ME-WKRSQGQDHIILAHH----------PNSMLDARMKLWPALFILADFGRYPPNIANVD 278

Query: 189 SITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKN 248
              I PY    +  A        R   +YF+G  Y      +GG+    AR  ++   K+
Sbjct: 279 KDLIAPYKHVIRSYADDSSTFDSRPTLLYFQGAIY----RKDGGF----ARQELFYLLKD 330

Query: 249 NP-----LFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIV 303
                     +  D      + M  + FCL   G  P S RL +A+   C+PVII+DDI 
Sbjct: 331 EKDVHFQFGSVQKDGINKASQGMHTSKFCLNIAGDTPSSNRLFDAIASHCVPVIISDDIE 390

Query: 304 LPFADAIPWEEIGMFVAEEDVPKLDTILTSI----PPEVILRKQRLLANPSMKRAMLFPQ 359
           LP+ D + + +  +FV   D  K   ++  I      E     Q+L     ++R   F  
Sbjct: 391 LPYEDVLDYSQFCIFVRTSDAIKEKFLINLIRGIGKDEWTQMWQKL---KEVERFFEFQY 447

Query: 360 PAQPGDAFHQILNGLARKLP 379
           P++ GDA   I   +ARK+P
Sbjct: 448 PSKEGDAVQMIWQAVARKVP 467


>gi|326432404|gb|EGD77974.1| hypothetical protein PTSG_09607 [Salpingoeca sp. ATCC 50818]
          Length = 377

 Score = 81.6 bits (200), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 72/232 (31%), Positives = 96/232 (41%), Gaps = 22/232 (9%)

Query: 138 ADHFFVVPHDFGACFHYQEE----KAIERGILPLLQRATLVQTFGQRNHVCLNEGS-ITI 192
           A    V+  D G C ++ E     + ++R I   L+ ATL+   G   + C   G  + I
Sbjct: 116 ARFLVVLSSDHGPCHNFNEHLTGRRLVKRWIDRSLENATLLMNDGSLVNKCYRPGKDVVI 175

Query: 193 PPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLF 252
           PP             P T R  F +F G    +             R  +     N    
Sbjct: 176 PPSTWIGNATFACSRPITDRKHFAFFAGAASSL------------IREYIINELGNEDWL 223

Query: 253 DISTD-HPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIP 311
            I  D     Y  +M  A+FCL P G A WSPRLVEA+  GCIPVIIAD    PF D + 
Sbjct: 224 FIPHDLQHEEYMCEMGNAVFCLAPRGRAAWSPRLVEALEAGCIPVIIADMNHEPFHDVLD 283

Query: 312 WEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAML-FPQPAQ 362
           +    + V E+ +  L   L SI    +    RL AN    RA   +P PA+
Sbjct: 284 YSTFTVQVHEDKLETLGEQLHSISSGQV---ARLHANGQRARAHFRYPPPAR 332


>gi|115487106|ref|NP_001066040.1| Os12g0124400 [Oryza sativa Japonica Group]
 gi|77552918|gb|ABA95714.1| secondary cell wall-related glycosyltransferase family 47,
           putative, expressed [Oryza sativa Japonica Group]
 gi|113648547|dbj|BAF29059.1| Os12g0124400 [Oryza sativa Japonica Group]
 gi|215678612|dbj|BAG92267.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|218186350|gb|EEC68777.1| hypothetical protein OsI_37312 [Oryza sativa Indica Group]
          Length = 475

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 75/310 (24%), Positives = 127/310 (40%), Gaps = 29/310 (9%)

Query: 80  SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLP----FKSPRMMRSAIQLISSNWPYWNRT 135
           + VR  +  +AD  + P + +         +P     +   +    ++ + +  P W R+
Sbjct: 166 AAVRVADSRDADVVFVPFFASLSYNRHSRVVPPEKVSRDKELQEKLVRYLMAQ-PEWKRS 224

Query: 136 EGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY 195
            GADH  +V H   +  H +        + P++    ++  FG+ +    +     I PY
Sbjct: 225 GGADHV-IVAHHPNSLLHARSV------LFPVV---FVLSDFGRYHPRVASLEKDVIAPY 274

Query: 196 APPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNP----L 251
               K   +       R   +YFRG  +      EGG      R  ++   K+       
Sbjct: 275 KHMAKTFVNDSAGFDDRPTLLYFRGAIFR----KEGG----NIRQELYYMLKDEKDVYFA 326

Query: 252 FDISTDH-PTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAI 310
           F    DH  +   + M  + FCL   G  P S RL +A+V  C+PVII+DDI LP+ DA+
Sbjct: 327 FGSVQDHGASKASKGMHASKFCLNIAGDTPSSNRLFDAIVSHCVPVIISDDIELPYEDAL 386

Query: 311 PWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLAN-PSMKRAMLFPQPAQPGDAFHQ 369
            + +  +FV   D  K   ++  I      +  R+      + +   +  P+Q  DA   
Sbjct: 387 DYSKFSIFVRSSDAVKKGYLMRLIRGVSKHQWTRMWNRLKEVDKHFEYQYPSQKDDAVQM 446

Query: 370 ILNGLARKLP 379
           I   LARK+P
Sbjct: 447 IWQALARKVP 456


>gi|302804107|ref|XP_002983806.1| hypothetical protein SELMODRAFT_445668 [Selaginella moellendorffii]
 gi|300148643|gb|EFJ15302.1| hypothetical protein SELMODRAFT_445668 [Selaginella moellendorffii]
          Length = 1068

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 95/349 (27%), Positives = 149/349 (42%), Gaps = 51/349 (14%)

Query: 39  KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIY 98
           + KVYVY +    +   L +D +         +F+   L S  V T +PE AD F  P  
Sbjct: 55  RFKVYVYPMIQNASAPDL-RDGKAARPGSIDRVFVDSLLASGFV-TDDPEAADLFLLP-- 110

Query: 99  PTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEK 158
            +        P P      ++S IQ +   +PYW R+ GADHFFV  HD  + +      
Sbjct: 111 ASISAIWKKRPDPKGIAHSLKSYIQQLRDLYPYWQRSLGADHFFVSCHDITSDW------ 164

Query: 159 AIERGILPLLQRATLVQTFGQRNHVC---LNEGSITIPPYA----PPQKMQAHQIPPDTP 211
              R +L L + A  +  F    H     L    IT+PP      PPQ+ +         
Sbjct: 165 --SRNVLELKKNAIQIACFPLARHGAQEFLAHKDITMPPAGGSIDPPQRRR--------- 213

Query: 212 RSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIF 271
                 +    YD ++    GY AR   A+ W++ ++     ++ D      + +    F
Sbjct: 214 ------WNLAVYDSSSQ---GYAARDVPAS-WKSDESFVAGAVALD-----LQLLVTTRF 258

Query: 272 CLCPLGWAPWSPRLVEAVVFGCIPVIIADDIV--LPFADAIPWEEIGMFVAEEDVPKLDT 329
           CL  LG +     ++ AV  GCIPVI +   +  LPF D + W    + ++ + + +   
Sbjct: 259 CLS-LGSSD-RHLVIPAVRSGCIPVIFSAGKLSDLPFQDILDWNSFAIVLSRDQLHQTKG 316

Query: 330 ILTSIPPEVILRKQRLLANPS-MKRAMLFPQPAQPGDAFHQILNGLARK 377
           IL SI  E   ++ RL  N +   + M +  P QP DAF+ +L  L R+
Sbjct: 317 ILESIDEE---KRSRLQENGARAAKHMEWHSPPQPEDAFYMVLYQLWRR 362


>gi|3668093|gb|AAC61825.1| unknown protein [Arabidopsis thaliana]
          Length = 460

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 90/378 (23%), Positives = 157/378 (41%), Gaps = 68/378 (17%)

Query: 39  KLKVYVYELPSKYNKKLL---------LKDP--RCLTHMFAAEIFMHRFLLSSP------ 81
           +++VY+Y LP ++   L+         +K P     T  +     MH + L S       
Sbjct: 59  RVRVYMYNLPKRFTYGLIEQHSIARGGIKKPVGDVTTLKYPGHQHMHEWYLFSDLNQPEV 118

Query: 82  -------VRTLNPEEADWFYTPIYPTCDL-TPTGLPLP----FKSPRMMRSAIQLISSNW 129
                  VR  +P +AD FY P++ +  L    G P+     +   +M    ++ +    
Sbjct: 119 DRSGSPIVRVSDPADADLFYVPVFSSLSLIVNAGRPVEAGSGYSDEKMQEGLVEWLEGQ- 177

Query: 130 PYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGS 189
            +W R  G DH  V+P          +  A+ R +  +     LV  FG+   +  ++GS
Sbjct: 178 EWWRRNAGRDH--VIPAG--------DPNALYRILDRVKNAVLLVSDFGR---LRPDQGS 224

Query: 190 IT---IPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENF 246
                + PY+    +   +I  +  R+  ++F G  Y      +GG      R  +++  
Sbjct: 225 FVKDVVIPYSHRVNLFNGEIGVED-RNTLLFFMGNRYR----KDGG----KVRDLLFQVL 275

Query: 247 KNNPLFDISTDHPTTYYED-------MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIA 299
           +     D++  H T   E+       M  + FCL P G  P + RL +++V  C+P+I++
Sbjct: 276 EKED--DVTIKHGTQSRENRRAATKGMHTSKFCLNPAGDTPSACRLFDSIVSLCVPLIVS 333

Query: 300 DDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAML 356
           D I LPF D I + +  +FV      +   L  +L  I  + IL  QR + +    R  L
Sbjct: 334 DSIELPFEDVIDYRKFSIFVEANAALQPGFLVQMLRKIKTKKILEYQREMKS-KFDRTKL 392

Query: 357 FPQPAQPGDAFHQILNGL 374
                +P  ++H  LN L
Sbjct: 393 LMSMHKPNRSYHFHLNNL 410


>gi|302795426|ref|XP_002979476.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
 gi|300152724|gb|EFJ19365.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
          Length = 434

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 92/394 (23%), Positives = 163/394 (41%), Gaps = 63/394 (15%)

Query: 40  LKVYVYELPSKYN---------KKLLLKDPRCLTH-----------MFAAEIFMHRFLLS 79
           L++++++LPSK+           + + ++ + L +            +  E    R L  
Sbjct: 62  LRIFMHDLPSKFTYGVVERYLRSRGIARNDKRLRYPGTQHSAEWWLFYDLEQGEDRRLSD 121

Query: 80  SPVRTLNPEEADWFYTPIYPTCDLT-PTGLPLPFKSPRMMRSAIQLISS---NWPYWNRT 135
           S VR +NP+EAD FY P + +  L    G     + P       + + +       W + 
Sbjct: 122 SSVRVMNPQEADVFYVPFFSSLSLVVGNGKSEDDEDPYSDEDTQEELMAWLEEQESWKKN 181

Query: 136 EGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY 195
           +G DH  +           Q+  A++R    L     L+  F +      +     + PY
Sbjct: 182 KGRDHVVIC----------QDPNALKRLRDRLKNTVLLLSDFERFKPDQASLVKDVVLPY 231

Query: 196 APPQKMQAHQIPPDT-PRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDI 254
               ++ ++     T  R   ++F G  Y      EGG      R  +++     P  D+
Sbjct: 232 T--HRIDSYSNENVTLDRDTLLFFMGNRYR----KEGG----KIRDQLFQVLDVEP--DM 279

Query: 255 STDHPTTYYED-------MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFA 307
              H T   E        MQ + FCL P G  P + RL +A+V  C+PVI++DDI LPF 
Sbjct: 280 VMKHGTQSREGRRLAKVGMQTSKFCLHPAGDTPSACRLFDAIVSVCVPVIVSDDIELPFE 339

Query: 308 DAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPG 364
           D + + E  +FV   +  +   L + L SI P+++ +KQ+ L     +    + +  + G
Sbjct: 340 DELDYSEFAIFVPSINALEPGYLGSYLRSISPDLLKQKQQRL-----REVRKYFEYEEKG 394

Query: 365 DAFHQILNGLARKLPHDKSTYLKPGGKFLNWTSG 398
            A   I   + +KLP  + T +    + +  +SG
Sbjct: 395 GAVEMIWLQVKKKLPFIR-TMINRDKRLVERSSG 427


>gi|359473738|ref|XP_002272591.2| PREDICTED: LOW QUALITY PROTEIN: probable glycosyltransferase
           At5g25310-like [Vitis vinifera]
          Length = 437

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 93/383 (24%), Positives = 148/383 (38%), Gaps = 59/383 (15%)

Query: 38  GKLKVYVYELPSKYNKKLLLK------------------DPRCLTHMFAAEIFMHRFLL- 78
           G L VY+Y+LP +++  +L +                      L    + E +M   LL 
Sbjct: 51  GPLMVYMYDLPRRFHVGMLRRRSPADESPVTAENLPPWPSNSGLKKQHSVEYWMMASLLY 110

Query: 79  --------SSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLP---FKSPRMMRSAIQLISS 127
                      VR  +PE AD F+ P + +      G  +     +  R ++  I  I  
Sbjct: 111 DGGGGNETREAVRVWDPEMADAFFVPFFSSLSFNTHGHNMTDPDTEFDRQLQIDILKILR 170

Query: 128 NWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNE 187
              YW R+ G DH  V+P      F +  E+ +   IL       +V  FG+      N 
Sbjct: 171 ESKYWQRSGGRDH--VIPMHHPNAFRFFREQ-VNTSIL-------IVADFGRYPKEISNL 220

Query: 188 GSITIPPYAPPQKMQAHQIPPD--TPRSIFVYFRGLFYDVNNDPEGGYYAR------GAR 239
               + PY            PD    R+  ++FRG    +  D EG    +      G  
Sbjct: 221 RKDVVAPYVHVVDSFTDDNSPDPYESRTTLLFFRGR--TIRKD-EGIVRDKLVKLLAGXD 277

Query: 240 AAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIA 299
             +  +F +           T   + M+ + FCL P G  P S RL +A+V  C+PVI++
Sbjct: 278 DYLQLHFHHRSYLSFLVXQST---QGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVS 334

Query: 300 DDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAML 356
           D I LP+ D I + +  +F ++++  +   +   L  IP E  +   R L   S      
Sbjct: 335 DQIELPYEDEIDYTQFSIFFSDKEALEPGYMIEQLRQIPKERWVEMWRHLKYISHHYE-- 392

Query: 357 FPQPAQPGDAFHQILNGLARKLP 379
           F  P + GDA   +   +  KLP
Sbjct: 393 FQYPPKKGDAIDMLWRQVKHKLP 415


>gi|302814814|ref|XP_002989090.1| hypothetical protein SELMODRAFT_447545 [Selaginella moellendorffii]
 gi|300143191|gb|EFJ09884.1| hypothetical protein SELMODRAFT_447545 [Selaginella moellendorffii]
          Length = 1522

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 94/348 (27%), Positives = 144/348 (41%), Gaps = 49/348 (14%)

Query: 39  KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIY 98
           + KVYVY +    +   L +D +         +F+   L S  V T +PE AD F  P  
Sbjct: 55  RFKVYVYPMIQNASAPDL-RDGKAARPGSIDRVFVDSLLASGFV-TDDPEAADLFLLP-- 110

Query: 99  PTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEK 158
            +        P P      ++S IQ +   +PYW R+ GADHFFV  HD  + +      
Sbjct: 111 ASISAIWKKRPDPKGIAHSLKSYIQQLRDLYPYWQRSLGADHFFVSCHDITSDW------ 164

Query: 159 AIERGILPLLQRATLVQTFGQRNHVC---LNEGSITIPPYA----PPQKMQAHQIPPDTP 211
              R +L L + A  +  F    H     L    IT+PP      PPQ+ + +       
Sbjct: 165 --SRNVLELKKNAIQIACFPLARHGAQEFLAHKDITMPPAGGSIDPPQRRRWN------- 215

Query: 212 RSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIF 271
             + VY          D     YA     A W++ ++     +  D      + +    F
Sbjct: 216 --LAVY----------DSSSQGYAASDVPASWKSDESFVAGAVKMD-----LQLLVTTRF 258

Query: 272 CLCPLGWAPWSPRLVEAVVFGCIPVIIADDIV--LPFADAIPWEEIGMFVAEEDVPKLDT 329
           CL  LG +     ++ AV  GCIPVI +   +  LPF D + W    + ++ + + +   
Sbjct: 259 CLS-LGSSD-RHLVIPAVRSGCIPVIFSAGKLSDLPFQDILDWNSFAIVLSRDQLHQTKA 316

Query: 330 ILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARK 377
           IL SI  E + R Q   A  +  + M +  P QP DAF+ +L  L R+
Sbjct: 317 ILESIDEEKLSRLQENGARAA--KHMEWHSPPQPEDAFYMVLYQLWRR 362


>gi|449465854|ref|XP_004150642.1| PREDICTED: probable glycosyltransferase At5g03795-like [Cucumis
           sativus]
          Length = 312

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 73/305 (23%), Positives = 124/305 (40%), Gaps = 82/305 (26%)

Query: 69  AEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSN 128
           AE   ++ LL+SP  T +P++A  F+ P  P              S R +   I+ + ++
Sbjct: 65  AESLFYKSLLNSPYTTHDPDQAHLFFIPFSPHI------------STRSLARLIRTLRTD 112

Query: 129 WPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEG 188
            PYWNRT GADHFF+     G           +R ++ L + A  V +F       +   
Sbjct: 113 LPYWNRTLGADHFFLSSSGIGYI--------SDRNVVELKKNAIQVSSFPVSPGKFIPHK 164

Query: 189 SITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKN 248
            +++PP +         +PP TP                                     
Sbjct: 165 DVSLPPVS--------TLPPRTP------------------------------------- 179

Query: 249 NPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIV--LPF 306
                      + Y + + ++ FCL        S  + EA+ FGC+PV+I+D  +  LP 
Sbjct: 180 -----------SCYGDKLAKSDFCLFEYEGGDVSG-IGEALRFGCVPVVISDRWIQDLPL 227

Query: 307 ADAIPWEEIGMFVA-EEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGD 365
            D + WEE+ +FVA    +  +  +L  +  E + R ++L A  +  +  ++  P QP D
Sbjct: 228 MDVVRWEEMAVFVAGGGGIEGVKKVLRRVDGERLDRMKKLGA--AAAQHFVWNSPPQPLD 285

Query: 366 AFHQI 370
           AF+ +
Sbjct: 286 AFNTV 290


>gi|302792172|ref|XP_002977852.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
 gi|300154555|gb|EFJ21190.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
          Length = 434

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 94/399 (23%), Positives = 158/399 (39%), Gaps = 73/399 (18%)

Query: 40  LKVYVYELPSKYN---------KKLLLKDPRCLTH-----------MFAAEIFMHRFLLS 79
           L++++Y+LPSK+           + + ++ + L +            +  E    R L  
Sbjct: 62  LRIFMYDLPSKFTYGVVERYLRSRGIARNDKRLRYPGTQHSAEWWLFYDLEQGEDRRLSD 121

Query: 80  SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNW----PYWNRT 135
           + VR +NP+EAD FY P + +  L                  IQ     W      W + 
Sbjct: 122 ASVRVMNPQEADVFYVPFFSSLSLVVGNGKSEDDEDPYSDEDIQEELMAWLEEQESWKKN 181

Query: 136 EGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY 195
           +G DH  +           Q+  A++R    L     L+  F +      +     + PY
Sbjct: 182 KGRDHVVIC----------QDPNALKRLRDRLKNTVLLLSDFERFKPDQASLVKDVVLPY 231

Query: 196 APPQKMQAHQIPP------DTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNN 249
                   H+I           R   ++F G  Y      EGG      R  +++     
Sbjct: 232 T-------HRIDSYFNENVTLDRDTLLFFMGNRYR----KEGG----KIRDQLFQVLDVE 276

Query: 250 PLFDISTDHPTTYYED-------MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDI 302
           P  D+   H T   E        MQ + FCL P G  P + RL +A+V  C+PVI++DDI
Sbjct: 277 P--DMVMKHGTQSREGRRLAKVGMQTSKFCLHPAGDTPSACRLFDAIVSVCVPVIVSDDI 334

Query: 303 VLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQ 359
            LPF D + + E  +FV   +  +   L + L SI P+++ +KQ+ L     +    + +
Sbjct: 335 ELPFEDELDYSEFAIFVPSINALEPGYLGSYLRSISPDLLKQKQQRL-----REVRKYFE 389

Query: 360 PAQPGDAFHQILNGLARKLPHDKSTYLKPGGKFLNWTSG 398
             + G A   I   + +KLP  + T +    + +  +SG
Sbjct: 390 YEEKGGAVEMIWLEVKKKLPFIR-TMINRDKRLVERSSG 427


>gi|224093262|ref|XP_002309856.1| predicted protein [Populus trichocarpa]
 gi|222852759|gb|EEE90306.1| predicted protein [Populus trichocarpa]
          Length = 326

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 91/356 (25%), Positives = 143/356 (40%), Gaps = 63/356 (17%)

Query: 39  KLKVYVYE---------LPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEE 89
           + KV+VY+         LPSK+N +            + +E +    L  SP  T +  E
Sbjct: 4   EFKVFVYQDRNITKHCDLPSKHNSR------------YESEEYFFSNLKMSPFLTDDAAE 51

Query: 90  ADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFG 149
           A  F+ PI+       T      +    +   ++ + S +PYWNRT GADHFFV   D  
Sbjct: 52  AHLFFIPIFSQ---KMTKKRSEDERAIAVEDFVKSLISKYPYWNRTLGADHFFVTCADIN 108

Query: 150 ACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQ--AHQIP 207
                    A  R I  L++ +  V      N   +    +++P   PP  +    + I 
Sbjct: 109 VT-------ATAR-IANLMKNSIKVMCTPSYNDEYVPHKDVSLPQRVPPLALTPAGNNI- 159

Query: 208 PDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPT------T 261
             T R    ++RGL    NN        R      WEN  +  LF      P+       
Sbjct: 160 --TNRITLAFWRGL----NNSD-----IRQKLLEAWEN--DLELFIQKGRKPSLEQGDLV 206

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVE-AVVFGCIPVIIADDIVLPFADAIPWEEIGMFVA 320
           ++E    + +C+CP G  P   R +  A+ +GC+PVI++D   LPF D + W +  + + 
Sbjct: 207 HHEAFNNSKYCICPGG--PELDRTIALAIHYGCVPVIMSDYYDLPFKDILDWRKFSIILE 264

Query: 321 EEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPG--DAFHQILNGL 374
           E  V  L   L     E++  + R +   ++     F     P   DAFH  +  L
Sbjct: 265 ESQVYYLREHL----KEMLEHEYRAMQTNTVMVRKHFQWNLVPAKYDAFHMTMYDL 316


>gi|3176669|gb|AAC18793.1| End is cut off [Arabidopsis thaliana]
          Length = 440

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 97/407 (23%), Positives = 158/407 (38%), Gaps = 87/407 (21%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRC-------------------LTHMFAAEIFMHRFLLS- 79
           L+V++Y+LP K+N  + + DP                     +    + E ++   LL+ 
Sbjct: 53  LRVFMYDLPRKFN--IAMMDPHSSDVEPITGKNLPSWPQTSGIKRQHSVEYWLMASLLNG 110

Query: 80  -----SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLP---FKSPRMMRSAIQLISSNWPY 131
                  +R  +P+ AD FY P + +      G  +     +  R+++  +     N  Y
Sbjct: 111 GEDENEAIRVFDPDLADVFYVPFFSSLSFNTHGKNMTDPDTEFDRLLQVELMEFLENSKY 170

Query: 132 WNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSIT 191
           WNR+ G DH  V+P      F +  ++ +   IL       +V  FG+ +          
Sbjct: 171 WNRSGGKDH--VIPMTHPNAFRFLRQQ-VNASIL-------IVVDFGRYSKDMARLSKDV 220

Query: 192 IPPY-----APPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGAR------- 239
           + PY     +  ++       P   R+  +YFRG   +     EG    R  +       
Sbjct: 221 VSPYVHVVESLNEEGDDGMGDPFEARTTLLYFRG---NTVRKDEGKIRLRLEKLLAGNSD 277

Query: 240 ------AAVWENFKNNPLFDISTDHPTTYY-----------------EDMQRAIFCLCPL 276
                  A  +N K + L      +    Y                 E M+ + FCL P 
Sbjct: 278 VHFEKSVATTQNIKVSDLEQNRGRYLMLTYQNDSNCSTCVLFCYQSTEGMRSSKFCLHPA 337

Query: 277 GWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSI-- 334
           G  P S RL +A+V  CIPVII+D I LPF D I + E  +F + ++  +   IL ++  
Sbjct: 338 GDTPSSCRLFDAIVSHCIPVIISDKIELPFEDEIDYSEFSLFFSIKESLEPGYILNNLRQ 397

Query: 335 -PPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPH 380
            P E  L   + L N S      F  P +  DA    +N L R++ H
Sbjct: 398 FPKEKWLEMWKRLKNVS--HHFEFQYPPKREDA----VNMLWRQVKH 438


>gi|224144904|ref|XP_002325457.1| predicted protein [Populus trichocarpa]
 gi|222862332|gb|EEE99838.1| predicted protein [Populus trichocarpa]
          Length = 463

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 86/378 (22%), Positives = 149/378 (39%), Gaps = 58/378 (15%)

Query: 40  LKVYVYELPSKYNKKLL----LKD-------------PRCLTHMFAAEIFMHRFLLS--- 79
           LK+++Y+LPS+++  LL    L D             P  L    + E ++   LL+   
Sbjct: 73  LKIFMYDLPSEFHFGLLDLKPLGDSVWPDLRAKVPEYPGGLNLQHSIEYWLTLDLLASEV 132

Query: 80  -------SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSP---RMMRSAIQLISSNW 129
                  S VR  N  EAD  + P + +          P +     ++++  +    ++ 
Sbjct: 133 PGIPRAGSAVRVRNSSEADVIFVPFFSSLCYNRYSKVNPHQKKSKDKLLQEKLVKFLTSQ 192

Query: 130 PYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGS 189
             W R+ G DH  +  H            ++    + L     ++  FG+      N   
Sbjct: 193 KEWKRSGGRDHVLLAHH----------PNSMLDARVKLWPAIFILADFGRYPPNIANVAK 242

Query: 190 ITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNN 249
             I PY    +   +       R   +YF+G  Y      +GG+    AR  ++   K+ 
Sbjct: 243 DVIAPYKHVIRSYVNDSSNFDSRPTLLYFQGAIY----RKDGGF----ARQELFYLLKDE 294

Query: 250 P-----LFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVL 304
                    +  D      + M  + FCL   G  P S RL +A+   C+PVII+DDI L
Sbjct: 295 KEVHFQFGSVQKDGVGKASQGMHSSKFCLNIAGDTPSSNRLFDAIASHCVPVIISDDIEL 354

Query: 305 PFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPA 361
           P+ + + + +  +FV   D  +   L  ++ SI  +   R  + L    ++    F  P+
Sbjct: 355 PYENVLDYSQFCIFVRTSDAVREKFLVNLIRSIKKDEWTRMWKRL--KEVENFFEFQYPS 412

Query: 362 QPGDAFHQILNGLARKLP 379
           + GDA   I   +ARK+P
Sbjct: 413 REGDAVQMIWQAVARKVP 430


>gi|357475227|ref|XP_003607899.1| Xylogalacturonan beta-1,3-xylosyltransferase [Medicago truncatula]
 gi|355508954|gb|AES90096.1| Xylogalacturonan beta-1,3-xylosyltransferase [Medicago truncatula]
          Length = 481

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 97/387 (25%), Positives = 160/387 (41%), Gaps = 76/387 (19%)

Query: 40  LKVYVYELPSKYNKKL-----LLKDPRCLT------------HMFAAEIFMHRFLL---- 78
           +KVY+Y+LP ++   +     L +  R  T            H   AE ++   L     
Sbjct: 92  VKVYLYDLPKRFTYGVIHHHSLARGSRANTDEKDVTSFKYPGHQHMAEWYLFSDLSRPDS 151

Query: 79  ----SSPVRTLNPEEADWFYTPIYPTCDLT-----PTGL-PLPFK---SPRMMRSAIQLI 125
               S  VR  +PEEAD F+ P + +  L      P G   +P K   S    + A+   
Sbjct: 152 ERSGSPVVRVSDPEEADLFFVPFFSSLSLIVNPVRPAGSGSVPEKTAYSDEENQEALMEW 211

Query: 126 SSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCL 185
                +W R++G DH  V            +  A+ R +  +     LV  FG+   +  
Sbjct: 212 LEMQEFWKRSKGRDHVIVA----------SDPNAMYRVVDRVKNCVLLVSDFGR---LRP 258

Query: 186 NEGSIT---IPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAV 242
           ++GS+    I PY+   +     I  D  R+  ++F G  Y      EGG      R  +
Sbjct: 259 DQGSLVKDVIVPYSHRIRTYDGGIGVD-KRNTLLFFMGNRYR----KEGG----KIRDTL 309

Query: 243 WENFKNNPLFDISTDHPTTYYED-------MQRAIFCLCPLGWAPWSPRLVEAVVFGCIP 295
           ++  +     D+   H     E        M  + FCL P G  P + RL +A+V  C+P
Sbjct: 310 FQILEKED--DVIIKHGAQSRESRRAASQGMHTSKFCLHPAGDTPSACRLFDAIVSLCVP 367

Query: 296 VIIADDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMK 352
           VI++D I LPF D I + +I +FV      +   L +IL  + P+ I+  Q+      +K
Sbjct: 368 VIVSDSIELPFEDTIDYRKIAVFVETAAAIQPGYLVSILRGMAPDRIVEYQK-----ELK 422

Query: 353 RAMLFPQPAQPGDAFHQILNGLARKLP 379
               + +  +P    ++I   +++KLP
Sbjct: 423 EVKRYFKYDEPDGTVNEIWRQISKKLP 449


>gi|449503277|ref|XP_004161922.1| PREDICTED: probable glycosyltransferase At5g03795-like [Cucumis
           sativus]
          Length = 310

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 73/305 (23%), Positives = 123/305 (40%), Gaps = 82/305 (26%)

Query: 69  AEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSN 128
           AE    + LL+SP  T +P++A  F+ P  P              S R +   I+ + ++
Sbjct: 63  AESLFFKSLLNSPYATHDPDQAHLFFIPFSPHI------------STRSLARLIRTLRTD 110

Query: 129 WPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEG 188
            PYWNRT GADHFF+     G           +R ++ L + A  V +F       +   
Sbjct: 111 LPYWNRTLGADHFFLSSSGIGYI--------SDRNVVELKKNAIQVSSFPVSPGKFIPHK 162

Query: 189 SITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKN 248
            +++PP +         +PP TP                                     
Sbjct: 163 DVSLPPVS--------TLPPRTP------------------------------------- 177

Query: 249 NPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIV--LPF 306
                      + Y + + ++ FCL        S  + EA+ FGC+PV+I+D  +  LP 
Sbjct: 178 -----------SCYGDKLAKSDFCLFEYEGGDVSG-IGEALRFGCVPVVISDRWIQDLPL 225

Query: 307 ADAIPWEEIGMFVA-EEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGD 365
            D + WEE+ +FVA    +  +  +L  +  E + R ++L A  +  +  ++  P QP D
Sbjct: 226 MDVVRWEEMAVFVAGGGGIEGVKKVLRRVDGERLDRMKKLGA--AAAQHFVWNSPPQPLD 283

Query: 366 AFHQI 370
           AF+ +
Sbjct: 284 AFNTV 288


>gi|194707922|gb|ACF88045.1| unknown [Zea mays]
          Length = 240

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 60/185 (32%), Positives = 87/185 (47%), Gaps = 24/185 (12%)

Query: 191 TIPPYAPPQKMQAHQ--IPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKN 248
            I PY P   +  H+  +   + RSI ++FRG          GG      R+ + E  K+
Sbjct: 19  VILPYVPNVDLCDHKCVLETQSKRSILLFFRGRL----KRNAGGKI----RSKLVEELKS 70

Query: 249 NPLFDISTDHPTT-------YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADD 301
               DI  +  +T         + M+++ FCL P G  P S RL +A+V GCIPVII+D+
Sbjct: 71  AK--DIVIEEGSTGAQGKAAAQDGMRKSFFCLSPAGDTPSSARLFDAIVSGCIPVIISDE 128

Query: 302 IVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFP 358
           + LPF   + + EI +FV+  D  +   L   L  I  + I   Q  L   S  R  L+ 
Sbjct: 129 LELPFEGILDYREIALFVSASDAVQPGWLLKYLRGINAKRIREIQSNLVKYS--RHFLYS 186

Query: 359 QPAQP 363
            PAQP
Sbjct: 187 SPAQP 191


>gi|222616547|gb|EEE52679.1| hypothetical protein OsJ_35064 [Oryza sativa Japonica Group]
          Length = 528

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 75/308 (24%), Positives = 126/308 (40%), Gaps = 29/308 (9%)

Query: 82  VRTLNPEEADWFYTPIYPTCDLTPTGLPLP----FKSPRMMRSAIQLISSNWPYWNRTEG 137
           VR  +  +AD  + P + +         +P     +   +    ++ + +  P W R+ G
Sbjct: 221 VRVADSRDADVVFVPFFASLSYNRHSRVVPPEKVSRDKELQEKLVRYLMAQ-PEWKRSGG 279

Query: 138 ADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAP 197
           ADH  +V H   +  H +        + P++    ++  FG+ +    +     I PY  
Sbjct: 280 ADHV-IVAHHPNSLLHARSV------LFPVV---FVLSDFGRYHPRVASLEKDVIAPYKH 329

Query: 198 PQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNP----LFD 253
             K   +       R   +YFRG  +      EGG      R  ++   K+       F 
Sbjct: 330 MAKTFVNDSAGFDDRPTLLYFRGAIFR----KEGG----NIRQELYYMLKDEKDVYFAFG 381

Query: 254 ISTDH-PTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW 312
              DH  +   + M  + FCL   G  P S RL +A+V  C+PVII+DDI LP+ DA+ +
Sbjct: 382 SVQDHGASKASKGMHASKFCLNIAGDTPSSNRLFDAIVSHCVPVIISDDIELPYEDALDY 441

Query: 313 EEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLAN-PSMKRAMLFPQPAQPGDAFHQIL 371
            +  +FV   D  K   ++  I      +  R+      + +   +  P+Q  DA   I 
Sbjct: 442 SKFSIFVRSSDAVKKGYLMRLIRGVSKHQWTRMWNRLKEVDKHFEYQYPSQKDDAVQMIW 501

Query: 372 NGLARKLP 379
             LARK+P
Sbjct: 502 QALARKVP 509


>gi|27817890|dbj|BAC55656.1| exostosin family protein-like protein [Oryza sativa Japonica Group]
          Length = 453

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 94/393 (23%), Positives = 154/393 (39%), Gaps = 76/393 (19%)

Query: 36  PVGKLKVYVYELPSKYNKKLLLK--------------------DPRCLTHMFAAEIFMHR 75
           P   +K+Y+Y+LP+K+   ++                      + R   H  +AE ++ +
Sbjct: 53  PPAPVKIYMYDLPAKFTYGVVRSYMAARARAGAAGAASAIPDDELRYPGHQHSAEWWLFK 112

Query: 76  FL-----LSSPV-RTLNPEEADWFYTPIYPTCDLTPTGL------------PLPFKSPRM 117
            L     L  PV R  +P +AD FY P + +  L    +              P  S   
Sbjct: 113 DLRRRGPLDRPVARVDDPADADLFYVPFFSSLSLVVNPIRPSAAAANASDAAEPAYSDES 172

Query: 118 MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTF 177
            +  + +     PYW R +G DH F+           Q+  A+ R +  +     L+  F
Sbjct: 173 TQEELLVWLERQPYWRRHQGRDHVFIC----------QDPNALYRVVDRISNAVLLISDF 222

Query: 178 GQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDT-PRSIFVYFRGLFYDVNNDPEGGYYAR 236
           G+      +     I PYA   ++ + Q       R   ++F G  Y      EGG    
Sbjct: 223 GRLRSEQASLVKDVILPYA--HRINSFQGDVGVESRPSLLFFMGNRYR----KEGG---- 272

Query: 237 GARAAVWENFKNNPLFDISTDHPTTYYED-------MQRAIFCLCPLGWAPWSPRLVEAV 289
             R  +++  +N    D+   H     E        M  + FCL P G  P + RL +A+
Sbjct: 273 KVRDTLFQVLENEA--DVIIKHGAQSRESRRMATRGMHSSKFCLHPAGDTPSACRLFDAL 330

Query: 290 VFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLL 346
           V  C+PVI++D I LPF D I +  I +FV      +   L + L  I  + IL  QR  
Sbjct: 331 VSLCVPVIVSDYIELPFEDVIDYRNISIFVETSKAVQPGFLTSTLRGISSQRILEYQR-- 388

Query: 347 ANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
               +K+   + +   P    +QI + ++ K P
Sbjct: 389 ---EIKKVKHYFEYEDPNGPVNQIWHQVSSKAP 418


>gi|222637299|gb|EEE67431.1| hypothetical protein OsJ_24780 [Oryza sativa Japonica Group]
          Length = 500

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 94/393 (23%), Positives = 154/393 (39%), Gaps = 76/393 (19%)

Query: 36  PVGKLKVYVYELPSKYNKKLLLK--------------------DPRCLTHMFAAEIFMHR 75
           P   +K+Y+Y+LP+K+   ++                      + R   H  +AE ++ +
Sbjct: 100 PPAPVKIYMYDLPAKFTYGVVRSYMAARARAGAADAASAIPDDELRYPGHQHSAEWWLFK 159

Query: 76  FL-----LSSPV-RTLNPEEADWFYTPIYPTCDLTPTGL------------PLPFKSPRM 117
            L     L  PV R  +P +AD FY P + +  L    +              P  S   
Sbjct: 160 DLRRRGPLDRPVARVDDPADADLFYVPFFSSLSLVVNPIRPSAAAANASDAAEPAYSDES 219

Query: 118 MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTF 177
            +  + +     PYW R +G DH F+           Q+  A+ R +  +     L+  F
Sbjct: 220 TQEELLVWLERQPYWRRHQGRDHVFIC----------QDPNALYRVVDRISNAVLLISDF 269

Query: 178 GQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDT-PRSIFVYFRGLFYDVNNDPEGGYYAR 236
           G+      +     I PYA   ++ + Q       R   ++F G  Y      EGG    
Sbjct: 270 GRLRSEQASLVKDVILPYA--HRINSFQGDVGVESRPSLLFFMGNRYR----KEGG---- 319

Query: 237 GARAAVWENFKNNPLFDISTDHPTTYYED-------MQRAIFCLCPLGWAPWSPRLVEAV 289
             R  +++  +N    D+   H     E        M  + FCL P G  P + RL +A+
Sbjct: 320 KVRDTLFQVLENEA--DVIIKHGAQSRESRRMATRGMHSSKFCLHPAGDTPSACRLFDAL 377

Query: 290 VFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLL 346
           V  C+PVI++D I LPF D I +  I +FV      +   L + L  I  + IL  QR  
Sbjct: 378 VSLCVPVIVSDYIELPFEDVIDYRNISIFVETSKAVQPGFLTSTLRGISSQRILEYQR-- 435

Query: 347 ANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
               +K+   + +   P    +QI + ++ K P
Sbjct: 436 ---EIKKVKHYFEYEDPNGPVNQIWHQVSSKAP 465


>gi|115472841|ref|NP_001060019.1| Os07g0567000 [Oryza sativa Japonica Group]
 gi|113611555|dbj|BAF21933.1| Os07g0567000 [Oryza sativa Japonica Group]
          Length = 500

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 94/393 (23%), Positives = 154/393 (39%), Gaps = 76/393 (19%)

Query: 36  PVGKLKVYVYELPSKYNKKLLLK--------------------DPRCLTHMFAAEIFMHR 75
           P   +K+Y+Y+LP+K+   ++                      + R   H  +AE ++ +
Sbjct: 100 PPAPVKIYMYDLPAKFTYGVVRSYMAARARAGAAGAASAIPDDELRYPGHQHSAEWWLFK 159

Query: 76  FL-----LSSPV-RTLNPEEADWFYTPIYPTCDLTPTGL------------PLPFKSPRM 117
            L     L  PV R  +P +AD FY P + +  L    +              P  S   
Sbjct: 160 DLRRRGPLDRPVARVDDPADADLFYVPFFSSLSLVVNPIRPSAAAANASDAAEPAYSDES 219

Query: 118 MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTF 177
            +  + +     PYW R +G DH F+           Q+  A+ R +  +     L+  F
Sbjct: 220 TQEELLVWLERQPYWRRHQGRDHVFIC----------QDPNALYRVVDRISNAVLLISDF 269

Query: 178 GQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDT-PRSIFVYFRGLFYDVNNDPEGGYYAR 236
           G+      +     I PYA   ++ + Q       R   ++F G  Y      EGG    
Sbjct: 270 GRLRSEQASLVKDVILPYA--HRINSFQGDVGVESRPSLLFFMGNRYR----KEGG---- 319

Query: 237 GARAAVWENFKNNPLFDISTDHPTTYYED-------MQRAIFCLCPLGWAPWSPRLVEAV 289
             R  +++  +N    D+   H     E        M  + FCL P G  P + RL +A+
Sbjct: 320 KVRDTLFQVLENEA--DVIIKHGAQSRESRRMATRGMHSSKFCLHPAGDTPSACRLFDAL 377

Query: 290 VFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLL 346
           V  C+PVI++D I LPF D I +  I +FV      +   L + L  I  + IL  QR  
Sbjct: 378 VSLCVPVIVSDYIELPFEDVIDYRNISIFVETSKAVQPGFLTSTLRGISSQRILEYQR-- 435

Query: 347 ANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
               +K+   + +   P    +QI + ++ K P
Sbjct: 436 ---EIKKVKHYFEYEDPNGPVNQIWHQVSSKAP 465


>gi|255542540|ref|XP_002512333.1| catalytic, putative [Ricinus communis]
 gi|223548294|gb|EEF49785.1| catalytic, putative [Ricinus communis]
          Length = 434

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 82/373 (21%), Positives = 146/373 (39%), Gaps = 48/373 (12%)

Query: 40  LKVYVYELPSKYNKKLL-------------------------LKDPRCLTHMFAAEIFMH 74
           LKVY+Y+LP +++  ++                         L+    + +   A +   
Sbjct: 56  LKVYMYDLPRRFHVGMMDHGGDAKNDTPVTGENLPTWPKNSGLRKQHSVEYWLMASLLYE 115

Query: 75  RFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPL---PFKSPRMMRSAIQLISSNWPY 131
                  VR L+PE+AD F+ P + +      G  +     +  R ++  +  +     Y
Sbjct: 116 GADEREAVRVLDPEKADAFFVPFFSSLSFNTHGHTMTDPETEIDRQLQVDVIDMLYKSKY 175

Query: 132 WNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSIT 191
           W ++ G DH  V+P      F +  ++ +   IL       +V  FG+            
Sbjct: 176 WQKSGGRDH--VIPMTHPNAFRFLRQQ-LNASIL-------IVADFGRYPKSMSTLSKDV 225

Query: 192 IPPYAPP-QKMQAHQIP-PDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNN 249
           + PY          ++  P   R+  ++FRG   +     EG   A+ A+     +  + 
Sbjct: 226 VAPYVHVVDSFTDDEVSNPFESRTTLLFFRG---NTIRKDEGKVRAKLAKILTGYDDIHF 282

Query: 250 PLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADA 309
                + +      E M+ + FCL P G  P S RL +A+V  C+PVI++D I LP+ D 
Sbjct: 283 ERSSATAETIKASTEGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDQIELPYEDE 342

Query: 310 IPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDA 366
           I + +  +F +  +  +   +   L  +P E  L   R L   S+     F  P +  DA
Sbjct: 343 IDYSQFSVFFSVNEAIQPGYMVDQLRQLPKERWLEMWRKLK--SISHHFEFQYPPEKEDA 400

Query: 367 FHQILNGLARKLP 379
              +   +  KLP
Sbjct: 401 VDMLWREVKHKLP 413


>gi|356512371|ref|XP_003524893.1| PREDICTED: probable glycosyltransferase At3g07620-like [Glycine
           max]
          Length = 427

 Score = 79.7 bits (195), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 94/394 (23%), Positives = 153/394 (38%), Gaps = 54/394 (13%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHM-------------------FAAEIFMHRFLLS- 79
           L+V++Y+LP ++N  ++  D R    M                    + E +M   LL+ 
Sbjct: 52  LRVFMYDLPRRFNVGMI--DRRSAAEMPVTVEDWPAWPVNWGLKKQHSVEYWMMGSLLNV 109

Query: 80  ----SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSP-----RMMRSAIQLISSNWP 130
                 VR  +PE A  F+ P + +      G  +  K P     R ++  +  +     
Sbjct: 110 GGGREVVRVSDPELAQAFFVPFFSSLSFNTHGHTM--KDPATQIDRQLQVDLMELLKKSN 167

Query: 131 YWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSI 190
           YW R+ G DH F + H     F +  ++        L +   +V  FG+      N    
Sbjct: 168 YWQRSGGRDHVFPMTHP--NAFRFLRDQ--------LNESIQVVVDFGRYPRGMSNLNKD 217

Query: 191 TIPPYAPPQKMQAHQIPPD--TPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKN 248
            + PY           P D    RS  ++FRG  Y      EG    + A+     +  +
Sbjct: 218 VVSPYVHVVDSFTDDEPQDPYESRSTLLFFRGRTY---RKDEGIVRVKLAKILAGYDDVH 274

Query: 249 NPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFAD 308
                 + ++     + M+ + FCL P G  P S RL +A+V  CIPVI++D I LPF D
Sbjct: 275 YERSVATEENIKASSKGMRSSKFCLHPAGDTPSSCRLFDAIVSHCIPVIVSDQIELPFED 334

Query: 309 AIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGD 365
            I + +  +F + ++  +   +   L   P E      R L   S+     F  P +  D
Sbjct: 335 EIDYSQFSVFFSFKEALQPGYMIDQLRKFPKEKWTEMWRQLK--SISHHYEFRYPPKRED 392

Query: 366 AFHQILNGLARKLPHDK-STYLKPGGKFLNWTSG 398
           A   +   +  KLP  K S +     K  +W  G
Sbjct: 393 AVDMLWRQVKHKLPGVKLSVHRNRRLKIPDWWQG 426


>gi|168061581|ref|XP_001782766.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162665739|gb|EDQ52413.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 212

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 48/142 (33%), Positives = 71/142 (50%), Gaps = 10/142 (7%)

Query: 82  VRTLNPEEADWFYTPIYPTCDLTP-TGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADH 140
           + TL+P EAD+F+  +Y +C  +P TG P   ++ ++M +A+  +S+   +WNR+ G D 
Sbjct: 50  IGTLDPYEADYFFILVYVSCKFSPKTGTPWLGRARKLMEAAVNHVSTKMEFWNRSGGRDQ 109

Query: 141 FFVVPHDFGACFHYQEEKAIERGIL-PLLQRATL-VQTFGQRNHVCLNEGSITIPPYAPP 198
            F    D   CFH  E +A    I   L   + L VQ F    H C     I IPPY  P
Sbjct: 110 IFAASQDNSVCFHTLETEAWNTRIYTKLFNPSNLGVQDF----HPCQAAEHIQIPPYVSP 165

Query: 199 QKMQAHQIPPDTPRSIFVYFRG 220
               ++      PR+IF +F G
Sbjct: 166 SVAASY---IKDPRNIFAFFSG 184


>gi|147818414|emb|CAN68950.1| hypothetical protein VITISV_039537 [Vitis vinifera]
          Length = 488

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 91/382 (23%), Positives = 148/382 (38%), Gaps = 66/382 (17%)

Query: 40  LKVYVYELPSKYNKKLL------------LKD-----PRCLTHMFAAEIFMHRFLLSS-- 80
           LKV++Y+LP++++  LL            ++      P  L    + E ++   LLSS  
Sbjct: 98  LKVFMYDLPAEFHFGLLDWAPAGESVWPDIRTKFPLYPGGLNLQHSIEYWLTLDLLSSEF 157

Query: 81  --------PVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPR------MMRSAIQLIS 126
                    +R  N  EAD  + P + +      G    FK  +      +++  +    
Sbjct: 158 PENQNARXAIRVHNSSEADVIFVPFFSSLSYNHFG---KFKGRQKKNENNLLQDKLVKFL 214

Query: 127 SNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLN 186
           +    W R+EG DH  +  H            ++    + L     ++  FG+      N
Sbjct: 215 TAQEEWIRSEGRDHIIMAHH----------PNSMLDARMKLWPAIFILSDFGRYPPNIAN 264

Query: 187 EGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGY------YARGARA 240
            G   I PY    K   +       R   +YF+G  Y      +GG+      Y      
Sbjct: 265 VGKDLIAPYKHVIKSFINDTSDFDSRPTLLYFQGAIYR----KDGGFIRQELFYLLKDEK 320

Query: 241 AVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD 300
            V   F N     I+        + M  + FCL   G  P S RL +A+   C+PVII+D
Sbjct: 321 DVHFAFGNTQGNGINKAS-----QGMHSSKFCLNIAGDTPSSNRLFDAIASHCVPVIISD 375

Query: 301 DIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLF 357
           +I LP+ D + + +  +FV   D  K   L  ++ SI  +   R  R L    ++    F
Sbjct: 376 EIELPYEDVLDYSQFCIFVRTSDALKDKFLXKLIRSIKKDEWTRMWRRL--KEVENFFEF 433

Query: 358 PQPAQPGDAFHQILNGLARKLP 379
             P++ GDA   I   + RK+P
Sbjct: 434 QYPSKEGDAVQMIWQAITRKVP 455


>gi|297814922|ref|XP_002875344.1| hypothetical protein ARALYDRAFT_904882 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297321182|gb|EFH51603.1| hypothetical protein ARALYDRAFT_904882 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 64

 Score = 79.0 bits (193), Expect = 4e-12,   Method: Composition-based stats.
 Identities = 34/63 (53%), Positives = 46/63 (73%)

Query: 268 RAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKL 327
           R+IFCLCP GW  WSPRLVE+ V GC+PV+IA+ I LPF++ + W EI + +A++D   L
Sbjct: 2   RSIFCLCPFGWGIWSPRLVESAVSGCVPVVIANGIQLPFSEIVRWPEILLMMAKKDDMNL 61

Query: 328 DTI 330
             I
Sbjct: 62  QKI 64


>gi|168060641|ref|XP_001782303.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666233|gb|EDQ52893.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 426

 Score = 79.0 bits (193), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 88/385 (22%), Positives = 155/385 (40%), Gaps = 62/385 (16%)

Query: 38  GKLKVYVYELPSKYNKKLLL------------------KDPRCLTHMFAAEIFMHRFLLS 79
            +LKV++Y+LPS+++  ++                   K P  L    + E ++   L++
Sbjct: 37  ARLKVFMYDLPSEFHYGMISEFTPKKNQIWPQNVSDIPKYPGGLYQQHSPEYWLISDLVT 96

Query: 80  SPV----------RTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSA---IQLIS 126
           S +          R    ++A     P + +            +  ++ R+    + LIS
Sbjct: 97  SDMPDRSTPCTVFRVKRWQDAGVILIPFFASLSYNKYSRAPLLRGKKLDRNQELQLNLIS 156

Query: 127 --SNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVC 184
             S+ P W  +EG++H  V+ H   A  H +E+    R ++       +V  FG+     
Sbjct: 157 FLSSQPAWRASEGSNHVVVIHHP-NAMLHTREKF---RSVM------FVVADFGRYGAEV 206

Query: 185 LNEGSITIPPYA---PPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAA 241
            N     + PY    P              R+  ++F+G         EGG   +     
Sbjct: 207 ANMAKDVVAPYKHVIPNFDEDVDAALSFKSRTTLLFFQGAIAR----KEGGIIRQQLYEL 262

Query: 242 VWEN----FKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVI 297
           + E     F N    +      T     M+++ FCL   G  P S RL +AV   C+P+I
Sbjct: 263 LGEEPNIIFSNGTTSNAGIRSATA---GMRQSKFCLHLAGDTPSSNRLFDAVASHCVPLI 319

Query: 298 IADDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRA 354
           I+++I LPF D + + E  +FV   D  +   +  +L+++  +   R    L    ++R 
Sbjct: 320 ISNEIELPFEDVLNYSEFSLFVNSSDALRKGFVTDLLSNVGEKEWTRMHDRLRQ--VERH 377

Query: 355 MLFPQPAQPGDAFHQILNGLARKLP 379
             +  PAQ GDA H     +ARK+P
Sbjct: 378 FQYQLPAQIGDAVHMTWEAIARKVP 402


>gi|218201205|gb|EEC83632.1| hypothetical protein OsI_29363 [Oryza sativa Indica Group]
          Length = 566

 Score = 79.0 bits (193), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 102/397 (25%), Positives = 148/397 (37%), Gaps = 87/397 (21%)

Query: 40  LKVYVYELPSKYNKKLL--------------LKDPRC-------LTHMFAAEIFMHRFLL 78
           L+VY+Y+LP +++  +L                DPR        L    + E ++   +L
Sbjct: 179 LRVYLYDLPPEFHFGMLGWDGKAAGAAWPDVAGDPRAVPRYPGGLNLQHSVEYWLTLDIL 238

Query: 79  SS-------------PVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLI 125
           SS              VR  N   AD F  P + +                  RS  QL 
Sbjct: 239 SSTTSGDHRRRRPCTAVRVTNASLADVFLVPFFASLSYNRQSKSPHGGHGSGGRSDRQLQ 298

Query: 126 SSNWPY------WNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQ 179
                Y      W R  GADH  VVPH   +    +   +    +L           FG+
Sbjct: 299 GELVRYLARREEWRRWGGADHL-VVPHHPNSMMDARRRLSAAMFVL---------SDFGR 348

Query: 180 RNHVCLNEGSITIPPYAPPQKMQAHQIPP----DTP----RSIFVYFRGLFYDVNNDPEG 231
                 N     I PY        H +P     D+P    R +  YF+G  +  N    G
Sbjct: 349 YPPDVANLRKDVIAPYK-------HVVPSLGDGDSPGFEQRPVLAYFQGAIHRKN----G 397

Query: 232 GYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAI-------FCLCPLGWAPWSPR 284
           G      R  +++  K+    D+   + +     ++RA        FCL   G  P S R
Sbjct: 398 GR----VRQRLYQLIKDEK--DVHFTYGSVRQNGIRRATKGMASSKFCLNIAGDTPSSNR 451

Query: 285 LVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILR 341
           L +A+V  C+PVII+DDI LPF D + + +  +FV   D  K   L  +L  I  E    
Sbjct: 452 LFDAIVSHCVPVIISDDIELPFEDVLDYSDFCVFVRASDAVKRGFLLHLLRGISQEEWTA 511

Query: 342 KQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKL 378
             R L    +     +  P+QPGDA   I   +ARK+
Sbjct: 512 MWRRL--KEVAHHFEYQYPSQPGDAVQMIWGAVARKM 546


>gi|291242407|ref|XP_002741099.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
          Length = 524

 Score = 78.6 bits (192), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 101/429 (23%), Positives = 161/429 (37%), Gaps = 84/429 (19%)

Query: 17  NTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYN------------------------ 52
           N P +   +   G    + P    KVYVY+LP K+N                        
Sbjct: 75  NIPNVTFNTNKNGKHERNSP---YKVYVYDLPPKFNVNLSDCVKKVDGCFHLDEKMFGMG 131

Query: 53  KKLLLKDPRCL---THMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYP--TCDLTPTG 107
            +LL +D +     TH F+ E+ +H  +L S  RT+NP+ AD FY P YP   C    + 
Sbjct: 132 SRLLRRDSQFSYRNTHQFSLEVILHHKILHSRYRTMNPKHADIFYIPFYPGLAC-FCRSF 190

Query: 108 LPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPL 167
               F    + +     ++  WP++   E        PH        +E  +   GIL  
Sbjct: 191 QKSSFDLDLLHKELWHYLTEKWPFFEMRE--------PHAMALGKIEREHWSQRCGILKG 242

Query: 168 LQRATLVQTFG--------QRNHVCLNEGSITIPPYAP----PQKMQAHQIP----PDTP 211
            + A  +Q  G         R++   N   + + PY       +  +AH+          
Sbjct: 243 NKYANRIQFIGIEEEYKTAYRSYFERNGQHVLVAPYPSYGHFIEGEEAHRNDFTKGAKYD 302

Query: 212 RSIFVYF----------RGLFYDVNNDPEGGY--YARGARA--AVW----ENFKNNPLFD 253
           R++FV            R +  D        Y  Y         VW    E  +NN  ++
Sbjct: 303 RNVFVLMAASSRASHEVRKILQDQLTRTSKSYNTYVNTEEQYDGVWYVTPECGQNNAAWE 362

Query: 254 ISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADD---IVLPFADAI 310
             T+  T   E M+ ++FCL P G +P      ++V   CIPVI   +   +  PF   +
Sbjct: 363 QGTELFTV--EWMRHSVFCLQPPGDSPTRKSFYDSVTAHCIPVIFLPEKARVKYPFQRLL 420

Query: 311 PWEEIGMFVAEE----DVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDA 366
            + E  +    E    + P +  +L  IP + +++ Q  L N S +    +P      DA
Sbjct: 421 NYSEFTVNFGLETFLLEKPDIVDLLRKIPEDYVIQLQNNLLNVSKRLQYSYPSNIDSDDA 480

Query: 367 FHQILNGLA 375
           F  IL+ L 
Sbjct: 481 FQMILDELG 489


>gi|225439168|ref|XP_002268382.1| PREDICTED: xylogalacturonan beta-1,3-xylosyltransferase [Vitis
           vinifera]
          Length = 488

 Score = 78.6 bits (192), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 91/382 (23%), Positives = 148/382 (38%), Gaps = 66/382 (17%)

Query: 40  LKVYVYELPSKYNKKLL------------LKD-----PRCLTHMFAAEIFMHRFLLSS-- 80
           LKV++Y+LP++++  LL            ++      P  L    + E ++   LLSS  
Sbjct: 98  LKVFMYDLPAEFHFGLLDWAPAGESVWPDIRTKFPLYPGGLNLQHSIEYWLTLDLLSSEF 157

Query: 81  --------PVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPR------MMRSAIQLIS 126
                    +R  N  EAD  + P + +      G    FK  +      +++  +    
Sbjct: 158 PENQNARAAIRVHNSSEADVIFVPFFSSLSYNHFG---KFKGRQKKNENNLLQDKLVKFL 214

Query: 127 SNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLN 186
           +    W R+EG DH  +  H            ++    + L     ++  FG+      N
Sbjct: 215 TAQEEWIRSEGRDHIIMAHH----------PNSMLDARMKLWPAIFILSDFGRYPPNIAN 264

Query: 187 EGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGY------YARGARA 240
            G   I PY    K   +       R   +YF+G  Y      +GG+      Y      
Sbjct: 265 VGKDVIAPYKHVIKSFINDTSDFDSRPTLLYFQGAIYR----KDGGFIRQELFYLLKDEK 320

Query: 241 AVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD 300
            V   F N     I+        + M  + FCL   G  P S RL +A+   C+PVII+D
Sbjct: 321 DVHFAFGNTQGNGINKAS-----QGMHSSKFCLNIAGDTPSSNRLFDAIASHCVPVIISD 375

Query: 301 DIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLF 357
           +I LP+ D + + +  +FV   D  K   L  ++ SI  +   R  R L    ++    F
Sbjct: 376 EIELPYEDVLDYSQFCIFVRTSDALKDKFLIKLIRSIKKDEWTRMWRRL--KEVENFFEF 433

Query: 358 PQPAQPGDAFHQILNGLARKLP 379
             P++ GDA   I   + RK+P
Sbjct: 434 QYPSKEGDAVQMIWQAITRKVP 455


>gi|30686300|ref|NP_850241.1| Exostosin family protein [Arabidopsis thaliana]
 gi|50253448|gb|AAT71926.1| At2g35100 [Arabidopsis thaliana]
 gi|53828615|gb|AAU94417.1| At2g35100 [Arabidopsis thaliana]
 gi|110737872|dbj|BAF00874.1| hypothetical protein [Arabidopsis thaliana]
 gi|330253970|gb|AEC09064.1| Exostosin family protein [Arabidopsis thaliana]
          Length = 447

 Score = 78.6 bits (192), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 91/383 (23%), Positives = 157/383 (40%), Gaps = 72/383 (18%)

Query: 39  KLKVYVYELPSKYNKKLL---------LKDP--RCLTHMFAAEIFMHRFLLSSP------ 81
           +++VY+Y LP ++   L+         +K P     T  +     MH + L S       
Sbjct: 59  RVRVYMYNLPKRFTYGLIEQHSIARGGIKKPVGDVTTLKYPGHQHMHEWYLFSDLNQPEV 118

Query: 82  -------VRTLNPEEADWFYTPIYPTCDL-TPTGLPLP----FKSPRMMRSAIQLISSNW 129
                  VR  +P +AD FY P++ +  L    G P+     +   +M    ++ +    
Sbjct: 119 DRSGSPIVRVSDPADADLFYVPVFSSLSLIVNAGRPVEAGSGYSDEKMQEGLVEWLEGQ- 177

Query: 130 PYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGS 189
            +W R  G DH  V+P          +  A+ R +  +     LV  FG+   +  ++GS
Sbjct: 178 EWWRRNAGRDH--VIPAG--------DPNALYRILDRVKNAVLLVSDFGR---LRPDQGS 224

Query: 190 IT---IPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENF 246
                + PY+    +   +I  +  R+  ++F G  Y      +GG      R  +++  
Sbjct: 225 FVKDVVIPYSHRVNLFNGEIGVED-RNTLLFFMGNRYR----KDGG----KVRDLLFQVL 275

Query: 247 KNNPLFDISTDHPTTYYED-------MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIA 299
           +     D++  H T   E+       M  + FCL P G  P + RL +++V  C+P+I++
Sbjct: 276 EKED--DVTIKHGTQSRENRRAATKGMHTSKFCLNPAGDTPSACRLFDSIVSLCVPLIVS 333

Query: 300 DDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAML 356
           D I LPF D I + +  +FV      +   L  +L  I  + IL  QR      MK    
Sbjct: 334 DSIELPFEDVIDYRKFSIFVEANAALQPGFLVQMLRKIKTKKILEYQR-----EMKSVRR 388

Query: 357 FPQPAQPGDAFHQILNGLARKLP 379
           +     P  A  +I   ++ KLP
Sbjct: 389 YFDYDNPNGAVKEIWRQVSHKLP 411


>gi|168000699|ref|XP_001753053.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162695752|gb|EDQ82094.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 471

 Score = 78.2 bits (191), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 81/314 (25%), Positives = 131/314 (41%), Gaps = 39/314 (12%)

Query: 68  AAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTC---DLTPTGLPLPFKSPRMMRSAIQL 124
            A++F  R L  S   T   + A  F  P        DL PT +         +R  ++ 
Sbjct: 163 TADLFF-RLLTRSEFVTEKAKRAQLFLLPFSIDVLWVDLGPTQVA------EKLRRYLEK 215

Query: 125 ISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQR-NHV 183
           + +N+PYW  + GADHF++  H F       E  +  R IL L + +        R N  
Sbjct: 216 VRTNYPYWESSLGADHFYLSCHAF-------EHNSKHRNILELGKNSIQAACAPLRHNQK 268

Query: 184 CLNEGSITIPPYAPPQKMQAHQ-IPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAV 242
                 +  P Y P  +    Q I     R+   YF G   DV       ++        
Sbjct: 269 FYPHKDVVFPQYKPVGEEDVRQAILGRRNRTSLAYFSGC-PDVTTPLLSAFHT------- 320

Query: 243 WENFKNNPLFDISTD---HPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIA 299
           WE    +P F +  +   H  + Y ++ R+ FC+  L    +S  LV+A+ FGC+PV+++
Sbjct: 321 WE---TDPDFIVEANPSPHRLSVYRNLARSRFCVSVLPHDTFS--LVDALRFGCVPVLLS 375

Query: 300 DDIV--LPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLF 357
                 LPF   + W +  + +  ED+P L  IL ++      R+ + L + ++K  + +
Sbjct: 376 KLTFHDLPFQGFLNWGQFAVVLGIEDLPNLKQILANV-SSTKHREMQYLGHQAIKH-LEW 433

Query: 358 PQPAQPGDAFHQIL 371
             P    DAFH  L
Sbjct: 434 NNPPVAYDAFHMTL 447


>gi|297815694|ref|XP_002875730.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297321568|gb|EFH51989.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 479

 Score = 78.2 bits (191), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 84/344 (24%), Positives = 126/344 (36%), Gaps = 58/344 (16%)

Query: 83  RTLNPEEADWFYTPIYPTCDLTPTGLPLPF-KSPRMMRSAIQLIS--SNWPYWNRTEGAD 139
           R  N  EAD  + P + +          P  K+ R      +L++  +    W R+ G D
Sbjct: 158 RVYNSSEADVIFVPFFSSLSYNRFSKVNPHQKTSRNKDLQGKLVTFLTGQEEWKRSGGRD 217

Query: 140 HFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQ 199
           H  +  H            ++      L     ++  FG+      N     I PY    
Sbjct: 218 HVVLAHH----------PNSMLDARNKLFPAMFILSDFGRYPPTVANVEKDIIAPYKHVI 267

Query: 200 KMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP 259
           K   +       R I +YF+G  Y      +GG             F    LF +  D  
Sbjct: 268 KAYENDTSGFDSRPILLYFQGAIYR----KDGG-------------FVRQELFYLLQDEK 310

Query: 260 TTYY--------------EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLP 305
             ++              + M  + FCL   G  P S RL +A+   C+PVII+DDI LP
Sbjct: 311 DVHFSFGSVRNGGINKASQGMHNSKFCLNIAGDTPSSNRLFDAIASHCVPVIISDDIELP 370

Query: 306 FADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQ 362
           F D I + E  +FV   D  K   L  ++  I  E   R    L    +++   F  P++
Sbjct: 371 FEDVIDYSEFAVFVRTSDALKENFLVNLIRGISKEEWTRMWNRL--KEVEKYYEFHFPSK 428

Query: 363 PGDAFHQILNGLARKLP------HDKSTY---LKPGGKFLNWTS 397
             DA   I   +ARK+P      H    Y   +   GK  +W+S
Sbjct: 429 VDDAVQMIWQAIARKVPGVKMRIHKSRRYSGSVSDTGKESSWSS 472


>gi|374922021|gb|AFA26188.1| hypothetical protein, partial [Lolium perenne]
          Length = 282

 Score = 78.2 bits (191), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 64/258 (24%), Positives = 107/258 (41%), Gaps = 44/258 (17%)

Query: 119 RSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFH----YQEEKAIERGILPLLQRATLV 174
           R A   I+  +PYWNRT G DH +    D GAC+     +     +  G        +  
Sbjct: 2   RMAYDHIAQRYPYWNRTSGRDHIWFFSWDEGACYAPKEIWNSMMLVHWGNTNTKHENSTT 61

Query: 175 QTFGQR----------NHVCLN-EGSITIPPYAPPQ------KMQAHQIPPDTPRSIFVY 217
             +             +H C +    + +P +  P       K+ A    P + R    Y
Sbjct: 62  AYWADNWDNIPLDRRGDHPCFDPTKDLVLPAWKDPDPAAIWLKLWAR---PRSNRRTLFY 118

Query: 218 FRGLFYDVNNDPEGGY----YARGARAAVWENFKNNP----------LFDISTDH--PTT 261
           F G   ++ +  E G     Y+ G R  +   F + P          + +++  H     
Sbjct: 119 FNG---NLGSAYEQGRPEDTYSMGIRQKLAAEFGSTPNKQGKLGRQHVANVTVTHLRSEK 175

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           YYE++  +IFC    G   WS R+ ++++ GCIPVII D I LP+ + + +    + + E
Sbjct: 176 YYEELASSIFCGVLPG-DGWSGRMEDSMLQGCIPVIIQDGIFLPYENVLNYNSFAVRIQE 234

Query: 322 EDVPKLDTILTSIPPEVI 339
           +D+P L T+L  +    I
Sbjct: 235 DDIPNLITVLRGMNETQI 252


>gi|291237398|ref|XP_002738622.1| PREDICTED: tout-velu-like [Saccoglossus kowalevskii]
          Length = 783

 Score = 78.2 bits (191), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 84/352 (23%), Positives = 133/352 (37%), Gaps = 58/352 (16%)

Query: 41  KVYVYELPSKYNKKLL-------------LKD-------------PRCLTHMFAAEIFMH 74
           K+YVY++P+ +N+ +L             L+D                 TH FA E  +H
Sbjct: 399 KIYVYDMPAAFNEDILDCVHTKVRGECIHLQDGGFGKMLWTDNNISYHFTHQFALEPIIH 458

Query: 75  RFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNR 134
             LL+S  RTLN  +AD FY P Y            P  +   + +     S N P+   
Sbjct: 459 HKLLNSTQRTLNASDADLFYLPYYAGLKCFCHDRYTPGVTAGDLNNKFWEYSLNLPF--- 515

Query: 135 TEGADHFFVV-----PHDFGAC--FHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNE 187
            +   HF  +      H    C        K I   ++   QR      F +  H    E
Sbjct: 516 IKTKPHFMALGKIEREHCSSGCPLLRSAHSKHILYLMIEQEQRRRSRVAFKRDGH----E 571

Query: 188 GSITIPPYAPPQKMQAHQIPP--DTPRSIFVYF-------RGLFYDVNNDPEGGYYARGA 238
             + + PY            P  +  RSI V         +     +  D +    A G 
Sbjct: 572 DEVIVVPYPSYAHFTTEDAVPRFNVSRSILVLMCAGVRRTQSFRVKLRQDLQKEENATGR 631

Query: 239 RAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVII 298
              V+  F      + ++     +   MQ+++FCL P G +P      ++V+ GCIPV  
Sbjct: 632 HRGVY--FHTRECMEETSRKVIDF---MQQSVFCLQPWGDSPTRKSFYDSVLSGCIPVRF 686

Query: 299 ADDIVLPFADAIPWEEIGMFVAEEDVPKLDT----ILTSIPPEVILRKQRLL 346
             D++ PF D I ++E  +FV + ++   +T     L  +P E I + Q  L
Sbjct: 687 LKDVIYPFEDRINYDEFSLFVDKNELETTNTSIVDYLAKVPKERIEKMQDKL 738


>gi|356525152|ref|XP_003531191.1| PREDICTED: probable glycosyltransferase At3g07620-like isoform 1
           [Glycine max]
          Length = 427

 Score = 77.8 bits (190), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 88/372 (23%), Positives = 144/372 (38%), Gaps = 49/372 (13%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRC-----------------LTHMFAAEIFMHRFLLSS-- 80
           L+V++Y+LP ++N  ++ +                     L    + E +M   LL++  
Sbjct: 52  LRVFMYDLPRRFNVGMIDRRSASETPVTVEDWPAWPVNWGLKKQHSVEYWMMGSLLNAGE 111

Query: 81  ---PVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSP-----RMMRSAIQLISSNWPYW 132
               VR  +PE A  F+ P + +      G  +  K P     R ++  +  +     YW
Sbjct: 112 GREAVRVSDPELAQAFFVPFFSSLSFNTHGHTM--KDPATQIDRQLQVDLMELLKKSKYW 169

Query: 133 NRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITI 192
            R+ G DH F + H     F         RG   L +   +V  FG+      N     +
Sbjct: 170 QRSGGRDHVFPMTHPNAFRF--------LRG--QLNESIQVVVDFGRYPRGMSNLNKDVV 219

Query: 193 PPYAPPQKMQAHQIPPD--TPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNP 250
            PY           P D    RS  ++FRG  Y      EG    + A+     +  +  
Sbjct: 220 SPYVHVVDSFTDDEPQDPYESRSTLLFFRGRTY---RKDEGIVRVKLAKILAGYDDVHYE 276

Query: 251 LFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAI 310
               + ++     + M+ + FCL P G  P S RL +A+V  C+PVI++D I LPF D I
Sbjct: 277 RSVATEENIKASSKGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDQIELPFEDDI 336

Query: 311 PWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAF 367
            + +  +F + ++  +   +   L   P E      R L   S+     F  P +  DA 
Sbjct: 337 DYSQFSVFFSFKEALQPGYMIDQLRKFPKEKWTEMWRQLK--SISHHYEFEYPPKREDAV 394

Query: 368 HQILNGLARKLP 379
             +      KLP
Sbjct: 395 DMLWRQAKHKLP 406


>gi|326490259|dbj|BAJ84793.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 498

 Score = 77.8 bits (190), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 102/394 (25%), Positives = 153/394 (38%), Gaps = 84/394 (21%)

Query: 40  LKVYVYELPSKY------------------NKKLLLKDP--RCLTHMFAAEIFMHRFLLS 79
           +K+Y+Y+LPSK+                  +   +L D   R   H  +AE ++ + L  
Sbjct: 99  VKIYMYDLPSKFTYGVVRSYMSARGPSGSSDAAAVLADEELRYPGHQHSAEWWLFKDLRQ 158

Query: 80  S-----PV-RTLNPEEADWFYTPIYPTCDL--TPTGLPL------------PFKSPRMMR 119
                 PV R  +P EAD FY P + +  L   P   P             P+ S   M+
Sbjct: 159 RGPRDRPVARVDDPVEADLFYVPFFSSLSLVVNPIRPPAAANASEAAATVGPWYSDEAMQ 218

Query: 120 SAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQ 179
             +        YW R  G DH F+           Q+  A+ R +  +     LV  FG+
Sbjct: 219 DELVEWLERQSYWRRYRGRDHVFIC----------QDPNALYRVVDRISNAVLLVSDFGR 268

Query: 180 RNHVCLNEGSITIPPYAPPQKMQAHQIPP------DTPRSIFVYFRGLFYDVNNDPEGGY 233
                 +     I PY+       H+I P         R   ++F G  Y      EGG 
Sbjct: 269 LRGDQASLVKDVILPYS-------HRINPFKGDVNVDSRPALLFFMGNRYR----KEGGK 317

Query: 234 YARGARAAVWEN-----FKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEA 288
             R     V EN      K+     +S    T   + M  + FCL P G  P + RL +A
Sbjct: 318 I-RDTLFQVLENEGDVIIKHGAQSRVSRRMAT---QGMHSSKFCLHPAGDTPSACRLFDA 373

Query: 289 VVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRL 345
           +V  C+PVI++D I LPF D I +  I +FV      +   L ++L  +  E IL  QR 
Sbjct: 374 LVSLCVPVIVSDHIELPFEDVIDYSNISIFVDTSKAIQPGFLTSMLRKVSSERILEYQR- 432

Query: 346 LANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
                ++R   + +   P    +QI   ++ K P
Sbjct: 433 ----EIQRVKHYFEYEDPNGPVNQIWRQVSMKAP 462


>gi|115476598|ref|NP_001061895.1| Os08g0438600 [Oryza sativa Japonica Group]
 gi|42408650|dbj|BAD09870.1| Exostosin family-like protein [Oryza sativa Japonica Group]
 gi|42408898|dbj|BAD10156.1| Exostosin family-like protein [Oryza sativa Japonica Group]
 gi|113623864|dbj|BAF23809.1| Os08g0438600 [Oryza sativa Japonica Group]
 gi|215715329|dbj|BAG95080.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215740942|dbj|BAG97437.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 566

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 102/400 (25%), Positives = 151/400 (37%), Gaps = 93/400 (23%)

Query: 40  LKVYVYELPSKYNKKLL--------------LKDPRC-------LTHMFAAEIFMHRFLL 78
           L+VY+Y+LP +++  +L                DPR        L    + E ++   +L
Sbjct: 179 LRVYLYDLPPEFHFGMLGWDGKAAGAAWPDVAGDPRAVPRYPGGLNLQHSVEYWLTLDIL 238

Query: 79  SS-------------PVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLI 125
           SS              VR  N   AD F  P + +                  RS  QL 
Sbjct: 239 SSTTSGDHRRRRPCTAVRVTNASLADVFLVPFFASLSYNRQSKSPHGGHGSGGRSDRQLQ 298

Query: 126 SSNWPY------WNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQ 179
                Y      W R  GADH  VVPH   +    +   +    +L           FG+
Sbjct: 299 GELVRYLARREEWRRWGGADHL-VVPHHPNSMMDARRRLSAAMFVL---------SDFGR 348

Query: 180 RNHVCLNEGSITIPPYAPPQKMQAHQIPP----DTP----RSIFVYFRGLFYDVNNDPEG 231
                 N     I PY        H +P     D+P    R +  YF+G  +  N    G
Sbjct: 349 YPPDVANLRKDVIAPYK-------HVVPSLGDGDSPGFEQRPVLAYFQGAIHRKN----G 397

Query: 232 GYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAI-------FCLCPLGWAPWSPR 284
           G      R  +++  K+    D+   + +     ++RA        FCL   G  P S R
Sbjct: 398 GR----VRQRLYQLIKDEK--DVHFTYGSVRQNGIRRATKGMASSKFCLNIAGDTPSSNR 451

Query: 285 LVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPE---V 338
           L +A+V  C+PVII+DDI LPF D + +    +FV   D  K   L  +L  I  E    
Sbjct: 452 LFDAIVSHCVPVIISDDIELPFEDVLDYSAFCVFVRASDAVKRGFLLHLLRGISQEEWTA 511

Query: 339 ILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKL 378
           + R+ + +A+        +  P+QPGDA   I   +ARK+
Sbjct: 512 MWRRLKEVAHH-----FEYQYPSQPGDAVQMIWGAVARKM 546


>gi|15230692|ref|NP_190126.1| exostosin family protein [Arabidopsis thaliana]
 gi|6967106|emb|CAB72489.1| putative protein [Arabidopsis thaliana]
 gi|332644505|gb|AEE78026.1| exostosin family protein [Arabidopsis thaliana]
          Length = 475

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 84/344 (24%), Positives = 125/344 (36%), Gaps = 58/344 (16%)

Query: 83  RTLNPEEADWFYTPIYPTCDLTPTGLPLPF-KSPRMMRSAIQLIS--SNWPYWNRTEGAD 139
           R  N  EAD  + P + +          P  K+ R      +L++  +    W R+ G D
Sbjct: 154 RVYNSSEADVIFVPFFSSLSYNRFSKVNPHQKTSRNKDLQGKLVTFLTAQEEWKRSGGRD 213

Query: 140 HFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQ 199
           H  +  H            ++      L     ++  FG+      N     I PY    
Sbjct: 214 HVVLAHH----------PNSMLDARNKLFPAMFILSDFGRYPPTVANVEKDVIAPYKHVI 263

Query: 200 KMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP 259
           K   +       R I +YF+G  Y      +GG             F    LF +  D  
Sbjct: 264 KAYENDTSGFDSRPILLYFQGAIYR----KDGG-------------FVRQELFYLLQDEK 306

Query: 260 TTYY--------------EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLP 305
             ++              + M  + FCL   G  P S RL +A+   C+PVII+DDI LP
Sbjct: 307 DVHFSFGSVRNGGINKASQGMHNSKFCLNIAGDTPSSNRLFDAIASHCVPVIISDDIELP 366

Query: 306 FADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQ 362
           F D I + E  +FV   D  K   L  ++  I  E   R    L    +++   F  P++
Sbjct: 367 FEDVIDYSEFSVFVRTSDALKENFLVNLIRGITKEEWTRMWNRL--KEVEKYYEFHFPSK 424

Query: 363 PGDAFHQILNGLARKLP------HDKSTY---LKPGGKFLNWTS 397
             DA   I   +ARK+P      H    Y   +   GK   W+S
Sbjct: 425 VDDAVQMIWQAIARKVPGVKMRIHKSRRYSGSVSETGKESRWSS 468


>gi|302842849|ref|XP_002952967.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
 gi|300261678|gb|EFJ45889.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
          Length = 728

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 85/334 (25%), Positives = 139/334 (41%), Gaps = 62/334 (18%)

Query: 41  KVYVYELPSKYNK-KLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYP 99
           ++YVYELP + N  + L +  R L ++      + + LLS+ +R  +   AD+F  PI  
Sbjct: 232 RIYVYELPPEMNTFQNLDRLDRPLMYL------IWQRLLSAGLRVADAASADFFLVPIR- 284

Query: 100 TCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGAD-HFFVVPHDFGACFHYQEEK 158
                   + + + S R+++ A+  I S WPYWN T G   H FV   D+G       ++
Sbjct: 285 --------VRMAYDSDRVIQ-AVSYIRSVWPYWNATRGGGRHIFVHTGDWG------RDE 329

Query: 159 AIERGILPLLQRATLVQTFG-QRNHVCLN-------EGSITIPPYAPPQKMQAHQIPPDT 210
             E   L L + AT +  +G  R+H              + +P       +  +Q+P  +
Sbjct: 330 LSEDAQL-LTRNATWLTHWGLARDHEFAGWRQSHRPGKDVVLPLMLAASLLSTYQLPRAS 388

Query: 211 P---------RSIFVYFRGLFYDVNNDPE-GGYY---------------ARGARAAVWEN 245
           P         R+  ++F G        P   G Y               A   RA  + +
Sbjct: 389 PLHPAGPRPERTTTLFFAGRICGSRATPSLNGTYPNCPNVLGSEDAYSAATRQRAYFYHH 448

Query: 246 FKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLP 305
            + N     ++  P     +M  A FCL P G      R V A + GC+PV + D ++ P
Sbjct: 449 GRANWKLVTASRAPAA---EMATAKFCLAPSGGGQ-GKRSVLAPLMGCVPVPVTDGLMQP 504

Query: 306 FADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVI 339
           F   + WE   + V E D+P +  +L  + PE +
Sbjct: 505 FEPELRWERFAVGVRERDLPVMHELLDRLMPEQV 538


>gi|296085881|emb|CBI31205.3| unnamed protein product [Vitis vinifera]
          Length = 455

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 87/356 (24%), Positives = 140/356 (39%), Gaps = 47/356 (13%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTL-NPEEADWFYTPIY 98
           LKV++Y+LP++++  LL   P       A E          P+  + N  EAD  + P +
Sbjct: 98  LKVFMYDLPAEFHFGLLDWAP-------AGESVWPDIRTKFPLYPVHNSSEADVIFVPFF 150

Query: 99  PTCDLTPTGLPLPFKSPR------MMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACF 152
            +      G    FK  +      +++  +    +    W R+EG DH  +  H      
Sbjct: 151 SSLSYNHFG---KFKGRQKKNENNLLQDKLVKFLTAQEEWIRSEGRDHIIMAHH------ 201

Query: 153 HYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPR 212
                 ++    + L     ++  FG+      N G   I PY    K   +       R
Sbjct: 202 ----PNSMLDARMKLWPAIFILSDFGRYPPNIANVGKDVIAPYKHVIKSFINDTSDFDSR 257

Query: 213 SIFVYFRGLFYDVNNDPEGGY------YARGARAAVWENFKNNPLFDISTDHPTTYYEDM 266
              +YF+G  Y      +GG+      Y       V   F N     I+        + M
Sbjct: 258 PTLLYFQGAIYR----KDGGFIRQELFYLLKDEKDVHFAFGNTQGNGINKAS-----QGM 308

Query: 267 QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK 326
             + FCL   G  P S RL +A+   C+PVII+D+I LP+ D + + +  +FV   D  K
Sbjct: 309 HSSKFCLNIAGDTPSSNRLFDAIASHCVPVIISDEIELPYEDVLDYSQFCIFVRTSDALK 368

Query: 327 ---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
              L  ++ SI  +   R  R L    ++    F  P++ GDA   I   + RK+P
Sbjct: 369 DKFLIKLIRSIKKDEWTRMWRRL--KEVENFFEFQYPSKEGDAVQMIWQAITRKVP 422


>gi|401888173|gb|EJT52138.1| hypothetical protein A1Q1_06676 [Trichosporon asahii var. asahii CBS
            2479]
          Length = 1189

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 81/328 (24%), Positives = 139/328 (42%), Gaps = 49/328 (14%)

Query: 64   THMFAAEIFMHRFL--LSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSA 121
            TH F+A+  M+R +  +S P      EEAD    P+Y  C+           +  ++   
Sbjct: 868  THCFSADSMMYRAMRHISVPA-----EEADLIVLPVYQHCE----------GADFILHDV 912

Query: 122  IQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEE--KAIERGIL---PLLQRATLVQT 176
                 ++ P  N  E A    V+ HD+G C  +  E   A +   L   PLL+   +   
Sbjct: 913  WHHAQASIPGINTGEKAVSL-VMTHDWGICLAFTWEIWSARQDHRLYPDPLLKNTLVFSV 971

Query: 177  FGQRNHVCLNEGS-ITIPPYA--PPQKMQA----HQIPPDTPRSIFVYFRGLFYDVNND- 228
             G     C      + +PP +   P+ + +      I P + R   + + G ++      
Sbjct: 972  MGDWFSPCYRPAQDVVVPPRSCTSPRLLASFPTVSHIKPVSERPRLISWAGTYWGSGKSE 1031

Query: 229  ------PEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWS 282
                  P GG    G R  +      N +     D    Y E++  A FC  P G A WS
Sbjct: 1032 RLRLACPRGGA---GMRELLPGAGPQNHI-----DKYDDYLEELNTARFCPQPRGIAGWS 1083

Query: 283  PRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRK 342
            PR+ +A+  GCIPV+ ++D   PFA  I W +I + V   ++  ++ +L SIP   + R 
Sbjct: 1084 PRVNDAIFAGCIPVLTSEDTHYPFAGLIDWSQISVRVHPTELDHVEELLASIP---LARL 1140

Query: 343  QRLLAN-PSMKRAMLFPQPAQPGDAFHQ 369
            +++ AN  +++ A ++P   +  +   +
Sbjct: 1141 EQIQANIVAIRDAFMYPADGKAEEELQK 1168


>gi|218199855|gb|EEC82282.1| hypothetical protein OsI_26516 [Oryza sativa Indica Group]
          Length = 393

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 79/320 (24%), Positives = 126/320 (39%), Gaps = 50/320 (15%)

Query: 83  RTLNPEEADWFYTPIYPTCDLTPTGL------------PLPFKSPRMMRSAIQLISSNWP 130
           R  +P +AD FY P + +  L    +              P  S    +  + +     P
Sbjct: 66  RVDDPADADLFYVPFFSSLSLVVNPIRPSAAAANASDAAEPAYSDESTQEELLVWLERQP 125

Query: 131 YWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSI 190
           YW R +G DH F+           Q+  A+ R +  +     L+  FG+      +    
Sbjct: 126 YWRRHQGRDHVFIC----------QDPNALYRVVDRISNAVLLISDFGRLRSEQASLVKD 175

Query: 191 TIPPYAPPQKMQAHQIPPDT-PRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNN 249
            I PYA   ++ + Q       R   ++F G  Y      EGG      R  +++  +N 
Sbjct: 176 VILPYA--HRINSFQGDVGVESRPSLLFFMGNRYR----KEGG----KVRDTLFQVLENE 225

Query: 250 PLFDISTDHPTTYYED-------MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDI 302
              D+   H     E        M  + FCL P G  P + RL +A+V  C+PVI++D I
Sbjct: 226 A--DVIIKHGAQSRESRRMATRGMHSSKFCLHPAGDTPSACRLFDALVSLCVPVIVSDYI 283

Query: 303 VLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQ 359
            LPF D I +  I +FV      +   L + L  I  + IL  QR      +K+   + +
Sbjct: 284 ELPFEDVIDYRNISIFVETSKAVQPGFLTSTLRGISSQRILEYQR-----EIKKVKHYFE 338

Query: 360 PAQPGDAFHQILNGLARKLP 379
              P    +QI + ++ K P
Sbjct: 339 YEDPNGPVNQIWHQVSSKAP 358


>gi|168042843|ref|XP_001773896.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162674740|gb|EDQ61244.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 373

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 66/274 (24%), Positives = 108/274 (39%), Gaps = 28/274 (10%)

Query: 113 KSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRAT 172
           K+  +    +Q +    P W  + G DH  V+ H     F           +   L++A 
Sbjct: 103 KNQELQEKLMQFLEKQ-PAWQASGGVDHVIVIHHPNSGYF-----------MRDHLRKAM 150

Query: 173 -LVQTFGQRNHVCLNEGSITIPPYAPP-QKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPE 230
            +V  FG+      N G   + PY       +A        R   ++F+G         E
Sbjct: 151 FVVADFGRYASDVANIGKDIVAPYKHVVNDFEAEATISYEKRKTLLFFQGAIMR----KE 206

Query: 231 GGYYARGARAAVWENFKNNPLFDISTDHPT-----TYYEDMQRAIFCLCPLGWAPWSPRL 285
           GG      R  +++     P       + T     +  E MQ + FCL   G  P S RL
Sbjct: 207 GGI----IRLQLYKLLNGEPDVHFEGGNTTNSAIRSASEGMQNSKFCLNLAGDTPSSNRL 262

Query: 286 VEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRL 345
            +A+   C+PVII+DDI +PF D + +    +F+   D  K + I+  +      +  ++
Sbjct: 263 FDAIASHCVPVIISDDIEVPFEDTLNYSTFSIFIKSSDALKSNFIIDLLRGVSREKWTKM 322

Query: 346 LAN-PSMKRAMLFPQPAQPGDAFHQILNGLARKL 378
            A    ++    +  P QP DA H     +ARK+
Sbjct: 323 WATLKQVEHHFKYQYPTQPDDAVHMTWKAIARKI 356


>gi|356506196|ref|XP_003521873.1| PREDICTED: probable glucuronosyltransferase GUT1-like [Glycine max]
          Length = 505

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 78/313 (24%), Positives = 117/313 (37%), Gaps = 34/313 (10%)

Query: 80  SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQ--LIS--SNWPYWNRT 135
           S +R  N  E+D  + P + +               R M   +Q  L+   +    W R+
Sbjct: 191 SVIRVRNSSESDVVFVPFFSSLCYNRFSSKTNPHEKRSMNKVLQEKLVKYVTEQEEWKRS 250

Query: 136 EGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY 195
            G DH  V  H            ++    + L     ++  FG+      N     I PY
Sbjct: 251 GGKDHVIVAHH----------PNSMLDARMKLWPGTFILSDFGRYPTNIANVEKDVIAPY 300

Query: 196 APPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGG------YYARGARAAVWENFKNN 249
                   +       R   +YF+G  Y      +GG      YY       V  +F N 
Sbjct: 301 KHVVGSYDNDQSSFDSRPTLLYFQGAIYR----KDGGHVRHELYYLVKNEKDVHFSFGN- 355

Query: 250 PLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADA 309
               +         E M+ + FCL   G  P S RL +A+   C+PVII+D+I LP+ D 
Sbjct: 356 ----VEKGGVRNAAEGMRSSKFCLNIAGDTPSSNRLFDAIASHCVPVIISDEIELPYEDV 411

Query: 310 IPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDA 366
           I + +  +FV   D  K   L   + SI  E   R    L    ++    F  P++ GDA
Sbjct: 412 IDYSQFCVFVRTRDALKKRYLINFIRSIGKEEWTRMWNRL--KEVESFFEFQFPSKEGDA 469

Query: 367 FHQILNGLARKLP 379
              I   +ARK+P
Sbjct: 470 VQMIWKAVARKVP 482


>gi|115441893|ref|NP_001045226.1| Os01g0921300 [Oryza sativa Japonica Group]
 gi|19386797|dbj|BAB86176.1| OJ1485_B09.5 [Oryza sativa Japonica Group]
 gi|57899432|dbj|BAD88370.1| exostosin-like [Oryza sativa Japonica Group]
 gi|113534757|dbj|BAF07140.1| Os01g0921300 [Oryza sativa Japonica Group]
 gi|125573139|gb|EAZ14654.1| hypothetical protein OsJ_04578 [Oryza sativa Japonica Group]
 gi|215741014|dbj|BAG97509.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215767487|dbj|BAG99715.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 437

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 76/307 (24%), Positives = 129/307 (42%), Gaps = 25/307 (8%)

Query: 82  VRTLNPEEADWFYTPIYPTCDLTPTGLPL---PFKSPRMMRSAIQLISSNWPYWNRTEGA 138
           VR  +P+ A+ F+ P + +      G  +     ++ R+++  +  I     YW R+ G 
Sbjct: 128 VRVTDPDAAEAFFVPFFSSLSFNVHGRNMTDPETEADRLLQVELMEILWKSKYWQRSAGR 187

Query: 139 DHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAP- 197
           DH  V+P      F +  +  +   IL       +V  FG+      +     + PY   
Sbjct: 188 DH--VIPMHHPNAFRFLRD-MVNASIL-------IVADFGRYTKELASLRKDVVAPYVHV 237

Query: 198 -PQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDIST 256
               +      P   R   ++FRG    V  D EG   A+ A+       K+   F+ S 
Sbjct: 238 VDSFLNDDPPDPFDDRPTLLFFRG--RTVRKD-EGKIRAKLAKIL---KGKDGVRFEDSL 291

Query: 257 ---DHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWE 313
              +   T  E M+ + FCL P G  P S RL +A+V  C+PVI++  I LPF D I + 
Sbjct: 292 ATGEGIKTSTEGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSSRIELPFEDEIDYS 351

Query: 314 EIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLAN-PSMKRAMLFPQPAQPGDAFHQILN 372
           E  +F + E+  + D +L  +      +   + +   ++     F  P + GDA + I  
Sbjct: 352 EFSLFFSVEEALRPDYLLNQLRQIQKTKWVEIWSKLKNVSHHYEFQNPPRKGDAVNMIWR 411

Query: 373 GLARKLP 379
            +  K+P
Sbjct: 412 QVKHKVP 418


>gi|219124193|ref|XP_002182394.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217406355|gb|EEC46295.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 573

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 84/376 (22%), Positives = 138/376 (36%), Gaps = 74/376 (19%)

Query: 60  PRCLTHMFAAEIFMHRFLL--SSPVRTLNPEEADWFYTPIYPTCDL--TPTGLPLPFKSP 115
           P  +   F+ E+ +   L    S +RT +PE+A  FY P  P+ +      G P  FK+ 
Sbjct: 144 PFSMHAQFSTELLVREILTHPDSCLRTYDPEQASLFYVPYLPSMEFHAGARGRPPSFKTS 203

Query: 116 RMMRSAIQLISSNW-----------PYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGI 164
           +   + ++ +  ++            YW R  G+DH  V         H ++++     +
Sbjct: 204 KYANAILRALEGDYQPWTDHFGLTPKYWQRRNGSDHILVFSEPLQGLTHPKKKRGNYHFV 263

Query: 165 L-------PLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQ------------KMQAHQ 205
                   P++    L  TF      C  +  +   P    +             +Q   
Sbjct: 264 HTQKQLAPPIVVSVELSTTFVNMYPSCAQKNILMPYPITDGRYFNGDLDKEARWAIQNRS 323

Query: 206 IPPDTPRSIFVYFR-----GLFYDVNNDPEGGYYARGA-------RAAVWENFKNNPLFD 253
           +     +S  V        G   D    P   +Y  G        RAA+ +N+K  P F 
Sbjct: 324 LDSIDSKSSPVLVAEKDPVGTLADAR--PIAQWYRAGVHGECVPLRAALQQNYKCTPSFP 381

Query: 254 ISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFAD----- 308
                PTTY   M+ A FC CP G    + R+ +AV+ GCIP+I++ D V P +D     
Sbjct: 382 SFKRTPTTYPLGMRMATFCPCPGGDTASAKRMFDAVLAGCIPIILSHDFVWPLSDEFEPE 441

Query: 309 --------AIPWEEIGMFVAEED-------------VPKLDTILTSIPPEVILRKQRLLA 347
                   A+ W      V + D             +P +  +L +IP   I R +R L 
Sbjct: 442 MLIKVSDFALRWNASNFVVRKFDNQCRPSVANTNYALPSVQELLEAIPASEIRRLRRGLR 501

Query: 348 NPSMKRAMLFPQPAQP 363
           +     +   P+   P
Sbjct: 502 HAQQAYSYYKPRKGFP 517


>gi|392575274|gb|EIW68408.1| hypothetical protein TREMEDRAFT_32575 [Tremella mesenterica DSM 1558]
          Length = 1176

 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 77/329 (23%), Positives = 132/329 (40%), Gaps = 54/329 (16%)

Query: 65   HMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQL 124
            H F+A+  ++R +    V   + EEAD    P+Y  C          F    +   A++ 
Sbjct: 839  HCFSADNILYRAMRHISV---SAEEADLIVLPVYQHC------TDEEFMLHDLTAFAVKT 889

Query: 125  ISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERG---ILP--LLQRATLVQTFGQ 179
            I          +     FV+ HD+G C ++  E    RG   + P  +L+ A +    G 
Sbjct: 890  IPGVL-----NQDKKLAFVLTHDWGICVNFAWEIWSARGESHLTPDWILRNALVWSVMGD 944

Query: 180  RNHVCLN-EGSITIPPYAPPQ-KMQAH-----QIPPDTPRSIFVYFRGLFYDVNND---- 228
             N  C      I +PP       ++ H      + P   R+  V + G ++    +    
Sbjct: 945  YNSPCYRPHQDIVVPPRTCKSIDLREHFPEITNVTPMRQRTKLVTWSGTYWGTGKNMRLR 1004

Query: 229  ---PEGGYYAR-----GARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAP 280
                 GG         G   + W N++              Y +++  A FC  P G A 
Sbjct: 1005 LTCERGGAGKEELVPGGGPMSSWYNWE--------------YMKEISGARFCPQPTGIAG 1050

Query: 281  WSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVIL 340
            WSPR+ +A+  GCIPV+ A+    PFAD + W +  + +   ++ +L+ IL++IP E + 
Sbjct: 1051 WSPRINDAIYAGCIPVLTAEGTHYPFADFLDWSKFSIRIKPTELDQLERILSAIPLEQLE 1110

Query: 341  RKQRLLANPSMKRAMLFPQPAQPGDAFHQ 369
              Q  L    ++ A ++     P D   +
Sbjct: 1111 EMQANLM--LVREAFIYSTDENPEDELKR 1137


>gi|302855272|ref|XP_002959133.1| hypothetical protein VOLCADRAFT_121779 [Volvox carteri f.
           nagariensis]
 gi|300255495|gb|EFJ39797.1| hypothetical protein VOLCADRAFT_121779 [Volvox carteri f.
           nagariensis]
          Length = 771

 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 78/330 (23%), Positives = 130/330 (39%), Gaps = 51/330 (15%)

Query: 41  KVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTP-IYP 99
           ++Y+YE P   +   +L   R L       I +   ++S  +R ++P  AD+F+ P    
Sbjct: 263 RIYIYEFPPYMHMWGMLWLDRPLN------IIVWERIISMGLREVDPARADYFFIPGCGR 316

Query: 100 TCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKA 159
            CD                     +++    YW + +G DH    P D+G C H  +  A
Sbjct: 317 GCDKWDDKFHF-------------ILAHYGQYWTQNQGRDHIMTHPGDWGRCEHSWDAFA 363

Query: 160 IERGILPLLQRATLVQTFG-------QRNHVCLN-----EGSITIPP-----YAPPQKMQ 202
            +      +   T++Q +G       +  H   N        I +PP     Y   +   
Sbjct: 364 DK-----FISNVTMLQHWGMTVDRSSEVEHNLFNTCHKPNQDILVPPMCGDLYPQFEYNI 418

Query: 203 AHQIPPDTP--RSIFVYFRGLFYDVNNDPE----GGYYARGARAAVWENFKNNPLFDIST 256
            H    + P  ++      G     N+  E      YY+ G RAA+W+  ++ P F I+ 
Sbjct: 419 WHPNRKENPITKTNLASVAGSICGWNSVEEPPCRNRYYSLGVRAALWQ-LRDVPGFHIA- 476

Query: 257 DHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG 316
                  + M  + FC  P G A +  R V A   GC+PVII+D +  P+   + W E G
Sbjct: 477 KRVAMMGQSMAESEFCFAPTG-AGYGKRNVMATTLGCMPVIISDHVAQPYEPFLNWNEFG 535

Query: 317 MFVAEEDVPKLDTILTSIPPEVILRKQRLL 346
           +++ E     ++ IL    P+    K   L
Sbjct: 536 VWIPESQAKDVEIILRGFTPQQKAEKMEKL 565


>gi|357463031|ref|XP_003601797.1| hypothetical protein MTR_3g085480 [Medicago truncatula]
 gi|355490845|gb|AES72048.1| hypothetical protein MTR_3g085480 [Medicago truncatula]
          Length = 425

 Score = 75.9 bits (185), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 82/371 (22%), Positives = 145/371 (39%), Gaps = 46/371 (12%)

Query: 40  LKVYVYELPSKYNKKLLLK-----------------DPRCLTHMFAAEIFM------HRF 76
           L+VY+Y+LP ++N  +L                   D + L    + E +M         
Sbjct: 50  LRVYMYDLPRRFNVGMLDGRNTTEAPVTIADYPLWPDNQGLRRQHSVEYWMMGSLLNGGG 109

Query: 77  LLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPL---PFKSPRMMRSAIQLISSNWPYWN 133
             S  VR L+PE  D ++ P + +      G  +     +    ++  +  +     YW 
Sbjct: 110 NGSEAVRVLDPEVVDVYFVPFFSSLSFNTHGHHMRDPETEIDHQLQIDLMGLLGQSKYWQ 169

Query: 134 RTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIP 193
           R+ G DH F + H     F +  ++        L +   +V  FG+      N     + 
Sbjct: 170 RSGGRDHIFPMTH--PNAFRFLRDQ--------LNESIQVVVDFGRYPKGVSNLNKDVVS 219

Query: 194 PYAPPQKMQAHQIPPD--TPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPL 251
           PY           P D    R+  ++FRG     +   +G   A+  +     +  +   
Sbjct: 220 PYVHFVDSYVDDEPHDPFESRTTLLFFRG---GTHRKDKGIVRAKFTKILAGFDDVHYER 276

Query: 252 FDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIP 311
              + ++     + M+ + FCL P G  P S RL +A+V  C+PVI++D I LPF + I 
Sbjct: 277 SSATGENIKLSSKGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDKIELPFENEID 336

Query: 312 WEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFH 368
           + +  +F + ++  +   +   L S P +      R L N S      F  P +  DA +
Sbjct: 337 YSQFSLFFSFKEALEPGYMINQLRSFPKQNWTEMWRQLKNISHHYE--FHYPPEREDAVN 394

Query: 369 QILNGLARKLP 379
            +   +  KLP
Sbjct: 395 MLWRQIKHKLP 405


>gi|168017876|ref|XP_001761473.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162687479|gb|EDQ73862.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 399

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 88/386 (22%), Positives = 149/386 (38%), Gaps = 48/386 (12%)

Query: 40  LKVYVYELPSKYNKKLLLK-------DPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADW 92
           L++Y+Y++      + LL           C+   +  ++ +H+ L+ S  RTL+   A++
Sbjct: 16  LQIYIYDVDEIDGLRELLSGRDHSIASSACVRGQWGTQVKIHKLLMKSRYRTLDKSRANF 75

Query: 93  FYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACF 152
           F+ P+Y  C     GL     +   ++   Q+     PY++R+ G DH FV P   GA  
Sbjct: 76  FFVPVYVKCVRIFGGLNEKEVNEHFLKILRQM-----PYFHRSGGRDHIFVFPSGAGA-- 128

Query: 153 HYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPR 212
                  + +G    L R+  +   G R          T      P  +     P ++  
Sbjct: 129 ------HLVKGWPNFLNRSIFLTPEGDRTDKKAFSSFNTWKDIIIPGNVDIINHPSNSAT 182

Query: 213 SIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNP--------LFDISTDHPTTYYE 264
           S     +  +     +  G    +  R  + E  K  P         F  S       Y 
Sbjct: 183 SPLPLSKRKYVA---NYLGRAQGKKGRLQLIELAKQFPAELDAPELAFQGSAKLGRIEYY 239

Query: 265 D-MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGM-FVAEE 322
           + ++ A FCL P G + W+ R  EA    C+PVI++D+I LP+ + + +    + + +  
Sbjct: 240 NRLRNAKFCLAPRGESSWTLRFYEAFFVECVPVILSDEIELPYQNVLDYSGFSIKWPSSR 299

Query: 323 DVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPG-DAFHQILNGLARKLP-- 379
              +L   L SIP   I   +R+L      R +    P   G  A   IL  L RK+   
Sbjct: 300 TNEELLRYLRSIPEFEI---ERMLVLGRNIRCLFTYAPDSEGCTAMTGILWELQRKVRAF 356

Query: 380 ---------HDKSTYLKPGGKFLNWT 396
                    H+ S   +   KF  W+
Sbjct: 357 HQYHHTFWLHNASIVNRDLAKFHEWS 382


>gi|357141481|ref|XP_003572240.1| PREDICTED: xylogalacturonan beta-1,3-xylosyltransferase-like
           [Brachypodium distachyon]
          Length = 543

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 77/325 (23%), Positives = 131/325 (40%), Gaps = 37/325 (11%)

Query: 82  VRTLNPEEADWFYTPIYPTCDL---TPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGA 138
           VR  N  +AD F+ P + +      +           R++++ +    +    W R  G 
Sbjct: 235 VRVTNASQADVFFVPFFASLSYNRHSKLQGKEKMSRNRLLQAELVKYLARQEEWRRWGGK 294

Query: 139 DHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPP 198
           DH  VVPH   +    Q  K +   +        ++  FG+      N     + PY   
Sbjct: 295 DHL-VVPHHPNSMM--QARKKLSAAMY-------VLSDFGRYPPDVANLKKDVVAPYKHV 344

Query: 199 -QKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTD 257
            + ++  + P    R +  YF+G  +      +GG      R  +++  K+    D+   
Sbjct: 345 VRSLRDDESPTFDQRPVLAYFQGAIHR----KDGG----KVRQKLYQLLKDEK--DVHFT 394

Query: 258 HPTTYYEDMQRAI-------FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAI 310
           + +     ++RA        FCL   G  P S RL +A+V  C+PV+I+DDI LPF D +
Sbjct: 395 YGSVRQNGIRRATKGMASSKFCLNIAGDTPSSNRLFDAIVSHCVPVMISDDIELPFEDVL 454

Query: 311 PWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRA---MLFPQPAQPGDAF 367
            + E  +FV   D  +   +L  +    I R +       +K       +  P++P DA 
Sbjct: 455 DYSEFCVFVRASDAVRKGFLLRLL--RGITRDEWNTMWERLKEVAHHFEYQYPSKPDDAV 512

Query: 368 HQILNGLARKLPHDKSTYLKPGGKF 392
             I   +ARK+ H     L   G+F
Sbjct: 513 QMIWGAVARKM-HSLKLQLHKSGRF 536


>gi|321261385|ref|XP_003195412.1| hypothetical protein CGB_G5240C [Cryptococcus gattii WM276]
 gi|317461885|gb|ADV23625.1| Conserved hypothetical protein [Cryptococcus gattii WM276]
          Length = 1130

 Score = 75.5 bits (184), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 77/321 (23%), Positives = 125/321 (38%), Gaps = 36/321 (11%)

Query: 64   THMFAAEIFMHRFL--LSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSA 121
            T  F+A+  M+R L  +S P      +EA+    P+Y  CD T       F     M  A
Sbjct: 806  TDCFSADSIMYRALRHMSVPA-----DEAELVVVPVYQQCDGTQ------FLLHDAMHHA 854

Query: 122  IQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEE---KAIERGILP--LLQRATLVQT 176
             + I        +        V+ HD+G C  +  +      ER + P  +L  A +   
Sbjct: 855  SETIQGV-----KNGEKKVALVLTHDWGICVDFAWDIWSARGERALHPDGILNNALVWSV 909

Query: 177  FGQRNHVCLN-EGSITIPPYAPPQKM------QAHQIPPDTPRSIFVYFRGLFYDVNNDP 229
             G  +  C      + IP                  I P   RS  + + G +       
Sbjct: 910  MGDYDSPCYRPHQDVVIPARTCRSNTLRETFPNVEAIKPMRERSNLLMWSGTYSGTGKSE 969

Query: 230  EGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAV 289
                      A   E  K       S    + Y +D+  A FC  P G A WSP+  +A+
Sbjct: 970  RIRLTCNRGGAGDRELIKGGG--KQSNFASSDYMKDLNNARFCAQPRGIAGWSPQTSDAI 1027

Query: 290  VFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLAN- 348
              GCIPV I++    PFAD + W ++ + VA  ++ K++ +L +IP   + + + L AN 
Sbjct: 1028 YAGCIPVFISEGTHYPFADFLDWSKLSVRVAPTELDKIEKVLAAIP---LSKVEELQANL 1084

Query: 349  PSMKRAMLFPQPAQPGDAFHQ 369
              ++ A L+    +P +   +
Sbjct: 1085 VCVREAFLYSGDGKPEEELER 1105


>gi|255566500|ref|XP_002524235.1| conserved hypothetical protein [Ricinus communis]
 gi|223536512|gb|EEF38159.1| conserved hypothetical protein [Ricinus communis]
          Length = 337

 Score = 75.5 bits (184), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 78/304 (25%), Positives = 131/304 (43%), Gaps = 58/304 (19%)

Query: 77  LLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTE 136
           L +S   TLNPE+A  F+ P        P+ L     SPR +   I+ + + +PYWNRT 
Sbjct: 71  LQNSHFITLNPEQAHLFFIPF-------PSDL-----SPRSLARVIRDLRTEFPYWNRTL 118

Query: 137 GADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYA 196
           GADHF++     G    Y+     +R ++ L + +  +  F   N   +    IT+PP  
Sbjct: 119 GADHFYISCTGLG----YES----DRNLVELKKNSVQISCFPSPNGKFVPHKDITLPPLV 170

Query: 197 PPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDIST 256
           P    ++       P   FV +           +G    RG    + E+           
Sbjct: 171 PSTIHKSSN--KRRPYKAFVKY-----------DGVEELRGDLEVLIES----------- 206

Query: 257 DHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIV--LPFADAIPWEE 314
             P+   ++  R+ FCL    +A     + EA+  GC+P++I +  +  LP  D + W+E
Sbjct: 207 -QPS---DEKTRSEFCL--FDYAANISGIGEALSSGCVPLVITERPIQDLPLMDVLRWQE 260

Query: 315 IGMFVAEED--VPKLDTIL--TSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQI 370
           I + V   D     +  +L  T    +   R +RL A  S  + +++ +  +P DAFH +
Sbjct: 261 IAVIVGSSDDGFKWVKRVLNGTCSRGDTCERMRRLGAGAS--QHLVWNETPEPYDAFHMV 318

Query: 371 LNGL 374
           +  L
Sbjct: 319 MYQL 322


>gi|125544214|gb|EAY90353.1| hypothetical protein OsI_11935 [Oryza sativa Indica Group]
          Length = 218

 Score = 74.7 bits (182), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 45/116 (38%), Positives = 63/116 (54%), Gaps = 12/116 (10%)

Query: 261 TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVA 320
           +Y + M+ + FCLCP+G+   SPR+VEA+ + C+PVIIAD+ VL       W    + VA
Sbjct: 101 SYIQHMKNSRFCLCPMGYEVNSPRIVEALYYECVPVIIADNFVL------DWSAFAVVVA 154

Query: 321 EEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQP--GDAFHQILNGL 374
           E+DVP L  IL  I     LRK   +     +    F   A+P   D FH IL+ +
Sbjct: 155 EKDVPDLKKILQGIT----LRKYVAMHGCVKRLQRHFLWHARPLRYDLFHMILHSI 206


>gi|242041079|ref|XP_002467934.1| hypothetical protein SORBIDRAFT_01g036720 [Sorghum bicolor]
 gi|241921788|gb|EER94932.1| hypothetical protein SORBIDRAFT_01g036720 [Sorghum bicolor]
          Length = 462

 Score = 74.3 bits (181), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 92/396 (23%), Positives = 156/396 (39%), Gaps = 64/396 (16%)

Query: 26  GSAGDVLEDDPV-GKLKVYVYELPSKYNKKLL--------------LKD-----PRCLTH 65
           G AG   + DP    L+V++Y+LP +++  LL              ++D     P  L  
Sbjct: 53  GGAGPGRKCDPAEALLRVFMYDLPPEFHFGLLDWKPPGFGGGVWPDVRDGVPEYPGGLNL 112

Query: 66  MFAAEIFMHRFLLSS---------PVRTLNPEEADWFYTPIYPTCDLTPTGLPLP----F 112
             + E ++   LL+S          VR     +AD  + P + +         +P     
Sbjct: 113 QHSIEYWLTLDLLASEQGAPTPCAAVRVRRAADADVVFVPFFASLSFNRHSRVVPPARDS 172

Query: 113 KSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQR-- 170
           +   + R  ++ +++  P W RT G DH  +  H                G+L    R  
Sbjct: 173 EDRALQRRLLEFLAAR-PEWRRTGGRDHVVLAHH--------------PNGMLDARYRFW 217

Query: 171 --ATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFY--DVN 226
               ++  FG+      N     I PY       A+       R   +YF+G  Y  D  
Sbjct: 218 PCVFVLCDFGRYPPSVANLDKDVIAPYRHLVANFANDTAGYDDRPTLLYFQGAIYRKDGG 277

Query: 227 NDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLV 286
           +  +  YY       V  +F +     ++ +        M+ + FCL   G  P S RL 
Sbjct: 278 SIRQELYYLLKDEKDVHFSFGS-----VAGNGIEQSTHGMRSSKFCLNIAGDTPSSNRLF 332

Query: 287 EAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQ 343
           +++V  C+PVII+D+I LPF D + + +  + V   D  K   L +++T I  E      
Sbjct: 333 DSIVSHCVPVIISDEIELPFEDVLDYSKFSVIVRGADAVKKGFLMSLITGISQEEWAHMW 392

Query: 344 RLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
             L    +++  ++  P+Q  DA   I   +ARK+P
Sbjct: 393 NKLKE--VEKHFVYQYPSQTDDAVQMIWKAIARKVP 426


>gi|168032656|ref|XP_001768834.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162679946|gb|EDQ66387.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 368

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 72/315 (22%), Positives = 126/315 (40%), Gaps = 40/315 (12%)

Query: 83  RTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSA---IQLIS--SNWPYWNRTEG 137
           R  N + AD    P + +        P      +M R+    + L+S   + P W  + G
Sbjct: 57  RVKNWQIADVILIPFFASLSYNKYSRPAVRGRKKMDRNQELQVNLLSFLRSQPAWRASNG 116

Query: 138 ADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYA- 196
           ADH  ++ H     +  ++ ++             +V  FG+ +    N     + PY  
Sbjct: 117 ADHVLIIHHPNAMVYKREQFRSA----------MFVVADFGRYDAEVANIAKDVVAPYKH 166

Query: 197 --PPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDI 254
             P        +     R+  ++F+G         EGG      R  ++E  ++    D+
Sbjct: 167 IIPNFDDDIDSVSSFNTRTTLLFFQGAIVR----KEGGII----RQKLYELLRDES--DV 216

Query: 255 STDHPTTYY-------EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFA 307
              + TT           M+++ FCL   G  P S RL +AV   C+P+I++DDI LPF 
Sbjct: 217 VFVNGTTTSAGIRSATSGMRQSKFCLHMEGDTPSSNRLFDAVASHCVPLIVSDDIELPFE 276

Query: 308 DAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPG 364
           D I + E  +FV   D  +   L  +L +   +   R    +    +++   +  P++ G
Sbjct: 277 DVINYTEFCLFVNSSDALRKGFLTNLLRNFGEKEWTRMHDRMR--EVQKHFEYQLPSEIG 334

Query: 365 DAFHQILNGLARKLP 379
           DA       +ARK+P
Sbjct: 335 DAVQMTWEAIARKVP 349


>gi|125528898|gb|EAY77012.1| hypothetical protein OsI_04969 [Oryza sativa Indica Group]
          Length = 416

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 71/273 (26%), Positives = 117/273 (42%), Gaps = 22/273 (8%)

Query: 113 KSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRAT 172
           ++ R+++  +  I     YW R+ G DH  V+P      F +  +  +   IL       
Sbjct: 141 EADRLLQVELMEILWKSKYWQRSAGRDH--VIPMHHPNAFRFLRD-MVNASIL------- 190

Query: 173 LVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPD--TPRSIFVYFRGLFYDVNNDPE 230
           +V  FG+      +     + PY        +  PPD    R   ++FRG    V  D E
Sbjct: 191 IVADFGRYTKELASLRKDVVAPYVHVVDSFLNDDPPDPFDARPTLLFFRG--RTVRKD-E 247

Query: 231 GGYYARGARAAVWENFKNNPLFDIST---DHPTTYYEDMQRAIFCLCPLGWAPWSPRLVE 287
           G   A+ A+       K+   F+ S    +   T  E M+ + FCL P G  P S RL +
Sbjct: 248 GKIRAKLAKIL---KGKDGVRFEDSLATGEGIKTSTEGMRSSKFCLHPAGDTPSSCRLFD 304

Query: 288 AVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLA 347
           A+V  C+PVI++  I LPF D I + E  +F + E+  + D +L  +      +   + +
Sbjct: 305 AIVSHCVPVIVSSRIELPFEDEIDYSEFSLFFSVEEALRPDYLLNQLRQIQKTKWVEIWS 364

Query: 348 N-PSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
              ++     F  P + GDA + I   +  K+P
Sbjct: 365 KLKNVSHHYEFQNPPRKGDAVNMIWRQVKHKVP 397


>gi|159472635|ref|XP_001694450.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
 gi|158276674|gb|EDP02445.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
          Length = 883

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 46/144 (31%), Positives = 70/144 (48%), Gaps = 29/144 (20%)

Query: 42  VYVYELPSKYNKKLL---LKDPRCLTHMFA--------------AEIFMHRFLLSSPVRT 84
           +YVY++ S +   LL   ++   CL   FA              AE  +H   L+S  RT
Sbjct: 497 IYVYDMKSDFGTDLLQYRIEGSHCLYRSFARANQTSFVGYNAYAAEPVLHELFLTSEHRT 556

Query: 85  LNPEEADWFYTPIYPTCDLT----------PTGL--PLPFKSPRMMRSAIQLISSNWPYW 132
           L+PEEAD+F+ P+   C             P GL  P    +  M R A + +++ +PY+
Sbjct: 557 LDPEEADFFFVPVNVGCLFDVYGWNEIPRWPRGLLGPRTHGATMMQREAARWLNATFPYF 616

Query: 133 NRTEGADHFFVVPHDFGACFHYQE 156
            R  G DH ++ PHD GAC+ ++E
Sbjct: 617 ARRGGRDHIWLNPHDEGACYVWRE 640


>gi|159477447|ref|XP_001696822.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
 gi|158275151|gb|EDP00930.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
          Length = 1279

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 44/140 (31%), Positives = 68/140 (48%), Gaps = 29/140 (20%)

Query: 42   VYVYELPSKYNKKLL---LKDPRCL-------------THMFAAEIFMHRFLLSSPVRTL 85
            +Y+Y+L   +N ++L   L    CL             TH ++ E +    L++S  RT 
Sbjct: 1125 IYMYDLDPLFNTRMLQYRLVRTSCLHRMFRADNSSTFVTHGYSIETYFIELLMTSGHRTY 1184

Query: 86   NPEEADWFYTPIYPTCDLTPT---------GLPL----PFKSPRMMRSAIQLISSNWPYW 132
            +PEEAD+F+ P+  TC   P          G+PL    P +   M R A + I   +P+W
Sbjct: 1185 DPEEADFFFVPLLVTCYFWPVLGWSDHPWFGMPLAIVRPHQGAFMYRDAKRWIQKTYPWW 1244

Query: 133  NRTEGADHFFVVPHDFGACF 152
            +R  G DH ++  HD GAC+
Sbjct: 1245 DRRGGKDHIWMTAHDEGACW 1264


>gi|224129654|ref|XP_002320639.1| predicted protein [Populus trichocarpa]
 gi|222861412|gb|EEE98954.1| predicted protein [Populus trichocarpa]
          Length = 375

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 98/403 (24%), Positives = 153/403 (37%), Gaps = 82/403 (20%)

Query: 39  KLKVYVYELPSKYNKKLLLKD-------------------PRCLTHMFAAEIFMHRFLLS 79
           KLKV+VY+LP +++  LL                      P  L    + E ++   LL+
Sbjct: 2   KLKVFVYDLPPEFHFGLLGWKGNTNQTWPNVDSHSRIPPYPGGLNLQHSVEYWLTLDLLA 61

Query: 80  S-------PVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSP----RMMRSAIQLISSN 128
           S        VR  N  +AD  + P + +         L  K      +M+++ +    + 
Sbjct: 62  SNTPKVGTAVRVQNSSQADIVFVPFFSSLSYNRHS-KLHGKEKVSVNKMLQTKLVQFLTA 120

Query: 129 WPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEG 188
              W R  G DH  +V H   +  H +++         L     ++  FG+      N G
Sbjct: 121 RDEWKRFGGNDHL-IVAHHPNSMLHARKK---------LGSAMFVLADFGRYPVEIANLG 170

Query: 189 SITIPPYAPPQKMQAHQIPPDTP-----RSIFVYFRGLFYDVNNDPEGG------YYARG 237
              I PY    K     IP         R I ++F+G  Y      +GG      YY   
Sbjct: 171 KDIIAPY----KHVVRTIPSGESAQFDRRPILMHFQGAIYR----KDGGAIRQELYYLLK 222

Query: 238 ARAAV---WENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCI 294
               V   +  ++ N +   +        + M  + FCL   G  P S RL +A+   C+
Sbjct: 223 DEKDVHFTFGTYRGNGIKKAA--------QGMASSKFCLNIAGDTPSSNRLFDAIASHCV 274

Query: 295 PVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRA 354
           PVII+DDI LPF D + + E  +FV   D  K   +L  +    I + Q       +K  
Sbjct: 275 PVIISDDIELPFEDVLDYSEFCLFVRASDAVKKGYLLDLL--RGIEKDQWTKLWERLKEI 332

Query: 355 ---MLFPQPAQPGDAFHQILNGLARKLP------HDKSTYLKP 388
                +  P+QPGDA   +   + RK        H K+ Y +P
Sbjct: 333 APHFEYSYPSQPGDAVDMVWKAVLRKTSSVQFKRHRKNRYARP 375


>gi|108863947|gb|ABA91286.2| secondary cell wall-related glycosyltransferase family 47,
           putative, expressed [Oryza sativa Japonica Group]
 gi|215769393|dbj|BAH01622.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 476

 Score = 73.2 bits (178), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 84/379 (22%), Positives = 146/379 (38%), Gaps = 56/379 (14%)

Query: 40  LKVYVYELPSKYNKKLLL------------------------KDPRCLTHMFAAEIFMHR 75
           ++V++Y+LP +++  LL                         + P  L    + E ++  
Sbjct: 95  VRVFMYDLPPEFHFGLLGWSPPTDGAADAAMWPDVGSGAAAPRYPGGLNQQHSVEYWLTL 154

Query: 76  FL-------LSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLP----FKSPRMMRSAIQL 124
            L         + VR  +  +AD  + P + +         +P     +   +    ++ 
Sbjct: 155 DLLSSSSPPCGAAVRVADSRDADVVFVPFFASLSYNRHSRVVPPEKVSRDKGLQERLVRY 214

Query: 125 ISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVC 184
           + +  P W R+ GADH  +V H   +  H +           L     ++  FG+ +   
Sbjct: 215 LMAQ-PEWKRSGGADH-VIVAHHPNSLLHARSV---------LFPAVFVLSDFGRYHPRV 263

Query: 185 LNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWE 244
            +     I PY    K   +       R   +YFRG  +      EGG   +     + +
Sbjct: 264 ASLEKDVIAPYKHMAKTFVNDSAGFDDRPTLLYFRGAIF----RKEGGNIRQELHYMLKD 319

Query: 245 NFKNNPLFDISTDH-PTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIV 303
                  F    DH  +   + M  + FCL   G  P S RL +A+V  C+PVII+DDI 
Sbjct: 320 EKDVYFAFGSVQDHGASKASQGMHASKFCLNIAGDTPSSNRLFDAIVSHCVPVIISDDIE 379

Query: 304 LPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMK---RAMLFPQP 360
           LP+ DA+ + +  +FV   D  K   ++  I    + + Q  +    +K   +   +  P
Sbjct: 380 LPYEDALDYSKFSIFVRSSDAVKKGYLMRLI--RGVSKHQWTMMWRRLKEVDKHFEYQYP 437

Query: 361 AQPGDAFHQILNGLARKLP 379
           +Q  DA   I   LARK+P
Sbjct: 438 SQKDDAVQMIWQTLARKVP 456


>gi|297839341|ref|XP_002887552.1| F25A4.34 [Arabidopsis lyrata subsp. lyrata]
 gi|297333393|gb|EFH63811.1| F25A4.34 [Arabidopsis lyrata subsp. lyrata]
          Length = 458

 Score = 72.8 bits (177), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 77/325 (23%), Positives = 127/325 (39%), Gaps = 32/325 (9%)

Query: 79  SSPVRTLNPEEADWFYTPIYPTCDLTPTGL---PLPFKSPRMMRSAIQLISSNWPYWNRT 135
           S+ +R  N  EAD  + P + +                  R+++  +     +   W R 
Sbjct: 142 SAAIRVKNSNEADIVFVPFFASLSYNRKSKLRGNETISGDRLLQERLVEFLKSQDEWKRF 201

Query: 136 EGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQT-FGQRNHVCLNEGSITIPP 194
           +G DH  +  H     +               L  A  V + FG+ +    N     I P
Sbjct: 202 DGKDHLIIAHHPNSLLYAKN-----------FLGSAMFVLSDFGRYSSANANLEKDIIAP 250

Query: 195 YAPPQK-MQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFD 253
           Y    K +  ++  P   R +  YF+G  Y      +GG   +     + +    +  F 
Sbjct: 251 YLHVVKTISNNESAPFEKRPVLAYFQGAIYR----KDGGTIRQELYNLLRDEKDVHFAFG 306

Query: 254 ISTDHPTTYY-EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW 312
               + T    + M  + FCL   G  P S RL +A+V  C+PVII+D I LPF D++ +
Sbjct: 307 TVRRNGTKQTGKGMASSKFCLNIAGDTPSSNRLFDAIVSHCVPVIISDQIELPFEDSLDY 366

Query: 313 EEIGMFV-AEEDVPK--LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQ 369
               +FV A E V K  L  +L  I  +   +K   L    +     +  P+ PGD+ + 
Sbjct: 367 SGFSVFVHASEAVKKGFLVNLLRGITEDQWKKKWGRL--KEVAGCFEYRFPSHPGDSVNM 424

Query: 370 ILNGLARKLP------HDKSTYLKP 388
           I + ++ KL       H K+ Y + 
Sbjct: 425 IWSAVSHKLSSLQFDVHRKNRYRRS 449


>gi|260782087|ref|XP_002586123.1| hypothetical protein BRAFLDRAFT_105910 [Branchiostoma floridae]
 gi|229271215|gb|EEN42134.1| hypothetical protein BRAFLDRAFT_105910 [Branchiostoma floridae]
          Length = 551

 Score = 72.8 bits (177), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 96/416 (23%), Positives = 161/416 (38%), Gaps = 85/416 (20%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCL-------------------------THMFAAEIFMH 74
            ++Y+YELP+++N+ L+    R L                         THMFA E+ +H
Sbjct: 125 FRLYIYELPAEFNRNLVSCVVRELGGCFRLGSFGMGPEFARHGNMSYRHTHMFALEVILH 184

Query: 75  RFLLSSPVRTLNPEEADWFYTPIY--------PTCDLTPTGLPLP--------FKSPRMM 118
           +  L SP RTL+P  AD FY P Y          C  T    P            +  + 
Sbjct: 185 QKALYSPSRTLDPHSADAFYIPYYAGLCTRHHSGCSTTKNISPYAGLACLCPGLDATALN 244

Query: 119 RSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQR---ATLVQ 175
           R     ++S +P++ R  G  H   +    G     Q  +      LP  +R   A + Q
Sbjct: 245 RKLFSHVTSRYPFYFR--GRPHLMAL----GKIEREQWTQDCSLLTLPQARRVVFAGIEQ 298

Query: 176 TF--GQRNHVCLNEGSITIPPY--------APPQ-----KMQAHQIPPDT-PRSIFVYFR 219
            F    R H       + + PY        A  Q      M+A ++  DT PR +FV+  
Sbjct: 299 EFSPALRAHFGRRGSPLIVAPYPAFGHVISAGSQGDVKSHMKAGEL--DTVPRDVFVFLA 356

Query: 220 GLFYDVNNDPEG---GYYARG----ARAAVWENFKNNPLFDISTDHPTTY----YEDMQR 268
               + +   +G    ++  G    +  A       +P++ ++ +    +     E M+ 
Sbjct: 357 ASSRNAHKIRQGLRPQFHVTGQPYSSEEAARVRRDGSPVWLLTPECRGNWEGKVVEWMRH 416

Query: 269 AIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADD--IVLPFADAIPWEEIGMFVAEEDVPK 326
           ++FCL P G +P      +AV  GC+PV    +  +  PF   + + +  + +  +DV  
Sbjct: 417 SVFCLQPPGDSPTRKSFYDAVACGCVPVTFTLEHPVRYPFDQVLNYSDFSVIIDGKDVTD 476

Query: 327 LD----TILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKL 378
            +     IL  IP E I   Q  L   +      +P      DAF  +L  +A+++
Sbjct: 477 RNITILNILRKIPSERIKMLQDNLKKVAPLLQYSYPSTVPSQDAFTMVLEEMAQRV 532


>gi|218185155|gb|EEC67582.1| hypothetical protein OsI_34941 [Oryza sativa Indica Group]
          Length = 483

 Score = 72.8 bits (177), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 86/382 (22%), Positives = 150/382 (39%), Gaps = 55/382 (14%)

Query: 40  LKVYVYELPSKYNKKLLL------------------------KDPRCLTHMFAAEIFMHR 75
           ++V++Y+LP +++  LL                         + P  L    + E ++  
Sbjct: 95  VRVFMYDLPPEFHFGLLGWSPPTDGAADAAMWPDVGSGAAAPRYPGGLNQQHSVEYWLTL 154

Query: 76  FL-------LSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLP----FKSPRMMRSAIQL 124
            L         + VR  +  +AD  + P + +         +P     +   +    ++ 
Sbjct: 155 DLLSSSSPPCGAAVRVADSRDADVVFVPFFASLSYNRHSRVVPPEKVSRDKGLQERLVRY 214

Query: 125 ISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVC 184
           + +  P W R+ GADH  +V H   +  H +           L     ++  FG+ +   
Sbjct: 215 LMAQ-PEWKRSGGADH-VIVAHHPNSLLHARSV---------LFPAVFVLSDFGRYHPRV 263

Query: 185 LNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFY--DVNNDPEGGYYARGARAAV 242
            +     I PY    K   +       R   +YFRG  +  +V  D   G   R     +
Sbjct: 264 ASLEKDVIAPYKHMAKTFVNDSAGFDDRPTLLYFRGAIFRKEVKIDSWKGGNIRQELHYM 323

Query: 243 WENFKNNPL-FDISTDH-PTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD 300
            ++ K+    F    DH  +   + M  + FCL   G  P S RL +A+V  C+PVII+D
Sbjct: 324 LKDEKDVYFAFGSVQDHGASKASQGMHASKFCLNIAGDTPSSNRLFDAIVSHCVPVIISD 383

Query: 301 DIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMK---RAMLF 357
           DI LP+ DA+ + +  +FV   D  K   ++  I    + + Q  +    +K   +   +
Sbjct: 384 DIELPYEDALDYSKFSIFVRSSDAVKKGYLMRLI--RGVSKHQWTMMWRRLKEVDKHFEY 441

Query: 358 PQPAQPGDAFHQILNGLARKLP 379
             P+Q  DA   I   LARK+P
Sbjct: 442 QYPSQKDDAVQMIWQTLARKVP 463


>gi|242079279|ref|XP_002444408.1| hypothetical protein SORBIDRAFT_07g021480 [Sorghum bicolor]
 gi|241940758|gb|EES13903.1| hypothetical protein SORBIDRAFT_07g021480 [Sorghum bicolor]
          Length = 539

 Score = 72.8 bits (177), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 97/426 (22%), Positives = 158/426 (37%), Gaps = 86/426 (20%)

Query: 34  DDPVGKLKVYVYELPSKYNKKLL-------------LKDPRCLTHMFAAEIFMHRF---- 76
           D     L+VYVY+LP++++  +L             ++D R   H        H      
Sbjct: 126 DADSAALRVYVYDLPAEFHFGMLGWDGKGKPAAWPDVRDARAAPHYPGGLNLQHSVAYWL 185

Query: 77  ---LLSSP---------------------VRTLNPEEADWFYTPIYPTCDL---TPTGLP 109
              +LSS                      VR  N   AD F+ P + +      +     
Sbjct: 186 TLDILSSALPPGTGTGSDDVVIRDRPCVAVRVTNASLADVFFVPFFASLSYNRHSKLRRG 245

Query: 110 LPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQ 169
                 R +++ +         W R  G +H  +VPH   +    +++ +    +L    
Sbjct: 246 EKVNRNRFLQAELVRYLMRKEEWRRWGGKNHL-IVPHHPNSMMEARKKLSAAMFVL---- 300

Query: 170 RATLVQTFGQRN-HVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYD---- 224
                  FG+ + HV   +  +  P     +       P    R I  YF+G  +     
Sbjct: 301 -----SDFGRYSPHVANLKKDVIAPYMHVVRSFGDGDSPAFDQRPILAYFQGAIHRKAVR 355

Query: 225 ------VNNDP--EGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAI------ 270
                 V N P  +GG      R  +++  K+    D+   + +     ++RA       
Sbjct: 356 ALCSVLVANRPAFQGG----KVRQKLYQLLKDER--DVHFTYGSVRQNGIRRATAGMSTS 409

Query: 271 -FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK--- 326
            FCL   G  P S RL +A+V  C+PVII+DDI LPF D + + E  +FV   D  K   
Sbjct: 410 KFCLNIAGDTPSSNRLFDAIVSHCVPVIISDDIELPFEDVLDYSEFCVFVRSADAAKRGF 469

Query: 327 LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHDKSTYL 386
           L  +L  I  +   +    L    + R   +  P++ GDA   I + +ARK+ H     L
Sbjct: 470 LLRLLRGISRDEWTKMWMRLKK--VTRHFEYQYPSRSGDAVQMIWSAVARKM-HSVQLQL 526

Query: 387 KPGGKF 392
              G+F
Sbjct: 527 HKRGRF 532


>gi|326530125|dbj|BAK08342.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 461

 Score = 72.8 bits (177), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 88/380 (23%), Positives = 146/380 (38%), Gaps = 62/380 (16%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRC------------------LTHMFAAEIFMHRFLLSS- 80
           L+VY+Y+LP +++  +L  +P                    L    + E ++   LL+S 
Sbjct: 69  LRVYMYDLPLEFHFGMLDWEPGSGGGLWPDVRHGVPEYPGGLNLQHSIEYWLTLDLLASE 128

Query: 81  --------PVRTLNPEEADWFYTPIYPTCDLTPTGLPLP----FKSPRMMRSAIQLISSN 128
                    VR  +P  AD  + P + +         +P     +   + R  I+ +++ 
Sbjct: 129 QGAPTPCNAVRVRDPARADVVFVPFFASLSFNRHSKVVPPARTSEDRTLQRRLIEFLAAR 188

Query: 129 WPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEG 188
            P W R+ G DH  +  H  G          +      L     ++  FG+  H   N  
Sbjct: 189 -PEWRRSGGRDHVVLAHHPNG----------MLDARYKLWPCVFVLCDFGRYPHSVANID 237

Query: 189 SITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGG------YYARGARAAV 242
              I PY        +       R   +YF+G  Y      +GG      YY       V
Sbjct: 238 KDVIAPYQHVVDDFLNDSTGYDDRPTLLYFQGAIYR----KDGGFIRQELYYLLKDEKDV 293

Query: 243 WENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDI 302
             +F +     ++ +        M+ + FCL   G  P S RL +++V  C+PVII+D+I
Sbjct: 294 HFSFGS-----VAGNGIEESTRGMRASKFCLNIAGDTPSSNRLFDSIVSHCVPVIISDEI 348

Query: 303 VLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQ 359
            LPF D + + +  + V   D  K   L  ++  I PE        L    ++    +  
Sbjct: 349 ELPFEDMLDYSKFCIIVRGADAVKKGFLINLIKGISPEEWTSMWNKLRE--VEGHFEYQY 406

Query: 360 PAQPGDAFHQILNGLARKLP 379
           P+QP DA   I   +ARK+P
Sbjct: 407 PSQPEDAVQMIWKTIARKVP 426


>gi|159477331|ref|XP_001696764.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
 gi|158275093|gb|EDP00872.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
          Length = 967

 Score = 72.4 bits (176), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 71/262 (27%), Positives = 104/262 (39%), Gaps = 45/262 (17%)

Query: 124 LISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQR--- 180
           L SS   +W+R  G DH F++ +D GAC+  QE       +L    R   V   G     
Sbjct: 542 LDSSGKSFWDRRGGRDHIFMMLNDEGACWMPQEVYNTSI-VLTHWGRMDNVHVCGSAWGY 600

Query: 181 -----------------------NHVCLNEGS-ITIPPYAPPQKMQAHQI--PPDTPRSI 214
                                   H C   G  + +P   PP    +  +   P   R I
Sbjct: 601 DNYSAPLDSWKPYVDGDWRKEYDGHPCYTPGKDLVVPSLKPPSHYASSPLLGAPPLERDI 660

Query: 215 FVYFRGLFYDVNNDP-EGGYYARG-----ARAAVWENFKNNPLFDISTDH--PTTYYEDM 266
            +Y RG     +  P    +Y+RG     A+ A   N+ +     I        +Y E +
Sbjct: 661 LLYLRG-----DTGPYRAHWYSRGIRQRLAKLAYKHNWADKYRIYIGEGWQISGSYSEHL 715

Query: 267 QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK 326
            R+ FC+   G   WS R  +A++ GCIP++I D +   F   + W+   + + EE V +
Sbjct: 716 ARSTFCVVAPG-DGWSARAEDAILHGCIPLVIMDGVHAVFESIVEWDAFAVRIREEAVNE 774

Query: 327 -LDTILTSIPPEVILRKQRLLA 347
            L   L S  PE I R QR LA
Sbjct: 775 DLPKFLLSFSPEQIERMQRRLA 796


>gi|357508931|ref|XP_003624754.1| Secondary cell wall-related glycosyltransferase family [Medicago
           truncatula]
 gi|355499769|gb|AES80972.1| Secondary cell wall-related glycosyltransferase family [Medicago
           truncatula]
          Length = 510

 Score = 72.0 bits (175), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 71/309 (22%), Positives = 116/309 (37%), Gaps = 31/309 (10%)

Query: 82  VRTLNPEEADWFYTPIYPTCDL---TPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGA 138
            R  N  EAD  + P + +      + TG        ++++  +     N   W R+ G 
Sbjct: 187 TRVRNSTEADVIFVPFFSSLTYNRHSKTGPHERRSRNKVLQEKLVRYLMNQEEWKRSGGR 246

Query: 139 DHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPP 198
           DH  +  H            ++    + L     ++  FG+      N     I PY   
Sbjct: 247 DHLILAHH----------PNSMLDARMKLWPATFILSDFGRYPPNIANVDKDVIAPYKHV 296

Query: 199 QKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNN-----PLFD 253
                        R   +YF+G  Y      +GGY    AR  ++   K           
Sbjct: 297 IASYVDDQSTFDSRKTLLYFQGAIYR----KDGGY----ARQELFYLLKEEKDVHFSFGS 348

Query: 254 ISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWE 313
           +           M+ + FCL   G  P S RL +A+   C+PVII+D+I LP+ D + + 
Sbjct: 349 VQKGGVRNATNGMRSSKFCLNIAGDTPSSNRLFDAIASHCVPVIISDEIELPYEDVLDYS 408

Query: 314 EIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQI 370
           +  +FV   D  K   L   + SI  +   R    L    +++   F  P++ GDA   I
Sbjct: 409 KFCVFVRTRDAVKKKYLINFIRSIGKDEWTRMWNRL--KEVEKFFEFQFPSKEGDAVEMI 466

Query: 371 LNGLARKLP 379
              ++RK+P
Sbjct: 467 WQAVSRKVP 475


>gi|299116802|emb|CBN74915.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 1052

 Score = 72.0 bits (175), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 76/304 (25%), Positives = 121/304 (39%), Gaps = 52/304 (17%)

Query: 38  GKLKVYVYELPSKYNKKLLL-------------KDPRCLTHMFAAEIFMHRFLLSSPVRT 84
           GK+ V+VY+   + +   L+             +D R        EI++HR +L+ P R 
Sbjct: 155 GKVSVFVYDGVPELDHSWLVPCYRQVRDGVSPWQDERADMAQDMGEIWLHRAMLAHPWRV 214

Query: 85  LNPEEADWFYTPIYPTCDLTPTGLPLPFKS-PRMMRSAIQLISSNWPYWNRTEGADHFFV 143
            NPEEAD FY P+YP             K+   ++ ++++ ++ +  Y+ R  GADH  V
Sbjct: 215 ANPEEADLFYVPMYPVLSTKLGNNRCGGKTHDELINTSVEYLALSSVYFRRFGGADHTLV 274

Query: 144 VPHDFGACFHYQEEKAIERGILP-LLQRATLV----QTFGQRNHVCLNEGSITIPPYAPP 198
                  C  +  + A+  G  P +L R T+V    +        C  +  +TIP  A  
Sbjct: 275 -------CAWWNCKSAL--GPKPRMLLRRTVVGINEKMLEWTRWGCGLDKMVTIPYTASS 325

Query: 199 QKMQAHQIPPDTP--RSIFVYFRGLFYD---------VNNDPEGGYYARGARAAVWENFK 247
               +  I       R I  +F G             V    EG     G   + W    
Sbjct: 326 VLTTSEMIGGRAAEDRDIPFFFVGTARGRPERQNLDVVTGMAEGSVMMLGDHQSDWGM-- 383

Query: 248 NNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDI-VLPF 306
                     + T Y   + R+ FC CP G    S R+ +AV  GC P++    + VLPF
Sbjct: 384 ----------NSTQYAAHIARSRFCFCPRGDTESSRRIFDAVAAGCTPIVTEASVAVLPF 433

Query: 307 ADAI 310
           ++ +
Sbjct: 434 SEHV 437


>gi|356497977|ref|XP_003517832.1| PREDICTED: xylogalacturonan beta-1,3-xylosyltransferase-like
           [Glycine max]
          Length = 509

 Score = 72.0 bits (175), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 86/378 (22%), Positives = 135/378 (35%), Gaps = 58/378 (15%)

Query: 40  LKVYVYELPSKYNKKLL-----------------------LKDPRCLTHMFAAEIFMHRF 76
           LKV++Y+LP +++  LL                       L     +      +I    F
Sbjct: 119 LKVFMYDLPPEFHFGLLDWKPSGNSVWPDVRTNIPGYPGGLNLQHSIEFWLTLDILASEF 178

Query: 77  LLSSPVRTL----NPEEADWFYTPIYPTCDLTPTGLPLPF---KSPRMMRSAIQLISSNW 129
             +S  RT+    N  EAD  + P + +          P       ++++  +       
Sbjct: 179 PQASKARTVIRVQNSSEADIIFVPFFSSLSYNRYSKSKPHVKKSKNKILQEKLVTYLMAQ 238

Query: 130 PYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGS 189
             W R+ G DH  +  H            ++    + L     ++  FG+      N   
Sbjct: 239 EEWKRSGGKDHLILAHH----------PNSMLDARMKLWPATFILSDFGRYPPNIANVEK 288

Query: 190 ITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNN 249
             I PY        +       R   +YF+G  Y      +GG     AR  ++   K+ 
Sbjct: 289 DVIAPYKHLISSYVNDNSNFDSRPTLLYFQGAIY----RKDGGL----ARQELFYLLKDE 340

Query: 250 -----PLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVL 304
                    I  D      E M+ + FCL   G  P S RL +A+   C+PVII+D I L
Sbjct: 341 KDVHFSFGSIGKDGIKKATEGMRASKFCLNIAGDTPSSNRLFDAIASHCVPVIISDKIEL 400

Query: 305 PFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPA 361
           P+ D I + E  +FV   D  K   L   +  I  E   R    L    ++    F  P+
Sbjct: 401 PYEDVIDYSEFCIFVRTSDAIKEKFLINFIRGIAKEEWTRMWNKL--KEVEHFFEFHFPS 458

Query: 362 QPGDAFHQILNGLARKLP 379
           +  DA   I   +ARK+P
Sbjct: 459 KENDAVQMIWQAVARKVP 476


>gi|359359070|gb|AEV40977.1| putative exostosin family domain-containing protein [Oryza
           punctata]
          Length = 459

 Score = 72.0 bits (175), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 77/294 (26%), Positives = 129/294 (43%), Gaps = 34/294 (11%)

Query: 119 RSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKA------IERGILPLLQRAT 172
           R  +  ++++ P W R+ G DH FV+  D  A +H ++E A      ++ G    L   +
Sbjct: 166 REVVDRVTAH-PAWRRSGGRDHVFVLT-DPVAMWHVRKEIAPAILLVVDFGGWYKLDSNS 223

Query: 173 LVQTFGQR-NHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEG 231
               F     H  ++     I PY     +   Q+  +  R+  +YF+G      +   G
Sbjct: 224 ASSNFSHMIQHTQVSLLKDVIVPYT--HLLPTMQLSENKERTTLLYFKG----AKHRHRG 277

Query: 232 GYYARGARAAVWENFKNNPLFDISTDHPTTYYED-----MQRAIFCLCPLGWAPWSPRLV 286
           G      R  +W+   N P   +   +P     +     M+ + FCL P G  P S RL 
Sbjct: 278 GL----VREKLWDLMVNEPDVVMEEGYPNATGREQSIKGMRTSEFCLHPAGDTPTSCRLF 333

Query: 287 EAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDV--PK-LDTILTSIPPEVILRKQ 343
           +AV   CIPVI++D+I LPF   I + E  +FV+  +   PK L   L ++P +     +
Sbjct: 334 DAVASLCIPVIVSDEIELPFEGMIDYTEFVIFVSVSNAMRPKWLTNYLRNVPRQKKDEFR 393

Query: 344 RLLAN--PSMKRAMLFP----QPAQPGDAFHQILNGLARKLPHDKSTYLKPGGK 391
           R +A+  P  +   ++P      AQ G A + I   + +KLP  +   ++   K
Sbjct: 394 RNMAHVQPIFEYDSIYPGRMTSAAQDG-AVNHIWKKIHQKLPMIQEAVIREKRK 446


>gi|297833038|ref|XP_002884401.1| hypothetical protein ARALYDRAFT_477609 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297330241|gb|EFH60660.1| hypothetical protein ARALYDRAFT_477609 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 495

 Score = 71.6 bits (174), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 77/322 (23%), Positives = 119/322 (36%), Gaps = 32/322 (9%)

Query: 80  SPVRTLNPEEADWFYTPIYPTCD---LTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTE 136
           + VR  N  EAD  + P + +      +           + ++  +    ++   W  + 
Sbjct: 192 AAVRVKNSSEADVVFVPFFSSLSYNRFSKVNQKQKKSQDKELQVNVVKYVTSQKEWKISG 251

Query: 137 GADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYA 196
           G DH  +  H            ++      L     +V  FG+ +    N     + PY 
Sbjct: 252 GKDHVIMAHH----------PNSMSTARHKLYPAMFVVADFGRYSPHVANIDKDIVAPYK 301

Query: 197 PPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDIST 256
                 A+       R I +YF+G  Y       GG+  +     V  +F +     +  
Sbjct: 302 HLVPSYANDTSGFDGRPILLYFQGAIYR----KAGGFVRQELYKDVHFSFGS-----VRN 352

Query: 257 DHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG 316
              T   E M+ + FCL   G  P S RL +A+   CIPVII+DDI LP+ D + + E  
Sbjct: 353 HGITKAGEGMRSSKFCLNIAGDTPSSNRLFDAIASHCIPVIISDDIELPYEDVLNYNEFC 412

Query: 317 MFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGD-AFHQILN 372
           +FV   D  K   L  ++ SI  +   +    L        + FP     GD A   I  
Sbjct: 413 LFVRSSDALKKGFLMGLVKSIGRDEYNKMWLRLKEVERYFDLRFPTKDDEGDYAVQMIWE 472

Query: 373 GLARKLP------HDKSTYLKP 388
            +ARK P      H    Y KP
Sbjct: 473 AVARKAPLVKMKVHRFQRYTKP 494


>gi|5882750|gb|AAD55303.1|AC008263_34 F25A4.34 [Arabidopsis thaliana]
 gi|12324821|gb|AAG52383.1|AC011765_35 unknown protein; 115857-117304 [Arabidopsis thaliana]
          Length = 458

 Score = 71.6 bits (174), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 78/329 (23%), Positives = 127/329 (38%), Gaps = 40/329 (12%)

Query: 79  SSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSP---RMMRSAIQLISSNWPYWNRT 135
           S+ +R  N  EAD  + P + +            ++    R+++  +     +   W R 
Sbjct: 142 SAAIRVKNSNEADIVFVPFFASLSYNRKSKLRGNETSSDDRLLQERLVEFLKSQDEWKRF 201

Query: 136 EGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQT-FGQRNHVCLNEGSITIPP 194
           +G DH  V  H     +               L  A  V + FG+ +    N     I P
Sbjct: 202 DGKDHLIVAHHPNSLLYARN-----------FLGSAMFVLSDFGRYSSAIANLEKDIIAP 250

Query: 195 YAPPQK-MQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNP--- 250
           Y    K +  ++      R +  YF+G  Y      +GG      R  ++   K+     
Sbjct: 251 YVHVVKTISNNESASFEKRPVLAYFQGAIYR----KDGG----TIRQELYNLLKDEKDVH 302

Query: 251 --LFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFAD 308
                +  +      + M  + FCL   G  P S RL +A+V  C+PVII+D I LPF D
Sbjct: 303 FAFGTVRGNGTKQTGKGMASSKFCLNIAGDTPSSNRLFDAIVSHCVPVIISDQIELPFED 362

Query: 309 AIPWEEIGMFV-AEEDVPK--LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGD 365
            + +    +FV A E V K  L  IL  I  +   +K   L    +     +  P+Q GD
Sbjct: 363 TLDYSGFSVFVHASEAVKKEFLVNILRGITEDQWKKKWGRL--KEVAGCFEYRFPSQVGD 420

Query: 366 AFHQILNGLARKLP------HDKSTYLKP 388
           + + I + ++ KL       H K+ Y + 
Sbjct: 421 SVNMIWSAVSHKLSSLQFDVHRKNRYRRS 449


>gi|326525254|dbj|BAK07897.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 440

 Score = 71.6 bits (174), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 77/309 (24%), Positives = 126/309 (40%), Gaps = 29/309 (9%)

Query: 82  VRTLNPEEADWFYTPIYPTCDLTPTGLPLP---FKSPRMMRSAIQLISSNWPYWNRTEGA 138
           VR  +P  A+ F+ P + +      G  +     ++ R+++  +  I     YW R+ G 
Sbjct: 132 VRVRDPAAAEAFFVPFFSSLSFNVHGRNMTDPDTEADRLLQVELMDILGKSEYWQRSAGR 191

Query: 139 DHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAP- 197
           DH  V+P      F +  +  +   +L       +V  FG+      +     + PY   
Sbjct: 192 DH--VIPMHHPNAFRFMRD-MVNASVL-------IVSDFGRYTKELASLRKDVVAPYVHV 241

Query: 198 -PQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDIS- 255
               +  +   P       ++FRG         EG    RG  A + ++ ++   F+ S 
Sbjct: 242 VDSFLDDNASDPFEADPTLLFFRGRPV---RKAEGKI--RGKLAKILKD-RDGVRFEDSL 295

Query: 256 --TDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWE 313
              D      + M+ + FCL P G  P S RL +A+V  CIPVII+  I LPF D I + 
Sbjct: 296 AIGDGIKISTDGMRSSKFCLHPAGDTPSSCRLFDAIVSHCIPVIISSRIELPFEDEIDYS 355

Query: 314 EIGMFVAEEDVPKLDTILTSI---PPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQI 370
           E   F + E+  + D +L  +   P E  +     L N S      F  P +  DA + I
Sbjct: 356 EFSPFFSVEEALEPDYLLNQLRQMPKEKWVEMWSKLKNVSSHYE--FQYPPRKDDAVNMI 413

Query: 371 LNGLARKLP 379
              +  K+P
Sbjct: 414 WRHVRHKIP 422


>gi|18410670|ref|NP_565089.1| exostosin-like protein [Arabidopsis thaliana]
 gi|16209709|gb|AAL14411.1| At1g74680/F1M20_36 [Arabidopsis thaliana]
 gi|27363220|gb|AAO11529.1| At1g74680/F1M20_36 [Arabidopsis thaliana]
 gi|332197500|gb|AEE35621.1| exostosin-like protein [Arabidopsis thaliana]
          Length = 461

 Score = 71.6 bits (174), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 78/329 (23%), Positives = 127/329 (38%), Gaps = 40/329 (12%)

Query: 79  SSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSP---RMMRSAIQLISSNWPYWNRT 135
           S+ +R  N  EAD  + P + +            ++    R+++  +     +   W R 
Sbjct: 145 SAAIRVKNSNEADIVFVPFFASLSYNRKSKLRGNETSSDDRLLQERLVEFLKSQDEWKRF 204

Query: 136 EGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQT-FGQRNHVCLNEGSITIPP 194
           +G DH  V  H     +               L  A  V + FG+ +    N     I P
Sbjct: 205 DGKDHLIVAHHPNSLLYARN-----------FLGSAMFVLSDFGRYSSAIANLEKDIIAP 253

Query: 195 YAPPQK-MQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNP--- 250
           Y    K +  ++      R +  YF+G  Y      +GG      R  ++   K+     
Sbjct: 254 YVHVVKTISNNESASFEKRPVLAYFQGAIYR----KDGG----TIRQELYNLLKDEKDVH 305

Query: 251 --LFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFAD 308
                +  +      + M  + FCL   G  P S RL +A+V  C+PVII+D I LPF D
Sbjct: 306 FAFGTVRGNGTKQTGKGMASSKFCLNIAGDTPSSNRLFDAIVSHCVPVIISDQIELPFED 365

Query: 309 AIPWEEIGMFV-AEEDVPK--LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGD 365
            + +    +FV A E V K  L  IL  I  +   +K   L    +     +  P+Q GD
Sbjct: 366 TLDYSGFSVFVHASEAVKKEFLVNILRGITEDQWKKKWGRL--KEVAGCFEYRFPSQVGD 423

Query: 366 AFHQILNGLARKLP------HDKSTYLKP 388
           + + I + ++ KL       H K+ Y + 
Sbjct: 424 SVNMIWSAVSHKLSSLQFDVHRKNRYRRS 452


>gi|302832868|ref|XP_002947998.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
 gi|300266800|gb|EFJ50986.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
          Length = 638

 Score = 71.6 bits (174), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 78/322 (24%), Positives = 124/322 (38%), Gaps = 62/322 (19%)

Query: 41  KVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPT 100
           ++YVY++P K++        R    +F   +F  R L S  V   + EEADWF+ P+   
Sbjct: 150 RIYVYDIPHKFSS--WYNPTRVDRELFW--VFWERLLGSGAV-VADGEEADWFWLPVK-- 202

Query: 101 CDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAI 160
                  L       R++  AIQ + + WP+++R +G  HF +   D G          +
Sbjct: 203 -------LRSTTDGYRLL-EAIQYVRTEWPWYDRLQGHRHFVIHTGDTG-------RGEV 247

Query: 161 ERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYF-- 218
            R I       T +  +G                +       AH+ P     ++FV    
Sbjct: 248 AREIRDATANMTWLHHWG------------LWEDWNASGWKAAHR-PGKIGLALFVSRCG 294

Query: 219 -RGLFYDVNNDPEG--GYYA-----------RGARAAVWENFKNNPLFDISTDHPTTYYE 264
             GL      D  G  G+ A              R AV  +  N   + I T       +
Sbjct: 295 DSGLLGSPTRDVGGVDGWAAPPGAEIAEDGHTAVRGAVHYHHHNRTGYKIVTGDSRYPLD 354

Query: 265 DMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDV 324
            +     C C L             V GC+P+II+D ++ PF   + W+ IG+ +A ED+
Sbjct: 355 LLTYKYVCDCVL-----------VSVMGCLPLIISDSVMQPFEPEMDWDRIGLRLAHEDI 403

Query: 325 PKLDTILTSIPPEVILRKQRLL 346
           P L   L +I  E + R++  L
Sbjct: 404 PTLHERLAAISDEELDRRRAAL 425


>gi|255547405|ref|XP_002514760.1| catalytic, putative [Ricinus communis]
 gi|223546364|gb|EEF47866.1| catalytic, putative [Ricinus communis]
          Length = 490

 Score = 71.6 bits (174), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 96/388 (24%), Positives = 146/388 (37%), Gaps = 73/388 (18%)

Query: 38  GKLKVYVYELPSKYNKKLL------------LKD-------PRCLTHMFAAEIFMHRFLL 78
           G LKVY+Y++P +++  LL            + D       P  L    + E ++   LL
Sbjct: 110 GLLKVYMYDMPPEFHFGLLGWKGKANQIWPNVDDLDHIPLYPGGLNLQHSIEYWLTLDLL 169

Query: 79  SS----------PVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSP----RMMRSAIQL 124
           +S           VR  N  +AD  + P + +         L  K      +M+++ +  
Sbjct: 170 ASNRPKVVRPCGAVRVDNSSQADIIFVPYFSSLSYNRHS-KLHGKEKVSMNKMLQNRLVE 228

Query: 125 ISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVC 184
                  W R+ G DH  V         H+       R +L       ++  FG+     
Sbjct: 229 FLMGQDEWKRSGGRDHLIVA--------HHPNSMLDARKMLG--AAMFVLADFGRYPVEI 278

Query: 185 LNEGSITIPPYAPPQKMQAHQIPPDTP-----RSIFVYFRGLFYDVNNDPEGG------Y 233
            N     I PY    K     IP         R I V+F+G  Y      +GG      Y
Sbjct: 279 ANLKKDVIAPY----KHVVRTIPSGESAQFEERPILVFFQGAIYR----KDGGIIRQELY 330

Query: 234 YARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGC 293
           Y       V   F       +  +      + M  + FCL   G  P S RL +A+V  C
Sbjct: 331 YLLKDEKDVHFTFGT-----VRKNGVNKAGQGMASSKFCLNIAGDTPSSNRLFDAIVSHC 385

Query: 294 IPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKR 353
           +PVII+DDI LPF D + + E  +FV   D  K   +L  +  + I R +  +    +K 
Sbjct: 386 VPVIISDDIELPFEDVLDYSEFSVFVRASDAVKEGYLLNLL--QSIDRDKWTMMWERLKE 443

Query: 354 A---MLFPQPAQPGDAFHQILNGLARKL 378
                 +  P+Q GDA   I   ++RKL
Sbjct: 444 IAPHFEYQYPSQSGDAVDMIWQAVSRKL 471


>gi|15228598|ref|NP_187015.1| exostosin-like protein [Arabidopsis thaliana]
 gi|6091757|gb|AAF03467.1|AC009327_6 hypothetical protein [Arabidopsis thaliana]
 gi|30102722|gb|AAP21279.1| At3g03650 [Arabidopsis thaliana]
 gi|110736553|dbj|BAF00242.1| hypothetical protein [Arabidopsis thaliana]
 gi|332640446|gb|AEE73967.1| exostosin-like protein [Arabidopsis thaliana]
          Length = 499

 Score = 71.6 bits (174), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 72/308 (23%), Positives = 115/308 (37%), Gaps = 22/308 (7%)

Query: 80  SPVRTLNPEEADWFYTPIYPTCD---LTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTE 136
           + +R  N  EAD  + P + +      +           + ++  +    ++   W  + 
Sbjct: 190 AAIRVKNSSEADVVFVPFFSSLSYNRFSKVNQKQKKSQDKELQENVVKYVTSQKEWKTSG 249

Query: 137 GADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYA 196
           G DH  +  H            ++      L     +V  FG+ +    N     + PY 
Sbjct: 250 GKDHVIMAHH----------PNSMSTARHKLFPAMFVVADFGRYSPHVANVDKDIVAPYK 299

Query: 197 PPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDIST 256
                  +       R I +YF+G  Y       GG+  +     + E    +  F    
Sbjct: 300 HLVPSYVNDTSGFDGRPILLYFQGAIYR----KAGGFVRQELYNLLKEEKDVHFSFGSVR 355

Query: 257 DHPTTYY-EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEI 315
           +H  +   E M+ + FCL   G  P S RL +A+   CIPVII+DDI LP+ D + + E 
Sbjct: 356 NHGISKAGEGMRSSKFCLNIAGDTPSSNRLFDAIASHCIPVIISDDIELPYEDVLNYNEF 415

Query: 316 GMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGD-AFHQIL 371
            +FV   D  K   L  ++ SI  E   +    L        + FP     GD A   I 
Sbjct: 416 CLFVRSSDALKKGFLMGLVRSIGREEYNKMWLRLKEVERYFDLRFPVKDDEGDYAVQMIW 475

Query: 372 NGLARKLP 379
             +ARK P
Sbjct: 476 KAVARKAP 483


>gi|219363227|ref|NP_001136572.1| hypothetical protein precursor [Zea mays]
 gi|194696226|gb|ACF82197.1| unknown [Zea mays]
 gi|414866559|tpg|DAA45116.1| TPA: hypothetical protein ZEAMMB73_757616 [Zea mays]
          Length = 453

 Score = 71.6 bits (174), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 86/381 (22%), Positives = 150/381 (39%), Gaps = 63/381 (16%)

Query: 40  LKVYVYELPSKYNKKLL--------------LKD-----PRCLTHMFAAEIFMHRFLLSS 80
           L+V++Y+LP +++  LL              ++D     P  L    + E ++   LL+S
Sbjct: 60  LRVFMYDLPPEFHFGLLDWKPPGFGGGVWPDVRDGVPDYPGGLNLQHSIEYWLTLDLLAS 119

Query: 81  ---------PVRTLNPEEADWFYTPIYPTCDLTPTGLPLP----FKSPRMMRSAIQLISS 127
                      R  +  +AD  + P + +         +P     +   + R  ++ +++
Sbjct: 120 EQGAPTPCAAARVRHAADADVVFVPFFASLSFNRHSRVVPPARNSEDRALQRRLLEFLAA 179

Query: 128 NWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQR----ATLVQTFGQRNHV 183
             P W RT G DH  +  H                G+L    R      ++  FG+    
Sbjct: 180 R-PEWRRTGGRDHVVLAHH--------------PNGMLDARYRFWPCVFVLCDFGRYPPS 224

Query: 184 CLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFY--DVNNDPEGGYYARGARAA 241
             N     I PY       A+       R   +YF+G  Y  D  +  +  YY       
Sbjct: 225 VANLDKDIIAPYRHLVANFANDTAGYDDRPTLLYFQGAIYRKDGGSIRQELYYLLKDEKD 284

Query: 242 VWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADD 301
           V  +F +     ++ +      + M+ + FCL   G  P S RL +++V  C+PVII+D+
Sbjct: 285 VHFSFGS-----VAGNGIEQATQGMRSSKFCLNIAGDTPSSNRLFDSIVSHCVPVIISDE 339

Query: 302 IVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFP 358
           I LPF D + + +  + V   D  K   L +++  I  E   R    L    +++   + 
Sbjct: 340 IELPFEDVLDYSKFSVIVRGADAVKKGFLKSLIKGISQEEWTRMWNKLKE--VEKHFEYQ 397

Query: 359 QPAQPGDAFHQILNGLARKLP 379
            P+Q  DA   I   +ARK+P
Sbjct: 398 YPSQTDDAVQMIWKAIARKVP 418


>gi|15241619|ref|NP_199306.1| Exostosin family protein [Arabidopsis thaliana]
 gi|30694651|ref|NP_851132.1| Exostosin family protein [Arabidopsis thaliana]
 gi|10177484|dbj|BAB10875.1| unnamed protein product [Arabidopsis thaliana]
 gi|15081733|gb|AAK82521.1| AT5g44930/K21C13_11 [Arabidopsis thaliana]
 gi|27363262|gb|AAO11550.1| At5g44930/K21C13_11 [Arabidopsis thaliana]
 gi|332007793|gb|AED95176.1| Exostosin family protein [Arabidopsis thaliana]
 gi|332007794|gb|AED95177.1| Exostosin family protein [Arabidopsis thaliana]
          Length = 443

 Score = 71.2 bits (173), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 98/420 (23%), Positives = 168/420 (40%), Gaps = 75/420 (17%)

Query: 3   VTAIALLLAFVSADNTPKIERISGSAGDVLEDDPVG--------KLKVYVYELPSKYNKK 54
           V ++ L+  FV+    P     SGS    L D            K KVY+YELP+ +   
Sbjct: 20  VLSVFLVFVFVNTFFYPSFYSDSGSIRRNLVDSRESFHFPGNFRKTKVYMYELPTNFTYG 79

Query: 55  LLL-----KDPRCLTHMFAAEIFMHRFLLSSP-------------VRTLNPEEADWFYTP 96
           ++      K        +     MH + L S              VR  +P EAD FY  
Sbjct: 80  VIEQHGGEKSDDVTGLKYPGHQHMHEWYLYSDLTRPEVKRVGSPIVRVFDPAEADLFYVS 139

Query: 97  IYPTCDL-TPTGLP-LPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHY 154
            + +  L   +G P   +    M  S +  + S   +W R  G DH  V           
Sbjct: 140 AFSSLSLIVDSGRPGFGYSDEEMQESLVSWLESQ-EWWRRNNGRDHVIVAG--------- 189

Query: 155 QEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSIT---IPPYAPPQKMQAHQIPPDTP 211
            +  A++R ++  ++ A L+ T   R  +  ++GS+    I PY+   ++ A++      
Sbjct: 190 -DPNALKR-VMDRVKNAVLLVTDFDR--LRADQGSLVKDVIIPYS--HRIDAYEGELGVK 243

Query: 212 -RSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAI 270
            R+  ++F G  Y      +GG      R  +++  +     D+     T   E+M RA+
Sbjct: 244 QRTNLLFFMGNRYR----KDGG----KVRDLLFKLLEKEE--DVVIKRGTQSRENM-RAV 292

Query: 271 --------FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEE 322
                   FCL   G    + RL +A+   C+PVI++D I LPF D I + +  +F+  +
Sbjct: 293 KQGMHTSKFCLHLAGDTSSACRLFDAIASLCVPVIVSDGIELPFEDVIDYRKFSIFLRRD 352

Query: 323 DVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
              K   +   L  + P  IL+ Q++     MK    +        + ++I   + +K+P
Sbjct: 353 AALKPGFVVKKLRKVKPGKILKYQKV-----MKEVRRYFDYTHLNGSVNEIWRQVTKKIP 407


>gi|159471277|ref|XP_001693783.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
 gi|158283286|gb|EDP09037.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
          Length = 611

 Score = 71.2 bits (173), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 78/344 (22%), Positives = 134/344 (38%), Gaps = 72/344 (20%)

Query: 41  KVYVYELPSKYN--KKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIY 98
           +VY+Y+LP +    +  +  D   + H           L +S  R  +  +ADW++ P+ 
Sbjct: 157 RVYIYDLPPELTTWRSDINLDRWTIRHFL-------EMLTASGARVADGAKADWYFIPVR 209

Query: 99  PTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEK 158
                             +++ AI  +    P++N T G  HF +   D G         
Sbjct: 210 LRSS----------SDAYVLQRAIHHLRHAHPWFNATGGGRHFVIAVGDMG-------RL 252

Query: 159 AIERGILPLLQRATLVQTFG---------------QRNHVCLNEGSITIPPYAPPQKMQA 203
             ERG  PL    T V  +G               + +H   N   I +P Y   +K++ 
Sbjct: 253 ESERG--PLSANVTFVSHWGLASSKKAERLNSSPWRASH--RNATDIVLPVYISLRKLEK 308

Query: 204 HQI-------------PPDTPRS---IFVYFRGLFYDVN---------NDPEGGYYARGA 238
           + I             PP+       +F +   +  D +         N P    Y+   
Sbjct: 309 YGITRSRHHPKFATMAPPEIRERNGPLFWFAGRICQDRSKPATDGVWPNCPGAMGYSAMT 368

Query: 239 RAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVII 298
           R AV  +  + P + ++      Y   +  + FC   +G      R ++A + GC+PV+I
Sbjct: 369 RQAVHFHHWDRPGYFVAPG-DKNYSRHLLTSKFCFGAMGGG-HGQRQLQAALAGCVPVVI 426

Query: 299 ADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRK 342
            D ++  +   + W + G+ VAE D+P+L TIL +I PE   RK
Sbjct: 427 GDGVLEAWEPYLDWNDFGVRVAEADIPRLHTILGAIGPEEYARK 470


>gi|159483551|ref|XP_001699824.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
 gi|158281766|gb|EDP07520.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
          Length = 740

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 44/133 (33%), Positives = 66/133 (49%), Gaps = 8/133 (6%)

Query: 248 NNPLF--DISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLP 305
           N P F   I+  H  T    M  + FCL P GW  W  RL+EAV  GC+PV++ D +  P
Sbjct: 596 NRPDFVLSIAGRHSQTL---MASSRFCLAPSGWG-WGVRLLEAVACGCVPVVVQDQVYQP 651

Query: 306 FADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGD 365
             D +P++E  + +    + +L  +L ++ P  +   Q  LA     RA L+   +  G 
Sbjct: 652 LWDVVPYDEFAVVLPRSQLHRLPQLLDAVGPGQLAALQAGLAR--WHRAFLYRHHSPSGL 709

Query: 366 AFHQILNGLARKL 378
           AF+  L  L R+L
Sbjct: 710 AFNYTLAALRRRL 722


>gi|255075893|ref|XP_002501621.1| hypothetical protein MICPUN_99882 [Micromonas sp. RCC299]
 gi|226516885|gb|ACO62879.1| hypothetical protein MICPUN_99882 [Micromonas sp. RCC299]
          Length = 918

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 51/179 (28%), Positives = 81/179 (45%), Gaps = 50/179 (27%)

Query: 42  VYVYELPSKYNKKLL--LKDPR-CLT-------------HMFAAEIFMHRFLLSSPVRTL 85
           +YVY+L  K+    L   +D R C+T             +++ AE+ +H  LL SP RT 
Sbjct: 419 IYVYDLLPKFTTAQLQHRQDVRKCVTRFAEEGNATRFEDNLYGAEVALHELLLDSPHRTD 478

Query: 86  NPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSA-----------------------I 122
           NPE AD+F+ P+Y  C ++    P P  S ++                           +
Sbjct: 479 NPEIADFFFVPMYHFCFISRLQQPTPGHSQQLFSRTRGVGCDLRGSHVDAAFQHLFVPVL 538

Query: 123 QLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRN 181
           + +  ++P+WNRT+GADH     HD GAC+  +E              ATL+  +G+R+
Sbjct: 539 EHLRRDYPWWNRTDGADHIVPFLHDEGACYAPRE-----------FGDATLLVHWGRRD 586


>gi|356516800|ref|XP_003527081.1| PREDICTED: xylogalacturonan beta-1,3-xylosyltransferase-like
           [Glycine max]
          Length = 493

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 94/399 (23%), Positives = 150/399 (37%), Gaps = 78/399 (19%)

Query: 40  LKVYVYELPSKY-------------------NKKLLLKDPRCLTHMFAAEIFMHRFLLSS 80
           LKV++Y+LP ++                   N K + + P  L    + E ++   LLSS
Sbjct: 114 LKVFMYDLPPEFHFGLLGWKRSVNQTWPEVNNPKRIPRYPGGLNLQHSMEYWLTLDLLSS 173

Query: 81  PV-------RTLNPEEADWFYTPIYPTCDL---TPTGLPLPFKSPRMMRSAIQLISSNWP 130
            V       R  +  +AD  + P + +      +           +M++  +        
Sbjct: 174 KVGQPCTAIRVQDSSQADVIFVPFFSSLSYNRHSKLNGEEKVSLNKMLQDRLVQFLMGQK 233

Query: 131 YWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSI 190
            W R+ G DH  V  H            ++      L     ++  FG+      N    
Sbjct: 234 EWKRSGGKDHLIVAHH----------PNSLLDARRKLGAAMLVLADFGRYPTELANIKKD 283

Query: 191 TIPPYA------PPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGG------YYARGA 238
            I PY       P  K  + +      R+  VYF+G  Y      +GG      YY    
Sbjct: 284 IIAPYRHLVSTIPKAKSASFE-----KRTTLVYFQGAIYR----KDGGAIRQELYYLLKD 334

Query: 239 RAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVII 298
              V   F +     I  +      + M  + FCL   G  P S RL +A+V  C+PVII
Sbjct: 335 EKDVHFTFGS-----IGGNGINQASQGMAMSKFCLNIAGDTPSSNRLFDAIVSHCVPVII 389

Query: 299 ADDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRK-QRLLANPSMKRA 354
           +D+I LPF D + + +  +FV   D  K   L  +L SI  +   +  +RL     +   
Sbjct: 390 SDEIELPFEDVLDYSDFSIFVRASDSMKKGYLLNLLRSITQKEWSKMWERL---KQITHH 446

Query: 355 MLFPQPAQPGDAFHQILNGLARKLP------HDKSTYLK 387
             +  P+QPGDA + I   + RK+       H K+ Y +
Sbjct: 447 FEYQYPSQPGDAVNMIWQQVERKISSIRFNLHRKNRYQR 485


>gi|414587899|tpg|DAA38470.1| TPA: hypothetical protein ZEAMMB73_317055 [Zea mays]
          Length = 326

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 38/74 (51%), Positives = 50/74 (67%), Gaps = 11/74 (14%)

Query: 20  KIERISGSAGDV---------LEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMF--A 68
           +IE+I  S  ++         LED+P+GKLKVYV++L SKYNKKLL KDPRCL HMF  +
Sbjct: 130 RIEKILKSGANIVLTTKRINGLEDNPIGKLKVYVHDLLSKYNKKLLQKDPRCLNHMFHVS 189

Query: 69  AEIFMHRFLLSSPV 82
            +I     LLS+P+
Sbjct: 190 QKISFSCPLLSAPI 203


>gi|224145217|ref|XP_002325567.1| predicted protein [Populus trichocarpa]
 gi|222862442|gb|EEE99948.1| predicted protein [Populus trichocarpa]
          Length = 476

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 88/350 (25%), Positives = 135/350 (38%), Gaps = 66/350 (18%)

Query: 83  RTLNPEEADWFYTPIYPT--CDLTPTGLPLPFKSP------RMMRSAIQLISSNWPYWNR 134
           R  +  EAD  + P + T   ++        F+        R  +  + ++  N   W R
Sbjct: 139 RVFDCNEADVVFVPFFATLSAEMELAKGKGSFRRKEGNEDYRRQKQVVDIVR-NSDAWKR 197

Query: 135 TEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFG---------------- 178
           + G DH FV+  D  A +H + E  I   IL       LV  FG                
Sbjct: 198 SGGKDHVFVLT-DPVAMWHLRAE--IAPAIL-------LVVDFGGWYRLDSKSSNGSSSD 247

Query: 179 --QRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYAR 236
             Q   V L +  I   PY     +   Q+  +  RS  +YF+G      +   GG    
Sbjct: 248 MIQHTQVSLLKDVIV--PYT--HLLPRLQLSENKKRSTLLYFKG----AKHRHRGGI--- 296

Query: 237 GARAAVWENFKNNPLFDISTDHPTTYYED-----MQRAIFCLCPLGWAPWSPRLVEAVVF 291
             R  +W+   N P   I    P     +     M+ + FCL P G  P S RL +A+  
Sbjct: 297 -VREKLWDLLVNEPGVIIEEGFPNATGREQSIRGMRSSEFCLHPAGDTPSSCRLFDAIQS 355

Query: 292 GCIPVIIADDIVLPFADAIPWEEIGMFVAEED-------VPKLDTILTSIPPEVILRKQR 344
            CIPV+++D+I LPF   + + E  +FVA +D       V +L +I  S+      R+  
Sbjct: 356 LCIPVVVSDNIELPFEGMVDYTEFAVFVAVDDALKPRWLVDRLRSI--SVKQRNEFRRNM 413

Query: 345 LLANPSMKRAMLFP---QPAQPGDAFHQILNGLARKLPHDKSTYLKPGGK 391
               P ++     P    P  P  A + I   + +KLP  K   ++   K
Sbjct: 414 AKVQPILQYDNGHPGGIGPISPDGAVNHIWKKVLQKLPAIKEAVVRERRK 463


>gi|359359217|gb|AEV41121.1| putative exostosin family domain-containing protein [Oryza
           officinalis]
          Length = 460

 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 77/288 (26%), Positives = 124/288 (43%), Gaps = 40/288 (13%)

Query: 116 RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKA------IERG---ILP 166
           R  R  +  ++++ P W R+ G DH FV+  D  A +H ++E A      ++ G    L 
Sbjct: 164 RRQREVVDRVTAH-PAWRRSGGRDHVFVLT-DPVAMWHVRKEIAPSILLVVDFGGWYKLD 221

Query: 167 LLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVN 226
               ++ V    Q   V L +  I   PY     +   Q+  +  R   +YF+G      
Sbjct: 222 SNSASSNVSHMIQHTQVSLLKDVIV--PYT--HLLPTMQLSENKDRLTLLYFKG----AK 273

Query: 227 NDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYED-----MQRAIFCLCPLGWAPW 281
           +   GG      R  +W+   N P   +   +P     +     M+ + FCL P G  P 
Sbjct: 274 HRHRGGL----VREKLWDLMVNEPDVVMEEGYPNATGREQSIKGMRTSEFCLHPAGDTPT 329

Query: 282 SPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEV 338
           S RL +AV   CIPVI++D+I LPF   I + E  +FV+  +  +   L   L ++P + 
Sbjct: 330 SCRLFDAVASLCIPVIVSDEIELPFEGMIDYTEFTIFVSVSNAMRPKWLTNYLRNVPRQQ 389

Query: 339 -------ILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
                  + R Q +    S+ R  +    AQ G A + I   + +KLP
Sbjct: 390 KDEFRRNMARVQPIFEYDSIYRGRM-TSAAQDG-AVNHIWKKIYQKLP 435


>gi|359359170|gb|AEV41075.1| putative exostosin family domain-containing protein [Oryza minuta]
          Length = 317

 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 77/288 (26%), Positives = 124/288 (43%), Gaps = 40/288 (13%)

Query: 116 RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKA------IERG---ILP 166
           R  R  +  ++++ P W R+ G DH FV+  D  A +H ++E A      ++ G    L 
Sbjct: 21  RRQREVVDRVTAH-PAWRRSGGRDHVFVLT-DPVAMWHVRKEIAPSILLVVDFGGWYKLD 78

Query: 167 LLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVN 226
               ++ V    Q   V L +  I   PY     +   Q+  +  R   +YF+G      
Sbjct: 79  SNSASSNVSHMIQHTQVSLLKDVIV--PYT--HLLPTMQLSENKDRLTLLYFKG----AK 130

Query: 227 NDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYED-----MQRAIFCLCPLGWAPW 281
           +   GG      R  +W+   N P   +   +P     +     M+ + FCL P G  P 
Sbjct: 131 HRHRGGL----VREKLWDLMVNEPDVVMEEGYPNATGREQSIKGMRTSEFCLHPAGDTPT 186

Query: 282 SPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEV 338
           S RL +AV   CIPVI++D+I LPF   I + E  +FV+  +  +   L   L ++P + 
Sbjct: 187 SCRLFDAVASLCIPVIVSDEIELPFEGMIDYTEFTIFVSVSNAMRPKWLTNYLRNVPRQQ 246

Query: 339 -------ILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
                  + R Q +    S+ R  +    AQ G A + I   + +KLP
Sbjct: 247 KDEFRRNMARVQPIFEYDSIYRGRM-TSAAQDG-AVNHIWKKIYQKLP 292


>gi|359359122|gb|AEV41028.1| putative exostosin family domain-containing protein [Oryza minuta]
          Length = 459

 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 64/232 (27%), Positives = 103/232 (44%), Gaps = 27/232 (11%)

Query: 119 RSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKA------IERGILPLLQRAT 172
           R  +  ++++ P W R+ G DH FV+  D  A +H ++E A      ++ G    L   +
Sbjct: 166 REVVDRVTAH-PAWRRSGGRDHVFVLT-DPVAMWHVRKEIAPAILLVVDFGGWYKLDSNS 223

Query: 173 LVQTFGQR-NHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEG 231
               F     H  ++     I PY     +   Q+  +  R+  +YF+G      +   G
Sbjct: 224 ASSNFSHMIQHTQVSLLKDVIVPYT--HLLPTMQLSENKERTTLLYFKG----AKHRHRG 277

Query: 232 GYYARGARAAVWENFKNNPLFDISTDHPTTYYED-----MQRAIFCLCPLGWAPWSPRLV 286
           G      R  +W+   N P   +   +P     +     M+ + FCL P G  P S RL 
Sbjct: 278 GL----VREKLWDLMVNEPDVVMEEGYPNATGREQSIKGMRTSEFCLHPAGDTPTSCRLF 333

Query: 287 EAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDV--PK-LDTILTSIP 335
           +AV   CIPVI++D+I LPF   I + E  +FV+  +   PK L   L ++P
Sbjct: 334 DAVASLCIPVIVSDEIELPFEGMIDYTEFVIFVSVSNAMRPKWLTNYLRNVP 385


>gi|357112427|ref|XP_003558010.1| PREDICTED: probable glycosyltransferase At3g07620-like
           [Brachypodium distachyon]
          Length = 464

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 73/313 (23%), Positives = 124/313 (39%), Gaps = 35/313 (11%)

Query: 80  SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLP----FKSPRMMRSAIQLISSNWPYWNRT 135
           + VR  +P  AD  + P + +         +P     +   + R  I+ +++  P W R+
Sbjct: 139 NAVRVRDPARADVVFVPFFASLSFNRHSKVVPPARTSEDRALQRRLIEFLAAR-PEWRRS 197

Query: 136 EGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY 195
            G DH  +  H  G          +      L     ++  FG+  H   N     I PY
Sbjct: 198 GGRDHVVLAHHPNG----------MLDARYKLWPCVFVLCDFGRYPHSVANIDKDVIAPY 247

Query: 196 APPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGG------YYARGARAAVWENFKNN 249
                   +       R   +YF+G  Y      +GG      YY       V  +F + 
Sbjct: 248 LHVVGNFFNDSAGYDARPTLLYFQGAIYR----KDGGFIRQELYYLLKDEKDVHFSFGS- 302

Query: 250 PLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADA 309
               ++ +      + M+ + FCL   G  P S RL +++V  C+P+II+D+I LPF D 
Sbjct: 303 ----VAGNGIEQSTQGMRASKFCLNIAGDTPSSNRLFDSIVSHCVPIIISDEIELPFEDV 358

Query: 310 IPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMK---RAMLFPQPAQPGDA 366
           + + +  + V   D  K   ++  I  + I R++       +K   R   +  P+Q  DA
Sbjct: 359 LDYSKFCIIVRGVDAVKKGFLINLI--KGISRQEWTSMWNKLKEVERHFEYQYPSQHDDA 416

Query: 367 FHQILNGLARKLP 379
              I   +ARK+P
Sbjct: 417 VQMIWKTIARKVP 429


>gi|449528899|ref|XP_004171439.1| PREDICTED: LOW QUALITY PROTEIN: probable glycosyltransferase
           At3g07620-like, partial [Cucumis sativus]
          Length = 173

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 44/143 (30%), Positives = 69/143 (48%), Gaps = 15/143 (10%)

Query: 240 AAVWENFKNNPLFDISTDHPTT------YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGC 293
           A VWEN   +   DIS +  +       Y +   +  FC+CP G    S R+ +++ +GC
Sbjct: 29  ARVWEN---DTELDISNNRISRATGHLLYQKRFYKTKFCICPGGSQVNSARIADSIHYGC 85

Query: 294 IPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKR 353
           +PVI++D   LPF D + W +  + V E DV +L  IL  I     ++  + L    M+ 
Sbjct: 86  VPVILSDYYDLPFNDILDWRKFSVIVKERDVYQLKQILKDISDIEFIKLHKNL----MQV 141

Query: 354 AMLFPQPAQP--GDAFHQILNGL 374
              F   + P   DAFH ++  L
Sbjct: 142 QKHFQWNSXPIKYDAFHMVMYDL 164


>gi|4678284|emb|CAB41192.1| putative protein [Arabidopsis thaliana]
          Length = 736

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 76/320 (23%), Positives = 130/320 (40%), Gaps = 64/320 (20%)

Query: 29  GDVLEDDPVGKLK---VYVYELPSKYNKKLLLKDP---RCLTHMF---AAEIFMHRFLLS 79
           G+++    V K K   +Y+Y+LP  +N  L+        C+  ++    A +  +  +L+
Sbjct: 334 GNLINLSAVVKKKRPLIYIYDLPPDFNSLLIEGRHFKFECVNRIYDERNATMAFYENILA 393

Query: 80  SPVRTLNPEEADWFYTPIYPTC-----DLTP-------TGLPLPFKSPRMMRSAIQLISS 127
           +  RT+N EEAD+F+ P+  +C     D  P       TGL     +    + A + I  
Sbjct: 394 TAHRTMNGEEADFFFVPVLDSCIINRADDAPHINMQNHTGLRSSL-TLEFYKRAYEHIVE 452

Query: 128 NWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQR------- 180
            +PYWNR+ G DH ++          +     +  G        +    FG         
Sbjct: 453 KYPYWNRSAGRDHIWI----------WNSMMLVHWGNTNSKHNHSTTAYFGDNWDDISDE 502

Query: 181 ---NHVCLN-EGSITIPPYAPP---QKMQAHQIPPDTPRSIFVYFRGLF---YDVNNDPE 230
              +H C +    + IP +  P      + +   P   R    YF G     Y+    PE
Sbjct: 503 RRGDHPCFDPRKDLVIPAWKVPDPYSMRKNYWERPREKRKTLFYFNGNLGPAYE-KGRPE 561

Query: 231 GGYYARGARAAVWENFKNNPLFD--ISTDH----------PTTYYEDMQRAIFCLCPLGW 278
              Y+ G R  + E F ++P  +  +   H             Y++D+  +IFC    G 
Sbjct: 562 DS-YSMGIRQKLAEEFGSSPNKEGKLGKQHAEDVIVTPLRSDNYHKDIANSIFCGAFPG- 619

Query: 279 APWSPRLVEAVVFGCIPVII 298
             WS R+ ++++ GC+PVII
Sbjct: 620 DGWSGRMEDSILQGCVPVII 639


>gi|168016669|ref|XP_001760871.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162687880|gb|EDQ74260.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 378

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 69/267 (25%), Positives = 108/267 (40%), Gaps = 46/267 (17%)

Query: 130 PYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGS 189
           P W  + G DH FV+ H            +++     L     +V  FG+ +    N   
Sbjct: 123 PAWKASGGRDHVFVIHH----------PNSMQATRNRLRNSLFIVSDFGRYDSEVANIQK 172

Query: 190 ITIPPYAPPQKMQAHQIPP----DTP---RSIFVYFRGLFYDVNNDPEGGYYARGARAAV 242
             + PY        H IP     D+    R I ++F+G         EGG      R  +
Sbjct: 173 DVVAPYK-------HVIPTFDFDDSSFHTRKILLFFQGAIVR----KEGG----KIRHEL 217

Query: 243 WENFKNNPLFDIST-----DHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVI 297
           +   K+ P    +T     D   +    M+ + FCL   G  P S RL +++V  C+PVI
Sbjct: 218 YRLLKDKPGVRFTTGNTALDGFQSATIGMRSSKFCLNMAGDTPSSNRLFDSIVSHCVPVI 277

Query: 298 IADDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPE--VILRKQRLLANPSMK 352
           I+DDI LPF D + +    +F+      K   +  +L ++  E    L  Q LL     +
Sbjct: 278 ISDDIELPFEDTLDYSNFCIFINSSLALKPGYVINMLRNVSEEEWTQLWNQLLLVEHHFE 337

Query: 353 RAMLFPQPAQPGDAFHQILNGLARKLP 379
               +  P +  DA + +   +ARKLP
Sbjct: 338 ----YQHPTRKNDAVNMVWKDIARKLP 360


>gi|357162592|ref|XP_003579459.1| PREDICTED: xylogalacturonan beta-1,3-xylosyltransferase-like
           [Brachypodium distachyon]
          Length = 474

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 60/217 (27%), Positives = 92/217 (42%), Gaps = 24/217 (11%)

Query: 116 RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIE-------RGILPLL 168
           R  R  +  ++++ P W R+ G DH FV+  D  A +H + E A          G   L 
Sbjct: 178 RRQREVVDRVTAH-PAWRRSGGRDHVFVLT-DPMAMWHVRAEIAPAILLVVDFGGWYKLD 235

Query: 169 QRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNND 228
            ++    +     H  ++     I PY     +   Q+  +  R   +YF+G      + 
Sbjct: 236 SKSAGSNSSHMIQHTQVSLLKDVIIPYT--HLLPTLQLSENMDRPTLLYFKG----AKHR 289

Query: 229 PEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYED-----MQRAIFCLCPLGWAPWSP 283
             GG      R  +W+   N P   +    P     +     M+ + FCL P G  P S 
Sbjct: 290 HRGGL----VREKLWDVMINEPGVVMEEGFPNATGREQSIKGMRTSEFCLHPAGDTPSSC 345

Query: 284 RLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVA 320
           RL +AV   CIPVI++DDI LPF   I + E  +FV+
Sbjct: 346 RLFDAVASLCIPVIVSDDIELPFEGMIDYTEFSIFVS 382


>gi|384251773|gb|EIE25250.1| hypothetical protein COCSUDRAFT_40557 [Coccomyxa subellipsoidea
           C-169]
          Length = 280

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 45/149 (30%), Positives = 71/149 (47%), Gaps = 7/149 (4%)

Query: 234 YARGARAAVWENFKNNPLFDI----STDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAV 289
           Y+ G R  +   F  +P F +         T Y  D  R+ FCL   G A W  RL  A+
Sbjct: 115 YSHGVRQTLMMLFSQHPGFKLIDTGERGGYTQYMADFGRSTFCLAATG-AGWGVRLKLAL 173

Query: 290 VFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANP 349
           + GCIPVIIAD++ +PF D +P+++  + V E  + +L  +L +I     L K+  +   
Sbjct: 174 MHGCIPVIIADNVQMPFEDVLPYQDFAVHVREHALYRLPEVLDAILSTEGLVKRMQINVS 233

Query: 350 SMKRAMLFPQPAQPGDAFHQILNGLARKL 378
            + R   +  P     A   ++  L RK+
Sbjct: 234 CIWRYFTWRDP--QARAIDALICSLRRKM 260


>gi|168010648|ref|XP_001758016.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162690893|gb|EDQ77258.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 481

 Score = 68.9 bits (167), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 85/391 (21%), Positives = 147/391 (37%), Gaps = 73/391 (18%)

Query: 38  GKLKVYVYELPSKYNKKLLLKD------------------PRCLTHMFAAEIFMHRFLLS 79
            KLK+++Y+LP +++  +L++                   P  L    + E ++   LL+
Sbjct: 97  AKLKLFMYDLPPEFHYSMLVEQAYTGGQIWPKNISDIPPYPGGLYQQHSPEYWLTNDLLT 156

Query: 80  SPV----------RTLNPEEADWFYTPIYP-------TCDLTPTGLPLPF---KSPRMMR 119
           S +          R  +   AD  + P +        T      G  L     K+ ++  
Sbjct: 157 SNMAGRQSACTAFRVNDWRAADLMFVPFFASLAYNRYTKSEHKVGGELDLVGDKNQKLQE 216

Query: 120 SAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRAT-LVQTFG 178
             ++ +    P W  + G+DH  V+ H     FH          +     +A  +V  FG
Sbjct: 217 KLLKFLEQQ-PAWQASGGSDHIVVIHHP--NSFH---------AMRNFFSKAIFIVADFG 264

Query: 179 QRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGA 238
           +      N     + PY            P   R I ++F+G         +GG      
Sbjct: 265 RYPSEVANLRKDVVAPYKHVIPSFVDDSTPFEEREILLFFQGTIVR----KQGGV----I 316

Query: 239 RAAVWENFKNNPLFDISTD-------HPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVF 291
           R  ++E  KN                H  T    M+R+  CL   G  P S RL +A+  
Sbjct: 317 RQQLYEMLKNEKGVHFEEGSAGSAGIHSAT--TGMRRSKCCLNIAGDTPSSNRLFDAIAS 374

Query: 292 GCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSM 351
            C+PVII+D+I LPF D + +    +F+   D  +   ++  I    + RK+ +     +
Sbjct: 375 HCVPVIISDEIELPFEDELDYSGFSIFINSTDAVQEKFVINLI--RSVSRKEWMRLWKRL 432

Query: 352 KRAML---FPQPAQPGDAFHQILNGLARKLP 379
           K   L   +  P +P DA + +   +A K+P
Sbjct: 433 KEVSLHFEYQHPTKPYDAVNMVWRAVAHKVP 463


>gi|32488405|emb|CAE02830.1| OSJNBa0043A12.35 [Oryza sativa Japonica Group]
 gi|90265244|emb|CAH67697.1| H0624F09.5 [Oryza sativa Indica Group]
          Length = 464

 Score = 68.9 bits (167), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 78/287 (27%), Positives = 126/287 (43%), Gaps = 38/287 (13%)

Query: 116 RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKA------IERG---ILP 166
           R  R  +  ++++ P W R+ G DH FV+  D  A +H ++E A      ++ G    L 
Sbjct: 168 RRQREVVDRVTAH-PAWRRSGGRDHVFVLT-DPVAMWHVRKEIAPAILLVVDFGGWYKLD 225

Query: 167 LLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVN 226
               ++ V    Q   V L +  I   PY     +    +  +  R   +YF+G      
Sbjct: 226 SNSASSNVSHMIQHTQVSLLKDVIV--PYT--HLLPTMHLSENKDRPTLLYFKG----AK 277

Query: 227 NDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYED-----MQRAIFCLCPLGWAPW 281
           +   GG      R  +W+   N P   +   +P     +     M+ + FCL P G  P 
Sbjct: 278 HRHRGGL----VREKLWDLMVNEPDVVMEEGYPNATGREQSIKGMRTSEFCLHPAGDTPT 333

Query: 282 SPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDV--PK-LDTILTSIPPEV 338
           S RL +AV   CIPVI++D+I LPF   I + E  +FV+  +   PK L   L ++P + 
Sbjct: 334 SCRLFDAVASLCIPVIVSDEIELPFEGMIDYTEFAIFVSVNNSMRPKWLTNYLRNVPRQQ 393

Query: 339 ILRKQRLLAN--PSMKRAMLFP----QPAQPGDAFHQILNGLARKLP 379
               +R +A+  P  +   ++P      AQ G A + I   + +KLP
Sbjct: 394 KDEFRRNMAHVQPIFEYDSIYPGRMASAAQDG-AVNHIWKKIHQKLP 439


>gi|320164456|gb|EFW41355.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
          Length = 380

 Score = 68.9 bits (167), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 84/336 (25%), Positives = 137/336 (40%), Gaps = 38/336 (11%)

Query: 36  PVGKLKVYVYELPSKYNKKLLLKDP---RC---LTHMFAAEIFMHRFLLSSPVRTLNPEE 89
           P+G  K+YVY       K++L   P    C   L  M+  EI +  ++  S   T NPEE
Sbjct: 4   PIGIFKIYVY-------KRILNAMPTRDECRDSLQIMWYEEIELPYWIRDSIHHTDNPEE 56

Query: 90  ADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFG 149
           A +FY P    C          F   R    +++ +  + PY++R  G DH  + P    
Sbjct: 57  AHFFYIPTMVKCLFNLN--RARFNETRQFLISVRHLHRS-PYFHRNNGHDHALLNP---- 109

Query: 150 ACFHYQEEKAIERGILPLLQRA------TLVQTFGQRNHVCLNEGSITIPPY---APPQK 200
               Y    ++  G     + A      T + T   R +       I +P Y   A    
Sbjct: 110 GGGSYNVTSSVLHGSYLFGRGAGHYSNVTKLLTEAYRPYAYFAGRDIIVPGYPDDAFFSY 169

Query: 201 MQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPT 260
            + +Q      R +F+Y  G+        + G  A   +    ++    PL  + T   +
Sbjct: 170 QETYQDALRERRRLFLYTGGVQLSYQRR-QLGRLAELLKIPSAKSSFYAPLVLLQTRKVS 228

Query: 261 TYYEDMQRAI----FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG 316
           +   + Q+ +    FC  P G +PW+ R  +A + GCIPV+   + VLPF + I W+ I 
Sbjct: 229 SNKFEYQQLVKDFTFCAAPRGTSPWTQRFYDAAIVGCIPVLFDRNFVLPFPNQIDWDSIV 288

Query: 317 MFVAEE---DVPKLDTILT-SIPPEVILRKQRLLAN 348
           +  +E+       LD +   S   E I  ++R LA+
Sbjct: 289 VRFSEDFSHSFSFLDHLYQLSQDVEAIRERRRKLAD 324


>gi|412988049|emb|CCO19445.1| exostosin-like glycosyltransferase [Bathycoccus prasinos]
          Length = 663

 Score = 68.9 bits (167), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 47/134 (35%), Positives = 71/134 (52%), Gaps = 11/134 (8%)

Query: 39  KLKVYVYE-LPSKYNKKLLLK-DPRCLTHMFA-AEIFMHRFLLSSPVRTLNPEEADWFYT 95
           + KV+VYE +P   N  L  K   +C  + +A AE  +   +  S V T +PE AD++  
Sbjct: 184 RFKVFVYEDVPEALNADLRTKRSDKCKDNGYANAEWKIPELIAKSEVYTPDPELADFYVV 243

Query: 96  PIYPTC---DLTPTG----LPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDF 148
           P++P C   D    G    +    K  +M ++AI  I+ N+PYW R+EG DH F+ P + 
Sbjct: 244 PLFPECYVRDKLEKGGADYVTAVRKVNKMYQAAIDRIAGNYPYWRRSEGRDHVFIFPAEK 303

Query: 149 GACFHYQEEKAIER 162
           GA  +   EK +ER
Sbjct: 304 GAE-NVLNEKTLER 316



 Score = 47.4 bits (111), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 31/125 (24%), Positives = 56/125 (44%), Gaps = 5/125 (4%)

Query: 265 DMQRAIFCLCP--LGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEE 322
           +M+ +  CL     G   WS  L +A++ GC+P+I+ DD    F+D + WE     +   
Sbjct: 527 EMRNSETCLITPHGGLEGWSTALSDAILSGCVPLIVHDDFEPYFSDVLDWENFSYKIPTR 586

Query: 323 DVPK--LDTIL-TSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
           +  +   D +L        I R +R        RA ++    Q GDA   +   L R++ 
Sbjct: 587 EALRNAKDALLHEKSKASSITRSKRENDLKIAARASVWNDKWQSGDALDALFESLRRRVR 646

Query: 380 HDKST 384
           + +++
Sbjct: 647 YHRNS 651


>gi|218195798|gb|EEC78225.1| hypothetical protein OsI_17865 [Oryza sativa Indica Group]
 gi|222629749|gb|EEE61881.1| hypothetical protein OsJ_16573 [Oryza sativa Japonica Group]
          Length = 452

 Score = 68.9 bits (167), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 78/287 (27%), Positives = 126/287 (43%), Gaps = 38/287 (13%)

Query: 116 RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKA------IERG---ILP 166
           R  R  +  ++++ P W R+ G DH FV+  D  A +H ++E A      ++ G    L 
Sbjct: 156 RRQREVVDRVTAH-PAWRRSGGRDHVFVLT-DPVAMWHVRKEIAPAILLVVDFGGWYKLD 213

Query: 167 LLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVN 226
               ++ V    Q   V L +  I   PY     +    +  +  R   +YF+G      
Sbjct: 214 SNSASSNVSHMIQHTQVSLLKDVIV--PYT--HLLPTMHLSENKDRPTLLYFKG----AK 265

Query: 227 NDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYED-----MQRAIFCLCPLGWAPW 281
           +   GG      R  +W+   N P   +   +P     +     M+ + FCL P G  P 
Sbjct: 266 HRHRGGL----VREKLWDLMVNEPDVVMEEGYPNATGREQSIKGMRTSEFCLHPAGDTPT 321

Query: 282 SPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDV--PK-LDTILTSIPPEV 338
           S RL +AV   CIPVI++D+I LPF   I + E  +FV+  +   PK L   L ++P + 
Sbjct: 322 SCRLFDAVASLCIPVIVSDEIELPFEGMIDYTEFAIFVSVNNSMRPKWLTNYLRNVPRQQ 381

Query: 339 ILRKQRLLAN--PSMKRAMLFP----QPAQPGDAFHQILNGLARKLP 379
               +R +A+  P  +   ++P      AQ G A + I   + +KLP
Sbjct: 382 KDEFRRNMAHVQPIFEYDSIYPGRMASAAQDG-AVNHIWKKIHQKLP 427


>gi|115469550|ref|NP_001058374.1| Os06g0680900 [Oryza sativa Japonica Group]
 gi|113596414|dbj|BAF20288.1| Os06g0680900 [Oryza sativa Japonica Group]
 gi|125556494|gb|EAZ02100.1| hypothetical protein OsI_24187 [Oryza sativa Indica Group]
          Length = 477

 Score = 68.9 bits (167), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 72/262 (27%), Positives = 106/262 (40%), Gaps = 26/262 (9%)

Query: 83  RTLNPEEADWFYTPIY--PTCDLTPTGLPLPFKSP--RMMRSAIQLISSNWPYWNRTEGA 138
           R  +P EA  F+ PI              + F  P  R++   +  +++ +P+WNR+ GA
Sbjct: 182 RARHPGEAHAFFLPISVASIAGYVYRRDMIDFWDPQLRLVAGYVDGLAAMYPFWNRSRGA 241

Query: 139 DHFFVVPHDFGACFHYQEEKAIERG-ILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAP 197
           DHF V  H +         KA  RG  + ++  A +   F     V L           P
Sbjct: 242 DHFLVSCHQWAPIL--SAAKAELRGNAIRVMCDADMSDGFDPATDVAL-----------P 288

Query: 198 PQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTD 257
           P    A   P   P+      R +         GG   R A  A WE   +  +      
Sbjct: 289 PVVASARATP---PQGRVASERTVLAFFAAGGGGGGAVREALLARWEGRDDRVVVYGRLP 345

Query: 258 HPTTYYEDMQRAIFCLCPL----GWAPWSPRLVEAVVFGCIPVIIADDIVL-PFADAIPW 312
               + E M+RA FCLCP     G A  S R+VEA+  GC+PV++ D     PF+D + W
Sbjct: 346 AGVDHGELMRRARFCLCPCGGGEGAAAASRRVVEAITAGCVPVLVDDGGYSPPFSDVLDW 405

Query: 313 EEIGMFVAEEDVPKLDTILTSI 334
               + V  E V ++  IL  +
Sbjct: 406 ARFSVAVPAERVGEIKDILGGV 427


>gi|58269252|ref|XP_571782.1| hypothetical protein [Cryptococcus neoformans var. neoformans JEC21]
 gi|57228018|gb|AAW44475.1| conserved hypothetical protein [Cryptococcus neoformans var.
            neoformans JEC21]
          Length = 1132

 Score = 68.6 bits (166), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 37/109 (33%), Positives = 60/109 (55%), Gaps = 4/109 (3%)

Query: 262  YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
            Y  D+  A FC  P G   WSP+  +A+  GCIPV I++    PFAD + W ++ + VA 
Sbjct: 1002 YMNDLNNARFCPQPRGITGWSPQTNDAIYAGCIPVFISEGTHYPFADFLDWSKLSVRVAP 1061

Query: 322  EDVPKLDTILTSIPPEVILRKQRLLAN-PSMKRAMLFPQPAQPGDAFHQ 369
             ++ K++ IL +IP   + + + L AN  SM+ A L+    +P +   +
Sbjct: 1062 TELDKIEKILAAIP---LSKVEELQANLVSMREAFLYSGDEKPEEELER 1107


>gi|134114275|ref|XP_774385.1| hypothetical protein CNBG3660 [Cryptococcus neoformans var.
            neoformans B-3501A]
 gi|50257020|gb|EAL19738.1| hypothetical protein CNBG3660 [Cryptococcus neoformans var.
            neoformans B-3501A]
          Length = 1132

 Score = 68.6 bits (166), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 37/109 (33%), Positives = 60/109 (55%), Gaps = 4/109 (3%)

Query: 262  YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
            Y  D+  A FC  P G   WSP+  +A+  GCIPV I++    PFAD + W ++ + VA 
Sbjct: 1002 YMNDLNNARFCPQPRGITGWSPQTNDAIYAGCIPVFISEGTHYPFADFLDWSKLSVRVAP 1061

Query: 322  EDVPKLDTILTSIPPEVILRKQRLLAN-PSMKRAMLFPQPAQPGDAFHQ 369
             ++ K++ IL +IP   + + + L AN  SM+ A L+    +P +   +
Sbjct: 1062 TELDKIEKILAAIP---LSKVEELQANLVSMREAFLYSGDEKPEEELER 1107


>gi|168017794|ref|XP_001761432.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162687438|gb|EDQ73821.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 377

 Score = 68.6 bits (166), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 65/282 (23%), Positives = 110/282 (39%), Gaps = 43/282 (15%)

Query: 113 KSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRAT 172
           K+ ++    ++ +    P W  ++G DH  V+ H      H   +    R +L +L    
Sbjct: 112 KNQKLQEKLLEYLKQQ-PAWQASDGCDHILVMHH--PNSMHAMRDSF--RNVLFVLA--- 163

Query: 173 LVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGL----------- 221
               FG+      N     + PY        +       R   ++F+G            
Sbjct: 164 ---DFGRYPPDVANVEKDVVAPYKHIIPSFDNDSSSFEDRETLLFFQGTIVRKQGGVIRQ 220

Query: 222 -FYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAP 280
             Y++  D EG ++  G+  +                H  T    M+ + FCL   G  P
Sbjct: 221 QLYEMLKDEEGVHFEEGSSGS-------------EGVHSAT--SGMRGSKFCLNIAGDTP 265

Query: 281 WSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVIL 340
            S RL +++   C+PVII+DDI LPF D + + E  +F+  ED  K   ++  +    I 
Sbjct: 266 SSNRLFDSIASHCVPVIISDDIELPFEDELDYSEFCVFIKSEDALKEKYVINLL--RSIT 323

Query: 341 RKQRLLANPSMK---RAMLFPQPAQPGDAFHQILNGLARKLP 379
           R Q       +K   R   +  P +P DA + +   +AR+ P
Sbjct: 324 RVQWTFLWKRLKAVARHFEYQHPTKPYDAVNMVWRAIARRAP 365


>gi|307106602|gb|EFN54847.1| hypothetical protein CHLNCDRAFT_134888 [Chlorella variabilis]
          Length = 833

 Score = 68.2 bits (165), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 47/142 (33%), Positives = 66/142 (46%), Gaps = 31/142 (21%)

Query: 42  VYVYELPSKYNKKLL------------LKDPRCLT------HMFAAEIFMHRFLLSSPVR 83
           +YVYELP  YN  +L              D R  T      H++  E  +H  LL S  R
Sbjct: 371 IYVYELPPFYNSVMLQYRVSREGCVHRFFDDRNATVFNDMMHLYNPEPGLHEALLQSEHR 430

Query: 84  TLNPEEADWFYTPIYPTCDLTPT----------GLPLPFKS---PRMMRSAIQLISSNWP 130
           TL+P+EAD+FY P + +C L P           G P+ +++     M       I S++P
Sbjct: 431 TLDPDEADFFYIPAFVSCFLFPVLSATDFPYFHGGPVAWRTHAAANMFIEVYHWIRSHYP 490

Query: 131 YWNRTEGADHFFVVPHDFGACF 152
           YW+R  G DH     HD G+C+
Sbjct: 491 YWDRNGGRDHIVGSFHDEGSCW 512



 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 36/150 (24%), Positives = 68/150 (45%), Gaps = 15/150 (10%)

Query: 208 PDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAV---------WENFKNNPLFDISTDH 258
           P   R I  +F+G     N  PE   Y+RG R  +         W   K +   ++    
Sbjct: 611 PTRERRILAFFKGRTQQSN--PE---YSRGIRQTLENLTREHDWWGKHKVHVGEEMPEGE 665

Query: 259 PTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMF 318
             +Y   + +++FC   +G   +S R  +A++ GCIPV+I D +   +++ +      + 
Sbjct: 666 SDSYSAMLAQSVFCFALMG-DGFSSRTDDAIIHGCIPVLIQDGVEPTWSNLLDTGSYSVR 724

Query: 319 VAEEDVPKLDTILTSIPPEVILRKQRLLAN 348
           + ++D+ ++  IL +I  E + R Q  L  
Sbjct: 725 ILQKDMERVPEILQAISKEDVARMQANLGK 754


>gi|224098485|ref|XP_002311191.1| predicted protein [Populus trichocarpa]
 gi|222851011|gb|EEE88558.1| predicted protein [Populus trichocarpa]
          Length = 345

 Score = 68.2 bits (165), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 67/247 (27%), Positives = 109/247 (44%), Gaps = 40/247 (16%)

Query: 77  LLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTE 136
           L +SP  T NPEEA  F+ P                 S R +   I+ +   +PYWNRT 
Sbjct: 69  LQASPFVTQNPEEAHLFFVPFASNL------------STRSIARFIRDLRMEFPYWNRTL 116

Query: 137 GADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYA 196
           GADHF+V     G    Y+     +R ++ L + +  +  F       +    I++PP A
Sbjct: 117 GADHFYVSCAGLG----YES----DRNLVELKKNSVQISCFPVPEGKFVPHKDISLPPLA 168

Query: 197 PPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDIST 256
             +  +A   P +         R + Y V +   GG   + ++ A      ++ L +   
Sbjct: 169 --RITRASHAPGN---------RTVRYLVRH---GG--VKDSKLANELRNDSDFLMESEP 212

Query: 257 DHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIV--LPFADAIPWEE 314
            +  T  E +  ++FCL   G A  S  + EA+ FGC+PV++ D  +  LP  D + W++
Sbjct: 213 SNEMTLVERLGSSMFCLFEDG-ADISG-IGEALRFGCVPVMVTDRPMQDLPLMDVLSWQK 270

Query: 315 IGMFVAE 321
           I +FV  
Sbjct: 271 IAVFVGS 277


>gi|224135879|ref|XP_002327326.1| predicted protein [Populus trichocarpa]
 gi|222835696|gb|EEE74131.1| predicted protein [Populus trichocarpa]
          Length = 449

 Score = 67.8 bits (164), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 76/295 (25%), Positives = 114/295 (38%), Gaps = 53/295 (17%)

Query: 128 NWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFG--------- 178
           N   W R+ G DH FV+  D  A +H + E      I P +    LV  FG         
Sbjct: 164 NSEAWKRSGGKDHVFVLT-DPVAMWHVRAE------IAPAI---LLVVDFGGWYRLDSKS 213

Query: 179 -------QRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEG 231
                     H  ++     I PY     +   Q   +  R+  +YF+G      +   G
Sbjct: 214 SNGSSSDMIRHTQVSLLKDVIVPYT--HLLPRFQFSENKKRNTLLYFKG----AKHRHRG 267

Query: 232 GYYARGARAAVWENFKNNPLFDISTDHPTTYYED-----MQRAIFCLCPLGWAPWSPRLV 286
           G      R  +W+   N P   +    P     +     M+ + FCL P G  P S RL 
Sbjct: 268 GI----VRENLWDLLVNEPGVIMEEGFPNATGRELSIRGMRTSEFCLHPAGDTPTSCRLF 323

Query: 287 EAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLL 346
           +A+   CIPVI++D+I LPF   + + E  +FVA +D  K   ++  +    I  KQ+  
Sbjct: 324 DAIQSLCIPVIVSDNIELPFEGILDYTEFSVFVAGDDALKPTWLMDHL--RSISEKQKEE 381

Query: 347 ANPSMKRAMLFPQ----------PAQPGDAFHQILNGLARKLPHDKSTYLKPGGK 391
              +M +  L  Q          P  P  A + I   +  KLP  K   ++   K
Sbjct: 382 LRRNMAKIQLIYQYENGHPGGIGPISPNGAVNHIWKKIHEKLPVIKEAIVREKRK 436


>gi|168014691|ref|XP_001759885.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689015|gb|EDQ75389.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 383

 Score = 67.8 bits (164), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 69/285 (24%), Positives = 115/285 (40%), Gaps = 49/285 (17%)

Query: 113 KSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRAT 172
           K+  +    +Q +  + P W  + G +H  V+ H           +A+   +   L    
Sbjct: 118 KNEELQAKLVQYLEKH-PAWKASNGKNHVMVIHHP-------NSMQAVRDRLRNALY--- 166

Query: 173 LVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDT-------PRSIFVYFRGLFYDV 225
           +V  FG+  +   N     + PY        H +P  T        RS  VYF+G     
Sbjct: 167 VVSDFGRYENETANIRKDVVAPYK-------HVLPTFTDDSSSFHTRSTVVYFQGSIVR- 218

Query: 226 NNDPEGGYYARGARAAVWENFKNNPLFDISTD-------HPTTYYEDMQRAIFCLCPLGW 278
               EGG      R  +++  K+ P    +T        H  T    M+ + FCL   G 
Sbjct: 219 ---KEGG----KIRHELYDLLKDEPDVHFTTGITASEGFHSAT--RGMRSSRFCLNLAGD 269

Query: 279 APWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK----LDTILTSI 334
            P S RL +++   C+PVII+DD+ LPF D + +    +F+      +    ++ +    
Sbjct: 270 TPSSNRLFDSIASHCVPVIISDDLELPFEDDLNYSSFCIFINSTRALQPGYVINLLRNVS 329

Query: 335 PPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
             E  L  +RLL    ++R   +  P+   DA + +   +ARKLP
Sbjct: 330 SEEWTLMWERLLV---VERHFEYQFPSVANDAVNMVWKAIARKLP 371


>gi|449462180|ref|XP_004148819.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase F8H-like
           [Cucumis sativus]
 gi|449524512|ref|XP_004169266.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase F8H-like
           [Cucumis sativus]
          Length = 458

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 41/138 (29%), Positives = 70/138 (50%), Gaps = 8/138 (5%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGM-FVA 320
           Y+E ++ A FCL P G + W+ R  E+    C+PV+++D + LPF + I + +I + + +
Sbjct: 308 YFEHLRNAKFCLAPRGESSWTLRFYESFFVECVPVVLSDQVELPFQNVIDYSQISIKWPS 367

Query: 321 EEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAM-LFPQPAQPGDAFHQILNGLARKLP 379
            E  P+L   L SIP E I    +++A     R + ++   ++P      IL  L RK+ 
Sbjct: 368 SEIGPQLLEYLESIPDETI---DKMIARGRRVRCLWVYASDSEPCSTMQGILWELQRKVR 424

Query: 380 ---HDKSTYLKPGGKFLN 394
                  T+    G F+N
Sbjct: 425 QFHQSTETFWLHNGSFVN 442



 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 42/133 (31%), Positives = 60/133 (45%), Gaps = 12/133 (9%)

Query: 40  LKVYVYELPSKYNKKLLL--KDPR-----CLTHMFAAEIFMHRFLLSSPVRTLNPEEADW 92
           LK+YVY+       K L+  +D +     CL   +  ++ +HR LL S  RT N EEAD+
Sbjct: 85  LKIYVYDETEIQGLKALMYGRDGKITAAACLKGQWGTQVKIHRLLLQSRFRTRNKEEADF 144

Query: 93  FYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACF 152
           F+ P Y  C     GL     + + +  A   +    PY+  + G DH FV P   GA  
Sbjct: 145 FFVPAYVKCVRMLGGL-----NDKEINEAYIQVLGQMPYFRLSGGRDHIFVFPSGAGAHL 199

Query: 153 HYQEEKAIERGIL 165
                  I R I+
Sbjct: 200 FKSWATYINRSII 212


>gi|449444196|ref|XP_004139861.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10L-like
           [Cucumis sativus]
          Length = 478

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 70/278 (25%), Positives = 111/278 (39%), Gaps = 35/278 (12%)

Query: 83  RTLNPEEADWFYTPIYPT--CDLTPTGLPLPFKSP------RMMRSAIQLISSNWPYWNR 134
           R    EEAD  + P + T   ++        F+           R+ +  + S    W +
Sbjct: 141 RVFKAEEADVIFVPFFATMSAEMQLGMAKGAFRKKVGNEDYERQRNVMDFLKST-DAWKK 199

Query: 135 TEGADHFFVVPHDFGACFHYQEEKAIE-------RGILPLLQRATLVQTFGQRNHVCLNE 187
           + G DH FV+  D  A +H + E A          G   L  +++   +     H  ++ 
Sbjct: 200 SGGRDHVFVLT-DPVAMWHVKTEIAPAVLLVVDFGGWFRLDTKSSNGSSPDMIQHTQVSV 258

Query: 188 GSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFK 247
               I PY     +    +  +  R   +YF+G          GG      R  +W+   
Sbjct: 259 LKDVIVPYT--HLLPRLHLSANKKRQTLLYFKG----AKRRHRGGL----VREKLWDLLV 308

Query: 248 NNPLFDISTDHPTTYYED-----MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDI 302
           N P   +    P    ++     M+ + FCL P G  P S RL +A+   CIPV+++D+I
Sbjct: 309 NEPDVIMEEGFPNATGKEQSIKGMRSSEFCLHPAGDTPTSCRLFDAIQSLCIPVVVSDNI 368

Query: 303 VLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPE 337
            LPF D + + E  +FVA  D  K   L   L +IP E
Sbjct: 369 ELPFEDMVDYSEFSVFVAVNDALKPNWLVKHLRTIPEE 406


>gi|326429052|gb|EGD74622.1| hypothetical protein PTSG_05987 [Salpingoeca sp. ATCC 50818]
          Length = 452

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 40/117 (34%), Positives = 60/117 (51%), Gaps = 2/117 (1%)

Query: 265 DMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFAD-AIPWEEIGMFVAEED 323
           +M  + FCLCP G    + R+ EAV+ GCIPVI++D    PF   A   +   + V E+D
Sbjct: 337 EMATSEFCLCPRGLTAGTRRIFEAVLVGCIPVIVSDGYTWPFPHLAAELDAASVRVPEKD 396

Query: 324 VPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPH 380
             ++  IL  +     + K+  LA+ +       P P QPGDAF+ I+  +A    H
Sbjct: 397 AARVLDILGHVSRRERVAKRVRLAHLAHNVTYHLPAP-QPGDAFYNIIRAIADTQQH 452


>gi|323456573|gb|EGB12440.1| hypothetical protein AURANDRAFT_70660 [Aureococcus anophagefferens]
          Length = 1731

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 28/74 (37%), Positives = 44/74 (59%)

Query: 260  TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFV 319
              Y E M+RA +C+   G++PWSPRL EAV  GC+P  ++  +  P+A  + W    + +
Sbjct: 1512 ANYTESMRRAKYCVVTEGFSPWSPRLSEAVALGCVPCFLSPSLAPPYATVLDWSAFSVEI 1571

Query: 320  AEEDVPKLDTILTS 333
            AE DV +L  +L +
Sbjct: 1572 AEADVGRLPEVLAA 1585


>gi|297611147|ref|NP_001065633.2| Os11g0128000 [Oryza sativa Japonica Group]
 gi|255679743|dbj|BAF27478.2| Os11g0128000, partial [Oryza sativa Japonica Group]
          Length = 199

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 40/119 (33%), Positives = 61/119 (51%), Gaps = 5/119 (4%)

Query: 264 EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED 323
           + M  + FCL   G  P S RL +A+V  C+PVII+DDI LP+ DA+ + +  +FV   D
Sbjct: 63  QGMHASKFCLNIAGDTPSSNRLFDAIVSHCVPVIISDDIELPYEDALDYSKFSIFVRSSD 122

Query: 324 VPKLDTILTSIPPEVILRKQRLLANPSMK---RAMLFPQPAQPGDAFHQILNGLARKLP 379
             K   ++  I    + + Q  +    +K   +   +  P+Q  DA   I   LARK+P
Sbjct: 123 AVKKGYLMRLIRG--VSKHQWTMMWRRLKEVDKHFEYQYPSQKDDAVQMIWQTLARKVP 179


>gi|194701034|gb|ACF84601.1| unknown [Zea mays]
 gi|413955857|gb|AFW88506.1| hypothetical protein ZEAMMB73_716681 [Zea mays]
          Length = 462

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 86/385 (22%), Positives = 148/385 (38%), Gaps = 71/385 (18%)

Query: 40  LKVYVYELPSKYNKKLL--------------LKD-----PRCLTHMFAAEIFMHRFLLSS 80
           L+V++Y+LP +++  LL              ++D     P  L    + E ++   LL+S
Sbjct: 68  LRVFMYDLPPEFHFGLLDWKPPGFGGGVWPDIRDGVPDYPGGLNLQHSIEYWLTLDLLAS 127

Query: 81  P---------VRTLNPEEADWFYTPIYPTCDLTPTGLPLP----FKSPRMMRSAIQLISS 127
                      R  +  +AD  + P + +         +P     +   + R  ++ +++
Sbjct: 128 EQGAPTPCAVARVRHAADADVVFVPFFASLSFNRHSRVVPPARDSEDRALQRRLLEFLAA 187

Query: 128 NWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQR----ATLVQTFGQRNHV 183
             P W RT G DH  +  H                G+L    R      ++  FG+    
Sbjct: 188 R-PEWRRTGGRDHVVLAHH--------------PNGMLDARYRFWPCVFVLCDFGRYPPS 232

Query: 184 CLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGG------YYARG 237
             N     I PY       A+       R   +YF+G  Y      +GG      YY   
Sbjct: 233 VANLDKDVIAPYRHLVANFANDTAGYDDRPTLLYFQGAIYR----KDGGFIRQELYYLLK 288

Query: 238 ARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVI 297
               V  +F +     ++ +      + M+ + FCL   G  P S RL +++V  C+PV 
Sbjct: 289 DEKDVHFSFGS-----VAGNGIEQATQGMRSSKFCLNIAGDTPSSNRLFDSIVSHCVPVT 343

Query: 298 IADDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRA 354
           I+D+I LPF D + + +  + V   D  K   L  ++  I  E   R    L    +++ 
Sbjct: 344 ISDEIELPFEDVLDYSKFSVIVRGADAVKKGFLMNLIKGISREEWTRMWNRLKE--VEKH 401

Query: 355 MLFPQPAQPGDAFHQILNGLARKLP 379
             +  P+Q  DA   I   +ARK+P
Sbjct: 402 FEYQYPSQTDDAVQMIWKAIARKVP 426


>gi|343171902|gb|AEL98655.1| exostosin-like protein, partial [Silene latifolia]
          Length = 250

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 59/213 (27%), Positives = 90/213 (42%), Gaps = 14/213 (6%)

Query: 173 LVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPD--TPRSIFVYFRGLFYDVNNDPE 230
           +V  FG+ +    N     + PY            PD    R+  ++FRG    V  D E
Sbjct: 26  IVADFGRYSKSMSNLRKDVVAPYVHVVDSYTDDDAPDPFKARTTLLFFRG--RTVRKD-E 82

Query: 231 GGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVV 290
           G   AR A+  V     +      S +      + M+ + FCL P G  P S RL +A+V
Sbjct: 83  GVVRARLAKILVDYEDVHYERSVASEESIKLSTQGMRSSKFCLDPAGDTPSSCRLFDAIV 142

Query: 291 FGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLA 347
             C+PVI++D I LPF D + +    +F + E+  K   +   L  +P E  L   + L 
Sbjct: 143 SHCVPVIVSDQIELPFEDELDYSNFSIFFSTEEALKPGYMVEELRKVPEEKWLNMYKKLK 202

Query: 348 NPSMKRAMLFPQPAQPGDAFHQILNGLARKLPH 380
           N S      +P   + G      +N L R++ H
Sbjct: 203 NISHHYEFQYPPKREDG------VNMLWRQIRH 229


>gi|343171904|gb|AEL98656.1| exostosin-like protein, partial [Silene latifolia]
          Length = 250

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 59/213 (27%), Positives = 90/213 (42%), Gaps = 14/213 (6%)

Query: 173 LVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPD--TPRSIFVYFRGLFYDVNNDPE 230
           +V  FG+ +    N     + PY            PD    R+  ++FRG    V  D E
Sbjct: 26  IVADFGRYSKSMSNLRKDVVAPYVHVVDSYTDDDAPDPFKARTTLLFFRG--RTVRKD-E 82

Query: 231 GGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVV 290
           G   AR A+  V     +      S +      + M+ + FCL P G  P S RL +A+V
Sbjct: 83  GVVRARLAKILVDYEDIHYERSVASEESIKLSTQGMRSSKFCLDPAGDTPSSCRLFDAIV 142

Query: 291 FGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLA 347
             C+PVI++D I LPF D + +    +F + E+  K   +   L  +P E  L   + L 
Sbjct: 143 SHCVPVIVSDQIELPFEDELDYSNFSIFFSTEEALKPGYMVEELRKVPEEKWLNMYKKLK 202

Query: 348 NPSMKRAMLFPQPAQPGDAFHQILNGLARKLPH 380
           N S      +P   + G      +N L R++ H
Sbjct: 203 NISHHYEFQYPPKREDG------VNMLWRQIRH 229


>gi|222635945|gb|EEE66077.1| hypothetical protein OsJ_22095 [Oryza sativa Japonica Group]
          Length = 265

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 40/140 (28%), Positives = 69/140 (49%), Gaps = 9/140 (6%)

Query: 240 AAVWEN-----FKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCI 294
           A +WEN       NN +     +    Y +   R  FC+CP G    S R+ +++ +GC+
Sbjct: 121 ARIWENDTELAISNNRINRAIGN--LVYQKHFFRTKFCVCPGGSQVNSARISDSIHYGCM 178

Query: 295 PVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRA 354
           PVI++D   L F+  + W +  + + E DV +L +IL S+  +  +   + L    +++ 
Sbjct: 179 PVILSDYYDLRFSGILNWRKFAVVLKESDVYELKSILKSLSQKEFVSLHKSLVQ--VQKH 236

Query: 355 MLFPQPAQPGDAFHQILNGL 374
             +  P  P DAFH I+  L
Sbjct: 237 FEWHSPPVPYDAFHMIMYEL 256


>gi|196004286|ref|XP_002112010.1| hypothetical protein TRIADDRAFT_24279 [Trichoplax adhaerens]
 gi|190585909|gb|EDV25977.1| hypothetical protein TRIADDRAFT_24279 [Trichoplax adhaerens]
          Length = 668

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 83/330 (25%), Positives = 131/330 (39%), Gaps = 51/330 (15%)

Query: 35  DPVGKLKVYVY---ELPSKYNKKLLLKDP-RCLTHMFAAEIFMHRFLLSSPVRTLNPEEA 90
           D   +L VYVY    LP  Y     + DP   +T  +     M + ++ SP    NP+ A
Sbjct: 35  DDHRRLTVYVYPRYLLPDLYG----ISDPVNTITREYRR---MLQAIIDSPFYVDNPKTA 87

Query: 91  DWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGA 150
                     C L P+   L  +   +  + + L  +N PYWN+ +    F ++P   G 
Sbjct: 88  ----------CLLIPSYDVLNQRHVNLQGAGLTL--TNLPYWNQGKNHLLFNMLP---GQ 132

Query: 151 CFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEG-SITIPPYAPPQKMQAHQIPPD 209
             H+       RG      RA L    G  + +    G  I+IP Y+  Q    HQ   D
Sbjct: 133 WPHFNTSLDARRG------RAMLAG--GGFSSLSYRYGFDISIPIYSLHQDDDYHQRSQD 184

Query: 210 TP---------RSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFD-----IS 255
                      R   +    L YD   D      A   R  + +    N          S
Sbjct: 185 ENYHNNISHHHRPWLLLSSQLHYDRKFDNRLMNLANDNRILILQRCSVNSSHKAYYQRCS 244

Query: 256 TDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEI 315
                +Y   +Q + FCL   G+       ++A+ FGCIPV+++D+ +LPF++ + W+  
Sbjct: 245 HKQSISYPTILQDSTFCLMLRGYRLIQSNFLDALKFGCIPVVLSDEYILPFSEVLDWKRA 304

Query: 316 GMFVAEEDVPKLDTILTSIPPEVI--LRKQ 343
            +   E+ +  L  +L+SI  +    LRKQ
Sbjct: 305 ALVFREDQLLSLPAVLSSISTKTRHNLRKQ 334


>gi|356576131|ref|XP_003556187.1| PREDICTED: probable glucuronosyltransferase Os01g0926400-like
           [Glycine max]
          Length = 481

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 71/297 (23%), Positives = 119/297 (40%), Gaps = 36/297 (12%)

Query: 112 FKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRA 171
           +K  R +  A++    N   WNR+ G DH FV+  D  A +H ++E A    ++      
Sbjct: 183 YKRQRQVMDAVK----NTHAWNRSGGRDHVFVLT-DPVAMWHVKDEIAPAVLLVVDFGGW 237

Query: 172 TLVQTFGQRN--------HVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFY 223
             + + G  N        H  ++     I PY     +    +  +  R   +YF+G   
Sbjct: 238 YRLDSRGGSNCSESDVIPHTQVSVIKDVIVPYT--HLLPRLDLSDNKERHQLLYFKG--- 292

Query: 224 DVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYED-----MQRAIFCLCPLGW 278
              +   GG      R  +W+   + P   +    P     +     MQ + FCL P G 
Sbjct: 293 -AKHRHRGGII----REKLWDLLVSEPGVIMEEGFPNATGREQSIKGMQTSEFCLHPAGD 347

Query: 279 APWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEV 338
            P S RL +A+   CIPVI++D+I LPF   + + E  +F A  D  K   +++ +    
Sbjct: 348 TPTSCRLFDAIQSLCIPVIVSDNIELPFEGMVDYAEFSVFAAVSDALKPSWLVSHLQSFS 407

Query: 339 ILRKQRLLANPSMKRAMLFPQPAQPG--------DAFHQILNGLARKLPHDKSTYLK 387
             +K R   N +  + +       PG         A + I   + +KLP  K   ++
Sbjct: 408 KEQKDRFRQNMARVQPIFVYDNGHPGGIGPIPADGAVNHIWKKVHQKLPMIKEAIIR 464


>gi|212722274|ref|NP_001131151.1| uncharacterized protein LOC100192459 [Zea mays]
 gi|194690716|gb|ACF79442.1| unknown [Zea mays]
 gi|413955856|gb|AFW88505.1| hypothetical protein ZEAMMB73_716681 [Zea mays]
          Length = 391

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 72/315 (22%), Positives = 121/315 (38%), Gaps = 43/315 (13%)

Query: 82  VRTLNPEEADWFYTPIYPTCDLTPTGLPLP----FKSPRMMRSAIQLISSNWPYWNRTEG 137
            R  +  +AD  + P + +         +P     +   + R  ++ +++  P W RT G
Sbjct: 67  ARVRHAADADVVFVPFFASLSFNRHSRVVPPARDSEDRALQRRLLEFLAAR-PEWRRTGG 125

Query: 138 ADHFFVVPHDFGACFHYQEEKAIERGILPLLQR----ATLVQTFGQRNHVCLNEGSITIP 193
            DH  +  H                G+L    R      ++  FG+      N     I 
Sbjct: 126 RDHVVLAHH--------------PNGMLDARYRFWPCVFVLCDFGRYPPSVANLDKDVIA 171

Query: 194 PYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGG------YYARGARAAVWENFK 247
           PY       A+       R   +YF+G  Y      +GG      YY       V  +F 
Sbjct: 172 PYRHLVANFANDTAGYDDRPTLLYFQGAIYR----KDGGFIRQELYYLLKDEKDVHFSFG 227

Query: 248 NNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFA 307
           +     ++ +      + M+ + FCL   G  P S RL +++V  C+PV I+D+I LPF 
Sbjct: 228 S-----VAGNGIEQATQGMRSSKFCLNIAGDTPSSNRLFDSIVSHCVPVTISDEIELPFE 282

Query: 308 DAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPG 364
           D + + +  + V   D  K   L  ++  I  E   R    L    +++   +  P+Q  
Sbjct: 283 DVLDYSKFSVIVRGADAVKKGFLMNLIKGISREEWTRMWNRLK--EVEKHFEYQYPSQTD 340

Query: 365 DAFHQILNGLARKLP 379
           DA   I   +ARK+P
Sbjct: 341 DAVQMIWKAIARKVP 355


>gi|224112673|ref|XP_002316258.1| predicted protein [Populus trichocarpa]
 gi|222865298|gb|EEF02429.1| predicted protein [Populus trichocarpa]
          Length = 339

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 73/303 (24%), Positives = 122/303 (40%), Gaps = 52/303 (17%)

Query: 77  LLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTE 136
           L +SP  T NPEEA  ++ P                 S R +   I+ +   +PYWNRT 
Sbjct: 69  LQNSPFVTQNPEEAHLYFVPFSSNL------------STRSVARFIRDLRMEFPYWNRTL 116

Query: 137 GADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYA 196
           GADHF+V     G    Y+     +R ++ L + +  +  F       +    IT PP+A
Sbjct: 117 GADHFYVSCAGLG----YES----DRNLVELKKNSVQISCFPTTEGRFVPHKDITFPPHA 168

Query: 197 PPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDIST 256
              +   +            + R              Y     + +    + +  F I +
Sbjct: 169 QGNRTAKY----------LGFVR--------------YNEVKESNLVNELRKDSDFLIES 204

Query: 257 D--HPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIV--LPFADAIPW 312
           +  +  T    +  ++FCL   G A  S  + EA+ FGC+PV++ D  +  LP  D I W
Sbjct: 205 EPSNGMTLVGRLGSSVFCLFEYG-ADVSG-IGEALRFGCVPVMVMDRPMQDLPLMDVIGW 262

Query: 313 EEIGMFVAEE-DVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQIL 371
           ++I +FV     V ++   L     +     +R L   + +  +    P QP D+FH ++
Sbjct: 263 QKIAIFVGSRGGVKEVKRELDRTCKDDECAGRRRLGVVASQHFVWNHMP-QPYDSFHMVM 321

Query: 372 NGL 374
             L
Sbjct: 322 YQL 324


>gi|320165867|gb|EFW42766.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
          Length = 508

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 73/305 (23%), Positives = 119/305 (39%), Gaps = 46/305 (15%)

Query: 36  PVGKLKVYVYELPSKYNKKLLLKDPRCL---THMFAAEIFMHRFLLSSPVRTLNPEEADW 92
           P+G LK+YVY+   K     +++  +CL   T M+  E+ +  +++ S   T NP+EA  
Sbjct: 129 PIGTLKIYVYKEMMK-----VMEPHKCLQTVTPMWLEEVELPSWVVKSIHYTNNPDEAHM 183

Query: 93  FYTPIYPTC--DLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGA 150
           F+ P    C  D   T   L  +   M    + ++ +   Y+ R  G DHF + P   G 
Sbjct: 184 FFIPAMVRCILDFNRTQFHLTSEFTEM----VDVLHTKHDYYRRNHGHDHFIINP---GG 236

Query: 151 CFHYQEEKAIERGILPL-----LQRATLVQTFGQRNHVCLNEGSITIP---PYAPPQKMQ 202
                    +   + P+        AT + +   R+    +     IP    Y   + M 
Sbjct: 237 GSMNVISSLLAGELHPVAANDWYSNATKLLSEAARSRAYFSGLDFVIPGSADYIFGKFMD 296

Query: 203 AHQIPPDTPRSIFVYFRG------------LFYDVNNDPEGGYYARGARAAVWENFKNNP 250
             Q   +    +F+Y  G            L   V  D E   + R     +  N  ++P
Sbjct: 297 VSQKIEEERPMLFLYLGGTSLGDQRQALGRLRKLVQGDSEQAAFFRD--KVLIANKIDDP 354

Query: 251 LFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAI 310
           +       P  Y   +Q   FC  P G +PW+ R  ++++ GCIPV         F D I
Sbjct: 355 V-------PELYSLRIQNFTFCAAPHGTSPWTQRFYDSLISGCIPVQFDRRFRFGFYDHI 407

Query: 311 PWEEI 315
            W+ I
Sbjct: 408 DWDSI 412


>gi|227206252|dbj|BAH57181.1| AT3G03650 [Arabidopsis thaliana]
          Length = 244

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 60/212 (28%), Positives = 87/212 (41%), Gaps = 9/212 (4%)

Query: 173 LVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGG 232
           +V  FG+ +    N     + PY        +       R I +YF+G  Y       GG
Sbjct: 21  VVADFGRYSPHVANVDKDIVAPYKHLVPSYVNDTSGFDGRPILLYFQGAIYR----KAGG 76

Query: 233 YYARGARAAVWENFKNNPLFDISTDHPTTYY-EDMQRAIFCLCPLGWAPWSPRLVEAVVF 291
           +  +     + E    +  F    +H  +   E M+ + FCL   G  P S RL +A+  
Sbjct: 77  FVRQELYNLLKEEKDVHFSFGSVRNHGISKAGEGMRSSKFCLNIAGDTPSSNRLFDAIAS 136

Query: 292 GCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLAN 348
            CIPVII+DDI LP+ D + + E  +FV   D  K   L  ++ SI  E   +    L  
Sbjct: 137 HCIPVIISDDIELPYEDVLNYNEFCLFVRSSDALKKGFLMGLVRSIGREEYNKMWLRLKE 196

Query: 349 PSMKRAMLFPQPAQPGD-AFHQILNGLARKLP 379
                 + FP     GD A   I   +ARK P
Sbjct: 197 VERYFDLRFPVKDDEGDYAVQMIWKAVARKAP 228


>gi|168043163|ref|XP_001774055.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162674601|gb|EDQ61107.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 541

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 69/296 (23%), Positives = 122/296 (41%), Gaps = 49/296 (16%)

Query: 68  AAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISS 127
            A++F  R LL S   T  P++A  F  P          G   P +    +R  +Q + +
Sbjct: 178 TADLF-SRLLLRSKFSTELPQKAQLFLLPFSIDALRVDLG---PSRISDHLRRYVQNVRT 233

Query: 128 NWPYWNRTEGADHFFVVPHDFGACFHYQ-----EEKAIERGILPLLQRATLVQTFGQRNH 182
           ++PYWN + GA+HF++    F     ++     E+ +I+    PL Q           N 
Sbjct: 234 SYPYWNLSLGANHFYLSSQAFENNNKHRNVLELEKNSIQAACAPLRQ-----------NQ 282

Query: 183 VCLNEGSITIPPYAPPQKMQAH-QIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAA 241
                     P Y P  + + +  +   T R++  YF G   D             A   
Sbjct: 283 NFYPHKDFIFPRYKPITQTEFYAALEGRTSRTVLAYFGGTLADT-----------PALVF 331

Query: 242 VWENFKNNPLFDISTD---HPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVII 298
           + + +K++P F++  D   H  + Y  + R+ FC+       +    V+A+ FGC+ V++
Sbjct: 332 ILDAWKSDPDFEVEVDPSPHRISVYRQLARSKFCVNVPSRDTFD--FVDAIRFGCVLVLL 389

Query: 299 ADDIV--LPFADAIPWEEIGMFVAEEDV--PKLDTILTSIPPEVILRKQRLLANPS 350
           +  +   LPF   + W +   F A+  V   KL+           + +++LLAN S
Sbjct: 390 SKSVFLDLPFQGFLDWRQ---FAAQHQVWPSKLEQRHCG-----FMMREKLLANCS 437


>gi|34394612|dbj|BAC83914.1| limonene cyclase like protein [Oryza sativa Japonica Group]
 gi|50508943|dbj|BAD31847.1| limonene cyclase like protein [Oryza sativa Japonica Group]
          Length = 332

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 36/103 (34%), Positives = 55/103 (53%), Gaps = 3/103 (2%)

Query: 273 LCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILT 332
           L  LG A     + EA+   C+PV+I DD  LPFAD + W    + VA  D+P+L  IL 
Sbjct: 218 LLELGLAKARATIREAIYLECVPVVIGDDYTLPFADVLNWAAFSVRVAVGDIPRLKEILA 277

Query: 333 SIPPEVILRKQRLLANPSMKRAMLFPQPA-QPGDAFHQILNGL 374
           ++ P   +R QR +   +++R  +    A +  D FH IL+ +
Sbjct: 278 AVSPRQYIRMQRRV--RAVRRHFMVSDGAPRRFDVFHMILHSI 318


>gi|159483755|ref|XP_001699926.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
 gi|158281868|gb|EDP07622.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
          Length = 785

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 59/229 (25%), Positives = 100/229 (43%), Gaps = 34/229 (14%)

Query: 165 LPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQK-------MQAHQIPPD-------- 209
           LP LQ+        +R      E  +  PPY   ++        +A+   PD        
Sbjct: 565 LPALQQ--------EREQCYRPEHDVAFPPYLSDREGNWFSVMKEAYDYTPDGKATFKRG 616

Query: 210 TPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRA 269
             R    YF G        P+  Y A G R  +   F N+   D+S +      + M R+
Sbjct: 617 VARDTLFYFNGF-----TKPDLAYSA-GVRQGLLALFGNSTRADLSINKGGGS-QRMLRS 669

Query: 270 IFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDT 329
            FC  P+G+  W  RL +A++ GC+P+++ D +     D +P+E+  + V+  ++ +L  
Sbjct: 670 RFCFTPMGFG-WGIRLSQAMLTGCVPIMVHDHVWPTLWDVLPYEQFSIRVSRHNMYRLLD 728

Query: 330 ILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKL 378
            L SI P+ + R Q  +A     +A ++ QP   G A++  L  L  +L
Sbjct: 729 YLESITPQQLARLQDGVAQ--WHKAFVW-QPEVGGLAYNYTLTSLHHRL 774



 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 38/119 (31%), Positives = 56/119 (47%), Gaps = 7/119 (5%)

Query: 39  KLKVYVYELPSKYNKKLLLKDPRCLTH-MFAAEI-FMHRFLLSSPVRTLNPEEADWFYTP 96
           +LK+YVY+LP        L D   L H ++ AEI   HR L  + V T NP EA+ FY P
Sbjct: 373 ELKIYVYDLPQHVAYMRPLGDHWPLHHSIYLAEIELYHRLLGDTTVVTENPWEANLFYVP 432

Query: 97  IYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQ 155
            +    +   G P      ++  +    +   +P+WN T G +H     +D G C  Y+
Sbjct: 433 THTYYYIGNIGFP-----GKLYTAVFHHVRQQYPWWNLTAGRNHVVSNSNDRGCCDLYR 486


>gi|359491711|ref|XP_002284930.2| PREDICTED: xylogalacturonan beta-1,3-xylosyltransferase isoform 1
           [Vitis vinifera]
          Length = 498

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 57/192 (29%), Positives = 83/192 (43%), Gaps = 28/192 (14%)

Query: 212 RSIFVYFRGLFYDVNNDPEGG------YYARGARAAVWENFKNNPLFDISTDHPTTYYED 265
           R + VYF+G  Y      +GG      YY       V   F +     +  +      E 
Sbjct: 311 RPLLVYFQGAIYR----KDGGAIRQELYYLLRDEKDVHFTFGS-----VRGNGINGASEG 361

Query: 266 MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVP 325
           M  + FCL   G  P S RL +A+V  C+PVII+D+I LPF D + + E  +FV   D  
Sbjct: 362 MASSKFCLNIAGDTPSSNRLFDAIVSHCVPVIISDEIELPFEDVLDYSEFCIFVRASDAV 421

Query: 326 K---LDTILTSIPPEVILRK-QRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP-- 379
           K   L  +L  I  E   +  +RL     +     +  P+Q GDA   I   ++RK+   
Sbjct: 422 KNGFLLNLLRGIKREKWTKMWERL---KEIAHHFEYQYPSQAGDAVDMIWGAVSRKISSI 478

Query: 380 ----HDKSTYLK 387
               H K+ Y +
Sbjct: 479 QNKLHRKNRYRR 490


>gi|255644422|gb|ACU22716.1| unknown [Glycine max]
          Length = 189

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 43/134 (32%), Positives = 65/134 (48%), Gaps = 11/134 (8%)

Query: 264 EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED 323
           + M  + FCL   G  P S RL +A+V  C+PVII+D+I LPF D + + +  +FV   D
Sbjct: 51  QGMAMSKFCLNIAGDTPSSNRLFDAIVSHCVPVIISDEIELPFEDVLDYSDFSIFVRASD 110

Query: 324 VPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP- 379
             K   L  +L SI  +   +    L    +     +  P+QPGDA + I   + RK+  
Sbjct: 111 SMKKGYLLNLLRSITQKEWSKMWERL--KQITHHFEYQYPSQPGDAVNMIWQQVERKISS 168

Query: 380 -----HDKSTYLKP 388
                H K+ Y + 
Sbjct: 169 IRFNLHRKNRYQRS 182


>gi|297733976|emb|CBI15223.3| unnamed protein product [Vitis vinifera]
          Length = 455

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 57/192 (29%), Positives = 83/192 (43%), Gaps = 28/192 (14%)

Query: 212 RSIFVYFRGLFYDVNNDPEGG------YYARGARAAVWENFKNNPLFDISTDHPTTYYED 265
           R + VYF+G  Y      +GG      YY       V   F +     +  +      E 
Sbjct: 268 RPLLVYFQGAIYR----KDGGAIRQELYYLLRDEKDVHFTFGS-----VRGNGINGASEG 318

Query: 266 MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVP 325
           M  + FCL   G  P S RL +A+V  C+PVII+D+I LPF D + + E  +FV   D  
Sbjct: 319 MASSKFCLNIAGDTPSSNRLFDAIVSHCVPVIISDEIELPFEDVLDYSEFCIFVRASDAV 378

Query: 326 K---LDTILTSIPPEVILRK-QRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP-- 379
           K   L  +L  I  E   +  +RL     +     +  P+Q GDA   I   ++RK+   
Sbjct: 379 KNGFLLNLLRGIKREKWTKMWERL---KEIAHHFEYQYPSQAGDAVDMIWGAVSRKISSI 435

Query: 380 ----HDKSTYLK 387
               H K+ Y +
Sbjct: 436 QNKLHRKNRYRR 447


>gi|145352087|ref|XP_001420390.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144580624|gb|ABO98683.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 517

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 85/397 (21%), Positives = 145/397 (36%), Gaps = 69/397 (17%)

Query: 30  DVLEDDPVGKLKVYVYELPSKYNKKLLLKDPR-CLTHMFAAEIFMHRFLLSSPVRTLNPE 88
           + L D P G L++Y  + P     K    +        +  E ++ R +   P  T + +
Sbjct: 140 EALNDLP-GTLRLYTLKTPRVTRLKYAFAEKIPTFNRGYGVETYLAREIERGPFATRSFK 198

Query: 89  EADWFYTPIYPTCDLTPTGLPLPFKSP----------------RMMRSAIQLISSNWPYW 132
           +A   + P+ P      T        P                R+ R   ++ + N   W
Sbjct: 199 DATAIFIPVRPYLQRLLTAEAYVMAGPAKGSNNNGAIRQAIRDRLSRDIERVKAINAEAW 258

Query: 133 NRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLL--------------------QRAT 172
              +      +   D G       +  +  G + +                     +RA 
Sbjct: 259 TSKQKCARVVMTNIDIGLSAFDSSDDEVRHGAVVITGNSELPIQNSEDAKDDEARARRAN 318

Query: 173 LVQT-FGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEG 231
            VQ  F    HV +  G   I  + P + ++   +     R+I V FRG           
Sbjct: 319 AVQAGFDPARHVAIWFG---ISSHLPREVVRMGALKSTNVRTIEVSFRG----------- 364

Query: 232 GYYARGARAAVWENFKNNPL---FDIST---DHPTTYYEDMQRAIFCLCPLGWAPWSPRL 285
             +  G R  V+   K       +D+ST   D P  Y   + ++ +CL   G    + RL
Sbjct: 365 SMHRGGVRRVVFPTLKQAEAGRGWDLSTSGQDKPRDYMTMLSKSKYCLYVYGDRAHTARL 424

Query: 286 VEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRL 345
            + + FGC+PVI+AD   LPF+    W +  + V E+DV  L +IL     +  LR++ +
Sbjct: 425 YDIITFGCVPVIVADGYDLPFSWLFDWSKFSVRVLEDDVATLPSILDRADYDS-LRRELV 483

Query: 346 LANPSMK---RAMLFPQPAQPGDAFHQILNGLARKLP 379
             +   +   R  +F      GDAF   + G+ R+L 
Sbjct: 484 KVHSFFQYHNRGSIF------GDAFWITMLGVRRQLA 514


>gi|405958592|gb|EKC24705.1| Xyloglucan galactosyltransferase KATAMARI1 [Crassostrea gigas]
          Length = 497

 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 82/335 (24%), Positives = 128/335 (38%), Gaps = 73/335 (21%)

Query: 19  PKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLL----LKDPRCL----------- 63
           PK   +  SAG V  +      K+Y+Y+LP K+N ++L    +   RC            
Sbjct: 56  PKTCTVLPSAG-VSTNGVKTPFKIYIYDLPKKFNLEILKIYDVWHARCYSFEFCGFGARL 114

Query: 64  -----------THMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPF 112
                      +H F+ E+ +H  L  SP RTL+PE+AD FY P Y        GL   +
Sbjct: 115 FNLESGVHVHDSHQFSLEVLVHHLLQLSPYRTLDPEQADLFYIPAY-------IGLQCLY 167

Query: 113 KSPRMMRSAIQLISSNWPYWNR----TEGADHFFVVPHDFGACFHYQEEKAIERGILPLL 168
            S   + +  +LI+  + Y         G  HF        +     E +   +G  P L
Sbjct: 168 ASFDNVSATNKLINELFVYLQSQPYFASGKPHF--------SSLAKIEREMQSKGCCPYL 219

Query: 169 ---QRATLVQTF-----GQRNHVCLNEGSITIP----PYAPPQKMQAHQIPPDTPRSIFV 216
              Q A +  TF       R    LN+  IT+P     +        +Q    +PR++F+
Sbjct: 220 LHPQSANI--TFLSIERETRYQSALNQRVITVPYPSYIHLDGSVTSRNQYLHSSPRNVFI 277

Query: 217 ----------YFRGLFYDVNNDPEGGYYARGARAAVWEN-FKNNPLFDISTDHPTTYYED 265
                     ++R L  D   +     Y        W + F          DH   Y   
Sbjct: 278 LLAAGTRRSNHYRSLILDQFREKTHLSYPEYTATNQWRSEFPMVMYITKECDHSAKYSTV 337

Query: 266 --MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVII 298
             M +++FCL P G +P      +A++ GC+PV+ 
Sbjct: 338 RWMLQSVFCLQPPGDSPTRKSFYDALLSGCVPVLF 372


>gi|242077606|ref|XP_002448739.1| hypothetical protein SORBIDRAFT_06g032380 [Sorghum bicolor]
 gi|241939922|gb|EES13067.1| hypothetical protein SORBIDRAFT_06g032380 [Sorghum bicolor]
          Length = 456

 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 68/246 (27%), Positives = 104/246 (42%), Gaps = 45/246 (18%)

Query: 116 RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQ 175
           R  R  +  ++S+ P W R+ G DH FV+  D  A +H + E      I P +    LV 
Sbjct: 160 RRQREVVDRVTSH-PAWRRSGGRDHVFVMT-DPVAMWHVRAE------IAPAI---LLVV 208

Query: 176 TFG----------QRN------HVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFR 219
            FG           RN      H  ++     I PY     +    +  +  R   +YF+
Sbjct: 209 DFGGWYKVDSKSANRNSSRMIQHTQVSLLKDVIVPYT--HLLPTLLLSENKDRPTLLYFK 266

Query: 220 GLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYED-----MQRAIFCLC 274
           G      +   GG      R  +W+   N P   +    P     +     MQ + FCL 
Sbjct: 267 G----AKHRHRGGL----VREKLWDLLGNEPDVIMEEGFPNATGREQSIKGMQTSEFCLH 318

Query: 275 PLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDV--PK-LDTIL 331
           P G  P S RL +A+   CIPVI++D++ LP+   I + E  +FV+  +   PK L + L
Sbjct: 319 PAGDTPTSCRLFDAIASLCIPVIVSDEVELPYEGMIDYTEFSIFVSVRNAMRPKWLTSYL 378

Query: 332 TSIPPE 337
            +IP +
Sbjct: 379 RNIPKQ 384


>gi|224097754|ref|XP_002311068.1| predicted protein [Populus trichocarpa]
 gi|222850888|gb|EEE88435.1| predicted protein [Populus trichocarpa]
          Length = 380

 Score = 65.5 bits (158), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 70/309 (22%), Positives = 122/309 (39%), Gaps = 29/309 (9%)

Query: 82  VRTLNPEEADWFYTPIYPTCDLTPTGL----PLPFKSPRMMRSAIQLISSNWPYWNRTEG 137
           VR L+PE A+ ++ P + +      G     P   K  ++    I  +  +  YW R+ G
Sbjct: 72  VRVLDPEIAEAYFVPFFSSLSFNTHGRNMTDPETEKDRQLQVDLIDFLQKS-KYWQRSGG 130

Query: 138 ADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAP 197
            DH  V+P      F +  +  +   IL       +V  FG+            + PY  
Sbjct: 131 RDH--VIPMTHPNAFRFLRQ-LVNASIL-------IVADFGRYPKSLSTLSKDVVSPYV- 179

Query: 198 PQKMQAHQ----IPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFD 253
              + + +    + P   R   ++FRG   +     +G   A+  +     +        
Sbjct: 180 -HNVDSFKDDDLLDPFESRKTLLFFRG---NTVRKDKGKVRAKLEKILAGYDDVRYERSS 235

Query: 254 ISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWE 313
            + +      + M+ + FCL P G  P S RL +A+V  C+PVI++D I LP+ D I + 
Sbjct: 236 PTAEAIQASTQGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDLIELPYEDEIDYS 295

Query: 314 EIGMFVAEEDVPKLDTI---LTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQI 370
           +  +F +  +  + D +   L   P +  +   R L   S      F  P    DA + +
Sbjct: 296 QFSIFFSINEAIQPDYLVNQLRKFPKDRWIEMWRQLKKIS--HHFEFQYPPVKEDAVNML 353

Query: 371 LNGLARKLP 379
              +  KLP
Sbjct: 354 WRQVKNKLP 362


>gi|255585230|ref|XP_002533317.1| catalytic, putative [Ricinus communis]
 gi|223526861|gb|EEF29074.1| catalytic, putative [Ricinus communis]
          Length = 478

 Score = 65.5 bits (158), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 71/300 (23%), Positives = 122/300 (40%), Gaps = 40/300 (13%)

Query: 116 RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQ-----R 170
           +  +  I+ + S+   W R+ G DH FV+  D  A +H + E  I   +L ++      R
Sbjct: 182 KRQKEVIEFVKSS-DAWKRSGGKDHVFVLT-DPVAMWHVRAE--IAPAVLLVVDFGGWYR 237

Query: 171 ATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIP--PDTPRSIFVYFRGLFYDVNND 228
                + G  +++  +     +     P      Q+P   +  R   +YF+G        
Sbjct: 238 LDSKSSDGNSSNIIRHTQVSLLKDVIVPYTHLLPQLPLSENKKRQTLLYFKG-------- 289

Query: 229 PEGGYYARGA--RAAVWENFKNNPLFDISTDHPTTYYED-----MQRAIFCLCPLGWAPW 281
               Y  RG   R  +W+   N P   +    P     +     M+ + FCL P G  P 
Sbjct: 290 --AKYRHRGGMVREKLWDLLVNEPGVIMEEGFPNATGREQSIKGMRTSEFCLHPAGDTPT 347

Query: 282 SPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILR 341
           S RL +A+   CIP+I++D+I LPF   + + E  +F+A +D  K + ++  +  + I +
Sbjct: 348 SCRLFDAIQSLCIPIIVSDNIELPFEGIVDYLEFSVFMAVDDALKPNWLVDHL--KSISK 405

Query: 342 KQRLLANPSMKRAMLFPQ----------PAQPGDAFHQILNGLARKLPHDKSTYLKPGGK 391
           KQR      M       +          P  P  A + I   + +KLP  K   ++   K
Sbjct: 406 KQRDEFRQKMAEVQSIFEYDNGYAGGIGPVPPNGAVNHIWKKVHQKLPIIKEAIVREKRK 465


>gi|159473595|ref|XP_001694919.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
 gi|158276298|gb|EDP02071.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
          Length = 674

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 46/148 (31%), Positives = 75/148 (50%), Gaps = 11/148 (7%)

Query: 234 YARGARAAVWENFKNNPLFDISTDH---PTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVV 290
           Y+ G R  +   F N    D+S +    P+     M R+ FC  P+G+  W  RL +A +
Sbjct: 527 YSGGVRQGLLALFGNTTRPDVSINKGGGPSL----MLRSRFCFTPMGFG-WGVRLTQAAM 581

Query: 291 FGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPS 350
            GC+PV++ D +     D +P+E+  + V+  ++ +L  IL SI  E +   Q  LA+  
Sbjct: 582 TGCVPVMVQDHVWPTLWDVLPYEKFSIRVSRHNLYRLFEILDSITAEELASLQAGLAH-- 639

Query: 351 MKRAMLFPQPAQPGDAFHQILNGLARKL 378
             RA ++ QP   G A++  L  L R+L
Sbjct: 640 WHRAFVW-QPEFGGLAYNYTLESLQRRL 666



 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 37/128 (28%), Positives = 55/128 (42%), Gaps = 7/128 (5%)

Query: 39  KLKVYVYELPSKYNKKLLLKD--PRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTP 96
           +LK+YVY+LP          D  P       A   F +R L     RT NP EA+ FY P
Sbjct: 255 RLKIYVYDLPESVVHMRSHSDEWPLHFPIYLAEHEFFNRLLGDWATRTENPWEANLFYIP 314

Query: 97  IYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQE 156
            +    +   G P      ++    +  +  N+P+WN T G +H     +D G C  Y+ 
Sbjct: 315 TFTYYYIGNVGQP-----GKLFSRVVSYVRHNYPFWNMTGGRNHILTSVNDRGCCDIYRL 369

Query: 157 EKAIERGI 164
              ++R I
Sbjct: 370 GPDVQRPI 377


>gi|125576050|gb|EAZ17272.1| hypothetical protein OsJ_32791 [Oryza sativa Japonica Group]
          Length = 506

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 40/119 (33%), Positives = 61/119 (51%), Gaps = 5/119 (4%)

Query: 264 EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED 323
           + M  + FCL   G  P S RL +A+V  C+PVII+DDI LP+ DA+ + +  +FV   D
Sbjct: 370 QGMHASKFCLNIAGDTPSSNRLFDAIVSHCVPVIISDDIELPYEDALDYSKFSIFVRSSD 429

Query: 324 VPKLDTILTSIPPEVILRKQRLLANPSMK---RAMLFPQPAQPGDAFHQILNGLARKLP 379
             K   ++  I    + + Q  +    +K   +   +  P+Q  DA   I   LARK+P
Sbjct: 430 AVKKGYLMRLI--RGVSKHQWTMMWRRLKEVDKHFEYQYPSQKDDAVQMIWQTLARKVP 486


>gi|302766093|ref|XP_002966467.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
 gi|300165887|gb|EFJ32494.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
          Length = 453

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 70/277 (25%), Positives = 111/277 (40%), Gaps = 62/277 (22%)

Query: 83  RTLNPEEADWFYTPIYP--TCDLTPTGLPLPFKSPR------MMRSAIQLISSNWPYWNR 134
           R L+P++AD  + P +   + +         F+  +        R+ +++++S+   W R
Sbjct: 113 RVLDPDQADVVFVPFFAALSAEAQLRNGKGHFRHRKDNEDYERQRAVMEIVTSS-SRWQR 171

Query: 135 TEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFG---------QRN---- 181
           + G DH FV+  D  A +H++ E  I   IL       LV  FG          RN    
Sbjct: 172 SGGRDHVFVLT-DPMAMYHFRAE--IANSIL-------LVVDFGGWYMEDAKSSRNLSSP 221

Query: 182 ----HVCLNEGSITIPPYAPPQKMQAHQIPP------DTPRSIFVYFRGLFYDVNNDPEG 231
               H  ++     I PY        H +P       +  RS  +YF+G  Y        
Sbjct: 222 QPIYHTQVSLIKDVIVPYT-------HLLPTLALSQDNAVRSTLLYFKGARYR------- 267

Query: 232 GYYARGARAAVWENFKNNPLFDISTDHPT-----TYYEDMQRAIFCLCPLGWAPWSPRLV 286
            +     R  +W      P   +    P         + M+ + FCL P G  P S RL 
Sbjct: 268 -HRTGLVRDQLWSVLDGEPGVLLEEGFPNRTGQVQAVQGMRNSHFCLHPAGDTPSSCRLF 326

Query: 287 EAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED 323
           +AV   CIPVI++D I LPF   + + +  +FV+  D
Sbjct: 327 DAVASLCIPVIVSDSIELPFEGMLDYTQFAIFVSVHD 363


>gi|222640628|gb|EEE68760.1| hypothetical protein OsJ_27458 [Oryza sativa Japonica Group]
          Length = 518

 Score = 65.1 bits (157), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 43/121 (35%), Positives = 62/121 (51%), Gaps = 11/121 (9%)

Query: 264 EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED 323
           + M  + FCL   G  P S RL +A+V  C+PVII+DDI LPF D + +    +FV   D
Sbjct: 383 KGMASSKFCLNIAGDTPSSNRLFDAIVSHCVPVIISDDIELPFEDVLDYSAFCVFVRASD 442

Query: 324 VPK---LDTILTSIPPE---VILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARK 377
             K   L  +L  I  E    + R+ + +A+        +  P+QPGDA   I   +ARK
Sbjct: 443 AVKRGFLLHLLRGISQEEWTAMWRRLKEVAHH-----FEYQYPSQPGDAVQMIWGAVARK 497

Query: 378 L 378
           +
Sbjct: 498 M 498


>gi|405121937|gb|AFR96705.1| hypothetical protein CNAG_03480 [Cryptococcus neoformans var. grubii
            H99]
          Length = 1125

 Score = 65.1 bits (157), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 36/109 (33%), Positives = 59/109 (54%), Gaps = 4/109 (3%)

Query: 262  YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
            Y  D+  A FC  P G   WSP+  +A+  GCIPV IA+    PFA  + W ++ + VA 
Sbjct: 995  YINDLNNARFCPQPRGITGWSPQTNDAIYAGCIPVFIAEGTHYPFAGFLDWSKLSVRVAP 1054

Query: 322  EDVPKLDTILTSIPPEVILRKQRLLAN-PSMKRAMLFPQPAQPGDAFHQ 369
             ++ K++ IL +IP   + + + L AN  S++ A L+    +P +   +
Sbjct: 1055 TELDKIEKILAAIP---LSKVEELQANLVSVREAFLYSGDEKPEEELER 1100


>gi|219362519|ref|NP_001137073.1| uncharacterized protein LOC100217246 [Zea mays]
 gi|194691480|gb|ACF79824.1| unknown [Zea mays]
 gi|413919898|gb|AFW59830.1| hypothetical protein ZEAMMB73_270023 [Zea mays]
          Length = 458

 Score = 65.1 bits (157), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 61/228 (26%), Positives = 94/228 (41%), Gaps = 46/228 (20%)

Query: 116 RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQ 175
           R  R  +  ++S+ P W R+ G DH FV+  D  A +H + E      I P +    LV 
Sbjct: 162 RRQREVVDRVTSH-PAWRRSSGRDHIFVLT-DPVAMWHVRAE------IAPAI---LLVV 210

Query: 176 TFG------------------QRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVY 217
            FG                  Q   V L +  I    +  P  +    +  +  R   +Y
Sbjct: 211 DFGGWYKVDSKSSSKNSSRVIQHTQVSLLKDVIVPYTHLLPTLL----LSENKDRRTLLY 266

Query: 218 FRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYED-----MQRAIFC 272
           F+G      +   GG      R  +W+   N P   +    P     +     ++ + FC
Sbjct: 267 FKG----AKHRHRGGL----VREKLWDLLGNEPDVIMEEGFPNATGREQSIKGLRTSEFC 318

Query: 273 LCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVA 320
           L P G  P S RL +A+   CIPVI++D++ LPF   I + EI +FV+
Sbjct: 319 LHPAGDTPTSCRLFDAIASLCIPVIVSDEVELPFEGIIDYTEISIFVS 366


>gi|320165878|gb|EFW42777.1| hypothetical protein CAOG_07909 [Capsaspora owczarzaki ATCC 30864]
          Length = 508

 Score = 64.7 bits (156), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 78/330 (23%), Positives = 126/330 (38%), Gaps = 23/330 (6%)

Query: 34  DDPVGKLKVYVY-ELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADW 92
           D P+G LK++VY E+         +K    L  M+  E  +  ++++S   T NPEEA  
Sbjct: 126 DAPIGVLKIFVYTEMIKNMEPHNCVK---ALVPMWREEAELPTWVVNSIHYTTNPEEAHM 182

Query: 93  FYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPH--DFGA 150
           F+ P    C +     P   +S   + +AI ++     Y+ R  G DHF + P     G 
Sbjct: 183 FFIPAMGRCMIAIHDRPHVLQSDSFL-NAIDILHVKNDYFRRRYGYDHFIINPGGGSLGL 241

Query: 151 CFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQ---KMQAHQIP 207
                   +    I      AT + +   R          TIP  A  +    M+ H  P
Sbjct: 242 ITDILWGSSSSATINTFYSNATKLLSESVRPRGYFAGRDFTIPGSADYRFGPYMKIHHQP 301

Query: 208 PDTPRSIFVYFRGLFYDVNNDPEG----GYYARGARAAVWENFKNNPLF--DISTDHPTT 261
                 +F++             G           +AA    F++  L    I+   P+ 
Sbjct: 302 LADRPMLFMFLGDTGLREQRQALGRLKVALQGDSEQAAF---FRDKVLIASKINDPDPSL 358

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y +  Q   FC  P G +PW+ R  ++++ GCIPV         + D + W+ I +    
Sbjct: 359 YPKRTQNFTFCAAPHGTSPWTQRFYDSLISGCIPVQFDRRFRFGYYDHVDWDAIVIRYPT 418

Query: 322 EDVPKLDTI----LTSIPPEVILRKQRLLA 347
             V K   +      S+  E I  +QR +A
Sbjct: 419 AQVDKFSFLEYLYKLSLDVEFIRERQRRIA 448


>gi|324508547|gb|ADY43609.1| Exostosin-1b, partial [Ascaris suum]
          Length = 628

 Score = 64.7 bits (156), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 30/79 (37%), Positives = 45/79 (56%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y   M  + FCL P G    S R +E++  GCIPVI++DD  LPF++ I W +  +   E
Sbjct: 292 YETTMSNSTFCLTPRGRRLGSFRFLESLRLGCIPVILSDDWELPFSEIIDWSQAAVIAHE 351

Query: 322 EDVPKLDTILTSIPPEVIL 340
           + V  +  +L +IP E +L
Sbjct: 352 DTVLTISDVLNAIPLERVL 370


>gi|449502905|ref|XP_004161776.1| PREDICTED: probable glucuronosyltransferase Os01g0926700-like
           [Cucumis sativus]
          Length = 482

 Score = 64.3 bits (155), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 68/281 (24%), Positives = 109/281 (38%), Gaps = 37/281 (13%)

Query: 83  RTLNPEEADWFYTPIYPT--CDLTPTGLPLPFKSP------RMMRSAIQLISSNWPYWNR 134
           R    EEAD  + P + T   ++        F+           R+ +  + S    W +
Sbjct: 141 RVFEAEEADVIFVPFFATMSAEMQLGMAKGAFRKKVGNEDYERQRNVMDFLKST-DAWKK 199

Query: 135 TEGADHFFVVPHDFG---ACFHYQEEKAIE-------RGILPLLQRATLVQTFGQRNHVC 184
           + G DH     H      A +H + E A          G   L  +++   +     H  
Sbjct: 200 SGGRDHVLFSLHSLTDPVAMWHVKAEIAPAVLLVVDFGGWFRLDTKSSNGSSPDMIQHTQ 259

Query: 185 LNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWE 244
           ++     I PY     +    +  +  R   +YF+G      +   GG      R  +W+
Sbjct: 260 VSVLKDVIVPYT--HLLPRLHLSANKKRQTLLYFKG----AKHRHRGGL----VREKLWD 309

Query: 245 NFKNNPLFDISTDHPTTYYED-----MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIA 299
              N P   +    P    ++     M+ + FCL P G  P S RL +A+   CIPV+++
Sbjct: 310 LLVNEPDVIMEEGFPNATGKEQSIKGMRSSEFCLHPAGDTPTSCRLFDAIQSLCIPVVVS 369

Query: 300 DDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPE 337
           D+I LPF D + + E  +FVA  D  K   L   L +IP E
Sbjct: 370 DNIELPFEDMVDYSEFSVFVAVNDALKPNWLVKHLRTIPEE 410


>gi|356507133|ref|XP_003522325.1| PREDICTED: probable glycosyltransferase At3g42180-like [Glycine
           max]
          Length = 484

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 93/398 (23%), Positives = 149/398 (37%), Gaps = 76/398 (19%)

Query: 40  LKVYVYELPSKYNKKLLL-------------------KDPRCLTHMFAAEIFMHRFLLSS 80
           LKV++Y+LP +++  LL                    + P  L    + E ++   LLSS
Sbjct: 112 LKVFMYDLPPEFHFGLLGWKGSVNQTWPEVDNPERIPRYPGGLNLQHSMEYWLTLDLLSS 171

Query: 81  PV-------RTLNPEEADWFYTPIYPTCDLTP----TGLPLPFKSPRMMRSAIQLISSNW 129
            V       R  +  +AD  + P + +          G      + R+    +Q +    
Sbjct: 172 KVGQPCTAIRVQDSSQADVIFVPFFSSLSYNRHSKLNGQEKVSLNKRLQDRLVQFLMGR- 230

Query: 130 PYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGS 189
             W R+ G DH  V  H            ++      L     ++  FG+      N   
Sbjct: 231 KEWKRSGGKDHLIVAHH----------PNSLLDARRRLGAAMLVLADFGRYPVELANIKK 280

Query: 190 ITIPPYAPPQKMQAHQIPPDTP-----RSIFVYFRGLFYDVNNDPEGGYYARGARAAVW- 243
             I PY    +     IP         R+  VYF+G  Y      +GG      R  ++ 
Sbjct: 281 DIIAPY----RHLVGTIPRAESASFEKRTTLVYFQGAIYR----KDGG----AIRQELYY 328

Query: 244 ----ENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIA 299
               EN  +     I  +      + M  + FCL   G  P S RL +A+V  C+PVII+
Sbjct: 329 LLKDENDVHFTFGSIGGNGINQASQGMALSKFCLNIAGDTPSSNRLFDAIVSHCVPVIIS 388

Query: 300 DDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRK-QRLLANPSMKRAM 355
           D+I LPF D + + +  + V   D  K   L  +L SI  +   +  +RL     +    
Sbjct: 389 DEIELPFEDDLDYSDFSIIVHASDAMKKGYLLNLLRSIKRDEWNKMWERL---KQITHHF 445

Query: 356 LFPQPAQPGDAFHQILNGLARKLP------HDKSTYLK 387
            +  P+QPGDA + I   +  K+       H K+ Y +
Sbjct: 446 EYQYPSQPGDAVNMIWQQVEHKISSIRFNLHRKNRYQR 483


>gi|326920358|ref|XP_003206441.1| PREDICTED: exostosin-2-like [Meleagris gallopavo]
          Length = 718

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 35/104 (33%), Positives = 55/104 (52%), Gaps = 8/104 (7%)

Query: 241 AVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD 300
           AV +   NN +FD        Y + +Q A FC+   G       L + +  GC+PVIIAD
Sbjct: 295 AVRKRCHNNQIFD--------YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVIIAD 346

Query: 301 DIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQR 344
             +LPF++ + W+   + + EE +P++ +IL S+P   I   QR
Sbjct: 347 SYILPFSEVLDWKRASVVIPEEKMPEMYSILQSVPQRQIEEMQR 390


>gi|302760669|ref|XP_002963757.1| galactosyltransferase-like protein [Selaginella moellendorffii]
 gi|300169025|gb|EFJ35628.1| galactosyltransferase-like protein [Selaginella moellendorffii]
          Length = 411

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 93/411 (22%), Positives = 149/411 (36%), Gaps = 72/411 (17%)

Query: 26  GSAGDVLEDDPVGKL-KVYVYELPSKYNKKLLLK-------------------------- 58
            S   +LE+DPV K  +++VY+LP ++N  LL                            
Sbjct: 3   ASENAILEEDPVCKGGRIFVYDLPPRFNADLLANCSTLNPWLSLCDALSHGGLGKPMTTT 62

Query: 59  -------DPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLP 111
                   P   T  F+ E+  H  +L  P  T + + A+ FY P Y   D++   L  P
Sbjct: 63  PWPSSKPSPWFYTEQFSGEVIFHTRILRHPCVTNDSDSANVFYVPFYAGLDVSRY-LWRP 121

Query: 112 FKSPRMMRSAIQLIS--SNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQ 169
            K+        +L+   S  P W R  G DHF ++       F   EE A   G+L + +
Sbjct: 122 SKAEDRDHLGHKLVEWLSTQPAWTRARGRDHFTMIGR-ITWDFRRPEENAWGSGLLNMAE 180

Query: 170 RATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRS-----IFVYFRGLFYD 224
              + +   + N     E  +  P    PQ     Q   +  R+     +F +       
Sbjct: 181 MKNMTRLAIESNPWEGGEYGVPYPTSFHPQNEHQLQEWQEFVRNKERGLVFSFAGATRKR 240

Query: 225 VNNDPEGGYYA-----RGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWA 279
           + ND      A     RGA +A+          D   + P    +    + FCL P G  
Sbjct: 241 IPNDFRLELLAQCSDSRGACSAM-------DCSDSKCETPEPVVQLFLNSTFCLQPRGDG 293

Query: 280 PWSPRLVEAVVFGCIPVIIADDI-------VLPFADAIPWEEIGMFVAEEDV---PKLDT 329
                + ++V+ GCIPV   +           P  D    E   +F+  EDV    K+  
Sbjct: 294 YTRRSIFDSVLAGCIPVFFWNQSSYWQYKWFFPEED----ESYSVFIDREDVRKGTKIME 349

Query: 330 ILTSIPPEVI--LRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKL 378
           +L+    E +  +R   + A P +  A          DAF   ++G+ R +
Sbjct: 350 VLSRFSQERVKAMRNTLIDALPKLVYATA-DHELSGADAFDTAIDGVLRSM 399


>gi|302800594|ref|XP_002982054.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
 gi|300150070|gb|EFJ16722.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
          Length = 453

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 77/304 (25%), Positives = 123/304 (40%), Gaps = 63/304 (20%)

Query: 83  RTLNPEEADWFYTPIYP--TCDLTPTGLPLPFKSPR-----MMRSAIQLISSNWPYWNRT 135
           R L+P++AD  + P +   + +         F+  +       + A+  I ++   W R+
Sbjct: 113 RVLDPDQADVVFVPFFAALSAEAQLRNGKGHFRHRKDNEDYERQKAVMEIVTSSSRWQRS 172

Query: 136 EGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFG---------QRN----- 181
            G DH FV+  D  A +H++ E  I   IL       LV  FG          RN     
Sbjct: 173 GGRDHVFVLT-DPMAMYHFRAE--IANSIL-------LVVDFGGWYMEDAKSSRNLSSPQ 222

Query: 182 ---HVCLNEGSITIPPYAPPQKMQAHQIPP------DTPRSIFVYFRGLFYDVNNDPEGG 232
              H  ++     I PY        H +P       +  R+  +YF+G  Y         
Sbjct: 223 PIYHTQVSLIKDVIVPYT-------HLLPTLALSQDNAVRTTLLYFKGARYR-------- 267

Query: 233 YYARGARAAVWENFKNNPLFDISTDHPT-----TYYEDMQRAIFCLCPLGWAPWSPRLVE 287
           +     R  +W      P   +    P         + M+ + FCL P G  P S RL +
Sbjct: 268 HRTGLVRDQLWSVLDGEPGVLLEEGFPNRTGQVQAVQGMRNSHFCLHPAGDTPSSCRLFD 327

Query: 288 AVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED--VPK-LDTILTSIPPEVILRKQR 344
           AV   CIPVI++D I LPF   + + +  +FV+  D  +PK L   L+S+  +V  + + 
Sbjct: 328 AVASLCIPVIVSDSIELPFEGMLDYTQFAIFVSVHDALLPKWLVRHLSSLSSKVRNQMRH 387

Query: 345 LLAN 348
            LA+
Sbjct: 388 NLAS 391


>gi|443722437|gb|ELU11306.1| hypothetical protein CAPTEDRAFT_208188 [Capitella teleta]
          Length = 449

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 44/133 (33%), Positives = 61/133 (45%), Gaps = 7/133 (5%)

Query: 266 MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD-DIVL--PFADAIPWE----EIGMF 318
           MQRA+FCL P G +P      +AV+ GCIPVI  D D+ +  PF   + +     EI   
Sbjct: 317 MQRALFCLQPPGDSPTRKSFYDAVISGCIPVIFKDADVTVRYPFDSHLNYSAFCVEIDAS 376

Query: 319 VAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKL 378
               D   LD +   +    I   QR L   +      FP    P DAF  ILN +  +L
Sbjct: 377 AVRRDRTALDALRELVSQRNIQHMQRDLQTAAACLQYSFPFHHSPNDAFAMILNQIEVRL 436

Query: 379 PHDKSTYLKPGGK 391
            ++ S  ++P G 
Sbjct: 437 QNNGSVDVQPLGS 449



 Score = 44.7 bits (104), Expect = 0.085,   Method: Compositional matrix adjust.
 Identities = 28/89 (31%), Positives = 40/89 (44%), Gaps = 23/89 (25%)

Query: 41  KVYVYELPSKYNKKL--LLKDPRCL---------------------THMFAAEIFMHRFL 77
           ++YVYELP +YN+ +    +   C                      T  FA E+ +H  +
Sbjct: 62  RIYVYELPGEYNRDIAQCFEGNECEKLGSCGYGPLIAQHGNLQVRNTWQFALEVIVHHRM 121

Query: 78  LSSPVRTLNPEEADWFYTPIYPTCDLTPT 106
           L+SP RTL+  EA+ FY P Y   D   T
Sbjct: 122 LASPYRTLDINEANAFYLPYYSGLDCLCT 150


>gi|389744786|gb|EIM85968.1| hypothetical protein STEHIDRAFT_168993 [Stereum hirsutum FP-91666
            SS1]
          Length = 1111

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 77/331 (23%), Positives = 129/331 (38%), Gaps = 72/331 (21%)

Query: 65   HMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQL 124
            H F A+  M + + +  VR    +EA+  + P Y   D                    ++
Sbjct: 772  HCFTADTQMEKGVSAMGVRR---DEAEMTFIPSYQQWD-----------------GDHRM 811

Query: 125  ISSNWPYWNRT--EGADHFFVV---PHDFGACF-------HYQEEKAIERGILPLLQRAT 172
            +   W Y NR   +G D   V+    HDFG C        H +E + +   + P ++  T
Sbjct: 812  LERTWEY-NRDAFDGFDASKVIIPFTHDFGQCLAFEWDVWHMRERQGLR--VHPFVRSTT 868

Query: 173  LVQTFGQRNHVCLN-EGSITIPPYA--PPQKMQAH----QIPPDTPRSIFVYFRGLFY-- 223
                 G  +  C      + IPP     P   ++      + P   R + V F G+ +  
Sbjct: 869  AWSVMGDLHSACYRPHQDVIIPPRTCLSPSLFKSFPTVADVRPARDRRVLVAFNGVLWGT 928

Query: 224  ---------------DVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQR 268
                           D +++     +A G       N K+     + T+    Y   +  
Sbjct: 929  GALNRNRLVCPRSHWDSDDNASRRLHASGP------NLKSL----VGTNGDYEYMSLLND 978

Query: 269  AIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLD 328
             +FC  P G   W+ RLV+++  GCIPV+I      PF D + W +I + V   D+ +L+
Sbjct: 979  TVFCPQPAGTTGWATRLVDSMYAGCIPVLIGQASHFPFYDMLDWGKISIRVEPSDLAQLE 1038

Query: 329  TILTSIPPEVILRKQRLLANPSMKR-AMLFP 358
             IL S     +   +RL AN  + R A+++P
Sbjct: 1039 DILFS--RYSLDDIERLQANIMLVRDALVYP 1067


>gi|9454585|gb|AAF87908.1|AC015447_18 Hypothetical protein [Arabidopsis thaliana]
          Length = 414

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 41/133 (30%), Positives = 59/133 (44%), Gaps = 12/133 (9%)

Query: 40  LKVYVYELPSKYNKKLLL-------KDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADW 92
           LK+YVY+       K LL       K   CL   + +++ +H+ LL S  RT+  +EAD 
Sbjct: 89  LKIYVYDENEIDGLKELLYGRDGSVKTTACLKGQWGSQVKIHKLLLESKFRTIKKDEADL 148

Query: 93  FYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACF 152
           F+ P Y  C     GL     + + +      + S  PY+ R+ G DH FV P   GA  
Sbjct: 149 FFVPAYVKCVRMLGGL-----NDKEINQTYVKVLSQMPYFRRSGGRDHIFVFPSGAGAHL 203

Query: 153 HYQEEKAIERGIL 165
                  I R I+
Sbjct: 204 FRSWSTFINRSII 216


>gi|387015840|gb|AFJ50039.1| Exostosin-2-like [Crotalus adamanteus]
          Length = 718

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 33/83 (39%), Positives = 47/83 (56%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y + +Q A FC+   G       L +A+  GCIPV+IAD  VLPF++ + W+   + + E
Sbjct: 308 YPQVLQEATFCIVLRGARLGQAVLSDALQAGCIPVVIADSYVLPFSEVLDWKRASVVIPE 367

Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
           E +P+L  IL SIP   I   QR
Sbjct: 368 EKMPELYNILQSIPQRQIEEMQR 390


>gi|296081850|emb|CBI20855.3| unnamed protein product [Vitis vinifera]
          Length = 618

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 90/396 (22%), Positives = 148/396 (37%), Gaps = 88/396 (22%)

Query: 2   WVTAI--ALLLAFVSADNTPKIERISGSAGDVLEDDPVGK---------LKVYVYELPSK 50
           W +A+   LL A    +N P I+   G    +  +  V K         LKVY Y    +
Sbjct: 263 WSSAVDKELLYAKSQIENAPIIKNDPGLHASLYRNVSVFKRSYELMENTLKVYTY----R 318

Query: 51  YNKKLLLKDPRCLTHMFAAEI-FMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLP 109
             ++ +   P  +  ++A+E  FM     +    T N  +A  FY P   +  +    L 
Sbjct: 319 EGERPVFHQP-PIKGIYASEGWFMKLMQANKKFVTKNGRKAHLFYLPF--SSLMLEEALY 375

Query: 110 LPFKSPRM-----MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGI 164
           +P    R      +++ + +I + +P+WNRT GADHF V  HD+       E   +    
Sbjct: 376 VPNSHSRKNLEQYLKNYLDMIGAKYPFWNRTGGADHFLVACHDWAPS----ETLKLMANS 431

Query: 165 LPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYD 224
           +  L  + + + F     V L E  + IP   P +++      P + R I  +F G  + 
Sbjct: 432 IRALCNSDIREGFKLGKDVSLPETCVRIPQ-NPLRQLGG---KPPSQRRILAFFAGSMHG 487

Query: 225 VNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPT------TYYEDMQRAIFCLCPLGW 278
                    Y R      WEN   +P   I    P        Y + M+ + +C+C  G+
Sbjct: 488 ---------YVRPILLKYWEN--KDPDMKIYGRMPKAKKGTMNYIQHMKSSKYCICAKGY 536

Query: 279 APWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEV 338
              SPR                                     +D+P L +IL SIP + 
Sbjct: 537 EVNSPR-------------------------------------KDIPNLKSILLSIPEKS 559

Query: 339 ILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
            L  Q  +    +++  L+       D FH IL+ +
Sbjct: 560 YLEIQMRVKQ--VQQHFLWHAKPVKYDVFHMILHSV 593


>gi|18399194|ref|NP_564443.1| exostosin-like protein [Arabidopsis thaliana]
 gi|5091619|gb|AAD39607.1|AC007454_6 F23M19.7 [Arabidopsis thaliana]
 gi|15450928|gb|AAK96735.1| Unknown protein [Arabidopsis thaliana]
 gi|20148711|gb|AAM10246.1| unknown protein [Arabidopsis thaliana]
 gi|332193570|gb|AEE31691.1| exostosin-like protein [Arabidopsis thaliana]
          Length = 477

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 59/214 (27%), Positives = 89/214 (41%), Gaps = 34/214 (15%)

Query: 128 NWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNE 187
           N   W R+ G DH FV+  D  A +H +EE A+   IL       LV  FG        +
Sbjct: 191 NTKAWKRSNGRDHVFVLT-DPVAMWHVREEIAL--SIL-------LVVDFGG----WFRQ 236

Query: 188 GSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLF----YDVNNDPEGGYYARGARAAVW 243
            S +    + P+++Q  Q+     + + V +  L        N       Y +GA+    
Sbjct: 237 DSKSSNGTSLPERIQHTQV--SVIKDVIVPYTHLLPRLDLSQNQRRHSLLYFKGAKHRHR 294

Query: 244 ENFKNNPLFDISTDHPTTYYED--------------MQRAIFCLCPLGWAPWSPRLVEAV 289
                  L+D+  + P    E+              M+ + FCL P G  P S RL +A+
Sbjct: 295 GGLIREKLWDLLVNEPGVVMEEGFPNATGREQSIRGMRNSEFCLHPAGDTPTSCRLFDAI 354

Query: 290 VFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED 323
              CIPVI++D I LPF   I + E  +F +  D
Sbjct: 355 QSLCIPVIVSDTIELPFEGIIDYSEFSVFASVSD 388


>gi|302845050|ref|XP_002954064.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
 gi|300260563|gb|EFJ44781.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
          Length = 1122

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 71/291 (24%), Positives = 108/291 (37%), Gaps = 46/291 (15%)

Query: 71  IFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWP 130
           +F  R L+S+ +RT++ +EAD+++ P+    +L P           M+   +  I   +P
Sbjct: 636 LFWQR-LMSAGIRTVDGDEADYYFIPVNTRTELAPG----------MVEWVLSYIRRTYP 684

Query: 131 YWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSI 190
           +W++  G  H  +   D G               LP   R+ L   F   N   L    I
Sbjct: 685 WWSKDNGNRHLIIHTGDMGI------------ADLPADMRSRLKSAFS--NITWLTHWGI 730

Query: 191 TIPPYAPPQK-MQAHQIPPDTPRSIFVYFRGLFYD-VNNDPEGGYYARGARAAVWENF-- 246
               Y P  K   AH+   D    + V  +G     +N   E     R    A    F  
Sbjct: 731 Y--QYHPVAKWYPAHRPGKDVVLPVMVTTQGFHLSPLNPRVEARARRRNQTMARSGTFFF 788

Query: 247 ----------KNNPLFDISTDHP----TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFG 292
                      +    D S   P         D+    FCL PLG      R V   + G
Sbjct: 789 AGRICGDRKPPDPATGDCSRTRPDYSGGVRQLDISSHKFCLAPLG-GGHGKRQVLVSLMG 847

Query: 293 CIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQ 343
           C+PV+I + ++ PF   I W    + V E D+P L  IL +I  + +   Q
Sbjct: 848 CVPVLIGNGVLQPFEPEIDWSRFSVSVPEADIPDLPRILANISDQRVADMQ 898


>gi|339247053|ref|XP_003375160.1| exostosin-1 [Trichinella spiralis]
 gi|316971539|gb|EFV55297.1| exostosin-1 [Trichinella spiralis]
          Length = 1019

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 33/84 (39%), Positives = 51/84 (60%)

Query: 261 TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVA 320
            Y E M  + FCL P G    S R +EA+  GCIPVI+++D VLPF++ I W++  +   
Sbjct: 604 NYEELMANSKFCLVPRGRRLGSFRFLEALEKGCIPVILSNDWVLPFSEVIDWDQAVVRGD 663

Query: 321 EEDVPKLDTILTSIPPEVILRKQR 344
           E  + +L ++L + P  VILR ++
Sbjct: 664 ERTLFQLPSLLRAYPESVILRMRQ 687


>gi|168017435|ref|XP_001761253.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162687593|gb|EDQ73975.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 336

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 30/82 (36%), Positives = 43/82 (52%)

Query: 263 YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEE 322
           YED+    F L P G  P S R +E +  G IPV+IAD+ V PF   IPW    +     
Sbjct: 214 YEDLMNTTFALVPAGRQPSSYRFIEVLAAGSIPVLIADNYVKPFDSLIPWYTCAIQFPTT 273

Query: 323 DVPKLDTILTSIPPEVILRKQR 344
           ++ ++   L  + PE  L++QR
Sbjct: 274 EIKRIVNTLRKVSPEEKLKRQR 295


>gi|307108297|gb|EFN56537.1| hypothetical protein CHLNCDRAFT_51514 [Chlorella variabilis]
          Length = 810

 Score = 63.2 bits (152), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 43/141 (30%), Positives = 64/141 (45%), Gaps = 30/141 (21%)

Query: 42  VYVYELPSKYNKKLLL---KDPRCL---------THM----FAAEIFMHRFLLSSPVRTL 85
           +YVYE+P++Y   ++        C+         TH+    +AAE      LL S  RTL
Sbjct: 340 IYVYEMPAEYVTLMIQYRWGGTACVPRYFDFHNETHLSEDLYAAESGFLEMLLQSEHRTL 399

Query: 86  NPEEADWFYTPIYPTCDLTPTGLPLP--------------FKSPRMMRSAIQLISSNWPY 131
           +PEEAD+ Y P Y +C +TP                      +  M+  A   I ++ PY
Sbjct: 400 DPEEADFLYVPAYTSCLITPVQRTADSLRDMWYGVENLRVHAATHMLLEAYYWIKAHAPY 459

Query: 132 WNRTEGADHFFVVPHDFGACF 152
           WNR  G DH ++V  D  +C+
Sbjct: 460 WNRRGGWDHIWLVTFDEASCY 480



 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 40/159 (25%), Positives = 67/159 (42%), Gaps = 5/159 (3%)

Query: 181 NHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSI--FVYFRGLFYDVNNDPEGGYYARGA 238
           +H C  +  + +P    PQ      +   TPRS     + RG     N     G   R A
Sbjct: 561 SHPCHAQQDLVVPLMKTPQTFYKSPLLGATPRSRTWLAFHRGRVQQDNPPYSRGLRQRLA 620

Query: 239 RAAVWENF--KNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPV 296
            A+    +  K+  +          Y E +  + FCL   G   WS R+ +A++ GCIPV
Sbjct: 621 NASAAGGWLEKHKIVMGEHDVVEGDYSELLASSTFCLVLPG-DGWSARMDDAMLHGCIPV 679

Query: 297 IIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIP 335
           II D++ + F   +      + + + D  +L  +L ++P
Sbjct: 680 IIMDNVHVSFESILDLAAFTVRIPQADAERLPEVLAAVP 718


>gi|159473731|ref|XP_001694987.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
 gi|158276366|gb|EDP02139.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
          Length = 721

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 50/182 (27%), Positives = 77/182 (42%), Gaps = 22/182 (12%)

Query: 208 PDTP-RSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFD-----------IS 255
           PD P R+  +YF G    +        Y++G R  + + F     +D           I 
Sbjct: 524 PDAPARTTTLYFGGYTKPIMA------YSQGVRQTIHKMFGPGGKYDPEGPNARKDFVIG 577

Query: 256 TDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEI 315
                   + M+ A FCL P+G A W  RL EA+V GC+PVII D I     D +P+EE 
Sbjct: 578 GPAGGAAVDSMKLAKFCLAPMG-AGWGIRLAEAMVSGCVPVIIQDHIYQAHWDILPFEEF 636

Query: 316 GMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLA 375
            + +   ++ +L  IL  + P+ +     L A         F      G A++  +  L 
Sbjct: 637 SIRIGRNELHQLVDILDDVSPQQL---DSLQAGIERYHRAFFWDAHWGGLAYNYTVQALK 693

Query: 376 RK 377
           R+
Sbjct: 694 RR 695



 Score = 52.0 bits (123), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 36/115 (31%), Positives = 55/115 (47%), Gaps = 7/115 (6%)

Query: 39  KLKVYVYELPSKYNKKLLLKDPRCLTH-MFAAEI-FMHRFLLSSPVRTLNPEEADWFYTP 96
           ++KVYVY+LP+    +    D   L   M+ +E+ F  + L    VRT NP EA  FY P
Sbjct: 300 EIKVYVYDLPNNVVHRREFNDQWALIDLMYNSELEFTDKLLGDWGVRTENPWEAALFYVP 359

Query: 97  IYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGAC 151
            +        G P       +++ A + +  N P++N T G +H F   +D G C
Sbjct: 360 TFTYWFTGNVGHPY-----YVIQHATKWLQENSPFFNLTGGRNHIFWATNDRGVC 409


>gi|393907069|gb|EFO19790.2| exostosin-1 [Loa loa]
          Length = 677

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 32/99 (32%), Positives = 56/99 (56%), Gaps = 1/99 (1%)

Query: 243 WENFKNNPLFDISTDHPTTYYE-DMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADD 301
           W+ +++    + + ++    YE  M  + FCL P G    S R +EA+  GCIPV+++DD
Sbjct: 271 WKKYEDGRCEEDNVEYDHWDYEMTMANSTFCLTPRGRRLGSFRFLEALRLGCIPVVLSDD 330

Query: 302 IVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVIL 340
             LPF++ I W +  +   E+ V  +  +L++IP + IL
Sbjct: 331 WELPFSEVIDWRQAVVIGHEDTVLTISDVLSAIPFDRIL 369


>gi|384245403|gb|EIE18897.1| hypothetical protein COCSUDRAFT_45043 [Coccomyxa subellipsoidea
           C-169]
          Length = 420

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 53/184 (28%), Positives = 76/184 (41%), Gaps = 45/184 (24%)

Query: 1   MWVTAIALLLAFVSADNTPKIERISGSAGDVLEDDPVGKLK--VYVYELPSKYNKKLLLK 58
           M + A   L+A VS            +AG        G+ K  VYVY+LP K+NK +  K
Sbjct: 1   MGIAAFLCLVALVS----------EAAAGGWRHTQGRGQAKEFVYVYDLPEKFNKDI--K 48

Query: 59  DPRCLTH--MFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLT--------PTGL 108
           +   + H   +  +  +H+ L  S V T +P  A  FY P+Y              P+  
Sbjct: 49  ELPTIWHPEQYDIDQVLHKHLTHSEVNTKDPSVAKVFYIPVYLGRYFNAQWQRFSDPSDA 108

Query: 109 PLPFK---------------------SPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHD 147
            L  K                     +  ++RSAI  +  N+PYWN + GADHF V  +D
Sbjct: 109 WLINKECHGLDSVDCWAEKWKVAENATSDLVRSAIAHVKENYPYWNASNGADHFMVFSYD 168

Query: 148 FGAC 151
            G C
Sbjct: 169 HGKC 172


>gi|312084451|ref|XP_003144281.1| exostosin-1 [Loa loa]
          Length = 694

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 32/99 (32%), Positives = 56/99 (56%), Gaps = 1/99 (1%)

Query: 243 WENFKNNPLFDISTDHPTTYYE-DMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADD 301
           W+ +++    + + ++    YE  M  + FCL P G    S R +EA+  GCIPV+++DD
Sbjct: 271 WKKYEDGRCEEDNVEYDHWDYEMTMANSTFCLTPRGRRLGSFRFLEALRLGCIPVVLSDD 330

Query: 302 IVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVIL 340
             LPF++ I W +  +   E+ V  +  +L++IP + IL
Sbjct: 331 WELPFSEVIDWRQAVVIGHEDTVLTISDVLSAIPFDRIL 369


>gi|449439621|ref|XP_004137584.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase
           IRX7-like [Cucumis sativus]
          Length = 494

 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 91/408 (22%), Positives = 149/408 (36%), Gaps = 59/408 (14%)

Query: 17  NTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLL----LKD------------- 59
           N  K E    +  D+  D    +L+V++Y+LP  Y+  LL     KD             
Sbjct: 89  NFVKDEVSVENQSDLGCDPAKARLRVFMYDLPPLYHFGLLGWKGEKDQIWPYVSNRSQIP 148

Query: 60  --PRCLTHMFAAEIFMHRFLLSS----------PVRTLNPEEADWFYTPIYPTCDL---T 104
             P  L    + E ++   LLSS           VR  +  +AD  + P + +      +
Sbjct: 149 PYPGGLNLQHSMEYWLTLDLLSSNVPDMDHTCTAVRVKDSSQADVIFVPFFSSLSYNQHS 208

Query: 105 PTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGI 164
            +         ++++  +         W RT G +H  +  H           K +   +
Sbjct: 209 KSHGKEKINVNKILQQKLIDFLFGQKEWRRTGGKNHLVIAHHPNSML---DARKKLGSAM 265

Query: 165 LPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQK-MQAHQIPPDTPRSIFVYFRGLFY 223
             L         FG+      N     I PY    K + + +      R I VYF+G  Y
Sbjct: 266 FVLAD-------FGRYPAAIANIEKDIIAPYRHIVKTVPSSKSATFDERPILVYFQGAIY 318

Query: 224 DVNNDPEGG------YYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLG 277
                 +GG      YY       V   F +     +  +      + M  + FCL   G
Sbjct: 319 R----KDGGVVRQELYYLLKDEEDVHFTFGS-----VKGNGINKAGQGMASSKFCLNIAG 369

Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
             P S RL +++   C+PVII+DDI LP+ D + + E  +FV   D  +   +L  +   
Sbjct: 370 DTPSSNRLFDSIASHCVPVIISDDIELPYEDILDYSEFCVFVRAADSIRKGYLLNLLRGI 429

Query: 338 VILRKQRLLAN-PSMKRAMLFPQPAQPGDAFHQILNGLARKLPHDKST 384
              R  ++      +     +  P+Q GDA   I   ++RK+   KS 
Sbjct: 430 GRERWTKMWDRIKEIVHEFEYQYPSQSGDAVDMIWQAVSRKVSKIKSN 477


>gi|449531319|ref|XP_004172634.1| PREDICTED: probable glycosyltransferase At5g03795-like [Cucumis
           sativus]
          Length = 307

 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 38/115 (33%), Positives = 59/115 (51%), Gaps = 5/115 (4%)

Query: 35  DPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFY 94
           D   K KVY+Y      +     + PR LT  +A+E +  + +  S  RT +P++A  F+
Sbjct: 146 DMESKFKVYIYP---DGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFRTEDPDQAHLFF 202

Query: 95  TPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFG 149
            PI  +C               ++++ ++ + S +PYWNRT GADHFFV  HD G
Sbjct: 203 IPI--SCHKMRGKGTSYENMTVIVQNYVEGLISKYPYWNRTLGADHFFVTCHDVG 255


>gi|224119530|ref|XP_002318097.1| predicted protein [Populus trichocarpa]
 gi|222858770|gb|EEE96317.1| predicted protein [Populus trichocarpa]
          Length = 243

 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 33/83 (39%), Positives = 48/83 (57%), Gaps = 2/83 (2%)

Query: 68  AAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISS 127
           A+E +    L  SP  T NP+EA  F+  I+ +C       PLP    R+++  ++ + S
Sbjct: 85  ASEHYFFMNLRDSPFLTKNPQEAHLFF--IFISCLPLSDEEPLPGYRERVIKRYVKGLIS 142

Query: 128 NWPYWNRTEGADHFFVVPHDFGA 150
            +PYWNRT GADHFFV  H+ G+
Sbjct: 143 TYPYWNRTLGADHFFVSCHNIGS 165


>gi|170583869|ref|XP_001896766.1| exostosin-1 [Brugia malayi]
 gi|158595933|gb|EDP34387.1| exostosin-1, putative [Brugia malayi]
          Length = 411

 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 29/75 (38%), Positives = 44/75 (58%)

Query: 266 MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVP 325
           M  + FCL P G    S R +EA+  GCIPV+++DD  LPF++ I W +  +   E+ V 
Sbjct: 156 MSNSTFCLTPRGRRLGSFRFLEALRLGCIPVVLSDDWELPFSEVIDWRQAVIIGHEDTVL 215

Query: 326 KLDTILTSIPPEVIL 340
            +  +L +IP + IL
Sbjct: 216 TISDVLNAIPLDRIL 230


>gi|402589373|gb|EJW83305.1| hypothetical protein WUBG_05784 [Wuchereria bancrofti]
          Length = 689

 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 29/75 (38%), Positives = 45/75 (60%)

Query: 266 MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVP 325
           M  + FCL P G    S R +EA+  GCIPV+++DD  LPF++ I W +  +   E+ V 
Sbjct: 296 MSNSTFCLTPRGRRLGSFRFLEALRLGCIPVVLSDDWELPFSEVIDWRQAVIIGHEDTVL 355

Query: 326 KLDTILTSIPPEVIL 340
            +  +L++IP + IL
Sbjct: 356 TISDVLSAIPLDRIL 370


>gi|195486003|ref|XP_002091324.1| GE13596 [Drosophila yakuba]
 gi|194177425|gb|EDW91036.1| GE13596 [Drosophila yakuba]
          Length = 760

 Score = 62.0 bits (149), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 42/147 (28%), Positives = 70/147 (47%), Gaps = 11/147 (7%)

Query: 200 KMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP 259
           K   H I  +T  S+F         ++N  +            W   ++N   + + ++ 
Sbjct: 266 KRYVHGIGSETRNSLF--------HLHNGRDMVLVTTCRHGKSWRELQDNRCDEDNREYD 317

Query: 260 TTYYEDM-QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMF 318
              YE + Q + FCL P G    S R +EA+  GCIPV++++  VLPF   I W++  ++
Sbjct: 318 RYDYETLLQNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNAWVLPFESKIDWKQAAIW 377

Query: 319 VAEEDVPKLDTILTSIPPEVI--LRKQ 343
             E  + ++  I+ SIP E I  LR+Q
Sbjct: 378 ADERLLLQVPDIVRSIPAERIFALRQQ 404


>gi|3435314|gb|AAC32397.1| putative tumor suppressor homolog [Drosophila melanogaster]
          Length = 760

 Score = 62.0 bits (149), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 42/147 (28%), Positives = 70/147 (47%), Gaps = 11/147 (7%)

Query: 200 KMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP 259
           K   H I  +T  S+F         ++N  +            W   ++N   + + ++ 
Sbjct: 266 KRYVHGIGSETRNSLF--------HLHNGRDMVLVTTCRHGKSWRELQDNRCDEDNREYD 317

Query: 260 TTYYEDM-QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMF 318
              YE + Q + FCL P G    S R +EA+  GCIPV++++  VLPF   I W++  ++
Sbjct: 318 RYDYETLLQNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNAWVLPFESKIDWKQAAIW 377

Query: 319 VAEEDVPKLDTILTSIPPEVI--LRKQ 343
             E  + ++  I+ SIP E I  LR+Q
Sbjct: 378 ADERLLLQVPDIVRSIPAERIFALRQQ 404


>gi|195583406|ref|XP_002081513.1| GD11060 [Drosophila simulans]
 gi|194193522|gb|EDX07098.1| GD11060 [Drosophila simulans]
          Length = 760

 Score = 62.0 bits (149), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 42/147 (28%), Positives = 70/147 (47%), Gaps = 11/147 (7%)

Query: 200 KMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP 259
           K   H I  +T  S+F         ++N  +            W   ++N   + + ++ 
Sbjct: 266 KRYVHGIGSETRNSLF--------HLHNGRDMVLVTTCRHGKSWRELQDNRCDEDNREYD 317

Query: 260 TTYYEDM-QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMF 318
              YE + Q + FCL P G    S R +EA+  GCIPV++++  VLPF   I W++  ++
Sbjct: 318 RYDYETLLQNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNAWVLPFESKIDWKQAAIW 377

Query: 319 VAEEDVPKLDTILTSIPPEVI--LRKQ 343
             E  + ++  I+ SIP E I  LR+Q
Sbjct: 378 ADERLLLQVPDIVRSIPAERIFALRQQ 404


>gi|255570432|ref|XP_002526175.1| catalytic, putative [Ricinus communis]
 gi|223534552|gb|EEF36251.1| catalytic, putative [Ricinus communis]
          Length = 453

 Score = 61.6 bits (148), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 36/119 (30%), Positives = 62/119 (52%), Gaps = 5/119 (4%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y+E ++ A FCL P G + W+ R  E+    C+PV+++D   LPF + I +  + +    
Sbjct: 303 YFEHLRNAKFCLAPRGESSWTLRFYESFFVECVPVLLSDQAELPFQNVIDYTHVSIKWPS 362

Query: 322 EDV-PKLDTILTSIPPEVILRKQRLLANPSMKRAM-LFPQPAQPGDAFHQILNGLARKL 378
             + P+L   L SIP E I   +R++AN    R + ++   ++   A   I+  L RK+
Sbjct: 363 TKIGPELLEYLESIPDEDI---ERMIANGRQVRCLWVYAPESEQCSAMQGIMWELQRKV 418



 Score = 58.9 bits (141), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 39/133 (29%), Positives = 57/133 (42%), Gaps = 12/133 (9%)

Query: 40  LKVYVYELPSKYNKKLLLKD-------PRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADW 92
           +K+YVYE       K LL+          C+   +  ++ +HR L +S  RT   EEAD 
Sbjct: 80  MKIYVYEEKEIDGLKELLRGRDGKISADTCVKGQWGTQVKIHRLLQNSRYRTRKKEEADL 139

Query: 93  FYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACF 152
           F+ P Y  C     GL     + + +      + S  PY+ R+ G DH FV P   GA  
Sbjct: 140 FFVPAYVKCVRMLGGL-----NDKEINLTYVKVLSQMPYFRRSGGRDHIFVFPSGAGAHL 194

Query: 153 HYQEEKAIERGIL 165
                  I R ++
Sbjct: 195 FRSWATYINRSVI 207


>gi|17137330|ref|NP_477231.1| tout-velu, isoform A [Drosophila melanogaster]
 gi|442623684|ref|NP_001260971.1| tout-velu, isoform B [Drosophila melanogaster]
 gi|61212925|sp|Q9V730.1|EXT1_DROME RecName: Full=Exostosin-1; AltName: Full=Protein tout-velu
 gi|7303172|gb|AAF58236.1| tout-velu, isoform A [Drosophila melanogaster]
 gi|60678089|gb|AAX33551.1| LD10920p [Drosophila melanogaster]
 gi|85701308|dbj|BAE78509.1| Tout-velu [Drosophila melanogaster]
 gi|220960240|gb|ACL92656.1| ttv-PA [synthetic construct]
 gi|440214386|gb|AGB93503.1| tout-velu, isoform B [Drosophila melanogaster]
          Length = 760

 Score = 61.6 bits (148), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 42/147 (28%), Positives = 70/147 (47%), Gaps = 11/147 (7%)

Query: 200 KMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP 259
           K   H I  +T  S+F         ++N  +            W   ++N   + + ++ 
Sbjct: 266 KRYVHGIGSETRNSLF--------HLHNGRDMVLVTTCRHGKSWRELQDNRCDEDNREYD 317

Query: 260 TTYYEDM-QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMF 318
              YE + Q + FCL P G    S R +EA+  GCIPV++++  VLPF   I W++  ++
Sbjct: 318 RYDYETLLQNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNAWVLPFESKIDWKQAAIW 377

Query: 319 VAEEDVPKLDTILTSIPPEVI--LRKQ 343
             E  + ++  I+ SIP E I  LR+Q
Sbjct: 378 ADERLLLQVPDIVRSIPAERIFALRQQ 404


>gi|195334376|ref|XP_002033859.1| GM21554 [Drosophila sechellia]
 gi|194125829|gb|EDW47872.1| GM21554 [Drosophila sechellia]
          Length = 760

 Score = 61.6 bits (148), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 42/147 (28%), Positives = 70/147 (47%), Gaps = 11/147 (7%)

Query: 200 KMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP 259
           K   H I  +T  S+F         ++N  +            W   ++N   + + ++ 
Sbjct: 266 KRYVHGIGSETRNSLF--------HLHNGRDMVLVTTCRHGKSWRELQDNRCDEDNREYD 317

Query: 260 TTYYEDM-QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMF 318
              YE + Q + FCL P G    S R +EA+  GCIPV++++  VLPF   I W++  ++
Sbjct: 318 RYDYETLLQNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNAWVLPFESKIDWKQAAIW 377

Query: 319 VAEEDVPKLDTILTSIPPEVI--LRKQ 343
             E  + ++  I+ SIP E I  LR+Q
Sbjct: 378 ADERLLLQVPDIVRSIPAERIFALRQQ 404


>gi|124301269|gb|ABN04853.1| Exostosin-like [Medicago truncatula]
          Length = 196

 Score = 61.6 bits (148), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 37/119 (31%), Positives = 63/119 (52%), Gaps = 8/119 (6%)

Query: 264 EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED 323
           + M  + FCL P G  P + RL +A+V  C+PVI++D I LPF D I + +I +FV    
Sbjct: 51  QGMHTSKFCLHPAGDTPSACRLFDAIVSLCVPVIVSDSIELPFEDTIDYRKIAVFVETAA 110

Query: 324 VPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
             +   L +IL  + P+ I+  Q+      +K    + +  +P    ++I   +++KLP
Sbjct: 111 AIQPGYLVSILRGMAPDRIVEYQK-----ELKEVKRYFKYDEPDGTVNEIWRQISKKLP 164


>gi|194883052|ref|XP_001975618.1| GG20465 [Drosophila erecta]
 gi|190658805|gb|EDV56018.1| GG20465 [Drosophila erecta]
          Length = 760

 Score = 61.6 bits (148), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 42/147 (28%), Positives = 70/147 (47%), Gaps = 11/147 (7%)

Query: 200 KMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP 259
           K   H I  +T  S+F         ++N  +            W   ++N   + + ++ 
Sbjct: 266 KRYVHGIGSETRNSLF--------HLHNGRDMVLVTTCRHGKSWRELQDNRCDEDNREYD 317

Query: 260 TTYYEDM-QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMF 318
              YE + Q + FCL P G    S R +EA+  GCIPV++++  VLPF   I W++  ++
Sbjct: 318 RYDYETLLQNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNAWVLPFESKIDWKQAAIW 377

Query: 319 VAEEDVPKLDTILTSIPPEVI--LRKQ 343
             E  + ++  I+ SIP E I  LR+Q
Sbjct: 378 ADERLLLQVPDIVRSIPAERIFALRQQ 404


>gi|71897059|ref|NP_001026520.1| exostosin-2 [Gallus gallus]
 gi|60098757|emb|CAH65209.1| hypothetical protein RCJMB04_7p22 [Gallus gallus]
          Length = 567

 Score = 61.6 bits (148), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 33/104 (31%), Positives = 55/104 (52%), Gaps = 8/104 (7%)

Query: 241 AVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD 300
           AV +   N+ +FD        Y + +Q A FC+   G       L + +  GC+PVIIAD
Sbjct: 295 AVRKRCHNSQMFD--------YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVIIAD 346

Query: 301 DIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQR 344
             +LPF++ + W+   + + E+ +P++ +IL S+P   I   QR
Sbjct: 347 SYILPFSEVLDWKRASVVIPEDKMPEMYSILQSVPQRQIEEMQR 390


>gi|332030617|gb|EGI70305.1| Exostosin-2 [Acromyrmex echinatior]
          Length = 594

 Score = 61.2 bits (147), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 30/83 (36%), Positives = 46/83 (55%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y + +Q + FCL   G       L++A+  GCIPVIIAD +++PF D I W +  +F+ E
Sbjct: 187 YPDSLQTSTFCLIIRGARLAQSALLDAMAAGCIPVIIADSLMMPFHDVIDWTKAAVFIRE 246

Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
            D+     +L  I P+ I+  Q 
Sbjct: 247 VDILLTIQLLKKISPQRIMDMQE 269


>gi|224128538|ref|XP_002320357.1| predicted protein [Populus trichocarpa]
 gi|222861130|gb|EEE98672.1| predicted protein [Populus trichocarpa]
          Length = 137

 Score = 61.2 bits (147), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 41/117 (35%), Positives = 58/117 (49%), Gaps = 9/117 (7%)

Query: 266 MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVP 325
           M  + FCL   G  P S RL +A+   C+PVII+D I LPF D + + E G+FV   D  
Sbjct: 1   MASSKFCLNIAGDTPSSNRLFDAIASQCVPVIISDGIELPFEDVLDYSEFGVFVRASDAV 60

Query: 326 K---LDTILTSIPPE--VILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARK 377
           K   L  +L  I  +   IL ++     P  +    +  P+QPGDA   +   + RK
Sbjct: 61  KKGYLLYLLRGIKKDQWTILWERLKEIAPQFE----YRYPSQPGDAVDMVWEAVLRK 113


>gi|357508935|ref|XP_003624756.1| Cysteine synthase [Medicago truncatula]
 gi|355499771|gb|AES80974.1| Cysteine synthase [Medicago truncatula]
          Length = 407

 Score = 61.2 bits (147), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 39/120 (32%), Positives = 61/120 (50%), Gaps = 5/120 (4%)

Query: 266 MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVP 325
           M+ + FCL   G  P S RL +A+   C+PVII+D+I LP+ D + + +  +FV   D  
Sbjct: 45  MRSSKFCLNIAGDTPSSNRLFDAIASHCVPVIISDEIELPYEDVLDYSKFCVFVRTRDAV 104

Query: 326 K---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHDK 382
           K   L   + SI  +   R    L    +++   F  P++ GDA   I   ++RK+P  K
Sbjct: 105 KKKYLINFIRSIGKDEWTRMWNRL--KEVEKFFEFQFPSKEGDAVEMIWQAVSRKVPFMK 162


>gi|449533617|ref|XP_004173769.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase
           IRX7-like, partial [Cucumis sativus]
          Length = 330

 Score = 61.2 bits (147), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 70/318 (22%), Positives = 119/318 (37%), Gaps = 34/318 (10%)

Query: 80  SPVRTLNPEEADWFYTPIYPTCDL---TPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTE 136
           + VR  +  +AD  + P + +      + +         ++++  +         W RT 
Sbjct: 17  TAVRVKDSSQADVIFVPFFSSLSYNQHSKSHGKEKINVNKILQQKLIDFLFGQKEWRRTG 76

Query: 137 GADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYA 196
           G +H  +  H            ++      L     ++  FG+      N     I PY 
Sbjct: 77  GKNHLVIAHH----------PNSMLDARKKLGSAMFVLADFGRYPAAIANIEKDIIAPYR 126

Query: 197 PPQK-MQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGG------YYARGARAAVWENFKNN 249
              K + + +      R I VYF+G  Y      +GG      YY       V   F + 
Sbjct: 127 HIVKTVPSSKSATFDERPILVYFQGAIYR----KDGGVVRQELYYLLKDEEDVHFTFGS- 181

Query: 250 PLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADA 309
               +  +      + M  + FCL   G  P S RL +++   C+PVII+DDI LP+ D 
Sbjct: 182 ----VKGNGINKAGQGMASSKFCLNIAGDTPSSNRLFDSIASHCVPVIISDDIELPYEDI 237

Query: 310 IPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDA 366
           + + E  +FV   D  +   L  +L  I  E   +    +    +     +  P+Q GDA
Sbjct: 238 LDYSEFCVFVRAADSIRKGYLLNLLRGIGRERWTKMWDRI--KEIVHEFEYQYPSQSGDA 295

Query: 367 FHQILNGLARKLPHDKST 384
              I   ++RK+   KS 
Sbjct: 296 VDMIWQAVSRKVSKIKSN 313


>gi|223975397|gb|ACN31886.1| unknown [Zea mays]
          Length = 134

 Score = 61.2 bits (147), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 39/116 (33%), Positives = 60/116 (51%), Gaps = 3/116 (2%)

Query: 266 MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVP 325
           M+ + FCL P G  P S RL +A+V  C+PVI++  I LPF D I + E  +F + E+  
Sbjct: 1   MRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSSRIELPFEDEIDYSEFSLFFSVEEAL 60

Query: 326 KLDTILTSIPPEVILRK--QRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
           + D +L  +  +V  RK     L   ++     F  P + GDA + I   +  K+P
Sbjct: 61  RPDYLLNEL-RQVPKRKWVDMWLKLKNVSHHYEFQYPPRKGDAVNMIWRQVRHKIP 115


>gi|255642058|gb|ACU21295.1| unknown [Glycine max]
          Length = 340

 Score = 61.2 bits (147), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 41/133 (30%), Positives = 59/133 (44%), Gaps = 12/133 (9%)

Query: 40  LKVYVYELPSKYNKKLLLK-------DPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADW 92
           LK+YVY+       K LL+       D  CL   + +++ +H+ LL S  RT   EEAD 
Sbjct: 67  LKIYVYQENEIDGLKELLRGRDAKITDEACLKGQWGSQVKIHKLLLQSKQRTWKKEEADL 126

Query: 93  FYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACF 152
           F+ P Y  C     GL     + + + S    + S  PY+  + G +H FV P   GA  
Sbjct: 127 FFVPSYVKCARMMGGL-----NDKEINSTYVKVISQMPYFRLSGGRNHIFVFPSGAGAHL 181

Query: 153 HYQEEKAIERGIL 165
                  I R I+
Sbjct: 182 FKSWATYINRSII 194


>gi|298711085|emb|CBJ26480.1| Glycosyltransferase, family GT47 [Ectocarpus siliculosus]
          Length = 561

 Score = 61.2 bits (147), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 33/97 (34%), Positives = 48/97 (49%), Gaps = 1/97 (1%)

Query: 248 NNPLFDIST-DHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 306
           N  L D S  D  +  YED+    F L P G +P + RL EA+  G +PV I  D V PF
Sbjct: 424 NKGLCDASERDFDSVSYEDLMNTTFALLPAGRSPATYRLAEALSAGALPVFIHQDFVKPF 483

Query: 307 ADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQ 343
              +PW E       E+ P++   L ++P + + + Q
Sbjct: 484 PGQVPWSEFSFSFPPEEAPRILETLRAVPDKKLAQMQ 520


>gi|326932982|ref|XP_003212589.1| PREDICTED: exostosin-1c-like [Meleagris gallopavo]
          Length = 734

 Score = 61.2 bits (147), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 74/286 (25%), Positives = 120/286 (41%), Gaps = 39/286 (13%)

Query: 79  SSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGA 138
           SS   T +PEEA  F   I  T D  P        S   +RS  Q I S +P WN   G 
Sbjct: 130 SSRYYTPHPEEACLFVLSI-DTLDRDPL-------SAHYIRSVEQRIRS-FPLWN--SGR 178

Query: 139 DHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPP 198
           +H     +  G   +Y EE   + G   +L +A+    F  +N     + SI + P   P
Sbjct: 179 NHLIFNLYA-GTWPNYTEELGFDIG-HAMLAKAS----FHSKNFRPGFDVSIPLFPREHP 232

Query: 199 QK------MQAHQIPPDTPRSIFVYFRGLFY-------------DVNNDPEGGYYARGAR 239
           Q+      +  + +PP   +   + F+G  Y              ++N  +         
Sbjct: 233 QRGGQSGWLHHNSVPPK--KKYLLVFKGKRYLTGIGSGTRNALHHIHNGRDIVSLTTCRH 290

Query: 240 AAVWENFKNNPLFDISTDHPT-TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVII 298
              WE  K+      + D+    Y E +  + FC+ P G    S R +EA+   CIPV++
Sbjct: 291 GKDWERHKDTRCDQDNVDYEKFDYQELLHNSTFCIVPRGRRLGSFRFLEALQAACIPVLL 350

Query: 299 ADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQR 344
           +D   LPF++AI W +  +  +E  + ++ + +  I PE +L  Q+
Sbjct: 351 SDGWELPFSEAIDWGKAAVVGSERLLLQIPSAVRCIRPERVLAFQQ 396


>gi|402218829|gb|EJT98904.1| hypothetical protein DACRYDRAFT_110241 [Dacryopinax sp. DJM-731 SS1]
          Length = 1153

 Score = 61.2 bits (147), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 77/322 (23%), Positives = 123/322 (38%), Gaps = 58/322 (18%)

Query: 65   HMFAAEIFMHRFLLSSPVRTLN--PEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAI 122
            H F+A++ M       P+R ++   EEA+  + P Y                      A 
Sbjct: 816  HCFSADVSMEL-----PLREVSVPAEEAEMIFMPFY-------------------QNRAQ 851

Query: 123  QLISSNWPYWNRT-EGAD-HFFVVP--HDFGACFHYQEEKAIERGILPLLQRATLV---- 174
             L    W +   T  G D H  V+P  HDFGAC  +  E ++ R       RA       
Sbjct: 852  WLFGEEWKFMRETIPGLDPHKVVIPFTHDFGACMWW--EHSVYRAREERFDRAKEARDSI 909

Query: 175  --QTFGQRNHVCLNE-GSITIPPY--APPQKMQA----HQIPPDTPRSIFVYFRGLFYDV 225
              Q     N  C      + +PP   A PQ   A     ++ P   R++   F+G ++  
Sbjct: 910  AWQVMADMNTPCYAPLQDVVMPPRTCASPQLYAAFSDMARVKPARQRNVLATFKGSYWGT 969

Query: 226  NNDPEGGYYARGARAAVWENFKNNPL---------FDISTDHPTTYYEDMQRAIFCLCPL 276
              +          R    E+     L         +D   D+  +Y   +   I+C  P 
Sbjct: 970  GANTRRKLNCE-KRLRTLEDVATPRLETEQRLMTVWDSLGDY-ESYPAILNDTIWCPLPE 1027

Query: 277  GWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPP 336
            G   W+ RL + V  GCIPV +      PF D + W ++ + +  +D+ +++ +L S   
Sbjct: 1028 GVTGWATRLEDVVYGGCIPVFVGHASQYPFYDMLDWSKLSIAIERKDLQRIEEVLMSYTM 1087

Query: 337  EVILRKQRLLANPSMKRAMLFP 358
            E I R Q  L    ++ A L+P
Sbjct: 1088 EEIERFQTNLM--LVRDAFLYP 1107


>gi|332017043|gb|EGI57842.1| Exostosin-1 [Acromyrmex echinatior]
          Length = 911

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 36/124 (29%), Positives = 59/124 (47%), Gaps = 1/124 (0%)

Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
           R   Y ++N  +  +         W  F++      + ++ T  YE +   A FCL P G
Sbjct: 254 RNALYHLHNGKDLVFVTTCRHGKTWREFQDEHCQQDNQEYDTYDYEILLMNATFCLVPRG 313

Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
               S R +EA+  GCIPVI+++   LPF + I W +  +F  E  + ++  IL S+   
Sbjct: 314 RRLGSFRFLEALRSGCIPVILSNSWALPFHERIDWNQAVIFSDERLLLQIPDILRSVSNV 373

Query: 338 VILR 341
            IL+
Sbjct: 374 QILK 377


>gi|356560377|ref|XP_003548469.1| PREDICTED: LOW QUALITY PROTEIN: probable glycosyltransferase
           At5g03795-like [Glycine max]
          Length = 334

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 71/268 (26%), Positives = 104/268 (38%), Gaps = 50/268 (18%)

Query: 67  FAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLIS 126
           +A+E +  +  + S   T +P EAD F+ P +    L              +R  IQ IS
Sbjct: 95  YASESYFKKVPMKSHFITKDPTEADLFFLP-FSIARLRHNRRVGVGGKQDFIRDYIQNIS 153

Query: 127 SNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLN 186
             +PYWNRT GADHF+V  H  G                  + +A  V+ F     VC +
Sbjct: 154 HKYPYWNRTGGADHFYVACHSIGR---------------SAMDKAPDVK-FNAIQVVCSS 197

Query: 187 E----GSITIPPYAPPQKMQAHQIPP---DTPRSIFVYFRGLFYDVNNDPEGGYYARGAR 239
                G+I       PQ       PP    + R    +F G      N P         R
Sbjct: 198 SYFLTGNIAHKDTCLPQIWPRKGNPPILVSSKRKRLAFFAGGV----NSP--------VR 245

Query: 240 AAVWENFKNNPLFDISTDH---PTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPV 296
             + E +KN+   +I   H    T Y +++  + F L   G+   + R         I V
Sbjct: 246 VKLLETWKNDS--EIFVHHGRLKTPYADELLGSKFGLHVKGFEVNTTR---------IGV 294

Query: 297 IIADDIVLPFADAIPWEEIGMFVAEEDV 324
           IIA+   LPFAD + W+   + V   D+
Sbjct: 295 IIANYYDLPFADVLNWKSFSVVVTTLDI 322


>gi|125586098|gb|EAZ26762.1| hypothetical protein OsJ_10674 [Oryza sativa Japonica Group]
          Length = 364

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 38/119 (31%), Positives = 59/119 (49%), Gaps = 5/119 (4%)

Query: 264 EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED 323
           + M+ + FCL   G  P S RL +++V  C+P+II+D+I LPF D + + +  + V   D
Sbjct: 213 QGMRASKFCLNIAGDTPSSNRLFDSIVSHCVPIIISDEIELPFEDVLDYSKFCIIVRGAD 272

Query: 324 VPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
             K   L  ++  I  E   R    L    ++R   +  P+Q  DA   I   +ARK P
Sbjct: 273 AVKKGFLMNLINGISREDWTRMWNRLK--EVERHFEYQYPSQNDDAVQMIWKAIARKAP 329


>gi|224051022|ref|XP_002199808.1| PREDICTED: exostosin-2 [Taeniopygia guttata]
          Length = 718

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 34/104 (32%), Positives = 54/104 (51%), Gaps = 8/104 (7%)

Query: 241 AVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD 300
           AV +    N +FD        Y + +Q + FC+   G       L + +  GC+PVIIAD
Sbjct: 295 AVRKRCYKNQVFD--------YPQVLQESTFCVVLRGARLGQAVLSDVLQAGCVPVIIAD 346

Query: 301 DIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQR 344
             +LPF++ + W+   + + EE +P++ +IL SIP   I   QR
Sbjct: 347 SYILPFSEVLDWKRASVVIPEEKMPEMYSILQSIPQRQIEEMQR 390


>gi|449270126|gb|EMC80844.1| Exostosin-2, partial [Columba livia]
          Length = 714

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 33/104 (31%), Positives = 54/104 (51%), Gaps = 8/104 (7%)

Query: 241 AVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD 300
           AV +    N +FD        Y + +Q + FC+   G       L + +  GC+PVIIAD
Sbjct: 291 AVRKRCHKNQVFD--------YPQVLQESTFCVVLRGARLGQAVLSDVLQAGCVPVIIAD 342

Query: 301 DIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQR 344
             +LPF++ + W+   + + EE +P++ +IL S+P   I   QR
Sbjct: 343 SYILPFSEVLDWKRASVVIPEEKMPEMYSILQSVPQRQIEEMQR 386


>gi|327259699|ref|XP_003214673.1| PREDICTED: exostosin-2-like [Anolis carolinensis]
          Length = 718

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 33/97 (34%), Positives = 51/97 (52%), Gaps = 8/97 (8%)

Query: 248 NNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFA 307
           NN +FD        Y + +Q A FC+   G       L + +  GCIPV+IAD  +LPF+
Sbjct: 302 NNQVFD--------YPQVLQEATFCIVLRGARLGQAVLSDVLQAGCIPVVIADSYILPFS 353

Query: 308 DAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQR 344
           + + W+   + + EE + ++ +IL SIP   I   QR
Sbjct: 354 EVLDWKRASVVIPEEKMSEMYSILHSIPQRQIEEMQR 390


>gi|297738432|emb|CBI27633.3| unnamed protein product [Vitis vinifera]
          Length = 441

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 38/119 (31%), Positives = 60/119 (50%), Gaps = 5/119 (4%)

Query: 264 EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED 323
           + M+ + FCL P G  P S RL +A+V  C+PVI++D I LP+ D I + +  +F ++++
Sbjct: 303 QGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDQIELPYEDEIDYTQFSIFFSDKE 362

Query: 324 VPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
             +   +   L  IP E  +   R L   S      F  P + GDA   +   +  KLP
Sbjct: 363 ALEPGYMIEQLRQIPKERWVEMWRHLKYISHHYE--FQYPPKKGDAIDMLWRQVKHKLP 419


>gi|388512347|gb|AFK44235.1| unknown [Lotus japonicus]
          Length = 267

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 63/254 (24%), Positives = 102/254 (40%), Gaps = 21/254 (8%)

Query: 131 YWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSI 190
           +W R+ G DH F + H     F   +          L +   +V  FG R     N    
Sbjct: 9   HWQRSRGRDHVFPMTHPNAFRFLRNQ----------LNESIQVVVDFG-RYPKGSNLNKD 57

Query: 191 TIPPYAPPQKMQAHQIPPD--TPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKN 248
            + PY           P D    R   ++FRG  +  +   EG   A+ A+     +  +
Sbjct: 58  VVSPYVHVVDSFTDDEPQDPYESRPTLLFFRGRTFRKD---EGIVRAKLAKILTGFDDVH 114

Query: 249 NPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFAD 308
                 + ++     + M+ + FCL P G  P S RL +A+V  C+PVI++D I LPF D
Sbjct: 115 YERSFATGENIKLSSQGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDQIELPFED 174

Query: 309 AIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGD 365
            I + +  +F + ++  +   +   L   P +      R L N S      F  P +  D
Sbjct: 175 EIDYSQFSLFFSFKEALQPGYMIDQLRKFPKDKWSEMWRQLKNISHHYE--FQYPPKKED 232

Query: 366 AFHQILNGLARKLP 379
           A + +   +  KLP
Sbjct: 233 AVNMLWRQVKHKLP 246


>gi|156378657|ref|XP_001631258.1| predicted protein [Nematostella vectensis]
 gi|156218295|gb|EDO39195.1| predicted protein [Nematostella vectensis]
          Length = 563

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 23/66 (34%), Positives = 43/66 (65%)

Query: 266 MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVP 325
           MQRA FCL   G       L+++++ GCIP++++DD +LPF++ + W+   + V+E ++ 
Sbjct: 234 MQRATFCLVIRGARLGQTALLDSLMMGCIPIVVSDDYILPFSEVLDWKRAAVVVSENEID 293

Query: 326 KLDTIL 331
           ++  IL
Sbjct: 294 RIPLIL 299


>gi|338711993|ref|XP_001489915.2| PREDICTED: LOW QUALITY PROTEIN: exostosin-2 [Equus caballus]
          Length = 728

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 40/131 (30%), Positives = 66/131 (50%), Gaps = 9/131 (6%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y + +Q A FC+   G       L + +  GC+PV+IAD  +LPF++ + W+   + V E
Sbjct: 308 YPQVLQEATFCMVLRGARLGQAVLSDVLRAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367

Query: 322 EDVPKLDTILTSIPPEVILRKQR-LLANPSMKRAMLFPQPAQ-----PGDAF-HQILNGL 374
           E +  + +IL SIP   I   QR L   P+ + + L     +     PG+ +  +I+N  
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQRQLFQEPARRESWLVANHXRSSLIXPGEEWGSEIIND- 426

Query: 375 ARKLPHDKSTY 385
            R  P+  S+Y
Sbjct: 427 -RIYPYAASSY 436


>gi|356534007|ref|XP_003535549.1| PREDICTED: LOW QUALITY PROTEIN: xylogalacturonan
           beta-1,3-xylosyltransferase-like [Glycine max]
          Length = 373

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 72/295 (24%), Positives = 110/295 (37%), Gaps = 30/295 (10%)

Query: 83  RTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSP--RMMRSAIQLISSNWPYWNRTEGADH 140
           R L+P  AD  + P + T             +   +  R  +  + S    WNR+ G DH
Sbjct: 48  RVLDPLLADVVFVPFFATLSANKGAFRKKHGNDDYKRQRQVVDAVKSTQ-VWNRSGGRDH 106

Query: 141 FFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSI---TIPPYAP 197
            FV+    GA         +  G      R       G+ + V   + S+    I PY  
Sbjct: 107 VFVLT---GAFCKNPSFSFVPGGDFGGWSRGGGGSNCGESDVVPHTQVSVIKDVIVPYM- 162

Query: 198 PQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTD 257
              +    +  +  R   +YF+G      +   GG      R  +W+   + P   +   
Sbjct: 163 -HLLPRLDLSENKVRHQLLYFKG----AKHRHRGGII----REKLWDLLVSEPGVIMEEG 213

Query: 258 HPTTYYED-----MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW 312
            P     +     M+ + FCL P G  P S RL +A+   CIPVI++D I LPF   + +
Sbjct: 214 FPNATGREQSIKGMRTSEFCLHPAGDTPTSCRLFDAIQSLCIPVIVSDIIELPFEGMVDY 273

Query: 313 EEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPG 364
            E  +F A  D  K   L   L S   E   +K R   N +  + +       PG
Sbjct: 274 AEFSVFPAVNDARKPSWLGNHLQSFSKE---QKDRFRQNMAQVQPIFVYDNGHPG 325


>gi|297851878|ref|XP_002893820.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297339662|gb|EFH70079.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 478

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 57/214 (26%), Positives = 86/214 (40%), Gaps = 34/214 (15%)

Query: 128 NWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNE 187
           N   W R+ G DH FV+  D  A +H +EE A+            LV  FG        +
Sbjct: 192 NTEAWKRSNGRDHVFVLT-DPVAMWHVREEIALS---------ILLVVDFGG----WFRQ 237

Query: 188 GSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLF----YDVNNDPEGGYYARGARAAVW 243
            S +    + P++++  Q+     + + V +  L        N       Y +GA+    
Sbjct: 238 DSKSSNGTSLPERIEHTQV--SVIKDVIVPYTHLLPSLDLSQNQRRHSLLYFKGAKHRHR 295

Query: 244 ENFKNNPLFDISTDHPTTYYED--------------MQRAIFCLCPLGWAPWSPRLVEAV 289
                  L+D+  D      E+              M+ + FCL P G  P S RL +A+
Sbjct: 296 GGLIREKLWDLLVDEQGIVMEEGFPNATGREQSIIGMRNSEFCLHPAGDTPTSCRLFDAI 355

Query: 290 VFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED 323
              CIPVI++D I LPF   I + E  +FV   D
Sbjct: 356 QSLCIPVIVSDTIELPFEGIIDYSEFSVFVPVSD 389


>gi|326437066|gb|EGD82636.1| hypothetical protein PTSG_03293 [Salpingoeca sp. ATCC 50818]
          Length = 366

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 38/113 (33%), Positives = 54/113 (47%), Gaps = 8/113 (7%)

Query: 40  LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIY- 98
           L+VY+Y LP        L+ P    + +AAE    R L +S   T +PEEA  F+  +  
Sbjct: 106 LRVYMYPLPES------LQLPPTRDYKYAAEATFTRMLRASTFSTDSPEEAQLFFVRVSC 159

Query: 99  PTCDLTPTGLPLPFKSPRMMRSAI-QLISSNWPYWNRTEGADHFFVVPHDFGA 150
                T        ++     +A+   +   +PYWNRT+G DHFFV  HD GA
Sbjct: 160 AEARFTQRDREAGQRAADAHATAVLAHVQQRYPYWNRTQGRDHFFVCGHDMGA 212



 Score = 43.1 bits (100), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 19/50 (38%), Positives = 29/50 (58%)

Query: 282 SPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTIL 331
           +PRL++A+  GC+PV IAD    P A  + W  + +F+AE D   +   L
Sbjct: 260 TPRLMDAIWAGCVPVFIADHYDPPLAKYVDWALLAVFIAEADAAHIKAHL 309


>gi|215686890|dbj|BAG89740.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215693850|dbj|BAG89049.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 264

 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 52/189 (27%), Positives = 85/189 (44%), Gaps = 23/189 (12%)

Query: 205 QIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYE 264
            +  +  R   +YF+G      +   GG      R  +W+   N P   +   +P     
Sbjct: 60  HLSENKDRPTLLYFKG----AKHRHRGGL----VREKLWDLMVNEPDVVMEEGYPNATGR 111

Query: 265 D-----MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFV 319
           +     M+ + FCL P G  P S RL +AV   CIPVI++D+I LPF   I + E  +FV
Sbjct: 112 EQSIKGMRTSEFCLHPAGDTPTSCRLFDAVASLCIPVIVSDEIELPFEGMIDYTEFAIFV 171

Query: 320 AEEDVPK---LDTILTSIPPEVILRKQRLLAN--PSMKRAMLFP----QPAQPGDAFHQI 370
           +  +  +   L   L ++P +     +R +A+  P  +   ++P      AQ G A + I
Sbjct: 172 SVNNSMRPKWLTNYLRNVPRQQKDEFRRNMAHVQPIFEYDSIYPGRMASAAQDG-AVNHI 230

Query: 371 LNGLARKLP 379
              + +KLP
Sbjct: 231 WKKIHQKLP 239


>gi|307177266|gb|EFN66444.1| Exostosin-1 [Camponotus floridanus]
          Length = 711

 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 35/124 (28%), Positives = 59/124 (47%), Gaps = 1/124 (0%)

Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
           R   Y ++N  +  +         W  F++      + ++ T  YE +   A FCL P G
Sbjct: 254 RNALYHLHNGKDLVFVTTCRHGKAWREFQDEHCQQNNQEYDTYDYEILLMNATFCLVPRG 313

Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
               S R +EA+  GCIPVI+++   LPF + I W +  +F  E  + ++  I+ S+   
Sbjct: 314 RRLGSFRFLEALRAGCIPVILSNGWALPFHERIDWTQAAIFSDERLLLQIPDIVRSVSNV 373

Query: 338 VILR 341
            IL+
Sbjct: 374 QILK 377


>gi|345305619|ref|XP_001510258.2| PREDICTED: exostosin-2 [Ornithorhynchus anatinus]
          Length = 615

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 29/83 (34%), Positives = 47/83 (56%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y + +Q A FC+   G       L + +  GC+PVI+AD  +LPF++ + W+   + + E
Sbjct: 308 YPQVLQEATFCVVLRGARLGQSVLSDVLQAGCVPVILADSYILPFSEVLDWKRASVVIPE 367

Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
           E +P++ +IL SIP   I   QR
Sbjct: 368 EKMPEMYSILQSIPQRQIEEMQR 390


>gi|115452759|ref|NP_001049980.1| Os03g0324700 [Oryza sativa Japonica Group]
 gi|108707908|gb|ABF95703.1| secondary cell wall-related glycosyltransferase family 47,
           putative, expressed [Oryza sativa Japonica Group]
 gi|113548451|dbj|BAF11894.1| Os03g0324700 [Oryza sativa Japonica Group]
 gi|125543687|gb|EAY89826.1| hypothetical protein OsI_11372 [Oryza sativa Indica Group]
 gi|215768347|dbj|BAH00576.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 468

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 38/119 (31%), Positives = 59/119 (49%), Gaps = 5/119 (4%)

Query: 264 EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED 323
           + M+ + FCL   G  P S RL +++V  C+P+II+D+I LPF D + + +  + V   D
Sbjct: 317 QGMRASKFCLNIAGDTPSSNRLFDSIVSHCVPIIISDEIELPFEDVLDYSKFCIIVRGAD 376

Query: 324 VPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
             K   L  ++  I  E   R    L    ++R   +  P+Q  DA   I   +ARK P
Sbjct: 377 AVKKGFLMNLINGISREDWTRMWNRLKE--VERHFEYQYPSQNDDAVQMIWKAIARKAP 433


>gi|308491504|ref|XP_003107943.1| hypothetical protein CRE_12808 [Caenorhabditis remanei]
 gi|308249890|gb|EFO93842.1| hypothetical protein CRE_12808 [Caenorhabditis remanei]
          Length = 847

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 38/154 (24%), Positives = 66/154 (42%), Gaps = 14/154 (9%)

Query: 194 PYAPPQKMQAHQIPPDTPRSIFVYFRG-------------LFYDVNNDPEGGYYARGARA 240
           PY    +   H  P +  R     F+G             L + ++N  +          
Sbjct: 148 PYQIESQRALHNEPKEEKRRYLASFKGKRYVYGIGSGTRNLVHHLHNGDDIVMVTTCKHN 207

Query: 241 AVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIA 299
             W+ ++++     + ++    Y+D+   + FCL P G    S R +E +  GCIPV+I+
Sbjct: 208 NDWQVYQDDRCQRDNEEYDRWEYDDLLSNSTFCLVPRGRRLGSFRFLETLRSGCIPVVIS 267

Query: 300 DDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTS 333
           D  +LPF++ I W    + VAE D   +  +L S
Sbjct: 268 DSWILPFSETIDWHSAAIVVAERDALSIPELLMS 301


>gi|115461230|ref|NP_001054215.1| Os04g0670600 [Oryza sativa Japonica Group]
 gi|113565786|dbj|BAF16129.1| Os04g0670600, partial [Oryza sativa Japonica Group]
          Length = 275

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 52/189 (27%), Positives = 85/189 (44%), Gaps = 23/189 (12%)

Query: 205 QIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYE 264
            +  +  R   +YF+G      +   GG      R  +W+   N P   +   +P     
Sbjct: 71  HLSENKDRPTLLYFKG----AKHRHRGGL----VREKLWDLMVNEPDVVMEEGYPNATGR 122

Query: 265 D-----MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFV 319
           +     M+ + FCL P G  P S RL +AV   CIPVI++D+I LPF   I + E  +FV
Sbjct: 123 EQSIKGMRTSEFCLHPAGDTPTSCRLFDAVASLCIPVIVSDEIELPFEGMIDYTEFAIFV 182

Query: 320 AEEDVPK---LDTILTSIPPEVILRKQRLLAN--PSMKRAMLFP----QPAQPGDAFHQI 370
           +  +  +   L   L ++P +     +R +A+  P  +   ++P      AQ G A + I
Sbjct: 183 SVNNSMRPKWLTNYLRNVPRQQKDEFRRNMAHVQPIFEYDSIYPGRMASAAQDG-AVNHI 241

Query: 371 LNGLARKLP 379
              + +KLP
Sbjct: 242 WKKIHQKLP 250


>gi|443712570|gb|ELU05824.1| hypothetical protein CAPTEDRAFT_92231 [Capitella teleta]
          Length = 668

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 70/282 (24%), Positives = 118/282 (41%), Gaps = 46/282 (16%)

Query: 77  LLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTE 136
           L  S   T NPEEA  F  P   T D       L  +  + M S IQ +    PYWN  +
Sbjct: 62  LRQSRYHTTNPEEACLF-VPAIDTLDRDK----LSAEYIQNMESKIQSL----PYWN--D 110

Query: 137 GADH--FFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPP 194
           G +H  F + P   G   HY E          +L +A++   + + N        I+ P 
Sbjct: 111 GLNHIIFNLYP---GTWPHYDETDLGFNTGKAMLAKASVSDMWFRPNF------DISFPL 161

Query: 195 YAPPQKMQAHQ--------IPPDTPRSIFVYFRGL-------------FYDVNNDPEGGY 233
           +    K +  +        +PP   RS  + F+G               Y ++ND +   
Sbjct: 162 FHKEHKFKGGEPGFLTENLVPP--LRSYTLSFKGKRYLTGIGSETRNSLYHIHNDDDIVM 219

Query: 234 YARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLGWAPWSPRLVEAVVFG 292
                    W++ K++     + ++    Y+ +   + FCL P G    S R +EA+   
Sbjct: 220 LTTCKHGKSWKDMKDDRCERDNAEYEKYDYKILLHNSTFCLVPRGRRLGSYRFLEALQAA 279

Query: 293 CIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSI 334
           CIPV ++++ VLPF++ I W +  ++  E  + ++ +I+ SI
Sbjct: 280 CIPVFLSNNWVLPFSEVIDWNQAAIWGDERLLLQIPSIVRSI 321


>gi|145355370|ref|XP_001421936.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144582175|gb|ABP00230.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 490

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 38/131 (29%), Positives = 60/131 (45%), Gaps = 13/131 (9%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y   M+ + FCL   G    SPRL+E+++FGC+PVI+ADD  LP +  + W    + + E
Sbjct: 350 YTAKMKNSRFCLYMRGTRVHSPRLIESMLFGCVPVILADDYELPLSWLVDWSAFSVMIPE 409

Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFP-----QPAQPGDAFHQILNGLAR 376
            D   +        P+ + R         M+  M+ P     +    GDAF     G+ R
Sbjct: 410 RDFQTI--------PDALERANSDWDAMHMRLQMVLPLFLYRRRPLVGDAFWATALGVER 461

Query: 377 KLPHDKSTYLK 387
           +L   ++   K
Sbjct: 462 QLRRRRAECTK 472


>gi|157110617|ref|XP_001651176.1| exostosin-2 [Aedes aegypti]
 gi|108878646|gb|EAT42871.1| AAEL005626-PA [Aedes aegypti]
          Length = 711

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 29/80 (36%), Positives = 48/80 (60%), Gaps = 2/80 (2%)

Query: 266 MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVP 325
           ++   FCL   G     P L++A+  GCIPVI+AD+++LPF+D + W+ I + + E ++ 
Sbjct: 307 LENGQFCLIARGVRLSQPTLMDALAAGCIPVIMADNLILPFSDILDWDLISIRIYENNLH 366

Query: 326 KLDTILTSIPPEVI--LRKQ 343
            + T L ++  E I  LR Q
Sbjct: 367 SVITTLKAVSKERIQELRAQ 386


>gi|123504861|ref|XP_001328850.1| Exostosin family protein [Trichomonas vaginalis G3]
 gi|121911798|gb|EAY16627.1| Exostosin family protein [Trichomonas vaginalis G3]
          Length = 325

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 72/307 (23%), Positives = 123/307 (40%), Gaps = 33/307 (10%)

Query: 73  MHRFLLSSPVRTLNPEEADWFYTPIYP-TCDLTPTGLPLPFKSPRMMRSAIQLISSNWPY 131
           +H  LL+SP+ T N  +AD FY P Y   C    +G   P +    +   ++       Y
Sbjct: 6   VHEQLLASPILTNNTNDADLFYIPHYSRMC----SGFTPPEERWEELPDYLEKYGH---Y 58

Query: 132 WNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSIT 191
           + R    DHF +  H          + AI+    P++    L   + +            
Sbjct: 59  FTRYSTVDHFMM--HSVPNYGDKPADIAIDDSRQPII--GVLDFKWSEMIKSPWTHAKSQ 114

Query: 192 IPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPL 251
           I P+   +     +I P   R I V+   +    N+  +     R     +++  KN+  
Sbjct: 115 ILPFITLKS----KINPKAKRKIPVF---VAMSTNHLAKNSANLRKNLTEIFKKIKNSEF 167

Query: 252 FDISTDHPTTYYE-------DMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVL 304
             IS   P +  +        M  + FC+ P G AP S RL +A+   CIP+I+AD + L
Sbjct: 168 IKISRTSPKSVRDILAVLPTKMGSSDFCIIPPGDAPTSKRLYDAISHLCIPIIVADYMTL 227

Query: 305 PF-ADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAML---FPQP 360
           PF   +I + E  + +  +D+ K+  ++ +     I   ++ L    + R M    +  P
Sbjct: 228 PFDGTSINYTECVIQIPSKDIEKIPDLVNNFDKNKIKEMRKKL---EIVREMFIWDYKNP 284

Query: 361 AQPGDAF 367
              G AF
Sbjct: 285 PNAGQAF 291


>gi|359493064|ref|XP_002270238.2| PREDICTED: probable glucuronosyltransferase Os01g0926700-like
           [Vitis vinifera]
          Length = 483

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 66/264 (25%), Positives = 102/264 (38%), Gaps = 38/264 (14%)

Query: 83  RTLNPEEADWFYTPIYPT--CDLTPTGLPLPFKSPR------MMRSAIQLISSNWPYWNR 134
           R  +  EAD  + P + T   ++   G    F+           R  ++ +      W R
Sbjct: 146 RVFDVNEADVVFVPFFATISAEIQLGGGKGVFRKKEGNEDYERQRQVMEFVRGT-EAWKR 204

Query: 135 TEGADHFFVVPHDFGACFHYQEEKAIE-------RGILPLLQRATLVQTFGQRNHVCLNE 187
           + G DH FV+  D  A +H + E A          G   L  +A+         H  ++ 
Sbjct: 205 SGGRDHVFVLT-DPVAMWHVKAEIAPAILLVVDFGGWYKLDSKASNNSLSEMIQHTQVSL 263

Query: 188 GSITIPPYA---PPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWE 244
               I PY    P   +  +QI     R   +YF+G      +   GG      R  +W+
Sbjct: 264 LKDVIVPYTHLLPRLHLSENQI-----RQTLLYFKG----AKHRHRGGL----VREKLWD 310

Query: 245 NFKNNPLFDISTDHPTTYYED-----MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIA 299
                    +    P     +     M+ + FCL P G  P S RL +A+   CIPVI++
Sbjct: 311 LLVYEQGVIMEEGFPNATGREQSIKGMRTSEFCLHPAGDTPTSCRLFDAIQSLCIPVIVS 370

Query: 300 DDIVLPFADAIPWEEIGMFVAEED 323
           D+I LPF   + + E  +FVA  D
Sbjct: 371 DNIELPFEGMVDYSEFSVFVAVRD 394


>gi|440798055|gb|ELR19126.1| hypothetical protein ACA1_335890 [Acanthamoeba castellanii str.
           Neff]
          Length = 350

 Score = 59.7 bits (143), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 34/98 (34%), Positives = 50/98 (51%)

Query: 251 LFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAI 310
           L D S      Y++ M    F L   G    S RL EA+    +PVI+AD+ VLPF++A+
Sbjct: 221 LDDSSRGDSGDYHDLMLNTKFALIVQGNGLHSYRLTEAMRANAVPVILADNYVLPFSEAV 280

Query: 311 PWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLAN 348
            W+EI +FV E     +  ++  I  E + R +  LA 
Sbjct: 281 RWDEIAIFVPESQWASIPDVIGRIDDEALARMREKLAT 318


>gi|291190070|ref|NP_001167078.1| Exostosin-2 [Salmo salar]
 gi|223647990|gb|ACN10753.1| Exostosin-2 [Salmo salar]
          Length = 724

 Score = 59.7 bits (143), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 28/83 (33%), Positives = 47/83 (56%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y + +Q + FC+   G       L + +  GC+PVI+AD  +LPF++ + W+   + + E
Sbjct: 314 YPQILQESSFCVVLRGARLGQATLSDVLQAGCVPVILADSYILPFSEVLDWKRASVVIPE 373

Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
           E +P++ TIL SIP   +   QR
Sbjct: 374 EKLPEMYTILKSIPHRQVEEMQR 396


>gi|303271929|ref|XP_003055326.1| glycosyltransferase family 47 protein [Micromonas pusilla CCMP1545]
 gi|226463300|gb|EEH60578.1| glycosyltransferase family 47 protein [Micromonas pusilla CCMP1545]
          Length = 595

 Score = 59.7 bits (143), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 38/142 (26%), Positives = 71/142 (50%), Gaps = 11/142 (7%)

Query: 248 NNPLFDISTDHPTT---YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVL 304
           N   +D+ +D  TT   Y + + R+ FCL   G   ++PRLVEA++FGC+PVIIAD   L
Sbjct: 450 NRTDWDLDSDGATTPSGYMKLLARSKFCLHVRGTRVYAPRLVEAMLFGCVPVIIADGYDL 509

Query: 305 PFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAM---LFPQPA 361
           P +  + W+   + + E      + +  +   E++        + +++R +   ++  P 
Sbjct: 510 PLSWFLDWDAFSVRMTER-----EGVNATRAAEIVDAADWREKHEALRRVVGFFMYHDPP 564

Query: 362 QPGDAFHQILNGLARKLPHDKS 383
             GDA      G+ R++   ++
Sbjct: 565 VFGDALWATAAGIERQISRGRA 586


>gi|56606002|ref|NP_001008400.1| exostosin-2 [Danio rerio]
 gi|55247892|gb|AAV48783.1| exostosin-2 [Danio rerio]
          Length = 719

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 28/83 (33%), Positives = 47/83 (56%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y + +Q + FC+   G       L + +  GC+PVI+AD  +LPF++ + W+   + + E
Sbjct: 309 YPQILQESSFCVVLRGARLGQATLSDVLQAGCVPVIMADSYILPFSEVLDWKRASVVIPE 368

Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
           E +P++ TIL SIP   +   QR
Sbjct: 369 EKLPEMYTILKSIPHRQVEEMQR 391


>gi|383847805|ref|XP_003699543.1| PREDICTED: exostosin-2-like [Megachile rotundata]
          Length = 707

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 29/83 (34%), Positives = 44/83 (53%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y + +Q A FCL   G       L+E +  G IPVIIAD + +PF D I W    +F+ E
Sbjct: 300 YPDVLQTATFCLVIRGARLGQSTLLECMATGSIPVIIADSLAMPFHDVIDWTRAAIFIRE 359

Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
            D+  + ++L  + P+ I   Q+
Sbjct: 360 VDILSVISVLKKVSPKRITELQK 382


>gi|341881773|gb|EGT37708.1| CBN-RIB-1 protein [Caenorhabditis brenneri]
          Length = 383

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 29/74 (39%), Positives = 41/74 (55%), Gaps = 1/74 (1%)

Query: 263 YEDM-QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           YED+   + FCL P G    S R +E +  GCIPV+I+D  +LPF + I W    + VAE
Sbjct: 264 YEDLLTNSTFCLVPRGRRLGSFRFLETLRSGCIPVVISDSWILPFTETIDWSSAAIVVAE 323

Query: 322 EDVPKLDTILTSIP 335
            D   +  +L S+ 
Sbjct: 324 RDALSIPELLMSMS 337


>gi|297688867|ref|XP_002821894.1| PREDICTED: exostosin-2 isoform 2 [Pongo abelii]
          Length = 728

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 76/329 (23%), Positives = 127/329 (38%), Gaps = 52/329 (15%)

Query: 35  DPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFY 94
           +P  K+KVY+Y L  KY     +     ++  +  E+ M               ++D++ 
Sbjct: 99  NPKNKIKVYIYAL-KKYVDDFGVSVSNTISREYN-ELLMA------------ISDSDYYT 144

Query: 95  TPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVV-----PHDFG 149
             I   C   P+   L   + R+  +A  L  +    W+R  G++H         P D+ 
Sbjct: 145 DDINRACLFVPSIDVLNQNTLRIKETAQAL--AQLSRWDR--GSNHLLFNMLPGGPPDYN 200

Query: 150 ACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEG-SITIPPYAPPQKMQAHQIPP 208
                  ++A+  G              G  +     +G  ++IP Y+P   + A    P
Sbjct: 201 TALDVPRDRALLAG--------------GGFSTWTYRQGYDVSIPVYSP---LSAEVDLP 243

Query: 209 DT---PRSIFVYFR--GLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPT--- 260
           +    PR  F+     GL  +   D E      G    V +   N     +S        
Sbjct: 244 EKGPGPRQYFLLSSQVGLHPEYREDLEALQVKHGESVLVLDKCTNLSEGVLSVRKRCHKH 303

Query: 261 ---TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGM 317
               Y + +Q A FC+   G       L + +  GC+PV+IAD  +LPF++ + W+   +
Sbjct: 304 QVFDYPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASV 363

Query: 318 FVAEEDVPKLDTILTSIPPEVILRKQRLL 346
            V EE +  + +IL SIP   I   QR L
Sbjct: 364 VVPEEKMSDVYSILQSIPQRQIEEMQRQL 392


>gi|198429575|ref|XP_002120379.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
          Length = 482

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 65/276 (23%), Positives = 108/276 (39%), Gaps = 39/276 (14%)

Query: 12  FVSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEI 71
           ++   N   +++ + S  D  E   V  LKVY+YE       K+           +A E 
Sbjct: 202 YIKQPNYKVLKKTAQSPNDYRE--MVNNLKVYIYE------TKIGTDHHPHRVGGYAVER 253

Query: 72  FMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLT--PTGLPLPFKSPRMMRSAIQLISSNW 129
                L  S  RT +P  A +F+ PI  +  +   PT      ++ R+  + +  I + +
Sbjct: 254 VFQELLEKSNFRTQHPNLATFFFIPIRCSSYILDYPTEHEGLMEAKRVTANILHEIQTQY 313

Query: 130 PYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGS 189
           PYW+++ GA+HF++  HD GA         +  G++       LV T    +   +    
Sbjct: 314 PYWSQSSGANHFYICSHDVGA--------KVAEGLMK--NAIGLVSTADYDDPYFIPHKD 363

Query: 190 ITIPPYAPPQKMQAHQIPPDTP------RSIFVYFRGLFYDVNNDPEGGYYARGARAAVW 243
           I+IPP         H I           R+I  +F G       D   G      R   W
Sbjct: 364 ISIPPTPSSGLSNIHLIGKGGALVDVRGRNILAFFAG-------DITSG----RIRPLAW 412

Query: 244 ENFKNNPLFDISTD--HPTTYYEDMQRAIFCLCPLG 277
             + ++   +I      P+ Y E +++A FCL   G
Sbjct: 413 RTWYSDQDIEIINRILKPSAYIEKLKKAKFCLIFRG 448


>gi|198457239|ref|XP_001360599.2| GA10084 [Drosophila pseudoobscura pseudoobscura]
 gi|198135910|gb|EAL25174.2| GA10084 [Drosophila pseudoobscura pseudoobscura]
          Length = 765

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 34/104 (32%), Positives = 57/104 (54%), Gaps = 3/104 (2%)

Query: 243 WENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADD 301
           W   ++N   + + ++    YE + Q + FCL P G    S R +EA+  GCIPV++++ 
Sbjct: 305 WRELQDNRCDEDNREYDRYDYETLLQNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNA 364

Query: 302 IVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVI--LRKQ 343
            VLPF   I W++  ++  E  + ++  I+ SI  E I  LR+Q
Sbjct: 365 WVLPFESKIDWKQAAIWADERLLLQVPDIVRSISAERIFALRQQ 408


>gi|195381971|ref|XP_002049706.1| GJ20607 [Drosophila virilis]
 gi|194144503|gb|EDW60899.1| GJ20607 [Drosophila virilis]
          Length = 757

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 41/147 (27%), Positives = 69/147 (46%), Gaps = 11/147 (7%)

Query: 200 KMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP 259
           K   H I  +T  S+F         ++N  +            W   ++N   + + ++ 
Sbjct: 262 KRYVHGIGSETRNSLF--------HLHNGRDMVMVTTCRHGKSWRELQDNRCDEDNREYD 313

Query: 260 TTYYEDM-QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMF 318
              YE + Q + FCL P G    S R +EA+  GCIPV++++  VLPF   I W++  ++
Sbjct: 314 RYDYETLLQNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNAWVLPFESKIDWKQAAIW 373

Query: 319 VAEEDVPKLDTILTSIPPEVI--LRKQ 343
             E  + ++  I+ SI  E I  LR+Q
Sbjct: 374 ADERLLLQVPDIVRSISAERIFALRQQ 400


>gi|260825668|ref|XP_002607788.1| hypothetical protein BRAFLDRAFT_64151 [Branchiostoma floridae]
 gi|229293137|gb|EEN63798.1| hypothetical protein BRAFLDRAFT_64151 [Branchiostoma floridae]
          Length = 733

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 29/88 (32%), Positives = 46/88 (52%)

Query: 257 DHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG 316
           D    Y + +Q A FC+   G       L +A++ GCIPVI  D  V+PF++ + W+   
Sbjct: 320 DRVYNYPDILQEATFCMILRGARMGQSALSDAMMAGCIPVIAIDTYVMPFSEVLDWKRAA 379

Query: 317 MFVAEEDVPKLDTILTSIPPEVILRKQR 344
           + + EED+P +  +L  I  E I   +R
Sbjct: 380 VILREEDLPDVHNVLRRISQERITNMRR 407


>gi|195431447|ref|XP_002063753.1| GK15839 [Drosophila willistoni]
 gi|194159838|gb|EDW74739.1| GK15839 [Drosophila willistoni]
          Length = 776

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 41/147 (27%), Positives = 69/147 (46%), Gaps = 11/147 (7%)

Query: 200 KMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP 259
           K   H I  +T  S+F         ++N  +            W   ++N   + + ++ 
Sbjct: 273 KRYVHGIGSETRNSLF--------HLHNGRDMVLVTTCRHGKSWRELQDNRCDEDNREYD 324

Query: 260 TTYYEDM-QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMF 318
              YE + Q + FCL P G    S R +EA+  GCIPV++++  VLPF   I W++  ++
Sbjct: 325 RYDYETLLQNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNAWVLPFESKIDWKQAAIW 384

Query: 319 VAEEDVPKLDTILTSIPPEVI--LRKQ 343
             E  + ++  I+ SI  E I  LR+Q
Sbjct: 385 ADERLLLQVPDIVRSISAERIFALRQQ 411


>gi|347963135|ref|XP_311070.5| AGAP000081-PA [Anopheles gambiae str. PEST]
 gi|333467342|gb|EAA06302.6| AGAP000081-PA [Anopheles gambiae str. PEST]
          Length = 761

 Score = 58.9 bits (141), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 41/135 (30%), Positives = 62/135 (45%), Gaps = 14/135 (10%)

Query: 266 MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVP 325
           +    FCL   G     P L+EA+  GCIPV++AD+ VLPFAD + WE + + + E ++ 
Sbjct: 357 LTTGTFCLVARGVRLGQPALLEAMAAGCIPVVMADNYVLPFADLLDWELLAVRLPEANLH 416

Query: 326 KLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLA-RKLPHDKST 384
            +  +L +I  E +   Q      +  R++     A        +L  L  R  PH   T
Sbjct: 417 TIVPVLRAISAERVAEMQ------AQIRSVYRRYFASLDRIVLTVLEQLNDRIFPHRSRT 470

Query: 385 YLKPGGKFLNWTSGP 399
           Y       L+W  GP
Sbjct: 471 Y-------LHWNVGP 478


>gi|332210817|ref|XP_003254509.1| PREDICTED: exostosin-2 isoform 2 [Nomascus leucogenys]
          Length = 728

 Score = 58.9 bits (141), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 37/112 (33%), Positives = 56/112 (50%), Gaps = 13/112 (11%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y + +Q A FC+   G       L + +  GC+PV+IAD  +LPF++ + W+   + V E
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367

Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGD---AFHQI 370
           E +  + +IL SIP   I   QR           LF +PA+  +   A HQI
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQR----------QLFMEPARRENWSAANHQI 409


>gi|145337133|ref|NP_176534.2| root hair specific 8 [Arabidopsis thaliana]
 gi|6633849|gb|AAF19708.1|AC008047_15 F2K11.17 [Arabidopsis thaliana]
 gi|332195980|gb|AEE34101.1| root hair specific 8 [Arabidopsis thaliana]
          Length = 664

 Score = 58.9 bits (141), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 70/299 (23%), Positives = 115/299 (38%), Gaps = 56/299 (18%)

Query: 42  VYVYELPSKYNKKLLLKDPRCL---------------------------THMFAAEIFMH 74
           VYVY+LPSK+NK LL +    +                           TH ++ E   H
Sbjct: 284 VYVYDLPSKFNKDLLRECSDMVPWADFCNYFKNDAFGELMESMGKGWFRTHQYSLEPIFH 343

Query: 75  RFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKS-PRMMRSAIQLISSNW---- 129
             +L  P R  N  +A  FY P Y   D+    L   FK+    ++  + +    W    
Sbjct: 344 SRILKHPCRVHNETQAKLFYVPFYGGMDV----LRWHFKNVSSDVKDVLPIEIVKWLGSK 399

Query: 130 PYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGS 189
             W +  G DH FV+       F   ++ +    +L + +     +   +RN   +N+ +
Sbjct: 400 KSWRKNSGKDHVFVLGK-ISWDFRRVDKYSWGSSLLEMQEMKNPTKLLIERNPWEVNDIA 458

Query: 190 ITIPPYAPPQKMQAHQIPPD----TPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWEN 245
           I  P Y  P+      I  +     PR   + F G     N  PE        R+ + + 
Sbjct: 459 IPHPTYFHPKTDTDIAIWQNKILGKPRRSLISFAGAARPGN--PE------SIRSILIDQ 510

Query: 246 FKNNP-------LFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVI 297
            +++P         D   D   +  E  + + FCL P G +P    + ++++ GCIPVI
Sbjct: 511 CRSSPNQCRFLNCTDGGCDKSESVIELFRDSEFCLQPPGDSPTRKSIFDSLILGCIPVI 569


>gi|195025057|ref|XP_001985992.1| GH20787 [Drosophila grimshawi]
 gi|193901992|gb|EDW00859.1| GH20787 [Drosophila grimshawi]
          Length = 754

 Score = 58.9 bits (141), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 41/147 (27%), Positives = 69/147 (46%), Gaps = 11/147 (7%)

Query: 200 KMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP 259
           K   H I  +T  S+F         ++N  +            W   ++N   + + ++ 
Sbjct: 260 KRYVHGIGSETRNSLF--------HLHNGRDMVMVTTCRHGKSWRELQDNRCDEDNREYD 311

Query: 260 TTYYEDM-QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMF 318
              YE + Q + FCL P G    S R +EA+  GCIPV++++  VLPF   I W++  ++
Sbjct: 312 RYDYETLLQNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNAWVLPFESKIDWKQAAIW 371

Query: 319 VAEEDVPKLDTILTSIPPEVI--LRKQ 343
             E  + ++  I+ SI  E I  LR+Q
Sbjct: 372 ADERLLLQVPDIVRSISAERIFALRQQ 398


>gi|2251238|gb|AAB62718.1| multiple exostoses type II protein EXT2.I [Homo sapiens]
          Length = 728

 Score = 58.9 bits (141), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 31/92 (33%), Positives = 49/92 (53%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y + +Q A FC+   G       L + +  GC+PV+IAD  +LPF++ + W+   + V E
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367

Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPSMKR 353
           E +  + +IL SIP   I   QR L    ++R
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQRQLFMEPVRR 399


>gi|414879654|tpg|DAA56785.1| TPA: hypothetical protein ZEAMMB73_800597 [Zea mays]
          Length = 265

 Score = 58.9 bits (141), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 35/118 (29%), Positives = 52/118 (44%), Gaps = 12/118 (10%)

Query: 40  LKVYVYELPSKYNKKLLLKD-------PRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADW 92
           L++YVY        + LL+          CL   +  ++ +H+FLL S  RT N + A+ 
Sbjct: 53  LRIYVYAEDEIQGLRALLRGRDGTINAATCLKGQWGTQVKVHQFLLKSRFRTFNKDHANL 112

Query: 93  FYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGA 150
           F+ P Y  C      L     S + +      + S  PY+ R+ G DH FV P   GA
Sbjct: 113 FFVPSYVKCVRMTGAL-----SDKEINQTYVKVLSQMPYFRRSGGRDHIFVFPSGAGA 165


>gi|332836218|ref|XP_003313041.1| PREDICTED: exostosin-2 isoform 1 [Pan troglodytes]
          Length = 728

 Score = 58.9 bits (141), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 76/329 (23%), Positives = 126/329 (38%), Gaps = 52/329 (15%)

Query: 35  DPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFY 94
           +P  K+KVY+Y L  KY     +     ++  +  E+ M               ++D++ 
Sbjct: 99  NPKNKIKVYIYAL-KKYVDDFGVSVSNAISREYN-ELLMA------------ISDSDYYT 144

Query: 95  TPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVV-----PHDFG 149
             I   C   P+   L   + R+  +A  L  +    W+R  G +H         P D+ 
Sbjct: 145 DDINRACLFVPSIDVLNQNTLRIKETAQAL--AQLSRWDR--GTNHLLFNMLPGGPPDYN 200

Query: 150 ACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEG-SITIPPYAPPQKMQAHQIPP 208
                  ++A+  G              G  +     +G  ++IP Y+P   + A    P
Sbjct: 201 TALDVPRDRALLAG--------------GGFSTWTYRQGYDVSIPVYSP---LSAEVDLP 243

Query: 209 DT---PRSIFVYFR--GLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPT--- 260
           +    PR  F+     GL  +   D E      G    V +   N     +S        
Sbjct: 244 EKGPGPRQYFLLSSQVGLHPEYREDLEALQVKHGESVLVLDKCTNLSEGVLSVRKRCHKH 303

Query: 261 ---TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGM 317
               Y + +Q A FC+   G       L + +  GC+PV+IAD  +LPF++ + W+   +
Sbjct: 304 QVFDYPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASV 363

Query: 318 FVAEEDVPKLDTILTSIPPEVILRKQRLL 346
            V EE +  + +IL SIP   I   QR L
Sbjct: 364 VVPEEKMSDVYSILQSIPQRQIEEMQRQL 392


>gi|308491052|ref|XP_003107717.1| CRE-RIB-1 protein [Caenorhabditis remanei]
 gi|308249664|gb|EFO93616.1| CRE-RIB-1 protein [Caenorhabditis remanei]
          Length = 347

 Score = 58.9 bits (141), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 28/72 (38%), Positives = 41/72 (56%), Gaps = 1/72 (1%)

Query: 263 YEDM-QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y+D+   + FCL P G    S R +E +  GCIPV+I+D  +LPF++ I W    + VAE
Sbjct: 229 YDDLLSNSTFCLVPRGRRLGSFRFLETLRSGCIPVVISDSWILPFSETIDWHSAAIVVAE 288

Query: 322 EDVPKLDTILTS 333
            D   +  +L S
Sbjct: 289 RDALSIPELLMS 300


>gi|414879653|tpg|DAA56784.1| TPA: hypothetical protein ZEAMMB73_800597 [Zea mays]
          Length = 238

 Score = 58.9 bits (141), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 35/118 (29%), Positives = 52/118 (44%), Gaps = 12/118 (10%)

Query: 40  LKVYVYELPSKYNKKLLLKD-------PRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADW 92
           L++YVY        + LL+          CL   +  ++ +H+FLL S  RT N + A+ 
Sbjct: 53  LRIYVYAEDEIQGLRALLRGRDGTINAATCLKGQWGTQVKVHQFLLKSRFRTFNKDHANL 112

Query: 93  FYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGA 150
           F+ P Y  C      L     S + +      + S  PY+ R+ G DH FV P   GA
Sbjct: 113 FFVPSYVKCVRMTGAL-----SDKEINQTYVKVLSQMPYFRRSGGRDHIFVFPSGAGA 165


>gi|312378632|gb|EFR25154.1| hypothetical protein AND_09769 [Anopheles darlingi]
          Length = 732

 Score = 58.5 bits (140), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 27/78 (34%), Positives = 45/78 (57%)

Query: 266 MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVP 325
           ++  +FCL   G     P L+EA+  GCIPVI+AD++VLPF++ + WE + + V E  + 
Sbjct: 327 LESGVFCLIARGVRLAQPVLLEAMATGCIPVIVADNLVLPFSNILDWELLSVRVYESQLH 386

Query: 326 KLDTILTSIPPEVILRKQ 343
            +  +L  +  + I   Q
Sbjct: 387 SVLALLKRVSDQRIRELQ 404


>gi|91076108|ref|XP_968944.1| PREDICTED: similar to tout-velu CG10117-PA [Tribolium castaneum]
 gi|270014706|gb|EFA11154.1| hypothetical protein TcasGA2_TC004758 [Tribolium castaneum]
          Length = 719

 Score = 58.5 bits (140), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 39/143 (27%), Positives = 65/143 (45%), Gaps = 9/143 (6%)

Query: 200 KMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP 259
           K   H I  DT        R   Y ++N  +            W++ K+      + ++ 
Sbjct: 238 KRYVHGIGSDT--------RNSLYHLHNRKDMIMVTTCRHGKSWKDMKDERCDQDNKEYD 289

Query: 260 TTYYEDM-QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMF 318
              YE + Q + FCL P G    S R +EA+  GCIPV++++   LPFA  I W +  ++
Sbjct: 290 KYDYEVLLQNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNGWALPFAQKIDWSKAAIW 349

Query: 319 VAEEDVPKLDTILTSIPPEVILR 341
             E  + ++  I+ S+ P  IL+
Sbjct: 350 ADERLLLQVPYIVRSLAPAKILQ 372


>gi|440897880|gb|ELR49485.1| Exostosin-2, partial [Bos grunniens mutus]
          Length = 731

 Score = 58.5 bits (140), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 31/83 (37%), Positives = 45/83 (54%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y + +Q A FC+   G       L + +  GC+PVIIAD  VLPF++ + W+   + V E
Sbjct: 321 YPQVLQEATFCMVLRGARLGQAVLSDVLRAGCVPVIIADSYVLPFSEVLDWKRASVVVPE 380

Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
           E +  + +IL SIP   I   QR
Sbjct: 381 EKMSDVYSILQSIPRRQIEEMQR 403


>gi|20138161|sp|O77783.1|EXT2_BOVIN RecName: Full=Exostosin-2; AltName:
           Full=Glucuronosyl-N-acetylglucosaminyl-proteoglycan/N-
           acetylglucosaminyl-proteoglycan
           4-alpha-N-acetylglucosaminyltransferase; AltName:
           Full=HS-polymerase; Short=HS-POL; AltName: Full=Multiple
           exostoses protein 2 homolog
 gi|3599568|gb|AAC35386.1| glucuronyl/N-acetylglucosaminyl transferase [Bos taurus]
          Length = 718

 Score = 58.5 bits (140), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 31/83 (37%), Positives = 45/83 (54%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y + +Q A FC+   G       L + +  GC+PVIIAD  VLPF++ + W+   + V E
Sbjct: 308 YPQVLQEATFCMVLRGARLGQAVLSDVLRAGCVPVIIADSYVLPFSEVLDWKRASVVVPE 367

Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
           E +  + +IL SIP   I   QR
Sbjct: 368 EKMSDVYSILQSIPRRQIEEMQR 390


>gi|195150443|ref|XP_002016164.1| GL11446 [Drosophila persimilis]
 gi|194110011|gb|EDW32054.1| GL11446 [Drosophila persimilis]
          Length = 676

 Score = 58.5 bits (140), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 34/104 (32%), Positives = 57/104 (54%), Gaps = 3/104 (2%)

Query: 243 WENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADD 301
           W   ++N   + + ++    YE + Q + FCL P G    S R +EA+  GCIPV++++ 
Sbjct: 305 WRELQDNRCDEDNREYDRYDYETLLQNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNA 364

Query: 302 IVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVI--LRKQ 343
            VLPF   I W++  ++  E  + ++  I+ SI  E I  LR+Q
Sbjct: 365 WVLPFESKIDWKQAAIWADERLLLQVPDIVRSISAERIFALRQQ 408


>gi|302780765|ref|XP_002972157.1| galactosyltransferase-like protein [Selaginella moellendorffii]
 gi|300160456|gb|EFJ27074.1| galactosyltransferase-like protein [Selaginella moellendorffii]
          Length = 470

 Score = 58.5 bits (140), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 76/319 (23%), Positives = 122/319 (38%), Gaps = 59/319 (18%)

Query: 64  THMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQ 123
           T  F+ EI  HR +L    RTL+P+ A  F+ P Y   D++         S  +     Q
Sbjct: 131 TDQFSGEIIFHRRMLDHRCRTLDPDGASAFFVPFYAGLDISRNLWASGKSSSDVDSLGEQ 190

Query: 124 L---ISSNWPYWNRTEGADHFFV---VPHDF-----------GACFHYQEEKAIERGILP 166
           L   +    P++NR+ GADHF V   +  DF            + FH  E ++++R ++ 
Sbjct: 191 LLHWLQRQHPHFNRSGGADHFLVAGRISWDFRRMPSAAGEWGSSLFHQIEMRSVKRLVI- 249

Query: 167 LLQRATLVQTFGQRNHVCLNEGSITIP----PYAPPQKMQAHQIPPDTPRSIFVYFRGLF 222
                       +RN    +E  +  P    P +     Q  +    +PR   V F G  
Sbjct: 250 ------------ERNPWDDSELGVPYPTSFHPSSDEDLAQWVEFVQGSPRPHLVAFAG-- 295

Query: 223 YDVNNDPEGGYYA----------RGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFC 272
                 P  GY +          R A+  +             T  P    +    ++FC
Sbjct: 296 -----SPRPGYRSDFRQVLLGQCRAAQRGISGCLDCTADTAGCTSDPLRVTKLFLSSVFC 350

Query: 273 LCPLGWAPWSPRLVEAVVFGCIPVIIADDIVL-PFADAIPW--EEIGMFVAEEDVPK--- 326
           L P G +     L ++++ GCIPV+  +      +   +P   EE  +F+  + V     
Sbjct: 351 LQPRGDSFTRKSLFDSLISGCIPVLFWNQSAYWQYELYLPRDPEEYSVFIPHQSVKNGTN 410

Query: 327 -LDTILTSIPPEVILRKQR 344
            LD +L  +  E I R QR
Sbjct: 411 VLD-VLQGVSRERIGRMQR 428


>gi|117935049|ref|NP_803462.2| exostosin-2 [Bos taurus]
 gi|117306659|gb|AAI26571.1| Exostoses (multiple) 2 [Bos taurus]
 gi|296479641|tpg|DAA21756.1| TPA: exostosin-2 [Bos taurus]
          Length = 718

 Score = 58.5 bits (140), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 31/83 (37%), Positives = 45/83 (54%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y + +Q A FC+   G       L + +  GC+PVIIAD  VLPF++ + W+   + V E
Sbjct: 308 YPQVLQEATFCMVLRGARLGQAVLSDVLRAGCVPVIIADSYVLPFSEVLDWKRASVVVPE 367

Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
           E +  + +IL SIP   I   QR
Sbjct: 368 EKMSDVYSILQSIPRRQIEEMQR 390


>gi|321468156|gb|EFX79142.1| hypothetical protein DAPPUDRAFT_304945 [Daphnia pulex]
          Length = 729

 Score = 58.5 bits (140), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 53/235 (22%), Positives = 100/235 (42%), Gaps = 23/235 (9%)

Query: 130 PYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFG----------Q 179
           P+WN   G  +  +     G    Y E+  ++ G   L + +  VQ +            
Sbjct: 160 PHWN---GGQNHIIFNFYSGTWPDYTEDLGMDIGRAILAKASISVQNYRPSFDISLPLVH 216

Query: 180 RNHVCLNEGSITIPPYAP--PQKMQAHQIPPDTPRSIF---VYFRGLFYDVNNDPEGGYY 234
           + H  L  G   +P YA   P   +++ +     R ++      R   Y ++N  +    
Sbjct: 217 KEH--LERGGDILPIYAENIPAASKSYLLAFKGKRYVYGIGSETRNSLYHLHNSRDVIMV 274

Query: 235 ARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLGWAPWSPRLVEAVVFGC 293
                   W+  K+    + + ++    YE +   + FCL P G    S R +E +  GC
Sbjct: 275 TTCKHGKSWKELKDERCEEDNAEYDRYDYEILLHNSTFCLVPRGRRLGSFRFIEVLQAGC 334

Query: 294 IPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVI--LRKQRLL 346
           IPV+++++ V+PF++ I W+   ++  E  + ++  I+ SI  E +  LR+Q  L
Sbjct: 335 IPVLLSNNWVIPFSEIIDWKTSAIWADERLLLQVPDIVRSIEAERVMALRQQSQL 389


>gi|322790909|gb|EFZ15575.1| hypothetical protein SINV_11283 [Solenopsis invicta]
          Length = 676

 Score = 58.5 bits (140), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 29/83 (34%), Positives = 44/83 (53%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y + +Q A FCL   G       L++A+  GCIP IIAD +++PF D I W +  +F+ E
Sbjct: 269 YPDSLQTATFCLIIRGARLAQSVLLDAMAAGCIPAIIADSLMMPFHDVIDWTKAAVFIRE 328

Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
            D+     +L  I  + I+  Q 
Sbjct: 329 VDILLTIQLLKKISHQRIMEMQE 351


>gi|426368047|ref|XP_004051026.1| PREDICTED: exostosin-2 isoform 2 [Gorilla gorilla gorilla]
          Length = 728

 Score = 58.5 bits (140), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 30/85 (35%), Positives = 46/85 (54%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y + +Q A FC+   G       L + +  GC+PV+IAD  +LPF++ + W+   + V E
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367

Query: 322 EDVPKLDTILTSIPPEVILRKQRLL 346
           E +  + +IL SIP   I   QR L
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQRQL 392


>gi|313235568|emb|CBY11023.1| unnamed protein product [Oikopleura dioica]
          Length = 663

 Score = 58.5 bits (140), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 30/87 (34%), Positives = 48/87 (55%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y   ++ + FCL P G    S R +E +   C+PV++ADD VLPF++ I WE   +   E
Sbjct: 232 YISLLKNSTFCLVPRGRRLGSFRFIETLQQACVPVLLADDWVLPFSEVIDWERSTISWEE 291

Query: 322 EDVPKLDTILTSIPPEVILRKQRLLAN 348
           + + +L   L  + P  +LR ++  AN
Sbjct: 292 KLLLELGQHLEDVSPADVLRMRQEGAN 318


>gi|296010875|ref|NP_001171554.1| exostosin-2 isoform 3 [Homo sapiens]
 gi|119588476|gb|EAW68070.1| exostoses (multiple) 2, isoform CRA_c [Homo sapiens]
          Length = 728

 Score = 58.5 bits (140), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 30/85 (35%), Positives = 46/85 (54%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y + +Q A FC+   G       L + +  GC+PV+IAD  +LPF++ + W+   + V E
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367

Query: 322 EDVPKLDTILTSIPPEVILRKQRLL 346
           E +  + +IL SIP   I   QR L
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQRQL 392


>gi|449681334|ref|XP_002163001.2| PREDICTED: exostosin-1a-like [Hydra magnipapillata]
          Length = 469

 Score = 58.5 bits (140), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 36/130 (27%), Positives = 66/130 (50%), Gaps = 2/130 (1%)

Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPT-TYYEDMQRAIFCLCPLG 277
           R   Y ++N+ +            W+ F ++   + ++++    Y E +  + FCL P G
Sbjct: 140 RNSLYLIHNNEDIILLTTCIHEKNWQKFADSRCEEDNSNYDRFNYTELLANSTFCLIPRG 199

Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIP-P 336
               S R +EA+ +GCIPVI+++   LPF D I W +  + + E  + +L +IL  I   
Sbjct: 200 RRLASFRFLEAIQYGCIPVIMSNGWDLPFNDVIDWVKFSIVLDESLLLQLPSILRGISFD 259

Query: 337 EVILRKQRLL 346
           +V+  KQ+ +
Sbjct: 260 QVLAMKQQTI 269


>gi|218198616|gb|EEC81043.1| hypothetical protein OsI_23835 [Oryza sativa Indica Group]
          Length = 250

 Score = 58.5 bits (140), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 24/67 (35%), Positives = 42/67 (62%)

Query: 271 FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTI 330
           FC+CP G    S R+ +++ +GC+PVI++D   LPF+  + W +  + + E DV +L +I
Sbjct: 166 FCVCPGGSQVNSARISDSIHYGCVPVILSDYYDLPFSGILNWRKFAVVLKESDVYELKSI 225

Query: 331 LTSIPPE 337
           L S+  +
Sbjct: 226 LKSLSQK 232


>gi|344281096|ref|XP_003412316.1| PREDICTED: LOW QUALITY PROTEIN: exostosin-2-like [Loxodonta
           africana]
          Length = 728

 Score = 58.5 bits (140), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 47/86 (54%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y + +Q A FC+   G       L + +  GC+PV+IAD  +LPF++ + W+   + V E
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367

Query: 322 EDVPKLDTILTSIPPEVILRKQRLLA 347
           E +  + +IL SIP   I   QR ++
Sbjct: 368 EKMSDMYSILQSIPQRQIEEMQRQVS 393


>gi|297688871|ref|XP_002821896.1| PREDICTED: exostosin-2 isoform 4 [Pongo abelii]
          Length = 718

 Score = 58.2 bits (139), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 75/327 (22%), Positives = 126/327 (38%), Gaps = 52/327 (15%)

Query: 35  DPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFY 94
           +P  K+KVY+Y L  KY     +     ++  +  E+ M               ++D++ 
Sbjct: 99  NPKNKIKVYIYAL-KKYVDDFGVSVSNTISREYN-ELLMA------------ISDSDYYT 144

Query: 95  TPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVV-----PHDFG 149
             I   C   P+   L   + R+  +A  L  +    W+R  G++H         P D+ 
Sbjct: 145 DDINRACLFVPSIDVLNQNTLRIKETAQAL--AQLSRWDR--GSNHLLFNMLPGGPPDYN 200

Query: 150 ACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEG-SITIPPYAPPQKMQAHQIPP 208
                  ++A+  G              G  +     +G  ++IP Y+P   + A    P
Sbjct: 201 TALDVPRDRALLAG--------------GGFSTWTYRQGYDVSIPVYSP---LSAEVDLP 243

Query: 209 DT---PRSIFVYFR--GLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPT--- 260
           +    PR  F+     GL  +   D E      G    V +   N     +S        
Sbjct: 244 EKGPGPRQYFLLSSQVGLHPEYREDLEALQVKHGESVLVLDKCTNLSEGVLSVRKRCHKH 303

Query: 261 ---TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGM 317
               Y + +Q A FC+   G       L + +  GC+PV+IAD  +LPF++ + W+   +
Sbjct: 304 QVFDYPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASV 363

Query: 318 FVAEEDVPKLDTILTSIPPEVILRKQR 344
            V EE +  + +IL SIP   I   QR
Sbjct: 364 VVPEEKMSDVYSILQSIPQRQIEEMQR 390


>gi|395815565|ref|XP_003781296.1| PREDICTED: exostosin-2 [Otolemur garnettii]
          Length = 718

 Score = 58.2 bits (139), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 30/83 (36%), Positives = 45/83 (54%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y + +Q A FC+   G       L E +  GC+PV+IAD  +LPF++ + W+   + V E
Sbjct: 308 YPQVLQDATFCVVLRGARLGQAVLSEVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367

Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
           E +  + +IL SIP   I   QR
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQR 390


>gi|17541994|ref|NP_502180.1| Protein RIB-1 [Caenorhabditis elegans]
 gi|6434267|emb|CAB61014.1| Protein RIB-1 [Caenorhabditis elegans]
 gi|130381603|dbj|BAF48989.1| heparan sulfate polymerase [Caenorhabditis elegans]
          Length = 382

 Score = 58.2 bits (139), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 27/72 (37%), Positives = 40/72 (55%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y E +  + FCL P G    S R +E +  GC+PV+I+D  +LPF++ I W    + VAE
Sbjct: 262 YDELLANSTFCLVPRGRRLGSFRFLETLRSGCVPVVISDSWILPFSETIDWNSAAIVVAE 321

Query: 322 EDVPKLDTILTS 333
            D   +  +L S
Sbjct: 322 RDALSIPELLMS 333


>gi|3023724|sp|O01704.1|EXT1_CAEEL RecName: Full=Multiple exostoses homolog 1; AltName: Full=Related
           to mammalian RIB protein 1
 gi|2058697|gb|AAC47509.1| multiple exostoses homolog 1 [Caenorhabditis elegans]
          Length = 378

 Score = 58.2 bits (139), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 27/72 (37%), Positives = 40/72 (55%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y E +  + FCL P G    S R +E +  GC+PV+I+D  +LPF++ I W    + VAE
Sbjct: 258 YDELLANSTFCLVPRGRRLGSFRFLETLRSGCVPVVISDSWILPFSETIDWNSAAIVVAE 317

Query: 322 EDVPKLDTILTS 333
            D   +  +L S
Sbjct: 318 RDALSIPELLMS 329


>gi|302832215|ref|XP_002947672.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
 gi|300267020|gb|EFJ51205.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
          Length = 790

 Score = 58.2 bits (139), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 43/131 (32%), Positives = 66/131 (50%), Gaps = 12/131 (9%)

Query: 210 TPRSIFVYFRG-LFYDVNNDPEGGYYARGARAAV--------WENFKNNPLFDISTDHPT 260
           + R++F++FRG L      DPE   Y+R  R  +        W    N  L D ST H  
Sbjct: 502 SERNVFLFFRGDLRLAPGQDPEC-KYSRCIRQTLYNLSISERWREKYNVLLGDTSTVH-G 559

Query: 261 TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVA 320
            Y   + +++FCL   G   WSPRL +AV+ GCIPVII D++   F   +      + + 
Sbjct: 560 DYSVLLSQSLFCLVAPGDG-WSPRLEDAVLHGCIPVIIMDEVQAVFESILDLPSFSVRIP 618

Query: 321 EEDVPKLDTIL 331
           + ++ ++ TIL
Sbjct: 619 QANMTQIVTIL 629


>gi|342320455|gb|EGU12395.1| Proteophosphoglycan ppg4 [Rhodotorula glutinis ATCC 204091]
          Length = 1322

 Score = 58.2 bits (139), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 76/335 (22%), Positives = 124/335 (37%), Gaps = 38/335 (11%)

Query: 41   KVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFM--HRFLLSSPVRTLNPEEADWFYTPIY 98
            K +V +LP   +  L  K P C     +   FM  H  + +   R++  +EA +   P Y
Sbjct: 973  KGHVVQLPDTIDNHL--KSPACFDLERSPLPFMGDHYLVEAMKNRSVTLDEASFVLVPYY 1030

Query: 99   PTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVP--HDFGACFHYQE 156
              C           K    +  A   I++N              V+P  HDFG+C  + +
Sbjct: 1031 QGCYYNYLQENTFKKLADTVGFAETAIATNPAITG------DRIVIPFTHDFGSCTGWWQ 1084

Query: 157  EKAIERGILPL--LQRATLVQTFGQRNHVCLN-EGSITIPPYAPPQK--MQAHQIPPD-- 209
            +     G  P   + +A   Q  G  N  C+  +  + +P      K   +  + P D  
Sbjct: 1085 KLEDVLGHSPPSPMDQAVAWQVNGDYNTRCIKVDRDVVVPAVTKHTKALFETFKTPADVA 1144

Query: 210  --TPRSIFVYF----RGLFYDVNNDPEGGYYARGARAAV-WENFKNNPLFDISTDHPTTY 262
                R    +F    RG           G   +   +A+ ++ F               Y
Sbjct: 1145 PVNSRKHLAFFAGGVRGFGAIARTKIGCGRTGQDPNSAILYQQFSPG----------QRY 1194

Query: 263  YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEE 322
               +  + FCL P G   W  R  EA+  GCIP  I D  + PF D + +    + + E 
Sbjct: 1195 LGTLNASKFCLLPRGIPAWMTRTFEAIYAGCIPAFIVDRNLFPFQDILDYSRFSVTIPEA 1254

Query: 323  DVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLF 357
            D  +++ IL++  PE +   Q  L    ++ A LF
Sbjct: 1255 DAHRIEEILSAYTPEQLSELQANLVK--VREAFLF 1287


>gi|332836216|ref|XP_508383.3| PREDICTED: exostosin-2 isoform 3 [Pan troglodytes]
          Length = 718

 Score = 58.2 bits (139), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 75/327 (22%), Positives = 125/327 (38%), Gaps = 52/327 (15%)

Query: 35  DPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFY 94
           +P  K+KVY+Y L  KY     +     ++  +  E+ M               ++D++ 
Sbjct: 99  NPKNKIKVYIYAL-KKYVDDFGVSVSNAISREYN-ELLMA------------ISDSDYYT 144

Query: 95  TPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVV-----PHDFG 149
             I   C   P+   L   + R+  +A  L  +    W+R  G +H         P D+ 
Sbjct: 145 DDINRACLFVPSIDVLNQNTLRIKETAQAL--AQLSRWDR--GTNHLLFNMLPGGPPDYN 200

Query: 150 ACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEG-SITIPPYAPPQKMQAHQIPP 208
                  ++A+  G              G  +     +G  ++IP Y+P   + A    P
Sbjct: 201 TALDVPRDRALLAG--------------GGFSTWTYRQGYDVSIPVYSP---LSAEVDLP 243

Query: 209 DT---PRSIFVYFR--GLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPT--- 260
           +    PR  F+     GL  +   D E      G    V +   N     +S        
Sbjct: 244 EKGPGPRQYFLLSSQVGLHPEYREDLEALQVKHGESVLVLDKCTNLSEGVLSVRKRCHKH 303

Query: 261 ---TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGM 317
               Y + +Q A FC+   G       L + +  GC+PV+IAD  +LPF++ + W+   +
Sbjct: 304 QVFDYPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASV 363

Query: 318 FVAEEDVPKLDTILTSIPPEVILRKQR 344
            V EE +  + +IL SIP   I   QR
Sbjct: 364 VVPEEKMSDVYSILQSIPQRQIEEMQR 390


>gi|395543711|ref|XP_003773757.1| PREDICTED: exostosin-2 [Sarcophilus harrisii]
          Length = 917

 Score = 58.2 bits (139), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 72/322 (22%), Positives = 131/322 (40%), Gaps = 48/322 (14%)

Query: 35  DPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFY 94
           +P  K+KVY+Y L  KY   + +     ++  +         LL++        ++D++ 
Sbjct: 99  NPKDKIKVYIYSL-KKYVDDVGVPVSNTISREYNE-------LLTAI------SDSDFYT 144

Query: 95  TPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPH---DFGAC 151
             I   C   P+   L  K+ R+  +A  L  +    W+R      F ++P    D+   
Sbjct: 145 DDINRACLFVPSIDVLNQKTLRIKETAQAL--AQLARWDRGTNQLLFNMLPGEPPDYNTA 202

Query: 152 FHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEG-SITIPPYAPPQKMQAHQIPPDT 210
                ++A+  G              G  +     +G  ++IP Y+P   + A    P+ 
Sbjct: 203 LDVPRDRALLAG--------------GGFSTWTYRQGYDVSIPVYSP---LSAEVDLPER 245

Query: 211 PRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAI 270
              + +   GL         G   +R  R  VW   +   + +   D  +     ++ A 
Sbjct: 246 GPGLVIRALGL---------GNEGSRPRRDLVWGLMELRDVMESMLDVESEGL-GLREAS 295

Query: 271 FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTI 330
           FC+   G       L + +  GC+PVIIAD  +LPF++ + W+   + + EE +  + ++
Sbjct: 296 FCIVLRGARLGQAALSDVLQAGCVPVIIADSYILPFSEVLDWKRASVVIPEEKMRDMYSV 355

Query: 331 LTSIPPEVILRKQR-LLANPSM 351
           L SIP   I   QR + A PS+
Sbjct: 356 LRSIPQRQIEEMQRQVRAGPSV 377


>gi|395742863|ref|XP_003777829.1| PREDICTED: exostosin-2 [Pongo abelii]
          Length = 751

 Score = 58.2 bits (139), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 75/327 (22%), Positives = 126/327 (38%), Gaps = 52/327 (15%)

Query: 35  DPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFY 94
           +P  K+KVY+Y L  KY     +     ++  +  E+ M               ++D++ 
Sbjct: 132 NPKNKIKVYIYAL-KKYVDDFGVSVSNTISREYN-ELLMA------------ISDSDYYT 177

Query: 95  TPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVV-----PHDFG 149
             I   C   P+   L   + R+  +A  L  +    W+R  G++H         P D+ 
Sbjct: 178 DDINRACLFVPSIDVLNQNTLRIKETAQAL--AQLSRWDR--GSNHLLFNMLPGGPPDYN 233

Query: 150 ACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEG-SITIPPYAPPQKMQAHQIPP 208
                  ++A+  G              G  +     +G  ++IP Y+P   + A    P
Sbjct: 234 TALDVPRDRALLAG--------------GGFSTWTYRQGYDVSIPVYSP---LSAEVDLP 276

Query: 209 DT---PRSIFVYFR--GLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPT--- 260
           +    PR  F+     GL  +   D E      G    V +   N     +S        
Sbjct: 277 EKGPGPRQYFLLSSQVGLHPEYREDLEALQVKHGESVLVLDKCTNLSEGVLSVRKRCHKH 336

Query: 261 ---TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGM 317
               Y + +Q A FC+   G       L + +  GC+PV+IAD  +LPF++ + W+   +
Sbjct: 337 QVFDYPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASV 396

Query: 318 FVAEEDVPKLDTILTSIPPEVILRKQR 344
            V EE +  + +IL SIP   I   QR
Sbjct: 397 VVPEEKMSDVYSILQSIPQRQIEEMQR 423


>gi|297688865|ref|XP_002821893.1| PREDICTED: exostosin-2 isoform 1 [Pongo abelii]
          Length = 731

 Score = 58.2 bits (139), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 75/327 (22%), Positives = 126/327 (38%), Gaps = 52/327 (15%)

Query: 35  DPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFY 94
           +P  K+KVY+Y L  KY     +     ++  +  E+ M               ++D++ 
Sbjct: 112 NPKNKIKVYIYAL-KKYVDDFGVSVSNTISREYN-ELLMA------------ISDSDYYT 157

Query: 95  TPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVV-----PHDFG 149
             I   C   P+   L   + R+  +A  L  +    W+R  G++H         P D+ 
Sbjct: 158 DDINRACLFVPSIDVLNQNTLRIKETAQAL--AQLSRWDR--GSNHLLFNMLPGGPPDYN 213

Query: 150 ACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEG-SITIPPYAPPQKMQAHQIPP 208
                  ++A+  G              G  +     +G  ++IP Y+P   + A    P
Sbjct: 214 TALDVPRDRALLAG--------------GGFSTWTYRQGYDVSIPVYSP---LSAEVDLP 256

Query: 209 DT---PRSIFVYFR--GLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPT--- 260
           +    PR  F+     GL  +   D E      G    V +   N     +S        
Sbjct: 257 EKGPGPRQYFLLSSQVGLHPEYREDLEALQVKHGESVLVLDKCTNLSEGVLSVRKRCHKH 316

Query: 261 ---TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGM 317
               Y + +Q A FC+   G       L + +  GC+PV+IAD  +LPF++ + W+   +
Sbjct: 317 QVFDYPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASV 376

Query: 318 FVAEEDVPKLDTILTSIPPEVILRKQR 344
            V EE +  + +IL SIP   I   QR
Sbjct: 377 VVPEEKMSDVYSILQSIPQRQIEEMQR 403


>gi|332836220|ref|XP_003313042.1| PREDICTED: exostosin-2 isoform 2 [Pan troglodytes]
          Length = 751

 Score = 58.2 bits (139), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 75/327 (22%), Positives = 125/327 (38%), Gaps = 52/327 (15%)

Query: 35  DPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFY 94
           +P  K+KVY+Y L  KY     +     ++  +  E+ M               ++D++ 
Sbjct: 132 NPKNKIKVYIYAL-KKYVDDFGVSVSNAISREYN-ELLMA------------ISDSDYYT 177

Query: 95  TPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVV-----PHDFG 149
             I   C   P+   L   + R+  +A  L  +    W+R  G +H         P D+ 
Sbjct: 178 DDINRACLFVPSIDVLNQNTLRIKETAQAL--AQLSRWDR--GTNHLLFNMLPGGPPDYN 233

Query: 150 ACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEG-SITIPPYAPPQKMQAHQIPP 208
                  ++A+  G              G  +     +G  ++IP Y+P   + A    P
Sbjct: 234 TALDVPRDRALLAG--------------GGFSTWTYRQGYDVSIPVYSP---LSAEVDLP 276

Query: 209 DT---PRSIFVYFR--GLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPT--- 260
           +    PR  F+     GL  +   D E      G    V +   N     +S        
Sbjct: 277 EKGPGPRQYFLLSSQVGLHPEYREDLEALQVKHGESVLVLDKCTNLSEGVLSVRKRCHKH 336

Query: 261 ---TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGM 317
               Y + +Q A FC+   G       L + +  GC+PV+IAD  +LPF++ + W+   +
Sbjct: 337 QVFDYPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASV 396

Query: 318 FVAEEDVPKLDTILTSIPPEVILRKQR 344
            V EE +  + +IL SIP   I   QR
Sbjct: 397 VVPEEKMSDVYSILQSIPQRQIEEMQR 423


>gi|410973605|ref|XP_003993238.1| PREDICTED: exostosin-2 [Felis catus]
          Length = 718

 Score = 58.2 bits (139), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 29/83 (34%), Positives = 46/83 (55%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y + +Q A FC+   G       L + +  GC+PV+IAD  VLPF++ + W+   + V E
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLRAGCVPVVIADSYVLPFSEVLDWKRASVVVPE 367

Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
           E + ++ +IL S+P   I   QR
Sbjct: 368 EKMSEVYSILQSVPQRQIEEMQR 390


>gi|322795454|gb|EFZ18199.1| hypothetical protein SINV_05251 [Solenopsis invicta]
          Length = 711

 Score = 58.2 bits (139), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 35/124 (28%), Positives = 58/124 (46%), Gaps = 1/124 (0%)

Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
           R   Y ++N  +  +         W   ++      + ++ T  YE +   A FCL P G
Sbjct: 254 RNALYHLHNGKDLVFVTTCRHGKAWRELQDEHCQQDNQEYDTYDYEILLMNATFCLVPRG 313

Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
               S R +EA+  GCIPVI+++   LPF + I W +  +F  E  + ++  IL S+   
Sbjct: 314 RRLGSFRFLEALRSGCIPVILSNGWALPFHERIDWTQAVIFSDERLLLQIPDILRSVSNV 373

Query: 338 VILR 341
            IL+
Sbjct: 374 QILK 377


>gi|397473541|ref|XP_003808267.1| PREDICTED: exostosin-2 [Pan paniscus]
 gi|410305316|gb|JAA31258.1| exostosin 2 [Pan troglodytes]
          Length = 751

 Score = 58.2 bits (139), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 75/327 (22%), Positives = 125/327 (38%), Gaps = 52/327 (15%)

Query: 35  DPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFY 94
           +P  K+KVY+Y L  KY     +     ++  +  E+ M               ++D++ 
Sbjct: 132 NPKNKIKVYIYAL-KKYVDDFGVSVSNAISREYN-ELLMA------------ISDSDYYT 177

Query: 95  TPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVV-----PHDFG 149
             I   C   P+   L   + R+  +A  L  +    W+R  G +H         P D+ 
Sbjct: 178 DDINRACLFVPSIDVLNQNTLRIKETAQAL--AQLSRWDR--GTNHLLFNMLPGGPPDYN 233

Query: 150 ACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEG-SITIPPYAPPQKMQAHQIPP 208
                  ++A+  G              G  +     +G  ++IP Y+P   + A    P
Sbjct: 234 TALDVPRDRALLAG--------------GGFSTWTYRQGYDVSIPVYSP---LSAEVDLP 276

Query: 209 DT---PRSIFVYFR--GLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPT--- 260
           +    PR  F+     GL  +   D E      G    V +   N     +S        
Sbjct: 277 EKGPGPRQYFLLSSQVGLHPEYREDLEALQVKHGESVLVLDKCTNLSEGVLSVRKRCHKH 336

Query: 261 ---TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGM 317
               Y + +Q A FC+   G       L + +  GC+PV+IAD  +LPF++ + W+   +
Sbjct: 337 QVFDYPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASV 396

Query: 318 FVAEEDVPKLDTILTSIPPEVILRKQR 344
            V EE +  + +IL SIP   I   QR
Sbjct: 397 VVPEEKMSDVYSILQSIPQRQIEEMQR 423


>gi|296088686|emb|CBI38136.3| unnamed protein product [Vitis vinifera]
          Length = 291

 Score = 57.8 bits (138), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 57/207 (27%), Positives = 84/207 (40%), Gaps = 29/207 (14%)

Query: 132 WNRTEGADHFFVVPHDFGACFHYQEEKAIE-------RGILPLLQRATLVQTFGQRNHVC 184
           W R+ G DH FV+  D  A +H + E A          G   L  +A+         H  
Sbjct: 10  WKRSGGRDHVFVLT-DPVAMWHVKAEIAPAILLVVDFGGWYKLDSKASNNSLSEMIQHTQ 68

Query: 185 LNEGSITIPPYA---PPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAA 241
           ++     I PY    P   +  +QI     R   +YF+G      +   GG      R  
Sbjct: 69  VSLLKDVIVPYTHLLPRLHLSENQI-----RQTLLYFKG----AKHRHRGGL----VREK 115

Query: 242 VWENFKNNPLFDISTDHPTTYYED-----MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPV 296
           +W+         +    P     +     M+ + FCL P G  P S RL +A+   CIPV
Sbjct: 116 LWDLLVYEQGVIMEEGFPNATGREQSIKGMRTSEFCLHPAGDTPTSCRLFDAIQSLCIPV 175

Query: 297 IIADDIVLPFADAIPWEEIGMFVAEED 323
           I++D+I LPF   + + E  +FVA  D
Sbjct: 176 IVSDNIELPFEGMVDYSEFSVFVAVRD 202


>gi|302791425|ref|XP_002977479.1| galactosyltransferase-like protein [Selaginella moellendorffii]
 gi|300154849|gb|EFJ21483.1| galactosyltransferase-like protein [Selaginella moellendorffii]
          Length = 466

 Score = 57.8 bits (138), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 78/323 (24%), Positives = 122/323 (37%), Gaps = 67/323 (20%)

Query: 64  THMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQ 123
           T  F+ EI  HR +L    RTL+P+ A  F+ P Y   D++         S  +     Q
Sbjct: 131 TDQFSGEIIFHRRMLDHRCRTLDPDGASAFFVPFYAGLDISRNLWASGKSSSDVDSLGEQ 190

Query: 124 L---ISSNWPYWNRTEGADHFFV---VPHDF-----------GACFHYQEEKAIERGILP 166
           L   +    P++NR+ GADHF V   +  DF            + FH  E ++++R ++ 
Sbjct: 191 LLHWLQRQHPHFNRSGGADHFLVAGRISWDFRRMPSAAGEWGSSLFHQIEMRSVKRLVI- 249

Query: 167 LLQRATLVQTFGQRNHVCLNEGSITIP----PYAPPQKMQAHQIPPDTPRSIFVYFRGLF 222
                       +RN    +E  +  P    P +     Q  +    +PR   V F G  
Sbjct: 250 ------------ERNPWDDSELGVPYPTSFHPSSDEDLAQWVEFVQGSPRPHLVAFAG-- 295

Query: 223 YDVNNDPEGGYYA----------RGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFC 272
                 P  GY +          R A   +             T  P    +    ++FC
Sbjct: 296 -----SPRPGYRSDFRQVLLGQCRAAPRGISRCLDCTADTAGCTSDPLRVTKLFLSSVFC 350

Query: 273 LCPLGWAPWSPRLVEAVVFGCIPVIIAD-------DIVLPFADAIPWEEIGMFVAEEDVP 325
           L P G +     L ++++ GCIPV+  +       ++ LP       EE  +F+  + V 
Sbjct: 351 LQPRGDSFTRKSLFDSLISGCIPVLFWNQSAYWQYELYLPRDP----EEYSVFIPHQSVK 406

Query: 326 K----LDTILTSIPPEVILRKQR 344
                LD +L  I  E I R QR
Sbjct: 407 NGTNVLD-VLQGISRERIGRMQR 428


>gi|222629835|gb|EEE61967.1| hypothetical protein OsJ_16742 [Oryza sativa Japonica Group]
          Length = 196

 Score = 57.8 bits (138), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 33/115 (28%), Positives = 56/115 (48%), Gaps = 11/115 (9%)

Query: 36  PVGKLKVYVYELPSKYNKKLLLKD-------PRCLTHMFAAEIFMHRFLLSSPVRTLNPE 88
           P   L++YVY        + LL+          C+   +  ++ +H+ LLSS  RT + +
Sbjct: 51  PHRDLRIYVYAEDEVDGLRALLRGRNDDVTAATCIKGQWGTQVKIHQLLLSSRFRTFDKD 110

Query: 89  EADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFV 143
           EAD F+ P Y  C +  TG         + ++ ++++ S  PY+ R+ G DH FV
Sbjct: 111 EADLFFVPTYVKC-VRMTG---KLNDKEINQTYVKVVLSQMPYFRRSGGRDHIFV 161


>gi|431915722|gb|ELK16055.1| Exostosin-2 [Pteropus alecto]
          Length = 1849

 Score = 57.8 bits (138), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 28/83 (33%), Positives = 45/83 (54%)

Query: 262  YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
            Y + +Q A FC+   G       L + +  GC+PV+IAD  +LPF++ + W+   + V E
Sbjct: 1439 YPQVLQEATFCVVLRGARLGQAVLSDVLRAGCVPVVIADSYILPFSEVLDWKRASVVVPE 1498

Query: 322  EDVPKLDTILTSIPPEVILRKQR 344
            E +  + +IL SIP   I   Q+
Sbjct: 1499 EKISDVYSILQSIPQRQIEEMQK 1521


>gi|47230478|emb|CAF99671.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 722

 Score = 57.8 bits (138), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 27/83 (32%), Positives = 47/83 (56%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y + +Q + FC+   G       L + +  GC+PVI+AD  +LPF++ + W+   +F+ E
Sbjct: 309 YPQILQESSFCVVLRGARLGQAVLSDVLQAGCVPVILADSYILPFSEVLDWKRASVFIPE 368

Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
           E + ++ +IL SIP   +   QR
Sbjct: 369 EKLSEMYSILKSIPHRQVEEMQR 391


>gi|307176975|gb|EFN66281.1| Exostosin-2 [Camponotus floridanus]
          Length = 563

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 34/95 (35%), Positives = 47/95 (49%), Gaps = 10/95 (10%)

Query: 250 PLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADA 309
           P FDIS          +  A FCL   G       L++A+  GCIPVIIAD + +PF D 
Sbjct: 154 PGFDIS----------LPSATFCLIIRGARLAQSSLLDAMAAGCIPVIIADSLTMPFHDV 203

Query: 310 IPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQR 344
           I W +  +FV E D+  +  +L  I  + I+  Q 
Sbjct: 204 IDWTKAAVFVREVDILLIIQLLKKISHQRIMEMQE 238


>gi|348558689|ref|XP_003465149.1| PREDICTED: exostosin-2-like isoform 2 [Cavia porcellus]
          Length = 669

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 29/83 (34%), Positives = 45/83 (54%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y + +Q A FC+   G       L + +  GC+PVIIAD  +LPF++ + W+   + V E
Sbjct: 307 YPQVLQEATFCVVLRGARLGQAALSDVLQAGCVPVIIADSYILPFSEVLDWKRASVVVPE 366

Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
           E +  + +IL SIP   +   QR
Sbjct: 367 EKIADVYSILQSIPRRQMEEMQR 389


>gi|348558687|ref|XP_003465148.1| PREDICTED: exostosin-2-like isoform 1 [Cavia porcellus]
          Length = 717

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 29/83 (34%), Positives = 45/83 (54%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y + +Q A FC+   G       L + +  GC+PVIIAD  +LPF++ + W+   + V E
Sbjct: 307 YPQVLQEATFCVVLRGARLGQAALSDVLQAGCVPVIIADSYILPFSEVLDWKRASVVVPE 366

Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
           E +  + +IL SIP   +   QR
Sbjct: 367 EKIADVYSILQSIPRRQMEEMQR 389


>gi|196014406|ref|XP_002117062.1| hypothetical protein TRIADDRAFT_1026 [Trichoplax adhaerens]
 gi|190580284|gb|EDV20368.1| hypothetical protein TRIADDRAFT_1026 [Trichoplax adhaerens]
          Length = 657

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 28/79 (35%), Positives = 48/79 (60%), Gaps = 1/79 (1%)

Query: 263 YEDMQ-RAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y+D+   + FCL P G    S R +EA+ FGCIP+++++  VLPF++ I W++  + + E
Sbjct: 228 YQDLLINSTFCLVPRGRRLGSFRFLEALQFGCIPIVLSNGWVLPFSEVIDWKKACVQIDE 287

Query: 322 EDVPKLDTILTSIPPEVIL 340
             +  +  ++ SI  E IL
Sbjct: 288 RQLFDVPELIESISDEKIL 306


>gi|426245345|ref|XP_004016473.1| PREDICTED: exostosin-2 [Ovis aries]
          Length = 718

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 31/83 (37%), Positives = 45/83 (54%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y + +Q A FC+   G       L + +  GC+PVIIAD  VLPF++ + W+   + V E
Sbjct: 308 YPQVLQEATFCMVLRGARLGQAVLSDVLRAGCVPVIIADSYVLPFSEVLDWKRASVVVPE 367

Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
           E +  + +IL SIP   I   QR
Sbjct: 368 EKMLDVYSILQSIPRRQIEEMQR 390


>gi|355752199|gb|EHH56319.1| Exostosin-2, partial [Macaca fascicularis]
          Length = 733

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 29/83 (34%), Positives = 45/83 (54%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y + +Q A FC+   G       L + +  GC+PV+IAD  +LPF++ + W+   + V E
Sbjct: 323 YPQVLQEATFCVVLRGARLGQAMLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 382

Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
           E +  + +IL SIP   I   QR
Sbjct: 383 EKMSDVYSILQSIPQRQIEEMQR 405


>gi|332210815|ref|XP_003254508.1| PREDICTED: exostosin-2 isoform 1 [Nomascus leucogenys]
          Length = 718

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 75/327 (22%), Positives = 125/327 (38%), Gaps = 52/327 (15%)

Query: 35  DPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFY 94
           +P  K+KVY+Y L  KY     +     ++  +  E+ M               ++D++ 
Sbjct: 99  NPKNKIKVYIYAL-KKYVDDFGVSVSNTISREYN-ELLMA------------ISDSDYYT 144

Query: 95  TPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVV-----PHDFG 149
             I   C   P+   L   + R+  +A  L  +    W+R  G +H         P D+ 
Sbjct: 145 DDINRACLFVPSIDVLNQNTLRIKETAQAL--AQLSRWDR--GMNHLLFNMLPGGPPDYN 200

Query: 150 ACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEG-SITIPPYAPPQKMQAHQIPP 208
                  ++A+  G              G  +     +G  ++IP Y+P   + A    P
Sbjct: 201 TALDVPRDRALLAG--------------GGFSTWTYRQGYDVSIPVYSP---LSAEVDLP 243

Query: 209 DT---PRSIFVYFR--GLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPT--- 260
           +    PR  F+     GL  +   D E      G    V +   N     +S        
Sbjct: 244 EKGPGPRQYFLLSSQVGLHPEYREDLEALQVKHGESVLVLDKCTNLSEGVLSVRKRCHKH 303

Query: 261 ---TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGM 317
               Y + +Q A FC+   G       L + +  GC+PV+IAD  +LPF++ + W+   +
Sbjct: 304 QVFDYPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASV 363

Query: 318 FVAEEDVPKLDTILTSIPPEVILRKQR 344
            V EE +  + +IL SIP   I   QR
Sbjct: 364 VVPEEKMSDVYSILQSIPQRQIEEMQR 390


>gi|224085069|ref|XP_002307477.1| predicted protein [Populus trichocarpa]
 gi|222856926|gb|EEE94473.1| predicted protein [Populus trichocarpa]
          Length = 460

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 40/133 (30%), Positives = 57/133 (42%), Gaps = 12/133 (9%)

Query: 40  LKVYVYELPSKYNKKLLLKDPR-------CLTHMFAAEIFMHRFLLSSPVRTLNPEEADW 92
           LK+YVYE       K LL+          C+   +  ++ +HR LL S  RT    EA+ 
Sbjct: 87  LKIYVYEEDEIDGLKELLRGREGKISADACVKGQWGTQVKIHRLLLQSRFRTRKKGEANL 146

Query: 93  FYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACF 152
           F+ P Y  C     GL        +  + ++ + S  PY+ R+ G DH FV P   GA  
Sbjct: 147 FFVPAYAKCVRMMGGL----NDKEINHTYVKAL-SQMPYFRRSGGRDHIFVFPSGAGAHL 201

Query: 153 HYQEEKAIERGIL 165
                  I R I+
Sbjct: 202 FRSWATYINRSII 214



 Score = 52.0 bits (123), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 33/119 (27%), Positives = 58/119 (48%), Gaps = 5/119 (4%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y++ ++ A FC+ P G + W+ R  E+    C+PVI++D    PF + I + +I +    
Sbjct: 310 YFQHLRNAKFCVAPRGESSWTLRFYESFFVECVPVILSDQAEFPFQNVIDYTQISIKWPS 369

Query: 322 EDVP-KLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPA-QPGDAFHQILNGLARKL 378
             +  +L   L SIP E +   +R++A     R +    P  +   A   I+  L RK+
Sbjct: 370 TRIGLELLEYLESIPDEDV---ERMIAAGRQVRCLWVYAPELELCSAMQGIMWELQRKV 425


>gi|221041480|dbj|BAH12417.1| unnamed protein product [Homo sapiens]
          Length = 337

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 29/83 (34%), Positives = 45/83 (54%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y + +Q A FC+   G       L + +  GC+PV+IAD  +LPF++ + W+   + V E
Sbjct: 189 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 248

Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
           E +  + +IL SIP   I   QR
Sbjct: 249 EKMSDVYSILQSIPQRQIEEMQR 271


>gi|118482536|gb|ABK93189.1| unknown [Populus trichocarpa]
          Length = 230

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 40/133 (30%), Positives = 57/133 (42%), Gaps = 12/133 (9%)

Query: 40  LKVYVYELPSKYNKKLLLKDPR-------CLTHMFAAEIFMHRFLLSSPVRTLNPEEADW 92
           LK+YVYE       K LL+          C+   +  ++ +HR LL S  RT    EA+ 
Sbjct: 87  LKIYVYEEDEIDGLKELLRGREGRISADACVKGQWGTQVKIHRLLLQSRFRTRKKGEANL 146

Query: 93  FYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACF 152
           F+ P Y  C     GL        +  + ++ + S  PY+ R+ G DH FV P   GA  
Sbjct: 147 FFVPAYAKCVRMMGGL----NDKEINHTYVKAL-SQMPYFRRSGGRDHIFVFPSGAGAHL 201

Query: 153 HYQEEKAIERGIL 165
                  I R I+
Sbjct: 202 FRSWATYINRSII 214


>gi|46370069|ref|NP_997005.1| exostosin-2 isoform 2 [Homo sapiens]
 gi|3023739|sp|Q93063.1|EXT2_HUMAN RecName: Full=Exostosin-2; AltName:
           Full=Glucuronosyl-N-acetylglucosaminyl-proteoglycan/N-
           acetylglucosaminyl-proteoglycan
           4-alpha-N-acetylglucosaminyltransferase; AltName:
           Full=Multiple exostoses protein 2; AltName:
           Full=Putative tumor suppressor protein EXT2
 gi|1518042|gb|AAB07008.1| EXT2 [Homo sapiens]
 gi|1519605|gb|AAC51219.1| multiple exostosis 2 [Homo sapiens]
 gi|1621113|gb|AAC50764.1| hereditary multiple exostoses gene 2 protein [Homo sapiens]
 gi|14603196|gb|AAH10058.1| Exostoses (multiple) 2 [Homo sapiens]
 gi|119588474|gb|EAW68068.1| exostoses (multiple) 2, isoform CRA_b [Homo sapiens]
 gi|189065454|dbj|BAG35293.1| unnamed protein product [Homo sapiens]
 gi|325463523|gb|ADZ15532.1| exostoses (multiple) 2 [synthetic construct]
          Length = 718

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 29/83 (34%), Positives = 45/83 (54%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y + +Q A FC+   G       L + +  GC+PV+IAD  +LPF++ + W+   + V E
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367

Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
           E +  + +IL SIP   I   QR
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQR 390


>gi|332210819|ref|XP_003254510.1| PREDICTED: exostosin-2 isoform 3 [Nomascus leucogenys]
          Length = 751

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 75/327 (22%), Positives = 125/327 (38%), Gaps = 52/327 (15%)

Query: 35  DPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFY 94
           +P  K+KVY+Y L  KY     +     ++  +  E+ M               ++D++ 
Sbjct: 132 NPKNKIKVYIYAL-KKYVDDFGVSVSNTISREYN-ELLMA------------ISDSDYYT 177

Query: 95  TPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVV-----PHDFG 149
             I   C   P+   L   + R+  +A  L  +    W+R  G +H         P D+ 
Sbjct: 178 DDINRACLFVPSIDVLNQNTLRIKETAQAL--AQLSRWDR--GMNHLLFNMLPGGPPDYN 233

Query: 150 ACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEG-SITIPPYAPPQKMQAHQIPP 208
                  ++A+  G              G  +     +G  ++IP Y+P   + A    P
Sbjct: 234 TALDVPRDRALLAG--------------GGFSTWTYRQGYDVSIPVYSP---LSAEVDLP 276

Query: 209 DT---PRSIFVYFR--GLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPT--- 260
           +    PR  F+     GL  +   D E      G    V +   N     +S        
Sbjct: 277 EKGPGPRQYFLLSSQVGLHPEYREDLEALQVKHGESVLVLDKCTNLSEGVLSVRKRCHKH 336

Query: 261 ---TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGM 317
               Y + +Q A FC+   G       L + +  GC+PV+IAD  +LPF++ + W+   +
Sbjct: 337 QVFDYPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASV 396

Query: 318 FVAEEDVPKLDTILTSIPPEVILRKQR 344
            V EE +  + +IL SIP   I   QR
Sbjct: 397 VVPEEKMSDVYSILQSIPQRQIEEMQR 423


>gi|444707556|gb|ELW48821.1| Exostosin-2 [Tupaia chinensis]
          Length = 740

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 29/83 (34%), Positives = 45/83 (54%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y + +Q A FCL   G       L + +  GC+PVI+AD  +LPF++ + W+   + V E
Sbjct: 330 YPQVLQEATFCLVLRGARLGQAVLSDVLQAGCVPVIVADSYILPFSEVLDWKRASVAVPE 389

Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
           E +  + +IL S+P   I   QR
Sbjct: 390 EKLSDVYSILQSVPQRQIEEMQR 412


>gi|268536482|ref|XP_002633376.1| C. briggsae CBR-RIB-1 protein [Caenorhabditis briggsae]
          Length = 349

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 32/116 (27%), Positives = 57/116 (49%), Gaps = 1/116 (0%)

Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
           R L + ++N  +          + W+ ++++     + ++    Y+D+   + FCL P G
Sbjct: 186 RNLVHHLHNGEDIVMVTTCKHNSDWQAYQDDRCQGDNNEYDRWEYDDLLANSTFCLVPRG 245

Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTS 333
               S R +E +  GCIPV+I+D  VLPF++   W    + VAE D   +  +L S
Sbjct: 246 RRLGSFRFLETLRSGCIPVVISDSWVLPFSETTDWNSAVIVVAERDALSIPELLMS 301


>gi|426368045|ref|XP_004051025.1| PREDICTED: exostosin-2 isoform 1 [Gorilla gorilla gorilla]
          Length = 718

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 29/83 (34%), Positives = 45/83 (54%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y + +Q A FC+   G       L + +  GC+PV+IAD  +LPF++ + W+   + V E
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367

Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
           E +  + +IL SIP   I   QR
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQR 390


>gi|390470425|ref|XP_002755247.2| PREDICTED: exostosin-2 isoform 1 [Callithrix jacchus]
          Length = 718

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 29/83 (34%), Positives = 45/83 (54%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y + +Q A FC+   G       L + +  GC+PV+IAD  +LPF++ + W+   + V E
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367

Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
           E +  + +IL SIP   I   QR
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQR 390


>gi|46250443|gb|AAH68545.1| Exostoses (multiple) 2 [Homo sapiens]
          Length = 718

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 29/83 (34%), Positives = 45/83 (54%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y + +Q A FC+   G       L + +  GC+PV+IAD  +LPF++ + W+   + V E
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367

Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
           E +  + +IL SIP   I   QR
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQR 390


>gi|296010873|ref|NP_000392.3| exostosin-2 isoform 1 [Homo sapiens]
 gi|119588477|gb|EAW68071.1| exostoses (multiple) 2, isoform CRA_d [Homo sapiens]
          Length = 751

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 29/83 (34%), Positives = 45/83 (54%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y + +Q A FC+   G       L + +  GC+PV+IAD  +LPF++ + W+   + V E
Sbjct: 341 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 400

Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
           E +  + +IL SIP   I   QR
Sbjct: 401 EKMSDVYSILQSIPQRQIEEMQR 423


>gi|73982548|ref|XP_533196.2| PREDICTED: exostosin-2 isoform 2 [Canis lupus familiaris]
          Length = 718

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 29/83 (34%), Positives = 45/83 (54%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y + +Q A FC+   G       L + +  GC+PV+IAD  +LPF++ + W+   + V E
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLRAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367

Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
           E +  + +IL SIP   I   QR
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQR 390


>gi|403254631|ref|XP_003920065.1| PREDICTED: exostosin-2 [Saimiri boliviensis boliviensis]
          Length = 718

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 29/83 (34%), Positives = 45/83 (54%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y + +Q A FC+   G       L + +  GC+PV+IAD  +LPF++ + W+   + V E
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367

Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
           E +  + +IL SIP   I   QR
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQR 390


>gi|432090368|gb|ELK23794.1| Exostosin-2 [Myotis davidii]
          Length = 718

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 29/83 (34%), Positives = 45/83 (54%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y + +Q A FC+   G       L + +  GC+PV+IAD  +LPF++ + W+   + V E
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLRAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367

Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
           E +  + +IL SIP   I   QR
Sbjct: 368 EKMADVYSILQSIPQRQIEEMQR 390


>gi|297268065|ref|XP_001106468.2| PREDICTED: exostosin-2 [Macaca mulatta]
          Length = 718

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 29/83 (34%), Positives = 45/83 (54%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y + +Q A FC+   G       L + +  GC+PV+IAD  +LPF++ + W+   + V E
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAILSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367

Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
           E +  + +IL SIP   I   QR
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQR 390


>gi|119588473|gb|EAW68067.1| exostoses (multiple) 2, isoform CRA_a [Homo sapiens]
 gi|119588475|gb|EAW68069.1| exostoses (multiple) 2, isoform CRA_a [Homo sapiens]
          Length = 731

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 29/83 (34%), Positives = 45/83 (54%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y + +Q A FC+   G       L + +  GC+PV+IAD  +LPF++ + W+   + V E
Sbjct: 321 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 380

Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
           E +  + +IL SIP   I   QR
Sbjct: 381 EKMSDVYSILQSIPQRQIEEMQR 403


>gi|291384884|ref|XP_002708900.1| PREDICTED: exostosin 2 [Oryctolagus cuniculus]
          Length = 765

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 29/83 (34%), Positives = 44/83 (53%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y + +Q A FC+   G       L + +  GC+PV+IAD  +LPF++ + W+   + V E
Sbjct: 355 YPQVLQEATFCVVLRGARLGQAALSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 414

Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
           E +  +  IL SIP   I   QR
Sbjct: 415 EKMADVYKILQSIPQRQIQEMQR 437


>gi|126332604|ref|XP_001362716.1| PREDICTED: exostosin-2 [Monodelphis domestica]
          Length = 718

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 29/88 (32%), Positives = 46/88 (52%)

Query: 257 DHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG 316
           D    Y + +Q A FC+   G       L + +  GC+PVIIAD  +LPF++ + W+   
Sbjct: 303 DQVFDYPQVLQEASFCMVLRGARLGQAVLSDVLQAGCVPVIIADSYILPFSEVLDWKRAS 362

Query: 317 MFVAEEDVPKLDTILTSIPPEVILRKQR 344
           + + EE +  + ++L SIP   I   QR
Sbjct: 363 VVIPEEKMRDMYSVLRSIPQRQIEEMQR 390


>gi|402893736|ref|XP_003910045.1| PREDICTED: exostosin-2-like isoform 1 [Papio anubis]
          Length = 535

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 29/83 (34%), Positives = 45/83 (54%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y + +Q A FC+   G       L + +  GC+PV+IAD  +LPF++ + W+   + V E
Sbjct: 341 YPQVLQEATFCVVLRGARLGQAILSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 400

Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
           E +  + +IL SIP   I   QR
Sbjct: 401 EKMSDVYSILQSIPQRQIEEMQR 423


>gi|432851754|ref|XP_004067068.1| PREDICTED: exostosin-2-like [Oryzias latipes]
          Length = 719

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 27/83 (32%), Positives = 46/83 (55%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y + +Q + FC+   G       L + +  GC+PVI+AD  +LPF++ + W+   + + E
Sbjct: 309 YPQILQESSFCVVLRGARLGQATLSDVLQAGCVPVILADSYILPFSEVLDWKRASVVIPE 368

Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
           E + ++ TIL SIP   +   QR
Sbjct: 369 EKLSEMYTILKSIPHRQVEEMQR 391


>gi|402893738|ref|XP_003910046.1| PREDICTED: exostosin-2-like isoform 2 [Papio anubis]
          Length = 515

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 29/83 (34%), Positives = 45/83 (54%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y + +Q A FC+   G       L + +  GC+PV+IAD  +LPF++ + W+   + V E
Sbjct: 321 YPQVLQEATFCVVLRGARLGQAILSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 380

Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
           E +  + +IL SIP   I   QR
Sbjct: 381 EKMSDVYSILQSIPQRQIEEMQR 403


>gi|194698238|gb|ACF83203.1| unknown [Zea mays]
 gi|413919897|gb|AFW59829.1| hypothetical protein ZEAMMB73_270023 [Zea mays]
          Length = 264

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 35/120 (29%), Positives = 56/120 (46%), Gaps = 13/120 (10%)

Query: 206 IPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYED 265
           +  +  R   +YF+G      +   GG      R  +W+   N P   +    P     +
Sbjct: 61  LSENKDRRTLLYFKG----AKHRHRGGL----VREKLWDLLGNEPDVIMEEGFPNATGRE 112

Query: 266 -----MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVA 320
                ++ + FCL P G  P S RL +A+   CIPVI++D++ LPF   I + EI +FV+
Sbjct: 113 QSIKGLRTSEFCLHPAGDTPTSCRLFDAIASLCIPVIVSDEVELPFEGIIDYTEISIFVS 172


>gi|168277694|dbj|BAG10825.1| exostosin-2 [synthetic construct]
          Length = 746

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 29/83 (34%), Positives = 45/83 (54%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y + +Q A FC+   G       L + +  GC+PV+IAD  +LPF++ + W+   + V E
Sbjct: 336 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 395

Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
           E +  + +IL SIP   I   QR
Sbjct: 396 EKMSDVYSILQSIPQRQIEEMQR 418


>gi|148235505|ref|NP_001080448.1| exostosin 2 [Xenopus laevis]
 gi|27881752|gb|AAH44703.1| Ext2-prov protein [Xenopus laevis]
          Length = 718

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 34/111 (30%), Positives = 57/111 (51%), Gaps = 9/111 (8%)

Query: 234 YARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGC 293
           Y+ GA AA+ +    N ++D        Y + +Q + FC+   G       L + +  GC
Sbjct: 289 YSDGA-AALRKRCYKNTVYD--------YPQILQESTFCIVLRGARLGQGLLSDVLQAGC 339

Query: 294 IPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQR 344
           +PVIIAD  VLPF++ + W+   + + EE + ++ +IL  IP   +   QR
Sbjct: 340 VPVIIADSYVLPFSEVLDWKRASVVIPEEKMFEMYSILQGIPQRQVEEMQR 390


>gi|348509623|ref|XP_003442347.1| PREDICTED: exostosin-2-like [Oreochromis niloticus]
          Length = 719

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 27/83 (32%), Positives = 46/83 (55%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y + +Q + FC+   G       L + +  GC+PVI+AD  +LPF++ + W+   + + E
Sbjct: 309 YPQILQESSFCVVLRGARLGQAALSDVLQAGCVPVILADSYILPFSEVLDWKRASVVIPE 368

Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
           E + ++ TIL SIP   +   QR
Sbjct: 369 EKLSEMYTILKSIPHRQVEEMQR 391


>gi|299472640|emb|CBN78292.1| Glycosyltransferase, family GT47 [Ectocarpus siliculosus]
          Length = 587

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 28/81 (34%), Positives = 41/81 (50%)

Query: 263 YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEE 322
           Y D+    F L P G +P + RL EA+  G +PV I  D V PF   IPW +       E
Sbjct: 473 YGDLMNTTFALLPAGRSPATYRLAEALSAGALPVFIHQDFVKPFPGKIPWSDFSFSFPPE 532

Query: 323 DVPKLDTILTSIPPEVILRKQ 343
           +VP++   L ++P   + + Q
Sbjct: 533 EVPRMLKTLRAVPDRELAQMQ 553


>gi|194755878|ref|XP_001960206.1| GF13249 [Drosophila ananassae]
 gi|190621504|gb|EDV37028.1| GF13249 [Drosophila ananassae]
          Length = 765

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 33/104 (31%), Positives = 56/104 (53%), Gaps = 3/104 (2%)

Query: 243 WENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADD 301
           W   ++N   + + ++    YE + Q + FCL P G    S R +EA+  GCIPV++++ 
Sbjct: 302 WRELQDNRCDEDNREYDRYDYETLLQNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNA 361

Query: 302 IVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVI--LRKQ 343
            VLPF   I W++  ++  E  + ++   + SI  E I  LR+Q
Sbjct: 362 WVLPFESKIDWKQAAIWADERLLLQVPDTVRSISVERIFALRQQ 405


>gi|354491358|ref|XP_003507822.1| PREDICTED: exostosin-2 [Cricetulus griseus]
          Length = 718

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 28/83 (33%), Positives = 45/83 (54%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y + +Q A FC+   G       L + +  GC+PV+IAD  +LPF++ + W+   + V E
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367

Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
           E +  + +IL +IP   I   QR
Sbjct: 368 EKMSDMYSILQNIPQRQIEEMQR 390


>gi|56751461|ref|YP_172162.1| hypothetical protein syc1452_c [Synechococcus elongatus PCC 6301]
 gi|56686420|dbj|BAD79642.1| hypothetical protein [Synechococcus elongatus PCC 6301]
          Length = 788

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 41/146 (28%), Positives = 65/146 (44%), Gaps = 10/146 (6%)

Query: 197 PPQKMQAHQIPPDTPRSIFVYFRGL--------FYDVNNDPEGGYYARGARAAVWENFKN 248
           PP  +      P   RS  + FRG            ++  PE    A   + + W     
Sbjct: 134 PPMPLLDQHCYPSRDRSTLLSFRGANSHPVREQLQRLHQPPEIA--AELIQQSYWGTLNY 191

Query: 249 NPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFAD 308
               +  +     Y + + R+ F + P G   +S RL+E +  G IPVI+ADD VLPF++
Sbjct: 192 VDEAEGLSAEQQVYTDLIARSRFSVAPRGHDIFSYRLLEVMAGGAIPVILADDWVLPFSE 251

Query: 309 AIPWEEIGMFVAEEDVPKLDTILTSI 334
            + W E  + VAE+   +L  +L +I
Sbjct: 252 LLDWSEFSLSVAEDRCWELPQLLQAI 277


>gi|344257014|gb|EGW13118.1| Exostosin-2 [Cricetulus griseus]
          Length = 701

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 28/83 (33%), Positives = 45/83 (54%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y + +Q A FC+   G       L + +  GC+PV+IAD  +LPF++ + W+   + V E
Sbjct: 291 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 350

Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
           E +  + +IL +IP   I   QR
Sbjct: 351 EKMSDMYSILQNIPQRQIEEMQR 373


>gi|307215037|gb|EFN89864.1| Exostosin-2 [Harpegnathos saltator]
          Length = 594

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 29/83 (34%), Positives = 44/83 (53%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y + +  A FCL   G       L++A+  GCIPVIIAD +++PF D I W +  + V E
Sbjct: 187 YPDILHSATFCLIIRGARLAQSVLLDAMAAGCIPVIIADSLIMPFHDVIDWTKAAILVRE 246

Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
            D+  +  +L  I  + I+  Q 
Sbjct: 247 VDILLIIQLLKKISHQRIVEMQE 269


>gi|308799499|ref|XP_003074530.1| Acetylglucosaminyltransferase EXT1/exostosin 1 (ISS) [Ostreococcus
           tauri]
 gi|116000701|emb|CAL50381.1| Acetylglucosaminyltransferase EXT1/exostosin 1 (ISS), partial
           [Ostreococcus tauri]
          Length = 439

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 31/83 (37%), Positives = 46/83 (55%), Gaps = 3/83 (3%)

Query: 252 FDIST---DHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFAD 308
           +D+ST   D P  Y   + ++ +CL   G    + RL + + FGC+PVI+AD   LPF+ 
Sbjct: 328 WDLSTSGQDKPRDYMMLLSKSRYCLYVYGDRAHTARLYDIITFGCVPVIVADGYDLPFSW 387

Query: 309 AIPWEEIGMFVAEEDVPKLDTIL 331
              W +  + V E+DV KL  IL
Sbjct: 388 LFDWSKFSVRVPEDDVAKLPGIL 410


>gi|81298858|ref|YP_399066.1| TPR repeat-containing protein [Synechococcus elongatus PCC 7942]
 gi|81167739|gb|ABB56079.1| TPR repeat [Synechococcus elongatus PCC 7942]
          Length = 788

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 41/146 (28%), Positives = 65/146 (44%), Gaps = 10/146 (6%)

Query: 197 PPQKMQAHQIPPDTPRSIFVYFRGL--------FYDVNNDPEGGYYARGARAAVWENFKN 248
           PP  +      P   RS  + FRG            ++  PE    A   + + W     
Sbjct: 134 PPMPLLDQHCYPSRDRSTLLSFRGANSHPVREQLQRLHQPPEIA--AELIQQSYWGTLNY 191

Query: 249 NPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFAD 308
               +  +     Y + + R+ F + P G   +S RL+E +  G IPVI+ADD VLPF++
Sbjct: 192 VDEAEGLSAEQQVYTDLIARSRFSVAPRGHDIFSYRLLEVMAGGAIPVILADDWVLPFSE 251

Query: 309 AIPWEEIGMFVAEEDVPKLDTILTSI 334
            + W E  + VAE+   +L  +L +I
Sbjct: 252 LLDWSEFSLSVAEDRCWELPQLLQAI 277


>gi|410912512|ref|XP_003969733.1| PREDICTED: exostosin-2-like [Takifugu rubripes]
          Length = 719

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 27/83 (32%), Positives = 46/83 (55%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y + +Q + FC+   G       L + +  GC+PVI+AD  +LPF++ + W+   +F+ E
Sbjct: 309 YPQVLQDSSFCVVLRGARLGQAALSDVLQAGCVPVILADSYILPFSEVLDWKRASVFIPE 368

Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
           E + ++  IL SIP   +   QR
Sbjct: 369 EKLSEMYGILKSIPHRQVEEMQR 391


>gi|307197832|gb|EFN78943.1| Exostosin-1 [Harpegnathos saltator]
          Length = 711

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 34/124 (27%), Positives = 58/124 (46%), Gaps = 1/124 (0%)

Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
           R   Y ++N  +  +         W   ++      + ++ T  YE +   A FCL P G
Sbjct: 254 RNALYHLHNGKDLVFVTTCRHGKAWRELQDEHCQQDNQEYDTYDYEILLMNATFCLVPRG 313

Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
               S R +EA+  GCIPVI+++   LPF + I W +  +F  E  + ++  I+ S+   
Sbjct: 314 RRLGSFRFLEALRAGCIPVILSNGWALPFHERIDWTQAVIFSDERLLLQIPDIVRSVSNV 373

Query: 338 VILR 341
            IL+
Sbjct: 374 QILK 377


>gi|348538380|ref|XP_003456670.1| PREDICTED: exostosin-1a-like [Oreochromis niloticus]
          Length = 776

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 57/236 (24%), Positives = 94/236 (39%), Gaps = 20/236 (8%)

Query: 118 MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTF 177
           +RS IQ    N P WN  +G +H     +  G+   Y E+   E G   L + +  V  F
Sbjct: 203 IRSRIQ----NLPTWN--DGRNHLIFNLYS-GSWPDYTEDLGFEVGQAMLAKASADVVNF 255

Query: 178 GQRNHVCL------------NEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDV 225
                + +              G +T+    P +K Q           I    R   Y +
Sbjct: 256 RSNYDISIPLFSKDHPLKGGGIGYLTLNDAPPSRKYQLVFKGKRYLTGIGSETRNALYHI 315

Query: 226 NNDPEGGYYARGARAAVWENFKNNPLFDISTDHPT-TYYEDMQRAIFCLCPLGWAPWSPR 284
           +N  +            WE  K++     + D+    Y E +  + FCL P G    S R
Sbjct: 316 HNGEDIILLTTCKHGKDWEKHKDSRCDRDNEDYSKFDYQELLHNSTFCLVPRGRRLGSFR 375

Query: 285 LVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVIL 340
            +EA+   CIPVI+++   LPF++ I W +  +   E  + ++ +I  S+  + IL
Sbjct: 376 FLEALQAACIPVILSNGWELPFSEVIDWRKAAIIGDERLLLQVPSITRSVGRDRIL 431


>gi|195123279|ref|XP_002006135.1| GI20872 [Drosophila mojavensis]
 gi|193911203|gb|EDW10070.1| GI20872 [Drosophila mojavensis]
          Length = 761

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 40/147 (27%), Positives = 68/147 (46%), Gaps = 11/147 (7%)

Query: 200 KMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP 259
           K   H I  +T  S+F         ++N  +            W   ++N   + + ++ 
Sbjct: 266 KRYVHGIGSETRNSLF--------HLHNGRDMVMVTTCKHGKSWRELQDNRCDEDNREYD 317

Query: 260 TTYYEDM-QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMF 318
              Y  + Q + FCL P G    S R +EA+  GCIPV++++  VLPF   I W++  ++
Sbjct: 318 RYDYGTLLQNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNAWVLPFESKIDWKQAAIW 377

Query: 319 VAEEDVPKLDTILTSIPPEVI--LRKQ 343
             E  + ++  I+ SI  E I  LR+Q
Sbjct: 378 ADERLLLQVPDIVRSISAERIFALRQQ 404


>gi|340719750|ref|XP_003398310.1| PREDICTED: exostosin-1-like isoform 1 [Bombus terrestris]
 gi|340719752|ref|XP_003398311.1| PREDICTED: exostosin-1-like isoform 2 [Bombus terrestris]
          Length = 711

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 34/124 (27%), Positives = 58/124 (46%), Gaps = 1/124 (0%)

Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
           R   Y ++N  +  +         W   ++      + ++ T  YE +   A FCL P G
Sbjct: 254 RNALYHLHNGKDLVFVTTCRHGKAWRELQDEHCQQDNQEYDTYDYEILLMNATFCLVPRG 313

Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
               S R +EA+  GCIPVI+++   LPF + I W +  +F  E  + ++  I+ S+   
Sbjct: 314 RRLGSFRFLEALRAGCIPVILSNGWALPFHERIDWTQAVIFSDERLLLQIPDIVRSVSNV 373

Query: 338 VILR 341
            IL+
Sbjct: 374 HILK 377


>gi|242020116|ref|XP_002430502.1| Exostosin-2, putative [Pediculus humanus corporis]
 gi|212515659|gb|EEB17764.1| Exostosin-2, putative [Pediculus humanus corporis]
          Length = 696

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 42/155 (27%), Positives = 68/155 (43%), Gaps = 3/155 (1%)

Query: 190 ITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNN 249
           I++P ++P  K     +     +  FV         + + E G  +   +  + EN   +
Sbjct: 219 ISLPLFSPYAK-SIKTLNKSQEKKWFVINSQANLHSDYERELGALSGVVKLELCEN--GD 275

Query: 250 PLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADA 309
           P F         Y   +Q A FC+   G       L+E++  GCIPVI AD +VLPF D 
Sbjct: 276 PRFRCHQGIAYNYPSVLQHATFCIIIRGARLAQQALLESLSAGCIPVIAADLMVLPFQDV 335

Query: 310 IPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQR 344
           I W+   + + E D+  L   L+S+  +  L  Q+
Sbjct: 336 IDWKRASITILESDLSSLIEKLSSVSDDKKLELQQ 370


>gi|350421075|ref|XP_003492724.1| PREDICTED: exostosin-1-like isoform 1 [Bombus impatiens]
 gi|350421078|ref|XP_003492725.1| PREDICTED: exostosin-1-like isoform 2 [Bombus impatiens]
          Length = 711

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 34/124 (27%), Positives = 58/124 (46%), Gaps = 1/124 (0%)

Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
           R   Y ++N  +  +         W   ++      + ++ T  YE +   A FCL P G
Sbjct: 254 RNALYHLHNGKDLVFVTTCRHGKAWRELQDEHCQQDNQEYDTYDYEILLMNATFCLVPRG 313

Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
               S R +EA+  GCIPVI+++   LPF + I W +  +F  E  + ++  I+ S+   
Sbjct: 314 RRLGSFRFLEALRAGCIPVILSNGWALPFHERIDWTQAVIFSDERLLLQIPDIVRSVSNV 373

Query: 338 VILR 341
            IL+
Sbjct: 374 HILK 377


>gi|170054218|ref|XP_001863025.1| exostosin-2 [Culex quinquefasciatus]
 gi|167874545|gb|EDS37928.1| exostosin-2 [Culex quinquefasciatus]
          Length = 758

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 26/80 (32%), Positives = 47/80 (58%), Gaps = 2/80 (2%)

Query: 266 MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVP 325
           ++   FCL   G     P L++A+  GCIPVI+AD++VLPF + + W+ + + + E ++ 
Sbjct: 354 LENGQFCLVARGVRLSQPTLMDALASGCIPVIMADNLVLPFGEVLDWDLVSIRIHENNLH 413

Query: 326 KLDTILTSIPPEVI--LRKQ 343
            + + L ++  E +  LR Q
Sbjct: 414 SVISTLKAVSKERVQELRAQ 433


>gi|440791537|gb|ELR12775.1| exostosin family protein [Acanthamoeba castellanii str. Neff]
          Length = 504

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 26/74 (35%), Positives = 40/74 (54%), Gaps = 1/74 (1%)

Query: 249 NPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFAD 308
            P+F  +  H T Y E + +++FC    G  P +  L   +  GCIP++ +D   LPF D
Sbjct: 299 GPMFGATVPH-TYYVEQITQSVFCAVARGHTPTTRALFNMLAGGCIPILFSDRWHLPFVD 357

Query: 309 AIPWEEIGMFVAEE 322
            +PW E+ +F  EE
Sbjct: 358 QLPWHEMVIFAPEE 371


>gi|401412800|ref|XP_003885847.1| conserved hypothetical protein [Neospora caninum Liverpool]
 gi|325120267|emb|CBZ55821.1| conserved hypothetical protein [Neospora caninum Liverpool]
          Length = 1577

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 35/84 (41%), Positives = 47/84 (55%), Gaps = 4/84 (4%)

Query: 266  MQRAIFCLCPL-GWAPWSPRLVEAVVFGCIPVIIA--DDIVLPFADAIPWEEIGMFVAEE 322
            + RA FCLCP  GW P S  + EA+   CIPVII   +++ LP      W +I +FV   
Sbjct: 1422 ISRATFCLCPQEGWLP-SLCIFEALAHACIPVIIGGEEELWLPGGCLFDWIQIAVFVPLS 1480

Query: 323  DVPKLDTILTSIPPEVILRKQRLL 346
              P    IL+ IP + IL KQ++L
Sbjct: 1481 RAPYTSLILSLIPEKEILEKQQML 1504


>gi|335281986|ref|XP_003122905.2| PREDICTED: exostosin-2-like [Sus scrofa]
          Length = 718

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 29/83 (34%), Positives = 45/83 (54%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y + +Q A FC+   G       L + +  GC+PV+IAD  VLPF++ + W+   + V E
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLRAGCVPVVIADSYVLPFSEVLDWKRASVVVPE 367

Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
           E +  + +IL SIP   +   QR
Sbjct: 368 EKMSDVYSILQSIPRRQMEEMQR 390


>gi|302854903|ref|XP_002958955.1| hypothetical protein VOLCADRAFT_100271 [Volvox carteri f.
           nagariensis]
 gi|300255701|gb|EFJ39990.1| hypothetical protein VOLCADRAFT_100271 [Volvox carteri f.
           nagariensis]
          Length = 705

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 48/168 (28%), Positives = 79/168 (47%), Gaps = 15/168 (8%)

Query: 221 LFYDVNNDPEGGYYARGARAAVWENFK-----NNPLFDISTDHPTT--YYEDMQRAIFCL 273
            F+  +  P    Y+ GAR A+  + K          DI     T   Y     R+ FCL
Sbjct: 508 FFFAGSVRPRDTSYSGGARQALSAHLKALMASGGNYSDIQFVEGTVPDYEALYMRSRFCL 567

Query: 274 CPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFAD--AIPWEEIGMFVAEEDVPKLDTIL 331
            P G A +  RL  A+   CIPVII D +  P+     +P+ +  + +++ D+P +  IL
Sbjct: 568 APHG-AGFGVRLTLAMTHACIPVIIQDQVYQPYESDGLLPYSQFSLRLSKSDIPYIVDIL 626

Query: 332 TSIPPEVILRKQRL-LANPSMKRAMLFPQPAQPGDAFHQILNGLARKL 378
            S+  E   R++R+ LA      A L+ +P+  G A++  +  L ++L
Sbjct: 627 RSVSTE---RQKRMRLAMAKYHHAFLW-EPSLGGRAYNYTIRALNQRL 670


>gi|350423159|ref|XP_003493402.1| PREDICTED: exostosin-2-like [Bombus impatiens]
          Length = 708

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 27/79 (34%), Positives = 41/79 (51%)

Query: 266 MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVP 325
           +Q A FCL   G       L+E +  G IP+IIAD + +P+   I W    +FV E D+ 
Sbjct: 305 LQTATFCLVIRGARLGQSTLLECMAAGSIPIIIADSLTMPYHGIIDWSRAAIFVREVDIL 364

Query: 326 KLDTILTSIPPEVILRKQR 344
            + ++L  I P+ I+  Q 
Sbjct: 365 SIISVLKKISPQRIIELQE 383


>gi|340727245|ref|XP_003401958.1| PREDICTED: exostosin-2-like isoform 2 [Bombus terrestris]
          Length = 710

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 27/79 (34%), Positives = 41/79 (51%)

Query: 266 MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVP 325
           +Q A FCL   G       L+E +  G IP+IIAD + +P+   I W    +FV E D+ 
Sbjct: 305 LQTATFCLVIRGARLGQSTLLECMAAGSIPIIIADSLTMPYHGIIDWSRAAIFVREVDIL 364

Query: 326 KLDTILTSIPPEVILRKQR 344
            + ++L  I P+ I+  Q 
Sbjct: 365 SIISVLKKISPQRIIELQE 383


>gi|401886285|gb|EJT50333.1| hypothetical protein A1Q1_00388 [Trichosporon asahii var. asahii
           CBS 2479]
          Length = 1041

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 33/106 (31%), Positives = 50/106 (47%), Gaps = 2/106 (1%)

Query: 261 TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVA 320
           TY + +    FC  P G A W+ RL + V  GCIPV++ D   + +     W    + V 
Sbjct: 891 TYLDLLGDTRFCPIPFGTAGWTYRLSDVVYAGCIPVLVGDQSHMTYWSMFDWSLFSVQVF 950

Query: 321 EEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDA 366
           E ++  L+ IL+ I  E   RKQ  L    ++ A L+P      +A
Sbjct: 951 EHELDHLERILSGITEEDAQRKQDALM--LVREAFLYPSEGHVEEA 994


>gi|340727243|ref|XP_003401957.1| PREDICTED: exostosin-2-like isoform 1 [Bombus terrestris]
          Length = 708

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 27/79 (34%), Positives = 41/79 (51%)

Query: 266 MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVP 325
           +Q A FCL   G       L+E +  G IP+IIAD + +P+   I W    +FV E D+ 
Sbjct: 305 LQTATFCLVIRGARLGQSTLLECMAAGSIPIIIADSLTMPYHGIIDWSRAAIFVREVDIL 364

Query: 326 KLDTILTSIPPEVILRKQR 344
            + ++L  I P+ I+  Q 
Sbjct: 365 SIISVLKKISPQRIIELQE 383


>gi|294956185|ref|XP_002788843.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
 gi|239904455|gb|EER20639.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
          Length = 87

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 22/53 (41%), Positives = 35/53 (66%)

Query: 261 TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWE 313
           TY  +M+ +IFC  P G  PW+ R+ +A++ GCIPV++++ IV PF   + W 
Sbjct: 12  TYDAEMKDSIFCFIPRGNTPWTRRIFDAIISGCIPVVLSNAIVFPFESLLDWS 64


>gi|328788198|ref|XP_001121972.2| PREDICTED: exostosin-2 [Apis mellifera]
          Length = 708

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 29/79 (36%), Positives = 41/79 (51%)

Query: 266 MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVP 325
           +Q A FCL   G       L+E +  G IPVIIAD + +PF   I W    +F+ E D+ 
Sbjct: 305 LQTATFCLVIRGARLGQTTLLECMAAGSIPVIIADSLTMPFHGIIDWTRAVIFIREVDIL 364

Query: 326 KLDTILTSIPPEVILRKQR 344
            L ++L  I  E I+  Q+
Sbjct: 365 SLISVLKKISQERIIELQQ 383


>gi|255070683|ref|XP_002507423.1| glycosyltransferase family 47 protein [Micromonas sp. RCC299]
 gi|226522698|gb|ACO68681.1| glycosyltransferase family 47 protein [Micromonas sp. RCC299]
          Length = 698

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 34/123 (27%), Positives = 58/123 (47%), Gaps = 6/123 (4%)

Query: 212 RSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIF 271
           R I + FRG       +   G+Y    ++  W+   +  +       P+ Y   M+ + F
Sbjct: 486 RPIEMSFRGTLRGGVRERILGHYLSVGKSRNWDLRSDGQV------SPSRYMRLMRDSKF 539

Query: 272 CLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTIL 331
           CL   G    SPRL+E ++FGC+PVI+AD  V P +    W +  + + E +  +L  +L
Sbjct: 540 CLHVRGTRVQSPRLIEGMLFGCVPVIVADGYVPPLSWLFDWSKFSVRLPEVEHERLPEVL 599

Query: 332 TSI 334
             +
Sbjct: 600 QGV 602


>gi|380025736|ref|XP_003696624.1| PREDICTED: LOW QUALITY PROTEIN: exostosin-2-like [Apis florea]
          Length = 700

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 30/83 (36%), Positives = 43/83 (51%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y + +Q A FCL   G       L+E +  G IPVIIAD + +PF   I W    +F+ E
Sbjct: 295 YPDVLQTATFCLVIRGARLGQTTLLECMAAGSIPVIIADSLTMPFHGIIDWTRAVIFIRE 354

Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
            D+  L ++L  I  E I+  Q+
Sbjct: 355 VDILSLISVLKKISQERIIELQQ 377


>gi|358335371|dbj|GAA53901.1| exostosin-2 [Clonorchis sinensis]
          Length = 1030

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 31/87 (35%), Positives = 49/87 (56%), Gaps = 8/87 (9%)

Query: 262 YYEDMQRAIFCLC----PLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGM 317
           Y E + R++FCL     P+G       L +++  GCIPVI  D+ +LPF++ + W +I +
Sbjct: 693 YGEALARSLFCLIIQIPPVGQFA----LFDSMNAGCIPVIADDNFILPFSEVLDWSKIAI 748

Query: 318 FVAEEDVPKLDTILTSIPPEVILRKQR 344
            V   ++ K+ T LTS   E I + QR
Sbjct: 749 RVRHSELHKIVTTLTSFTSEEIAQFQR 775


>gi|355566589|gb|EHH22968.1| Exostosin-2 [Macaca mulatta]
          Length = 751

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 28/83 (33%), Positives = 45/83 (54%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y + +Q A FC+   G       L + +  GC+PV+IAD  +LPF++ + W+   + V E
Sbjct: 341 YPQVLQEATFCVVLRGARLGQAMLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 400

Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
           E +  + +IL SIP   I   +R
Sbjct: 401 EKMSDVYSILQSIPQRQIEEMKR 423


>gi|307111745|gb|EFN59979.1| hypothetical protein CHLNCDRAFT_56485 [Chlorella variabilis]
          Length = 478

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 30/91 (32%), Positives = 45/91 (49%), Gaps = 14/91 (15%)

Query: 76  FLLSSPVRTLNPEEADWFYTPIYPTCDLTPT--------------GLPLPFKSPRMMRSA 121
            L+ S  RTL+ EEAD+FY P++ +C + P               G      +  M+  A
Sbjct: 320 MLMQSEHRTLDAEEADFFYVPVFTSCFIWPVRDGADSLYDFFYSVGHNRVQGATNMLLEA 379

Query: 122 IQLISSNWPYWNRTEGADHFFVVPHDFGACF 152
              I S+ P+W R  G DH ++V HD G+C+
Sbjct: 380 FHWIQSHQPWWERRGGRDHIWLVTHDEGSCW 410


>gi|351696937|gb|EHA99855.1| Exostosin-2 [Heterocephalus glaber]
          Length = 717

 Score = 55.8 bits (133), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 27/83 (32%), Positives = 44/83 (53%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y + +Q A FC+   G       L + +  GC+PV+IAD  +LPF++ + W+   + V E
Sbjct: 307 YPQVLQEATFCVVLRGARLGQAALSDVLRAGCVPVVIADSYILPFSEVLDWKRASVVVPE 366

Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
           E +  +  +L SIP   +   QR
Sbjct: 367 EKIADMYGVLQSIPRRQMEEMQR 389


>gi|395801788|ref|ZP_10481043.1| hypothetical protein FF52_07909 [Flavobacterium sp. F52]
 gi|395435977|gb|EJG01916.1| hypothetical protein FF52_07909 [Flavobacterium sp. F52]
          Length = 341

 Score = 55.8 bits (133), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 28/110 (25%), Positives = 51/110 (46%), Gaps = 4/110 (3%)

Query: 257 DHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG 316
           D    ++ ++Q  ++  C  G   +S R  EA++ G IPV++  D+ LPFA+ I W +  
Sbjct: 226 DSTLEFFTNIQNNLYVFCLRGNGNFSIRFYEALIMGRIPVLVDTDVRLPFANTINWSKHC 285

Query: 317 MFVAE----EDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQ 362
           + V+E    E +       + +  E +    R L    + R   F Q ++
Sbjct: 286 LLVSEKNLLEKIVSFHNAKSQVELEAMQFGNRKLIKEKLNRIDYFIQFSK 335


>gi|449460844|ref|XP_004148154.1| PREDICTED: xyloglucan galactosyltransferase KATAMARI1 homolog
           [Cucumis sativus]
          Length = 471

 Score = 55.8 bits (133), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 44/147 (29%), Positives = 60/147 (40%), Gaps = 38/147 (25%)

Query: 35  DPVGKLKVYVYELPSKYNKKLLLK----DP---RC------------------------- 62
           DP    +V+VY+LP  +NK LL      DP   RC                         
Sbjct: 67  DPCNSGRVFVYDLPPFFNKDLLQNCESLDPWTSRCDDVSNGGFGRQATELNGVVPDGLTP 126

Query: 63  ---LTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMR 119
               +  +  E  MH  +L+   RTL+PE A  FY P Y    +    L L + +    R
Sbjct: 127 AWFWSEQYMLEPIMHNRILNYKCRTLDPESATAFYIPFYAGLSIGRY-LWLNYTTSDRDR 185

Query: 120 SAIQLIS--SNWPYWNRTEGADHFFVV 144
            + +LI    N PYWNR+ G DHF  +
Sbjct: 186 DSEKLIEWVQNEPYWNRSNGGDHFITL 212


>gi|302794690|ref|XP_002979109.1| hypothetical protein SELMODRAFT_418827 [Selaginella moellendorffii]
 gi|300153427|gb|EFJ20066.1| hypothetical protein SELMODRAFT_418827 [Selaginella moellendorffii]
          Length = 435

 Score = 55.8 bits (133), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 42/143 (29%), Positives = 71/143 (49%), Gaps = 16/143 (11%)

Query: 260 TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFV 319
           T++ +    +IFCL P G  P S RL +A+V GCIPVI++D++  PF   + + ++ +F 
Sbjct: 281 TSWSQRHTFSIFCLSPAGDTPSSARLFDAIVSGCIPVIVSDELEPPFEGLVDYRKVALF- 339

Query: 320 AEEDVPKLDTI----LTSIPPEVILRKQRLLANPSMK--RAMLFPQPAQ---PGDAFHQI 370
               VP + T     L S    +  R+  +L +  ++  R   +  PAQ   P D   Q 
Sbjct: 340 ----VPSVKTTEKGWLVSYLRAITARQLSMLRSHMLEFSRHFQYSSPAQQLGPEDLTWQA 395

Query: 371 LNGLAR--KLPHDKSTYLKPGGK 391
           + G  +  +L   ++  L  GG+
Sbjct: 396 VAGKLQSIRLHIRRAQRLVDGGR 418



 Score = 45.8 bits (107), Expect = 0.043,   Method: Compositional matrix adjust.
 Identities = 42/151 (27%), Positives = 64/151 (42%), Gaps = 33/151 (21%)

Query: 25  SGSAGDV---LEDDPVGK--LKVYVYELPSKYNKKLL----------------------L 57
           +GSA D+   +ED       L+VYVY++P K+  +LL                      L
Sbjct: 78  TGSAEDLDRGMEDRARSSQVLRVYVYDMPEKFTLQLLRLFRDTYKETANLTSNGSPVHRL 137

Query: 58  KDPRCLTHMFAAEIFM--HRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSP 115
            +   + +   A++     + LL S  R LNP EAD FY P + T       L    +  
Sbjct: 138 IEQHSIDYWLYADLLAPESQRLLKSVKRVLNPTEADIFYIPFFTTISYF---LMEKQQCK 194

Query: 116 RMMRSAIQLISSNWPYWNRTEGADHFFVVPH 146
           ++ R A+  + +N   W R+ G DH   V H
Sbjct: 195 QLYREALSWV-TNQAAWKRSGGRDHVLPVHH 224


>gi|116786286|gb|ABK24052.1| unknown [Picea sitchensis]
          Length = 208

 Score = 55.8 bits (133), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 37/125 (29%), Positives = 60/125 (48%), Gaps = 13/125 (10%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGM---- 317
           Y++ ++ A FCL P G + W+ R  EA    C+PVI++D I LPF + + + +  +    
Sbjct: 58  YFQHLRNAKFCLAPRGESSWTLRFYEAFFVECVPVILSDQIELPFQNVLDYSQFSIKWPA 117

Query: 318 -FVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPG-DAFHQILNGLA 375
             +  E +  LD+I  +    +I R Q++       R +    P   G  A   IL  L 
Sbjct: 118 TRIGVELLEYLDSITDTEIKRMIARGQQV-------RCLWAYAPESVGCSAMTGILWELQ 170

Query: 376 RKLPH 380
           RK+ H
Sbjct: 171 RKVRH 175


>gi|443721868|gb|ELU10993.1| hypothetical protein CAPTEDRAFT_157657 [Capitella teleta]
          Length = 712

 Score = 55.8 bits (133), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 25/83 (30%), Positives = 49/83 (59%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y + +Q + FCL           L +A+  GC+PVI+AD  +LPF++ I W+   + VAE
Sbjct: 303 YPQVLQESEFCLVIRTSRLGQLTLSDAMKAGCVPVIVADSFILPFSEVIDWKRAAIVVAE 362

Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
           +++  ++ ++ +I  + +L+ +R
Sbjct: 363 DNLSTVNEVVRAISRDSLLQMRR 385


>gi|449499718|ref|XP_004160896.1| PREDICTED: xyloglucan galactosyltransferase KATAMARI1 homolog
           [Cucumis sativus]
          Length = 471

 Score = 55.8 bits (133), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 44/147 (29%), Positives = 60/147 (40%), Gaps = 38/147 (25%)

Query: 35  DPVGKLKVYVYELPSKYNKKLLLK----DP---RC------------------------- 62
           DP    +V+VY+LP  +NK LL      DP   RC                         
Sbjct: 67  DPCNSGRVFVYDLPPFFNKDLLQNCESLDPWTSRCDDVSNGGFGRQATELNGVVPDGLTP 126

Query: 63  ---LTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMR 119
               +  +  E  MH  +L+   RTL+PE A  FY P Y    +    L L + +    R
Sbjct: 127 AWFWSEQYMLEPIMHNRILNYKCRTLDPESATAFYIPFYAGLSIGRY-LWLNYTTSDRDR 185

Query: 120 SAIQLIS--SNWPYWNRTEGADHFFVV 144
            + +LI    N PYWNR+ G DHF  +
Sbjct: 186 DSEKLIEWVQNEPYWNRSNGGDHFITL 212


>gi|260826381|ref|XP_002608144.1| hypothetical protein BRAFLDRAFT_91377 [Branchiostoma floridae]
 gi|229293494|gb|EEN64154.1| hypothetical protein BRAFLDRAFT_91377 [Branchiostoma floridae]
          Length = 595

 Score = 55.5 bits (132), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 26/76 (34%), Positives = 41/76 (53%)

Query: 269 AIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLD 328
           A FC+   G       L +A++ GCIPVI  D  V+PF++ + W+   + + EED+P + 
Sbjct: 194 ATFCMILRGARMGQSALSDAMMAGCIPVIAIDTYVMPFSEVLDWKRAAVILREEDLPDVH 253

Query: 329 TILTSIPPEVILRKQR 344
            +L  I  E I   +R
Sbjct: 254 NVLRRISQERITNMRR 269


>gi|74201817|dbj|BAE28510.1| unnamed protein product [Mus musculus]
          Length = 701

 Score = 55.5 bits (132), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 28/83 (33%), Positives = 44/83 (53%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y + +Q A FC    G       L + +  GC+PV+IAD  +LPF++ + W+   + V E
Sbjct: 291 YPQVLQEATFCTVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 350

Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
           E +  + +IL +IP   I   QR
Sbjct: 351 EKMSDVYSILQNIPQRQIEEMQR 373


>gi|110637313|ref|YP_677520.1| hypothetical protein CHU_0899 [Cytophaga hutchinsonii ATCC 33406]
 gi|110279994|gb|ABG58180.1| conserved hypothetical protein [Cytophaga hutchinsonii ATCC 33406]
          Length = 330

 Score = 55.5 bits (132), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 30/73 (41%), Positives = 44/73 (60%), Gaps = 3/73 (4%)

Query: 261 TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVA 320
           TY ED+  + F LCP G A +S R++E +  G +PVIIAD+ V PF  +I  +   + +A
Sbjct: 180 TYLEDILSSYFVLCPRGIASYSHRIIETMALGSVPVIIADEWV-PF--SIEEDNYYVRIA 236

Query: 321 EEDVPKLDTILTS 333
           E DV  +  IL +
Sbjct: 237 ESDVENIYAILKA 249


>gi|302809300|ref|XP_002986343.1| hypothetical protein SELMODRAFT_425371 [Selaginella moellendorffii]
 gi|300145879|gb|EFJ12552.1| hypothetical protein SELMODRAFT_425371 [Selaginella moellendorffii]
          Length = 421

 Score = 55.5 bits (132), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 42/143 (29%), Positives = 70/143 (48%), Gaps = 16/143 (11%)

Query: 260 TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFV 319
           T++ +    +IFCL P G  P S RL +A+V GCIPVI++D++  PF   + + ++ +F 
Sbjct: 281 TSWSQRHTFSIFCLSPAGDTPSSARLFDAIVSGCIPVIVSDELEPPFEGLVDYRKVALF- 339

Query: 320 AEEDVPKLDTI----LTSIPPEVILRKQRLLANPSMK--RAMLFPQPAQ---PGDAFHQI 370
               VP + T     L S    +  R+  +L    ++  R   +  PAQ   P D   Q 
Sbjct: 340 ----VPSVKTTEKGWLVSYLRAITARQLSMLRGHMLEFSRHFQYSSPAQQLGPEDLTWQT 395

Query: 371 LNGLAR--KLPHDKSTYLKPGGK 391
           + G  +  +L   ++  L  GG+
Sbjct: 396 VAGKLQSIRLHIRRAQRLVDGGR 418



 Score = 44.7 bits (104), Expect = 0.089,   Method: Compositional matrix adjust.
 Identities = 42/151 (27%), Positives = 63/151 (41%), Gaps = 33/151 (21%)

Query: 25  SGSAGDV---LEDDPVGK--LKVYVYELPSKYNKKLL----------------------L 57
           +GSA D+   +ED       L+VYVY++P K+  +LL                      L
Sbjct: 78  TGSAEDLDRGMEDRARSSQVLRVYVYDMPEKFTLQLLRLFRDTYKETANLTSNGSPVHRL 137

Query: 58  KDPRCLTHMFAAEIFM--HRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSP 115
                + +   A++     + LL S  R LNP EAD FY P + T       L    +  
Sbjct: 138 IGQHSIDYWLYADLLAPESQRLLKSVKRVLNPTEADIFYIPFFTTISYF---LMEKQQCK 194

Query: 116 RMMRSAIQLISSNWPYWNRTEGADHFFVVPH 146
           ++ R A+  + +N   W R+ G DH   V H
Sbjct: 195 QLYREALSWV-TNQAAWKRSGGRDHVLPVHH 224


>gi|417404013|gb|JAA48784.1| Putative glycosyl transferase family 64 domain protein [Desmodus
           rotundus]
          Length = 701

 Score = 55.5 bits (132), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 29/83 (34%), Positives = 44/83 (53%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y + +Q A FC+   G       L + +  GC+PV+IAD  +LPF++ + W+   + V E
Sbjct: 291 YPQVLQEATFCVVLRGARLGQAVLSDILRAGCVPVVIADSYILPFSEVLDWKRASVVVPE 350

Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
           E +  +  IL SIP   I   QR
Sbjct: 351 EKMSDVYGILQSIPQRQIGEMQR 373


>gi|226442845|ref|NP_034293.2| exostosin-2 [Mus musculus]
 gi|341940670|sp|P70428.2|EXT2_MOUSE RecName: Full=Exostosin-2; AltName:
           Full=Glucuronosyl-N-acetylglucosaminyl-proteoglycan/N-
           acetylglucosaminyl-proteoglycan
           4-alpha-N-acetylglucosaminyltransferase; AltName:
           Full=Multiple exostoses protein 2 homolog
 gi|74216635|dbj|BAE37749.1| unnamed protein product [Mus musculus]
 gi|148695682|gb|EDL27629.1| exostoses (multiple) 2, isoform CRA_a [Mus musculus]
          Length = 718

 Score = 55.5 bits (132), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 28/83 (33%), Positives = 44/83 (53%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y + +Q A FC    G       L + +  GC+PV+IAD  +LPF++ + W+   + V E
Sbjct: 308 YPQVLQEATFCTVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367

Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
           E +  + +IL +IP   I   QR
Sbjct: 368 EKMSDVYSILQNIPQRQIEEMQR 390


>gi|13879260|gb|AAH06597.1| Exostoses (multiple) 2 [Mus musculus]
          Length = 718

 Score = 55.5 bits (132), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 28/83 (33%), Positives = 44/83 (53%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y + +Q A FC    G       L + +  GC+PV+IAD  +LPF++ + W+   + V E
Sbjct: 308 YPQVLQEATFCTVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367

Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
           E +  + +IL +IP   I   QR
Sbjct: 368 EKMSDVYSILQNIPQRQIEEMQR 390


>gi|255537445|ref|XP_002509789.1| Xyloglucan galactosyltransferase KATAMARI1, putative [Ricinus
           communis]
 gi|223549688|gb|EEF51176.1| Xyloglucan galactosyltransferase KATAMARI1, putative [Ricinus
           communis]
          Length = 567

 Score = 55.5 bits (132), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 83/346 (23%), Positives = 126/346 (36%), Gaps = 87/346 (25%)

Query: 42  VYVYELPSKYNKKLL---------------------------LKDPRCLTHMFAAEIFMH 74
           +YVY+LPSK+NK LL                           L     LTH ++ E   H
Sbjct: 187 IYVYDLPSKFNKDLLGQCREMIPWTDFCKYFDNEAFGKPIEKLGKGWYLTHQYSLEPIFH 246

Query: 75  RFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKS-PRMMRSAIQLISSNW---- 129
             +L  P R  N  EA  FY P Y   D+    L   FK+    ++  + L    W    
Sbjct: 247 SRILKHPCRVYNENEAKLFYVPYYGGLDI----LRWHFKNVSNDVKDTLALELLKWLESR 302

Query: 130 PYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGS 189
             W +  G DH FV+       F  + + +     L L Q    V+   +R    +N+  
Sbjct: 303 KTWLQNSGKDHVFVLGK-ISWDFRRKIDSSWGTRFLQLQQMQNPVKLLIERQPWDVNDIG 361

Query: 190 ITIPPYAPPQK---MQAHQIP-----------------PDTPRSIFVYFRGLFYDVNNDP 229
           I  P +  P     + A Q+                  PD P SI    R +   +N   
Sbjct: 362 IPHPTFFHPHSDDDIVAWQLKIIRTTRKNLLTFAGAARPDQPESI----RSIL--INQCT 415

Query: 230 EGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAV 289
             G   +      + N K+        D P T  E    + FCL P G +P    + +++
Sbjct: 416 SAGDKCK------FLNCKSG-----GCDRPETIIELFAESEFCLQPPGDSPTRKSVFDSL 464

Query: 290 VFGCIPVIIADDIVLPFAD--AIPWE------EIGMFVAEEDVPKL 327
           + GCIPV+       PF      PW       +  +F+ +E+V ++
Sbjct: 465 ISGCIPVLFN-----PFTAYYQYPWHLPEDHSKYSVFIDQEEVRQM 505


>gi|74214457|dbj|BAE31083.1| unnamed protein product [Mus musculus]
          Length = 690

 Score = 55.5 bits (132), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 28/83 (33%), Positives = 44/83 (53%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y + +Q A FC    G       L + +  GC+PV+IAD  +LPF++ + W+   + V E
Sbjct: 412 YPQVLQEATFCTVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 471

Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
           E +  + +IL +IP   I   QR
Sbjct: 472 EKMSDVYSILQNIPQRQIEEMQR 494


>gi|198416199|ref|XP_002119284.1| PREDICTED: similar to exostoses (multiple) 2, partial [Ciona
           intestinalis]
          Length = 659

 Score = 55.5 bits (132), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 41/134 (30%), Positives = 60/134 (44%), Gaps = 9/134 (6%)

Query: 256 TDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEI 315
            D    Y E +Q   FCL           L+E++  GCIPV   D  +LPF++ + W   
Sbjct: 292 NDEQKKYPEILQEGTFCLLLPTSRLGQSALMESMQAGCIPVFACDTYILPFSEVLDWSRA 351

Query: 316 GMFVAEEDVPKLDTILTSIPPE--VILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNG 373
            + + E+ +P +  IL  IP E  V+++KQ      S    +    PA       QI+N 
Sbjct: 352 SVLIREDSLPDIMNILRRIPHEQVVLMKKQVEFLYTSYFTNI----PAITMTTL-QIIND 406

Query: 374 LARKLPHDKSTYLK 387
             R  PH  +TY K
Sbjct: 407 --RVFPHVATTYEK 418


>gi|188528923|ref|NP_001120887.1| exostosin 2 [Xenopus (Silurana) tropicalis]
 gi|183986340|gb|AAI66243.1| ext2 protein [Xenopus (Silurana) tropicalis]
          Length = 718

 Score = 55.5 bits (132), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 32/111 (28%), Positives = 56/111 (50%), Gaps = 9/111 (8%)

Query: 234 YARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGC 293
           Y  GA A     ++N     +  D+P    + +Q + FC+   G       L + +  GC
Sbjct: 289 YTDGAAAHRKRCYRN-----VVYDYP----QILQESTFCIVLRGARLGQSVLSDVLQAGC 339

Query: 294 IPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQR 344
           +PVIIAD  VLPF++ + W+   + + EE + ++ +IL ++P   +   QR
Sbjct: 340 VPVIIADSYVLPFSEVLDWKRASVVIPEEKMFEMYSILQAVPQRQLEEMQR 390


>gi|242018853|ref|XP_002429885.1| Exostosin-1, putative [Pediculus humanus corporis]
 gi|212514919|gb|EEB17147.1| Exostosin-1, putative [Pediculus humanus corporis]
          Length = 725

 Score = 55.5 bits (132), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 55/235 (23%), Positives = 99/235 (42%), Gaps = 24/235 (10%)

Query: 110 LPFKSPRMMRSAIQLISSNWP-YWNRTEGADH-FFVVPHDFGACFHYQEEKAIERGILPL 167
           LP+ +         L S  WP Y     G D+ F ++     +  H++    +    +PL
Sbjct: 165 LPYWNKGRNHVIFNLYSGTWPDYAEENLGFDYGFSILAKASMSTVHFRPNFDVS---IPL 221

Query: 168 LQRATLVQTFGQRNHVCLNEGSITIPPY--APPQKMQAHQIPPDTPRSIFVYFRGLFYDV 225
             +    +  G+  +V  N   + I  Y  A   K   H I  +T  S+        Y +
Sbjct: 222 FHKNHR-EKGGESGYVSSNNFPV-IKKYVLAFKGKRYVHGIGSETRNSL--------YHL 271

Query: 226 NNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLGWAPWSPR 284
           +N+ +            W+  K+    + + ++    YE + Q + FCL P G    S R
Sbjct: 272 HNEKDMVLVTTCKHGKSWKEMKDERCDEDNQEYDRYDYEVLLQNSTFCLVPRGRRLGSFR 331

Query: 285 LVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVI 339
            +EA+  GCIPV++++  VLPF + I W +  ++  E        +L +I P+++
Sbjct: 332 FLEALQAGCIPVLLSNGWVLPFQEVIDWTKAAVWADER------LLLQAIVPDIV 380


>gi|363742350|ref|XP_003642625.1| PREDICTED: exostosin-1b-like, partial [Gallus gallus]
          Length = 456

 Score = 55.5 bits (132), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 31/103 (30%), Positives = 54/103 (52%), Gaps = 1/103 (0%)

Query: 243 WENFKNNPLFDISTDHPT-TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADD 301
           WE  K+      + D+    Y E +  + FC+ P G    S R +EA+   CIPV+++D 
Sbjct: 20  WERHKDTRCDQDNVDYEKFDYQELLHNSTFCIVPRGRRLGSFRFLEALQAACIPVLLSDG 79

Query: 302 IVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQR 344
             LPF++AI W +  +  +E  + ++ + +  I PE +L  Q+
Sbjct: 80  WELPFSEAIDWGKAAVVGSERLLLQIPSAVRCIRPERVLAFQQ 122


>gi|126322251|ref|XP_001370057.1| PREDICTED: exostosin-1 [Monodelphis domestica]
          Length = 746

 Score = 55.5 bits (132), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 32/123 (26%), Positives = 58/123 (47%), Gaps = 1/123 (0%)

Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
           R   Y V+N  +            W+  K++     +T++    Y +M   A FCL P G
Sbjct: 280 RNALYHVHNGEDVVLLTTCKHGKDWQKHKDSRCDKDNTEYEKYDYREMLHNATFCLVPRG 339

Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
               S R +EA+   C+PV++++   LPF++ I W +  +   E  + ++ + + SI  +
Sbjct: 340 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVIDWNQAAIIGDERLLLQIPSTIRSIHQD 399

Query: 338 VIL 340
            IL
Sbjct: 400 KIL 402


>gi|395512319|ref|XP_003760388.1| PREDICTED: exostosin-1 [Sarcophilus harrisii]
          Length = 746

 Score = 55.5 bits (132), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 32/123 (26%), Positives = 58/123 (47%), Gaps = 1/123 (0%)

Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
           R   Y V+N  +            W+  K++     +T++    Y +M   A FCL P G
Sbjct: 280 RNALYHVHNGEDVVLLTTCKHGKDWQKHKDSRCDKDNTEYEKYDYREMLHNATFCLVPRG 339

Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
               S R +EA+   C+PV++++   LPF++ I W +  +   E  + ++ + + SI  +
Sbjct: 340 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVIDWNQAAIIGDERLLLQIPSTIRSIHQD 399

Query: 338 VIL 340
            IL
Sbjct: 400 KIL 402


>gi|74213823|dbj|BAE29346.1| unnamed protein product [Mus musculus]
          Length = 786

 Score = 55.1 bits (131), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 28/83 (33%), Positives = 44/83 (53%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y + +Q A FC    G       L + +  GC+PV+IAD  +LPF++ + W+   + V E
Sbjct: 376 YPQVLQEATFCTVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 435

Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
           E +  + +IL +IP   I   QR
Sbjct: 436 EKMSDVYSILQNIPQRQIEEMQR 458


>gi|355686834|gb|AER98200.1| exostoses 2 [Mustela putorius furo]
          Length = 717

 Score = 55.1 bits (131), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 29/83 (34%), Positives = 43/83 (51%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y + +Q A FC+   G       L + +  GC+PVI+AD  VLPF++ + W+   + V E
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAALSDVLRAGCVPVIVADSYVLPFSEVLDWKRASVVVPE 367

Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
           E +  +  IL  IP   I   QR
Sbjct: 368 EKLSDVYGILQGIPRRQIEEMQR 390


>gi|449500758|ref|XP_004161187.1| PREDICTED: xyloglucan galactosyltransferase KATAMARI1 homolog
           [Cucumis sativus]
          Length = 618

 Score = 55.1 bits (131), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 80/318 (25%), Positives = 111/318 (34%), Gaps = 75/318 (23%)

Query: 38  GKLKVYVYELPSKYNKKLL---------------------------LKDPRCLTHMFAAE 70
           G   +YVYELP+K+NK+L+                           L D    T+ +A E
Sbjct: 222 GGRGIYVYELPAKFNKELVGQCGEMVPWMNFCKYFNNEGLGEKIPELGDGWYNTNQYALE 281

Query: 71  IFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKS-------PRMMRSAIQ 123
              H  +L  P R  N EEA  FY P Y   D+    L   FK+         +    IQ
Sbjct: 282 PIFHSRVLKHPCRVYNQEEAKLFYVPYYGGLDI----LRWHFKNNVTYELKDSLGLELIQ 337

Query: 124 LISSNWPYWNRTEGADHFFV---VPHDF-----GACFHYQEEK-AIERGILPLLQRATLV 174
            +S+  P W +  G DH FV   +  DF     G    Y     +     L L Q    +
Sbjct: 338 WLSAQKP-WAKNSGKDHVFVLGKISWDFRRNANGNINGYNSNNPSWGTKFLELHQLQNPI 396

Query: 175 QTFGQRNHVCLNEGSITIPPYAPPQK---MQAHQIPPDTPRSIFVY-------------F 218
           +   +R     N+  I  P +  P     + A Q      R  ++               
Sbjct: 397 KLLIERQPWHQNDVGIPHPTFFHPHSDDDIFAWQWKAIRSRRKYLVGFAGGARPESSENI 456

Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGW 278
           R L  D     EGG   R        N K         D P    E    + FCL P G 
Sbjct: 457 RSLLIDHCTTTEGGRLCRHL------NCKKG-----DCDRPKAVIELFLESEFCLQPPGD 505

Query: 279 APWSPRLVEAVVFGCIPV 296
           +P    + ++++ GCIPV
Sbjct: 506 SPTRKSVFDSLISGCIPV 523


>gi|321463593|gb|EFX74608.1| hypothetical protein DAPPUDRAFT_307216 [Daphnia pulex]
          Length = 724

 Score = 55.1 bits (131), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 76/321 (23%), Positives = 130/321 (40%), Gaps = 62/321 (19%)

Query: 38  GKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIF-MHRFLLSSPVRTLNPEEADWFYTP 96
           G++ VYVY +      + + KD   +T    +E + + + +  S   T +P EA  F   
Sbjct: 110 GRIGVYVYPVV-----EFVDKDKVPITKKITSEFYNILKAITKSMYFTSDPNEACLFI-- 162

Query: 97  IYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADH--FFVVPHDFGACFHY 154
             PT DL            R+    +    S+ PYWN  EG +H  F ++P   G+   Y
Sbjct: 163 --PTIDL--------LNQNRIRPKDVGKALSSLPYWN--EGRNHLIFNILP---GSMPEY 207

Query: 155 QEEKAIERGILPLLQRATLVQTFGQRNHVCLNEG-SITIPPYAPPQKMQAHQIPPDTPRS 213
                ++ G          + T G  + +    G  +++P Y+P       Q   D  R+
Sbjct: 208 HPFLEVDIGYA--------MMTSGGFSSLTYRSGFDVSVPVYSPLAAELRTQSSQD--RN 257

Query: 214 IFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDI--------------STDHP 259
             V        +++ P   Y     R  + E    +P F +                D  
Sbjct: 258 WLV--------ISSQP---YIHEDFREVISEMAAEHPGFLVLNSCGSSPLDTKLRCRDEE 306

Query: 260 TTYYED-MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMF 318
           T  + D +  A FCL   G     P L+E++  GCIPV+++D  VLP+ + I W+   + 
Sbjct: 307 TYNFPDILMNATFCLVVRGARLGQPTLMESLAAGCIPVVVSDSYVLPYEEVIDWKSAVLQ 366

Query: 319 VAEEDVPKLDTILTSIPPEVI 339
           + E+D+ K+  +L  +  + I
Sbjct: 367 LYEDDLSKMMDLLRGVSSDRI 387


>gi|148695683|gb|EDL27630.1| exostoses (multiple) 2, isoform CRA_b [Mus musculus]
          Length = 706

 Score = 55.1 bits (131), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 28/83 (33%), Positives = 44/83 (53%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y + +Q A FC    G       L + +  GC+PV+IAD  +LPF++ + W+   + V E
Sbjct: 372 YPQVLQEATFCTVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 431

Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
           E +  + +IL +IP   I   QR
Sbjct: 432 EKMSDVYSILQNIPQRQIEEMQR 454


>gi|449449896|ref|XP_004142700.1| PREDICTED: xyloglucan galactosyltransferase KATAMARI1 homolog
           [Cucumis sativus]
          Length = 618

 Score = 55.1 bits (131), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 80/318 (25%), Positives = 111/318 (34%), Gaps = 75/318 (23%)

Query: 38  GKLKVYVYELPSKYNKKLL---------------------------LKDPRCLTHMFAAE 70
           G   +YVYELP+K+NK+L+                           L D    T+ +A E
Sbjct: 222 GGRGIYVYELPAKFNKELVGQCGEMVPWMNFCKYFNNEGLGEKIPELGDGWYNTNQYALE 281

Query: 71  IFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKS-------PRMMRSAIQ 123
              H  +L  P R  N EEA  FY P Y   D+    L   FK+         +    IQ
Sbjct: 282 PIFHSRVLKHPCRVYNQEEAKLFYVPYYGGLDI----LRWHFKNNVTYELKDSLGLELIQ 337

Query: 124 LISSNWPYWNRTEGADHFFV---VPHDF-----GACFHYQEEK-AIERGILPLLQRATLV 174
            +S+  P W +  G DH FV   +  DF     G    Y     +     L L Q    +
Sbjct: 338 WLSAQKP-WAKNSGKDHVFVLGKISWDFRRNANGNINGYNSNNPSWGTKFLELHQLQNPI 396

Query: 175 QTFGQRNHVCLNEGSITIPPYAPPQK---MQAHQIPPDTPRSIFVY-------------F 218
           +   +R     N+  I  P +  P     + A Q      R  ++               
Sbjct: 397 KLLIERQPWHQNDVGIPHPTFFHPHSDDDIFAWQWKAIRSRRKYLVGFAGGARPESSENI 456

Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGW 278
           R L  D     EGG   R        N K         D P    E    + FCL P G 
Sbjct: 457 RSLLIDHCTTTEGGRLCRHL------NCKKG-----DCDRPKAVIELFLESEFCLQPPGD 505

Query: 279 APWSPRLVEAVVFGCIPV 296
           +P    + ++++ GCIPV
Sbjct: 506 SPTRKSVFDSLISGCIPV 523


>gi|406700155|gb|EKD03339.1| hypothetical protein A1Q2_02368 [Trichosporon asahii var. asahii
           CBS 8904]
          Length = 1041

 Score = 55.1 bits (131), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 32/106 (30%), Positives = 50/106 (47%), Gaps = 2/106 (1%)

Query: 261 TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVA 320
           TY + +    FC  P G A W+ RL + V  GCIPV++ D   + +     W    + V 
Sbjct: 891 TYLDLLGDTRFCPIPFGTAGWTYRLSDVVYAGCIPVLVGDQSHMTYWSMFDWSLFSVQVF 950

Query: 321 EEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDA 366
           E ++  L+ IL ++  E   RKQ  L    ++ A L+P      +A
Sbjct: 951 EHELDHLERILGAVTEEDAQRKQDALM--LVREAFLYPSEGHVEEA 994


>gi|291238132|ref|XP_002738985.1| PREDICTED: tout-velu-like, partial [Saccoglossus kowalevskii]
          Length = 403

 Score = 55.1 bits (131), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 29/78 (37%), Positives = 45/78 (57%), Gaps = 2/78 (2%)

Query: 268 RAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKL 327
            A FC+   G       L +A+  GCIPV+++D  VLPF+  + W+   + + EED+ ++
Sbjct: 1   EATFCIVLRGARLGQTALSDALKAGCIPVVLSDTYVLPFSQVLDWKRAAIRLNEEDLDQV 60

Query: 328 DTILTSIPPEVI--LRKQ 343
            ++L SI P  I  LRKQ
Sbjct: 61  ASVLRSISPTRINSLRKQ 78


>gi|412986477|emb|CCO14903.1| predicted protein [Bathycoccus prasinos]
          Length = 574

 Score = 55.1 bits (131), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 50/187 (26%), Positives = 78/187 (41%), Gaps = 18/187 (9%)

Query: 140 HFFVVPHDFG---ACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYA 196
           HF V   + G   A F  Q++ A+   +   L R  + +     +H   NE         
Sbjct: 328 HFIVSNSEIGVENAPFDAQKDVAVIPSLSFYLPREAVSRGLVDLSHFLRNEEEYASVGLE 387

Query: 197 PPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDIST 256
            P K++A        R+I + FRG     NN   G    +  R  +      + +     
Sbjct: 388 LP-KLEAE-------RNIRLMFRG-----NN--RGPLREKVFRYLIENGSPEDSIETTGV 432

Query: 257 DHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG 316
             P  Y   M+ + +CL   G    SPRL+E ++FGC+PVI+AD   LP A  + W +  
Sbjct: 433 ASPQAYMSLMEHSKYCLHVRGTRVMSPRLIELMLFGCVPVIVADAYELPLAWFLDWTKFS 492

Query: 317 MFVAEED 323
           + V E +
Sbjct: 493 IRVPESE 499


>gi|187469677|gb|AAI66748.1| Ext2 protein [Rattus norvegicus]
          Length = 718

 Score = 55.1 bits (131), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 27/83 (32%), Positives = 44/83 (53%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y + +Q A FC    G       L + +  GC+PV+IAD  +LPF++ + W+   + + E
Sbjct: 308 YPQVLQEATFCTVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVIPE 367

Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
           E +  + +IL +IP   I   QR
Sbjct: 368 EKMSDVYSILQNIPQRQIEEMQR 390


>gi|166183792|gb|ABY84155.1| exostosin 1 (predicted) [Callithrix jacchus]
          Length = 746

 Score = 55.1 bits (131), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 32/123 (26%), Positives = 58/123 (47%), Gaps = 1/123 (0%)

Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
           R   Y V+N  +            W+  K++     +T++    Y +M   A FCL P G
Sbjct: 280 RNALYHVHNGEDVVLLTTCKHGKDWQKHKDSRCDRDNTEYEKYDYREMLHNATFCLVPRG 339

Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
               S R +EA+   C+PV++++   LPF++ I W +  +   E  + ++ + + SI  +
Sbjct: 340 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIPSTIRSIHQD 399

Query: 338 VIL 340
            IL
Sbjct: 400 KIL 402


>gi|157820411|ref|NP_001101221.1| exostosin-2 [Rattus norvegicus]
 gi|149022701|gb|EDL79595.1| exostoses (multiple) 2 (predicted) [Rattus norvegicus]
          Length = 670

 Score = 55.1 bits (131), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 27/83 (32%), Positives = 44/83 (53%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y + +Q A FC    G       L + +  GC+PV+IAD  +LPF++ + W+   + + E
Sbjct: 308 YPQVLQEATFCTVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVIPE 367

Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
           E +  + +IL +IP   I   QR
Sbjct: 368 EKMSDVYSILQNIPQRQIEEMQR 390


>gi|426235700|ref|XP_004011818.1| PREDICTED: exostosin-1 [Ovis aries]
          Length = 746

 Score = 54.7 bits (130), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 32/123 (26%), Positives = 58/123 (47%), Gaps = 1/123 (0%)

Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
           R   Y V+N  +            W+  K++     +T++    Y +M   A FCL P G
Sbjct: 280 RNALYHVHNGEDVVLLTTCKHGKDWQKHKDSRCDRDNTEYEKYDYREMLHNATFCLVPRG 339

Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
               S R +EA+   C+PV++++   LPF++ I W +  +   E  + ++ + + SI  +
Sbjct: 340 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIPSTIRSIHQD 399

Query: 338 VIL 340
            IL
Sbjct: 400 KIL 402


>gi|148697308|gb|EDL29255.1| exostoses (multiple) 1, isoform CRA_b [Mus musculus]
          Length = 566

 Score = 54.7 bits (130), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 32/123 (26%), Positives = 58/123 (47%), Gaps = 1/123 (0%)

Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
           R   Y V+N  +            W+  K++     +T++    Y +M   A FCL P G
Sbjct: 294 RNALYHVHNGEDVLLLTTCKHGKDWQKHKDSRCDRDNTEYEKYDYREMLHNATFCLVPRG 353

Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
               S R +EA+   C+PV++++   LPF++ I W +  +   E  + ++ + + SI  +
Sbjct: 354 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIPSTIRSIHQD 413

Query: 338 VIL 340
            IL
Sbjct: 414 KIL 416


>gi|73974379|ref|XP_539145.2| PREDICTED: exostosin-1 isoform 1 [Canis lupus familiaris]
          Length = 746

 Score = 54.7 bits (130), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 32/123 (26%), Positives = 58/123 (47%), Gaps = 1/123 (0%)

Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
           R   Y V+N  +            W+  K++     +T++    Y +M   A FCL P G
Sbjct: 280 RNALYHVHNGEDVVLLTTCKHGKDWQKHKDSRCDRDNTEYEKYDYREMLHNATFCLVPRG 339

Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
               S R +EA+   C+PV++++   LPF++ I W +  +   E  + ++ + + SI  +
Sbjct: 340 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIPSTIRSIHQD 399

Query: 338 VIL 340
            IL
Sbjct: 400 KIL 402


>gi|197101852|ref|NP_001125538.1| exostosin-1 [Pongo abelii]
 gi|75042009|sp|Q5RBC3.1|EXT1_PONAB RecName: Full=Exostosin-1; AltName:
           Full=Glucuronosyl-N-acetylglucosaminyl-proteoglycan/N-
           acetylglucosaminyl-proteoglycan
           4-alpha-N-acetylglucosaminyltransferase; AltName:
           Full=Multiple exostoses protein 1 homolog
 gi|55728386|emb|CAH90937.1| hypothetical protein [Pongo abelii]
          Length = 746

 Score = 54.7 bits (130), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 32/123 (26%), Positives = 58/123 (47%), Gaps = 1/123 (0%)

Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
           R   Y V+N  +            W+  K++     +T++    Y +M   A FCL P G
Sbjct: 280 RNALYHVHNGEDVVLLTTCKHGKDWQKHKDSRCDRDNTEYEKYDYREMLHNATFCLVPRG 339

Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
               S R +EA+   C+PV++++   LPF++ I W +  +   E  + ++ + + SI  +
Sbjct: 340 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIPSTIRSIHQD 399

Query: 338 VIL 340
            IL
Sbjct: 400 KIL 402


>gi|74144073|dbj|BAE22143.1| unnamed protein product [Mus musculus]
          Length = 435

 Score = 54.7 bits (130), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 28/83 (33%), Positives = 44/83 (53%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y + +Q A FC    G       L + +  GC+PV+IAD  +LPF++ + W+   + V E
Sbjct: 25  YPQVLQEATFCTVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 84

Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
           E +  + +IL +IP   I   QR
Sbjct: 85  EKMSDVYSILQNIPQRQIEEMQR 107


>gi|46370066|ref|NP_000118.2| exostosin-1 [Homo sapiens]
 gi|20141422|sp|Q16394.2|EXT1_HUMAN RecName: Full=Exostosin-1; AltName:
           Full=Glucuronosyl-N-acetylglucosaminyl-proteoglycan/N-
           acetylglucosaminyl-proteoglycan
           4-alpha-N-acetylglucosaminyltransferase; AltName:
           Full=Multiple exostoses protein 1; AltName:
           Full=Putative tumor suppressor protein EXT1
 gi|12654671|gb|AAH01174.1| Exostoses (multiple) 1 [Homo sapiens]
 gi|119612378|gb|EAW91972.1| exostoses (multiple) 1 [Homo sapiens]
 gi|123981930|gb|ABM82794.1| exostoses (multiple) 1 [synthetic construct]
 gi|123996761|gb|ABM85982.1| exostoses (multiple) 1 [synthetic construct]
 gi|189053697|dbj|BAG35949.1| unnamed protein product [Homo sapiens]
 gi|261859910|dbj|BAI46477.1| exostoses (multiple) 1 [synthetic construct]
          Length = 746

 Score = 54.7 bits (130), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 32/123 (26%), Positives = 58/123 (47%), Gaps = 1/123 (0%)

Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
           R   Y V+N  +            W+  K++     +T++    Y +M   A FCL P G
Sbjct: 280 RNALYHVHNGEDVVLLTTCKHGKDWQKHKDSRCDRDNTEYEKYDYREMLHNATFCLVPRG 339

Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
               S R +EA+   C+PV++++   LPF++ I W +  +   E  + ++ + + SI  +
Sbjct: 340 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIPSTIRSIHQD 399

Query: 338 VIL 340
            IL
Sbjct: 400 KIL 402


>gi|147902262|ref|NP_001091564.1| exostosin-1 [Bos taurus]
 gi|238686639|sp|A5D7I4.1|EXT1_BOVIN RecName: Full=Exostosin-1; AltName:
           Full=Glucuronosyl-N-acetylglucosaminyl-proteoglycan/N-
           acetylglucosaminyl-proteoglycan
           4-alpha-N-acetylglucosaminyltransferase; AltName:
           Full=Multiple exostoses protein 1 homolog
 gi|146186820|gb|AAI40569.1| EXT1 protein [Bos taurus]
 gi|296480555|tpg|DAA22670.1| TPA: exostosin-1 [Bos taurus]
          Length = 746

 Score = 54.7 bits (130), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 32/123 (26%), Positives = 58/123 (47%), Gaps = 1/123 (0%)

Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
           R   Y V+N  +            W+  K++     +T++    Y +M   A FCL P G
Sbjct: 280 RNALYHVHNGEDVVLLTTCKHGKDWQKHKDSRCDRDNTEYEKYDYREMLHNATFCLVPRG 339

Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
               S R +EA+   C+PV++++   LPF++ I W +  +   E  + ++ + + SI  +
Sbjct: 340 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIPSTIRSIHQD 399

Query: 338 VIL 340
            IL
Sbjct: 400 KIL 402


>gi|403283504|ref|XP_003933159.1| PREDICTED: exostosin-1 [Saimiri boliviensis boliviensis]
          Length = 746

 Score = 54.7 bits (130), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 32/123 (26%), Positives = 58/123 (47%), Gaps = 1/123 (0%)

Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
           R   Y V+N  +            W+  K++     +T++    Y +M   A FCL P G
Sbjct: 280 RNALYHVHNGEDVVLLTTCKHGKDWQKHKDSRCDRDNTEYEKYDYREMLHNATFCLVPRG 339

Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
               S R +EA+   C+PV++++   LPF++ I W +  +   E  + ++ + + SI  +
Sbjct: 340 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIPSTIRSIHQD 399

Query: 338 VIL 340
            IL
Sbjct: 400 KIL 402


>gi|281183009|ref|NP_001162444.1| exostosin-1 [Papio anubis]
 gi|384475833|ref|NP_001245062.1| exostosin 1 [Macaca mulatta]
 gi|114621433|ref|XP_001141496.1| PREDICTED: exostosin-1 isoform 1 [Pan troglodytes]
 gi|397505670|ref|XP_003823375.1| PREDICTED: exostosin-1 [Pan paniscus]
 gi|238687365|sp|A9X1C8.1|EXT1_PAPAN RecName: Full=Exostosin-1; AltName:
           Full=Glucuronosyl-N-acetylglucosaminyl-proteoglycan/N-
           acetylglucosaminyl-proteoglycan
           4-alpha-N-acetylglucosaminyltransferase; AltName:
           Full=Multiple exostoses protein 1 homolog
 gi|163781066|gb|ABY40823.1| exostoses 1 (predicted) [Papio anubis]
 gi|355698182|gb|EHH28730.1| Exostosin-1 [Macaca mulatta]
 gi|383410151|gb|AFH28289.1| exostosin-1 [Macaca mulatta]
 gi|387541472|gb|AFJ71363.1| exostosin-1 [Macaca mulatta]
 gi|410224816|gb|JAA09627.1| exostosin 1 [Pan troglodytes]
 gi|410259738|gb|JAA17835.1| exostosin 1 [Pan troglodytes]
 gi|410302704|gb|JAA29952.1| exostosin 1 [Pan troglodytes]
 gi|410353629|gb|JAA43418.1| exostosin 1 [Pan troglodytes]
          Length = 746

 Score = 54.7 bits (130), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 32/123 (26%), Positives = 58/123 (47%), Gaps = 1/123 (0%)

Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
           R   Y V+N  +            W+  K++     +T++    Y +M   A FCL P G
Sbjct: 280 RNALYHVHNGEDVVLLTTCKHGKDWQKHKDSRCDRDNTEYEKYDYREMLHNATFCLVPRG 339

Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
               S R +EA+   C+PV++++   LPF++ I W +  +   E  + ++ + + SI  +
Sbjct: 340 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIPSTIRSIHQD 399

Query: 338 VIL 340
            IL
Sbjct: 400 KIL 402


>gi|311253457|ref|XP_001925015.2| PREDICTED: exostosin-1 [Sus scrofa]
          Length = 746

 Score = 54.7 bits (130), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 32/123 (26%), Positives = 58/123 (47%), Gaps = 1/123 (0%)

Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
           R   Y V+N  +            W+  K++     +T++    Y +M   A FCL P G
Sbjct: 280 RNALYHVHNGEDVVLLTTCKHGKDWQKHKDSRCDRDNTEYEKYDYREMLHNATFCLVPRG 339

Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
               S R +EA+   C+PV++++   LPF++ I W +  +   E  + ++ + + SI  +
Sbjct: 340 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIPSTIRSIHQD 399

Query: 338 VIL 340
            IL
Sbjct: 400 KIL 402


>gi|301774588|ref|XP_002922714.1| PREDICTED: exostosin-1-like [Ailuropoda melanoleuca]
          Length = 746

 Score = 54.7 bits (130), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 32/123 (26%), Positives = 58/123 (47%), Gaps = 1/123 (0%)

Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
           R   Y V+N  +            W+  K++     +T++    Y +M   A FCL P G
Sbjct: 280 RNALYHVHNGEDVVLLTTCKHGKDWQKHKDSRCDRDNTEYEKYDYREMLHNATFCLVPRG 339

Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
               S R +EA+   C+PV++++   LPF++ I W +  +   E  + ++ + + SI  +
Sbjct: 340 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIPSTIRSIHQD 399

Query: 338 VIL 340
            IL
Sbjct: 400 KIL 402


>gi|395521813|ref|XP_003765009.1| PREDICTED: exostosin-like 1 [Sarcophilus harrisii]
          Length = 694

 Score = 54.7 bits (130), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 32/144 (22%), Positives = 60/144 (41%), Gaps = 8/144 (5%)

Query: 203 AHQIPPDTPRSIFVYFRGLF------YDVNNDPEGGYYARGARAAVWENFKNNPLFDIST 256
            H IPP  P    + F G        + +  D               +++++ P     T
Sbjct: 219 VHLIPP--PGEDLLAFEGQLDMWGTGFSIQQDQNSILNPSDHHGETIKSYRDGPCKQDHT 276

Query: 257 DHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG 316
              +     +  + FC  P      S RL++A+  GC+PV+++    LPFA+ I W    
Sbjct: 277 IEQSEMQNGLHNSTFCFIPSSCHAGSFRLLQALKAGCVPVLLSRGWELPFAEVIDWGTAA 336

Query: 317 MFVAEEDVPKLDTILTSIPPEVIL 340
           + + E  + ++ ++L  +PP  +L
Sbjct: 337 VIIDERHLLQIKSVLQGLPPARVL 360


>gi|1168162|gb|AAB62283.1| putative tumour suppressor/hereditary multiple exostoses candidate
           gene [Homo sapiens]
 gi|1586817|prf||2204384A EXT1 gene
          Length = 746

 Score = 54.7 bits (130), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 32/123 (26%), Positives = 58/123 (47%), Gaps = 1/123 (0%)

Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
           R   Y V+N  +            W+  K++     +T++    Y +M   A FCL P G
Sbjct: 280 RNALYHVHNGEDVVLLTTCKHGKDWQKHKDSRCDRDNTEYEKYDYREMLHNATFCLVPRG 339

Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
               S R +EA+   C+PV++++   LPF++ I W +  +   E  + ++ + + SI  +
Sbjct: 340 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIPSTIRSIHQD 399

Query: 338 VIL 340
            IL
Sbjct: 400 KIL 402


>gi|397643551|gb|EJK75938.1| hypothetical protein THAOC_02324 [Thalassiosira oceanica]
          Length = 505

 Score = 54.7 bits (130), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 37/141 (26%), Positives = 60/141 (42%), Gaps = 32/141 (22%)

Query: 264 EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED 323
           +D   A FCLCP G +  + R   +++ GCIPV+++  IVLPF   I +    +FVA +D
Sbjct: 351 QDTFEATFCLCPAGDSDVARRFFTSILAGCIPVVMSQHIVLPFESLIDYSTFVVFVAFDD 410

Query: 324 VPK---------------------------LDTILTSIPPEVILRKQRLLANPSMKRAML 356
                                          D +L     EV+ R++ LL    ++   +
Sbjct: 411 TENAEKNILPTVGDKDEGSTVLRVSNFESVYDALLHMTEEEVLTRRRNLLC---VRDHFV 467

Query: 357 FPQP--AQPGDAFHQILNGLA 375
           + +     PGDA   I+  +A
Sbjct: 468 YRREPGGHPGDAVDTIVAEMA 488


>gi|168013242|ref|XP_001759310.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689623|gb|EDQ75994.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 392

 Score = 54.7 bits (130), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 66/277 (23%), Positives = 107/277 (38%), Gaps = 23/277 (8%)

Query: 64  THMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQ 123
           TH +A E+  H  ++    RT +P  A+ FY P Y   D+      L   +    R A+ 
Sbjct: 65  THQYALELVSHARIMKYKCRTEDPNLANLFYIPYYGGLDVIRWHFDL--NATNTNRDALG 122

Query: 124 LISSNW----PYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQ 179
                W    P W R  G DH  V+       F  Q        +L   +   +++   +
Sbjct: 123 WKLVRWLEKQPSWRRRGGLDHLLVLGK-ISWDFRRQLRGNWGSRLLEFPEIQNMMRVMIE 181

Query: 180 RNHVCLNEGSITIPPYAPPQKMQA----HQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYA 235
           RN    N+  +  P Y  P+         Q      R+  V F G   +  NDP     A
Sbjct: 182 RNPWSKNDIGVPHPTYFHPKSASDIDTWLQHVKSQERTSLVAFVG--KERRNDPTNVRSA 239

Query: 236 --RGARAAVWENFKNNPLFDISTD---HPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVV 290
             R  R A  E        +   D   HP    +    + FC+ P+G +P    + ++++
Sbjct: 240 LVRQCRGASSEAVCR--FVECKKDLCQHPVFVTKTFVTSQFCMQPVGDSPTRRSVFDSLI 297

Query: 291 FGCIPVIIA-DDIVLPFADAIPWEEI--GMFVAEEDV 324
            GCIPV+       L +A  +P  E    ++++E++V
Sbjct: 298 AGCIPVLFHPATAYLQYAWHLPRNESSWSVYISEDEV 334


>gi|356523730|ref|XP_003530488.1| PREDICTED: xyloglucan galactosyltransferase KATAMARI1-like [Glycine
           max]
          Length = 437

 Score = 54.7 bits (130), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 66/273 (24%), Positives = 108/273 (39%), Gaps = 36/273 (13%)

Query: 46  ELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTP 105
           ++  K N+K+LLK+    T+ F+ E+  H  L      T +   A   Y P Y   D+  
Sbjct: 73  KVSKKSNEKVLLKESFYATNQFSLEVIFHNTLKHYKCLTNDSSLASAIYVPYYAGLDVVQ 132

Query: 106 ---TGLPLPFK--SPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAI 160
               G  +  +  SP+ +   +    +  P W R  G DHF VV    G+ F  + E   
Sbjct: 133 YLWGGFNVSIRDASPKELVKWL----AQQPEWKRMWGRDHFMVVGR-IGSDFRRRTENND 187

Query: 161 ERGI-LPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPP----------QKMQAHQIP-- 207
           + G  L LL  A  +      +    NE SI  P Y  P          +KM+  + P  
Sbjct: 188 DWGTKLMLLPEARNMSILSIESGSKENEFSIPYPTYFHPSKDKEVFQWQKKMRKVKRPYL 247

Query: 208 ---PDTPRSIFVYFRGLFY-DVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYY 263
                 PR  + Y   +   ++  + +     +        N+ N+P+      H T  +
Sbjct: 248 FSFAGAPRPYYNYLSSIIRNEIIKECQSSRSCKLLNCNAGHNYCNDPV------HVTKVF 301

Query: 264 EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPV 296
              Q ++FCL P G +       ++++ GCIPV
Sbjct: 302 ---QSSVFCLQPPGDSFTRRSTFDSILAGCIPV 331


>gi|372266629|ref|ZP_09502677.1| exostosin [Alteromonas sp. S89]
          Length = 328

 Score = 54.7 bits (130), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 47/86 (54%), Gaps = 3/86 (3%)

Query: 251 LFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAI 310
           L +I       Y E ++   FCLCP G  P S RL EA+ FGC+PV+++DD+ L    A 
Sbjct: 215 LKEIKRREEREYVEVLKETKFCLCPSGSGPNSIRLWEAIKFGCVPVLLSDDLDLRGLSA- 273

Query: 311 PWEEIGMFVAEEDVPKLDTILTSIPP 336
             + + +  ++E+V KL   L +  P
Sbjct: 274 --KALRVTESKEEVEKLPARLANTDP 297


>gi|358256597|dbj|GAA50186.1| glucuronyl/N-acetylglucosaminyl transferase EXT1 [Clonorchis
           sinensis]
          Length = 802

 Score = 54.7 bits (130), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 35/94 (37%), Positives = 52/94 (55%), Gaps = 4/94 (4%)

Query: 253 DISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW 312
           D++T     Y E M  + FCL P G    S R +EA+   CIPV++++D  LPF++ I W
Sbjct: 342 DMATYDQYDYNELMHNSTFCLVPRGRRLGSYRFLEALEASCIPVMLSNDWELPFSEVIDW 401

Query: 313 EEIGMFVAEEDVP-KLDTILTSIPPEVI--LRKQ 343
            +  +  A+E +P  L  +L  IP   I  LR+Q
Sbjct: 402 SK-AVIWADEHLPLTLSLMLRRIPDYRIVQLRQQ 434


>gi|348588241|ref|XP_003479875.1| PREDICTED: exostosin-1 [Cavia porcellus]
          Length = 746

 Score = 54.7 bits (130), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 32/123 (26%), Positives = 58/123 (47%), Gaps = 1/123 (0%)

Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
           R   Y V+N  +            W+  K++     +T++    Y +M   A FCL P G
Sbjct: 280 RNALYHVHNGEDVLLLTTCKHGKDWQKHKDSRCDRDNTEYEKYDYREMLHNATFCLVPRG 339

Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
               S R +EA+   C+PV++++   LPF++ I W +  +   E  + ++ + + SI  +
Sbjct: 340 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIPSTIRSIHQD 399

Query: 338 VIL 340
            IL
Sbjct: 400 KIL 402


>gi|149721624|ref|XP_001496484.1| PREDICTED: exostosin-1 [Equus caballus]
          Length = 746

 Score = 54.7 bits (130), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 32/123 (26%), Positives = 58/123 (47%), Gaps = 1/123 (0%)

Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
           R   Y V+N  +            W+  K++     +T++    Y +M   A FCL P G
Sbjct: 280 RNALYHVHNGEDVLLLTTCKHGKDWQKHKDSRCDRDNTEYEKYDYREMLHNATFCLVPRG 339

Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
               S R +EA+   C+PV++++   LPF++ I W +  +   E  + ++ + + SI  +
Sbjct: 340 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIPSTIRSIHQD 399

Query: 338 VIL 340
            IL
Sbjct: 400 KIL 402


>gi|91076424|ref|XP_976077.1| PREDICTED: similar to exostosin-2 isoform 2 [Tribolium castaneum]
 gi|270002589|gb|EEZ99036.1| hypothetical protein TcasGA2_TC004910 [Tribolium castaneum]
          Length = 706

 Score = 54.7 bits (130), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 29/89 (32%), Positives = 46/89 (51%), Gaps = 4/89 (4%)

Query: 256 TDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEI 315
            D+P   +E    + FCL   G       L+EA+   CIPV++ D  VLPF++ I W+  
Sbjct: 297 VDYPAILHE----STFCLVFRGERIGQFALLEAMAANCIPVVVMDGAVLPFSNVIDWKRA 352

Query: 316 GMFVAEEDVPKLDTILTSIPPEVILRKQR 344
            +F+ E  +  L  +L  I P+ I + Q+
Sbjct: 353 AVFIMENYLHTLVDVLEKISPQRIKQMQK 381


>gi|112807209|ref|NP_034292.2| exostosin-1 [Mus musculus]
 gi|283837898|ref|NP_001124012.1| exostosin 1 [Rattus norvegicus]
 gi|3023731|sp|P97464.1|EXT1_MOUSE RecName: Full=Exostosin-1; AltName:
           Full=Glucuronosyl-N-acetylglucosaminyl-proteoglycan/N-
           acetylglucosaminyl-proteoglycan
           4-alpha-N-acetylglucosaminyltransferase; AltName:
           Full=Multiple exostoses protein 1 homolog
 gi|1813640|gb|AAB41728.1| Ext1 [Mus musculus]
 gi|13435765|gb|AAH04741.1| Exostoses (multiple) 1 [Mus musculus]
 gi|74205633|dbj|BAE21106.1| unnamed protein product [Mus musculus]
 gi|148697307|gb|EDL29254.1| exostoses (multiple) 1, isoform CRA_a [Mus musculus]
 gi|149066396|gb|EDM16269.1| similar to Ext1 [Rattus norvegicus]
          Length = 746

 Score = 54.7 bits (130), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 32/123 (26%), Positives = 58/123 (47%), Gaps = 1/123 (0%)

Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
           R   Y V+N  +            W+  K++     +T++    Y +M   A FCL P G
Sbjct: 280 RNALYHVHNGEDVLLLTTCKHGKDWQKHKDSRCDRDNTEYEKYDYREMLHNATFCLVPRG 339

Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
               S R +EA+   C+PV++++   LPF++ I W +  +   E  + ++ + + SI  +
Sbjct: 340 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIPSTIRSIHQD 399

Query: 338 VIL 340
            IL
Sbjct: 400 KIL 402


>gi|291388448|ref|XP_002710791.1| PREDICTED: exostosin 1 [Oryctolagus cuniculus]
          Length = 746

 Score = 54.7 bits (130), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 32/123 (26%), Positives = 58/123 (47%), Gaps = 1/123 (0%)

Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
           R   Y V+N  +            W+  K++     +T++    Y +M   A FCL P G
Sbjct: 280 RNALYHVHNGEDVLLLTTCKHGKDWQKHKDSRCDRDNTEYEKYDYREMLHNATFCLVPRG 339

Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
               S R +EA+   C+PV++++   LPF++ I W +  +   E  + ++ + + SI  +
Sbjct: 340 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIPSTIRSIHQD 399

Query: 338 VIL 340
            IL
Sbjct: 400 KIL 402


>gi|383847108|ref|XP_003699197.1| PREDICTED: exostosin-1-like [Megachile rotundata]
          Length = 711

 Score = 54.7 bits (130), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 33/124 (26%), Positives = 57/124 (45%), Gaps = 1/124 (0%)

Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
           R   Y ++N  +  +         W   ++      + ++    YE +   A FCL P G
Sbjct: 254 RNALYHLHNGKDLVFVTTCRHGKAWRELQDEHCQQDNQEYDMYDYEILLMNATFCLVPRG 313

Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
               S R +EA+  GCIPVI+++   LPF + I W +  +F  E  + ++  I+ S+   
Sbjct: 314 RRLGSFRFLEALRAGCIPVILSNGWALPFHERIDWTQAVIFSDERLLLQIPDIVRSVSNV 373

Query: 338 VILR 341
            IL+
Sbjct: 374 QILK 377


>gi|358058807|dbj|GAA95770.1| hypothetical protein E5Q_02427 [Mixia osmundae IAM 14324]
          Length = 492

 Score = 54.7 bits (130), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 29/71 (40%), Positives = 39/71 (54%), Gaps = 8/71 (11%)

Query: 40  LKVYVYELPSKYNKKLLLKDPR--------CLTHMFAAEIFMHRFLLSSPVRTLNPEEAD 91
            K+YVY+LP K+   L+             C+   F AEI MHR+LL S  RT + EEAD
Sbjct: 109 FKIYVYDLPQKHTTDLVESVMSLNYFCTWDCVHAQFTAEIQMHRYLLQSCARTDDIEEAD 168

Query: 92  WFYTPIYPTCD 102
            +Y P+Y T +
Sbjct: 169 LYYLPVYVTAE 179



 Score = 51.2 bits (121), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 45/158 (28%), Positives = 67/158 (42%), Gaps = 30/158 (18%)

Query: 242 VWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADD 301
            +E F  N   D   DHP      M  + FC  P G  P + RL  A+  GCIPVI +D+
Sbjct: 338 TYERFGLNLTRD---DHP------MFHSAFCAVPAGDTPTTRRLFNAIFAGCIPVIFSDE 388

Query: 302 IVLPFADA-IPWE-----------EIGMFVAEEDVPKLDTILTSIPPEVILRKQR----- 344
           +VLPF  + IP+E           E G+    +D+ ++    T+   +  L   R     
Sbjct: 389 LVLPFHRSQIPYEDMLLRYPMDQDEAGLHKIYDDLHRMQLDGTADKMQRALNHHRRKLEF 448

Query: 345 ---LLANPSMK-RAMLFPQPAQPGDAFHQILNGLARKL 378
                  P +   +   P  +Q GDAF   +  LA ++
Sbjct: 449 ELPFEMTPGVNYTSFRGPDRSQTGDAFDYGMRELAMRV 486


>gi|221506379|gb|EEE32014.1| conserved hypothetical protein [Toxoplasma gondii VEG]
          Length = 1692

 Score = 54.7 bits (130), Expect = 9e-05,   Method: Composition-based stats.
 Identities = 35/92 (38%), Positives = 46/92 (50%), Gaps = 4/92 (4%)

Query: 258  HPTTYYEDMQRAIFCLCPL-GWAPWSPRLVEAVVFGCIPVIIA--DDIVLPFADAIPWEE 314
            H  T    + RA FCLCP  GW P S  + EAV   CIPVII   +++ LP      W +
Sbjct: 1529 HAATKSFFISRATFCLCPQEGWLP-SMCIFEAVAHACIPVIIGGEEELWLPGGCLFDWIQ 1587

Query: 315  IGMFVAEEDVPKLDTILTSIPPEVILRKQRLL 346
            + +FV     P    IL   P   +L KQ++L
Sbjct: 1588 MAVFVPLSRAPYTSLILALTPDNQVLEKQQML 1619


>gi|237844077|ref|XP_002371336.1| hypothetical protein TGME49_017690 [Toxoplasma gondii ME49]
 gi|211969000|gb|EEB04196.1| hypothetical protein TGME49_017690 [Toxoplasma gondii ME49]
          Length = 1692

 Score = 54.7 bits (130), Expect = 9e-05,   Method: Composition-based stats.
 Identities = 35/92 (38%), Positives = 46/92 (50%), Gaps = 4/92 (4%)

Query: 258  HPTTYYEDMQRAIFCLCPL-GWAPWSPRLVEAVVFGCIPVIIA--DDIVLPFADAIPWEE 314
            H  T    + RA FCLCP  GW P S  + EAV   CIPVII   +++ LP      W +
Sbjct: 1529 HAATKSFFISRATFCLCPQEGWLP-SMCIFEAVAHACIPVIIGGEEELWLPGGCLFDWIQ 1587

Query: 315  IGMFVAEEDVPKLDTILTSIPPEVILRKQRLL 346
            + +FV     P    IL   P   +L KQ++L
Sbjct: 1588 MAVFVPLSRAPYTSLILALTPDNQVLEKQQML 1619


>gi|183637125|gb|ACC64545.1| exostosin 1 (predicted) [Rhinolophus ferrumequinum]
          Length = 746

 Score = 54.7 bits (130), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 32/123 (26%), Positives = 58/123 (47%), Gaps = 1/123 (0%)

Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
           R   Y V+N  +            W+  K++     +T++    Y +M   A FCL P G
Sbjct: 280 RNALYHVHNGEDVLLLTTCKHGKDWQKHKDSRCDRDNTEYEKYDYREMLHNATFCLVPRG 339

Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
               S R +EA+   C+PV++++   LPF++ I W +  +   E  + ++ + + SI  +
Sbjct: 340 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIPSTIRSIHQD 399

Query: 338 VIL 340
            IL
Sbjct: 400 KIL 402


>gi|350537983|ref|NP_001233696.1| exostosin-1 [Cricetulus griseus]
 gi|20138355|sp|Q9JK82.1|EXT1_CRIGR RecName: Full=Exostosin-1; AltName: Full=Heparan sulfate
           copolymerase; AltName: Full=Multiple exostoses protein 1
           homolog
 gi|7960285|gb|AAF71276.1|AF252858_1 exostosin 1 [Cricetulus griseus]
          Length = 746

 Score = 54.7 bits (130), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 32/123 (26%), Positives = 58/123 (47%), Gaps = 1/123 (0%)

Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
           R   Y V+N  +            W+  K++     +T++    Y +M   A FCL P G
Sbjct: 280 RNALYHVHNGEDVLLLTTCKHGKDWQKHKDSRCDRDNTEYEKYDYREMLHNATFCLVPRG 339

Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
               S R +EA+   C+PV++++   LPF++ I W +  +   E  + ++ + + SI  +
Sbjct: 340 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIPSTIRSIHQD 399

Query: 338 VIL 340
            IL
Sbjct: 400 KIL 402


>gi|168067686|ref|XP_001785740.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162662615|gb|EDQ49446.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 392

 Score = 54.7 bits (130), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 73/308 (23%), Positives = 122/308 (39%), Gaps = 34/308 (11%)

Query: 64  THMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKS-PRMMRSAI 122
           T  F  EI  H  L   P RT N EEA  FY P Y   DL        F +  R+    I
Sbjct: 66  TDQFTIEITFHERLKIHPCRTTNTEEASLFYIPFYHGIDLIKNLYNTDFVARDRLTLLFI 125

Query: 123 QLISSNWPYWNRTEGADHFFVVPH---DFGACFHYQEEKAIERGILPLLQRATLVQTFGQ 179
           + + S  P W R +G  H  V+     DF     Y ++K     +L   +   + +   +
Sbjct: 126 KWLRSQKP-WQRYQGKRHVLVLGRIVWDF--IRDYSKDKTWGSSLLTHPELTNVTKLLIE 182

Query: 180 RNHVCLNEGSITIP---PYAPPQK--MQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYY 234
           R+     + ++ +P    + P  +  ++A Q      R++  + R  F  +         
Sbjct: 183 RD--IWKDDTLGVPYPTSFHPSSESDLRAWQ------RTVRTFKRHKFVSLAGATRDNKL 234

Query: 235 ARGARAAVWENFKNNP-LFDISTD------HPTTYYEDMQRAIFCLCPLGWAPWSPRLVE 287
               R AV+E   N+     I+ +      +P    +    ++FCL P G +P    + +
Sbjct: 235 TGLIRDAVFEQCANSSRCHSIACNDGWCKRNPQVIVQMGLESVFCLQPPGDSPTRKGIFD 294

Query: 288 AVVFGCIPVII-ADDIVLPFADAIP--WEEIGMFVAEEDVPKLD----TILTSIPPEVIL 340
           ++  GCIPV+       L +   +P    +  + V+EEDV   +      L+ IP   + 
Sbjct: 295 SLQTGCIPVVFNRQQAALQYLSHLPGNHSDYSVIVSEEDVCDHNYDIMNHLSRIPLAEVA 354

Query: 341 RKQRLLAN 348
           R Q  + N
Sbjct: 355 RMQANVVN 362


>gi|380027202|ref|XP_003697318.1| PREDICTED: exostosin-1-like [Apis florea]
          Length = 711

 Score = 54.7 bits (130), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 33/124 (26%), Positives = 57/124 (45%), Gaps = 1/124 (0%)

Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
           R   Y ++N  +  +         W   ++      + ++    YE +   A FCL P G
Sbjct: 254 RNALYHLHNGKDLVFVTTCRHGKAWRELQDEHCQQDNQEYDMYDYEILLMNATFCLVPRG 313

Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
               S R +EA+  GCIPVI+++   LPF + I W +  +F  E  + ++  I+ S+   
Sbjct: 314 RRLGSFRFLEALRAGCIPVILSNGWALPFHERIDWTQAVIFSDERLLLQIPDIVRSVSNV 373

Query: 338 VILR 341
            IL+
Sbjct: 374 HILK 377


>gi|1235559|emb|CAA65443.1| ext1 [Mus musculus]
          Length = 745

 Score = 54.3 bits (129), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 32/123 (26%), Positives = 58/123 (47%), Gaps = 1/123 (0%)

Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
           R   Y V+N  +            W+  K++     +T++    Y +M   A FCL P G
Sbjct: 280 RNALYHVHNGEDVLLLTTCKHGKDWQKHKDSRCDRDNTEYEKYDYREMLHNATFCLVPRG 339

Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
               S R +EA+   C+PV++++   LPF++ I W +  +   E  + ++ + + SI  +
Sbjct: 340 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIPSTIRSIHQD 399

Query: 338 VIL 340
            IL
Sbjct: 400 KIL 402


>gi|326918032|ref|XP_003205297.1| PREDICTED: exostosin-1-like, partial [Meleagris gallopavo]
          Length = 535

 Score = 54.3 bits (129), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 32/123 (26%), Positives = 59/123 (47%), Gaps = 1/123 (0%)

Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
           R   Y V+N  +            W+  K++     +T++    Y +M   A FCL P G
Sbjct: 118 RNALYHVHNGEDVVLLTTCKHGKDWQKHKDSRCDRDNTEYEKYDYREMLHNATFCLVPRG 177

Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
               S R +EA+   C+PV++++   LPF++ I W++  +   E  + ++ + + SI  +
Sbjct: 178 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVIDWKQAAVIGDERLLLQIPSTIRSIHQD 237

Query: 338 VIL 340
            IL
Sbjct: 238 KIL 240


>gi|66517433|ref|XP_391845.2| PREDICTED: exostosin-1 [Apis mellifera]
          Length = 711

 Score = 54.3 bits (129), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 33/124 (26%), Positives = 57/124 (45%), Gaps = 1/124 (0%)

Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
           R   Y ++N  +  +         W   ++      + ++    YE +   A FCL P G
Sbjct: 254 RNALYHLHNGKDLVFVTTCRHGKAWRELQDEHCQQDNQEYDMYDYEILLMNATFCLVPRG 313

Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
               S R +EA+  GCIPVI+++   LPF + I W +  +F  E  + ++  I+ S+   
Sbjct: 314 RRLGSFRFLEALRAGCIPVILSNGWALPFHERIDWTQAVIFSDERLLLQIPDIVRSVSNV 373

Query: 338 VILR 341
            IL+
Sbjct: 374 HILK 377


>gi|355686831|gb|AER98199.1| exostoses 1 [Mustela putorius furo]
          Length = 702

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 32/123 (26%), Positives = 58/123 (47%), Gaps = 1/123 (0%)

Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
           R   Y V+N  +            W+  K++     +T++    Y +M   A FCL P G
Sbjct: 236 RNALYHVHNGEDVVLLTTCKHGKDWQKHKDSRCDRDNTEYEKYDYREMLHNATFCLVPRG 295

Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
               S R +EA+   C+PV++++   LPF++ I W +  +   E  + ++ + + SI  +
Sbjct: 296 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIPSTIRSIHQD 355

Query: 338 VIL 340
            IL
Sbjct: 356 KIL 358


>gi|190402228|gb|ACE77644.1| exostosin 1 (predicted) [Sorex araneus]
          Length = 746

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 32/123 (26%), Positives = 58/123 (47%), Gaps = 1/123 (0%)

Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
           R   Y V+N  +            W+  K++     +T++    Y +M   A FCL P G
Sbjct: 280 RNALYHVHNGEDILLLTTCKHGKDWQKHKDSRCDRDNTEYEKYDYREMLHNATFCLVPRG 339

Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
               S R +EA+   C+PV++++   LPF++ I W +  +   E  + ++ + + SI  +
Sbjct: 340 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIPSTIRSIHQD 399

Query: 338 VIL 340
            IL
Sbjct: 400 KIL 402


>gi|226526925|gb|ACO71282.1| exostoses 1 (predicted) [Dasypus novemcinctus]
          Length = 744

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 32/123 (26%), Positives = 58/123 (47%), Gaps = 1/123 (0%)

Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
           R   Y V+N  +            W+  K++     +T++    Y +M   A FCL P G
Sbjct: 278 RNALYHVHNGEDILLLTTCKHGKDWQKHKDSRCDRDNTEYEKYDYREMLHNATFCLVPRG 337

Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
               S R +EA+   C+PV++++   LPF++ I W +  +   E  + ++ + + SI  +
Sbjct: 338 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIPSTIRSIHQD 397

Query: 338 VIL 340
            IL
Sbjct: 398 KIL 400


>gi|440797580|gb|ELR18663.1| exostosin, putative [Acanthamoeba castellanii str. Neff]
          Length = 423

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 33/121 (27%), Positives = 52/121 (42%), Gaps = 9/121 (7%)

Query: 227 NDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLV 286
           +DP+ G   +    +V E     P  ++       Y + +    F LCP G A +S R  
Sbjct: 251 HDPDNGVIIQ---TSVEEQIGGKPSVEVE------YLDTLFHTQFTLCPRGRALYSYRTT 301

Query: 287 EAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLL 346
           EA+  G IPVI+ D    P+ + I W    + + E     +  +L S   E I R +R +
Sbjct: 302 EAIAAGAIPVILGDGYAFPYNELIDWRSFAVILPESSWETMMDVLRSFTSEEIARMRRNM 361

Query: 347 A 347
            
Sbjct: 362 G 362


>gi|260786761|ref|XP_002588425.1| hypothetical protein BRAFLDRAFT_153069 [Branchiostoma floridae]
 gi|229273587|gb|EEN44436.1| hypothetical protein BRAFLDRAFT_153069 [Branchiostoma floridae]
          Length = 107

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 31/79 (39%), Positives = 44/79 (55%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y   +  + FCL P G    S R +EA+   CIPVI A+   LPF++ I W+   +   E
Sbjct: 3   YNTLLHNSTFCLVPRGRRLGSFRFLEALQAACIPVIEANGWELPFSEVIEWDRATITADE 62

Query: 322 EDVPKLDTILTSIPPEVIL 340
             + +L +IL +IPPE IL
Sbjct: 63  RLLFQLPSILRAIPPEKIL 81


>gi|449495258|ref|XP_004159780.1| PREDICTED: xyloglucan galactosyltransferase KATAMARI1-like [Cucumis
           sativus]
          Length = 490

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 75/358 (20%), Positives = 126/358 (35%), Gaps = 45/358 (12%)

Query: 52  NKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPT----G 107
           ++ + L +    T+ F  E+  H  +      T +   A   Y P Y   D++       
Sbjct: 120 SRGVFLNNSWFKTNQFLLEVIFHNKMKQYECLTNDSAMASAVYVPFYAGLDISHYLWNPS 179

Query: 108 LPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGI-LP 166
           + +   S R   S+I    S  P W R  G DHFFV        F  Q ++  + G  L 
Sbjct: 180 ITIRDSSARDFLSSI----SEKPEWKRMFGRDHFFVAGR-ISWDFRRQTDEVSDWGSKLR 234

Query: 167 LLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQA----HQIPPDTPRSIFVYFRGLF 222
            L  +  +           N+ +I  P Y  P K+        +     R     F G  
Sbjct: 235 FLSESHNMTMLSVEASSWKNDFAIPYPTYFHPSKLSEIVEWQSLMRARQRQHLFTFTGAP 294

Query: 223 YDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWS 282
                D   G      R +    F +     ++ D PTT  E  Q +IFCL P G +   
Sbjct: 295 RPDLTDSIRGMVIEQCRGSSLCKFIDCSSDGVNCDDPTTTMEAFQSSIFCLQPPGDSYTR 354

Query: 283 PRLVEAVVFGCIPVII------------------ADDIVLPFADAIPWEEIGMFVAEEDV 324
             + ++++ GCIPV                    A  + +P  +   W+E          
Sbjct: 355 RSIFDSILAGCIPVFFHPGTAYSQYLWHFPKNQTAYSVFIPVRNVKKWDE---------- 404

Query: 325 PKLDTILTSIPP--EVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPH 380
             ++ IL+ I    E  +R++ +   PS+       +     DAF   + G+  ++ +
Sbjct: 405 -SIEGILSGISKDRESSMREEVIRVIPSIVYGDPRSKIGNLEDAFDLAVKGILERVEN 461


>gi|159477445|ref|XP_001696821.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
 gi|158275150|gb|EDP00929.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
          Length = 191

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 47/158 (29%), Positives = 66/158 (41%), Gaps = 23/158 (14%)

Query: 181 NHVCLNEGSITIPPYAPPQKMQAHQ---IPPDTPRSIFVYFRGLFYDVNNDP-EGGYYAR 236
            H C   G   + P   P    AH      P   R I +Y RG     +  P    +Y+R
Sbjct: 37  GHPCYTPGKDLVIPLFKPPGHYAHSPLLGAPPLQRDILLYLRG-----DTGPYRAHWYSR 91

Query: 237 GARAAVWENFKNNPLFDISTDH----------PTTYYEDMQRAIFCLCPLGWAPWSPRLV 286
           G R  +    K   + D + +H          P TY E + R+IFC+   G   +S R  
Sbjct: 92  GIRQRL---AKLAYMHDWAEEHRIFVGEQFMIPGTYSEHLARSIFCVVAPG-DGYSGRGE 147

Query: 287 EAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDV 324
           +AV+ GCIP+II D +   F   I W    + +AE  V
Sbjct: 148 DAVLHGCIPLIIMDGVHAVFESIIDWSAFSIRIAESAV 185


>gi|380799435|gb|AFE71593.1| exostosin-1, partial [Macaca mulatta]
          Length = 542

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 32/123 (26%), Positives = 58/123 (47%), Gaps = 1/123 (0%)

Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
           R   Y V+N  +            W+  K++     +T++    Y +M   A FCL P G
Sbjct: 76  RNALYHVHNGEDVVLLTTCKHGKDWQKHKDSRCDRDNTEYEKYDYREMLHNATFCLVPRG 135

Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
               S R +EA+   C+PV++++   LPF++ I W +  +   E  + ++ + + SI  +
Sbjct: 136 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIPSTIRSIHQD 195

Query: 338 VIL 340
            IL
Sbjct: 196 KIL 198


>gi|294881106|ref|XP_002769247.1| hypothetical protein Pmar_PMAR007658 [Perkinsus marinus ATCC 50983]
 gi|239872525|gb|EER01965.1| hypothetical protein Pmar_PMAR007658 [Perkinsus marinus ATCC 50983]
          Length = 75

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 21/53 (39%), Positives = 34/53 (64%)

Query: 261 TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWE 313
           TY  +M+ + FC  P G  PW+ R+ +A++ GCIPV++++ IV PF   + W 
Sbjct: 12  TYDAEMKDSTFCFIPRGNTPWTRRIFDAIISGCIPVVLSNAIVFPFESLLDWS 64


>gi|219129557|ref|XP_002184953.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217403738|gb|EEC43689.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 982

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 64/286 (22%), Positives = 106/286 (37%), Gaps = 45/286 (15%)

Query: 67  FAAEIFMHRFLLS--SPVRTLNPEEADWFYTPIYPTCD------------LTPTGLP-LP 111
           +A E+ +   +++  S +RT NP+EA  FY P  P+ +            L+P G   L 
Sbjct: 606 YATELLVREVMINPKSCLRTYNPDEATLFYVPYLPSVEHHKGSKYINDMALSPYGNAILD 665

Query: 112 FKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGIL------ 165
                   +          YW R  GADH  V        +H ++ +     I       
Sbjct: 666 ILDKDNYTAWENTFGLTAKYWKRHGGADHILVFSEPMHGLWHPRQRRGNYHFIHSQKQLH 725

Query: 166 -PLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQ----KMQAHQIPPDTP--RSIFVYF 218
            P++    L  TF +    C  +  +   P    +    K  +  +   T    S+ V  
Sbjct: 726 PPIVISVELSTTFVKMYPKCAAKNILMPYPNTDGRWFNGKHHSEAVKASTAWNASLKVSI 785

Query: 219 RGL----------------FYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTY 262
             L                FY   N        +   +   +   ++ LF  +    ++Y
Sbjct: 786 AALPEEQLLGQEPARPIAQFYGAGNHGTCKQLRQAMASDYSQCALSSKLFKQNV-KISSY 844

Query: 263 YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFAD 308
              M  A FC CP G +P + R+ +AV+ GCIP+I++ D V PF +
Sbjct: 845 VIGMNLASFCPCPGGDSPSAKRMFDAVLAGCIPIILSQDFVWPFTN 890


>gi|77748248|gb|AAI05840.1| Ext1 protein [Rattus norvegicus]
          Length = 557

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 32/123 (26%), Positives = 58/123 (47%), Gaps = 1/123 (0%)

Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
           R   Y V+N  +            W+  K++     +T++    Y +M   A FCL P G
Sbjct: 91  RNALYHVHNGEDVLLLTTCKHGKDWQKHKDSRCDRDNTEYEKYDYREMLHNATFCLVPRG 150

Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
               S R +EA+   C+PV++++   LPF++ I W +  +   E  + ++ + + SI  +
Sbjct: 151 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIPSTIRSIHQD 210

Query: 338 VIL 340
            IL
Sbjct: 211 KIL 213


>gi|118781611|ref|XP_311558.3| AGAP010388-PA [Anopheles gambiae str. PEST]
 gi|116130033|gb|EAA07205.3| AGAP010388-PA [Anopheles gambiae str. PEST]
          Length = 744

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 29/99 (29%), Positives = 52/99 (52%), Gaps = 1/99 (1%)

Query: 243 WENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADD 301
           W + ++    + + ++    YE + Q + FCL P G    S R +E +  GCIPV++++ 
Sbjct: 298 WRDLQDARCDEDNREYDRYDYETLLQNSTFCLVPRGRRLGSFRFLEVLQAGCIPVLLSNS 357

Query: 302 IVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVIL 340
            VLPF   I W++  ++  E  + ++  I+ SI    IL
Sbjct: 358 WVLPFQSKIDWKQAAIWADERLLLQVPDIVRSISTSRIL 396


>gi|449662559|ref|XP_002154865.2| PREDICTED: exostosin-2-like [Hydra magnipapillata]
          Length = 729

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 23/59 (38%), Positives = 35/59 (59%)

Query: 266 MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDV 324
           +Q   FCL   G+   S  L++A++ GCIPVI+ +D VLPF + I W    + V E+ +
Sbjct: 349 LQEGTFCLLLPGYYYGSSLLLDAMMMGCIPVIMMNDYVLPFNEVIDWSRAAIIVREQQI 407


>gi|241997492|ref|XP_002433395.1| exostosin-2, putative [Ixodes scapularis]
 gi|215490818|gb|EEC00459.1| exostosin-2, putative [Ixodes scapularis]
          Length = 714

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 39/139 (28%), Positives = 64/139 (46%), Gaps = 14/139 (10%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y + +    FCL           L +A++ GC+PVI+AD+ +LPF++ + W+   + + E
Sbjct: 300 YPDILAEGKFCLVVRAARLGQSVLSDALMAGCVPVIVADEYILPFSEVLDWKRAAIQIRE 359

Query: 322 EDVPKLDTILTSIPPEVI--LRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
           +D+  L T+L  +    +  +R Q LL      R+M     A+       ++N   R  P
Sbjct: 360 DDLEDLVTVLKGVSKARLFEMRSQALLLWDRYFRSM-----AKIALTTLDVIND--RVFP 412

Query: 380 HDKSTY-----LKPGGKFL 393
           H   TY     L PG  F 
Sbjct: 413 HIAKTYEDWNDLPPGVSFF 431


>gi|357481629|ref|XP_003611100.1| Xyloglucan galactosyltransferase KATAMARI1 [Medicago truncatula]
 gi|355512435|gb|AES94058.1| Xyloglucan galactosyltransferase KATAMARI1 [Medicago truncatula]
          Length = 547

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 69/309 (22%), Positives = 112/309 (36%), Gaps = 44/309 (14%)

Query: 25  SGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCL--------------------- 63
           S +  +   DD  G   +YVY+LPSK+NK L+ +    L                     
Sbjct: 151 SENKNNATCDDGKG---IYVYDLPSKFNKDLVGQCSDMLPWQDFCRYISNEGFGEPISKL 207

Query: 64  ------THMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKS-PR 116
                 TH ++ E+  H  +L  P R  N  +A  FY P Y   D+    L   F++   
Sbjct: 208 GKGWYKTHQYSLELIFHSKVLKHPCRVYNENDAKLFYVPFYGGLDV----LRWHFQNVSN 263

Query: 117 MMRSAIQLISSNW----PYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRAT 172
            ++ ++ L    W      W R  G DH FV+       F    +      +L L +   
Sbjct: 264 DVKDSLSLELVKWLERQVNWKRNLGKDHVFVLGK-ISWDFRRTSDSPWGTRLLELEKLQN 322

Query: 173 LVQTFGQRNHVCLNEGSITIPPYAPPQK----MQAHQIPPDTPRSIFVYFRGLFYDVNND 228
            ++   +R    +N+  I  P Y  P+     +        + R   V F G   D  +D
Sbjct: 323 PIKLLIERQPWHVNDIGIPHPTYFHPKSDNDIIDWQLKIIRSNRRNLVSFAGAARDHADD 382

Query: 229 PEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEA 288
                      +      K         + P +  E    + FCL P G +P    + ++
Sbjct: 383 HIRSILINQCSSESDGKCKFLNCSSAKCNEPESIIELFVESEFCLQPPGDSPTRKSVFDS 442

Query: 289 VVFGCIPVI 297
           ++ GCIPV+
Sbjct: 443 LISGCIPVL 451


>gi|405969796|gb|EKC34747.1| Exostosin-2 [Crassostrea gigas]
          Length = 698

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 27/84 (32%), Positives = 46/84 (54%)

Query: 261 TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVA 320
           TY + +Q + FC+           L +A+  GCIPVI+AD  VLPF++ + W+   + + 
Sbjct: 289 TYPDILQDSKFCMVLRSARLGHTALSDALRTGCIPVIVADGYVLPFSEVLDWKRAAVVIR 348

Query: 321 EEDVPKLDTILTSIPPEVILRKQR 344
           EE++  +  +L S   E I + +R
Sbjct: 349 EENLKDVVEVLKSYSMERIYQMRR 372


>gi|390332053|ref|XP_003723408.1| PREDICTED: LOW QUALITY PROTEIN: exostosin-2-like
           [Strongylocentrotus purpuratus]
          Length = 707

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 29/80 (36%), Positives = 45/80 (56%), Gaps = 2/80 (2%)

Query: 266 MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVP 325
           +Q A FC+           L +A+  GCIPVII+D  +LPF++ I W+   + V E+ +P
Sbjct: 301 LQDATFCIVLRRTRLGQAALSDALQAGCIPVIISDAYILPFSEVIDWKRASLVVREDRIP 360

Query: 326 KLDTILTSIPPEVI--LRKQ 343
            L  IL ++  E I  +R+Q
Sbjct: 361 DLPDILHAVELEHIYEMRQQ 380


>gi|390338044|ref|XP_783281.3| PREDICTED: exostosin-1b-like [Strongylocentrotus purpuratus]
          Length = 702

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 79/329 (24%), Positives = 130/329 (39%), Gaps = 69/329 (20%)

Query: 40  LKVYVYELPSKYNK-KLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIY 98
            KVY+Y  P  Y K   L  D   LT + A+  F           T N +EA  F T I 
Sbjct: 63  FKVYIY--PVSYQKPSQLYSD--ILTSIRASRYF-----------TTNADEACLFVTSI- 106

Query: 99  PTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEK 158
            T D       L     + + S+I  +S    +WN   G +H     +  G    Y E+ 
Sbjct: 107 DTLDRDK----LSSTYVKNIESSISKLS----HWN--NGKNHLIFNLYS-GTWPDYSEDL 155

Query: 159 AIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAP--PQK------MQAHQIPPDT 210
               G   +L +A+  + + + N        I++P +A   PQK      +Q +  P   
Sbjct: 156 GFNIG-EAILAKASFSEQYFRPNF------DISLPLFAKTHPQKGGNSGDLQGNNFP--V 206

Query: 211 PRSIFVYFRG--LFYDVNNDPEGGYY-----------ARGARAAVWENFKNNPLFDISTD 257
            R   + F+G    + + +D     Y                   W+  K     D+  D
Sbjct: 207 QRKYLLAFKGKRYLFGIGSDTRNALYHLHNGVDIILLTTCKHGKNWQKHK-----DVRCD 261

Query: 258 HPTTYYED------MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIP 311
           H    ++       +  + FCL P G    S R +E++   CIP+++++   LPF++ I 
Sbjct: 262 HDNAEFDKFDYQVLLHNSTFCLVPRGRRLGSFRFLESLQAACIPMLLSNGWELPFSEVID 321

Query: 312 WEEIGMFVAEEDVPKLDTILTSIPPEVIL 340
           W +  +F  E  + ++ +I+ SI  E IL
Sbjct: 322 WSKAVVFGDERLLLQVPSIVRSITAEQIL 350


>gi|387015838|gb|AFJ50038.1| Exostosin-1-like [Crotalus adamanteus]
          Length = 751

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 32/123 (26%), Positives = 57/123 (46%), Gaps = 1/123 (0%)

Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
           R   Y V+N  +            W+  K++     + ++    Y +M   A FCL P G
Sbjct: 285 RNALYHVHNGEDVVLLTTCKHGKDWQKHKDSRCDRDNAEYEKYDYREMLHNATFCLVPRG 344

Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
               S R +EA+   CIPV++++   LPF++ I W +  +   E  + ++ + + SI  +
Sbjct: 345 RRLGSFRFLEALQAACIPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIPSTIRSIHQD 404

Query: 338 VIL 340
            IL
Sbjct: 405 KIL 407


>gi|363731065|ref|XP_003640902.1| PREDICTED: exostosin-1-like [Gallus gallus]
          Length = 740

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 31/123 (25%), Positives = 58/123 (47%), Gaps = 1/123 (0%)

Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
           R   Y V+N  +            W+  K++     + ++    Y +M   A FCL P G
Sbjct: 274 RNALYHVHNGEDVVLLTTCKHGKDWQKHKDSRCDRDNAEYEKYDYREMLHNATFCLVPRG 333

Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
               S R +EA+   C+PV++++   LPF++ I W++  +   E  + ++ + + SI  +
Sbjct: 334 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVIDWKQAAVIGDERLLLQIPSTIRSIHQD 393

Query: 338 VIL 340
            IL
Sbjct: 394 KIL 396


>gi|255633864|gb|ACU17293.1| unknown [Glycine max]
          Length = 57

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 23/26 (88%), Positives = 26/26 (100%)

Query: 22 ERISGSAGDVLEDDPVGKLKVYVYEL 47
          ERISGSAGDVLE+DPVG+LKV+VYEL
Sbjct: 32 ERISGSAGDVLENDPVGRLKVFVYEL 57


>gi|224046654|ref|XP_002200457.1| PREDICTED: exostosin-1 [Taeniopygia guttata]
          Length = 741

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 31/123 (25%), Positives = 57/123 (46%), Gaps = 1/123 (0%)

Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
           R   Y V+N  +            W+  K++     + ++    Y +M   A FCL P G
Sbjct: 275 RNALYHVHNGEDVVLLTTCKHGKDWQKHKDSRCDRDNAEYEKYDYREMLHNATFCLVPRG 334

Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
               S R +EA+   C+PV++++   LPF++ I W +  +   E  + ++ + + SI  +
Sbjct: 335 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVIDWNQAAIIGDERLLLQIPSTIRSIHQD 394

Query: 338 VIL 340
            IL
Sbjct: 395 KIL 397


>gi|452825321|gb|EME32318.1| glucuronyl/N-acetylglucosaminyl transferase EXT1 [Galdieria
           sulphuraria]
          Length = 354

 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 56/207 (27%), Positives = 90/207 (43%), Gaps = 47/207 (22%)

Query: 130 PYWNRTEGADHFFVVPHDF--GACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNE 187
           PYWN   G +H  +   DF  G C     E   + G   LL+       F  R++    +
Sbjct: 144 PYWNL--GINHMVL---DFLDGKC-----EGQHDVGFAALLKTHLSSDIF--RSNF---D 188

Query: 188 GSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFK 247
            S+ +PP       + H    + P+   + F+G  Y +        +  G   ++  NF 
Sbjct: 189 KSLPLPP-------RQHFRILEHPKKFLITFKGTSYPL--------HTAGGIRSILNNFH 233

Query: 248 N--------NPLFDISTDHPT--TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVI 297
           N         P F + T      +Y E ++++ FCL P G +P + RL+E +  GCIPV 
Sbjct: 234 NGKDIIFVTKPQFSVPTPEYDLWSYEELVEQSKFCLVPQGRSPSTFRLLEVMSAGCIPVF 293

Query: 298 IADDI----VLPFADAIPWEEIGMFVA 320
           + D      ++P +D I WE+I  F+A
Sbjct: 294 LFDTTAAKYIMPISDEIAWEQIS-FIA 319


>gi|327280286|ref|XP_003224883.1| PREDICTED: exostosin-1-like [Anolis carolinensis]
          Length = 753

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 31/123 (25%), Positives = 57/123 (46%), Gaps = 1/123 (0%)

Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
           R   Y V+N  +            W+  K++     + ++    Y +M   A FCL P G
Sbjct: 287 RNALYHVHNGEDVVLLTTCKHGKDWQKHKDSRCDRDNAEYEKYDYREMLHNATFCLVPRG 346

Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
               S R +EA+   C+PV++++   LPF++ I W +  +   E  + ++ + + SI  +
Sbjct: 347 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVIDWNQAAVIGDERLLLQIPSTIRSIHQD 406

Query: 338 VIL 340
            IL
Sbjct: 407 KIL 409


>gi|395818023|ref|XP_003782438.1| PREDICTED: exostosin-1 [Otolemur garnettii]
 gi|197215619|gb|ACH53015.1| exostosin 1 (predicted) [Otolemur garnettii]
          Length = 746

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 31/123 (25%), Positives = 57/123 (46%), Gaps = 1/123 (0%)

Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
           R   Y V+N  +            W+  K++     + ++    Y +M   A FCL P G
Sbjct: 280 RNALYHVHNGEDVVLLTTCKHGKDWQKHKDSRCDRDNMEYEKYDYREMLHNATFCLVPRG 339

Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
               S R +EA+   C+PV++++   LPF++ I W +  +   E  + ++ + + SI  +
Sbjct: 340 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIPSTIRSIHQD 399

Query: 338 VIL 340
            IL
Sbjct: 400 KIL 402


>gi|357628255|gb|EHJ77645.1| hypothetical protein KGM_04618 [Danaus plexippus]
          Length = 655

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 27/99 (27%), Positives = 53/99 (53%), Gaps = 1/99 (1%)

Query: 243 WENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADD 301
           W++ ++    + + ++    YE +   + FCL   G    S R +EA+  GC+PV++++ 
Sbjct: 203 WKDLRDERCDEDNREYDKFDYEQLLANSTFCLVARGRRLGSYRFLEALAAGCVPVLLSNG 262

Query: 302 IVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVIL 340
             LPF + I W    ++  E  + ++  ++ S+PPE IL
Sbjct: 263 WRLPFDERIDWRRAVIWADERLLLQVPELVRSVPPERIL 301


>gi|298713685|emb|CBJ48876.1| Glycosyltransferase, family GT47 [Ectocarpus siliculosus]
          Length = 226

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 42/166 (25%), Positives = 68/166 (40%), Gaps = 17/166 (10%)

Query: 180 RNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGAR 239
           RN     +G++ +  +   ++M    +  +    I        +D +  PE   Y R   
Sbjct: 39  RNFFLTTKGTLYLSGHGSGERMSLVPLQDEANGVIMSLHCFEIHDEHLLPENVEYCRA-- 96

Query: 240 AAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIA 299
                      L D   DH    YE +  + F L P G +P + RL E +  G IPV + 
Sbjct: 97  -----------LSDRYGDHS---YESLMNSTFGLVPAGRSPGTYRLGEVMSAGSIPVFVG 142

Query: 300 DDIVLPFADAIPWEEIGMFVAEEDV-PKLDTILTSIPPEVILRKQR 344
            D+VLPF + I W       A + V P++   L ++P   + + QR
Sbjct: 143 WDLVLPFKELIDWSSFSFAFAPDQVGPQMVRTLRAVPRAELEQMQR 188


>gi|1619954|gb|AAB17006.1| multiple exostosis 2 protein [Mus musculus]
          Length = 718

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 27/83 (32%), Positives = 44/83 (53%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y + +Q A FC            L + +  GC+PV+IAD  +LPF++ + W++  + V E
Sbjct: 308 YPQVLQEATFCTVLRRARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKKASVVVPE 367

Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
           E +  + +IL +IP   I   QR
Sbjct: 368 EKMSDVYSILQNIPQRQIEEMQR 390


>gi|302807981|ref|XP_002985685.1| galactosyltransferase-like protein [Selaginella moellendorffii]
 gi|300146594|gb|EFJ13263.1| galactosyltransferase-like protein [Selaginella moellendorffii]
          Length = 500

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 89/396 (22%), Positives = 151/396 (38%), Gaps = 73/396 (18%)

Query: 35  DPVGKLKVYVYELPSKYNKKLLLK---------------------------------DPR 61
           D     +V++YELP ++N ++L K                                 DP 
Sbjct: 97  DSCEGRRVFMYELPRRFNLEVLEKCDKMVSWLTFCDHFINHGFGKALAGANSSWYATDPY 156

Query: 62  CLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSA 121
            L  +F   +  +R L++SP       EAD F+ P Y   D     L   + +  + R  
Sbjct: 157 MLEVIFHERMHRYRCLVNSP------READAFFIPYYAGLD----ALRFLYGADNLNRHE 206

Query: 122 -----IQLISSNWPY-WNRTEGADHFFVVPH---DFGACFHYQEEKAIERGI-LPLLQRA 171
                ++ + +N+ + W R  G DHF V      DF +   Y+ +     G  L LL++ 
Sbjct: 207 QGVDLVEFLEANYSWSWTRNLGHDHFMVTGRTAWDFAS---YRGKSGSSWGTSLRLLKQM 263

Query: 172 TLVQTF-GQRNHVCLNEGSITIP-PYAPPQKMQAH---QIPPDTPRSIFVYFRGLFYDVN 226
             V T   +R      E +I  P  + P  K +     +    +PR+ F+ F G      
Sbjct: 264 ENVTTLVMERRPWDRTEQAIPYPTSFHPATKSELQAWIERVKASPRANFMSFAGAPRPQQ 323

Query: 227 NDP-EGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRL 285
           N+   G  + +  ++   E    + L      +P    E +  +IFCL P G        
Sbjct: 324 NESIRGILFEQCRKSRSCEAVNCSKL--RCAHNPLPIAEKLLSSIFCLQPQGDTSTRRSS 381

Query: 286 VEAVVFGCIPVII-ADDIVLPFADAIPWEE--IGMFVAEEDVPK-----LDTILTSIPPE 337
            +++V GCIPV   AD     +   +P E     +F+ EED+ +      + + +    +
Sbjct: 382 FDSLVCGCIPVFFHADSAYTQYTWHLPRERESYSVFIPEEDIRRDGLEVEEFLRSKFSSQ 441

Query: 338 VILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNG 373
            I   QR +    + R +   +P   GD     L+G
Sbjct: 442 RIGELQRNI-RKIIPRLLYTGKPWSSGDGGRDSLDG 476


>gi|168038843|ref|XP_001771909.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162676860|gb|EDQ63338.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 308

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 63/244 (25%), Positives = 99/244 (40%), Gaps = 37/244 (15%)

Query: 127 SNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLN 186
           S  PYWN+  G +H  V+   F   +H +       G   ++       T+     V   
Sbjct: 67  STLPYWNK--GLNHVLVI---FADKWHQRGPHQASIGNASVMASDMHETTYRPGFDV--- 118

Query: 187 EGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYA----RGAR--- 239
             SI +P     ++ Q+  + P   R     FRGL Y +    EG + +    RG     
Sbjct: 119 --SIPLPGRHHVRRFQS--VKP-LERKYLATFRGLRY-LGPKGEGVFRSLDSFRGMHNGN 172

Query: 240 ----AAVWENFKNN------PLFDISTD-----HPTTYYEDMQRAIFCLCPLGWAPWSPR 284
               A   E+  NN      P   +  D     H    ++D+    F L P G  P S R
Sbjct: 173 DVIVATSCEHMINNMIRKKEPTLGVHCDEDLLIHKNYTFDDLMNTTFGLVPAGVQPASYR 232

Query: 285 LVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPP-EVILRKQ 343
            +E +  G IPV+IAD+ V PF   I W +  +     ++ ++   L ++ P EV +R++
Sbjct: 233 FIEVLSAGAIPVLIADNYVKPFDTLILWYKCLLQFPTTEMHRIVGTLRAMKPEEVKIRQE 292

Query: 344 RLLA 347
             LA
Sbjct: 293 NCLA 296


>gi|55742053|ref|NP_001006730.1| exostosin 1 [Xenopus (Silurana) tropicalis]
 gi|49523045|gb|AAH75481.1| exostoses (multiple) 1 [Xenopus (Silurana) tropicalis]
          Length = 738

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 31/123 (25%), Positives = 56/123 (45%), Gaps = 1/123 (0%)

Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
           R   Y V+N  +            W+  K+      + ++    Y +M   A FCL P G
Sbjct: 272 RNALYHVHNGEDVVLLTTCKHGKDWQKHKDARCDKDNAEYEKYDYREMLHNATFCLVPRG 331

Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
               S R +EA+   C+PV++++   LPF++ I W +  +   E  + ++ + + SI  +
Sbjct: 332 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVIDWNQAAVIGDERLLLQIPSTIRSIHQD 391

Query: 338 VIL 340
            IL
Sbjct: 392 KIL 394


>gi|298713683|emb|CBJ48874.1| Glycosyltransferase, family GT47 [Ectocarpus siliculosus]
          Length = 335

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 44/169 (26%), Positives = 72/169 (42%), Gaps = 23/169 (13%)

Query: 180 RNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNND---PEGGYYAR 236
           RN     +GS+ +  +   ++M    + P    +  V      ++++N+   PE   Y R
Sbjct: 148 RNFFLTTKGSLYLSRHGSGERMS---LVPLQDEANGVIMSLHCFELHNEHLLPENVEYCR 204

Query: 237 GARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPV 296
                         L D   DH    YE +  + F L P G +P + RL E +  G IPV
Sbjct: 205 A-------------LSDRYGDHS---YESLMNSTFGLVPAGRSPGTFRLGEVMSAGSIPV 248

Query: 297 IIADDIVLPFADAIPWEEIGMFVAEEDV-PKLDTILTSIPPEVILRKQR 344
            +  D+VLPF + I W       A + V P++   L ++P   + + QR
Sbjct: 249 FVGWDLVLPFKELIDWSSFSFAFAPDQVGPQMVRTLRAVPRAELEQMQR 297


>gi|224058539|ref|XP_002299537.1| predicted protein [Populus trichocarpa]
 gi|222846795|gb|EEE84342.1| predicted protein [Populus trichocarpa]
          Length = 573

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 77/333 (23%), Positives = 123/333 (36%), Gaps = 60/333 (18%)

Query: 42  VYVYELPSKYNKKLL--------------------LKDPRC-------LTHMFAAEIFMH 74
           VYVY+LPSK+NK L+                    L +P          TH ++ E   H
Sbjct: 192 VYVYDLPSKFNKDLIGQCGDMMPWTDFCKYFNNEALGEPIANLGKGWYHTHQYSLEPIFH 251

Query: 75  RFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKS-PRMMRSAIQLISSNWPY-- 131
             +LS P R  N  EA  FY P Y   D+    L   FK+    ++ A+ +    W    
Sbjct: 252 SRILSHPCRVYNESEAKLFYVPYYGGLDI----LRWHFKNVSDDVKDALAMDLMKWLEHR 307

Query: 132 --WNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGS 189
             W +  G DH FV+       F  +   +     L L Q    ++   +R    +N+ +
Sbjct: 308 RPWVQNSGTDHVFVLG-KISWDFRRKNYTSWGTRFLELEQMQNPIKLLIERQPWEVNDIA 366

Query: 190 ITIPPYAPPQK----MQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWEN 245
           I  P +  P      +   Q   +T R   V F G       D      +         +
Sbjct: 367 IPHPTFFHPHSDDDIVAWQQKIIETTRKNLVSFAGA---ARPDQPESIRSTLINQCTSTS 423

Query: 246 FKNNPLFDIST---DHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDI 302
                  D  +   + P +  +    + FCL P G +P    + +++V GCIPV+     
Sbjct: 424 SDKCQFLDCKSGGCNQPESVTKLFLESEFCLQPPGDSPTRKSVFDSLVSGCIPVLFD--- 480

Query: 303 VLPFAD--AIPWE------EIGMFVAEEDVPKL 327
             PF      PW       +  +F+ +E+V ++
Sbjct: 481 --PFTAYYQYPWHLPEDHGKYSVFIDQEEVRQM 511


>gi|297725119|ref|NP_001174923.1| Os06g0638350 [Oryza sativa Japonica Group]
 gi|255677258|dbj|BAH93651.1| Os06g0638350 [Oryza sativa Japonica Group]
          Length = 257

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 71/269 (26%), Positives = 104/269 (38%), Gaps = 63/269 (23%)

Query: 58  KDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPI--------YPTCDLTPTGLP 109
           + PR LT  +A+E +  + +  S  RT + E+A  F+ PI         P+  L  T   
Sbjct: 20  QTPRKLTGKYASEGYFFQNIRESRFRTDDLEKAHLFFVPISPHKMRGKVPSSLLLVTYAW 79

Query: 110 LPFKSPRMMRSAI--------QLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIE 161
           L        RS +         L SS   +W    GADHFFV  HD G        +A E
Sbjct: 80  LILHIRSYDRSILFLDLYWWCPLCSSFRGHWG--VGADHFFVTCHDVGV-------RAFE 130

Query: 162 RGILPLLQRATLVQTFGQRNHVC---LNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYF 218
              LP + + ++         VC    N G I     A PQ +Q   +P           
Sbjct: 131 G--LPFIIKNSI-------RVVCSPSYNAGYIPHKDVALPQILQPFALPAGGN------- 174

Query: 219 RGLFYDVNNDPEGGYYA-------RGARAAVWEN-----FKNNPLFDISTDHPTTYYEDM 266
                D+ N    G++A       R   A +WEN       NN +     +    Y +  
Sbjct: 175 -----DIENRTILGFWAGHRNSKIRVILARIWENDTELAISNNRINRAIGN--LVYQKHF 227

Query: 267 QRAIFCLCPLGWAPWSPRLVEAVVFGCIP 295
            R  FC+CP G    S R+ +++ +GC+P
Sbjct: 228 FRTKFCVCPGGSQVNSARISDSIHYGCMP 256


>gi|148234643|ref|NP_001083782.1| exostosin 1 [Xenopus laevis]
 gi|62871603|gb|AAH94398.1| XEXT1 protein [Xenopus laevis]
          Length = 738

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 31/123 (25%), Positives = 56/123 (45%), Gaps = 1/123 (0%)

Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
           R   Y V+N  +            W+  K+      + ++    Y +M   A FCL P G
Sbjct: 272 RNALYHVHNGEDVVLLTTCKHGKDWQKHKDARCDKDNAEYEKYDYREMLHNATFCLVPRG 331

Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
               S R +EA+   C+PV++++   LPF++ I W +  +   E  + ++ + + SI  +
Sbjct: 332 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVIDWNQAAVIGDERLLLQIPSTIRSIHQD 391

Query: 338 VIL 340
            IL
Sbjct: 392 KIL 394


>gi|148229160|ref|NP_001083108.1| uncharacterized protein LOC398750 [Xenopus laevis]
 gi|37805422|gb|AAH60367.1| MGC68803 protein [Xenopus laevis]
          Length = 718

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 26/83 (31%), Positives = 44/83 (53%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y + +Q + FC+   G       L + +  GC+PVIIAD  VLPF++ + W+   + + E
Sbjct: 308 YPQILQESTFCIVLRGARLGQGVLSDVLQAGCVPVIIADSYVLPFSEVLDWKRASVVIPE 367

Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
           E + ++ +IL  I    +   QR
Sbjct: 368 EKMFEMYSILQGISQRQVEEMQR 390


>gi|123457129|ref|XP_001316295.1| Exostosin family protein [Trichomonas vaginalis G3]
 gi|121898997|gb|EAY04072.1| Exostosin family protein [Trichomonas vaginalis G3]
          Length = 350

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 61/271 (22%), Positives = 97/271 (35%), Gaps = 67/271 (24%)

Query: 65  HMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIY--------PTCDLTPTGLPLPFKSPR 116
           H F  E  +HR  +    R     +AD FY PIY          CD     LPL      
Sbjct: 28  HTFYFEPVLHRHFMKGYNRVSYESDADLFYVPIYLGLFNMQREKCDFDRCVLPL------ 81

Query: 117 MMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQ---RATL 173
                   + S   Y++R    DH FV             +       +P  Q   +  +
Sbjct: 82  --------VRSQGDYYDRFGSVDHVFV-------------QMLFSHNNVPFTQHHQKVIV 120

Query: 174 VQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIP---------PDTPRSIFVYFRGLFYD 224
            QT        + + +  +  + P Q  +   +P           + + I  +  G    
Sbjct: 121 AQT-------TIGDINWNLSIFEPRQMTRFTVMPYNSNFDFYESSSKQCITAFLTGQMTI 173

Query: 225 VNNDPEGGYYARGARAAVWE---NFKNNPLFDISTDH---PTTYYED---MQRAIFCLCP 275
            + D      AR  R A+ E   N KN  + +           Y++    M+ + FC  P
Sbjct: 174 ASFDKR----ARNIRQALKEEMRNTKNTAVIETKRKSHFIAAEYFQIESLMRNSEFCPVP 229

Query: 276 LGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 306
            G  P S RL +++  GCIP++++D+I  PF
Sbjct: 230 HGDGPQSKRLYDSMRTGCIPIVLSDEIRFPF 260


>gi|19909908|dbj|BAB87180.1| XEXT1 [Xenopus laevis]
          Length = 735

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 31/123 (25%), Positives = 56/123 (45%), Gaps = 1/123 (0%)

Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
           R   Y V+N  +            W+  K+      + ++    Y +M   A FCL P G
Sbjct: 269 RNALYHVHNGEDVVLLTTCKHGKDWQKHKDARCDKDNAEYEKYDYREMLHNATFCLVPRG 328

Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
               S R +EA+   C+PV++++   LPF++ I W +  +   E  + ++ + + SI  +
Sbjct: 329 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVIDWNQAAVIGDERLLLQIPSTIRSIHQD 388

Query: 338 VIL 340
            IL
Sbjct: 389 KIL 391


>gi|156361936|ref|XP_001625539.1| predicted protein [Nematostella vectensis]
 gi|156212377|gb|EDO33439.1| predicted protein [Nematostella vectensis]
          Length = 881

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 45/175 (25%), Positives = 78/175 (44%), Gaps = 23/175 (13%)

Query: 190 ITIPPYAPPQK----MQAHQIPPDTPRSIFVYFRGLFY-------DVNND------PEGG 232
           + +PP   P       QA Q  P T R  F+ F G F+       +V+ D       E  
Sbjct: 310 VIVPPLIGPTSGEVWAQAGQQLP-TLRKYFLTFEGSFHLSNEGAEEVSKDDLVFLSKESS 368

Query: 233 YYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSP--RLVEAVV 290
                     ++   N+  +++   H       ++++ F L P+G + +    RL+EA+ 
Sbjct: 369 DLFIAVECVTYDKISNSAGWNLCGSHKNRTVR-LKKSTFSLVPVGNSGFVTHVRLIEALQ 427

Query: 291 FGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVI--LRKQ 343
            G IPVI+    +LP A+ I W  + + +    + +L+TIL ++  E I  LR+Q
Sbjct: 428 TGAIPVILGTTNMLPLAEFIDWRSVSITLTPARIMELNTILRTVSSEDIHNLRQQ 482


>gi|50416406|gb|AAH77234.1| XEXT1 protein [Xenopus laevis]
          Length = 735

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 31/123 (25%), Positives = 56/123 (45%), Gaps = 1/123 (0%)

Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
           R   Y V+N  +            W+  K+      + ++    Y +M   A FCL P G
Sbjct: 269 RNALYHVHNGEDVVLLTTCKHGKDWQKHKDARCDKDNAEYEKYDYREMLHNATFCLVPRG 328

Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
               S R +EA+   C+PV++++   LPF++ I W +  +   E  + ++ + + SI  +
Sbjct: 329 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVIDWNQAAVIGDERLLLQIPSTIRSIHQD 388

Query: 338 VIL 340
            IL
Sbjct: 389 KIL 391


>gi|302847713|ref|XP_002955390.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
 gi|300259232|gb|EFJ43461.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
          Length = 267

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 33/104 (31%), Positives = 52/104 (50%), Gaps = 6/104 (5%)

Query: 282 SPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILR 341
            PR+ +AV  GC+PV+I+DD++  F   + W   G+ +AE D+P++  +  +I PE    
Sbjct: 11  GPRM-QAVQAGCVPVVISDDVLEAFEPFLDWNTFGVRLAEADIPRMHEVREAISPEEYAH 69

Query: 342 KQRLLANPSMKRAMLFPQPAQPG-----DAFHQILNGLARKLPH 380
           K+ LL   +   A      +  G     DAF  +L  L  K  H
Sbjct: 70  KEVLLRCAAQHMAFSTVTGSYIGESGRYDAFETLLEILRAKAAH 113


>gi|225457534|ref|XP_002270383.1| PREDICTED: xyloglucan galactosyltransferase KATAMARI1 homolog
           [Vitis vinifera]
          Length = 511

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 41/136 (30%), Positives = 56/136 (41%), Gaps = 39/136 (28%)

Query: 42  VYVYELPSKYNKKLLLKDPRCL----------------------------THMFAAEIFM 73
           VYVYELP K+N  LL K+ R L                            TH F AE+  
Sbjct: 125 VYVYELPPKFNIGLL-KECRRLNVYTDMCPHVANCGLGQPILEMGSSWFATHQFIAEMIF 183

Query: 74  HRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRS---AIQLIS--SN 128
           H  + + P RT +PE+AD FY P Y     +       F+   +      A++L+     
Sbjct: 184 HARMENHPCRTRDPEKADLFYVPFYGGLHASSK-----FRESNLAARDALAVELVEYIHR 238

Query: 129 WPYWNRTEGADHFFVV 144
             +W R  GADHF  +
Sbjct: 239 QRWWRRNHGADHFLAL 254


>gi|256088012|ref|XP_002580154.1| exostosin-1 [Schistosoma mansoni]
 gi|353230126|emb|CCD76297.1| putative exostosin-1 [Schistosoma mansoni]
          Length = 766

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 23/60 (38%), Positives = 36/60 (60%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y+E M  + FCL P G    S R +E +  GCIPV++++D+ LPF++ I W    ++  E
Sbjct: 322 YWELMYNSTFCLVPRGRRLGSYRFLEVLQAGCIPVMLSNDLELPFSEVIDWNRAVIWADE 381


>gi|224144348|ref|XP_002336132.1| predicted protein [Populus trichocarpa]
 gi|222873674|gb|EEF10805.1| predicted protein [Populus trichocarpa]
          Length = 421

 Score = 52.0 bits (123), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 41/142 (28%), Positives = 57/142 (40%), Gaps = 38/142 (26%)

Query: 38  GKLKVYVYELPSKYNKKLLLKDPRCL---------------------------------T 64
           G + VY+Y++P+++NK LL KD   L                                 T
Sbjct: 24  GGMSVYLYDMPAEFNKGLL-KDCSHLNAYTDMCPHVANRGLGQPLSYMAESAVATTWFAT 82

Query: 65  HMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPT--GLPLPFKSPRMMRSAI 122
           H F AE+  H  + + P R L+P  A  FY P Y   D +       L  +    +R A 
Sbjct: 83  HQFIAEMIFHARMENHPCRVLDPINAKLFYVPFYGGLDASSKFHDANLTARDELAVRLAD 142

Query: 123 QLISSNWPYWNRTEGADHFFVV 144
            L S   P+W R  G DHF V+
Sbjct: 143 YLRSK--PWWERHHGKDHFLVL 162


>gi|224061383|ref|XP_002300452.1| predicted protein [Populus trichocarpa]
 gi|222847710|gb|EEE85257.1| predicted protein [Populus trichocarpa]
          Length = 400

 Score = 52.0 bits (123), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 41/142 (28%), Positives = 57/142 (40%), Gaps = 38/142 (26%)

Query: 38  GKLKVYVYELPSKYNKKLLLKDPRCL---------------------------------T 64
           G + VY+Y++P+++NK LL KD   L                                 T
Sbjct: 3   GGMSVYLYDMPAEFNKGLL-KDCSHLNPYTDMCPHVANRGLGQPLSYMAESAVATTWFAT 61

Query: 65  HMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPT--GLPLPFKSPRMMRSAI 122
           H F AE+  H  + + P R L+P  A  FY P Y   D +       L  +    +R A 
Sbjct: 62  HQFIAEMIFHARMENHPCRVLDPINAKLFYVPFYGGLDASSKFHDANLTARDELAVRLAD 121

Query: 123 QLISSNWPYWNRTEGADHFFVV 144
            L S   P+W R  G DHF V+
Sbjct: 122 YLRSK--PWWERHHGKDHFLVL 141


>gi|345480513|ref|XP_001603840.2| PREDICTED: exostosin-1-like isoform 1 [Nasonia vitripennis]
 gi|345480515|ref|XP_003424163.1| PREDICTED: exostosin-1-like isoform 2 [Nasonia vitripennis]
          Length = 713

 Score = 52.0 bits (123), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 27/72 (37%), Positives = 41/72 (56%)

Query: 269 AIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLD 328
           + FCL P G    S R +EA+  GCIPVI+++   LPF D I W +  ++  E  + ++ 
Sbjct: 307 STFCLVPRGRRLGSFRFLEALRAGCIPVILSNGWALPFHDRIDWFQAVIYADERLLFQVP 366

Query: 329 TILTSIPPEVIL 340
            IL S+  E I+
Sbjct: 367 DILRSVVEEKIM 378


>gi|296227312|ref|XP_002759355.1| PREDICTED: exostosin-1 [Callithrix jacchus]
          Length = 475

 Score = 52.0 bits (123), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 27/85 (31%), Positives = 46/85 (54%)

Query: 256 TDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEI 315
           T+H   Y E +  A FCL P G    S R +EA+   C+PV++++   LPF++ I W + 
Sbjct: 47  TEHWYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQA 106

Query: 316 GMFVAEEDVPKLDTILTSIPPEVIL 340
            +   E  + ++ + + SI  + IL
Sbjct: 107 AVIGDERLLLQIPSTIRSIHQDKIL 131


>gi|159465527|ref|XP_001690974.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
 gi|158279660|gb|EDP05420.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
          Length = 341

 Score = 52.0 bits (123), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 36/143 (25%), Positives = 62/143 (43%), Gaps = 14/143 (9%)

Query: 204 HQIPPDTPRSIFVYFRGLFYDVNN--DPEGGY----------YARGARAAVWENFKNNPL 251
           H + P  PR+    F G     ++  DPE  +          Y++GAR  V  +    P 
Sbjct: 72  HPLAPRIPRNGTFLFSGRICGDHSLPDPEKPWPHCATRRSPGYSQGARQLVHFHHSQRPG 131

Query: 252 FDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIP 311
           + +   +   Y  D+    +CL P G      R V     GC+PV+++D ++ PF   + 
Sbjct: 132 YKVMVKN-KEYQVDLINYKWCLAPSG-GGHGHRQVLVAAMGCLPVVVSDLVMQPFEPEMD 189

Query: 312 WEEIGMFVAEEDVPKLDTILTSI 334
           W    + V ++DVP L   + ++
Sbjct: 190 WSAFSLRVEQKDVPTLHEAIEAV 212


>gi|310831249|ref|YP_003969892.1| putative exostosin family protein [Cafeteria roenbergensis virus
           BV-PW1]
 gi|309386433|gb|ADO67293.1| putative exostosin family protein [Cafeteria roenbergensis virus
           BV-PW1]
          Length = 300

 Score = 51.6 bits (122), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 33/108 (30%), Positives = 56/108 (51%), Gaps = 3/108 (2%)

Query: 231 GGYYARGARAAVWENFKNNPLFDISTDHPTTYYED-MQRAIFCLCPLGWAPWSPRLVEAV 289
           G ++      +  +N K     D S  + T  Y + +  +IF LCP G  P S R  E++
Sbjct: 172 GNWHFNKIVYSTKQNAKQELNIDNSHINNTKNYNNILLDSIFSLCPSGSGPNSIRFWESL 231

Query: 290 VFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
             G IPV+++D + LP    I W++I + + E+++  +D IL +I  E
Sbjct: 232 AVGSIPVLLSDKLDLP--KNINWQDIIIIIPEKEISNIDIILNNISVE 277


>gi|302785233|ref|XP_002974388.1| galactosyltransferase-like protein [Selaginella moellendorffii]
 gi|300157986|gb|EFJ24610.1| galactosyltransferase-like protein [Selaginella moellendorffii]
          Length = 500

 Score = 51.6 bits (122), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 81/382 (21%), Positives = 145/382 (37%), Gaps = 59/382 (15%)

Query: 41  KVYVYELPSKYNKKLLLKDPRCLT----------HMFAA-----------------EIFM 73
           +V++YELP K+N ++L K  + ++          H F                   E+  
Sbjct: 105 RVFMYELPRKFNLEVLEKCDKMVSWLTFCDHFINHGFGKALAGANSSWYATDPYMLEVIF 164

Query: 74  HRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSA-----IQLISSN 128
           H  +        +P EAD F+ P Y   D     L   + +  + R       ++ + +N
Sbjct: 165 HERMRRYRCLVNSPREADAFFIPYYAGLD----ALRFLYGADNLNRHEQGVDLVKFLEAN 220

Query: 129 WPY-WNRTEGADHFFVVPH---DFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVC 184
           + + W R  G DHF V      DF +   Y+ + +    +  L Q   +     +R    
Sbjct: 221 YSWSWRRNLGHDHFMVTGRTAWDFAS---YRGKSSWGTSLRLLKQMENVTTLVMERRPWD 277

Query: 185 LNEGSITIP-PYAPPQKMQAH---QIPPDTPRSIFVYFRGLFYDVNNDP-EGGYYARGAR 239
             E +I  P  + P  K +     +    +PR+ F+ F G      N+   G  + +  +
Sbjct: 278 RTEQAIPYPTSFHPATKSELQAWIERVKASPRTNFMSFAGAPRPQQNESIRGILFEQCRK 337

Query: 240 AAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVII- 298
           +   E    + L      +P    E +  +IFCL P G         +++V GCIPV   
Sbjct: 338 SRSCEAVNCSKL--RCAHNPLPIAEKLLSSIFCLQPQGDTSTRRSSFDSLVCGCIPVFFH 395

Query: 299 ADDIVLPFADAIPWEE--IGMFVAEEDVPK-----LDTILTSIPPEVILRKQRLLANPSM 351
           AD     +   +P E     +F+ EE++ +      + + +    + I   QR +    +
Sbjct: 396 ADSAYTQYTWHLPRERESYSVFIPEEEIRRDGLEVEEFLRSKFSSQRIGELQRNI-RKII 454

Query: 352 KRAMLFPQPAQPGDAFHQILNG 373
            R +   +P   GD     L+G
Sbjct: 455 PRLLYTGKPWSSGDGGRDSLDG 476


>gi|427796253|gb|JAA63578.1| Putative tout-velu, partial [Rhipicephalus pulchellus]
          Length = 835

 Score = 51.6 bits (122), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 25/78 (32%), Positives = 42/78 (53%)

Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
           Y   M+ A FCL P G    S R +E++  GC+PV++A+   LPF +++ WE   +   E
Sbjct: 394 YGSLMENATFCLVPRGRRLGSFRFLESLQAGCVPVLLANGWELPFGESLRWEGAALRADE 453

Query: 322 EDVPKLDTILTSIPPEVI 339
             + ++   L S+P   +
Sbjct: 454 RLLLQVPDTLRSMPRRRV 471


>gi|348519387|ref|XP_003447212.1| PREDICTED: exostosin-1b-like [Oreochromis niloticus]
          Length = 740

 Score = 51.6 bits (122), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 49/195 (25%), Positives = 80/195 (41%), Gaps = 19/195 (9%)

Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
           R   Y V+N  +            W+  K+      + ++    Y++M   + FCL P G
Sbjct: 274 RNALYHVHNAEDVVLLTTCKHGKDWQKHKDARCDKDNAEYDKYDYKEMLHNSTFCLVPRG 333

Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
               S R +EA+   C+PV++++   LPF++ I W    +   E  + ++ T + SI  +
Sbjct: 334 RRLGSFRFLEALQAACVPVMLSNGWELPFSEIIDWNTAAVIGDERLLLQIPTTVRSIHQD 393

Query: 338 VILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLA----RKLPHDKSTYL----KPG 389
            IL         S+++   F   A        +L  L     R L H   + L     PG
Sbjct: 394 KIL---------SLRQQTQFLWEAYFSSVEKIVLTTLEIIQDRVLEHASRSSLMWNSHPG 444

Query: 390 GKF-LNWTSGPVGDL 403
           G F L   SG +GD 
Sbjct: 445 GLFALPQYSGYLGDF 459


>gi|156362595|ref|XP_001625861.1| predicted protein [Nematostella vectensis]
 gi|156212714|gb|EDO33761.1| predicted protein [Nematostella vectensis]
          Length = 728

 Score = 51.6 bits (122), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 37/135 (27%), Positives = 62/135 (45%), Gaps = 12/135 (8%)

Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKN------NPLFDISTDHPTTYYEDMQRAIFC 272
           R   Y ++N  +            W   K+      N L+D       +Y E +  A FC
Sbjct: 267 RNALYHIHNGRDIVLLTTCKHGKAWHKHKDERCDGDNALYD-----RYSYDELLLNATFC 321

Query: 273 LCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILT 332
           L P G    S R +E++  GCIP +++D   LPFA+ I W++  +  +E  + ++  I+ 
Sbjct: 322 LVPRGRRLGSFRFLESLKVGCIPFLLSDGWELPFAEVIDWKKAVIDGSERLLMQVPGIVR 381

Query: 333 SIP-PEVILRKQRLL 346
           S    +V+  KQ+ L
Sbjct: 382 SYSRSQVLAMKQQSL 396


>gi|340377881|ref|XP_003387457.1| PREDICTED: exostosin-1b-like [Amphimedon queenslandica]
          Length = 729

 Score = 51.6 bits (122), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 33/128 (25%), Positives = 55/128 (42%), Gaps = 11/128 (8%)

Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENF------KNNPLFDISTDHPTTYYEDMQRAIFC 272
           R   Y ++N  +            W  +      K+N  F+        Y   +  + FC
Sbjct: 262 RNAIYHLDNGDDVIMLTTCKHGKFWSRYRDEKCKKDNIFFE-----KYNYNHLISNSTFC 316

Query: 273 LCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILT 332
           L P G    S R +E++  GCIPV +++ +VLPF + I W +      E  + ++  +L 
Sbjct: 317 LVPRGRRLGSFRFLESLHAGCIPVSLSNGLVLPFHELIDWSKALFVFDERQLFQVPHMLR 376

Query: 333 SIPPEVIL 340
            IP + IL
Sbjct: 377 HIPEDKIL 384


>gi|441647993|ref|XP_004090847.1| PREDICTED: LOW QUALITY PROTEIN: exostosin-1 [Nomascus leucogenys]
          Length = 746

 Score = 51.2 bits (121), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 31/123 (25%), Positives = 57/123 (46%), Gaps = 1/123 (0%)

Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
           R   Y V+N  +            W+  K++     +T++    Y +M   A FCL P G
Sbjct: 280 RNALYHVHNGEDVVLLTTCKHGKDWQKHKDSRCDRDNTEYEKYDYREMLHNATFCLVPRG 339

Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
               S   +EA+   C+PV++++   LPF++ I W +  +   E  + ++ + + SI  +
Sbjct: 340 RRLGSFIFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIPSTIRSIHQD 399

Query: 338 VIL 340
            IL
Sbjct: 400 KIL 402


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.323    0.140    0.457 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 7,271,486,090
Number of Sequences: 23463169
Number of extensions: 328051390
Number of successful extensions: 619690
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1149
Number of HSP's successfully gapped in prelim test: 160
Number of HSP's that attempted gapping in prelim test: 616856
Number of HSP's gapped (non-prelim): 1973
length of query: 406
length of database: 8,064,228,071
effective HSP length: 145
effective length of query: 261
effective length of database: 8,957,035,862
effective search space: 2337786359982
effective search space used: 2337786359982
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 78 (34.7 bits)