BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 038885
(406 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|224053575|ref|XP_002297880.1| predicted protein [Populus trichocarpa]
gi|222845138|gb|EEE82685.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 780 bits (2015), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 373/408 (91%), Positives = 387/408 (94%), Gaps = 2/408 (0%)
Query: 1 MWVTAIALLLAFVSADNTPKI--ERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLK 58
+WV A+ L+ FV ++ ERISGSAGDVL+DDPVG+LKVYVYELPSKYNKKLL K
Sbjct: 5 LWVFAVVLVFGFVDGKEIERLRTERISGSAGDVLDDDPVGRLKVYVYELPSKYNKKLLQK 64
Query: 59 DPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMM 118
DPRCLTHMFAAEIFMHRFLLSSPVRTLNP+EADWFY+PIYPTCDLTP GLPLPFKSPRMM
Sbjct: 65 DPRCLTHMFAAEIFMHRFLLSSPVRTLNPDEADWFYSPIYPTCDLTPMGLPLPFKSPRMM 124
Query: 119 RSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFG 178
RSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQR+TLVQTFG
Sbjct: 125 RSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRSTLVQTFG 184
Query: 179 QRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGA 238
QRNHVCLNEGSITIPP+APPQKMQAHQIPPD PRSIFVYFRGLFYDVNNDPEGGYYARGA
Sbjct: 185 QRNHVCLNEGSITIPPFAPPQKMQAHQIPPDIPRSIFVYFRGLFYDVNNDPEGGYYARGA 244
Query: 239 RAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVII 298
RAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVII
Sbjct: 245 RAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVII 304
Query: 299 ADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFP 358
ADDIVLPFADAIPWEEIG+FVAEEDVP LDTILTSIPPEVILRKQRLLANPSMKRAMLFP
Sbjct: 305 ADDIVLPFADAIPWEEIGVFVAEEDVPNLDTILTSIPPEVILRKQRLLANPSMKRAMLFP 364
Query: 359 QPAQPGDAFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
QPAQPGDAFHQILNGLARKLPHD+S YLK G LNWT+GPVGDLKPW
Sbjct: 365 QPAQPGDAFHQILNGLARKLPHDRSVYLKSGQNILNWTAGPVGDLKPW 412
>gi|60657598|gb|AAX33320.1| secondary cell wall-related glycosyltransferase family 47 [Populus
tremula x Populus tremuloides]
Length = 412
Score = 775 bits (2000), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 370/408 (90%), Positives = 386/408 (94%), Gaps = 2/408 (0%)
Query: 1 MWVTAIALLLAFVSADNTPKI--ERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLK 58
+WV A+ L+ FV ++ ERISGSAGDVL+DDPVG+LKVYVYELPSKYNKKLL K
Sbjct: 5 LWVFALVLVFGFVDGKKIERLRTERISGSAGDVLDDDPVGRLKVYVYELPSKYNKKLLQK 64
Query: 59 DPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMM 118
DPRCLTHMFAAEIFMHRFLLSSPVRTLNP+EADWFY+PIYPTCDLTP GLPLPFKSPRMM
Sbjct: 65 DPRCLTHMFAAEIFMHRFLLSSPVRTLNPDEADWFYSPIYPTCDLTPMGLPLPFKSPRMM 124
Query: 119 RSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFG 178
RSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFH QEEKA+ERGILPLLQR+TLVQTFG
Sbjct: 125 RSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHCQEEKAVERGILPLLQRSTLVQTFG 184
Query: 179 QRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGA 238
+RNHVCLNEGSITIPP+APPQKMQAHQIPPD PRSIFVYFRGLFYDVNNDPEGGYYARGA
Sbjct: 185 RRNHVCLNEGSITIPPFAPPQKMQAHQIPPDIPRSIFVYFRGLFYDVNNDPEGGYYARGA 244
Query: 239 RAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVII 298
RAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVII
Sbjct: 245 RAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVII 304
Query: 299 ADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFP 358
ADDIVLPFADAIPWEEIG+FVAEEDVP LDTILTSIPPEVILRKQRLLANPSMKRAMLFP
Sbjct: 305 ADDIVLPFADAIPWEEIGVFVAEEDVPNLDTILTSIPPEVILRKQRLLANPSMKRAMLFP 364
Query: 359 QPAQPGDAFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
QPAQPGDAFHQILNGLARKLPHD+S YLK G LNWT+GPVGDLKPW
Sbjct: 365 QPAQPGDAFHQILNGLARKLPHDRSVYLKSGQNILNWTAGPVGDLKPW 412
>gi|15223522|ref|NP_174064.1| putative glycosyltransferase [Arabidopsis thaliana]
gi|75272002|sp|Q9FZJ1.1|IRX10_ARATH RecName: Full=Probable beta-1,4-xylosyltransferase IRX10; AltName:
Full=Glucuronoxylan glucuronosyltransferase 1;
Short=AtGUT1; AltName: Full=Glucuronoxylan
glucuronosyltransferase 2; AltName: Full=Protein
IRREGULAR XYLEM 10; AltName: Full=Xylan
xylosyltransferase IRX10
gi|9802541|gb|AAF99743.1|AC004557_22 F17L21.23 [Arabidopsis thaliana]
gi|63003872|gb|AAY25465.1| At1g27440 [Arabidopsis thaliana]
gi|98960979|gb|ABF58973.1| At1g27440 [Arabidopsis thaliana]
gi|332192709|gb|AEE30830.1| putative glycosyltransferase [Arabidopsis thaliana]
Length = 412
Score = 773 bits (1996), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 374/406 (92%), Positives = 385/406 (94%), Gaps = 2/406 (0%)
Query: 3 VTAIALLLAFV--SADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDP 60
++AI L L F SA + ERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLL KDP
Sbjct: 7 LSAILLFLFFSASSAKQNVRTERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLQKDP 66
Query: 61 RCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRS 120
RCLTHMFAAEIFMHRFLLSSPVRT NP+EADWFYTPIYPTCDLTPTGLPLPFKSPRMMRS
Sbjct: 67 RCLTHMFAAEIFMHRFLLSSPVRTRNPDEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRS 126
Query: 121 AIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQR 180
+IQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQR
Sbjct: 127 SIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQR 186
Query: 181 NHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARA 240
NHVCL+EGSITIPP+APPQKMQAH IPPD PRSIFVYFRGLFYDVNNDPEGGYYARGARA
Sbjct: 187 NHVCLDEGSITIPPFAPPQKMQAHFIPPDIPRSIFVYFRGLFYDVNNDPEGGYYARGARA 246
Query: 241 AVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD 300
AVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD
Sbjct: 247 AVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD 306
Query: 301 DIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQP 360
DIVLPFADAIPWEEIG+FVAE+DVP+LDTILTSIP EVILRKQRLLANPSMKRAMLFPQP
Sbjct: 307 DIVLPFADAIPWEEIGVFVAEKDVPELDTILTSIPTEVILRKQRLLANPSMKRAMLFPQP 366
Query: 361 AQPGDAFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
AQPGDAFHQILNGLARKLPHDKS YLK G K LNWT+GPV DLKPW
Sbjct: 367 AQPGDAFHQILNGLARKLPHDKSIYLKTGEKALNWTAGPVADLKPW 412
>gi|224075447|ref|XP_002304637.1| predicted protein [Populus trichocarpa]
gi|222842069|gb|EEE79616.1| predicted protein [Populus trichocarpa]
Length = 413
Score = 772 bits (1994), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 369/393 (93%), Positives = 380/393 (96%)
Query: 14 SADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFM 73
+AD + ERISGSAGDVLED+PVG+LKV+VYELPSKYNKKLL KDPRCLTHMFAAEIFM
Sbjct: 21 NADQRLRTERISGSAGDVLEDNPVGRLKVFVYELPSKYNKKLLQKDPRCLTHMFAAEIFM 80
Query: 74 HRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWN 133
HRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPF SPRMMRSAIQL+SSNWPYWN
Sbjct: 81 HRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFNSPRMMRSAIQLLSSNWPYWN 140
Query: 134 RTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIP 193
RTEGADHFFVVPHDFGACFHYQEEKAIERGILPLL+RATLVQTFGQRNHVCLNEGSITIP
Sbjct: 141 RTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLRRATLVQTFGQRNHVCLNEGSITIP 200
Query: 194 PYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFD 253
PYAPPQKMQAHQIP DTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFD
Sbjct: 201 PYAPPQKMQAHQIPLDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFD 260
Query: 254 ISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWE 313
ISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWE
Sbjct: 261 ISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWE 320
Query: 314 EIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNG 373
EIG+FVAEEDVP LDT LTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNG
Sbjct: 321 EIGVFVAEEDVPHLDTFLTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNG 380
Query: 374 LARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
LARKLPHD++ YLKPG K LNWT+GP GDLKPW
Sbjct: 381 LARKLPHDRTVYLKPGQKVLNWTAGPAGDLKPW 413
>gi|297851174|ref|XP_002893468.1| hypothetical protein ARALYDRAFT_472948 [Arabidopsis lyrata subsp.
lyrata]
gi|297339310|gb|EFH69727.1| hypothetical protein ARALYDRAFT_472948 [Arabidopsis lyrata subsp.
lyrata]
Length = 412
Score = 772 bits (1994), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 373/406 (91%), Positives = 387/406 (95%), Gaps = 2/406 (0%)
Query: 3 VTAIALLLAFV--SADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDP 60
++AI L L F S++ + ERISGSAGDVLED+PVGKLKV+VYELPSKYNKKLL KDP
Sbjct: 7 LSAILLFLFFSASSSEQNVRTERISGSAGDVLEDNPVGKLKVFVYELPSKYNKKLLQKDP 66
Query: 61 RCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRS 120
RCLTHMFAAEIFMHRFLLSSPVRT NP+EADWFYTPIYPTCDLTPTGLPLPFKSPRMMRS
Sbjct: 67 RCLTHMFAAEIFMHRFLLSSPVRTRNPDEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRS 126
Query: 121 AIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQR 180
AIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQR
Sbjct: 127 AIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQR 186
Query: 181 NHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARA 240
NHVCL+EGSITIPP+APPQKMQAH IPPD PRSIFVYFRGLFYDVNNDPEGGYYARGARA
Sbjct: 187 NHVCLDEGSITIPPFAPPQKMQAHFIPPDIPRSIFVYFRGLFYDVNNDPEGGYYARGARA 246
Query: 241 AVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD 300
AVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD
Sbjct: 247 AVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD 306
Query: 301 DIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQP 360
DIVLPFADAIPWEEIG+FVAE+DVP+LDTILTSIP EVILRKQRLLANPSMKRAMLFPQP
Sbjct: 307 DIVLPFADAIPWEEIGVFVAEKDVPELDTILTSIPTEVILRKQRLLANPSMKRAMLFPQP 366
Query: 361 AQPGDAFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
AQPGDAFHQILNGLARKLPHDKS YLK G K LNWT+GPVGDLKPW
Sbjct: 367 AQPGDAFHQILNGLARKLPHDKSIYLKAGEKELNWTAGPVGDLKPW 412
>gi|359473830|ref|XP_003631364.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10L-like [Vitis
vinifera]
Length = 412
Score = 759 bits (1960), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 365/408 (89%), Positives = 380/408 (93%), Gaps = 2/408 (0%)
Query: 1 MWVTAIALL--LAFVSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLK 58
+WV + + D P ERISGSAGDVLEDDPVG+LKV+VYELPSKYNKK+L K
Sbjct: 5 IWVLVFGFVWGIGAHGLDRHPHTERISGSAGDVLEDDPVGRLKVFVYELPSKYNKKILQK 64
Query: 59 DPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMM 118
DPRCL HMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIY TCDLTP GLPLPFKSPRMM
Sbjct: 65 DPRCLNHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYTTCDLTPNGLPLPFKSPRMM 124
Query: 119 RSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFG 178
RSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFG
Sbjct: 125 RSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFG 184
Query: 179 QRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGA 238
QRNHVCLNEGSITIPPYAPPQKMQAH IP +TPRSIFVYFRGLFYDVNNDPEGGYYARGA
Sbjct: 185 QRNHVCLNEGSITIPPYAPPQKMQAHLIPQETPRSIFVYFRGLFYDVNNDPEGGYYARGA 244
Query: 239 RAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVII 298
RAAVWENFK+NPLFDIST+HPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAV+FGCIPVII
Sbjct: 245 RAAVWENFKDNPLFDISTEHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVIFGCIPVII 304
Query: 299 ADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFP 358
ADDIVLPFADAIPWEEIG+FVAEEDVP LDTILTSIPPEVILRKQRLLANPSMK+AMLFP
Sbjct: 305 ADDIVLPFADAIPWEEIGVFVAEEDVPNLDTILTSIPPEVILRKQRLLANPSMKQAMLFP 364
Query: 359 QPAQPGDAFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
QPAQ GDAFHQILNGLARKLPH+K+ YLKPG LNWT+GPVGDLKPW
Sbjct: 365 QPAQSGDAFHQILNGLARKLPHNKNVYLKPGENALNWTAGPVGDLKPW 412
>gi|255540029|ref|XP_002511079.1| catalytic, putative [Ricinus communis]
gi|223550194|gb|EEF51681.1| catalytic, putative [Ricinus communis]
Length = 417
Score = 755 bits (1949), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 360/403 (89%), Positives = 379/403 (94%), Gaps = 3/403 (0%)
Query: 7 ALLLAFVSA---DNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCL 63
A L+ SA ++ + ERISGSAGDVLEDDPVG+LKV+VYELPSKYNKK+L KDPRCL
Sbjct: 15 AFFLSSSSAYKMGSSQRTERISGSAGDVLEDDPVGRLKVFVYELPSKYNKKILQKDPRCL 74
Query: 64 THMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQ 123
THMFAAEIFMHRFLLSSPVRTLNPEEADWFYTP+Y TCDLTP GLPLPFKSPRMMRSAIQ
Sbjct: 75 THMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRMMRSAIQ 134
Query: 124 LISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHV 183
LISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILP+LQRATLVQTFGQRNHV
Sbjct: 135 LISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPMLQRATLVQTFGQRNHV 194
Query: 184 CLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVW 243
CL EGSITIPPYAPPQKMQAH IP TPRSIFVYFRGLFYDV NDPEGGYYARGARAAVW
Sbjct: 195 CLKEGSITIPPYAPPQKMQAHLIPEKTPRSIFVYFRGLFYDVGNDPEGGYYARGARAAVW 254
Query: 244 ENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIV 303
ENFK+NPLFDIST+HPTTYYEDMQRA+FCLCPLGWAPWSPRLVEAV+FGCIPVIIADDIV
Sbjct: 255 ENFKDNPLFDISTEHPTTYYEDMQRAVFCLCPLGWAPWSPRLVEAVIFGCIPVIIADDIV 314
Query: 304 LPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQP 363
LPFADAIPWEEIG+FV E+DVP LDTILTSIPPEVILRKQRLLANPSMK+AMLFPQPAQ
Sbjct: 315 LPFADAIPWEEIGVFVDEKDVPNLDTILTSIPPEVILRKQRLLANPSMKQAMLFPQPAQA 374
Query: 364 GDAFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
GDAFHQ+LNGLARKLPHD+S YLKPG K LNWT+GPVGDLKPW
Sbjct: 375 GDAFHQVLNGLARKLPHDRSVYLKPGEKILNWTAGPVGDLKPW 417
>gi|449484890|ref|XP_004157009.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10-like [Cucumis
sativus]
Length = 417
Score = 754 bits (1948), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 360/406 (88%), Positives = 378/406 (93%)
Query: 1 MWVTAIALLLAFVSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDP 60
+WV AI L+ + N ERISGSAGDVLED+PVG+LKVYVY+LPSKYNKK L KDP
Sbjct: 12 LWVFAIFLVSRIAAVPNAVPTERISGSAGDVLEDNPVGRLKVYVYDLPSKYNKKTLQKDP 71
Query: 61 RCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRS 120
RCLTHMFAAEI+MHRFLL+SPVRTLNP+EADWFYTPIY TCDLTP GLPLPFKSPRMMRS
Sbjct: 72 RCLTHMFAAEIYMHRFLLNSPVRTLNPDEADWFYTPIYVTCDLTPNGLPLPFKSPRMMRS 131
Query: 121 AIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQR 180
AIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAI+RGILPLLQRATLVQTFGQR
Sbjct: 132 AIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIDRGILPLLQRATLVQTFGQR 191
Query: 181 NHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARA 240
NHVCLNEGSITIPPY PPQKM+ H IP +TPRSIFVYFRGLFYDVNNDPEGGYYARGARA
Sbjct: 192 NHVCLNEGSITIPPYCPPQKMKTHLIPSETPRSIFVYFRGLFYDVNNDPEGGYYARGARA 251
Query: 241 AVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD 300
AVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD
Sbjct: 252 AVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD 311
Query: 301 DIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQP 360
DIVLPFADAIPWEEIG+FV E+DV LDTILTSIPP+VILRKQRLLANPSMKRAM+FPQP
Sbjct: 312 DIVLPFADAIPWEEIGVFVDEKDVSNLDTILTSIPPDVILRKQRLLANPSMKRAMMFPQP 371
Query: 361 AQPGDAFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
AQ GDAFHQILNGLARKLPHDK YLKPG + LNWT+GPVGDLKPW
Sbjct: 372 AQSGDAFHQILNGLARKLPHDKGVYLKPGERILNWTAGPVGDLKPW 417
>gi|225456511|ref|XP_002284685.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10L [Vitis
vinifera]
gi|147860849|emb|CAN83159.1| hypothetical protein VITISV_022554 [Vitis vinifera]
gi|297734521|emb|CBI15768.3| unnamed protein product [Vitis vinifera]
Length = 416
Score = 754 bits (1947), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 363/411 (88%), Positives = 382/411 (92%), Gaps = 6/411 (1%)
Query: 2 WVTAIALLLAF---VSADNT---PKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKL 55
WV + LL+A V A+ + ERISGSAGDVLED+PVG+LKV+VYELPSKYNKK+
Sbjct: 6 WVFCVLLLIASSLRVGAEQLRRGQRTERISGSAGDVLEDNPVGRLKVFVYELPSKYNKKI 65
Query: 56 LLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSP 115
L KDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTP+Y TCDLTP GLPLPFKSP
Sbjct: 66 LQKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPVYTTCDLTPNGLPLPFKSP 125
Query: 116 RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQ 175
RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGIL LLQRATLVQ
Sbjct: 126 RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILHLLQRATLVQ 185
Query: 176 TFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYA 235
TFGQRNHVCL EGSITIPPYAPPQKMQ+H IPPDTPRSIFVYFRGLFYDV NDPEGGYYA
Sbjct: 186 TFGQRNHVCLKEGSITIPPYAPPQKMQSHLIPPDTPRSIFVYFRGLFYDVGNDPEGGYYA 245
Query: 236 RGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIP 295
RGARAAVWENFK+NPLFDIST+HP TYYEDMQRA+FCLCPLGWAPWSPRLVE V+FGCIP
Sbjct: 246 RGARAAVWENFKDNPLFDISTEHPMTYYEDMQRAVFCLCPLGWAPWSPRLVEGVIFGCIP 305
Query: 296 VIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAM 355
VIIADDIVLPFADAIPWEEIG+FVAEEDVP LDTILTSIPPEVILRKQRLLANPSMK+AM
Sbjct: 306 VIIADDIVLPFADAIPWEEIGVFVAEEDVPNLDTILTSIPPEVILRKQRLLANPSMKQAM 365
Query: 356 LFPQPAQPGDAFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
LFPQPAQPGDAFHQILNGLARKLPH ++ YLKPG K LNWT+GPVGDLKPW
Sbjct: 366 LFPQPAQPGDAFHQILNGLARKLPHGRNIYLKPGEKILNWTAGPVGDLKPW 416
>gi|356508876|ref|XP_003523179.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10L-like
[Glycine max]
Length = 416
Score = 749 bits (1934), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 357/406 (87%), Positives = 374/406 (92%)
Query: 1 MWVTAIALLLAFVSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDP 60
+ A A + V ERISGSAGDVLEDDPVG+LKV+VYELPSKYNKK+L KDP
Sbjct: 11 LLCAASAFAIGAVELGRHQPTERISGSAGDVLEDDPVGRLKVFVYELPSKYNKKILQKDP 70
Query: 61 RCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRS 120
RCL HMFAAEIFMHRFLLSSPVRTLNPEEADWFYTP+Y TCDLTP GLPLPFKSPRMMRS
Sbjct: 71 RCLNHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRMMRS 130
Query: 121 AIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQR 180
AIQLISSNWPYWNRTEGADHFFV PHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQR
Sbjct: 131 AIQLISSNWPYWNRTEGADHFFVTPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQR 190
Query: 181 NHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARA 240
NHVCL EGSITIPPYAPPQKM H IP TPRSIFVYFRGLFYDV NDPEGGYYARGARA
Sbjct: 191 NHVCLKEGSITIPPYAPPQKMHTHLIPEKTPRSIFVYFRGLFYDVGNDPEGGYYARGARA 250
Query: 241 AVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD 300
AVWENFK+NPLFDIST+HPTTYYEDMQRA+FCLCPLGWAPWSPRLVEAVVFGCIPVIIAD
Sbjct: 251 AVWENFKDNPLFDISTEHPTTYYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD 310
Query: 301 DIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQP 360
DIVLPFADAIPWEEIG+FV E+DVP+LDTILTSIPPEVILRKQRLLANPSMK+AMLFPQP
Sbjct: 311 DIVLPFADAIPWEEIGVFVDEKDVPQLDTILTSIPPEVILRKQRLLANPSMKQAMLFPQP 370
Query: 361 AQPGDAFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
AQPGDAFHQ+LNGLARKLPHD+S +LKPG K LNWT+GPVGDLKPW
Sbjct: 371 AQPGDAFHQVLNGLARKLPHDRSVFLKPGEKALNWTAGPVGDLKPW 416
>gi|224119858|ref|XP_002318180.1| predicted protein [Populus trichocarpa]
gi|224122230|ref|XP_002318783.1| predicted protein [Populus trichocarpa]
gi|222858853|gb|EEE96400.1| predicted protein [Populus trichocarpa]
gi|222859456|gb|EEE97003.1| predicted protein [Populus trichocarpa]
Length = 417
Score = 749 bits (1933), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 353/406 (86%), Positives = 373/406 (91%)
Query: 1 MWVTAIALLLAFVSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDP 60
+W L V + ERISGSAGDVLEDDPVG+LKV+VYELP KYNKK+L KDP
Sbjct: 12 LWYAGFVLSTGAVELGRSQHTERISGSAGDVLEDDPVGRLKVFVYELPRKYNKKILQKDP 71
Query: 61 RCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRS 120
RCLTHMFAAEIFMHRFLLSSPVRTLNP+EADWFYTP+Y TCDLT GLPLPFKSPRMMRS
Sbjct: 72 RCLTHMFAAEIFMHRFLLSSPVRTLNPDEADWFYTPVYTTCDLTTNGLPLPFKSPRMMRS 131
Query: 121 AIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQR 180
AIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQR
Sbjct: 132 AIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQR 191
Query: 181 NHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARA 240
NHVCL +GSIT+PPYAPPQKMQ H IP TPRSIFVYFRGLFYDV NDPEGGYYARGARA
Sbjct: 192 NHVCLKDGSITVPPYAPPQKMQTHLIPEKTPRSIFVYFRGLFYDVGNDPEGGYYARGARA 251
Query: 241 AVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD 300
AVWENFK+NPLFDIST+HP TYYEDMQRA+FCLCPLGWAPWSPRLVEAV+FGCIPVIIAD
Sbjct: 252 AVWENFKDNPLFDISTEHPATYYEDMQRAVFCLCPLGWAPWSPRLVEAVIFGCIPVIIAD 311
Query: 301 DIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQP 360
DIVLPFADAIPWEEIG++V EEDVP LDTILTSIPPEVILRKQRLLANPSMK+AMLFPQP
Sbjct: 312 DIVLPFADAIPWEEIGVYVDEEDVPNLDTILTSIPPEVILRKQRLLANPSMKQAMLFPQP 371
Query: 361 AQPGDAFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
AQPGDAFHQ+LNGLARKLPHD+S YL+PG K LNWT+GPVGDLKPW
Sbjct: 372 AQPGDAFHQVLNGLARKLPHDRSVYLRPGEKILNWTAGPVGDLKPW 417
>gi|356516468|ref|XP_003526916.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10L-like
[Glycine max]
Length = 416
Score = 748 bits (1930), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 354/385 (91%), Positives = 369/385 (95%)
Query: 22 ERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSP 81
ERISGSAGDVLEDDPVG+LKV+VYELPSKYNKK+L KDPRCL HMFAAEIFMHRFLLSSP
Sbjct: 32 ERISGSAGDVLEDDPVGRLKVFVYELPSKYNKKILQKDPRCLNHMFAAEIFMHRFLLSSP 91
Query: 82 VRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHF 141
VRTLNPEEADWFYTP+Y TCDLTP GLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHF
Sbjct: 92 VRTLNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHF 151
Query: 142 FVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKM 201
FV PHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCL EGSITIPPYAPPQKM
Sbjct: 152 FVTPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLKEGSITIPPYAPPQKM 211
Query: 202 QAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTT 261
H IP TPRSIFVYFRGLFYDV NDPEGGYYARGARAAVWENFK+NPLFDIST+HPTT
Sbjct: 212 HTHLIPEKTPRSIFVYFRGLFYDVGNDPEGGYYARGARAAVWENFKDNPLFDISTEHPTT 271
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
YYEDMQRA+FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG+FV E
Sbjct: 272 YYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFVDE 331
Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHD 381
+DVP+LDTILTSIPPEVILRKQRLLANPSMK+AMLFPQPAQPGDAFHQ+LNGLARKLPHD
Sbjct: 332 KDVPQLDTILTSIPPEVILRKQRLLANPSMKQAMLFPQPAQPGDAFHQVLNGLARKLPHD 391
Query: 382 KSTYLKPGGKFLNWTSGPVGDLKPW 406
+S +LKPG K LNWT+GPVGDLKPW
Sbjct: 392 RSIFLKPGEKALNWTAGPVGDLKPW 416
>gi|317106607|dbj|BAJ53114.1| JHL07K02.4 [Jatropha curcas]
Length = 417
Score = 747 bits (1928), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 358/409 (87%), Positives = 377/409 (92%), Gaps = 6/409 (1%)
Query: 4 TAIALLLAFVSADNTPKI------ERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLL 57
I L A VS+ K+ ERISGSAGDVLEDDPVG+LKV+VYELPSKYNKK+L
Sbjct: 9 VGILLCAATVSSIGAVKLGRSQPTERISGSAGDVLEDDPVGRLKVFVYELPSKYNKKILQ 68
Query: 58 KDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRM 117
KD RCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTP+Y TCDLTP GLPLPFKSPRM
Sbjct: 69 KDSRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRM 128
Query: 118 MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTF 177
MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTF
Sbjct: 129 MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTF 188
Query: 178 GQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARG 237
GQRNHVCL +GSITIPPYAPPQKMQAH IP TPRSIFVYFRGLFYDV NDPEGGYYARG
Sbjct: 189 GQRNHVCLKDGSITIPPYAPPQKMQAHLIPEKTPRSIFVYFRGLFYDVGNDPEGGYYARG 248
Query: 238 ARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVI 297
ARAAVWENFK+NPLFDIST+HP+TYYEDMQRA+FCLCPLGWAPWSPRLVEAV+FGCIPVI
Sbjct: 249 ARAAVWENFKDNPLFDISTEHPSTYYEDMQRAVFCLCPLGWAPWSPRLVEAVIFGCIPVI 308
Query: 298 IADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLF 357
IADDIVLPFADAIPWE+IG+FV E+DVP LD ILTSIPPEVILRKQRLLANPSMK+AMLF
Sbjct: 309 IADDIVLPFADAIPWEQIGVFVDEKDVPNLDAILTSIPPEVILRKQRLLANPSMKQAMLF 368
Query: 358 PQPAQPGDAFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
PQP QPGDAFHQ+LNGLARKLPHD+S YLKPG K LNWT+GPVGDLKPW
Sbjct: 369 PQPVQPGDAFHQVLNGLARKLPHDRSIYLKPGEKILNWTAGPVGDLKPW 417
>gi|307135997|gb|ADN33853.1| glucuronoxylan glucuronosyltransferase [Cucumis melo subsp. melo]
Length = 416
Score = 745 bits (1923), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 353/411 (85%), Positives = 377/411 (91%), Gaps = 6/411 (1%)
Query: 2 WVTAIALLLAFVSA------DNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKL 55
W + L AF++ + + + ERISGSAGDVLED+PVG+LKV+VYELPSKYNKK+
Sbjct: 6 WAIFVPLFAAFLTTTDAIVRERSQRTERISGSAGDVLEDNPVGRLKVFVYELPSKYNKKI 65
Query: 56 LLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSP 115
L KDPRCL HMFAAEIFMHRFLL+SPVRTLNPEEADWFYTP+Y TCDLTP GLPLPFKSP
Sbjct: 66 LQKDPRCLNHMFAAEIFMHRFLLTSPVRTLNPEEADWFYTPVYTTCDLTPNGLPLPFKSP 125
Query: 116 RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQ 175
RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQ
Sbjct: 126 RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQ 185
Query: 176 TFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYA 235
TFGQRNHVCL EGSITIPPYAPPQKM AH IP TPRSIFVYFRGLFYDV NDPEGGYYA
Sbjct: 186 TFGQRNHVCLKEGSITIPPYAPPQKMHAHLIPEKTPRSIFVYFRGLFYDVGNDPEGGYYA 245
Query: 236 RGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIP 295
RGARAAVWENFK+NPLFDIST+HPTTYYEDMQRA+FCLCPLGWAPWSPRLVEAV+FGCIP
Sbjct: 246 RGARAAVWENFKDNPLFDISTEHPTTYYEDMQRAVFCLCPLGWAPWSPRLVEAVIFGCIP 305
Query: 296 VIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAM 355
VIIADDIVLPFADAIPWEEIG+F+ E+DV LDTILTSIP E+ILRKQRLLANPSMK+AM
Sbjct: 306 VIIADDIVLPFADAIPWEEIGVFLDEKDVANLDTILTSIPLEMILRKQRLLANPSMKQAM 365
Query: 356 LFPQPAQPGDAFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
LFPQPAQPGDAFHQ+LNGLARKLPHD+S YLK G K LNWT+GPVGDLKPW
Sbjct: 366 LFPQPAQPGDAFHQVLNGLARKLPHDRSVYLKAGEKILNWTAGPVGDLKPW 416
>gi|449440484|ref|XP_004138014.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10L-like
[Cucumis sativus]
gi|449528307|ref|XP_004171146.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10L-like
[Cucumis sativus]
Length = 416
Score = 744 bits (1922), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 352/411 (85%), Positives = 377/411 (91%), Gaps = 6/411 (1%)
Query: 2 WVTAIALLLAFVSA------DNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKL 55
W + L AF++ + + + ERISGSAGDVLED+PVG+LKV+VYELPSKYNKK+
Sbjct: 6 WAIFVPLFAAFLTTTDAIVRERSQRTERISGSAGDVLEDNPVGRLKVFVYELPSKYNKKI 65
Query: 56 LLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSP 115
L KDPRCL HMFAAEIFMHRFLL+SPVRTLNPEEADWFYTP+Y TCDLTP GLPLPFKSP
Sbjct: 66 LQKDPRCLNHMFAAEIFMHRFLLTSPVRTLNPEEADWFYTPVYTTCDLTPNGLPLPFKSP 125
Query: 116 RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQ 175
RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQ
Sbjct: 126 RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQ 185
Query: 176 TFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYA 235
TFGQRNHVCL EGSITIPPYAPPQKM AH IP TPRSIFVYFRGLFYDV NDPEGGYYA
Sbjct: 186 TFGQRNHVCLKEGSITIPPYAPPQKMHAHLIPEKTPRSIFVYFRGLFYDVGNDPEGGYYA 245
Query: 236 RGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIP 295
RGARAAVWENFK+NPLFDIST+HPTTYYEDMQRA+FCLCPLGWAPWSPRLVEAV+FGCIP
Sbjct: 246 RGARAAVWENFKDNPLFDISTEHPTTYYEDMQRAVFCLCPLGWAPWSPRLVEAVIFGCIP 305
Query: 296 VIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAM 355
VIIADDIVLPFADAIPWEEIG+F+ E+DV LDTILTSIP E+ILRKQRLLANPSMK+AM
Sbjct: 306 VIIADDIVLPFADAIPWEEIGVFLDEKDVANLDTILTSIPLEMILRKQRLLANPSMKQAM 365
Query: 356 LFPQPAQPGDAFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
LFPQPAQPGDAFHQ+LNGLARKLPHD+S Y+K G K LNWT+GPVGDLKPW
Sbjct: 366 LFPQPAQPGDAFHQVLNGLARKLPHDRSVYIKAGEKILNWTAGPVGDLKPW 416
>gi|356512936|ref|XP_003525170.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10L-like
[Glycine max]
Length = 416
Score = 744 bits (1921), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 352/385 (91%), Positives = 367/385 (95%)
Query: 22 ERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSP 81
ERISGSAGDVLEDDPVG+LKV+VYELPSKYNKK+L KDPRCL HMFAAEIFMHRFLLSSP
Sbjct: 32 ERISGSAGDVLEDDPVGRLKVFVYELPSKYNKKILQKDPRCLNHMFAAEIFMHRFLLSSP 91
Query: 82 VRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHF 141
VRTLNPEEADWFYTP+Y TCDLTP GLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHF
Sbjct: 92 VRTLNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHF 151
Query: 142 FVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKM 201
FVVPHDFGACFHYQEEKAIERGIL LL+RATLVQTFGQRNHVCL EGSITIPPYAPPQKM
Sbjct: 152 FVVPHDFGACFHYQEEKAIERGILTLLRRATLVQTFGQRNHVCLKEGSITIPPYAPPQKM 211
Query: 202 QAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTT 261
H IP TPRSIFVYFRGLFYDV NDPEGGYYARGARAAVWENFK+N LFDIST+HPTT
Sbjct: 212 HTHLIPDKTPRSIFVYFRGLFYDVGNDPEGGYYARGARAAVWENFKDNLLFDISTEHPTT 271
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
YYEDMQRA+FCLCPLGWAPWSPRLVEAV+FGCIPVIIADDIVLPFADAIPWEEIG+FV E
Sbjct: 272 YYEDMQRAVFCLCPLGWAPWSPRLVEAVIFGCIPVIIADDIVLPFADAIPWEEIGVFVDE 331
Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHD 381
EDVPKLDTILTSIPPEVILRKQRLLANPSMK+AMLFPQPAQPGDAFHQ+LNGLARKLPHD
Sbjct: 332 EDVPKLDTILTSIPPEVILRKQRLLANPSMKQAMLFPQPAQPGDAFHQVLNGLARKLPHD 391
Query: 382 KSTYLKPGGKFLNWTSGPVGDLKPW 406
+ +LKPG K LNWT+GPVGDLKPW
Sbjct: 392 NTVFLKPGEKILNWTAGPVGDLKPW 416
>gi|18424516|ref|NP_568941.1| Exostosin family protein [Arabidopsis thaliana]
gi|75163931|sp|Q940Q8.1|IX10L_ARATH RecName: Full=Probable beta-1,4-xylosyltransferase IRX10L; AltName:
Full=Glucuronoxylan glucuronosyltransferase 1;
Short=AtGUT1; AltName: Full=Glucuronoxylan
glucuronosyltransferase 2; Short=AtGUT2; AltName:
Full=Protein IRREGULAR XYLEM 10-like; AltName:
Full=Xylan xylosyltransferase IRX10L
gi|15809826|gb|AAL06841.1| AT5g61840/mac9_140 [Arabidopsis thaliana]
gi|17978869|gb|AAL47406.1| AT5g61840/mac9_140 [Arabidopsis thaliana]
gi|332010141|gb|AED97524.1| Exostosin family protein [Arabidopsis thaliana]
Length = 415
Score = 739 bits (1907), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 347/385 (90%), Positives = 367/385 (95%)
Query: 22 ERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSP 81
ERISGSAGDVLEDDPVG+LKV+VYELPSKYNKK+L KDPRCL HMFAAEI+M RFLLSSP
Sbjct: 31 ERISGSAGDVLEDDPVGRLKVFVYELPSKYNKKILQKDPRCLNHMFAAEIYMQRFLLSSP 90
Query: 82 VRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHF 141
VRTLNPEEADWFY P+Y TCDLTP GLPLPFKSPRMMRSAIQLI+SNWPYWNRTEGADHF
Sbjct: 91 VRTLNPEEADWFYVPVYTTCDLTPNGLPLPFKSPRMMRSAIQLIASNWPYWNRTEGADHF 150
Query: 142 FVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKM 201
FVVPHDFGACFHYQEEKAI RGILPLLQRATLVQTFGQRNHVCL EGSIT+PPYAPPQKM
Sbjct: 151 FVVPHDFGACFHYQEEKAIGRGILPLLQRATLVQTFGQRNHVCLKEGSITVPPYAPPQKM 210
Query: 202 QAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTT 261
Q+H IP TPRSIFVYFRGLFYDV NDPEGGYYARGARAAVWENFK+NPLFDIST+HPTT
Sbjct: 211 QSHLIPEKTPRSIFVYFRGLFYDVGNDPEGGYYARGARAAVWENFKDNPLFDISTEHPTT 270
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
YYEDMQRAIFCLCPLGWAPWSPRLVEAV+FGCIPVIIADDIVLPFADAIPWE+IG+FV E
Sbjct: 271 YYEDMQRAIFCLCPLGWAPWSPRLVEAVIFGCIPVIIADDIVLPFADAIPWEDIGVFVDE 330
Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHD 381
+DVP LDTILTSIPPEVILRKQRLLANPSMK+AMLFPQPAQPGDAFHQ+LNGLARKLPH+
Sbjct: 331 KDVPYLDTILTSIPPEVILRKQRLLANPSMKQAMLFPQPAQPGDAFHQVLNGLARKLPHE 390
Query: 382 KSTYLKPGGKFLNWTSGPVGDLKPW 406
+S YL+PG K LNWT+GPV DLKPW
Sbjct: 391 RSVYLRPGEKLLNWTAGPVADLKPW 415
>gi|297797147|ref|XP_002866458.1| hypothetical protein ARALYDRAFT_496352 [Arabidopsis lyrata subsp.
lyrata]
gi|297312293|gb|EFH42717.1| hypothetical protein ARALYDRAFT_496352 [Arabidopsis lyrata subsp.
lyrata]
Length = 417
Score = 738 bits (1906), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 350/404 (86%), Positives = 377/404 (93%), Gaps = 1/404 (0%)
Query: 3 VTAIALLLAFVSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRC 62
+T ++ + AF + + P ERISGSAGDVLED+PVG+LKV+VYELPSKYNKK+L KDPRC
Sbjct: 15 LTFLSSISAFRLSRSQPT-ERISGSAGDVLEDNPVGRLKVFVYELPSKYNKKILQKDPRC 73
Query: 63 LTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAI 122
L HMFAAEI+M RFLLSSPVRTLNPEEADWFY P+Y TCDLTP GLPLPFKSPRMMRSAI
Sbjct: 74 LNHMFAAEIYMQRFLLSSPVRTLNPEEADWFYVPVYTTCDLTPNGLPLPFKSPRMMRSAI 133
Query: 123 QLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNH 182
QLI+SNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAI RGILPLLQRATLVQTFGQRNH
Sbjct: 134 QLIASNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIGRGILPLLQRATLVQTFGQRNH 193
Query: 183 VCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAV 242
VCL EGSIT+PPYAPPQKMQ+H IP TPRSIFVYFRGLFYDV NDPEGGYYARGARAAV
Sbjct: 194 VCLKEGSITVPPYAPPQKMQSHLIPEKTPRSIFVYFRGLFYDVGNDPEGGYYARGARAAV 253
Query: 243 WENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDI 302
WENFK+NPLFDIST+HPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAV+FGCIPVIIADDI
Sbjct: 254 WENFKDNPLFDISTEHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVIFGCIPVIIADDI 313
Query: 303 VLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQ 362
VLPFADAIPWE+IG+FV E+DVP LDTILTSIPPEVILRKQRLLANPSMK+AMLFPQPAQ
Sbjct: 314 VLPFADAIPWEDIGVFVDEKDVPYLDTILTSIPPEVILRKQRLLANPSMKQAMLFPQPAQ 373
Query: 363 PGDAFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
PGDAFHQ+LNGLARKLPH++S YL+PG K LNWT+GPV DLKPW
Sbjct: 374 PGDAFHQVLNGLARKLPHERSVYLRPGEKLLNWTAGPVADLKPW 417
>gi|21592991|gb|AAM64940.1| unknown [Arabidopsis thaliana]
Length = 415
Score = 736 bits (1900), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 346/385 (89%), Positives = 366/385 (95%)
Query: 22 ERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSP 81
ERISGSAGDVLEDDPVG+LKV+VYELPSKYNKK+L KDPRCL HMFAAEI+M RFLLSSP
Sbjct: 31 ERISGSAGDVLEDDPVGRLKVFVYELPSKYNKKILQKDPRCLNHMFAAEIYMQRFLLSSP 90
Query: 82 VRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHF 141
VRTLNPEEADWFY P+Y TCDLTP GLPLPFKSPRMMRSAIQLI+SNWPYWNRTEGADHF
Sbjct: 91 VRTLNPEEADWFYVPVYTTCDLTPNGLPLPFKSPRMMRSAIQLIASNWPYWNRTEGADHF 150
Query: 142 FVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKM 201
FVVPHDF ACFHYQEEKAI RGILPLLQRATLVQTFGQRNHVCL EGSIT+PPYAPPQKM
Sbjct: 151 FVVPHDFRACFHYQEEKAIGRGILPLLQRATLVQTFGQRNHVCLKEGSITVPPYAPPQKM 210
Query: 202 QAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTT 261
Q+H IP TPRSIFVYFRGLFYDV NDPEGGYYARGARAAVWENFK+NPLFDIST+HPTT
Sbjct: 211 QSHLIPEKTPRSIFVYFRGLFYDVGNDPEGGYYARGARAAVWENFKDNPLFDISTEHPTT 270
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
YYEDMQRAIFCLCPLGWAPWSPRLVEAV+FGCIPVIIADDIVLPFADAIPWE+IG+FV E
Sbjct: 271 YYEDMQRAIFCLCPLGWAPWSPRLVEAVIFGCIPVIIADDIVLPFADAIPWEDIGVFVDE 330
Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHD 381
+DVP LDTILTSIPPEVILRKQRLLANPSMK+AMLFPQPAQPGDAFHQ+LNGLARKLPH+
Sbjct: 331 KDVPYLDTILTSIPPEVILRKQRLLANPSMKQAMLFPQPAQPGDAFHQVLNGLARKLPHE 390
Query: 382 KSTYLKPGGKFLNWTSGPVGDLKPW 406
+S YL+PG K LNWT+GPV DLKPW
Sbjct: 391 RSVYLRPGEKLLNWTAGPVADLKPW 415
>gi|116793780|gb|ABK26875.1| unknown [Picea sitchensis]
Length = 420
Score = 735 bits (1898), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 346/387 (89%), Positives = 366/387 (94%)
Query: 20 KIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLS 79
+ ERI+GSAGDVLEDDPVG+LKV++YELPSKYNKK+L KDPRCLTHMFA EIFM+RFLL
Sbjct: 34 RSERIAGSAGDVLEDDPVGRLKVFIYELPSKYNKKILAKDPRCLTHMFATEIFMNRFLLG 93
Query: 80 SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGAD 139
SPVRTLNPEEADWFYTP+Y TCDLTP GLPLPFKSPRMMRSAIQ IS+NWPYWNRTEGAD
Sbjct: 94 SPVRTLNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRMMRSAIQYISTNWPYWNRTEGAD 153
Query: 140 HFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQ 199
HFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQR HVCL +GSIT+PPYAPPQ
Sbjct: 154 HFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRYHVCLKKGSITVPPYAPPQ 213
Query: 200 KMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP 259
KMQAH IPP TPRSIFVYFRGLFYDV NDPEGGYYARGARA+VWENFK+NPLFDIST+HP
Sbjct: 214 KMQAHLIPPSTPRSIFVYFRGLFYDVGNDPEGGYYARGARASVWENFKDNPLFDISTEHP 273
Query: 260 TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFV 319
TYYEDMQRAIFCLCPLGWAPWSPRLVE V+FGCIPVIIADDIVLPFADAIPWEEIG+FV
Sbjct: 274 ATYYEDMQRAIFCLCPLGWAPWSPRLVEGVIFGCIPVIIADDIVLPFADAIPWEEIGVFV 333
Query: 320 AEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
AE+DVP LDTILTSIPPEVILRKQRLLANP+MK+AMLFPQPAQPGDAFHQILNGLARKLP
Sbjct: 334 AEKDVPNLDTILTSIPPEVILRKQRLLANPAMKQAMLFPQPAQPGDAFHQILNGLARKLP 393
Query: 380 HDKSTYLKPGGKFLNWTSGPVGDLKPW 406
HD S YLKP K LNWT+GP GDLKPW
Sbjct: 394 HDSSIYLKPNQKILNWTAGPPGDLKPW 420
>gi|75159221|sp|Q8S1X7.1|GT15_ORYSJ RecName: Full=Probable glucuronosyltransferase Os01g0926700
gi|20160728|dbj|BAB89670.1| P0482D04.17 [Oryza sativa Japonica Group]
gi|20805226|dbj|BAB92893.1| putative pectin-glucuronyltransferase [Oryza sativa Japonica Group]
gi|125528942|gb|EAY77056.1| hypothetical protein OsI_05014 [Oryza sativa Indica Group]
gi|125573178|gb|EAZ14693.1| hypothetical protein OsJ_04618 [Oryza sativa Japonica Group]
Length = 417
Score = 734 bits (1894), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 345/387 (89%), Positives = 369/387 (95%)
Query: 20 KIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLS 79
+ ERISGSAGDVLEDDPVG+LKVYVY+LPSKYNKKLL KDPRCL HMFAAEIFMHRFLLS
Sbjct: 31 QTERISGSAGDVLEDDPVGRLKVYVYDLPSKYNKKLLKKDPRCLNHMFAAEIFMHRFLLS 90
Query: 80 SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGAD 139
S VRT NPEEADWFYTP+Y TCDLTP+GLPLPFKSPRMMRSAI+LI++NWPYWNR+EGAD
Sbjct: 91 SAVRTFNPEEADWFYTPVYTTCDLTPSGLPLPFKSPRMMRSAIELIATNWPYWNRSEGAD 150
Query: 140 HFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQ 199
HFFV PHDFGACFHYQEEKAI RGILPLLQRATLVQTFGQ+NHVCL +GSITIPPYAPPQ
Sbjct: 151 HFFVTPHDFGACFHYQEEKAIGRGILPLLQRATLVQTFGQKNHVCLKDGSITIPPYAPPQ 210
Query: 200 KMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP 259
KMQAH IPPDTPRSIFVYFRGLFYD +NDPEGGYYARGARA+VWENFKNNPLFDISTDHP
Sbjct: 211 KMQAHLIPPDTPRSIFVYFRGLFYDTSNDPEGGYYARGARASVWENFKNNPLFDISTDHP 270
Query: 260 TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFV 319
TYYEDMQR++FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG+FV
Sbjct: 271 PTYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFV 330
Query: 320 AEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
AEEDVPKLD+ILTSIP +VILRKQRLLANPSMK+AMLFPQPAQ GDAFHQILNGLARKLP
Sbjct: 331 AEEDVPKLDSILTSIPTDVILRKQRLLANPSMKQAMLFPQPAQAGDAFHQILNGLARKLP 390
Query: 380 HDKSTYLKPGGKFLNWTSGPVGDLKPW 406
H ++ +LKPG + LNWT+GPVGDLKPW
Sbjct: 391 HGENVFLKPGERALNWTAGPVGDLKPW 417
>gi|212276027|ref|NP_001130448.1| uncharacterized protein LOC100191546 precursor [Zea mays]
gi|194689154|gb|ACF78661.1| unknown [Zea mays]
gi|413946366|gb|AFW79015.1| putative Secondary cell wall glycosyltransferase family 47 [Zea
mays]
Length = 415
Score = 729 bits (1883), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 345/397 (86%), Positives = 369/397 (92%), Gaps = 3/397 (0%)
Query: 10 LAFVSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAA 69
+A V+A T ERI GSAGDVLEDDPVG+LKVYVY+LPSKYNKKLL KDPRCL HMFAA
Sbjct: 22 VAVVTAQET---ERIEGSAGDVLEDDPVGRLKVYVYDLPSKYNKKLLKKDPRCLNHMFAA 78
Query: 70 EIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNW 129
EIFMHRFLLSS VRT NPEEADWFYTP+Y TCDLTP GLPLPFKSPRMMRSAIQLI++NW
Sbjct: 79 EIFMHRFLLSSAVRTFNPEEADWFYTPVYTTCDLTPKGLPLPFKSPRMMRSAIQLIATNW 138
Query: 130 PYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGS 189
PYWNR+EGADHFFV PHDFGACFHYQEEKAI RGILPLLQRATLVQTFGQ+NHVCL +GS
Sbjct: 139 PYWNRSEGADHFFVTPHDFGACFHYQEEKAIGRGILPLLQRATLVQTFGQKNHVCLKDGS 198
Query: 190 ITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNN 249
ITIPPYAPPQKMQ H IPPDTPRSIFVYFRGLFYD NDPEGGYYARGARA+VWENFKNN
Sbjct: 199 ITIPPYAPPQKMQTHLIPPDTPRSIFVYFRGLFYDTGNDPEGGYYARGARASVWENFKNN 258
Query: 250 PLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADA 309
PLFDISTDHP TYYEDMQR++FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADA
Sbjct: 259 PLFDISTDHPPTYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADA 318
Query: 310 IPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQ 369
IPWEEIG+FVAE+DVP+LD+ILTSIP +V+LRKQRLLANPSMK+AMLFPQPAQPGDAFHQ
Sbjct: 319 IPWEEIGVFVAEDDVPQLDSILTSIPTDVVLRKQRLLANPSMKQAMLFPQPAQPGDAFHQ 378
Query: 370 ILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
ILNGLARKLPH + +LKPG + LNWT+GP GDLKPW
Sbjct: 379 ILNGLARKLPHGSNVFLKPGERALNWTAGPPGDLKPW 415
>gi|242088703|ref|XP_002440184.1| hypothetical protein SORBIDRAFT_09g027440 [Sorghum bicolor]
gi|241945469|gb|EES18614.1| hypothetical protein SORBIDRAFT_09g027440 [Sorghum bicolor]
Length = 415
Score = 728 bits (1880), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 344/397 (86%), Positives = 368/397 (92%), Gaps = 3/397 (0%)
Query: 10 LAFVSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAA 69
+A V+A T ERI GSAGDVLED+PVG+LKVYVY+LPSKYNKKLL KDPRCL HMFAA
Sbjct: 22 IAVVTAQET---ERIEGSAGDVLEDNPVGRLKVYVYDLPSKYNKKLLKKDPRCLNHMFAA 78
Query: 70 EIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNW 129
EIFMHRFLLSS VRT NPEEADWFYTP+Y TCDLTP GLPLPFKSPRMMRSAIQLI++NW
Sbjct: 79 EIFMHRFLLSSAVRTFNPEEADWFYTPVYTTCDLTPKGLPLPFKSPRMMRSAIQLIATNW 138
Query: 130 PYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGS 189
PYWNR+EGADHFFV PHDFGACFHYQEEKAI RGILPLLQRATLVQTFGQ+NHVCL +GS
Sbjct: 139 PYWNRSEGADHFFVTPHDFGACFHYQEEKAIGRGILPLLQRATLVQTFGQKNHVCLKDGS 198
Query: 190 ITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNN 249
ITIPPYAPPQKMQ H IP DTPRSIFVYFRGLFYD NDPEGGYYARGARA+VWENFKNN
Sbjct: 199 ITIPPYAPPQKMQTHLIPADTPRSIFVYFRGLFYDTGNDPEGGYYARGARASVWENFKNN 258
Query: 250 PLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADA 309
PLFDISTDHP TYYEDMQR++FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADA
Sbjct: 259 PLFDISTDHPPTYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADA 318
Query: 310 IPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQ 369
IPWEEIG+FVAEEDVP+LD+ILTSIP +V+LRKQRLLANPSMK+AMLFPQPAQPGDAFHQ
Sbjct: 319 IPWEEIGVFVAEEDVPQLDSILTSIPTDVVLRKQRLLANPSMKQAMLFPQPAQPGDAFHQ 378
Query: 370 ILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
ILNGLARKLPH + +LKPG + LNWT+GP GDLKPW
Sbjct: 379 ILNGLARKLPHGSNVFLKPGERVLNWTAGPPGDLKPW 415
>gi|242059743|ref|XP_002459017.1| hypothetical protein SORBIDRAFT_03g044530 [Sorghum bicolor]
gi|241930992|gb|EES04137.1| hypothetical protein SORBIDRAFT_03g044530 [Sorghum bicolor]
Length = 420
Score = 726 bits (1874), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 341/387 (88%), Positives = 366/387 (94%)
Query: 20 KIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLS 79
+ ERISGSAGDVLED+PVG+LKVYVY+LPSKYNKKLL KDPRCL HMFAAEIFMHRFLLS
Sbjct: 34 QTERISGSAGDVLEDNPVGRLKVYVYDLPSKYNKKLLKKDPRCLNHMFAAEIFMHRFLLS 93
Query: 80 SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGAD 139
S VRT NPEEADWFYTP+Y TCDLTP+GLPLPFKSPRMMRSAI+LI++NWPYWNR+EGAD
Sbjct: 94 SAVRTFNPEEADWFYTPVYTTCDLTPSGLPLPFKSPRMMRSAIELIATNWPYWNRSEGAD 153
Query: 140 HFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQ 199
HFFV PHDFGACFHYQEEKAI RGILPLLQRATLVQTFGQ+NHVCL +GSITIPP+APPQ
Sbjct: 154 HFFVTPHDFGACFHYQEEKAIGRGILPLLQRATLVQTFGQKNHVCLKDGSITIPPFAPPQ 213
Query: 200 KMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP 259
KMQAH IP DTPRSIFVYFRGLFYD +NDPEGGYYARGARA+VWENFKNNPLFDISTDHP
Sbjct: 214 KMQAHLIPADTPRSIFVYFRGLFYDTSNDPEGGYYARGARASVWENFKNNPLFDISTDHP 273
Query: 260 TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFV 319
TYYEDMQR++FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG+FV
Sbjct: 274 PTYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFV 333
Query: 320 AEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
AEEDVPKLD+ILTSIP +VILRKQRLLANPSMK+AMLFPQPAQ GDAFHQILNGLARKLP
Sbjct: 334 AEEDVPKLDSILTSIPTDVILRKQRLLANPSMKQAMLFPQPAQAGDAFHQILNGLARKLP 393
Query: 380 HDKSTYLKPGGKFLNWTSGPVGDLKPW 406
H + +LKPG + LNWT+GP GDLKPW
Sbjct: 394 HGDNVFLKPGERVLNWTAGPPGDLKPW 420
>gi|226499780|ref|NP_001150738.1| secondary cell wall-related glycosyltransferase family 47 precursor
[Zea mays]
gi|195641374|gb|ACG40155.1| secondary cell wall-related glycosyltransferase family 47 [Zea
mays]
gi|223974207|gb|ACN31291.1| unknown [Zea mays]
Length = 415
Score = 724 bits (1868), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 339/394 (86%), Positives = 365/394 (92%)
Query: 13 VSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIF 72
++A + ERI GSAGDVLED+PVG+LKV+VY+LPSKYNKKLL KDPRCL HMFAAEIF
Sbjct: 22 IAAVTAQETERIEGSAGDVLEDNPVGRLKVFVYDLPSKYNKKLLKKDPRCLNHMFAAEIF 81
Query: 73 MHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYW 132
MHRFLLSS VRT NPEEADWFYTP+Y TCDLTP GLPLPFKSPRMMRSAIQLI++NWPYW
Sbjct: 82 MHRFLLSSAVRTFNPEEADWFYTPVYTTCDLTPKGLPLPFKSPRMMRSAIQLIATNWPYW 141
Query: 133 NRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITI 192
NR+EGADHFFV PHDFGACFHYQEEKAI RGILPLLQRATLVQTFGQ+NHVCL +GSITI
Sbjct: 142 NRSEGADHFFVTPHDFGACFHYQEEKAIGRGILPLLQRATLVQTFGQKNHVCLKDGSITI 201
Query: 193 PPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLF 252
PPYAPPQKMQ H IP DTPRSIFVYFRGLFYD NDPEGGYYARGARA+VWENFKNNPLF
Sbjct: 202 PPYAPPQKMQTHLIPADTPRSIFVYFRGLFYDTGNDPEGGYYARGARASVWENFKNNPLF 261
Query: 253 DISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW 312
DISTDHP TYYEDMQR++FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW
Sbjct: 262 DISTDHPPTYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW 321
Query: 313 EEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILN 372
EEIG+FVAEEDVP+LD+ILTSIP +V+LRKQRLLANPSMK+AMLFPQPAQ GDAFHQILN
Sbjct: 322 EEIGVFVAEEDVPRLDSILTSIPTDVVLRKQRLLANPSMKQAMLFPQPAQAGDAFHQILN 381
Query: 373 GLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
GLARKLPH + +LKPG + LNWT+GP GDLKPW
Sbjct: 382 GLARKLPHGSNVFLKPGERALNWTAGPPGDLKPW 415
>gi|357126596|ref|XP_003564973.1| PREDICTED: probable glucuronosyltransferase Os01g0926600-like
[Brachypodium distachyon]
Length = 411
Score = 723 bits (1865), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 343/394 (87%), Positives = 366/394 (92%), Gaps = 3/394 (0%)
Query: 13 VSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIF 72
VSA +T ERI GSAGDVLEDDPVG+LKVYVYELP+KYNKK++ KD RCL+HMFAAEIF
Sbjct: 21 VSAQDT---ERIEGSAGDVLEDDPVGRLKVYVYELPTKYNKKMVAKDSRCLSHMFAAEIF 77
Query: 73 MHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYW 132
MHRFLLSS +RT+NPEEADWFYTP+Y TCDLTP G PLPFKSPR+MRSAIQ ISS+WPYW
Sbjct: 78 MHRFLLSSAIRTMNPEEADWFYTPVYTTCDLTPWGHPLPFKSPRIMRSAIQFISSHWPYW 137
Query: 133 NRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITI 192
NRT GADHFFVVPHDFGACFHYQEEKAIERGILPLL+RATLVQTFGQ++HVCL EGSI I
Sbjct: 138 NRTAGADHFFVVPHDFGACFHYQEEKAIERGILPLLRRATLVQTFGQKDHVCLKEGSINI 197
Query: 193 PPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLF 252
PPYAPPQKM+ H +PP+TPRSIFVYFRGLFYD NDPEGGYYARGARA+VWENFKNNPLF
Sbjct: 198 PPYAPPQKMKTHLVPPETPRSIFVYFRGLFYDTANDPEGGYYARGARASVWENFKNNPLF 257
Query: 253 DISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW 312
DISTDHP TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW
Sbjct: 258 DISTDHPPTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW 317
Query: 313 EEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILN 372
+EIG+FVAE+DVPKLDTILTSIP EVILRKQRLLANPSMK+AMLFPQPAQPGDAFHQILN
Sbjct: 318 DEIGVFVAEDDVPKLDTILTSIPMEVILRKQRLLANPSMKQAMLFPQPAQPGDAFHQILN 377
Query: 373 GLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
GLARKLPH+K YL PG K LNWT GP GDLKPW
Sbjct: 378 GLARKLPHNKGVYLTPGQKALNWTQGPKGDLKPW 411
>gi|115441967|ref|NP_001045263.1| Os01g0926600 [Oryza sativa Japonica Group]
gi|75159222|sp|Q8S1X8.1|GT14_ORYSJ RecName: Full=Probable glucuronosyltransferase Os01g0926600
gi|20160727|dbj|BAB89669.1| P0482D04.16 [Oryza sativa Japonica Group]
gi|20805225|dbj|BAB92892.1| putative pectin-glucuronyltransferase [Oryza sativa Japonica Group]
gi|113534794|dbj|BAF07177.1| Os01g0926600 [Oryza sativa Japonica Group]
gi|125528941|gb|EAY77055.1| hypothetical protein OsI_05013 [Oryza sativa Indica Group]
gi|125573177|gb|EAZ14692.1| hypothetical protein OsJ_04617 [Oryza sativa Japonica Group]
gi|215687152|dbj|BAG90922.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 415
Score = 722 bits (1863), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 340/385 (88%), Positives = 363/385 (94%)
Query: 22 ERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSP 81
ERI GSAGDVLEDDPVG+LKVYVYELP+KYNKK++ KD RCL+HMFAAEIFMHRFLLSS
Sbjct: 31 ERIEGSAGDVLEDDPVGRLKVYVYELPTKYNKKMVAKDSRCLSHMFAAEIFMHRFLLSSA 90
Query: 82 VRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHF 141
+RTLNPEEADWFYTP+Y TCDLTP G PLPFKSPR+MRSAIQ ISS+WPYWNRT+GADHF
Sbjct: 91 IRTLNPEEADWFYTPVYTTCDLTPWGHPLPFKSPRIMRSAIQFISSHWPYWNRTDGADHF 150
Query: 142 FVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKM 201
FVVPHDFGACFHYQEEKAIERGILPLL+RATLVQTFGQ++HVCL EGSITIPPYAPPQKM
Sbjct: 151 FVVPHDFGACFHYQEEKAIERGILPLLRRATLVQTFGQKDHVCLKEGSITIPPYAPPQKM 210
Query: 202 QAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTT 261
+ H +PP+TPRSIFVYFRGLFYD NDPEGGYYARGARA+VWENFKNNPLFDISTDHP T
Sbjct: 211 KTHLVPPETPRSIFVYFRGLFYDTANDPEGGYYARGARASVWENFKNNPLFDISTDHPPT 270
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
YYEDMQR+IFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW+EIG+FVAE
Sbjct: 271 YYEDMQRSIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWDEIGVFVAE 330
Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHD 381
+DVPKLDTILTSIP +VILRKQRLLANPSMK+AMLFPQPAQPGDAFHQILNGL RKLPH
Sbjct: 331 DDVPKLDTILTSIPMDVILRKQRLLANPSMKQAMLFPQPAQPGDAFHQILNGLGRKLPHP 390
Query: 382 KSTYLKPGGKFLNWTSGPVGDLKPW 406
KS YL PG K LNWT GPVGDLKPW
Sbjct: 391 KSVYLDPGQKVLNWTQGPVGDLKPW 415
>gi|413951430|gb|AFW84079.1| secondary cell wall glycosyltransferase family 47 [Zea mays]
Length = 421
Score = 721 bits (1862), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 339/387 (87%), Positives = 365/387 (94%)
Query: 20 KIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLS 79
+ ERISGSAGDVLED+PVG+LKVYVY+LPSKYNKKL+ KDPRCL HMFAAEIFMHRFLLS
Sbjct: 35 QTERISGSAGDVLEDNPVGRLKVYVYDLPSKYNKKLVKKDPRCLNHMFAAEIFMHRFLLS 94
Query: 80 SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGAD 139
S VRT NPEEADWFYTP+Y TCDLTP+GLPLPFKSPRMMRSAI+LI++NWPYWNR+EGAD
Sbjct: 95 SAVRTFNPEEADWFYTPVYATCDLTPSGLPLPFKSPRMMRSAIELIATNWPYWNRSEGAD 154
Query: 140 HFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQ 199
HFFV PHDFGACFHYQEEKAI RGILPLLQRATLVQTFGQ+NHVCL GSITIPP+APPQ
Sbjct: 155 HFFVTPHDFGACFHYQEEKAIGRGILPLLQRATLVQTFGQKNHVCLKGGSITIPPFAPPQ 214
Query: 200 KMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP 259
KMQAH IP DTPRSIFVYFRGLFYD +NDPEGGYYARGARA+VWENFKNNPLFDISTDHP
Sbjct: 215 KMQAHLIPLDTPRSIFVYFRGLFYDTSNDPEGGYYARGARASVWENFKNNPLFDISTDHP 274
Query: 260 TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFV 319
TYYEDMQR++FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG+FV
Sbjct: 275 PTYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFV 334
Query: 320 AEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
AEEDVPKLD+ILTSIP +VILRKQRLLANP+MK+AMLFPQPAQ GDAFHQILNGLARKLP
Sbjct: 335 AEEDVPKLDSILTSIPTDVILRKQRLLANPAMKQAMLFPQPAQAGDAFHQILNGLARKLP 394
Query: 380 HDKSTYLKPGGKFLNWTSGPVGDLKPW 406
H + +LKPG + LNWT+GP GDLKPW
Sbjct: 395 HGDNVFLKPGERVLNWTAGPPGDLKPW 421
>gi|301072484|gb|ADK56172.1| glycosyltransferase 47 [Triticum aestivum]
Length = 415
Score = 721 bits (1861), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 339/385 (88%), Positives = 361/385 (93%)
Query: 22 ERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSP 81
ERI GSAGDVLEDDPVG+LKVYVYELP+KYNKK++ KD RCL+HMFAAEIFMHRFLLSS
Sbjct: 31 ERIEGSAGDVLEDDPVGRLKVYVYELPTKYNKKMVAKDSRCLSHMFAAEIFMHRFLLSSA 90
Query: 82 VRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHF 141
+RT+NPEEADWFYTP+Y TCDLTP G PLPFKSPR+MRSA+Q ISS+WPYWNRT GADHF
Sbjct: 91 IRTMNPEEADWFYTPVYTTCDLTPWGHPLPFKSPRIMRSAVQFISSHWPYWNRTAGADHF 150
Query: 142 FVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKM 201
FVVPHDFGACFHYQEEKAIERGILPLL+RATLVQTFGQ++HVCL EGSI IPPYAPPQKM
Sbjct: 151 FVVPHDFGACFHYQEEKAIERGILPLLRRATLVQTFGQKDHVCLKEGSINIPPYAPPQKM 210
Query: 202 QAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTT 261
+ H +PP+TPRSIFVYFRGLFYD NDPEGGYYARGARA+VWENFKNNPLFDISTDHP T
Sbjct: 211 KTHLVPPETPRSIFVYFRGLFYDTANDPEGGYYARGARASVWENFKNNPLFDISTDHPPT 270
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW+EIG+FVAE
Sbjct: 271 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWDEIGVFVAE 330
Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHD 381
+DVPKLDTILTSIP EVILRKQRLLANPSMK+AMLFPQPAQPGDAFHQILNGLARKLPH
Sbjct: 331 DDVPKLDTILTSIPMEVILRKQRLLANPSMKQAMLFPQPAQPGDAFHQILNGLARKLPHS 390
Query: 382 KSTYLKPGGKFLNWTSGPVGDLKPW 406
KS YL PG K LNWT GP GDLKPW
Sbjct: 391 KSVYLTPGQKALNWTQGPKGDLKPW 415
>gi|195619204|gb|ACG31432.1| secondary cell wall-related glycosyltransferase family 47 [Zea
mays]
Length = 419
Score = 721 bits (1860), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 341/404 (84%), Positives = 367/404 (90%), Gaps = 2/404 (0%)
Query: 5 AIALLL--AFVSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRC 62
A++LL V+ ERI GSAGDVLEDDPVG+LKVYVYELP+KYNKK++ KD RC
Sbjct: 16 AVSLLACSGAVAVARAQDTERIEGSAGDVLEDDPVGRLKVYVYELPTKYNKKMVAKDSRC 75
Query: 63 LTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAI 122
L+HMFAAEIFMHRFLLSS +RTLNPEEADWFYTP+Y TCDLTP G PLPFKSPR+MRSAI
Sbjct: 76 LSHMFAAEIFMHRFLLSSAIRTLNPEEADWFYTPVYTTCDLTPWGHPLPFKSPRIMRSAI 135
Query: 123 QLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNH 182
Q IS+ WPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERG+LPLL+RATLVQTFGQ++H
Sbjct: 136 QFISNRWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGVLPLLRRATLVQTFGQKDH 195
Query: 183 VCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAV 242
VCL EGSITIPPYAPPQKM+ H +PP TPRSIFVYFRGLFYD NDPEGGYYARGARA+V
Sbjct: 196 VCLKEGSITIPPYAPPQKMKTHLVPPGTPRSIFVYFRGLFYDTANDPEGGYYARGARASV 255
Query: 243 WENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDI 302
WENFKNNPLFDISTDHP TYYEDMQRA+FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDI
Sbjct: 256 WENFKNNPLFDISTDHPPTYYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDI 315
Query: 303 VLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQ 362
VLPFADAIPWEEIG+FVAE+DVP+LDTILTSIP EVILRKQRLLANPSMK+AMLFPQPAQ
Sbjct: 316 VLPFADAIPWEEIGVFVAEDDVPRLDTILTSIPMEVILRKQRLLANPSMKQAMLFPQPAQ 375
Query: 363 PGDAFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
GDAFHQILNGLARKLPH + +LKPG K LNWT GP GDLKPW
Sbjct: 376 AGDAFHQILNGLARKLPHGEDVFLKPGQKVLNWTEGPRGDLKPW 419
>gi|326511078|dbj|BAJ91886.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 415
Score = 720 bits (1859), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 339/385 (88%), Positives = 360/385 (93%)
Query: 22 ERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSP 81
ERI GSAGDVLEDDPVG+LKVYVYELP+KYNKK++ KD RCL+HMFAAEIFMHRFLLSS
Sbjct: 31 ERIEGSAGDVLEDDPVGRLKVYVYELPTKYNKKMVAKDSRCLSHMFAAEIFMHRFLLSSA 90
Query: 82 VRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHF 141
+RT+NPEEADWFYTP+Y TCDLTP G PLPFKSPR+MRSA+Q ISS+WPYWNRT GADHF
Sbjct: 91 IRTMNPEEADWFYTPVYTTCDLTPWGHPLPFKSPRIMRSAVQFISSHWPYWNRTAGADHF 150
Query: 142 FVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKM 201
FVVPHDFGACFHYQEEKAIERGILPLL+RATLVQTFGQ++HVCL EGSI IPPYAPPQKM
Sbjct: 151 FVVPHDFGACFHYQEEKAIERGILPLLRRATLVQTFGQKDHVCLKEGSINIPPYAPPQKM 210
Query: 202 QAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTT 261
+ H +PP+TPRSIFVYFRGLFYD NDPEGGYYARGARA+VWENFKNNPLFDISTDHP T
Sbjct: 211 KTHLVPPETPRSIFVYFRGLFYDTANDPEGGYYARGARASVWENFKNNPLFDISTDHPPT 270
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW+EIGMFVAE
Sbjct: 271 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWDEIGMFVAE 330
Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHD 381
+DVPKLDTILTSIP EVILRKQRLLANPSMK+AMLFPQPAQ GDAFHQILNGLARKLPH
Sbjct: 331 DDVPKLDTILTSIPMEVILRKQRLLANPSMKQAMLFPQPAQAGDAFHQILNGLARKLPHS 390
Query: 382 KSTYLKPGGKFLNWTSGPVGDLKPW 406
KS YL PG K LNWT GP GDLKPW
Sbjct: 391 KSVYLTPGQKALNWTQGPKGDLKPW 415
>gi|242059741|ref|XP_002459016.1| hypothetical protein SORBIDRAFT_03g044520 [Sorghum bicolor]
gi|241930991|gb|EES04136.1| hypothetical protein SORBIDRAFT_03g044520 [Sorghum bicolor]
Length = 417
Score = 720 bits (1859), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 340/394 (86%), Positives = 363/394 (92%)
Query: 13 VSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIF 72
V+ ERI GSAGDVLEDDPVG+LKVYVYELP+KYNKK++ KD RCL+HMFAAEIF
Sbjct: 24 VAVARAQDTERIEGSAGDVLEDDPVGRLKVYVYELPTKYNKKMVAKDSRCLSHMFAAEIF 83
Query: 73 MHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYW 132
MHRFLLSS +RTLNPEEADWFYTP+Y TCDLTP G PLPFKSPR+MRSAIQ IS+ WPYW
Sbjct: 84 MHRFLLSSAIRTLNPEEADWFYTPVYTTCDLTPWGHPLPFKSPRIMRSAIQFISNRWPYW 143
Query: 133 NRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITI 192
NRTEGADHFFVVPHDFGACFHYQEEKAIERG+LPLL+RATLVQTFGQ++HVCL EGSITI
Sbjct: 144 NRTEGADHFFVVPHDFGACFHYQEEKAIERGVLPLLRRATLVQTFGQKDHVCLKEGSITI 203
Query: 193 PPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLF 252
PPYAPPQKM+ H +PP TPRSIFVYFRGLFYD NDPEGGYYARGARA+VWENFKNNPLF
Sbjct: 204 PPYAPPQKMKTHLVPPGTPRSIFVYFRGLFYDTANDPEGGYYARGARASVWENFKNNPLF 263
Query: 253 DISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW 312
DISTDHP TYYEDMQRA+FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW
Sbjct: 264 DISTDHPPTYYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW 323
Query: 313 EEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILN 372
EEIG+FVAE+DVPKLDTILTSIP EVILRKQRLLANPSMK+AMLFPQPAQPGDAFHQILN
Sbjct: 324 EEIGVFVAEDDVPKLDTILTSIPMEVILRKQRLLANPSMKQAMLFPQPAQPGDAFHQILN 383
Query: 373 GLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
GLARKLPH + +LKPG K LNWT GP GDLKPW
Sbjct: 384 GLARKLPHGEDVFLKPGLKVLNWTEGPRGDLKPW 417
>gi|363543255|ref|NP_001241842.1| uncharacterized protein LOC100857042 precursor [Zea mays]
gi|194704652|gb|ACF86410.1| unknown [Zea mays]
gi|224034207|gb|ACN36179.1| unknown [Zea mays]
gi|414878896|tpg|DAA56027.1| TPA: hypothetical protein ZEAMMB73_615997 [Zea mays]
Length = 418
Score = 720 bits (1859), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 337/388 (86%), Positives = 366/388 (94%), Gaps = 1/388 (0%)
Query: 20 KIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLS 79
+ ERISGSAGDVLED+PVG+LKVYVY+LPSKYNKKLL KDPRCL+HMFAAEIFMHRFLLS
Sbjct: 31 QTERISGSAGDVLEDNPVGRLKVYVYDLPSKYNKKLLKKDPRCLSHMFAAEIFMHRFLLS 90
Query: 80 SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGAD 139
S VRT NPEEADWFYTP+Y TCDLTP+GLPLPFKSPRMMRSAI+L+++NWPYWNR+EGAD
Sbjct: 91 SAVRTFNPEEADWFYTPVYTTCDLTPSGLPLPFKSPRMMRSAIELVATNWPYWNRSEGAD 150
Query: 140 HFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQ 199
HFFV PHDFGACFHYQEEKAI RGILPLLQRATLVQTFGQ+NHVCL +GSITIPP+APPQ
Sbjct: 151 HFFVTPHDFGACFHYQEEKAIGRGILPLLQRATLVQTFGQKNHVCLKDGSITIPPFAPPQ 210
Query: 200 KMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP 259
KMQAH IP DTPRSIFVYFRGLFYD +NDPEGGYYARGARA+VWENFKNNPLFDISTDHP
Sbjct: 211 KMQAHLIPADTPRSIFVYFRGLFYDTSNDPEGGYYARGARASVWENFKNNPLFDISTDHP 270
Query: 260 TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFV 319
TYYEDMQR++FCLCPLGWAPWSPRLVEAVVFGCIPVI+ADDIVLPFADAIPWE+IG+FV
Sbjct: 271 ATYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIVADDIVLPFADAIPWEDIGVFV 330
Query: 320 AEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
AEEDVP+LDTILTSIP +V+LRKQRLLANPSMK+AMLFPQPAQ GDAFHQILNGLARKLP
Sbjct: 331 AEEDVPRLDTILTSIPTDVVLRKQRLLANPSMKQAMLFPQPAQAGDAFHQILNGLARKLP 390
Query: 380 HDKSTYLKPGG-KFLNWTSGPVGDLKPW 406
H +LKPG + LNWT+GP GDLKPW
Sbjct: 391 HGDGVFLKPGDERVLNWTAGPPGDLKPW 418
>gi|194697530|gb|ACF82849.1| unknown [Zea mays]
Length = 418
Score = 719 bits (1857), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 337/388 (86%), Positives = 365/388 (94%), Gaps = 1/388 (0%)
Query: 20 KIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLS 79
+ ERISGSAGDVLED+PVG+LKVYVY+LPSKYNKKLL KDPRCL HMFAAEIFMHRFLLS
Sbjct: 31 QTERISGSAGDVLEDNPVGRLKVYVYDLPSKYNKKLLKKDPRCLNHMFAAEIFMHRFLLS 90
Query: 80 SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGAD 139
S VRT NPEEADWFYTP+Y TCDLTP+GLPLPFKSPRMMRSAI+L+++NWPYWNR+EGAD
Sbjct: 91 SAVRTFNPEEADWFYTPVYTTCDLTPSGLPLPFKSPRMMRSAIELVATNWPYWNRSEGAD 150
Query: 140 HFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQ 199
HFFV PHDFGACFHYQEEKAI RGILPLLQRATLVQTFGQ+NHVCL +GSITIPP+APPQ
Sbjct: 151 HFFVTPHDFGACFHYQEEKAIGRGILPLLQRATLVQTFGQKNHVCLKDGSITIPPFAPPQ 210
Query: 200 KMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP 259
KMQAH IP DTPRSIFVYFRGLFYD +NDPEGGYYARGARA+VWENFKNNPLFDISTDHP
Sbjct: 211 KMQAHLIPADTPRSIFVYFRGLFYDTSNDPEGGYYARGARASVWENFKNNPLFDISTDHP 270
Query: 260 TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFV 319
TYYEDMQR++FCLCPLGWAPWSPRLVEAVVFGCIPVI+ADDIVLPFADAIPWE+IG+FV
Sbjct: 271 ATYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIVADDIVLPFADAIPWEDIGVFV 330
Query: 320 AEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
AEEDVP+LDTILTSIP +V+LRKQRLLANPSMK+AMLFPQPAQ GDAFHQILNGLARKLP
Sbjct: 331 AEEDVPRLDTILTSIPTDVVLRKQRLLANPSMKQAMLFPQPAQAGDAFHQILNGLARKLP 390
Query: 380 HDKSTYLKPGG-KFLNWTSGPVGDLKPW 406
H +LKPG + LNWT+GP GDLKPW
Sbjct: 391 HGDGVFLKPGDERVLNWTAGPPGDLKPW 418
>gi|226503833|ref|NP_001149319.1| secondary cell wall-related glycosyltransferase family 47 precursor
[Zea mays]
gi|195626366|gb|ACG35013.1| secondary cell wall-related glycosyltransferase family 47 [Zea
mays]
Length = 421
Score = 719 bits (1856), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 338/387 (87%), Positives = 364/387 (94%)
Query: 20 KIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLS 79
+ ERISGSAGDVLED+PVG+LKVYVY+LPSKYNKKL+ KDPRCL HMFAAEIFMHRFLLS
Sbjct: 35 QTERISGSAGDVLEDNPVGRLKVYVYDLPSKYNKKLVKKDPRCLNHMFAAEIFMHRFLLS 94
Query: 80 SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGAD 139
S VRT NPEEADWFYTP+Y TCDLTP+GLPLPFKSPRMMRSAI+LI++NWPYWNR+EGAD
Sbjct: 95 SAVRTFNPEEADWFYTPVYATCDLTPSGLPLPFKSPRMMRSAIELIATNWPYWNRSEGAD 154
Query: 140 HFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQ 199
HFFV PHDFGACFHYQEEKAI RGILPLLQRATLVQTFGQ+NHVCL GSI IPP+APPQ
Sbjct: 155 HFFVTPHDFGACFHYQEEKAIGRGILPLLQRATLVQTFGQKNHVCLKGGSIXIPPFAPPQ 214
Query: 200 KMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP 259
KMQAH IP DTPRSIFVYFRGLFYD +NDPEGGYYARGARA+VWENFKNNPLFDISTDHP
Sbjct: 215 KMQAHLIPLDTPRSIFVYFRGLFYDTSNDPEGGYYARGARASVWENFKNNPLFDISTDHP 274
Query: 260 TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFV 319
TYYEDMQR++FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG+FV
Sbjct: 275 PTYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFV 334
Query: 320 AEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
AEEDVPKLD+ILTSIP +VILRKQRLLANP+MK+AMLFPQPAQ GDAFHQILNGLARKLP
Sbjct: 335 AEEDVPKLDSILTSIPTDVILRKQRLLANPAMKQAMLFPQPAQAGDAFHQILNGLARKLP 394
Query: 380 HDKSTYLKPGGKFLNWTSGPVGDLKPW 406
H + +LKPG + LNWT+GP GDLKPW
Sbjct: 395 HGDNVFLKPGERVLNWTAGPPGDLKPW 421
>gi|326509565|dbj|BAJ86998.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326509967|dbj|BAJ87200.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 415
Score = 718 bits (1854), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 338/385 (87%), Positives = 360/385 (93%)
Query: 22 ERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSP 81
ERI GSAGDVLEDDPVG+LKVYVYELP+KYNKK++ KD RCL+HMFAAEIFMHRFLLSS
Sbjct: 31 ERIEGSAGDVLEDDPVGRLKVYVYELPTKYNKKMVAKDSRCLSHMFAAEIFMHRFLLSSA 90
Query: 82 VRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHF 141
+RT+NPEEADWFYTP+Y TCDLTP G PLPFKSPR+MRSA+Q ISS+WPYWNRT GADHF
Sbjct: 91 IRTMNPEEADWFYTPVYTTCDLTPWGHPLPFKSPRIMRSAVQFISSHWPYWNRTAGADHF 150
Query: 142 FVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKM 201
FVVPHDFGACFHYQEEKAIERGILPLL+RATLVQTFGQ++HVCL EGSI IPPYAPPQKM
Sbjct: 151 FVVPHDFGACFHYQEEKAIERGILPLLRRATLVQTFGQKDHVCLKEGSINIPPYAPPQKM 210
Query: 202 QAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTT 261
+ H +PP+TPRSIFVYFRGLFYD NDPEGGYYARGARA+VWENFKNNPLFDISTDHP T
Sbjct: 211 KTHLVPPETPRSIFVYFRGLFYDTANDPEGGYYARGARASVWENFKNNPLFDISTDHPPT 270
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW+EIG+FVAE
Sbjct: 271 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWDEIGVFVAE 330
Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHD 381
+DVPKLDTILTSIP EVILRKQRLLANPSMK+AMLFPQPAQ GDAFHQILNGLARKLPH
Sbjct: 331 DDVPKLDTILTSIPMEVILRKQRLLANPSMKQAMLFPQPAQAGDAFHQILNGLARKLPHS 390
Query: 382 KSTYLKPGGKFLNWTSGPVGDLKPW 406
KS YL PG K LNWT GP GDLKPW
Sbjct: 391 KSVYLTPGQKALNWTQGPKGDLKPW 415
>gi|414878899|tpg|DAA56030.1| TPA: secondary cell wall glycosyltransferase family 47 isoform 1
[Zea mays]
gi|414878900|tpg|DAA56031.1| TPA: secondary cell wall glycosyltransferase family 47 isoform 2
[Zea mays]
Length = 419
Score = 718 bits (1853), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 340/404 (84%), Positives = 366/404 (90%), Gaps = 2/404 (0%)
Query: 5 AIALLL--AFVSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRC 62
A++LL V+ ERI GSAGDVLEDDPVG+LKVYVYELP+KYNKK++ KD RC
Sbjct: 16 AVSLLACSGAVAVARAQDTERIEGSAGDVLEDDPVGRLKVYVYELPTKYNKKMVAKDSRC 75
Query: 63 LTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAI 122
L+HMFAAEIFMHRFLLSS +RTLNPE ADWFYTP+Y TCDLTP G PLPFKSPR+MRSAI
Sbjct: 76 LSHMFAAEIFMHRFLLSSAIRTLNPEVADWFYTPVYTTCDLTPWGHPLPFKSPRIMRSAI 135
Query: 123 QLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNH 182
Q IS+ WPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERG+LPLL+RATLVQTFGQ++H
Sbjct: 136 QFISNRWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGVLPLLRRATLVQTFGQKDH 195
Query: 183 VCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAV 242
VCL EGSITIPPYAPPQKM+ H +PP TPRSIFVYFRGLFYD NDPEGGYYARGARA+V
Sbjct: 196 VCLKEGSITIPPYAPPQKMKTHLVPPGTPRSIFVYFRGLFYDTANDPEGGYYARGARASV 255
Query: 243 WENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDI 302
WENFKNNPLFDISTDHP TYYEDMQRA+FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDI
Sbjct: 256 WENFKNNPLFDISTDHPPTYYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDI 315
Query: 303 VLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQ 362
VLPFADAIPWEEIG+FVAE+DVP+LDTILTSIP EVILRKQRLLANPSMK+AMLFPQPAQ
Sbjct: 316 VLPFADAIPWEEIGVFVAEDDVPRLDTILTSIPMEVILRKQRLLANPSMKQAMLFPQPAQ 375
Query: 363 PGDAFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
GDAFHQILNGLARKLPH + +LKPG K LNWT GP GDLKPW
Sbjct: 376 AGDAFHQILNGLARKLPHGEDVFLKPGQKVLNWTEGPRGDLKPW 419
>gi|226499284|ref|NP_001142343.1| uncharacterized protein LOC100274514 precursor [Zea mays]
gi|194708308|gb|ACF88238.1| unknown [Zea mays]
Length = 419
Score = 717 bits (1850), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 339/404 (83%), Positives = 366/404 (90%), Gaps = 2/404 (0%)
Query: 5 AIALLL--AFVSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRC 62
A++LL V+ ERI GSAGDVLEDDPVG+LKVYVYELP+KYNKK++ KD RC
Sbjct: 16 AVSLLACSGAVAVARAQDTERIEGSAGDVLEDDPVGRLKVYVYELPTKYNKKMVAKDSRC 75
Query: 63 LTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAI 122
L+HMFAAEIFMHRFLLSS +RTLNPE ADWFYTP+Y TCDLTP G PLPF+SPR+MRSAI
Sbjct: 76 LSHMFAAEIFMHRFLLSSAIRTLNPEVADWFYTPVYTTCDLTPWGHPLPFRSPRIMRSAI 135
Query: 123 QLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNH 182
Q IS+ WPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERG+LPLL+RATLVQTFGQ++H
Sbjct: 136 QFISNRWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGVLPLLRRATLVQTFGQKDH 195
Query: 183 VCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAV 242
VCL EGSITIPPYAPPQKM+ H +PP TPRSIFVYFRGLFYD NDPEGGYYARGARA+V
Sbjct: 196 VCLKEGSITIPPYAPPQKMKTHLVPPGTPRSIFVYFRGLFYDTANDPEGGYYARGARASV 255
Query: 243 WENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDI 302
WENFKNNPLFDISTDHP TYYEDMQRA+FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDI
Sbjct: 256 WENFKNNPLFDISTDHPPTYYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDI 315
Query: 303 VLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQ 362
VLPFADAIPWEEIG+FVAE+DVP+LDTILTSIP EVILRKQRLLANPSMK+AMLFPQPAQ
Sbjct: 316 VLPFADAIPWEEIGVFVAEDDVPRLDTILTSIPMEVILRKQRLLANPSMKQAMLFPQPAQ 375
Query: 363 PGDAFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
GDAFHQILNGLARKLPH + +LKPG K LNWT GP GDLKPW
Sbjct: 376 AGDAFHQILNGLARKLPHGEDVFLKPGQKVLNWTEGPRGDLKPW 419
>gi|357126610|ref|XP_003564980.1| PREDICTED: probable glucuronosyltransferase Os01g0926700-like
[Brachypodium distachyon]
Length = 418
Score = 716 bits (1848), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 335/387 (86%), Positives = 364/387 (94%)
Query: 20 KIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLS 79
+ ERISGSAGDVL+D+PVG+LKVY+Y+LP KYNKKLL KDPRCL HMFAAEIFMHRFLLS
Sbjct: 32 QTERISGSAGDVLDDNPVGRLKVYIYDLPGKYNKKLLKKDPRCLNHMFAAEIFMHRFLLS 91
Query: 80 SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGAD 139
S VRT NPEEADWFYTP+Y TCDLTP+GLPLPFKSPRM+RSAIQLI+ WPYWNR+EGAD
Sbjct: 92 SAVRTFNPEEADWFYTPVYATCDLTPSGLPLPFKSPRMVRSAIQLIAEKWPYWNRSEGAD 151
Query: 140 HFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQ 199
HFFV PHDFGACFHYQEEKAI RGILPLLQRATLVQTFGQ+NHVCL +GSITIPPYAPPQ
Sbjct: 152 HFFVAPHDFGACFHYQEEKAIGRGILPLLQRATLVQTFGQKNHVCLKDGSITIPPYAPPQ 211
Query: 200 KMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP 259
KMQ H IP DTPRSIFVYFRGLFYD +NDPEGGYYARGARA+VWENFKNNPLFDISTDHP
Sbjct: 212 KMQNHLIPGDTPRSIFVYFRGLFYDTSNDPEGGYYARGARASVWENFKNNPLFDISTDHP 271
Query: 260 TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFV 319
TYYEDMQR++FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWE+IG+FV
Sbjct: 272 PTYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEDIGVFV 331
Query: 320 AEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
+E+DVPKLD+ILTSIP +VILRKQRLLANPSMK+AMLFPQPA+ GDAFHQILNGLARKLP
Sbjct: 332 SEDDVPKLDSILTSIPTDVILRKQRLLANPSMKQAMLFPQPAEAGDAFHQILNGLARKLP 391
Query: 380 HDKSTYLKPGGKFLNWTSGPVGDLKPW 406
H ++ +LKPG K LNWT+GPVGDLKPW
Sbjct: 392 HGENVFLKPGQKVLNWTAGPVGDLKPW 418
>gi|326488903|dbj|BAJ98063.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 421
Score = 716 bits (1847), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 336/387 (86%), Positives = 362/387 (93%)
Query: 20 KIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLS 79
+ ERISGSAGDVL+DDPVG+LKV++Y+LP KYNKKLL KDPRCL HMFAAEIFMHRFLLS
Sbjct: 35 QTERISGSAGDVLDDDPVGRLKVFIYDLPGKYNKKLLKKDPRCLNHMFAAEIFMHRFLLS 94
Query: 80 SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGAD 139
S VRT NPEEADWFYTP+YPTCDLTP+GLPLPFKSPRMMRSAI+LI++ WPYWNR+EGAD
Sbjct: 95 SAVRTTNPEEADWFYTPVYPTCDLTPSGLPLPFKSPRMMRSAIELIATKWPYWNRSEGAD 154
Query: 140 HFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQ 199
HFFV PHDFGACFHYQEEKAI RGILPLLQRATLVQTFGQ+NHVCL +GSITIPP+APPQ
Sbjct: 155 HFFVTPHDFGACFHYQEEKAIGRGILPLLQRATLVQTFGQKNHVCLKDGSITIPPFAPPQ 214
Query: 200 KMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP 259
KMQ H IP +TPRSIFVYFRGLFYD NDPEGGYYARGARA+VWENFKNNPLFDISTDHP
Sbjct: 215 KMQNHLIPGETPRSIFVYFRGLFYDTGNDPEGGYYARGARASVWENFKNNPLFDISTDHP 274
Query: 260 TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFV 319
TYYEDMQR++FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG+FV
Sbjct: 275 PTYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFV 334
Query: 320 AEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
EEDVP+LD+ILTSIP E ILRKQRLLANPSMK+AMLFPQPAQ GDAFHQILNGLARKLP
Sbjct: 335 PEEDVPRLDSILTSIPTEDILRKQRLLANPSMKQAMLFPQPAQAGDAFHQILNGLARKLP 394
Query: 380 HDKSTYLKPGGKFLNWTSGPVGDLKPW 406
H +S +LKPG LNWT+GPVGDLKPW
Sbjct: 395 HGESVFLKPGQARLNWTAGPVGDLKPW 421
>gi|242061766|ref|XP_002452172.1| hypothetical protein SORBIDRAFT_04g021180 [Sorghum bicolor]
gi|241932003|gb|EES05148.1| hypothetical protein SORBIDRAFT_04g021180 [Sorghum bicolor]
Length = 432
Score = 716 bits (1847), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 333/385 (86%), Positives = 362/385 (94%)
Query: 22 ERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSP 81
ERI+GSAGDVLED+P+G+LKV++Y+LPSKYNK+++ KDPRCL HMFAAEIFMHRFLLSS
Sbjct: 48 ERIAGSAGDVLEDNPIGRLKVFIYDLPSKYNKRIVTKDPRCLHHMFAAEIFMHRFLLSSA 107
Query: 82 VRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHF 141
VRTLNPEEADWFYTP+Y TCDLTP GLPLPFKSPRMMRSAI+ IS+ WP+WNRT+GADHF
Sbjct: 108 VRTLNPEEADWFYTPVYTTCDLTPAGLPLPFKSPRMMRSAIKFISNKWPFWNRTDGADHF 167
Query: 142 FVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKM 201
FVVPHDFGACFHYQEEKA ERGILP+L+RATLVQTFGQ+NHVCL EGSI IPPYAPPQKM
Sbjct: 168 FVVPHDFGACFHYQEEKATERGILPMLRRATLVQTFGQKNHVCLKEGSIIIPPYAPPQKM 227
Query: 202 QAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTT 261
QAH +PPDTPRSIFVYFRGLFYD NDPEGGYYARGARA++WENFKNNPLFDISTDHP T
Sbjct: 228 QAHLVPPDTPRSIFVYFRGLFYDNGNDPEGGYYARGARASLWENFKNNPLFDISTDHPVT 287
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
YYEDMQRA+FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWE+IG+FV E
Sbjct: 288 YYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEDIGVFVDE 347
Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHD 381
EDVPKLD+ILTSIP E ILRKQRLLANPSMK+AMLFPQPAQP DAFHQILNGLARKLPH
Sbjct: 348 EDVPKLDSILTSIPIENILRKQRLLANPSMKKAMLFPQPAQPRDAFHQILNGLARKLPHM 407
Query: 382 KSTYLKPGGKFLNWTSGPVGDLKPW 406
+S YL+PG K LNWT+GP GDLKPW
Sbjct: 408 QSVYLQPGEKHLNWTAGPAGDLKPW 432
>gi|301072486|gb|ADK56173.1| glycosyltransferase 47 [Triticum aestivum]
Length = 422
Score = 715 bits (1845), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 336/387 (86%), Positives = 361/387 (93%)
Query: 20 KIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLS 79
+ ERISGSAGDVL+DDPVG+LKV++Y+LP KYNKKLL KDPRCL HMFAAEIFMHRFLLS
Sbjct: 36 QTERISGSAGDVLDDDPVGRLKVFIYDLPGKYNKKLLKKDPRCLNHMFAAEIFMHRFLLS 95
Query: 80 SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGAD 139
S VRT NPEEADWFYTP+YPTCDLTP+GLPLPFKSPRMMRSAI+LI++ WPYWNR+EGAD
Sbjct: 96 SAVRTTNPEEADWFYTPVYPTCDLTPSGLPLPFKSPRMMRSAIELIATKWPYWNRSEGAD 155
Query: 140 HFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQ 199
HFFV PHDFGACFHYQEEKAI RGILPLLQRATLVQTFGQ+NHVCL EGSITIPP+APPQ
Sbjct: 156 HFFVTPHDFGACFHYQEEKAIGRGILPLLQRATLVQTFGQKNHVCLKEGSITIPPFAPPQ 215
Query: 200 KMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP 259
KMQ H IP +TPRSIFVYFRGLFYD NDPEGGYYARGARA+VWENFKNNPLFDISTDHP
Sbjct: 216 KMQNHLIPGETPRSIFVYFRGLFYDTGNDPEGGYYARGARASVWENFKNNPLFDISTDHP 275
Query: 260 TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFV 319
TYYEDMQR++FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG+FV
Sbjct: 276 PTYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFV 335
Query: 320 AEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
EEDVP+LD+ILTSIP E ILRKQRLLANPSMK+AMLFPQPAQ GDAFHQILNGLARKLP
Sbjct: 336 PEEDVPRLDSILTSIPTEDILRKQRLLANPSMKQAMLFPQPAQAGDAFHQILNGLARKLP 395
Query: 380 HDKSTYLKPGGKFLNWTSGPVGDLKPW 406
H + +LKPG LNWT+GPVGDLKPW
Sbjct: 396 HGEDVFLKPGQARLNWTAGPVGDLKPW 422
>gi|212275360|ref|NP_001130059.1| uncharacterized protein LOC100191151 precursor [Zea mays]
gi|194688192|gb|ACF78180.1| unknown [Zea mays]
Length = 418
Score = 714 bits (1844), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 335/388 (86%), Positives = 364/388 (93%), Gaps = 1/388 (0%)
Query: 20 KIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLS 79
+ ERISGSAGDVLED+PVG+LKVYVY+LPSKYNKKLL KDPRCL+HMFAAEIFMHRFLLS
Sbjct: 31 QTERISGSAGDVLEDNPVGRLKVYVYDLPSKYNKKLLKKDPRCLSHMFAAEIFMHRFLLS 90
Query: 80 SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGAD 139
S VRT NPEEADWFYTP+Y TCDLTP+GLPLPFKSPRMMRSAI+ +++NWPYWNR+EGAD
Sbjct: 91 SAVRTFNPEEADWFYTPVYTTCDLTPSGLPLPFKSPRMMRSAIERVATNWPYWNRSEGAD 150
Query: 140 HFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQ 199
HFFV PHDFGACFHYQEEKAI RGI PLLQRATLVQTFGQ+NHVCL +GSITIPP+APPQ
Sbjct: 151 HFFVTPHDFGACFHYQEEKAIGRGIPPLLQRATLVQTFGQKNHVCLKDGSITIPPFAPPQ 210
Query: 200 KMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP 259
KMQAH IP DTPRSIFVYFRGLFYD +NDPEGGYYARGARA+VWENFKNNPLFDISTDHP
Sbjct: 211 KMQAHLIPADTPRSIFVYFRGLFYDTSNDPEGGYYARGARASVWENFKNNPLFDISTDHP 270
Query: 260 TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFV 319
TYYEDMQR++FCLCPLGWAPWSPRLVEAVVFGCIPVI+ADDIVLPFADAIPWE+IG+FV
Sbjct: 271 ATYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIVADDIVLPFADAIPWEDIGVFV 330
Query: 320 AEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
AEEDVP+LDTILTSIP +V+LRKQRLLANPSMK+AMLFPQPAQ GDAFHQILNGLARKLP
Sbjct: 331 AEEDVPRLDTILTSIPTDVVLRKQRLLANPSMKQAMLFPQPAQAGDAFHQILNGLARKLP 390
Query: 380 HDKSTYLKPGG-KFLNWTSGPVGDLKPW 406
H +LKPG + LNWT+GP GDLKPW
Sbjct: 391 HGDGVFLKPGDERVLNWTAGPPGDLKPW 418
>gi|150246869|emb|CAM91796.1| putative glycosyltransferase family 47 [Triticum aestivum]
Length = 420
Score = 714 bits (1843), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 336/385 (87%), Positives = 360/385 (93%)
Query: 22 ERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSP 81
ERISGSAGDVL+DDPVG+LKV++Y+LP KYNKKLL KDPRCL HMFAAEIFMHRFLLSS
Sbjct: 36 ERISGSAGDVLDDDPVGRLKVFIYDLPGKYNKKLLKKDPRCLNHMFAAEIFMHRFLLSSA 95
Query: 82 VRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHF 141
VRT NPEEADWFYTP+YPTCDLTP+GLPLPFKSPRMMRSAI+LI++ WPYWNR+EGADHF
Sbjct: 96 VRTTNPEEADWFYTPVYPTCDLTPSGLPLPFKSPRMMRSAIELIATKWPYWNRSEGADHF 155
Query: 142 FVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKM 201
FV PHDFGACFHYQEEKAI RGILPLLQRATLVQTFGQ+NHVCL EGSITIPP+APPQKM
Sbjct: 156 FVTPHDFGACFHYQEEKAIGRGILPLLQRATLVQTFGQKNHVCLKEGSITIPPFAPPQKM 215
Query: 202 QAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTT 261
Q H IP +TPRSIFVYFRGLFYD NDPEGGYYARGARA+VWENFKNNPLFDISTDHP T
Sbjct: 216 QNHLIPGETPRSIFVYFRGLFYDTGNDPEGGYYARGARASVWENFKNNPLFDISTDHPPT 275
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
YYEDMQR++FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG+FV E
Sbjct: 276 YYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFVPE 335
Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHD 381
EDVP+LD+ILTSIP E ILRKQRLLANPSMK+AMLFPQPAQ GDAFHQILNGLARKLPH
Sbjct: 336 EDVPRLDSILTSIPTEDILRKQRLLANPSMKQAMLFPQPAQAGDAFHQILNGLARKLPHG 395
Query: 382 KSTYLKPGGKFLNWTSGPVGDLKPW 406
+ +LKPG LNWT+GPVGDLKPW
Sbjct: 396 EDVFLKPGQVRLNWTAGPVGDLKPW 420
>gi|115458146|ref|NP_001052673.1| Os04g0398600 [Oryza sativa Japonica Group]
gi|75143148|sp|Q7XLG3.2|GT42_ORYSJ RecName: Full=Probable glucuronosyltransferase Os04g0398600
gi|38347167|emb|CAE05157.2| OSJNBa0039C07.13 [Oryza sativa Japonica Group]
gi|113564244|dbj|BAF14587.1| Os04g0398600 [Oryza sativa Japonica Group]
gi|116309465|emb|CAH66536.1| H0209H04.3 [Oryza sativa Indica Group]
gi|125548129|gb|EAY93951.1| hypothetical protein OsI_15724 [Oryza sativa Indica Group]
gi|125590243|gb|EAZ30593.1| hypothetical protein OsJ_14642 [Oryza sativa Japonica Group]
gi|215706942|dbj|BAG93402.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215741563|dbj|BAG98058.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 420
Score = 711 bits (1835), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 336/385 (87%), Positives = 359/385 (93%)
Query: 22 ERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSP 81
ERISGSAGDVLED+PVG+LKV++Y+LP KYNKK++ KDPRCL HMFAAEIFMHRFLLSS
Sbjct: 36 ERISGSAGDVLEDNPVGRLKVFIYDLPRKYNKKMVNKDPRCLNHMFAAEIFMHRFLLSSA 95
Query: 82 VRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHF 141
VRTLNP+EADWFYTP+Y TCDLTP GLPLPFKSPR+MRSAIQ IS WP+WNRT+GADHF
Sbjct: 96 VRTLNPKEADWFYTPVYTTCDLTPAGLPLPFKSPRVMRSAIQYISHKWPFWNRTDGADHF 155
Query: 142 FVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKM 201
FVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQ NHVCL EGSITIPPYAPPQKM
Sbjct: 156 FVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQENHVCLKEGSITIPPYAPPQKM 215
Query: 202 QAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTT 261
QAH IPPDTPRSIFVYFRGLFYD NDPEGGYYARGARA++WENFKNNPLFDISTDHP T
Sbjct: 216 QAHLIPPDTPRSIFVYFRGLFYDTGNDPEGGYYARGARASLWENFKNNPLFDISTDHPPT 275
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
YYEDMQRA+FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG+FV E
Sbjct: 276 YYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFVEE 335
Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHD 381
+DVPKLDTILTS+P + ILRKQRLLANPSMK+AMLFPQPAQP DAFHQILNGLARKLPH
Sbjct: 336 KDVPKLDTILTSMPIDDILRKQRLLANPSMKQAMLFPQPAQPRDAFHQILNGLARKLPHP 395
Query: 382 KSTYLKPGGKFLNWTSGPVGDLKPW 406
+ YL+P K LNWT+GPVGDLK W
Sbjct: 396 EGVYLQPSDKRLNWTAGPVGDLKAW 420
>gi|125582313|gb|EAZ23244.1| hypothetical protein OsJ_06939 [Oryza sativa Japonica Group]
Length = 434
Score = 711 bits (1835), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 334/385 (86%), Positives = 360/385 (93%)
Query: 22 ERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSP 81
ERISGSAGDVLED+PVG+LKV+VY+LPSKYNK+++ KDPRCL HMFAAEIFMHRFLLSS
Sbjct: 49 ERISGSAGDVLEDNPVGRLKVFVYDLPSKYNKRIVAKDPRCLNHMFAAEIFMHRFLLSSA 108
Query: 82 VRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHF 141
VRTLNPE+ADWFY P+Y TCDLT GLPLPFKSPRMMRSAIQ +S WP+WNRT+GADHF
Sbjct: 109 VRTLNPEQADWFYAPVYTTCDLTHAGLPLPFKSPRMMRSAIQFLSRKWPFWNRTDGADHF 168
Query: 142 FVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKM 201
FVVPHDFGACFHYQEEKAIERGILPLL+RATLVQTFGQ+NHVCL EGSITIPPYAPPQKM
Sbjct: 169 FVVPHDFGACFHYQEEKAIERGILPLLRRATLVQTFGQKNHVCLKEGSITIPPYAPPQKM 228
Query: 202 QAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTT 261
QAH IPPDTPRSIFVYFRGLFYD NDPEGGYYARGARA++WENFKNNPLFDIST+HP T
Sbjct: 229 QAHLIPPDTPRSIFVYFRGLFYDNGNDPEGGYYARGARASLWENFKNNPLFDISTEHPAT 288
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
YYEDMQR++FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW+EIG+FV E
Sbjct: 289 YYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWDEIGVFVDE 348
Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHD 381
EDVP+LD+ILTSIP + ILRKQRLLANPSMK+AMLFPQPAQP DAFHQILNGLARKLPH
Sbjct: 349 EDVPRLDSILTSIPIDDILRKQRLLANPSMKQAMLFPQPAQPRDAFHQILNGLARKLPHP 408
Query: 382 KSTYLKPGGKFLNWTSGPVGDLKPW 406
S YLKPG K LNWT+GPV DLKPW
Sbjct: 409 DSVYLKPGEKHLNWTAGPVADLKPW 433
>gi|75122905|sp|Q6H4N0.1|GT21_ORYSJ RecName: Full=Probable glucuronosyltransferase Os02g0520750
gi|49389081|dbj|BAD26319.1| putative pectin-glucuronyltransferase [Oryza sativa Japonica Group]
gi|215769122|dbj|BAH01351.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 434
Score = 710 bits (1833), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 334/385 (86%), Positives = 360/385 (93%)
Query: 22 ERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSP 81
ERISGSAGDVLED+PVG+LKV+VY+LPSKYNK+++ KDPRCL HMFAAEIFMHRFLLSS
Sbjct: 49 ERISGSAGDVLEDNPVGRLKVFVYDLPSKYNKRIVAKDPRCLNHMFAAEIFMHRFLLSSA 108
Query: 82 VRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHF 141
VRTLNPE+ADWFY P+Y TCDLT GLPLPFKSPRMMRSAIQ +S WP+WNRT+GADHF
Sbjct: 109 VRTLNPEQADWFYAPVYTTCDLTHAGLPLPFKSPRMMRSAIQFLSRKWPFWNRTDGADHF 168
Query: 142 FVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKM 201
FVVPHDFGACFHYQEEKAIERGILPLL+RATLVQTFGQ+NHVCL EGSITIPPYAPPQKM
Sbjct: 169 FVVPHDFGACFHYQEEKAIERGILPLLRRATLVQTFGQKNHVCLKEGSITIPPYAPPQKM 228
Query: 202 QAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTT 261
QAH IPPDTPRSIFVYFRGLFYD NDPEGGYYARGARA++WENFKNNPLFDIST+HP T
Sbjct: 229 QAHLIPPDTPRSIFVYFRGLFYDNGNDPEGGYYARGARASLWENFKNNPLFDISTEHPAT 288
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
YYEDMQR++FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW+EIG+FV E
Sbjct: 289 YYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWDEIGVFVDE 348
Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHD 381
EDVP+LD+ILTSIP + ILRKQRLLANPSMK+AMLFPQPAQP DAFHQILNGLARKLPH
Sbjct: 349 EDVPRLDSILTSIPIDDILRKQRLLANPSMKQAMLFPQPAQPRDAFHQILNGLARKLPHP 408
Query: 382 KSTYLKPGGKFLNWTSGPVGDLKPW 406
S YLKPG K LNWT+GPV DLKPW
Sbjct: 409 DSVYLKPGEKHLNWTAGPVADLKPW 433
>gi|302823891|ref|XP_002993593.1| glycosyltransferase-like protein [Selaginella moellendorffii]
gi|300138521|gb|EFJ05285.1| glycosyltransferase-like protein [Selaginella moellendorffii]
Length = 437
Score = 709 bits (1829), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 336/384 (87%), Positives = 354/384 (92%)
Query: 22 ERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSP 81
ERISGSAGDVLEDDPVG+LKV+VYELP KYN+K+L KDPRCLTHMFA EIF+H FLL S
Sbjct: 25 ERISGSAGDVLEDDPVGRLKVFVYELPVKYNRKVLEKDPRCLTHMFATEIFVHEFLLGSA 84
Query: 82 VRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHF 141
VRTLNPEEADWFYTP+Y TCDLTP GLPLPFKSPRMMRS IQ IS+ WPYWNRTEGADHF
Sbjct: 85 VRTLNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRMMRSVIQYISNQWPYWNRTEGADHF 144
Query: 142 FVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKM 201
FVVPHDFGACFHYQEEKAIERGIL LLQRATLVQTFGQR HVCL EGSI +PPY PPQKM
Sbjct: 145 FVVPHDFGACFHYQEEKAIERGILHLLQRATLVQTFGQRYHVCLKEGSIVVPPYCPPQKM 204
Query: 202 QAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTT 261
QAH IPP PRSIFVYFRGLFYD NDPEGGYYARGARAAVWENFK+NPLFDIST+HP T
Sbjct: 205 QAHLIPPSIPRSIFVYFRGLFYDYGNDPEGGYYARGARAAVWENFKDNPLFDISTEHPIT 264
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
YYEDMQRAIFCLCPLGWAPWSPRLVE V+FGCIPVIIADDIVLPFADAIPWEEIG+FVAE
Sbjct: 265 YYEDMQRAIFCLCPLGWAPWSPRLVEGVIFGCIPVIIADDIVLPFADAIPWEEIGVFVAE 324
Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHD 381
+DVPKLDTILTSIPPEVIL+KQRLLA P+MK+AMLFPQPAQPGDAFHQILNGLARKLPHD
Sbjct: 325 KDVPKLDTILTSIPPEVILKKQRLLATPAMKQAMLFPQPAQPGDAFHQILNGLARKLPHD 384
Query: 382 KSTYLKPGGKFLNWTSGPVGDLKP 405
+ YL PG K LNW+SGP GDLKP
Sbjct: 385 RRIYLNPGEKALNWSSGPPGDLKP 408
>gi|357163140|ref|XP_003579636.1| PREDICTED: probable glucuronosyltransferase Os04g0398600-like
[Brachypodium distachyon]
Length = 429
Score = 707 bits (1824), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 332/385 (86%), Positives = 358/385 (92%)
Query: 22 ERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSP 81
ERISGSAGDVLED+PVG+LKV++Y+LP KYNKK++ KDPRCL HMFAAEIFMHRFLLSS
Sbjct: 45 ERISGSAGDVLEDNPVGRLKVFIYDLPRKYNKKMVAKDPRCLNHMFAAEIFMHRFLLSSA 104
Query: 82 VRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHF 141
VRTLNP+EADWFY P+Y TCDLTP GLPLPFKSPR+MRSAIQ IS+ WP+WN+T+GADHF
Sbjct: 105 VRTLNPKEADWFYAPVYTTCDLTPAGLPLPFKSPRVMRSAIQYISNKWPFWNKTDGADHF 164
Query: 142 FVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKM 201
FVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQ NHVCL EGSI IPP+APPQKM
Sbjct: 165 FVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQENHVCLKEGSIIIPPFAPPQKM 224
Query: 202 QAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTT 261
QAH IPPDTPRSIFVYFRGLFYD NDPEGGYYARGARA++WENFKNNPLFDIST+HP T
Sbjct: 225 QAHLIPPDTPRSIFVYFRGLFYDTGNDPEGGYYARGARASLWENFKNNPLFDISTEHPAT 284
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
YYEDMQRA+FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG+F+ E
Sbjct: 285 YYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFIEE 344
Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHD 381
+DVPKLDTILTS+P E ILRKQRLLANPSMK+AMLFPQPAQ DAFHQILNGLARKLPH
Sbjct: 345 KDVPKLDTILTSMPIEDILRKQRLLANPSMKQAMLFPQPAQARDAFHQILNGLARKLPHP 404
Query: 382 KSTYLKPGGKFLNWTSGPVGDLKPW 406
+ YL PG K LNWT+GPVGDLKPW
Sbjct: 405 EGVYLPPGEKHLNWTAGPVGDLKPW 429
>gi|302783378|ref|XP_002973462.1| glycosyltransferase-like protein [Selaginella moellendorffii]
gi|300159215|gb|EFJ25836.1| glycosyltransferase-like protein [Selaginella moellendorffii]
Length = 437
Score = 706 bits (1822), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 335/384 (87%), Positives = 353/384 (91%)
Query: 22 ERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSP 81
ERISGSAGDVLEDDPVG+LKV+VYELP KYN+K+L KDPRCLTHMFA EIF+H FLL S
Sbjct: 25 ERISGSAGDVLEDDPVGRLKVFVYELPVKYNRKVLEKDPRCLTHMFATEIFVHEFLLGSA 84
Query: 82 VRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHF 141
VRTLNPEEADWFYTP+Y TCDLT GLPLPFKSPRMMRS IQ IS+ WPYWNRTEGADHF
Sbjct: 85 VRTLNPEEADWFYTPVYTTCDLTRNGLPLPFKSPRMMRSVIQYISNQWPYWNRTEGADHF 144
Query: 142 FVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKM 201
FVVPHDFGACFHYQEEKAIERGIL LLQRATLVQTFGQR HVCL EGSI +PPY PPQKM
Sbjct: 145 FVVPHDFGACFHYQEEKAIERGILHLLQRATLVQTFGQRYHVCLKEGSIVVPPYCPPQKM 204
Query: 202 QAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTT 261
QAH IPP PRSIFVYFRGLFYD NDPEGGYYARGARAAVWENFK+NPLFDIST+HP T
Sbjct: 205 QAHLIPPSIPRSIFVYFRGLFYDYGNDPEGGYYARGARAAVWENFKDNPLFDISTEHPIT 264
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
YYEDMQRAIFCLCPLGWAPWSPRLVE V+FGCIPVIIADDIVLPFADAIPWEEIG+FVAE
Sbjct: 265 YYEDMQRAIFCLCPLGWAPWSPRLVEGVIFGCIPVIIADDIVLPFADAIPWEEIGVFVAE 324
Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHD 381
+DVPKLDTILTSIPPEVIL+KQRLLA P+MK+AMLFPQPAQPGDAFHQILNGLARKLPHD
Sbjct: 325 KDVPKLDTILTSIPPEVILKKQRLLATPAMKQAMLFPQPAQPGDAFHQILNGLARKLPHD 384
Query: 382 KSTYLKPGGKFLNWTSGPVGDLKP 405
+ YL PG K LNW+SGP GDLKP
Sbjct: 385 RRIYLNPGEKALNWSSGPPGDLKP 408
>gi|242072854|ref|XP_002446363.1| hypothetical protein SORBIDRAFT_06g014730 [Sorghum bicolor]
gi|241937546|gb|EES10691.1| hypothetical protein SORBIDRAFT_06g014730 [Sorghum bicolor]
Length = 430
Score = 705 bits (1819), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 331/385 (85%), Positives = 360/385 (93%)
Query: 22 ERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSP 81
ERISGSAGDVLED+PVG+LKV++Y+LP KYNKK++ KDPRCL+HMFAAEIFMHRFLLSS
Sbjct: 46 ERISGSAGDVLEDNPVGRLKVFIYDLPRKYNKKMVTKDPRCLSHMFAAEIFMHRFLLSSA 105
Query: 82 VRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHF 141
VRTLNP+EADWFYTP+Y TCDLT GLPLPFKSPR+MRSAIQ IS+ WP+WNRT+GADHF
Sbjct: 106 VRTLNPKEADWFYTPVYTTCDLTNAGLPLPFKSPRVMRSAIQYISNKWPFWNRTDGADHF 165
Query: 142 FVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKM 201
FVVPHDF ACFHYQEEKAIERGILPLL+RATLVQTFGQ NHVCL EGSI IPP+APPQKM
Sbjct: 166 FVVPHDFAACFHYQEEKAIERGILPLLRRATLVQTFGQENHVCLKEGSIIIPPFAPPQKM 225
Query: 202 QAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTT 261
QAH I PDTPRSIFVYFRGLFYD NDPEGGYYARGARA++WENFK+NPLFDISTDHP T
Sbjct: 226 QAHLISPDTPRSIFVYFRGLFYDTGNDPEGGYYARGARASLWENFKSNPLFDISTDHPAT 285
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
YYEDMQRA+FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG+FV E
Sbjct: 286 YYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFVEE 345
Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHD 381
+DVPKLDTILTS+P + ILRKQRLLANPSMK+AMLFPQPAQP DAFHQILNGLARKLPH
Sbjct: 346 KDVPKLDTILTSMPIDDILRKQRLLANPSMKQAMLFPQPAQPRDAFHQILNGLARKLPHP 405
Query: 382 KSTYLKPGGKFLNWTSGPVGDLKPW 406
+ TYL+PG + LNWT+GPVGDLKPW
Sbjct: 406 EGTYLQPGDQHLNWTAGPVGDLKPW 430
>gi|357149222|ref|XP_003575040.1| PREDICTED: probable glucuronosyltransferase Os02g0520750-like
[Brachypodium distachyon]
Length = 428
Score = 704 bits (1818), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 331/391 (84%), Positives = 359/391 (91%)
Query: 16 DNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHR 75
D ERISGSAGDVLED+PVGKLKV+VY+LPSKYNK ++ KDPRCL+HMFAAEIFMHR
Sbjct: 38 DAQRNTERISGSAGDVLEDNPVGKLKVFVYDLPSKYNKMIVTKDPRCLSHMFAAEIFMHR 97
Query: 76 FLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRT 135
FL SS VRT+NPEEADWFYTP+Y TCDLT GLPLPFKSPRMMRSAIQ IS+ WP+WNRT
Sbjct: 98 FLFSSAVRTVNPEEADWFYTPVYTTCDLTRAGLPLPFKSPRMMRSAIQFISNKWPFWNRT 157
Query: 136 EGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY 195
+G DHFFVVPHDF ACFHYQEE AI RGILPLL+ ATLVQTFGQ+NHVCL +GSITIPPY
Sbjct: 158 DGGDHFFVVPHDFAACFHYQEENAIARGILPLLRHATLVQTFGQKNHVCLKDGSITIPPY 217
Query: 196 APPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDIS 255
APPQKMQAH IPPDTPRSIFVYFRGLFYD NDPEGGYYARGARA++WENFKNNPLFDIS
Sbjct: 218 APPQKMQAHLIPPDTPRSIFVYFRGLFYDNGNDPEGGYYARGARASLWENFKNNPLFDIS 277
Query: 256 TDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEI 315
T+HP TYYEDMQR++FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW+EI
Sbjct: 278 TEHPATYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWDEI 337
Query: 316 GMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLA 375
G+FV EEDVPKLD+ILTSIP + ILRKQRLLANPSMK+AMLFPQPAQP DAFHQILNGLA
Sbjct: 338 GVFVDEEDVPKLDSILTSIPIDDILRKQRLLANPSMKKAMLFPQPAQPRDAFHQILNGLA 397
Query: 376 RKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
RKLPH +S YL+PG K +NWT+GPVGDLKPW
Sbjct: 398 RKLPHPESVYLQPGEKHVNWTAGPVGDLKPW 428
>gi|226499262|ref|NP_001149267.1| secondary cell wall-related glycosyltransferase family 47 precursor
[Zea mays]
gi|195625922|gb|ACG34791.1| secondary cell wall-related glycosyltransferase family 47 [Zea
mays]
Length = 427
Score = 702 bits (1812), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 330/385 (85%), Positives = 359/385 (93%)
Query: 22 ERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSP 81
ERISGSAGDVLED+PVG+LKV++Y+LP KYNKK++ KD RCL+HMFAAEIFMHRFLLSS
Sbjct: 43 ERISGSAGDVLEDNPVGRLKVFIYDLPRKYNKKMVTKDSRCLSHMFAAEIFMHRFLLSSA 102
Query: 82 VRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHF 141
VRTLNP+EADWFYTP+Y TCDLT GLPLPFKSPR+MRSAIQ IS+ WP+WNRT+GADHF
Sbjct: 103 VRTLNPKEADWFYTPVYTTCDLTNAGLPLPFKSPRVMRSAIQYISNKWPFWNRTDGADHF 162
Query: 142 FVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKM 201
FVVPHDF ACFHYQEEKAIERGILPLL+RATLVQTFGQ NHVCL EGSI IPPYAPPQKM
Sbjct: 163 FVVPHDFAACFHYQEEKAIERGILPLLRRATLVQTFGQENHVCLKEGSIIIPPYAPPQKM 222
Query: 202 QAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTT 261
QAH I PDTPRSIFVYFRGLFYD NDPEGGYYARGARA++WENFK+NPLFDISTDHP T
Sbjct: 223 QAHLISPDTPRSIFVYFRGLFYDTGNDPEGGYYARGARASLWENFKSNPLFDISTDHPAT 282
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
YYEDMQRA+FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG+FV E
Sbjct: 283 YYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFVEE 342
Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHD 381
+DVPKLDTILTS+P + ILRKQRLLANPSMK+AMLFPQPAQP DAFHQILNGLARKLPH
Sbjct: 343 KDVPKLDTILTSMPIDDILRKQRLLANPSMKQAMLFPQPAQPRDAFHQILNGLARKLPHP 402
Query: 382 KSTYLKPGGKFLNWTSGPVGDLKPW 406
+ T+L+PG + LNWT+GPVGDLKPW
Sbjct: 403 EGTFLQPGDQRLNWTAGPVGDLKPW 427
>gi|194700342|gb|ACF84255.1| unknown [Zea mays]
gi|414587472|tpg|DAA38043.1| TPA: secondary cell wall glycosyltransferase family 47 [Zea mays]
Length = 427
Score = 702 bits (1811), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 330/385 (85%), Positives = 359/385 (93%)
Query: 22 ERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSP 81
ERISGSAGDVLED+PVG+LKV++Y+LP KYNKK++ KD RCL+HMFAAEIFMHRFLLSS
Sbjct: 43 ERISGSAGDVLEDNPVGRLKVFIYDLPRKYNKKMVTKDSRCLSHMFAAEIFMHRFLLSSA 102
Query: 82 VRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHF 141
VRTLNP+EADWFYTP+Y TCDLT GLPLPFKSPR+MRSAIQ IS+ WP+WNRT+GADHF
Sbjct: 103 VRTLNPKEADWFYTPVYTTCDLTNAGLPLPFKSPRVMRSAIQYISNKWPFWNRTDGADHF 162
Query: 142 FVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKM 201
FVVPHDF ACFHYQEEKAIERGILPLL+RATLVQTFGQ NHVCL EGSI IPPYAPPQKM
Sbjct: 163 FVVPHDFAACFHYQEEKAIERGILPLLRRATLVQTFGQENHVCLKEGSIIIPPYAPPQKM 222
Query: 202 QAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTT 261
QAH I PDTPRSIFVYFRGLFYD NDPEGGYYARGARA++WENFK+NPLFDISTDHP T
Sbjct: 223 QAHLISPDTPRSIFVYFRGLFYDTGNDPEGGYYARGARASLWENFKSNPLFDISTDHPAT 282
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
YYEDMQRA+FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG+FV E
Sbjct: 283 YYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFVEE 342
Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHD 381
+DVPKLDTILTS+P + ILRKQRLLANPSMK+AMLFPQPAQP DAFHQILNGLARKLPH
Sbjct: 343 KDVPKLDTILTSMPIDDILRKQRLLANPSMKQAMLFPQPAQPRDAFHQILNGLARKLPHP 402
Query: 382 KSTYLKPGGKFLNWTSGPVGDLKPW 406
+ T+L+PG + LNWT+GPVGDLKPW
Sbjct: 403 EGTFLQPGDQRLNWTAGPVGDLKPW 427
>gi|296085534|emb|CBI29266.3| unnamed protein product [Vitis vinifera]
Length = 390
Score = 702 bits (1811), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 344/408 (84%), Positives = 358/408 (87%), Gaps = 24/408 (5%)
Query: 1 MWVTAIALL--LAFVSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLK 58
+WV + + D P ERISGSAGDVLEDDPVG+LKV+VYELPSKYNKK+L K
Sbjct: 5 IWVLVFGFVWGIGAHGLDRHPHTERISGSAGDVLEDDPVGRLKVFVYELPSKYNKKILQK 64
Query: 59 DPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMM 118
DPRCL HMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIY TCDLTP GLPLPFKSPRMM
Sbjct: 65 DPRCLNHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYTTCDLTPNGLPLPFKSPRMM 124
Query: 119 RSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFG 178
RSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFG
Sbjct: 125 RSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFG 184
Query: 179 QRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGA 238
QRNHVCLNEGSITIPPYAPPQKMQAH IP +TPRSIFVYFRGLFYDVNNDPEGGYYARGA
Sbjct: 185 QRNHVCLNEGSITIPPYAPPQKMQAHLIPQETPRSIFVYFRGLFYDVNNDPEGGYYARGA 244
Query: 239 RAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVII 298
RAAVWENFK+NPLFDIST+HPTTYYEDMQRAIFCLCPL
Sbjct: 245 RAAVWENFKDNPLFDISTEHPTTYYEDMQRAIFCLCPL---------------------- 282
Query: 299 ADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFP 358
ADDIVLPFADAIPWEEIG+FVAEEDVP LDTILTSIPPEVILRKQRLLANPSMK+AMLFP
Sbjct: 283 ADDIVLPFADAIPWEEIGVFVAEEDVPNLDTILTSIPPEVILRKQRLLANPSMKQAMLFP 342
Query: 359 QPAQPGDAFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
QPAQ GDAFHQILNGLARKLPH+K+ YLKPG LNWT+GPVGDLKPW
Sbjct: 343 QPAQSGDAFHQILNGLARKLPHNKNVYLKPGENALNWTAGPVGDLKPW 390
>gi|413932967|gb|AFW67518.1| hypothetical protein ZEAMMB73_420245 [Zea mays]
Length = 386
Score = 701 bits (1808), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 329/384 (85%), Positives = 359/384 (93%), Gaps = 1/384 (0%)
Query: 23 RISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPV 82
++ GSAGDVLED+PVG+LKVYV++LPSKYNKKL+ KDPRCL HMFAAEIFMHR LLSS V
Sbjct: 4 KVEGSAGDVLEDNPVGRLKVYVHDLPSKYNKKLVKKDPRCLNHMFAAEIFMHRLLLSSAV 63
Query: 83 RTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFF 142
RT NPEEADWFYTP+Y TCDLTP+GLPLPFKSPRMM SAI+LI++NWPYWNR+EGADHFF
Sbjct: 64 RTFNPEEADWFYTPVYATCDLTPSGLPLPFKSPRMMLSAIELIATNWPYWNRSEGADHFF 123
Query: 143 VVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQ 202
V PHDFGACFHYQ+EKAI RGILPLLQ ATLVQTFGQ+NHVCL GSITIPP+APPQKMQ
Sbjct: 124 VTPHDFGACFHYQDEKAIGRGILPLLQHATLVQTFGQKNHVCLKGGSITIPPFAPPQKMQ 183
Query: 203 AHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTY 262
AH IP DTPRSIFVYFRGLFYD +NDPEGGYYARGARA+VWENFKNNPLFDISTDHP+TY
Sbjct: 184 AHLIPADTPRSIFVYFRGLFYDTSNDPEGGYYARGARASVWENFKNNPLFDISTDHPSTY 243
Query: 263 YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEE 322
YEDM+R++FCLCPLGWAPWSPRLVEAVVFGCIP+IIA DIVLPFADAIPWEEIG+FVAEE
Sbjct: 244 YEDMERSVFCLCPLGWAPWSPRLVEAVVFGCIPLIIA-DIVLPFADAIPWEEIGVFVAEE 302
Query: 323 DVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHDK 382
DVPKLD+ILTSIP +VILRKQRLLANPSMK+AMLFPQPAQ GDAFHQILNGLARKLPH
Sbjct: 303 DVPKLDSILTSIPTDVILRKQRLLANPSMKQAMLFPQPAQAGDAFHQILNGLARKLPHGD 362
Query: 383 STYLKPGGKFLNWTSGPVGDLKPW 406
S +LKPG + LNWT+GP GDLKPW
Sbjct: 363 SVFLKPGERVLNWTAGPPGDLKPW 386
>gi|242088705|ref|XP_002440185.1| hypothetical protein SORBIDRAFT_09g027450 [Sorghum bicolor]
gi|241945470|gb|EES18615.1| hypothetical protein SORBIDRAFT_09g027450 [Sorghum bicolor]
Length = 416
Score = 681 bits (1758), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 324/397 (81%), Positives = 352/397 (88%), Gaps = 3/397 (0%)
Query: 10 LAFVSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAA 69
+A V+A T ERI GSAGDVLEDDPVG+LKVYVYELP KYNK +L KD RCL+HMFA
Sbjct: 23 VAVVAAQET---ERIQGSAGDVLEDDPVGRLKVYVYELPPKYNKNILAKDSRCLSHMFAT 79
Query: 70 EIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNW 129
EIFMHRFLL+S VRTLNP+EADWFYTP+Y TCDLTP G PL KSPRMMRSAIQ IS W
Sbjct: 80 EIFMHRFLLTSAVRTLNPDEADWFYTPVYTTCDLTPWGHPLTTKSPRMMRSAIQYISKRW 139
Query: 130 PYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGS 189
PYWNRTEGADHFFV PHDFGACF++QEE AI+RG+LP+L+RATLVQTFGQ++HVCL EGS
Sbjct: 140 PYWNRTEGADHFFVTPHDFGACFYFQEETAIQRGVLPVLRRATLVQTFGQKHHVCLKEGS 199
Query: 190 ITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNN 249
ITIPPYAPP K++ H +PP+TPRSIFVYFRGLFYD NDPEGGYYARGARA+VWENFKNN
Sbjct: 200 ITIPPYAPPHKIRTHIVPPETPRSIFVYFRGLFYDTANDPEGGYYARGARASVWENFKNN 259
Query: 250 PLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADA 309
LFDIST+HP TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADA
Sbjct: 260 ALFDISTEHPPTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADA 319
Query: 310 IPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQ 369
IPWEEI +FVAE+DV KLDTILTSIP E ILRKQRLLANPSMK+AMLFPQPA+P DAFHQ
Sbjct: 320 IPWEEIAVFVAEDDVLKLDTILTSIPMEEILRKQRLLANPSMKQAMLFPQPAEPRDAFHQ 379
Query: 370 ILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
+LNGLARKLPH K +LKPG K LNWT G DLKPW
Sbjct: 380 VLNGLARKLPHGKGVFLKPGQKVLNWTEGTREDLKPW 416
>gi|363543513|ref|NP_001241766.1| secondary cell wall-related glycosyltransferase family 47 precursor
[Zea mays]
gi|195634597|gb|ACG36767.1| secondary cell wall-related glycosyltransferase family 47 [Zea
mays]
Length = 417
Score = 680 bits (1755), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 322/397 (81%), Positives = 354/397 (89%), Gaps = 3/397 (0%)
Query: 10 LAFVSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAA 69
+A V+A + IERI GSAGDVLEDDPVG+LKVYVYELP KYNK ++ KD RCL+HMFA
Sbjct: 24 VAVVTAQD---IERIQGSAGDVLEDDPVGRLKVYVYELPPKYNKNVVAKDSRCLSHMFAT 80
Query: 70 EIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNW 129
EIFMHRFLL+S VRTLNP+EADWFYTP+Y TCDLTP G PL KSPRMMRSAIQ +S W
Sbjct: 81 EIFMHRFLLASAVRTLNPDEADWFYTPVYTTCDLTPWGHPLTVKSPRMMRSAIQYVSKRW 140
Query: 130 PYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGS 189
PYWNRTEGADHFFV PHDFGACF++QEEKAI+RG+LP+L+RATLVQTFGQ+NHVCL EGS
Sbjct: 141 PYWNRTEGADHFFVTPHDFGACFYFQEEKAIQRGVLPVLRRATLVQTFGQKNHVCLREGS 200
Query: 190 ITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNN 249
ITIPPYAPP K++AH +PP+TPRSIFVYFRGLFYD NDPEGGYYARGARA+VWENFKNN
Sbjct: 201 ITIPPYAPPHKIRAHIVPPETPRSIFVYFRGLFYDTANDPEGGYYARGARASVWENFKNN 260
Query: 250 PLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADA 309
LFDIST+HP TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADA
Sbjct: 261 ALFDISTEHPPTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADA 320
Query: 310 IPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQ 369
IPWEEI +FV E+DV +LDTILTSIP + ILRKQRLLANPSMK+AMLFPQPA+P DAFHQ
Sbjct: 321 IPWEEIAVFVPEDDVLRLDTILTSIPMDEILRKQRLLANPSMKQAMLFPQPAEPRDAFHQ 380
Query: 370 ILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
+LNGLARKLPH K +LKPG K LNWT G DLKPW
Sbjct: 381 VLNGLARKLPHAKDVFLKPGQKVLNWTEGTQDDLKPW 417
>gi|226529361|ref|NP_001147714.1| LOC100281324 precursor [Zea mays]
gi|195613258|gb|ACG28459.1| secondary cell wall-related glycosyltransferase family 47 [Zea
mays]
Length = 418
Score = 680 bits (1754), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 322/397 (81%), Positives = 354/397 (89%), Gaps = 3/397 (0%)
Query: 10 LAFVSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAA 69
+A V+A + IERI GSAGDVLEDDPVG+LKVYVYELP KYNK ++ KD RCL+HMFA
Sbjct: 25 VAVVTAQD---IERIQGSAGDVLEDDPVGRLKVYVYELPPKYNKNVVAKDSRCLSHMFAT 81
Query: 70 EIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNW 129
EIFMHRFLL+S VRTLNP+EADWFYTP+Y TCDLTP G PL KSPRMMRSAIQ +S W
Sbjct: 82 EIFMHRFLLASAVRTLNPDEADWFYTPVYTTCDLTPWGHPLTVKSPRMMRSAIQYVSKRW 141
Query: 130 PYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGS 189
PYWNRTEGADHFFV PHDFGACF++QEEKAI+RG+LP+L+RATLVQTFGQ+NHVCL EGS
Sbjct: 142 PYWNRTEGADHFFVTPHDFGACFYFQEEKAIQRGVLPVLRRATLVQTFGQKNHVCLREGS 201
Query: 190 ITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNN 249
ITIPPYAPP K++AH +PP+TPRSIFVYFRGLFYD NDPEGGYYARGARA+VWENFKNN
Sbjct: 202 ITIPPYAPPHKIRAHIVPPETPRSIFVYFRGLFYDTANDPEGGYYARGARASVWENFKNN 261
Query: 250 PLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADA 309
LFDIST+HP TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADA
Sbjct: 262 ALFDISTEHPPTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADA 321
Query: 310 IPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQ 369
IPWEEI +FV E+DV +LDTILTSIP + ILRKQRLLANPSMK+AMLFPQPA+P DAFHQ
Sbjct: 322 IPWEEIAVFVPEDDVLRLDTILTSIPMDEILRKQRLLANPSMKQAMLFPQPAEPRDAFHQ 381
Query: 370 ILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
+LNGLARKLPH K +LKPG K LNWT G DLKPW
Sbjct: 382 VLNGLARKLPHAKDVFLKPGQKVLNWTEGTQDDLKPW 418
>gi|194706890|gb|ACF87529.1| unknown [Zea mays]
Length = 418
Score = 679 bits (1753), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 322/397 (81%), Positives = 354/397 (89%), Gaps = 3/397 (0%)
Query: 10 LAFVSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAA 69
+A V+A + IERI GSAGDVLEDDPVG+LKVYVYELP KYNK ++ KD RCL+HMFA
Sbjct: 25 VAVVTAQD---IERIQGSAGDVLEDDPVGRLKVYVYELPPKYNKNVVAKDSRCLSHMFAT 81
Query: 70 EIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNW 129
EIFMHRFLL+S VRTLNP+EADWFYTP+Y TCDLTP G PL KSPRMMRSAIQ +S W
Sbjct: 82 EIFMHRFLLASAVRTLNPDEADWFYTPVYTTCDLTPWGHPLTVKSPRMMRSAIQYVSKRW 141
Query: 130 PYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGS 189
PYWNRTEGADHFFV PHDFGACF++QEEKAI+RG+LP+L+RATLVQTFGQ+NHVCL EGS
Sbjct: 142 PYWNRTEGADHFFVTPHDFGACFYFQEEKAIQRGVLPVLRRATLVQTFGQKNHVCLREGS 201
Query: 190 ITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNN 249
ITIPPYAPP K++AH +PP+TPRSIFVYFRGLFYD NDPEGGYYARGARA+VWENFKNN
Sbjct: 202 ITIPPYAPPHKIRAHIVPPETPRSIFVYFRGLFYDTANDPEGGYYARGARASVWENFKNN 261
Query: 250 PLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADA 309
LFDIST+HP TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADA
Sbjct: 262 ALFDISTEHPPTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADA 321
Query: 310 IPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQ 369
IPWEEI +FV E+DV +LDTILTSIP + ILRKQRLLANPSMK+AMLFPQPA+P DAFHQ
Sbjct: 322 IPWEEIAVFVPEDDVLRLDTILTSIPMDEILRKQRLLANPSMKQAMLFPQPAEPRDAFHQ 381
Query: 370 ILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
+LNGLARKLPH K +LKPG K LNWT G DLKPW
Sbjct: 382 VLNGLARKLPHPKDVFLKPGQKVLNWTEGAQDDLKPW 418
>gi|449469226|ref|XP_004152322.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10-like [Cucumis
sativus]
Length = 388
Score = 679 bits (1753), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 331/406 (81%), Positives = 349/406 (85%), Gaps = 29/406 (7%)
Query: 1 MWVTAIALLLAFVSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDP 60
+WV AI L+ + N ERISGSAGDVLED+PVG+LKVYVY+LPSKYNKK L KDP
Sbjct: 12 LWVFAIFLVSRIAAVPNAVPTERISGSAGDVLEDNPVGRLKVYVYDLPSKYNKKTLQKDP 71
Query: 61 RCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRS 120
RCLTHMFAAEI+MHRFLL+SPVRTLNP+EADWFYTPIY TCDLTP GLPLPFKSPRMMRS
Sbjct: 72 RCLTHMFAAEIYMHRFLLNSPVRTLNPDEADWFYTPIYVTCDLTPNGLPLPFKSPRMMRS 131
Query: 121 AIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQR 180
AIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAI+RGI
Sbjct: 132 AIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIDRGI---------------- 175
Query: 181 NHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARA 240
PY PPQKM+ H IP +TPRSIFVYFRGLFYDVNNDPEGGYYARGARA
Sbjct: 176 -------------PYCPPQKMKTHLIPSETPRSIFVYFRGLFYDVNNDPEGGYYARGARA 222
Query: 241 AVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD 300
AVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD
Sbjct: 223 AVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD 282
Query: 301 DIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQP 360
DIVLPFADAIPWEEIG+FV E+DV LDTILTSIPP+VILRKQRLLANPSMKRAM+FPQP
Sbjct: 283 DIVLPFADAIPWEEIGVFVDEKDVSNLDTILTSIPPDVILRKQRLLANPSMKRAMMFPQP 342
Query: 361 AQPGDAFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
AQ GDAFHQILNGLARKLPHDK YLKPG + LNWT+GPVGDLKPW
Sbjct: 343 AQSGDAFHQILNGLARKLPHDKGVYLKPGERILNWTAGPVGDLKPW 388
>gi|413946367|gb|AFW79016.1| secondary cell wall glycosyltransferase family 47 [Zea mays]
Length = 418
Score = 679 bits (1752), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 322/397 (81%), Positives = 354/397 (89%), Gaps = 3/397 (0%)
Query: 10 LAFVSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAA 69
+A V+A + IERI GSAGDVLEDDPVG+LKVYVYELP KYNK ++ KD RCL+HMFA
Sbjct: 25 VAVVTAQD---IERIQGSAGDVLEDDPVGRLKVYVYELPPKYNKNVVAKDSRCLSHMFAT 81
Query: 70 EIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNW 129
EIFMHRFLL+S VRTLNP+EADWFYTP+Y TCDLTP G PL KSPRMMRSAIQ +S W
Sbjct: 82 EIFMHRFLLASAVRTLNPDEADWFYTPVYTTCDLTPWGHPLTVKSPRMMRSAIQYVSKRW 141
Query: 130 PYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGS 189
PYWNRTEGADHFFV PHDFGACF++QEEKAI+RG+LP+L+RATLVQTFGQ+NHVCL EGS
Sbjct: 142 PYWNRTEGADHFFVTPHDFGACFYFQEEKAIQRGVLPVLRRATLVQTFGQKNHVCLREGS 201
Query: 190 ITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNN 249
ITIPPYAPP K++AH +PP+TPRSIFVYFRGLFYD NDPEGGYYARGARA+VWENFKNN
Sbjct: 202 ITIPPYAPPHKIRAHIVPPETPRSIFVYFRGLFYDTANDPEGGYYARGARASVWENFKNN 261
Query: 250 PLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADA 309
LFDIST+HP TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADA
Sbjct: 262 ALFDISTEHPPTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADA 321
Query: 310 IPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQ 369
IPWEEI +FV E+DV +LDTILTSIP + ILRKQRLLANPSMK+AMLFPQPA+P DAFHQ
Sbjct: 322 IPWEEIAVFVPEDDVLRLDTILTSIPMDEILRKQRLLANPSMKQAMLFPQPAEPRDAFHQ 381
Query: 370 ILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
+LNGLARKLPH K +LKPG K LNWT G DLKPW
Sbjct: 382 VLNGLARKLPHPKDVFLKPGQKVLNWTEGAQDDLKPW 418
>gi|242090769|ref|XP_002441217.1| hypothetical protein SORBIDRAFT_09g022500 [Sorghum bicolor]
gi|241946502|gb|EES19647.1| hypothetical protein SORBIDRAFT_09g022500 [Sorghum bicolor]
Length = 422
Score = 671 bits (1731), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 315/387 (81%), Positives = 350/387 (90%), Gaps = 2/387 (0%)
Query: 22 ERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSP 81
ERISGS GDVLEDDPVGKLKV+VY+LP+KYN K + KDPRCLTHMFA EIF+HR LLSS
Sbjct: 36 ERISGSGGDVLEDDPVGKLKVFVYDLPAKYNTKPVEKDPRCLTHMFATEIFVHRSLLSSA 95
Query: 82 VRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHF 141
VRTL+PEEADWFYTP+Y TCDLT +G P+PF SPRMMRSAI+LI+ WPYWNR+EGADHF
Sbjct: 96 VRTLDPEEADWFYTPVYTTCDLTASGHPMPFDSPRMMRSAIRLIADRWPYWNRSEGADHF 155
Query: 142 FVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEG--SITIPPYAPPQ 199
FV PHDFGACFH+QEEKA+ RGILP+L+RATLVQTFGQRNHVCL +G SITIPPYAPP
Sbjct: 156 FVTPHDFGACFHFQEEKAMARGILPVLRRATLVQTFGQRNHVCLKDGGGSITIPPYAPPW 215
Query: 200 KMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP 259
KM+A +PP TPRSIFVYFRGLFYD NDPEGGYYARGARA+VWENFK+NPLFDIST HP
Sbjct: 216 KMEAQLLPPATPRSIFVYFRGLFYDAGNDPEGGYYARGARASVWENFKSNPLFDISTAHP 275
Query: 260 TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFV 319
TTYY+DMQRA+FCLCPLGWAPWSPRLVEAVVFGCIPV+IADDIVLPFADAIPW +IG+FV
Sbjct: 276 TTYYQDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVVIADDIVLPFADAIPWADIGVFV 335
Query: 320 AEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
AE+DVP+LDTILTSIP EV+LRKQRLLA+P+MKRA+LFPQPAQPGDAFHQILNGLARKLP
Sbjct: 336 AEDDVPRLDTILTSIPVEVVLRKQRLLASPAMKRAVLFPQPAQPGDAFHQILNGLARKLP 395
Query: 380 HDKSTYLKPGGKFLNWTSGPVGDLKPW 406
H +L+ G LNWT+GP DLKPW
Sbjct: 396 HGDDAFLRNGQTVLNWTAGPPRDLKPW 422
>gi|23821292|dbj|BAC20928.1| pectin-glucuronyltransferase [Nicotiana plumbaginifolia]
Length = 341
Score = 666 bits (1718), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 314/341 (92%), Positives = 328/341 (96%)
Query: 66 MFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLI 125
MFAAEI+MHRFLLSS VRT NPEEADWFYTP+Y TCDLTP GLPLPFKSPRMMRSAIQLI
Sbjct: 1 MFAAEIYMHRFLLSSAVRTFNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRMMRSAIQLI 60
Query: 126 SSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCL 185
+SNWPYWNRTEGADHFF+ PHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCL
Sbjct: 61 ASNWPYWNRTEGADHFFITPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCL 120
Query: 186 NEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWEN 245
+GSITIPPYAPPQKMQ+H IPPDTPRSIFVYFRGLFYDV NDPEGGYYARGARAAVWEN
Sbjct: 121 KDGSITIPPYAPPQKMQSHLIPPDTPRSIFVYFRGLFYDVGNDPEGGYYARGARAAVWEN 180
Query: 246 FKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLP 305
FK+NPLFDIST+HPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAV+FGCIPVIIADDIVLP
Sbjct: 181 FKDNPLFDISTEHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVIFGCIPVIIADDIVLP 240
Query: 306 FADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGD 365
FADAIPWE+IG+FVAE+DVP LDTILTSIPPE ILRKQRLLANPSMK+AMLFPQPAQ GD
Sbjct: 241 FADAIPWEDIGVFVAEKDVPNLDTILTSIPPEEILRKQRLLANPSMKQAMLFPQPAQSGD 300
Query: 366 AFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
AFHQILNGLARKLPHDKSTYLKPG FLNWT+GPVGDLKPW
Sbjct: 301 AFHQILNGLARKLPHDKSTYLKPGENFLNWTAGPVGDLKPW 341
>gi|168000965|ref|XP_001753186.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162695885|gb|EDQ82227.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 449
Score = 664 bits (1714), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 316/408 (77%), Positives = 354/408 (86%), Gaps = 4/408 (0%)
Query: 3 VTAIALLLAFV--SADNTPKI--ERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLK 58
+ ++ LLL F+ D T ++ +RISGSAGDVLED+PVG+LKV++Y++PSKYN L K
Sbjct: 38 LRSVVLLLLFIYFVQDVTAEMGHQRISGSAGDVLEDNPVGRLKVFIYDIPSKYNTDWLKK 97
Query: 59 DPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMM 118
DPRCLTHMFA E ++H FL SPVRTLNPEEADWFYTP+Y TCDLTP GLPLPFKSPR+M
Sbjct: 98 DPRCLTHMFAVEEYLHDFLTESPVRTLNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRVM 157
Query: 119 RSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFG 178
RSAI ISS+WPYWNRT+GADHFFVVPHDF ACFHYQEEKAIERGILPLL+RATL+QTFG
Sbjct: 158 RSAISYISSHWPYWNRTDGADHFFVVPHDFAACFHYQEEKAIERGILPLLKRATLIQTFG 217
Query: 179 QRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGA 238
Q +HVCL E SI IPPYAPP++MQ PP TPRSIF YFRGLFYD NDPEGGYYARGA
Sbjct: 218 QNHHVCLKEDSIVIPPYAPPERMQTRLNPPSTPRSIFAYFRGLFYDPGNDPEGGYYARGA 277
Query: 239 RAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVII 298
RAA+WENFK+NPLFDIST+HP TYYEDMQRAIFCLCPLGWAPWSPRLVE V+FGCIPVII
Sbjct: 278 RAAIWENFKDNPLFDISTEHPATYYEDMQRAIFCLCPLGWAPWSPRLVEGVIFGCIPVII 337
Query: 299 ADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFP 358
ADDIVLPFADAIPWE+IG+FV E+DVP LD IL +I E +L KQRLLANP+MK+AMLFP
Sbjct: 338 ADDIVLPFADAIPWEKIGVFVEEKDVPILDKILCTINHEEVLEKQRLLANPAMKQAMLFP 397
Query: 359 QPAQPGDAFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
+PA+PGDAFHQILNGLARKLPHD S YL+PG FLNWT GP GDL PW
Sbjct: 398 RPAKPGDAFHQILNGLARKLPHDPSIYLQPGQSFLNWTEGPPGDLYPW 445
>gi|326490197|dbj|BAJ94172.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326526653|dbj|BAK00715.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 422
Score = 661 bits (1706), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 308/384 (80%), Positives = 345/384 (89%)
Query: 23 RISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPV 82
RISG+AGDVLEDDPVGKLKV+VYE+P KYN LL KD RCL HMFAAEIFMH+FLL+S V
Sbjct: 39 RISGNAGDVLEDDPVGKLKVFVYEMPRKYNHYLLEKDNRCLYHMFAAEIFMHQFLLASAV 98
Query: 83 RTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFF 142
RT NPEEADWFYTP+Y TCDLT G PLPF++PR+MRSAIQ I++ WPYWNRTEGADHFF
Sbjct: 99 RTKNPEEADWFYTPVYVTCDLTQQGFPLPFRAPRIMRSAIQYIAATWPYWNRTEGADHFF 158
Query: 143 VVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQ 202
+ PHDFGACFHYQEE+AIERGILPLL+RATLVQTFGQRNHVC+ EGSITIPPYA PQKMQ
Sbjct: 159 LAPHDFGACFHYQEERAIERGILPLLRRATLVQTFGQRNHVCMQEGSITIPPYANPQKMQ 218
Query: 203 AHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTY 262
AH I P TPRSIF YFRGLFYD+ NDPEGGYYARGARA+VWENFK+NPLFD+S++HP+TY
Sbjct: 219 AHLISPGTPRSIFAYFRGLFYDMGNDPEGGYYARGARASVWENFKDNPLFDMSSEHPSTY 278
Query: 263 YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEE 322
YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWE+I +FVAE
Sbjct: 279 YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEQISVFVAEA 338
Query: 323 DVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHDK 382
DVP+LD+IL S+ PE +LRKQRLLA+P+MK+A+LF QPA+PGDAF Q+LNGLARKLPH
Sbjct: 339 DVPRLDSILASVAPEDVLRKQRLLASPAMKQAVLFHQPARPGDAFDQVLNGLARKLPHPD 398
Query: 383 STYLKPGGKFLNWTSGPVGDLKPW 406
+LKPG K L+W +G DLKPW
Sbjct: 399 GAFLKPGQKVLDWNAGLDSDLKPW 422
>gi|357126594|ref|XP_003564972.1| PREDICTED: probable glucuronosyltransferase Os01g0926400-like
[Brachypodium distachyon]
Length = 423
Score = 660 bits (1704), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 303/384 (78%), Positives = 347/384 (90%)
Query: 23 RISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPV 82
RISG+AGDVLED+PVGKLKV+VYE+P KYN+ LL KD RCL HMFAAEIFMH+FLL+S V
Sbjct: 40 RISGNAGDVLEDNPVGKLKVFVYEMPRKYNQYLLEKDNRCLYHMFAAEIFMHQFLLASAV 99
Query: 83 RTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFF 142
RT++PEEADWFYTP+Y TCDLT G PLPF++PRMMRSA+Q I++ WPYWNRTEGADHFF
Sbjct: 100 RTMDPEEADWFYTPVYTTCDLTQQGFPLPFRAPRMMRSAVQYIAATWPYWNRTEGADHFF 159
Query: 143 VVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQ 202
+ PHDFGACFHYQEE+AIERGILP+L+RATLVQTFGQRNHVC+ EGSITIPPYA PQKMQ
Sbjct: 160 LAPHDFGACFHYQEERAIERGILPVLRRATLVQTFGQRNHVCMQEGSITIPPYANPQKMQ 219
Query: 203 AHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTY 262
AH I P TPRSIF YFRGLFYD+ NDPEGGYYARGARA+VWENFK+NPLFD+ST+HP+TY
Sbjct: 220 AHLISPGTPRSIFAYFRGLFYDMGNDPEGGYYARGARASVWENFKDNPLFDMSTEHPSTY 279
Query: 263 YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEE 322
YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW++I +FV E
Sbjct: 280 YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWDQISVFVPEA 339
Query: 323 DVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHDK 382
DVP+LD+IL S+PPE +LRKQRLLA+P+MK+A+LF QPAQP DAF Q+LNGLARKLPH +
Sbjct: 340 DVPRLDSILASVPPEDVLRKQRLLASPAMKQAVLFHQPAQPRDAFDQVLNGLARKLPHRE 399
Query: 383 STYLKPGGKFLNWTSGPVGDLKPW 406
+L+PG K ++W +G DLKPW
Sbjct: 400 GVFLRPGQKVIDWNAGQYDDLKPW 423
>gi|115441965|ref|NP_001045262.1| Os01g0926400 [Oryza sativa Japonica Group]
gi|75159223|sp|Q8S1X9.1|GT13_ORYSJ RecName: Full=Probable glucuronosyltransferase Os01g0926400;
AltName: Full=OsGT47D
gi|20160726|dbj|BAB89668.1| P0482D04.15 [Oryza sativa Japonica Group]
gi|20805224|dbj|BAB92891.1| putative pectin-glucuronyltransferase [Oryza sativa Japonica Group]
gi|113534793|dbj|BAF07176.1| Os01g0926400 [Oryza sativa Japonica Group]
gi|215686656|dbj|BAG88909.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215707279|dbj|BAG93739.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 422
Score = 657 bits (1696), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 305/394 (77%), Positives = 348/394 (88%)
Query: 13 VSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIF 72
+A T K R+SG GDVLEDDP GKLKV+VYE+P KYN LL KD RCL HMFAAEIF
Sbjct: 29 AAATTTQKHGRMSGKGGDVLEDDPTGKLKVFVYEMPRKYNLNLLAKDSRCLQHMFAAEIF 88
Query: 73 MHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYW 132
MH+FLLSSPVRTL+PEEADWFYTP Y TCDLTP G PLPF++PR+MRSA++ +++ WPYW
Sbjct: 89 MHQFLLSSPVRTLDPEEADWFYTPAYTTCDLTPQGFPLPFRAPRIMRSAVRYVAATWPYW 148
Query: 133 NRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITI 192
NRT+GADHFF+ PHDFGACFHYQEE+AIERGILP+L+RATLVQTFGQR+H CL GSIT+
Sbjct: 149 NRTDGADHFFLAPHDFGACFHYQEERAIERGILPVLRRATLVQTFGQRHHPCLQPGSITV 208
Query: 193 PPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLF 252
PPYA P+KM+AH+I P TPRSIFVYFRGLFYD+ NDPEGGYYARGARA+VWENFK+NPLF
Sbjct: 209 PPYADPRKMEAHRISPATPRSIFVYFRGLFYDMGNDPEGGYYARGARASVWENFKDNPLF 268
Query: 253 DISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW 312
DIST+HP TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW
Sbjct: 269 DISTEHPATYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW 328
Query: 313 EEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILN 372
EI +FVAEEDVP+LDTIL S+P + ++RKQRLLA+P+MK+A+LF QPA+PGDAFHQILN
Sbjct: 329 GEISVFVAEEDVPRLDTILASVPLDEVIRKQRLLASPAMKQAVLFHQPARPGDAFHQILN 388
Query: 373 GLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
GLARKLPH K +L+PG K ++W G DLKPW
Sbjct: 389 GLARKLPHPKGVFLEPGEKGIDWDQGLENDLKPW 422
>gi|21671946|gb|AAM74308.1|AC083944_26 Unknown protein similar to exostosin-2 [Oryza sativa Japonica
Group]
gi|110288700|gb|ABB46946.2| exostosin family protein, putative, expressed [Oryza sativa
Japonica Group]
Length = 401
Score = 656 bits (1693), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 310/406 (76%), Positives = 346/406 (85%), Gaps = 12/406 (2%)
Query: 6 IALLLAF-----VSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDP 60
+A+LL F V+A IERI EDDPVGKLKVYVYELP KYNK ++ KD
Sbjct: 3 LAVLLLFACSSGVAAAAAQGIERIK-------EDDPVGKLKVYVYELPPKYNKNIVAKDS 55
Query: 61 RCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRS 120
RCL+HMFA EIFMHRFLLSS +RT NP+EADWFYTP+Y TCDLTP G PL KSPRMMRS
Sbjct: 56 RCLSHMFATEIFMHRFLLSSAIRTSNPDEADWFYTPVYTTCDLTPWGHPLTTKSPRMMRS 115
Query: 121 AIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQR 180
AI+ IS WPYWNRTEGADHFFVVPHDF ACF++QE KAIERGILP+L+RATLVQTFGQ+
Sbjct: 116 AIKFISKYWPYWNRTEGADHFFVVPHDFAACFYFQEAKAIERGILPVLRRATLVQTFGQK 175
Query: 181 NHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARA 240
NH CL +GSIT+PPY P K++AH +PP+TPRSIFVYFRGLFYD +NDPEGGYYARGARA
Sbjct: 176 NHACLKDGSITVPPYTPAHKIRAHLVPPETPRSIFVYFRGLFYDTSNDPEGGYYARGARA 235
Query: 241 AVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD 300
+VWENFKNNP+FDISTDHP TYYEDMQRA+FCLCPLGWAPWSPRLVEAVVFGCIPVIIAD
Sbjct: 236 SVWENFKNNPMFDISTDHPQTYYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD 295
Query: 301 DIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQP 360
DIVLPF+DAIPWEEI +FVAE+DVP+LDTILTSIP EVILRKQ +LA PSMK+ MLFPQP
Sbjct: 296 DIVLPFSDAIPWEEIAVFVAEDDVPQLDTILTSIPTEVILRKQAMLAEPSMKQTMLFPQP 355
Query: 361 AQPGDAFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
A+PGD FHQ++N LARKLPH + +LKPG K LNWT G DLKPW
Sbjct: 356 AEPGDGFHQVMNALARKLPHGRDVFLKPGQKVLNWTEGTREDLKPW 401
>gi|125528940|gb|EAY77054.1| hypothetical protein OsI_05011 [Oryza sativa Indica Group]
Length = 422
Score = 655 bits (1690), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 304/394 (77%), Positives = 347/394 (88%)
Query: 13 VSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIF 72
+A T K +SG GDVLEDDP GKLKV+VYE+P KYN LL KD RCL HMFAAEIF
Sbjct: 29 AAATTTQKHGSMSGKGGDVLEDDPTGKLKVFVYEMPRKYNLNLLAKDSRCLQHMFAAEIF 88
Query: 73 MHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYW 132
MH+FLLSSPVRTL+PEEADWFYTP Y TCDLTP G PLPF++PR+MRSA++ +++ WPYW
Sbjct: 89 MHQFLLSSPVRTLDPEEADWFYTPAYTTCDLTPQGFPLPFRAPRIMRSAVRYVAATWPYW 148
Query: 133 NRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITI 192
NRT+GADHFF+ PHDFGACFHYQEE+AIERGILP+L+RATLVQTFGQR+H CL GSIT+
Sbjct: 149 NRTDGADHFFLAPHDFGACFHYQEERAIERGILPVLRRATLVQTFGQRHHPCLQPGSITV 208
Query: 193 PPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLF 252
PPYA P+KM+AH+I P TPRSIFVYFRGLFYD+ NDPEGGYYARGARA+VWENFK+NPLF
Sbjct: 209 PPYADPRKMEAHRISPATPRSIFVYFRGLFYDMGNDPEGGYYARGARASVWENFKDNPLF 268
Query: 253 DISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW 312
DIST+HP TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW
Sbjct: 269 DISTEHPATYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW 328
Query: 313 EEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILN 372
EI +FVAEEDVP+LDTIL S+P + ++RKQRLLA+P+MK+A+LF QPA+PGDAFHQILN
Sbjct: 329 GEISVFVAEEDVPRLDTILASVPLDEVIRKQRLLASPAMKQAVLFHQPARPGDAFHQILN 388
Query: 373 GLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
GLARKLPH K +L+PG K ++W G DLKPW
Sbjct: 389 GLARKLPHPKGVFLEPGEKGIDWDQGLENDLKPW 422
>gi|218184216|gb|EEC66643.1| hypothetical protein OsI_32904 [Oryza sativa Indica Group]
gi|222612524|gb|EEE50656.1| hypothetical protein OsJ_30886 [Oryza sativa Japonica Group]
Length = 400
Score = 655 bits (1689), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 309/406 (76%), Positives = 346/406 (85%), Gaps = 13/406 (3%)
Query: 6 IALLLAF-----VSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDP 60
+A+LL F V+A IERI +DDPVGKLKVYVYELP KYNK ++ KD
Sbjct: 3 LAVLLLFACSSGVAAAAAQGIERI--------KDDPVGKLKVYVYELPPKYNKNIVAKDS 54
Query: 61 RCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRS 120
RCL+HMFA EIFMHRFLLSS +RT NP+EADWFYTP+Y TCDLTP G PL KSPRMMRS
Sbjct: 55 RCLSHMFATEIFMHRFLLSSAIRTSNPDEADWFYTPVYTTCDLTPWGHPLTTKSPRMMRS 114
Query: 121 AIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQR 180
AI+ IS WPYWNRTEGADHFFVVPHDF ACF++QE KAIERGILP+L+RATLVQTFGQ+
Sbjct: 115 AIKFISKYWPYWNRTEGADHFFVVPHDFAACFYFQEAKAIERGILPVLRRATLVQTFGQK 174
Query: 181 NHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARA 240
NH CL +GSIT+PPY P K++AH +PP+TPRSIFVYFRGLFYD +NDPEGGYYARGARA
Sbjct: 175 NHACLKDGSITVPPYTPAHKIRAHLVPPETPRSIFVYFRGLFYDTSNDPEGGYYARGARA 234
Query: 241 AVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD 300
+VWENFKNNP+FDISTDHP TYYEDMQRA+FCLCPLGWAPWSPRLVEAVVFGCIPVIIAD
Sbjct: 235 SVWENFKNNPMFDISTDHPQTYYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD 294
Query: 301 DIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQP 360
DIVLPF+DAIPWEEI +FVAE+DVP+LDTILTSIP EVILRKQ +LA PSMK+ MLFPQP
Sbjct: 295 DIVLPFSDAIPWEEIAVFVAEDDVPQLDTILTSIPTEVILRKQAMLAEPSMKQTMLFPQP 354
Query: 361 AQPGDAFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
A+PGD FHQ++N LARKLPH + +LKPG K LNWT G DLKPW
Sbjct: 355 AEPGDGFHQVMNALARKLPHGRDVFLKPGQKVLNWTEGTREDLKPW 400
>gi|10176877|dbj|BAB10084.1| unnamed protein product [Arabidopsis thaliana]
gi|23821294|dbj|BAC20929.1| NpGUT1 homolog [Arabidopsis thaliana]
Length = 341
Score = 654 bits (1687), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 308/341 (90%), Positives = 325/341 (95%)
Query: 66 MFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLI 125
MFAAEI+M RFLLSSPVRTLNPEEADWFY P+Y TCDLTP GLPLPFKSPRMMRSAIQLI
Sbjct: 1 MFAAEIYMQRFLLSSPVRTLNPEEADWFYVPVYTTCDLTPNGLPLPFKSPRMMRSAIQLI 60
Query: 126 SSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCL 185
+SNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAI RGILPLLQRATLVQTFGQRNHVCL
Sbjct: 61 ASNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIGRGILPLLQRATLVQTFGQRNHVCL 120
Query: 186 NEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWEN 245
EGSIT+PPYAPPQKMQ+H IP TPRSIFVYFRGLFYDV NDPEGGYYARGARAAVWEN
Sbjct: 121 KEGSITVPPYAPPQKMQSHLIPEKTPRSIFVYFRGLFYDVGNDPEGGYYARGARAAVWEN 180
Query: 246 FKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLP 305
FK+NPLFDIST+HPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAV+FGCIPVIIADDIVLP
Sbjct: 181 FKDNPLFDISTEHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVIFGCIPVIIADDIVLP 240
Query: 306 FADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGD 365
FADAIPWE+IG+FV E+DVP LDTILTSIPPEVILRKQRLLANPSMK+AMLFPQPAQPGD
Sbjct: 241 FADAIPWEDIGVFVDEKDVPYLDTILTSIPPEVILRKQRLLANPSMKQAMLFPQPAQPGD 300
Query: 366 AFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
AFHQ+LNGLARKLPH++S YL+PG K LNWT+GPV DLKPW
Sbjct: 301 AFHQVLNGLARKLPHERSVYLRPGEKLLNWTAGPVADLKPW 341
>gi|115481310|ref|NP_001064248.1| Os10g0180000 [Oryza sativa Japonica Group]
gi|122212636|sp|Q33AH8.2|GT101_ORYSJ RecName: Full=Probable glucuronosyltransferase GUT1; AltName:
Full=Glucuronoxylan glucuronosyltransferase 1;
Short=OsGUT1
gi|110288701|gb|ABB46947.2| exostosin family protein, putative, expressed [Oryza sativa
Japonica Group]
gi|113638857|dbj|BAF26162.1| Os10g0180000 [Oryza sativa Japonica Group]
Length = 417
Score = 654 bits (1687), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 309/406 (76%), Positives = 346/406 (85%), Gaps = 13/406 (3%)
Query: 6 IALLLAF-----VSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDP 60
+A+LL F V+A IERI +DDPVGKLKVYVYELP KYNK ++ KD
Sbjct: 20 LAVLLLFACSSGVAAAAAQGIERI--------KDDPVGKLKVYVYELPPKYNKNIVAKDS 71
Query: 61 RCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRS 120
RCL+HMFA EIFMHRFLLSS +RT NP+EADWFYTP+Y TCDLTP G PL KSPRMMRS
Sbjct: 72 RCLSHMFATEIFMHRFLLSSAIRTSNPDEADWFYTPVYTTCDLTPWGHPLTTKSPRMMRS 131
Query: 121 AIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQR 180
AI+ IS WPYWNRTEGADHFFVVPHDF ACF++QE KAIERGILP+L+RATLVQTFGQ+
Sbjct: 132 AIKFISKYWPYWNRTEGADHFFVVPHDFAACFYFQEAKAIERGILPVLRRATLVQTFGQK 191
Query: 181 NHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARA 240
NH CL +GSIT+PPY P K++AH +PP+TPRSIFVYFRGLFYD +NDPEGGYYARGARA
Sbjct: 192 NHACLKDGSITVPPYTPAHKIRAHLVPPETPRSIFVYFRGLFYDTSNDPEGGYYARGARA 251
Query: 241 AVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD 300
+VWENFKNNP+FDISTDHP TYYEDMQRA+FCLCPLGWAPWSPRLVEAVVFGCIPVIIAD
Sbjct: 252 SVWENFKNNPMFDISTDHPQTYYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD 311
Query: 301 DIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQP 360
DIVLPF+DAIPWEEI +FVAE+DVP+LDTILTSIP EVILRKQ +LA PSMK+ MLFPQP
Sbjct: 312 DIVLPFSDAIPWEEIAVFVAEDDVPQLDTILTSIPTEVILRKQAMLAEPSMKQTMLFPQP 371
Query: 361 AQPGDAFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
A+PGD FHQ++N LARKLPH + +LKPG K LNWT G DLKPW
Sbjct: 372 AEPGDGFHQVMNALARKLPHGRDVFLKPGQKVLNWTEGTREDLKPW 417
>gi|242059739|ref|XP_002459015.1| hypothetical protein SORBIDRAFT_03g044510 [Sorghum bicolor]
gi|241930990|gb|EES04135.1| hypothetical protein SORBIDRAFT_03g044510 [Sorghum bicolor]
Length = 415
Score = 652 bits (1681), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 310/408 (75%), Positives = 352/408 (86%), Gaps = 4/408 (0%)
Query: 3 VTAIALLLAFVSA----DNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLK 58
V AI L+ V+A K RISG+A VL+DDPVGKLKV+VYE+P KYNK LL K
Sbjct: 8 VVAILLVATCVAAAPPQQQPRKPGRISGNAVGVLDDDPVGKLKVFVYEMPRKYNKMLLDK 67
Query: 59 DPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMM 118
D RCL HMFAAEIFMH+FLLSS VRTL+PEEADWFYTP+Y TCDLTP G PLPF++PR+M
Sbjct: 68 DSRCLHHMFAAEIFMHQFLLSSAVRTLDPEEADWFYTPVYTTCDLTPQGFPLPFRAPRIM 127
Query: 119 RSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFG 178
RSAI+ +++ WP+WNRT+GADHFF+ PHDFGACFHYQEE+A+ERGILPLL+RATLVQTFG
Sbjct: 128 RSAIRYVATTWPFWNRTDGADHFFLTPHDFGACFHYQEERAMERGILPLLRRATLVQTFG 187
Query: 179 QRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGA 238
QRNHVCL +GSIT+PPYA P KMQAH I P TPRSIFVYFRGLFYD+ NDPEGGYYARGA
Sbjct: 188 QRNHVCLQDGSITVPPYADPHKMQAHLISPGTPRSIFVYFRGLFYDMGNDPEGGYYARGA 247
Query: 239 RAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVII 298
RA+VWENFK+NPLFDIST+HP TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVII
Sbjct: 248 RASVWENFKDNPLFDISTEHPYTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVII 307
Query: 299 ADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFP 358
ADDIVLPFADAIPWE+I +FVAE DVP+LD+ILTSIP ILR+QRLLA S+K+A+LF
Sbjct: 308 ADDIVLPFADAIPWEDISVFVAERDVPRLDSILTSIPLADILRRQRLLARESVKQALLFH 367
Query: 359 QPAQPGDAFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
QPA+ GDAFHQ+LNGLARKLPH +L+PG K L+W +G DLKPW
Sbjct: 368 QPARTGDAFHQVLNGLARKLPHGDGVFLQPGEKVLDWDAGLDSDLKPW 415
>gi|18642697|gb|AAL76189.1|AC092173_1 Unknown protein [Oryza sativa Japonica Group]
gi|23821296|dbj|BAC20930.1| NpGUT1 homolog [Oryza sativa Japonica Group]
Length = 401
Score = 652 bits (1681), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 308/406 (75%), Positives = 344/406 (84%), Gaps = 12/406 (2%)
Query: 6 IALLLAF-----VSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDP 60
+A+LL F V+A IERI EDDPVGKLKVYVYELP KYNK ++ KD
Sbjct: 3 LAVLLLFACSSGVAAAAAQGIERIK-------EDDPVGKLKVYVYELPPKYNKNIVAKDS 55
Query: 61 RCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRS 120
RCL+HMFA EIFMHRFLLSS +RT NP+EADWFYTP+Y TCDLTP G PL KSPRMMRS
Sbjct: 56 RCLSHMFATEIFMHRFLLSSAIRTSNPDEADWFYTPVYTTCDLTPWGHPLTTKSPRMMRS 115
Query: 121 AIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQR 180
AI+ IS WPYWNRTEGADHFFVVPHDF ACF++QE KAIERGILP+L+RATLVQTFGQ+
Sbjct: 116 AIKFISKYWPYWNRTEGADHFFVVPHDFAACFYFQEAKAIERGILPVLRRATLVQTFGQK 175
Query: 181 NHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARA 240
NH CL +GSIT+PPY P K++AH +PP+TPRSIFVYFRGLFYD +NDPEGGYYARGARA
Sbjct: 176 NHACLKDGSITVPPYTPAHKIRAHLVPPETPRSIFVYFRGLFYDTSNDPEGGYYARGARA 235
Query: 241 AVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD 300
+VWENFKNNP+FDISTDHP TYYEDMQRA+FCLCPLGWAPWSPRLVEAVVFGCIPVIIAD
Sbjct: 236 SVWENFKNNPMFDISTDHPQTYYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD 295
Query: 301 DIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQP 360
DI LP +DAIPWEEI +FVAE+DVP+LDTILTSIP EVILRKQ +LA PSMK+ MLFPQP
Sbjct: 296 DIDLPLSDAIPWEEIAVFVAEDDVPQLDTILTSIPTEVILRKQAMLAEPSMKQTMLFPQP 355
Query: 361 AQPGDAFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
A+PGD FHQ++N LARKLPH + +LKPG K LNWT G DLKPW
Sbjct: 356 AEPGDGFHQVMNALARKLPHGRDVFLKPGQKVLNWTEGTREDLKPW 401
>gi|195648072|gb|ACG43504.1| secondary cell wall-related glycosyltransferase family 47 [Zea
mays]
Length = 420
Score = 651 bits (1680), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 303/384 (78%), Positives = 345/384 (89%)
Query: 23 RISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPV 82
RISG A VL+DDPVGKLKV+VYE+P KYN+ LL KD RCL HMFAAEIFMH+FLLSS V
Sbjct: 37 RISGDAVGVLDDDPVGKLKVFVYEMPRKYNRMLLDKDSRCLHHMFAAEIFMHQFLLSSAV 96
Query: 83 RTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFF 142
RTL+PEEADWFYTP Y TCDLTP G PLPF++PR+MRSAI+ +++ WP+WNRT+GADHFF
Sbjct: 97 RTLDPEEADWFYTPAYTTCDLTPQGFPLPFRAPRIMRSAIRYVATTWPFWNRTDGADHFF 156
Query: 143 VVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQ 202
+ PHDFGACFHYQEE+A+ERGILPLL+RATLVQTFGQRNHVCL +GSIT+PPYA P+KMQ
Sbjct: 157 LTPHDFGACFHYQEERAMERGILPLLRRATLVQTFGQRNHVCLQDGSITVPPYADPRKMQ 216
Query: 203 AHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTY 262
AH I P TPRSIFVYFRGLFYD+ NDPEGGYYARGARA+VWENFK+NPLFDIST+HP+TY
Sbjct: 217 AHLISPGTPRSIFVYFRGLFYDMGNDPEGGYYARGARASVWENFKDNPLFDISTEHPSTY 276
Query: 263 YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEE 322
YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWE+I +FVAE
Sbjct: 277 YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEDISVFVAER 336
Query: 323 DVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHDK 382
DVP+LD+ILTSIP ILR+QRLLA S+K+A+LF QPA+PGDAFHQ+LNGLARKLPH +
Sbjct: 337 DVPRLDSILTSIPLPDILRRQRLLARDSVKQALLFHQPARPGDAFHQVLNGLARKLPHGE 396
Query: 383 STYLKPGGKFLNWTSGPVGDLKPW 406
+L+PG K L+W +G DLKPW
Sbjct: 397 GVFLQPGEKALDWDAGLQSDLKPW 420
>gi|414878903|tpg|DAA56034.1| TPA: hypothetical protein ZEAMMB73_301237 [Zea mays]
Length = 393
Score = 650 bits (1678), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 305/388 (78%), Positives = 346/388 (89%), Gaps = 3/388 (0%)
Query: 22 ERISGSAGD---VLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLL 78
ERI G AGD VL+DDPVGKLKV+VYE+P KYN+ LL KD RCL HMFAAEIFMH+FLL
Sbjct: 6 ERIDGFAGDAVGVLDDDPVGKLKVFVYEMPRKYNRMLLDKDSRCLHHMFAAEIFMHQFLL 65
Query: 79 SSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGA 138
SS VRTL+PEEADWFYTP Y TCDLTP G PLPF++PR+MRSAI+ +++ WP+WNRT+GA
Sbjct: 66 SSAVRTLDPEEADWFYTPAYTTCDLTPQGFPLPFRAPRIMRSAIRYVATTWPFWNRTDGA 125
Query: 139 DHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPP 198
DHFF+ PHDFGACFHYQEE+A+ERGILPLL+RATLVQTFGQRNHVCL +GSIT+PPYA P
Sbjct: 126 DHFFLTPHDFGACFHYQEERAMERGILPLLRRATLVQTFGQRNHVCLQDGSITVPPYADP 185
Query: 199 QKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDH 258
KMQAH I P TPRSIFVYFRGLFYD+ NDPEGGYYARGARA+VWENFK+NPLFDIST+H
Sbjct: 186 GKMQAHLISPGTPRSIFVYFRGLFYDMGNDPEGGYYARGARASVWENFKDNPLFDISTEH 245
Query: 259 PTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMF 318
P+TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWE+I +F
Sbjct: 246 PSTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEDISVF 305
Query: 319 VAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKL 378
VAE DVP+LD+ILTSIP ILR+QRLLA S+K+A+LF QPA+PGDAFHQ+LNGLARKL
Sbjct: 306 VAERDVPRLDSILTSIPLPDILRRQRLLARDSVKQALLFHQPARPGDAFHQVLNGLARKL 365
Query: 379 PHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
PH + +L+PG K L+W +G DLKPW
Sbjct: 366 PHGEGVFLQPGEKALDWDAGLQSDLKPW 393
>gi|414878901|tpg|DAA56032.1| TPA: secondary cell wall glycosyltransferase family 47 [Zea mays]
Length = 417
Score = 649 bits (1675), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 303/384 (78%), Positives = 344/384 (89%)
Query: 23 RISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPV 82
RISG A VL+DDPVGKLKV+VYE+P KYN+ LL KD RCL HMFAAEIFMH+FLLSS V
Sbjct: 34 RISGDAVGVLDDDPVGKLKVFVYEMPRKYNRMLLDKDSRCLHHMFAAEIFMHQFLLSSAV 93
Query: 83 RTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFF 142
RTL+PEEADWFYTP Y TCDLTP G PLPF++PR+MRSAI+ +++ WP+WNRT+GADHFF
Sbjct: 94 RTLDPEEADWFYTPAYTTCDLTPQGFPLPFRAPRIMRSAIRYVATTWPFWNRTDGADHFF 153
Query: 143 VVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQ 202
+ PHDFGACFHYQEE+A+ERGILPLL+RATLVQTFGQRNHVCL +GSIT+PPYA P KMQ
Sbjct: 154 LTPHDFGACFHYQEERAMERGILPLLRRATLVQTFGQRNHVCLQDGSITVPPYADPGKMQ 213
Query: 203 AHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTY 262
AH I P TPRSIFVYFRGLFYD+ NDPEGGYYARGARA+VWENFK+NPLFDIST+HP+TY
Sbjct: 214 AHLISPGTPRSIFVYFRGLFYDMGNDPEGGYYARGARASVWENFKDNPLFDISTEHPSTY 273
Query: 263 YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEE 322
YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWE+I +FVAE
Sbjct: 274 YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEDISVFVAER 333
Query: 323 DVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHDK 382
DVP+LD+ILTSIP ILR+QRLLA S+K+A+LF QPA+PGDAFHQ+LNGLARKLPH +
Sbjct: 334 DVPRLDSILTSIPLPDILRRQRLLARDSVKQALLFHQPARPGDAFHQVLNGLARKLPHGE 393
Query: 383 STYLKPGGKFLNWTSGPVGDLKPW 406
+L+PG K L+W +G DLKPW
Sbjct: 394 GVFLQPGEKALDWDAGLQSDLKPW 417
>gi|293336613|ref|NP_001170701.1| uncharacterized protein LOC100384778 precursor [Zea mays]
gi|238007038|gb|ACR34554.1| unknown [Zea mays]
gi|413951433|gb|AFW84082.1| hypothetical protein ZEAMMB73_582447 [Zea mays]
Length = 412
Score = 649 bits (1674), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 299/384 (77%), Positives = 343/384 (89%)
Query: 23 RISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPV 82
R SG+A VL+DDPVGKLKV+VYE+P KYNK L+ KD RCL HMFAAEIFMH+FLLSS V
Sbjct: 29 RTSGNAVGVLDDDPVGKLKVFVYEMPRKYNKMLVDKDSRCLQHMFAAEIFMHQFLLSSAV 88
Query: 83 RTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFF 142
RTL+PE ADWFYTP+Y TCDLTP G PLPF++PRMMRSAI+ +++ WP+WNRT+GADHFF
Sbjct: 89 RTLDPEAADWFYTPVYTTCDLTPQGFPLPFRAPRMMRSAIRYVAATWPFWNRTDGADHFF 148
Query: 143 VVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQ 202
+ PHDFGACFHYQEE+A+ERGILPLL+RATLVQTFGQRNHVCL +GSIT+PPYA P ++Q
Sbjct: 149 LTPHDFGACFHYQEERAVERGILPLLRRATLVQTFGQRNHVCLQDGSITVPPYASPHRLQ 208
Query: 203 AHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTY 262
AH + P TPRSIFVYFRGLFYD+ NDPEGGYYARGARA+VWENFK+NPLFDIST+HP TY
Sbjct: 209 AHLVGPGTPRSIFVYFRGLFYDMGNDPEGGYYARGARASVWENFKDNPLFDISTEHPATY 268
Query: 263 YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEE 322
YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWE++ +FVAE
Sbjct: 269 YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEDMSVFVAER 328
Query: 323 DVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHDK 382
DVP+LD+ILTSIP ILR+QRLLA S+KRA+LF QPA+PGDAFHQ+LNGLARKLPH +
Sbjct: 329 DVPRLDSILTSIPLPDILRRQRLLARDSVKRALLFHQPARPGDAFHQVLNGLARKLPHRE 388
Query: 383 STYLKPGGKFLNWTSGPVGDLKPW 406
+L+PG K L+W +G DLKPW
Sbjct: 389 GVFLQPGEKALDWDAGLESDLKPW 412
>gi|223974777|gb|ACN31576.1| unknown [Zea mays]
Length = 412
Score = 646 bits (1667), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 298/384 (77%), Positives = 341/384 (88%)
Query: 23 RISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPV 82
R SG A VL+DDPVGKLKV+VYE+P KYNK L+ D RCL HMFAAEIFMH+FLLSS V
Sbjct: 29 RTSGDAVGVLDDDPVGKLKVFVYEMPRKYNKMLVDNDSRCLQHMFAAEIFMHQFLLSSAV 88
Query: 83 RTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFF 142
RTL+PE ADWFYTP+Y TCDLTP G PLPF++PRMMRSAI+ +++ WP+WNRT+GADHFF
Sbjct: 89 RTLDPEAADWFYTPVYTTCDLTPQGFPLPFRAPRMMRSAIRYVAATWPFWNRTDGADHFF 148
Query: 143 VVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQ 202
+ PHDFGACFHYQEE+A+ERGILPLL+RATLVQTFGQRNHVCL +GSIT+PPYA P ++Q
Sbjct: 149 LTPHDFGACFHYQEERAVERGILPLLRRATLVQTFGQRNHVCLQDGSITVPPYASPHRLQ 208
Query: 203 AHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTY 262
AH + P TPRSIFVYFRGLFYD+ NDPEGGYYARGARA+VWENFK+NPLFDIST+HP TY
Sbjct: 209 AHLVGPGTPRSIFVYFRGLFYDMGNDPEGGYYARGARASVWENFKDNPLFDISTEHPATY 268
Query: 263 YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEE 322
YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWE++ +FVAE
Sbjct: 269 YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEDMSVFVAER 328
Query: 323 DVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHDK 382
DVP+LD+ILTSIP ILR+QRLLA S+KRA+LF QPA+PGDAFHQ+LNGLARKLPH +
Sbjct: 329 DVPRLDSILTSIPLPDILRRQRLLARDSVKRALLFHQPARPGDAFHQVLNGLARKLPHGE 388
Query: 383 STYLKPGGKFLNWTSGPVGDLKPW 406
+L+PG K L+W +G DLKPW
Sbjct: 389 DVFLQPGEKALDWDAGLESDLKPW 412
>gi|414878902|tpg|DAA56033.1| TPA: hypothetical protein ZEAMMB73_301237 [Zea mays]
Length = 387
Score = 642 bits (1657), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 299/380 (78%), Positives = 340/380 (89%)
Query: 27 SAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLN 86
A VL+DDPVGKLKV+VYE+P KYN+ LL KD RCL HMFAAEIFMH+FLLSS VRTL+
Sbjct: 8 DAVGVLDDDPVGKLKVFVYEMPRKYNRMLLDKDSRCLHHMFAAEIFMHQFLLSSAVRTLD 67
Query: 87 PEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPH 146
PEEADWFYTP Y TCDLTP G PLPF++PR+MRSAI+ +++ WP+WNRT+GADHFF+ PH
Sbjct: 68 PEEADWFYTPAYTTCDLTPQGFPLPFRAPRIMRSAIRYVATTWPFWNRTDGADHFFLTPH 127
Query: 147 DFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQI 206
DFGACFHYQEE+A+ERGILPLL+RATLVQTFGQRNHVCL +GSIT+PPYA P KMQAH I
Sbjct: 128 DFGACFHYQEERAMERGILPLLRRATLVQTFGQRNHVCLQDGSITVPPYADPGKMQAHLI 187
Query: 207 PPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM 266
P TPRSIFVYFRGLFYD+ NDPEGGYYARGARA+VWENFK+NPLFDIST+HP+TYYEDM
Sbjct: 188 SPGTPRSIFVYFRGLFYDMGNDPEGGYYARGARASVWENFKDNPLFDISTEHPSTYYEDM 247
Query: 267 QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK 326
QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWE+I +FVAE DVP+
Sbjct: 248 QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEDISVFVAERDVPR 307
Query: 327 LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHDKSTYL 386
LD+ILTSIP ILR+QRLLA S+K+A+LF QPA+PGDAFHQ+LNGLARKLPH + +L
Sbjct: 308 LDSILTSIPLPDILRRQRLLARDSVKQALLFHQPARPGDAFHQVLNGLARKLPHGEGVFL 367
Query: 387 KPGGKFLNWTSGPVGDLKPW 406
+PG K L+W +G DLKPW
Sbjct: 368 QPGEKALDWDAGLQSDLKPW 387
>gi|125539672|gb|EAY86067.1| hypothetical protein OsI_07437 [Oryza sativa Indica Group]
Length = 342
Score = 633 bits (1633), Expect = e-179, Method: Compositional matrix adjust.
Identities = 299/341 (87%), Positives = 318/341 (93%)
Query: 66 MFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLI 125
MFAAEIFMHRFLLSS VRTLNPE+ADWFY P+Y TCDLT GLPLPFKSPRMMRSAIQ +
Sbjct: 1 MFAAEIFMHRFLLSSAVRTLNPEQADWFYAPVYTTCDLTHAGLPLPFKSPRMMRSAIQFL 60
Query: 126 SSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCL 185
S WP+WNRT+GADHFFVVPHDFGACFHYQEEKAIERGILPLL+RATLVQTFGQ+NHVCL
Sbjct: 61 SRKWPFWNRTDGADHFFVVPHDFGACFHYQEEKAIERGILPLLRRATLVQTFGQKNHVCL 120
Query: 186 NEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWEN 245
EGSITIPPYAPPQKMQAH IPPDTPRSIFVYFRGLFYD NDPEGGYYARGARA++WEN
Sbjct: 121 KEGSITIPPYAPPQKMQAHLIPPDTPRSIFVYFRGLFYDNGNDPEGGYYARGARASLWEN 180
Query: 246 FKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLP 305
FKNNPLFDIST+HP TYYEDMQR++FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLP
Sbjct: 181 FKNNPLFDISTEHPATYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLP 240
Query: 306 FADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGD 365
FADAIPW+EIG+FV EEDVP+LD+ILTSIP + ILRKQRLLANPSMK+AMLFPQPAQP D
Sbjct: 241 FADAIPWDEIGVFVDEEDVPRLDSILTSIPIDDILRKQRLLANPSMKQAMLFPQPAQPRD 300
Query: 366 AFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
AFHQILNGLARKLPH S YLKPG K LNWT+GPV DLKPW
Sbjct: 301 AFHQILNGLARKLPHPDSVYLKPGEKHLNWTAGPVADLKPW 341
>gi|386576430|gb|AFJ12119.1| glycosyltransferase, partial [Nicotiana tabacum]
Length = 317
Score = 621 bits (1602), Expect = e-175, Method: Compositional matrix adjust.
Identities = 293/317 (92%), Positives = 306/317 (96%)
Query: 66 MFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLI 125
MFAAEI+MHRFLLSS VRT NPEEADWFYTP+Y TCDLTP GLPLPFKSPRMMRSAIQLI
Sbjct: 1 MFAAEIYMHRFLLSSAVRTFNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRMMRSAIQLI 60
Query: 126 SSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCL 185
+SNWPYWNRTEGADHFF+ PHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCL
Sbjct: 61 ASNWPYWNRTEGADHFFITPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCL 120
Query: 186 NEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWEN 245
+GSITIPPYAPPQKMQ+H IPPDTPRSIFVYFRGLFYDV NDPEGGYYARGARAAVWEN
Sbjct: 121 KDGSITIPPYAPPQKMQSHLIPPDTPRSIFVYFRGLFYDVGNDPEGGYYARGARAAVWEN 180
Query: 246 FKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLP 305
FK+NPLFDIST+HPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAV+FGCIPVIIADDIVLP
Sbjct: 181 FKDNPLFDISTEHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVIFGCIPVIIADDIVLP 240
Query: 306 FADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGD 365
FADAIPWE+IG+FVAE+DVP LDTILTSIPPE ILRKQRLLANPSMK+AMLFPQPAQ GD
Sbjct: 241 FADAIPWEDIGVFVAEKDVPNLDTILTSIPPEEILRKQRLLANPSMKQAMLFPQPAQSGD 300
Query: 366 AFHQILNGLARKLPHDK 382
AFHQILNGLARKLPHDK
Sbjct: 301 AFHQILNGLARKLPHDK 317
>gi|39725600|dbj|BAD04923.1| pectin-glucuronyltransferase [Nicotiana tabacum]
Length = 317
Score = 612 bits (1579), Expect = e-173, Method: Compositional matrix adjust.
Identities = 290/317 (91%), Positives = 303/317 (95%)
Query: 66 MFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLI 125
MFAAEI+MHRFLLSS VRT NPEEADWFYTP+Y TCDLTP GLPLPFKSPRMMRSAIQLI
Sbjct: 1 MFAAEIYMHRFLLSSAVRTFNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRMMRSAIQLI 60
Query: 126 SSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCL 185
+SNWPYWNRTEGADHFF+ PHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCL
Sbjct: 61 ASNWPYWNRTEGADHFFITPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCL 120
Query: 186 NEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWEN 245
+GSITIPPYAPPQKMQ+H IPPDTPRSIFVYFRGLFYDV NDPEGGYYARGARAAVWEN
Sbjct: 121 KDGSITIPPYAPPQKMQSHLIPPDTPRSIFVYFRGLFYDVGNDPEGGYYARGARAAVWEN 180
Query: 246 FKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLP 305
FK+NP FDIST+HPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAV+FGCIPVIIADDIVLP
Sbjct: 181 FKDNPPFDISTEHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVIFGCIPVIIADDIVLP 240
Query: 306 FADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGD 365
FADAIPWE+IG+FVAE+DVP LDTILTSIPPE ILRKQRLLA PSMK+AML PQPAQ GD
Sbjct: 241 FADAIPWEDIGVFVAEKDVPTLDTILTSIPPEEILRKQRLLAKPSMKQAMLSPQPAQSGD 300
Query: 366 AFHQILNGLARKLPHDK 382
AFHQILNGLARKLPHDK
Sbjct: 301 AFHQILNGLARKLPHDK 317
>gi|414878904|tpg|DAA56035.1| TPA: hypothetical protein ZEAMMB73_301237 [Zea mays]
Length = 360
Score = 612 bits (1578), Expect = e-173, Method: Compositional matrix adjust.
Identities = 284/360 (78%), Positives = 323/360 (89%)
Query: 47 LPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPT 106
+P KYN+ LL KD RCL HMFAAEIFMH+FLLSS VRTL+PEEADWFYTP Y TCDLTP
Sbjct: 1 MPRKYNRMLLDKDSRCLHHMFAAEIFMHQFLLSSAVRTLDPEEADWFYTPAYTTCDLTPQ 60
Query: 107 GLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILP 166
G PLPF++PR+MRSAI+ +++ WP+WNRT+GADHFF+ PHDFGACFHYQEE+A+ERGILP
Sbjct: 61 GFPLPFRAPRIMRSAIRYVATTWPFWNRTDGADHFFLTPHDFGACFHYQEERAMERGILP 120
Query: 167 LLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVN 226
LL+RATLVQTFGQRNHVCL +GSIT+PPYA P KMQAH I P TPRSIFVYFRGLFYD+
Sbjct: 121 LLRRATLVQTFGQRNHVCLQDGSITVPPYADPGKMQAHLISPGTPRSIFVYFRGLFYDMG 180
Query: 227 NDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLV 286
NDPEGGYYARGARA+VWENFK+NPLFDIST+HP+TYYEDMQRAIFCLCPLGWAPWSPRLV
Sbjct: 181 NDPEGGYYARGARASVWENFKDNPLFDISTEHPSTYYEDMQRAIFCLCPLGWAPWSPRLV 240
Query: 287 EAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLL 346
EAVVFGCIPVIIADDIVLPFADAIPWE+I +FVAE DVP+LD+ILTSIP ILR+QRLL
Sbjct: 241 EAVVFGCIPVIIADDIVLPFADAIPWEDISVFVAERDVPRLDSILTSIPLPDILRRQRLL 300
Query: 347 ANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
A S+K+A+LF QPA+PGDAFHQ+LNGLARKLPH + +L+PG K L+W +G DLKPW
Sbjct: 301 ARDSVKQALLFHQPARPGDAFHQVLNGLARKLPHGEGVFLQPGEKALDWDAGLQSDLKPW 360
>gi|222619798|gb|EEE55930.1| hypothetical protein OsJ_04615 [Oryza sativa Japonica Group]
Length = 401
Score = 598 bits (1541), Expect = e-168, Method: Compositional matrix adjust.
Identities = 284/394 (72%), Positives = 327/394 (82%), Gaps = 21/394 (5%)
Query: 13 VSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIF 72
+A T K R+SG GDVLEDDP GKLKV+VYE+P KYN LL KD RCL HMFAAEIF
Sbjct: 29 AAATTTQKHGRMSGKGGDVLEDDPTGKLKVFVYEMPRKYNLNLLAKDSRCLQHMFAAEIF 88
Query: 73 MHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYW 132
MH+FLLSSPVRTL+PEEADWFYTP Y TCDLTP G PLPF++PR+MRSA++ +++ WPYW
Sbjct: 89 MHQFLLSSPVRTLDPEEADWFYTPAYTTCDLTPQGFPLPFRAPRIMRSAVRYVAATWPYW 148
Query: 133 NRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITI 192
NRT+GADHFF+ PHDFGACFHYQEE+AIERGILP+L+RATLVQTFGQR+H CL GSIT+
Sbjct: 149 NRTDGADHFFLAPHDFGACFHYQEERAIERGILPVLRRATLVQTFGQRHHPCLQPGSITV 208
Query: 193 PPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLF 252
PPYA P+KM+AH+I P TPRSIFVYFRGLFYD+ NDPEGGYYARGARA+VWENFK+NPLF
Sbjct: 209 PPYADPRKMEAHRISPATPRSIFVYFRGLFYDMGNDPEGGYYARGARASVWENFKDNPLF 268
Query: 253 DISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW 312
DIST+HP T PRLVEAVVFGCIPVIIADDIVLPFADAIPW
Sbjct: 269 DISTEHPAT---------------------PRLVEAVVFGCIPVIIADDIVLPFADAIPW 307
Query: 313 EEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILN 372
EI +FVAEEDVP+LDTIL S+P + ++RKQRLLA+P+MK+A+LF QPA+PGDAFHQILN
Sbjct: 308 GEISVFVAEEDVPRLDTILASVPLDEVIRKQRLLASPAMKQAVLFHQPARPGDAFHQILN 367
Query: 373 GLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
GLARKLPH K +L+PG K ++W G DLKPW
Sbjct: 368 GLARKLPHPKGVFLEPGEKGIDWDQGLENDLKPW 401
>gi|110288702|gb|ABB46945.2| exostosin family protein, putative, expressed [Oryza sativa
Japonica Group]
gi|215766610|dbj|BAG98714.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 341
Score = 595 bits (1535), Expect = e-168, Method: Compositional matrix adjust.
Identities = 274/341 (80%), Positives = 304/341 (89%)
Query: 66 MFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLI 125
MFA EIFMHRFLLSS +RT NP+EADWFYTP+Y TCDLTP G PL KSPRMMRSAI+ I
Sbjct: 1 MFATEIFMHRFLLSSAIRTSNPDEADWFYTPVYTTCDLTPWGHPLTTKSPRMMRSAIKFI 60
Query: 126 SSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCL 185
S WPYWNRTEGADHFFVVPHDF ACF++QE KAIERGILP+L+RATLVQTFGQ+NH CL
Sbjct: 61 SKYWPYWNRTEGADHFFVVPHDFAACFYFQEAKAIERGILPVLRRATLVQTFGQKNHACL 120
Query: 186 NEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWEN 245
+GSIT+PPY P K++AH +PP+TPRSIFVYFRGLFYD +NDPEGGYYARGARA+VWEN
Sbjct: 121 KDGSITVPPYTPAHKIRAHLVPPETPRSIFVYFRGLFYDTSNDPEGGYYARGARASVWEN 180
Query: 246 FKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLP 305
FKNNP+FDISTDHP TYYEDMQRA+FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLP
Sbjct: 181 FKNNPMFDISTDHPQTYYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLP 240
Query: 306 FADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGD 365
F+DAIPWEEI +FVAE+DVP+LDTILTSIP EVILRKQ +LA PSMK+ MLFPQPA+PGD
Sbjct: 241 FSDAIPWEEIAVFVAEDDVPQLDTILTSIPTEVILRKQAMLAEPSMKQTMLFPQPAEPGD 300
Query: 366 AFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
FHQ++N LARKLPH + +LKPG K LNWT G DLKPW
Sbjct: 301 GFHQVMNALARKLPHGRDVFLKPGQKVLNWTEGTREDLKPW 341
>gi|386576428|gb|AFJ12118.1| glycosyltransferase, partial [Nicotiana tabacum]
Length = 294
Score = 574 bits (1480), Expect = e-161, Method: Compositional matrix adjust.
Identities = 271/294 (92%), Positives = 284/294 (96%)
Query: 68 AAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISS 127
AAEI+MHRFLLSS VRT NPEEADWFYTP+Y TCDLTP GLPLPFKSPRMMRSAIQLI+S
Sbjct: 1 AAEIYMHRFLLSSAVRTFNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRMMRSAIQLIAS 60
Query: 128 NWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNE 187
NWPYWNRTEGADHFF+ PHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCL +
Sbjct: 61 NWPYWNRTEGADHFFITPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLKD 120
Query: 188 GSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFK 247
GSITIPPYAPPQKMQ+H IPPDTPRSIFVYFRGLFYDV NDPEGGYYARGARAAVWENFK
Sbjct: 121 GSITIPPYAPPQKMQSHLIPPDTPRSIFVYFRGLFYDVGNDPEGGYYARGARAAVWENFK 180
Query: 248 NNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFA 307
+NPLFDIST+HPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAV+FGCIPVIIADDIVLPFA
Sbjct: 181 DNPLFDISTEHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVIFGCIPVIIADDIVLPFA 240
Query: 308 DAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPA 361
DAIPWE+IG+FVAE+DVP LDTILTSIPPE ILRKQRLLANPSMK+AMLFPQPA
Sbjct: 241 DAIPWEDIGVFVAEKDVPNLDTILTSIPPEEILRKQRLLANPSMKQAMLFPQPA 294
>gi|224136520|ref|XP_002322350.1| predicted protein [Populus trichocarpa]
gi|222869346|gb|EEF06477.1| predicted protein [Populus trichocarpa]
Length = 343
Score = 544 bits (1402), Expect = e-152, Method: Compositional matrix adjust.
Identities = 255/294 (86%), Positives = 272/294 (92%)
Query: 112 FKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRA 171
F+SPRM+RSA QLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQ A
Sbjct: 50 FQSPRMIRSATQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQHA 109
Query: 172 TLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEG 231
TLVQTFGQRNHVCL +GSIT+P YAPPQKMQ H IP TPRSIFVYFRGLFYDV NDPEG
Sbjct: 110 TLVQTFGQRNHVCLKDGSITVPSYAPPQKMQTHLIPEKTPRSIFVYFRGLFYDVGNDPEG 169
Query: 232 GYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVF 291
GYYARGARAAVWENFK+NPLFDIST+HPTTYYEDMQ+A+FCLCPL WAPWSPRLVEA++F
Sbjct: 170 GYYARGARAAVWENFKDNPLFDISTEHPTTYYEDMQQAVFCLCPLSWAPWSPRLVEALIF 229
Query: 292 GCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSM 351
G IPVII DDIVLPFADAIPWEEIG+FV E+DVP LDTILTSIPPEVILRKQRLLANPSM
Sbjct: 230 GYIPVIIVDDIVLPFADAIPWEEIGVFVDEKDVPNLDTILTSIPPEVILRKQRLLANPSM 289
Query: 352 KRAMLFPQPAQPGDAFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKP 405
K+AMLFPQ AQ GDAFHQ+LNGLARKL HD++ YLK G K LNWT+G VGDLKP
Sbjct: 290 KQAMLFPQLAQAGDAFHQVLNGLARKLSHDRNVYLKSGEKILNWTAGLVGDLKP 343
>gi|326499682|dbj|BAJ86152.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 304
Score = 536 bits (1382), Expect = e-150, Method: Compositional matrix adjust.
Identities = 253/283 (89%), Positives = 266/283 (93%)
Query: 124 LISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHV 183
+ISS+WPYWNRT GADHFFVVPHDFGACFHYQEEKAIERGILPLL+RATLVQTFGQ++HV
Sbjct: 22 VISSHWPYWNRTAGADHFFVVPHDFGACFHYQEEKAIERGILPLLRRATLVQTFGQKDHV 81
Query: 184 CLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVW 243
CL EGSI IPPYAPPQKM+ H +PP+TPRSIFVYFRGLFYD NDPEGGYYARGARA+VW
Sbjct: 82 CLKEGSINIPPYAPPQKMKTHLVPPETPRSIFVYFRGLFYDTANDPEGGYYARGARASVW 141
Query: 244 ENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIV 303
ENFKNNPLFDISTDHP TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIV
Sbjct: 142 ENFKNNPLFDISTDHPPTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIV 201
Query: 304 LPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQP 363
LPFADAIPW+EIG+FVAE+DVPKLDTILTSIP EVILRKQRLLANPSMK+AMLFPQPAQ
Sbjct: 202 LPFADAIPWDEIGVFVAEDDVPKLDTILTSIPMEVILRKQRLLANPSMKQAMLFPQPAQA 261
Query: 364 GDAFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
GDAFHQILNGLARKLPH KS YL PG K LNWT GP GDLKPW
Sbjct: 262 GDAFHQILNGLARKLPHSKSVYLTPGQKALNWTQGPKGDLKPW 304
>gi|255543228|ref|XP_002512677.1| catalytic, putative [Ricinus communis]
gi|223548638|gb|EEF50129.1| catalytic, putative [Ricinus communis]
Length = 253
Score = 493 bits (1270), Expect = e-137, Method: Compositional matrix adjust.
Identities = 236/252 (93%), Positives = 242/252 (96%)
Query: 155 QEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSI 214
QEEKAIERGI PLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQA QIPP+TPRSI
Sbjct: 2 QEEKAIERGIFPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQARQIPPETPRSI 61
Query: 215 FVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLC 274
FVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDIST+HPTTYYEDMQRAIFCLC
Sbjct: 62 FVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTEHPTTYYEDMQRAIFCLC 121
Query: 275 PLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSI 334
PLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG+FVAEEDVP LDTILTSI
Sbjct: 122 PLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFVAEEDVPNLDTILTSI 181
Query: 335 PPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHDKSTYLKPGGKFLN 394
P +V+LRKQRLLANPSMKRAMLFPQPAQ GDAFHQILNGLARKLPHD S YLKPG K LN
Sbjct: 182 PTQVVLRKQRLLANPSMKRAMLFPQPAQSGDAFHQILNGLARKLPHDNSIYLKPGEKVLN 241
Query: 395 WTSGPVGDLKPW 406
WT+GPV DLKPW
Sbjct: 242 WTAGPVKDLKPW 253
>gi|414866628|tpg|DAA45185.1| TPA: hypothetical protein ZEAMMB73_313698 [Zea mays]
Length = 588
Score = 459 bits (1182), Expect = e-127, Method: Compositional matrix adjust.
Identities = 218/252 (86%), Positives = 232/252 (92%)
Query: 155 QEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSI 214
+EEKAI RGILPLLQRATLVQTFGQ+NHVCL GSITIPP+APPQKMQAH IP DTPRSI
Sbjct: 337 KEEKAIGRGILPLLQRATLVQTFGQKNHVCLKGGSITIPPFAPPQKMQAHLIPVDTPRSI 396
Query: 215 FVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLC 274
FVYFRGLFYD +NDPEGGYYARGA A+VWENFKNNPLFDISTDHP TYYEDMQR++FCLC
Sbjct: 397 FVYFRGLFYDTSNDPEGGYYARGACASVWENFKNNPLFDISTDHPPTYYEDMQRSVFCLC 456
Query: 275 PLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSI 334
PLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG+FVAEEDVPKLD+IL SI
Sbjct: 457 PLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFVAEEDVPKLDSILMSI 516
Query: 335 PPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHDKSTYLKPGGKFLN 394
P +VILRKQRLLANPSMK+AMLFPQPAQ GDAFHQILNGLARKL H +LKPG + L
Sbjct: 517 PTDVILRKQRLLANPSMKQAMLFPQPAQAGDAFHQILNGLARKLLHGDGVFLKPGERVLK 576
Query: 395 WTSGPVGDLKPW 406
WT+GP GDLKPW
Sbjct: 577 WTAGPPGDLKPW 588
>gi|289166870|gb|ADC84485.1| glycosyltransferase family 47A [Salix sachalinensis]
Length = 215
Score = 436 bits (1121), Expect = e-119, Method: Compositional matrix adjust.
Identities = 207/215 (96%), Positives = 212/215 (98%)
Query: 142 FVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKM 201
FVVPHDFGACFHYQEEKA+ERGILPLLQR+TLVQTFGQRNHVCLNEGSI IPP+APPQKM
Sbjct: 1 FVVPHDFGACFHYQEEKAVERGILPLLQRSTLVQTFGQRNHVCLNEGSIAIPPFAPPQKM 60
Query: 202 QAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTT 261
QAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTT
Sbjct: 61 QAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTT 120
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCI VIIADDIVLPFADAIPWEEIG+FVAE
Sbjct: 121 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCISVIIADDIVLPFADAIPWEEIGVFVAE 180
Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPSMKRAML 356
EDVP LDTILT+IPPEVILRKQRLLANPSMKRAML
Sbjct: 181 EDVPNLDTILTTIPPEVILRKQRLLANPSMKRAML 215
>gi|289166872|gb|ADC84486.1| glycosyltransferase family 47A [Salix miyabeana]
Length = 215
Score = 416 bits (1069), Expect = e-113, Method: Compositional matrix adjust.
Identities = 197/215 (91%), Positives = 206/215 (95%)
Query: 142 FVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKM 201
FVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCL +GSITIPPYAPPQKM
Sbjct: 1 FVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLKDGSITIPPYAPPQKM 60
Query: 202 QAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTT 261
Q H IP TPRSIFVYFRGLFYDV NDPEGGYYARGARAAVWENFK+NPLFDIST+HP T
Sbjct: 61 QTHLIPEKTPRSIFVYFRGLFYDVGNDPEGGYYARGARAAVWENFKDNPLFDISTEHPAT 120
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
YYEDMQRA+FCLCPLGWAPWSPRLVEAV+FGCIPVIIADDIVLPFADAIPWE+IG++V E
Sbjct: 121 YYEDMQRAVFCLCPLGWAPWSPRLVEAVIFGCIPVIIADDIVLPFADAIPWEDIGVYVDE 180
Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPSMKRAML 356
EDVP LDTILTSIPPEVILRKQRLLANPSMK+AML
Sbjct: 181 EDVPNLDTILTSIPPEVILRKQRLLANPSMKQAML 215
>gi|414866629|tpg|DAA45186.1| TPA: hypothetical protein ZEAMMB73_313698 [Zea mays]
Length = 206
Score = 373 bits (957), Expect = e-100, Method: Compositional matrix adjust.
Identities = 178/206 (86%), Positives = 189/206 (91%)
Query: 201 MQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPT 260
MQAH IP DTPRSIFVYFRGLFYD +NDPEGGYYARGA A+VWENFKNNPLFDISTDHP
Sbjct: 1 MQAHLIPVDTPRSIFVYFRGLFYDTSNDPEGGYYARGACASVWENFKNNPLFDISTDHPP 60
Query: 261 TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVA 320
TYYEDMQR++FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG+FVA
Sbjct: 61 TYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFVA 120
Query: 321 EEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPH 380
EEDVPKLD+IL SIP +VILRKQRLLANPSMK+AMLFPQPAQ GDAFHQILNGLARKL H
Sbjct: 121 EEDVPKLDSILMSIPTDVILRKQRLLANPSMKQAMLFPQPAQAGDAFHQILNGLARKLLH 180
Query: 381 DKSTYLKPGGKFLNWTSGPVGDLKPW 406
+LKPG + L WT+GP GDLKPW
Sbjct: 181 GDGVFLKPGERVLKWTAGPPGDLKPW 206
>gi|224093354|ref|XP_002334839.1| predicted protein [Populus trichocarpa]
gi|222875141|gb|EEF12272.1| predicted protein [Populus trichocarpa]
Length = 175
Score = 328 bits (841), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 153/171 (89%), Positives = 164/171 (95%)
Query: 236 RGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIP 295
RGARAAVWENFK+NPLFDIST+HP TYYEDMQRA+FCLCPLGWAPWSPRLVEAV+FGCIP
Sbjct: 5 RGARAAVWENFKDNPLFDISTEHPATYYEDMQRAVFCLCPLGWAPWSPRLVEAVIFGCIP 64
Query: 296 VIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAM 355
VIIADDIVLPFADAIPWEEIG++V EEDVP LDTILTSIPPEVILRKQRLLANPSMK+AM
Sbjct: 65 VIIADDIVLPFADAIPWEEIGVYVDEEDVPNLDTILTSIPPEVILRKQRLLANPSMKQAM 124
Query: 356 LFPQPAQPGDAFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
LFPQPAQPGDAFHQ+LNGLARKLPHD+S YL+P K LNWT+GPVGDLKPW
Sbjct: 125 LFPQPAQPGDAFHQVLNGLARKLPHDRSVYLRPSEKILNWTAGPVGDLKPW 175
>gi|224106838|ref|XP_002314302.1| predicted protein [Populus trichocarpa]
gi|222850710|gb|EEE88257.1| predicted protein [Populus trichocarpa]
Length = 462
Score = 315 bits (808), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 160/384 (41%), Positives = 233/384 (60%), Gaps = 11/384 (2%)
Query: 11 AFVSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAE 70
A + N+ I++ +L+D LKVY+YELPSKYN L + RC H+FA+E
Sbjct: 84 ALFESSNSTFIQQSKNINRGLLKD-----LKVYIYELPSKYNTDWL-ANERCSNHLFASE 137
Query: 71 IFMHRFLLSS-PVRTLNPEEADWFYTPIYPTCDL-TPTGLPLPFKSPRMMRSAIQLISSN 128
+ +H+ L +S +RT +P EAD+F+ P+Y +C+ T G P + ++ SA+QLISSN
Sbjct: 138 VAIHKALSNSLDIRTFDPYEADFFFVPVYVSCNFSTVNGFPAIGHARSLLSSAVQLISSN 197
Query: 129 WPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQR-NHVCLNE 187
+P+WNR++G+DH FV HD+GACFH EE+A+E GI L+R+ ++QTFG + NH C +
Sbjct: 198 YPFWNRSQGSDHVFVASHDYGACFHAMEERAMEDGIPEFLKRSIILQTFGVKFNHPCQDV 257
Query: 188 GSITIPPYAPPQKMQA--HQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWEN 245
++ IPPY P++++ P + R I+ +FRG + G YY++ R +W
Sbjct: 258 ENVVIPPYISPERVRTTLENYPLNGRRDIWAFFRGKMEVHPKNISGRYYSKKVRTVIWRK 317
Query: 246 FKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLP 305
+ + F + Y ++ R++FCLCPLGWAPWSPRLVE+V GC+PVIIAD I LP
Sbjct: 318 YSGDRRFYLQRHRFAGYQSEIVRSVFCLCPLGWAPWSPRLVESVALGCVPVIIADGIRLP 377
Query: 306 FADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGD 365
F A+ W EI + VAE+DV L T+L + + Q+ L +P ++RA+LF Q GD
Sbjct: 378 FPTAVRWSEISLTVAEKDVANLGTLLDHVAATNLSAIQKNLWDPDVRRALLFNDRVQEGD 437
Query: 366 AFHQILNGLARKLPHDKSTYLKPG 389
A Q+L LARKL T P
Sbjct: 438 ATWQVLYALARKLDRSYRTVRLPN 461
>gi|42568020|ref|NP_197685.2| FRA8-like protein [Arabidopsis thaliana]
gi|75127070|sp|Q6NMM8.1|F8H_ARATH RecName: Full=Probable glucuronoxylan glucuronosyltransferase F8H;
AltName: Full=FRA8 homolog; AltName: Full=Protein
FRAGILE FIBER 8 homolog
gi|44681390|gb|AAS47635.1| At5g22940 [Arabidopsis thaliana]
gi|48958521|gb|AAT47813.1| At5g22940 [Arabidopsis thaliana]
gi|332005716|gb|AED93099.1| FRA8-like protein [Arabidopsis thaliana]
Length = 469
Score = 315 bits (807), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 157/388 (40%), Positives = 231/388 (59%), Gaps = 6/388 (1%)
Query: 4 TAIALLLAFVSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCL 63
++I LL ++ + P I SA +K+YVY+LP+ YN + RC
Sbjct: 73 SSIRLLSNHKTSSSLPSRALIESSAIKTTSIGLFTGMKIYVYDLPASYNDDWVTASDRCA 132
Query: 64 THMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDL-TPTGLPLPFKSPRMMRSAI 122
+H+FAAE+ +HR LLSS VRTL+P+EAD+F+ P+Y +C+ T G P + ++ SA+
Sbjct: 133 SHLFAAEVAIHRALLSSDVRTLDPDEADYFFVPVYVSCNFSTSNGFPSLSHARSLLSSAV 192
Query: 123 QLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQR-N 181
+S ++P+WNR++G+DH FV HDFGACFH E+ AIE GI ++R+ ++QTFG +
Sbjct: 193 DFLSDHYPFWNRSQGSDHVFVASHDFGACFHAMEDMAIEEGIPKFMKRSIILQTFGVKYK 252
Query: 182 HVCLNEGSITIPPYAPPQKMQA--HQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGAR 239
H C + IPPY PP+ +Q + P + R I+ +FRG + G +Y++G R
Sbjct: 253 HPCQEVEHVVIPPYIPPESVQKAIEKAPVNGRRDIWAFFRGKMEVNPKNISGRFYSKGVR 312
Query: 240 AAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIA 299
A+ + F F ++ Y ++ R++FCLCPLGWAPWSPRLVE+ V GC+PV+IA
Sbjct: 313 TAILKKFGGRRRFYLNRHRFAGYRSEIVRSVFCLCPLGWAPWSPRLVESAVLGCVPVVIA 372
Query: 300 DDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQ 359
D I LPF++ + W EI + VAE+DV L +L + + QR L P KRA+L+
Sbjct: 373 DGIQLPFSETVQWPEISLTVAEKDVRNLRKVLEHVAATNLSAIQRNLHEPVFKRALLYNV 432
Query: 360 PAQPGDAFHQILNGLARKLPHDKSTYLK 387
P + GDA IL L RKL D +Y +
Sbjct: 433 PMKEGDATWHILESLWRKL--DDRSYRR 458
>gi|60657602|gb|AAX33322.1| secondary cell wall-related glycosyltransferase family 47 [Populus
tremula x Populus tremuloides]
Length = 442
Score = 315 bits (807), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 161/379 (42%), Positives = 231/379 (60%), Gaps = 11/379 (2%)
Query: 11 AFVSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAE 70
A + N+ I++ +L+D LKVY+YELPSKYN L + RC H+FA+E
Sbjct: 64 ALFESSNSTFIQQSRNINRGLLKD-----LKVYIYELPSKYNTDWL-ANERCSNHLFASE 117
Query: 71 IFMHRFLLSS-PVRTLNPEEADWFYTPIYPTCDL-TPTGLPLPFKSPRMMRSAIQLISSN 128
+ +H+ L SS VRT +P EAD+F+ P+Y +C+ T G P + ++ SA+QLISSN
Sbjct: 118 VAIHKALSSSLDVRTFDPYEADFFFVPVYVSCNFSTVNGFPAIGHARSLLSSAVQLISSN 177
Query: 129 WPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQR-NHVCLNE 187
+P+WNR++G+DH FV HD+GACFH EE+A+E GI L+R+ ++QTFG + NH C +
Sbjct: 178 YPFWNRSQGSDHVFVASHDYGACFHAMEERAMEDGIPEFLKRSIILQTFGVKFNHPCQDV 237
Query: 188 GSITIPPYAPPQKMQA--HQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWEN 245
++ IPPY P ++ + P R I+ +FRG + G YY++ R +W
Sbjct: 238 ENVVIPPYISPGSVRTTLEKYPLTGRRDIWAFFRGKMEVHPKNISGRYYSKKVRTVIWRK 297
Query: 246 FKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLP 305
+ + F + Y ++ R++FCLCPLGWAPWSPRLVE+V GC+PVIIAD I LP
Sbjct: 298 YSGDRRFYLQRHRFAGYQSEIVRSVFCLCPLGWAPWSPRLVESVALGCVPVIIADGIRLP 357
Query: 306 FADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGD 365
F A+ W EI + VAE+DV L T+L + + Q+ L +P ++RA+LF P Q GD
Sbjct: 358 FPTAVRWSEISLTVAEKDVANLGTLLDQVAATNLSAIQKNLWDPDVRRALLFNDPVQGGD 417
Query: 366 AFHQILNGLARKLPHDKST 384
A Q+L LA+KL T
Sbjct: 418 ATWQVLYALAQKLDRSYRT 436
>gi|297808283|ref|XP_002872025.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
gi|297317862|gb|EFH48284.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
Length = 465
Score = 315 bits (807), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 159/388 (40%), Positives = 231/388 (59%), Gaps = 6/388 (1%)
Query: 4 TAIALLLAFVSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCL 63
++I LL ++ + P I SA +K+YVY+LP+ +N + RC
Sbjct: 74 SSIRLLSNHKTSSSLPSRALIESSAIKTNSFGLFSGMKIYVYDLPASFNDDWVTASDRCA 133
Query: 64 THMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDL-TPTGLPLPFKSPRMMRSAI 122
TH+FAAE+ +HR LLSS VRTL+PEEAD+F+ P+Y +C+ T G P + ++ SA+
Sbjct: 134 THLFAAEVAIHRALLSSDVRTLDPEEADFFFVPVYVSCNFSTSNGFPSLSHARSLLSSAV 193
Query: 123 QLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQR-N 181
+S ++P+WNRT+G+DH FV HDFGACFH E+ AIE GI ++++ ++QTFG +
Sbjct: 194 DFLSDHYPFWNRTQGSDHVFVASHDFGACFHAMEDMAIEEGIPEFMKKSIILQTFGVKYK 253
Query: 182 HVCLNEGSITIPPYAPPQKMQA--HQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGAR 239
H C + IPPY PP+ +Q + P + R I+ +FRG + G +Y++G R
Sbjct: 254 HPCQEVEHVVIPPYIPPESVQRAIEKAPANGRRDIWAFFRGKMEVNPKNISGRFYSKGVR 313
Query: 240 AAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIA 299
A+ + F F ++ Y ++ R++FCLCPLGWAPWSPRLVE+ V GC+PV+IA
Sbjct: 314 TAILKKFGGRRRFYLNRHRFAGYRSEIVRSVFCLCPLGWAPWSPRLVESAVLGCVPVVIA 373
Query: 300 DDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQ 359
D I LPF++ + W EI + VAE+DV L IL + + QR L P KRA+L+
Sbjct: 374 DGIKLPFSETVRWPEISLTVAEKDVRSLRKILEHVAATNLSVIQRNLHGPVFKRALLYNV 433
Query: 360 PAQPGDAFHQILNGLARKLPHDKSTYLK 387
P + GDA IL L RKL D +Y +
Sbjct: 434 PMKEGDATWHILESLWRKL--DDRSYRR 459
>gi|297721259|ref|NP_001172992.1| Os02g0520750 [Oryza sativa Japonica Group]
gi|255670949|dbj|BAH91721.1| Os02g0520750 [Oryza sativa Japonica Group]
Length = 213
Score = 314 bits (804), Expect = 6e-83, Method: Compositional matrix adjust.
Identities = 148/171 (86%), Positives = 160/171 (93%)
Query: 236 RGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIP 295
RGARA++WENFKNNPLFDIST+HP TYYEDMQR++FCLCPLGWAPWSPRLVEAVVFGCIP
Sbjct: 42 RGARASLWENFKNNPLFDISTEHPATYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIP 101
Query: 296 VIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAM 355
VIIADDIVLPFADAIPW+EIG+FV EEDVP+LD+ILTSIP + ILRKQRLLANPSMK+AM
Sbjct: 102 VIIADDIVLPFADAIPWDEIGVFVDEEDVPRLDSILTSIPIDDILRKQRLLANPSMKQAM 161
Query: 356 LFPQPAQPGDAFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
LFPQPAQP DAFHQILNGLARKLPH S YLKPG K LNWT+GPV DLKPW
Sbjct: 162 LFPQPAQPRDAFHQILNGLARKLPHPDSVYLKPGEKHLNWTAGPVADLKPW 212
>gi|114325715|gb|ABI64067.1| glycosyltransferase GT47C [Populus tremula x Populus alba]
Length = 442
Score = 313 bits (802), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 161/379 (42%), Positives = 232/379 (61%), Gaps = 11/379 (2%)
Query: 11 AFVSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAE 70
A + N+ I++ +L+D LKVY+YELPSKYN L ++ RC H+FA+E
Sbjct: 64 ALFESSNSTFIQQSRNINRGLLKD-----LKVYIYELPSKYNTDWL-ENERCSNHLFASE 117
Query: 71 IFMHRFLLSS-PVRTLNPEEADWFYTPIYPTCDL-TPTGLPLPFKSPRMMRSAIQLISSN 128
+ +H+ L SS VRT +P EAD+F+ P+Y +C+ T G P + ++ SA+QLISSN
Sbjct: 118 VAIHKALSSSLDVRTFDPYEADFFFVPVYVSCNFSTVNGFPAIGHARSLLSSAVQLISSN 177
Query: 129 WPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQR-NHVCLNE 187
+P+WNR++G+DH FV HD+GACFH EE+A+E GI L+R+ ++QTFG + NH C +
Sbjct: 178 YPFWNRSQGSDHVFVASHDYGACFHAMEERAMEDGIPEFLKRSIILQTFGVKFNHPCQDV 237
Query: 188 GSITIPPYAPPQKMQA--HQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWEN 245
++ IPPY P ++A + P R I+ +FRG + G YY++ R +
Sbjct: 238 ENVVIPPYISPGSVRATLEKYPLTGRRDIWAFFRGKMEVHPKNISGRYYSKKVRTVILRK 297
Query: 246 FKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLP 305
+ + F + Y ++ R++FCLCPLGWAPWSPRLVE+V GC+PVIIAD I LP
Sbjct: 298 YSGDRRFYLQRHRFAGYQSEIVRSVFCLCPLGWAPWSPRLVESVALGCVPVIIADGIRLP 357
Query: 306 FADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGD 365
F A+ W EI + VAE+DV L T+L + + Q+ L +P ++RA+LF P Q GD
Sbjct: 358 FPTAVRWSEISLTVAEKDVANLGTLLDQVAATNLSAIQKNLWDPDVRRALLFNDPVQGGD 417
Query: 366 AFHQILNGLARKLPHDKST 384
A Q+L LA+KL T
Sbjct: 418 ATWQVLYALAQKLDRSYRT 436
>gi|10177241|dbj|BAB10615.1| unnamed protein product [Arabidopsis thaliana]
Length = 498
Score = 313 bits (801), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 150/352 (42%), Positives = 219/352 (62%), Gaps = 6/352 (1%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYP 99
+K+YVY+LP+ YN + RC +H+FAAE+ +HR LLSS VRTL+P+EAD+F+ P+Y
Sbjct: 138 MKIYVYDLPASYNDDWVTASDRCASHLFAAEVAIHRALLSSDVRTLDPDEADYFFVPVYV 197
Query: 100 TCDL-TPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEK 158
+C+ T G P + ++ SA+ +S ++P+WNR++G+DH FV HDFGACFH E+
Sbjct: 198 SCNFSTSNGFPSLSHARSLLSSAVDFLSDHYPFWNRSQGSDHVFVASHDFGACFHAMEDM 257
Query: 159 AIERGILPLLQRATLVQTFGQR-NHVCLNEGSITIPPYAPPQKMQA--HQIPPDTPRSIF 215
AIE GI ++R+ ++QTFG + H C + IPPY PP+ +Q + P + R I+
Sbjct: 258 AIEEGIPKFMKRSIILQTFGVKYKHPCQEVEHVVIPPYIPPESVQKAIEKAPVNGRRDIW 317
Query: 216 VYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCP 275
+FRG + G +Y++G R A+ + F F ++ Y ++ R++FCLCP
Sbjct: 318 AFFRGKMEVNPKNISGRFYSKGVRTAILKKFGGRRRFYLNRHRFAGYRSEIVRSVFCLCP 377
Query: 276 LGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIP 335
LGWAPWSPRLVE+ V GC+PV+IAD I LPF++ + W EI + VAE+DV L +L +
Sbjct: 378 LGWAPWSPRLVESAVLGCVPVVIADGIQLPFSETVQWPEISLTVAEKDVRNLRKVLEHVA 437
Query: 336 PEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHDKSTYLK 387
+ QR L P KRA+L+ P + GDA IL L RKL D +Y +
Sbjct: 438 ATNLSAIQRNLHEPVFKRALLYNVPMKEGDATWHILESLWRKL--DDRSYRR 487
>gi|225436482|ref|XP_002275679.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase IRX7
[Vitis vinifera]
gi|297734915|emb|CBI17149.3| unnamed protein product [Vitis vinifera]
Length = 460
Score = 308 bits (790), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 156/350 (44%), Positives = 218/350 (62%), Gaps = 5/350 (1%)
Query: 33 EDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADW 92
D + LKVY+Y+LPSKYN L + RC H+FA+E+ +H+ L S VRT +P EAD+
Sbjct: 92 HQDLLKDLKVYIYDLPSKYNVDWL-SNERCSNHLFASEVALHKALQESDVRTFDPWEADF 150
Query: 93 FYTPIYPTCDL-TPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGAC 151
F+ P+Y +C+ T G P + ++ SAIQ IS+ P+WNR+ GADH FV HD+GAC
Sbjct: 151 FFVPVYVSCNFSTVNGFPAIGHARPLLASAIQHISTQLPFWNRSLGADHVFVASHDYGAC 210
Query: 152 FHYQEEKAIERGILPLLQRATLVQTFG-QRNHVCLNEGSITIPPYAPPQKMQA--HQIPP 208
FH E+ A GI L+++ ++QTFG + H C + ++ IPPY P+K+Q+ P
Sbjct: 211 FHAMEDVARADGIPEFLKKSIILQTFGVKHQHPCQDVENVLIPPYVSPEKVQSTLDSAPA 270
Query: 209 DTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQR 268
+ R I+V+FRG + G +Y++ R A+W+ + N F + Y ++ R
Sbjct: 271 NGQRDIWVFFRGKMEVHPKNISGRFYSKAVRTAIWQKYGGNRKFYLKRHRFAGYQSEIVR 330
Query: 269 AIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLD 328
++FCLCPLGWAPWSPRLVE+VV GC+PVIIAD I LPF++AI W EI + VAE+DV KL
Sbjct: 331 SVFCLCPLGWAPWSPRLVESVVLGCVPVIIADGIRLPFSEAIRWPEISLTVAEKDVGKLG 390
Query: 329 TILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKL 378
IL + + Q+ L +P KRA+LF Q GDA Q+LN L KL
Sbjct: 391 MILEDVAATNLSTIQKNLWDPENKRALLFNNQVQEGDATWQVLNALWHKL 440
>gi|302786830|ref|XP_002975186.1| glycosyltransferase-like protein [Selaginella moellendorffii]
gi|300157345|gb|EFJ23971.1| glycosyltransferase-like protein [Selaginella moellendorffii]
Length = 405
Score = 308 bits (789), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 150/342 (43%), Positives = 216/342 (63%), Gaps = 5/342 (1%)
Query: 38 GKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPI 97
+KVY+Y+LPS YN L+ D RC +H+FAAE+ +H+ LL SPVRTL+P+EAD+F+ P+
Sbjct: 53 SNIKVYIYDLPSSYNTDWLV-DSRCSSHLFAAEVAIHQNLLRSPVRTLDPDEADFFFMPV 111
Query: 98 YPTCDLTP-TGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQE 156
Y +C+ T +G P F + ++++A+ L+S N P+W+R +G DH FV HDFGACFH E
Sbjct: 112 YVSCNFTSRSGFPTLFHASDILQAAVGLVSRNMPFWDRHQGRDHVFVATHDFGACFHAME 171
Query: 157 EKAIERGILPLLQRATLVQTFGQRN-HVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIF 215
+ A+ GI L+ + ++QTFG++N H C N I IPPY P K P R I
Sbjct: 172 DLAVTMGIPQFLRNSIILQTFGEKNKHPCQNVDHIQIPPYVVPAKKLPD--PRSQRRKIL 229
Query: 216 VYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCP 275
+FRG + G Y+RG R +W F ++ F I Y +M R++FCLCP
Sbjct: 230 AFFRGKMEIHPKNVSGHMYSRGVRTTIWRRFSHDRRFFIKRKRSDNYKAEMLRSVFCLCP 289
Query: 276 LGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIP 335
LGWAPWSPR+VE+V+ GCIPVIIAD+I LP++ I W +I + VAE DV KLD IL+ +
Sbjct: 290 LGWAPWSPRIVESVIQGCIPVIIADNIQLPYSHVIDWRKISVTVAERDVHKLDRILSRVA 349
Query: 336 PEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARK 377
+ Q L +++A+++ QP GDA Q+L+ L+++
Sbjct: 350 ATNVSMIQANLWRDEVRQALVYNQPLVRGDATWQVLDLLSKR 391
>gi|302791649|ref|XP_002977591.1| glycosyltransferase-like protein [Selaginella moellendorffii]
gi|300154961|gb|EFJ21595.1| glycosyltransferase-like protein [Selaginella moellendorffii]
Length = 345
Score = 308 bits (788), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 150/342 (43%), Positives = 216/342 (63%), Gaps = 5/342 (1%)
Query: 38 GKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPI 97
+KVY+Y+LPS YN L+ D RC +H+FAAE+ +H+ LL SPVRTL+P+EAD+F+ P+
Sbjct: 4 SNIKVYIYDLPSSYNTDWLV-DSRCSSHLFAAEVAIHQNLLRSPVRTLDPDEADFFFMPV 62
Query: 98 YPTCDLTP-TGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQE 156
Y +C+ T +G P F + ++++A+ L+S N P+W+R +G DH FV HDFGACFH E
Sbjct: 63 YVSCNFTSRSGFPTLFHASDILQAAVGLVSRNMPFWDRHQGRDHVFVATHDFGACFHAME 122
Query: 157 EKAIERGILPLLQRATLVQTFGQRN-HVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIF 215
+ A+ GI L+ + ++QTFG++N H C N I IPPY P K P R I
Sbjct: 123 DLAVAMGIPQFLRNSIILQTFGEKNKHPCQNVDHIQIPPYVVPAKKLPD--PRGQRRKIL 180
Query: 216 VYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCP 275
+FRG + G Y+RG R +W F ++ F I Y +M R++FCLCP
Sbjct: 181 AFFRGKMEIHPKNVSGHMYSRGVRTTIWRRFSHDRRFFIKRKRSDNYKAEMLRSVFCLCP 240
Query: 276 LGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIP 335
LGWAPWSPR+VE+V+ GCIPVIIAD+I LP++ I W +I + VAE DV KLD IL+ +
Sbjct: 241 LGWAPWSPRIVESVIQGCIPVIIADNIQLPYSHVIDWRKISVTVAERDVHKLDRILSKVA 300
Query: 336 PEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARK 377
+ Q L +++A+++ QP GDA Q+L+ L+++
Sbjct: 301 ATNVSMIQANLWRDEVRQALVYNQPLVRGDATWQVLDLLSKR 342
>gi|357472125|ref|XP_003606347.1| Secondary cell wall-related glycosyltransferase family [Medicago
truncatula]
gi|355507402|gb|AES88544.1| Secondary cell wall-related glycosyltransferase family [Medicago
truncatula]
Length = 427
Score = 304 bits (779), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 146/358 (40%), Positives = 224/358 (62%), Gaps = 5/358 (1%)
Query: 25 SGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRT 84
+ S+ L+ + LK++VY+LP KYNK L K+PRC TH+FA+E+ +HR LL+S VRT
Sbjct: 54 TTSSPQTLQKTSLKNLKIFVYDLPPKYNKNWL-KNPRCKTHLFASEVAIHRALLTSDVRT 112
Query: 85 LNPEEADWFYTPIYPTCDL-TPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFV 143
+P EAD+F+ P+Y +C+ T G P + ++ SA++LIS+ +P+WNR+ G+DH FV
Sbjct: 113 FDPYEADFFFVPVYVSCNFSTVNGFPAIGHARSLISSAVKLISTEYPFWNRSTGSDHVFV 172
Query: 144 VPHDFGACFHYQEEKAIERGILPLLQRATLVQTFG-QRNHVCLNEGSITIPPYAPPQKMQ 202
HDFG+CFH E+ A++ G+ +++ + ++QTFG +H C + IPP+ P+ ++
Sbjct: 173 ASHDFGSCFHTLEDVAMKDGVPEIMKNSIVLQTFGVTYDHPCQKVEHVVIPPFVSPESVR 232
Query: 203 A--HQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPT 260
P + R I+V+FRG + G +Y++ R +W+ F + F +
Sbjct: 233 NTLENFPVNGRRDIWVFFRGKMEVHPKNVSGRFYSKKVRTVIWKKFNGDRRFYLRRHRFA 292
Query: 261 TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVA 320
Y ++ R++FCLCPLGWAPWSPRLVE+V GC+PVIIAD I LPF+ A+ W EI + VA
Sbjct: 293 GYQSEIARSVFCLCPLGWAPWSPRLVESVALGCVPVIIADSIRLPFSSAVNWPEISVTVA 352
Query: 321 EEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKL 378
E+DV +L IL + + QR L +P ++A+LF GDA Q+L+ L+ K+
Sbjct: 353 EKDVWRLGEILEKVAATNLSIIQRNLWDPRTRKALLFNSRVHEGDATWQVLHSLSEKV 410
>gi|42570324|ref|NP_850113.2| exostosin-like protein [Arabidopsis thaliana]
gi|75216857|sp|Q9ZUV3.1|IRX7_ARATH RecName: Full=Probable glucuronoxylan glucuronosyltransferase IRX7;
AltName: Full=Protein FRAGILE FIBER 8; AltName:
Full=Protein IRREGULAR XYLEM 7
gi|4063747|gb|AAC98455.1| hypothetical protein [Arabidopsis thaliana]
gi|77022037|gb|ABA60868.1| putative glucuronyltransferase [Arabidopsis thaliana]
gi|77022039|gb|ABA60869.1| putative glucuronyltransferase [Arabidopsis thaliana]
gi|330252987|gb|AEC08081.1| exostosin-like protein [Arabidopsis thaliana]
Length = 448
Score = 303 bits (775), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 149/353 (42%), Positives = 217/353 (61%), Gaps = 7/353 (1%)
Query: 32 LEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLS--SPVRTLNPEE 89
++ D LK+YVY+LPSK+NK L D RC H+FAAE+ +H+ LS VRT +P E
Sbjct: 86 IKTDVFNNLKIYVYDLPSKFNKDWLAND-RCTNHLFAAEVALHKAFLSLEGDVRTEDPYE 144
Query: 90 ADWFYTPIYPTCDL-TPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDF 148
AD+F+ P+Y +C+ T G P + ++ AI+L+S+ +P+WNRT G+DH F HDF
Sbjct: 145 ADFFFVPVYVSCNFSTINGFPAIGHARSLINDAIKLVSTQYPFWNRTSGSDHVFTATHDF 204
Query: 149 GACFHYQEEKAIERGILPLLQRATLVQTFGQR-NHVCLNEGSITIPPYAPPQKMQAHQ-- 205
G+CFH E++AI G+ L+ + ++QTFG NH C ++ IPPY P+ + Q
Sbjct: 205 GSCFHTMEDRAIADGVPIFLRNSIILQTFGVTFNHPCQEVENVVIPPYISPESLHKTQKN 264
Query: 206 IPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYED 265
IP R I+V+FRG + G +Y++ R +W ++ + F + Y +
Sbjct: 265 IPVTKERDIWVFFRGKMELHPKNISGRFYSKRVRTNIWRSYGGDRRFYLQRQRFAGYQSE 324
Query: 266 MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVP 325
+ R++FCLCPLGWAPWSPRLVE+V GC+PVIIAD I LPF + W +I + VAE DV
Sbjct: 325 IARSVFCLCPLGWAPWSPRLVESVALGCVPVIIADGIRLPFPSTVRWPDISLTVAERDVG 384
Query: 326 KLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKL 378
KL IL + + QR L +PS++RA++F P++ GDA Q+L L++KL
Sbjct: 385 KLGDILEHVAATNLSVIQRNLEDPSVRRALMFNVPSREGDATWQVLEALSKKL 437
>gi|255565439|ref|XP_002523710.1| transferase, putative [Ricinus communis]
gi|223537014|gb|EEF38650.1| transferase, putative [Ricinus communis]
Length = 461
Score = 301 bits (771), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 144/350 (41%), Positives = 220/350 (62%), Gaps = 6/350 (1%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSP-VRTLNPEEADWFYTPIY 98
LK+Y+YELPSKYN+ L + RC H+FA+E+ +H+ + +S +RT +P EAD+F+ P+Y
Sbjct: 107 LKIYIYELPSKYNRDWL-SNKRCSNHLFASEVAIHKAISNSDDIRTFDPYEADFFFVPVY 165
Query: 99 PTCDL-TPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEE 157
+C+ T G P + ++ SA+ IS+N+P+WNR++GADH FV HDFG+CFH EE
Sbjct: 166 VSCNFSTINGFPAIGHARSLLSSAVTFISTNYPFWNRSQGADHVFVASHDFGSCFHTLEE 225
Query: 158 KAIERGILPLLQRATLVQTFGQR-NHVCLNEGSITIPPYAPPQKMQA--HQIPPDTPRSI 214
+A++ G+ L+++ ++QTFG + +H C ++ IPPY P +++ + P R I
Sbjct: 226 RAMQDGVPEFLKKSIILQTFGVKYDHPCQQVENVVIPPYISPVSVRSTLKKAPLTGRRDI 285
Query: 215 FVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLC 274
+V+FRG + G +Y++ R +W F + F + Y ++ R++FCLC
Sbjct: 286 WVFFRGKMEVHPKNVSGRFYSKKVRTEIWRRFNGDRRFYLQRHRFAGYQSEIARSVFCLC 345
Query: 275 PLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSI 334
PLGWAPWSPRLVE+V GC+PVIIAD I LPF A+PW I + VAE+DV KL IL +
Sbjct: 346 PLGWAPWSPRLVESVALGCVPVIIADGIRLPFPSAVPWPAISLTVAEKDVAKLGRILEDV 405
Query: 335 PPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHDKST 384
+ Q+ + +P+++RA+LF + GDA Q+L L +KL + T
Sbjct: 406 AATNLTLIQKNIWDPTVRRALLFNDQIEEGDATWQVLYALTKKLDRSRRT 455
>gi|356539378|ref|XP_003538175.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase
IRX7-like [Glycine max]
Length = 460
Score = 298 bits (763), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 145/348 (41%), Positives = 213/348 (61%), Gaps = 5/348 (1%)
Query: 35 DPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFY 94
+ + KLKV+VY+LP KYN L + RC H+FA+E+ +HR LL+S VRT +P +AD+F+
Sbjct: 97 ESLKKLKVFVYDLPPKYNTDWLTNE-RCSKHLFASEVAIHRALLTSEVRTFDPYDADFFF 155
Query: 95 TPIYPTCDL-TPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFH 153
P+Y +C+ T G P + ++ SA+ LISS +P+WNR+ G+DH FV HDFG+CFH
Sbjct: 156 VPVYVSCNFSTVNGFPAIGHARSLIASAVNLISSEYPFWNRSRGSDHVFVASHDFGSCFH 215
Query: 154 YQEEKAIERGILPLLQRATLVQTFGQR-NHVCLNEGSITIPPYAPPQKMQ--AHQIPPDT 210
E+ A+ G+ +++ + ++QTFG +H C + IPPY P+ ++ P D
Sbjct: 216 TLEDVAMADGVPEIVRNSIVLQTFGVVFDHPCQKVEHVVIPPYVSPESVRDTMENFPVDG 275
Query: 211 PRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAI 270
R I+ +FRG + G +Y++ R +W F + F + Y ++ R++
Sbjct: 276 RRDIWAFFRGKMEVHPKNVSGRFYSKEVRTVIWRKFNGDRRFYLQRHRFAGYQSEIARSV 335
Query: 271 FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTI 330
FCLCPLGWAPWSPRLVE+V GC+PV+IAD I LPF A+ W EI + VAE+DV +L I
Sbjct: 336 FCLCPLGWAPWSPRLVESVALGCVPVVIADGIRLPFVSAVKWSEISVTVAEKDVGRLAEI 395
Query: 331 LTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKL 378
L + + QR L +P+ +RA+LF Q GDA Q+L L+ KL
Sbjct: 396 LERVAATNLSTIQRNLWDPATRRALLFNSQVQVGDATWQVLRALSEKL 443
>gi|297822463|ref|XP_002879114.1| hypothetical protein ARALYDRAFT_481698 [Arabidopsis lyrata subsp.
lyrata]
gi|297324953|gb|EFH55373.1| hypothetical protein ARALYDRAFT_481698 [Arabidopsis lyrata subsp.
lyrata]
Length = 452
Score = 297 bits (760), Expect = 8e-78, Method: Compositional matrix adjust.
Identities = 145/353 (41%), Positives = 216/353 (61%), Gaps = 7/353 (1%)
Query: 32 LEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLS--SPVRTLNPEE 89
+ D LK+YVY+LPSK+NK L D RC H+FAAE+ +H+ LS +RT +P E
Sbjct: 90 IRTDVFNNLKIYVYDLPSKFNKDWLAND-RCSNHLFAAEVALHKAFLSLEGDIRTEDPYE 148
Query: 90 ADWFYTPIYPTCDL-TPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDF 148
AD+F+ P+Y +C+ T G P + ++ AI+ +S+ +P+WNR G+DH F HDF
Sbjct: 149 ADFFFVPVYVSCNFSTINGFPAIGHARTLINDAIKFVSTQYPFWNRNNGSDHVFTATHDF 208
Query: 149 GACFHYQEEKAIERGILPLLQRATLVQTFGQR-NHVCLNEGSITIPPYAPPQKMQA--HQ 205
G+CFH E++AI G+ +L+ + ++QTFG NH C ++ IPPY P+ +
Sbjct: 209 GSCFHTMEDRAIADGVPKILRSSIVLQTFGVTFNHPCQEVENVVIPPYISPESLHKTLKN 268
Query: 206 IPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYED 265
IP + R I+ +FRG + G +Y++ R +W ++ + F + + Y +
Sbjct: 269 IPVNKERDIWAFFRGKMELHPKNISGRFYSKRVRTKIWRSYGGDRRFYLQRQRFSGYQLE 328
Query: 266 MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVP 325
+ R++FCLCPLGWAPWSPRLVE+V GC+PVIIAD I LPF A+ W +I + VAE DV
Sbjct: 329 IARSVFCLCPLGWAPWSPRLVESVALGCVPVIIADGIRLPFPSAVRWPDISLTVAERDVG 388
Query: 326 KLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKL 378
KL IL + + QR L +PS++RA++F P++ GDA Q+L L++KL
Sbjct: 389 KLGDILEHVVATNLSVIQRNLEDPSVRRALMFNVPSREGDATWQVLEALSKKL 441
>gi|449459136|ref|XP_004147302.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase F8H-like
[Cucumis sativus]
Length = 447
Score = 296 bits (758), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 151/342 (44%), Positives = 213/342 (62%), Gaps = 6/342 (1%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYP 99
LKV+VY+LP ++N L D RC H+FA+E+ +H+ LL+S VRTL+P EAD+F+ P+Y
Sbjct: 97 LKVFVYDLPPEFNANWL-SDARCGGHLFASEVAIHKALLTSHVRTLDPSEADFFFVPVYV 155
Query: 100 TCDLTP-TGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEK 158
+C+ + G P +P ++ SA+ +IS +P+WNR+ G DH FV HD+GACFH E+
Sbjct: 156 SCNFSSFNGFPAIAHAPSLLASAVDVISGQFPFWNRSRGFDHVFVASHDYGACFHSLEDM 215
Query: 159 AIERGILPLLQRATLVQTFGQR-NHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVY 217
AI GI L+ + ++QTFG + H C + +I IPPY P+ M+ + R IF +
Sbjct: 216 AIANGIPEFLKNSIILQTFGVKYKHPCQDVENILIPPYISPEFMEP-AVVDGRRRDIFAF 274
Query: 218 FRGLFYDVNNDPEGG-YYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPL 276
FRG +VN GG +Y + R +W+ F + F + Y ++ R++FCLCPL
Sbjct: 275 FRGKM-EVNPKNVGGRFYGKRVRTTIWKKFHRDRRFYLRRHRFAGYRSEIARSVFCLCPL 333
Query: 277 GWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPP 336
GWAPWSPRLVE+V GC+PVIIAD I LPF A+ W I + VAE+DV KL IL +
Sbjct: 334 GWAPWSPRLVESVALGCVPVIIADGIRLPFPSAVDWPGISLTVAEKDVGKLRKILERVAA 393
Query: 337 EVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKL 378
+ Q+ L +P +RA+LF P QP DA Q+L+ LA KL
Sbjct: 394 TNLTAIQKNLWDPKNRRALLFHNPTQPQDATWQVLSALAEKL 435
>gi|302826405|ref|XP_002994684.1| glycosyltransferase-like protein [Selaginella moellendorffii]
gi|300137154|gb|EFJ04250.1| glycosyltransferase-like protein [Selaginella moellendorffii]
Length = 332
Score = 295 bits (755), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 146/342 (42%), Positives = 212/342 (61%), Gaps = 18/342 (5%)
Query: 38 GKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPI 97
+KVY+Y+LPS YN L+ D RC +H+FAAE+ +H+ LL SPVRTL+P+EAD+F+ P+
Sbjct: 4 SNIKVYIYDLPSSYNTDWLV-DSRCSSHLFAAEVAIHQNLLRSPVRTLDPDEADFFFMPV 62
Query: 98 YPTCDLTP-TGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQE 156
Y +C+ T +G P F + ++++A+ L+S N P+W+R +G DH FV HDFGACFH +
Sbjct: 63 YVSCNFTSRSGFPTLFHASDILQAAVGLVSRNMPFWDRHQGRDHVFVATHDFGACFHAMD 122
Query: 157 EKAIERGILPLLQRATLVQTFGQRN-HVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIF 215
A+ GI L+ + ++QTFG++N H C N I IPPY R I
Sbjct: 123 -LAVTMGIPQFLRNSIILQTFGEKNKHPCQNVDHIQIPPY--------------VRRKIL 167
Query: 216 VYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCP 275
+FRG + G Y+RG R +W F ++ F I Y +M R++FCLCP
Sbjct: 168 AFFRGKMEIHPKNVSGHMYSRGVRTTIWRRFSHDRRFFIKRKRSDNYKAEMLRSVFCLCP 227
Query: 276 LGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIP 335
LGWAPWSPR+VE+V+ GCIPVIIAD+I LP++ I W +I + VAE DV KLD IL+ +
Sbjct: 228 LGWAPWSPRIVESVIQGCIPVIIADNIQLPYSHVIDWRKISVTVAERDVHKLDRILSRVA 287
Query: 336 PEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARK 377
+ Q L +++A+++ QP GDA Q+L+ L+++
Sbjct: 288 ATNVSMIQANLWRDEVRQALVYNQPLVRGDATWQVLDLLSKR 329
>gi|168003473|ref|XP_001754437.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162694539|gb|EDQ80887.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 353
Score = 294 bits (752), Expect = 7e-77, Method: Compositional matrix adjust.
Identities = 147/343 (42%), Positives = 213/343 (62%), Gaps = 5/343 (1%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYP 99
+++YVY+LP+K+N+ L D RC H+FAAE+ +H+ L++SP+RTL+P EAD+F+ P+Y
Sbjct: 5 IRIYVYDLPAKFNEDWL-ADERCSNHLFAAEVAIHKVLMTSPIRTLDPCEADFFFIPVYV 63
Query: 100 TCDLTP-TGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEK 158
+C TP TG P ++ + M +A+ +S+ +WNR+ G DH FV HD+GACFH E +
Sbjct: 64 SCKFTPKTGFPWLGQARKFMEAAVNHVSTRMEFWNRSGGRDHIFVASHDYGACFHTLETE 123
Query: 159 AIERGILPLLQRATLVQTFG-QRNHVCLNEGSITIPPYAPPQKMQAH-QIPPD-TPRSIF 215
AI GI ++++ ++QTFG Q H C I IPPY P ++ + PP+ R+IF
Sbjct: 124 AIAHGIPEFMRKSLILQTFGVQDFHPCQAAEHIQIPPYVSPSVAASYIKDPPERQKRNIF 183
Query: 216 VYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCP 275
+FRG + G Y+RG R +++ F +N F + Y +M R+ FCLCP
Sbjct: 184 AFFRGKMEINPKNVSGLVYSRGVRTVLYKKFSHNRRFLLKRHRTDNYQLEMLRSTFCLCP 243
Query: 276 LGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIP 335
+GWAPWSPR+VEAVV GC+PVIIAD+I LP++ AI W I + V E DVPKLD IL ++
Sbjct: 244 VGWAPWSPRIVEAVVHGCVPVIIADNISLPYSHAIDWTGISLSVREHDVPKLDKILLNVA 303
Query: 336 PEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKL 378
+ Q L +RA+LF P GDA + + L+ KL
Sbjct: 304 ATNLSTIQHNLWKEENRRALLFTDPLVKGDATWHVFDRLSTKL 346
>gi|168059925|ref|XP_001781950.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666596|gb|EDQ53246.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 356
Score = 293 bits (751), Expect = 8e-77, Method: Compositional matrix adjust.
Identities = 145/343 (42%), Positives = 212/343 (61%), Gaps = 5/343 (1%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYP 99
++VYVY+LP K+NK L+ D RC H+FA+E+ +H+ LLSSP++TLNP EAD+F+ P+Y
Sbjct: 5 IRVYVYDLPQKFNKDWLV-DERCSNHLFASEVAIHKILLSSPIKTLNPYEADFFFMPVYF 63
Query: 100 TCDLTP-TGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEK 158
+C + TG P +P++M A+ +SS +WNR+ G DH FV HDFGACFH E +
Sbjct: 64 SCKFSSKTGFPRLGHAPKLMEDAVNHVSSMMEFWNRSGGKDHVFVAAHDFGACFHSLESE 123
Query: 159 AIERGILPLLQRATLVQTFGQRN-HVCLNEGSITIPPYAPPQKMQAH--QIPPDTPRSIF 215
AI GI ++Q + ++QTFG H C +I IPPY P + ++ + P + R+IF
Sbjct: 124 AIAHGIPEIVQSSLILQTFGVHGFHPCQAAENIQIPPYISPSTVFSYVKKPPEEQRRNIF 183
Query: 216 VYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCP 275
+FRG + G Y+RG R +++ F N F + Y D+ R+ FCLCP
Sbjct: 184 AFFRGKMEINPKNVSGLVYSRGVRTYIYKKFSRNRRFFLKRHRADNYQLDLLRSTFCLCP 243
Query: 276 LGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIP 335
LGWAPWSPR+VEAV +GC+PVIIAD+I LP++ AI W + + + E DV KL IL ++
Sbjct: 244 LGWAPWSPRIVEAVAYGCVPVIIADNIRLPYSHAIDWSNMSLNIREHDVHKLYKILLNVA 303
Query: 336 PEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKL 378
+ + Q+ L +R +LF +P GDA +L+ L+ K+
Sbjct: 304 AKNLSSIQKNLWKEENRRVLLFMEPLAKGDATWHMLDRLSTKV 346
>gi|356542668|ref|XP_003539788.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase
IRX7-like [Glycine max]
Length = 461
Score = 293 bits (750), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 143/346 (41%), Positives = 210/346 (60%), Gaps = 5/346 (1%)
Query: 37 VGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTP 96
+ LKV+VY+LP KYN L + RC H+FA+E+ +HR LL+S VRT +P +AD+F+ P
Sbjct: 100 LKNLKVFVYDLPQKYNTDWL-SNERCSKHLFASEVAIHRALLTSEVRTFDPYDADFFFVP 158
Query: 97 IYPTCDL-TPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQ 155
+Y +C+ T G P + ++ SA+ L+SS +P+WNR+ G+DH FV HDFG+CFH
Sbjct: 159 VYVSCNFSTVNGFPAIGHARSLIASAVSLVSSEYPFWNRSRGSDHVFVASHDFGSCFHTL 218
Query: 156 EEKAIERGILPLLQRATLVQTFGQ-RNHVCLNEGSITIPPYAPPQKMQ--AHQIPPDTPR 212
E+ A+ G+ +++ + ++QTFG +H C + + IPPY P+ ++ P + R
Sbjct: 219 EDVAMADGVPEIMRNSIVLQTFGVVYDHPCQSVEHVVIPPYVSPESVRDTMENFPVNGRR 278
Query: 213 SIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFC 272
I+ +FRG + G +Y++ R +W F + F + Y ++ R++FC
Sbjct: 279 DIWAFFRGKMELHPKNVSGRFYSKKVRTVIWRKFNGDRRFYLQRQRFAGYQSEIARSVFC 338
Query: 273 LCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILT 332
LCPLGWAPWSPRLVE+V GC+PVIIAD I LPF A+ W EI + VAE+DV +L IL
Sbjct: 339 LCPLGWAPWSPRLVESVALGCVPVIIADGIRLPFISAVKWPEISITVAEKDVGRLAEILE 398
Query: 333 SIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKL 378
+ + QR L +P + A+LF Q GDA QIL L+ KL
Sbjct: 399 RVAATNLSTIQRNLWDPVTRSALLFNSQVQKGDATWQILRALSEKL 444
>gi|414864293|tpg|DAA42850.1| TPA: hypothetical protein ZEAMMB73_024068 [Zea mays]
Length = 434
Score = 293 bits (749), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 145/356 (40%), Positives = 212/356 (59%), Gaps = 4/356 (1%)
Query: 39 KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIY 98
++++YVY+LP+++N+ D RC H+FAAE+ +H LL+ R PE+AD F+ P+Y
Sbjct: 77 RVRIYVYDLPARFNRDWAAADARCSRHLFAAEVAVHEALLAYAGRAARPEDADLFFVPVY 136
Query: 99 PTCDL-TPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEE 157
+C+ TP G P + ++ A+ L+ + PYWNR+ GADH FV HDFGACFH E+
Sbjct: 137 VSCNFSTPNGFPSLSHARGLLADAVDLVRARMPYWNRSAGADHVFVASHDFGACFHPMED 196
Query: 158 KAIERGILPLLQRATLVQTFG-QRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFV 216
AI GI L+R+ L+QTFG Q +HVC + IPP+ PP+ P R IF
Sbjct: 197 VAIADGIPEFLKRSILLQTFGVQGHHVCQEVEHVVIPPHVPPEVAHELPEPEKAQRDIFA 256
Query: 217 YFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPL 276
+FRG + G +Y++ R + +++ N F + Y +M R++FCLCPL
Sbjct: 257 FFRGKMEVHPKNISGRFYSKKVRTELLQHYGRNRKFYLKRKRFDNYRSEMARSLFCLCPL 316
Query: 277 GWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPP 336
GWAPWSPRLVE+V+ GCIPVIIADDI LPF + W+EI + VAE+D+ L +L +
Sbjct: 317 GWAPWSPRLVESVLLGCIPVIIADDIRLPFPPVLQWQEISLQVAEKDIASLGMVLDHVVA 376
Query: 337 EVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHDKSTYLKPGGKF 392
+ Q+ L +P +RA++F +P + GDA Q+L L L D+S + G +
Sbjct: 377 TNLTVIQKNLWDPVKRRALVFNRPMEAGDATWQVLRELEVLL--DQSRRRRSVGSY 430
>gi|356547155|ref|XP_003541982.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase
IRX7-like [Glycine max]
Length = 458
Score = 291 bits (746), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 147/364 (40%), Positives = 223/364 (61%), Gaps = 13/364 (3%)
Query: 21 IERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSS 80
+E S + G VL++ +KV+VYELP KYN L + RC H+FA+E+ +HR LL+S
Sbjct: 85 VESTSNTLG-VLKN-----MKVFVYELPPKYNTDWLANE-RCSNHLFASEVAIHRALLTS 137
Query: 81 PVRTLNPEEADWFYTPIYPTCDLTP-TGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGAD 139
VRT +P EAD+F+ P+Y +C+ + G P + ++ SA+ L+S+ +P+WNR+ G+D
Sbjct: 138 EVRTFDPYEADFFFVPVYVSCNFSAVNGFPAIGHARTLISSAVNLVSTEYPFWNRSRGSD 197
Query: 140 HFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRN-HVCLNEGSITIPPYAPP 198
H FV HDFGACFH E+ A+ GI +L+ + ++QTFG + H C + ++ IPPY P
Sbjct: 198 HVFVASHDFGACFHTLEDVAMADGIPKILKNSIVLQTFGVIHPHPCQDVENVVIPPYVAP 257
Query: 199 QKMQA--HQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDIST 256
+ +++ + P + R I+ +FRG + G +Y++ R +W F + F +
Sbjct: 258 ESVRSTLEKFPVNGRRDIWAFFRGKMEVHPKNVSGQFYSKRVRTEIWRKFNGDRRFYLQR 317
Query: 257 DHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG 316
Y ++ R++FCLCPLGWAPWSPRLVE+V GC+PV+IAD I LPF+ A+ W EI
Sbjct: 318 RRFAGYQLEIARSVFCLCPLGWAPWSPRLVESVALGCVPVVIADGIRLPFSSAVRWSEIS 377
Query: 317 MFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLF--PQPAQPGDAFHQILNGL 374
+ VAE DV KL IL + + Q+ L +P +RA+LF + + GDA Q++ L
Sbjct: 378 LTVAERDVGKLGKILERVAATNLSVIQKSLWDPGTRRALLFNNNKKVEEGDATWQVMVSL 437
Query: 375 ARKL 378
+ KL
Sbjct: 438 SEKL 441
>gi|356541948|ref|XP_003539434.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase
IRX7-like [Glycine max]
Length = 459
Score = 290 bits (743), Expect = 8e-76, Method: Compositional matrix adjust.
Identities = 144/345 (41%), Positives = 209/345 (60%), Gaps = 6/345 (1%)
Query: 39 KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIY 98
+KV+VYELP KYN L + RC +H+FA+E+ +HR LL+S VRT +P EAD+F+ P+Y
Sbjct: 99 NMKVFVYELPPKYNTDWLANE-RCSSHLFASEVAIHRALLTSEVRTFDPYEADFFFVPVY 157
Query: 99 PTCDLTPTG-LPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEE 157
+C+ + P + ++ SA+ L+S+ +P+WNR+ G+DH FV HDFGACFH E+
Sbjct: 158 VSCNFSAVNDFPAIGHARTLISSAVNLVSTEYPFWNRSRGSDHVFVASHDFGACFHTLED 217
Query: 158 KAIERGILPLLQRATLVQTFGQ-RNHVCLNEGSITIPPYAPPQKMQA--HQIPPDTPRSI 214
A+ GI +L+ + ++QTFG H C ++ IPPY P+ +++ + P R I
Sbjct: 218 VAMADGIPIILKNSIVLQTFGVIHQHPCQEVENVVIPPYVSPESVRSTLEKFPVTGRRDI 277
Query: 215 FVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLC 274
F +FRG + +Y++ R +W F + F + Y ++ R++FCLC
Sbjct: 278 FAFFRGKMEVHPKNVSRRFYSKRVRTEIWRKFNGDRRFYLQRHRFAGYQLEIARSVFCLC 337
Query: 275 PLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSI 334
PLGWAPWSPRLVE+V GC+PV+IAD I LPF+ A+ W EI + VAE DV KL IL +
Sbjct: 338 PLGWAPWSPRLVESVALGCVPVVIADGIQLPFSSAVRWSEISLSVAERDVGKLGKILERV 397
Query: 335 PPEVILRKQRLLANPSMKRAMLF-PQPAQPGDAFHQILNGLARKL 378
+ QR L +P +RA+LF Q GDA Q+L L+ KL
Sbjct: 398 AATNLSVIQRNLWDPRTRRALLFNNNKVQEGDATWQVLVSLSEKL 442
>gi|168049543|ref|XP_001777222.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162671450|gb|EDQ58002.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 368
Score = 290 bits (741), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 145/343 (42%), Positives = 208/343 (60%), Gaps = 5/343 (1%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYP 99
+++YVY+LP K+N+ L+ D RC H+FA+E+ +H+ LL+SP+RTL+P EAD+F+ P+Y
Sbjct: 20 IRIYVYDLPPKFNEDWLV-DERCSNHLFASEVAIHKILLTSPIRTLDPYEADFFFMPVYV 78
Query: 100 TCDLTP-TGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEK 158
+C +P TG P +P++M++A+ +S+ +WNR+ G DH FV HD+GACFH E +
Sbjct: 79 SCKFSPKTGFPWLGHAPKLMQAAVNHVSTKMEFWNRSWGRDHIFVAAHDYGACFHTLETQ 138
Query: 159 AIERGILPLLQRATLVQTFGQRN-HVCLNEGSITIPPYAPPQKMQAHQIPP--DTPRSIF 215
AI +GI ++ + ++QTFG + H C I IPPY P ++ P R IF
Sbjct: 139 AIAQGIPQFMRNSLILQTFGVKGFHPCQAAEHIQIPPYISPSVAVSYVKDPLEHQQRDIF 198
Query: 216 VYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCP 275
YFRG + G Y++G R +++ F N F + ++M R+ FCLCP
Sbjct: 199 AYFRGKMEINPKNVSGLLYSKGIRTVLYKRFSRNKRFVLKRHRVDNSQQEMLRSTFCLCP 258
Query: 276 LGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIP 335
LGWAPWSPR+VEAV +GCIPVIIAD+I LP++ I W I + V E DVPKLD IL +
Sbjct: 259 LGWAPWSPRIVEAVTYGCIPVIIADNISLPYSHTIDWSSISLTVPEHDVPKLDKILIGVA 318
Query: 336 PEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKL 378
+ QR L +RA+LF P GDA I + L+ L
Sbjct: 319 VTNLTAIQRNLWRGENRRALLFTDPLVQGDATWHIFDLLSSTL 361
>gi|293334733|ref|NP_001169191.1| uncharacterized protein LOC100383044 [Zea mays]
gi|223975431|gb|ACN31903.1| unknown [Zea mays]
gi|413957212|gb|AFW89861.1| hypothetical protein ZEAMMB73_311893 [Zea mays]
Length = 428
Score = 290 bits (741), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 144/355 (40%), Positives = 212/355 (59%), Gaps = 4/355 (1%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYP 99
+++YVY+LP+++N+ + D RC H+FAAE+ +H LL+ R PE+AD F+ P+Y
Sbjct: 74 VRIYVYDLPARFNRDWAVADARCARHLFAAEVAVHEALLAYTGRAARPEDADLFFVPVYV 133
Query: 100 TCDL-TPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEK 158
+C+ TP G P + M+ A+ L+ + PYWNR+ GADH FV HDFGACFH E+
Sbjct: 134 SCNFSTPNGFPSLSHARGMLADAVDLVQAGMPYWNRSAGADHVFVASHDFGACFHPMEDV 193
Query: 159 AIERGILPLLQRATLVQTFG-QRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVY 217
AI GI L+R+ L+QTFG Q +H C + IPP+ PP+ P R IF +
Sbjct: 194 AIADGIPEFLKRSILLQTFGVQGHHTCQEVEHVVIPPHVPPEVEHELPEPEKAQRDIFAF 253
Query: 218 FRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLG 277
FRG + G +Y++ R + +++ N F + Y +M R++FCLCPLG
Sbjct: 254 FRGKMEVHPKNISGRFYSKKVRTELLQHYGRNRKFYLKRKRFDNYRSEMARSLFCLCPLG 313
Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
WAPWSPRLVE+V+ GCIPVIIAD+I +PF + W EI + VAE+DV L+ +L +
Sbjct: 314 WAPWSPRLVESVLLGCIPVIIADNIRMPFPSVLQWPEISLQVAEKDVANLEVVLDHVVAT 373
Query: 338 VILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHDKSTYLKPGGKF 392
+ Q+ L +P ++A++F +P + GDA Q+L L L D+S + GG +
Sbjct: 374 NLSVIQKNLWDPVKRKALVFNRPMEVGDATWQVLRELEVLL--DQSQRRRYGGSW 426
>gi|449456052|ref|XP_004145764.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase
IRX7-like [Cucumis sativus]
Length = 459
Score = 288 bits (738), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 147/372 (39%), Positives = 219/372 (58%), Gaps = 11/372 (2%)
Query: 11 AFVSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAE 70
A + NT + ++ + G +LE+ +KV+VY+LP KYN + L + RC H+FA+E
Sbjct: 83 ALIETSNTTFLRQVQQNQG-LLEE-----VKVFVYDLPPKYNVEWL-SNERCSNHLFASE 135
Query: 71 IFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDL-TPTGLPLPFKSPRMMRSAIQLISSNW 129
+ +HR LL+S RT +P EAD+F+ P+Y +C+ T G P + ++ SA+ ISS++
Sbjct: 136 VAIHRALLNSDYRTFDPLEADFFFVPVYVSCNFSTVNGFPAIGHARSLISSAVSHISSHY 195
Query: 130 PYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQR-NHVCLNEG 188
+WNRT G+DH FV HDF +CFH E AI G+ L+ + ++QTFG + H C +
Sbjct: 196 SFWNRTNGSDHVFVASHDFASCFHTMEHVAIADGVPSFLKNSIILQTFGVKYKHPCQDVE 255
Query: 189 SITIPPYAPPQKMQA--HQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENF 246
+ IPPY P+ ++ + P R IF +FRG + G +Y++ R +W F
Sbjct: 256 HVVIPPYISPESIENTLERSPVTGRRDIFAFFRGKMEMNPKNVSGRFYSKKVRTMIWRKF 315
Query: 247 KNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 306
+ F + Y ++ R++FCLCPLGWAPWSPRLVE+V GC+PVIIAD I LPF
Sbjct: 316 NGDRRFYLQRHRFPGYQSEIVRSVFCLCPLGWAPWSPRLVESVALGCVPVIIADGIRLPF 375
Query: 307 ADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDA 366
A+ W EI + VAE+D+ KL IL + + Q+ L +P +RA+LF + GDA
Sbjct: 376 PSAVNWPEISITVAEKDIGKLGRILDHVAASNLTTIQKNLWDPRNRRALLFHNQVEDGDA 435
Query: 367 FHQILNGLARKL 378
Q++ L+ KL
Sbjct: 436 TWQVIGALSEKL 447
>gi|115450193|ref|NP_001048697.1| Os03g0107900 [Oryza sativa Japonica Group]
gi|122247627|sp|Q10SX7.1|GT31_ORYSJ RecName: Full=Probable glucuronosyltransferase Os03g0107900
gi|108705764|gb|ABF93559.1| exostosin family protein, putative, expressed [Oryza sativa
Japonica Group]
gi|113547168|dbj|BAF10611.1| Os03g0107900 [Oryza sativa Japonica Group]
gi|125542077|gb|EAY88216.1| hypothetical protein OsI_09667 [Oryza sativa Indica Group]
gi|215766485|dbj|BAG98793.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222624042|gb|EEE58174.1| hypothetical protein OsJ_09104 [Oryza sativa Japonica Group]
Length = 427
Score = 288 bits (737), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 140/341 (41%), Positives = 203/341 (59%), Gaps = 2/341 (0%)
Query: 36 PVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYT 95
P L++YVY+LP+++N+ + D RC TH+FAAE+ +H LL+ R P++A F+
Sbjct: 70 PRPALRIYVYDLPARFNRHWVAADARCATHLFAAEVALHEALLAYAGRAARPDDATLFFV 129
Query: 96 PIYPTCDL-TPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHY 154
P+Y +C+ T G P + ++ A+ L+ + PYWNR+ GADH FV HDFGACFH
Sbjct: 130 PVYVSCNFSTDNGFPSLSHARALLADAVDLVRAQMPYWNRSAGADHVFVASHDFGACFHP 189
Query: 155 QEEKAIERGILPLLQRATLVQTFG-QRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRS 213
E+ AI GI L+R+ L+QTFG Q HVC + IPP+ PP+ P R
Sbjct: 190 MEDVAIADGIPEFLKRSILLQTFGVQGTHVCQEADHVVIPPHVPPEVALELPEPEKAQRD 249
Query: 214 IFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCL 273
IF +FRG + G +Y++ R + + + N F + Y +M R++FCL
Sbjct: 250 IFAFFRGKMEVHPKNISGRFYSKKVRTELLQKYGRNRKFYLKRKRYGNYRSEMARSLFCL 309
Query: 274 CPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTS 333
CPLGWAPWSPRLVE+V+ GCIPVIIADDI LPF + W +I + VAE+DV L+ +L
Sbjct: 310 CPLGWAPWSPRLVESVLLGCIPVIIADDIRLPFPSVLQWLDISLQVAEKDVASLEMVLDH 369
Query: 334 IPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
+ + Q+ L +P ++A++F +P + GDA Q+L L
Sbjct: 370 VVATNLTVIQKNLWDPVKRKALVFNRPMEEGDATWQVLREL 410
>gi|449496198|ref|XP_004160070.1| PREDICTED: LOW QUALITY PROTEIN: probable glucuronoxylan
glucuronosyltransferase IRX7-like [Cucumis sativus]
Length = 459
Score = 288 bits (737), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 147/372 (39%), Positives = 220/372 (59%), Gaps = 11/372 (2%)
Query: 11 AFVSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAE 70
A + NT + ++ + ++LE+ +KV+VY+LP KYN + L + RC H+FA+E
Sbjct: 83 ALIETSNTTFLRQVQQNQ-ELLEE-----VKVFVYDLPPKYNVEWL-SNERCSNHLFASE 135
Query: 71 IFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDL-TPTGLPLPFKSPRMMRSAIQLISSNW 129
+ +HR LL+S RT +P EAD+F+ P+Y +C+ T G P + ++ SA+ ISS++
Sbjct: 136 VAIHRALLNSHYRTFDPLEADFFFVPVYVSCNFSTVNGFPAIGHARSLISSAVSHISSHY 195
Query: 130 PYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQR-NHVCLNEG 188
+WNRT G+DH FV HDF +CFH E AI G+ L+ + ++QTFG + H C +
Sbjct: 196 SFWNRTNGSDHVFVASHDFASCFHTMEHVAIADGVPSFLKNSIILQTFGVKYKHPCQDVE 255
Query: 189 SITIPPYAPPQKMQA--HQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENF 246
+ IPPY PP+ ++ + P R IF +FRG + G +Y++ R +W F
Sbjct: 256 HVVIPPYIPPESIENTLERSPVTGRRDIFAFFRGKMEMNPKNVSGRFYSKKVRTMIWRKF 315
Query: 247 KNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 306
+ F + Y ++ R++FCLCPLGWAPWSPRLVE+V GC+PVIIAD I LPF
Sbjct: 316 NGDRRFYLQRHRFPGYQSEIVRSVFCLCPLGWAPWSPRLVESVALGCVPVIIADGIRLPF 375
Query: 307 ADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDA 366
A+ W EI + VAE+D+ KL IL + + Q+ L +P +RA+LF + GDA
Sbjct: 376 PSAVNWPEISITVAEKDIGKLGRILDHVAGSNLTTIQKNLWDPRNRRALLFHNQVEDGDA 435
Query: 367 FHQILNGLARKL 378
Q++ L+ KL
Sbjct: 436 TWQVIGALSEKL 447
>gi|242042569|ref|XP_002468679.1| hypothetical protein SORBIDRAFT_01g050110 [Sorghum bicolor]
gi|241922533|gb|EER95677.1| hypothetical protein SORBIDRAFT_01g050110 [Sorghum bicolor]
Length = 429
Score = 287 bits (735), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 139/337 (41%), Positives = 203/337 (60%), Gaps = 2/337 (0%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYP 99
+++YVY+LP+++N+ D RC H+FAAE+ +H LL+ R PE+AD F+ P+Y
Sbjct: 75 VRIYVYDLPARFNRDWAAADARCARHLFAAEVAVHEALLAYAGRAARPEDADLFFVPVYV 134
Query: 100 TCDL-TPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEK 158
+C+ TP G P + ++ A+ L+ PYWNR+ GADH FV HDFGACFH E+
Sbjct: 135 SCNFSTPNGFPSLSHARGLLAEAVDLVRVRMPYWNRSAGADHVFVASHDFGACFHPMEDV 194
Query: 159 AIERGILPLLQRATLVQTFG-QRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVY 217
AI GI L+R+ L+QTFG Q +HVC + IPP+ PP+ P R IF +
Sbjct: 195 AIADGIPEFLKRSILLQTFGVQGHHVCQEVEHVVIPPHVPPEVAHELPEPEKAQRDIFAF 254
Query: 218 FRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLG 277
FRG + G +Y++ R + +++ N F + Y +M R++FCLCPLG
Sbjct: 255 FRGKMEVHPKNISGRFYSKKVRTELLQHYGRNRKFYLKRKRFDNYRSEMARSLFCLCPLG 314
Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
WAPWSPRLVE+V+ GCIPVIIAD+I LPF + W EI + VAE+D+ L+ +L +
Sbjct: 315 WAPWSPRLVESVLLGCIPVIIADNIRLPFPSVLQWPEISLQVAEKDIANLEMVLDHVVAT 374
Query: 338 VILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
+ Q+ L +P ++A++F +P + GDA Q+L L
Sbjct: 375 NLTMIQKNLWDPVKRKALVFNRPMEVGDATWQVLREL 411
>gi|357121010|ref|XP_003562215.1| PREDICTED: probable glucuronosyltransferase Os03g0107900-like
[Brachypodium distachyon]
Length = 441
Score = 283 bits (724), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 142/349 (40%), Positives = 206/349 (59%), Gaps = 3/349 (0%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYP 99
+++YVY+LP ++N+ L DPRC H+FAAE+ +H LL + PE+AD F+ P+Y
Sbjct: 89 VRIYVYDLPRRFNRDWLAADPRCARHLFAAEVALHEALLGH-YSAVRPEDADLFFVPVYV 147
Query: 100 TCDL-TPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEK 158
+C+ TP G P + ++ A+ L+ PYWNR+ GADH FV HDFGACFH E+
Sbjct: 148 SCNFSTPNGFPSLSHARGLLADAVDLVRREAPYWNRSAGADHVFVASHDFGACFHPMEDV 207
Query: 159 AIERGILPLLQRATLVQTFG-QRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVY 217
AI GI L+R+ L+QTFG Q HVC + IPP+ PP+ T R IF +
Sbjct: 208 AIADGIPDFLKRSILLQTFGVQGPHVCQEAEHVVIPPHVPPEVALEILELEKTRRDIFAF 267
Query: 218 FRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLG 277
FRG + G +Y++ R + + + N F + Y +M R++FCLCPLG
Sbjct: 268 FRGKMEVHPKNISGRFYSKKVRTELLQRYGRNSKFYLKRKRYDNYRSEMARSLFCLCPLG 327
Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
WAPWSPRLVE+V+ GCIPVIIAD+I LPF + W +I + VAE+DV L+ +L +
Sbjct: 328 WAPWSPRLVESVLLGCIPVIIADNIRLPFPSVLRWSDISLQVAEKDVASLEKVLDHVVAT 387
Query: 338 VILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHDKSTYL 386
+ Q+ L +P ++A++F + + GDA Q+L L L +S+Y+
Sbjct: 388 NLTVIQKNLWDPVKRKALVFNRRLEEGDATWQVLRELEVLLDRSQSSYV 436
>gi|357141002|ref|XP_003572039.1| PREDICTED: probable glucuronosyltransferase Os03g0107900-like
[Brachypodium distachyon]
Length = 465
Score = 282 bits (721), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 142/343 (41%), Positives = 204/343 (59%), Gaps = 3/343 (0%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYP 99
+++YVY+LP ++N+ + D RC H+FAAE+ +H LL +R PEEAD F P+Y
Sbjct: 111 VRIYVYDLPPRFNRAWVSADARCGRHLFAAEVAVHEALLRRHLRA-RPEEADLFLVPVYV 169
Query: 100 TCDL-TPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEK 158
+C+ TPTGLP + ++ A++L+ + PYWNR+ G DH FV HDFGACFH E+
Sbjct: 170 SCNFSTPTGLPSLKHARGLLAEAVELVRRDMPYWNRSAGTDHVFVASHDFGACFHAMEDV 229
Query: 159 AIERGILPLLQRATLVQTFG-QRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVY 217
AI GI L+R+ L+QTFG Q H C + IPP+ P+ + P + R IF +
Sbjct: 230 AIAGGIPEFLKRSILLQTFGVQGRHTCQEVEHVVIPPHVLPEVARELPEPEKSHRDIFAF 289
Query: 218 FRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLG 277
FRG + G +Y + R + + + +N F + Y +M R++FCLCPLG
Sbjct: 290 FRGKMEVHPKNMSGRFYGKKVRTKLLQLYGHNRKFYLKRKQHDGYRLEMARSLFCLCPLG 349
Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
WAPWSPRLVE+V+ GCIPVIIAD+I LPF + W +I + VAE D+ L+ +L +
Sbjct: 350 WAPWSPRLVESVLLGCIPVIIADNIRLPFPGVLRWPDISLQVAERDIANLEAMLDHVAST 409
Query: 338 VILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPH 380
+ Q L +P ++A++F QP + GDA Q+L L KL H
Sbjct: 410 NLTTIQGNLWDPVKRKALVFNQPMEEGDATWQVLKELEAKLGH 452
>gi|24476038|gb|AAN62780.1| Unknown protein [Oryza sativa Japonica Group]
Length = 449
Score = 278 bits (712), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 140/359 (38%), Positives = 203/359 (56%), Gaps = 24/359 (6%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYP 99
L++YVY+LP+++N+ + D RC TH+FAAE+ +H LL+ R P++A F+ P+Y
Sbjct: 74 LRIYVYDLPARFNRHWVAADARCATHLFAAEVALHEALLAYAGRAARPDDATLFFVPVYV 133
Query: 100 TCDL-TPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFH----- 153
+C+ T G P + ++ A+ L+ + PYWNR+ GADH FV HDFGACFH
Sbjct: 134 SCNFSTDNGFPSLSHARALLADAVDLVRAQMPYWNRSAGADHVFVASHDFGACFHPMELF 193
Query: 154 -----------------YQEEKAIERGILPLLQRATLVQTFG-QRNHVCLNEGSITIPPY 195
QE+ AI GI L+R+ L+QTFG Q HVC + IPP+
Sbjct: 194 VIIHFELGVNAKSNLALGQEDVAIADGIPEFLKRSILLQTFGVQGTHVCQEADHVVIPPH 253
Query: 196 APPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDIS 255
PP+ P R IF +FRG + G +Y++ R + + + N F +
Sbjct: 254 VPPEVALELPEPEKAQRDIFAFFRGKMEVHPKNISGRFYSKKVRTELLQKYGRNRKFYLK 313
Query: 256 TDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEI 315
Y +M R++FCLCPLGWAPWSPRLVE+V+ GCIPVIIADDI LPF + W +I
Sbjct: 314 RKRYGNYRSEMARSLFCLCPLGWAPWSPRLVESVLLGCIPVIIADDIRLPFPSVLQWLDI 373
Query: 316 GMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
+ VAE+DV L+ +L + + Q+ L +P ++A++F +P + GDA Q+L L
Sbjct: 374 SLQVAEKDVASLEMVLDHVVATNLTVIQKNLWDPVKRKALVFNRPMEEGDATWQVLREL 432
>gi|326514612|dbj|BAJ96293.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 428
Score = 273 bits (698), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 142/349 (40%), Positives = 202/349 (57%), Gaps = 4/349 (1%)
Query: 34 DDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRT-LNPEEADW 92
+P +++Y Y+LP ++N++ D RC H+FAAE+ +H LL R L PEEAD
Sbjct: 67 SEPRPGVRIYAYDLPPRFNRRWAAADARCSRHLFAAEVAVHEALLLRQRRAGLRPEEADL 126
Query: 93 FYTPIYPTCDL-TPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGAC 151
F P+Y C+ TPTGLP + ++ A+ L+ + P+WNR+ GADH FV HDFGAC
Sbjct: 127 FLVPVYACCNFSTPTGLPSLAHARGLLADAVGLVRAQMPFWNRSAGADHVFVASHDFGAC 186
Query: 152 FHYQEEKAIERGILPLLQRATLVQTFG-QRNHVCLNEGSITIPPYAPPQKMQAHQIPPDT 210
FH E+ A+ GI L+ + L+QTFG Q H C + + IPPY PP+ P+
Sbjct: 187 FHPMEDVAMAAGIPEFLKGSILLQTFGVQGRHPCQDVEHVVIPPYVPPELAPRELPEPEK 246
Query: 211 P-RSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRA 269
R IF +FRG + G +Y+R R + + N F + Y +M R+
Sbjct: 247 AHRDIFAFFRGKMEVHPKNISGHFYSRKVRTELLRLYGRNRKFYLKRKRNDGYRSEMARS 306
Query: 270 IFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDT 329
+FC+CPLGWAPWSPRLVE+V+ GCIPV+IADDI LPF + W +I + VAE DV L+
Sbjct: 307 LFCICPLGWAPWSPRLVESVLLGCIPVVIADDIRLPFPGVLRWPDISLQVAERDVAGLEA 366
Query: 330 ILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKL 378
+L + + Q L +P ++A++F +P + GDA Q+L L KL
Sbjct: 367 VLDHVAATNLTTIQGNLWDPVKRKALVFNRPMEEGDATWQVLKELEAKL 415
>gi|388503508|gb|AFK39820.1| unknown [Lotus japonicus]
Length = 141
Score = 268 bits (686), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 126/141 (89%), Positives = 136/141 (96%)
Query: 266 MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVP 325
MQRA+FCLCPLGWAPWSPRLVEAVVFGCIPVI+ADDIVLPFADAIPWE+IG+FV EEDVP
Sbjct: 1 MQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIVADDIVLPFADAIPWEDIGVFVDEEDVP 60
Query: 326 KLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHDKSTY 385
KLDTILTSIPPE+ILRKQRLLANPSMK+AMLFPQPAQPGDAFHQ+LNGLARKLPHD+S +
Sbjct: 61 KLDTILTSIPPEIILRKQRLLANPSMKQAMLFPQPAQPGDAFHQVLNGLARKLPHDRSVF 120
Query: 386 LKPGGKFLNWTSGPVGDLKPW 406
LK G K LNWT+GPVGDLKPW
Sbjct: 121 LKAGDKILNWTAGPVGDLKPW 141
>gi|388510202|gb|AFK43167.1| unknown [Medicago truncatula]
Length = 216
Score = 266 bits (679), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 125/141 (88%), Positives = 131/141 (92%)
Query: 22 ERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSP 81
ERISGSAGDVLEDDPVG+LKV+VYELPSKYNKK+L KDPRCLTHMFAAEIFMHRFLLSS
Sbjct: 32 ERISGSAGDVLEDDPVGRLKVFVYELPSKYNKKILQKDPRCLTHMFAAEIFMHRFLLSSA 91
Query: 82 VRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHF 141
VRTLNPEEADWFYTP+Y TCDLTP GLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHF
Sbjct: 92 VRTLNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHF 151
Query: 142 FVVPHDFGACFHYQEEKAIER 162
FVVPHDFGACF + E +R
Sbjct: 152 FVVPHDFGACFLPRREGNRKR 172
>gi|255633862|gb|ACU17292.1| unknown [Glycine max]
Length = 141
Score = 265 bits (678), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 126/141 (89%), Positives = 136/141 (96%)
Query: 266 MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVP 325
MQRA+FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG+FV E+DVP
Sbjct: 1 MQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFVDEKDVP 60
Query: 326 KLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHDKSTY 385
+LDTILTSIPPEVILRKQRLLANP MK+AMLFPQPAQPGDAFHQ+LNGLARKLPHD+S +
Sbjct: 61 QLDTILTSIPPEVILRKQRLLANPFMKQAMLFPQPAQPGDAFHQVLNGLARKLPHDRSVF 120
Query: 386 LKPGGKFLNWTSGPVGDLKPW 406
LKPG + LNWT+GPVGDLKPW
Sbjct: 121 LKPGEEALNWTAGPVGDLKPW 141
>gi|255543226|ref|XP_002512676.1| conserved hypothetical protein [Ricinus communis]
gi|223548637|gb|EEF50128.1| conserved hypothetical protein [Ricinus communis]
Length = 145
Score = 241 bits (614), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 116/138 (84%), Positives = 124/138 (89%), Gaps = 3/138 (2%)
Query: 2 WVTAIALLLAFVS---ADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLK 58
W+ + L+ F++ A P+ ERISGSAGDVLEDDPVG+LKVYVYELPSKYNKKLL K
Sbjct: 6 WIFLLLLVSCFINRTFAHQKPRTERISGSAGDVLEDDPVGRLKVYVYELPSKYNKKLLQK 65
Query: 59 DPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMM 118
DPRCLTHMFAAEIFMHRFLLSSPVRTLNP+EADWFYTPIY TCDLTPTGLPLPFKSPRMM
Sbjct: 66 DPRCLTHMFAAEIFMHRFLLSSPVRTLNPDEADWFYTPIYTTCDLTPTGLPLPFKSPRMM 125
Query: 119 RSAIQLISSNWPYWNRTE 136
RSAIQLISSNWPYWNRTE
Sbjct: 126 RSAIQLISSNWPYWNRTE 143
>gi|289166880|gb|ADC84490.1| glycosyltransferase family 47C [Salix miyabeana]
Length = 252
Score = 223 bits (569), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 105/244 (43%), Positives = 153/244 (62%), Gaps = 3/244 (1%)
Query: 117 MMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQT 176
++ SA+ LISSN P+WNR+ G+DH FV HD+GACFH EE+A E GI L+R+ ++QT
Sbjct: 5 LLSSAVNLISSNHPFWNRSRGSDHVFVASHDYGACFHAMEERAAEDGIPEFLKRSIILQT 64
Query: 177 FGQR-NHVCLNEGSITIPPYAPPQKMQA--HQIPPDTPRSIFVYFRGLFYDVNNDPEGGY 233
FG + +H C + ++ IPP+ P+ +Q + P R I+V+FRG + G Y
Sbjct: 65 FGVKFDHPCQDVENVVIPPFITPESVQTTLEKYPLTGRRDIWVFFRGKMEVHPKNISGRY 124
Query: 234 YARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGC 293
Y++ R +W + +P F + Y ++ R++FCLCPLGWAPWSPRLVE++ GC
Sbjct: 125 YSKKVRTVIWRKYSGDPRFYLRRHRFAGYQSEIARSVFCLCPLGWAPWSPRLVESIALGC 184
Query: 294 IPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKR 353
+PVIIAD I LPF A+ W +I + VAE+DV L T+L + + Q+ L P ++R
Sbjct: 185 VPVIIADGIRLPFPAAVRWSDISLTVAEKDVADLGTLLDHVAASNLSAIQKNLWAPDVRR 244
Query: 354 AMLF 357
A+LF
Sbjct: 245 ALLF 248
>gi|289166878|gb|ADC84489.1| glycosyltransferase family 47C [Salix sachalinensis]
Length = 252
Score = 223 bits (568), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 105/244 (43%), Positives = 153/244 (62%), Gaps = 3/244 (1%)
Query: 117 MMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQT 176
++ SA+ LISSN P+WNR+ G+DH FV HD+GACFH EE+A E GI L+R+ ++QT
Sbjct: 5 LLSSAVNLISSNHPFWNRSRGSDHVFVASHDYGACFHAMEERAAEDGIPEFLKRSIILQT 64
Query: 177 FGQR-NHVCLNEGSITIPPYAPPQKMQA--HQIPPDTPRSIFVYFRGLFYDVNNDPEGGY 233
FG + +H C + ++ IPP+ P+ +Q + P R I+V+FRG + G Y
Sbjct: 65 FGVKFDHPCQDVENVVIPPFITPESVQTTLEKYPLTGRRDIWVFFRGKMEVHPKNISGRY 124
Query: 234 YARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGC 293
Y++ R +W + +P F + Y ++ R++FCLCPLGWAPWSPRLVE++ GC
Sbjct: 125 YSKKVRTVIWRKYSGDPRFYLRRHRFAGYQSEIARSVFCLCPLGWAPWSPRLVESIALGC 184
Query: 294 IPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKR 353
+PVIIAD I LPF A+ W +I + VAE+DV L T+L + + Q+ L P ++R
Sbjct: 185 VPVIIADGIRLPFPAAVRWSDISLTVAEKDVADLRTLLDHVAASNLSAIQKNLWAPDVRR 244
Query: 354 AMLF 357
A+LF
Sbjct: 245 ALLF 248
>gi|62319307|dbj|BAD94554.1| hypothetical protein [Arabidopsis thaliana]
Length = 114
Score = 208 bits (530), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 99/114 (86%), Positives = 108/114 (94%)
Query: 293 CIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMK 352
CIPVIIADDIVLPFADAIPWE+IG+FV E+DVP LDTILTSIPPEVILRKQRLLANPSMK
Sbjct: 1 CIPVIIADDIVLPFADAIPWEDIGVFVDEKDVPYLDTILTSIPPEVILRKQRLLANPSMK 60
Query: 353 RAMLFPQPAQPGDAFHQILNGLARKLPHDKSTYLKPGGKFLNWTSGPVGDLKPW 406
+AMLFPQPAQPGDAFHQ+LNGLARKLPH++S YL+PG K LNWT+GPV DLKPW
Sbjct: 61 QAMLFPQPAQPGDAFHQVLNGLARKLPHERSVYLRPGEKLLNWTAGPVADLKPW 114
>gi|118481039|gb|ABK92473.1| unknown [Populus trichocarpa]
Length = 239
Score = 189 bits (479), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 96/237 (40%), Positives = 138/237 (58%), Gaps = 3/237 (1%)
Query: 156 EEKAIERGILPLLQRATLVQTFGQR-NHVCLNEGSITIPPYAPPQKMQA--HQIPPDTPR 212
EE+A+E GI L+R+ ++QTFG + NH C + ++ IPPY P++++ P + R
Sbjct: 2 EERAMEDGIPEFLKRSIILQTFGVKFNHPCQDVENVVIPPYISPERVRTTLENYPLNGRR 61
Query: 213 SIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFC 272
I+ +FRG + G YY++ R +W + + F + Y ++ R++FC
Sbjct: 62 DIWAFFRGKMEVHPKNISGRYYSKKVRTVIWRKYSGDRRFYLQRHRFAGYQSEIVRSVFC 121
Query: 273 LCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILT 332
LCPLGWAPWSPRLVE+V GC+PVIIAD I LPF A+ W EI + VAE+DV L T+L
Sbjct: 122 LCPLGWAPWSPRLVESVALGCVPVIIADGIRLPFPTAVRWSEISLTVAEKDVANLGTLLD 181
Query: 333 SIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHDKSTYLKPG 389
+ + Q+ L +P ++RA+LF Q GDA Q+L LARKL T P
Sbjct: 182 HVAATNLSAIQKNLWDPDVRRALLFNDRVQEGDATWQVLYALARKLDRSYRTVRLPN 238
>gi|412991334|emb|CCO16179.1| predicted protein [Bathycoccus prasinos]
Length = 558
Score = 180 bits (457), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 113/361 (31%), Positives = 185/361 (51%), Gaps = 32/361 (8%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYP 99
K++VY LP K++ ++L K+ RC+T + EI +H ++ S + TL+P EA++FY P+Y
Sbjct: 208 FKIFVYNLPPKFHVEMLKKNKRCVTDQYGTEIRIHANIMQSKMYTLDPLEAEFFYVPVYG 267
Query: 100 TCDLTPTGLPLPFKSPRMMRSA-----IQLISSNWPYWNRTEGADHFFVVPHDFGACFHY 154
C L L K +A ++L++ +P+WNRT+G DH F G
Sbjct: 268 ECKLFENIATLGAKKGLQETNAWWLEAMKLVTDQYPFWNRTQGRDHVFTFAGARGPHIFK 327
Query: 155 QEEKAIERGIL--PLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQI--PPDT 210
++ I++ I P R +L + F I IP P + + + +
Sbjct: 328 DWKRHIKKSIFLTPEGDR-SLSEQFNTWK-------DIVIPGLEPEKAFWSGSLRKQKEV 379
Query: 211 PRS-IFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP----TTYYED 265
R+ F YFRG + G Y++G R + E FK+ + H T Y E+
Sbjct: 380 KRAKTFAYFRGTI----ANKLGKQYSKGIRIKMKEAFKDIKDVVFTEQHSSCDKTCYREE 435
Query: 266 MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVP 325
M+ + FCLCP GW+PW+ R +A++ GCIPVIIAD+I P+ ++ W ++ + + E+
Sbjct: 436 MRASTFCLCPRGWSPWTLRAYQALMVGCIPVIIADEIEFPYENSFDWRQVSIKIPEKRHL 495
Query: 326 KLDTILTSIPPEVILRKQRLLAN--PSMKRAMLFPQPAQPGDAFHQILNGLARKLPHDKS 383
+ IL S+P +V+ RK++ +A PS + + +PA DAFH ++ L RK KS
Sbjct: 496 ETIDILRSVPDDVVERKRKAMAKFWPS----VAWKKPAADDDAFHLVMKELERKKRGFKS 551
Query: 384 T 384
+
Sbjct: 552 S 552
>gi|293336592|ref|NP_001167834.1| uncharacterized protein LOC100381534 [Zea mays]
gi|223944319|gb|ACN26243.1| unknown [Zea mays]
Length = 241
Score = 177 bits (449), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 94/238 (39%), Positives = 136/238 (57%), Gaps = 3/238 (1%)
Query: 156 EEKAIERGILPLLQRATLVQTFG-QRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSI 214
E+ AI GI L+R+ L+QTFG Q +HVC + IPP+ PP+ P R I
Sbjct: 2 EDVAIADGIPEFLKRSILLQTFGVQGHHVCQEVEHVVIPPHVPPEVAHELPEPEKAQRDI 61
Query: 215 FVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLC 274
F +FRG + G +Y++ R + +++ N F + Y +M R++FCLC
Sbjct: 62 FAFFRGKMEVHPKNISGRFYSKKVRTELLQHYGRNRKFYLKRKRFDNYRSEMARSLFCLC 121
Query: 275 PLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSI 334
PLGWAPWSPRLVE+V+ GCIPVIIADDI LPF + W+EI + VAE+D+ L +L +
Sbjct: 122 PLGWAPWSPRLVESVLLGCIPVIIADDIRLPFPPVLQWQEISLQVAEKDIASLGMVLDHV 181
Query: 335 PPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHDKSTYLKPGGKF 392
+ Q+ L +P +RA++F +P + GDA Q+L L L D+S + G +
Sbjct: 182 VATNLTVIQKNLWDPVKRRALVFNRPMEAGDATWQVLRELEVLL--DQSRRRRSVGSY 237
>gi|388496634|gb|AFK36383.1| unknown [Medicago truncatula]
Length = 316
Score = 172 bits (435), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 84/217 (38%), Positives = 136/217 (62%), Gaps = 5/217 (2%)
Query: 23 RISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPV 82
+ + S+ L+ + LK++VY+LP KYNK L K+PRC TH+FA+E+ +HR LL+S V
Sbjct: 90 KKTTSSPQTLQKTSLKNLKIFVYDLPPKYNKNWL-KNPRCKTHLFASEVAIHRALLTSDV 148
Query: 83 RTLNPEEADWFYTPIYPTCDL-TPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHF 141
RT +P EAD+F+ P+Y +C+ T G P + ++ SA++LIS+ +P+WNR+ G+DH
Sbjct: 149 RTFDPYEADFFFVPVYVSCNFSTVNGFPAIGHARSLISSAVKLISTEYPFWNRSTGSDHV 208
Query: 142 FVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFG-QRNHVCLNEGSITIPPYAPPQK 200
FV HDFG+CFH E+ A++ G+ + + + ++QTFG +H C + IPP+ P+
Sbjct: 209 FVASHDFGSCFHTLEDVAMKDGVPEITKNSIVLQTFGVTYDHPCQKVEHVVIPPFVSPES 268
Query: 201 MQA--HQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYA 235
++ P + R I+V+FRG + G +Y+
Sbjct: 269 VRNTLENFPVNGRRDIWVFFRGKMEVHPKNVSGRFYS 305
>gi|412993472|emb|CCO13983.1| predicted protein [Bathycoccus prasinos]
Length = 614
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 110/352 (31%), Positives = 175/352 (49%), Gaps = 25/352 (7%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYP 99
K++VY+L ++N L PRC T + EI H LL V T +PEEA++F+ PIY
Sbjct: 260 FKIFVYDLKPEFNADLARDQPRCRTDQYGTEIRFHENLLHHSVLTNDPEEAEFFFVPIYG 319
Query: 100 TC----DLTPTGLPLPFKSPRM-MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHY 154
C + +G K + R A++ I + +PYWNRT+G DH + P G
Sbjct: 320 ECYLFRETQNSGTNNAMKVTNLWYRDALKTIQTEYPYWNRTDGRDHVWSFPGARGPHIFR 379
Query: 155 QEEKAIERGIL--PLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTP- 211
+K I++ I P R +FG++ + + I IP P + ++ +
Sbjct: 380 DWKKLIKKSIFLTPEGDR-----SFGEQFNTWKD---IVIPGLEPDSEFIDGKLRKQSSL 431
Query: 212 -RSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTT----YYEDM 266
+ IF +FRG + N Y+RG R + FK + + + P+ Y +++
Sbjct: 432 KKDIFAFFRGT---ILNKAGILAYSRGIRPKMEAAFKKHKDVIFTEEIPSCDRDCYRKEL 488
Query: 267 QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK 326
+++ FCLCP GW+PW+ R +A++ GCIPVIIAD+I LP+ +++ W ++ + +AE D K
Sbjct: 489 RKSTFCLCPRGWSPWTLRAYQAMMVGCIPVIIADEIELPYENSLDWTKLSVKIAEVDAEK 548
Query: 327 LDTILTSIPPEVILRKQRLLANPSMKRAM-LFPQPAQPGDAFHQILNGLARK 377
IL I I KQ+ + A P+ P DA +L+ L RK
Sbjct: 549 TIDILKQISKSEIRNKQKAIEKVWKSVAWGSNPKKLDPMDAMECVLHELGRK 600
>gi|297598226|ref|NP_001045264.2| Os01g0926700 [Oryza sativa Japonica Group]
gi|255674019|dbj|BAF07178.2| Os01g0926700, partial [Oryza sativa Japonica Group]
Length = 113
Score = 161 bits (407), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 75/83 (90%), Positives = 79/83 (95%)
Query: 154 YQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRS 213
+QEEKAI RGILPLLQRATLVQTFGQ+NHVCL +GSITIPPYAPPQKMQAH IPPDTPRS
Sbjct: 31 FQEEKAIGRGILPLLQRATLVQTFGQKNHVCLKDGSITIPPYAPPQKMQAHLIPPDTPRS 90
Query: 214 IFVYFRGLFYDVNNDPEGGYYAR 236
IFVYFRGLFYD +NDPEGGYYAR
Sbjct: 91 IFVYFRGLFYDTSNDPEGGYYAR 113
>gi|428181360|gb|EKX50224.1| hypothetical protein GUITHDRAFT_104038 [Guillardia theta CCMP2712]
Length = 723
Score = 150 bits (380), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 111/385 (28%), Positives = 189/385 (49%), Gaps = 67/385 (17%)
Query: 42 VYVYELPSKYNKKLL-LKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPT 100
V+VY++ ++N +L L +C ++AAE+++H LL S TL+P AD F+ P+Y
Sbjct: 346 VFVYDIHPQWNSAMLSLNMQQCRNSIYAAEVYIHEQLLLSDSLTLDPGAADLFFIPLYAA 405
Query: 101 CDLTPTGL-PLPF---------KSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGA 150
C L+ + P P K+ + ++ ++ + +P+++R+ GADH V+ D+G+
Sbjct: 406 CFLSSHFVRPGPGWPDNDVDIGKTYQAVQLVLEHVRQTYPFFDRSAGADHVLVLSSDWGS 465
Query: 151 CFHYQEEKAIERGILPLLQRATLVQTFGQR-------------NHVCLNE---------- 187
C +G L + L+ T G R +H+ +E
Sbjct: 466 C----------QGPFLELHNSILLVTSGDRTLVRPAWYAARAADHMGSSEEFAVRSRLPC 515
Query: 188 ----GSITIPPYAPPQKMQAHQIPPDT-PRSIFVYFRG--------LFYDVNNDPEGGYY 234
+ IPP P + A + T R I VYFRG L Y N D Y
Sbjct: 516 FQLFKDVVIPPLVPHPALTASYMGERTRGRDILVYFRGTAAGSVKALLY--NKD-----Y 568
Query: 235 ARGARAAVWENFKNNPLFDISTD-HPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGC 293
+ G R + + + +S + ++Y++++ R++FCL P GW WS R EA++ GC
Sbjct: 569 SLGIRQLLLRRYSRVRGWVVSDRINSSSYHDELLRSVFCLAPAGWELWSVRFFEAILLGC 628
Query: 294 IPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKR 353
IPV++ DD+ LPF + + + V + + +L++IL+SI VI RKQ L + +
Sbjct: 629 IPVLLTDDVQLPFQQRLDYSRFTVKVEQRRILELESILSSINETVIRRKQEGL--KEVWK 686
Query: 354 AMLFPQPAQPGDAFHQILNGLARKL 378
M + +P + GDAF I++ LAR++
Sbjct: 687 RMTYQRPPEDGDAFTGIMDELARRV 711
>gi|361069335|gb|AEW08979.1| Pinus taeda anonymous locus CL2637Contig1_04 genomic sequence
gi|361069337|gb|AEW08980.1| Pinus taeda anonymous locus CL2637Contig1_04 genomic sequence
gi|376339236|gb|AFB34142.1| hypothetical protein CL2637Contig1_04, partial [Pinus cembra]
gi|376339238|gb|AFB34143.1| hypothetical protein CL2637Contig1_04, partial [Pinus cembra]
gi|376339240|gb|AFB34144.1| hypothetical protein CL2637Contig1_04, partial [Pinus cembra]
gi|376339242|gb|AFB34145.1| hypothetical protein CL2637Contig1_04, partial [Pinus cembra]
gi|376339244|gb|AFB34146.1| hypothetical protein CL2637Contig1_04, partial [Pinus mugo]
gi|376339246|gb|AFB34147.1| hypothetical protein CL2637Contig1_04, partial [Pinus mugo]
gi|376339248|gb|AFB34148.1| hypothetical protein CL2637Contig1_04, partial [Pinus mugo]
gi|376339250|gb|AFB34149.1| hypothetical protein CL2637Contig1_04, partial [Pinus mugo]
gi|383157350|gb|AFG60998.1| Pinus taeda anonymous locus CL2637Contig1_04 genomic sequence
gi|383157351|gb|AFG60999.1| Pinus taeda anonymous locus CL2637Contig1_04 genomic sequence
gi|383157352|gb|AFG61000.1| Pinus taeda anonymous locus CL2637Contig1_04 genomic sequence
gi|383157353|gb|AFG61001.1| Pinus taeda anonymous locus CL2637Contig1_04 genomic sequence
gi|383157354|gb|AFG61002.1| Pinus taeda anonymous locus CL2637Contig1_04 genomic sequence
gi|383157355|gb|AFG61003.1| Pinus taeda anonymous locus CL2637Contig1_04 genomic sequence
gi|383157356|gb|AFG61004.1| Pinus taeda anonymous locus CL2637Contig1_04 genomic sequence
gi|383157357|gb|AFG61005.1| Pinus taeda anonymous locus CL2637Contig1_04 genomic sequence
gi|383157358|gb|AFG61006.1| Pinus taeda anonymous locus CL2637Contig1_04 genomic sequence
gi|383157359|gb|AFG61007.1| Pinus taeda anonymous locus CL2637Contig1_04 genomic sequence
gi|383157360|gb|AFG61008.1| Pinus taeda anonymous locus CL2637Contig1_04 genomic sequence
gi|383157361|gb|AFG61009.1| Pinus taeda anonymous locus CL2637Contig1_04 genomic sequence
gi|383157362|gb|AFG61010.1| Pinus taeda anonymous locus CL2637Contig1_04 genomic sequence
gi|383157363|gb|AFG61011.1| Pinus taeda anonymous locus CL2637Contig1_04 genomic sequence
gi|383157364|gb|AFG61012.1| Pinus taeda anonymous locus CL2637Contig1_04 genomic sequence
gi|383157365|gb|AFG61013.1| Pinus taeda anonymous locus CL2637Contig1_04 genomic sequence
gi|383157366|gb|AFG61014.1| Pinus taeda anonymous locus CL2637Contig1_04 genomic sequence
gi|383157367|gb|AFG61015.1| Pinus taeda anonymous locus CL2637Contig1_04 genomic sequence
Length = 79
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 68/79 (86%), Positives = 75/79 (94%)
Query: 37 VGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTP 96
VG+LKV++YELPSKYN+K+L KDPRCLTHMFA EIFM+RFLLSSPVRTLNPEEADWFYTP
Sbjct: 1 VGRLKVFIYELPSKYNRKILAKDPRCLTHMFATEIFMNRFLLSSPVRTLNPEEADWFYTP 60
Query: 97 IYPTCDLTPTGLPLPFKSP 115
+Y TCDLTP GLPLPFKSP
Sbjct: 61 VYTTCDLTPNGLPLPFKSP 79
>gi|376339230|gb|AFB34139.1| hypothetical protein CL2637Contig1_04, partial [Abies alba]
gi|376339232|gb|AFB34140.1| hypothetical protein CL2637Contig1_04, partial [Abies alba]
gi|376339234|gb|AFB34141.1| hypothetical protein CL2637Contig1_04, partial [Abies alba]
Length = 79
Score = 148 bits (373), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 67/79 (84%), Positives = 75/79 (94%)
Query: 37 VGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTP 96
VG+LKV++YELPSKYN+K+L KDPRCLTHMFA EIFM+RFLL+SPVRTLNPEEADWFYTP
Sbjct: 1 VGRLKVFIYELPSKYNRKILAKDPRCLTHMFATEIFMNRFLLASPVRTLNPEEADWFYTP 60
Query: 97 IYPTCDLTPTGLPLPFKSP 115
+Y TCDLTP GLPLPFKSP
Sbjct: 61 VYTTCDLTPNGLPLPFKSP 79
>gi|412994079|emb|CCO14590.1| predicted protein [Bathycoccus prasinos]
Length = 553
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 103/364 (28%), Positives = 166/364 (45%), Gaps = 22/364 (6%)
Query: 29 GDVLEDDPVGKL-KVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSP--VRTL 85
+V+E++ KV+VY++P + +L + RC + EI+ HR VRT+
Sbjct: 186 ANVIEEEESSSFPKVFVYDVPKQLTSELAKRYGRCERDQYGTEIWFHRNFRDDKNGVRTM 245
Query: 86 NPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSA-------IQLISSNWPYWNRTEGA 138
NPEEAD F+ P Y C L + + M + + PY+NRT+G
Sbjct: 246 NPEEADLFFVPQYGECFLWSREMLRHENQGQAMEETNEYFLEVLSHVKGKLPYFNRTDGR 305
Query: 139 DHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPP 198
DH FV G +K I I + + F + + Y
Sbjct: 306 DHIFVFAGARGPTIFRDWQKEIPHSIYLTPEGDRTLPQFDTWKDIVIPGLEYDKRMYLEE 365
Query: 199 QKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKN--NPLFD--I 254
+ + PP R I FRG + P G Y++G R + + F+N + ++D I
Sbjct: 366 HRNELVTNPP--KRKILAMFRGTI----DHPAGFAYSKGLRPKLKKIFQNATDVIYDTKI 419
Query: 255 STDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEE 314
Y +M ++FCL PLGW PW+ R +AV+ CIP+IIAD+I PF I + E
Sbjct: 420 KDCDRDCYVREMTESVFCLNPLGWTPWTLRFYQAVMTRCIPIIIADNIEFPFESEINYSE 479
Query: 315 IGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
+ + E+DV + + +P E R++R + + + + +PA+ GDA++ + L
Sbjct: 480 FALKIPEKDVSDILETMRHMPEEERERRRRYM--DKIWKQFTYQRPAEIGDAYYSTVKEL 537
Query: 375 ARKL 378
ARK+
Sbjct: 538 ARKV 541
>gi|224142563|ref|XP_002324625.1| predicted protein [Populus trichocarpa]
gi|222866059|gb|EEF03190.1| predicted protein [Populus trichocarpa]
Length = 336
Score = 134 bits (337), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 92/309 (29%), Positives = 151/309 (48%), Gaps = 29/309 (9%)
Query: 79 SSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKS----PRMMRSAIQLISSNWPYWNR 134
SP +P+EA F PI ++ P+ S R+++ + +++ +PYWNR
Sbjct: 41 KSPFAASHPDEAHMFLLPISVAYIISYVYKPIVTYSRDELQRLVQDYVGVVADKYPYWNR 100
Query: 135 TEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPP 194
++GADHF V HD+ + + + +L A + F R V + E +I
Sbjct: 101 SKGADHFLVSCHDWAPDIS-GANPDLYKNFIRVLCNANTSERFEPRRDVSIPEINIPNGK 159
Query: 195 YAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNN----P 250
PP K +PP + RSIF +F G + GY R + EN+K+
Sbjct: 160 LGPPHK----GLPP-SKRSIFAFFAGGAH--------GY----IRKVLLENWKDKDDEIQ 202
Query: 251 LFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAI 310
+ + T Y+E M ++ FCLCP G+ SPR+V A+ GC+PV I+D+ LPF+D +
Sbjct: 203 VHEYLDKKGTDYFELMGQSKFCLCPSGYEVASPRVVTAIQLGCVPVTISDNYTLPFSDVL 262
Query: 311 PWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQI 370
W + + + E +P++ TIL I P+ L Q + ++R +PA+P D H +
Sbjct: 263 DWSKFSVHIPSEKIPEIKTILKKISPQRYLMMQMRVIQ--VQRHFELNRPARPYDLLHML 320
Query: 371 LNGL-ARKL 378
L+ + R+L
Sbjct: 321 LHSVWVRRL 329
>gi|224120362|ref|XP_002331029.1| predicted protein [Populus trichocarpa]
gi|222872959|gb|EEF10090.1| predicted protein [Populus trichocarpa]
Length = 336
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 94/305 (30%), Positives = 145/305 (47%), Gaps = 29/305 (9%)
Query: 79 SSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKS----PRMMRSAIQLISSNWPYWNR 134
SP +P+EA F PI + PL S R+++ + +++ +PYWNR
Sbjct: 41 KSPFAASHPDEAHTFLLPISVAYIIHYIYRPLVTFSRVELQRLVQDYVTVVAGKYPYWNR 100
Query: 135 TEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPP 194
TEGADHF V HD+ + + + + +L A + F R V + E +I
Sbjct: 101 TEGADHFLVSCHDWAPDISRANPR-LYKNFIRVLCNANTSERFEPRRDVSIPEINIPFGK 159
Query: 195 YAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGY-YARGARAAVWENFKNNPLFD 253
+ PP K +PP + RSIF +F G G + Y R W++ +
Sbjct: 160 FGPPGK----GLPP-SKRSIFAFFAG----------GAHGYIRKLLLEHWKDKDDEIQVH 204
Query: 254 ISTDH--PTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIP 311
DH Y++ M ++ FCLCP G+ SPR+V A+ GCIPV I+D+ LPF+D +
Sbjct: 205 EYLDHNKKNDYFKLMGQSKFCLCPSGYEVASPRVVTAIQSGCIPVTISDNYTLPFSDVLD 264
Query: 312 WEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMK--RAMLFPQPAQPGDAFHQ 369
W + + + E +P++ TIL I R+ +L +K R +PAQP D H
Sbjct: 265 WSKFSVNIPSEKIPEIKTILKKIS----FRRYLILQGRVIKIRRHFKLNRPAQPYDMLHM 320
Query: 370 ILNGL 374
IL+ +
Sbjct: 321 ILHSI 325
>gi|224143183|ref|XP_002324873.1| predicted protein [Populus trichocarpa]
gi|222866307|gb|EEF03438.1| predicted protein [Populus trichocarpa]
Length = 335
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 92/299 (30%), Positives = 137/299 (45%), Gaps = 20/299 (6%)
Query: 80 SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSA----IQLISSNWPYWNRT 135
SP NP+EA F+ P+ L P S + ++ I LISS +PYWNR+
Sbjct: 42 SPFSAKNPDEALAFFIPVSIASILHFIYRPYVTYSRKQIQDIAEDYIGLISSKYPYWNRS 101
Query: 136 EGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY 195
GADHF + HD+ + R + +L A + F V L E +
Sbjct: 102 SGADHFMISCHDWAPDVS-AANPDLYRNFIRVLCNANSSEGFKPARDVSLPEFKLPRGKL 160
Query: 196 APPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDIS 255
P +Q P D RSI +F G G R W+ N+
Sbjct: 161 EPEHILQ----PCDNNRSILAFFAG-------GSHGS--VRKILFKHWKEKDNDIQVYKY 207
Query: 256 TDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEI 315
Y E M ++ +CLCP GW SPR+VEA+ GC+PVII+D VLPF+D + W +
Sbjct: 208 LPETLNYTEQMSKSRYCLCPSGWEVASPRVVEAIYSGCVPVIISDYYVLPFSDVLDWIKF 267
Query: 316 GMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
+ + +P++ TIL SIP E L KQ+ + +++ +PA+P D H +++ +
Sbjct: 268 SVHIPVSGIPEIKTILQSIPVEEYLEKQKRVLQ--VQQHFKLHRPAKPFDVVHMVMHSV 324
>gi|356565368|ref|XP_003550913.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
max]
Length = 534
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 107/351 (30%), Positives = 171/351 (48%), Gaps = 43/351 (12%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAE-IFMHRFLLSSPVRTLNPEEADWFYTP-- 96
LKVY+Y S+ + K P L ++A+E FM + T +PE+A FY P
Sbjct: 198 LKVYIYRDGSR---PIFHKPP--LKGIYASEGWFMKLMEENKQFVTKDPEKAHLFYLPYS 252
Query: 97 ---------IYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHD 147
+ + DL P + +R + I++ +P+WNRT+G+DHF V HD
Sbjct: 253 ARQMGLTLYVPGSHDLKPLSI--------FLRDYVNKIAAKYPFWNRTQGSDHFLVACHD 304
Query: 148 FGACFHYQEEKAIERGILPLLQRATLVQ-TFGQRNHVCLNEGSITIPPYAPPQKMQAHQI 206
+G + + ++R + L A L + F V L E +I P P + + +++
Sbjct: 305 WGP-YTVTGHEELKRNTIKALCNADLSEGVFVAGRDVSLPETTIR-APRRPLRYLGGNRV 362
Query: 207 PPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM 266
+ R I +F G + P Y G + + +K PL TY + M
Sbjct: 363 ---SLRPILAFFAGSMHG-RVRPTLLTYWGGGKDEDMKIYKRLPL---RVSQRMTYIQHM 415
Query: 267 QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK 326
+ + +C+CP+G+ SPR+VEA+ + C+PVIIAD+ VLPF++ + W + VAE+D+P+
Sbjct: 416 KSSKYCVCPMGFEVNSPRIVEAIYYECVPVIIADNFVLPFSEVLDWSAFSVVVAEKDIPR 475
Query: 327 LDTILTSIPPEVILRKQRLLANP---SMKRAMLFPQPAQPGDAFHQILNGL 374
L IL SIP LRK + N K + P+P + D FH IL+ +
Sbjct: 476 LKEILLSIP----LRKYLTMQNNVKMVQKHFLWNPRPIR-YDLFHMILHSI 521
>gi|357476811|ref|XP_003608691.1| hypothetical protein MTR_4g100730 [Medicago truncatula]
gi|355509746|gb|AES90888.1| hypothetical protein MTR_4g100730 [Medicago truncatula]
Length = 535
Score = 131 bits (330), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 107/351 (30%), Positives = 167/351 (47%), Gaps = 43/351 (12%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAE-IFMHRFLLSSPVRTLNPEEADWFYTPIY 98
LKVY+Y S+ + +P L ++A+E FM + T +PE A FY P
Sbjct: 204 LKVYIYRDGSR----PIFHNP-SLKGIYASEGWFMKLMQENKQFVTKDPERAHLFYLPYS 258
Query: 99 P-----------TCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHD 147
+ DL P + +R + I++ +P+WNRT G+DHF V HD
Sbjct: 259 ARQMEVTLYVPGSHDLKPLSI--------FLRDYVNKIAAKYPFWNRTHGSDHFLVACHD 310
Query: 148 FGACFHYQEEKAIERGILPLLQRATLVQ-TFGQRNHVCLNEGSITIPPYAPPQKMQAHQI 206
+G + E + + R L L A L + F + V L E +I P P + + ++
Sbjct: 311 WGP-YTVTEHEELARNTLKALCNADLSERIFIEGRDVSLPETTIR-APRRPLRYLGGNRA 368
Query: 207 PPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM 266
+ R I +F G + P Y G + + +K PL TY + M
Sbjct: 369 ---SLRPILAFFAGSMHG-RVRPTLLKYWGGEKYEDMKIYKRLPL---RVSKKMTYIQHM 421
Query: 267 QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK 326
+ + +CLCP+G+ SPR+VEA+ + C+PVIIAD+ VLP ++ + W + VAE+D+P+
Sbjct: 422 KSSKYCLCPMGFEVNSPRIVEAIYYECVPVIIADNFVLPLSEVLDWSAFSVVVAEKDIPR 481
Query: 327 LDTILTSIPPEVILRKQRLLANP---SMKRAMLFPQPAQPGDAFHQILNGL 374
L IL SIP +RK + N K + P+P + D FH IL+ +
Sbjct: 482 LKDILLSIP----MRKYVAMQNNVKMVQKHFLWNPKPIR-YDLFHMILHSI 527
>gi|302829555|ref|XP_002946344.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
gi|300268090|gb|EFJ52271.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
Length = 785
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 99/374 (26%), Positives = 163/374 (43%), Gaps = 73/374 (19%)
Query: 42 VYVYELPSKYNKKLL----------------LKDPRCLTHMFAAEIFMHRFLLSSPVRTL 85
+YVY++P Y+ ++L D L+ ++ E+++H +L S RT
Sbjct: 342 IYVYDMPPAYHSRMLQYRIGSDACMWRRFSEANDTYLLSMTYSVEVYLHEMMLQSEHRTF 401
Query: 86 NPEEADWFYTPIYPTCDL-------------TPTGLPLPFKSPRMMRSAIQLISSNWPYW 132
+PEEAD+FY P+Y TC + P P M+ A + +S+ +P+W
Sbjct: 402 DPEEADFFYVPMYITCFMWPVMGWADFPWWHAPLAHTRPMHVSNMILEAYEWLSTTFPWW 461
Query: 133 NRTEGADHFFVVPHDFGACF--------------------HYQEEKAIERGI----LPLL 168
NR G DH +++ D GAC+ ++ A ++ I LP+
Sbjct: 462 NRRGGRDHIWLMAPDEGACYMPTVVYNSSIILTHWGRMDPDHKSGSAFDQDIYDKDLPVA 521
Query: 169 QRA---TLVQTFGQRNHVCLN-EGSITIPPYAPPQKMQAHQI--PPDTPRSIFVYFRGLF 222
Q L R H+C N E + IP + P Q + P R I +YFRG
Sbjct: 522 QFKGWRGLDWMEKSRPHLCYNPEKDLVIPAFKSPDHFQESPLLGAPPLERDILLYFRGDV 581
Query: 223 YDVNNDPEGGYYARGARA-----AVWENFKNNPLFDISTDHPT--TYYEDMQRAIFCLCP 275
+ D +Y+RG R A W + I T +Y E + R+ FCL
Sbjct: 582 GEGRRD----HYSRGIRQKLFQFAHWGKWAEKYKIYIGTGETIGGSYSEHLARSKFCLVA 637
Query: 276 LGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED--VPKLDTILTS 333
G WS R +A++ GC+P+++ D + F + W+ + + E++ + + +LT+
Sbjct: 638 PG-DGWSARAEDAILHGCVPLVVMDGVHAVFESILDWDSFSIRIREDNQALQAIPELLTA 696
Query: 334 IPPEVILRKQRLLA 347
I PE + + QR LA
Sbjct: 697 ISPERLAKMQRNLA 710
>gi|320164161|gb|EFW41060.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 409
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 92/327 (28%), Positives = 146/327 (44%), Gaps = 61/327 (18%)
Query: 39 KLKVYVYELPSKYNKKLL---------LKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEE 89
L+++VY LP+++N +++ ++DP C T +++E H+ LL SP+RTLNP +
Sbjct: 24 SLRIFVYPLPAEFNTRVIEHNLAHPPDMRDPVCTTSFYSSEWAFHQLLLDSPLRTLNPRD 83
Query: 90 ADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSN---------WPY------WNR 134
AD++Y P+Y TC P S + +A+ I+S+ W Y WN
Sbjct: 84 ADYYYVPVYGTCHGFNRMAVQPNASAELFSAALDWITSHGSIPRDTLPWRYDPYSPDWNS 143
Query: 135 --------TEG---------ADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTF 177
T G DH ++ GA G ++ A +
Sbjct: 144 LGTIEQVATRGEYPPFPAFAQDHLWLFSQGHGAKLF---------GDYSRIKNAVFLTAN 194
Query: 178 GQRNHVCLN-EGSITIPP----YAPP-----QKMQAHQIPPDTPRSIFVYFRGLFYDV-N 226
GQ + +TIPP Y P + + ++ R F G
Sbjct: 195 GQLSAAEFTLAKDVTIPPRLTHYVPTPIYANKSVDELEVILTGQRPTLACFGGTKLPCFV 254
Query: 227 NDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLV 286
ND G ++RG R + E F +P F I + Y + ++ + FCLCP GW W+PR+
Sbjct: 255 NDARGSCHSRGVRPYLKETFSKHPDFRILGIRSSGYEKALRSSTFCLCPEGWHAWTPRVF 314
Query: 287 EAVVFGCIPVIIADDIVLPFADAIPWE 313
EA++ GCIPV+I+DD+ LPF I ++
Sbjct: 315 EAILSGCIPVLISDDLALPFESLIDYD 341
>gi|102140005|gb|ABF70140.1| exostosin family protein [Musa balbisiana]
Length = 533
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 102/344 (29%), Positives = 167/344 (48%), Gaps = 30/344 (8%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLN-PEEADWFYTPIY 98
LKVY+YE K L P L ++A+E + + + + +N P +A FY P Y
Sbjct: 201 LKVYIYE----DGPKPLCHTPH-LDGIYASEGWFMKLMEENTQFVVNDPNKAHLFYLP-Y 254
Query: 99 PTCDLTPTGLPLPFKSPR----MMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHY 154
+ L +S R +R + IS+ +P+WNRT GADHF V HD+ A +
Sbjct: 255 SSRQLRTHLYVAGSRSMRPLSIFLRDYVNSISAKYPFWNRTRGADHFLVACHDW-ATYTT 313
Query: 155 QEEKAIERGILPLLQRATLVQ-TFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRS 213
+ + + + ++ A + + F + V L E + P P+K + P + RS
Sbjct: 314 NLHEELRKNTIKVVCNADVSEGVFVRGKDVSLAETYVRTP--NSPRKAIGGR--PASRRS 369
Query: 214 IFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDH---PTTYYEDMQRAI 270
I +F G + P Y RG ++ ++++ D +Y E M+ +
Sbjct: 370 ILAFFAGQMHG-RVRPILLRYWRGRD-------RDMRIYEVLPDEIAAKMSYIEHMKSSK 421
Query: 271 FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTI 330
FC+CP+G+ SPR+VEA+ + C+PVIIA++ VLPF + + W + VAE+D+PKL I
Sbjct: 422 FCICPMGYEVNSPRIVEAIYYDCVPVIIANNFVLPFEEVLDWGAFSVVVAEKDIPKLKQI 481
Query: 331 LTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
L I +R QR + +++ L+ D FH IL+ +
Sbjct: 482 LLGISGRRYVRMQRNVRR--LRKHFLWNDKPVKYDLFHMILHSI 523
>gi|359484704|ref|XP_002264922.2| PREDICTED: probable glycosyltransferase At5g20260-like [Vitis
vinifera]
Length = 467
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 91/302 (30%), Positives = 141/302 (46%), Gaps = 39/302 (12%)
Query: 86 NPEEADWFYTPIY----------PTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRT 135
+P+EA FY PI P D + L R++ I ++S +PYWNR+
Sbjct: 181 HPDEAHVFYIPISVTRIAHYIYSPPVDYSGHMLQ------RLVTDYIYVVSDKYPYWNRS 234
Query: 136 EGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY 195
GADHF V HD+ + + + +L A + F + L E +I
Sbjct: 235 NGADHFLVSCHDWAPEISIVTPD-LYKHFIRVLCNANTSERFQPIRDISLPEVNIPKGKL 293
Query: 196 APPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFK-NNPLFDI 254
PP H P R I +F G E GY R ++ ++K N+ +
Sbjct: 294 GPP-----HLDKPPNQRHILAFFAGR--------ESGY----MRTLLFRSWKENDDEVQV 336
Query: 255 STDHPTT--YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW 312
P+ Y + M + FCLCP GW SPR+VEA+ GC+PVII D VLPF++ + W
Sbjct: 337 YEHLPSNRDYAKSMGDSKFCLCPSGWEVASPRVVEAIAAGCVPVIICDYYVLPFSEVLVW 396
Query: 313 EEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILN 372
+ + + + +P++ IL ++P E LR Q+ + ++R + +PAQP D H IL+
Sbjct: 397 SKFSINITSDKIPEIKKILKAVPNERYLRMQKRVKQ--VQRHFVINRPAQPYDMLHMILH 454
Query: 373 GL 374
+
Sbjct: 455 SV 456
>gi|79326862|ref|NP_001031828.1| putative glycosyltransferase [Arabidopsis thaliana]
gi|292630778|sp|Q9FFN2.2|GLYT3_ARATH RecName: Full=Probable glycosyltransferase At5g03795
gi|332003272|gb|AED90655.1| putative glycosyltransferase [Arabidopsis thaliana]
Length = 518
Score = 128 bits (322), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 105/359 (29%), Positives = 154/359 (42%), Gaps = 55/359 (15%)
Query: 39 KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTP-- 96
+ K+YVY K + L D C + F++ + RT NP++A FY P
Sbjct: 189 QFKIYVY----KEGEPPLFHDGPCKSIYSMEGSFIYEIETDTRFRTNNPDKAHVFYLPFS 244
Query: 97 -------IYP--TCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHD 147
+Y + D +P ++ I L+ +PYWNR+ GADHF + HD
Sbjct: 245 VVKMVRYVYERNSRDFSPIR--------NTVKDYINLVGDKYPYWNRSIGADHFILSCHD 296
Query: 148 FGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNE-----GSITIPPYAPPQKMQ 202
+G + I L A + F R V + E GS+T
Sbjct: 297 WGPEASFSHPHLGHNSIRALCN-ANTSERFKPRKDVSIPEINLRTGSLT----------G 345
Query: 203 AHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYA--RGARAAVWENFKNNPLFDISTDHPT 260
P + R I +F GG + R WEN N+ T
Sbjct: 346 LVGGPSPSSRPILAFF-----------AGGVHGPVRPVLLQHWENKDNDIRVHKYLPRGT 394
Query: 261 TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVA 320
+Y + M+ + FC+CP G+ SPR+VEA+ GC+PV+I V PF+D + W + V+
Sbjct: 395 SYSDMMRNSKFCICPSGYEVASPRIVEALYSGCVPVLINSGYVPPFSDVLNWRSFSVIVS 454
Query: 321 EEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL-ARKL 378
ED+P L TILTSI P LR R + ++R PA+ D FH IL+ + R+L
Sbjct: 455 VEDIPNLKTILTSISPRQYLRMYRRVLK--VRRHFEVNSPAKRFDVFHMILHSIWVRRL 511
>gi|297810461|ref|XP_002873114.1| hypothetical protein ARALYDRAFT_487158 [Arabidopsis lyrata subsp.
lyrata]
gi|297318951|gb|EFH49373.1| hypothetical protein ARALYDRAFT_487158 [Arabidopsis lyrata subsp.
lyrata]
Length = 519
Score = 128 bits (322), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 104/351 (29%), Positives = 151/351 (43%), Gaps = 39/351 (11%)
Query: 39 KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTP-- 96
+ K+YVY K + L D C + F++ + RT NP++A FY P
Sbjct: 190 QFKIYVY----KEGEPPLFHDGPCKSIYSMEGSFIYEMETDTRFRTNNPDKAHAFYLPFS 245
Query: 97 IYPTCDLTPTGLPLPFKSPR-MMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQ 155
+ F R +R I L+ +PYWNR+ GADHF + HD+G +
Sbjct: 246 VVKMVRYVYERNSRDFSPIRNTVRDYINLVGDKYPYWNRSIGADHFILSCHDWGPEASFS 305
Query: 156 EEKAIERGILPLLQRATLVQTFGQRNHVCLNE-----GSITIPPYAPPQKMQAHQIPPDT 210
I L A + F R V + E GS+T P +
Sbjct: 306 HPHLGHNSIRALCN-ANTSEKFKPRKDVSIPEINLRTGSLTGLVGGPSP----------S 354
Query: 211 PRSIFVYFRGLFYDVNNDPEGGYYA--RGARAAVWENFKNNPLFDISTDHPTTYYEDMQR 268
R I +F GG + R WEN N+ T+Y + M+
Sbjct: 355 SRPILAFF-----------AGGVHGPVRPVLLEHWENKDNDIRVHKYLPRGTSYSDMMRN 403
Query: 269 AIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLD 328
+ FC+CP G+ SPR+VEA+ GC+PV+I V PF+D + W + V+ ED+P L
Sbjct: 404 SKFCICPSGYEVASPRIVEALYSGCVPVLINSGYVPPFSDVLNWRSFSVIVSVEDIPNLK 463
Query: 329 TILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL-ARKL 378
TILT+I P LR R + ++R PA+ D FH IL+ + R+L
Sbjct: 464 TILTAISPRQYLRMYRRVLK--VRRHFEVNSPAKRFDVFHMILHSIWVRRL 512
>gi|359484708|ref|XP_002264880.2| PREDICTED: probable glycosyltransferase At5g20260-like [Vitis
vinifera]
Length = 467
Score = 128 bits (322), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 81/262 (30%), Positives = 129/262 (49%), Gaps = 23/262 (8%)
Query: 116 RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQ 175
R++ I ++S+ +PYWNR+ GADHF V HD+ + + + +L A +
Sbjct: 215 RLVTDYIYVVSNKYPYWNRSNGADHFLVSCHDWAPEISIVTPD-LYKHFIRVLCNANTSE 273
Query: 176 TFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYA 235
F + L E +I PP H P R I +F G E GY
Sbjct: 274 RFQPIRDISLPEVNIPKGKLGPP-----HLDKPPNQRHILAFFSGR--------ESGY-- 318
Query: 236 RGARAAVWENFK-NNPLFDISTDHPTT--YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFG 292
R ++ ++K N+ + P+ Y + M + FCLCP GW SPR+VEA+ G
Sbjct: 319 --MRTLLFRSWKENDDEVQVYEHLPSNRDYAKSMVDSKFCLCPSGWEVASPRVVEAIAAG 376
Query: 293 CIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMK 352
C+PVII D VLPF++ + W + + + + +P++ IL ++P E LR Q+ + ++
Sbjct: 377 CVPVIICDYYVLPFSEVLDWSKFSINITSDKIPEIKKILKAVPNERYLRMQKRVKQ--VQ 434
Query: 353 RAMLFPQPAQPGDAFHQILNGL 374
R + +PAQP D H IL+ +
Sbjct: 435 RHFVINRPAQPYDMLHMILHSV 456
>gi|356533401|ref|XP_003535253.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
max]
Length = 523
Score = 128 bits (321), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 99/353 (28%), Positives = 160/353 (45%), Gaps = 52/353 (14%)
Query: 39 KLKVYVYE---LPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYT 95
+ KV+VYE LP + D C + F+H ++ RT +P++A+ F+
Sbjct: 185 QFKVFVYEEGELP-------VFHDGPCSSIYSTEGSFIHAIEMNEHFRTRDPKKANVFFL 237
Query: 96 P-----------IYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVV 144
P I + D P R +R + +I++ +PYWNR+ GADHF +
Sbjct: 238 PFSIAWMVRYVYIRNSYDFGPI--------KRTVRDYVNVIATRYPYWNRSLGADHFMLS 289
Query: 145 PHDFGACFHYQEEKAI---ERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKM 201
HD+G + K+I + + +L A + F + + S P K
Sbjct: 290 CHDWGP----ETSKSIPYLRKNSIRVLCNANTSEGFDP-----IKDASFPEINLQPGLKD 340
Query: 202 QAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTT 261
PP + RSI +F G N+ P R WEN + +
Sbjct: 341 SFVGGPPASKRSILAFFAG----GNHGP-----IRPILLEHWENKDEDIQVHKYLPKGVS 391
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
YY ++ + FCLCP G+ SPR+VEA+ GC+PV+I++ V PF+D + W+ + V+
Sbjct: 392 YYGMLRNSKFCLCPSGYEVASPRVVEAIYTGCVPVLISEHYVPPFSDVLNWKMFSVNVSV 451
Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
+++P L ILTSI P +R Q+ + ++R P + D FH IL+ +
Sbjct: 452 KEIPNLKDILTSISPRQYIRMQKRVGQ--IRRHFEVHSPPKRYDVFHMILHSV 502
>gi|297802132|ref|XP_002868950.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
gi|297314786|gb|EFH45209.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
Length = 428
Score = 128 bits (321), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 107/351 (30%), Positives = 159/351 (45%), Gaps = 52/351 (14%)
Query: 39 KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIY 98
+ KVY+Y P + + PR +T +A+E + + + S RTL+PEEAD F+ PI
Sbjct: 106 RFKVYIY--PDG-DPNTFYQTPRKVTGKYASEGYFFQNIRESRFRTLDPEEADLFFIPI- 161
Query: 99 PTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEK 158
+C ++++ + + + +PYWNRT GADHFFV HD G +
Sbjct: 162 -SCHKMRGKGTSYENMTVIVQNYVDGLIAKYPYWNRTLGADHFFVTCHDVGV-------R 213
Query: 159 AIERGILPLLQRATLVQTFGQRNHVC---LNEGSITIPPYAPPQKMQAHQIPPDTPRSIF 215
A E G L L++ V VC N G I A PQ +Q +P
Sbjct: 214 AFE-GSLLLIKNTIRV--------VCSPSYNVGFIPHKDVALPQVLQPFALPAGGN---- 260
Query: 216 VYFRGLFYDVNNDPEGGYYA-------RGARAAVWEN-----FKNNPLFDISTDHPTTYY 263
DV N G++A R A VWEN NN + + +T H Y
Sbjct: 261 --------DVENRTTLGFWAGHRNSKIRVILARVWENDTELDISNNRI-NRATGH-LVYQ 310
Query: 264 EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED 323
+ R FC+CP G S R+ +++ +GCIPVI++D LPF D + W + + + E+D
Sbjct: 311 KRFYRTKFCICPGGSQVNSARITDSIHYGCIPVILSDYYDLPFNDILNWRKFAVVLREQD 370
Query: 324 VPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
V L IL +IP + L +++ + P DAFH I+ L
Sbjct: 371 VYNLKQILKNIPHSEFISLHNNLV--KVQKHFQWNSPPVKFDAFHMIMYEL 419
>gi|9758008|dbj|BAB08605.1| unnamed protein product [Arabidopsis thaliana]
Length = 408
Score = 128 bits (321), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 104/351 (29%), Positives = 151/351 (43%), Gaps = 39/351 (11%)
Query: 39 KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTP-- 96
+ K+YVY K + L D C + F++ + RT NP++A FY P
Sbjct: 79 QFKIYVY----KEGEPPLFHDGPCKSIYSMEGSFIYEIETDTRFRTNNPDKAHVFYLPFS 134
Query: 97 IYPTCDLTPTGLPLPFKSPR-MMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQ 155
+ F R ++ I L+ +PYWNR+ GADHF + HD+G +
Sbjct: 135 VVKMVRYVYERNSRDFSPIRNTVKDYINLVGDKYPYWNRSIGADHFILSCHDWGPEASFS 194
Query: 156 EEKAIERGILPLLQRATLVQTFGQRNHVCLNE-----GSITIPPYAPPQKMQAHQIPPDT 210
I L A + F R V + E GS+T P +
Sbjct: 195 HPHLGHNSIRALCN-ANTSERFKPRKDVSIPEINLRTGSLT----------GLVGGPSPS 243
Query: 211 PRSIFVYFRGLFYDVNNDPEGGYYA--RGARAAVWENFKNNPLFDISTDHPTTYYEDMQR 268
R I +F GG + R WEN N+ T+Y + M+
Sbjct: 244 SRPILAFF-----------AGGVHGPVRPVLLQHWENKDNDIRVHKYLPRGTSYSDMMRN 292
Query: 269 AIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLD 328
+ FC+CP G+ SPR+VEA+ GC+PV+I V PF+D + W + V+ ED+P L
Sbjct: 293 SKFCICPSGYEVASPRIVEALYSGCVPVLINSGYVPPFSDVLNWRSFSVIVSVEDIPNLK 352
Query: 329 TILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL-ARKL 378
TILTSI P LR R + ++R PA+ D FH IL+ + R+L
Sbjct: 353 TILTSISPRQYLRMYRRVL--KVRRHFEVNSPAKRFDVFHMILHSIWVRRL 401
>gi|296084508|emb|CBI25529.3| unnamed protein product [Vitis vinifera]
Length = 388
Score = 128 bits (321), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 81/262 (30%), Positives = 129/262 (49%), Gaps = 23/262 (8%)
Query: 116 RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQ 175
R++ I ++S+ +PYWNR+ GADHF V HD+ + + + +L A +
Sbjct: 136 RLVTDYIYVVSNKYPYWNRSNGADHFLVSCHDWAPEISIVTPD-LYKHFIRVLCNANTSE 194
Query: 176 TFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYA 235
F + L E +I PP H P R I +F G E GY
Sbjct: 195 RFQPIRDISLPEVNIPKGKLGPP-----HLDKPPNQRHILAFFSGR--------ESGY-- 239
Query: 236 RGARAAVWENFK-NNPLFDISTDHPTT--YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFG 292
R ++ ++K N+ + P+ Y + M + FCLCP GW SPR+VEA+ G
Sbjct: 240 --MRTLLFRSWKENDDEVQVYEHLPSNRDYAKSMVDSKFCLCPSGWEVASPRVVEAIAAG 297
Query: 293 CIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMK 352
C+PVII D VLPF++ + W + + + + +P++ IL ++P E LR Q+ + ++
Sbjct: 298 CVPVIICDYYVLPFSEVLDWSKFSINITSDKIPEIKKILKAVPNERYLRMQKRVKQ--VQ 355
Query: 353 RAMLFPQPAQPGDAFHQILNGL 374
R + +PAQP D H IL+ +
Sbjct: 356 RHFVINRPAQPYDMLHMILHSV 377
>gi|302837063|ref|XP_002950091.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
gi|300264564|gb|EFJ48759.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
Length = 593
Score = 127 bits (320), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 123/424 (29%), Positives = 175/424 (41%), Gaps = 96/424 (22%)
Query: 42 VYVYELPSKYNKKLL-LKDPRCLTH-----MFAAEIFMHRFLLSSPVRTLNPEEADWFYT 95
+YVY+L Y KLL + P T ++AA+ +H LL S RT +PEEAD+FY
Sbjct: 183 IYVYDLEPLYQSKLLQYRWPDNGTQWTDVWVYAADTLLHESLLISEHRTFDPEEADFFYV 242
Query: 96 PIYPTCDLTPTGL--------------------------------PLPF-KSPR------ 116
P Y +C P G+ LP + PR
Sbjct: 243 PHYASCLPFPIGVWADAPWFPEPGARAHHHLCEVLPATRVITVTCRLPVNRGPRVKQMVN 302
Query: 117 MMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGI-LPLLQRATLVQ 175
M+R + I+S +PYW R G DH ++ HD GAC+ AI I L R L
Sbjct: 303 MVRDTLDWINSTFPYWRRRGGRDHIWLFTHDEGACW---APTAINASIWLTHWGRTELNH 359
Query: 176 TFGQ------------------------RNHVCLN-EGSITIPPYAPPQKMQAH--QIPP 208
T + H C N E + IP + P A Q P
Sbjct: 360 TSNTAFLADKYDSDFAGPLQPEGFVKYIKGHPCFNPEKDLVIPAFKAPSHYHASPLQGNP 419
Query: 209 DTPRSIFVYFRGLFYDV--NNDPEGGYYARGARAAVWENFKNNPLFDI-------STDHP 259
R + +FRG DV N P Y+RG R +++ K + +D
Sbjct: 420 ARERDLLFFFRG---DVGKNRLPN---YSRGVRQQIYKMAKEGGWAEKYRFYIGDGSDVE 473
Query: 260 TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFV 319
Y E + RAIFCL G WS R+ +AV+ GCIPV+IAD + F + + + + +
Sbjct: 474 GDYSEMLSRAIFCLVAPG-DGWSARMEDAVLHGCIPVVIADGVEAVFENVLELDAFALRL 532
Query: 320 AEEDVPKLDTILTSIPPEVILRKQRLLANPSMK-RAMLFPQPAQPGDAFHQILNGLARKL 378
+E VP+L +L ++P I KQ L + R P + DAF I+ L ++
Sbjct: 533 PQEAVPRLLDVLRAVPQRAIRSKQAHLGRVWQRYRWASLP---KLDDAFATIMQWLHSRI 589
Query: 379 PHDK 382
PH +
Sbjct: 590 PHTR 593
>gi|255575457|ref|XP_002528630.1| catalytic, putative [Ricinus communis]
gi|223531919|gb|EEF33733.1| catalytic, putative [Ricinus communis]
Length = 574
Score = 127 bits (320), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 92/316 (29%), Positives = 139/316 (43%), Gaps = 40/316 (12%)
Query: 72 FMHRFLLSSPVRTLNPEEADWFYTP-----------IYPTCDLTPTGLPLPFKSPRMMRS 120
F+HR + RT +PE+A ++ P + + D P R +R
Sbjct: 268 FIHRMEIDDQFRTKDPEKAHVYFLPFSVAMMVQFVYVRDSHDFGPI--------KRTVRD 319
Query: 121 AIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQR 180
+ L++ +PYWNR+ GADHF + HD+G + + I L A + F
Sbjct: 320 YVNLVAGKYPYWNRSLGADHFMLACHDWGPETSFSLPDLAKNSIRALCN-ANTSERFNPI 378
Query: 181 NHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYA--RGA 238
V E I + + P + RSI +F GG + R
Sbjct: 379 KDVSFPE--INLQTGTTKGFIGG---PSPSKRSILAFF-----------AGGLHGPIRPI 422
Query: 239 RAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVII 298
WEN N+ +YYE M+++ FCLCP G+ SPR+VEA+ GC+PV+I
Sbjct: 423 LLEHWENKDNDMKVHRYLPKGVSYYEMMRKSKFCLCPSGYEVASPRVVEALYTGCVPVLI 482
Query: 299 ADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFP 358
+D V PF+D + W+ + V D+P L ILTSI +R QR + ++R
Sbjct: 483 SDHYVPPFSDVLNWKSFSVEVPVSDIPNLKRILTSISSRQYIRMQRRVLQ--VRRHFEVN 540
Query: 359 QPAQPGDAFHQILNGL 374
P + D FH IL+ +
Sbjct: 541 SPPKRYDVFHMILHSI 556
>gi|19699001|gb|AAL91236.1| unknown protein [Arabidopsis thaliana]
gi|30023662|gb|AAP13364.1| At2g28110 [Arabidopsis thaliana]
Length = 214
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 58/125 (46%), Positives = 83/125 (66%), Gaps = 4/125 (3%)
Query: 32 LEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLS--SPVRTLNPEE 89
++ D LK+YVY+LPSK+NK L D RC H+FAAE+ +H+ LS VRT +P E
Sbjct: 86 IKTDVFNNLKIYVYDLPSKFNKDWLAND-RCTNHLFAAEVALHKAFLSLEGDVRTEDPYE 144
Query: 90 ADWFYTPIYPTCDL-TPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDF 148
AD+F+ P+Y +C+ T G P + ++ AI+L+S+ +P+WNRT G+DH F HDF
Sbjct: 145 ADFFFVPVYVSCNFSTINGFPAIGHARSLINDAIKLVSTQYPFWNRTSGSDHVFTATHDF 204
Query: 149 GACFH 153
G+CFH
Sbjct: 205 GSCFH 209
>gi|359484710|ref|XP_002266490.2| PREDICTED: probable glycosyltransferase At5g20260 [Vitis vinifera]
Length = 480
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 83/295 (28%), Positives = 136/295 (46%), Gaps = 26/295 (8%)
Query: 86 NPEEADWFYTPIYPTCDLTPTGLPLPFKS---PRMMRSAIQLISSNWPYWNRTEGADHFF 142
+P+EA+ FY P+ T + P + PR++ I ++ +PYWNR++GADHF
Sbjct: 195 HPDEANAFYIPMSLTRVVHFIYEPPHYHGKWIPRLVTDYINFVADKYPYWNRSKGADHFL 254
Query: 143 VVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQ 202
V HD+ + A++ + RA +R H + I+IP P+
Sbjct: 255 VSCHDWAP-----DVSALKPDLYKHFIRALCNANTSERFHPIRD---ISIPEINIPRGKL 306
Query: 203 A--HQIPPDTPRSIFVYFRGLFYDVNNDPEGGY-YARGARAAVWENFKNNPLFDISTDHP 259
H P R I +F G G + Y R W+ +
Sbjct: 307 GPPHLDQPPNKRPILAFFAG----------GAHGYVRSVLFKYWKEKDDEVQVFERLPRN 356
Query: 260 TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFV 319
Y + M + FCLCP G+ SPR+VEA+ GC+P+II D LPF+D + W + +++
Sbjct: 357 RNYSKSMGDSKFCLCPSGYEVASPRIVEAIAAGCVPMIICDHYSLPFSDVLDWSKFSIYI 416
Query: 320 AEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
+ +P++ IL ++P E L Q+ + ++R +PA+P D H IL+ +
Sbjct: 417 TSDKIPEIKKILKAVPTETYLEMQKRVKQ--VQRHFAINRPARPYDMLHMILHSV 469
>gi|15233650|ref|NP_195517.1| Exostosin family protein [Arabidopsis thaliana]
gi|4467110|emb|CAB37544.1| putative protein [Arabidopsis thaliana]
gi|7270787|emb|CAB80469.1| putative protein [Arabidopsis thaliana]
gi|15293111|gb|AAK93666.1| unknown protein [Arabidopsis thaliana]
gi|21280961|gb|AAM45007.1| unknown protein [Arabidopsis thaliana]
gi|332661466|gb|AEE86866.1| Exostosin family protein [Arabidopsis thaliana]
Length = 425
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 106/351 (30%), Positives = 159/351 (45%), Gaps = 52/351 (14%)
Query: 39 KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIY 98
+ KVY+Y P + + PR +T +A+E + + + S RTL+P+EAD F+ PI
Sbjct: 103 RFKVYIY--PDG-DPNTFYQTPRKVTGKYASEGYFFQNIRESRFRTLDPDEADLFFIPI- 158
Query: 99 PTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEK 158
+C ++++ + + + +PYWNRT GADHFFV HD G +
Sbjct: 159 -SCHKMRGKGTSYENMTVIVQNYVDGLIAKYPYWNRTLGADHFFVTCHDVGV-------R 210
Query: 159 AIERGILPLLQRATLVQTFGQRNHVC---LNEGSITIPPYAPPQKMQAHQIPPDTPRSIF 215
A E PLL + T+ VC N G I A PQ +Q +P
Sbjct: 211 AFEGS--PLLIKNTI-------RVVCSPSYNVGFIPHKDVALPQVLQPFALPAGGN---- 257
Query: 216 VYFRGLFYDVNNDPEGGYYA-------RGARAAVWEN-----FKNNPLFDISTDHPTTYY 263
DV N G++A R A VWEN NN + + +T H Y
Sbjct: 258 --------DVENRTTLGFWAGHRNSKIRVILAHVWENDTELDISNNRI-NRATGH-LVYQ 307
Query: 264 EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED 323
+ R FC+CP G S R+ +++ +GCIPVI++D LPF D + W + + + E+D
Sbjct: 308 KRFYRTKFCICPGGSQVNSARITDSIHYGCIPVILSDYYDLPFNDILNWRKFAVVLREQD 367
Query: 324 VPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
V L IL +IP + L +++ + P DAFH I+ L
Sbjct: 368 VYNLKQILKNIPHSEFVSLHNNLV--KVQKHFQWNSPPVKFDAFHMIMYEL 416
>gi|302753728|ref|XP_002960288.1| glycosylransferase-like protein [Selaginella moellendorffii]
gi|300171227|gb|EFJ37827.1| glycosylransferase-like protein [Selaginella moellendorffii]
Length = 372
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 97/350 (27%), Positives = 169/350 (48%), Gaps = 34/350 (9%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAE-IFMHRFLLSSPVRTLNPEEADWFYTPI- 97
KVYVY+ + +L+ K P LT ++++E F+H +S T +P+EA F+ P
Sbjct: 32 FKVYVYK---EGEPRLVHKGP--LTGIYSSEGRFIHEMNQNSRFVTHDPQEAHMFFLPYS 86
Query: 98 --YPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQ 155
+ DL G ++ + LI+S P+WN T G+DHFF HD+G +
Sbjct: 87 VAHMVLDLYVPGSHTMLPLATFIKDYVNLIASKHPFWNLTRGSDHFFTSCHDWGPAT-AR 145
Query: 156 EEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIF 215
+ + + + ++ + L + F L E + P K+ P + R I
Sbjct: 146 DHPELRKNSVKVVCNSDLTEEFVPDKDASLPE--TYLHAVKLPTKLGG---PGPSKRPIL 200
Query: 216 VYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP------TTYYEDMQRA 269
+F G + R A W++ + +P I P T+Y + M+ +
Sbjct: 201 AFFAGQMHG---------RVRPALIKHWKD-RGDPDMRIYEVLPPEVARRTSYVQHMKSS 250
Query: 270 IFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDT 329
FC+C +G+ SPR+VE++ + C+PV+IAD+ VLPF+D + W + V+E+DVP+L
Sbjct: 251 KFCICAMGFEVNSPRIVESIYYDCVPVLIADNFVLPFSDVLNWGSFSLTVSEKDVPRLKE 310
Query: 330 ILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL-ARKL 378
+L ++ + + Q L +++ L+ A+ D FH IL+ + R+L
Sbjct: 311 LLLAVSEDRYRKMQSRLKK--VRKHFLWHDSAERFDMFHMILHSVWTRRL 358
>gi|296084509|emb|CBI25530.3| unnamed protein product [Vitis vinifera]
Length = 387
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 83/295 (28%), Positives = 136/295 (46%), Gaps = 26/295 (8%)
Query: 86 NPEEADWFYTPIYPTCDLTPTGLPLPFKS---PRMMRSAIQLISSNWPYWNRTEGADHFF 142
+P+EA+ FY P+ T + P + PR++ I ++ +PYWNR++GADHF
Sbjct: 102 HPDEANAFYIPMSLTRVVHFIYEPPHYHGKWIPRLVTDYINFVADKYPYWNRSKGADHFL 161
Query: 143 VVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQ 202
V HD+ + A++ + RA +R H + I+IP P+
Sbjct: 162 VSCHDWAP-----DVSALKPDLYKHFIRALCNANTSERFHPIRD---ISIPEINIPRGKL 213
Query: 203 A--HQIPPDTPRSIFVYFRGLFYDVNNDPEGGY-YARGARAAVWENFKNNPLFDISTDHP 259
H P R I +F G G + Y R W+ +
Sbjct: 214 GPPHLDQPPNKRPILAFFAG----------GAHGYVRSVLFKYWKEKDDEVQVFERLPRN 263
Query: 260 TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFV 319
Y + M + FCLCP G+ SPR+VEA+ GC+P+II D LPF+D + W + +++
Sbjct: 264 RNYSKSMGDSKFCLCPSGYEVASPRIVEAIAAGCVPMIICDHYSLPFSDVLDWSKFSIYI 323
Query: 320 AEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
+ +P++ IL ++P E L Q+ + ++R +PA+P D H IL+ +
Sbjct: 324 TSDKIPEIKKILKAVPTETYLEMQKRVKQ--VQRHFAINRPARPYDMLHMILHSV 376
>gi|384490457|gb|EIE81679.1| hypothetical protein RO3G_06384 [Rhizopus delemar RA 99-880]
Length = 793
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 98/370 (26%), Positives = 162/370 (43%), Gaps = 46/370 (12%)
Query: 34 DDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLL--SSPVR----TLNP 87
+ V K K+YVY+ P ++ +C + +EI +H L +SP+ T NP
Sbjct: 429 EQAVCKPKIYVYKAPESIQVPEQIQKEKCHESNYNSEIILHNQLTDPTSPIYEHYVTENP 488
Query: 88 EEADWFYTPIYPTCDLTPTGLPLPFK-------SPRMMRSAIQLISSNWPYWNRTEGADH 140
EEAD+F+ P + +C L + + + + ++ +PYWN+T G +H
Sbjct: 489 EEADFFFIPFFGSCYLYNCWYENKWNWDERCEVDAKYVDPLMDMVIQEYPYWNKTGGRNH 548
Query: 141 FFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEG----SITIPPYA 196
+ P D F Y + P Q A ++T G + + ++ I IP
Sbjct: 549 IMIHPMD--KTFTYYQSN-------PRFQSAIFLKTVGDKRNKWMSRHRYHRDIVIPSAT 599
Query: 197 ---------PPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFK 247
P + A P R IF F+G DV E Y+ G R+ + +F
Sbjct: 600 RMIHHLRANPLDYLNAQGQPKSGKRDIFALFQGCCPDVQPTDE---YSNGIRSLFFNHFA 656
Query: 248 NNPLFDISTD-HPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 306
+ P ++I Y E + RA + L P+GW + R+ E + FG +PV+IAD I+ PF
Sbjct: 657 HYPGYEIGQSVADEEYLEKLSRAKYGLSPMGWTLDTTRIWEFMAFGVVPVVIADGIIEPF 716
Query: 307 ADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDA 366
+ W++ + + ++V +LD IL SI + KQ+ L + + DA
Sbjct: 717 EFDVDWDKFIVRIRRDEVHRLDEILKSIDDKTYEYKQKNLWEFGRRVGLEM-------DA 769
Query: 367 FHQILNGLAR 376
+H I+ L R
Sbjct: 770 WHFIVRELCR 779
>gi|159485716|ref|XP_001700890.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158281389|gb|EDP07144.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 704
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 108/369 (29%), Positives = 162/369 (43%), Gaps = 69/369 (18%)
Query: 42 VYVYELPSKYNKKLL---LKDPRCLTH--------------MFAAEIFMHRFLLSSPVRT 84
+YVY+L YN +LL + C TH + E +H +LL S RT
Sbjct: 269 IYVYDLDPLYNARLLQYRIVQTWC-THRHYHTGNTSTWSATFYGLEAALHEYLLISEHRT 327
Query: 85 LNPEEADWFYTPIYPTCDLTPT------------GLPLPFKSPRMMRSAIQLISSNWPYW 132
+PEEAD+FY P Y C + P G P + M+R ++ I +P+W
Sbjct: 328 FDPEEADYFYVPFYGACMIYPVAGWADYPWFWTPGGPRVMQVINMIREIVEWIDKQYPFW 387
Query: 133 NRTEGADHFFVVPHDFGACF------------HYQE---EKAIERGILPLLQRATLV--- 174
R G DH ++ HD GAC+ H+ E + +V
Sbjct: 388 KRRGGRDHIWLFTHDEGACWAPSVIKDSVWLTHWGRLDPEHTSNTAFVGDNYTHDMVNWR 447
Query: 175 QTFGQ----RNHVCLN-EGSITIPPY-APPQKMQA-HQIPPDTPRSIFVYFRGLFYDVNN 227
Q G + H C + + + +P + +PP +++ Q P PR IF +F+G DV
Sbjct: 448 QPEGYIKYIKGHPCYDPQKDLVVPNFKSPPHYVRSPLQSTPSKPRDIFFFFKG---DVGK 504
Query: 228 DPEGGYYARGARAAV--------WENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWA 279
+Y+RG R + W N + + + D H Y + + R++FCL G
Sbjct: 505 H-RLSHYSRGIRQKIYKMAMEQDWANTQKSLIGDGGNVH-GDYSDLLSRSLFCLVAPG-D 561
Query: 280 PWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVI 339
WSPRL +AV+ GCIPVIIAD + F + + + VAE DVP++ IL ++ I
Sbjct: 562 GWSPRLEDAVLHGCIPVIIADRVHAVFESVLDIDSFAVRVAEADVPRVMDILRAVSDIKI 621
Query: 340 LRKQRLLAN 348
KQ L
Sbjct: 622 RLKQSRLGQ 630
>gi|302768010|ref|XP_002967425.1| glycosyltransferase-like protein [Selaginella moellendorffii]
gi|300165416|gb|EFJ32024.1| glycosyltransferase-like protein [Selaginella moellendorffii]
Length = 372
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 97/350 (27%), Positives = 169/350 (48%), Gaps = 34/350 (9%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAE-IFMHRFLLSSPVRTLNPEEADWFYTPI- 97
KVYVY+ + +L+ K P LT ++++E F+H +S T +P+EA F+ P
Sbjct: 32 FKVYVYK---EGEPRLVHKGP--LTGIYSSEGRFIHEMNQNSRFVTHDPQEAHMFFLPYS 86
Query: 98 --YPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQ 155
+ DL G ++ + LI+S P+WN T G+DHFF HD+G +
Sbjct: 87 VAHMVLDLYVPGSHSMLPLATFIKDYVNLIASKHPFWNLTRGSDHFFASCHDWGPAT-AR 145
Query: 156 EEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIF 215
+ + + + ++ + L + F L E + P K+ P + R I
Sbjct: 146 DHPELRKNSVKVVCNSDLTEEFVPDKDASLPE--TYLHAVKLPTKLGG---PGPSKRPIL 200
Query: 216 VYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP------TTYYEDMQRA 269
+F G + R A W++ + +P I P T+Y + M+ +
Sbjct: 201 AFFAGQMHG---------RVRPALIKHWKD-RGDPDMRIYEVLPPDVARRTSYVQHMKSS 250
Query: 270 IFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDT 329
FC+C +G+ SPR+VE++ + C+PV+IAD+ VLPF+D + W + V+E+DVP+L
Sbjct: 251 KFCICAMGFEVNSPRIVESIYYDCVPVLIADNFVLPFSDVLNWGSFSLTVSEKDVPRLKE 310
Query: 330 ILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL-ARKL 378
+L ++ + + Q L +++ L+ A+ D FH IL+ + R+L
Sbjct: 311 LLLAVSEDRYRKMQSRLKK--VRKHFLWHDSAERFDMFHMILHSVWTRRL 358
>gi|255579651|ref|XP_002530666.1| catalytic, putative [Ricinus communis]
gi|223529799|gb|EEF31735.1| catalytic, putative [Ricinus communis]
Length = 528
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 110/348 (31%), Positives = 162/348 (46%), Gaps = 37/348 (10%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSS-PVRTLNPEEADWFYTPIY 98
LKVY+Y P K+ + P L ++A+E + +F+ + T +PE+A FY P Y
Sbjct: 192 LKVYIY--PD--GKRPIFHVPH-LNGIYASEGWFMKFMEENRQFVTRDPEKAHLFYLP-Y 245
Query: 99 PTCDLTPTGLPLPFKS-----PRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFH 153
L L +P MR +I++ +P+WNRT G DHF V HD+G
Sbjct: 246 SARQLQ-MALYVPNSHNLRPLSIFMRDYANMIATKYPFWNRTHGRDHFLVACHDWGPYTL 304
Query: 154 YQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRS 213
E+ + I L F V L E +I IP P K I + R
Sbjct: 305 TMHEELTKNTIKALCNADASEGIFDPTKDVSLPETTIRIP--RRPLKNVGGGIRV-SQRP 361
Query: 214 IFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPL-----FDISTDHPTTYYEDMQR 268
I +F G + R W+N K+ L Y + M+
Sbjct: 362 ILAFFAGNMHG---------RVRPTLLQYWQN-KDEDLKIYGPLPARISRKMNYVQHMKS 411
Query: 269 AIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLD 328
+ +C+CP+G SPR+VEA+ + C+PVIIAD+ VLPF+D + W + VAE+D+PKL
Sbjct: 412 SRYCICPMGHEVNSPRIVEAIYYECVPVIIADNFVLPFSDVLDWSAFSIVVAEKDIPKLK 471
Query: 329 TILTSIPPEVILRKQRLLANPSM-KRAMLF-PQPAQPGDAFHQILNGL 374
IL +IP + R +L N M +R L+ P+P + D FH IL+ +
Sbjct: 472 EILLAIP---LRRYLTMLTNLKMLQRHFLWNPRPLR-YDLFHMILHSI 515
>gi|255582309|ref|XP_002531945.1| catalytic, putative [Ricinus communis]
gi|223528391|gb|EEF30427.1| catalytic, putative [Ricinus communis]
Length = 453
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 80/293 (27%), Positives = 135/293 (46%), Gaps = 22/293 (7%)
Query: 87 PEEADWFYTPIYPTCDLTPTGLPLPFKS----PRMMRSAIQLISSNWPYWNRTEGADHFF 142
PEEA F+ P+ + P+ S R++ + +++ +PYWNR+ GADHF
Sbjct: 166 PEEAQVFFLPVSVANIINFIYKPIVTYSRDQLQRLVTDYVGTVANKYPYWNRSSGADHFL 225
Query: 143 VVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQ 202
V HD+ + + R + +L A + F + V + E +I PP
Sbjct: 226 VSCHDWAPDISTANPE-LYRNFIRVLCNANTSERFNPKRDVSIPEINIPSGKLGPP---- 280
Query: 203 AHQIPPDTPRSIFVYFRGLFYDVNNDPEGGY-YARGARAAVWENFKNNPLFDISTDHPTT 261
HQ + R+I +F G G + Y R W+ + D
Sbjct: 281 LHQASSPSNRTILAFFAG----------GSHGYIRKLLLEHWKGKDSEIQVHEYLDKKQN 330
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y++ M ++ FCLCP G+ SPR+V A+ GC+PV I+D+ LPF+D + W + + +
Sbjct: 331 YFKLMGQSRFCLCPSGYEVASPRVVTAIQLGCVPVTISDNYTLPFSDILDWSKFSVHIPS 390
Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
+ ++ TIL I P L+ + + ++R + +PAQP D H +L+ +
Sbjct: 391 GKIQEIKTILKGISPRQYLKMHKRVM--LVRRHFMLNRPAQPFDMIHMMLHSI 441
>gi|296084507|emb|CBI25528.3| unnamed protein product [Vitis vinifera]
Length = 862
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 83/294 (28%), Positives = 135/294 (45%), Gaps = 26/294 (8%)
Query: 87 PEEADWFYTPIYPTCDLTPTGLPLPFKS---PRMMRSAIQLISSNWPYWNRTEGADHFFV 143
P+EA+ FY P+ T + P + PR++ I ++ +PYWNR++GADHF V
Sbjct: 578 PDEANAFYIPMSLTKIVHFIYEPPHYYGKWIPRLVTDYINFVADKYPYWNRSKGADHFLV 637
Query: 144 VPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQA 203
HD+ + A++ + RA +R H + I+IP P+
Sbjct: 638 SCHDWAP-----DVSALKPDLYKHFIRALCNANTSERFHPIRD---ISIPEINIPRGKLG 689
Query: 204 --HQIPPDTPRSIFVYFRGLFYDVNNDPEGGY-YARGARAAVWENFKNNPLFDISTDHPT 260
H P R I +F G G + Y R W+ +
Sbjct: 690 PPHLDQPPNKRPILAFFAG----------GAHGYVRSVLFKYWKEKDDEVQVFERLPGNR 739
Query: 261 TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVA 320
Y + M + FCLCP G+ SPR+VEA+ GC+P+II D LPF+D + W + +++
Sbjct: 740 NYSKSMGDSKFCLCPSGYEVASPRIVEAIAAGCVPMIICDHYSLPFSDVLDWSKFSIYIT 799
Query: 321 EEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
+ +P++ IL ++P E L Q+ + ++R +PA+P D H IL+ +
Sbjct: 800 SDKIPEIKKILKAVPTETYLEMQKRVK--QVQRHFAINRPARPYDMLHMILHSV 851
>gi|224071129|ref|XP_002303362.1| predicted protein [Populus trichocarpa]
gi|222840794|gb|EEE78341.1| predicted protein [Populus trichocarpa]
Length = 386
Score = 125 bits (314), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 107/354 (30%), Positives = 167/354 (47%), Gaps = 49/354 (13%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSS-PVRTLNPEEADWFYTP-- 96
LKVY+Y P K + P L ++A+E + +F+ +S + +PE+A FY P
Sbjct: 50 LKVYIY--PD--GDKPIFHQPH-LYGIYASEGWFMKFMEASREFVSRDPEKAHLFYLPYS 104
Query: 97 -------IY--PTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHD 147
+Y + +L P + MR +I++ +PYWNRT G DHF V HD
Sbjct: 105 ARQLEVAVYVPNSHNLRPLSI--------FMRDYANMIAAKYPYWNRTHGRDHFLVACHD 156
Query: 148 FGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIP 207
+G E+ + + L F V L E +I P P + I
Sbjct: 157 WGPYALTMHEELTKNTMKALCNADVSEGIFTAGQDVSLPETTIRSP--KRPLRNVGGGIR 214
Query: 208 PDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNN-----PLFDISTDHPTTY 262
+ R I +F G + R W N ++ PL I TY
Sbjct: 215 V-SQRPILAFFAGNLHG---------RVRPTLLKYWHNKDDDMKIYGPL-PIGISRKMTY 263
Query: 263 YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEE 322
+ M+ + +C+CP+G+ SPR+VEA+ + C+PVIIAD+ VLPF + + W + VAE+
Sbjct: 264 VQHMKSSKYCICPMGYEVNSPRIVEAIYYECVPVIIADNFVLPFNEVLDWSAFSVVVAEK 323
Query: 323 DVPKLDTILTSIPPEVILRKQRLLAN-PSMKRAMLF-PQPAQPGDAFHQILNGL 374
D+PKL IL +IP + R +LAN ++++ L+ P+P + D FH IL+ +
Sbjct: 324 DIPKLKEILLAIP---LRRYLTMLANLKTVQKHFLWNPRPLR-YDLFHMILHSI 373
>gi|357138738|ref|XP_003570946.1| PREDICTED: probable glycosyltransferase At5g11130-like
[Brachypodium distachyon]
Length = 405
Score = 125 bits (314), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 101/351 (28%), Positives = 162/351 (46%), Gaps = 54/351 (15%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYP 99
KVY+Y P + K + PR LT +A+E + + + S RT +P+ AD F+ PI P
Sbjct: 84 FKVYIY--PDG-DPKTFYQTPRKLTGKYASEGYFFQNIRESRFRTEDPDSADLFFVPISP 140
Query: 100 TCDLTPTGLPLPFKSPRMM-RSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEK 158
G +++ ++ + ++ + + +PYWNRT GADHFFV HD G +
Sbjct: 141 H---KMRGKGTSYENMTIIVKDYVEGLINKYPYWNRTLGADHFFVTCHDVGV-------R 190
Query: 159 AIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY---APPQKMQAHQIPPDTPRSIF 215
A E LP + + ++ VC ++ P+ A PQ +Q +P
Sbjct: 191 AFEG--LPFMVKNSI-------RVVCSPSYNVDFIPHKDVALPQVLQPFALPKGGN---- 237
Query: 216 VYFRGLFYDVNNDPEGGYYA-------RGARAAVWEN-----FKNNPLFDISTDHPTTYY 263
DV N G++A R A VWEN NN + + Y
Sbjct: 238 --------DVENRTNLGFWAGHRNSKIRVILARVWENDTELAISNNRISRAIGE--LVYQ 287
Query: 264 EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED 323
+ R FC+CP G S R+ +++ +GC+PVI++D LPF D + W++ + + E D
Sbjct: 288 KQFYRTKFCICPGGSQVNSARISDSIHYGCVPVILSDYYDLPFNDVLDWKKFAIVLKERD 347
Query: 324 VPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
V +L +IL SI E + L +++ ++ P P DAFH ++ L
Sbjct: 348 VYELKSILKSISQEEFVALHNSLVQ--IQKHFVWHSPPIPYDAFHMVMYEL 396
>gi|357125316|ref|XP_003564340.1| PREDICTED: probable glycosyltransferase At5g03795-like
[Brachypodium distachyon]
Length = 553
Score = 125 bits (313), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 85/268 (31%), Positives = 129/268 (48%), Gaps = 33/268 (12%)
Query: 118 MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQ-T 176
+++ I +IS+ +PYWNRT+GADHFFV HD+G + + + R + L A L +
Sbjct: 297 VKNYIDMISAKFPYWNRTKGADHFFVACHDWGP-YTTKLHDELRRNTIKALCNADLSEGV 355
Query: 177 FGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRG----------LFYDVN 226
F + V L E + +P + ++ P T RSI +F G L Y
Sbjct: 356 FIRGRDVSLPETFVR----SPRRPLRDIGGKPATERSILAFFAGQMHGRVRPILLQYWGG 411
Query: 227 NDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLV 286
D + Y R R Y + M+ + +C+CP+G+ SPR+V
Sbjct: 412 KDTDMRIYGRLPRRITRR---------------MNYVQHMKSSKYCICPMGYEVNSPRIV 456
Query: 287 EAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLL 346
EA+ + CIPVIIAD+ VLPF DA+ W + V E+DVP+L IL IP + Q +
Sbjct: 457 EAIYYECIPVIIADNFVLPFDDALDWSTFSVVVPEKDVPRLKEILLRIPESRYITMQSNV 516
Query: 347 ANPSMKRAMLFPQPAQPGDAFHQILNGL 374
+++ L+ D FH IL+ +
Sbjct: 517 KK--VQKHFLWHAKPVKYDIFHMILHSV 542
>gi|159470363|ref|XP_001693329.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158277587|gb|EDP03355.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 626
Score = 125 bits (313), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 111/369 (30%), Positives = 167/369 (45%), Gaps = 48/369 (13%)
Query: 39 KLKVYVYELP------SKYNKKLLLKDPRCLTHMFAA-EIFMHRFLLSSPVRTLNPEEAD 91
KLK+Y+Y+LP YN +DP M++A E+FM FL + RT NP EA+
Sbjct: 242 KLKIYMYDLPWDVAFPGAYNDGFFGRDP-----MYSAYELFMRYFLQDNVTRTENPWEAN 296
Query: 92 WFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGAC 151
FY P+ + +P AI + S WP+W+R+ G DHF+ + D G C
Sbjct: 297 LFYVPMLLYFYIGNVRDAVP-----QTAWAINHVRSRWPFWDRSGGRDHFYFMTGDRGTC 351
Query: 152 FHYQE--EKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPD 209
+E ++AI+ + T +++ C+ + PP M A +P D
Sbjct: 352 HLPRELQDQAIKVVHWGMQVAGTDWIGLDNKDYACIQLKRDLV---VPPINMFAEILPTD 408
Query: 210 TPRSI-FVYFRG--------LFYDVNNDPEGGYYARGARAAVWE--------NFKNNPLF 252
T + V G LF+ + Y+ G R A+ E N + P
Sbjct: 409 TVKHYQTVVANGGQDFGRTLLFFFAGGIAQSMEYSGGTRQAIKELLTSVHIANGNSTPAD 468
Query: 253 DISTDHPTTYYEDMQ-RAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIP 311
+ + T Y+ + + FC+ P G+ W RLV+A+ FGCIPVII D + F D +P
Sbjct: 469 VVFVEGRTQEYKKLLLTSKFCIAPYGFG-WGLRLVQAIEFGCIPVIIQDHVYQAFEDFLP 527
Query: 312 WEEIGMFVAEEDVPKLDTILTSIPPE--VILRKQRLLANPSMKRAMLFPQPAQPGDAFHQ 369
+EE + + DVP+L IL S PE LR L RA ++ + G+A+
Sbjct: 528 YEEFSVRLPLRDVPRLLDILRSYSPEQQAALR----LGMAKYYRAFVWHREFG-GEAYEW 582
Query: 370 ILNGLARKL 378
L GL R+L
Sbjct: 583 TLAGLQRRL 591
>gi|147775379|emb|CAN73458.1| hypothetical protein VITISV_022180 [Vitis vinifera]
Length = 333
Score = 125 bits (313), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 83/296 (28%), Positives = 135/296 (45%), Gaps = 30/296 (10%)
Query: 87 PEEADWFYTPIYPTCDLTPTGLPLPFKS---PRMMRSAIQLISSNWPYWNRTEGADHFFV 143
P+EA+ FY P+ T + P + PR++ I ++ +PYWNR++GADHF V
Sbjct: 49 PDEANAFYIPMSLTKIVHFIYEPPHYYGKWIPRLVTDYINFVADKYPYWNRSKGADHFLV 108
Query: 144 VPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNH----VCLNEGSITIPPYAPPQ 199
HD+ + A++ + RA +R H + + E +I PP
Sbjct: 109 SCHDWAP-----DVSALKPDLYKHFIRALCNANTSERFHPIRDISIPEINIPXGKLGPPH 163
Query: 200 KMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGY-YARGARAAVWENFKNNPLFDISTDH 258
Q P R I +F G G + Y R W+ +
Sbjct: 164 LDQ-----PPNKRPILAFFAG----------GAHGYVRSVLFKYWKEKDDEVQVFERLPG 208
Query: 259 PTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMF 318
Y + M + FCLCP G+ SPR+VEA+ GC+P+II D LPF+D + W + ++
Sbjct: 209 NRNYSKSMGDSKFCLCPSGYEVASPRIVEAIAAGCVPMIICDHYSLPFSDVLDWSKFSIY 268
Query: 319 VAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
+ + +P++ IL ++P E L Q+ + ++R +PA+P D H IL+ +
Sbjct: 269 ITSDKIPEIKKILKAVPTETYLEMQKRVKQ--VQRHFAINRPARPYDMLHMILHSV 322
>gi|449531189|ref|XP_004172570.1| PREDICTED: probable glycosyltransferase At5g03795-like, partial
[Cucumis sativus]
Length = 344
Score = 124 bits (312), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 105/348 (30%), Positives = 167/348 (47%), Gaps = 38/348 (10%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAE-IFMHRFLLSSPVRTLNPEEADWFYTPIY 98
LKVY+Y S+ + P L ++A+E FM + T +PE+A FY Y
Sbjct: 9 LKVYIYRDGSR----PIFHTPH-LRGIYASEGWFMKLMEENRQFVTKDPEKAHLFYLA-Y 62
Query: 99 PTCDLTPTGLPLPFKSPRM------MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACF 152
+ L T L +P S M +R + I+ +PYWNRT G DHF V HD+G +
Sbjct: 63 SSRQLQ-TALYVP-DSHNMKPLSIYLRDHVNWIAGKYPYWNRTHGYDHFLVACHDWGP-Y 119
Query: 153 HYQEEKAIERGILPLLQRATLVQ-TFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTP 211
E + + + + L A L + F V L E +I P P + + ++ +
Sbjct: 120 TVNEHRELSQHTIKALCNADLSEGVFKLGKDVSLPETTIRTPR-KPLRNVGGKRV---SQ 175
Query: 212 RSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPL-----FDISTDHPTTYYEDM 266
R I +F G + R W N K++ + + TY + M
Sbjct: 176 RPILAFFAGNMHG---------RVRPILLKHW-NDKDDDIRVYGPLPLRVSRKMTYIQHM 225
Query: 267 QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK 326
+ + +C+CP+G+ SPR++EA+ + C+PVIIAD+ VLPF++ + W + VAE+D+PK
Sbjct: 226 KSSKYCICPMGYEVNSPRIIEAIYYECVPVIIADNFVLPFSEFLDWSAFSVVVAEKDIPK 285
Query: 327 LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
L ILT+IP + L Q + K + P+P + D FH +L+ +
Sbjct: 286 LKEILTAIPLKRYLTMQ-INVKMVQKHFLWNPKPLK-YDLFHMVLHSI 331
>gi|449469432|ref|XP_004152424.1| PREDICTED: probable glycosyltransferase At5g03795-like [Cucumis
sativus]
Length = 472
Score = 124 bits (312), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 105/348 (30%), Positives = 167/348 (47%), Gaps = 38/348 (10%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAE-IFMHRFLLSSPVRTLNPEEADWFYTPIY 98
LKVY+Y S+ + P L ++A+E FM + T +PE+A FY Y
Sbjct: 137 LKVYIYRDGSR----PIFHTPH-LRGIYASEGWFMKLMEENRQFVTKDPEKAHLFYLA-Y 190
Query: 99 PTCDLTPTGLPLPFKSPRM------MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACF 152
+ L T L +P S M +R + I+ +PYWNRT G DHF V HD+G +
Sbjct: 191 SSRQLQ-TALYVP-DSHNMKPLSIYLRDHVNWIAGKYPYWNRTHGYDHFLVACHDWGP-Y 247
Query: 153 HYQEEKAIERGILPLLQRATLVQ-TFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTP 211
E + + + + L A L + F V L E +I P P + + ++ +
Sbjct: 248 TVNEHRELSQHTIKALCNADLSEGVFKLGKDVSLPETTIRTPR-KPLRNVGGKRV---SQ 303
Query: 212 RSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPL-----FDISTDHPTTYYEDM 266
R I +F G + R W N K++ + + TY + M
Sbjct: 304 RPILAFFAGNMHG---------RVRPILLKHW-NDKDDDIRVYGPLPLRVSRKMTYIQHM 353
Query: 267 QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK 326
+ + +C+CP+G+ SPR++EA+ + C+PVIIAD+ VLPF++ + W + VAE+D+PK
Sbjct: 354 KSSKYCICPMGYEVNSPRIIEAIYYECVPVIIADNFVLPFSEFLDWSAFSVVVAEKDIPK 413
Query: 327 LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
L ILT+IP + L Q + K + P+P + D FH +L+ +
Sbjct: 414 LKEILTAIPLKRYLTMQ-INVKMVQKHFLWNPKPLK-YDLFHMVLHSI 459
>gi|359484706|ref|XP_002266646.2| PREDICTED: probable glycosyltransferase At5g20260-like [Vitis
vinifera]
Length = 480
Score = 124 bits (312), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 83/294 (28%), Positives = 135/294 (45%), Gaps = 26/294 (8%)
Query: 87 PEEADWFYTPIYPTCDLTPTGLPLPFKS---PRMMRSAIQLISSNWPYWNRTEGADHFFV 143
P+EA+ FY P+ T + P + PR++ I ++ +PYWNR++GADHF V
Sbjct: 196 PDEANAFYIPMSLTKIVHFIYEPPHYYGKWIPRLVTDYINFVADKYPYWNRSKGADHFLV 255
Query: 144 VPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQA 203
HD+ + A++ + RA +R H + I+IP P+
Sbjct: 256 SCHDWAP-----DVSALKPDLYKHFIRALCNANTSERFHPIRD---ISIPEINIPRGKLG 307
Query: 204 --HQIPPDTPRSIFVYFRGLFYDVNNDPEGGY-YARGARAAVWENFKNNPLFDISTDHPT 260
H P R I +F G G + Y R W+ +
Sbjct: 308 PPHLDQPPNKRPILAFFAG----------GAHGYVRSVLFKYWKEKDDEVQVFERLPGNR 357
Query: 261 TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVA 320
Y + M + FCLCP G+ SPR+VEA+ GC+P+II D LPF+D + W + +++
Sbjct: 358 NYSKSMGDSKFCLCPSGYEVASPRIVEAIAAGCVPMIICDHYSLPFSDVLDWSKFSIYIT 417
Query: 321 EEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
+ +P++ IL ++P E L Q+ + ++R +PA+P D H IL+ +
Sbjct: 418 SDKIPEIKKILKAVPTETYLEMQKRVKQ--VQRHFAINRPARPYDMLHMILHSV 469
>gi|15233924|ref|NP_195005.1| Exostosin family protein [Arabidopsis thaliana]
gi|3063691|emb|CAA18582.1| putative protein [Arabidopsis thaliana]
gi|7270226|emb|CAB79996.1| putative protein [Arabidopsis thaliana]
gi|332660719|gb|AEE86119.1| Exostosin family protein [Arabidopsis thaliana]
Length = 593
Score = 124 bits (311), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 102/349 (29%), Positives = 167/349 (47%), Gaps = 42/349 (12%)
Query: 39 KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEI-FMHRFLLSSPVRTLNPEEADWFYTPI 97
KLKVYVY + K+ +L P L ++A+E FM + S T +P +A FY P
Sbjct: 266 KLKVYVY----REGKRPVLHKP-VLKGIYASEGWFMKQLKSSRTFVTKDPRKAHLFYLPF 320
Query: 98 YPTCDLTPTGLPLPFKSP-----RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACF 152
+ + L +P + +++ + +ISS + +WN+T G+DHF V HD+
Sbjct: 321 --SSKMLEETLYVPGSHSDKNLIQFLKNYLDMISSKYSFWNKTGGSDHFLVACHDWAPS- 377
Query: 153 HYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPR 212
E + + L + + + F V L E +I +P + ++A P + R
Sbjct: 378 ---ETRQYMAKCIRALCNSDVSEGFVFGKDVALPETTILVPR----RPLRALGGKPVSQR 430
Query: 213 SIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNN--PLFDISTDHPTT-----YYED 265
I +F G + GY R + +N+ N P I ++ P + Y E
Sbjct: 431 QILAFFAGGMH--------GY----LRPLLLQNWGGNRDPDMKIFSEIPKSKGKKSYMEY 478
Query: 266 MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVP 325
M+ + +C+CP G SPR+VEA+ + C+PVII+D+ V PF + + WE +FV E+D+P
Sbjct: 479 MKSSKYCICPKGHEVNSPRVVEALFYECVPVIISDNFVPPFFEVLNWESFAVFVLEKDIP 538
Query: 326 KLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
L IL SI E R+ ++ K + +P + D FH IL+ +
Sbjct: 539 DLKNILVSITEER-YREMQMRVKMVQKHFLWHSKPER-FDIFHMILHSI 585
>gi|297804498|ref|XP_002870133.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
gi|297315969|gb|EFH46392.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
Length = 540
Score = 124 bits (311), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 102/346 (29%), Positives = 160/346 (46%), Gaps = 33/346 (9%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAE-IFMHRFLLSSPVRTLNPEEADWFYTPIY 98
LKVY+Y P +K + P L ++A+E FM ++ T NPE A FY P
Sbjct: 198 LKVYIY--PD--GEKPIFHQPH-LNGIYASEGWFMKLMESNTQFVTKNPERAHLFYMPYS 252
Query: 99 PTCDLTPTGLPLPFKSPRM---MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQ 155
T +P + +R + ++S+ +P+WNRT G+DHF V HD+G +
Sbjct: 253 VKQLQTSIFVPGSHNIKPLSIFLRDYVNMLSTKYPFWNRTHGSDHFLVACHDWGP-YTVN 311
Query: 156 EEKAIERGILPLLQRATLVQ-TFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSI 214
E + R + L A L F V L E SI +++ + R I
Sbjct: 312 EHPELRRNTIKALCNADLADGIFIPGKDVSLPETSIRNAGKPLRNIGNGNRV---SQRPI 368
Query: 215 FVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNN-----PLFDISTDHPTTYYEDMQRA 269
+F G + R W N ++ PL + TY + M+ +
Sbjct: 369 LAFFAGNLHG---------RVRPKLLKHWRNKDDDMKIYGPLPH-NVARKMTYVQHMKSS 418
Query: 270 IFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDT 329
+CLCP+G+ SPR+VEA+ + C+PV+IAD+ +LPF+D + W + V E+++P+L
Sbjct: 419 KYCLCPMGYEVNSPRIVEAIYYECVPVVIADNFMLPFSDVLDWSAFSVVVPEKEIPRLKE 478
Query: 330 ILTSIPPEVILRKQRLLANPSM-KRAMLFPQPAQPGDAFHQILNGL 374
IL IP L+ Q +N M +R L+ + D FH IL+ +
Sbjct: 479 ILLEIPMRRYLKMQ---SNVKMVQRHFLWSPKPRKYDVFHMILHSI 521
>gi|413947133|gb|AFW79782.1| putative homeodomain-like transcription factor superfamily protein
[Zea mays]
Length = 544
Score = 124 bits (311), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 83/261 (31%), Positives = 123/261 (47%), Gaps = 19/261 (7%)
Query: 118 MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTF 177
+++ I LIS+ +PYWNRT+GADHFFV HD+G + + + + + L A L + F
Sbjct: 290 VKNYIDLISAKYPYWNRTKGADHFFVACHDWGP-YTTKLHDELRKNTIKALCNADLSEGF 348
Query: 178 GQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARG 237
R + P P + + P R+I +F G + R
Sbjct: 349 FIRGKDVSLPETFLRSPRRPLRDIGGR---PAAQRTILAFFAGQMHG---------RVRP 396
Query: 238 ARAAVWENFKNNPLFDISTDHPTT----YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGC 293
W N ++ H T Y + M+ + +C+CP+G+ SPR+VEA+ + C
Sbjct: 397 VLLKYWGNKDDDMRIYSRLPHRITRKRNYVQHMKSSKYCICPMGYEVNSPRIVEAIYYEC 456
Query: 294 IPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKR 353
+PVIIAD VLPF A+ W + V E DVPKL IL +IP + Q + K
Sbjct: 457 VPVIIADHFVLPFDAALNWSTFSVVVPESDVPKLKQILLAIPESRYITLQANVKR-VQKH 515
Query: 354 AMLFPQPAQPGDAFHQILNGL 374
M P P + D FH IL+ +
Sbjct: 516 FMWHPNPVK-YDIFHMILHSV 535
>gi|297802718|ref|XP_002869243.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
gi|297315079|gb|EFH45502.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
Length = 583
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 103/350 (29%), Positives = 166/350 (47%), Gaps = 44/350 (12%)
Query: 39 KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEI-FMHRFLLSSPVRTLNPEEADWFYTPI 97
KLKVYVY + K+ +L P L ++A+E FM + S T +P +A FY P
Sbjct: 256 KLKVYVY----REGKRPVLHKP-VLKGIYASEGWFMKQLKSSRTFVTKDPRKAHLFYLPF 310
Query: 98 YPTCDLTPTGLPLPFKSP-----RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACF 152
+ + L +P + +++ + +ISS + +WN+T G+DHF V HD+
Sbjct: 311 --SSKMLEETLYVPGSHSDQNLIQFLKNYLDMISSKYNFWNKTGGSDHFLVACHDWAPS- 367
Query: 153 HYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPR 212
E + + L + + + F V L E +I +P + ++A P + R
Sbjct: 368 ---ETRQYMAKCIRALCNSDVSEGFVFGKDVALPETTILVPR----RPLRALGGKPVSQR 420
Query: 213 SIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNN--PLFDISTDHPTT-----YYED 265
I +F G + GY R + N+ N P I ++ P + Y E
Sbjct: 421 QILAFFAGGMH--------GY----LRPLLLRNWGGNRDPDMKIFSEIPKSKGKKSYMEY 468
Query: 266 MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVP 325
M+ + FC+CP G SPR+VEA+ + C+PVII+D+ V PF + + WE +FV E+D+P
Sbjct: 469 MKSSKFCICPKGHEVNSPRVVEALFYECVPVIISDNFVPPFFEVLNWEAFAVFVLEKDIP 528
Query: 326 KLDTILTSIPPEVILRKQRLLANPSM-KRAMLFPQPAQPGDAFHQILNGL 374
L IL SI E R + + M ++ L+ + D FH IL+ +
Sbjct: 529 DLKNILVSITEE---RYREMQTRVKMVQKHFLWHSKPERFDIFHMILHSI 575
>gi|384251887|gb|EIE25364.1| exostosin-like glycosyltransferase [Coccomyxa subellipsoidea C-169]
Length = 705
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 102/366 (27%), Positives = 162/366 (44%), Gaps = 70/366 (19%)
Query: 42 VYVYELPSKYNKKLL---LKDPRCLTHMFAA-------------EIFMHRFLLSSPVRTL 85
+Y+ ELP YN ++L + C F + E +H LL SP RTL
Sbjct: 276 IYMVELPPIYNSRMLQYRIHKDTCTWRGFDSGNASFITDWTYQIEPALHEMLLQSPHRTL 335
Query: 86 NPEEADWFYTPIYPTCDLTPT------------GLPLPFKSPRMMRSAIQLISSNWPYWN 133
+PE AD+FY P+Y +C + P G P + MM A + + + PYWN
Sbjct: 336 DPEAADFFYVPVYTSCFIHPVYGWADTPWFHNPGSPRVMHAATMMLEAKRWLETELPYWN 395
Query: 134 RTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQT----------------- 176
RT G DH +++ HD G+C+ E ++ IL R L
Sbjct: 396 RTGGRDHIWLISHDEGSCWAPSEIRS--SIILSHWGRKALDHESYSAYPFDNYSDNAVHP 453
Query: 177 ----FGQRN----HVCLN-EGSITIPPYAPPQKMQAHQIPP--DTPRSIFVYFRGLFYDV 225
G R+ H C + + + IP + PP ++ + + PR + ++FRG +
Sbjct: 454 EWRPHGWRHIIEGHPCYDPDKDLIIPAFVPPARIVPSPLTGAREDPRPLLLFFRGDV-GL 512
Query: 226 NNDPEGGYYARGARAAVW-----ENFKNNPLFDIST--DHPTTYYEDMQRAIFCLCPLGW 278
N P +Y+RG R ++ + ++ I T D P Y E + + FCL G
Sbjct: 513 NRRP---HYSRGIRQRIYALSKEQRWREKYRIWIGTKEDTPGGYSELLSSSKFCLVVPG- 568
Query: 279 APWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEV 338
WSPR +A++ GC+PV++ D + F + WEE + + E ++ L IL SI P
Sbjct: 569 DGWSPRAEDAMLHGCVPVVVNDGVDQVFETLLDWEEFAVRIPEREMEFLPEILLSISPSR 628
Query: 339 ILRKQR 344
+ + Q+
Sbjct: 629 LQQLQK 634
>gi|357440259|ref|XP_003590407.1| hypothetical protein MTR_1g061760 [Medicago truncatula]
gi|355479455|gb|AES60658.1| hypothetical protein MTR_1g061760 [Medicago truncatula]
Length = 508
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 91/322 (28%), Positives = 151/322 (46%), Gaps = 51/322 (15%)
Query: 72 FMHRFLLSSPVRTLNPEEADWFYTP-----------IYPTCDLTPTGLPLPFKSPRMMRS 120
F+H L+ RT +P++A ++ P + + D P + +
Sbjct: 206 FIHAIELNDQFRTRDPQKAHVYFLPFSVVMLVRFVYLRDSRDFGPIR--------KTVTD 257
Query: 121 AIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQR 180
I +I+ +PYWNR+ GADHF + HD+G + +P L + ++
Sbjct: 258 YINVIAGKYPYWNRSLGADHFMLACHDWGPETSFS---------VPYLHKNSI------- 301
Query: 181 NHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARG--- 237
V N T + P + + +I T SI + GL + P ++A G
Sbjct: 302 -RVLCNAN--TSERFNPAKDVSFPEINLQTG-SINGFLGGL--SASKRPILAFFAGGLHG 355
Query: 238 -ARAAVWENFKNNPLFDISTDH----PTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFG 292
RA + E+++NN D+ +YYE ++++ FCLCP G+ SPR+VEA+ G
Sbjct: 356 HIRAILLEHWENNKDQDMMIQKYLPKGVSYYEMLRKSKFCLCPSGYEVASPRIVEAIYTG 415
Query: 293 CIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMK 352
C+PV+I+D V PF+D + W+ + ++ ED+PKL IL I P +R QR + ++
Sbjct: 416 CVPVLISDHYVPPFSDVLNWKSFSVEISVEDIPKLKDILMRISPTQYIRMQRRVVQ--IR 473
Query: 353 RAMLFPQPAQPGDAFHQILNGL 374
R P + D FH IL+ +
Sbjct: 474 RHFEVHSPPKRFDVFHMILHSV 495
>gi|125538390|gb|EAY84785.1| hypothetical protein OsI_06153 [Oryza sativa Indica Group]
Length = 332
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 101/355 (28%), Positives = 163/355 (45%), Gaps = 56/355 (15%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYP 99
KVY+Y P + K + PR LT +A+E + + + S RT +P++A F+ PI P
Sbjct: 5 FKVYMY--PDG-DPKTFYQTPRKLTGKYASEGYFFQNIRESRFRTGDPDKAHLFFVPISP 61
Query: 100 TCDLTPTGLPLPFKSPRMM-RSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEK 158
G +++ ++ + ++ + + +PYWNRT GADHFFV HD G +
Sbjct: 62 H---KMRGKGTSYENMTIIVKDYVEGLINKYPYWNRTLGADHFFVTCHDVGV-------R 111
Query: 159 AIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY---APPQKMQAHQIPPDTPRSIF 215
A E LP + + ++ VC ++ P+ A PQ +Q +P
Sbjct: 112 AFEG--LPFMVKNSI-------RVVCSPSYNVDFIPHKDIALPQVLQPFALPEGGN---- 158
Query: 216 VYFRGLFYDVNNDPEGGYYA-------RGARAAVWEN-----FKNNPLFDISTDHPTTYY 263
DV N G++A R A VWEN NN + + Y
Sbjct: 159 --------DVENRTILGFWAGHRNSKIRVILAQVWENDTELAISNNRISRAIGE--LVYQ 208
Query: 264 EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED 323
+ R FC+CP G S R+ +++ +GC+PVI++D LPF D + W + + + E D
Sbjct: 209 KQFYRTKFCICPGGSQVNSARISDSIHYGCVPVILSDYYDLPFNDILDWRKFAVVLKERD 268
Query: 324 VPKLDTILTSIPPEVILRKQRLLAN----PSMKRAMLFPQPAQPGDAFHQILNGL 374
V +L +IL SI E + + L P +++ ++ P P DAFH ++ L
Sbjct: 269 VYQLKSILKSISQEEFVELHKSLVQNISCPQVQKHFVWHSPPLPYDAFHMVMYEL 323
>gi|147789666|emb|CAN71921.1| hypothetical protein VITISV_031190 [Vitis vinifera]
Length = 310
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 82/295 (27%), Positives = 135/295 (45%), Gaps = 26/295 (8%)
Query: 86 NPEEADWFYTPIYPTCDLTPTGLPLPFKS---PRMMRSAIQLISSNWPYWNRTEGADHFF 142
+P+EA+ FY P+ T + P + PR++ I ++ +PYWNR++GADHF
Sbjct: 25 HPDEANAFYIPMSLTRVVHFIYEPPHYXGKWIPRLVXDYINFVADKYPYWNRSKGADHFL 84
Query: 143 VVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQ 202
V HD+ + A++ + RA +R H + I+IP P+
Sbjct: 85 VSCHDWAP-----DVSALKPDLYKHFIRALCNANTSERFHPIRD---ISIPEINIPRGKL 136
Query: 203 A--HQIPPDTPRSIFVYFRGLFYDVNNDPEGGY-YARGARAAVWENFKNNPLFDISTDHP 259
H P R I +F G G + Y R W+ +
Sbjct: 137 GPPHLDQPPNKRPILAFFAG----------GAHGYVRSVLFKYWKEKDDEVQVFERLPRN 186
Query: 260 TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFV 319
Y + M + FCLCP G+ SPR+V+A+ GC+P+II D LPF+D + W + +++
Sbjct: 187 RNYSKSMGDSKFCLCPSGYEVASPRIVKAIAAGCVPMIICDHYSLPFSDXLDWSKFSIYI 246
Query: 320 AEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
+ +P++ IL ++P E L Q+ + ++R +PA P D H IL+ +
Sbjct: 247 TSDKIPEIKKILKAVPTETYLEMQKRVKQ--VQRHFAINRPAXPYDMLHMILHSV 299
>gi|115434052|ref|NP_001041784.1| Os01g0107700 [Oryza sativa Japonica Group]
gi|52076197|dbj|BAD44851.1| pectin-glucuronyltransferase -like [Oryza sativa Japonica Group]
gi|52076236|dbj|BAD44890.1| pectin-glucuronyltransferase -like [Oryza sativa Japonica Group]
gi|113531315|dbj|BAF03698.1| Os01g0107700 [Oryza sativa Japonica Group]
gi|215697737|dbj|BAG91731.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222617591|gb|EEE53723.1| hypothetical protein OsJ_00064 [Oryza sativa Japonica Group]
Length = 550
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 80/259 (30%), Positives = 129/259 (49%), Gaps = 15/259 (5%)
Query: 118 MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQ-T 176
++ I IS+ +PYWNRT+GADHFFV HD+G + + + + + L A L +
Sbjct: 294 VKKYIDFISTKFPYWNRTKGADHFFVACHDWGP-YTTKLHDELRKNTIKALCNADLSEGV 352
Query: 177 FGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYA- 235
F V L E + +P + ++ P RSI +F G + Y+
Sbjct: 353 FIHGRDVSLPETFLR----SPRRPLRGIGGKPAAERSILAFFAGQMHGRVRPVLLQYWGG 408
Query: 236 RGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIP 295
+ A +++ + Y + M+ + +C+CP+G+ SPR+VEA+ + C+P
Sbjct: 409 KDADMRIYDRLPHR------ITRRMNYIQHMKSSKYCICPMGYEVNSPRIVEAIYYECVP 462
Query: 296 VIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAM 355
VIIAD+ VLPF DA+ W + + E+DVPKL IL +IP + + Q + K +
Sbjct: 463 VIIADNFVLPFDDALNWSAFSVVIPEKDVPKLKQILLAIPDDQYMAMQSNVQR-VQKHFI 521
Query: 356 LFPQPAQPGDAFHQILNGL 374
P P + D FH IL+ +
Sbjct: 522 WHPNPIK-YDIFHMILHSI 539
>gi|224068931|ref|XP_002326234.1| predicted protein [Populus trichocarpa]
gi|222833427|gb|EEE71904.1| predicted protein [Populus trichocarpa]
Length = 328
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 95/316 (30%), Positives = 141/316 (44%), Gaps = 40/316 (12%)
Query: 72 FMHRFLLSSPVRTLNPEEADWFYTP---------IYP--TCDLTPTGLPLPFKSPRMMRS 120
F+HR + RT +P++A ++ P +Y + D P R +
Sbjct: 33 FIHRMEIDGHFRTKDPDKAHVYFLPFSVAMMVRFVYERESRDFGPIR--------RTVSD 84
Query: 121 AIQLISSNWPYWNRTEGADHFFVVPHDFG--ACFHYQEEKAIERGILPLLQRATLVQTFG 178
I LIS +P+WNR+ GADHF + HD+G A F I + L A + F
Sbjct: 85 YINLISGKYPFWNRSLGADHFMLACHDWGPEASFSVPHLGKIS---IRALCNANTSEKFN 141
Query: 179 QRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGA 238
V L E ++ K + P + RSI +F G + R
Sbjct: 142 PIKDVSLPEINLRTGSI----KGFVGGLSP-SKRSILAFFAGRLHGP---------IRPV 187
Query: 239 RAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVII 298
WEN ++ +YYE M+ + FCLCP G+ SPR+VEA+ GC+PV+I
Sbjct: 188 VLEHWENKDDDIKVHQQLPKGVSYYEMMRGSKFCLCPSGYEVASPRIVEALYAGCVPVLI 247
Query: 299 ADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFP 358
+D V PF+D + W+ + V D+P L ILTSI P +R QR + ++R
Sbjct: 248 SDHYVPPFSDVLNWKSFSVEVPVSDIPSLKKILTSISPRQYIRMQRRVLQ--VRRHFEVN 305
Query: 359 QPAQPGDAFHQILNGL 374
P + D FH IL+ +
Sbjct: 306 SPPKRFDVFHMILHSI 321
>gi|357478261|ref|XP_003609416.1| Xylogalacturonan beta-1,3-xylosyltransferase [Medicago truncatula]
gi|355510471|gb|AES91613.1| Xylogalacturonan beta-1,3-xylosyltransferase [Medicago truncatula]
Length = 380
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 85/306 (27%), Positives = 141/306 (46%), Gaps = 33/306 (10%)
Query: 80 SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLP-FKSPRMMRSAI---QLISSNWPYWNRT 135
SP T NP+EA F P+ T + PL + ++M I +I+ + YWNR+
Sbjct: 86 SPFSTHNPDEAHVFMLPLSVTNMVHYLYNPLTTYSRDQIMHVTIDYTNIIAHKYRYWNRS 145
Query: 136 EGADHFFVVPHDFGACFHYQEE-KAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPP 194
+GADH V HD+ + K + + ++ +L A + F + V +
Sbjct: 146 KGADHLLVSCHDWAPEISRESSGKELFKNLIKVLCNANTSEGFDPKRDVSM--------- 196
Query: 195 YAPPQKMQAHQIPPDTP------RSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKN 248
P +Q +++ P RSI +F G E G R W+
Sbjct: 197 --PEMNLQGYKLSSPIPSKESNNRSILAFFAG--------GEHGM-IRKTLLDQWKGKDK 245
Query: 249 NPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFAD 308
L Y++ M ++ FCLCP G+ SPRLVE++ GC+PVI++D+ LPF+D
Sbjct: 246 EVLVYEYLPKKLKYFKLMGKSKFCLCPSGYEVASPRLVESINTGCVPVIVSDNYQLPFSD 305
Query: 309 AIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFH 368
+ W + + + + + ++ TIL S+P L+ R + ++R + PA+P D FH
Sbjct: 306 VLDWSKFSLHIPSKRISEIKTILKSVPHARYLKLHRRVLK--VQRHFVLNPPAKPFDVFH 363
Query: 369 QILNGL 374
IL+ +
Sbjct: 364 MILHSI 369
>gi|159466264|ref|XP_001691329.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158279301|gb|EDP05062.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 815
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 104/375 (27%), Positives = 160/375 (42%), Gaps = 82/375 (21%)
Query: 42 VYVYELPSKYNKKLL-LKDPR--CLTHMFAA-------------EIFMHRFLLSSPVRTL 85
+YVY+LP+ YN ++L ++ R C F + E+ MH LL S RT
Sbjct: 370 IYVYDLPAAYNSRMLQYRNDRGMCTWRAFGSRNHTETFAWTYGLEVLMHEMLLQSEHRTF 429
Query: 86 NPEEADWFYTPIYPTCDL-------------TPTGLPLPFKSPRMMRSAIQLISSNWPYW 132
+PE AD+FY P+Y +C + P+G P MM LI ++PYW
Sbjct: 430 DPEAADFFYVPVYGSCFIFPLHCYADGPWWYAPSG-PRVMHVTNMMLEVRDLIRKHFPYW 488
Query: 133 NRTEGADHFFVVPHDFGACF-------------HYQEEKAIERG-------------ILP 166
+R G DH +++ HD GAC+ H+ + + + P
Sbjct: 489 DRRGGRDHIWLMTHDEGACYAPSDIYGSSIFLTHWGRRDSDHKSNTAFTPDNYTQEYVHP 548
Query: 167 LLQRATLVQTFGQRNHVCLNEG-SITIPPYAPPQKMQAHQI--PPDTPRSIFVYFRGLFY 223
+ + L G H C G + +P P + P PR I +Y RG
Sbjct: 549 MQDKGWLHLIEG---HPCYTPGKDLIVPALKLPHHFSGSPLLFHPPRPRDILLYLRG--- 602
Query: 224 DVNND--PEGGYYARGARAAV--------WENFKNNPLFDISTDHPTTYYEDMQRAIFCL 273
DV P Y+RG R + W+N K N + +D P Y E + + FC+
Sbjct: 603 DVGKHRLPN---YSRGIRQRLYKLWKDHDWQN-KYNAMIGDGSDVPGGYSEHLASSKFCV 658
Query: 274 CPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFV--AEEDVPKLDTIL 331
G WS RL +AV+ GC+PVI+ D++ F + + + + V AE ++ L L
Sbjct: 659 VAPG-DGWSARLEDAVLHGCVPVIVMDNVSAVFEEQLDFNSFSIRVGEAEAELAYLPERL 717
Query: 332 TSIPPEVILRKQRLL 346
S+PP ++ Q+ L
Sbjct: 718 KSVPPRILEGMQKKL 732
>gi|212721192|ref|NP_001132213.1| exostosin-like protein [Zea mays]
gi|194693780|gb|ACF80974.1| unknown [Zea mays]
gi|413935957|gb|AFW70508.1| exostosin-like protein [Zea mays]
Length = 403
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 100/350 (28%), Positives = 159/350 (45%), Gaps = 52/350 (14%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYP 99
KVY+Y P + K + PR LT +A+E + + + S RT +P++A F+ PI P
Sbjct: 82 FKVYIY--PDG-DPKTFYQTPRKLTGKYASEGYFFQNIRESRFRTDDPDQAHLFFVPISP 138
Query: 100 TCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKA 159
+ G +++ ++ + + +PYWNRT GADHFFV HD G +A
Sbjct: 139 H-KMRGKGTSYE-NMTVIVKDYVEGLINKYPYWNRTLGADHFFVTCHDVGV-------RA 189
Query: 160 IERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY---APPQKMQAHQIPPDTPRSIFV 216
E LP + + ++ VC ++ P+ A PQ +Q +P
Sbjct: 190 FEG--LPFMVKNSI-------RVVCSPSYNVDFIPHKDIALPQVLQPFALPEGGN----- 235
Query: 217 YFRGLFYDVNNDPEGGYYA-------RGARAAVWEN-----FKNNPLFDISTDHPTTYYE 264
D+ N G++A R A VWEN NN + + Y +
Sbjct: 236 -------DIENRTTLGFWAGHRNSKIRVILARVWENDTELAISNNRISRAIGE--LVYQK 286
Query: 265 DMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDV 324
R FC+CP G S R+ +++ +GC+PVI++D LPF DA+ W + + + E DV
Sbjct: 287 QFYRTKFCICPGGSQVNSARISDSIHYGCVPVILSDYYDLPFNDALDWRKFAVILRERDV 346
Query: 325 PKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
+L IL SI E + + L +++ ++ P DAFH I+ L
Sbjct: 347 YQLKNILKSISQEEFVSLHKSLVQ--VQKHFVWHSPPVSYDAFHMIMYEL 394
>gi|359484716|ref|XP_002263848.2| PREDICTED: probable glycosyltransferase At5g20260-like [Vitis
vinifera]
gi|296084516|emb|CBI25537.3| unnamed protein product [Vitis vinifera]
Length = 477
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 85/304 (27%), Positives = 137/304 (45%), Gaps = 28/304 (9%)
Query: 80 SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSP---------RMMRSAIQLISSNWP 130
SP +P+EA F+ P+ + L LP SP R++ +++++ +P
Sbjct: 182 SPFIAKHPDEAHAFFLPL--SVVKVVQFLYLPITSPEDYSRKRLQRIVTDYVKVVADKYP 239
Query: 131 YWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSI 190
YWNR+ GADHF V HD+ Y + + + +L A + F V L E ++
Sbjct: 240 YWNRSGGADHFMVSCHDWAPSVSYANPELF-KNFIRVLCNANSSEGFRPGRDVSLPEVNL 298
Query: 191 TIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNP 250
PP Q P R + +F G + R W++ N
Sbjct: 299 PAGELGPPHLGQ-----PSNNRPVLAFFAGRAHG---------NIRKILFEHWKDQDNEV 344
Query: 251 LFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAI 310
L Y + M ++ FCLCP G+ SPR+VEA+ GC+PVII+++ LPF D +
Sbjct: 345 LVHERLHKGQNYAKLMGQSKFCLCPSGYEVASPRVVEAIHAGCVPVIISNNYSLPFNDVL 404
Query: 311 PWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQI 370
W + + + +P++ TIL I L+ Q + ++R + +PA+P D H I
Sbjct: 405 DWSQFSIQIPVAKIPEIKTILLGISKNKYLKMQERVLR--VRRHFVLNRPARPFDIIHMI 462
Query: 371 LNGL 374
L+ L
Sbjct: 463 LHSL 466
>gi|195624236|gb|ACG33948.1| exostosin-like [Zea mays]
Length = 403
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 100/350 (28%), Positives = 159/350 (45%), Gaps = 52/350 (14%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYP 99
KVY+Y P + K + PR LT +A+E + + + S RT +P++A F+ PI P
Sbjct: 82 FKVYIY--PDG-DPKTFYQTPRKLTGKYASEGYFFQNIRESRFRTDDPDQAHLFFVPISP 138
Query: 100 TCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKA 159
+ G +++ ++ + + +PYWNRT GADHFFV HD G +A
Sbjct: 139 H-KMRGKGTSYE-NMTVIVKDYVEGLINKYPYWNRTLGADHFFVTCHDVGV-------RA 189
Query: 160 IERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY---APPQKMQAHQIPPDTPRSIFV 216
E LP + + ++ VC ++ P+ A PQ +Q +P
Sbjct: 190 FEG--LPFMVKNSI-------RVVCSPSYNVDFIPHKDIALPQVLQPFALPEGGN----- 235
Query: 217 YFRGLFYDVNNDPEGGYYA-------RGARAAVWEN-----FKNNPLFDISTDHPTTYYE 264
D+ N G++A R A VWEN NN + + Y +
Sbjct: 236 -------DIENRTTLGFWAGHRNSKIRVILARVWENDTELAISNNRISRAIGE--LVYQK 286
Query: 265 DMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDV 324
R FC+CP G S R+ +++ +GC+PVI++D LPF DA+ W + + + E DV
Sbjct: 287 QFYRTKFCICPGGSQVNSARISDSIHYGCVPVILSDYYDLPFNDALDWRKFAVILRERDV 346
Query: 325 PKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
+L IL SI E + + L +++ ++ P DAFH I+ L
Sbjct: 347 YQLKNILKSISQEEFVSLHKSLVQ--VQKHFVWHSPPVSYDAFHMIMYEL 394
>gi|356570806|ref|XP_003553575.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
max]
Length = 537
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 93/356 (26%), Positives = 153/356 (42%), Gaps = 58/356 (16%)
Query: 39 KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTP-- 96
+ KV+VYE + + + C + F+H ++ RT +PEEA F+ P
Sbjct: 207 QFKVFVYE----EGEPPVFHNGPCKSIYSMEGNFIHAIEMNDQFRTRDPEEAHVFFLPFS 262
Query: 97 ---------IYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHD 147
+ + D P + + + +I +PYWNR+ GADHF++ HD
Sbjct: 263 VAMLVQFVYVRDSHDFGPI--------KKTVTDYVNVIGGRYPYWNRSLGADHFYLACHD 314
Query: 148 FGACFHYQEEKAI---ERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAH 204
+G + ++I + + +L A + F V P +Q
Sbjct: 315 WGP----ETSRSIPNLNKNSIRVLCNANTSEGFKPSKDVSF-----------PEINLQTG 359
Query: 205 QI------PPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDH 258
I P + R + +F G + R WEN +
Sbjct: 360 SINGFIGGPSASRRPLLAFFAGGLHGP---------IRPVLLEHWENKDEDIQVHKYLPK 410
Query: 259 PTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMF 318
+YYE ++++ FCLCP G+ SPR+VEA+ GC+PV+I+D V PF D + W+ +
Sbjct: 411 GVSYYEMLRKSKFCLCPSGYEVASPRVVEAIYTGCVPVLISDHYVPPFNDVLNWKSFSVE 470
Query: 319 VAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
V+ +D+P+L IL SI P +R QR + ++R P + D FH IL+ +
Sbjct: 471 VSVKDIPRLKEILLSISPRQYIRMQRRVG--QVRRHFEVHSPPKRYDVFHMILHSV 524
>gi|326519558|dbj|BAK00152.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 564
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 83/266 (31%), Positives = 126/266 (47%), Gaps = 30/266 (11%)
Query: 118 MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTF 177
+++ I +IS+ +PYWNRT+GADHFFV HD+G ++ + I L F
Sbjct: 307 VKNYIDMISAKFPYWNRTKGADHFFVACHDWGPYTTKLHDELRKNTIKALCNADVSEGVF 366
Query: 178 GQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRG---------LFYDVNND 228
+ V L E I + + ++ P RSI +F G L Y D
Sbjct: 367 IRGRDVSLPETYIK----SARRPVRDIGGKPAAERSILAFFAGQMHGRVRPVLKYWGGKD 422
Query: 229 PEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEA 288
+ Y+R R Y + M+ + +C+CP+G+ SPR+VEA
Sbjct: 423 TDMRIYSRIPRQITRR---------------MNYAKHMKSSKYCICPMGYEVNSPRIVEA 467
Query: 289 VVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLAN 348
+ + C+PVIIAD+ VLPF DA+ W + VAE+DVPKL IL +IP + + +
Sbjct: 468 IYYECVPVIIADNFVLPFDDALDWSAFSVVVAEKDVPKLKAILLAIPESRYITMRSNVKK 527
Query: 349 PSMKRAMLFPQPAQPGDAFHQILNGL 374
++R L+ D FH IL+ +
Sbjct: 528 --VQRHFLWHAKPVKYDIFHMILHSV 551
>gi|218187361|gb|EEC69788.1| hypothetical protein OsI_00072 [Oryza sativa Indica Group]
Length = 536
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 80/259 (30%), Positives = 127/259 (49%), Gaps = 15/259 (5%)
Query: 118 MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQ-T 176
++ I IS+ +PYWNRT+GADHFFV HD+G + + + + + L A L +
Sbjct: 280 VKKYIDFISTKFPYWNRTKGADHFFVACHDWGP-YTTKLHDELRKNTIKALCNADLSEGV 338
Query: 177 FGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYA- 235
F V L E + +P + ++ P RSI +F G + Y+
Sbjct: 339 FIHGRDVSLPETFLR----SPRRPLRGIGGKPAAERSILAFFAGQMHGRVRPVLLQYWGG 394
Query: 236 RGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIP 295
+ A +++ + Y + M+ + +C+CP+G+ SPR+VEA+ + C+P
Sbjct: 395 KDADMRIYDRLPHR------ITRRMNYIQHMKSSKYCICPMGYEVNSPRIVEAIYYECVP 448
Query: 296 VIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAM 355
VIIAD+ VLPF DA W + + E+DVPKL IL IP + + Q + K +
Sbjct: 449 VIIADNFVLPFDDAFNWSAFSVVILEKDVPKLKQILLEIPDDQYMAMQSNVQR-VQKHFI 507
Query: 356 LFPQPAQPGDAFHQILNGL 374
P P + D FH IL+ +
Sbjct: 508 WHPNPIK-YDIFHMILHSI 525
>gi|359496494|ref|XP_002269459.2| PREDICTED: probable glycosyltransferase At5g03795-like [Vitis
vinifera]
Length = 554
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 93/306 (30%), Positives = 146/306 (47%), Gaps = 26/306 (8%)
Query: 84 TLNPEEADWFYTPIYPTCDLTPTGLPLPFKS-----PRMMRSAIQLISSNWPYWNRTEGA 138
T +P++A FY P Y L T L +P +R + +I++ +P+WNRT G+
Sbjct: 243 TRDPKKAHLFYLP-YSARQLE-TALYVPNSHNIRPLSIFLRDHVNMIAAKYPFWNRTHGS 300
Query: 139 DHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQ-TFGQRNHVCLNEGSITIPPYAP 197
DHF V HD+G + E + + R + L A L + F V L E +I P P
Sbjct: 301 DHFLVACHDWGP-YTVNEHQELSRNTIKALCNADLSEGIFVAGKDVSLPETTIR-NPRRP 358
Query: 198 PQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYA-RGARAAVWENFKNNPLFDIST 256
+ + ++ + R I +F G + Y++ + ++ N
Sbjct: 359 LRNVGGRRV---SQRPILAFFAGNMHGRVRPTLLKYWSDKDEDMRIYGPLPNR------I 409
Query: 257 DHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG 316
+Y + M+ + FC+CP+G+ SPR+VEA+ + C+PVIIAD+ V P D + W
Sbjct: 410 SRKMSYIQHMKSSRFCICPMGYEVNSPRIVEAIYYECVPVIIADNFVPPLNDVLDWTAFS 469
Query: 317 MFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSM--KRAMLFPQPAQPGDAFHQILNGL 374
+ VAE+D+PKL IL +IP L Q N M K + P+P + D FH IL+ +
Sbjct: 470 VIVAEKDIPKLKEILLAIPLRRYLVMQ---TNVKMVQKHFLWNPKPVR-YDLFHMILHSI 525
Query: 375 ARKLPH 380
L H
Sbjct: 526 WVVLKH 531
>gi|357485421|ref|XP_003612998.1| Exostosin-like protein [Medicago truncatula]
gi|355514333|gb|AES95956.1| Exostosin-like protein [Medicago truncatula]
Length = 415
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 101/352 (28%), Positives = 158/352 (44%), Gaps = 54/352 (15%)
Query: 39 KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIY 98
K KVY+Y P + K + PR LT +A+E + + + S RTL+P+EA F+ PI
Sbjct: 93 KFKVYIY--PDG-DSKTFYQTPRKLTGKYASEGYFFQNIRESRFRTLDPDEAHLFFIPI- 148
Query: 99 PTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEK 158
+C ++++ ++ + S +PYWNRT GADHFFV HD G +
Sbjct: 149 -SCHKMRGKGTSYENMTIIVQNYVESLISKYPYWNRTLGADHFFVTCHDVGV-------R 200
Query: 159 AIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY---APPQKMQAHQIPPDTPRSIF 215
A E LPLL + ++ VC + P+ A PQ +Q +P
Sbjct: 201 ATEG--LPLLVKNSI-------RAVCSPSYDVGFIPHKDVALPQVLQPFALPAGGN---- 247
Query: 216 VYFRGLFYDVNNDPEGGYYA-------RGARAAVWENFKNNPLFDISTDHPT------TY 262
DV N G++A R A VWEN + DIS + + Y
Sbjct: 248 --------DVENRTSLGFWAGHRNSKIRVILARVWEN---DTELDISNNRISRATGHLVY 296
Query: 263 YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEE 322
+ FC+CP G S R+ +++ +GCIPVI+++ LPF D + W + + + E
Sbjct: 297 QKRFYSTKFCICPGGSQVNSARIADSIHYGCIPVILSNYYDLPFNDILDWRKFAVVLKES 356
Query: 323 DVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
DV +L IL + + + L +++ + P DAFH ++ L
Sbjct: 357 DVYQLKQILKNKSQDEFVALHNNLV--KIQKHFQWNSPPVRYDAFHMVMYDL 406
>gi|224143179|ref|XP_002324871.1| predicted protein [Populus trichocarpa]
gi|222866305|gb|EEF03436.1| predicted protein [Populus trichocarpa]
Length = 334
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 88/301 (29%), Positives = 140/301 (46%), Gaps = 30/301 (9%)
Query: 83 RTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMR---SAIQLISSNWPYWNRTEGAD 139
+T +P A ++ P T + PL + M + ++++SS +P+WNRT+GAD
Sbjct: 45 KTRDPRRAHVYFMPFSVTWMVKYLYKPLTYDHTAMKQFVADYVRVVSSKYPFWNRTQGAD 104
Query: 140 HFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSI---TIPP-- 194
HF + HD+G + I +L A + F R V L E + +PP
Sbjct: 105 HFMLSCHDWGPHASHGNPFLYNTSIR-VLCNANSSEGFSPRKDVSLPEIHLYGGNVPPKL 163
Query: 195 YAPPQKMQAHQIPPDTPRSIFVYFRGLFYD-VNNDPEGGYYARGARAAVWENFKNNPLFD 253
+PP P +PR +F G + + + R V+E +
Sbjct: 164 ISPP--------PATSPRPYLAFFSGGLHGPIRPILLDHWKGRDPDLQVYEYLPKD---- 211
Query: 254 ISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWE 313
YY M R+ FCLCP G SPR+VEA+ C+PVI++D VLPF+D + WE
Sbjct: 212 ------LDYYSFMLRSKFCLCPSGHEVASPRIVEAIYAECVPVILSDHYVLPFSDVLRWE 265
Query: 314 EIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNG 373
+ V ++P+L +L S+P E R + L ++++ + QPA+ D FH IL+
Sbjct: 266 AFAIQVNVSEIPRLKEVLISVPEERYRRLKEGLR--AIRKHFVLNQPAKRFDVFHMILHS 323
Query: 374 L 374
+
Sbjct: 324 I 324
>gi|8953375|emb|CAB96648.1| putative protein [Arabidopsis thaliana]
Length = 336
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 89/301 (29%), Positives = 138/301 (45%), Gaps = 23/301 (7%)
Query: 79 SSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPF----KSPRMMRSAIQLISSNWPYWNR 134
+S + +PEEA FY P+ + P + +++ I LIS+ +PYWNR
Sbjct: 41 NSRFKAASPEEATVFYIPVGIVNIIRFVYRPYTSYARDRLQNIVKDYISLISNRYPYWNR 100
Query: 135 TEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPP 194
+ GADHFF+ HD+ + + + I L A + F V L E I IP
Sbjct: 101 SRGADHFFLSCHDWAPDVSAVDPELYKHFIRALCN-ANSSEGFTPMRDVSLPE--INIPH 157
Query: 195 YAPPQKMQAHQIPPDTPRSIFVYFRGLFY-DVNNDPEGGYYARGARAAVWENFKNNPLFD 253
Q H P R + +F G + DV + + V+EN
Sbjct: 158 ---SQLGFVHTGEPPQNRKLLAFFAGGSHGDVRKILFQHWKEKDKDVLVYENLPKT---- 210
Query: 254 ISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWE 313
Y + M +A FCLCP GW SPR+VE++ GC+PVIIAD VLPF+D + W+
Sbjct: 211 ------MNYTKMMDKAKFCLCPSGWEVASPRIVESLYSGCVPVIIADYYVLPFSDVLNWK 264
Query: 314 EIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNG 373
+ + +P + IL +I E L QR + +++ + +P++P D H I++
Sbjct: 265 TFSVHIPISKMPDIKKILEAITEEEYLNMQRRVL--EVRKHFVINRPSKPYDMLHMIMHS 322
Query: 374 L 374
+
Sbjct: 323 I 323
>gi|334187609|ref|NP_196674.2| putative glycosyltransferase [Arabidopsis thaliana]
gi|292630779|sp|Q9LFP3.2|GLYT4_ARATH RecName: Full=Probable glycosyltransferase At5g11130
gi|332004254|gb|AED91637.1| putative glycosyltransferase [Arabidopsis thaliana]
Length = 480
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 89/301 (29%), Positives = 138/301 (45%), Gaps = 23/301 (7%)
Query: 79 SSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPF----KSPRMMRSAIQLISSNWPYWNR 134
+S + +PEEA FY P+ + P + +++ I LIS+ +PYWNR
Sbjct: 185 NSRFKAASPEEATVFYIPVGIVNIIRFVYRPYTSYARDRLQNIVKDYISLISNRYPYWNR 244
Query: 135 TEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPP 194
+ GADHFF+ HD+ + + + I L A + F V L E I IP
Sbjct: 245 SRGADHFFLSCHDWAPDVSAVDPELYKHFIRALCN-ANSSEGFTPMRDVSLPE--INIPH 301
Query: 195 YAPPQKMQAHQIPPDTPRSIFVYFRGLFY-DVNNDPEGGYYARGARAAVWENFKNNPLFD 253
Q H P R + +F G + DV + + V+EN
Sbjct: 302 ---SQLGFVHTGEPPQNRKLLAFFAGGSHGDVRKILFQHWKEKDKDVLVYENLPKT---- 354
Query: 254 ISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWE 313
Y + M +A FCLCP GW SPR+VE++ GC+PVIIAD VLPF+D + W+
Sbjct: 355 ------MNYTKMMDKAKFCLCPSGWEVASPRIVESLYSGCVPVIIADYYVLPFSDVLNWK 408
Query: 314 EIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNG 373
+ + +P + IL +I E L QR + +++ + +P++P D H I++
Sbjct: 409 TFSVHIPISKMPDIKKILEAITEEEYLNMQRRVL--EVRKHFVINRPSKPYDMLHMIMHS 466
Query: 374 L 374
+
Sbjct: 467 I 467
>gi|357119191|ref|XP_003561329.1| PREDICTED: probable glycosyltransferase At5g03795-like
[Brachypodium distachyon]
Length = 569
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 95/347 (27%), Positives = 153/347 (44%), Gaps = 43/347 (12%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYP 99
K+YVY+ + + D C F+H + + +RT +P A F+ P
Sbjct: 237 FKIYVYD----EGEPPIYHDGPCHNIYSTEGRFIHAMEMENRMRTTDPGLAHVFFLPFSI 292
Query: 100 TCDLTPTGLPLPFKSPRMMRSA---IQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQE 156
+P + R+ I ++S+ PYWNR++GADHF + HD+G +
Sbjct: 293 AKMEKTIYVPGSHTMEPLRRTVFDYIDVLSTKHPYWNRSQGADHFMLSCHDWGPYVSSVD 352
Query: 157 EKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPD------T 210
I L T +EG I + P+ + D +
Sbjct: 353 GNLFSNSIRVLCNANT-------------SEGFIPSKDVSLPEINHLNDFKKDIGGPSAS 399
Query: 211 PRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKN-NPLFDISTDHP--TTYYEDMQ 267
R I +F G N+ P R + +++K +P +S P +Y E M+
Sbjct: 400 GRPILAFFAG----GNHGP--------VRPLLLKHWKGKDPDVQVSEYLPAGVSYVETMR 447
Query: 268 RAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKL 327
R+ FCLCP G+ SPR+ EA+ C+PV+IADD VLPF+D + W + VA D+P +
Sbjct: 448 RSKFCLCPSGFEVASPRVAEAIYVECVPVVIADDYVLPFSDVLSWPAFSLRVAVRDIPDI 507
Query: 328 DTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
IL+++ P +R QR + +++R + Q D FH IL+ +
Sbjct: 508 KRILSAVSPRRYIRMQRRVR--AVRRHFMLNGVPQRYDVFHMILHSI 552
>gi|102139904|gb|ABF70053.1| exostosin family protein [Musa acuminata]
Length = 585
Score = 122 bits (305), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 99/342 (28%), Positives = 162/342 (47%), Gaps = 26/342 (7%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEI-FMHRFLLSSPVRTLNPEEADWFYTPIY 98
LKVY+YE K L P L ++A+E FM ++ +P +A FY P Y
Sbjct: 253 LKVYIYE----DGPKPLCHTPH-LDGIYASEGWFMKLMEENTQFVVKDPNKAHLFYLP-Y 306
Query: 99 PTCDLTPTGLPLPFKSPR----MMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHY 154
+ L +S + +R + I++ +P+WNRT GADHF V HD+ A +
Sbjct: 307 SSRQLRTHLYVAGSRSMQPLSIFLRDYVNSIAAKYPFWNRTRGADHFLVACHDW-ATYTT 365
Query: 155 QEEKAIERGILPLLQRATLVQ-TFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRS 213
+ + + + ++ A + + F + V L E + P P+K + P + RS
Sbjct: 366 NLHEDLRKNTIKVVCNADVSEGVFVRGKDVSLAETYVRTP--NSPRKAIGGR--PASRRS 421
Query: 214 IFVYFRGLFYD-VNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFC 272
I +F G + V + R ++E + +S Y E M+ + FC
Sbjct: 422 ILAFFAGQMHGRVRPILLRHWRGRDRDMRIYEVLPDEIAAKMS------YIEHMKSSKFC 475
Query: 273 LCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILT 332
+CP+G+ SPR+VEA+ + C+PVIIA++ VLPF + + W + VAE+D+PKL IL
Sbjct: 476 ICPMGYEVNSPRIVEAIYYDCVPVIIANNFVLPFEEVLDWGAFSVVVAEKDIPKLKQILL 535
Query: 333 SIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
I +R Q + +++ L+ D FH IL+ +
Sbjct: 536 GISGRRYVRMQTNVRR--LRKHFLWNDKPVKYDLFHMILHSI 575
>gi|359476148|ref|XP_002285239.2| PREDICTED: probable glycosyltransferase At5g11130 [Vitis vinifera]
Length = 472
Score = 121 bits (304), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 92/307 (29%), Positives = 139/307 (45%), Gaps = 32/307 (10%)
Query: 79 SSPVRTLNPEEADWFYTPI-------YPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPY 131
+ P +P+EA FY PI Y T + + ++ I +IS +P+
Sbjct: 176 NCPFSARHPDEAMLFYIPISVVNIIRYVYQPYTNSASYSRVRLQGLVADYIDVISRKYPF 235
Query: 132 WNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSIT 191
WNR+ GADHF V HD+ K R + +L A + F V L E +
Sbjct: 236 WNRSSGADHFMVSCHDWAPEISAANPKFF-RHFIRVLCNANTSEGFKPVRDVSLPEILVP 294
Query: 192 IPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPL 251
PP Q P T RSI +F G A G ++ ++
Sbjct: 295 YRMLGPPYLGQ-----PPTNRSILAFFAGG-------------AHGKVRSILFHYWKEKD 336
Query: 252 FDISTDH--PTT--YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFA 307
DI PTT Y E M R+ FCLCP G+ SPR+VE++ GC+PVII+D+ LPF+
Sbjct: 337 EDIQVHEYLPTTLNYTELMGRSKFCLCPSGFEVASPRVVESIYAGCVPVIISDNYSLPFS 396
Query: 308 DAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAF 367
D + W + + + +P+ TIL +IP + L KQ+ + ++R +PA+ D
Sbjct: 397 DVLDWSQFSVHIPIARIPETKTILQAIPIQEYLTKQKTVMQ--VQRHFTLNRPAKRFDVL 454
Query: 368 HQILNGL 374
H +L+ +
Sbjct: 455 HMVLHSI 461
>gi|30683790|ref|NP_567512.2| Exostosin family protein [Arabidopsis thaliana]
gi|19347795|gb|AAL86348.1| unknown protein [Arabidopsis thaliana]
gi|26983908|gb|AAN86206.1| unknown protein [Arabidopsis thaliana]
gi|332658395|gb|AEE83795.1| Exostosin family protein [Arabidopsis thaliana]
Length = 542
Score = 121 bits (304), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 102/346 (29%), Positives = 162/346 (46%), Gaps = 33/346 (9%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSP-VRTLNPEEADWFYTPIY 98
LKVY+Y P K + +P L ++A+E + + + S+ T NPE A FY P Y
Sbjct: 200 LKVYIY--PD--GDKPIFHEPH-LNGIYASEGWFMKLMESNKQFVTKNPERAHLFYMP-Y 253
Query: 99 PTCDLTPTGL--------PLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGA 150
L + PL +R + ++S +P+WNRT G+DHF V HD+G
Sbjct: 254 SVKQLQKSIFVPGSHNIKPLSI----FLRDYVNMLSIKYPFWNRTHGSDHFLVACHDWGP 309
Query: 151 CFHYQEEKAIERGILPLLQRATLVQ-TFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPD 209
+ E ++R + L A L F V L E SI +++
Sbjct: 310 -YTVNEHPELKRNAIKALCNADLSDGIFVPGKDVSLPETSIRNAGRPLRNIGNGNRV--- 365
Query: 210 TPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRA 269
+ R I +F G + P+ + R E+ K + TY + M+ +
Sbjct: 366 SQRPILAFFAGNLHG-RVRPKLLKHWRNKD----EDMKIYGPLPHNVARKMTYVQHMKSS 420
Query: 270 IFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDT 329
+CLCP+G+ SPR+VEA+ + C+PV+IAD+ +LPF+D + W + V E+++P+L
Sbjct: 421 KYCLCPMGYEVNSPRIVEAIYYECVPVVIADNFMLPFSDVLDWSAFSVVVPEKEIPRLKE 480
Query: 330 ILTSIPPEVILRKQRLLANPSM-KRAMLFPQPAQPGDAFHQILNGL 374
IL IP L+ Q +N M +R L+ + D FH IL+ +
Sbjct: 481 ILLEIPMRRYLKMQ---SNVKMVQRHFLWSPKPRKYDVFHMILHSI 523
>gi|296082060|emb|CBI21065.3| unnamed protein product [Vitis vinifera]
Length = 402
Score = 121 bits (303), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 92/307 (29%), Positives = 139/307 (45%), Gaps = 32/307 (10%)
Query: 79 SSPVRTLNPEEADWFYTPI-------YPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPY 131
+ P +P+EA FY PI Y T + + ++ I +IS +P+
Sbjct: 106 NCPFSARHPDEAMLFYIPISVVNIIRYVYQPYTNSASYSRVRLQGLVADYIDVISRKYPF 165
Query: 132 WNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSIT 191
WNR+ GADHF V HD+ K R + +L A + F V L E +
Sbjct: 166 WNRSSGADHFMVSCHDWAPEISAANPKFF-RHFIRVLCNANTSEGFKPVRDVSLPEILVP 224
Query: 192 IPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPL 251
PP Q P T RSI +F G A G ++ ++
Sbjct: 225 YRMLGPPYLGQ-----PPTNRSILAFFAGG-------------AHGKVRSILFHYWKEKD 266
Query: 252 FDISTDH--PTT--YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFA 307
DI PTT Y E M R+ FCLCP G+ SPR+VE++ GC+PVII+D+ LPF+
Sbjct: 267 EDIQVHEYLPTTLNYTELMGRSKFCLCPSGFEVASPRVVESIYAGCVPVIISDNYSLPFS 326
Query: 308 DAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAF 367
D + W + + + +P+ TIL +IP + L KQ+ + ++R +PA+ D
Sbjct: 327 DVLDWSQFSVHIPIARIPETKTILQAIPIQEYLTKQKTVMQ--VQRHFTLNRPAKRFDVL 384
Query: 368 HQILNGL 374
H +L+ +
Sbjct: 385 HMVLHSI 391
>gi|296083497|emb|CBI23466.3| unnamed protein product [Vitis vinifera]
Length = 585
Score = 121 bits (303), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 91/300 (30%), Positives = 144/300 (48%), Gaps = 26/300 (8%)
Query: 84 TLNPEEADWFYTPIYPTCDLTPTGLPLPFKS-----PRMMRSAIQLISSNWPYWNRTEGA 138
T +P++A FY P Y L T L +P +R + +I++ +P+WNRT G+
Sbjct: 243 TRDPKKAHLFYLP-YSARQLE-TALYVPNSHNIRPLSIFLRDHVNMIAAKYPFWNRTHGS 300
Query: 139 DHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQ-TFGQRNHVCLNEGSITIPPYAP 197
DHF V HD+G + E + + R + L A L + F V L E +I P P
Sbjct: 301 DHFLVACHDWGP-YTVNEHQELSRNTIKALCNADLSEGIFVAGKDVSLPETTIR-NPRRP 358
Query: 198 PQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYA-RGARAAVWENFKNNPLFDIST 256
+ + ++ + R I +F G + Y++ + ++ N
Sbjct: 359 LRNVGGRRV---SQRPILAFFAGNMHGRVRPTLLKYWSDKDEDMRIYGPLPNR------I 409
Query: 257 DHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG 316
+Y + M+ + FC+CP+G+ SPR+VEA+ + C+PVIIAD+ V P D + W
Sbjct: 410 SRKMSYIQHMKSSRFCICPMGYEVNSPRIVEAIYYECVPVIIADNFVPPLNDVLDWTAFS 469
Query: 317 MFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSM--KRAMLFPQPAQPGDAFHQILNGL 374
+ VAE+D+PKL IL +IP L Q N M K + P+P + D FH IL+ +
Sbjct: 470 VIVAEKDIPKLKEILLAIPLRRYLVMQ---TNVKMVQKHFLWNPKPVR-YDLFHMILHSI 525
>gi|242064274|ref|XP_002453426.1| hypothetical protein SORBIDRAFT_04g005890 [Sorghum bicolor]
gi|241933257|gb|EES06402.1| hypothetical protein SORBIDRAFT_04g005890 [Sorghum bicolor]
Length = 403
Score = 121 bits (303), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 99/350 (28%), Positives = 159/350 (45%), Gaps = 52/350 (14%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYP 99
KVY+Y P + K + PR LT +A+E + + + S RT +P++A F+ PI P
Sbjct: 82 FKVYIY--PDG-DPKTFYQTPRKLTGKYASEGYFFQNIRESRFRTDDPDQAHLFFVPISP 138
Query: 100 TCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKA 159
+ G +++ ++ + + +PYWNRT GADHFFV HD G +A
Sbjct: 139 H-KMRGKGTTYE-NMTVIVKDYVEGLINKYPYWNRTLGADHFFVTCHDVGV-------RA 189
Query: 160 IERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY---APPQKMQAHQIPPDTPRSIFV 216
E LP + + ++ VC ++ P+ A PQ +Q +P
Sbjct: 190 FEG--LPFMVKNSI-------RVVCSPSYNVDFIPHKDIALPQVLQPFALPEGGN----- 235
Query: 217 YFRGLFYDVNNDPEGGYYA-------RGARAAVWEN-----FKNNPLFDISTDHPTTYYE 264
D+ N G++A R A VWEN NN + + Y +
Sbjct: 236 -------DIENRTILGFWAGHRNSKIRVILARVWENDTELAISNNRISRAIGE--LVYQK 286
Query: 265 DMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDV 324
R FC+CP G S R+ +++ +GC+PVI++D LPF DA+ W + + + E DV
Sbjct: 287 QFYRTKFCICPGGSQVNSARISDSIHYGCVPVILSDYYDLPFNDALDWRKFAVILRERDV 346
Query: 325 PKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
+L IL SI E + + L +++ ++ P DAFH ++ L
Sbjct: 347 YQLKNILKSISQEEFVSLHKSLVQ--VQKHFVWHSPPVSYDAFHMVMYEL 394
>gi|227202794|dbj|BAH56870.1| AT4G38040 [Arabidopsis thaliana]
Length = 407
Score = 121 bits (303), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 98/312 (31%), Positives = 145/312 (46%), Gaps = 50/312 (16%)
Query: 39 KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIY 98
+ KVY+Y P + + PR +T +A+E + + + S RTL+P+EAD F+ PI
Sbjct: 103 RFKVYIY--PDG-DPNTFYQTPRKVTGKYASEGYFFQNIRESRFRTLDPDEADLFFIPI- 158
Query: 99 PTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEK 158
+C ++++ + + + +PYWNRT GADHFFV HD G +
Sbjct: 159 -SCHKMRGKGTSYENMTVIVQNYVDGLIAKYPYWNRTLGADHFFVTCHDVGV-------R 210
Query: 159 AIERGILPLLQRATLVQTFGQRNHVC---LNEGSITIPPYAPPQKMQAHQIPPDTPRSIF 215
A E PLL + T+ VC N G I A PQ +Q +P
Sbjct: 211 AFEGS--PLLIKNTI-------RVVCSPSYNVGFIPHKDVALPQVLQPFALPAGGN---- 257
Query: 216 VYFRGLFYDVNNDPEGGYYA-------RGARAAVWEN-----FKNNPLFDISTDHPTTYY 263
DV N G++A R A VWEN NN + + +T H Y
Sbjct: 258 --------DVENRTTLGFWAGHRNSKIRVILAHVWENDTELDISNNRI-NRATGH-LVYQ 307
Query: 264 EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED 323
+ R FC+CP G S R+ +++ +GCIPVI++D LPF D + W + + + E+D
Sbjct: 308 KRFYRTKFCICPGGSQVNSARITDSIHYGCIPVILSDYYDLPFNDILNWRKFAVVLREQD 367
Query: 324 VPKLDTILTSIP 335
V L IL +IP
Sbjct: 368 VYNLKQILKNIP 379
>gi|168059393|ref|XP_001781687.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666856|gb|EDQ53500.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 376
Score = 121 bits (303), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 100/373 (26%), Positives = 172/373 (46%), Gaps = 41/373 (10%)
Query: 22 ERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRC-LTH------MFAAE-IFM 73
+R G+ G++L + L + YEL K K + KD R L H ++A+E F+
Sbjct: 4 KRNYGTRGNILTRSALVNL-LRSYELMEKVFKVYIYKDGRKPLVHSGPQLGIYASEGQFI 62
Query: 74 HRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLP-----LPFKSPRMMRSAIQLISSN 128
R +S T +P A F+ P +T +P LP + ++ ++ ++
Sbjct: 63 ERMEAASEFLTDDPSRAHMFFLPYSVYRMVTHLYVPNSRSMLPLAT--FIKDYVEALARQ 120
Query: 129 WPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEG 188
+PYWNRT+GADHFFV HD+G ++ + + ++ A L + F L E
Sbjct: 121 YPYWNRTKGADHFFVSCHDWGPATA-RDHPTLRSNAVKVVCNADLTEEFVVGKDASLPEV 179
Query: 189 SITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKN 248
+ P K+ P R +F G + R + +++K+
Sbjct: 180 YMHKSKTKAPIKLGG---PGYDERPYLAFFAGQMHG------------RVRPILLDHWKD 224
Query: 249 -NPLFDISTDHPT------TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADD 301
+P I P +Y + M+ + +C+C G+ SPR+VE++ + C+PVIIAD+
Sbjct: 225 KDPDLMIYGVLPKPIAKQISYVQHMKMSKYCICAAGYEVNSPRIVESIHYDCVPVIIADN 284
Query: 302 IVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPA 361
VLPF+D + W+ + + E D+PKL IL IP + Q L +++ ++ +
Sbjct: 285 FVLPFSDVLNWDAFSVTMPESDIPKLKAILNDIPEKTYRSMQIRLRK--IRQHFVWHKKP 342
Query: 362 QPGDAFHQILNGL 374
+ D FH IL+ +
Sbjct: 343 EKYDVFHMILHSV 355
>gi|125581086|gb|EAZ22017.1| hypothetical protein OsJ_05674 [Oryza sativa Japonica Group]
Length = 408
Score = 121 bits (303), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 100/354 (28%), Positives = 160/354 (45%), Gaps = 54/354 (15%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYP 99
KVY+Y P + K + PR LT +A+E + + + S RT +P++A F+ PI P
Sbjct: 81 FKVYMY--PDG-DPKTFYQTPRKLTGKYASEGYFFQNIRESRFRTGDPDKAHLFFVPISP 137
Query: 100 TCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKA 159
+ G +++ ++ + + +PYWNRT GADHFFV HD G +A
Sbjct: 138 H-KMRGKGTSYE-NMTIIVKDYVEGLINKYPYWNRTLGADHFFVTCHDVGV-------RA 188
Query: 160 IERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY---APPQKMQAHQIPPDTPRSIFV 216
E LP + + ++ VC ++ P+ A PQ +Q +P
Sbjct: 189 FEG--LPFMVKNSI-------RVVCSPSYNVDFIPHKDIALPQVLQPFALPEGGN----- 234
Query: 217 YFRGLFYDVNNDPEGGYYA-------RGARAAVWEN-----FKNNPLFDISTDHPTTYYE 264
DV N G++A R A VWEN NN + + Y +
Sbjct: 235 -------DVENRTILGFWAGHRNSKIRVILAQVWENDTELAISNNRISRAIGE--LVYQK 285
Query: 265 DMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDV 324
FC+CP G S R+ +++ +GC+PVI++D LPF D + W + + + E DV
Sbjct: 286 QFYHTKFCICPGGSQVNSARISDSIHYGCVPVILSDYYDLPFNDILDWRKFAVVLKERDV 345
Query: 325 PKLDTILTSIPPEVILRKQRLLAN----PSMKRAMLFPQPAQPGDAFHQILNGL 374
+L +IL SI E + + L P +++ ++ P P DAFH ++ L
Sbjct: 346 YQLKSILKSISQEEFVELHKSLVQNISCPQVQKHFVWHSPPLPYDAFHMVMYEL 399
>gi|449443279|ref|XP_004139407.1| PREDICTED: probable glycosyltransferase At3g42180-like [Cucumis
sativus]
gi|449494024|ref|XP_004159425.1| PREDICTED: probable glycosyltransferase At3g42180-like [Cucumis
sativus]
Length = 447
Score = 120 bits (302), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 88/303 (29%), Positives = 140/303 (46%), Gaps = 26/303 (8%)
Query: 80 SPVRTLNPEEADWFYTPIYPTCDL----TPTGLPLPFKSPRMMRSA---IQLISSNWPYW 132
SP R +P+EA F P+ T + P P + RM R I+++++ +PYW
Sbjct: 152 SPFRASHPDEAHVFLLPLSITNIIHFIYRPITSPADYNRDRMHRVTTDYIRVVANRYPYW 211
Query: 133 NRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITI 192
NR+ GADHF V HD+ + + + ++ A + + F + L E +I
Sbjct: 212 NRSNGADHFVVSCHDWAPEIS-DANPQLFKNFIRVVCNANITEGFRPNIDIPLPEINIHP 270
Query: 193 PPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGY-YARGARAAVWENFKNNPL 251
PP Q PP+ R I +F G G + Y R W+ N
Sbjct: 271 GTLGPPDLGQ----PPER-RPILAFFAG----------GAHGYIRKILIKHWKEKDNEVQ 315
Query: 252 FDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIP 311
Y + + + FCLCP G+ SPR+VEA+ GC+PVII+D+ LPF+D +
Sbjct: 316 VHEYLPKTQNYTKLIGESKFCLCPSGYEVASPRVVEAIYGGCVPVIISDNYSLPFSDVLD 375
Query: 312 WEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQIL 371
W + + + +P++ TIL +I E L+ + + +KR +PA+P D H +L
Sbjct: 376 WSRFSVQIPVQRIPEIKTILKAISEEKYLKLYKGVIK--VKRHFKINRPAKPFDVIHMLL 433
Query: 372 NGL 374
+ L
Sbjct: 434 HSL 436
>gi|296088784|emb|CBI38234.3| unnamed protein product [Vitis vinifera]
Length = 326
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 103/354 (29%), Positives = 161/354 (45%), Gaps = 58/354 (16%)
Query: 39 KLKVYVY--ELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTP 96
K KV+VY P+ Y + PR LT +A+E + + + S RT +P++A F+ P
Sbjct: 4 KFKVFVYPDGDPNTY-----YQTPRKLTGKYASEGYFFQNIRQSQFRTDDPDQAHLFFIP 58
Query: 97 IYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQE 156
I +C ++++ +Q + S +PYWNRT GADHFF+ HD G
Sbjct: 59 I--SCHKMRGKGTSYENMTIIVQNYVQSLMSKYPYWNRTLGADHFFLTCHDVGV------ 110
Query: 157 EKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY---APPQKMQAHQIPPDTPRS 213
+A E G+ L++ + V VC + P+ A PQ +Q +P
Sbjct: 111 -RATE-GVPLLVKNSIRV--------VCSPSYDVGFIPHKDVALPQVLQPFALPTGGR-- 158
Query: 214 IFVYFRGLFYDVNNDPEGGYYA-------RGARAAVWEN-----FKNNPLFDISTDHPTT 261
D+ N G++A R A +WEN KNN + + +T H
Sbjct: 159 ----------DIKNRTTLGFWAGHRNSKIRVILARIWENDTELDIKNNRI-NRATGH-LV 206
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y R FC+CP G S R+ +++ +GC+PVI++D LPF D + W + + + E
Sbjct: 207 YQNKFYRTKFCICPGGSQVNSARIADSIHYGCVPVILSDYYDLPFNDILDWRKFSVILKE 266
Query: 322 EDVPKLDTILTSIP-PEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
DV +L IL IP E I L+ +++ + P DAFH ++ L
Sbjct: 267 RDVYRLKYILKDIPDAEFIALHDNLV---KVQKHFQWNTPPIKYDAFHMVMYEL 317
>gi|356518346|ref|XP_003527840.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
max]
Length = 633
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 98/345 (28%), Positives = 158/345 (45%), Gaps = 37/345 (10%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEI-FMHRFLLSSPVRTLNPEEADWFYTPIY 98
LKVYVY + K ++ P L+ ++A+E FM S T +P+ A FY P
Sbjct: 305 LKVYVY----REGDKAIMHSP-ILSGLYASEGWFMKHMEASKQFVTTDPKNAHLFYLPF- 358
Query: 99 PTCDLTPTGLPLPFKS----PRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHY 154
+ L +P S + + + + +I+ +WNRT GADHF V HD+
Sbjct: 359 -SSQRLVDALWVPKSSYGNLIQYLSNYVDMIAGKHHFWNRTGGADHFLVACHDWAPA--- 414
Query: 155 QEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSI 214
E K L L A + Q F + L E ++ P P + + +Q+ + R
Sbjct: 415 -ETKQHMAKCLRALCNADVKQGFVFGKDMSLPE-TVVRSPRNPTRSIGGNQV---SKRKT 469
Query: 215 FVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTT-----YYEDMQRA 269
+F G + Y R WEN +P I P + Y + M+ +
Sbjct: 470 LAFFAGQMHG---------YVRPILLQHWEN--KDPDMKIFGRLPKSKGNRNYIQYMKSS 518
Query: 270 IFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDT 329
+C+C G+ SPR+VEA+++ C+PVI++D+ V PF + + WE +FV E+D+P L
Sbjct: 519 KYCICAKGYEVNSPRVVEAILYECVPVILSDNFVPPFFEMLNWESFAVFVLEKDIPNLKN 578
Query: 330 ILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
IL SIP + L+ Q ++ + + + D FH IL+ +
Sbjct: 579 ILLSIPRKRYLQMQ-MMVRKVQQHFLWHNKSPVKYDIFHMILHSI 622
>gi|359481952|ref|XP_002284018.2| PREDICTED: probable glycosyltransferase At5g03795 [Vitis vinifera]
Length = 546
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 93/347 (26%), Positives = 152/347 (43%), Gaps = 40/347 (11%)
Query: 39 KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTP-- 96
+LKVYVY+ + L + C + F+H+ + S RT +PE+A F+ P
Sbjct: 213 QLKVYVYD----EGEPPLFHNGPCKSIYSMEGNFIHKMEMDSHFRTKDPEKAHLFFLPFS 268
Query: 97 ---------IYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHD 147
+ + DL P + + + ++S+ +PYWNR+ GADHF + HD
Sbjct: 269 VAMLVRFVYVRDSHDLGPI--------KQTVIDYVNVVSTKYPYWNRSLGADHFMLACHD 320
Query: 148 FGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIP 207
+G + + + + +L A + F V E I + + + P
Sbjct: 321 WGPETSFSI-PYLHKNSIRVLCNANTSEGFNPSKDVSFPE--INLLTGSTDSFIGG---P 374
Query: 208 PDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQ 267
+ R++ +F G + R WEN + +YYE M+
Sbjct: 375 SPSHRTLLAFFAGGLHGP---------IRPILLEHWENKDEDVKVHKYLPKGVSYYEMMR 425
Query: 268 RAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKL 327
++ +CLCP G+ SPR+VEA+ GC+PV+I+D V PF+D + W+ + V ++P L
Sbjct: 426 KSKYCLCPSGYEVASPRVVEALYTGCVPVLISDHYVPPFSDVLNWKSFSVEVPVREIPNL 485
Query: 328 DTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
IL I P +R QR +R P + D FH IL+ L
Sbjct: 486 KRILMDISPRQYIRMQR--RGIQARRHFEVNSPPKRYDVFHMILHSL 530
>gi|297807149|ref|XP_002871458.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
gi|297317295|gb|EFH47717.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
Length = 336
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 88/303 (29%), Positives = 147/303 (48%), Gaps = 27/303 (8%)
Query: 79 SSPVRTLNPEEADWFYTPIYPTCDLTPTGLPL-PFKSPRM---MRSAIQLISSNWPYWNR 134
+S + +PEEA FY P+ + P + R+ ++ I LIS+ +PYWNR
Sbjct: 41 NSRYKAASPEEATVFYIPVGIVNIIRFVYRPYTSYARDRLQNIVKDYISLISNRYPYWNR 100
Query: 135 TEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPP 194
+ GADHFF+ HD+ + + + R + L A + F V L E +I P
Sbjct: 101 SRGADHFFLSCHDWAPDVSAVDPE-LYRHFIRALCNANASEGFTPMRDVSLPEINI---P 156
Query: 195 YAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDI 254
++ + + P + R + +F GG + R ++E +K +
Sbjct: 157 HSQLGFVHTGEAPQN--RKLLAFF-----------AGGSHGE-VRKILFEQWKEKDKDVL 202
Query: 255 STDH-PTT--YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIP 311
++ P T Y + M +A FCLCP GW SPR+VE++ GC+PVIIAD VLPF+D +
Sbjct: 203 VYEYLPKTMNYTKMMDKAKFCLCPSGWEVASPRIVESLYSGCVPVIIADSYVLPFSDVLN 262
Query: 312 WEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQIL 371
W+ + + +P + IL +I E L QR + +++ + +P++P D H I+
Sbjct: 263 WKTFSVHIPISKMPDIKKILEAISEEEYLEMQRRVL--EVRKHFVINRPSKPYDMLHMIM 320
Query: 372 NGL 374
+ +
Sbjct: 321 HSI 323
>gi|242056743|ref|XP_002457517.1| hypothetical protein SORBIDRAFT_03g008610 [Sorghum bicolor]
gi|241929492|gb|EES02637.1| hypothetical protein SORBIDRAFT_03g008610 [Sorghum bicolor]
Length = 521
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 82/258 (31%), Positives = 128/258 (49%), Gaps = 13/258 (5%)
Query: 118 MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQ-T 176
+++ I LIS+ +PYWNRT+GADHFFV HD+G + + + + + L A L +
Sbjct: 267 VKNYIDLISAKYPYWNRTKGADHFFVACHDWGP-YTTKLHDELRKNTIKALCNADLSEGI 325
Query: 177 FGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYAR 236
F + V L E + +P + ++ P R+I +F G + Y+
Sbjct: 326 FIRGKDVSLPETFLR----SPRRPLRDIGGRPAAQRTILAFFAGQMHGRVRPVLLKYW-- 379
Query: 237 GARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPV 296
G + A + P Y + M+ + +C+CP+G+ SPR+VEA+ + C+PV
Sbjct: 380 GDKDADMRIYSRLPH---RITRKRNYVQHMKSSKYCICPMGYEVNSPRIVEAIYYECVPV 436
Query: 297 IIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAML 356
IIAD+ VLPF A+ W + V E DVPKL IL +IP + Q + K +
Sbjct: 437 IIADNFVLPFDAALNWSAFSVVVPESDVPKLKEILLAIPESRYITLQSNVKR-VQKHFLW 495
Query: 357 FPQPAQPGDAFHQILNGL 374
P P + D FH IL+ +
Sbjct: 496 HPNPVK-YDIFHMILHSV 512
>gi|449518133|ref|XP_004166098.1| PREDICTED: probable glycosyltransferase At5g20260-like, partial
[Cucumis sativus]
Length = 362
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 88/301 (29%), Positives = 142/301 (47%), Gaps = 22/301 (7%)
Query: 79 SSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLP-FKSPRMMR---SAIQLISSNWPYWNR 134
SP PEEA F+ PI + P+ + R++R ++++++ +PYWNR
Sbjct: 68 KSPFSAHEPEEAQVFFLPISIVYIVDYIYKPITTYARDRLVRIFTDYVRVVANKYPYWNR 127
Query: 135 TEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPP 194
T GADHF V HD+ +E+ + + + +L A + F L E I +PP
Sbjct: 128 TRGADHFMVSCHDWAPEV-TKEDPNLFKYFIRVLCNANTSEGFNPMRDASLPE--INLPP 184
Query: 195 YAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGY-YARGARAAVWENFKNNPLFD 253
+ Q P + RSI +F G G + + R W++ +
Sbjct: 185 TFHLNLPRLGQPPQN--RSILAFFAG----------GAHGFIRHILMQHWKDKDHEIQVH 232
Query: 254 ISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWE 313
Y E + R+ FCLCP G+ SPRLVEA+ GC+PV+I+D LPF D + W
Sbjct: 233 EYLPPSQNYTELIDRSKFCLCPSGYEVASPRLVEAIHGGCVPVVISDYYSLPFDDVLDWS 292
Query: 314 EIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNG 373
+ M + E +P++ TIL + + L+ QR + ++R +PA+ D FH +L+
Sbjct: 293 KFSMRIPSERIPEIKTILRGVSMKKYLKLQRGVMK--VQRHFEIHRPAKAFDMFHMVLHS 350
Query: 374 L 374
+
Sbjct: 351 V 351
>gi|449465629|ref|XP_004150530.1| PREDICTED: probable glycosyltransferase At5g20260-like [Cucumis
sativus]
Length = 299
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 88/301 (29%), Positives = 142/301 (47%), Gaps = 22/301 (7%)
Query: 79 SSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLP-FKSPRMMR---SAIQLISSNWPYWNR 134
SP PEEA F+ PI + P+ + R++R ++++++ +PYWNR
Sbjct: 5 KSPFSAHEPEEAQVFFLPISIVYIVDYIYKPITTYARDRLVRIFTDYVRVVANKYPYWNR 64
Query: 135 TEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPP 194
T GADHF V HD+ +E+ + + + +L A + F L E I +PP
Sbjct: 65 TRGADHFMVSCHDWAPEV-TKEDPNLFKYFIRVLCNANTSEGFNPMRDASLPE--INLPP 121
Query: 195 YAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGY-YARGARAAVWENFKNNPLFD 253
+ Q P + RSI +F G G + + R W++ +
Sbjct: 122 TFHLNLPRLGQPPQN--RSILAFFAG----------GAHGFIRHILMQHWKDKDHEIQVH 169
Query: 254 ISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWE 313
Y E + R+ FCLCP G+ SPRLVEA+ GC+PV+I+D LPF D + W
Sbjct: 170 EYLPPSQNYTELIDRSKFCLCPSGYEVASPRLVEAIHGGCVPVVISDYYSLPFDDVLDWS 229
Query: 314 EIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNG 373
+ M + E +P++ TIL + + L+ QR + ++R +PA+ D FH +L+
Sbjct: 230 KFSMRIPSERIPEIKTILRGVSMKKYLKLQRGVMK--VQRHFEIHRPAKAFDMFHMVLHS 287
Query: 374 L 374
+
Sbjct: 288 V 288
>gi|225464075|ref|XP_002268836.1| PREDICTED: probable glycosyltransferase At5g03795-like [Vitis
vinifera]
Length = 417
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 103/354 (29%), Positives = 161/354 (45%), Gaps = 58/354 (16%)
Query: 39 KLKVYVY--ELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTP 96
K KV+VY P+ Y + PR LT +A+E + + + S RT +P++A F+ P
Sbjct: 95 KFKVFVYPDGDPNTY-----YQTPRKLTGKYASEGYFFQNIRQSQFRTDDPDQAHLFFIP 149
Query: 97 IYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQE 156
I +C ++++ +Q + S +PYWNRT GADHFF+ HD G
Sbjct: 150 I--SCHKMRGKGTSYENMTIIVQNYVQSLMSKYPYWNRTLGADHFFLTCHDVGV------ 201
Query: 157 EKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY---APPQKMQAHQIPPDTPRS 213
+A E G+ L++ + V VC + P+ A PQ +Q +P
Sbjct: 202 -RATE-GVPLLVKNSIRV--------VCSPSYDVGFIPHKDVALPQVLQPFALPTGGR-- 249
Query: 214 IFVYFRGLFYDVNNDPEGGYYA-------RGARAAVWEN-----FKNNPLFDISTDHPTT 261
D+ N G++A R A +WEN KNN + + +T H
Sbjct: 250 ----------DIKNRTTLGFWAGHRNSKIRVILARIWENDTELDIKNNRI-NRATGH-LV 297
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y R FC+CP G S R+ +++ +GC+PVI++D LPF D + W + + + E
Sbjct: 298 YQNKFYRTKFCICPGGSQVNSARIADSIHYGCVPVILSDYYDLPFNDILDWRKFSVILKE 357
Query: 322 EDVPKLDTILTSIP-PEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
DV +L IL IP E I L+ +++ + P DAFH ++ L
Sbjct: 358 RDVYRLKYILKDIPDAEFIALHDNLV---KVQKHFQWNTPPIKYDAFHMVMYEL 408
>gi|226498050|ref|NP_001147269.1| exostosin-like [Zea mays]
gi|195609318|gb|ACG26489.1| exostosin-like [Zea mays]
gi|413926327|gb|AFW66259.1| exostosin-like protein [Zea mays]
Length = 403
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 98/351 (27%), Positives = 161/351 (45%), Gaps = 54/351 (15%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYP 99
KVY+Y P + K + PR LT +A+E + + + S RT +P++A F+ PI P
Sbjct: 82 FKVYIY--PDG-DPKTFYQTPRKLTGKYASEGYFFQNIRESRFRTDDPDQAHLFFVPISP 138
Query: 100 TCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKA 159
+ G +++ ++ + + +PYWNRT GADHFFV HD G +A
Sbjct: 139 H-KMRGKGTSYE-NMTVIVKDYVEGLINKYPYWNRTLGADHFFVTCHDVGV-------RA 189
Query: 160 IERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY---APPQKMQAHQIPPDTPRSIFV 216
E G+ +++ + V VC ++ P+ A PQ +Q +
Sbjct: 190 FE-GLKFMVKNSIRV--------VCSPSYNVDFIPHKDIALPQVLQPFALHEGGN----- 235
Query: 217 YFRGLFYDVNNDPEGGYYA-------RGARAAVWENFKNNPLFDISTDHPT------TYY 263
D++N G++A R A VWEN + IS + + Y
Sbjct: 236 -------DIDNRVILGFWAGHRNSKIRVILARVWEN---DTELAISNNRISRAIGELVYQ 285
Query: 264 EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED 323
+ R FC+CP G S R+ +++ +GC+PVI++D LPF DA+ W + + + E D
Sbjct: 286 KQFYRTKFCICPGGSQVNSARISDSIHYGCVPVILSDYYDLPFNDALDWRKFAVILRERD 345
Query: 324 VPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
V +L IL SI E + + L +++ ++ P DAFH ++ L
Sbjct: 346 VYQLKNILKSISQEEFISLHKSLVQ--VQKHFVWHSPPVSYDAFHMVMYEL 394
>gi|297740031|emb|CBI30213.3| unnamed protein product [Vitis vinifera]
Length = 337
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 93/347 (26%), Positives = 152/347 (43%), Gaps = 40/347 (11%)
Query: 39 KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTP-- 96
+LKVYVY+ + L + C + F+H+ + S RT +PE+A F+ P
Sbjct: 4 QLKVYVYD----EGEPPLFHNGPCKSIYSMEGNFIHKMEMDSHFRTKDPEKAHLFFLPFS 59
Query: 97 ---------IYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHD 147
+ + DL P + + + ++S+ +PYWNR+ GADHF + HD
Sbjct: 60 VAMLVRFVYVRDSHDLGPI--------KQTVIDYVNVVSTKYPYWNRSLGADHFMLACHD 111
Query: 148 FGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIP 207
+G + + + + +L A + F V E I + + + P
Sbjct: 112 WGPETSFSI-PYLHKNSIRVLCNANTSEGFNPSKDVSFPE--INLLTGSTDSFIGG---P 165
Query: 208 PDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQ 267
+ R++ +F G + R WEN + +YYE M+
Sbjct: 166 SPSHRTLLAFFAGGLHGP---------IRPILLEHWENKDEDVKVHKYLPKGVSYYEMMR 216
Query: 268 RAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKL 327
++ +CLCP G+ SPR+VEA+ GC+PV+I+D V PF+D + W+ + V ++P L
Sbjct: 217 KSKYCLCPSGYEVASPRVVEALYTGCVPVLISDHYVPPFSDVLNWKSFSVEVPVREIPNL 276
Query: 328 DTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
IL I P +R QR +R P + D FH IL+ L
Sbjct: 277 KRILMDISPRQYIRMQR--RGIQARRHFEVNSPPKRYDVFHMILHSL 321
>gi|115444695|ref|NP_001046127.1| Os02g0187200 [Oryza sativa Japonica Group]
gi|46390046|dbj|BAD15422.1| putative pectin-glucuronyltransferase [Oryza sativa Japonica Group]
gi|46390077|dbj|BAD15452.1| putative pectin-glucuronyltransferase [Oryza sativa Japonica Group]
gi|113535658|dbj|BAF08041.1| Os02g0187200 [Oryza sativa Japonica Group]
gi|215694561|dbj|BAG89554.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 402
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 99/350 (28%), Positives = 159/350 (45%), Gaps = 52/350 (14%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYP 99
KVY+Y P + K + PR LT +A+E + + + S RT +P++A F+ PI P
Sbjct: 81 FKVYMY--PDG-DPKTFYQTPRKLTGKYASEGYFFQNIRESRFRTGDPDKAHLFFVPISP 137
Query: 100 TCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKA 159
+ G +++ ++ + + +PYWNRT GADHFFV HD G +A
Sbjct: 138 H-KMRGKGTSYE-NMTIIVKDYVEGLINKYPYWNRTLGADHFFVTCHDVGV-------RA 188
Query: 160 IERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY---APPQKMQAHQIPPDTPRSIFV 216
E LP + + ++ VC ++ P+ A PQ +Q +P
Sbjct: 189 FEG--LPFMVKNSI-------RVVCSPSYNVDFIPHKDIALPQVLQPFALPEGGN----- 234
Query: 217 YFRGLFYDVNNDPEGGYYA-------RGARAAVWEN-----FKNNPLFDISTDHPTTYYE 264
DV N G++A R A VWEN NN + + Y +
Sbjct: 235 -------DVENRTILGFWAGHRNSKIRVILAQVWENDTELAISNNRISRAIGE--LVYQK 285
Query: 265 DMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDV 324
FC+CP G S R+ +++ +GC+PVI++D LPF D + W + + + E DV
Sbjct: 286 QFYHTKFCICPGGSQVNSARISDSIHYGCVPVILSDYYDLPFNDILDWRKFAVVLKERDV 345
Query: 325 PKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
+L +IL SI E + + L +++ ++ P P DAFH ++ L
Sbjct: 346 YQLKSILKSISQEEFVELHKSLVQ--VQKHFVWHSPPLPYDAFHMVMYEL 393
>gi|159478873|ref|XP_001697525.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158274404|gb|EDP00187.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 597
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 109/388 (28%), Positives = 157/388 (40%), Gaps = 88/388 (22%)
Query: 42 VYVYELPSKYNKKLLLK--DPRCLTH----------------MFAAEIFMHRFLLSSPVR 83
+YVY+L Y K+L PR H ++A + +H LL SP R
Sbjct: 209 IYVYDLEPMYQSKILQYRVTPRWCVHRWHVWPANHTEWTDLWVYAMDTLLHESLLGSPHR 268
Query: 84 TLNPEEADWFYTPIYPTCDLTPTGLPLPF------KSPR------MMRSAIQLISSNWPY 131
T +PEEAD+FY P +C P G F PR ++ +Q I++ +P+
Sbjct: 269 TFDPEEADFFYVPHQASCLPFPIGAWADFPWFPDAGGPRTRQMLNLVIDTVQWINATFPF 328
Query: 132 WNRTEGADHFFVVPHDFGACF------------HYQEEK------------AIERGILPL 167
W R G DH F HD GAC+ H+ + +R +
Sbjct: 329 WQRRGGRDHIFTFTHDEGACWAPNIVNNSIWLTHWGRTELNHTSNTAYLLDKYDRDTPTI 388
Query: 168 LQRATLVQTFGQRNHVCLN-EGSITIPPYAPPQKMQAHQI--PPDTPRSIFVYF------ 218
LQ V F + H C N E + IP + P + + P R + +F
Sbjct: 389 LQPDGFVHLF--KGHPCYNPEKDLVIPAFKAPGHYASSGLVGAPTRERDLLFFFSPPVSA 446
Query: 219 -----------RGLFYDVNNDPEGGYYARGARAAVWENFK------NNPLFDISTDHPTT 261
RG DV + Y+RG R A++ K + + D
Sbjct: 447 PGAASCPHASTRG---DVGKRRQPN-YSRGVRQAIYRAAKAGDWAAKHKFYIGGHDDVKG 502
Query: 262 YYEDM-QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVA 320
Y DM RA FCL G WS R+ +AV+ GCIPVIIAD + F + + G+ +
Sbjct: 503 EYSDMLSRAKFCLVAPG-DGWSARMEDAVLHGCIPVIIADGVHAVFESILDIDGFGLRIP 561
Query: 321 EEDVPKLDTILTSIPPEVILRKQRLLAN 348
+E VP++ IL ++PP I KQ L
Sbjct: 562 QEQVPRILDILLAVPPRAIRSKQAHLGR 589
>gi|225429942|ref|XP_002281263.1| PREDICTED: probable glycosyltransferase At5g03795 [Vitis vinifera]
Length = 675
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 106/396 (26%), Positives = 173/396 (43%), Gaps = 51/396 (12%)
Query: 2 WVTAI--ALLLAFVSADNTPKIERISGSAGDVLEDDPVGK---------LKVYVYELPSK 50
W +A+ LL A +N P I+ G + + V K LKVY Y +
Sbjct: 291 WSSAVDKELLYAKSQIENAPIIKNDPGLHASLYRNVSVFKRSYELMENTLKVYTY----R 346
Query: 51 YNKKLLLKDPRCLTHMFAAEI-FMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLP 109
++ + P + ++A+E FM + T N +A FY P + + L
Sbjct: 347 EGERPVFHQP-PIKGIYASEGWFMKLMQANKKFVTKNGRKAHLFYLPF--SSLMLEEALY 403
Query: 110 LPFKSPRM-----MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGI 164
+P R +++ + +I + +P+WNRT GADHF V HD+ E +
Sbjct: 404 VPNSHSRKNLEQYLKNYLDMIGAKYPFWNRTGGADHFLVACHDWAPS----ETLKLMANS 459
Query: 165 LPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYD 224
+ L + + + F V L E + IP P +++ P + R I +F G +
Sbjct: 460 IRALCNSDIREGFKLGKDVSLPETCVRIPQ-NPLRQLGG---KPPSQRRILAFFAGSMHG 515
Query: 225 VNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPT------TYYEDMQRAIFCLCPLGW 278
Y R WEN +P I P Y + M+ + +C+C G+
Sbjct: 516 ---------YVRPILLKYWEN--KDPDMKIYGRMPKAKKGTMNYIQHMKSSKYCICAKGY 564
Query: 279 APWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEV 338
SPR+VEA+ + C+PVII+D+ V PF + WE +F+ E+D+P L +IL SIP +
Sbjct: 565 EVNSPRVVEAIFYECVPVIISDNFVPPFFGVLNWESFAVFILEKDIPNLKSILLSIPEKS 624
Query: 339 ILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
L Q + +++ L+ D FH IL+ +
Sbjct: 625 YLEIQMRVKQ--VQQHFLWHAKPVKYDVFHMILHSV 658
>gi|356505300|ref|XP_003521429.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
max]
Length = 534
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 93/353 (26%), Positives = 150/353 (42%), Gaps = 52/353 (14%)
Query: 39 KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTP-- 96
+ KV+VYE + + + C + F+H ++ RT +PE+A F+ P
Sbjct: 204 QFKVFVYE----EGEPPVFHNGPCKSIYSMEGNFIHAIEMNDQFRTRDPEKAHVFFLPFS 259
Query: 97 ---------IYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHD 147
+ + D P + + + +I+ +PYWNR+ GADHF++ HD
Sbjct: 260 VAMLVQFVYVRDSHDFGPI--------KKTVTDYVNVIAGRYPYWNRSLGADHFYLACHD 311
Query: 148 FGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQI- 206
+G E I +L A + F V P +Q I
Sbjct: 312 WGPETSRSIPNLNENSIR-VLCNANTSEGFKPSKDVSF-----------PEINLQTGSIN 359
Query: 207 -----PPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTT 261
P + R + +F G + R WEN + +
Sbjct: 360 GFIGGPSASGRPLLAFFAGGLHGP---------IRPVLLEHWENRDEDIQVHKYLPKGVS 410
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
YYE ++++ FCLCP G+ SPR+VEA+ GC+PV+I+D V PF D + W+ + V+
Sbjct: 411 YYEMLRKSRFCLCPSGYEVASPRVVEAIYTGCVPVLISDHYVPPFNDVLNWKSFSVEVSV 470
Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
+D+P+L IL SI P +R QR + ++R P + D FH IL+ +
Sbjct: 471 KDIPRLKEILLSISPRHYIRMQRRVG--LVRRHFEVHSPPKRYDVFHMILHSV 521
>gi|358343604|ref|XP_003635890.1| Exostosin-like protein [Medicago truncatula]
gi|358344094|ref|XP_003636128.1| Exostosin-like protein [Medicago truncatula]
gi|355501825|gb|AES83028.1| Exostosin-like protein [Medicago truncatula]
gi|355502063|gb|AES83266.1| Exostosin-like protein [Medicago truncatula]
Length = 435
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 101/346 (29%), Positives = 156/346 (45%), Gaps = 58/346 (16%)
Query: 39 KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIY 98
K KVY+Y P + K + PR LT +A+E + + + S RTL+P++A F+ PI
Sbjct: 129 KFKVYIY--PDG-DSKTFYQTPRKLTGKYASEGYFFQNIRESRFRTLDPDQAHLFFIPI- 184
Query: 99 PTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEK 158
+C ++++ ++ + S +PYWNRT GADHFFV HD G +
Sbjct: 185 -SCHKMRGKGTSYDNMTIIVQNYVESLISKYPYWNRTLGADHFFVTCHDVGV-------R 236
Query: 159 AIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY---APPQKMQAHQIPPDTPRSIF 215
A E LPLL + ++ VC + P+ A PQ +Q +P
Sbjct: 237 ATEG--LPLLVKNSI-------RAVCSPSYDVGFIPHKDVALPQVLQPFALPAGGN---- 283
Query: 216 VYFRGLFYDVNNDPEGGYYA-------RGARAAVWENFKNNPLFDISTDHPTTYYEDMQR 268
DV N G++A R A VWEN + DIS + T
Sbjct: 284 --------DVENRTSLGFWAGHRNSKIRVILARVWEN---DTELDISNNRIYTK------ 326
Query: 269 AIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLD 328
FC+CP G S R+ +++ +GCIPVI+++ LPF D + W + + E DV +L
Sbjct: 327 --FCICPGGSQVNSARIADSIHYGCIPVILSNYYDLPFNDILDWRKFAVVHNESDVYQLK 384
Query: 329 TILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
IL + + + L N +++ + P DAFH ++ L
Sbjct: 385 QILKNKSQDEFI----ALHNNLIQKHFQWNSPPVRYDAFHMVMYDL 426
>gi|307107123|gb|EFN55367.1| hypothetical protein CHLNCDRAFT_52572 [Chlorella variabilis]
Length = 815
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 111/424 (26%), Positives = 178/424 (41%), Gaps = 107/424 (25%)
Query: 42 VYVYELPSKYNKKLL---LKDPRCLTHMFA--------------AEIFMHRFLLSSPVRT 84
+YVYELP +N+ LL + C+ +F AE +H LL S RT
Sbjct: 375 IYVYELPPIFNQVLLQYRVDHGSCVHRLFTDGNGTNWEDSGGYLAETGLHEALLQSKHRT 434
Query: 85 LNPEEADWFYTPIYPTCDLTPT-------------GLPLPFKSPRMMRSAIQLISSNWPY 131
L+PEEAD+FY P+Y +C + P +P + M+ + ++ PY
Sbjct: 435 LDPEEADYFYIPVYSSCYMYPIHGFADTPFFHAFHKIPRVHATTNMLIEVYHWLRAHHPY 494
Query: 132 WNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQR-ATLVQTFGQ----------- 179
W+R+ G DH + HD G+C+ LP + R AT++ +G+
Sbjct: 495 WDRSGGRDHIILQSHDEGSCW------------LPAVLRPATMLTHWGRMDLGHTSSTGY 542
Query: 180 ---------RNHVCLNEGS---------------ITIPPYAPPQKMQAHQIPP--DTPRS 213
R+ + + EG+ + +PP P K + + R+
Sbjct: 543 IDDVYSRPARHPIYMPEGTEGKLGDFPCYDPAKDLVVPPMTSPLKYELSPLVGAFTRNRT 602
Query: 214 IFVYFRGLFYDVNNDPEGGYYARGARAAV---------WENFK-----NNPLFDISTDHP 259
+F+G NN P Y+RG R + W FK NP D
Sbjct: 603 TLAFFKGRTQQ-NNKP----YSRGIRQTLENLCRDKDWWGKFKIWIGEGNP-----PDMD 652
Query: 260 TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFV 319
TY + + + FC G +SPR +AV GC+PVII D++ L F I + + + +
Sbjct: 653 RTYSQLLASSTFCFVLPG-DGFSPRFEDAVQHGCLPVIIQDEVHLAFESIIDYRKFVVRI 711
Query: 320 AEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
++D+ ++ IL +IPPE + Q+ LA K + +P + +L G RK+P
Sbjct: 712 QQKDMERVPEILGAIPPEKVQTMQKALATVWRKWSYTGYRPY--ANVTLDLLEGYRRKVP 769
Query: 380 HDKS 383
D+
Sbjct: 770 EDQQ 773
>gi|224142559|ref|XP_002324623.1| predicted protein [Populus trichocarpa]
gi|222866057|gb|EEF03188.1| predicted protein [Populus trichocarpa]
Length = 337
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 84/302 (27%), Positives = 134/302 (44%), Gaps = 24/302 (7%)
Query: 80 SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLP----FKSPRMMRSA---IQLISSNWPYW 132
SP R +P+EA F+ P+ + LP+ + R+ R + +++ +PYW
Sbjct: 42 SPFRAQDPDEAHVFFLPVSVASIVHFIYLPITAAADYSRDRLRRVVTDYVHIVAKKYPYW 101
Query: 133 NRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITI 192
NR+ GADHF V HD+ + + I +L A + F V L E +
Sbjct: 102 NRSNGADHFMVSCHDWAPDVSIANSELFNKFIR-VLCNANISIGFRPPRDVLLPEIYL-- 158
Query: 193 PPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLF 252
P++ Q P + P I +F G Y R W+N N
Sbjct: 159 -PFSGLGTTHMGQAPNNRP--ILAFFEG---------RAHGYIRQVLFKHWKNKDNEVQV 206
Query: 253 DISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW 312
Y M ++ FCLCP G+ SPR+VEA+ GC+PVII+++ LPF+D + W
Sbjct: 207 HELLPKGKNYTRLMGQSKFCLCPSGFEVASPRVVEAIYQGCVPVIISNNYSLPFSDVLNW 266
Query: 313 EEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILN 372
+ + + E +P++ IL I LR + ++R + +PA+P D H +L+
Sbjct: 267 SQFSVQIPVEKIPEIKMILQRISNSKYLRMHERVKR--VQRHFVLNRPAKPFDVIHMVLH 324
Query: 373 GL 374
L
Sbjct: 325 SL 326
>gi|356548516|ref|XP_003542647.1| PREDICTED: xylogalacturonan beta-1,3-xylosyltransferase-like
[Glycine max]
Length = 340
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 84/301 (27%), Positives = 136/301 (45%), Gaps = 23/301 (7%)
Query: 80 SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPR------MMRSAIQLISSNWPYWN 133
S R NP +A F P + ++ + P ++ I++I+ +PYWN
Sbjct: 44 SHFRARNPNQAHVFLIP-FSIVNIVQYVYNRNLRQPGSQSIQLLVEDYIRVIAHKYPYWN 102
Query: 134 RTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIP 193
RTEGADHF + HD+G Y K + + +L A + F V + E + +
Sbjct: 103 RTEGADHFLLSCHDWGPTISYANPKLF-KNFIRVLCNANTSEGFRPNKDVSIPE--VNLL 159
Query: 194 PYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFD 253
P Q P D R+I +F G + R W++ N+
Sbjct: 160 PRGTLGSPNRGQHPND--RTILAFFAGREHGA---------IRTILLNHWKDKDNDVQIY 208
Query: 254 ISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWE 313
S Y + M ++ FCLCP G+ SPR+VEA+ GC+PV+I+ PF D + W
Sbjct: 209 ESLPKGKVYTKLMGQSKFCLCPSGYEVASPRVVEAIYAGCVPVLISSSYSPPFTDVLNWS 268
Query: 314 EIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNG 373
+ + + E +P++ TIL S+ P+ L+ Q + ++R +PA+P D H IL+
Sbjct: 269 QFSVEIPVEKIPEIKTILQSVSPKKYLKLQMNVL--RVQRHFTINRPAKPFDLMHMILHS 326
Query: 374 L 374
+
Sbjct: 327 I 327
>gi|147805375|emb|CAN76361.1| hypothetical protein VITISV_035438 [Vitis vinifera]
Length = 459
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 86/312 (27%), Positives = 138/312 (44%), Gaps = 35/312 (11%)
Query: 80 SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSP---------RMMRSAIQLISSNWP 130
SP +P+EA F+ P+ + L LP SP R++ +++++ +P
Sbjct: 155 SPFIAKHPDEAHAFFLPL--SVVKVVQFLYLPITSPEDYSRKRLQRVVTDYVKVVADKYP 212
Query: 131 YWNRTEGADHFFVVPHDFGACFHYQ--------EEKAIERGILPLLQRATLVQTFGQRNH 182
YWNR+ GADHF V HD+ + + + + + +L A + F
Sbjct: 213 YWNRSGGADHFMVSCHDWVSSSSLEPMAPSVSYANPELFKNFIRVLCNANSSEGFRPGRD 272
Query: 183 VCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAV 242
V L E ++ PP Q P R I +F G + R
Sbjct: 273 VSLPEVNLPAGELGPPHLGQ-----PSNNRPILAFFAGRAHG---------NIRKILFEH 318
Query: 243 WENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDI 302
W++ N L Y + M ++ FCLCP G+ SPR+VEA+ GC+PVII++
Sbjct: 319 WKDQDNEVLVHERLHKGQNYAKLMGQSKFCLCPSGYEVASPRVVEAIHAGCVPVIISNXY 378
Query: 303 VLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQ 362
LPF D + W + + + E +P++ TIL I L+ Q + ++R + +PA+
Sbjct: 379 SLPFNDVLDWSQFSIQIPEAKIPEIKTILLGISKNKYLKMQERVLR--VRRHFVLNRPAR 436
Query: 363 PGDAFHQILNGL 374
P D H IL+ L
Sbjct: 437 PFDIIHMILHSL 448
>gi|302852375|ref|XP_002957708.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
gi|300257002|gb|EFJ41257.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
Length = 600
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 112/384 (29%), Positives = 170/384 (44%), Gaps = 60/384 (15%)
Query: 36 PVGKLKVYVYELP------SKYNKKLLLKDPRCLTHMFAA-EIFMHRFLLSSPVRTLNPE 88
P KLK+Y+Y+LP +YN +D M+AA E FM FL VRT NP
Sbjct: 228 PRSKLKIYMYDLPWEVAFPYEYNDGHFGRDK-----MYAAYEYFMTYFLQDHAVRTENPY 282
Query: 89 EADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDF 148
EA+ FY P+ + P+P + A+ + + WP++NRT G DHF+ + D
Sbjct: 283 EANLFYIPMLAYFYIANVRNPVP-----QVTLALDYVRTKWPFYNRTGGRDHFYFLTGDR 337
Query: 149 GACF--HYQEEKAIERGILPLLQRATLVQTFGQRNHVCLN-EGSITIPPYA-----PPQK 200
GAC + ++ I+ L R + C+ + + IPP P +
Sbjct: 338 GACSTPRWLQDSCIKLVHFGLQGEELPGTGVPNREYGCVQVKRDLVIPPINLFTDLVPSE 397
Query: 201 MQAH------QIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDI 254
QA+ + D+ R + +F G V PE Y+ G R A+ L
Sbjct: 398 TQAYYKWLVSKKGYDSNRKLLFFFAG---GVGQVPE---YSGGVRQAI------KGLLSS 445
Query: 255 STDHPT----------TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVL 304
T P Y E +Q + FC+ P G+ W RL++A+ +GCIP+II D +
Sbjct: 446 LTPKPEDVEFFEGRVHNYKELLQSSKFCIAPYGFG-WGLRLIQAIEYGCIPLIIQDHVYQ 504
Query: 305 PF---ADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPA 361
PF D +P+EE + + D+P + +L S E L + R L +A ++ +
Sbjct: 505 PFERPKDFLPYEEFSVRMGLVDIPYMIELLRSY-TEAQLAQLR-LGMAKYYQAFIWNR-E 561
Query: 362 QPGDAFHQILNGLARKLPHDKSTY 385
G A+ L GL R+L H S Y
Sbjct: 562 YGGLAYEWTLAGLERRLAHMNSEY 585
>gi|356562054|ref|XP_003549290.1| PREDICTED: probable glycosyltransferase At5g25310-like [Glycine
max]
Length = 482
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 92/302 (30%), Positives = 137/302 (45%), Gaps = 30/302 (9%)
Query: 83 RTLNPEEADWFYTPIYPTCDLTPTGLPLPFKS---PRMMRSAIQLISSNWPYWNRTEGAD 139
RT +P A F+ P T + PL F + + ++++S+ P+WN T GAD
Sbjct: 191 RTNDPNAAHVFFLPFSVTWMVKYLYTPLSFNVTPLKKFVSDYVRVVSTRHPFWNITHGAD 250
Query: 140 HFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQ 199
HF + HD+G Q + + +L A + F R V L E I Y
Sbjct: 251 HFMLACHDWGP-HASQGNPFLYNTSIRVLCNANTSEGFNPRKDVSLPE----IHLYGGEV 305
Query: 200 KMQAHQIPPDT-PRSIFVYFRGLFYDVNNDPEGGYYA--RGARAAVWENFKNNPLFDIST 256
+ PPDT PR +F GG + R A W+N N + +
Sbjct: 306 SPKLLSPPPDTAPRRYLAFF-----------SGGLHGPIRPALLGHWKNHDENDVIRVYE 354
Query: 257 DHPTT--YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEE 314
P YY M + FCLCP G SPR+VEA+ C+PVI+++ VLPF+D + WE
Sbjct: 355 YLPKDLDYYSFMLTSKFCLCPSGHEVASPRIVEAIYAECVPVILSEYYVLPFSDVLQWEA 414
Query: 315 IGMFVAEEDVPKLDTILTSIPPEVILRKQRLLAN--PSMKRAMLFPQPAQPGDAFHQILN 372
+ V D+P+L IL++I + K R L +++R +PA+ D FH IL+
Sbjct: 415 FSVQVDVSDIPRLKEILSAISED----KYRKLKEGVKAVRRHFTLNRPAKRFDVFHMILH 470
Query: 373 GL 374
+
Sbjct: 471 SI 472
>gi|414881250|tpg|DAA58381.1| TPA: hypothetical protein ZEAMMB73_758303 [Zea mays]
Length = 474
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 95/342 (27%), Positives = 158/342 (46%), Gaps = 25/342 (7%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTL-NPEEADWFYTPIY 98
LKVY+Y+ ++ + P L+ ++A+E + + L S + + +A FY P Y
Sbjct: 137 LKVYIYQ----DGRRPIFHTP-PLSGIYASEGWFMKLLKESRRHVVADAGKAHLFYLP-Y 190
Query: 99 PTCDLTPTGLPLPFKSPR----MMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHY 154
+ L T + R +R+ ++ ++S +P+WNRT GADHF V HD+G
Sbjct: 191 SSQQLRLTLYEAGSHNLRPLAAYLRNFVRGLASKYPFWNRTRGADHFLVACHDWGPYTTT 250
Query: 155 QEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSI 214
+ I L + F V L E +I P + ++ P + RSI
Sbjct: 251 AHRDLRKNAIKALCNADSSEGIFTPGKDVSLPETTIR----TPRRPLRYVGGLPVSRRSI 306
Query: 215 FVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLC 274
+F G + ++ G ++ + L Y + M+ + FCLC
Sbjct: 307 LAFFAGNVHGRVRPVLLRHWGNGQD----DDMRVYSLLPSRVSRRMNYIQHMKNSRFCLC 362
Query: 275 PLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSI 334
P+G+ SPR+VEA+ + C+PVIIAD+ VLPF++ + W + +AE+D+P L IL I
Sbjct: 363 PMGYEVNSPRIVEALYYECVPVIIADNFVLPFSEVLDWSAFSVVIAEKDIPDLKKILKGI 422
Query: 335 PPEVILRKQRLLANPSMKRAMLFPQPAQP--GDAFHQILNGL 374
LR+ + + + F A+P D FH IL+ +
Sbjct: 423 S----LRRYVAMHDSVKRLQRHFLWHARPIKYDLFHMILHSI 460
>gi|356548353|ref|XP_003542567.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
max]
Length = 505
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 94/352 (26%), Positives = 155/352 (44%), Gaps = 50/352 (14%)
Query: 39 KLKVYVYE---LPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYT 95
+ KV+VYE LP + + C + F+H ++ RT +P++A F+
Sbjct: 175 QFKVFVYEEGELP-------VFHEGPCASIYSTEGSFIHAIEMNEHFRTRDPKKAHVFFL 227
Query: 96 P-----------IYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVV 144
P I + D P R +R I +I++ +PYWNR+ GADHF +
Sbjct: 228 PFSVVMMVRYVYIRDSHDFGPI--------KRTVRDYINVIAARYPYWNRSLGADHFMLS 279
Query: 145 PHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAH 204
HD+G + + + + +L A + F R V E ++ P
Sbjct: 280 CHDWGPEAS-KFSPYLRKNSIRVLCNANTSEGFDPRKDVSFPEINLQRGPIDGLLGG--- 335
Query: 205 QIPPDTPRSIFVYFRGLFYDVNNDPEGGYYA--RGARAAVWENFKNNPLFDISTDHPTTY 262
P + RSI +F GG + R WE + +Y
Sbjct: 336 --PSASQRSILAFF-----------AGGIHGPIRPILLEHWEKKDEDIQVHQYLPKGVSY 382
Query: 263 YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEE 322
Y ++++ FCLCP G+ SPR+VEA+ GC+PV+I+D V PF+D + W+ + V+ +
Sbjct: 383 YGMLRKSKFCLCPSGYEVASPRVVEAIYTGCVPVLISDHYVPPFSDVLNWKMFSVEVSMK 442
Query: 323 DVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
++P L IL +I P +R Q+ + ++R P + D FH IL+ +
Sbjct: 443 EIPNLKDILMNISPRKYIRMQKRVRQ--IRRHFEVHSPPKRYDVFHMILHSV 492
>gi|302852377|ref|XP_002957709.1| hypothetical protein VOLCADRAFT_119761 [Volvox carteri f.
nagariensis]
gi|300257003|gb|EFJ41258.1| hypothetical protein VOLCADRAFT_119761 [Volvox carteri f.
nagariensis]
Length = 1481
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 102/377 (27%), Positives = 159/377 (42%), Gaps = 84/377 (22%)
Query: 42 VYVYELPSKYNKKLL---LKDPRCLTH---------------MFAAEIFMHRFLLSSPVR 83
+YVY+L Y K+L + P C+ ++AA+ +H LL S R
Sbjct: 307 IYVYDLEPLYQSKILQYRISPPWCVHRRHDLPGNQTVWSDGWVYAADTLLHELLLISEHR 366
Query: 84 TLNPEEADWFYTPIYPTCDLTPTGLPLPFKS------------PR------MMRSAIQLI 125
T +PEEAD+FY P +C LP P S PR M+R + I
Sbjct: 367 TFDPEEADFFYVPHSASC------LPFPMGSWADYPWFLGPGGPRIRQMVNMLREVVDWI 420
Query: 126 SSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGI-LPLLQRATLVQTFGQ----- 179
+P+W R G DH ++ HD GAC+ K +E L R L G
Sbjct: 421 DKTYPFWRRRGGRDHIWLFTHDEGACW---APKVLENSTWLTHWGRMGLEHRSGTAFLAD 477
Query: 180 -------------------RNHVCLNE-GSITIPPYAPPQKMQAHQI--PPDTPRSIFVY 217
+ H C + + +P + P+ ++ + R IF++
Sbjct: 478 KYDIDFVSPHQPEGFLTHIKGHPCYDSTKDLVVPAFKQPRHYRSSPLLGSATKQRDIFLF 537
Query: 218 FRGLFYDVNNDPEGGYYARGARAAVWE-----NFKNNPLFDISTDHPTTYYED-MQRAIF 271
FRG DV +Y+RG R +++ N+K+ + T Y D + R+ F
Sbjct: 538 FRG---DVGKH-RMAHYSRGVRQKLYKLSVENNWKSKNVLIGGTHEVRGEYSDLLSRSQF 593
Query: 272 CLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTIL 331
CL G WS RL +AV+ GCIPVI+ D++ + F + + + + E+ +P++ IL
Sbjct: 594 CLVAAG-DGWSARLEDAVLHGCIPVIVIDEVHVVFESILNVDSFAVRIDEQQLPQILDIL 652
Query: 332 TSIPPEVILRKQRLLAN 348
+IP I KQ L +
Sbjct: 653 AAIPERKIRAKQAHLGH 669
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 87/293 (29%), Positives = 129/293 (44%), Gaps = 36/293 (12%)
Query: 67 FAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTP------------TGLPLPFKS 114
+ AE +H +LL S RT +PEEAD+FY P Y TC + P TG P +
Sbjct: 1043 YGAESALHEYLLLSEHRTFDPEEADFFYVPYYGTCMIWPVLHWADFPYFHTTGGPRILQV 1102
Query: 115 PRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLV 174
M+ + I+ +P+W R G DH F+ PHD GAC+ +L AT +
Sbjct: 1103 INMLIDTVDWINKMYPFWGRRGGRDHIFLFPHDEGACW-----------APNVLVNATWL 1151
Query: 175 QTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYY 234
+G+ + + ++ S Y + P RG DP Y
Sbjct: 1152 THWGRTDMIHESKTSFDADNYT------RDYVGWRQPGGFVNLIRG---HPCYDPVKIY- 1201
Query: 235 ARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCI 294
R A+ W++ K+N L + D P Y + + R++FCL G WS R +AV+ GCI
Sbjct: 1202 -RLAKENNWQD-KHNILIGDAADVPGDYSDLLSRSLFCLVATG-DGWSARTEDAVLHGCI 1258
Query: 295 PVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLA 347
PVII D + + F +E + + E + ++ IL IP I Q L
Sbjct: 1259 PVIIIDGVHIKFETVFSVDEFSIRIPEANASRILEILKEIPKTKIRSIQAHLG 1311
>gi|15239475|ref|NP_198516.1| Exostosin family protein [Arabidopsis thaliana]
gi|332006752|gb|AED94135.1| Exostosin family protein [Arabidopsis thaliana]
Length = 547
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 92/308 (29%), Positives = 146/308 (47%), Gaps = 33/308 (10%)
Query: 39 KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTL-NPEEADWFYTPI 97
KLK+YVY K K + P ++A+E + + + S+ + +P +A FY PI
Sbjct: 239 KLKIYVY----KEGGKPIFHTP-MPRGIYASEGWFMKLMESNKKFVVKDPRKAHLFYIPI 293
Query: 98 YPTCDLTPTGLPLPFKSPRMM----RSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFH 153
+ + L L F++P+ + + + LI+ + +WNRT GADHF V HD+G
Sbjct: 294 --SIKALRSSLGLDFQTPKSLADHLKEYVDLIAGKYKFWNRTGGADHFLVACHDWGNKLT 351
Query: 154 YQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRS 213
+ K R L + + Q F L I AP + + + R
Sbjct: 352 TKTMKNSVRS----LCNSNVAQGFRIGTDTALPVTYIR-SSEAPLEYLGG---KTSSERK 403
Query: 214 IFVYFRGLFYDVNNDPEGGYYARGARAAVWENFK-NNPLFDISTDHPTT---YYEDMQRA 269
I +F G + Y R +WEN + + +F P + Y E M+ +
Sbjct: 404 ILAFFAGSMHG---------YLRPILVKLWENKEPDMKIFGPMPRDPKSKKQYREYMKSS 454
Query: 270 IFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDT 329
+C+C G+ +PR+VEA++ C+PVIIAD+ V PF + + WEE +FV E+D+P L
Sbjct: 455 RYCICARGYEVHTPRVVEAIINECVPVIIADNYVPPFFEVLNWEEFAVFVEEKDIPNLRN 514
Query: 330 ILTSIPPE 337
IL SIP +
Sbjct: 515 ILLSIPED 522
>gi|255548866|ref|XP_002515489.1| catalytic, putative [Ricinus communis]
gi|223545433|gb|EEF46938.1| catalytic, putative [Ricinus communis]
Length = 481
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 87/303 (28%), Positives = 149/303 (49%), Gaps = 34/303 (11%)
Query: 83 RTLNPEEADWFYTPIYPTCDLTPTGLPLPF-KSP--RMMRSAIQLISSNWPYWNRTEGAD 139
RT +P+ A ++ P T + PL + SP + + ++++S+ +P+WNRT GAD
Sbjct: 192 RTRDPKRAHVYFMPFSVTWMVKYLYKPLTYDHSPLRQFVADYVRVLSTKYPFWNRTHGAD 251
Query: 140 HFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSI---TIPP-- 194
HF + HD+G + + + + +L A + F R V L E + +PP
Sbjct: 252 HFMLACHDWGP-HASRGDHLLYNTSIRVLCNANTSEGFNPRKDVSLPEIHLYGGNVPPQL 310
Query: 195 YAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYA--RGARAAVWENFKNN-PL 251
+PP P +T R +F GG + R W++ +++ +
Sbjct: 311 LSPP--------PANTTRPHLAFF-----------AGGLHGPIRPLLLKHWKDRESDLRV 351
Query: 252 FDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIP 311
F+ H YY M R+ FCLCP G SPR+VE++ C+PVI++D VLPF+D +
Sbjct: 352 FEYLPKH-LDYYSFMLRSKFCLCPSGHEVASPRIVESIYAECVPVILSDHYVLPFSDVLR 410
Query: 312 WEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQIL 371
W+ + + ++P+L+ +L S+P E R + L +++ + QPA+ D FH IL
Sbjct: 411 WDAFSIQLNVSEIPRLEEVLRSVPEEKYERLKEGLR--TVRTHFMLNQPAKRFDVFHMIL 468
Query: 372 NGL 374
+ +
Sbjct: 469 HSI 471
>gi|115470965|ref|NP_001059081.1| Os07g0188700 [Oryza sativa Japonica Group]
gi|34394611|dbj|BAC83913.1| putative pectin-glucuronyltransferase [Oryza sativa Japonica Group]
gi|50508944|dbj|BAD31848.1| putative pectin-glucuronyltransferase [Oryza sativa Japonica Group]
gi|113610617|dbj|BAF20995.1| Os07g0188700 [Oryza sativa Japonica Group]
gi|125599387|gb|EAZ38963.1| hypothetical protein OsJ_23384 [Oryza sativa Japonica Group]
gi|215707162|dbj|BAG93622.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 606
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 96/353 (27%), Positives = 158/353 (44%), Gaps = 50/353 (14%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTP--- 96
KV+VYE + + D C + F++ + + +RT +P++A F+ P
Sbjct: 272 FKVFVYE----EGEPPVFHDGPCRSIYSTEGRFIYAMEMENRMRTRDPDQAHVFFLPFSV 327
Query: 97 ------IYP--TCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDF 148
IY + D+ P R + I ++S+ +P+WNR+ GADHF + HD+
Sbjct: 328 VKMVKMIYEPNSHDMDPL--------RRTISDYINVVSTKYPHWNRSLGADHFMLSCHDW 379
Query: 149 GACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPP 208
G I +L A + F V L E I + +++ P
Sbjct: 380 GPYVSSANGHLFSNSIR-VLCNANTSEGFDPSRDVSLPE--INLRSDVVDRQVGG---PS 433
Query: 209 DTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTD------HPTTY 262
+ R I +F G ++ P R + +++ DI H +Y
Sbjct: 434 ASHRPILAFFAG----GDHGP--------VRPLLLQHWGKGQDADIQVSEYLPRRHGMSY 481
Query: 263 YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEE 322
+ M+R+ FCLCP G+ SPR+VEA+ C+PV+I DD LPFAD + W + VA
Sbjct: 482 TDMMRRSRFCLCPSGYEVASPRVVEAIYLECVPVVIGDDYTLPFADVLNWAAFSVRVAVG 541
Query: 323 DVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPA-QPGDAFHQILNGL 374
D+P+L IL ++ P +R QR + +++R + A + D FH IL+ +
Sbjct: 542 DIPRLKEILAAVSPRQYIRMQRRVR--AVRRHFMVSDGAPRRFDVFHMILHSI 592
>gi|356565159|ref|XP_003550812.1| PREDICTED: LOW QUALITY PROTEIN: probable glycosyltransferase
At5g20260-like [Glycine max]
Length = 488
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 80/300 (26%), Positives = 137/300 (45%), Gaps = 35/300 (11%)
Query: 87 PEEADWFYTPIYPTCDLTPTGLPLP-FKSPRMMRSAI---QLISSNWPYWNRTEGADHFF 142
P+EA F PI T + PL + ++MR + +I+ +PYWNRT+GADHF
Sbjct: 201 PDEAHVFMLPISVTQIVRYVYNPLTTYSRDQLMRITVDYTNIIAHRYPYWNRTKGADHFL 260
Query: 143 VVPHDFGACFHYQEE-KAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKM 201
HD+ +E + + + I+ +L A + F V + P +
Sbjct: 261 ASCHDWAPDISREESGRELFKNIIRVLCNANTSEGFKPEKDVPM-----------PEMNL 309
Query: 202 QAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAV-------WENFKNNPLFDI 254
Q ++ P +D+NN ++A GA + W++
Sbjct: 310 QGFKLSSPIPG----------FDLNNRSILAFFAGGAHGRIRKILLEHWKDKDEEVQVHE 359
Query: 255 STDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEE 314
Y M ++ FCLCP G+ SPR+VE++ GC+PVI++D LPF+D + W +
Sbjct: 360 YLPKGVDYQGLMGQSKFCLCPSGYEVASPRIVESINIGCVPVIVSDYYQLPFSDVLDWSK 419
Query: 315 IGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
+ + + ++ TIL ++P L+ Q+ + ++R +PA+P D FH IL+ +
Sbjct: 420 FSLHIPSRRIAEIKTILKNVPHAKYLKLQKRVM--KVQRHFELNRPAKPFDVFHMILHSI 477
>gi|356565155|ref|XP_003550810.1| PREDICTED: xylogalacturonan beta-1,3-xylosyltransferase-like
[Glycine max]
Length = 496
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 87/308 (28%), Positives = 135/308 (43%), Gaps = 36/308 (11%)
Query: 80 SPVRTLNPEEADWF-------------YTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLIS 126
S R +PEEA F Y PI D P L L ++ I +I
Sbjct: 200 SQFRARHPEEAHVFFLPFSIANVVHYVYKPILKQSDYEPVRLQL------LVEDYISVIE 253
Query: 127 SNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLN 186
+PYWNR++GADHF + HD+ + + I L A + F V +
Sbjct: 254 DKYPYWNRSKGADHFLLSCHDWAPKVSNGNPELFQSFIRALCN-ANTSEGFHPNRDVSIP 312
Query: 187 EGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENF 246
E + + PP Q P++ R+I +F G + R W++
Sbjct: 313 EVYLPVGKLGPPSLGQH----PNS-RTILAFFAGGVHG---------EIRKILLKHWKDK 358
Query: 247 KNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 306
N Y + M ++ FCLCP G SPR+VEA+ GC+PVII D+ LPF
Sbjct: 359 DNEVRVHEYLPKSQNYTKLMGQSKFCLCPSGHEVASPRVVEAIHAGCVPVIICDNYSLPF 418
Query: 307 ADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDA 366
+D + W + + V+ + +P++ +IL SI + LR + ++R + +PA+P D
Sbjct: 419 SDVLHWSQFSVKVSVQKIPEIKSILQSISRKKYLRLHMNVLR--VRRHFMINRPAKPFDM 476
Query: 367 FHQILNGL 374
H IL+ +
Sbjct: 477 MHMILHSI 484
>gi|363808264|ref|NP_001242494.1| uncharacterized protein LOC100777368 [Glycine max]
gi|255635742|gb|ACU18220.1| unknown [Glycine max]
Length = 409
Score = 118 bits (296), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 99/353 (28%), Positives = 160/353 (45%), Gaps = 56/353 (15%)
Query: 39 KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIY 98
K KVY+Y P + + PR LT +A+E + + + S RT NP+EA F+ PI
Sbjct: 87 KFKVYIY--PDG-DPNTFYQTPRKLTGKYASEGYFFQNIRDSRFRTENPDEAHLFFIPI- 142
Query: 99 PTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEK 158
+C ++++ ++ + S +PYWNRT GADHFFV HD G +
Sbjct: 143 -SCHKMRGKGTSYENMTIIVQNYVESLISKYPYWNRTLGADHFFVTCHDVGV-------R 194
Query: 159 AIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY---APPQKMQAHQIPPDTPRSIF 215
A E G+ L++ + VC + P+ A PQ +Q +P
Sbjct: 195 ATE-GLEFLVKNSIRA--------VCSPSYDVGFIPHKDVALPQVLQPFALPAGGN---- 241
Query: 216 VYFRGLFYDVNNDPEGGYYA-------RGARAAVWENFKNNPLFDISTDHPT------TY 262
D+ N G++A R A VWEN + DIS + + Y
Sbjct: 242 --------DIENRTTLGFWAGHRNSKIRVILARVWEN---DTELDISNNRISRATGHLVY 290
Query: 263 YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEE 322
+ R+ FC+CP G S R+ +++ +GCIPV++++ LPF D + W + + + E
Sbjct: 291 QKRFYRSKFCICPGGSQVNSARIADSIHYGCIPVMLSNYYDLPFNDILDWNKFAVVLKES 350
Query: 323 DVPKLDTILTSIP-PEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
DV +L IL +I E + L+ +++ + P+ DAFH ++ L
Sbjct: 351 DVYQLKQILKNISDAEFVTLHNNLV---KVQKHFQWNSPSIRFDAFHLVMYDL 400
>gi|125557509|gb|EAZ03045.1| hypothetical protein OsI_25187 [Oryza sativa Indica Group]
Length = 601
Score = 118 bits (295), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 97/355 (27%), Positives = 157/355 (44%), Gaps = 54/355 (15%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTP--- 96
KV+VYE + + D C + F++ + + +RT +P +A F+ P
Sbjct: 267 FKVFVYE----EGEPPVFHDGPCHSIYSTEGRFIYAMEMENRMRTRDPNQAHVFFLPFSV 322
Query: 97 ------IYP--TCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDF 148
IY + D+ P R + I ++S+ +P+WNR+ GADHF + HD+
Sbjct: 323 VKMVKMIYEPNSHDMDPL--------RRTISDYINVVSTKYPHWNRSLGADHFMLSCHDW 374
Query: 149 GACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQI-- 206
G I +L A + F V L E ++ + A Q+
Sbjct: 375 GPYVSSANGHLFSNSIR-VLCNANTSEGFDPSRDVSLPEINLR-------SDVVARQVGG 426
Query: 207 PPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTD------HPT 260
P + R I +F G ++ P R + +++ DI H
Sbjct: 427 PSASHRPILAFFAG----GDHGP--------VRPLLLQHWGKGQDADIQVSEYLPRRHSM 474
Query: 261 TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVA 320
+Y + M+R+ FCLCP G+ SPR+VEA+ C+PV+I DD LPFAD + W + VA
Sbjct: 475 SYTDMMRRSRFCLCPSGYEVASPRVVEAIYLECVPVVIGDDYALPFADVLNWAAFSVRVA 534
Query: 321 EEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQ-PGDAFHQILNGL 374
D+P+L IL ++ P +R QR + +++R + A D FH IL+ +
Sbjct: 535 VGDIPRLKEILAAVSPRQYIRMQRRVR--AVRRHFMVSDGAPWRFDVFHMILHSI 587
>gi|224142567|ref|XP_002324627.1| predicted protein [Populus trichocarpa]
gi|222866061|gb|EEF03192.1| predicted protein [Populus trichocarpa]
Length = 342
Score = 118 bits (295), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 84/306 (27%), Positives = 142/306 (46%), Gaps = 27/306 (8%)
Query: 79 SSPVRTLNPEEADWFYTPIYPTCDLTPTGLP-LPFKSPRMMR---SAIQLISSNWPYWNR 134
SP +P+EA F+ PI + P + F ++ R +++++ + YWNR
Sbjct: 41 KSPFIARHPDEAHAFFLPISVAYIIHYVYKPRITFARDQLQRLVTDYVRVVADKYTYWNR 100
Query: 135 TEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPP 194
T+GADHF V HD+ + + R + +L A + + F + V + E + +
Sbjct: 101 TQGADHFSVSCHDWAPDVS-RANPELFRYFVRVLCNANISEGFRPQRDVSIPEIFLPVGK 159
Query: 195 YAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGY-YARGARAAVWENFKNNP--- 250
PP++ P + RSI +F G G + + R W+ +
Sbjct: 160 LGPPREYTK----PPSKRSILAFFAG----------GAHGHIRKVLLTHWKEKDDEVQVH 205
Query: 251 --LFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFAD 308
L + + Y+E M ++ FCLCP G SPR+V A+ GC+PV I+ + LPF+D
Sbjct: 206 EYLTQRNKKNTNLYFELMGQSKFCLCPSGHEVASPRVVTAIQLGCVPVTISANYSLPFSD 265
Query: 309 AIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFH 368
+ W + + + E +P++ TIL I L QR + ++R + +PAQP D H
Sbjct: 266 VLDWSKFSVDIPPEKIPEIKTILKGISSRRYLTMQRRVMQ--IQRHFMLNRPAQPYDMLH 323
Query: 369 QILNGL 374
IL+ +
Sbjct: 324 MILHSV 329
>gi|302804767|ref|XP_002984135.1| glycosyltransferase-like protein [Selaginella moellendorffii]
gi|300147984|gb|EFJ14645.1| glycosyltransferase-like protein [Selaginella moellendorffii]
Length = 376
Score = 118 bits (295), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 96/363 (26%), Positives = 156/363 (42%), Gaps = 30/363 (8%)
Query: 15 ADNTPKIERISGSAGDVLED--DPVGKLKVYVYEL-PSKYNKKLLLKDPRCLTHMFAAEI 71
ADN P R + L+D D V +KV+VY P + L + +A+E
Sbjct: 29 ADNGPYHSRRA-----FLDDYRDMVNTMKVFVYPCSPRDPFSHIFLPTSSAPSGNYASEA 83
Query: 72 FMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPY 131
+ + L S + T +P +AD F+ P T + + + P +R ++ IS WPY
Sbjct: 84 YFKKALAGSGMVTDDPSQADLFFMPFSITRLRNDPKVGVG-RMPAFVRDYVKNISHRWPY 142
Query: 132 WNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSIT 191
WNRT G+DHF+V H G + + I + VQ F V +
Sbjct: 143 WNRTGGSDHFYVACHSIGKVALEKAQHVKLNAIQVVCSSNYYVQGFIPHKDVAI------ 196
Query: 192 IPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPL 251
P P+ +I R + +F G +N P R W N + +
Sbjct: 197 --PQIWPRSESFREIKTIEQRKVLAFFAG----GSNSP-----VRANVVRTWRN--DTQI 243
Query: 252 FDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIP 311
+ +Y E + R+ FCL G+ + RL +A +GC+PV+IA+ LPF+ +
Sbjct: 244 HAYPSRIQGSYAEALLRSKFCLHVKGYEVNTARLGDAFFYGCVPVVIANHYDLPFSSVLN 303
Query: 312 WEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQIL 371
W+ + V ++PKL IL+ I E + RL+ + +R + P + DAF+ ++
Sbjct: 304 WKSFSVVVTTANIPKLKAILSGISREDYSQMHRLVLDA--RRHFQWHAPPREYDAFYMVM 361
Query: 372 NGL 374
L
Sbjct: 362 YQL 364
>gi|449431924|ref|XP_004133750.1| PREDICTED: probable glycosyltransferase At5g03795-like [Cucumis
sativus]
Length = 412
Score = 117 bits (294), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 100/356 (28%), Positives = 158/356 (44%), Gaps = 54/356 (15%)
Query: 35 DPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFY 94
D K KVY+Y P + + PR LT +A+E + + + S RT +P++A F+
Sbjct: 86 DMESKFKVYIY--PDG-DPNTFYQTPRKLTGKYASEGYFFQNIRESRFRTEDPDQAHLFF 142
Query: 95 TPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHY 154
PI +C ++++ ++ + S +PYWNRT GADHFFV HD G
Sbjct: 143 IPI--SCHKMRGKGTSYENMTVIVQNYVEGLISKYPYWNRTLGADHFFVTCHDVGV---- 196
Query: 155 QEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY---APPQKMQAHQIPPDTP 211
+A E G+ L++ A V VC + P+ A PQ +Q +P
Sbjct: 197 ---RASE-GLPFLIKNAIRV--------VCSPSYDVGFIPHKDVALPQVLQPFALPAGGN 244
Query: 212 RSIFVYFRGLFYDVNNDPEGGYYA-------RGARAAVWENFKNNPLFDISTDHPTT--- 261
D N G++A R A VWEN + DIS + +
Sbjct: 245 ------------DTENRTTLGFWAGHRNSKIRVILARVWEN---DTELDISNNRISRATG 289
Query: 262 ---YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMF 318
Y + + FC+CP G S R+ +++ +GC+PVI++D LPF D + W + +
Sbjct: 290 HLLYQKRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSDYYDLPFNDILDWRKFSVI 349
Query: 319 VAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
V E DV +L IL I ++ + L +++ + P DAFH ++ L
Sbjct: 350 VKERDVYQLKQILKDISDIEFIKLHKNLMQ--VQKHFQWNSPPIKYDAFHMVMYDL 403
>gi|224071347|ref|XP_002303415.1| predicted protein [Populus trichocarpa]
gi|222840847|gb|EEE78394.1| predicted protein [Populus trichocarpa]
Length = 326
Score = 117 bits (294), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 97/352 (27%), Positives = 158/352 (44%), Gaps = 54/352 (15%)
Query: 39 KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIY 98
K+Y+Y P + + PR LT +A+E + + + S RTL+P++A F+ PI
Sbjct: 4 NFKIYIY--PDG-DPNTFYQTPRKLTGKYASEGYFFQNIRESRFRTLDPDQAHLFFIPI- 59
Query: 99 PTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEK 158
+C ++ + ++ + + + YWNRT GADHFFV HD G +
Sbjct: 60 -SCHKMRGKGTSYENMTVIVENYVESLIAKYSYWNRTLGADHFFVTCHDVGV-------R 111
Query: 159 AIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY---APPQKMQAHQIPPDTPRSIF 215
A E G+ L++ A V VC + P+ A PQ +Q +P
Sbjct: 112 ATE-GVPFLIKNAIRV--------VCSPSYDVGFIPHKDVALPQVLQPFALPAGGN---- 158
Query: 216 VYFRGLFYDVNNDPEGGYYA--RGAR-----AAVWENFKNNPLFDISTDHPT------TY 262
DV N G++A R +R A VWEN + DIS++ Y
Sbjct: 159 --------DVENRTTLGFWAGHRNSRIRVILARVWEN---DTELDISSNRINRATGHLVY 207
Query: 263 YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEE 322
+ FC+CP G S R+ +++ +GC+PVI+++ LPF D + W + + + E+
Sbjct: 208 QKRFYGTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWHKFSVILKEQ 267
Query: 323 DVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
DV +L IL IP + + L +++ + P DAFH ++ L
Sbjct: 268 DVYQLKQILKDIPDNQFVSLHKNLI--KVQKHFQWNSPPIKNDAFHMVMYDL 317
>gi|224120366|ref|XP_002331030.1| predicted protein [Populus trichocarpa]
gi|222872960|gb|EEF10091.1| predicted protein [Populus trichocarpa]
Length = 339
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 88/303 (29%), Positives = 136/303 (44%), Gaps = 35/303 (11%)
Query: 86 NPEEADWFYTPIYPTCDLTPTGLP-LPFKSPRMMR---SAIQLISSNWPYWNRTEGADHF 141
+PEEA F P+ + P + F ++ R +++I+ PYWNRT GADHF
Sbjct: 48 HPEEAQAFLLPVSVAYIIHYVYRPRITFSRDQLQRLVTDYVRVIADKHPYWNRTHGADHF 107
Query: 142 FVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKM 201
V HD+ + + + + + L A + F + V + E + + PPQ+
Sbjct: 108 SVSCHDWAPDVS-RADPGLFKYFIRALCNANTSEGFQPQRDVSIPEIFLPVGKLGPPQEY 166
Query: 202 QAHQIPPDTPRSIFVYFRGLFYD----------VNNDPEGGYYARGARAAVWENFKNNPL 251
P + RSI +F G + D E + R +N KNN L
Sbjct: 167 AQ----PPSKRSILAFFAGGAHGHIRKILLERWKEKDDEIQVHEYLTR----KNKKNNNL 218
Query: 252 FDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIP 311
Y+E M ++ FCLCP G SPR+V A+ GC+PV I+D+ LPF+D +
Sbjct: 219 ----------YFELMGQSKFCLCPSGHEVASPRVVTAIQLGCVPVTISDNYSLPFSDVLD 268
Query: 312 WEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQIL 371
W + + + E +P + IL I L QR + ++R +PAQP D H IL
Sbjct: 269 WSKFSVDIPSEKIPDIKIILKGISVRRYLTMQRRVMQ--IRRHFTLNRPAQPYDMLHMIL 326
Query: 372 NGL 374
+ +
Sbjct: 327 HSV 329
>gi|168062371|ref|XP_001783154.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162665352|gb|EDQ52040.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 351
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 77/258 (29%), Positives = 125/258 (48%), Gaps = 15/258 (5%)
Query: 122 IQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRN 181
++ I+S +PYWNRT GADHFFV HD+ ++ + + ++ A L F +
Sbjct: 87 VERITSKYPYWNRTRGADHFFVSCHDWAPLSTILHDE-LHNNSMKVVCNADLTANFDIQK 145
Query: 182 HVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAA 241
V + + ++ +PP R ++ G + + P + RG ++
Sbjct: 146 DVSIPQAVKG----GNQSELDIDNLPPGK-RDYLAFYAGQMHGLVR-PVLIQHWRGKDSS 199
Query: 242 --VWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIA 299
V+E +IS Y + M+R+ FCLCP G+ SPR+VEA++ GC+PVIIA
Sbjct: 200 MKVYEVLPPEIAKNIS------YAQHMKRSKFCLCPKGFEVNSPRIVEAILSGCVPVIIA 253
Query: 300 DDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQ 359
D+ VLPF++ + W + + V E+D+P L ILT++P Q L L Q
Sbjct: 254 DNFVLPFSNVLDWSKFSITVEEKDIPNLKRILTNVPDGTYRSMQSCLKYIRRHFVWLEDQ 313
Query: 360 PAQPGDAFHQILNGLARK 377
D+FH + + R+
Sbjct: 314 EDTQYDSFHMTMYSIWRQ 331
>gi|356562965|ref|XP_003549738.1| PREDICTED: probable glycosyltransferase At3g42180-like [Glycine
max]
Length = 473
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 90/311 (28%), Positives = 137/311 (44%), Gaps = 42/311 (13%)
Query: 80 SPVRTLNPEEADWF-------------YTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLIS 126
SP + +PE+A F Y P D P L R++ I +++
Sbjct: 176 SPFKATHPEQAHLFLLPYSVSKVIRYVYKPRRSRSDYDPDRLQ------RLVADYINILA 229
Query: 127 SNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLN 186
+ +PYWNR++GADHF V HD+G + + I L A + F V +
Sbjct: 230 NRYPYWNRSKGADHFLVSCHDWGPRISDANPELFKYFIRALCN-ANTSEGFQPNRDVSIP 288
Query: 187 EGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAV--WE 244
E + PP Q P+ R+I +F GG + + + + W+
Sbjct: 289 EVYLPSGKLGPPNMGQH----PNN-RTILAFF-----------AGGAHGKIRKKLLKRWK 332
Query: 245 NFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVL 304
N Y + M + FCLCP G SPR+VEA+ GC+PVII D+ L
Sbjct: 333 NKDKEVQVHEYLPKGQDYTKLMGLSKFCLCPSGHEVASPRVVEAIYAGCVPVIICDNYSL 392
Query: 305 PFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLAN-PSMKRAMLFPQPAQP 363
PF D + W + M +A E +P++ TIL S+ + L L +N ++R + +PA+P
Sbjct: 393 PFIDVLNWRKFSMEIAVERMPEIKTILQSVSKDKYL---ELYSNVRRVRRHFVINRPAKP 449
Query: 364 GDAFHQILNGL 374
D H IL+ L
Sbjct: 450 FDLIHMILHSL 460
>gi|224087070|ref|XP_002308053.1| predicted protein [Populus trichocarpa]
gi|222854029|gb|EEE91576.1| predicted protein [Populus trichocarpa]
Length = 339
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 86/299 (28%), Positives = 135/299 (45%), Gaps = 27/299 (9%)
Query: 86 NPEEADWFYTPIYPTCDLTPTGLP-LPFKSPRMMR---SAIQLISSNWPYWNRTEGADHF 141
+PEEA F P+ + P + F ++ R +++I+ PYWNRT GADHF
Sbjct: 48 HPEEAQAFLLPVSVAYIIHYVYRPRITFSRDQLQRLVTDYVRVIADKHPYWNRTHGADHF 107
Query: 142 FVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKM 201
V HD+ + + + + + L A + F + V + E + + PPQ+
Sbjct: 108 SVSCHDWAPDVS-RADPGLFKYFIRALCNANTSEGFQPQRDVSIPEIFLPVGKLGPPQEY 166
Query: 202 QAHQIPPDTPRSIFVYFRGLFYD------VNNDPEGGYYARGARAAVWENFKNNPLFDIS 255
P + RSI +F G + + E + +N KNN L
Sbjct: 167 AQ----PPSKRSILAFFAGGAHGHIRKILLERWKEKDDEIQVHEYLTQKNKKNNNL---- 218
Query: 256 TDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEI 315
Y+E M ++ FCLCP G SPR+V A+ GC+PV I+D+ LPF+D + W +
Sbjct: 219 ------YFELMGQSKFCLCPSGHEVASPRVVTAIQLGCVPVTISDNYSLPFSDVLDWSKF 272
Query: 316 GMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
+ + E +P + IL I L QR + ++R +PAQP D H IL+ +
Sbjct: 273 SVDIPSEKIPDIKIILKGISVRRYLTMQRRVMQ--IRRHFTLNRPAQPYDMLHMILHSV 329
>gi|9758523|dbj|BAB08970.1| unnamed protein product [Arabidopsis thaliana]
Length = 559
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 99/352 (28%), Positives = 166/352 (47%), Gaps = 44/352 (12%)
Query: 39 KLKVYVYELPSK--YNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTL-NPEEADWFYT 95
KLK+YVY+ K ++ + PR + +A+E + + + S+ + +P +A FY
Sbjct: 225 KLKIYVYKEGGKPIFHTPM----PRGI---YASEGWFMKLMESNKKFVVKDPRKAHLFYI 277
Query: 96 PIYPTCDLTPTGLPLPFKSPRMM----RSAIQLISSNWPYWNRTEGADHFFVVPHDFGAC 151
PI + + L L F++P+ + + + LI+ + +WNRT GADHF V HD+G
Sbjct: 278 PI--SIKALRSSLGLDFQTPKSLADHLKEYVDLIAGKYKFWNRTGGADHFLVACHDWGNK 335
Query: 152 FHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTP 211
+ K R L + + Q F L I AP + + +
Sbjct: 336 LTTKTMKNSVRS----LCNSNVAQGFRIGTDTALPVTYIR-SSEAPLEYLGG---KTSSE 387
Query: 212 RSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFK-NNPLFDISTDHPTT---YYEDMQ 267
R I +F G + Y R +WEN + + +F P + Y E M+
Sbjct: 388 RKILAFFAGSMHG---------YLRPILVKLWENKEPDMKIFGPMPRDPKSKKQYREYMK 438
Query: 268 RAI-----FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEE 322
+ +C+C G+ +PR+VEA++ C+PVIIAD+ V PF + + WEE +FV E+
Sbjct: 439 SSSSHFNRYCICARGYEVHTPRVVEAIINECVPVIIADNYVPPFFEVLNWEEFAVFVEEK 498
Query: 323 DVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
D+P L IL SIP + + Q + ++++ L+ + D FH IL+ +
Sbjct: 499 DIPNLRNILLSIPEDRYIGMQARV--KAVQQHFLWHKKPVKFDQFHMILHSI 548
>gi|449482564|ref|XP_004156325.1| PREDICTED: probable glycosyltransferase At5g03795-like [Cucumis
sativus]
Length = 516
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 92/356 (25%), Positives = 152/356 (42%), Gaps = 58/356 (16%)
Query: 39 KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTP-- 96
++K++VYE + L + C + F+H + S RT +P +A F+ P
Sbjct: 189 EMKIFVYE----EGEPPLFHNGPCKSIYSTEGNFIHAIEMDSQFRTKDPNKAHVFFLPLS 244
Query: 97 ---------IYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHD 147
++ + D TP + I +I + +P+WNR+ GADHF + HD
Sbjct: 245 VAMLVRFVYVHDSHDFTPIR--------HTVVDYINVIGTKYPFWNRSLGADHFMLSCHD 296
Query: 148 FGACFHYQEEKAI---ERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAH 204
+G + K++ + + +L A + F V P +Q
Sbjct: 297 WGP----EASKSVPNLYKNSIRVLCNANTSEGFNPSKDVSF-----------PEINLQTG 341
Query: 205 QI------PPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDH 258
+ P + R I +F G + R WEN +
Sbjct: 342 HLTGFLGGPSPSHRPILAFFAGGLHGP---------IRPILIQQWENQDQDIQVHQYLPK 392
Query: 259 PTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMF 318
+Y + M+++ FCLCP G+ SPR+VEA+ GC+PV+I+D V PF+D I W+ +
Sbjct: 393 GVSYIDMMRKSKFCLCPSGYEVASPRIVEAIYTGCVPVLISDHYVPPFSDVINWKSFSVE 452
Query: 319 VAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
V+ +D+P L TILT I LR R + ++R P + D +H IL+ +
Sbjct: 453 VSVDDIPNLKTILTGISTRQYLRMYRRVVK--VRRHFEVNSPPKRYDVYHMILHSV 506
>gi|449451243|ref|XP_004143371.1| PREDICTED: probable glycosyltransferase At5g03795-like [Cucumis
sativus]
Length = 516
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 92/356 (25%), Positives = 152/356 (42%), Gaps = 58/356 (16%)
Query: 39 KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTP-- 96
++K++VYE + L + C + F+H + S RT +P +A F+ P
Sbjct: 189 EMKIFVYE----EGEPPLFHNGPCKSIYSTEGNFIHAIEMDSQFRTKDPNKAHVFFLPLS 244
Query: 97 ---------IYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHD 147
++ + D TP + I +I + +P+WNR+ GADHF + HD
Sbjct: 245 VAMLVRFVYVHDSHDFTPIR--------HTVVDYINVIGTKYPFWNRSLGADHFMLSCHD 296
Query: 148 FGACFHYQEEKAI---ERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAH 204
+G + K++ + + +L A + F V P +Q
Sbjct: 297 WGP----EASKSVPNLYKNSIRVLCNANTSEGFNPSKDVSF-----------PEINLQTG 341
Query: 205 QI------PPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDH 258
+ P + R I +F G + R WEN +
Sbjct: 342 HLTGFLGGPSPSHRPIMAFFAGGLHGP---------IRPILIQRWENQDQDIQVHQYLPK 392
Query: 259 PTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMF 318
+Y + M+++ FCLCP G+ SPR+VEA+ GC+PV+I+D V PF+D I W+ +
Sbjct: 393 GVSYIDMMRKSKFCLCPSGYEVASPRIVEAIYTGCVPVLISDHYVPPFSDVINWKSFSVE 452
Query: 319 VAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
V+ +D+P L TILT I LR R + ++R P + D +H IL+ +
Sbjct: 453 VSVDDIPNLKTILTGISTRQYLRMYRRVVK--VRRHFEVNSPPKRYDVYHMILHSV 506
>gi|302780894|ref|XP_002972221.1| glycosyltransferase-like protein [Selaginella moellendorffii]
gi|300159688|gb|EFJ26307.1| glycosyltransferase-like protein [Selaginella moellendorffii]
Length = 376
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 96/363 (26%), Positives = 156/363 (42%), Gaps = 30/363 (8%)
Query: 15 ADNTPKIERISGSAGDVLED--DPVGKLKVYVYEL-PSKYNKKLLLKDPRCLTHMFAAEI 71
ADN P R + L+D D V +KV+VY P + L + +A+E
Sbjct: 29 ADNGPYHSRRA-----FLDDYRDMVNTMKVFVYPCSPRDPFSHIFLPTSSAPSGNYASEA 83
Query: 72 FMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPY 131
+ + L S + T +P +AD F+ P T + + + P +R ++ IS WPY
Sbjct: 84 YFKKALAESGMVTDDPSQADLFFMPFSITRLRNDPKVGVG-RMPAFVRDYVKNISHRWPY 142
Query: 132 WNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSIT 191
WNRT G+DHF+V H G + + I + VQ F V +
Sbjct: 143 WNRTGGSDHFYVACHSIGKVALEKAQHVRLNAIQVVCSSNYYVQGFIPHKDVAM------ 196
Query: 192 IPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPL 251
P P+ +I R + +F G +N P R W N + +
Sbjct: 197 --PQIWPRSESFREIKTIEQRKVLAFFAG----GSNSP-----VRANVVRTWRN--DTQI 243
Query: 252 FDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIP 311
+ +Y E + R+ FCL G+ + RL +A +GC+PV+IA+ LPF+ +
Sbjct: 244 HAYPSRIQGSYAEALLRSKFCLHVKGYEVNTARLGDAFFYGCVPVVIANYYDLPFSSVLN 303
Query: 312 WEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQIL 371
W+ + V ++PKL IL+ I E + RL+ + +R + P + DAF+ ++
Sbjct: 304 WKSFSVVVTTANIPKLKAILSGISREDYSQMHRLVLDA--RRHFQWHAPPREYDAFYMVM 361
Query: 372 NGL 374
L
Sbjct: 362 YQL 364
>gi|326529697|dbj|BAK04795.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 348
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 75/259 (28%), Positives = 123/259 (47%), Gaps = 14/259 (5%)
Query: 118 MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTF 177
+R ++ +++ +P+WNRT GADHF V HD+G + I L + F
Sbjct: 90 LRDFVKGLAAKYPFWNRTRGADHFLVACHDWGPYTTTAHRDLSKNSIKALCNADSSEGIF 149
Query: 178 GQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARG 237
V L E +I P + ++ P + R I +F G + ++ +G
Sbjct: 150 TPGKDVSLPETTIR----TPKRPLRYVGGLPVSRRRILAFFAGNVHGRVRPVLLQHWGKG 205
Query: 238 ARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVI 297
++ + L Y + M+ + FCLCP+G+ SPR+VEA+ + C+PVI
Sbjct: 206 QD----DDMRVYALLPGRVSRTMNYIQHMKNSKFCLCPMGYEVNSPRIVEALYYECVPVI 261
Query: 298 IADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLAN--PSMKRAM 355
IAD+ VLPF+D + W + VAE+D+P+L IL I LR+ + + ++R
Sbjct: 262 IADNFVLPFSDVLDWSAFSVVVAEKDIPELKRILQGIS----LRRYVAMHDCVKRLQRHF 317
Query: 356 LFPQPAQPGDAFHQILNGL 374
L+ D FH IL+ +
Sbjct: 318 LWYDRPLRYDLFHMILHSI 336
>gi|302830910|ref|XP_002947021.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
gi|300268065|gb|EFJ52247.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
Length = 834
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 104/372 (27%), Positives = 156/372 (41%), Gaps = 78/372 (20%)
Query: 42 VYVYELPSKYNKKLL-LKDPRCL-----------THMFA----AEIFMHRFLLSSPVRTL 85
+YVY+LP YN ++L ++ + L T +FA E+ H LL S RT
Sbjct: 391 IYVYDLPPAYNARMLQYRNDKGLCTWRGFGSGNRTEIFAWTYGLEVLFHEMLLQSEHRTF 450
Query: 86 NPEEADWFYTPIYPTCDL-------------TPTGLPLPFKSPRMMRSAIQLISSNWPYW 132
+PE AD+FY P+Y +C + P+G P MM I ++PYW
Sbjct: 451 DPEAADYFYVPVYGSCFIFPLHCYADGPWWHVPSG-PRVMHVTNMMLEVRDWIRKHFPYW 509
Query: 133 NRTEGADHFFVVPHDFGACF--------------------HYQEEKAI------ERGILP 166
+R G DH +++ HD GAC+ H+ A + + P
Sbjct: 510 DRRGGRDHIWLMTHDEGACYAPTEIYNSSIFLTHWGRIDKHHASNTAFTPDNYTQEYVHP 569
Query: 167 LLQRATLVQTFGQRNHVCLNEGS-ITIPPYAPPQKMQAHQI--PPDTPRSIFVYFRGLFY 223
L G H C G + +P P + + P R I +Y RG
Sbjct: 570 EQPGGWLHLIDG---HPCYTPGKDLVVPALKLPHHFRQSPLLFHPPRQRDILLYLRG--- 623
Query: 224 DVNND--PEGGYYARGARAAVWENFKN-------NPLFDISTDHPTTYYEDMQRAIFCLC 274
DV P Y+RG R ++ +++ N + +D P Y E + R+ FCL
Sbjct: 624 DVGKHRLPN---YSRGIRQRLYRLWRDQQWLQGYNVMIGDGSDVPGDYSEHLSRSKFCLV 680
Query: 275 PLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSI 334
G WSPRL +AV+ GC+PVII D + + D + E + V E+++ L L +
Sbjct: 681 VPG-DGWSPRLEDAVLHGCVPVIIMDGVHGVWEDQLELERFSIRVGEDELEGLPQQLAVV 739
Query: 335 PPEVILRKQRLL 346
P V+ QR L
Sbjct: 740 PQRVLEDMQRKL 751
>gi|115456850|ref|NP_001052025.1| Os04g0109900 [Oryza sativa Japonica Group]
gi|38345576|emb|CAE01775.2| OSJNBa0027H06.12 [Oryza sativa Japonica Group]
gi|113563596|dbj|BAF13939.1| Os04g0109900 [Oryza sativa Japonica Group]
gi|125589112|gb|EAZ29462.1| hypothetical protein OsJ_13536 [Oryza sativa Japonica Group]
Length = 441
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 98/347 (28%), Positives = 163/347 (46%), Gaps = 30/347 (8%)
Query: 39 KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSP------VRTLNPEEADW 92
+ KVYVYE + +L + C F+ + L SP VRT +P A
Sbjct: 103 RFKVYVYE----EGEPPILHEGPCKNIYTIEGSFIEQLELMSPSDAGGGVRTWDPTRAHA 158
Query: 93 FYTPIYPTCDLT-----PTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHD 147
F+ P + + P+ P ++ ++ ++++++ P+WNR+ GADHF + HD
Sbjct: 159 FFLPFSVSQMVKFVYRPPSQDRAPLRA--IVADYVRVVAARHPFWNRSAGADHFMLSCHD 216
Query: 148 FGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIP 207
+G + + + + + L A + F V + E I + P+++ A P
Sbjct: 217 WGP-YASRGQPELYTNAIRALCNANTSEGFRPGKDVSVPE--INLYDGDMPRELLA-PAP 272
Query: 208 PDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQ 267
R + +F G + D ++ +G AA + P+++ YY M+
Sbjct: 273 GLESRPLLAFFAGGRHGHVRDLLLRHW-KGRDAATF------PVYEYDLPAAGDYYSFMR 325
Query: 268 RAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKL 327
RA FCLCP G SPR+VEA+ C+PV+IAD LPFAD + WE + VA D+P+L
Sbjct: 326 RARFCLCPSGHEVASPRVVEAIQAECVPVVIADGYALPFADVLRWEAFSVAVAVGDIPRL 385
Query: 328 DTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
L IP + R +R + +KR ++ QP + D F+ IL+ +
Sbjct: 386 RERLERIPAAEVERLRRGVR--LVKRHLMLQQPPRRLDMFNMILHSV 430
>gi|159480086|ref|XP_001698117.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158273916|gb|EDO99702.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 821
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 108/386 (27%), Positives = 157/386 (40%), Gaps = 85/386 (22%)
Query: 36 PVGKLK--VYVYELPSKYNKKLLLKDPRCLTH------------------MFAAEIFMHR 75
P G+ + +YVY++P KY ++L R L H ++ E H
Sbjct: 342 PGGRRRPLIYVYDVPPKYTSRMLQY--RLLAHACLWRRWLDGNNTELTGWTYSVETMFHE 399
Query: 76 FLLSSPVRTLNPEEADWFYTPIYPTCDLTPT-------GLPLPFKSPRMMRSAIQL---- 124
LL S RT +PEEAD+FY P Y TC P P+ R M A L
Sbjct: 400 LLLQSEHRTFDPEEADFFYVPHYVTCYFWPIMGWADAPWWHAPYVDSRPMHGANMLTELH 459
Query: 125 --ISSNWPYWNRTEGADHFFVVPHDFGACF-------------HYQEEKAIERGILPLLQ 169
+ +N PYW+R G DH +++ D GAC+ H+ +A LQ
Sbjct: 460 GWLRTNLPYWDRRGGRDHIWLMAADEGACWMPTAIYNTSIVLTHWGRLEANHTSGTAYLQ 519
Query: 170 RATLVQTFG-QR-----------NHVCLN-EGSITIPPYAPPQKMQAHQI--PPDTPRSI 214
+G QR H C + + + IP + PP + P R I
Sbjct: 520 DVYDRPVYGFQRWPGVDYHHDIEGHPCFDPKKDLVIPAFKPPFHFARSPLLGAPPLQRDI 579
Query: 215 FVYFRGLFYDVNNDPEGGY----YARGARAAVWENFKNNPLFD---ISTDHPTT----YY 263
+YFRG G + Y+RG R + + F+ I H Y
Sbjct: 580 LLYFRG--------DSGAFRLPQYSRGIRQRITDLSNRQDWFNRYKIVISHGGMVGGDYS 631
Query: 264 EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED 323
E + R+ FCL G WSPR +A++ GCIPV++ D + F + W+ + + E+D
Sbjct: 632 EHLARSKFCLVAPG-DGWSPRAEDAILHGCIPVVVMDGVQAVFESILDWDSFSLRIREDD 690
Query: 324 --VPKLDTILTSIPPEVILRKQRLLA 347
+ L +L SI PE + QR LA
Sbjct: 691 AALEALPQLLASISPERLAHMQRHLA 716
>gi|297808029|ref|XP_002871898.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
gi|297317735|gb|EFH48157.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
Length = 610
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 94/351 (26%), Positives = 159/351 (45%), Gaps = 48/351 (13%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTL-NPEEADWFYTP-- 96
LKVYVY K + + P L ++A+E + + + + T+ +P +A +Y P
Sbjct: 281 LKVYVY----KEGNRPIFHTP-ILKGLYASEGWFMKLMEGNKQYTVKDPRKAHLYYMPFS 335
Query: 97 -------IYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFG 149
+Y T L + ++ + ISS +P++NRT+GADHF V HD+
Sbjct: 336 ARMLEYTLYVRNSHNRTNLR------QFLKEYTEHISSKYPFFNRTDGADHFLVACHDWA 389
Query: 150 ACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPD 209
Y+ +E I L A + F + L E + A ++ P
Sbjct: 390 P---YETRHHMEHCIKALCN-ADVTAGFKIGRDISLPETYVR----AAKNPLRDLGGKPP 441
Query: 210 TPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP------TTYY 263
+ R ++ G + Y R W++ +P I P Y
Sbjct: 442 SQRRTLAFYAGSMHG---------YLRQILLQHWKD--KDPDMKIFGRMPFGVASKMNYI 490
Query: 264 EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED 323
E M+ + +C+CP G+ SPR+VE++ + C+PVII+D+ V PF + + W + VAE+D
Sbjct: 491 EQMKSSKYCICPKGYEVNSPRVVESIFYECVPVIISDNFVPPFFEVLDWSAFSVIVAEKD 550
Query: 324 VPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
+P+L IL+SIP E ++ Q +A +R L+ + D FH +L+ +
Sbjct: 551 IPRLKDILSSIPEEKYVKMQ--MAVRKAQRHFLWHAKPEKYDLFHMVLHSI 599
>gi|255582307|ref|XP_002531944.1| catalytic, putative [Ricinus communis]
gi|223528390|gb|EEF30426.1| catalytic, putative [Ricinus communis]
Length = 363
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 84/332 (25%), Positives = 143/332 (43%), Gaps = 53/332 (15%)
Query: 79 SSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPF----KSPRMMRSAIQLISSNWPYWNR 134
SP + +EA F+ P+ + LP+ K R+ + +++++ +P+WNR
Sbjct: 40 ESPFIARHADEAHVFFLPVSVAHIVEYIYLPITSYDRDKLIRVFKDYVKVVADKYPFWNR 99
Query: 135 TEGADHFFVVPHD------------------------FGACFHY-------QEEKAIERG 163
+ G+DHF + HD F FH Q + + +
Sbjct: 100 SSGSDHFMLSCHDWAMVHINSIPVELVRAFALGRNLKFEISFHSHRAPEISQHDPELYKN 159
Query: 164 ILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFY 223
++ ++ A + F V L E + IPP H P R I +F G
Sbjct: 160 LIRVICNANTSEGFLPTRDVTLPE--LNIPPGGFDH---VHHCLPSHKRRILAFFAG--- 211
Query: 224 DVNNDPEGGY-YARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWS 282
G + Y R W+N + Y + M ++ FCLCP G+ S
Sbjct: 212 -------GAHGYIRKILLHHWKNKDDEVQVHEYLSKDEDYRKLMGQSKFCLCPSGYEVAS 264
Query: 283 PRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRK 342
PR+VE++ GCIPVII+D LPF+D + W +I + + E +P++ TIL + + LR
Sbjct: 265 PRIVESIYAGCIPVIISDHYNLPFSDVLDWSQISVQIPVEKIPEIKTILKGVSNDKYLRM 324
Query: 343 QRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
Q+ + ++R +P++P D H +L+ +
Sbjct: 325 QKRVRR--VQRHFEINRPSKPFDVLHMVLHSV 354
>gi|356565157|ref|XP_003550811.1| PREDICTED: xylogalacturonan beta-1,3-xylosyltransferase-like
[Glycine max]
Length = 411
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 83/284 (29%), Positives = 128/284 (45%), Gaps = 25/284 (8%)
Query: 92 WFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGAC 151
+ Y PI D P L R++ I +I++ +PYWNR+EGADHF + HD+G
Sbjct: 137 YVYKPIRKHSDYEPIRLQ------RLVEDYIGVIANKYPYWNRSEGADHFLLSCHDWGPK 190
Query: 152 FHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTP 211
Y K + + +L A + F V + E + PP Q P D
Sbjct: 191 VSYGNPKLF-KNFIRVLCNANTSEGFLPNKDVSIPEVYLPKGKLGPPNLGQR---PND-- 244
Query: 212 RSIFVYFRGLFY-DVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAI 270
RSI +F G + D+ R W+ N+ Y + M ++
Sbjct: 245 RSILAFFAGREHGDI----------RKILLNHWKGKDNDIQVHEYLPKGKNYTQLMGQSK 294
Query: 271 FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTI 330
FCLCP G+ SPR+VEA+ GC+PV+I+ PF D + W + + + E + ++ TI
Sbjct: 295 FCLCPSGYEVASPRVVEAIHAGCVPVLISSSYSPPFTDVLNWSQFSVEIPVEKISEIKTI 354
Query: 331 LTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
L SI LR + ++R + +PA+P D H IL+ +
Sbjct: 355 LQSISRNRYLRLHMNVLR--VRRHFMLNRPAKPFDLMHMILHSI 396
>gi|224084457|ref|XP_002307304.1| predicted protein [Populus trichocarpa]
gi|222856753|gb|EEE94300.1| predicted protein [Populus trichocarpa]
Length = 406
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 94/333 (28%), Positives = 158/333 (47%), Gaps = 51/333 (15%)
Query: 58 KDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRM 117
+ PR LT +A+E + + + S +T +P++A F+ PI +C G + +++ +
Sbjct: 100 QTPRKLTGKYASEGYFFQNIRESRFQTQDPDQAHLFFIPI--SCH-KMRGKGISYENMTI 156
Query: 118 M-RSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQT 176
+ + ++ + S +PYWNRT GADHFFV HD G +A E G+ L++ A V
Sbjct: 157 IVDNYVESLKSKYPYWNRTLGADHFFVTCHDVGV-------RATE-GVPFLIKNAIRV-- 206
Query: 177 FGQRNHVCLNEGSITIPPY---APPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGY 233
VC + P+ A PQ +Q +P DV G+
Sbjct: 207 ------VCSPSYDVGFIPHKDIALPQVLQPFALPAGGN------------DVEKRTTLGF 248
Query: 234 YA--RGAR-----AAVWEN-----FKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPW 281
+A R +R A VWEN NN + + +T H Y + + +C+CP G
Sbjct: 249 WAGHRNSRIRVILARVWENDTELDISNNRI-NRATGH-LVYQKRFYGSKYCICPGGSQVN 306
Query: 282 SPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILR 341
S R+ +++ +GCIPVI+++ LPF D + W + + + E+DV +L IL IP ++
Sbjct: 307 SARIADSIHYGCIPVILSNYYDLPFNDILDWHKFSVILKEQDVYRLKQILKDIPDNKLVS 366
Query: 342 KQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
+ L +++ + P DAFH ++ L
Sbjct: 367 LHKNLV--KVQKHFQWNSPPVKYDAFHMVMYDL 397
>gi|356575096|ref|XP_003555678.1| PREDICTED: probable glycosyltransferase At3g07620-like [Glycine
max]
Length = 493
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 84/309 (27%), Positives = 140/309 (45%), Gaps = 24/309 (7%)
Query: 71 IFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSA---IQLISS 127
IF++ ++S RT NP+EA ++ P L P+ + R+ + +IS
Sbjct: 194 IFINSLEINSQFRTQNPDEAHVYFLPFSVVMILEHLFHPVIRDKAVLERTIGDYVHIISH 253
Query: 128 NWPYWNRTEGADHFFVVPHDFG--ACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCL 185
+ YWNR+ GADHF + HD+G A ++ +E I + +L A + + F +
Sbjct: 254 KYKYWNRSYGADHFMLSCHDWGPRATWYVKELYFIA---IRVLCNANISEHFNPKKDASF 310
Query: 186 NEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWEN 245
E ++ + PP R+I +F G + R WE
Sbjct: 311 PEINLV----NGETRGLIGGYPP-CNRTILAFFAGQMHG---------RIRPVLFQHWEG 356
Query: 246 FKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLP 305
+ L Y+E M+++ +C+CP G+ SPR+VEA+ C+PVII+ VLP
Sbjct: 357 KDKDVLVYEKLPDGVPYHETMKKSKYCICPSGFEVASPRIVEAIYAQCVPVIISQQYVLP 416
Query: 306 FADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGD 365
F+D + W+ + + DVPKL IL I + +R Q ++R + P + D
Sbjct: 417 FSDVLNWDSFSVQILVSDVPKLKEILLGISEDKYMRLQE--GVKQVQRHFVVNNPPKRYD 474
Query: 366 AFHQILNGL 374
FH I++ +
Sbjct: 475 VFHMIIHSI 483
>gi|297811281|ref|XP_002873524.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
gi|297319361|gb|EFH49783.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
Length = 546
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 103/350 (29%), Positives = 162/350 (46%), Gaps = 47/350 (13%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEI-FM------HRFLLSSPVRTLNPEEADW 92
LKVY+Y S+ ++ + + + ++A+E FM HRFL T +P A
Sbjct: 220 LKVYIY---SEGDRPIFHQPEAIMEGIYASEGWFMKLMESNHRFL------TKDPNIAHL 270
Query: 93 FYTPIYPTCDLTPTGLPLPFKSPR----MMRSAIQLISSNWPYWNRTEGADHFFVVPHDF 148
FY P + T L S R +++ + LI+SN+P+WNRT G+DHFF HD+
Sbjct: 271 FYLP-FSTRILQQKLYVHDSHSRRNLVKYLKNYLDLIASNYPFWNRTRGSDHFFTACHDW 329
Query: 149 GACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPP 208
E + + L A + F V L E I+ P + ++
Sbjct: 330 APA----ETRGPYINCIRSLCNADVGVDFVVGKDVSLPETKIS-SSQNPNGNIGGNR--- 381
Query: 209 DTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDIST----DHPTTYYE 264
+ R+I +F G + GY R + + + P D+ DH +Y
Sbjct: 382 PSKRTILAFFAGNLH--------GY----VRPILLNQWSSRPEPDMKIFNRIDH-KSYIR 428
Query: 265 DMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDV 324
M+R+ FC+C G+ SPR+VE+V++GC+PVII+D+ V PF + + WE +FV E+++
Sbjct: 429 YMKRSRFCVCAKGYEVNSPRVVESVLYGCVPVIISDNFVPPFLEILNWESFAVFVPEKEI 488
Query: 325 PKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
P L IL SIP + Q+ + K M D FH IL+ +
Sbjct: 489 PNLRKILISIPVRRYVEMQKRVMK-VQKHFMWHDGEPVRYDVFHMILHSV 537
>gi|242053777|ref|XP_002456034.1| hypothetical protein SORBIDRAFT_03g029210 [Sorghum bicolor]
gi|241928009|gb|EES01154.1| hypothetical protein SORBIDRAFT_03g029210 [Sorghum bicolor]
Length = 517
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 92/340 (27%), Positives = 153/340 (45%), Gaps = 21/340 (6%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTL-NPEEADWFYTPIY 98
LKVY+Y+ ++ + P L+ ++A+E + + L S + + +A FY P Y
Sbjct: 182 LKVYIYQ----DGRRPIFHTP-PLSGIYASEGWFMKLLKESRRHVVADAGKAHLFYLP-Y 235
Query: 99 PTCDLTPTGLPLPFKSPR----MMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHY 154
+ L T + R +R+ ++ +++ +P+WNRT GADHF V HD+G
Sbjct: 236 SSQQLRLTLYQADSHNLRPLAAYLRNFVRGLANKYPFWNRTRGADHFLVACHDWGPYTTA 295
Query: 155 QEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSI 214
+ I L + F V L E +I P + ++ P + RSI
Sbjct: 296 AHRDLRKNAIKALCNADSSEGIFTPGKDVSLPETTIR----NPRRPLRYVGGLPVSRRSI 351
Query: 215 FVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLC 274
+F G + ++ G + + L Y + M+ + FCLC
Sbjct: 352 LAFFAGNVHGRVRPVLLRHWGDGQD----DEMRVYSLLPNRVSRRMNYIQHMKNSRFCLC 407
Query: 275 PLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSI 334
P+G+ SPR+VEA + C+PVIIAD+ VLP ++ + W + VAE+D+P L IL I
Sbjct: 408 PMGYEVNSPRIVEAFYYECVPVIIADNFVLPLSEVLDWSAFSVVVAEKDIPDLKKILQGI 467
Query: 335 PPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
P + + ++R L+ D FH IL+ +
Sbjct: 468 SPRRYVAMHSCVKR--LQRHFLWHARPIKYDLFHMILHSI 505
>gi|449501885|ref|XP_004161484.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein
LOC101226446 [Cucumis sativus]
Length = 859
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 93/350 (26%), Positives = 151/350 (43%), Gaps = 63/350 (18%)
Query: 42 VYVYELPSKYNKKLLLKDP---RCLTHMF-------------AAEIFMHRFLLSSPVRTL 85
+Y+Y+LP +N +LL C+ M+ AE+ + +L+SP RTL
Sbjct: 350 IYIYDLPPGFNSQLLQGRHWKFECVNRMYNERNATMWTDDLYGAEMAFYESILASPHRTL 409
Query: 86 NPEEADWFYTPIYPTCDLTPTGLP--LPFKSPRMMRSAIQL---------ISSNWPYWNR 134
N EEAD+F+ P+ +C +T L + +RS + L I +PYWNR
Sbjct: 410 NGEEADFFFVPVLDSCIITRADDAPHLSLRDYMGLRSFLTLDFYKKAHDHIVEQYPYWNR 469
Query: 135 TEGADHFFVVPHDFGACFH----YQEEKAIERGILPLLQRATLVQTFGQR---------- 180
+ G DH + D GAC+ + + G + +G
Sbjct: 470 SSGRDHIWFFSWDEGACYAPKEIWNSMMLVHWGNTNSKHNHSTTAYWGDNWDNIPSSKRG 529
Query: 181 NHVCLN-EGSITIPPYAPPQKMQAHQ---IPPDTPRSIFVYFRGLF---YDVNNDPEGGY 233
NH C + E + +P + P + + P R F +F G Y+ PE
Sbjct: 530 NHPCFDPEKDLVVPAWKRPDGSRLSKKLWARPREERKTFFFFNGNLGPAYE-RGRPE-ST 587
Query: 234 YARGARAAVWENFKNNPLFD--ISTDHPT----------TYYEDMQRAIFCLCPLGWAPW 281
Y+ G R V E F ++P + + H Y+ED+ ++FC G W
Sbjct: 588 YSMGIRQKVAEEFGSSPNKEGKLGKQHAADVIVTPLRSENYHEDLASSVFCGVMPG-DGW 646
Query: 282 SPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTIL 331
S R+ ++++ GCIPVII D I LP+ + + ++ + + E+D+P L IL
Sbjct: 647 SGRMEDSILQGCIPVIIQDGIFLPYENVLNYDSFAVRIGEDDIPNLINIL 696
>gi|90265152|emb|CAH67778.1| H0201G08.5 [Oryza sativa Indica Group]
gi|125546968|gb|EAY92790.1| hypothetical protein OsI_14593 [Oryza sativa Indica Group]
Length = 441
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 98/345 (28%), Positives = 160/345 (46%), Gaps = 26/345 (7%)
Query: 39 KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSP------VRTLNPEEADW 92
+ KVYVYE + +L + C F+ + L SP VRT +P A
Sbjct: 103 RFKVYVYE----EGEPPILHEGPCKNIYTIEGSFIEQLELMSPSDAGGGVRTWDPTRAHA 158
Query: 93 FYTPIYPTCDLTPTGLPLPFKSP---RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFG 149
F+ P + + P P ++ ++++++ P+WNR+ GADHF + HD+G
Sbjct: 159 FFLPFSVSQMVKFVYRPPSQDRPPLRAIVADYVRVVAARHPFWNRSAGADHFMLSCHDWG 218
Query: 150 ACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPD 209
+ + + + + L A + F V + E I + P+++ A P
Sbjct: 219 P-YASRGQPELYTNAIRALCNANTSEGFRPGKDVSVPE--INLYDGDMPRELLA-PAPGL 274
Query: 210 TPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRA 269
R + +F G + D ++ +G AA + P+++ YY M+RA
Sbjct: 275 ESRPLLAFFAGGRHGHVRDLLLRHW-KGRDAATF------PVYEYDLPAAGDYYSFMRRA 327
Query: 270 IFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDT 329
FCLCP G SPR+VEA+ C+PV+IAD LPFAD + WE + VA D+P+L
Sbjct: 328 RFCLCPSGHEVASPRVVEAIQAECVPVVIADGYALPFADVLRWEAFSVAVAVGDIPRLRE 387
Query: 330 ILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
L IP + R +R + +KR ++ QP + D F+ IL+ +
Sbjct: 388 RLERIPAAEVERLRRGVR--LVKRHLMLQQPPRRLDMFNMILHSV 430
>gi|255548864|ref|XP_002515488.1| catalytic, putative [Ricinus communis]
gi|223545432|gb|EEF46937.1| catalytic, putative [Ricinus communis]
Length = 474
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 85/298 (28%), Positives = 138/298 (46%), Gaps = 33/298 (11%)
Query: 87 PEEADWFYTPIYPTCDLTPTGLPLP----FKSPRMMRSAIQLISSNWPYWNRTEGADHFF 142
P+EA F+ P+ + P + +++ + +ISS +P+WNR++GADHF
Sbjct: 185 PDEAHAFFLPVSIVSIVKYVYRPYSDYSRIRLQNVVKDYVGVISSKYPFWNRSDGADHFL 244
Query: 143 VVPHDFG---ACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQ 199
HD+ + H + K R +L A + F V L E + +P
Sbjct: 245 TSCHDWAPDVSAGHPELYKYFTR----VLCNANTSEGFVPERDVSLPEIRLRDRKLSP-- 298
Query: 200 KMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPL-FDISTDH 258
+ H +PP R I +F G E G+ R ++E++K +
Sbjct: 299 --EPHSLPPKD-RRILAFFAG--------GEHGH----VRTKLFEHWKGKDRDVQVYEYL 343
Query: 259 PTT--YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG 316
P T Y E M + FCLCP GW SPR+ EA+ GC+PVII+D LPF+D + W +
Sbjct: 344 PKTLNYTELMSHSKFCLCPSGWEVASPRVPEAIYSGCVPVIISDYYYLPFSDVLDWSKFS 403
Query: 317 MFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
+ + +P++ T+L IP L Q+ + ++R +PA+P D H +L+ +
Sbjct: 404 VHIPVARIPEIKTVLQKIPMRKYLTMQKRVIQ--VQRHFKLNRPAKPYDVLHMVLHSI 459
>gi|255563758|ref|XP_002522880.1| catalytic, putative [Ricinus communis]
gi|223537865|gb|EEF39480.1| catalytic, putative [Ricinus communis]
Length = 406
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 97/351 (27%), Positives = 158/351 (45%), Gaps = 52/351 (14%)
Query: 39 KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIY 98
KVY+Y P + + PR LT +A+E + + + S RT +P++A F+ PI
Sbjct: 84 NFKVYIY--PDG-DPNTFYQTPRKLTGKYASEGYFFQNIRESKFRTNDPDQAHLFFIPI- 139
Query: 99 PTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEK 158
+C ++++ ++ ++ +PYWNRT GADHFFV HD G +
Sbjct: 140 -SCHKMRGKGTSYENMTIIVQNYVESLAVKYPYWNRTLGADHFFVTCHDVGV-------R 191
Query: 159 AIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY---APPQKMQAHQIPPDTPRSIF 215
A E G+ L++ A V VC + P+ A PQ +Q +P
Sbjct: 192 ATE-GVPFLVKNAIRV--------VCSPSYDVGFIPHKDVALPQVLQPFALPAGGN---- 238
Query: 216 VYFRGLFYDVNNDPEGGYYA-------RGARAAVWEN-----FKNNPLFDISTDHPTTYY 263
D+ N G++A R A VWEN NN + + +T H Y
Sbjct: 239 --------DLENRTTLGFWAGHRNSKIRVILARVWENDTELDISNNRI-NRATGH-LVYQ 288
Query: 264 EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED 323
+ R FC+CP G S R+ +++ +GC+PVI++D LPF D + W+ + + E+D
Sbjct: 289 KRFYRTKFCICPGGSQVNSARIADSIHYGCVPVILSDYYDLPFNDILDWKRFSVILKEKD 348
Query: 324 VPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
V +L +L I + + L +++ + P DAFH ++ L
Sbjct: 349 VYRLKQVLKDISDDEFVALHENLV--EVQKHFQWNSPPIKYDAFHMVMFDL 397
>gi|449437619|ref|XP_004136589.1| PREDICTED: uncharacterized protein LOC101206674 [Cucumis sativus]
Length = 791
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 93/350 (26%), Positives = 151/350 (43%), Gaps = 63/350 (18%)
Query: 42 VYVYELPSKYNKKLLLKDP---RCLTHMF-------------AAEIFMHRFLLSSPVRTL 85
+Y+Y+LP +N +LL C+ M+ AE+ + +L+SP RTL
Sbjct: 350 IYIYDLPPGFNSQLLQGRHWKFECVNRMYNERNATMWTDDLYGAEMAFYESILASPHRTL 409
Query: 86 NPEEADWFYTPIYPTCDLTPTGLP--LPFKSPRMMRSAIQL---------ISSNWPYWNR 134
N EEAD+F+ P+ +C +T L + +RS + L I +PYWNR
Sbjct: 410 NGEEADFFFVPVLDSCIITRADDAPHLSLRDYMGLRSFLTLDFYKKAHDHIVEQYPYWNR 469
Query: 135 TEGADHFFVVPHDFGACFH----YQEEKAIERGILPLLQRATLVQTFGQR---------- 180
+ G DH + D GAC+ + + G + +G
Sbjct: 470 SSGRDHIWFFSWDEGACYAPKEIWNSMMLVHWGNTNSKHNHSTTAYWGDNWDNIPSSKRG 529
Query: 181 NHVCLN-EGSITIPPYAPPQKMQAHQ---IPPDTPRSIFVYFRGLF---YDVNNDPEGGY 233
NH C + E + +P + P + + P R F +F G Y+ PE
Sbjct: 530 NHPCFDPEKDLVVPAWKRPDGSRLSKKLWARPREERKTFFFFNGNLGPAYE-RGRPE-ST 587
Query: 234 YARGARAAVWENFKNNPLFD--ISTDHPT----------TYYEDMQRAIFCLCPLGWAPW 281
Y+ G R V E F ++P + + H Y+ED+ ++FC G W
Sbjct: 588 YSMGIRQKVAEEFGSSPNKEGKLGKQHAADVIVTPLRSENYHEDLASSVFCGVMPG-DGW 646
Query: 282 SPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTIL 331
S R+ ++++ GCIPVII D I LP+ + + ++ + + E+D+P L IL
Sbjct: 647 SGRMEDSILQGCIPVIIQDGIFLPYENVLNYDSFAVRIGEDDIPNLINIL 696
>gi|413956860|gb|AFW89509.1| hypothetical protein ZEAMMB73_873038 [Zea mays]
Length = 783
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 91/357 (25%), Positives = 156/357 (43%), Gaps = 71/357 (19%)
Query: 42 VYVYELPSKYNKKLLLKDP---RCLT-------------HMFAAEIFMHRFLLSSPVRTL 85
+YVY+LP++++ LL +C+ ++ A+I +H +L+SP RTL
Sbjct: 342 IYVYDLPAEFDSHLLEGRHYRFQCVNRIYDDMNRTIWTQQLYGAQIALHESILASPHRTL 401
Query: 86 NPEEADWFYTPIYPTCDLTPTG-----------LPLPFKSPRMMRSAIQLISSNWPYWNR 134
N +EAD+FY P+ +C +T + + + R A I+ +PYWNR
Sbjct: 402 NGDEADYFYVPVLDSCLITRSDDAPHLLLPRDLRRRSYHALEYYRMAHGHIAQQYPYWNR 461
Query: 135 TEGADHFFVVPHDFGACFH----YQEEKAIERGILPLLQRATLVQTFGQR---------- 180
T G DH + D GAC+ ++ + G + + +
Sbjct: 462 TSGRDHIWFFSWDEGACYAPKEIWKSMMLVHWGNTNTKHKNSTTAYWADNWDDIPLDKRG 521
Query: 181 NHVCLN-EGSITIPPYAPPQ------KMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGY 233
NH C + + +P + P K+ A P R+ YF G ++ + EGG
Sbjct: 522 NHPCFDPRKDLVLPAWKEPNPGAIWLKLWAR---PRNNRTTLFYFNG---NLGSAYEGGR 575
Query: 234 ----YARGARAAVWENFKNNP----------LFDISTDHPTT--YYEDMQRAIFCLCPLG 277
Y+ G R + F + P D++ + T YYE++ ++FC G
Sbjct: 576 PEDTYSMGIRQKLAAEFGSTPNKQGRLGRQHAADVTVTYLRTEKYYEELASSVFCGVLPG 635
Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSI 334
WS R+ ++++ GCIPVII D I LP+ + + + + + E+D+P L + L I
Sbjct: 636 -DGWSGRMEDSMLQGCIPVIIQDGIFLPYENVLNYNSFAVRIQEDDIPGLISTLRGI 691
>gi|356546126|ref|XP_003541482.1| PREDICTED: xylogalacturonan beta-1,3-xylosyltransferase-like
[Glycine max]
Length = 490
Score = 115 bits (287), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 84/302 (27%), Positives = 133/302 (44%), Gaps = 24/302 (7%)
Query: 80 SPVRTLNPEEADWFYTPIYPTCDLTPTGLPL-------PFKSPRMMRSAIQLISSNWPYW 132
S R +PEEA F+ PI + P+ P + ++ I +I +PYW
Sbjct: 194 SQFRARHPEEAHVFFLPISIANVVHYVYKPILKQSDYEPVRLQHLVEDYIGVIQDKYPYW 253
Query: 133 NRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITI 192
NR+ GADHF + HD+G Y + + I L A + F V + E + +
Sbjct: 254 NRSIGADHFLLSCHDWGPKVSYGNPELFQTFIRALCN-ANTSEGFHPNRDVSIPEVYLPV 312
Query: 193 PPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLF 252
P Q P++ R+ +F G + R W++ N L
Sbjct: 313 GKLGPASLGQH----PNS-RTTLAFFAGGVHG---------EIRKILLKHWKDKDNEVLV 358
Query: 253 DISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW 312
Y + M ++ FCLCP G SPR+VEA+ GC+PVII D+ LPF+D + W
Sbjct: 359 HEYLPKGQDYTKLMGQSKFCLCPSGHEVASPRVVEAIHAGCVPVIICDNYSLPFSDVLNW 418
Query: 313 EEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILN 372
+ + + E +P++ +IL SI LR + ++R + +P +P D H IL+
Sbjct: 419 SQFSVEIPVEKIPEIKSILQSISRNKYLRLH--MNVLRVRRHFMINRPTKPFDMMHMILH 476
Query: 373 GL 374
+
Sbjct: 477 SI 478
>gi|356518344|ref|XP_003527839.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
max]
Length = 637
Score = 115 bits (287), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 98/392 (25%), Positives = 169/392 (43%), Gaps = 44/392 (11%)
Query: 2 WVTAI--ALLLAFVSADNTPKIERISGSAGDVLEDDPVGK---------LKVYVYELPSK 50
W +A+ LL A + +N P + + + + K LKVYVY +
Sbjct: 260 WSSAVDQELLQARLEIENAPIVNNVENLYAPLFRNISRFKRSYELMEKTLKVYVY----R 315
Query: 51 YNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPL 110
K ++ P L + FM S T +P++A FY P +P
Sbjct: 316 EGDKPIMHSPYLLGIYASEGWFMRLMEASKQFVTKDPKKAHLFYLPFSSRMLEETLYVPN 375
Query: 111 PFKSPRM---MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPL 167
S + +++ + +I+ +WNRT GADHF V HD+ E + L
Sbjct: 376 SHSSRNLIQYLKNYVDMIAGKHRFWNRTGGADHFLVACHDWAPT----ETRQHMARCLRA 431
Query: 168 LQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNN 227
L A + + F + L E + P + + +++ + R +F G +
Sbjct: 432 LCNADVKEGFVLGKDISLPETYVR-NAQKPTRNIGGNRV---SKRKTLAFFAGGMHG--- 484
Query: 228 DPEGGYYARGARAAVWENFKNNPLFDI-----STDHPTTYYEDMQRAIFCLCPLGWAPWS 282
Y R WEN +P I + Y + M+ + +C+C G+ S
Sbjct: 485 ------YVRPILLQHWEN--KDPAMKIFGILPKSKGNRNYIQYMKSSKYCICAKGYEVNS 536
Query: 283 PRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRK 342
PR+VEA+++ C+PVI++D+ V PF + + WE +FV E+D+P L IL SIP + L+
Sbjct: 537 PRVVEAILYECVPVILSDNFVPPFFEMLNWESFAVFVLEKDIPNLKNILLSIPQKRYLQM 596
Query: 343 QRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
Q ++ +++ L+ + D FH +L+ +
Sbjct: 597 QMMVR--KVQQHFLWHRSPVKYDIFHMVLHSI 626
>gi|15239502|ref|NP_197954.1| Exostosin family protein [Arabidopsis thaliana]
gi|110738111|dbj|BAF00988.1| hypothetical protein [Arabidopsis thaliana]
gi|332006108|gb|AED93491.1| Exostosin family protein [Arabidopsis thaliana]
Length = 654
Score = 115 bits (287), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 103/348 (29%), Positives = 160/348 (45%), Gaps = 40/348 (11%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVR--TLNPEEADWFYTPI 97
LKVY Y K K ++ P L ++A+E + + S+ + T +P +A FY P
Sbjct: 324 LKVYAY----KEGNKPIMHSP-ILRGIYASEGWFMNIIESNNNKFVTKDPAKAHLFYLP- 377
Query: 98 YPTCDLTPTGLPLPFKSPR----MMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFH 153
+ + L T S R ++ I IS+ +P+WNRT GADHF HD+
Sbjct: 378 FSSRMLEVTLYVQDSHSHRNLIKYLKDYIDFISAKYPFWNRTSGADHFLAACHDWAPS-- 435
Query: 154 YQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIP--PYAPPQKMQAHQIPPDTP 211
+ K + + I L + + + F L E + P P + A+Q
Sbjct: 436 -ETRKHMAKSIRALCN-SDVKEGFVFGKDTSLPETFVRDPKKPLSNMGGKSANQ------ 487
Query: 212 RSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTT-----YYEDM 266
R I +F G P+ GY R + W N K+ P I P T Y + M
Sbjct: 488 RPILAFFAG-------KPDHGYL-RPILLSYWGNNKD-PDLKIFGKLPRTKGNKNYLQFM 538
Query: 267 QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK 326
+ + +C+C G+ SPR+VEA+ + C+PVII+D+ V PF + + WE +F+ E+D+P
Sbjct: 539 KTSKYCICAKGFEVNSPRVVEAIFYDCVPVIISDNFVPPFFEVLNWESFAIFIPEKDIPN 598
Query: 327 LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
L IL SI PE R ++ K + +P + D FH IL+ +
Sbjct: 599 LKKILMSI-PESRYRSMQMRVKKVQKHFLWHAKPEK-YDMFHMILHSI 644
>gi|357462311|ref|XP_003601437.1| hypothetical protein MTR_3g080690 [Medicago truncatula]
gi|355490485|gb|AES71688.1| hypothetical protein MTR_3g080690 [Medicago truncatula]
Length = 450
Score = 115 bits (287), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 81/309 (26%), Positives = 139/309 (44%), Gaps = 24/309 (7%)
Query: 71 IFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSA---IQLISS 127
IF++ ++ RT NP EA ++ P L P+ + R+ + +IS
Sbjct: 151 IFINLLENNTLFRTQNPNEAHVYFLPFSVVMILEHLFHPVIRDKAVLGRTIGDYVHIISH 210
Query: 128 NWPYWNRTEGADHFFVVPHDFG--ACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCL 185
+ YWNR+ GADHF + HD+G A ++ +E I + +L A + + F +
Sbjct: 211 KYAYWNRSYGADHFMLSCHDWGPRATWYVKELYFIA---IRVLCNANISEHFNPKKDASF 267
Query: 186 NEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWEN 245
E I + + + P R+I +F G + R W+N
Sbjct: 268 PE--INLVSGETTGLLGGY---PTWNRTILAFFAG---------QMNGRIRPVLFQHWKN 313
Query: 246 FKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLP 305
+ L +Y+E M+ + +C+CP GW SPR+VEA+ C+P++I+ VLP
Sbjct: 314 KDKDVLVYEKLPEKISYHETMKMSKYCICPSGWEVASPRIVEAIYAECVPILISQQYVLP 373
Query: 306 FADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGD 365
F+D + W+ + + ++PKL IL I E +R Q ++R + P + D
Sbjct: 374 FSDVLNWDSFSVQIEVSEIPKLKEILLGISEEKYMRLQE--GVKQVQRHFVVNNPPKKYD 431
Query: 366 AFHQILNGL 374
FH I++ +
Sbjct: 432 VFHMIIHSI 440
>gi|334186614|ref|NP_001190743.1| Exostosin family protein [Arabidopsis thaliana]
gi|332658396|gb|AEE83796.1| Exostosin family protein [Arabidopsis thaliana]
Length = 589
Score = 114 bits (286), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 93/314 (29%), Positives = 147/314 (46%), Gaps = 29/314 (9%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSP-VRTLNPEEADWFYTPIY 98
LKVY+Y P K + +P L ++A+E + + + S+ T NPE A FY P Y
Sbjct: 200 LKVYIY--PD--GDKPIFHEPH-LNGIYASEGWFMKLMESNKQFVTKNPERAHLFYMP-Y 253
Query: 99 PTCDLTPTGL--------PLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGA 150
L + PL +R + ++S +P+WNRT G+DHF V HD+G
Sbjct: 254 SVKQLQKSIFVPGSHNIKPLSI----FLRDYVNMLSIKYPFWNRTHGSDHFLVACHDWGP 309
Query: 151 CFHYQEEKAIERGILPLLQRATLVQ-TFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPD 209
+ E ++R + L A L F V L E SI +++
Sbjct: 310 -YTVNEHPELKRNAIKALCNADLSDGIFVPGKDVSLPETSIRNAGRPLRNIGNGNRV--- 365
Query: 210 TPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRA 269
+ R I +F G + P+ + R E+ K + TY + M+ +
Sbjct: 366 SQRPILAFFAGNLHG-RVRPKLLKHWRNKD----EDMKIYGPLPHNVARKMTYVQHMKSS 420
Query: 270 IFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDT 329
+CLCP+G+ SPR+VEA+ + C+PV+IAD+ +LPF+D + W + V E+++P+L
Sbjct: 421 KYCLCPMGYEVNSPRIVEAIYYECVPVVIADNFMLPFSDVLDWSAFSVVVPEKEIPRLKE 480
Query: 330 ILTSIPPEVILRKQ 343
IL IP L+ Q
Sbjct: 481 ILLEIPMRRYLKMQ 494
>gi|357478275|ref|XP_003609423.1| Xylogalacturonan beta-1,3-xylosyltransferase [Medicago truncatula]
gi|355510478|gb|AES91620.1| Xylogalacturonan beta-1,3-xylosyltransferase [Medicago truncatula]
Length = 398
Score = 114 bits (286), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 89/320 (27%), Positives = 142/320 (44%), Gaps = 37/320 (11%)
Query: 58 KDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRM 117
K P TH A +F F +S +R + Y P D P L L +
Sbjct: 96 KSPFKATHPELAHVFFLPFSVSKVIR--------YVYKPRKSRSDYNPHRLQL------L 141
Query: 118 MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTF 177
+ I+++++ +PYWN ++GADHF + HD+G Y K + I L A + F
Sbjct: 142 VEDYIKIVANKYPYWNISQGADHFLLSCHDWGPRVSYANPKLFKHFIRALCN-ANTSEGF 200
Query: 178 GQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGY-YAR 236
V + + ++ + PP Q P+ R+I +F G G + R
Sbjct: 201 WPNRDVSIPQLNLPVGKLGPPNTDQH----PNN-RTILTFFAG----------GAHGKIR 245
Query: 237 GARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPV 296
W++ Y + M + FCLCP G SPR+VEA+ GC+PV
Sbjct: 246 KKLLKSWKDKDEEVQVHEYLPKGQDYTKLMGLSKFCLCPSGHEVASPRVVEAIYAGCVPV 305
Query: 297 IIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANP--SMKRA 354
II D+ LPF+D + W + M +A + +P++ TIL +I K R+L + +++
Sbjct: 306 IICDNYSLPFSDVLNWSQFSMEIAVDRIPEIKTILQNITET----KYRVLYSNVRRVRKH 361
Query: 355 MLFPQPAQPGDAFHQILNGL 374
+PA+P D H IL+ +
Sbjct: 362 FEMNRPAKPFDLIHMILHSV 381
>gi|168033546|ref|XP_001769276.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162679541|gb|EDQ65988.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 408
Score = 114 bits (286), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 82/270 (30%), Positives = 128/270 (47%), Gaps = 39/270 (14%)
Query: 122 IQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRN 181
+ ++S +PYWNRT GADHFFV HD+ A + + ++ A L F
Sbjct: 147 VDTLASKYPYWNRTHGADHFFVSCHDW-APLSTMLHGELHTNSMKVVCNADLTVNFDIEK 205
Query: 182 HVCL-------NEGSITIPPYAPPQK----MQAHQIPPDTPRSIFVYFRGLFYDVNNDPE 230
V + N+ + + P ++ A Q+ + Y++G DP
Sbjct: 206 DVSIPQTLKGGNQSDLDVGSLGPEERDFLAFYAGQMHGTVRPVLLDYWKG------KDPT 259
Query: 231 GGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVV 290
V+E ++ +IS Y + M+R+ +CLCP G+ SPR+VEA++
Sbjct: 260 ---------MKVYEVLPSDIAVNIS------YAQHMKRSRYCLCPKGFEVNSPRIVEAIL 304
Query: 291 FGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPS 350
GC+PVIIAD+ VLP+ D + W + + V EED+P L IL+SI QR L
Sbjct: 305 SGCVPVIIADNFVLPYNDVLDWTKFSVTVPEEDIPDLKKILSSISNVTYRSMQRRLR--Y 362
Query: 351 MKRAMLF---PQPAQPGDAFHQILNGLARK 377
++R L+ P+ Q D+FH L + R+
Sbjct: 363 IRRHFLWLEDPEDTQ-YDSFHMTLYSIWRQ 391
>gi|159478058|ref|XP_001697121.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158274595|gb|EDP00376.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 802
Score = 114 bits (286), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 101/362 (27%), Positives = 154/362 (42%), Gaps = 73/362 (20%)
Query: 42 VYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTC 101
+Y+Y+L K L L+ + A E ++H LL S RT +P+EAD+FY P+Y TC
Sbjct: 381 IYIYDLTDYTTKFLQLR-------LGAVEAYLHETLLQSEHRTFDPDEADFFYVPVYVTC 433
Query: 102 DLTPT-GLP------LPFKSPRMMRSAIQL------ISSNWPYWNRTEGADHFFVVPHDF 148
+ P G P R M + L ISS +P+WNR G DH +++ D
Sbjct: 434 YMWPILGWADGPWWYAPLAHTRTMHVSNMLSEVHAHISSTFPWWNRRGGRDHIWLMAADE 493
Query: 149 GACF--------------------HYQEEKAIERGILPLLQRATLVQTFG------QRNH 182
GAC+ +Q A ++ + G R H
Sbjct: 494 GACYMPTAIYNTSIILTHWGRMDLEHQSNTAYQQDNYNMAMPGEFKAWPGMDWQSRMRGH 553
Query: 183 VCLN-EGSITIPPYAPPQKMQAHQIPPDTP--RSIFVYFRGLFYDVNND--PEGGYYARG 237
C + + IP + + + P R + YFRG D+ P+ Y+RG
Sbjct: 554 PCYDPRKDLVIPAFKSVDHFRDSPLLGGAPLVRDLLCYFRG---DIGQARFPQ---YSRG 607
Query: 238 ARAAVWENFKNNPLFDISTDHPTT----------YYEDMQRAIFCLCPLGWAPWSPRLVE 287
R ++ + N D + H Y E + R+ FCL G WSPR +
Sbjct: 608 LRQKLFHLWHKN---DWAAKHKIYIGNGEMVRGPYSEHLLRSRFCLVLPG-DGWSPRAED 663
Query: 288 AVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED--VPKLDTILTSIPPEVILRKQRL 345
AV+ GCIPV+I D++ F + WE + + E+D + L +L ++PPE + + QR
Sbjct: 664 AVLHGCIPVVIMDNVHAVFESILDWESFSIRIREDDAALEALPQLLEAVPPERVAKMQRN 723
Query: 346 LA 347
LA
Sbjct: 724 LA 725
>gi|224120358|ref|XP_002331028.1| predicted protein [Populus trichocarpa]
gi|222872958|gb|EEF10089.1| predicted protein [Populus trichocarpa]
Length = 334
Score = 114 bits (286), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 80/303 (26%), Positives = 142/303 (46%), Gaps = 27/303 (8%)
Query: 79 SSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPF----KSPRMMRSAIQLISSNWPYWNR 134
SP N +EA F+ PI + LP+ + R+ + + ++++ +PYWNR
Sbjct: 41 KSPFLARNHDEAHAFFLPISVAYIVEFVYLPITTYHRERLVRIFKDYVTVVANKYPYWNR 100
Query: 135 TEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPP 194
+ G DHF V HD+ +++ + + ++ ++ A + F R L E +
Sbjct: 101 SRGGDHFMVSCHDWAPQVS-RDDPELYKNLIRVMCNANTSEGFRPRRDATLPELN----- 154
Query: 195 YAPPQKMQ--AHQIPPDTPRSIFVYFRGLFY-DVNNDPEGGYYARGARAAVWENFKNNPL 251
PP K+ + P R IF +F G + D+ + + V E +
Sbjct: 155 -CPPLKLTPACRGLAPHE-RKIFAFFAGGAHGDIRKILLRHWKEKDDEIQVHEYLPKD-- 210
Query: 252 FDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIP 311
Y E M ++ FCLCP G+ SPR+ E++ GC+PVII+D LPF+D +
Sbjct: 211 --------QDYMELMGQSKFCLCPSGFEVASPRVAESIYSGCVPVIISDHYNLPFSDVLD 262
Query: 312 WEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQIL 371
W + + + E +P++ TIL I + L+ Q+ + ++R + +PA+P D H +L
Sbjct: 263 WSQFSVQIPVEKIPEIKTILRGISYDEYLKMQKGVMK--VQRHFVLNRPAKPYDVLHMVL 320
Query: 372 NGL 374
+ +
Sbjct: 321 HSV 323
>gi|356565153|ref|XP_003550809.1| PREDICTED: probable glycosyltransferase At5g20260-like [Glycine
max]
Length = 398
Score = 114 bits (285), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 78/304 (25%), Positives = 138/304 (45%), Gaps = 28/304 (9%)
Query: 80 SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKS-------PRMMRSAIQLISSNWPYW 132
SP + NP+EA F+ P+ + P ++ R++ I +++ +PYW
Sbjct: 101 SPFKARNPDEAHAFFLPLSVVNVVHYVYKPYMSQNDYSRDRLQRLVEDYIGVVADKYPYW 160
Query: 133 NRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITI 192
NR+ GADHF + HD+ + + + +L A + F + V + E + +
Sbjct: 161 NRSNGADHFLLSCHDWAPEISHANPDLF-KNFIRVLCNANNSEGFQPKRDVSIPEVYLPV 219
Query: 193 PPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYA--RGARAAVWENFKNNP 250
PP Q R+I +F GG + R W++ N+
Sbjct: 220 GKLGPPNLGQH-----PMNRTILAFF-----------SGGAHGDIRKLLLKHWKDKDNHV 263
Query: 251 LFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAI 310
Y E M + FCLCP G+ SPR+VEA+ GC+PVII+++ LPF+D +
Sbjct: 264 QVHEYLPKGQNYTELMGLSKFCLCPSGYEVASPRVVEAINAGCVPVIISENYSLPFSDVL 323
Query: 311 PWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQI 370
W + + ++ E++ + TIL ++ + + R + ++R + +PA+P D H I
Sbjct: 324 NWSQFSIQISVENISDIKTILQNVTQKKYKKLHRNVR--RVQRHFVMNRPAKPFDLMHMI 381
Query: 371 LNGL 374
L+ +
Sbjct: 382 LHSI 385
>gi|356537024|ref|XP_003537031.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
max]
Length = 472
Score = 114 bits (285), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 95/358 (26%), Positives = 154/358 (43%), Gaps = 62/358 (17%)
Query: 39 KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIY 98
+ KV+VYE + + + C + F+H ++ RT +P++A F+ P
Sbjct: 142 QFKVFVYE----EGETPVFHNGPCKSIYSMEGNFIHAIEMNDHFRTKDPKKAHVFFLPF- 196
Query: 99 PTCDLTPTGLPLPFKSPRMMRS----------AIQLISSNWPYWNRTEGADHFFVVPHDF 148
+ + F R R I LI++ + YWNR+ GADHF + HD+
Sbjct: 197 ------SVVMMVRFVYQRDSRDFGPIRKTVIDYINLIAARYSYWNRSLGADHFMLACHDW 250
Query: 149 GACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYA----PPQKMQAH 204
G E ++ LP L + ++ +C S P P +Q
Sbjct: 251 GP------EASLS---LPYLHKNSI-------RVLCNANTSEGFKPAKDVSFPEINLQTG 294
Query: 205 QI------PPDTPRSIFVYFRGLFYDVNNDPEGGYYA--RGARAAVWENFKNNPLFDIST 256
I P + RSI +F GG + R WEN +
Sbjct: 295 SINGFIGGPSASKRSILAFF-----------AGGVHGPIRPILLEHWENKDEDIQVHKYL 343
Query: 257 DHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG 316
+YY+ ++ + FCLCP G+ SPR+VEA+ GC+PV+I++ V PF+D + W+
Sbjct: 344 PKGVSYYDKLRNSKFCLCPSGYEVASPRVVEAIYTGCVPVLISEHYVPPFSDVLNWKSFS 403
Query: 317 MFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
+ ++ +D+P L IL SI P +R QR + ++R P + D FH IL+ +
Sbjct: 404 VELSVKDIPNLKDILMSISPRQYIRMQRRVIQ--IQRHFEVHSPPKRFDVFHMILHSV 459
>gi|297812207|ref|XP_002873987.1| hypothetical protein ARALYDRAFT_326421 [Arabidopsis lyrata subsp.
lyrata]
gi|297319824|gb|EFH50246.1| hypothetical protein ARALYDRAFT_326421 [Arabidopsis lyrata subsp.
lyrata]
Length = 654
Score = 114 bits (285), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 84/300 (28%), Positives = 134/300 (44%), Gaps = 22/300 (7%)
Query: 80 SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSA----IQLISSNWPYWNRT 135
SP PEEA F P+ + PL S + + +++ +PYWNR+
Sbjct: 361 SPFAASGPEEAHAFLLPVSIANVVHYLYRPLVTYSREQLHKVFLDYVNVVAHKYPYWNRS 420
Query: 136 EGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY 195
GADHFFV HD+ + + + ++ +L A + F + V + E +I
Sbjct: 421 LGADHFFVSCHDWAPDVSGSNPEMM-KNLIRVLCNANTSEGFMPQRDVSIPEINIPRGQL 479
Query: 196 APPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGY-YARGARAAVWENFKNNPLFDI 254
PPQ ++ R I +F G G + Y R W++
Sbjct: 480 GPPQLSRS----SGHDRPILAFFSG----------GSHGYIRKILLQHWKDKDEEVQVHE 525
Query: 255 STDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEE 314
+ Y++ M A FCLCP G+ SPR+V A+ GC+PVII+D LPF+D + W +
Sbjct: 526 YLTNNKDYFKLMATARFCLCPSGYEVASPRVVAAINLGCVPVIISDHYALPFSDVLDWTK 585
Query: 315 IGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
+ V E +P++ TIL SI QR + ++R + +P+QP D +L+ +
Sbjct: 586 FTIHVPSEKIPEIKTILKSISWRRYKVLQRRVLQ--VQRHFVINRPSQPFDMLRMLLHSV 643
>gi|307111637|gb|EFN59871.1| hypothetical protein CHLNCDRAFT_133704 [Chlorella variabilis]
Length = 833
Score = 114 bits (285), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 99/359 (27%), Positives = 149/359 (41%), Gaps = 72/359 (20%)
Query: 42 VYVYELPSKYNKKLL----LKDPRCLTH---------------MFAAEIFMHRFLLSSPV 82
+YVYELP YN +L KD TH ++ E +H LL S
Sbjct: 371 IYVYELPPMYNALMLQYRVAKDD--CTHRVFNEQNESYTPFLWLYQPETGIHEMLLQSEH 428
Query: 83 RTLNPEEADWFYTPIYPTCDLTPT----------GLPLPFKS---PRMMRSAIQLISSNW 129
RTL+PEEAD+FY P+Y +C + P G P ++ M + S++
Sbjct: 429 RTLDPEEADYFYLPVYASCAIWPVLYSNDFPYFHGGPAAQRTHGATNMFMEVQSWVRSHF 488
Query: 130 PYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGS 189
PYW+R G DH + HD G+C+ LP + R +V + R V G+
Sbjct: 489 PYWDRNGGRDHIVLTVHDEGSCW------------LPAVLRPAIVMSHWGRTDVNPPAGT 536
Query: 190 ----------ITIPPYAPPQKM-QAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGA 238
+ P + P + + + P P + G PE Y+RG
Sbjct: 537 GYDADTYSNEVRHPVWQPEGHLSKLGEFPCYDPSKVTYILGGRI-----QPENARYSRGT 591
Query: 239 RA---------AVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAV 289
R W+ ++ + P Y E M R++FCL +G +S R +AV
Sbjct: 592 RQFLANISEAEGWWDKYRIHVGAGSPPGGPGDYSECMARSVFCLALMG-DGYSSRFDDAV 650
Query: 290 VFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLAN 348
+ GCIPVI+ D I L + + + V + D+ ++ IL ++P E I R Q LA
Sbjct: 651 LHGCIPVIVQDGIELTWHSLLDIPAYSLRVPQADMARIPQILQAVPQEDIARMQANLAK 709
>gi|218192182|gb|EEC74609.1| hypothetical protein OsI_10218 [Oryza sativa Indica Group]
Length = 737
Score = 114 bits (285), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 100/394 (25%), Positives = 168/394 (42%), Gaps = 68/394 (17%)
Query: 17 NTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLL------------LKDPRCLT 64
N P ++ IS S+ +V + +YVY+LP++++ LL + D + T
Sbjct: 271 NLPDLKNISSSSINVKVVAKKKRPLIYVYDLPAEFDSHLLEGRHYKFQCVNRIYDDKNRT 330
Query: 65 ----HMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMM-- 118
++ A+I ++ +L+SP RTLN +EAD+FY P +C +T + + PR +
Sbjct: 331 IWTQQLYGAQIALYESILASPHRTLNGDEADYFYVPALDSCLITRSDDAPHLQMPRDLRL 390
Query: 119 ---------RSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFH----YQEEKAIERGIL 165
R I+ +PYWNRT G DH + D GAC+ + + G
Sbjct: 391 RSYHTLEYYRMTYDHIAQRYPYWNRTSGRDHIWFFSWDEGACYAPKEIWNSMMLVHWGNT 450
Query: 166 PLLQRATLVQTFGQR----------NHVCLN-EGSITIPPYAPPQ------KMQAHQIPP 208
+ + + NH C + + +P + P K+ A
Sbjct: 451 NTKHKNSTTAYWADNWNYIPIDRRGNHPCFDPRKDLVLPAWKQPNPAAIWLKLWAR---T 507
Query: 209 DTPRSIFVYFRGLFYDVNNDPEG-GYYARGARAAVWENFKNNP----------LFDISTD 257
R+ YF G D Y+ G R + F + P +++
Sbjct: 508 RNNRTTLFYFNGNLGPAYKDGRHEDTYSMGIRQKLAAEFGSTPDKQGKLGRQHTANVTVT 567
Query: 258 HPTT--YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEI 315
+ T YYE++ +IFC G WS R+ ++++ GCIPVII D I+LP+ + + +
Sbjct: 568 YLRTEKYYEELASSIFCGVLPG-DGWSGRMEDSMLQGCIPVIIQDGILLPYENMLNYNSF 626
Query: 316 GMFVAEEDVPKLDTILTSIPP---EVILRKQRLL 346
+ + E+D+P L IL I E +LR R +
Sbjct: 627 AVRIQEDDIPNLIRILRGINETQVEFMLRNVRQI 660
>gi|224142972|ref|XP_002324801.1| predicted protein [Populus trichocarpa]
gi|222866235|gb|EEF03366.1| predicted protein [Populus trichocarpa]
Length = 398
Score = 114 bits (285), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 92/349 (26%), Positives = 157/349 (44%), Gaps = 43/349 (12%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAE-IFMHRFLLSSPVRTLNPEEADWFYTPIY 98
LKVY+Y+ + + +L + P L ++A+E FM + T +P+++ FY P
Sbjct: 68 LKVYIYK---EGERPILHQAP--LKGIYASEGWFMKLLETNKKFVTKDPKKSHLFYLPFS 122
Query: 99 PTCDLTPTGLPLPFKSPRM---MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQ 155
+P + +++ + +IS+ +P+WNRT GADHF V HD+ Q
Sbjct: 123 SRNLEVNLYVPNSHSHKNLIQYLKNYLDMISAKYPFWNRTRGADHFLVACHDWAPTETRQ 182
Query: 156 EEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIF 215
R + + V FG+ L E ++ P ++ P + RSI
Sbjct: 183 HMANCIRALCNSDAKGGFV--FGK--DAALPETTVR----TPQNLLRDLGGKPASKRSIL 234
Query: 216 VYFRGLFYDV----------NNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYED 265
+F G + N DP+ + + L + Y +
Sbjct: 235 AFFAGSMHGYLRPILLQHWGNKDPDVKVFGK--------------LPKVKGRGKMNYPQY 280
Query: 266 MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVP 325
M+ + +C+C G+ SPR+VEA+ + C+PVII+D+ V PF + + WE +FV E+D+P
Sbjct: 281 MKSSKYCICAKGFEVNSPRVVEAIFYECVPVIISDNFVPPFFEVLNWESFAVFVLEKDIP 340
Query: 326 KLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
L IL SI PE R+ ++ + + +P + D FH IL+ +
Sbjct: 341 NLKNILLSI-PENKYREMQMRVKKVQQHFLWHARPVK-YDIFHMILHSV 387
>gi|356496874|ref|XP_003517290.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
max]
Length = 404
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 99/353 (28%), Positives = 158/353 (44%), Gaps = 56/353 (15%)
Query: 39 KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIY 98
K KVY+Y P + + PR LT +A+E + + + S T NP+EA F+ PI
Sbjct: 82 KFKVYIY--PDG-DPNTFYQTPRKLTGKYASEGYFFQNIRESRFCTENPDEAHLFFIPI- 137
Query: 99 PTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEK 158
+C ++++ ++ + S +PYWNRT GADHFFV HD G +
Sbjct: 138 -SCHKMRGKGTSYENMTIIVQNYVESLISKYPYWNRTLGADHFFVTCHDVGV-------R 189
Query: 159 AIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY---APPQKMQAHQIPPDTPRSIF 215
A E G+ L++ + VC + P+ A PQ +Q +P
Sbjct: 190 ATE-GLEFLVKNSIRA--------VCSPSYDVGFIPHKDVALPQVLQPFALPAGGN---- 236
Query: 216 VYFRGLFYDVNNDPEGGYYA-------RGARAAVWENFKNNPLFDISTDHPT------TY 262
D+ N G++A R A VWEN + DIS + + Y
Sbjct: 237 --------DIENRTTLGFWAGHRNSKIRVILARVWEN---DTELDISNNRISRATGHLVY 285
Query: 263 YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEE 322
+ R+ FC+CP G S R+ +++ +GCIPVI+++ LPF D + W + + + E
Sbjct: 286 QKRFYRSKFCICPGGSQVNSARIADSIHYGCIPVILSNYYDLPFNDILDWNKFAVVLKES 345
Query: 323 DVPKLDTILTSIP-PEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
DV +L IL +I E + L+ +++ + P DAFH ++ L
Sbjct: 346 DVYQLKQILKNISDAEFVTLHNNLV---KVQKHFQWNSPPIRFDAFHLVMYDL 395
>gi|356546040|ref|XP_003541440.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
max]
Length = 491
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 95/354 (26%), Positives = 154/354 (43%), Gaps = 54/354 (15%)
Query: 39 KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIY 98
+ KV+VYE + + + C + F+H ++ RT +P++A F+ P
Sbjct: 161 QFKVFVYE----EGETPVFHNGPCKSIYSMEGNFIHAIEMNDHFRTKDPKKAHVFFLPFS 216
Query: 99 PTC--------DLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFG- 149
D G P K + + + LI++ +PYWNR+ GADHF + HD+G
Sbjct: 217 VVMMVRFVYERDSRDFG---PIK--KTVIDYVNLIATRYPYWNRSLGADHFMLACHDWGP 271
Query: 150 -ACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQI-- 206
A F + + + +L A + F V P +Q I
Sbjct: 272 EASFSL---PYLHKNSIRVLCNANTSEGFKPAKDVSF-----------PEINLQTGSING 317
Query: 207 ----PPDTPRSIFVYFRGLFYDVNNDPEGGYYA--RGARAAVWENFKNNPLFDISTDHPT 260
P + RSI +F GG + R WEN +
Sbjct: 318 FVGGPSASKRSILAFF-----------AGGVHGPIRPILLEHWENKDEDIQVHKYLPKGV 366
Query: 261 TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVA 320
+YY ++++ FCLCP G+ SPR+VEA+ GC+PV+I++ V PF+D + W+ + ++
Sbjct: 367 SYYGMLRKSKFCLCPSGYEVASPRVVEAIYTGCVPVLISEHYVPPFSDVLNWKSFSVELS 426
Query: 321 EEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
+D+P L IL SI P +R QR + ++R P + D FH IL+ +
Sbjct: 427 VKDIPILKDILMSISPRQHIRMQRRVGQ--IRRHFEVHSPPKRFDVFHMILHSV 478
>gi|302820946|ref|XP_002992138.1| glycosyltransferase-like protein [Selaginella moellendorffii]
gi|300140064|gb|EFJ06793.1| glycosyltransferase-like protein [Selaginella moellendorffii]
Length = 342
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 98/345 (28%), Positives = 149/345 (43%), Gaps = 61/345 (17%)
Query: 58 KDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKS--- 114
+ PR +T +A+E + + L S T NP +A F+ PI +C +P S
Sbjct: 20 QTPRKITGKYASEGYFFQNLRESKFVTKNPNKAHLFFIPI--SCHKMRGKVPYYLTSNWN 77
Query: 115 ----------PRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGI 164
+++ ++ + +PYWNRT GADHFFV HD GA +A + +
Sbjct: 78 KMQGISYEKMADIVQEYVEGLIVKYPYWNRTLGADHFFVTCHDVGA-------RATNK-V 129
Query: 165 LPLLQRATLVQTFGQRNHVC---LNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGL 221
L++ + V VC N I A PQ +Q +P RG
Sbjct: 130 ANLVKNSIRV--------VCSPSYNGDFIPHKDIAMPQVLQPFALP-----------RG- 169
Query: 222 FYDVNNDPEGGYYA-------RGARAAVWEN-----FKNNPLFDISTDHPTTYYEDMQRA 269
DV N G++A R A +WE NN + + + Y + R+
Sbjct: 170 GNDVRNRTILGFWAGHRNSKIRVVLAKLWEEDDVLAISNNRISRATGE--LVYQKQFYRS 227
Query: 270 IFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDT 329
FC+CP G S R+V+++ +GC+PVI++D LPF D + W+ + + E DV L
Sbjct: 228 KFCICPGGSQVNSARIVDSIHYGCVPVILSDHYDLPFNDVLDWKRFALLLRERDVGDLKL 287
Query: 330 ILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
L S+ E L R L R + P +P DAFH ++ L
Sbjct: 288 KLQSVSKEQYLSLHRGLVEVVQDR-FEWHTPPRPYDAFHMVVYEL 331
>gi|334187775|ref|NP_197468.2| Exostosin family protein [Arabidopsis thaliana]
gi|332005353|gb|AED92736.1| Exostosin family protein [Arabidopsis thaliana]
Length = 610
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 93/351 (26%), Positives = 158/351 (45%), Gaps = 48/351 (13%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTL-NPEEADWFYTP-- 96
LKVYVY K + + P L ++A+E + + + + T+ +P +A +Y P
Sbjct: 281 LKVYVY----KEGNRPIFHTP-ILKGLYASEGWFMKLMEGNKQYTVKDPRKAHLYYMPFS 335
Query: 97 -------IYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFG 149
+Y T L + ++ + ISS +P++NRT+GADHF V HD+
Sbjct: 336 ARMLEYTLYVRNSHNRTNLR------QFLKEYTEHISSKYPFFNRTDGADHFLVACHDWA 389
Query: 150 ACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPD 209
Y+ +E I L A + F + L E + A ++ P
Sbjct: 390 P---YETRHHMEHCIKALCN-ADVTAGFKIGRDISLPETYVR----AAKNPLRDLGGKPP 441
Query: 210 TPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP------TTYY 263
+ R ++ G + Y R W++ +P I P Y
Sbjct: 442 SQRRTLAFYAGSMHG---------YLRQILLQHWKD--KDPDMKIFGRMPFGVASKMNYI 490
Query: 264 EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED 323
E M+ + +C+CP G+ SPR+VE++ + C+PVII+D+ V PF + + W + VAE+D
Sbjct: 491 EQMKSSKYCICPKGYEVNSPRVVESIFYECVPVIISDNFVPPFFEVLDWSAFSVIVAEKD 550
Query: 324 VPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
+P+L IL SIP + ++ Q +A +R L+ + D FH +L+ +
Sbjct: 551 IPRLKDILLSIPEDKYVKMQ--MAVRKAQRHFLWHAKPEKYDLFHMVLHSI 599
>gi|125585107|gb|EAZ25771.1| hypothetical protein OsJ_09611 [Oryza sativa Japonica Group]
Length = 779
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 100/394 (25%), Positives = 168/394 (42%), Gaps = 68/394 (17%)
Query: 17 NTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLL------------LKDPRCLT 64
N P ++ IS S+ +V + +YVY+LP++++ LL + D + T
Sbjct: 313 NLPDLKNISSSSINVKVVAKKKRPLIYVYDLPAEFDSHLLEGRHYKFQCVNRIYDDKNRT 372
Query: 65 ----HMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMM-- 118
++ A+I ++ +L+SP RTLN +EAD+FY P +C +T + + PR +
Sbjct: 373 IWTQQLYGAQIALYESILASPHRTLNGDEADYFYVPALDSCLITRSDDAPHLQMPRDLRL 432
Query: 119 ---------RSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFH----YQEEKAIERGIL 165
R I+ +PYWNRT G DH + D GAC+ + + G
Sbjct: 433 RSYHTLEYYRMTYDHIAQRYPYWNRTSGRDHIWFFSWDEGACYAPKEIWNSMMLVHWGNT 492
Query: 166 PLLQRATLVQTFGQR----------NHVCLN-EGSITIPPYAPPQ------KMQAHQIPP 208
+ + + NH C + + +P + P K+ A
Sbjct: 493 NTKHKNSTTAYWADNWNYIPIDRRGNHPCFDPRKDLVLPAWKQPNPAAIWLKLWART--- 549
Query: 209 DTPRSIFVYFRGLFYDVNNDPEG-GYYARGARAAVWENFKNNP----------LFDISTD 257
R+ YF G D Y+ G R + F + P +++
Sbjct: 550 RNNRTTLFYFNGNLGPAYKDGRHEDTYSMGIRQKLAAEFGSTPDKQGKLGRQHTANVTVT 609
Query: 258 HPTT--YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEI 315
+ T YYE++ +IFC G WS R+ ++++ GCIPVII D I+LP+ + + +
Sbjct: 610 YLRTEKYYEELASSIFCGVLPG-DGWSGRMEDSMLQGCIPVIIQDGILLPYENMLNYNSF 668
Query: 316 GMFVAEEDVPKLDTILTSIPP---EVILRKQRLL 346
+ + E+D+P L IL I E +LR R +
Sbjct: 669 AVRIQEDDIPNLIRILRGINETQVEFMLRNVRQI 702
>gi|297805260|ref|XP_002870514.1| hypothetical protein ARALYDRAFT_915839 [Arabidopsis lyrata subsp.
lyrata]
gi|297316350|gb|EFH46773.1| hypothetical protein ARALYDRAFT_915839 [Arabidopsis lyrata subsp.
lyrata]
Length = 536
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 100/346 (28%), Positives = 156/346 (45%), Gaps = 57/346 (16%)
Query: 39 KLKVYVYELPSK--YNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTL-NPEEADWFYT 95
KLKVYVY+ K ++K L PR + +A+E + + + S+ + +P +A FY
Sbjct: 227 KLKVYVYKEGGKPIFHKPL----PRGI---YASEGWFMKLMESNKKFVVRDPRKAHLFYI 279
Query: 96 PIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQ 155
PI ++ + LI+ + +WNRT GADHF V HD+G +
Sbjct: 280 PI------------------NHLKEYVDLIAGKYKFWNRTGGADHFIVACHDWGNKLTKK 321
Query: 156 EEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIF 215
K R L + + Q F L I P + + P + R I
Sbjct: 322 TMKNSVRA----LCNSNVAQGFRIGTDTAL--PVTYIRSAESPLEYLGGKTP--SKRKIL 373
Query: 216 VYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP------TTYYEDMQRA 269
+F G + Y R +WEN P I P T Y E M+ +
Sbjct: 374 AFFAGSMHG---------YLRPILVKLWEN--KEPDMKIVGPMPRDPESKTQYREYMKSS 422
Query: 270 IFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDT 329
+C+C G+ +PR+VEA++ C+PVIIAD+ V PF + + WEE +FV E+++ L
Sbjct: 423 KYCICARGYEVHTPRVVEAIINECVPVIIADNYVPPFFEILNWEEFAVFVEEKEIANLRN 482
Query: 330 ILTSIPPE-VILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
IL SI E I+ + R+ A +++ L+ + D FH IL+ +
Sbjct: 483 ILLSISEERYIVMQARVKA---VQQHFLWHKKPVKFDLFHMILHSI 525
>gi|449462352|ref|XP_004148905.1| PREDICTED: probable glycosyltransferase At5g25310-like [Cucumis
sativus]
gi|449523501|ref|XP_004168762.1| PREDICTED: LOW QUALITY PROTEIN: probable glycosyltransferase
At5g25310-like [Cucumis sativus]
Length = 684
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 95/345 (27%), Positives = 160/345 (46%), Gaps = 37/345 (10%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSP-VRTLNPEEADWFYTPIY 98
LKVY+Y ++ + + P L ++A+E + + L S+ T NP +A FY P +
Sbjct: 355 LKVYIYREGAR---PIFHQGP--LQSIYASEGWFMKILESNKKFVTKNPRKAHLFYLP-F 408
Query: 99 PTCDLTPTGLPLPFKSPR----MMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHY 154
+ L S + +++ + I++ +P+WNRT GADHF V HD+
Sbjct: 409 SSRQLEEVLYVRDSHSHKNLIQHLKNYLDFIAAKYPHWNRTGGADHFLVACHDWAPA--- 465
Query: 155 QEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSI 214
+ K + + I L + + + F V L E + + P + + + P + R I
Sbjct: 466 ETRKYMAKCIRALCN-SDVKEGFVFGKDVSLPETFVRVA-RNPLRDVGGN---PSSKRPI 520
Query: 215 FVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPT-----TYYEDMQRA 269
+F G + Y R WE + +P IS P Y M+ +
Sbjct: 521 LAFFAGSMHG---------YLRSTLLEYWE--RKDPDMKISGPMPKVKGSKNYLWHMKNS 569
Query: 270 IFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDT 329
+C+C G+ SPR+VE++++ C+PVII+D+ V P + + WE +FVAE+D+P L
Sbjct: 570 KYCICAKGYEVNSPRVVESILYECVPVIISDNFVPPLFEVLNWESFAVFVAEKDIPNLKK 629
Query: 330 ILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
IL SIP + Q + ++ L+ Q D FH IL+ +
Sbjct: 630 ILLSIPEKRYREMQMRVK--KLQPHFLWHAKPQKYDMFHMILHSI 672
>gi|297819890|ref|XP_002877828.1| hypothetical protein ARALYDRAFT_906539 [Arabidopsis lyrata subsp.
lyrata]
gi|297323666|gb|EFH54087.1| hypothetical protein ARALYDRAFT_906539 [Arabidopsis lyrata subsp.
lyrata]
Length = 107
Score = 113 bits (283), Expect = 1e-22, Method: Composition-based stats.
Identities = 48/105 (45%), Positives = 75/105 (71%), Gaps = 1/105 (0%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYP 99
+K+YVY+LP+ +N + RC TH+FAAE+ +HR LLSS VRTL+PEEAD+F+ P+Y
Sbjct: 1 MKIYVYDLPASFNDDWVTASDRCATHLFAAEVAIHRTLLSSDVRTLDPEEADFFFVPVYV 60
Query: 100 TCDLTPT-GLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFV 143
+C+ + + G P + + SA+ +S ++P+W+RT+G+DH FV
Sbjct: 61 SCNFSKSNGFPSLSHARSLFSSAVDFLSDHYPFWDRTQGSDHVFV 105
>gi|358348291|ref|XP_003638181.1| hypothetical protein MTR_121s0013 [Medicago truncatula]
gi|355504116|gb|AES85319.1| hypothetical protein MTR_121s0013 [Medicago truncatula]
Length = 550
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 98/350 (28%), Positives = 155/350 (44%), Gaps = 36/350 (10%)
Query: 39 KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEI-FMHRFLLSSPVRTLNPEEADWFYTPI 97
KLKVY+Y + +K + P+ + ++A+E FM + +P++A FY P
Sbjct: 225 KLKVYIY----REGEKPIFHQPK-MRGIYASEGWFMKLMEGNKRFIVKDPKKAHLFYLPF 279
Query: 98 YPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEE 157
+ + L K + + + +I+ + +WNRT GADHF V HD+ + Q
Sbjct: 280 --SSQMLRANLSDNKKMEQYLDKYVNIIAGKYRFWNRTGGADHFLVACHDWASRITRQPM 337
Query: 158 KAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVY 217
K R L A + + F L + I P + A + P + R+I +
Sbjct: 338 KNCIRS----LCNANVAKGFQIGKDTTL--PATYIHSVMNPLRKIAGKHPSE--RTILAF 389
Query: 218 FRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDI------STDHPTTYYEDMQRAIF 271
F G + Y R WEN P I + Y + M + +
Sbjct: 390 FAGSMHG---------YLRPILLKHWEN--KEPDMKIFGAMARDAEGKRIYMDYMNSSKY 438
Query: 272 CLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTIL 331
C+C G+ +SPR+VEA+ C+PVII+D+ V PF + + WE +FV E DVP L +IL
Sbjct: 439 CICARGYEVYSPRIVEAIFSECVPVIISDNYVPPFFEVLKWEAFSVFVRERDVPNLRSIL 498
Query: 332 TSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL-ARKLPH 380
SI E L L +++ L+ + D FH IL+ + +L H
Sbjct: 499 LSITEEKYLALH--LGVKKVQQHFLWHKVPVKYDLFHMILHSIWNNRLSH 546
>gi|15239118|ref|NP_196722.1| Exostosin family protein [Arabidopsis thaliana]
gi|7573418|emb|CAB87721.1| putative protein [Arabidopsis thaliana]
gi|332004317|gb|AED91700.1| Exostosin family protein [Arabidopsis thaliana]
Length = 546
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 98/345 (28%), Positives = 160/345 (46%), Gaps = 37/345 (10%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSS-PVRTLNPEEADWFYTPIY 98
LKVYVY S+ ++ + + + ++A+E + + + SS T +P +A FY P
Sbjct: 220 LKVYVY---SEGDRPIFHQPEAIMEGIYASEGWFMKLMESSHRFLTKDPTKAHLFYIPF- 275
Query: 99 PTCDLTPTGLPLPFKSPR-----MMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFH 153
+ + L + R + + I LI+SN+P WNRT G+DHFF HD+
Sbjct: 276 -SSRILQQKLYVHDSHSRNNLVKYLGNYIDLIASNYPSWNRTCGSDHFFTACHDWAPT-- 332
Query: 154 YQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRS 213
E + + L A + F V L E ++ P K+ + + R+
Sbjct: 333 --ETRGPYINCIRALCNADVGIDFVVGKDVSLPETKVS-SLQNPNGKIGGSR---PSKRT 386
Query: 214 IFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDIST----DHPTTYYEDMQRA 269
I +F G + GY R + + + P D+ DH +Y M+R+
Sbjct: 387 ILAFFAGSLH--------GY----VRPILLNQWSSRPEQDMKIFNRIDH-KSYIRYMKRS 433
Query: 270 IFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDT 329
FC+C G+ SPR+VE++++GC+PVII+D+ V PF + + WE +FV E+++P L
Sbjct: 434 RFCVCAKGYEVNSPRVVESILYGCVPVIISDNFVPPFLEILNWESFAVFVPEKEIPNLRK 493
Query: 330 ILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
IL SIP + Q+ + K M D FH IL+ +
Sbjct: 494 ILISIPVRRYVEMQKRVLK-VQKHFMWHDGEPVRYDIFHMILHSV 537
>gi|26450497|dbj|BAC42362.1| unknown protein [Arabidopsis thaliana]
Length = 546
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 98/345 (28%), Positives = 160/345 (46%), Gaps = 37/345 (10%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSS-PVRTLNPEEADWFYTPIY 98
LKVYVY S+ ++ + + + ++A+E + + + SS T +P +A FY P
Sbjct: 220 LKVYVY---SEGDRPIFHQPEAIMEGIYASEGWFMKLMESSHRFLTKDPTKAHLFYIPF- 275
Query: 99 PTCDLTPTGLPLPFKSPR-----MMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFH 153
+ + L + R + + I LI+SN+P WNRT G+DHFF HD+
Sbjct: 276 -SSRILQQKLYVHDSHSRNNLVKYLGNYIDLIASNYPSWNRTCGSDHFFTACHDWAPT-- 332
Query: 154 YQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRS 213
E + + L A + F V L E ++ P K+ + + R+
Sbjct: 333 --ETRGPYINCIRALCNADVGIDFVVGKDVSLPETKVS-SLQNPNGKIGGSR---PSKRT 386
Query: 214 IFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDIST----DHPTTYYEDMQRA 269
I +F G + GY R + + + P D+ DH +Y M+R+
Sbjct: 387 ILAFFAGSLH--------GY----VRPILLNQWSSRPEQDMKIFNRIDH-KSYIRYMKRS 433
Query: 270 IFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDT 329
FC+C G+ SPR+VE++++GC+PVII+D+ V PF + + WE +FV E+++P L
Sbjct: 434 RFCVCAKGYEVNSPRVVESILYGCVPVIISDNFVPPFLEILNWESFAVFVPEKEIPNLRK 493
Query: 330 ILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
IL SIP + Q+ + K M D FH IL+ +
Sbjct: 494 ILISIPVRRYVEMQKRVLK-VQKHFMWHDGEPVRYDIFHMILHSV 537
>gi|218188734|gb|EEC71161.1| hypothetical protein OsI_03019 [Oryza sativa Indica Group]
Length = 499
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 99/349 (28%), Positives = 158/349 (45%), Gaps = 39/349 (11%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAE-IFMHRFLLSSPVRTLNPEEADWFYTPIY 98
LKVY+Y+ ++ + P L+ ++A+E FM S +P +A FY P Y
Sbjct: 164 LKVYIYQ----DGRRPIFHTP-PLSGIYASEGWFMKLLKESRRFAVTDPAKAHLFYLP-Y 217
Query: 99 PTCDLTPTGLPLPFKS-----PRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFH 153
+ L L +P +R ++ +++ +P+WNRT GADHF V HD+G+ +
Sbjct: 218 SSQQLR-ISLYVPDSHNLRPLAAYLRDFVKGLAAKYPFWNRTRGADHFLVACHDWGS-YT 275
Query: 154 YQEEKAIERGILPLLQRATLVQ-TFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPR 212
+ R + L A + F V L E +I P + ++ P + R
Sbjct: 276 TTAHGDLRRNTVKALCNADSSEGIFTPGRDVSLPETTIR----TPRRPLRYVGGLPVSRR 331
Query: 213 SIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPL-----FDISTDHPTTYYEDMQ 267
I +F G + R W + +++ + +Y + M+
Sbjct: 332 GILAFFAGNVHG---------RVRPVLLKHWGDGRDDDMRVYGPLPARVSRRMSYIQHMK 382
Query: 268 RAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKL 327
+ FCLCP+G+ SPR+VEA+ + C+PVIIAD+ VLP +D + W + VAE+DVP L
Sbjct: 383 NSRFCLCPMGYEVNSPRIVEALYYECVPVIIADNFVLPLSDVLDWSAFAVVVAEKDVPDL 442
Query: 328 DTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQP--GDAFHQILNGL 374
IL I LRK + + F A+P D FH IL+ +
Sbjct: 443 KKILQGI----TLRKYVAMHGCVKRLQRHFLWHARPLRYDLFHMILHSI 487
>gi|356495238|ref|XP_003516486.1| PREDICTED: probable glycosyltransferase At3g07620-like [Glycine
max]
Length = 610
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 95/343 (27%), Positives = 150/343 (43%), Gaps = 35/343 (10%)
Query: 39 KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEI-FMHRFLLSSPVRTLNPEEADWFYTPI 97
KLKV++Y + K + P+ + ++A+E FM + +P +A FY P
Sbjct: 285 KLKVFIY----REGAKPIFHQPK-MRGIYASEGWFMKLMEGNKRFIVKDPRKAHLFYLPF 339
Query: 98 YPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEE 157
+ + L P + + + ++LI+ + +WNRT+GADHF V HD+ + Q
Sbjct: 340 --SSQMLRVTLSNPKQMEQHLEKYVELIAGRYRFWNRTDGADHFLVACHDWASRITRQPM 397
Query: 158 KAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVY 217
K G + L + + + F L I P K A + P + RS +
Sbjct: 398 K----GCIRSLCNSNVAKGFQIGKDTTL--PVTYIHSVMDPLKECAGK--PPSERSALAF 449
Query: 218 FRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDI------STDHPTTYYEDMQRAIF 271
F G + Y R W N P I + Y E M + +
Sbjct: 450 FAGSMHG---------YLRPILLKHWAN--KEPDMKIFGPMPRDLEGKKMYMEYMNSSKY 498
Query: 272 CLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTIL 331
C+C G+ +PR++EA+ GC+PVII+D+ V P + + WE +FV E DVP L IL
Sbjct: 499 CICARGYEVHTPRIIEAIFSGCVPVIISDNYVPPLFEVLKWEAFSLFVRERDVPSLRDIL 558
Query: 332 TSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
SIP E L L +++ L+ + D FH IL+ +
Sbjct: 559 LSIPEEKYLALH--LGVKKVQQHFLWHKVPVKYDLFHMILHAI 599
>gi|357118563|ref|XP_003561022.1| PREDICTED: probable glycosyltransferase At5g25310-like
[Brachypodium distachyon]
Length = 479
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 99/375 (26%), Positives = 162/375 (43%), Gaps = 66/375 (17%)
Query: 44 VYELPSKYNKKLLLKDPRCLTHMFAAE------------------IFMHRFLLSSP---- 81
VY P+ +++ + + R H++A F+ + L SP
Sbjct: 112 VYRNPAAFHRSYVEMERRFKVHVYAEGEPPILHAGPCKNIYTIEGRFIEQLELMSPGGAG 171
Query: 82 VRTLNPEEADWFYTPIYPTCDLTPTGLPLPF-KSP--RMMRSAIQLISSNWPYWNRTEGA 138
VRT +PE A F+ P + + +PL + ++P ++ ++++++ +WNR+ GA
Sbjct: 172 VRTWDPERAHAFFLPFSVSQMVQFAYVPLSYDRAPLRALVADYVRVVAARHRFWNRSSGA 231
Query: 139 DHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPP 198
DHF + HD+G + + GI L A + F V + E ++
Sbjct: 232 DHFMLSCHDWGPEASRGDPELYGNGIRALCN-ANTSEGFRPGKDVSIPEINL-------- 282
Query: 199 QKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAV-------WENFK--NN 249
DTPR + + GL + P ++A G V W+ N
Sbjct: 283 -------YDGDTPRQLLLPAPGL----SERPYLAFFAGGRHGHVRDLLLREWKGRDPDNF 331
Query: 250 PLFDIS----------TDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIA 299
P+++ D YY MQRA FCLCP G SPR+VEA+ GC+PV+++
Sbjct: 332 PVYEYDLPTTTNTTGGGDKQHDYYSYMQRARFCLCPSGHEVASPRVVEAIHAGCVPVLVS 391
Query: 300 DDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQ 359
D PFAD + WE + V D+P+L +L SIP + R + +K+ Q
Sbjct: 392 DGYAPPFADVLRWEGFSVSVPVADIPRLREVLESIPAAEVERLRD--GGRLVKQHFTLRQ 449
Query: 360 PAQPGDAFHQILNGL 374
P + D FH IL+ +
Sbjct: 450 PPERLDMFHMILHSV 464
>gi|302761638|ref|XP_002964241.1| xylosyltransferase-like protein [Selaginella moellendorffii]
gi|300167970|gb|EFJ34574.1| xylosyltransferase-like protein [Selaginella moellendorffii]
Length = 332
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 86/296 (29%), Positives = 127/296 (42%), Gaps = 24/296 (8%)
Query: 84 TLNPEEADWFYTPIYPTCDLTPTGLPLPFKSP-----RMMRSAIQLISSNWPYWNRTEGA 138
T +PE+A F+ P + + T L P R R I +IS + WNR+ GA
Sbjct: 45 TSDPEKAHLFFLPF--SVAMMVTYLYTPGSHDMGPLGRFTRDYIDVISHRYSAWNRSRGA 102
Query: 139 DHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPP 198
DHF V HD+G + I L T +V + S+
Sbjct: 103 DHFMVSCHDWGPHISRAHPDLMANSIRVLCNANT------SEGYVPSKDASLPEIHLVGG 156
Query: 199 QKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDH 258
Q PP R +F G ++ P R W+ +
Sbjct: 157 QVPSVLGGPPPEERRYLAFFAG----GDHGP-----VRPVLFKYWKEKDEDVRVFEKLPS 207
Query: 259 PTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMF 318
Y + M + +CLCP G+ SPR+VEA+ C+PV+IADD VLPF+D + W+ +
Sbjct: 208 RDAYLDYMSHSKYCLCPGGYEVNSPRIVEAIYNDCVPVVIADDFVLPFSDVLDWDAFSVK 267
Query: 319 VAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
V E D+P+L TIL +IP L Q ++ ++R F QP + D F+ IL+ +
Sbjct: 268 VLERDIPRLKTILQAIPTARYLEMQARVSK--VRRHFRFNQPPERYDVFNMILHSV 321
>gi|357466145|ref|XP_003603357.1| hypothetical protein MTR_3g106690 [Medicago truncatula]
gi|355492405|gb|AES73608.1| hypothetical protein MTR_3g106690 [Medicago truncatula]
Length = 393
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 85/288 (29%), Positives = 134/288 (46%), Gaps = 35/288 (12%)
Query: 92 WFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGAC 151
+ YTP + D+TP + + ++++S +P+WNRT GADHF + HD+G
Sbjct: 125 YLYTP--SSYDITPL--------TQFVSDYVRVVSMRYPFWNRTHGADHFMLACHDWGP- 173
Query: 152 FHYQEEK-AIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDT 210
H + + + +L A + F V L E I Y + +PP+
Sbjct: 174 -HASKGNPFLYNTSIRVLCNANTSEGFNPLKDVSLPE----IHLYGGEVSPKLLSLPPEN 228
Query: 211 -PRSIFVYFRGLFYDVNNDPEGGYYA--RGARAAVWENFKNNPLFDISTDHPTTYYEDMQ 267
PR +F GG + R W+N + L + YY M
Sbjct: 229 APRRYLAFF-----------AGGMHGPIRPILLQHWKNRDKDILVNEYLPKGIDYYSIML 277
Query: 268 RAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKL 327
+ FCLCP G+ SPR+VE++ C+PVI+++ VLPF+D + WE + V D+P+L
Sbjct: 278 SSKFCLCPSGFEVASPRIVESIYAECVPVILSNYYVLPFSDVLQWEAFSVQVDVSDIPRL 337
Query: 328 DTILTSIPPEVILR-KQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
IL++IP + KQ + A ++R QPA+ D FH IL+ +
Sbjct: 338 KEILSAIPESKYKKLKQGVRA---VRRHFTLNQPAKRFDVFHMILHSI 382
>gi|255567222|ref|XP_002524592.1| catalytic, putative [Ricinus communis]
gi|223536145|gb|EEF37800.1| catalytic, putative [Ricinus communis]
Length = 388
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 80/307 (26%), Positives = 137/307 (44%), Gaps = 20/307 (6%)
Query: 71 IFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAI---QLISS 127
+F+ + RTLNP+EA F+ P + P+ + R+ + ++IS
Sbjct: 88 LFLSLMETDTKFRTLNPDEAQVFFLPFSVVMIIEHLFHPIIRDKAVLERTVVDYVRIISH 147
Query: 128 NWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNE 187
+ YWNR+ GADHF + HD+G + E + I +L A + F + E
Sbjct: 148 KYTYWNRSLGADHFMLSCHDWGPRATWYERQLYFNSI-RVLCNANTSEYFNPKKDASFPE 206
Query: 188 GSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFK 247
++ A +PP R+I +F G + R W+
Sbjct: 207 INLITGEIAD----LTGGLPPSN-RTILAFFSGKMHGK---------LRPLLFQHWKEKD 252
Query: 248 NNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFA 307
+ L + +Y E M+++ +C+CP G SPR+VEA+ C+PV+I+ + V PF+
Sbjct: 253 KDVLVYETFPEGLSYQEMMKKSRYCICPSGHEVASPRIVEAIYAECVPVLISQNYVFPFS 312
Query: 308 DAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAF 367
D + WE + V+ D+ L IL IP + LR + + +++ L P + D F
Sbjct: 313 DVLNWESFSIQVSVSDISNLKNILLGIPEDQYLRMRERVKQ--VQQHFLINNPPKRYDVF 370
Query: 368 HQILNGL 374
H I++ +
Sbjct: 371 HMIIHSI 377
>gi|242043198|ref|XP_002459470.1| hypothetical protein SORBIDRAFT_02g005180 [Sorghum bicolor]
gi|241922847|gb|EER95991.1| hypothetical protein SORBIDRAFT_02g005180 [Sorghum bicolor]
Length = 584
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 100/355 (28%), Positives = 158/355 (44%), Gaps = 41/355 (11%)
Query: 39 KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIY 98
+LKVYVYE + + D C + F+H ++ +RT +P +A F+ P
Sbjct: 237 QLKVYVYE----EGEPPVFHDGPCRSIYSTEGRFIHAMETATRLRTSDPSQAHVFFLPFS 292
Query: 99 PTCDLTPTGLP-----LPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFH 153
+ P P K R + +++IS +P+WNR+ GADHF + HD+G
Sbjct: 293 VVKMVKTIYEPGSHDMAPLK--RTVADYLRVISDKYPFWNRSAGADHFMLSCHDWGPYVS 350
Query: 154 YQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRS 213
+ I +L A + F V L E I + A +++ P R
Sbjct: 351 SANAELFGNSIR-VLCNANTSEGFDLAKDVSLPE--INLRSDAVERQVGG----PSASRR 403
Query: 214 IFVYFRGLFYDVNNDPEGGYYARGARAAVW---ENFKNNPLFDISTDHPTT--------- 261
F+ F F N+ P R A A W +++P +S PT
Sbjct: 404 PFLAF---FAGGNHGP-----VRPALLAHWGPGSGREDDPDVRVSEYLPTRGGRAGASAA 455
Query: 262 -YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD-DIVLPFADAIPWEEIGMFV 319
Y + M+R+ FCLCP G+ SPRL EAV C+PV++ D + LPFAD + W+ + V
Sbjct: 456 AYTDMMRRSRFCLCPGGYEVASPRLAEAVYLECVPVVVDDGEYALPFADVLNWDAFAVRV 515
Query: 320 AEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
DVP++ IL+++ P + + + + M+ P + DAFH IL+ +
Sbjct: 516 RVADVPRIKEILSAVSPRQYI-RMQRRVRMVRRHFMVHGGPPRRYDAFHMILHSV 569
>gi|334187794|ref|NP_197526.5| Exostosin family protein [Arabidopsis thaliana]
gi|332005439|gb|AED92822.1| Exostosin family protein [Arabidopsis thaliana]
Length = 458
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 83/302 (27%), Positives = 136/302 (45%), Gaps = 26/302 (8%)
Query: 80 SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSA----IQLISSNWPYWNRT 135
SP NPEEA F P+ + PL S + + +++ +PYWNR+
Sbjct: 166 SPFAANNPEEAHAFLLPVSVANIVHYLYRPLVTYSREQLHKVFLDYVDVVAHKYPYWNRS 225
Query: 136 EGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY 195
GADHF+V HD+ + + + ++ +L A + F + V + E +I
Sbjct: 226 LGADHFYVSCHDWAPDVSGSNPELM-KNLIRVLCNANTSEGFMPQRDVSIPEINIPGGHL 284
Query: 196 APPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGY-YARGARAAVWENFKNNPLFDI 254
PP+ ++ R I +F G G + Y R W++
Sbjct: 285 GPPRLSRS----SGHDRPILAFFAG----------GSHGYIRRILLQHWKDKDEEVQVHE 330
Query: 255 STDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEE 314
Y++ M A FCLCP G+ SPR+V A+ GC+PVII+D LPF+D + W +
Sbjct: 331 YLAKNKDYFKLMATARFCLCPSGYEVASPRVVAAINLGCVPVIISDHYALPFSDVLDWTK 390
Query: 315 IGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANP--SMKRAMLFPQPAQPGDAFHQILN 372
+ V + +P++ TIL SI R+ R+L ++R + +P+QP D +L+
Sbjct: 391 FTIHVPSKKIPEIKTILKSIS----WRRYRVLQRRVLQVQRHFVINRPSQPFDMLRMLLH 446
Query: 373 GL 374
+
Sbjct: 447 SV 448
>gi|292630785|sp|Q3E9A4.3|GLYT5_ARATH RecName: Full=Probable glycosyltransferase At5g20260
Length = 466
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 83/302 (27%), Positives = 136/302 (45%), Gaps = 26/302 (8%)
Query: 80 SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSA----IQLISSNWPYWNRT 135
SP NPEEA F P+ + PL S + + +++ +PYWNR+
Sbjct: 174 SPFAANNPEEAHAFLLPVSVANIVHYLYRPLVTYSREQLHKVFLDYVDVVAHKYPYWNRS 233
Query: 136 EGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY 195
GADHF+V HD+ + + + ++ +L A + F + V + E +I
Sbjct: 234 LGADHFYVSCHDWAPDVSGSNPELM-KNLIRVLCNANTSEGFMPQRDVSIPEINIPGGHL 292
Query: 196 APPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGY-YARGARAAVWENFKNNPLFDI 254
PP+ ++ R I +F G G + Y R W++
Sbjct: 293 GPPRLSRS----SGHDRPILAFFAG----------GSHGYIRRILLQHWKDKDEEVQVHE 338
Query: 255 STDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEE 314
Y++ M A FCLCP G+ SPR+V A+ GC+PVII+D LPF+D + W +
Sbjct: 339 YLAKNKDYFKLMATARFCLCPSGYEVASPRVVAAINLGCVPVIISDHYALPFSDVLDWTK 398
Query: 315 IGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANP--SMKRAMLFPQPAQPGDAFHQILN 372
+ V + +P++ TIL SI R+ R+L ++R + +P+QP D +L+
Sbjct: 399 FTIHVPSKKIPEIKTILKSIS----WRRYRVLQRRVLQVQRHFVINRPSQPFDMLRMLLH 454
Query: 373 GL 374
+
Sbjct: 455 SV 456
>gi|302815783|ref|XP_002989572.1| xylosyltransferase-like protein [Selaginella moellendorffii]
gi|300142750|gb|EFJ09448.1| xylosyltransferase-like protein [Selaginella moellendorffii]
Length = 332
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 86/296 (29%), Positives = 126/296 (42%), Gaps = 24/296 (8%)
Query: 84 TLNPEEADWFYTPIYPTCDLTPTGLPLPFKSP-----RMMRSAIQLISSNWPYWNRTEGA 138
T +PE+A F+ P + + T L P R R I +IS + WNR+ GA
Sbjct: 45 TSDPEKAHLFFLPF--SVAMMVTYLYTPGSHDMGPLGRFTRDYIDVISHRYSSWNRSRGA 102
Query: 139 DHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPP 198
DHF V HD+G + I L T +V + S+
Sbjct: 103 DHFMVSCHDWGPHISRAHPDLMANSIRVLCNANT------SEGYVPSKDASLPEIHLVGG 156
Query: 199 QKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDH 258
Q PP R +F G ++ P R W+ +
Sbjct: 157 QVPSVLGGPPPEERRYLAFFAG----GDHGP-----VRPVLFKYWKEKDEDVRVFEKLPS 207
Query: 259 PTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMF 318
Y + M + +CLCP G+ SPR+VEA+ C+PV+IADD VLPF+D + W+ +
Sbjct: 208 RDAYLDYMSHSKYCLCPGGYEVNSPRIVEAIYNDCVPVVIADDFVLPFSDVLDWDAFSVK 267
Query: 319 VAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
V E D+P+L TIL +IP L Q + ++R F QP + D F+ IL+ +
Sbjct: 268 VLERDIPRLKTILQAIPTARYLEMQARASK--VRRHFRFNQPPERYDVFNMILHSV 321
>gi|302835519|ref|XP_002949321.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
gi|300265623|gb|EFJ49814.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
Length = 855
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 106/405 (26%), Positives = 170/405 (41%), Gaps = 90/405 (22%)
Query: 19 PKIERISGSAGDVLEDDPVGKLK--VYVYELPSKYNKKLLLKD------------PRCLT 64
P E +G+ V E G+++ +Y+Y++P ++ ++ P T
Sbjct: 379 PNSEGAAGTGSAVGEQHTPGRMRPLIYIYDMPPEFTSRMHQYKNVHEHCSYRRFIPSNRT 438
Query: 65 HMFA----AEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTG--LPLPF------ 112
++A E + H L SP RT +PEEAD+FY P+Y TC + P +PF
Sbjct: 439 ELYADTYSVEAYFHEVLSISPHRTFDPEEADFFYVPVYYTCWMWPINGWADMPFYGAPTS 498
Query: 113 -----KSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFH----YQEEKAIE-- 161
+ + A I SN+P+W+R G DH ++ HD GAC+ YQ +
Sbjct: 499 WHRYSNAANLWLKAKTWIQSNFPFWDRRGGRDHIWMTNHDEGACYMPTEIYQTSIMLTHW 558
Query: 162 ------------------------RGILPLLQRATLVQTFGQRNHVCLN-EGSITIPPYA 196
+G+L TL Q H C + + IP +
Sbjct: 559 GRMDLNHTSNTAYRPDNYSDGITWKGVLDGKDVKTLYQ-----GHPCYDPRKDLVIPAFK 613
Query: 197 PPQKMQAHQIPPDTPRS--IFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPL--- 251
P + PR I +Y RG DV E Y+RG R +++ +N
Sbjct: 614 TPDHFSQSPLLGSWPRQRDILLYLRG---DVGKHREPN-YSRGIRQKLYKLAVDNEWAKK 669
Query: 252 --------FDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIV 303
F+I +Y E + R++FC G +SPR +AV+ GC+P+II D+
Sbjct: 670 HRIFIGEQFEIQ----GSYGEHLSRSLFCAVVPG-DGYSPRFEDAVLHGCLPLIIVDNTH 724
Query: 304 LPFADAIPWEEIGMFVAEEDVPK-LDTILTSIPPEVILRKQRLLA 347
+ F I + + ++E + + L +LT+I P+ I R QR L+
Sbjct: 725 VLFESIIDVDSFSLRISEAALNEYLPHLLTAISPDQIARMQRRLS 769
>gi|109946595|gb|ABG48476.1| At3g42180 [Arabidopsis thaliana]
Length = 425
Score = 112 bits (279), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 88/309 (28%), Positives = 141/309 (45%), Gaps = 36/309 (11%)
Query: 79 SSPVRTLNPEEADWFYTPIYPTCDLT-----PTGLPLPFKSPRMMR---SAIQLISSNWP 130
S R PEEA F+ P + ++ P P F R+ R + +++ P
Sbjct: 129 SGRFRASRPEEAHAFFLP-FSVANIVHYVYQPITSPADFNRARLHRIFNDYVDVVAHKHP 187
Query: 131 YWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSI 190
+WN++ GADHF V HD+ + + + + L T + G R ++ + I
Sbjct: 188 FWNQSNGADHFMVSCHDWAPDVPDSKPEFFKNFMRGLCNANT---SEGFRRNIDFSIPEI 244
Query: 191 TIPP--YAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKN 248
IP PP Q P+ R+I +F G + GY R ++ ++K
Sbjct: 245 NIPKRKLKPPFMGQN----PEN-RTILAFFAGRAH--------GY----IREVLFSHWKG 287
Query: 249 NPLFDISTDHPT---TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLP 305
DH T Y+E + + FCLCP G+ SPR VEA+ GC+PV+I+D+ LP
Sbjct: 288 KDKDVQVYDHLTKGQNYHELIGHSKFCLCPSGYEVASPREVEAIYSGCVPVVISDNYSLP 347
Query: 306 FADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGD 365
F D + W + + + + +P + IL IP + LR R + ++R + +PAQP D
Sbjct: 348 FNDVLDWSKFSVEIPVDKIPDIKKILQEIPHDKYLRMYRNVM--KVRRHFVVNRPAQPFD 405
Query: 366 AFHQILNGL 374
H IL+ +
Sbjct: 406 VIHMILHSV 414
>gi|26451681|dbj|BAC42936.1| unknown protein [Arabidopsis thaliana]
Length = 270
Score = 112 bits (279), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 78/266 (29%), Positives = 131/266 (49%), Gaps = 29/266 (10%)
Query: 116 RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQ 175
+ +++ + +ISS + +WN+T G+DHF V HD+ E + + L + + +
Sbjct: 19 QFLKNYLDMISSKYSFWNKTGGSDHFLVACHDWAP----SETRQYMAKCIRALCNSDVSE 74
Query: 176 TFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYA 235
F V L E +I +P + ++A P + R I +F G + GY
Sbjct: 75 GFVFGKDVALPETTILVPR----RPLRALGGKPVSQRQILAFFAGGMH--------GY-- 120
Query: 236 RGARAAVWENFKNN--PLFDISTDHPTT-----YYEDMQRAIFCLCPLGWAPWSPRLVEA 288
R + +N+ N P I ++ P + Y E M+ + +C+CP G SPR+VEA
Sbjct: 121 --LRPLLLQNWGGNRDPDMKIFSEIPKSKGKKSYMEYMKSSKYCICPKGHEVNSPRVVEA 178
Query: 289 VVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLAN 348
+ + C+PVII+D+ V PF + + WE +FV E+D+P L IL SI E R+ ++
Sbjct: 179 LFYECVPVIISDNFVPPFFEVLNWESFAVFVLEKDIPDLKNILVSITEER-YREMQMRVK 237
Query: 349 PSMKRAMLFPQPAQPGDAFHQILNGL 374
K + +P + D FH IL+ +
Sbjct: 238 MVQKHFLWHSKPER-FDIFHMILHSI 262
>gi|6735291|emb|CAB68119.1| putative protein [Arabidopsis thaliana]
Length = 340
Score = 112 bits (279), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 88/309 (28%), Positives = 141/309 (45%), Gaps = 36/309 (11%)
Query: 79 SSPVRTLNPEEADWFYTPIYPTCDLT-----PTGLPLPFKSPRMMR---SAIQLISSNWP 130
S R PEEA F+ P + ++ P P F R+ R + +++ P
Sbjct: 44 SGRFRASRPEEAHAFFLP-FSVANIVHYVYQPITSPADFNRARLHRIFNDYVDVVAHKHP 102
Query: 131 YWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSI 190
+WN++ GADHF V HD+ + + + + L T + G R ++ + I
Sbjct: 103 FWNQSNGADHFMVSCHDWAPDVPDSKPEFFKNFMRGLCNANT---SEGFRRNIDFSIPEI 159
Query: 191 TIPP--YAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKN 248
IP PP Q P+ R+I +F G + GY R ++ ++K
Sbjct: 160 NIPKRKLKPPFMGQN----PEN-RTILAFFAGRAH--------GY----IREVLFSHWKG 202
Query: 249 NPLFDISTDHPT---TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLP 305
DH T Y+E + + FCLCP G+ SPR VEA+ GC+PV+I+D+ LP
Sbjct: 203 KDKDVQVYDHLTKGQNYHELIGHSKFCLCPSGYEVASPREVEAIYSGCVPVVISDNYSLP 262
Query: 306 FADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGD 365
F D + W + + + + +P + IL IP + LR R + ++R + +PAQP D
Sbjct: 263 FNDVLDWSKFSVEIPVDKIPDIKKILQEIPHDKYLRMYRNVM--KVRRHFVVNRPAQPFD 320
Query: 366 AFHQILNGL 374
H IL+ +
Sbjct: 321 VIHMILHSV 329
>gi|413952956|gb|AFW85605.1| hypothetical protein ZEAMMB73_586394 [Zea mays]
Length = 509
Score = 111 bits (278), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 89/306 (29%), Positives = 138/306 (45%), Gaps = 36/306 (11%)
Query: 82 VRTLNPEEADWFYTPIYPTCDLTPTGLPLPF-KSP--RMMRSAIQLISSNWPYWNRTEGA 138
VRT P A F+ P + + P + K+P ++ +++++S PYWNR+ GA
Sbjct: 216 VRTSEPARAHAFFLPFSVSQMVQFAYRPNTYDKTPLRAIVADYVRVVASRHPYWNRSAGA 275
Query: 139 DHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPP 198
DHF + HD+G + GI L C S P+
Sbjct: 276 DHFMLACHDWGPEASTGHPELHANGIRAL----------------CNANSSEGFRPWQDV 319
Query: 199 QKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAV-------WENFKNN-- 249
+ D PR + G V + P ++A G + W+ +
Sbjct: 320 SVPDINLYDGDMPRQLLAPAPG----VTSRPFLAFFAGGRHGHIRDLLLRHWKGRDPDFF 375
Query: 250 PLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADA 309
P+++ + YY M+RA FCLCP G+ SPR+VE++ C+PVI++D VLPFAD
Sbjct: 376 PVYEHRHEDGFDYYSFMRRARFCLCPSGYQVASPRVVESIHAECVPVILSDGYVLPFADV 435
Query: 310 IPWEEIGMFVAEEDVPKLDTILTSIP-PEVILRKQRLLANPSMKRAMLFPQPAQPGDAFH 368
+ WE + V D+P+L +L IP PEV +Q + +KR + QP Q D F+
Sbjct: 436 LRWEAFSVAVPVADIPRLREVLERIPAPEVEKLQQGVRL---VKRHFMLNQPPQRLDMFN 492
Query: 369 QILNGL 374
IL+ +
Sbjct: 493 MILHSV 498
>gi|145339080|ref|NP_189804.4| putative glycosyltransferase [Arabidopsis thaliana]
gi|292630777|sp|Q3EAR7.2|GLYT2_ARATH RecName: Full=Probable glycosyltransferase At3g42180
gi|332644208|gb|AEE77729.1| putative glycosyltransferase [Arabidopsis thaliana]
Length = 470
Score = 111 bits (278), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 88/309 (28%), Positives = 141/309 (45%), Gaps = 36/309 (11%)
Query: 79 SSPVRTLNPEEADWFYTPIYPTCDLT-----PTGLPLPFKSPRMMR---SAIQLISSNWP 130
S R PEEA F+ P + ++ P P F R+ R + +++ P
Sbjct: 174 SGRFRASRPEEAHAFFLP-FSVANIVHYVYQPITSPADFNRARLHRIFNDYVDVVAHKHP 232
Query: 131 YWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSI 190
+WN++ GADHF V HD+ + + + + L T + G R ++ + I
Sbjct: 233 FWNQSNGADHFMVSCHDWAPDVPDSKPEFFKNFMRGLCNANT---SEGFRRNIDFSIPEI 289
Query: 191 TIPP--YAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKN 248
IP PP Q P+ R+I +F G + GY R ++ ++K
Sbjct: 290 NIPKRKLKPPFMGQN----PEN-RTILAFFAGRAH--------GY----IREVLFSHWKG 332
Query: 249 NPLFDISTDHPT---TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLP 305
DH T Y+E + + FCLCP G+ SPR VEA+ GC+PV+I+D+ LP
Sbjct: 333 KDKDVQVYDHLTKGQNYHELIGHSKFCLCPSGYEVASPREVEAIYSGCVPVVISDNYSLP 392
Query: 306 FADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGD 365
F D + W + + + + +P + IL IP + LR R + ++R + +PAQP D
Sbjct: 393 FNDVLDWSKFSVEIPVDKIPDIKKILQEIPHDKYLRMYRNVM--KVRRHFVVNRPAQPFD 450
Query: 366 AFHQILNGL 374
H IL+ +
Sbjct: 451 VIHMILHSV 459
>gi|302825130|ref|XP_002994198.1| glycosyltransferase-like protein [Selaginella moellendorffii]
gi|300137940|gb|EFJ04733.1| glycosyltransferase-like protein [Selaginella moellendorffii]
Length = 762
Score = 111 bits (278), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 104/384 (27%), Positives = 164/384 (42%), Gaps = 67/384 (17%)
Query: 22 ERISGSAGDVLEDDPVGKLK--VYVYELPSKYNKKLLLKDP---RCLTHMF--------- 67
+R S S +++ V K + +Y+Y+LP++YN LL +C+T ++
Sbjct: 300 QRSSESDDQQVQEISVEKRRPLIYIYDLPAEYNSHLLEGRHFKFQCVTRVYDGVNATFWS 359
Query: 68 ----AAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPT------------GLPLP 111
AE+ LL+SP RT+N +EAD+F+ P+ C +T GL
Sbjct: 360 EYLEGAELAFLEGLLASPHRTMNGDEADYFFAPVLGACAITRADDAPHFSMEKHMGLRGY 419
Query: 112 FKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQE----EKAIERGILPL 167
F S + ++A I +P+WNR+ G DH ++ P D GAC +E + G
Sbjct: 420 F-SGELYKNAYMHIKEQYPFWNRSSGRDHIWLFPWDEGACSAPKEIWNGTMLVHWGNTNS 478
Query: 168 LQRATLVQTFG----------QRNHVCLNE-GSITIPPYAPPQKMQAHQ---IPPDTPRS 213
+ + F + +H C + I +P + P + P R
Sbjct: 479 KHKKSTTGYFADSWDDIPKEWRGDHPCYDPLKDIVLPAWKNPDPRSVAERFWSRPREERK 538
Query: 214 IFVYFRGLF---YDVNNDPEGGYYARGARAAVWENFKNNPLFD------------ISTDH 258
YF G YD PE Y+ G R V E F + P ++
Sbjct: 539 TLFYFNGNLGKGYDFGR-PE-DRYSMGIRQRVAEEFGSTPNNHGKLGRQAAPDVVVTPQR 596
Query: 259 PTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMF 318
Y +++ + FC G WS R+ +AV+ GCIPVII D I LP+ + +E +
Sbjct: 597 SDDYAKELSSSRFCGVFPG-DGWSGRMEDAVLHGCIPVIIQDGIHLPYESLLDYESFTVR 655
Query: 319 VAEEDVPKLDTILTSIPPEVILRK 342
VAE+ +P+L TIL +I + K
Sbjct: 656 VAEDKIPELITILRNISNAEVESK 679
>gi|356554237|ref|XP_003545455.1| PREDICTED: probable glycosyltransferase At5g25310-like [Glycine
max]
Length = 483
Score = 111 bits (278), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 90/302 (29%), Positives = 136/302 (45%), Gaps = 31/302 (10%)
Query: 83 RTLNPEEADWFYTPIYPTCDLTPTGLPLPFKS---PRMMRSAIQLISSNWPYWNRTEGAD 139
RT +P A ++ P T + PL F + + +++IS+ P+WN T GAD
Sbjct: 193 RTNDPNAAHVYFLPFSVTWMVKYLYTPLSFNVTPLKQFVSDYVRVISTRHPFWNITHGAD 252
Query: 140 HFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQ 199
HF + HD+G Q + + +L A + F R V L E I Y
Sbjct: 253 HFMLACHDWGP-HASQGNPFLYNTSIRVLCNANTSEGFNPRKDVSLPE----IHLYGGEV 307
Query: 200 KMQAHQIPPDT-PRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDH 258
+ PPDT PR +F G + R A+ ++KN+ DI
Sbjct: 308 SPKLLSPPPDTAPRRYLAFFSGGLHG------------PIRPALLRHWKNDNDDDIRVYE 355
Query: 259 ----PTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEE 314
YY M + FCLCP G SPR+VEA+ C+PVI+++ VLPF+D + WE
Sbjct: 356 YLPKDLDYYSFMLNSKFCLCPSGHEVASPRIVEAIYAECVPVILSEYYVLPFSDVLQWEA 415
Query: 315 IGMFVAEEDVPKLDTILTSIPPEVILRKQRLLAN--PSMKRAMLFPQPAQPGDAFHQILN 372
+ V D+P+L IL++I + K R L +++ +PA+ D FH IL+
Sbjct: 416 FSVQVDVSDIPRLKEILSAISED----KYRKLKEGVKAVRGHFTLNRPAKRFDVFHMILH 471
Query: 373 GL 374
+
Sbjct: 472 SI 473
>gi|224092292|ref|XP_002309546.1| predicted protein [Populus trichocarpa]
gi|222855522|gb|EEE93069.1| predicted protein [Populus trichocarpa]
Length = 417
Score = 111 bits (278), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 102/391 (26%), Positives = 168/391 (42%), Gaps = 48/391 (12%)
Query: 4 TAIALLLAFVSADNTPKIERISGSAGDVLEDDPVGK---------LKVYVYELPSKYNKK 54
+++ +L A +N+P I+ + + + + + LKVYVY+ +K
Sbjct: 44 SSVEMLFAKSQIENSPLIKNETRLYAPIYRNVSMFRRSYELMEKMLKVYVYQ----DGEK 99
Query: 55 LLLKDPRCLTHMFAAE-IFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFK 113
+ P L ++A+E FM + T +P +A FY P + L L +
Sbjct: 100 PIFHQP-ILDGIYASEGWFMKHMEANENFVTKDPGKAHLFYLPF--SSRLLELTLYVRHS 156
Query: 114 SPRM-----MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLL 168
R MR+ +I++ + +WNRT GADHF HD+ E + + L
Sbjct: 157 HSRTNLIEYMRNYAGMIAAKYHFWNRTGGADHFVAACHDWAPA----ETRGPLLNCIRAL 212
Query: 169 QRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNND 228
A + F V L E + P + ++ + P + R I +F G +
Sbjct: 213 CNADIEVGFSIGKDVSLPETYVR-SAQNPLKNLEGN---PPSQRPILAFFAGNMHG---- 264
Query: 229 PEGGYYARGARAAVWENFKNNPLFDISTDHP-----TTYYEDMQRAIFCLCPLGWAPWSP 283
Y R W N +P I P T Y + M+ + FC+CP G SP
Sbjct: 265 -----YVRPVLLDYWGN--KDPDMKIFGPMPHVKGNTNYIQHMKSSKFCICPRGHEVNSP 317
Query: 284 RLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQ 343
R+VEA+ C+PVII+D+ V PF + + WE + V E+D+P L IL SI E +
Sbjct: 318 RIVEAIFLECVPVIISDNFVPPFFEVLDWESFAVIVLEKDIPNLKNILVSISEEKYIEMH 377
Query: 344 RLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
+ + +++ L+ + D FH IL+ +
Sbjct: 378 KRVK--KVQQHFLWHSKPEKYDLFHMILHSV 406
>gi|357520911|ref|XP_003630744.1| Xylogalacturonan beta-1,3-xylosyltransferase [Medicago truncatula]
gi|355524766|gb|AET05220.1| Xylogalacturonan beta-1,3-xylosyltransferase [Medicago truncatula]
Length = 653
Score = 111 bits (277), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 97/351 (27%), Positives = 152/351 (43%), Gaps = 48/351 (13%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEI-FMHRFLLSSPVRTLNPEEADWFYTP-- 96
LKVY+Y K + P L ++A+E FM + +P +A FY P
Sbjct: 322 LKVYIY----MEGNKPIFHQP-ILKGLYASEGWFMKLMEENKQFVVKDPAKAHLFYMPFS 376
Query: 97 -------IYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFG 149
+Y T L + ++ IS+ + Y+NRT GADHF V HD+
Sbjct: 377 SRMLEFSVYVRNSHNRTNLR------QYLKEYTDKISAKYRYFNRTGGADHFLVACHDWA 430
Query: 150 ACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPD 209
Y+ +E I L + + Q F V L E + PQ+ + P
Sbjct: 431 P---YETRHHMEYCIKALCN-SDVTQGFKIGRDVSLPE--TMVRSVRNPQRDLGGK--PP 482
Query: 210 TPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDI------STDHPTTYY 263
RSI ++ G + Y R W+ + +P I H Y
Sbjct: 483 QQRSILAFYAGNMHG---------YLRSILLKHWK--EKDPDMKIFGPMPHGVAHKMNYI 531
Query: 264 EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED 323
E M+ + +C+CP G+ SPR+VEA+ + C+PVII+D+ V PF + + W+ + +AE+D
Sbjct: 532 EHMKSSKYCICPKGYEVNSPRVVEAIFYECVPVIISDNFVPPFFEVLNWDSFSLILAEKD 591
Query: 324 VPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
+P L IL S+P E L+ Q L +++ L+ D FH L+ +
Sbjct: 592 IPNLKQILLSVPEEKYLKLQ--LGVRRVQKHFLWHTKPLKYDLFHMTLHSI 640
>gi|224090294|ref|XP_002308967.1| predicted protein [Populus trichocarpa]
gi|222854943|gb|EEE92490.1| predicted protein [Populus trichocarpa]
Length = 793
Score = 111 bits (277), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 101/397 (25%), Positives = 168/397 (42%), Gaps = 78/397 (19%)
Query: 28 AGDVLEDDPVGKLK---VYVYELPSKYNKKLLLKDP---RCLT-------------HMFA 68
G +++ + V K K +Y+Y+LP K+N LL C+ ++
Sbjct: 334 TGKLVDLNAVVKKKRPLIYIYDLPPKFNSLLLEGRHFKFECVNRLYNDNNATIWTDQLYG 393
Query: 69 AEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLP--LPFKSPRMMRSAIQL-- 124
A++ ++ +L+SP RTLN EEAD+F+ P+ +C +T L + +RS++ L
Sbjct: 394 AQMALYESILASPYRTLNGEEADFFFVPVLDSCIITRADDAPHLSMEQHLGLRSSLTLEF 453
Query: 125 -------ISSNWPYWNRTEGADHFFVVPHDFGACFHYQE--------------EKAIERG 163
I ++P+WNR+ G DH + D GAC+ +E K
Sbjct: 454 YRKAYDHIVEHYPFWNRSSGRDHLWSFSWDEGACYAPKEIWNSMMVVHWGNTNSKHNHST 513
Query: 164 ILPLLQRATLVQTFGQRNHVCLN-EGSITIPPYAPPQ------KMQAHQIPPDTPRSIFV 216
+ + + H C + + + +P + P K+ A P R
Sbjct: 514 TAYWADNWDKISSDRRGKHPCFDPDKDLVLPAWKRPDVNALSTKLWAR---PLEKRKTLF 570
Query: 217 YFRGLF--YDVNNDPEGGYYARGARAAVWENFKNNPLFD------------ISTDHPTTY 262
YF G +N PE Y+ G R + E F + P D +S +Y
Sbjct: 571 YFNGNLGPAYLNGRPE-ALYSMGIRQKLAEEFGSTPNKDGNLGKQHAENVIVSPLRSESY 629
Query: 263 YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEE 322
+ED+ ++FC G WS R+ ++++ GCIPV+I D I LP+ + + +E + + E+
Sbjct: 630 HEDLASSVFCGVMPG-DGWSGRMEDSILQGCIPVVIQDGIYLPYENVLNYESFAVRILED 688
Query: 323 DVPKLDTILTSIPPEVILRK--------QRLLANPSM 351
++P L IL I K QR L SM
Sbjct: 689 EIPNLIKILQGFNETEIENKLTSVQKIGQRFLYRDSM 725
>gi|115451051|ref|NP_001049126.1| Os03g0174300 [Oryza sativa Japonica Group]
gi|108706455|gb|ABF94250.1| exostosin family protein, putative, expressed [Oryza sativa
Japonica Group]
gi|113547597|dbj|BAF11040.1| Os03g0174300 [Oryza sativa Japonica Group]
gi|215695509|dbj|BAG90700.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 576
Score = 111 bits (277), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 95/376 (25%), Positives = 161/376 (42%), Gaps = 65/376 (17%)
Query: 17 NTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLL------------LKDPRCLT 64
N P ++ IS S+ +V + +YVY+LP++++ LL + D + T
Sbjct: 192 NLPDLKNISSSSINVKVVAKKKRPLIYVYDLPAEFDSHLLEGRHYKFQCVNRIYDDKNRT 251
Query: 65 ----HMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMM-- 118
++ A+I ++ +L+SP RTLN +EAD+FY P +C +T + + PR +
Sbjct: 252 IWTQQLYGAQIALYESILASPHRTLNGDEADYFYVPALDSCLITRSDDAPHLQMPRDLRL 311
Query: 119 ---------RSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFH----YQEEKAIERGIL 165
R I+ +PYWNRT G DH + D GAC+ + + G
Sbjct: 312 RSYHTLEYYRMTYDHIAQRYPYWNRTSGRDHIWFFSWDEGACYAPKEIWNSMMLVHWGNT 371
Query: 166 PLLQRATLVQTFGQR----------NHVCLN-EGSITIPPYAPPQ------KMQAHQIPP 208
+ + + NH C + + +P + P K+ A
Sbjct: 372 NTKHKNSTTAYWADNWNYIPIDRRGNHPCFDPRKDLVLPAWKQPNPAAIWLKLWART--- 428
Query: 209 DTPRSIFVYFRGLFYDVNNDPEG-GYYARGARAAVWENFKNNP----------LFDISTD 257
R+ YF G D Y+ G R + F + P +++
Sbjct: 429 RNNRTTLFYFNGNLGPAYKDGRHEDTYSMGIRQKLAAEFGSTPDKQGKLGRQHTANVTVT 488
Query: 258 HPTT--YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEI 315
+ T YYE++ +IFC G WS R+ ++++ GCIPVII D I+LP+ + + +
Sbjct: 489 YLRTEKYYEELASSIFCGVLPG-DGWSGRMEDSMLQGCIPVIIQDGILLPYENMLNYNSF 547
Query: 316 GMFVAEEDVPKLDTIL 331
+ + E+D+P L IL
Sbjct: 548 AVRIQEDDIPNLIRIL 563
>gi|255550904|ref|XP_002516500.1| catalytic, putative [Ricinus communis]
gi|223544320|gb|EEF45841.1| catalytic, putative [Ricinus communis]
Length = 456
Score = 111 bits (277), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 84/301 (27%), Positives = 136/301 (45%), Gaps = 33/301 (10%)
Query: 84 TLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPR-----MMRSAIQLISSNWPYWNRTEGA 138
T +P+EA FY P + L L + R M++ +I++ +P+W+RT GA
Sbjct: 168 TKDPKEAHLFYIPF--SSRLLELTLYVRKSHSRNNLIEYMKNYTDMIAAKYPFWSRTGGA 225
Query: 139 DHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPP 198
DHF HD+ E + + L A + F V L E + P
Sbjct: 226 DHFVAACHDWAPA----ETRGRMLNCIRALCNADIDVGFRIGKDVSLPETYVR-SAQNPL 280
Query: 199 QKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDH 258
+ + + P + R I +F G + + R WEN +P I
Sbjct: 281 KNLDGN---PPSQRPILAFFAGNVHG---------FVRPILLEYWEN--KDPEMKIFGPM 326
Query: 259 P-----TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWE 313
P T Y + M+ + +C+CP G SPR+VE++ + C+PVII+D+ V PF + + WE
Sbjct: 327 PRVKGNTNYIQLMKSSKYCICPRGHEVNSPRIVESIFYECVPVIISDNYVPPFFEVLDWE 386
Query: 314 EIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNG 373
+FV E+D+P L IL SIP E + + + +++ L+ + D FH IL+
Sbjct: 387 SFAVFVLEKDIPNLKKILLSIPEETYVEMHKRVK--KVQQHFLWHSEPEKHDLFHMILHS 444
Query: 374 L 374
+
Sbjct: 445 V 445
>gi|449527873|ref|XP_004170933.1| PREDICTED: probable glycosyltransferase At5g03795-like [Cucumis
sativus]
Length = 466
Score = 111 bits (277), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 89/318 (27%), Positives = 146/318 (45%), Gaps = 43/318 (13%)
Query: 67 FAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTC-----DLTPTGLPLPFKSPRMMRSA 121
+A+E + + L+ S T +P+EAD+F+ P T ++ +G+P +R
Sbjct: 169 YASESYFKKSLIKSHFITNDPKEADFFFLPFSITGLRNDRRVSVSGIP------NFIRDY 222
Query: 122 IQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRN 181
I +S +PYWNRT GADHF+V H G + +A ++++VQ +
Sbjct: 223 IFDVSHKYPYWNRTGGADHFYVACHSVGRSAMDKSSEA----------KSSIVQVVCSSS 272
Query: 182 HVCLNEGSITIPPYAPPQKMQAHQIPPD---TPRSIFVYFRGLFYDVNNDPEGGYYARGA 238
+ G I+ A PQ + P + + R+ +F G N P R A
Sbjct: 273 YFL--TGYISHKDAALPQIWPRKEDPSNLASSKRTRLAFFAGAM----NSP-----TRQA 321
Query: 239 RAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVII 298
VW K++ +F S T Y +++ R+ FCL G+ + R+ +++ +GC+PVII
Sbjct: 322 LVQVWG--KDSEIFAYSGRLKTPYADELLRSKFCLHVKGFEVNTARVGDSIFYGCVPVII 379
Query: 299 ADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFP 358
A+ LPF D + W+ + V D+P+L IL I E R L + +K F
Sbjct: 380 ANYYDLPFGDILNWKSFSIVVTTSDIPRLKEILKGINDEEYAR----LQSNVLKVRKHFK 435
Query: 359 QPAQP--GDAFHQILNGL 374
+ P D FH ++ L
Sbjct: 436 WHSSPVDYDTFHMVMYQL 453
>gi|240256346|ref|NP_197913.4| putative glycosyltransferase [Arabidopsis thaliana]
gi|292630776|sp|Q3E7Q9.2|GLYT6_ARATH RecName: Full=Probable glycosyltransferase At5g25310
gi|332006042|gb|AED93425.1| putative glycosyltransferase [Arabidopsis thaliana]
Length = 480
Score = 110 bits (276), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 99/354 (27%), Positives = 160/354 (45%), Gaps = 51/354 (14%)
Query: 39 KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLS-----SPVRTLNPEEADWF 93
+ KVYVYE + L+ D C + ++A E RF+ + RT +P +A +
Sbjct: 150 RFKVYVYE----EGEPPLVHDGPCKS-VYAVE---GRFITEMEKRRTKFRTYDPNQAYVY 201
Query: 94 YTPIYPTCDL----TPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFG 149
+ P T + P K+ + I+L+S+N P+WNRT GADHF + HD+G
Sbjct: 202 FLPFSVTWLVRYLYEGNSDAKPLKT--FVSDYIRLVSTNHPFWNRTNGADHFMLTCHDWG 259
Query: 150 ACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPD 209
Q + + + ++ A + F V L E I + K++ +
Sbjct: 260 P-LTSQANRDLFNTSIRVMCNANSSEGFNPTKDVTLPE--IKLYGGEVDHKLRLSKTLSA 316
Query: 210 TPRSIFVYFRGLFYDVNNDPEGGYYARGARAAV-------WENFK-NNPLFDISTDHPTT 261
+PR P G++A G V W+ + P+++ H
Sbjct: 317 SPR----------------PYLGFFAGGVHGPVRPILLKHWKQRDLDMPVYEYLPKH-LN 359
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
YY+ M+ + FC CP G+ SPR++EA+ CIPVI++ + VLPF D + WE + V
Sbjct: 360 YYDFMRSSKFCFCPSGYEVASPRVIEAIYSECIPVILSVNFVLPFTDVLRWETFSVLVDV 419
Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPS-MKRAMLFPQPAQPGDAFHQILNGL 374
++P+L IL SI E + + L +N ++R P Q DAFH L+ +
Sbjct: 420 SEIPRLKEILMSISNE---KYEWLKSNLRYVRRHFELNDPPQRFDAFHLTLHSI 470
>gi|159478515|ref|XP_001697348.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158274506|gb|EDP00288.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 620
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 111/397 (27%), Positives = 164/397 (41%), Gaps = 67/397 (16%)
Query: 28 AGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNP 87
AG +P G K+YVYE P +Y+ LL R L I + L+S +RT+NP
Sbjct: 145 AGQGYRPNPRGP-KIYVYEFPPQYHVWSLLWLDRPLN------IILWERLMSLGLRTVNP 197
Query: 88 EEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHD 147
EEAD+F+ P C FK ++ YWNR G DH D
Sbjct: 198 EEADYFF---LPGCGRGCNKWDQKFK---------YIMEHYAKYWNRRNGRDHLMTHAGD 245
Query: 148 FGACFHYQEEKAIERGILPLLQRATLVQTFG-------QRNHVCLN-----EGSITIPPY 195
+G C EKA P + T++ +G + H N + I +P
Sbjct: 246 WGRC-----EKAWGPDSAPFIANLTMLNHWGITVDRSQETEHDLFNACHIPDQDIQVPVL 300
Query: 196 AP---PQ--------KMQAHQIPPDTPRSIFVYFRGLFYDVNNDPE----GGYYARGARA 240
PQ K +AH + +++ G N+ E YY+ G RA
Sbjct: 301 CGDLYPQFEHNVWHPKRRAHPVN----KTVLASVAGSICGWNSAEEPPCKNKYYSFGVRA 356
Query: 241 AVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD 300
A+W ++ P F I+ P + M + FC P G A R V +V GC+PVII+D
Sbjct: 357 ALWTTLRDKPGFHIAKRVPV-LGQSMAESEFCFAPTG-AGHGKRQVVSVTLGCMPVIISD 414
Query: 301 DIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAM----- 355
+ PF + W + G+++AE D+P ++ IL P+ K + L + A
Sbjct: 415 HVAQPFEPFLDWNDFGVWIAEADLPDVEAILRGFTPQQKAAKMKKLYCAARHVAYTTVFG 474
Query: 356 -LFPQPAQPGDAFHQILNGLARKLPHDKSTYL--KPG 389
LF DA I N L ++ D + ++ KPG
Sbjct: 475 GLFDGDTGEYDAIATIFNIL--RMDQDFADFMACKPG 509
>gi|356546130|ref|XP_003541484.1| PREDICTED: probable glycosyltransferase At5g20260-like [Glycine
max]
Length = 468
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 78/307 (25%), Positives = 140/307 (45%), Gaps = 34/307 (11%)
Query: 80 SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSP---------RMMRSAIQLISSNWP 130
SP + NP+EA F+ P + ++ P+ S R++ I +++ +P
Sbjct: 171 SPFKAKNPDEAHAFFLP-FSVVNVVHYAYK-PYMSQNDYRRDRLQRLVEDYIVVVADKYP 228
Query: 131 YWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSI 190
YWNR+ GADHF + HD+ + + + +L A + F + V + E +
Sbjct: 229 YWNRSNGADHFLLSCHDWAPEISHANPDLF-KNFIRVLCNANNSEGFQPKRDVSIPEVYL 287
Query: 191 TIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYA--RGARAAVWENFKN 248
++ PP Q R+I +F GG + R W++ N
Sbjct: 288 SVGKLGPPNLGQH-----PMNRTILAFF-----------SGGAHGDIRKLLLKHWKDKDN 331
Query: 249 NPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFAD 308
Y E M + FCLCP G+ SPR+VEA+ C+PVII+++ LP +D
Sbjct: 332 QVQVHEYLPKGQNYTELMGLSKFCLCPSGYEVASPRVVEAINAVCVPVIISENYSLPLSD 391
Query: 309 AIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLAN-PSMKRAMLFPQPAQPGDAF 367
+ W + + ++ E++P + TIL ++ + + ++L N ++R + +PA+P D
Sbjct: 392 VLNWSQFSIQISVENIPDIKTILQNVTQK---KYKKLYRNVRRVRRHFVMHRPAKPFDLM 448
Query: 368 HQILNGL 374
H I++ +
Sbjct: 449 HMIIHSI 455
>gi|449461995|ref|XP_004148727.1| PREDICTED: probable glycosyltransferase At5g03795-like [Cucumis
sativus]
gi|449501299|ref|XP_004161331.1| PREDICTED: probable glycosyltransferase At5g03795-like [Cucumis
sativus]
Length = 664
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 98/351 (27%), Positives = 152/351 (43%), Gaps = 48/351 (13%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEI-FMHRFLLSSPVRTLNPEEADWFYTP-- 96
LK+YVY + KK + P L ++A+E FM + +P +A FY P
Sbjct: 335 LKIYVY----RDGKKPIFHQP-ILKGLYASEGWFMKLMEGNKRFVVKDPRKAHLFYMPFS 389
Query: 97 -------IYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFG 149
+Y T L + ++ + I++ +PYWNRT GADHF HD+
Sbjct: 390 SRMLEYTLYVRNSHNRTNLR------QFLKEYAENIAAKYPYWNRTGGADHFLAGCHDWA 443
Query: 150 ACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPD 209
Y+ +E I L A + F V L E + P + + P
Sbjct: 444 P---YETRHHMEHCIKALCN-ADVTVGFKIGRDVSLPETYVR-SARNPLRDLGG---KPA 495
Query: 210 TPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP------TTYY 263
+ R I ++ G + Y R W++ NP I P Y
Sbjct: 496 SQRHILAFYAGNMHG---------YVRPILLKYWKD--KNPDMKIFGPMPPGVASKMNYI 544
Query: 264 EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED 323
+ M+ + +C+CP G+ SPR+VEA+ + C+PVII+D+ V PF + + WE + VAE+D
Sbjct: 545 QHMKSSKYCICPKGYEVNSPRVVEAIFYECVPVIISDNFVPPFFEVLDWEAFSVIVAEKD 604
Query: 324 VPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
+P L IL SIP + L Q L +++ L+ D FH L+ +
Sbjct: 605 IPNLQDILLSIPKDRYLEMQ--LRVRKVQKHFLWHAKPLKYDLFHMTLHSI 653
>gi|297818734|ref|XP_002877250.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323088|gb|EFH53509.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
Length = 422
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 86/309 (27%), Positives = 141/309 (45%), Gaps = 36/309 (11%)
Query: 79 SSPVRTLNPEEADWFYTPIYPTCDLT-----PTGLPLPFKSPRMMR---SAIQLISSNWP 130
S R PEEA F+ P + ++ P P F R+ R + +++ P
Sbjct: 126 SGRFRASRPEEAHAFFLP-FSVANIVHYVYQPITSPADFNRARLHRIFNDYVDVVARKHP 184
Query: 131 YWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSI 190
+WN++ GADHF V HD+ + + + + L T T G R ++ I
Sbjct: 185 FWNQSNGADHFMVSCHDWAPDVADSKPEFFKDFMRGLCNANT---TEGFRPNI-----DI 236
Query: 191 TIPPYAPPQKMQAHQIPPDTP--RSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKN 248
+IP P++ TP R+I +F G + GY R ++ ++K
Sbjct: 237 SIPEINIPKRKLKPPFMGQTPENRTILAFFAGRAH--------GY----IREVLFTHWKG 284
Query: 249 NPLFDISTDHPT---TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLP 305
DH T Y+E + + FCLCP G+ SPR VEA+ GC+PV+I+D+ LP
Sbjct: 285 KDKDVQVYDHLTKGQNYHELIGHSKFCLCPSGYEVASPREVEAIYSGCVPVVISDNYSLP 344
Query: 306 FADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGD 365
F D + W + + + + +P + IL IP + +R + + ++R + +PAQP D
Sbjct: 345 FNDVLDWSKFSVEIPVDKIPDIKKILQEIPHDKYIRMYQNVM--KVRRHFVVNRPAQPFD 402
Query: 366 AFHQILNGL 374
H IL+ +
Sbjct: 403 VIHMILHSV 411
>gi|357113818|ref|XP_003558698.1| PREDICTED: uncharacterized protein LOC100844507 [Brachypodium
distachyon]
Length = 781
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 98/381 (25%), Positives = 162/381 (42%), Gaps = 69/381 (18%)
Query: 17 NTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLL------------LKDPRCLT 64
N P + ++S S +V + +YVY+LP++++ LL + D + T
Sbjct: 316 NLPDLNKLSKSPINVNAVVKKKRPLIYVYDLPAEFDSHLLEGRHFKLQCVNRIYDDKNRT 375
Query: 65 ----HMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTC----------DLTPTGLPL 110
++ A++ ++ +L+SP RTLN +EAD+FY P+ +C LTP L L
Sbjct: 376 IWTEQLYGAQMALYESILASPHRTLNGDEADYFYVPVLDSCLITRSDDAPHLLTPEDLHL 435
Query: 111 -PFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFH----YQEEKAIERGIL 165
+ + R A IS + YWNRT G DH + D GAC+ + + G
Sbjct: 436 RSYHALEYYRKAYDHISQRYAYWNRTSGRDHIWFFSWDEGACYAPKEIWNSMMLVHWGNT 495
Query: 166 PLLQRATLVQTFGQR----------NHVCLN-EGSITIPPYAPPQ------KMQAHQIPP 208
+ + NH C + + +P + P+ K+ A P
Sbjct: 496 NTKHENSTTAYWADNWDDIPLDRRGNHPCFDPRKDLVLPAWKVPEPGAIWLKLWAR---P 552
Query: 209 DTPRSIFVYFRGLF---YDVNNDPEGGYYARGARAAVWENFKNNPLFD------------ 253
R+ YF G Y+ PE Y+ G R + F + P +
Sbjct: 553 RINRTTLFYFNGNLGPAYE-QGRPEDT-YSMGIRQKLAAEFGSTPSKEGKLGRQHTANVT 610
Query: 254 ISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWE 313
++ YYE++ ++FC G WS R+ ++++ GCIPVII D I LP+ + + +
Sbjct: 611 VTYLRSEKYYEELASSVFCGALPG-DGWSGRMEDSMLQGCIPVIIQDGIFLPYENVLNYN 669
Query: 314 EIGMFVAEEDVPKLDTILTSI 334
+ + E D+P L IL I
Sbjct: 670 SFAVRIQEHDIPNLIRILGGI 690
>gi|167517883|ref|XP_001743282.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163778381|gb|EDQ91996.1| predicted protein [Monosiga brevicollis MX1]
Length = 503
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 94/333 (28%), Positives = 143/333 (42%), Gaps = 46/333 (13%)
Query: 65 HMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRM--MRSAI 122
+ + E + + L SSP PEEA F P T ++ + S I
Sbjct: 196 YKYGVERRLPQVLASSPYAVQQPEEATHFLIPFQCTAHRYTVADRAGGQNAAEAGLASWI 255
Query: 123 QLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTF----- 177
IS+ +PYWNR+ GA+HF+V HD G+ Q +A ++ ++ L+ A F
Sbjct: 256 ASISAAYPYWNRSAGANHFYVCSHDMGSSAVAQLSRAAQQNLIGLVNTADRRDGFFNVHR 315
Query: 178 --------GQRNHVCLNEG---SITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVN 226
G CL G S+T +A TPR+ + G ++
Sbjct: 316 DLATAPHIGDGCDTCLQGGTRLSVTREAWA------------GTPRNRLAFMAG---NLQ 360
Query: 227 NDPEGGYYARGARAAVWENFKNNPLFDI--STDHPTTYYEDMQRAIFCLCPLGWAPWSPR 284
P R + + F +P F + T Y + + + FCL G+ W+PR
Sbjct: 361 RGP--------VRPRLRQFFDGDPDFLLVDGTLAAAHYRQALAESEFCLVVRGFRVWTPR 412
Query: 285 LVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQR 344
L++AV GCIPVIIAD LPF+ + W +FV E DVP+L IL + + L +
Sbjct: 413 LMDAVWSGCIPVIIADGYELPFSSLLHWPSFAVFVPEHDVPRLKDILLAKLSQAPLLRAN 472
Query: 345 LLANPSMKRAMLFPQPAQPGDAFHQILNGLARK 377
LLA + + + P DAF ++ LA +
Sbjct: 473 LLA---ASQYLTYHSNWVPLDAFDILMLQLAAR 502
>gi|115438799|ref|NP_001043679.1| Os01g0640600 [Oryza sativa Japonica Group]
gi|55297176|dbj|BAD68851.1| pectin-glucuronyltransferase-like [Oryza sativa Japonica Group]
gi|113533210|dbj|BAF05593.1| Os01g0640600 [Oryza sativa Japonica Group]
Length = 501
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 98/350 (28%), Positives = 155/350 (44%), Gaps = 39/350 (11%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAE-IFMHRFLLSSPVRTLNPEEADWFYTPIY 98
LKVY+Y+ ++ + P L+ ++A+E FM S +P +A FY P Y
Sbjct: 164 LKVYIYQ----DGRRPIFHTP-PLSGIYASEGWFMKLLKESRRFAVTDPAKAHLFYLP-Y 217
Query: 99 PTCDLTPTGLPLPFKS-----PRMMRSAIQLISSNWPYWNRTEGADHFFVVPHD-FGACF 152
+ L L +P +R ++ +++ +P+WNRT GADHF V HD +
Sbjct: 218 SSQQLR-ISLYVPDSHNLRPLAAYLRDFVKGLAAKYPFWNRTRGADHFLVACHDWLQGSY 276
Query: 153 HYQEEKAIERGILPLLQRATLVQ-TFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTP 211
+ R + L A + F V L E +I P + ++ P +
Sbjct: 277 TTTAHGDLRRNTVKALCNADSSEGIFTPGRDVSLPETTIR----TPRRPLRYVGGLPVSR 332
Query: 212 RSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPL-----FDISTDHPTTYYEDM 266
R I +F G + R W + +++ + +Y + M
Sbjct: 333 RGILAFFAGNVHG---------RVRPVLLKHWGDGRDDDMRVYGPLPARVSRRMSYIQHM 383
Query: 267 QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK 326
+ + FCLCP+G+ SPR+VEA+ + C+PVIIAD+ VLP +D + W + VAE+DVP
Sbjct: 384 KNSRFCLCPMGYEVNSPRIVEALYYECVPVIIADNFVLPLSDVLDWSAFAVVVAEKDVPD 443
Query: 327 LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQP--GDAFHQILNGL 374
L IL I LRK + + F A+P D FH IL+ +
Sbjct: 444 LKKILQGI----TLRKYVAMHGCVKRLQRHFLWHARPLRYDLFHMILHSI 489
>gi|449442160|ref|XP_004138850.1| PREDICTED: probable glycosyltransferase At5g25310-like [Cucumis
sativus]
gi|449530249|ref|XP_004172108.1| PREDICTED: probable glycosyltransferase At5g25310-like [Cucumis
sativus]
Length = 465
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 86/310 (27%), Positives = 145/310 (46%), Gaps = 47/310 (15%)
Query: 83 RTLNPEEADWFYTPI-------YPTCDLTPTGLPLPFKSPRMMRS-AIQLISSNWPYWNR 134
RT++P A + P Y D + PL RM S ++++S +P+WN+
Sbjct: 175 RTVDPSRAHVLFMPFSVAWMVKYLYKDGSYDQTPL-----RMFVSDYVEVVSKKYPFWNK 229
Query: 135 TEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNE-----GS 189
T GADHF + HD+G + + + + +L A + F + V L E G
Sbjct: 230 TNGADHFILACHDWGPIA-TEGNRFLYNTSIRVLCNANSSEGFNPQKDVSLPEIHLYDGE 288
Query: 190 ITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNN 249
I+ P+ + A P F GG + R + ++KN
Sbjct: 289 IS------PKLLSASNSHHHRPHLAFF-------------AGGLHG-PIRPILLNHWKNR 328
Query: 250 PLFDISTDH----PTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLP 305
+I YY++M ++ FCLCP G+ SPR+VEA+ C+PVII++ VLP
Sbjct: 329 THTNIHVYEYLPKQLDYYDEMLQSRFCLCPSGYEVASPRIVEAIYAECVPVIISERYVLP 388
Query: 306 FADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLAN-PSMKRAMLFPQPAQPG 364
F+D + WE + V+ ++P+L+ IL + E R ++L+ ++++ + +PA+
Sbjct: 389 FSDVLRWEGFSIEVSVSEIPRLEEILMGVSEE---RYEKLIQGLRTVRKHFVLNRPAKRF 445
Query: 365 DAFHQILNGL 374
DAFH IL+ +
Sbjct: 446 DAFHMILHSV 455
>gi|296090519|emb|CBI40850.3| unnamed protein product [Vitis vinifera]
Length = 416
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 96/350 (27%), Positives = 154/350 (44%), Gaps = 50/350 (14%)
Query: 39 KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIY 98
KVY+Y P + + PR LT +A+E + + + S RT +P++A F+ PI
Sbjct: 94 NFKVYIY--PDG-DPNTFYQTPRKLTGKYASEGYFFQNIRDSRFRTNDPDQAHLFFIPI- 149
Query: 99 PTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEK 158
+C ++++ + + S +PYWNRT GADHFFV HD G +
Sbjct: 150 -SCHKMRGKGTSYENMTVIVQNYVGSLISKYPYWNRTLGADHFFVTCHDVGV-------R 201
Query: 159 AIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY---APPQKMQAHQIPPDTPRSIF 215
A E G+ L++ + V VC + P+ A PQ +Q +P
Sbjct: 202 ATE-GVPFLVKNSIRV--------VCSPSYDVGFIPHKDVALPQVLQPFALPAGGN---- 248
Query: 216 VYFRGLFYDVNNDPEGGYYA-------RGARAAVWENFKNNPLFDISTDHPT---TYYED 265
D+ N G++A R A VWEN + + + Y +
Sbjct: 249 --------DIENRTTLGFWAGHRNSKIRVILARVWENDTELDIMNNRINRAAGELLYQKR 300
Query: 266 MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVP 325
R FC+CP G S R+ +++ +GC+PVI+++ LPF D + W + + + E DV
Sbjct: 301 FYRTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFSVVLKELDVY 360
Query: 326 KLDTILTSIP-PEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
+L IL IP E + L+ +++ + P DAFH I+ L
Sbjct: 361 QLKQILKDIPDAEFVALHNNLV---QVQKHFQWNSPPIRYDAFHMIMYEL 407
>gi|443705810|gb|ELU02170.1| hypothetical protein CAPTEDRAFT_209273 [Capitella teleta]
Length = 473
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 89/339 (26%), Positives = 142/339 (41%), Gaps = 34/339 (10%)
Query: 16 DNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHR 75
DN+ I LE K+ +Y EL K N+ + K + E +
Sbjct: 158 DNSEAIAHKDNFGAPYLEMMSSLKIFMYTSELDDKVNRGVHWK--------YGVESLFIK 209
Query: 76 FLLSSPVRTLNPEEADWFYTPIYPTC--DLTPTGLPLPFKSPRMMRSAIQLISSNWPYWN 133
L S T + EEA +F+ P ++ + ++ + ++ ISS + YW+
Sbjct: 210 LLSKSSFVTKDAEEAHFFFLPFQCATYRNVIRDRAAAQNFTENLVSNILKDISSRYTYWD 269
Query: 134 RTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIP 193
R+ GADHF+V HD GA + +++ + L+ A F + I +P
Sbjct: 270 RSLGADHFYVCAHDMGASSVAAADANLQKNAIALVNTADYADPF------YVPHKDIALP 323
Query: 194 PYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYA----RGARAAVWENFKNN 249
P+ K I +S P +YA G V++++ N+
Sbjct: 324 PHPAHGKGSLPDIGRGGGKS------------TERPNLAFYAGNLDSGQLRPVFKDWLND 371
Query: 250 PLFDISTDH--PTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFA 307
I H Y +++Q A FCL P G WSP +++AV GC+PVII+D LP
Sbjct: 372 SDIHIHHGHMSDNVYIKNLQSAKFCLVPRGHRVWSPVVMDAVWTGCVPVIISDYYDLPLH 431
Query: 308 DAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLL 346
I W +F+ E++V L + L SIP E + R Q +
Sbjct: 432 GLIDWTHFAVFLKEKEVLSLKSKLKSIPEEKLRRMQSYI 470
>gi|222618931|gb|EEE55063.1| hypothetical protein OsJ_02778 [Oryza sativa Japonica Group]
Length = 482
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 98/344 (28%), Positives = 155/344 (45%), Gaps = 46/344 (13%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAE-IFMHRFLLSSPVRTLNPEEADWFYTPIY 98
LKVY+Y+ ++ + P L+ ++A+E FM S +P +A FY P Y
Sbjct: 164 LKVYIYQ----DGRRPIFHTP-PLSGIYASEGWFMKLLKESRRFAVTDPAKAHLFYLP-Y 217
Query: 99 PTCDLTPTGLPLPFKS-----PRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFH 153
+ L L +P +R ++ +++ +P+WNRT GADHF V HD+G+ +
Sbjct: 218 SSQQLR-ISLYVPDSHNLRPLAAYLRDFVKGLAAKYPFWNRTRGADHFLVACHDWGS-YT 275
Query: 154 YQEEKAIERGILPLLQRATLVQ-TFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPR 212
+ R + L A + F V L E +I P + ++ P + R
Sbjct: 276 TTAHGDLRRNTVKALCNADSSEGIFTPGRDVSLPETTIR----TPRRPLRYVGGLPVSRR 331
Query: 213 SIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFC 272
I +F G + R W + +++ + M+ + FC
Sbjct: 332 GILAFFAGNVHG---------RVRPVLLKHWGDGRDDDM------------RHMKNSRFC 370
Query: 273 LCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILT 332
LCP+G+ SPR+VEA+ + C+PVIIAD+ VLP +D + W + VAE+DVP L IL
Sbjct: 371 LCPMGYEVNSPRIVEALYYECVPVIIADNFVLPLSDVLDWSAFAVVVAEKDVPDLKKILQ 430
Query: 333 SIPPEVILRKQRLLANPSMKRAMLFPQPAQP--GDAFHQILNGL 374
I LRK + + F A+P D FH IL+ +
Sbjct: 431 GI----TLRKYVAMHGCVKRLQRHFLWHARPLRYDLFHMILHSI 470
>gi|359495862|ref|XP_002266299.2| PREDICTED: probable glycosyltransferase At5g03795 [Vitis vinifera]
Length = 594
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 95/350 (27%), Positives = 153/350 (43%), Gaps = 50/350 (14%)
Query: 39 KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIY 98
KVY+Y + + PR LT +A+E + + + S RT +P++A F+ PI
Sbjct: 272 NFKVYIYP---DGDPNTFYQTPRKLTGKYASEGYFFQNIRDSRFRTNDPDQAHLFFIPI- 327
Query: 99 PTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEK 158
+C ++++ + + S +PYWNRT GADHFFV HD G +
Sbjct: 328 -SCHKMRGKGTSYENMTVIVQNYVGSLISKYPYWNRTLGADHFFVTCHDVGV-------R 379
Query: 159 AIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY---APPQKMQAHQIPPDTPRSIF 215
A E G+ L++ + V VC + P+ A PQ +Q +P
Sbjct: 380 ATE-GVPFLVKNSIRV--------VCSPSYDVGFIPHKDVALPQVLQPFALPAGGN---- 426
Query: 216 VYFRGLFYDVNNDPEGGYYA-------RGARAAVWENFKNNPLFDISTDHPTT---YYED 265
D+ N G++A R A VWEN + + + Y +
Sbjct: 427 --------DIENRTTLGFWAGHRNSKIRVILARVWENDTELDIMNNRINRAAGELLYQKR 478
Query: 266 MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVP 325
R FC+CP G S R+ +++ +GC+PVI+++ LPF D + W + + + E DV
Sbjct: 479 FYRTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFSVVLKELDVY 538
Query: 326 KLDTILTSIP-PEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
+L IL IP E + L+ +++ + P DAFH I+ L
Sbjct: 539 QLKQILKDIPDAEFVALHNNLV---QVQKHFQWNSPPIRYDAFHMIMYEL 585
>gi|326511980|dbj|BAJ95971.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 784
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 96/355 (27%), Positives = 155/355 (43%), Gaps = 67/355 (18%)
Query: 42 VYVYELPSKYNKKLL------------LKDPRCLT----HMFAAEIFMHRFLLSSPVRTL 85
+YVY+LP++++ LL + D + T ++ A++ ++ +L+SP RTL
Sbjct: 341 IYVYDLPAEFDSHLLEGRHYKLECVNRIYDEKNRTIWTRQLYGAQMALYESILASPHRTL 400
Query: 86 NPEEADWFYTPIYPTCDLT----------PTGLPL-PFKSPRMMRSAIQLISSNWPYWNR 134
N +EAD+FY P+ +C +T P L L + + R A I+ +PYWNR
Sbjct: 401 NGDEADYFYVPVLDSCLITRSDDAPHLRMPEDLRLRSYHTLEYYRKAYDHIAQRYPYWNR 460
Query: 135 TEGADHFFVVPHDFGACFHYQE-------------EKAIERGILPLLQRATLVQTFGQR- 180
T G DH + D GAC+ +E E+ F +R
Sbjct: 461 TSGRDHIWFFSWDEGACYAPKEIWNSMMLVHWGNTNTKHEKSTTAYWADNWDDIPFDRRG 520
Query: 181 NHVCLN-EGSITIPPYAPPQ------KMQAHQIPPDTPRSIFVYFRGLFYDVNND--PEG 231
NH C + + +P + P K+ A P R+ YF G + PE
Sbjct: 521 NHPCFDPRKDLVLPAWKEPNPGAIWLKLWAR---PKINRTTLFYFNGNLGPAYEEGRPED 577
Query: 232 GYYARGARAAVWENFKNNP-----LFDISTDHPTT-------YYEDMQRAIFCLCPLGWA 279
Y+ G R + F + P L T + T YYE++ +IFC G
Sbjct: 578 T-YSMGIRQKLAAEFGSTPNKQGKLGRQQTANVTVTYLKSEMYYEELASSIFCGVLPG-D 635
Query: 280 PWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSI 334
WS R+ ++++ GCIPVII D I LP+ + + + + + E+D+P L +L +
Sbjct: 636 GWSGRMEDSMLQGCIPVIIQDGIFLPYENVLNYNSFSVRIQEDDIPNLIKVLQGL 690
>gi|242094892|ref|XP_002437936.1| hypothetical protein SORBIDRAFT_10g005120 [Sorghum bicolor]
gi|241916159|gb|EER89303.1| hypothetical protein SORBIDRAFT_10g005120 [Sorghum bicolor]
Length = 412
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 90/307 (29%), Positives = 142/307 (46%), Gaps = 29/307 (9%)
Query: 83 RTLNPEEADWFYTPIYPTCDLTPTGLPLPF-KSP--RMMRSAIQLISSNWPYWNRTEGAD 139
RT +P A F+ P + + P + K+P ++ +++++S PYWNR+ GAD
Sbjct: 109 RTSDPARAHAFFLPFSVSKMVQFAYRPDTYDKTPLRAIVADYVRVVASRHPYWNRSAGAD 168
Query: 140 HFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQ 199
HF + HD+G + GI L A + F V + E I + P+
Sbjct: 169 HFMLSCHDWGPEASRGHPELHANGIRALCN-ANTSEGFRPGQDVSVPE--INLYTGDMPR 225
Query: 200 KMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNN--PLFDISTD 257
++ A PP R +F G + + R W+ + P+++ +
Sbjct: 226 QLLAPPAPPLASRPFLAFFAGGRHG---------HVRDLLLRRWKGHDPDVFPVYEHEHE 276
Query: 258 H----------PTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFA 307
H P YY M+RA FCLCP G+ SPR+VEA+ C+PVI++D LPFA
Sbjct: 277 HSHGRRQQDGAPLDYYWYMRRARFCLCPSGYEVASPRVVEAIHAECVPVILSDGYALPFA 336
Query: 308 DAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAF 367
D + WE + V D+P+L +L IP + R QR + +KR + QP + D F
Sbjct: 337 DVLRWEAFSVAVPVADIPRLREVLERIPAPEVERLQRGVR--LVKRHFMLHQPPERLDMF 394
Query: 368 HQILNGL 374
+ IL+ +
Sbjct: 395 NMILHSV 401
>gi|449446746|ref|XP_004141132.1| PREDICTED: probable glycosyltransferase At5g03795-like [Cucumis
sativus]
Length = 466
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 89/318 (27%), Positives = 145/318 (45%), Gaps = 43/318 (13%)
Query: 67 FAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTC-----DLTPTGLPLPFKSPRMMRSA 121
+A+E + + L+ S T +P+EAD+F P T ++ +G+P +R
Sbjct: 169 YASESYFKKSLIKSHFITNDPKEADFFSLPFSITGLRNDRRVSVSGIP------NFIRDY 222
Query: 122 IQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRN 181
I +S +PYWNRT GADHF+V H G + +A ++++VQ +
Sbjct: 223 IFDVSHKYPYWNRTGGADHFYVACHSVGRSAMDKSSEA----------KSSIVQVVCSSS 272
Query: 182 HVCLNEGSITIPPYAPPQKMQAHQIPPD---TPRSIFVYFRGLFYDVNNDPEGGYYARGA 238
+ G I+ A PQ + P + + R+ +F G N P R A
Sbjct: 273 YFL--TGYISHKDAALPQIWPRKEDPSNLASSKRTRLAFFAGAM----NSP-----TRQA 321
Query: 239 RAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVII 298
VW K++ +F S T Y +++ R+ FCL G+ + R+ +++ +GC+PVII
Sbjct: 322 LVQVWG--KDSEIFAYSGRLKTPYADELLRSKFCLHVKGFEVNTARVGDSIFYGCVPVII 379
Query: 299 ADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFP 358
A+ LPF D + W+ + V D+P+L IL I E R L + +K F
Sbjct: 380 ANYYDLPFGDILNWKSFSIVVTTSDIPRLKEILKGINDEEYAR----LQSNVLKVRKHFK 435
Query: 359 QPAQP--GDAFHQILNGL 374
+ P D FH ++ L
Sbjct: 436 WHSSPVDYDTFHMVMYQL 453
>gi|255567220|ref|XP_002524591.1| catalytic, putative [Ricinus communis]
gi|223536144|gb|EEF37799.1| catalytic, putative [Ricinus communis]
Length = 507
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 90/348 (25%), Positives = 149/348 (42%), Gaps = 42/348 (12%)
Query: 39 KLKVYVYEL--PSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTP 96
+ K+YVYE P Y+ D C + + F+H RTL+P+EA ++ P
Sbjct: 176 QFKIYVYEEGGPPMYH------DGPCKSIYSSEGRFIHELEKGKLYRTLDPDEALVYFLP 229
Query: 97 IYPTCDLTPTGLPLPFKSPRMMRSA---IQLISSNWPYWNRTEGADHFFVVPHDFGACFH 153
+ +P ++ + R+ I +IS+ P+WNR+ GADHF + HD+G
Sbjct: 230 FSVVMMVEYLYVPDSHETNAIGRAIVDYIHVISNKHPFWNRSLGADHFMLSCHDWGPRAS 289
Query: 154 YQEEKAIERGILPLLQRATLVQTFGQRN-------HVCLNEGSITIPPYAPPQKMQAHQI 206
I +L A + F H+ E S + +P +
Sbjct: 290 SYVPHLFNSSIR-VLCNANTSEGFNPSKDASFPEIHLKTGEISGLLGGVSPSR------- 341
Query: 207 PPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM 266
RSI +F G + + R W+N + + +Y +
Sbjct: 342 -----RSILAFFAGRLHG---------HIRQILLEQWKNKDEDVQVYDQMPNGVSYESML 387
Query: 267 QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK 326
+ + FCLCP G+ SPR+VEA+ C+PV+I+D+ V PF+D + W+ + + D+PK
Sbjct: 388 KTSRFCLCPSGYEVASPRIVEAIYTECVPVLISDNYVPPFSDVLNWKAFSVQIQVRDIPK 447
Query: 327 LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
+ IL I LR QR L ++R + P + D FH ++ +
Sbjct: 448 IKEILMGISQRQYLRMQRRLKQ--VQRHFVVNGPPKRFDMFHMTIHSI 493
>gi|297808545|ref|XP_002872156.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
gi|297317993|gb|EFH48415.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
Length = 334
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 77/261 (29%), Positives = 121/261 (46%), Gaps = 30/261 (11%)
Query: 122 IQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRN 181
I+L+S+N P+WNRT GADHF + HD+G Q + + + ++ A + F
Sbjct: 86 IRLVSTNHPFWNRTNGADHFMLACHDWGP-LTSQADNDLFNTSIRVMCNANSSEGFNPSK 144
Query: 182 HVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAA 241
V L E + P K++ + +PR P G++A G
Sbjct: 145 DVTLPEIKLYGGEVDP--KLRLSKTLSASPR----------------PYLGFFAGGVHGP 186
Query: 242 VWENFKNN--------PLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGC 293
V N+ P+++ H YY+ M+ + FC CP G+ SPR++EA+ C
Sbjct: 187 VRPILLNHWKQRDPDMPVYEYLPKH-LNYYDFMRSSKFCFCPSGYEVASPRVIEAIYSEC 245
Query: 294 IPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKR 353
IPVI++ + VLPF D + WE + V ++P+L IL SI E +R L ++R
Sbjct: 246 IPVILSVNFVLPFTDVLRWETFSVLVDVSEIPRLKEILMSISDEKYEWLKRNLR--YVRR 303
Query: 354 AMLFPQPAQPGDAFHQILNGL 374
P + DAFH L+ +
Sbjct: 304 HFELNDPPKRFDAFHLTLHSI 324
>gi|75223627|gb|ABA18110.1| exostosin family protein [Arabidopsis arenosa]
Length = 340
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 88/310 (28%), Positives = 142/310 (45%), Gaps = 38/310 (12%)
Query: 79 SSPVRTLNPEEADWFYTPIYPTCDLT-----PTGLPLPFKSPRMMR---SAIQLISSNWP 130
S R PEEA F+ P + ++ P P F R+ R + +++ P
Sbjct: 44 SGRFRASRPEEAHAFFLP-FSVANIVHYVYQPITSPADFNRARLHRIFNDYVDVVACKHP 102
Query: 131 YWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSI 190
+WN++ GADHF V HD+ + + + + L T T G R ++ + I
Sbjct: 103 FWNQSNGADHFMVSCHDWAPDVADSKPEFFKDFMRGLCNANT---TEGFRPNIDFSIPEI 159
Query: 191 TIPP--YAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKN 248
IP PP Q P+ R+I +F G + GY R ++ ++K
Sbjct: 160 NIPKRKLKPPFMGQT----PEN-RTILAFFAGRAH--------GY----IREVLFTHWKG 202
Query: 249 NPLFDISTDHPT---TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLP 305
DH T Y+E + + FCLCP G+ SPR VEA+ GC+PV+I+D+ LP
Sbjct: 203 KDKDVQVYDHLTKGQNYHELIGHSKFCLCPSGYEVASPREVEAIYSGCVPVVISDNYSLP 262
Query: 306 FADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANP-SMKRAMLFPQPAQPG 364
F D + W + + + + +P + IL IP + + +R+ N ++R + +PAQP
Sbjct: 263 FKDVLDWSKFSVEIPVDKIPDIKKILQEIPHD---KYRRMYQNVMKVRRHFVVNRPAQPF 319
Query: 365 DAFHQILNGL 374
D H IL+ +
Sbjct: 320 DVIHMILHSV 329
>gi|159470095|ref|XP_001693195.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158277453|gb|EDP03221.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 659
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 103/390 (26%), Positives = 156/390 (40%), Gaps = 76/390 (19%)
Query: 42 VYVYELPSKYNKKLL---LKDPRCLTHMF--------------AAEIFMHRFLLSSPVRT 84
+YVY++ YN K+L L C TH F E +H +LL S RT
Sbjct: 297 IYVYDMEPLYNAKMLQYRLASAWC-THRFYTSGNGTAYSPWCYGVESGLHEYLLLSEHRT 355
Query: 85 LNPEEADWFYTPIYPTCDLTP------------TGLPLPFKSPRMMRSAIQLISSNWPYW 132
+PEEAD+FY P+Y +C + P G + M+ A I +N+P+W
Sbjct: 356 FDPEEADFFYVPVYISCLIWPVLNYADFPVFYSNGGTRVMHAVNMLSEARDWIDANYPFW 415
Query: 133 NRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITI 192
R G DH + PHD GAC+ ++V + + L
Sbjct: 416 KRRGGRDHIWTFPHDEGACW----------------APNSIVSS------IWLTHWGRMD 453
Query: 193 PPYAPPQKMQAHQIPPD--TPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNP 250
P + A D +PR Y + DP+ Y + A W K N
Sbjct: 454 PDHTSKSSFDADNYTRDFVSPRQPKGYTHLIQGHGCYDPKKIYNM--SIANNWRQ-KYNV 510
Query: 251 LFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAI 310
L D Y + + R++FCL G WS R +AV+ GCIPV++ D + + F
Sbjct: 511 LVGDGQDVQGDYSDLLSRSLFCLVATG-DGWSARTEDAVLHGCIPVVVIDGVHMKFETLF 569
Query: 311 PWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLL------ANPSMKRAMLFPQPAQP- 363
+ + + E DV + TIL ++P E + Q L A+P ++ A + A+P
Sbjct: 570 DVDSFSIRIPEADVANILTILKALPEERVRAMQANLGQASNAADPLVREAAELSKSAEPR 629
Query: 364 -----------GDAFHQILNGLARKLPHDK 382
DAF I+ L ++P +
Sbjct: 630 FPRPFRGDPAVDDAFATIIQWLHSRIPDTR 659
>gi|168029668|ref|XP_001767347.1| predicted protein [Physcomitrella patens subsp. patens]
gi|168029742|ref|XP_001767384.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162681411|gb|EDQ67838.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162681448|gb|EDQ67875.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 333
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 92/348 (26%), Positives = 153/348 (43%), Gaps = 42/348 (12%)
Query: 39 KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAE-IFMHRFLLSSPVRTLNPEEADWFYTPI 97
+ KVYVY S+ + L+ P ++A E F+ +P T +P+ A ++ P
Sbjct: 4 RFKVYVY---SEGEEPLVHNGP--CKEIYAVEGRFIQELQGDNPFVTHDPDNAHVYFLPF 58
Query: 98 Y-----------PTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPH 146
+ D+ P L + + ++ +P+WNR+ GADHF + H
Sbjct: 59 SVAMMVAYLYEKESGDMDPLRL--------FVGDYVDVLMHKYPFWNRSGGADHFMLSCH 110
Query: 147 DFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQI 206
D+G + R I +L A + + V L E I + P ++
Sbjct: 111 DWGPLITRENMNLGTRSIR-VLCNANSSEGYVPWKDVSLPE--IHLVGGHIPAELGG--- 164
Query: 207 PPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM 266
PP R +F G ++ P R WE ++ + Y+E M
Sbjct: 165 PPAKDRPHLAFFAGR----DHGP-----VRPQLFKHWEGKDDDVIVYQWLPAHLKYHELM 215
Query: 267 QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK 326
+ + +C+CP G+ SPR+VEA+ C+PVIIAD +LPF+D + WE + V E D+P
Sbjct: 216 KTSRYCICPGGYEVNSPRIVEAIYNECVPVIIADSFILPFSDVLNWESFSLHVKESDIPN 275
Query: 327 LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
L +IL ++ E Q ++ ++R + QP + D FH IL+ +
Sbjct: 276 LKSILQNVTMETYTSMQERVS--QVQRHFVLHQPPKRYDVFHMILHSV 321
>gi|224142181|ref|XP_002324437.1| predicted protein [Populus trichocarpa]
gi|222865871|gb|EEF03002.1| predicted protein [Populus trichocarpa]
Length = 334
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 92/343 (26%), Positives = 152/343 (44%), Gaps = 32/343 (9%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAE-IFMHRFLLSSPVRTLNPEEADWFYTPIY 98
LKVYVY K +K + + + ++A+E FM + +P +A FY P
Sbjct: 5 LKVYVY----KEGEKPIFHQSK-MRGIYASEGWFMKLIEGNKKFVVRDPRKAHLFYLPFS 59
Query: 99 PTCDLTPTGLPLPFKSPRM---MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQ 155
P T + +++ + L++ + +WNRT G DHF V HD+ + Q
Sbjct: 60 PHMLRTALFDHNSLNQKELAEFLKNYVDLVAKKYSFWNRTGGTDHFLVGCHDWAS----Q 115
Query: 156 EEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIF 215
+ R + +L + + + F L I P K + P + P I
Sbjct: 116 MTRHHMRNCIRVLCNSNVAKGFKIGKDTTL--PVTYIRSVENPLKELGGKSPSERP--IL 171
Query: 216 VYFRGLFYDVNNDPEGGYYARGARAAVWENFK--NNPLFDISTD--HPTTYYEDMQRAIF 271
+F G + Y R WEN + L +S D Y E M+R+ +
Sbjct: 172 AFFAGNMHG---------YLRPILLEYWENKEPDMKILGPMSRDIAGKRRYREYMKRSKY 222
Query: 272 CLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTIL 331
C+C G+ +PR+VE++ + C+PVII+D+ V P + + WE +F+ E+D+P L IL
Sbjct: 223 CICARGYEVHTPRVVESIFYECVPVIISDNYVPPLFEVLNWEAFSVFIQEKDIPNLRNIL 282
Query: 332 TSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
SIP E + Q L +++ L+ + D FH IL+ +
Sbjct: 283 LSIPQEKYVAMQ--LGVKKVQQHFLWHKKPVKYDLFHMILHSV 323
>gi|302837496|ref|XP_002950307.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
gi|300264312|gb|EFJ48508.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
Length = 368
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 100/374 (26%), Positives = 159/374 (42%), Gaps = 43/374 (11%)
Query: 39 KLKVYVYELPSKYNKKLLL-KDPRCLTHMFAA-EIFMHRFLLSSPVRTLNPEEADWFYTP 96
+ K+Y+YELP + L +D +++ A E FM FL+ VRT NP EA+ FY P
Sbjct: 10 EFKIYMYELPWEIAFPYELGEDVHTRDNIYTAYEEFMKYFLVDDMVRTQNPYEANLFYVP 69
Query: 97 IYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQ- 155
LT + I+ I + WP++NRT G DHF D +C H+Q
Sbjct: 70 A-----LTYFYATNVRNGQWQAEAVIEYIRTKWPFYNRTGGRDHFVFFTGDRASC-HFQR 123
Query: 156 --EEKAIERGILPLLQRATLVQTFGQRNHVCL-NEGSITIPPYA------------PPQK 200
++ I+ + R R++ C+ N+ + +PP P K
Sbjct: 124 WIQDSVIKVVHFGMQHRNLTWNEISNRDYACIQNKRDLVVPPRTVNLGPLLPSFSTPYYK 183
Query: 201 MQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLF--DIS--- 255
D R++ +F G D G Y+ G R A+ + + D+
Sbjct: 184 WLVSNQGYDGNRTLLFFFAGGVAD-------GEYSGGVRLAIKQMLSSITHLPADVKFVE 236
Query: 256 ---TDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW 312
Y+ ++ + FC+ P G W RLV+AV GC+PVII D + F D +P+
Sbjct: 237 GRVGGGEDEYFAMIRASKFCIAPYG-HGWGNRLVQAVHLGCVPVIIQDYVYQAFEDFLPY 295
Query: 313 EEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILN 372
E+ + + DVP + +L S + R + LA RA ++ + G A+ L
Sbjct: 296 EDFSVRMRLADVPHMIDLLRSYSEADLARLRLGLAR--YYRAFIWEREYD-GLAYEWTLA 352
Query: 373 GLARKLPHDKSTYL 386
GL R+ + + Y
Sbjct: 353 GLQRRAYNLAAEYF 366
>gi|357130589|ref|XP_003566930.1| PREDICTED: probable glycosyltransferase At5g03795-like
[Brachypodium distachyon]
Length = 510
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 76/271 (28%), Positives = 126/271 (46%), Gaps = 31/271 (11%)
Query: 118 MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQ------EEKAIERGILPLLQRA 171
+R +Q +++ +P+WNR GADHF V HD+ ++ Q + + R + L A
Sbjct: 245 LRDFVQGLAAKYPFWNRNRGADHFLVACHDWRKFYYLQGPYTTTAHRDLRRNSIKALCNA 304
Query: 172 -TLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPE 230
+ + F V L E +I P + ++ P + R I +F G +
Sbjct: 305 DSSERIFSPGKDVSLPETTIR----TPKRPLRYVGGLPVSRRRILAFFAGNVHG------ 354
Query: 231 GGYYARGARAAVWENFKNNPL-----FDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRL 285
R W + +++ + +Y + M+ + FCLCP+G SPR+
Sbjct: 355 ---RVRPVLLKHWGDGRDDDMRVYGPLPNRVSRQMSYIQHMKNSRFCLCPMGHEVNSPRI 411
Query: 286 VEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRL 345
VEA+ + C+PV+IAD+ VLPF+D + W + VAE+D+P L IL I LR+
Sbjct: 412 VEALYYECVPVVIADNFVLPFSDVLDWTAFSVVVAEKDIPDLKKILQGIS----LRRYVA 467
Query: 346 LANPSMKRAMLFPQPAQP--GDAFHQILNGL 374
+ + + F A+P D FH IL+ +
Sbjct: 468 MHDCVKRLQRHFLWHARPLRYDLFHMILHSI 498
>gi|443732873|gb|ELU17436.1| hypothetical protein CAPTEDRAFT_228334 [Capitella teleta]
Length = 506
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 91/324 (28%), Positives = 147/324 (45%), Gaps = 42/324 (12%)
Query: 70 EIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPR----MMRSAIQLI 125
E+F+H L SSP T +P EA +F+ P C + ++ R M+ + I
Sbjct: 202 ELFIH-LLKSSPYITQDPSEATFFFLPF--RCFAYRKTISDRDRAQRFTEEMVSKILYEI 258
Query: 126 SSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQR--ATLVQTFGQRNHV 183
SN+ +W+RT GADHF+V HDFG AI G P L + +V T +
Sbjct: 259 KSNYSFWDRTLGADHFYVCAHDFGP--------AIVAGSDPFLHKNAIAMVNTADYEHIY 310
Query: 184 CLNEGSITIPPYAPPQKMQ-------AHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYAR 236
+ I++PP+ K H + P + R++ ++ G N D R
Sbjct: 311 YVPHKDISLPPHPSHGKNSLANIGKGGHGLNP-SDRTVLAFYAG-----NLD-------R 357
Query: 237 G-ARAAVWENFKNNPLFDISTDHPT--TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGC 293
G R ++ + + + F I H T Y ++ + FCL G WSP L++A+ FGC
Sbjct: 358 GRIRPSIKDFWSTDIDFRIFMGHLTDERYQHYLKTSKFCLILRGNEAWSPCLMDAIWFGC 417
Query: 294 IPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKR 353
+PVII+D LP + W + + + E V L IL ++ P+ + Q L +
Sbjct: 418 VPVIISDYYDLPLHGMLDWNQFAVVIRESKVKSLKEILLAVSPQKLTSMQEKLKQ--VYG 475
Query: 354 AMLFPQPAQPGDAFHQILNGLARK 377
++ P +P DAF ++ L ++
Sbjct: 476 HFVWNDPPRPYDAFQSVMYQLWKR 499
>gi|75214755|gb|ABA18105.1| exostosin family protein [Capsella rubella]
Length = 423
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 85/308 (27%), Positives = 140/308 (45%), Gaps = 34/308 (11%)
Query: 79 SSPVRTLNPEEADWFYTPIYPTCDLT-----PTGLPLPFKSPRMMR---SAIQLISSNWP 130
SS R + PEEA F+ P + ++ P P F R+ R + +++ +P
Sbjct: 127 SSRFRAVRPEEAHVFFLP-FSVANIVHYVYKPIASPADFNRARLHRIFNDYVDVVARKYP 185
Query: 131 YWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSI 190
+W ++ GADHF V HD+ + + + L A + F + E +I
Sbjct: 186 FWKQSNGADHFMVSCHDWAPDVP-DSKPEFFKDFMRGLCNANTSEGFKPSIDFSIPEINI 244
Query: 191 TIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNP 250
PP Q P+ R+I +F G + GY R ++ ++K
Sbjct: 245 PKGKLKPPFMGQN----PEN-RTILAFFAGRAH--------GY----IREVLFTHWKGKD 287
Query: 251 LFDISTDHPT---TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFA 307
DH T Y+E + FCLCP G+ SPR VEA+ GC+PV+I+D+ LPF+
Sbjct: 288 KDVQVYDHLTKGQNYHELTGHSKFCLCPSGYEVASPREVEAIYSGCVPVVISDNYSLPFS 347
Query: 308 DAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRK-QRLLANPSMKRAMLFPQPAQPGDA 366
D + W + + + + +P + IL IP + +R Q +L +++ + +PAQP D
Sbjct: 348 DVLDWSKFSVEIPVDRIPDIKNILQEIPHDKYIRMYQNVL---KVRKHFVVNRPAQPFDV 404
Query: 367 FHQILNGL 374
H IL+ +
Sbjct: 405 IHMILHSV 412
>gi|356510025|ref|XP_003523741.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
max]
Length = 619
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 99/346 (28%), Positives = 150/346 (43%), Gaps = 39/346 (11%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEI-FMHRFLLSSPVRTLNPEEADWFYTPIY 98
LKVY+Y+ + + +P L ++A+E FM + T +P +A FY P
Sbjct: 291 LKVYIYQ----DGDRPIFHEP-LLDGIYASEGWFMKLMEANKQFVTRDPGKAHLFYIPF- 344
Query: 99 PTCDLTPTGLPLPFKSPR-----MMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFH 153
+ L L + R M++ + +I+ +P+WNRT GADHF V HD+
Sbjct: 345 -SSRLLQQTLYVRNSHRRSNLIEYMKNYVDMIAGKYPFWNRTSGADHFVVACHDWAPA-- 401
Query: 154 YQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRS 213
E + + L A + F V L E I P + + P + R
Sbjct: 402 --ETRGRMLSCIRALCNADIEVGFKIGKDVSLPETYIR-SSENPVKNIGG---DPPSKRP 455
Query: 214 IFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP-----TTYYEDMQR 268
I +F G + Y R WEN P IS P Y + M+
Sbjct: 456 ILAFFAGGLHG---------YVRPILLKHWEN--KEPDMKISGPLPHVRGNVNYIQLMKS 504
Query: 269 AIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLD 328
+ FC+C G SPR+VEA+ CIPVII+D+ + PF + + WE +FV EE++P L
Sbjct: 505 SKFCICARGHEVNSPRVVEAIFHECIPVIISDNFIPPFFEILNWESFAVFVKEEEIPNLR 564
Query: 329 TILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
IL SI E L + A + + +P + D FH +L+ +
Sbjct: 565 NILLSISEERYLEMHK-RAKKVQEHFLWHAEPVK-YDLFHMLLHSI 608
>gi|75214619|gb|ABA18091.1| exostosin family protein [Olimarabidopsis pumila]
Length = 343
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 87/309 (28%), Positives = 139/309 (44%), Gaps = 36/309 (11%)
Query: 79 SSPVRTLNPEEADWFYTPIYPTCDLT-----PTGLPLPFKSPRMMR---SAIQLISSNWP 130
S R PEEA F+ P + ++ P P F R+ R +++++ P
Sbjct: 45 SGRFRAGRPEEAHAFFLP-FSVANIVHYVSQPIASPADFNRARLHRIFNDYVEVVARKHP 103
Query: 131 YWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSI 190
+WN++ GADHF V HD+ + + + I L T + G R + + I
Sbjct: 104 FWNQSNGADHFMVSCHDWAPDVAGSKPEFFKDFIRGLCNANT---SEGFRPSIDFSIPEI 160
Query: 191 TIPP--YAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKN 248
IP PP Q P+ R+I +F G + GY R ++ ++K
Sbjct: 161 NIPKGKLKPPFMGQT----PEN-RTILAFFAGRAH--------GY----IREVLFTHWKG 203
Query: 249 NPLFDISTDHPT---TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLP 305
DH T Y+E + + FCLCP G+ SPR VEA+ GC+PV+I+D+ LP
Sbjct: 204 KDKDVQVYDHLTKGQNYHELIGHSKFCLCPSGYEVASPREVEAIYSGCVPVVISDNYSLP 263
Query: 306 FADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGD 365
F D + W + + + + +P + IL IP E ++ + + R + +PAQP D
Sbjct: 264 FNDVLDWSKFSVEIPVDKIPDIKKILQEIPHEKYIKMYHNVM--KVGRHFVVNRPAQPFD 321
Query: 366 AFHQILNGL 374
H IL+ +
Sbjct: 322 VIHMILHSV 330
>gi|255546379|ref|XP_002514249.1| catalytic, putative [Ricinus communis]
gi|223546705|gb|EEF48203.1| catalytic, putative [Ricinus communis]
Length = 334
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 92/343 (26%), Positives = 156/343 (45%), Gaps = 32/343 (9%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAE-IFMHRFLLSSPVRTLNPEEADWFYTPIY 98
LKVYVY K +K + + + ++A+E FM + +P +A F+ P
Sbjct: 5 LKVYVY----KEGEKPIFHQSK-MRGIYASEGWFMKHIEGNKKFLVKDPRKAHLFFLPFS 59
Query: 99 PTCDLTPT-GLPLPFKS--PRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQ 155
P T G L + +++ + L++ + +WNRT G DHF V HD+ + +
Sbjct: 60 PQMLRTVIFGQKLQSQKDLEEYLKNYVDLVARKYSFWNRTGGTDHFLVGCHDWAS----R 115
Query: 156 EEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIF 215
+ + + +L A + + F L I P K + P + R
Sbjct: 116 ITRKYMQNCIRVLCNANVAKGFKIGKDTTL--PVTYIRSAENPLKDVGGKHPSE--RYTL 171
Query: 216 VYFRGLFYDVNNDPEGGYYARGARAAVWENFKNN-PLFD---ISTDHPTTYYEDMQRAIF 271
+F G + Y R WEN +++ +F + Y E M+ + +
Sbjct: 172 AFFAGGMHG---------YLRPILVQFWENKESDMKIFGPMPRDIEGKRLYREYMKSSKY 222
Query: 272 CLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTIL 331
C+C G+ +PR+VEA+++ C+PVII+D+ V PF + + WE +FV E+D+P L +IL
Sbjct: 223 CICARGYEVHTPRIVEAILYECVPVIISDNYVPPFFEVLNWEAFSVFVQEKDIPNLRSIL 282
Query: 332 TSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
SIP E L Q L +++ L+ + D FH IL+ +
Sbjct: 283 LSIPEEKYLEMQ--LRVKMVQQHFLWHKNPVKYDLFHMILHSV 323
>gi|302799042|ref|XP_002981280.1| glycosyltransferase-like protein [Selaginella moellendorffii]
gi|300150820|gb|EFJ17468.1| glycosyltransferase-like protein [Selaginella moellendorffii]
Length = 352
Score = 108 bits (269), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 94/336 (27%), Positives = 152/336 (45%), Gaps = 40/336 (11%)
Query: 53 KKLLLKDPRCLTHMFAAE-IFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLP 111
K L+ + P+ T ++A+E +F+ +P P+ A F+ P + +
Sbjct: 35 KPLMHEGPK--TGIYASEGLFIATMERGNPFAVTEPKIATMFFIP-FSLKQMVDYMYDTN 91
Query: 112 FKSPRMMRSAI----QLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPL 167
S + ++S I + ++S +PY N T G DHFFV HD+ E++ +R I+ +
Sbjct: 92 SHSMKNIQSYIAGYLRRLASKYPYMNATNGIDHFFVSCHDWA--LMALEKQDCQRNIVKV 149
Query: 168 LQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTP---RSIFVYFRGLFYD 224
+ A + F V L E + ++P I DT R +F G +
Sbjct: 150 VCNADSSRGFNTSRDVSLPETRVRQGKHSP--------IIRDTSGMDRPYLAFFAGQMHG 201
Query: 225 VNNDPEGGYYARGARAAVWENFKNNPLFDI------STDHPTTYYEDMQRAIFCLCPLGW 278
R A W++ +P I S +Y E M+ + +C+C G+
Sbjct: 202 K---------LRPVLLAHWKD--KDPEMKIYEVLPPSVAERISYSEHMRLSKYCICAAGF 250
Query: 279 APWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEV 338
SPRLVEA+V C+PVI+AD+ VLPF++ I W+ I + VAE+DV L IL IP
Sbjct: 251 EVNSPRLVEAIVNECVPVILADNFVLPFSEVINWDSISVTVAEKDVANLKAILAGIPLRT 310
Query: 339 ILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
Q L + +KR + + D F+ I++ L
Sbjct: 311 YKEMQARLKH--VKRHFEWKNSPEKYDIFNMIVHSL 344
>gi|356507416|ref|XP_003522463.1| PREDICTED: LOW QUALITY PROTEIN: probable glycosyltransferase
At3g07620-like [Glycine max]
Length = 471
Score = 107 bits (268), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 81/310 (26%), Positives = 135/310 (43%), Gaps = 40/310 (12%)
Query: 72 FMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPY 131
F+H RT +P+EA +Y P + + P GL +++ IQ+I+ P+
Sbjct: 183 FIHEMEKGRYYRTYDPDEAFVYYLP-FSGVYVDPLGL--------VVKDYIQVIAHKHPF 233
Query: 132 WNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNE---- 187
WNR+ G DHF + HD+G + I +L A + + F V E
Sbjct: 234 WNRSLGYDHFMLSCHDWGPLVSSYVDHFYNNAIR-VLCNANVSEGFKPAKDVSFPEIKLI 292
Query: 188 -GSIT--IPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWE 244
G +T + Y P Q R+I +F G + Y R + W+
Sbjct: 293 KGEVTNLVGGYPPSQ------------RTILAFFAGHQHG---------YIRXLLQSTWK 331
Query: 245 NFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVL 304
N + +YY ++ + FCLCP G+ SPR+V+A+ C+PV+I+D V
Sbjct: 332 NKDQDMQIYEELPEGISYYTKLRSSKFCLCPSGYEVASPRVVKAIFAECVPVLISDGYVP 391
Query: 305 PFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPG 364
PF+D + W + V +D+P + IL I LR + + ++R + +P +
Sbjct: 392 PFSDVLNWNSFSVQVDVKDIPNIKKILMGISERQYLRMYKRVKQ--VQRHFVPNEPPKRY 449
Query: 365 DAFHQILNGL 374
D FH ++ +
Sbjct: 450 DMFHMTVHSI 459
>gi|357478279|ref|XP_003609425.1| Xylogalacturonan beta-1,3-xylosyltransferase [Medicago truncatula]
gi|355510480|gb|AES91622.1| Xylogalacturonan beta-1,3-xylosyltransferase [Medicago truncatula]
Length = 617
Score = 107 bits (268), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 77/303 (25%), Positives = 135/303 (44%), Gaps = 26/303 (8%)
Query: 80 SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSP-------RMMRSAIQLISSNWPYW 132
SP + +P EA F+ P + P+ K RM+ + +++ +PYW
Sbjct: 320 SPFKAKHPNEAHIFFLPFSVANVVQYVYKPIMSKKDFNRDRLHRMVEDYVNVVAHKYPYW 379
Query: 133 NRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITI 192
NR+ GADHF + HD+ + + +L A + F + V + E + +
Sbjct: 380 NRSNGADHFLLSCHDWAPEIS-DANPNLFKNFTRVLCNANTSEGFQPKRDVSIPEVYLPV 438
Query: 193 PPYAPPQKMQAHQIPPDTPRSIFVYFRGLFY-DVNNDPEGGYYARGARAAVWENFKNNPL 251
PP Q+ P + R+I +F G + D+ + + A+ V E
Sbjct: 439 GKLGPPNLGQS---PLN--RTILAFFSGGAHGDIRKLLLNHWKNKDAQVQVHEYLPKG-- 491
Query: 252 FDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIP 311
Y E M + FCLCP G+ SPR+VEA+ GC+PVII+ + LPF D +
Sbjct: 492 --------QNYTELMGLSKFCLCPSGYEVASPRIVEAINAGCVPVIISQNYSLPFNDVLN 543
Query: 312 WEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQIL 371
W E + + E + ++ IL ++ + ++ + +++ + +PA+P D H IL
Sbjct: 544 WSEFSVEIPVEKIVEIKNILQNVTKDKYMKLHMNVM--KVQKHFVMNRPAKPFDVMHMIL 601
Query: 372 NGL 374
+ +
Sbjct: 602 HSI 604
>gi|356563757|ref|XP_003550126.1| PREDICTED: probable glycosyltransferase At5g03795 [Glycine max]
Length = 645
Score = 107 bits (268), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 91/344 (26%), Positives = 153/344 (44%), Gaps = 35/344 (10%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEI-FMHRFLLSSPVRTLNPEEADWFYTPIY 98
LKVYVY + + ++ P T ++A+E FM + + T +P +A FY P
Sbjct: 317 LKVYVY----REGARPIMHSP-FFTGLYASEGWFMKQMEANKRFLTRDPNKAHLFYLPFS 371
Query: 99 PTCDLTPTGLPLPFKSPRMMR---SAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQ 155
+ +++ + +++I+ + +WNRT GADHF V HD+
Sbjct: 372 SRMLEETLYVQNSHNHKNLVQYLHNYVEMIAGKYTFWNRTGGADHFLVGCHDWAPG---- 427
Query: 156 EEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIF 215
E K + L A + + F L E + P + + + + R+
Sbjct: 428 ETKVDMANCIRSLCNADVKEGFVFGKDASLPETYVR-DAKIPTKDLSGNSA---SKRTTL 483
Query: 216 VYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTT-----YYEDMQRAI 270
+F G + Y R WEN +P I P + Y + M+ +
Sbjct: 484 AFFAGSMHG---------YVRPILLQHWEN--KDPDMKIFGRLPKSKGNRNYIQYMKSSK 532
Query: 271 FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTI 330
+C+C G+ SPR+VEA+ + C+PVII+D+ V PF + + WE + V E+D+P L I
Sbjct: 533 YCICAKGYEVNSPRVVEAIFYECVPVIISDNFVPPFLEVLNWESFAVIVLEKDIPNLKNI 592
Query: 331 LTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
L SIP + LR Q + +++ L+ + D FH IL+ +
Sbjct: 593 LLSIPEKQYLRLQMRVK--KVQQHFLWHKNPVKYDIFHMILHSV 634
>gi|359484345|ref|XP_002280595.2| PREDICTED: probable glycosyltransferase At5g03795-like [Vitis
vinifera]
gi|297738776|emb|CBI28021.3| unnamed protein product [Vitis vinifera]
Length = 665
Score = 107 bits (268), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 106/390 (27%), Positives = 163/390 (41%), Gaps = 53/390 (13%)
Query: 8 LLLAFVSADNTPKIERISGSAGDVLEDDPVGK---------LKVYVYELPSKYNKKLLLK 58
LL A N P I G V + + K LK+Y+Y + +K +
Sbjct: 295 LLSARSEIQNAPVIRNTPGLYASVYRNVSMFKRSYELMERVLKIYIY----REGEKPIFH 350
Query: 59 DPRCLTHMFAAEI-FMHRFLLSSPVRTLNPEEADWFYTPI---------YPTCDLTPTGL 108
PR L ++A+E FM + +P +A FY P Y TP L
Sbjct: 351 QPR-LRGIYASEGWFMKLIEGNKRFVVRDPRKAHLFYVPFSSKMLRTVFYEQNSSTPRDL 409
Query: 109 PLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLL 168
FK+ + LI+ + +WNRT GADH V HD+ Q + L
Sbjct: 410 EKYFKN------YVGLIAGKYRFWNRTGGADHLIVACHDWAPRITRQ----CSWNSIRAL 459
Query: 169 QRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNND 228
+ + F L I P K + P P I +F G +
Sbjct: 460 CNSNIASGFKIGKDTTL--PVTYIRKSEDPLKYLGGKPPSQRP--ILAFFAGSMHG---- 511
Query: 229 PEGGYYARGARAAVWENFKNN-PLFD-ISTDH--PTTYYEDMQRAIFCLCPLGWAPWSPR 284
Y R WEN + + +F +S D + Y + M+ + +C+C G+ +PR
Sbjct: 512 -----YLRPILLQYWENKEQDIKIFGPMSRDDGGKSRYRDHMKSSKYCICARGYEVHTPR 566
Query: 285 LVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQR 344
+VEA+ + C+PVII+D+ V PF + + WE +F+ E+DVP L IL SIP E L+ Q
Sbjct: 567 VVEAIFYECVPVIISDNYVPPFFEILNWEAFAVFILEKDVPNLRNILLSIPEEKYLQMQ- 625
Query: 345 LLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
+ +++ L+ + D FH IL+ +
Sbjct: 626 -MRVKMVQQHFLWHKKPVKYDLFHMILHSV 654
>gi|302829362|ref|XP_002946248.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
gi|300269063|gb|EFJ53243.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
Length = 656
Score = 107 bits (268), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 103/388 (26%), Positives = 169/388 (43%), Gaps = 62/388 (15%)
Query: 39 KLKVYVYELPSKYNKKLLLKDPRCLTH-MFAAEIFMHRFLLS--SPVRTLNPEEADWFYT 95
LK+Y+Y++P +D H + + + L+ S +RT NP EA+ FY
Sbjct: 290 SLKIYMYDIPPNIVGPHQFEDGNGGIHPQYESFLRFQGLFLNDVSGIRTENPHEANLFYI 349
Query: 96 PI---YPTCDLT-PTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGAC 151
P Y + +L PTG + A+ +++ +P++NRT G DHF ++ D GAC
Sbjct: 350 PAFTYYSSSNLGDPTGAAV---------RAVNWVAATFPFFNRTGGRDHFVLLSGDRGAC 400
Query: 152 FHYQEEKAIERGILPLLQRATLVQTFG--------------QRNHVCLNEG-SITIPPY- 195
+ LP + V FG + C G + +PPY
Sbjct: 401 YLK---------TLPQTENLIRVTHFGYERPNITDMGPLVTNTEYGCFKAGRDVVMPPYV 451
Query: 196 ----APPQKMQAHQIPPDTPRSIFVYFRGLFY---DV-NNDPEGGYYARGARAAVWENFK 247
A Q ++A P ++ L + D+ +N+PE Y+ G R A+
Sbjct: 452 KSNVAGIQGVRAKLEEPGGAEALLAGKDTLLFFSGDIRHNEPE---YSGGVRQALALLLA 508
Query: 248 NNPLFDI------STDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADD 301
N D+ Y ++R+ FCL P G W RL+ A+ CIPVII D
Sbjct: 509 NTSYPDVVFKGGYMMMGMGEYESLLRRSKFCLAPYGHG-WGIRLIHAITHACIPVIIQDK 567
Query: 302 IVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPA 361
+ PF D + + + + V++ ++P+L IL ++P +LR + N + RA L+ QP
Sbjct: 568 VRQPFEDILHYPDFSVRVSKAELPRLVEILRAVPEPDLLRM--IKENSRVYRAFLW-QPE 624
Query: 362 QPGDAFHQILNGLARKLPHDKSTYLKPG 389
G A++ + L R+L H + + G
Sbjct: 625 LGGLAYNITIASLRRRLSHVRGDLYEAG 652
>gi|147846684|emb|CAN80640.1| hypothetical protein VITISV_016911 [Vitis vinifera]
Length = 1363
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 95/368 (25%), Positives = 159/368 (43%), Gaps = 68/368 (18%)
Query: 28 AGDVLEDDPVGKLK---VYVYELPSKYNKKLL------------LKDPRCLTH----MFA 68
+G ++ D V K K +YVY+LP ++N LL + D R T+ ++
Sbjct: 351 SGSLVNLDAVVKKKRPLIYVYDLPPEFNSLLLEGRHFKFECVNRIYDDRNATYWTEQLYG 410
Query: 69 AEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLP--LPFKSPRMMRSAIQL-- 124
A++ ++ +L+SP RTL+ EEAD+F+ P+ +C + L + +RS++ L
Sbjct: 411 AQMAIYESILASPHRTLDGEEADFFFVPVLDSCIIVRADDAPHLNMHAHGGLRSSLTLEF 470
Query: 125 -------ISSNWPYWNRTEGADHFFVVPHDFGACFHYQE--------------EKAIERG 163
I +P+WNR+ G DH + D GAC+ +E K
Sbjct: 471 YKTAYDHIVEQYPFWNRSSGRDHIWFFSWDEGACYAPKEIWDSMMLVHWGNTNSKHNHST 530
Query: 164 ILPLLQRATLVQTFGQRNHVCLN-EGSITIPPYAPPQKMQAHQ---IPPDTPRSIFVYFR 219
V + + NH C + + +P + P + P R YF
Sbjct: 531 TAYWADNWDSVSSDRRGNHPCFDPYKDLVLPAWKRPDVVSLSSKLWSRPREQRKTLFYFN 590
Query: 220 GLFYDVNNDPEGGY----YARGARAAVWENFKNNPLFD--ISTDHPT----------TYY 263
G ++ EGG Y+ G R V E F ++P + + H Y+
Sbjct: 591 G---NLGPAYEGGRPETTYSMGIRQKVAEEFGSSPNKEGKLGKQHAEDVIVTPLRSGNYH 647
Query: 264 EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED 323
E + ++FC G WS R ++++ GCIPV+I D I LPF + + +E + + E++
Sbjct: 648 ESLASSVFCGVMPG-DGWSGRFEDSILQGCIPVVIQDGIFLPFENMLNYESFAVRIREDE 706
Query: 324 VPKLDTIL 331
+P L IL
Sbjct: 707 IPNLIKIL 714
>gi|224092294|ref|XP_002309547.1| predicted protein [Populus trichocarpa]
gi|222855523|gb|EEE93070.1| predicted protein [Populus trichocarpa]
Length = 471
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 93/349 (26%), Positives = 151/349 (43%), Gaps = 43/349 (12%)
Query: 40 LKVYVY---ELPSKYNKKLLLKDPRCLTHMFAAE-IFMHRFLLSSPVRTLNPEEADWFYT 95
LKVY+Y E+P + P L ++A+E FM + T + ++A FY
Sbjct: 141 LKVYIYKEGEMP-------IFHQP-LLNGIYASEGWFMKLLEGNKKFVTKDSKKAHLFYL 192
Query: 96 PIYPTCDLTPTGLPLPFKSPRM---MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACF 152
P +P + ++ + +IS +P+WNRT+GADHF HD+
Sbjct: 193 PFSSRYLEIRLYVPNSHSHKNLIEYLKKYLDMISEKYPFWNRTQGADHFLAACHDWAPS- 251
Query: 153 HYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPR 212
E + + L + + F L E + + P + + ++ + R
Sbjct: 252 ---ETRQHMANCIRALCNSDAKEDFVYGKDASLPE-TYVLTQENPLRDLGGNRA---SKR 304
Query: 213 SIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPT-------TYYED 265
SI +F G + Y R WEN +P I P Y
Sbjct: 305 SILAFFAGSMHG---------YLRPILLQHWEN--KDPDMKIFGRLPKVKGRGKMNYARY 353
Query: 266 MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVP 325
M+ + +C+C G+ SPR+VEA+ + C+PVII+D+ V PF + + WE +FV E+D+P
Sbjct: 354 MKSSKYCICAKGYEVNSPRVVEAIFYECVPVIISDNFVPPFLEVLNWESFAVFVLEKDIP 413
Query: 326 KLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
L IL SIP + R Q + +++ L+ D FH IL+ +
Sbjct: 414 NLKKILLSIPAKKYRRMQMRVKR--VQQHFLWHARPVKYDVFHMILHSI 460
>gi|159479930|ref|XP_001698039.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158273838|gb|EDO99624.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 810
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 96/374 (25%), Positives = 150/374 (40%), Gaps = 73/374 (19%)
Query: 42 VYVYELPSKYNKKLLLK---DPRCL-------------THMFAAEIFMHRFLLSSPVRTL 85
+YVY++P Y ++L CL + + E MH +L S RT
Sbjct: 373 IYVYDVPPDYTSRMLQYRNFGDTCLWRRWHDGNFTGITGYTYGIETLMHELMLQSEHRTF 432
Query: 86 NPEEADWFYTPIYPTCDL-------------TPTGLPLPFKSPRMMRSAIQLISSNWPYW 132
+PEEAD+FY P+Y TC P GL + M+ + + PYW
Sbjct: 433 DPEEADFFYVPMYITCYFWPILGWADGPWWHAPNGLRV-MHGANMITELHDWLRTKLPYW 491
Query: 133 NRTEGADHFFVVPHDFGACF-------------HYQEEKAIERGILPLLQRATLVQ---- 175
+R G DH +++ D GAC+ H+ + LQ +
Sbjct: 492 DRRGGRDHIWLMAADEGACWMPKAVYDTSIVLTHWGRLDPEHKSNTAYLQDNYTAKPESA 551
Query: 176 -------TFGQR--NHVCLN-EGSITIPPYAPPQKMQAHQI--PPDTPRSIFVYFRGLFY 223
FG R H C + + +P + P + P R + ++FRG
Sbjct: 552 FDAWRGVDFGDRIKGHPCFDPRKDLVVPAFKSPNHFPRSPLIGAPPLERDLLLFFRG--- 608
Query: 224 DVNNDPEGGYYARGARAAVW-----ENFKNNPLFDISTDHP--TTYYEDMQRAIFCLCPL 276
DV +Y+RG R ++ ++ N I + Y E + R+ FCL
Sbjct: 609 DVGAS-RLPHYSRGIRQRLFHLAHKHDWYNRFKIAIGSGDSLKGDYSEQLARSKFCLVAP 667
Query: 277 GWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED--VPKLDTILTSI 334
G WSPR +A++ GCIPV++ D + F + W+ + + E+D + L +L SI
Sbjct: 668 G-DGWSPRAEDAILHGCIPVVVMDGVQAVFESILDWDSFSLRIREDDAALEALPQLLASI 726
Query: 335 PPEVILRKQRLLAN 348
PE + QR LA
Sbjct: 727 SPERLAHMQRHLAR 740
>gi|224087074|ref|XP_002308055.1| predicted protein [Populus trichocarpa]
gi|222854031|gb|EEE91578.1| predicted protein [Populus trichocarpa]
Length = 338
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 77/268 (28%), Positives = 123/268 (45%), Gaps = 29/268 (10%)
Query: 122 IQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRN 181
+++I+ + YWNRT GADHF + HD+G + + I L A + F +
Sbjct: 88 VRVIADKYTYWNRTNGADHFSISCHDWGPDISRTNPELFKYFIRALCN-ANTSEGFQPQR 146
Query: 182 HVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYF--------RGLFYDVNNDPEGGY 233
V + E + + P++ P + R I +F R + D +G
Sbjct: 147 DVSVPEIFLHVGKLGLPREGAQ----PPSKRPILAFFAGGAHGRIRKVLLKRWKDKDGEI 202
Query: 234 YARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGC 293
V + KNN L Y++ M ++ FCLCP G SPR+V A+ GC
Sbjct: 203 ---QVHEYVTQRKKNNNL----------YFKLMGQSKFCLCPSGHEVASPRVVTAIQLGC 249
Query: 294 IPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKR 353
+PVII+D+ LPF+D + W + + + E + ++ TIL I + L QR + +R
Sbjct: 250 VPVIISDNYSLPFSDVLDWSKFSVNIPSEKIQEIKTILKGISHKRYLTMQRRVIQA--QR 307
Query: 354 AMLFPQPAQPGDAFHQILNGL-ARKLPH 380
+PA+P D H IL+ + R+L H
Sbjct: 308 HFTLNRPAKPYDMIHMILHSIWLRRLNH 335
>gi|226496701|ref|NP_001141657.1| uncharacterized protein LOC100273782 [Zea mays]
gi|194705444|gb|ACF86806.1| unknown [Zea mays]
Length = 497
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 105/362 (29%), Positives = 158/362 (43%), Gaps = 58/362 (16%)
Query: 38 GKLKVYVYELPSKYNKKLL--LKDP-RCLTHMFAAEIFMHRF------------------ 76
G L+VYVYE+PSK+ LL +D R ++ + +HR
Sbjct: 109 GLLRVYVYEMPSKFTYDLLRLFRDSYRDTDNLTSNGSPVHRLIEQHSIDYWLWADLIALD 168
Query: 77 ---LLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWN 133
LL S +R EEAD FY P + T L + + R A++ ++ P W
Sbjct: 169 SQRLLKSVIRVQQQEEADIFYVPFFTTISYF---LLEKQECKALYREALKWVTDQ-PAWQ 224
Query: 134 RTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIP 193
R+EG DH V H + +++++ I LL + + V L + I
Sbjct: 225 RSEGRDHVIPVHHPWS---FKSVRRSVKKAIW-LLPDMDSTGNWYKPGQVYLEKD--VIL 278
Query: 194 PYAPPQKMQAHQ--IPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPL 251
PY P + H+ + + RSI ++FRG GG R+ + E K+
Sbjct: 279 PYVPNVDLCDHKCVLETQSKRSILLFFRGRL----KRNAGG----KIRSKLVEELKSAK- 329
Query: 252 FDISTDHPTT-------YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVL 304
DI + +T + M+++ FCL P G P S RL +A+V GCIPVII+D++ L
Sbjct: 330 -DIVIEEGSTGAQGKAAAQDGMRKSFFCLSPAGDTPSSARLFDAIVTGCIPVIISDELEL 388
Query: 305 PFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPA 361
PF + + EI +FV+ D + L L I + I Q L S R L+ PA
Sbjct: 389 PFEGILDYREIALFVSASDAVQPGWLLKYLRGINAKRIREIQSNLVKYS--RHFLYSSPA 446
Query: 362 QP 363
QP
Sbjct: 447 QP 448
>gi|359484343|ref|XP_002283936.2| PREDICTED: uncharacterized protein LOC100268163 [Vitis vinifera]
Length = 738
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 96/351 (27%), Positives = 153/351 (43%), Gaps = 48/351 (13%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEI-FMHRFLLSSPVRTLNPEEADWFYTP-- 96
LKVYVY K +K + P L ++A+E FM + +P +A FY P
Sbjct: 409 LKVYVY----KDGEKPIFHQP-ILKGLYASEGWFMKLMERNKHFVVKDPRQAQLFYMPFS 463
Query: 97 -------IYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFG 149
+Y T L + ++ + I++ + +WNRT GADHF V HD+
Sbjct: 464 SRMLEYKLYVRNSHNRTNLR------QYLKQYSEKIAAKYRFWNRTGGADHFLVACHDWA 517
Query: 150 ACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPD 209
Y+ +E+ I L A + F V L E + P + + P
Sbjct: 518 P---YETRHHMEQCIKALCN-ADVTAGFKIGRDVSLPETYVR-SARNPLRDLGG---KPP 569
Query: 210 TPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP------TTYY 263
+ R I ++ G + Y R W++ +P I P Y
Sbjct: 570 SERHILAFYAGNMHG---------YLRPILLKYWKD--KDPDMKIYGPMPPGVASKMNYI 618
Query: 264 EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED 323
+ M+ + FC+CP G+ SPR+VEA+ + C+PVII+D+ V PF D + W + +AE+D
Sbjct: 619 QHMKSSKFCICPKGYEVNSPRVVEAIFYECVPVIISDNFVPPFFDVLDWGAFSIILAEKD 678
Query: 324 VPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
+P L +L SIP + L+ Q L +++ L+ D FH L+ +
Sbjct: 679 IPNLKDVLLSIPNDKYLQMQ--LGVRKVQKHFLWHAKPLKYDLFHMTLHSI 727
>gi|225441752|ref|XP_002277596.1| PREDICTED: uncharacterized protein LOC100267584 [Vitis vinifera]
Length = 794
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 101/410 (24%), Positives = 172/410 (41%), Gaps = 71/410 (17%)
Query: 28 AGDVLEDDPVGKLK---VYVYELPSKYNKKLL------------LKDPRCLTH----MFA 68
+G ++ D V K K +YVY+LP ++N LL + D R T+ ++
Sbjct: 335 SGSLVNLDAVVKKKRPLIYVYDLPPEFNSLLLEGRHFKFECVNRIYDDRNATYWTEQLYG 394
Query: 69 AEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLP--LPFKSPRMMRSAIQL-- 124
A++ ++ +L+SP RTL+ EEAD+F+ P+ +C + L + +RS++ L
Sbjct: 395 AQMAIYESILASPHRTLDGEEADFFFVPVLDSCIIVRADDAPHLNMHAHGGLRSSLTLEF 454
Query: 125 -------ISSNWPYWNRTEGADHFFVVPHDFGACFHYQE--------------EKAIERG 163
I +P+WNR+ G DH + D GAC+ +E K
Sbjct: 455 YKTAYDHIVEQYPFWNRSSGRDHIWFFSWDEGACYAPKEIWDSMMLVHWGNTNSKHNHST 514
Query: 164 ILPLLQRATLVQTFGQRNHVCLN-EGSITIPPYAPPQKMQAHQ---IPPDTPRSIFVYFR 219
V + + NH C + + +P + P + P R YF
Sbjct: 515 TAYWADNWDSVSSDRRGNHPCFDPYKDLVLPAWKRPDVVSLSSKLWSRPREQRKTLFYFN 574
Query: 220 GLFYDVNNDPEGGY----YARGARAAVWENFKNNPLFD--ISTDHPT----------TYY 263
G ++ EGG Y+ G R V E F ++P + + H Y+
Sbjct: 575 G---NLGPAYEGGRPETTYSMGIRQKVAEEFGSSPNKEGKLGKQHAEDVIVTPLRSGNYH 631
Query: 264 EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED 323
E + ++FC G WS R ++++ GCIPV+I D I LPF + + +E + + E++
Sbjct: 632 ESLASSVFCGVMPG-DGWSGRFEDSILQGCIPVVIQDGIFLPFENMLNYESFAVRIREDE 690
Query: 324 VPKLDTILTSIPP---EVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQI 370
+P L IL + E L R + + R + + + AF +
Sbjct: 691 IPNLIKILRGMNETEIEFKLENVRKIWQRFLYRDSILLEAERQKTAFGNV 740
>gi|414880038|tpg|DAA57169.1| TPA: hypothetical protein ZEAMMB73_490377 [Zea mays]
Length = 497
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 105/362 (29%), Positives = 158/362 (43%), Gaps = 58/362 (16%)
Query: 38 GKLKVYVYELPSKYNKKLL--LKDP-RCLTHMFAAEIFMHRF------------------ 76
G L+VYVYE+PSK+ LL +D R ++ + +HR
Sbjct: 109 GLLRVYVYEMPSKFTYDLLRLFRDSYRDTDNLTSNGSPVHRLIEQHSIDYWLWADLIALD 168
Query: 77 ---LLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWN 133
LL S +R EEAD FY P + T L + + R A++ ++ P W
Sbjct: 169 SQRLLKSVIRVQQQEEADIFYVPFFTTISYF---LLEKQECKALYREALKWVTDQ-PAWQ 224
Query: 134 RTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIP 193
R+EG DH V H + +++++ I LL + + V L + I
Sbjct: 225 RSEGRDHVIPVHHPWS---FKSVRRSVKKAIW-LLPDMDSTGNWYKPGQVYLEKD--VIL 278
Query: 194 PYAPPQKMQAHQ--IPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPL 251
PY P + H+ + + RSI ++FRG GG R+ + E K+
Sbjct: 279 PYVPNVDLCDHKCVLETQSKRSILLFFRGRL----KRNAGG----KIRSKLVEELKSAK- 329
Query: 252 FDISTDHPTT-------YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVL 304
DI + +T + M+++ FCL P G P S RL +A+V GCIPVII+D++ L
Sbjct: 330 -DIVIEEGSTGAQGKAAAQDGMRKSFFCLSPAGDTPSSARLFDAIVSGCIPVIISDELEL 388
Query: 305 PFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPA 361
PF + + EI +FV+ D + L L I + I Q L S R L+ PA
Sbjct: 389 PFEGILDYREIALFVSASDAVQPGWLLKYLRGINAKRIREIQSNLVKYS--RHFLYSSPA 446
Query: 362 QP 363
QP
Sbjct: 447 QP 448
>gi|297739695|emb|CBI29877.3| unnamed protein product [Vitis vinifera]
Length = 822
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 101/410 (24%), Positives = 172/410 (41%), Gaps = 71/410 (17%)
Query: 28 AGDVLEDDPVGKLK---VYVYELPSKYNKKLL------------LKDPRCLTH----MFA 68
+G ++ D V K K +YVY+LP ++N LL + D R T+ ++
Sbjct: 363 SGSLVNLDAVVKKKRPLIYVYDLPPEFNSLLLEGRHFKFECVNRIYDDRNATYWTEQLYG 422
Query: 69 AEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLP--LPFKSPRMMRSAIQL-- 124
A++ ++ +L+SP RTL+ EEAD+F+ P+ +C + L + +RS++ L
Sbjct: 423 AQMAIYESILASPHRTLDGEEADFFFVPVLDSCIIVRADDAPHLNMHAHGGLRSSLTLEF 482
Query: 125 -------ISSNWPYWNRTEGADHFFVVPHDFGACFHYQE--------------EKAIERG 163
I +P+WNR+ G DH + D GAC+ +E K
Sbjct: 483 YKTAYDHIVEQYPFWNRSSGRDHIWFFSWDEGACYAPKEIWDSMMLVHWGNTNSKHNHST 542
Query: 164 ILPLLQRATLVQTFGQRNHVCLN-EGSITIPPYAPPQKMQAHQ---IPPDTPRSIFVYFR 219
V + + NH C + + +P + P + P R YF
Sbjct: 543 TAYWADNWDSVSSDRRGNHPCFDPYKDLVLPAWKRPDVVSLSSKLWSRPREQRKTLFYFN 602
Query: 220 GLFYDVNNDPEGGY----YARGARAAVWENFKNNPLFD--ISTDHPT----------TYY 263
G ++ EGG Y+ G R V E F ++P + + H Y+
Sbjct: 603 G---NLGPAYEGGRPETTYSMGIRQKVAEEFGSSPNKEGKLGKQHAEDVIVTPLRSGNYH 659
Query: 264 EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED 323
E + ++FC G WS R ++++ GCIPV+I D I LPF + + +E + + E++
Sbjct: 660 ESLASSVFCGVMPG-DGWSGRFEDSILQGCIPVVIQDGIFLPFENMLNYESFAVRIREDE 718
Query: 324 VPKLDTILTSIPP---EVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQI 370
+P L IL + E L R + + R + + + AF +
Sbjct: 719 IPNLIKILRGMNETEIEFKLENVRKIWQRFLYRDSILLEAERQKTAFGNV 768
>gi|147799433|emb|CAN76867.1| hypothetical protein VITISV_012309 [Vitis vinifera]
Length = 1908
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 96/351 (27%), Positives = 152/351 (43%), Gaps = 48/351 (13%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEI-FMHRFLLSSPVRTLNPEEADWFYTP-- 96
LKVYVY K +K + P L ++A+E FM + +P +A FY P
Sbjct: 1065 LKVYVY----KDGEKPIFHQP-ILKGLYASEGWFMKLMERNKXFVVKDPRQAQLFYMPFS 1119
Query: 97 -------IYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFG 149
+Y T L + ++ + I++ + +WNRT G DHF V HD+
Sbjct: 1120 SRMLEYKLYVRNSHNRTNLR------QYLKQYSEKIAAKYRFWNRTGGXDHFLVACHDWA 1173
Query: 150 ACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPD 209
Y+ +E+ I L A + F V L E + P + + P
Sbjct: 1174 P---YETRHHMEQCIKALCN-ADVTAGFKIGRDVSLPETYVR-SARNPLRDLGG---KPP 1225
Query: 210 TPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP------TTYY 263
+ R I ++ G + Y R W++ +P I P Y
Sbjct: 1226 SERHILAFYAGNMHG---------YLRPILLKYWKD--KDPDMKIYGPMPPGVASKMNYI 1274
Query: 264 EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED 323
+ M+ + FC+CP G+ SPR+VEA+ + C+PVII+D+ V PF D + W + +AE+D
Sbjct: 1275 QHMKSSKFCICPKGYEVNSPRVVEAIFYECVPVIISDNFVPPFFDVLDWGAFSIILAEKD 1334
Query: 324 VPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
+P L +L SIP E L+ Q L +++ L+ D FH L+ +
Sbjct: 1335 IPNLKDVLLSIPNEKYLQMQ--LGVRKVQKHFLWHAKPLKYDLFHMTLHSI 1383
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 108/430 (25%), Positives = 171/430 (39%), Gaps = 85/430 (19%)
Query: 8 LLLAFVSADNTPKIERISGSAGDVLEDDPVGK---------LKVYVYELPSKYNKKLLLK 58
LL A N P I G V + + K LK+Y+Y + +K +
Sbjct: 295 LLSARSEIQNAPVIRNTPGLYASVYRNVSMFKRSYELMERVLKIYIY----REGEKPIFH 350
Query: 59 DPRCLTHMFAAEI-FMHRFLLSSPVRTLNPEEADWFYTPI---------YPTCDLTPTGL 108
PR L ++A+E FM + +P +A FY P Y TP L
Sbjct: 351 QPR-LRGIYASEGWFMKLIEGNKRFVVRDPRKAHLFYVPFSSKMLRTVFYEQNSSTPRDL 409
Query: 109 PLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIE------- 161
FK+ + LI+ + +WNRT GADH V HD+ + I
Sbjct: 410 EKYFKN------YVGLIAGKYRFWNRTGGADHLIVACHDWNPIYRTISTNTIRIKSQAIT 463
Query: 162 ----------------------RGILPLLQRATLVQTFGQRNHVCLNE-------GSITI 192
+G R T ++ +C + G T
Sbjct: 464 MPPFIFVGGESTYDLVSGTFSNKGFNSQAPRITRQCSWNSIRALCNSNIASGFKIGKDTT 523
Query: 193 PPYAPPQK----MQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKN 248
P +K ++ P + R I +F G + Y R WEN +
Sbjct: 524 LPVTYIRKSEDPLKYLGGKPPSQRPILAFFAGSMHG---------YLRPILLQYWENKEQ 574
Query: 249 N-PLFD-ISTDH--PTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVL 304
+ +F +S D + Y + M+ + +C+C G+ +PR+VEA+ + C+PVII+D+ V
Sbjct: 575 DIKIFGPMSRDDGGKSRYRDHMKSSKYCICARGYEVHTPRVVEAIFYECVPVIISDNYVP 634
Query: 305 PFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPG 364
PF + + WE +F+ E+DVP L IL SIP E L+ Q + +++ L+ +
Sbjct: 635 PFFEILNWEAFAVFILEKDVPNLRNILLSIPEEKYLQMQ--MRVKMVQQHFLWHKKPVKY 692
Query: 365 DAFHQILNGL 374
D FH IL+ +
Sbjct: 693 DLFHMILHSV 702
>gi|302772499|ref|XP_002969667.1| glycosyltransferase-like protein [Selaginella moellendorffii]
gi|300162178|gb|EFJ28791.1| glycosyltransferase-like protein [Selaginella moellendorffii]
Length = 352
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 91/333 (27%), Positives = 152/333 (45%), Gaps = 34/333 (10%)
Query: 53 KKLLLKDPRCLTHMFAAE-IFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLP 111
K L+ + P+ T ++A+E +F+ +P P+ A F+ P + +
Sbjct: 35 KPLMHEGPK--TGIYASEGLFIATMERGNPFAVTEPKIATMFFIP-FSLKQMVDYMYDTN 91
Query: 112 FKSPRMMRSAI----QLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPL 167
S + ++S I + ++S +PY N T G DHFFV HD+ E++ +R I+ +
Sbjct: 92 SHSMKNIQSYIAGYLRRLASKYPYMNATNGIDHFFVSCHDWA--LMALEKQDCQRNIVKV 149
Query: 168 LQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNN 227
+ A + F V L E + ++P + + R +F G +
Sbjct: 150 VCNADSSRGFNTSRDVSLPETRVRQGKHSPIIRDIS-----GMDRPYLAFFAGQMHG--- 201
Query: 228 DPEGGYYARGARAAVWENFKNNPLFDI------STDHPTTYYEDMQRAIFCLCPLGWAPW 281
R A W++ +P I S +Y E M+ + +C+C G+
Sbjct: 202 ------KLRPVLLAHWKD--KDPEMKIYEVLPPSVAERISYSEHMRLSKYCICAAGFEVN 253
Query: 282 SPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILR 341
SPRLVEA+V C+PVI+AD+ VLPF++ I W+ I + VAE+DV L IL IP
Sbjct: 254 SPRLVEAIVNECVPVILADNFVLPFSEVINWDSISVTVAEKDVANLKAILAGIPLRRYKE 313
Query: 342 KQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
Q L + +KR ++ + D F+ I++ L
Sbjct: 314 MQARLKH--VKRHFVWKNSPEKYDIFNMIVHSL 344
>gi|384252594|gb|EIE26070.1| exostosin-domain-containing protein [Coccomyxa subellipsoidea
C-169]
Length = 898
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 101/391 (25%), Positives = 160/391 (40%), Gaps = 92/391 (23%)
Query: 42 VYVYELPSKYNKKLL---LKDPRCLTHMFA--------------AEIFMHRFL--LSSPV 82
+YVY+LP+++N ++L L C+ +F F R L LSS
Sbjct: 448 IYVYDLPAEFNSRMLQYRLNKRECVWRLFKDNNESYINSWTYGIESAFHERLLQALSSEH 507
Query: 83 RTLNPEEADWFYTPIYPTCDLTP---------------------------TGLPLPFKSP 115
RTL+PEEAD+F+ P+Y +C L P TG ++
Sbjct: 508 RTLDPEEADFFFMPVYTSCFLHPVWGYVDHPWYYGPTIDCRRDGDMQICQTGANRVMQAM 567
Query: 116 RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQ 175
M+ A + + N P+W R G DH +++ HD G+C+ +E + IL R +
Sbjct: 568 FMLLEAQKWVEVNHPWWRRKGGRDHIWLITHDEGSCWAPKEIRL--SIILSHWGRKDVNH 625
Query: 176 TFGQ--------------------------RNHVCLNE-GSITIPPYAPPQKMQAHQIP- 207
T + H C + + IP P + A+ P
Sbjct: 626 TSNSAFKPWDNYTQEVIHPEWWPEGYTHHIKGHACYDPIKDLIIPNLKHPAEF-ANFSPL 684
Query: 208 ---PDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNP-------LFDISTD 257
P PR I FRG DV +Y+RG R ++ + + L D
Sbjct: 685 VGHPQPPRDILFLFRG---DVGKH-RLPHYSRGIRQRLFALAQEHDWAGRHAILIGDRDD 740
Query: 258 HPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGM 317
Y E + R+ FCL G +SPR +A++ GC+PV++ D++ F+ + W +
Sbjct: 741 VAGDYSELLTRSKFCLVAPG-DGFSPRAEDAILHGCVPVVVMDEVDPVFSSILDWSAFSL 799
Query: 318 FVAEEDVPKLDTILTSIPPEVILRKQRLLAN 348
+AE D+ +L IL ++P + QR L N
Sbjct: 800 RIAEADIEQLPQILLAVPEARLQAMQRSLRN 830
>gi|449466083|ref|XP_004150756.1| PREDICTED: probable glycosyltransferase At3g07620-like [Cucumis
sativus]
Length = 341
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 95/352 (26%), Positives = 154/352 (43%), Gaps = 51/352 (14%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTL-NPEEADWFYTPIY 98
LKVYVY+ +K + P LT ++A+E + + L + + +PE+A FY P
Sbjct: 13 LKVYVYD----EGEKPIFHQP-ILTGIYASEGWFMKLLEDNKKFVVKDPEKAHLFYLPF- 66
Query: 99 PTCDLTPTGLPLPFKSPRMM----RSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHY 154
+ + F++ R + ++ I +I + +WN+ G+DHF V HD+
Sbjct: 67 -SSQFLRSAFGNKFRNKRDLQKPLKNYIDVIGKKYRFWNKNGGSDHFLVACHDWAPKLTK 125
Query: 155 QEEKAIER--------GILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQI 206
+ K R G + + +L TF V E IT PP
Sbjct: 126 RLVKNCIRALCNANGAGDFEIGKDTSLPVTF-----VHSTEDLITKIGGKPP-------- 172
Query: 207 PPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFD----ISTDHPTTY 262
+ R+ +F G + Y R WEN + + + S + Y
Sbjct: 173 ---SERTTLAFFAGSMHG---------YLRPILLHYWENKEPDMMIVGPMPNSIEGKNAY 220
Query: 263 YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEE 322
E M+ + +C+C G+ SPR++EA++ CIPVII+D+ V P + + WE +FV E
Sbjct: 221 MEQMKSSKYCICARGYQVHSPRVIEAILNECIPVIISDNYVPPLFEVLNWESFSVFVKER 280
Query: 323 DVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
++P L IL SIP E R + + +PA+ DAFH IL+ +
Sbjct: 281 EIPNLRDILLSIPEEN-YRAMHSRVKMVQQHFLWHEKPAK-YDAFHMILHSI 330
>gi|255546377|ref|XP_002514248.1| catalytic, putative [Ricinus communis]
gi|223546704|gb|EEF48202.1| catalytic, putative [Ricinus communis]
Length = 676
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 95/351 (27%), Positives = 151/351 (43%), Gaps = 48/351 (13%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEI-FMHRFLLSSPVRTLNPEEADWFYTP-- 96
LKVY+Y K KK + P + ++A+E FM + +P A FY P
Sbjct: 347 LKVYIY----KDGKKPIFHLP-IMKGLYASEGWFMKLMQGNKHFLVKDPRRAHLFYMPFS 401
Query: 97 -------IYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFG 149
+Y T L + ++ + I++ +P+WNRT+GADHF V HD+
Sbjct: 402 SRMLEYTLYVRNSHNRTNLR------QYLKDYSEKIAAKYPFWNRTDGADHFLVACHDWA 455
Query: 150 ACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPD 209
Y+ +E I L A + F + L E + + ++ P
Sbjct: 456 P---YETRHHMEHCIKALCN-ADVTAGFKIGRDISLPETYVR----SARNPLRDLGGKPP 507
Query: 210 TPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP------TTYY 263
+ R I ++ G + Y R W++ +P I P Y
Sbjct: 508 SQRHILAFYAGSMHG---------YLRPILLKYWKD--KDPSMKIFGPMPPGVASKMNYI 556
Query: 264 EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED 323
+ M+ + +C+CP G+ SPR+VEA+ + C+PVII+D+ V PF + W + +AE+D
Sbjct: 557 QHMKSSKYCICPKGYEVNSPRVVEAIFYECVPVIISDNFVPPFFEVFNWGAFSLILAEKD 616
Query: 324 VPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
+P L IL SIP E L Q L K + P P + D F+ L+ +
Sbjct: 617 IPNLKEILLSIPEEKYLEMQ-LGVRKVQKHFLWHPSPMK-YDLFYMTLHAI 665
>gi|302851968|ref|XP_002957506.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
gi|300257148|gb|EFJ41400.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
Length = 706
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 93/375 (24%), Positives = 150/375 (40%), Gaps = 71/375 (18%)
Query: 39 KLKVYVYELPSKYN-KKLLLKDPRCLTHMFAAEI-FMHRFLLSSPVRTLNPEEADWFYTP 96
KLK+YVY+LP + + +K +P L M+ AE+ FM + L VRT NP EA+ F P
Sbjct: 250 KLKIYVYDLPERVSYRKPWHDEPALLDTMYLAELLFMEQLLGDWSVRTENPWEANLFVLP 309
Query: 97 IYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGAC----- 151
Y G P + + + SN+P+WN T G +H +D G C
Sbjct: 310 TYTIYYTGNIGFP-----AKHFANVFNYVRSNYPFWNLTGGRNHVAFATNDRGCCDLYKL 364
Query: 152 ---------------FHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYA 196
F + A+E + A + G +G T+
Sbjct: 365 ARSNPELQHPIKVVHFSQTSDPALEAAFREFKRLAVHQTSGGAEGEAMEEDGGGTVMQEL 424
Query: 197 PPQKMQAHQIPPDTPRSIFVYFRGLF-------------------------YD-----VN 226
P ++ PP T ++ + F YD V+
Sbjct: 425 PEALVRFRGFPPYTLEALRMEREPCFRPEQDVAVPNYLERGWIGRLQEAYAYDREGNAVH 484
Query: 227 NDPE-------GGY------YARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCL 273
D + GY Y+ G R + + N D++ + E M ++ FCL
Sbjct: 485 RDKQRPYLFYFNGYSKPDMAYSGGVRQGLLSMYHNLTRGDVAINPGCCTAEYMLQSRFCL 544
Query: 274 CPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTS 333
CPLG+ W RL +A+ GC+PVI+ D F D +P+E+ + + ++ +L +L +
Sbjct: 545 CPLGYG-WGIRLTQAMQSGCVPVIVQDHTYSAFWDLLPYEKFSVRINRHNLHRLFDLLDA 603
Query: 334 IPPEVILRKQRLLAN 348
+ PE + Q+ LA+
Sbjct: 604 VTPEQLKDLQKGLAD 618
>gi|255571564|ref|XP_002526728.1| catalytic, putative [Ricinus communis]
gi|223533917|gb|EEF35642.1| catalytic, putative [Ricinus communis]
Length = 728
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 86/313 (27%), Positives = 138/313 (44%), Gaps = 51/313 (16%)
Query: 65 HMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLP--LPFKSPRMMRSAI 122
H++ A++ ++ LL+SP RTLN EEAD+F+ PI +C +T L + +RS++
Sbjct: 326 HLYGAQMALYESLLASPYRTLNGEEADFFFVPILDSCIITRADDAPHLSMQDHMGLRSSL 385
Query: 123 QL---------ISSNWPYWNRTEGADHFFVVPHDFGACFHYQE--------------EKA 159
L I ++PYWNRT G DH + D GAC+ +E K
Sbjct: 386 TLEYYRKAYDHIVEHYPYWNRTSGRDHIWFFSWDEGACYAPKEIWNSMMLVHWGNTNSKH 445
Query: 160 IERGILPLLQRATLVQTFGQRNHVCLN-EGSITIPPYAPPQ------KMQAHQIPPDTPR 212
+ + + H C + + + +P + P K+ A P R
Sbjct: 446 NHSTTAYWADNWDKISSDRRGRHPCFDPDKDLVLPAWKRPDVSALSTKLWAR---PLERR 502
Query: 213 SIFVYFRGLFYDV--NNDPEGGYYARGARAAVWENFKNNPLFD--ISTDH---------- 258
+F G N PE Y+ G R + E F ++P D + H
Sbjct: 503 KTLFFFNGNLGPAYPNGRPELS-YSMGIRQKLAEEFGSSPNKDGKLGKQHAEDVIVTPLR 561
Query: 259 PTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMF 318
Y+ED+ +IFC G WS R+ ++++ GCIPVII D I LP+ + + +E +
Sbjct: 562 SENYHEDLASSIFCGVLPG-DGWSGRMEDSILQGCIPVIIQDGIFLPYENVLNYESFAVR 620
Query: 319 VAEEDVPKLDTIL 331
+ E+++ L IL
Sbjct: 621 IREDEISNLLKIL 633
>gi|15231488|ref|NP_187419.1| putative glycosyltransferase [Arabidopsis thaliana]
gi|75207507|sp|Q9SSE8.1|GLYT1_ARATH RecName: Full=Probable glycosyltransferase At3g07620
gi|6466945|gb|AAF13080.1|AC009176_7 hypothetical protein [Arabidopsis thaliana]
gi|332641055|gb|AEE74576.1| putative glycosyltransferase [Arabidopsis thaliana]
Length = 470
Score = 105 bits (261), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 99/359 (27%), Positives = 158/359 (44%), Gaps = 65/359 (18%)
Query: 40 LKVYVYEL--PSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPI 97
K+YVYE P ++ L KD + +F FM +L RT +P++A ++ P
Sbjct: 143 FKIYVYEEGDPPIFHYGLC-KDIYSMEGLFLN--FMENDVLK--YRTRDPDKAHVYFLPF 197
Query: 98 YPTCDLTPTGLPLPFKSP---RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHY 154
L P+ R++ +Q+IS +PYWN ++G DHF + HD+G
Sbjct: 198 SVVMILHHLFDPVVRDKAVLERVIADYVQIISKKYPYWNTSDGFDHFMLSCHDWG----- 252
Query: 155 QEEKAIERGILPLLQRAT--LVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPR 212
RAT + + F V N I Y P+K D P
Sbjct: 253 --------------HRATWYVKKLFFNSIRVLCNAN---ISEYFNPEK--------DAP- 286
Query: 213 SIFVYFRGLFYDVNNDPEG---------GYYARGARAAV-------WENFKNNPLFDIST 256
F L D+NN G ++A + + W+ + L +
Sbjct: 287 --FPEINLLTGDINNLTGGLDPISRTTLAFFAGKSHGKIRPVLLNHWKEKDKDILVYENL 344
Query: 257 DHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG 316
Y E M+++ FC+CP G SPR+ EA+ GC+PV+I+++ VLPF+D + WE+
Sbjct: 345 PDGLDYTEMMRKSRFCICPSGHEVASPRVPEAIYSGCVPVLISENYVLPFSDVLNWEKFS 404
Query: 317 MFVAEEDVPKLDTILTSIPPEVILRKQRLLAN-PSMKRAMLFPQPAQPGDAFHQILNGL 374
+ V+ +++P+L IL IP E R RL +KR +L P + D F+ I++ +
Sbjct: 405 VSVSVKEIPELKRILMDIPEE---RYMRLYEGVKKVKRHILVNDPPKRYDVFNMIIHSI 460
>gi|168019297|ref|XP_001762181.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162686585|gb|EDQ72973.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 344
Score = 105 bits (261), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 98/354 (27%), Positives = 152/354 (42%), Gaps = 57/354 (16%)
Query: 39 KLKVYVY--ELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTP 96
K KVYVY P Y + PR LT +++E + + L S T + AD F+ P
Sbjct: 21 KFKVYVYPDGDPETY-----YQTPRKLTGKYSSEGYFFQNLRESRFVTNDSAAADLFFLP 75
Query: 97 IYPTC-DLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQ 155
+ +C + GL K ++R+ ++ + +P+WNRT GADHFFV HD G +
Sbjct: 76 V--SCHKMRGKGLSYE-KMADIVRAYVESLIIKYPFWNRTVGADHFFVTCHDVGVRATAK 132
Query: 156 EEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIF 215
E ++ I R ++ N I A PQ +Q +P
Sbjct: 133 VEHLVKNSI-----RVVCSPSY--------NGSFIPHKDVALPQVLQPFPLPAGGD---- 175
Query: 216 VYFRGLFYDVNNDPEGGYYARGARAAVWENF------------KNNPLFDISTDHPTTYY 263
D++N G++A + V N NN L + D+ Y
Sbjct: 176 --------DIHNRTVLGFWAGHRNSKVRVNLADAWQYDPILFVANNRLNRSTGDY--IYQ 225
Query: 264 EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIP-VIIADDIVLPFADAIPWEEIGMFVAEE 322
R+ FC+CP G S R+ E++ +GC+P VI+AD LPF D + W + + V E
Sbjct: 226 NQFYRSKFCICPAGSQVNSARIAESIHYGCVPEVIMADFYDLPFNDILDWRKFSLVVRER 285
Query: 323 DVPKLDTILTSIPPEVILRKQRLL--ANPSMKRAMLFPQPAQPGDAFHQILNGL 374
+ L IL + V ++K R+L ++R + P DAFH ++ L
Sbjct: 286 EYDNLKKILQA----VTVQKYRMLHAGVRQVRRHFEWHSPPIKYDAFHMVMYEL 335
>gi|356518828|ref|XP_003528079.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
max]
Length = 505
Score = 105 bits (261), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 79/315 (25%), Positives = 135/315 (42%), Gaps = 38/315 (12%)
Query: 72 FMHRFLLSSPVRTLNPEEADWFYTP---------IY---PTCDLTPTGLPLPFKSPRMMR 119
F+H RT +P+EA +Y P +Y +L P GL +++
Sbjct: 205 FIHEMEKGRYYRTYDPDEAFVYYLPFSVVMLVEYVYDRGSNYNLDPLGL--------VVK 256
Query: 120 SAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQ 179
IQ+I+ P+WNR+ G DH + HD+G + I +L A + F
Sbjct: 257 DYIQIIAHKHPFWNRSLGHDHVMLSCHDWGPLVSSYVDHLYNNAI-RVLCNANTSEGFKP 315
Query: 180 RNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGAR 239
V E + +++ P + R+I +F G + Y R
Sbjct: 316 AKDVSFPEIKLI------KGEVKGLGGYPPSQRTILAFFAGHLHG---------YIRYLL 360
Query: 240 AAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIA 299
+ W+N + +YY ++ + FCLCP G+ SPR+VEA+ C+PV+I+
Sbjct: 361 LSTWKNKDQDMQIYEELPEGISYYTKLRSSKFCLCPSGYEVASPRVVEAIFAECVPVLIS 420
Query: 300 DDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQ 359
D V PF+D + W + V +D+P + IL I + LR + + ++R + +
Sbjct: 421 DSYVPPFSDVLNWNSFSVQVNVKDIPNIKRILMEISEKQYLRMHKRVKQ--VQRHFVPNE 478
Query: 360 PAQPGDAFHQILNGL 374
P + D FH ++ +
Sbjct: 479 PPKRYDMFHMTVHSI 493
>gi|449511061|ref|XP_004163851.1| PREDICTED: probable glycosyltransferase At3g07620-like [Cucumis
sativus]
Length = 341
Score = 104 bits (260), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 94/354 (26%), Positives = 157/354 (44%), Gaps = 55/354 (15%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTL-NPEEADWFYTPIY 98
LKVYVY+ +K + P LT ++A+E + + L + + +PE+A FY P
Sbjct: 13 LKVYVYD----EGEKPIFHQP-ILTGIYASEGWFMKLLEDNKKFVVKDPEKAHLFYLPF- 66
Query: 99 PTCDLTPTGLPLPFKSPRMM----RSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHY 154
+ + F++ R + ++ I +I + +WN+ G+DHF V HD+
Sbjct: 67 -SSQFLRSAFGNKFRNKRDLQKPLKNYIDVIGKKYRFWNKNGGSDHFLVACHDWAPKLTK 125
Query: 155 QEEKAIER--------GILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQI 206
+ K R G + + +L TF V E IT PP
Sbjct: 126 RLVKNCIRALCNANGAGDFEIGKDTSLPVTF-----VHSTEDLITKIGGKPP-------- 172
Query: 207 PPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFD----ISTDHPTTY 262
+ R+ +F G + Y R WEN + + + S + Y
Sbjct: 173 ---SERTTLAFFAGSMHG---------YLRPILLHYWENKEPDMMIVGPMPNSIEGKNAY 220
Query: 263 YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEE 322
E M+ + +C+C G+ SPR++EA++ CIPVII+D+ V P + + WE +FV E
Sbjct: 221 MEQMKSSKYCICARGYQVHSPRVIEAILNECIPVIISDNYVPPLFEVLNWESFSVFVKER 280
Query: 323 DVPKLDTILTSIPPEVILRKQRLLAN--PSMKRAMLFPQPAQPGDAFHQILNGL 374
++P L IL SIP E R++ + +++ L+ + DAFH IL+ +
Sbjct: 281 EIPYLRDILLSIPEE----NYRVMHSRVKMVQQHFLWHEKPAKYDAFHMILHSI 330
>gi|159490314|ref|XP_001703124.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158270754|gb|EDO96589.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 490
Score = 104 bits (260), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 99/368 (26%), Positives = 143/368 (38%), Gaps = 51/368 (13%)
Query: 42 VYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTC 101
VYVYELP K TH I ++ LL S R + ++ADW+Y PI
Sbjct: 105 VYVYELPPDLTTWTNTKRLDRSTH-----IHFYQRLLGSGARIADGDKADWYYIPIRQRM 159
Query: 102 DLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIE 161
R + A+ IS+ +P+WNRT G+ HF + D GA E +
Sbjct: 160 T----------ADSRFLSEAVAYISATYPWWNRTGGSRHFVIHTGDLGA----DETQLGA 205
Query: 162 RGILPLLQ-----RATLVQTFGQRNHVCLNEGSITIPPYAPP--------QKMQAHQIPP 208
R P + T+ + F + + IP + P ++ H +
Sbjct: 206 RLQAPNITWLTHWGLTMDKVFSGWKKAHRPDKDVVIPVFLTPGHFKHFGLERTPLHPLMD 265
Query: 209 DTPRSIFVYFRGLFYDVNNDPEGGY----------YARGARAAVWENFKNNPLFDISTDH 258
R+ +F G P+ G Y+ G R V + + P F +
Sbjct: 266 KQERTTTFFFAGRICGDRKPPKTGSWPNCGPRSPGYSAGVRQLVHHHHWDPPGFKVVLHE 325
Query: 259 PTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMF 318
P Y + + FCL PLG R + GC+PV IADD+ PF W + G+
Sbjct: 326 PN-YGAALGSSKFCLAPLG-GGHGQRQIIVSFMGCLPVCIADDVYEPFEPQYNWTQFGVR 383
Query: 319 VAEEDVPKLDTILTSIPPEVILRKQRLLANPSMK------RAMLFPQPAQPGDAFHQILN 372
AE D+P+L TIL S+ + KQR L + LF + + DAF L
Sbjct: 384 PAESDIPELHTILESVSAKEYAAKQRALRCAAQHFVYSSIVGGLFGEDGRY-DAFETTLE 442
Query: 373 GLARKLPH 380
L K H
Sbjct: 443 VLRVKAEH 450
>gi|147784355|emb|CAN72733.1| hypothetical protein VITISV_033460 [Vitis vinifera]
Length = 321
Score = 104 bits (260), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 91/324 (28%), Positives = 141/324 (43%), Gaps = 50/324 (15%)
Query: 68 AAEIFMHRFLLSSPVRTL-NPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLIS 126
++F+ FL P L N F+ + CD T ++++ +Q +
Sbjct: 22 GTKLFLRFFLDIPPCNNLMNFPSLVVFFLSLSNLCDATVLQGTSYENMTIIVQNYVQSLM 81
Query: 127 SNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLN 186
S +PYWNRT GADHFF+ HD G +A E +PLL + ++ VC
Sbjct: 82 SKYPYWNRTLGADHFFLTCHDXGV-------RATEG--VPLLVKNSI-------RVVCSP 125
Query: 187 EGSITIPPY---APPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYA-------R 236
+ P+ A PQ +Q +P D+ N G++A R
Sbjct: 126 SYDVGFIPHKDVALPQVLQPFALPTGGR------------DIKNRTTLGFWAGHRNSKIR 173
Query: 237 GARAAVWEN-----FKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVF 291
A +WEN KNN + + +T H Y R FC+CP G S R+ +++ +
Sbjct: 174 VILARIWENDTELDIKNNRI-NRATGH-LVYQNKFYRTKFCICPGGSQVNSARIADSIHY 231
Query: 292 GCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIP-PEVILRKQRLLANPS 350
GC+PVI++D LPF D + W + + + E DV +L IL IP E I L+
Sbjct: 232 GCVPVILSDYYDLPFNDILDWRKFSVILKERDVYRLKYILKDIPDAEFIALHDNLV---K 288
Query: 351 MKRAMLFPQPAQPGDAFHQILNGL 374
+++ + P DAFH ++ L
Sbjct: 289 VQKHFQWNTPPIKYDAFHMVMYEL 312
>gi|242059031|ref|XP_002458661.1| hypothetical protein SORBIDRAFT_03g037670 [Sorghum bicolor]
gi|241930636|gb|EES03781.1| hypothetical protein SORBIDRAFT_03g037670 [Sorghum bicolor]
Length = 499
Score = 104 bits (259), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 103/360 (28%), Positives = 155/360 (43%), Gaps = 58/360 (16%)
Query: 40 LKVYVYELPSKYNKKLL--LKDP-RCLTHMFAAEIFMHRF-------------------- 76
L+VYVYE+P K+ LL +D R ++ + +HR
Sbjct: 113 LRVYVYEMPGKFTYDLLRLFRDSYRDTDNLTSNGSPVHRLIEQHSIDYWLWADLIAPESQ 172
Query: 77 -LLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRT 135
LL S VR EEAD FY P + T L + + R A++ ++ P W R+
Sbjct: 173 RLLKSVVRVQRQEEADIFYVPFFTTISYF---LLEKQECKALYREALKWVTDQ-PAWQRS 228
Query: 136 EGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY 195
EG DH V H + ++ + + + LL + + V L + I PY
Sbjct: 229 EGRDHVIPVHHPWS----FKSVRRFVKKAIWLLPDMDSTGNWYKPGQVYLEKD--VILPY 282
Query: 196 APPQKMQAHQIPPDT--PRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFD 253
P + H+ +T RSI ++FRG GG R+ + E K+ D
Sbjct: 283 VPNVDLCDHKCVLETQFKRSILLFFRGRL----KRNAGG----KIRSKLVEELKSAE--D 332
Query: 254 ISTDHPTTYY-------EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 306
I + + + M++++FCL P G P S RL +A+V GCIPVII+D++ LPF
Sbjct: 333 IVIEEGSAGAQGKAAAQDGMRKSLFCLSPAGDTPSSARLFDAIVSGCIPVIISDELELPF 392
Query: 307 ADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQP 363
+ + EI +FV+ D + L L I + I Q L S R L+ PAQP
Sbjct: 393 EGILDYREIALFVSSSDAVQPGWLVKYLRGIDAKRIREIQSNLVKYS--RHFLYSSPAQP 450
>gi|297805042|ref|XP_002870405.1| hypothetical protein ARALYDRAFT_493575 [Arabidopsis lyrata subsp.
lyrata]
gi|297316241|gb|EFH46664.1| hypothetical protein ARALYDRAFT_493575 [Arabidopsis lyrata subsp.
lyrata]
Length = 509
Score = 104 bits (259), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 84/305 (27%), Positives = 134/305 (43%), Gaps = 30/305 (9%)
Query: 80 SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLP----FKSPRMMR---SAIQLISSNWPYW 132
S R +PE+A F+ P + P+ F R+ R + ++++ YW
Sbjct: 214 SRFRADHPEDAHVFFIPFSVAKVIHFVYKPITSVEGFSRARLHRLIEDYVDVVATKHRYW 273
Query: 133 NRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITI 192
NR++G DHF V HD+ K E+ I L T + G R +V ++I
Sbjct: 274 NRSKGGDHFMVSCHDWAPDVIDGNPKLFEKFIRALCNANT---SEGFRPNV-----DVSI 325
Query: 193 PPYAPPQKMQAHQIPPDTPR--SIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNP 250
P P+ +PR SI +F G + R W+ N
Sbjct: 326 PEIYLPKGKLGPSFLGKSPRIRSILAFFAGRSHG---------EIRKILFKHWKEMDNEV 376
Query: 251 LFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAI 310
Y + M + FCLCP GW SPR VEA+ GC+PVII+D+ LPF+D +
Sbjct: 377 QVYDRLPPGKDYTKTMGMSKFCLCPSGWEVASPREVEAIYAGCVPVIISDNYSLPFSDVL 436
Query: 311 PWEEIGMFVAEEDVPKLDTILTSIPPEVILRK-QRLLANPSMKRAMLFPQPAQPGDAFHQ 369
W+ + + +P++ TIL S+ L+ +R+L +K+ + +PA+P D H
Sbjct: 437 NWDSFSIQIPVSRIPEIKTILQSVSLVRYLKMYKRVL---EVKQHFVLNRPAKPYDVMHM 493
Query: 370 ILNGL 374
+L+ +
Sbjct: 494 MLHSI 498
>gi|42568129|ref|NP_198314.2| xylogalacturonan beta-1,3-xylosyltransferase [Arabidopsis thaliana]
gi|292630939|sp|Q94AA9.2|XGD1_ARATH RecName: Full=Xylogalacturonan beta-1,3-xylosyltransferase;
AltName: Full=Protein XYLOGALACTURONAN DEFICIENT 1
gi|332006509|gb|AED93892.1| xylogalacturonan beta-1,3-xylosyltransferase [Arabidopsis thaliana]
Length = 500
Score = 104 bits (259), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 74/262 (28%), Positives = 120/262 (45%), Gaps = 23/262 (8%)
Query: 116 RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQ 175
R++ + ++++ PYWNR++G DHF V HD+ K E+ I L T
Sbjct: 248 RLIEDYVDVVATKHPYWNRSQGGDHFMVSCHDWAPDVIDGNPKLFEKFIRGLCNANT--- 304
Query: 176 TFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPR--SIFVYFRGLFYDVNNDPEGGY 233
+ G R +V ++IP P+ +PR SI +F G +
Sbjct: 305 SEGFRPNV-----DVSIPEIYLPKGKLGPSFLGKSPRVRSILAFFAGRSHG--------- 350
Query: 234 YARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGC 293
R W+ N Y + M + FCLCP GW SPR VEA+ GC
Sbjct: 351 EIRKILFQHWKEMDNEVQVYDRLPPGKDYTKTMGMSKFCLCPSGWEVASPREVEAIYAGC 410
Query: 294 IPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRK-QRLLANPSMK 352
+PVII+D+ LPF+D + W+ + + + ++ TIL S+ L+ +R+L +K
Sbjct: 411 VPVIISDNYSLPFSDVLNWDSFSIQIPVSRIKEIKTILQSVSLVRYLKMYKRVL---EVK 467
Query: 353 RAMLFPQPAQPGDAFHQILNGL 374
+ + +PA+P D H +L+ +
Sbjct: 468 QHFVLNRPAKPYDVMHMMLHSI 489
>gi|414883870|tpg|DAA59884.1| TPA: hypothetical protein ZEAMMB73_690759, partial [Zea mays]
Length = 577
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 98/367 (26%), Positives = 153/367 (41%), Gaps = 64/367 (17%)
Query: 39 KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSP------VRTLNPEEADW 92
+LKVYVYE + + D C + F+H + +RT +P A
Sbjct: 233 QLKVYVYE----EGEPPVFHDGPCRSIYSTEGRFIHSMETETEAEEGRRLRTRDPARAHV 288
Query: 93 FYTPIYPTCDLTPTGLP-----LPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHD 147
F+ P + P P K R + ++++SS +PYWNR+ GADHF + HD
Sbjct: 289 FFLPFSVVKMVQTIYEPGSRDMAPLK--RTVADYVRVLSSKYPYWNRSLGADHFMLSCHD 346
Query: 148 FGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGS---------ITIPPYAPP 198
+G P + A Q FG V N + +++P
Sbjct: 347 WG----------------PYVSSAN-AQLFGNSIRVLCNANTSEGFDPARDVSLPQVNLR 389
Query: 199 QKMQAHQI--PPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPL---FD 253
Q+ P + R + +F G N+ P R A A W
Sbjct: 390 SDAVERQVGGPSASRRPVLAFFAG----GNHGP-----VRPALLAHWGPGGRRGGDPDVR 440
Query: 254 ISTDHPT-----TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD-DIVLPFA 307
+S P +Y + M+R+ FCLCP G+ SPRL EA+ GC+PV++ D + LPFA
Sbjct: 441 VSEYLPRGGGAPSYADMMRRSRFCLCPGGYEVASPRLAEALYLGCVPVVVDDGEYALPFA 500
Query: 308 DAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAF 367
D + W+ + + D+P+L IL ++ P + + + + ML P + DAF
Sbjct: 501 DVLDWDAFALRLRVADIPRLKEILAAVSPRQYI-RMQRRVRMVRRHFMLHGGPPRRYDAF 559
Query: 368 HQILNGL 374
H IL+ +
Sbjct: 560 HMILHSV 566
>gi|356500491|ref|XP_003519065.1| PREDICTED: uncharacterized protein LOC100783624 [Glycine max]
Length = 795
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 94/376 (25%), Positives = 161/376 (42%), Gaps = 65/376 (17%)
Query: 15 ADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLL------------LKDPRC 62
AD+T E++ V + P+ VYVY+LP ++N LL + D
Sbjct: 331 ADDTHANEKMINLNAVVAKKRPL----VYVYDLPPEFNSLLLEGRHFKLECVNRIYDGNN 386
Query: 63 LT----HMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLP--LPFKSPR 116
+T ++ A+I ++ LL+SP RTLN EEAD+F+ P+ +C +T L +
Sbjct: 387 ITVWTDQLYGAQIALYESLLASPHRTLNGEEADFFFVPVLDSCIITRADDAPHLSMQEHM 446
Query: 117 MMRSAIQL---------ISSNWPYWNRTEGADHFFVVPHDFGACFH----YQEEKAIERG 163
+RS++ L I +PYWNR+ G DH + D GAC+ + + G
Sbjct: 447 GLRSSLTLEYYKKAYIHIVEQYPYWNRSSGRDHVWSFSWDEGACYAPKEIWNSMMLVHWG 506
Query: 164 ILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQ--AHQIP------------PD 209
+ + ++ T P + P + + A ++P
Sbjct: 507 NTNTKHNHSTTAYWADNWDKISSDKRGTHPCFDPDKDLVLPAWKVPDANVLTSKLWAWSH 566
Query: 210 TPRSIFVYFRGLFYDV--NNDPEGGYYARGARAAVWENFKNNPLFD------------IS 255
R YF G + PE Y+ G R + E F ++P D ++
Sbjct: 567 EKRKTLFYFNGNLGPAYPHGRPE-DTYSMGIRQKLAEEFGSSPNKDGKLGKQHAKDVIVT 625
Query: 256 TDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEI 315
+ Y+ D+ ++FC G WS R+ ++++ GCIPV+I D I LP+ + + ++
Sbjct: 626 PERSENYHLDLASSVFCGVFPG-DGWSGRMEDSILQGCIPVVIQDGIFLPYENVLNYDSF 684
Query: 316 GMFVAEEDVPKLDTIL 331
+ + E ++P L IL
Sbjct: 685 AVRIPEAEIPNLIKIL 700
>gi|159470401|ref|XP_001693348.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158277606|gb|EDP03374.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 356
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 91/331 (27%), Positives = 146/331 (44%), Gaps = 40/331 (12%)
Query: 68 AAEIFMHRFLLSSPVRTLNPEEADWFYTP---IYPTCDLTPTGLPLPFKSPRMMRSAIQL 124
A E F+ F+ + VRT NP EA FY P + + +L P L + +
Sbjct: 27 AYEYFLKYFITDNIVRTENPYEAHLFYVPALNFFYSGNLRPPEYHL--------EAVMDH 78
Query: 125 ISSNWPYWNRTEGADHFFVVPHDFGACFHYQE-EKAIERGILPLLQRATLVQTFGQRN-- 181
+ + WP++NR+ G DHF + D GAC ++ + ++ + + +Q+ L T + N
Sbjct: 79 VKTAWPFYNRSGGRDHFIFLTGDRGACHMPRDMQDSMIKVVHFGMQKQGLNWTSMEHNKE 138
Query: 182 HVCLN-EGSITIPPYAPPQKMQAHQIPPDTPRSIFVYF------------RGLFYDVNND 228
+ C+ + +PP+ K P P YF R + +
Sbjct: 139 YGCIRMRQDLVVPPHPNDHK-------PLWPVGAAAYFQRIAAAGGHDAGRNITFLFAGG 191
Query: 229 PEGGYYARGARAAVWENFKN--NPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLV 286
G Y+ G R AV N +P Y + + R+ FCL G W R++
Sbjct: 192 VGEGEYSGGTRQAVRALLLNITDPAIMFVEGRRDDYVDLLWRSQFCLAAYGHG-WGIRVM 250
Query: 287 EAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLL 346
+++ FGCIPVII D + F D +P+EE + + DVP+L +L S PE + + L
Sbjct: 251 QSIQFGCIPVIIQDHVYQAFEDFLPYEEFSVRLPLRDVPRLLELLRSYSPEQLAALR--L 308
Query: 347 ANPSMKRAMLFPQPAQPGDAFHQILNGLARK 377
RA ++ + Q G+AF L GL R+
Sbjct: 309 GMAKYFRAFIWNRD-QGGEAFEWTLAGLQRR 338
>gi|297820564|ref|XP_002878165.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
gi|297324003|gb|EFH54424.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
Length = 792
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 96/368 (26%), Positives = 158/368 (42%), Gaps = 68/368 (18%)
Query: 42 VYVYELPSKYNKKLL------------LKDPRCLT----HMFAAEIFMHRFLLSSPVRTL 85
+Y+Y+LP +N L+ + D R T +++ +++ + +L++ RTL
Sbjct: 349 IYIYDLPPDFNSLLIEGRHFKLECVNRIYDERNATVWTDYLYGSQMAFYENILATAHRTL 408
Query: 86 NPEEADWFYTPIYPTC-----DLTP-------TGLPLPFKSPRMMRSAIQLISSNWPYWN 133
N EEAD+F+ P+ +C D P TGL F + + A + I +PYWN
Sbjct: 409 NGEEADFFFVPVLDSCIINRADDAPHINMQNHTGLRSSF-TLEFYKRAYEHIVEKYPYWN 467
Query: 134 RTEGADHFFVVPHDFGACFH----YQEEKAIERGILPLLQRATLVQTFGQR--------- 180
R+ G DH + D GAC+ + + G + +G
Sbjct: 468 RSAGRDHIWFFSWDEGACYAPKEIWNSMMLVHWGNTNSKHNHSTTAYWGDNWDDISDERR 527
Query: 181 -NHVCLN-EGSITIPPYAPPQ--KMQA-HQIPPDTPRSIFVYFRGLF---YDVNNDPEGG 232
+H C + + IP + P M+A + P R YF G Y+ PE
Sbjct: 528 GDHPCFDPRKDLVIPAWKVPDPYSMRANYWARPREKRKTLFYFNGNLGPAYE-KGRPEDS 586
Query: 233 YYARGARAAVWENFKNNPLFD--ISTDH----------PTTYYEDMQRAIFCLCPLGWAP 280
Y+ G R + E F ++P + + H Y++D+ +IFC G
Sbjct: 587 -YSMGIRQKLAEEFGSSPNKEGKLGKQHAEDVIVTPLRSDNYHKDIANSIFCGAFPG-DG 644
Query: 281 WSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVIL 340
WS R+ ++++ GC+PVII D I LP+ + + +E + V+E+D+P L L I
Sbjct: 645 WSGRMEDSILQGCVPVIIQDGIYLPYENMLNYESFAVRVSEDDIPNLINTLRGFSETEI- 703
Query: 341 RKQRLLAN 348
Q LAN
Sbjct: 704 --QFRLAN 709
>gi|302824287|ref|XP_002993788.1| glycosyltransferase-like protein [Selaginella moellendorffii]
gi|300138384|gb|EFJ05154.1| glycosyltransferase-like protein [Selaginella moellendorffii]
Length = 353
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 99/359 (27%), Positives = 152/359 (42%), Gaps = 58/359 (16%)
Query: 29 GDVLEDD---PVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTL 85
DV ++D +LKV+VY P Y K L +A+E + R L+ S T
Sbjct: 31 ADVFQEDYEQMEQQLKVFVYPDPVVYTK---------LAGKYASEGYFFRNLMESRFVTT 81
Query: 86 NPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVP 145
+PEEA F+ PI L GL S + S ++ + + +PYWNRT GADHFFV
Sbjct: 82 DPEEAQLFFVPI-SCARLKEEGLDHDEISDNVA-SFVESVIAKFPYWNRTMGADHFFVTC 139
Query: 146 HDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQ 205
H+ G RAT +N + + S P+ P + + Q
Sbjct: 140 HEIGT-------------------RATSKVAHLVKNSIRVVCASSYSGPFIPHKDVALPQ 180
Query: 206 I--PPDTPRSIFVYFRGLFYDVNNDPEGGYYARGA----RAAVWENFKNNPLFDISTDHP 259
I P +PR D G++A A R + + ++ + IST H
Sbjct: 181 ILQPFPSPRG--------GDDTEKRETLGFWAGPANSKTRILLTKTWQEDSDMVISTKHV 232
Query: 260 TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIP----VIIADDIVLPFADAIPWEEI 315
+ R+ FC+CP G + R+VE++ FGC+P +I++D LPF D + W +
Sbjct: 233 GM--QQFYRSKFCICPSGTRVSTARIVESIHFGCVPGKSYLILSDHYDLPFNDVLDWRKF 290
Query: 316 GMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
+ + E+D L L + P L + L A + + P D FH ++ L
Sbjct: 291 AVILPEQDAGTLKDAL-ELAPYATLHRNLLQAQAHFE----WHSPPIKYDTFHMVMYEL 344
>gi|15146187|gb|AAK83577.1| AT5g33290/F19N2_10 [Arabidopsis thaliana]
gi|18252279|gb|AAL62020.1| AT5g33290/F19N2_10 [Arabidopsis thaliana]
Length = 341
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 74/262 (28%), Positives = 120/262 (45%), Gaps = 23/262 (8%)
Query: 116 RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQ 175
R++ + ++++ PYWNR++G DHF V HD+ K E+ I L T
Sbjct: 89 RLIEDYVDVVATKHPYWNRSQGGDHFMVSCHDWAPDEIDGNPKLFEKFIRGLCNANT--- 145
Query: 176 TFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPR--SIFVYFRGLFYDVNNDPEGGY 233
+ G R +V ++IP P+ +PR SI +F G +
Sbjct: 146 SEGFRPNV-----DVSIPEIYLPKGKLGPSFLGKSPRVRSILAFFAGRSHG--------- 191
Query: 234 YARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGC 293
R W+ N Y + M + FCLCP GW SPR VEA+ GC
Sbjct: 192 EIRKILFQHWKEMDNEVQVYDRLPPGKDYTKTMGMSKFCLCPSGWEVASPREVEAIYAGC 251
Query: 294 IPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRK-QRLLANPSMK 352
+PVII+D+ LPF+D + W+ + + + ++ TIL S+ L+ +R+L +K
Sbjct: 252 VPVIISDNYSLPFSDVLNWDSFSIQIPVSRIKEIKTILQSVSLVRYLKMYKRVL---EVK 308
Query: 353 RAMLFPQPAQPGDAFHQILNGL 374
+ + +PA+P D H +L+ +
Sbjct: 309 QHFVLNRPAKPYDVMHMMLHSI 330
>gi|356518348|ref|XP_003527841.1| PREDICTED: LOW QUALITY PROTEIN: probable glycosyltransferase
At5g03795-like [Glycine max]
Length = 549
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 88/322 (27%), Positives = 139/322 (43%), Gaps = 33/322 (10%)
Query: 63 LTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRM---MR 119
L ++A+E + + + ++ T +P +A FY P + + + M+
Sbjct: 240 LDGIYASEGWFMKLMEANKXVTGDPGKAHLFYIPFSSRLLQQTLYVRNSHRHSNLIEYMK 299
Query: 120 SAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQ 179
+ +++I+ +P+WNRT GADHF V HD+ E + + L A + F
Sbjct: 300 NYVKMIAGKYPFWNRTSGADHFVVACHDWAPA----ETRGRMLSSIRALCNADIEVGFKI 355
Query: 180 RNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGAR 239
V L E I P + ++ P + R I +F G + Y
Sbjct: 356 GKDVSLPETYIR-SSENPVKNIEG---DPPSQRPILAFFAGGLH---------VYVXPIL 402
Query: 240 AAVWENFKNNPLFDISTDHP-----TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCI 294
WEN P IS P Y + M+ + FC+ G SPR+VEA+ CI
Sbjct: 403 LKHWEN--KEPDMKISGPLPHVRGNVNYIQFMKSSKFCIHARGHEVNSPRVVEAIFHECI 460
Query: 295 PVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRA 354
PVII+D+ + PF + + WE +FV EE++P L IL SI E L + + K
Sbjct: 461 PVIISDNFIPPFFEILNWESFAVFVTEEEIPNLRNILLSISEERYLEMHKRVK----KVQ 516
Query: 355 MLFPQPAQP--GDAFHQILNGL 374
FP A+P D H +L+ +
Sbjct: 517 EHFPWHAEPVKDDLSHMLLHSI 538
>gi|224129246|ref|XP_002320537.1| predicted protein [Populus trichocarpa]
gi|222861310|gb|EEE98852.1| predicted protein [Populus trichocarpa]
Length = 395
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 76/314 (24%), Positives = 139/314 (44%), Gaps = 34/314 (10%)
Query: 71 IFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSA---IQLISS 127
+F+ ++ RT NP+EA ++ P + P+ + R+ +++IS
Sbjct: 95 VFLSLMETNTKFRTSNPDEAHVYFLPFSVVMIIEHLFHPIIRDKAVLERTVSDYVRIISH 154
Query: 128 NWPYWNRTEGADHFFVVPHDFG--ACFHYQEEKAIERGILPLLQRATLVQTFGQRN---- 181
+ YWNR+ GADHF + HD+G A ++ ++ + + +L A + F +
Sbjct: 155 KYLYWNRSLGADHFMLSCHDWGPRATWYVRQ---LYYNSIRVLCNANTSEYFNPKKDASF 211
Query: 182 -HVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARA 240
+ L G IT +PP R++ +F G + R A
Sbjct: 212 PEINLKTGEIT---------GLTGGLPPSN-RTVLAFFAGKMHGK---------LRPALL 252
Query: 241 AVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD 300
W + + +Y+E M+++ +C+CP G SPR+ EA+ C+PV+I+
Sbjct: 253 QHWMGKDKDVQVYETLPQGISYHEMMKKSKYCICPSGHEVASPRIAEAIYAECVPVLISQ 312
Query: 301 DIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQP 360
+ PF+D + W+ + V ++P L IL IP + LR Q + ++R + P
Sbjct: 313 HYIFPFSDVLNWDSFTIQVPVTEIPNLKNILEGIPEDQYLRMQERVR--QVQRHFVVNNP 370
Query: 361 AQPGDAFHQILNGL 374
+ D FH I++ +
Sbjct: 371 PRRYDVFHMIIHSI 384
>gi|356546124|ref|XP_003541481.1| PREDICTED: probable glycosyltransferase At5g20260-like, partial
[Glycine max]
Length = 290
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 79/298 (26%), Positives = 141/298 (47%), Gaps = 28/298 (9%)
Query: 86 NPEEADWFYTPIYPTCDLTPTGLPLP-FKSPRMMRSAI---QLISSNWPYWNRTEGADHF 141
NP+EA F PI + PL + ++M I +I+ +PYWNRT GADHF
Sbjct: 1 NPDEAHVFMLPISVAQIVRYVYNPLTTYSRDQLMWITIDYTNIIAHRYPYWNRTRGADHF 60
Query: 142 FVVPHDFG--ACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQ 199
HD+ + K + + I+ +L A + F V + E ++ + P
Sbjct: 61 LASCHDWAPPDISRAESGKELFKNIIRVLYNANKSEGFKPEKDVPMPEVNLQGFKLSSP- 119
Query: 200 KMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKN-NPLFDISTDH 258
+ P+ RSI +F GG + R R + +++K+ + +
Sbjct: 120 ---ILGLDPNN-RSILAFF-----------AGGVHGR-IREILLQHWKDKDEEVQVHEYL 163
Query: 259 P--TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG 316
P Y+ M ++ FCLCP G+ SPR+VE++ GC+PVI++D LPF+D + +
Sbjct: 164 PKGVDYHGLMGQSKFCLCPSGYEVASPRIVESINIGCVPVIVSDYYQLPFSDVLDRSKFS 223
Query: 317 MFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
+ + + ++ T+L ++P L+ Q+ + ++R + +PA+ + FH IL+ +
Sbjct: 224 LHIPSRRIAEIKTMLKNVPHAKYLKLQKRVM--KVQRHFVLNRPAKSFNVFHMILHSI 279
>gi|357142838|ref|XP_003572711.1| PREDICTED: probable glycosyltransferase At5g20260-like
[Brachypodium distachyon]
Length = 484
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 85/312 (27%), Positives = 138/312 (44%), Gaps = 52/312 (16%)
Query: 86 NPEEADWFYTPIYPTCDL-------TPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGA 138
+P+EA+ F PI C+L T P + +++ + +IS PYWNR+ GA
Sbjct: 191 HPDEANVFLLPI-SVCNLVHYVYRLNTTAHLAPLR--KLLADYVAVISDKHPYWNRSGGA 247
Query: 139 DHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRN-----HVCLNEGSITIP 193
DH V HD+ A + + + +L A + + F R V L +G + +P
Sbjct: 248 DHVLVSCHDW-APLVSEGSPELRDNAIRVLCNANVSEGFVPRKDATLPEVNLADGVLRLP 306
Query: 194 PYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWE-NFKNNPLF 252
P++ R+ +F GG RA + + + +P
Sbjct: 307 TQGLPRQN----------RTTLAFF-----------AGGMLGEIRRALLEQWAGREDPEM 345
Query: 253 DISTDHP--------TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVL 304
D+ P Y+ M RA FCLCP G+ SPR+VE+V GC+PVII++ L
Sbjct: 346 DVHEYLPPHGGGPGYDDYHALMGRARFCLCPSGFEVASPRVVESVFAGCVPVIISEGYPL 405
Query: 305 PFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE--VILRKQRLLANPSMKRAMLFPQPAQ 362
PF D + W ++ + V +P+L IL + +LR + L A +R + +PA+
Sbjct: 406 PFGDVLDWSKMSVAVPAARIPELKAILRGVSERRYRVLRARVLQA----QRHFVLHRPAR 461
Query: 363 PGDAFHQILNGL 374
D H +L+ +
Sbjct: 462 RFDMIHMVLHSI 473
>gi|356546741|ref|XP_003541781.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
max]
Length = 410
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 92/356 (25%), Positives = 152/356 (42%), Gaps = 55/356 (15%)
Query: 39 KLKVYVY--ELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTP 96
+ KV+VY P Y PR LT +A+E + + + S T +P A F+ P
Sbjct: 87 EFKVFVYPDGDPETY-----FHTPRKLTGKYASEGYFFKNIRESRFFTDDPRRAHLFFLP 141
Query: 97 IYPTC-DLTPTGLPLPFKSPRMMRSA---IQLISSNWPYWNRTEGADHFFVVPHDFGACF 152
I +C + GL + RM+ ++ + +PYWNRT GADHFFV HD G
Sbjct: 142 I--SCHKMRGRGLT----NERMIDEVEKYVEHLKFEYPYWNRTLGADHFFVTCHDIGV-- 193
Query: 153 HYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPR 212
+G+ +++ + V + + ++G I PQ PP
Sbjct: 194 ------KATKGVPHMMKNSIRVICSSRYD----DDGYIPHKDVTLPQVQLPFFHPPGGN- 242
Query: 213 SIFVYFRGLFYDVNNDPEGGYYARGARA-------AVWENFK----NNPLFDISTDHPTT 261
D+ N ++A + + A+W+N N D+ P
Sbjct: 243 -----------DIKNRNTLAFWAGRSDSRLKEDLIAIWDNDTEIDIQNSRVDLRATGPVV 291
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y E + ++ FCLCP G S R+ +++ FGC+PVI++ LPF D + W + + + E
Sbjct: 292 YMEKLYKSKFCLCPHG-PIGSSRIADSIHFGCVPVIMSKYYDLPFNDILDWTQFSIVLKE 350
Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARK 377
DV +L L SI + + + +++ + P DAFH ++ L R+
Sbjct: 351 TDVYQLKYTLRSISEKHFITLNHNIVK--IQKHFKWNTPPVRQDAFHMVMYELWRR 404
>gi|320163884|gb|EFW40783.1| conserved hypothetical protein [Capsaspora owczarzaki ATCC 30864]
Length = 478
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 91/399 (22%), Positives = 152/399 (38%), Gaps = 83/399 (20%)
Query: 37 VGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTP 96
+ L++Y+Y++ + L+ D + AE L +S T P++A+ F+ P
Sbjct: 103 LDSLRIYMYDIALGREMRWLVDDK------YGAEQLFINLLATSAFHTTAPDKANMFFMP 156
Query: 97 IYPTCDLTPTGLPL--PFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHY 154
T + + + ++ + + +WN + G DHF++ HD G
Sbjct: 157 FRCTAYRRSVQERVLGDIVAKNVTAQYFDVVMNKYRWWNVSSGTDHFYICGHDMGTAVTA 216
Query: 155 QEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPP-------------------- 194
A+ + + LV T + + I++PP
Sbjct: 217 LSHPALVKNAI------GLVNTADYDDARYIPHKDISLPPNIDVLPSAHVATEEEITADL 270
Query: 195 ----------YAPPQKMQAH-----------------------QIPPDTPRSIFVYFRGL 221
Y ++ AH I P R+ YF G
Sbjct: 271 IRLEMARDRLYRATRQKVAHPDMNFEPLMERRMGKLVQYGLGGLIHPREKRTKLAYFAGP 330
Query: 222 FYDVNNDPEGGYYARGARAAVWENFKNNP---LFDISTDHPTTYYEDMQRAIFCLCPLGW 278
+ Y R R V + F N+ LF+ P YY ++ + FCL G+
Sbjct: 331 LH----------YGR-VRPKVRDAFANDTDIVLFEGRHAQPILYYNELATSKFCLFLRGY 379
Query: 279 APWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEV 338
WSPRL++AV GCIPVII+D LP + W E + + E +P+L L ++
Sbjct: 380 RAWSPRLMDAVFMGCIPVIISDHYDLPLGQLLDWSEFSITIPEARIPRLKQTLLAVSDAQ 439
Query: 339 ILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARK 377
+ R Q LA + + ++ P +P DAFH +L L R+
Sbjct: 440 LSRMQNRLAE--VYQHFVWNDPPKPFDAFHMVLWQLWRR 476
>gi|168059257|ref|XP_001781620.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666934|gb|EDQ53576.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 460
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 102/403 (25%), Positives = 161/403 (39%), Gaps = 74/403 (18%)
Query: 34 DDPVGK----LKVYVYELPSKYNKKLLLKDPRC-----------------LTHMFAAEIF 72
DD G LK+Y+YELP K+N +L KD + + E +
Sbjct: 57 DDSTGSYSCPLKIYMYELPRKFNMGMLKKDKNQEIPWTNHVAPPWKQKFEVNKQHSVEYW 116
Query: 73 MHRFLL---------SSPVRTLNPEEADWFYTPIYPTCDLTPTG---------------L 108
+ +LL ++ VR ++PE+AD F+ P + G +
Sbjct: 117 LMVYLLDGWDRKDGKTAAVRVIDPEQADVFFVPFFSALSFNSHGHGMSEGAAADKRLQIV 176
Query: 109 PLPF-----KSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERG 163
L F S ++++ + I S +W ++G DH V H A HY++
Sbjct: 177 LLTFGRHVNASCHLVQAGLVDILSKSKWWQASQGRDHILVAHHP-NALRHYRDM------ 229
Query: 164 ILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAP--PQKMQAHQIPPDTPRSIFVYFRGL 221
L Q +V FG+ + + PY P Q + P + R ++F+G
Sbjct: 230 ---LNQSIFIVADFGRYDKTVARLSKDVVAPYVHVLPSYDQDNPADPFSLRKTLLFFQGR 286
Query: 222 FYDVNNDPEGGYYARGARAAVWENFKNNPLFD--ISTDHPTTYYEDMQRAIFCLCPLGWA 279
+ +G R A + N + D S + T M+ + FCL P G
Sbjct: 287 IH-----RKGDGIVRTKLAELLANNSDVHYVDSLASAEAIATSTAGMRTSRFCLHPAGDT 341
Query: 280 PWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPP 336
P S RL +A+V C+PVII+D I LPF D + +++ +F + E+ K L L SI
Sbjct: 342 PSSCRLFDAIVSHCVPVIISDRIELPFEDDLNYKDFSIFFSSEESVKPGHLLRTLRSITR 401
Query: 337 EVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
E LR L S + P + DA + I + K+P
Sbjct: 402 ERWLRMWNALKTVS--HHFEYQHPPKKDDAVNMIFKQVQHKVP 442
>gi|356511373|ref|XP_003524401.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
max]
Length = 643
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 94/347 (27%), Positives = 152/347 (43%), Gaps = 40/347 (11%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTL-NPEEADWFYTPIY 98
LKVY+Y K K + P + ++A+E + + + + L +P +A FY P
Sbjct: 314 LKVYIY----KDGNKPIFHQP-IMKGLYASEGWFMKLMEENKHFVLKDPAKAHLFYMPF- 367
Query: 99 PTCDLTPTGLPLPFKSPR-----MMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFH 153
+ + L + R ++ IS+ + Y+NRT GADHF V HD+
Sbjct: 368 -SSRMLEHALYVRNSHNRTNLRQFLKDYTDKISAKYRYFNRTGGADHFLVACHDWAP--- 423
Query: 154 YQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRS 213
Y+ +E I L A + Q F V L E + PQ+ + P P
Sbjct: 424 YETRHHMEYCIKALCN-ADVTQGFKIGRDVSLPEAYVR--SVRDPQRDLGGKPPHQRP-- 478
Query: 214 IFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP------TTYYEDMQ 267
I ++ G + Y R W++ +P I P Y M+
Sbjct: 479 ILAFYAGNMHG---------YLRPILLKHWKD--KDPDMKIYGPMPHGAASKMNYINHMK 527
Query: 268 RAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKL 327
+ +C+CP G+ SPR+VEA+ + C+PVII+D+ V PF + + W+ + +AE+D+P L
Sbjct: 528 NSKYCICPKGYEVNSPRVVEAIFYECVPVIISDNFVPPFFEVLNWDAFSIILAEKDIPNL 587
Query: 328 DTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
IL S+ E L+ Q L + K +P + D FH L+ +
Sbjct: 588 KQILLSVSQEKYLKLQ-LGVRKAQKHFFWHVKPLKY-DLFHMTLHSI 632
>gi|302823230|ref|XP_002993269.1| glycosyltransferase-like protein [Selaginella moellendorffii]
gi|300138939|gb|EFJ05690.1| glycosyltransferase-like protein [Selaginella moellendorffii]
Length = 353
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 98/359 (27%), Positives = 152/359 (42%), Gaps = 58/359 (16%)
Query: 29 GDVLEDD---PVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTL 85
DV ++D +LKV+VY P Y K L +A+E + R L+ S T
Sbjct: 31 ADVFQEDYEQMEQQLKVFVYPDPVVYTK---------LAGKYASEGYFFRNLMESRFVTT 81
Query: 86 NPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVP 145
+PE+A F+ PI L GL S + S ++ + + +PYWNRT GADHFFV
Sbjct: 82 DPEKAQLFFVPI-SCARLREEGLDHDEISDNVA-SFVESVIAKFPYWNRTMGADHFFVTC 139
Query: 146 HDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQ 205
H+ G RAT +N + + S P+ P + + Q
Sbjct: 140 HEIGT-------------------RATSKVAHLVKNSIRVVCASSYSGPFIPHKDVALPQ 180
Query: 206 I--PPDTPRSIFVYFRGLFYDVNNDPEGGYYARGA----RAAVWENFKNNPLFDISTDHP 259
I P +PR D G++A A R + + ++ + IST H
Sbjct: 181 ILQPFPSPRG--------GDDTEKRETLGFWAGPANSKTRILLTKTWQEDSDMVISTKHV 232
Query: 260 TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIP----VIIADDIVLPFADAIPWEEI 315
+ R+ FC+CP G + R+VE++ FGC+P +I++D LPF D + W +
Sbjct: 233 GM--QQFYRSKFCICPSGTRVSTARIVESIHFGCVPGKSFLILSDHYDLPFNDVLDWRKF 290
Query: 316 GMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
+ + E+D L L + P L + L A + + P D FH ++ L
Sbjct: 291 AVILPEQDAGTLKDAL-ELAPYATLHRNLLQAQAHFE----WHSPPIKYDTFHMVMYEL 344
>gi|224284379|gb|ACN39924.1| unknown [Picea sitchensis]
Length = 787
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 92/353 (26%), Positives = 148/353 (41%), Gaps = 63/353 (17%)
Query: 42 VYVYELPSKYNKKLL------------LKDPRCLT----HMFAAEIFMHRFLLSSPVRTL 85
+Y+Y+LP ++N LL + D R + ++ +++ ++ LL+SP RTL
Sbjct: 340 IYIYDLPPEFNIHLLEGRHFRFQCVNRIYDDRNKSLWTDQLYGSQMALYESLLASPYRTL 399
Query: 86 NPEEADWFYTPIYPTCDLTPTGLP--LPFKSPRMMRSAIQL---------ISSNWPYWNR 134
N EEAD+FY P+ C +T L K+ +RS + L I ++ YWNR
Sbjct: 400 NGEEADYFYVPVLDACLITRADDAPHLSMKNHMGLRSYLTLDFYKKAYDHIMEHYTYWNR 459
Query: 135 TEGADHFFVVPHDFGACFHYQE--------------EKAIERGILPLLQRATLVQTFGQR 180
+ G DH + D GAC+ +E K L + +
Sbjct: 460 SSGHDHIWFFAWDEGACYAPKEIWNSMMLVHWGNTNSKHNHSTTAYLADNWDHIPIERRG 519
Query: 181 NHVCLN-EGSITIPPYAPPQKMQAHQ---IPPDTPRSIFVYFRGLFYDV--NNDPEGGYY 234
H C + E + +P + P R YF G NN PE Y
Sbjct: 520 RHPCFDPEKDLVLPAWKRPDPYNVKARFWARSRRERFTLFYFNGNLGASFKNNRPEPT-Y 578
Query: 235 ARGARAAVWENFKNNP-------------LFDISTDHPTTYYEDMQRAIFCLCPLGWAPW 281
+ G R + F + P + +S P YY ++ ++FC G W
Sbjct: 579 SLGIRQKLAAEFASEPNKEGKFGRQSTKDVIVVSQKSP-NYYSELGSSLFCGVFPG-DGW 636
Query: 282 SPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSI 334
S R+ ++V+ GCIPVII D I + + + + ++ + +AE+D+P L IL I
Sbjct: 637 SGRMEDSVLQGCIPVIIQDGIQVAYENVLNYDSFAVRIAEDDIPHLVQILRGI 689
>gi|302845214|ref|XP_002954146.1| hypothetical protein VOLCADRAFT_118607 [Volvox carteri f.
nagariensis]
gi|300260645|gb|EFJ44863.1| hypothetical protein VOLCADRAFT_118607 [Volvox carteri f.
nagariensis]
Length = 401
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 91/363 (25%), Positives = 155/363 (42%), Gaps = 62/363 (17%)
Query: 77 LLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTE 136
LLSS VRT + + AD+F+ P+ + +S + + + I +WP+W R
Sbjct: 4 LLSSGVRTADGDAADFFFIPLV---------MRTKGQSANHLTAVVSYIQQHWPWWGRYG 54
Query: 137 GAD-HFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGS------ 189
G H VVP D G + + +L L++ T + +G +H +EG+
Sbjct: 55 GGHRHLLVVPADLG-------RRMLPEELLKLVENVTFLTHWG--SHTNHSEGAWVESHR 105
Query: 190 ----ITIPP---------YAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYAR 236
I +PP ++P + + + R+ ++F G ++P G
Sbjct: 106 PGKDIVVPPLHNADEPIVFSPLHTLHSKR---RRQRTSGLFFSGRICSDGSEPHRGRCRT 162
Query: 237 GA----RAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFG 292
+ R V ++ N + ++T Y + FCL P G + R V+A V G
Sbjct: 163 NSQGNVRHKVLKHHWNRTTWTLTT-RAKAYASALSSHTFCLSP-GGGGYGRRSVQAAVMG 220
Query: 293 CIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVI--LRKQRLLANPS 350
C+PV+I D + PF + W + M V E+D+P L TIL S+ I +++Q A
Sbjct: 221 CVPVLIGDGLHQPFEPELDWSQFSMSVPEQDIPHLHTILESMNSSTIAAMQEQLRCAAQH 280
Query: 351 MKRAMLFPQP-AQPG--DAFHQILNGLARKLPH---DKSTYLKPGGKFLNW-------TS 397
+ + F + + G DAF ++ L + H D S Y + +F ++ T
Sbjct: 281 LYYSTTFGEVMGEDGRYDAFETLMEVLRMRRDHPSLDPSEYARTDSRFADFINCKLEPTG 340
Query: 398 GPV 400
GPV
Sbjct: 341 GPV 343
>gi|159470093|ref|XP_001693194.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158277452|gb|EDP03220.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 673
Score = 101 bits (251), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 93/347 (26%), Positives = 146/347 (42%), Gaps = 70/347 (20%)
Query: 42 VYVYELPSKYNKKLL---LKDPRCLTH---------------MFAAEIFMHRFLLSSPVR 83
+YVY+L Y ++L + P C+ ++AA+ +H LL SP R
Sbjct: 254 IYVYDLEPLYQARILQYRVSPPWCVFRRHDLPANRTEWSDMWVYAADTLLHELLLVSPHR 313
Query: 84 TLNPEEADWFYTPIYPTCDLTPTGLPLP---------FKSP---------RMMRSAIQLI 125
T +PEEAD+FY P +C LP P FK P M+ I
Sbjct: 314 TFDPEEADFFYVPHQASC------LPFPIGNWADWPWFKGPGGPRIRQMLNMIMETRDWI 367
Query: 126 SSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCL 185
++P+W R G DH + HD GAC+ P + ++ T R
Sbjct: 368 DQHYPFWKRRGGRDHIWTFTHDEGACW------------APNVLNTSIWLTHWGRMDPDH 415
Query: 186 NEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGL-FYDVNNDPEGGYYARGARAAVWE 244
+ +P A+Q P V+ +G Y D + R +
Sbjct: 416 TSNTAFVPDRYDRDFKSAYQ-----PEGYRVHMQGHPCYRPGQDLVIPAFKRP------D 464
Query: 245 NFKNNPLFDIST---DHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADD 301
+++ +PL ++ + P Y + + R++FCL G WS RL +AV+ GCIPVII D+
Sbjct: 465 HYRASPLAAATSKPRELPGDYSDMLSRSLFCLVAAG-DGWSARLEDAVLHGCIPVIIIDN 523
Query: 302 IVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLAN 348
+ + F + + + +AE DV ++ IL +IP I KQ L +
Sbjct: 524 VHVVFESILDIDSFSVRIAEADVDRILEILQAIPERKIRFKQAHLGH 570
>gi|413926326|gb|AFW66258.1| hypothetical protein ZEAMMB73_148778 [Zea mays]
Length = 267
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 76/274 (27%), Positives = 126/274 (45%), Gaps = 49/274 (17%)
Query: 117 MMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQT 176
+++ ++ + + +PYWNRT GADHFFV HD G +A E G+ +++ + V
Sbjct: 18 IVKDYVEGLINKYPYWNRTLGADHFFVTCHDVGV-------RAFE-GLKFMVKNSIRV-- 67
Query: 177 FGQRNHVCLNEGSITIPPY---APPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGY 233
VC ++ P+ A PQ +Q + D++N G+
Sbjct: 68 ------VCSPSYNVDFIPHKDIALPQVLQPFALHEGGN------------DIDNRVILGF 109
Query: 234 YA-------RGARAAVWENFKNNPLFDISTDHPT------TYYEDMQRAIFCLCPLGWAP 280
+A R A VWEN + IS + + Y + R FC+CP G
Sbjct: 110 WAGHRNSKIRVILARVWEN---DTELAISNNRISRAIGELVYQKQFYRTKFCICPGGSQV 166
Query: 281 WSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVIL 340
S R+ +++ +GC+PVI++D LPF DA+ W + + + E DV +L IL SI E +
Sbjct: 167 NSARISDSIHYGCVPVILSDYYDLPFNDALDWRKFAVILRERDVYQLKNILKSISQEEFI 226
Query: 341 RKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
+ L +++ ++ P DAFH ++ L
Sbjct: 227 SLHKSLV--QVQKHFVWHSPPVSYDAFHMVMYEL 258
>gi|357125540|ref|XP_003564451.1| PREDICTED: probable glucuronosyltransferase GUT1-like [Brachypodium
distachyon]
Length = 495
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 98/360 (27%), Positives = 156/360 (43%), Gaps = 58/360 (16%)
Query: 40 LKVYVYELPSKYNKKLL--LKDP-RCLTHMFAAEIFMHRF-------------------- 76
L+VYVYE+P K+ LL +D R T++ + +HR
Sbjct: 109 LRVYVYEMPRKFTYDLLRLFRDSYRDTTNLTSNGSPVHRLIEQHSIDYWLWADLIAPESQ 168
Query: 77 -LLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRT 135
LL + +R EEAD FY P + T L K + R A++ ++ P W R+
Sbjct: 169 RLLKNVIRVQQQEEADIFYVPFFTTISYF---LLEKQKCKALYREALKWVTDQ-PAWQRS 224
Query: 136 EGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY 195
EG DH V H + ++ + + + LL + + V L + I PY
Sbjct: 225 EGRDHIIPVHHPWS----FKSVRRFVKKAIWLLPDMDSTGNWYKPGQVYLEKD--VILPY 278
Query: 196 APPQKMQAHQIPPDT--PRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFD 253
P + ++ +T RS+ ++FRG GG R+ + K+ D
Sbjct: 279 VPNVDLCDYKCASETQSKRSMLLFFRGRL----KRNAGG----KVRSKLVTELKDAE--D 328
Query: 254 ISTDHPTTYYED-------MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 306
+ + T E M++++FCL P G P S RL +A+V GCIPVII+D++ LPF
Sbjct: 329 VVIEEGTAGAEGKVAAQNGMRKSLFCLNPAGDTPSSARLFDAIVSGCIPVIISDELELPF 388
Query: 307 ADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQP 363
+ + +I +FV+ D + L L I + + Q L S R ++ +PAQP
Sbjct: 389 EGILDYRKIALFVSSSDALQPGWLVKYLRGIDAKRVREMQSNLVKYS--RHFIYSKPAQP 446
>gi|302790752|ref|XP_002977143.1| glycosyltransferase-like protein [Selaginella moellendorffii]
gi|300155119|gb|EFJ21752.1| glycosyltransferase-like protein [Selaginella moellendorffii]
Length = 342
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 92/347 (26%), Positives = 147/347 (42%), Gaps = 65/347 (18%)
Query: 58 KDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKS--- 114
+ PR +T +A+E + + L S T NP +A F+ PI +C +P S
Sbjct: 20 QTPRKITGKYASEGYFFQNLRESKFVTKNPNKAHLFFIPI--SCHKMRGKVPYYLTSNWN 77
Query: 115 ----------PRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGI 164
+++ ++ + +PYWNRT GADHFFV HD GA +A + +
Sbjct: 78 KMQGISYEKMADIVQEYVEGLIVKYPYWNRTLGADHFFVTCHDVGA-------RATNK-V 129
Query: 165 LPLLQRATLVQTFGQRNHVC---LNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGL 221
L++ + V VC N I A PQ +Q +P
Sbjct: 130 ANLVKNSIRV--------VCSPSYNGDFIPHKDIAMPQVLQPFALPRGGN---------- 171
Query: 222 FYDVNNDPEGGYYA-------RGARAAVWEN-----FKNNPLFDISTDHPTTYYEDMQRA 269
DV N G++A R A +WE NN + + + Y + R+
Sbjct: 172 --DVRNRTILGFWAGHRNSKIRVVLAKLWEEDDVLAISNNRISRATGE--LVYQKQFYRS 227
Query: 270 IFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDT 329
FC+CP G S R+V+++ +GC+PVI++D LPF D + W+ + + E DV L
Sbjct: 228 KFCICPGGSQVNSARIVDSIHYGCVPVILSDHYDLPFNDVLDWKRFALLLRERDVGDLKL 287
Query: 330 ILTSIPPEVILRKQRLLAN--PSMKRAMLFPQPAQPGDAFHQILNGL 374
S + L+A+ ++ + P +P DAFH ++ L
Sbjct: 288 FFFSFFLFS---SRVLIASLFRQVQDRFEWHTPPRPYDAFHMVVYEL 331
>gi|159489064|ref|XP_001702517.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158280539|gb|EDP06296.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 489
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 85/335 (25%), Positives = 139/335 (41%), Gaps = 63/335 (18%)
Query: 40 LKVYVYELPSKYNK-KLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIY 98
+K+YVYELP N + + R L ++ + R +LSS VRT + + AD+++ P+
Sbjct: 119 VKIYVYELPPVANTWTYIARIDRPL-----VQVLLQR-MLSSGVRTTDGDSADYYFIPLL 172
Query: 99 PTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGAD-HFFVVPHDFGACFHYQEE 157
+ + + + + + +WP+W+RT G H V P D G
Sbjct: 173 ---------MRTRTHTVNHLAAVVHYVRKHWPWWDRTGGGHRHLLVAPGDIG-------R 216
Query: 158 KAIERGILPLLQRATLVQTFG-QRNHVCLN-------EGSITIPPYAPPQK------MQA 203
+ + +L + + T + +G RNH N I +PP PP + + A
Sbjct: 217 RILTPELLHMTENCTFLTHWGLHRNHSGGNWLASHRPGKDIVVPPLTPPDEPIVYSPLHA 276
Query: 204 HQIPPDTPRSIFVYFRGLFYDVNNDPEGGY-------YARGARAAVWENFKNNPLFDIST 256
R ++F G N P G Y+ G R V + N P + I+T
Sbjct: 277 TLKKNRKSRLGELFFAGRICGDNQKPTDGKCSEKRQDYSAGTRQQVAHHHWNRPNWTITT 336
Query: 257 DHPTTYYEDMQRAIFCLCPLGWA-----------------PWSPRLVEAVVFGCIPVIIA 299
H Y E + IFCL P G A + R V++++ GCIPV +
Sbjct: 337 -HTPAYAEALSTHIFCLSPTGTARLSHARLDLSTLQRPGGGYGRRSVQSLLMGCIPVTVT 395
Query: 300 DDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSI 334
D + PF + W + V E+D+ +L +LT +
Sbjct: 396 DHVHQPFEPEVDWARFSVPVREDDIAQLHHVLTGL 430
>gi|449520748|ref|XP_004167395.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase
IRX7-like [Cucumis sativus]
Length = 517
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 101/367 (27%), Positives = 156/367 (42%), Gaps = 63/367 (17%)
Query: 36 PVGK-LKVYVYELPSKYNKKLL---LKDPRCLTHMFAAEIFMHRF--------------- 76
P+G ++VYVYE+P K+ LL R +++ + +HR
Sbjct: 123 PLGSPIRVYVYEMPWKFTYDLLWTFRNTYRETSNLTSNGSPVHRLIEQHSIDYWLWADLI 182
Query: 77 ------LLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWP 130
LL VR EEAD FY P + T L + + R A++ ++ P
Sbjct: 183 APESERLLKGVVRVYRQEEADLFYIPFFTTISFF---LLEKQQCKALYREALKWVTDQ-P 238
Query: 131 YWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSI 190
W R+EG DH V H + ++ + + + LL + + V L + I
Sbjct: 239 AWKRSEGRDHILPVHHPWS----FKTVRKFMKNAIWLLPDMDSTGNWYKPGQVFLEKDLI 294
Query: 191 TIPPYAP-----PQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYAR------GAR 239
PY P +K ++Q + RSI ++FRG + + G A+ GA
Sbjct: 295 L--PYVPNVELCDRKCLSYQ---QSKRSILLFFRG---RLKRNAGGKIRAKLGGELSGAD 346
Query: 240 AAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIA 299
+ E M+++IFCL P G P S RL +A+V GCIPVI++
Sbjct: 347 DVLIEEGTAG------EGGKAAAQTGMRKSIFCLSPAGDTPSSARLFDAIVSGCIPVIVS 400
Query: 300 DDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAML 356
D++ LPF + + +I +FV+ D K L T L S I R Q+ LA + R +
Sbjct: 401 DELELPFEGILDYRKIALFVSSSDALKSGWLLTYLRSFSAADIRRLQQNLAK--LSRHFI 458
Query: 357 FPQPAQP 363
+ PAQP
Sbjct: 459 YSSPAQP 465
>gi|297833558|ref|XP_002884661.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297330501|gb|EFH60920.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 470
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 98/360 (27%), Positives = 157/360 (43%), Gaps = 67/360 (18%)
Query: 40 LKVYVYEL--PSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPI 97
K+YVYE P ++ L KD + +F FM +L RT +P++A ++ P
Sbjct: 143 FKIYVYEEGDPPIFHYGLC-KDIYSMEGLFLN--FMENDVLK--YRTRDPDKAHVYFLPF 197
Query: 98 YPTCDLTPTGLPLPFKSPRMMRSAI----QLISSNWPYWNRTEGADHFFVVPHDFGACFH 153
L P+ + ++ S I Q+IS +PYWN ++G DHF + HD+G
Sbjct: 198 SVVMILHHLFDPV-VRDKAVLESVIADYVQIISKKYPYWNTSDGFDHFMLSCHDWG---- 252
Query: 154 YQEEKAIERGILPLLQRAT--LVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTP 211
RAT + + F V N I Y P+K D P
Sbjct: 253 ---------------HRATWYVKKLFFNSIRVLCNAN---ISEYFNPEK--------DAP 286
Query: 212 RSIFVYFRGLFYDVNNDPEG---------GYYARGARAAV-------WENFKNNPLFDIS 255
F L ++NN G ++A + + W+ + L +
Sbjct: 287 ---FPEINLLTGEINNLTGGLDPISRKTLAFFAGKSHGKIRPVLLNHWKEKDKDILVYEN 343
Query: 256 TDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEI 315
Y E M++ FC+CP G SPR+ EA+ GC+PV+I+++ VLPF+D + WE+
Sbjct: 344 LPEDLNYTEMMRKTRFCICPSGHEVASPRIPEAIYSGCVPVLISENYVLPFSDVLNWEKF 403
Query: 316 GMFVAEEDVPKLDTILTSIPPEVILRKQRLLAN-PSMKRAMLFPQPAQPGDAFHQILNGL 374
+ V+ +++P+L IL IP E R RL +K +L P + D F+ I++ +
Sbjct: 404 SVSVSVKEIPELKRILMDIPEE---RYMRLYEGVKKVKMHILVNDPPKRYDVFNMIIHSI 460
>gi|297807715|ref|XP_002871741.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
gi|297317578|gb|EFH48000.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
Length = 510
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 96/360 (26%), Positives = 153/360 (42%), Gaps = 49/360 (13%)
Query: 36 PVG-KLKVYVYELPSKYNKKLL----------------------LKDPRCLTHMFAAEIF 72
PVG +KVYVYE+P K+ LL L + + + A++
Sbjct: 112 PVGFPIKVYVYEMPKKFTFDLLWLFHNTYKETSNATSNGSPVHRLIEQHSIDYWLWADLI 171
Query: 73 M--HRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWP 130
L S VR ++AD+FY P + T L + + R A++ ++ P
Sbjct: 172 SPESERRLKSVVRVHKQQDADFFYVPFFTTISFF---LLEKQQCKALYREALKWVTDQ-P 227
Query: 131 YWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSI 190
W R+EG DH F + H + ++ + + + LL + + V L + I
Sbjct: 228 AWKRSEGRDHIFPIHHPWS----FKSVRKFVKNAIWLLPDMDSTGNWYKPGQVSLEKDLI 283
Query: 191 TIPPYAPPQKM-QAHQIPPDTP-RSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKN 248
PY P + A + P R+ ++FRG G R A K
Sbjct: 284 L--PYVPNVDICDAKCLSESAPMRTTLLFFRGRL-----KRNAGGKIRAKLGAELSGVKG 336
Query: 249 NPLFDISTDHPTTYYED--MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 306
+ + + M+R++FCLCP G P S RL +A+V GCIPVI++D++ PF
Sbjct: 337 VIISEGTAGEGGKLAAQGGMRRSLFCLCPAGDTPSSARLFDAIVSGCIPVIVSDELEFPF 396
Query: 307 ADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQP 363
+ ++++ + V+ DV + L L S+ P I Q+ LA S R L+ PAQP
Sbjct: 397 EGILDYKKVAVLVSSNDVVQPGWLVNHLRSLTPFQIKELQKNLAQYS--RHFLYSSPAQP 454
>gi|224053619|ref|XP_002297901.1| predicted protein [Populus trichocarpa]
gi|222845159|gb|EEE82706.1| predicted protein [Populus trichocarpa]
Length = 457
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 82/289 (28%), Positives = 127/289 (43%), Gaps = 37/289 (12%)
Query: 65 HMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQL 124
HMF LL SP+ T PE+A +F+ P + K + +
Sbjct: 152 HMFKVS------LLHSPLLTATPEKAHFFFLPFSINDLRNDPRVHSEAKISQFVAQYTSS 205
Query: 125 ISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVC 184
ISS++ +WN + GADHF+V H G +E + G+ + T ++ QR ++
Sbjct: 206 ISSSFRFWNASGGADHFYVCCHSVG-----REAPSRHHGLRNNAIQLTCCSSYFQRFYLS 260
Query: 185 LNEGSITIPPYAPPQKMQAHQIPPDTP-----------RSIFVYFRGLFYDVNNDPEGGY 233
+ + P P+ Q PP R VYF G V N
Sbjct: 261 HKDVGL---PQVWPRTDQTALNPPHASVCYLDVNTYRCRHRLVYFAG---RVQNSQ---- 310
Query: 234 YARGARAAVWENFKNNPLFDISTDHPTTYYED-MQRAIFCLCPLGWAPWSPRLVEAVVFG 292
R +W N+ FDI +PT YE+ +R+ FCL G+ + R+ +A+ +G
Sbjct: 311 -VRQQLVNLW---GNDTQFDIFNGNPTFPYEEGFKRSKFCLHVKGYEVNTARVSDAIHYG 366
Query: 293 CIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILR 341
CIPVII++ LPFA+ + W + + + + D+ L T L SI E+ LR
Sbjct: 367 CIPVIISNYYDLPFANVLDWSKFSVVINQRDIAFLKTKLLSIKREMYLR 415
>gi|42566010|ref|NP_191322.3| exostosin family protein [Arabidopsis thaliana]
gi|44917463|gb|AAS49056.1| At3g57630 [Arabidopsis thaliana]
gi|46931284|gb|AAT06446.1| At3g57630 [Arabidopsis thaliana]
gi|332646159|gb|AEE79680.1| exostosin family protein [Arabidopsis thaliana]
Length = 793
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 92/367 (25%), Positives = 156/367 (42%), Gaps = 68/367 (18%)
Query: 29 GDVLEDDPVGKLK---VYVYELPSKYNKKLL------------LKDPRCLT----HMFAA 69
G+++ V K K +Y+Y+LP +N L+ + D R T +++ +
Sbjct: 334 GNLINLSAVVKKKRPLIYIYDLPPDFNSLLIEGRHFKFECVNRIYDERNATVWTDYLYGS 393
Query: 70 EIFMHRFLLSSPVRTLNPEEADWFYTPIYPTC-----DLTP-------TGLPLPFKSPRM 117
++ + +L++ RT+N EEAD+F+ P+ +C D P TGL +
Sbjct: 394 QMAFYENILATAHRTMNGEEADFFFVPVLDSCIINRADDAPHINMQNHTGLRSSL-TLEF 452
Query: 118 MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFH----YQEEKAIERGILPLLQRATL 173
+ A + I +PYWNR+ G DH + D GAC+ + + G +
Sbjct: 453 YKRAYEHIVEKYPYWNRSAGRDHIWFFSWDEGACYAPKEIWNSMMLVHWGNTNSKHNHST 512
Query: 174 VQTFGQR----------NHVCLN-EGSITIPPYAPP---QKMQAHQIPPDTPRSIFVYFR 219
FG +H C + + IP + P + + P R YF
Sbjct: 513 TAYFGDNWDDISDERRGDHPCFDPRKDLVIPAWKVPDPYSMRKNYWERPREKRKTLFYFN 572
Query: 220 GLF---YDVNNDPEGGYYARGARAAVWENFKNNPLFD--ISTDH----------PTTYYE 264
G Y+ PE Y + G R + E F ++P + + H Y++
Sbjct: 573 GNLGPAYE-KGRPEDSY-SMGIRQKLAEEFGSSPNKEGKLGKQHAEDVIVTPLRSDNYHK 630
Query: 265 DMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDV 324
D+ +IFC G WS R+ ++++ GC+PVII D I LP+ + + +E + V E+D+
Sbjct: 631 DIANSIFCGAFPG-DGWSGRMEDSILQGCVPVIIQDGIYLPYENMLNYESFAVRVNEDDI 689
Query: 325 PKLDTIL 331
P L L
Sbjct: 690 PNLINTL 696
>gi|42572713|ref|NP_974452.1| exostosin family protein [Arabidopsis thaliana]
gi|110740929|dbj|BAE98560.1| hypothetical protein [Arabidopsis thaliana]
gi|332646160|gb|AEE79681.1| exostosin family protein [Arabidopsis thaliana]
Length = 791
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 92/367 (25%), Positives = 156/367 (42%), Gaps = 68/367 (18%)
Query: 29 GDVLEDDPVGKLK---VYVYELPSKYNKKLL------------LKDPRCLT----HMFAA 69
G+++ V K K +Y+Y+LP +N L+ + D R T +++ +
Sbjct: 332 GNLINLSAVVKKKRPLIYIYDLPPDFNSLLIEGRHFKFECVNRIYDERNATVWTDYLYGS 391
Query: 70 EIFMHRFLLSSPVRTLNPEEADWFYTPIYPTC-----DLTP-------TGLPLPFKSPRM 117
++ + +L++ RT+N EEAD+F+ P+ +C D P TGL +
Sbjct: 392 QMAFYENILATAHRTMNGEEADFFFVPVLDSCIINRADDAPHINMQNHTGLRSSL-TLEF 450
Query: 118 MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFH----YQEEKAIERGILPLLQRATL 173
+ A + I +PYWNR+ G DH + D GAC+ + + G +
Sbjct: 451 YKRAYEHIVEKYPYWNRSAGRDHIWFFSWDEGACYAPKEIWNSMMLVHWGNTNSKHNHST 510
Query: 174 VQTFGQR----------NHVCLN-EGSITIPPYAPP---QKMQAHQIPPDTPRSIFVYFR 219
FG +H C + + IP + P + + P R YF
Sbjct: 511 TAYFGDNWDDISDERRGDHPCFDPRKDLVIPAWKVPDPYSMRKNYWERPREKRKTLFYFN 570
Query: 220 GLF---YDVNNDPEGGYYARGARAAVWENFKNNPLFD--ISTDH----------PTTYYE 264
G Y+ PE Y + G R + E F ++P + + H Y++
Sbjct: 571 GNLGPAYE-KGRPEDSY-SMGIRQKLAEEFGSSPNKEGKLGKQHAEDVIVTPLRSDNYHK 628
Query: 265 DMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDV 324
D+ +IFC G WS R+ ++++ GC+PVII D I LP+ + + +E + V E+D+
Sbjct: 629 DIANSIFCGAFPG-DGWSGRMEDSILQGCVPVIIQDGIYLPYENMLNYESFAVRVNEDDI 687
Query: 325 PKLDTIL 331
P L L
Sbjct: 688 PNLINTL 694
>gi|296085067|emb|CBI28482.3| unnamed protein product [Vitis vinifera]
Length = 634
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 78/306 (25%), Positives = 141/306 (46%), Gaps = 42/306 (13%)
Query: 83 RTLNPEEADWFYTPIYPTCDLTPTGLPL---PFKSPRMMRSAIQLISSNWPYWNRTEGAD 139
RT +P+EA ++ P + P+ + ++ +++IS + YWNR+ GAD
Sbjct: 348 RTQDPDEAHVYFLPFSVVMIIHHLFDPIVRDKYVMKHVVSDYVKVISQKYRYWNRSLGAD 407
Query: 140 HFFVVPHDFG--ACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNE------GSIT 191
HF + HD+G A ++ + + + LL A + F R + E +I
Sbjct: 408 HFMLSCHDWGPRATWYVPQ---LYYNSIRLLCNANTSECFNPRKDASIPEINLIDGETIG 464
Query: 192 IPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFK-NNP 250
+ PP K R+I +F GG + R R A+ +++K +
Sbjct: 465 LTGGLPPSK-----------RTILAFF-----------AGGLHGR-IRPALLQHWKEKDE 501
Query: 251 LFDISTDHPT--TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFAD 308
+ P +Y + M+++ +C+CP G SPR+VEA+ C+PV+I+ VLPF+D
Sbjct: 502 QVQVYETLPEGLSYPDLMKKSKYCICPSGHEVASPRIVEAIYAECVPVLISQHYVLPFSD 561
Query: 309 AIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFH 368
+ W + V+ ++P L IL IP + +R Q + +++ + P + D FH
Sbjct: 562 VLDWGSFSIQVSVNEIPNLKKILLGIPQDRYIRMQERVKQ--VQQHFVVNNPPKRFDVFH 619
Query: 369 QILNGL 374
I++ +
Sbjct: 620 MIIHSI 625
>gi|359485860|ref|XP_002264111.2| PREDICTED: probable glycosyltransferase At3g07620-like [Vitis
vinifera]
Length = 410
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 78/304 (25%), Positives = 138/304 (45%), Gaps = 38/304 (12%)
Query: 83 RTLNPEEADWFYTPIYPTCDLTPTGLPL---PFKSPRMMRSAIQLISSNWPYWNRTEGAD 139
RT +P+EA ++ P + P+ + ++ +++IS + YWNR+ GAD
Sbjct: 124 RTQDPDEAHVYFLPFSVVMIIHHLFDPIVRDKYVMKHVVSDYVKVISQKYRYWNRSLGAD 183
Query: 140 HFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNE------GSITIP 193
HF + HD+G + + I LL A + F R + E +I +
Sbjct: 184 HFMLSCHDWGPRATWYVPQLYYNSI-RLLCNANTSECFNPRKDASIPEINLIDGETIGLT 242
Query: 194 PYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFK-NNPLF 252
PP K R+I +F GG + R R A+ +++K +
Sbjct: 243 GGLPPSK-----------RTILAFF-----------AGGLHGR-IRPALLQHWKEKDEQV 279
Query: 253 DISTDHPT--TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAI 310
+ P +Y + M+++ +C+CP G SPR+VEA+ C+PV+I+ VLPF+D +
Sbjct: 280 QVYETLPEGLSYPDLMKKSKYCICPSGHEVASPRIVEAIYAECVPVLISQHYVLPFSDVL 339
Query: 311 PWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQI 370
W + V+ ++P L IL IP + +R Q + +++ + P + D FH I
Sbjct: 340 DWGSFSIQVSVNEIPNLKKILLGIPQDRYIRMQERVKQ--VQQHFVVNNPPKRFDVFHMI 397
Query: 371 LNGL 374
++ +
Sbjct: 398 IHSI 401
>gi|223943427|gb|ACN25797.1| unknown [Zea mays]
Length = 380
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 90/302 (29%), Positives = 134/302 (44%), Gaps = 40/302 (13%)
Query: 77 LLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTE 136
LL S +R EEAD FY P + T L + + R A++ ++ P W R+E
Sbjct: 55 LLKSVIRVQQQEEADIFYVPFFTTISYF---LLEKQECKALYREALKWVTDQ-PAWQRSE 110
Query: 137 GADHFFVVPHDFGACFHYQEEKAIERGI---LPLLQRATLVQTFGQRNHVCLNEGSITIP 193
G DH V H + K++ R + + LL + + V L + I
Sbjct: 111 GRDHVIPVHHPWSF-------KSVRRSVKKAIWLLPDMDSTGNWYKPGQVYLEKD--VIL 161
Query: 194 PYAPPQKMQAHQ--IPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPL 251
PY P + H+ + + RSI ++FRG GG R+ + E K+
Sbjct: 162 PYVPNVDLCDHKCVLETQSKRSILLFFRGRL----KRNAGG----KIRSKLVEELKSAK- 212
Query: 252 FDISTDHPTT-------YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVL 304
DI + +T + M+++ FCL P G P S RL +A+V GCIPVII+D++ L
Sbjct: 213 -DIVIEEGSTGAQGKAAAQDGMRKSFFCLSPAGDTPSSARLFDAIVSGCIPVIISDELEL 271
Query: 305 PFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPA 361
PF + + EI +FV+ D + L L I + I Q L S R L+ PA
Sbjct: 272 PFEGILDYREIALFVSASDAVQPGWLLKYLRGINAKRIREIQSNLVKYS--RHFLYSSPA 329
Query: 362 QP 363
QP
Sbjct: 330 QP 331
>gi|449453962|ref|XP_004144725.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase
IRX7-like [Cucumis sativus]
Length = 518
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 101/367 (27%), Positives = 155/367 (42%), Gaps = 63/367 (17%)
Query: 36 PVGK-LKVYVYELPSKYNKKLL---LKDPRCLTHMFAAEIFMHRF--------------- 76
P+G ++VYVYE+P K+ LL R +++ + +HR
Sbjct: 123 PLGSPIRVYVYEMPWKFTYDLLWTFRNTYRETSNLTSNGSPVHRLIEQHSIDYWLWADLI 182
Query: 77 ------LLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWP 130
LL VR EEAD FY P + T L + + R A++ ++ P
Sbjct: 183 APESERLLKGVVRVYRQEEADLFYIPFFTTISFF---LLEKQQCKALYREALKWVTDQ-P 238
Query: 131 YWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSI 190
W R+EG DH V H + ++ + + + LL + + V L + I
Sbjct: 239 AWKRSEGRDHILPVHHPWS----FKTVRKFMKNAIWLLPDMDSTGNWYKPGQVFLEKDLI 294
Query: 191 TIPPYAP-----PQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYAR------GAR 239
PY P K ++Q + RSI ++FRG + + G A+ GA
Sbjct: 295 L--PYVPNVELCDSKCLSYQ---QSKRSILLFFRG---RLKRNAGGKIRAKLGGELSGAD 346
Query: 240 AAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIA 299
+ E M+++IFCL P G P S RL +A+V GCIPVI++
Sbjct: 347 DVLIEEGTAG------EGGKAAAQTGMRKSIFCLSPAGDTPSSARLFDAIVSGCIPVIVS 400
Query: 300 DDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAML 356
D++ LPF + + +I +FV+ D K L T L S I R Q+ LA + R +
Sbjct: 401 DELELPFEGILDYRKIALFVSSSDALKSGWLLTYLRSFSAADIRRLQQNLAK--LSRHFI 458
Query: 357 FPQPAQP 363
+ PAQP
Sbjct: 459 YSSPAQP 465
>gi|242065760|ref|XP_002454169.1| hypothetical protein SORBIDRAFT_04g025885 [Sorghum bicolor]
gi|241934000|gb|EES07145.1| hypothetical protein SORBIDRAFT_04g025885 [Sorghum bicolor]
Length = 338
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 81/312 (25%), Positives = 136/312 (43%), Gaps = 43/312 (13%)
Query: 80 SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSP--------RMMRSAIQLISSNWPY 131
+P +P EA F P+ C+L P ++ R + + +++ +PY
Sbjct: 42 NPFAARHPGEAHAFLLPV-SVCNLVQYIYPFYRRNTTAYMAHMRRALADYVDVVAGRYPY 100
Query: 132 WNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRN-----HVCLN 186
WNR+ GADH V HD+ + + + +L A ++F R V L
Sbjct: 101 WNRSRGADHVMVSCHDWAPLVS-EANGELYANAIRVLCNANTSESFRPRKDATLPEVNLG 159
Query: 187 EGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENF 246
+G + P + +PP+ R+ +F G + + R A W
Sbjct: 160 DGLLRRPTFG---------MPPEN-RTTLAFFAGGMHG---------HIRKALLGYWLG- 199
Query: 247 KNNPLFDISTDHPT--TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVL 304
+ +P DI P Y+ M A FCLCP G+ SPR+VE+V GC+PVII+D
Sbjct: 200 RKDPDMDIHEYLPKGQDYHALMASARFCLCPSGFEVASPRVVESVFSGCVPVIISDGYPP 259
Query: 305 PFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE--VILRKQRLLANPSMKRAMLFPQPAQ 362
PF+D + W ++ + V +P+L IL + +LR + L A +R + +P+Q
Sbjct: 260 PFSDVLDWSKMSVTVPPARIPELKDILKGVSERRYRVLRARVLQA----QRHFVVHRPSQ 315
Query: 363 PGDAFHQILNGL 374
D +++ +
Sbjct: 316 RFDMIRMVMHSI 327
>gi|242059273|ref|XP_002458782.1| hypothetical protein SORBIDRAFT_03g040220 [Sorghum bicolor]
gi|241930757|gb|EES03902.1| hypothetical protein SORBIDRAFT_03g040220 [Sorghum bicolor]
Length = 426
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 90/360 (25%), Positives = 148/360 (41%), Gaps = 42/360 (11%)
Query: 40 LKVYVYELPSKYNKKLLLKD-------PRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADW 92
L++YVY + LL+ CL + ++ +H+FLL S RT N ++A+
Sbjct: 53 LRIYVYAEDEIQGLRALLRGRDGTITAATCLKGQWGTQVKIHQFLLKSRFRTFNKDQANL 112
Query: 93 FYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACF 152
F+ P Y C L S + + + S PY+ R+ G DH FV P GA
Sbjct: 113 FFVPSYVKCVRMTGAL-----SDKEINQTYVKVLSQMPYFRRSGGRDHIFVFPSGAGA-- 165
Query: 153 HYQEEKAIERGILPLLQRATLVQTFG----QRNHVCLNEGSITIPPYAPPQKM-----QA 203
+ R L R+ ++ G +R N I P M +A
Sbjct: 166 ------HLFRSWATFLNRSIILTPEGDRTDKRGTSAFNTWKDIIIPGNVDDSMVKSDARA 219
Query: 204 HQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP---T 260
Q P T R F G + G A + + +P+ +S +
Sbjct: 220 VQPIPLTKRKYLANFLG-----RAQGKAGRLQLVELAKQYPDKLESPVLKLSGPNKLGRI 274
Query: 261 TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVA 320
Y++ ++ A FCL P G + W+ R E+ C+PVI++D++ LPF + I + EI +
Sbjct: 275 EYFKHLRNAKFCLAPRGESSWTLRFYESFFVECVPVILSDEVELPFQNVIDYGEISIKWP 334
Query: 321 EEDV-PKLDTILTSIPPEVILRKQRLLANPSMKRAM-LFPQPAQPGDAFHQILNGLARKL 378
+ P+L L SI E R + ++ + R + ++ +P A IL L +K+
Sbjct: 335 SSRIGPELLEYLESISDE---RIEEMIGHGRQMRCLWVYAADTEPCSAMSGILTELQKKV 391
>gi|413937765|gb|AFW72316.1| hypothetical protein ZEAMMB73_504926 [Zea mays]
Length = 472
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 81/304 (26%), Positives = 134/304 (44%), Gaps = 30/304 (9%)
Query: 80 SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKS-----PRMMRSAIQLISSNWPYWNR 134
+P +P EA F P+ C+L L + R + + +++ +PYWNR
Sbjct: 179 NPFAARDPGEAHAFLLPV-SVCNLVHYIYRLNTTAYMAHMRRALADYVDVVAHKYPYWNR 237
Query: 135 TEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPP 194
+ GADH V HD+ + + + + +L A + F R L E ++
Sbjct: 238 SRGADHVIVSCHDWAPLVS-EANRELYANAIRVLCNANTSEGFRPRKDATLPEVNLADGL 296
Query: 195 YAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDI 254
P +PP+ R+ +F G + + R A W + +P DI
Sbjct: 297 LRRPTL----GLPPEN-RTTLAFFAGGMHG---------HIRRALLGYWLG-RKDPDMDI 341
Query: 255 STDHPT--TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW 312
P Y+ M RA FCLCP G+ SPR+VE+V GC+PVII+D PF+D + W
Sbjct: 342 HEYLPAGQDYHALMARARFCLCPSGFEVASPRVVESVFTGCVPVIISDGYPPPFSDVLDW 401
Query: 313 EEIGMFVAEEDVPKLDTILTSIPPE--VILRKQRLLANPSMKRAMLFPQPAQPGDAFHQI 370
++ + V +P+L +L + +LR + L A +R + +PA+ D +
Sbjct: 402 SKMSVTVPPARIPELKAVLKGVSERRYRVLRARVLQA----QRHFVVHRPARRFDMIRMV 457
Query: 371 LNGL 374
L+ +
Sbjct: 458 LHSI 461
>gi|307104488|gb|EFN52741.1| hypothetical protein CHLNCDRAFT_138314 [Chlorella variabilis]
Length = 647
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 91/343 (26%), Positives = 147/343 (42%), Gaps = 64/343 (18%)
Query: 64 THMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPT--------------GLP 109
T ++ E + +L S RTL+PEEAD+FY P++P+C + P
Sbjct: 231 TWTYSLEFGLLEMMLQSEHRTLDPEEADFFYVPVFPSCFIWPVRSTADSLRDFYYGWAQS 290
Query: 110 LPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQ 169
+ ++ A + +++PYW+R G DH ++V HD +C+ K+ IL
Sbjct: 291 RVQGAANLLLEAYHWLRAHYPYWDRRGGRDHIWLVTHDEASCYVPAAIKSASI-ILSHWG 349
Query: 170 RATLVQTFGQ---RNHVCLN--------EGS-----ITIPPYAPPQKM---------QAH 204
R T G N LN EGS ++ P + P + + H
Sbjct: 350 RKDPNHTSGTGFPGNVYHLNVSHPHWEPEGSMAKLDLSQPCHDPVKDLVLPLMKTPDHYH 409
Query: 205 QIP----PDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFD----IST 256
Q P P R+ + RG + + PE Y+RG R +W + + D +
Sbjct: 410 QSPLVGAPTRNRTWLAFHRGRQHKTDA-PE---YSRGVRQRLWSASQEHGWLDKYGILLG 465
Query: 257 DHPTT-----------YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLP 305
++P++ Y + + +IFCL G WS R+ +A + GCIPVI+ D++ +
Sbjct: 466 ENPSSPGAEEVKLAGDYSQLLASSIFCLVLPG-DGWSARMDDATLHGCIPVIVMDEVDVS 524
Query: 306 FADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLAN 348
F I ++ + VA+ DV +L IL I E QR L
Sbjct: 525 FESVIDLQQFTVRVAQADVERLPEILLEISQERRQEMQRALGR 567
>gi|225461772|ref|XP_002285599.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase
IRX7-like [Vitis vinifera]
Length = 513
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 96/366 (26%), Positives = 155/366 (42%), Gaps = 47/366 (12%)
Query: 40 LKVYVYELPSKYNKKLL----------------------LKDPRCLTHMFAAEIFM--HR 75
++VYVYE+P+K+ LL L + + + A++
Sbjct: 128 IRVYVYEMPAKFTYDLLWLFRNTYKETSNRTSNGSPVHRLIEQHSIDYWLWADLTAPESE 187
Query: 76 FLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRT 135
LL + VR EEAD FY P + T L P + + R A++ ++ P W R+
Sbjct: 188 RLLKNVVRVHRQEEADLFYIPFFTTISFF---LLEPEQWKPLYREALKWVTDQ-PAWKRS 243
Query: 136 EGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY 195
EG DH V H + ++ + + + LL + + V L + I PY
Sbjct: 244 EGRDHILPVHHPWS----FKTVRKSMKNAIWLLPDMDSTGNWYKPGQVSLEKDLIL--PY 297
Query: 196 APPQKMQAHQIPPDT--PRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFD 253
P + + ++ R ++FRG + + G A+ ++
Sbjct: 298 VPNVDLCDAKCSSESESKRKTLLFFRG---RLKRNAGGKIRAKLMAELSGDDGVVIQEGT 354
Query: 254 ISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWE 313
M+++IFCL P G P S RL +A+V GCIPVI++D++ LPF + +
Sbjct: 355 AGEGGKEAAQRGMRKSIFCLSPAGDTPSSARLFDAIVSGCIPVIVSDELELPFEGILDYR 414
Query: 314 EIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQ---PGDAF 367
+I +FV+ D + L T L SI P I QR LA S R ++ PAQ P D
Sbjct: 415 KIALFVSSSDAMQPGWLLTFLKSISPAQIKEMQRNLAKYS--RHFVYSSPAQLLGPEDLV 472
Query: 368 HQILNG 373
+++ G
Sbjct: 473 WRMMAG 478
>gi|356557595|ref|XP_003547101.1| PREDICTED: LOW QUALITY PROTEIN: probable glycosyltransferase
At5g03795-like [Glycine max]
Length = 412
Score = 98.6 bits (244), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 83/327 (25%), Positives = 140/327 (42%), Gaps = 48/327 (14%)
Query: 60 PRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTC-DLTPTGLPLPFKSPRMM 118
PR LT +A+E + + + S T +P A F+ PI +C + GL + RM+
Sbjct: 107 PRKLTGKYASEGYFFKNIRESRFFTDDPRRAHLFFLPI--SCHKMRGRGLTIE----RMI 160
Query: 119 RSA---IQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQ 175
++ + +PYWNRT GADHFFV HD G +G+ L + + V
Sbjct: 161 DEVEKYVEHLKLKYPYWNRTLGADHFFVTCHDIGV--------KATKGVPHLTKNSIRVA 212
Query: 176 TFGQRN------HVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDP 229
+ H + + +P + PP + R+ F ++ G
Sbjct: 213 CSSSYDDDDYVPHKDVTLPQVQLPFFHPPGENDIKN------RNTFAFWAG--------- 257
Query: 230 EGGYYARGARAAVWENFK----NNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRL 285
+ A+W+N N D+ P Y E + ++ FCLCP G P L
Sbjct: 258 RSDSRLKDDLMAMWDNDTELDIQNXRVDLRATGPVVYMEKLYKSKFCLCPHG--PVGNSL 315
Query: 286 V-EAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQR 344
+ +++ FGC+PVI+ + LPF D + W + + + E ++ L IL SI + + R
Sbjct: 316 IADSIHFGCVPVIMPNYYDLPFNDILDWSQFSVVLKETNIYLLKDILRSISEKHFISLNR 375
Query: 345 LLANPSMKRAMLFPQPAQPGDAFHQIL 371
+ +++ + P DAFH ++
Sbjct: 376 NIV--KIQKHFKWNTPPVRQDAFHMVM 400
>gi|224129250|ref|XP_002320538.1| predicted protein [Populus trichocarpa]
gi|222861311|gb|EEE98853.1| predicted protein [Populus trichocarpa]
Length = 332
Score = 98.6 bits (244), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 82/343 (23%), Positives = 141/343 (41%), Gaps = 34/343 (9%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYP 99
K+Y+Y K + D C + + F+H T +P+EA ++ P
Sbjct: 5 FKIYIY----KEGDPPMFHDGPCKSIYSSEGRFIHELEKGKSFTTTDPDEALVYFLPFSV 60
Query: 100 TCDLTPTGLPLPFKSPRMMRSA---IQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQE 156
+ +P + + + I +I+ +P+WNR+ GADHF + HD+G
Sbjct: 61 VMLVQYLYVPGSHEIDAIGNTVVDYINVIADKYPFWNRSLGADHFILSCHDWGPRTSSYV 120
Query: 157 EKAIERGILPLLQRATLVQTFGQRN-----HVCLNEGSITIPPYAPPQKMQAHQIPPDTP 211
I +L A + F + + L G IT P +
Sbjct: 121 PHLFNNSI-RVLCNANTSEGFNPKKDASFPEIHLRTGEITGLVGGPSP----------SR 169
Query: 212 RSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIF 271
RSI +F G + + R W++ + + +Y ++ + F
Sbjct: 170 RSILAFFAGRLHG---------HIRRLLLEQWKDKDQDVQVHDQLRNGMSYDSMLKNSRF 220
Query: 272 CLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTIL 331
CLCP G+ SPR+VEA+ C+PV+I+D V PF+D + W+ + V +D+PK+ IL
Sbjct: 221 CLCPSGYEVASPRIVEAIYAECVPVLISDGYVPPFSDVLNWKAFSIQVQVKDIPKIKDIL 280
Query: 332 TSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
I LR QR + ++R + + D FH ++ +
Sbjct: 281 MGISQRQYLRMQRRVKQ--VQRHFVVNGIPKRFDVFHMTIHSI 321
>gi|168051833|ref|XP_001778357.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162670236|gb|EDQ56808.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 406
Score = 98.6 bits (244), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 94/383 (24%), Positives = 160/383 (41%), Gaps = 67/383 (17%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRC------------------LTHMFAAEIFMHRFLLS-- 79
LKVY+Y++P K+N L+ D + + + E +M +LL
Sbjct: 31 LKVYMYDIPRKFNFGLMTMDNKNEDLPWGNHAAPPWSQQWEVNKQHSVEYWMTVYLLDGW 90
Query: 80 -------SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLP---FKSPRMMRSAIQLISSNW 129
+ +R +P +AD F+ P + + G + + + ++ + I N
Sbjct: 91 DRKDGRRAAIRVRDPYQADVFFVPFFASLSFNNYGYGMEGPGAELDKNLQECVVNILLNS 150
Query: 130 PYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATL-VQTFGQ--RNHVCLN 186
+W ++G DH V+ H A HY+ LL + L V FG+ + CL
Sbjct: 151 KWWKASQGRDHVIVLHHP-NAFRHYRH----------LLNSSMLIVADFGRFSTDVACL- 198
Query: 187 EGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENF 246
+ I P Q + R I +YF+G ++ +G A+ A+A + E
Sbjct: 199 QKDIVAPYEHVVQSYVDDHSNSFSQRHILLYFQG---RIHRKADGIVRAKLAKALMNEKD 255
Query: 247 KNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 306
+ + S++ M+ + FCL P G P S RL +A+V C+PVI++D I LPF
Sbjct: 256 VHYMDSEASSEALAEATSGMRSSRFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDRIELPF 315
Query: 307 ADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLAN----------PSMKRAML 356
D I + E +F + E+ ++ P+ +LR R + ++
Sbjct: 316 EDDIDYNEFSLFFSSEE---------AVRPQYLLRILRGINETKWTQMWTKLKAVSHHFE 366
Query: 357 FPQPAQPGDAFHQILNGLARKLP 379
F PA+ DA + I + RKLP
Sbjct: 367 FQHPAKKDDAVNMIFKQVQRKLP 389
>gi|356503460|ref|XP_003520526.1| PREDICTED: uncharacterized protein LOC100775594 [Glycine max]
Length = 761
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 92/383 (24%), Positives = 157/383 (40%), Gaps = 74/383 (19%)
Query: 15 ADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLK---DPRCLT------- 64
ADN E++ + V + P+ +YVY+LP +N LL C+
Sbjct: 296 ADNQHFDEKVINAKAVVAKKRPL----IYVYDLPPVFNSLLLEGRHFKQNCVNRLYDVYN 351
Query: 65 ------HMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTC--DLTPTGLPLPFKSPR 116
++ A+I ++ +L+SP RTLN +EAD+F+ P+ +C D L ++
Sbjct: 352 ATIWTDELYGAQIALYESILASPHRTLNGDEADFFFVPVLDSCLIDRADHAPHLSTQNHE 411
Query: 117 MMRSAIQL---------ISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPL 167
+RS + L I +PYWN + G DH + D GAC+ K I ++ +
Sbjct: 412 GLRSFLTLDFYKNAYNHIVEQYPYWNCSSGRDHIWFFSWDEGACY---APKEIWSSMMLV 468
Query: 168 LQRATLVQTFGQRN-----------------HVCLN-EGSITIPPYAPPQKMQAHQIP-- 207
T + + H C + E + IP + + H +
Sbjct: 469 HWGNTNTKHYHSTTAYCPDNWDGIPSDRRGFHPCFDPEKDLVIPAW---KVTHVHVLSSK 525
Query: 208 ----PDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFD---------- 253
P R YF G +Y+ G R + E F + P +
Sbjct: 526 LWAWPLEKRKTLFYFNGNLGPAYPYGRNEWYSMGIRQKLAEEFGSKPNKEGKLGKQRAKD 585
Query: 254 --ISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIP 311
++ + Y ++ ++FC G WS R+ ++V+ GCIPVII D I LP+ + +
Sbjct: 586 VVVTAERSENYEVELASSVFCGVLPG-DGWSGRMEDSVLQGCIPVIIQDGIFLPYENVLN 644
Query: 312 WEEIGMFVAEEDVPKLDTILTSI 334
++ + + E+++P L IL I
Sbjct: 645 YDSFAVRIPEDEIPNLIKILRGI 667
>gi|356533217|ref|XP_003535163.1| PREDICTED: uncharacterized protein LOC100807663 [Glycine max]
Length = 795
Score = 98.2 bits (243), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 96/383 (25%), Positives = 161/383 (42%), Gaps = 79/383 (20%)
Query: 15 ADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLL------------LKDPRC 62
AD+T E++ V + P+ VYVY+LP ++N LL + D
Sbjct: 331 ADDTHANEKMINLNAVVAKKRPL----VYVYDLPPEFNSLLLEGRHYKLECVNRIYDDNN 386
Query: 63 LT----HMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLP--LPFKSPR 116
+T ++ A+I ++ LL+SP RTLN EEAD+F+ P+ +C +T L +
Sbjct: 387 ITVWTDQLYGAQIALYESLLASPHRTLNGEEADFFFVPVLDSCIITRADDAPHLSMQEHM 446
Query: 117 MMRSAIQL---------ISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPL 167
+RS++ L I +PYW+ + G DH + D GAC+ +E I
Sbjct: 447 GLRSSLTLEYYKNTYTHIVEQYPYWSHSSGRDHIWSFSWDEGACYAPKE-------IWNS 499
Query: 168 LQRATLVQTFGQRNHVC----------LNEGSITIPPYAPPQK---MQAHQIP------- 207
+ T + NH ++ I P P K + A ++P
Sbjct: 500 MMLVHWGNTNTKHNHSTTAYWADNWDKISSDRRGIHPCFDPDKDLVLPAWKVPDAYVLTS 559
Query: 208 -----PDTPRSIFVYFRGLFYDV--NNDPEGGYYARGARAAVWENFKNNPLFD------- 253
R YF G + PE Y+ G R + E F ++P D
Sbjct: 560 KLWARSHEKRKTLFYFNGNLGPAYPHGRPEDT-YSMGIRQKLAEEFGSSPNKDGKLGKQH 618
Query: 254 -----ISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFAD 308
++ + Y+ D+ ++FC G WS R+ ++++ GCIPV+I D I LP+ +
Sbjct: 619 AKDVIVTPERSEDYHMDLASSVFCGVFPG-DGWSGRMEDSILQGCIPVVIQDGIFLPYEN 677
Query: 309 AIPWEEIGMFVAEEDVPKLDTIL 331
+ ++ + + E ++P L L
Sbjct: 678 VLNYDSFAVRIPEAEIPNLIKTL 700
>gi|168033234|ref|XP_001769121.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162679650|gb|EDQ66095.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 765
Score = 97.8 bits (242), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 102/383 (26%), Positives = 161/383 (42%), Gaps = 75/383 (19%)
Query: 16 DNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLL------------LKDPRCL 63
D + + RIS A ++++ P+ VYVY+LP+++ + L L D
Sbjct: 305 DVSDETNRIS-VASNIMKKRPL----VYVYDLPAEFTTQFLQGRHFKFECVNRLYDVDNA 359
Query: 64 T----HMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTC---------DLTPTGLPL 110
T +++ A I ++ LL+S RT N +EAD+FY P C LT G +
Sbjct: 360 TIWTENLYGAGIALYESLLASEHRTTNGDEADFFYVPFLQACIVEQGDAAPHLTFQGKYM 419
Query: 111 PFKSPRMMRSAIQL---ISSNWPYWNRTEGADHFFVVPHDFGAC------------FHYQ 155
+ + Q+ I N+PYWNR+ G DH + P D GAC H+
Sbjct: 420 GLRQYFAGDYSKQIYFHIQQNYPYWNRSAGRDHIWFFPWDEGACSAPKEIWNSMMLSHWG 479
Query: 156 EEKAIERGILPLLQRAT--LVQTFGQRNHVCLN-EGSITIP------PYAPPQKMQAHQI 206
A + + L+ + +H C + + +P PY Q + +
Sbjct: 480 NTNAKHKASTTAYRADNWDLIPPEWRGDHPCYDPAKDLVLPAWKFPDPYPIVQNLSSRH- 538
Query: 207 PPDTPRSIFVYFRGLF---YDVNNDPEGGYYARGARAAVWENFKNNPLFD---------- 253
R YF G YD N PE G Y+ G R + F + P
Sbjct: 539 --RQDRPTLFYFNGNLGSAYD-NGRPEPG-YSMGIRQKLAAEFGSQPNKKGLLGRQAVDD 594
Query: 254 --ISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIP 311
+ Y ++ ++ FC G WS R+ ++++ GCIPVII D I LPF + +
Sbjct: 595 VVVQAQRSPQYKLELSKSRFCGVLPG-DGWSGRMEDSILSGCIPVIIQDGIHLPFENVLD 653
Query: 312 WEEIGMFVAEEDVPKLDTILTSI 334
+E + VAE+++ L TIL +I
Sbjct: 654 YESFTVRVAEDNIHNLITILKAI 676
>gi|147775380|emb|CAN73459.1| hypothetical protein VITISV_022181 [Vitis vinifera]
Length = 319
Score = 97.8 bits (242), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 44/113 (38%), Positives = 69/113 (61%), Gaps = 2/113 (1%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + M + FCLCP GW SPR+VEA+ GC+PVII D VLPF++ + W + + +
Sbjct: 198 YAKSMGDSKFCLCPSGWEVASPRVVEAIAAGCVPVIICDYYVLPFSEVLDWSKFSINITS 257
Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
+ +P++ IL ++P E LR Q+ + ++R + +PAQP D H IL+ +
Sbjct: 258 DKIPEIKKILKAVPNERYLRMQKRVKQ--VQRHFVINRPAQPYDMLHMILHSV 308
Score = 43.9 bits (102), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 24/37 (64%)
Query: 116 RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACF 152
R++ I ++S +PYWNR+ GADHF V HD+ F
Sbjct: 82 RLVTDYIYVVSDKYPYWNRSNGADHFLVSCHDWVQSF 118
>gi|297597828|ref|NP_001044591.2| Os01g0811400 [Oryza sativa Japonica Group]
gi|255673802|dbj|BAF06505.2| Os01g0811400 [Oryza sativa Japonica Group]
Length = 497
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 96/358 (26%), Positives = 153/358 (42%), Gaps = 54/358 (15%)
Query: 40 LKVYVYELPSKYNKKLL--LKDP-RCLTHMFAAEIFMHRF-------------------- 76
L+VYVYE+PS++ LL +D R +++ + +HR
Sbjct: 113 LRVYVYEMPSRFTYDLLRLFRDSYRETSNLTSNGSPVHRLVEQHSIDYWLWADLIAPESQ 172
Query: 77 -LLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRT 135
LL + +R EEAD FY P + T L + + R A++ ++ P W R+
Sbjct: 173 RLLKNVIRVRRQEEADIFYVPFFTTISYF---LLEKQECKALYREALKWVTDQ-PAWQRS 228
Query: 136 EGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY 195
EG DH V H + ++ + + + LL + + V L + I PY
Sbjct: 229 EGRDHVIPVHHPWS----FKSVRRFVKKAIWLLPDMDSTGNWYKPGQVYLEKD--VILPY 282
Query: 196 APPQKMQAHQIPPDTP--RSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFD 253
P + + +T RS ++FRG GG R+ + K+
Sbjct: 283 VPNVDLCDSKCVSETQSRRSTLLFFRGRL----RRNAGG----KIRSKLVTELKDAEGII 334
Query: 254 I-----STDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFAD 308
I D M++++FCL P G P S RL +A+V GCIPVI++D++ LPF
Sbjct: 335 IEEGTAGADGKAAAQNGMRKSLFCLNPAGDTPSSARLFDAIVSGCIPVIVSDELELPFEG 394
Query: 309 AIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQP 363
+ + +I +FV+ D + L L SI + I + Q L S R L+ PA+P
Sbjct: 395 ILDYRKIALFVSSNDAVQPGWLVKYLRSIDAKRIRQMQSNLLKYS--RHFLYSSPARP 450
>gi|212722910|ref|NP_001131485.1| uncharacterized protein LOC100192822 [Zea mays]
gi|194691662|gb|ACF79915.1| unknown [Zea mays]
Length = 426
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 91/359 (25%), Positives = 144/359 (40%), Gaps = 40/359 (11%)
Query: 40 LKVYVYELPSKYNKKLLLKD-------PRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADW 92
L++YVY + LL+ CL + ++ +H+FLL S RT N + A+
Sbjct: 53 LRIYVYAEDEIQGLRALLRGRDGTINAATCLKGQWGTQVKVHQFLLKSRFRTFNKDHANL 112
Query: 93 FYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACF 152
F+ P Y C L S + + + S PY+ R+ G DH FV P GA
Sbjct: 113 FFVPSYVKCVRMTGAL-----SDKEINQTYVKVLSQMPYFRRSGGRDHIFVFPSGAGA-- 165
Query: 153 HYQEEKAIERGILPLLQRATLVQTFG----QRNHVCLNEGSITIPPYAPPQKMQAHQIP- 207
H AI L R+ ++ G +R N I P M P
Sbjct: 166 HLFRSWAI------FLNRSIILTPEGDRTDKRGTSAFNTWKDIIIPGNVDDSMVKSDAPA 219
Query: 208 ----PDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP---T 260
P T R F G + G A + + +P +S +
Sbjct: 220 VQPIPLTKRKYLANFLG-----RAQGKAGRLQLVELAKQYPDKLESPELKLSGPNKLGRI 274
Query: 261 TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVA 320
Y++ ++ A FCL P G + W+ R E+ C+PVI++D++ LPF + I + EI +
Sbjct: 275 EYFKHLRNAKFCLAPRGESSWTLRFYESFFVECVPVILSDEVELPFQNVIDYSEISIKWP 334
Query: 321 EEDV-PKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKL 378
+ P+L L SI E I ++ + M+ ++ +P A IL L +K+
Sbjct: 335 SSRIGPELLEYLESISDERI--EEMIGHGREMRCLWVYAADTEPCSAMSGILTELQKKV 391
>gi|55297487|dbj|BAD68203.1| exostosin family protein-like [Oryza sativa Japonica Group]
gi|55297674|dbj|BAD68245.1| exostosin family protein-like [Oryza sativa Japonica Group]
Length = 512
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 96/358 (26%), Positives = 153/358 (42%), Gaps = 54/358 (15%)
Query: 40 LKVYVYELPSKYNKKLL--LKDP-RCLTHMFAAEIFMHRF-------------------- 76
L+VYVYE+PS++ LL +D R +++ + +HR
Sbjct: 113 LRVYVYEMPSRFTYDLLRLFRDSYRETSNLTSNGSPVHRLVEQHSIDYWLWADLIAPESQ 172
Query: 77 -LLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRT 135
LL + +R EEAD FY P + T L + + R A++ ++ P W R+
Sbjct: 173 RLLKNVIRVRRQEEADIFYVPFFTTISYF---LLEKQECKALYREALKWVTDQ-PAWQRS 228
Query: 136 EGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY 195
EG DH V H + ++ + + + LL + + V L + I PY
Sbjct: 229 EGRDHVIPVHHPWS----FKSVRRFVKKAIWLLPDMDSTGNWYKPGQVYLEKD--VILPY 282
Query: 196 APPQKMQAHQIPPDTP--RSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFD 253
P + + +T RS ++FRG GG R+ + K+
Sbjct: 283 VPNVDLCDSKCVSETQSRRSTLLFFRGRL----RRNAGG----KIRSKLVTELKDAEGII 334
Query: 254 I-----STDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFAD 308
I D M++++FCL P G P S RL +A+V GCIPVI++D++ LPF
Sbjct: 335 IEEGTAGADGKAAAQNGMRKSLFCLNPAGDTPSSARLFDAIVSGCIPVIVSDELELPFEG 394
Query: 309 AIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQP 363
+ + +I +FV+ D + L L SI + I + Q L S R L+ PA+P
Sbjct: 395 ILDYRKIALFVSSNDAVQPGWLVKYLRSIDAKRIRQMQSNLLKYS--RHFLYSSPARP 450
>gi|356553158|ref|XP_003544925.1| PREDICTED: probable glucuronosyltransferase Os01g0926700-like
[Glycine max]
Length = 440
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 97/378 (25%), Positives = 156/378 (41%), Gaps = 43/378 (11%)
Query: 40 LKVYVYELPSKYNKKLLLK-------DPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADW 92
LK+YVY+ K LL+ D CL + +++ +H+ LL S RT EEAD
Sbjct: 67 LKIYVYQEDEIDGLKELLRGRDAKITDEACLKGQWGSQVKIHKLLLQSKQRTWKKEEADL 126
Query: 93 FYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACF 152
F+ P Y C GL + + + S + S PY+ + G +H FV P GA
Sbjct: 127 FFVPSYVKCARMMGGL-----NDKEINSTYVKVISQMPYFRLSGGRNHIFVFPSGAGAHL 181
Query: 153 HYQEEKAIERGIL--PLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDT 210
I R I+ P R +R+ N I P M + T
Sbjct: 182 FKSWATYINRSIILTPEGDRTD------KRDTSAFNTWKDIIIPGNIDDGMT--KTGDTT 233
Query: 211 PRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLF----DISTDHP-----TT 261
+ + + R + +G + R + E K P D+ P
Sbjct: 234 VQPLPLSKRKYLANYLGRAQG----KAGRLKLIELSKQFPEKLECPDLKFSGPDKLGRKE 289
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGM-FVA 320
Y+E ++ + FCL P G + W+ R E+ C+PVI++D I LPF + I + +I + + +
Sbjct: 290 YFEHLRNSKFCLAPRGESSWTLRFYESFFVECVPVILSDQIELPFQNVIDYSQISIKWPS 349
Query: 321 EEDVPKLDTILTSIPPEVILRKQRLLANPSMKRA-MLFPQPAQPGDAFHQILNGLARKLP 379
+ P+L L SIP E I ++++A R ++ ++ A I+ L RK+
Sbjct: 350 SQIGPELLQYLESIPDEEI---EKIIARGRQVRCWWVYASDSESCSAMRGIMWELQRKVR 406
Query: 380 ---HDKSTYLKPGGKFLN 394
H T+ G +N
Sbjct: 407 QFHHSAETFWLHNGSIVN 424
>gi|168004467|ref|XP_001754933.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162694037|gb|EDQ80387.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 378
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 94/386 (24%), Positives = 155/386 (40%), Gaps = 80/386 (20%)
Query: 44 VYELPSKYNKKLLLKD------------------PRCLTHMFAAEIFMHRFLLS------ 79
+YELP KYN LL +D + + + E ++ +LLS
Sbjct: 1 MYELPRKYNLGLLQRDNPDQELPWTSDVIPPWKMEFEVNNQHSVEYWLMVYLLSGRDRKK 60
Query: 80 ---SPVRTLNPEEADWFYTPIYPTCDLTPTG----LPLPFKSPRMMRSAIQLISSNWPYW 132
+ VR +PE+A+ F+ P + + G P K + ++++S N +W
Sbjct: 61 GNMAAVRVKDPEQAEVFFVPFFASLSFNSFGRNMAAPNAAKDRELQEGVVEMLS-NSKWW 119
Query: 133 NRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITI 192
+++G DH V+ H F Y + + Q +V FG+ N +
Sbjct: 120 QKSQGRDHIIVIHH--PNAFRYYRDM--------MNQSMFIVADFGRYNQTVARLKKDIV 169
Query: 193 PPYAP--PQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNP 250
PYA P + + P + R ++F+G V +G A+
Sbjct: 170 APYAHVVPSYNEDNPSDPFSARKTLLFFQG---RVRRKADGVIRAK-------------- 212
Query: 251 LFDISTDHPTTYYED--------------MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPV 296
L + + YYED M+ + FCL P G P S RL +A+V C+PV
Sbjct: 213 LGKLLMNQTDVYYEDSLARTEAIAMSTQGMRFSRFCLHPAGDTPSSCRLFDAIVSHCVPV 272
Query: 297 IIADDIVLPFADAIPWEEIGMFVAEEDV---PKLDTILTSIPPEVILRKQRLLANPSMKR 353
I++D I LPF D + + E +F + ++ L L SI E L+ L ++
Sbjct: 273 IVSDRIELPFEDDLDYSEFSIFFSAKEAIIPGHLLGTLRSITRERWLQMWNKLK--AISH 330
Query: 354 AMLFPQPAQPGDAFHQILNGLARKLP 379
+ P++ DA + I + RKLP
Sbjct: 331 HFEYQNPSKEDDAVNLIFKQVQRKLP 356
>gi|125528116|gb|EAY76230.1| hypothetical protein OsI_04166 [Oryza sativa Indica Group]
Length = 513
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 96/358 (26%), Positives = 153/358 (42%), Gaps = 54/358 (15%)
Query: 40 LKVYVYELPSKYNKKLL--LKDP-RCLTHMFAAEIFMHRF-------------------- 76
L+VYVYE+PS++ LL +D R +++ + +HR
Sbjct: 114 LRVYVYEMPSRFTYDLLRLFRDSYRETSNLTSNGSPVHRLVEQHSIDYWLWADLIAPESQ 173
Query: 77 -LLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRT 135
LL + +R EEAD FY P + T L + + R A++ ++ P W R+
Sbjct: 174 RLLKNVIRVRRQEEADIFYVPFFTTISYF---LLEKQECKALYREALKWVTDQ-PAWQRS 229
Query: 136 EGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY 195
EG DH V H + ++ + + + LL + + V L + I PY
Sbjct: 230 EGRDHVIPVHHPWS----FKSVRRFVKKAIWLLPDMDSTGNWYKPGQVYLEKD--VILPY 283
Query: 196 APPQKMQAHQIPPDTP--RSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFD 253
P + + +T RS ++FRG GG R+ + K+
Sbjct: 284 VPNVDLCDSKCVSETQSRRSTLLFFRGRL----RRNAGGKI----RSKLVTELKDAEGII 335
Query: 254 I-----STDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFAD 308
I D M++++FCL P G P S RL +A+V GCIPVI++D++ LPF
Sbjct: 336 IEEGTAGADGKAAAQNGMRKSLFCLNPAGDTPSSARLFDAIVSGCIPVIVSDELELPFEG 395
Query: 309 AIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQP 363
+ + +I +FV+ D + L L SI + I + Q L S R L+ PA+P
Sbjct: 396 ILDYRKIALFVSSNDAVQPGWLVKYLRSIDAKRIRQMQSNLLKYS--RHFLYSSPARP 451
>gi|326517848|dbj|BAK03842.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 451
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 92/366 (25%), Positives = 148/366 (40%), Gaps = 53/366 (14%)
Query: 39 KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSP---VRTLNPEEADWFYT 95
+ KVYVYE + +L P + LL+ P VRT + + A F+
Sbjct: 98 RFKVYVYE---EGEPPILHTGPCKDIYTIEGRFIEQLELLAPPAPGVRTRDADRAHAFFL 154
Query: 96 PIYPTCDLTPTGLPLPF-KSP--RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACF 152
P + L + + P ++ +++++S P+WNR+ GADHF + HD+G
Sbjct: 155 PFSVAQMMQFAYRQLSYDRGPLLSLVGDYVRVVASRHPFWNRSAGADHFMLSCHDWGPDA 214
Query: 153 HYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPR 212
+ + GI L A + F V + E I + P+++ P + R
Sbjct: 215 SKGDPELYANGIRALCN-ANTSEGFRPGKDVSIPE--INLYDGDTPRQLLGPS-PGLSAR 270
Query: 213 SIFVYFRG------------------------LFYDVNNDPEGGYYARGARAAVWENFKN 248
+F G YD+ + G R R
Sbjct: 271 PYLAFFAGGRHGHVRDLLLRHWKGRDPATFPVYEYDIPSTTGGNSSGRHNRRG------- 323
Query: 249 NPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFAD 308
D + Y+ M R+ FCLCP G SPR+VEA+ C+PV++++ PFAD
Sbjct: 324 -------RDRQSDYFAYMHRSRFCLCPSGHEVASPRVVEAIHAECVPVLVSEGYAPPFAD 376
Query: 309 AIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFH 368
+ WE + V D+P+L +L IP + R + + +KR QP + D FH
Sbjct: 377 VLRWESFSVSVPVVDIPRLKEVLEGIPMAEVERLREGVR--LVKRHFTLRQPPERLDMFH 434
Query: 369 QILNGL 374
IL+ +
Sbjct: 435 MILHSV 440
>gi|358054164|dbj|GAA99700.1| hypothetical protein E5Q_06404 [Mixia osmundae IAM 14324]
Length = 420
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 88/331 (26%), Positives = 136/331 (41%), Gaps = 56/331 (16%)
Query: 39 KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVR---TLNPEEADWFYT 95
LK+YVY+LP +D +C +++E+ + L S+P T +P EA++F
Sbjct: 78 SLKIYVYDLPEHLRLGRAQED-KCRWSAYSSELHLTHMLASTPAARYVTHDPSEANFFLV 136
Query: 96 PIYPTCDL----TPTGLPLPFK---SPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDF 148
P++P C L G + ++++ A+ I S P W R G DH P DF
Sbjct: 137 PLFPACYLFHCWDTAGWNRDLRCNVDEKLIQPAMAYIQSQ-PSWQRHSGRDHVMFHPMDF 195
Query: 149 GACFHYQEEKAIERGILPLLQRA---TLVQTFGQRNHVCLNEGSITIPPYA--PPQKMQA 203
G ++ + + + R + + T + R V + + + Y P +
Sbjct: 196 GDTYYSTDSRVMMRQMSYFVTNGDARTNGTIYRPRKDVVIPSATYLLHSYRYHPRDYLDE 255
Query: 204 H-----QIPPD-----------------------------TPRSIFVYFRGLFYDVNNDP 229
H Q+P T R+I YFRGL DV D
Sbjct: 256 HGHPLSQMPKRPKDGTAVSDHVNIYEPTRARKWFASKRDPTGRTILAYFRGLGADVQPDD 315
Query: 230 EGGYYARG---ARAAVWENFKNNPLFDISTDHPTTYYE-DMQRAIFCLCPLGWAPWSPRL 285
E R R + F + P FD+S + Y ++ A + L P G S R+
Sbjct: 316 EYSLGVRSLFYGRKGSHDGFASLPGFDVSVESENAEYAVELAHARYGLTPPGHTLDSTRI 375
Query: 286 VEAVVFGCIPVII-ADDIVLPFADAIPWEEI 315
E + FG +PVII AD +VLPFA +PW E+
Sbjct: 376 WEYLAFGVVPVIIGADGLVLPFAQHLPWSEM 406
>gi|302142837|emb|CBI20132.3| unnamed protein product [Vitis vinifera]
Length = 379
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 85/305 (27%), Positives = 134/305 (43%), Gaps = 23/305 (7%)
Query: 77 LLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTE 136
LL + VR EEAD FY P + T L P + + R A++ ++ P W R+E
Sbjct: 55 LLKNVVRVHRQEEADLFYIPFFTTISFF---LLEPEQWKPLYREALKWVTDQ-PAWKRSE 110
Query: 137 GADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYA 196
G DH V H + ++ + + + LL + + V L + I PY
Sbjct: 111 GRDHILPVHHPWS----FKTVRKSMKNAIWLLPDMDSTGNWYKPGQVSLEKD--LILPYV 164
Query: 197 PPQKMQAHQIPPDT--PRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDI 254
P + + ++ R ++FRG + + G A+ ++
Sbjct: 165 PNVDLCDAKCSSESESKRKTLLFFRG---RLKRNAGGKIRAKLMAELSGDDGVVIQEGTA 221
Query: 255 STDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEE 314
M+++IFCL P G P S RL +A+V GCIPVI++D++ LPF + + +
Sbjct: 222 GEGGKEAAQRGMRKSIFCLSPAGDTPSSARLFDAIVSGCIPVIVSDELELPFEGILDYRK 281
Query: 315 IGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQ---PGDAFH 368
I +FV+ D + L T L SI P I QR LA S R ++ PAQ P D
Sbjct: 282 IALFVSSSDAMQPGWLLTFLKSISPAQIKEMQRNLAKYS--RHFVYSSPAQLLGPEDLVW 339
Query: 369 QILNG 373
+++ G
Sbjct: 340 RMMAG 344
>gi|358054165|dbj|GAA99701.1| hypothetical protein E5Q_06403 [Mixia osmundae IAM 14324]
Length = 419
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 88/331 (26%), Positives = 136/331 (41%), Gaps = 56/331 (16%)
Query: 39 KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVR---TLNPEEADWFYT 95
LK+YVY+LP +D +C +++E+ + L S+P T +P EA++F
Sbjct: 77 SLKIYVYDLPEHLRLGRAQED-KCRWSAYSSELHLTHMLASTPAARYVTHDPSEANFFLV 135
Query: 96 PIYPTCDL----TPTGLPLPFK---SPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDF 148
P++P C L G + ++++ A+ I S P W R G DH P DF
Sbjct: 136 PLFPACYLFHCWDTAGWNRDLRCNVDEKLIQPAMAYIQSQ-PSWQRHSGRDHVMFHPMDF 194
Query: 149 GACFHYQEEKAIERGILPLLQRA---TLVQTFGQRNHVCLNEGSITIPPYA--PPQKMQA 203
G ++ + + + R + + T + R V + + + Y P +
Sbjct: 195 GDTYYSTDSRVMMRQMSYFVTNGDARTNGTIYRPRKDVVIPSATYLLHSYRYHPRDYLDE 254
Query: 204 H-----QIPPD-----------------------------TPRSIFVYFRGLFYDVNNDP 229
H Q+P T R+I YFRGL DV D
Sbjct: 255 HGHPLSQMPKRPKDGTAVSDHVNIYEPTRARKWFASKRDPTGRTILAYFRGLGADVQPDD 314
Query: 230 EGGYYARG---ARAAVWENFKNNPLFDISTDHPTTYYE-DMQRAIFCLCPLGWAPWSPRL 285
E R R + F + P FD+S + Y ++ A + L P G S R+
Sbjct: 315 EYSLGVRSLFYGRKGSHDGFASLPGFDVSVESENAEYAVELAHARYGLTPPGHTLDSTRI 374
Query: 286 VEAVVFGCIPVII-ADDIVLPFADAIPWEEI 315
E + FG +PVII AD +VLPFA +PW E+
Sbjct: 375 WEYLAFGVVPVIIGADGLVLPFAQHLPWSEM 405
>gi|357474305|ref|XP_003607437.1| Cation proton exchanger [Medicago truncatula]
gi|355508492|gb|AES89634.1| Cation proton exchanger [Medicago truncatula]
Length = 1198
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 104/360 (28%), Positives = 151/360 (41%), Gaps = 58/360 (16%)
Query: 40 LKVYVYELPSKYNKKLLL---KDPRCLTHMFAAEIFMHRF-------------------- 76
LKVYVY +P+K+ LLL R +++ + +HR
Sbjct: 112 LKVYVYNMPNKFTYDLLLLFRNTYRDTSNLTSNGSPVHRLIEQHSIDYWLWADLIAPESE 171
Query: 77 -LLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRT 135
LL S VR EEAD+FY P + T L + + R A++ I+ + P W R+
Sbjct: 172 RLLKSVVRVHRQEEADFFYIPFFTTISFF---LLEKQQCKALYREALKWIT-DQPAWKRS 227
Query: 136 EGADHFFVVPH--DFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIP 193
G DH V H F Y + KAI +LP + GQ E + +P
Sbjct: 228 GGRDHILPVHHPWSFKTVRRYVK-KAI--WLLPDMDSTGNWYKPGQ----VYLEKDLILP 280
Query: 194 PYAPPQKMQAHQIPPDTP-RSIFVYFRGLFYDVNNDPEGGYYA------RGARAAVWENF 246
A A + P R+ ++FRG + + G + RGA V E
Sbjct: 281 YVANVDFCDATCLSEINPKRNTLLFFRG---RLKRNAGGKIRSKLVDQLRGADGVVIEEG 337
Query: 247 KNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 306
+ M++++FCL P G P S RL +A+V GCIPVI++D++ LPF
Sbjct: 338 TSGE------GGKEAAQNGMRKSLFCLNPAGDTPSSARLFDAIVSGCIPVIVSDELELPF 391
Query: 307 ADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQP 363
+ + +I +FV+ D K L L I I Q+ LA S R L+ PAQP
Sbjct: 392 EGILDYRKIALFVSSNDALKPSWLLKYLKDIRSAHIKEMQQNLAKYS--RHFLYSSPAQP 449
>gi|15237870|ref|NP_197191.1| Exostosin family protein [Arabidopsis thaliana]
gi|9755690|emb|CAC01702.1| putative protein [Arabidopsis thaliana]
gi|15810401|gb|AAL07088.1| unknown protein [Arabidopsis thaliana]
gi|23296585|gb|AAN13125.1| unknown protein [Arabidopsis thaliana]
gi|332004972|gb|AED92355.1| Exostosin family protein [Arabidopsis thaliana]
Length = 511
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 89/355 (25%), Positives = 149/355 (41%), Gaps = 48/355 (13%)
Query: 40 LKVYVYELPSKYNKKLL----------------------LKDPRCLTHMFAAEIFM--HR 75
+KVYVYE+P K+ LL L + + + A++
Sbjct: 118 IKVYVYEMPKKFTFDLLWLFHNTYKETSNATSNGSPVHRLIEQHSIDYWLWADLISPESE 177
Query: 76 FLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRT 135
L S VR ++AD+FY P + T L + + R A++ ++ P W R+
Sbjct: 178 RRLKSVVRVQKQQDADFFYVPFFTTISFF---LLEKQQCKALYREALKWVTDQ-PAWKRS 233
Query: 136 EGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY 195
EG DH F + H + ++ + + + LL + + V L + I PY
Sbjct: 234 EGRDHIFPIHHPWS----FKSVRKFVKNAIWLLPDMDSTGNWYKPGQVSLEKDLIL--PY 287
Query: 196 APPQKMQAHQIPPDTP--RSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFD 253
P + + ++ R+ ++FRG G R A K+ + +
Sbjct: 288 VPNVDICDTKCLSESAPMRTTLLFFRGRL-----KRNAGGKIRAKLGAELSGIKDIIISE 342
Query: 254 ISTDH--PTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIP 311
+ M+R++FCLCP G P S RL +A+V GCIPVI++D++ PF +
Sbjct: 343 GTAGEGGKLAAQRGMRRSLFCLCPAGDTPSSARLFDAIVSGCIPVIVSDELEFPFEGILD 402
Query: 312 WEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQP 363
++++ + V+ D + L L S+ P + Q LA S R L+ PAQP
Sbjct: 403 YKKVAVLVSSSDAIQPGWLVNHLRSLTPFQVKGLQNNLAQYS--RHFLYSSPAQP 455
>gi|297850568|ref|XP_002893165.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
gi|297339007|gb|EFH69424.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
Length = 462
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 97/392 (24%), Positives = 151/392 (38%), Gaps = 71/392 (18%)
Query: 40 LKVYVYELPSKYNKKLLL-------KDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADW 92
LK+YVY+ K LL K CL + +++ +H+ LL S RT+ +EAD
Sbjct: 89 LKIYVYDENEIDGLKELLYGRDGSVKTTACLKGQWGSQVKIHKLLLESKFRTIKKDEADL 148
Query: 93 FYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACF 152
F+ P Y C GL + + + + S PY+ R+ G DH FV P GA
Sbjct: 149 FFVPAYVKCVRMLGGL-----NDKEINQTYVKVLSQMPYFRRSGGRDHIFVFPSGAGAHL 203
Query: 153 HYQEEKAIERGIL--PLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIP--- 207
I R I+ P R +++ N I P M + P
Sbjct: 204 FRSWSTFINRSIILTPEADRTD------KKDTTAFNTWKDIIIPGNVDDAMTKNGQPDVQ 257
Query: 208 --PDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP------ 259
P + R + G A+G + L D+S +P
Sbjct: 258 PLPLSKRKYLANYLG-------------RAQGKAGRL-------KLIDLSKQYPDKLECP 297
Query: 260 ------------TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFA 307
TTY+E ++ A FCL P G + W+ R E+ C+PV+++D LPF
Sbjct: 298 DLKFSGTEKFGRTTYFEHLRNAKFCLAPRGESSWTLRFYESFFVECVPVLLSDHAELPFQ 357
Query: 308 DAIPWEEIGMFVAEEDV-PKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQP-AQPGD 365
+ I + ++ + + + L SI + I + ++A R + P + P
Sbjct: 358 NVIDYAQVSIKWPSTRIGAEFLDYLASISDKDI---EGMIARGREIRCLFVYGPDSAPCS 414
Query: 366 AFHQILNGLARKLPH---DKSTYLKPGGKFLN 394
A IL L RK+ H T+ G F+N
Sbjct: 415 AVKGILWELQRKVRHFQQSTETFWLHNGSFVN 446
>gi|357474287|ref|XP_003607428.1| Exostosin family protein-like protein [Medicago truncatula]
gi|355508483|gb|AES89625.1| Exostosin family protein-like protein [Medicago truncatula]
Length = 502
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 106/360 (29%), Positives = 153/360 (42%), Gaps = 58/360 (16%)
Query: 40 LKVYVYELPSKYNKKLLL---KDPRCLTHMFAAEIFMHRF-------------------- 76
LKVYVY +P+K+ LLL R +++ + +HR
Sbjct: 112 LKVYVYNMPNKFTYDLLLLFRNTYRDTSNLTSNGSPVHRLIEQHSIDYWLWADLIAPESE 171
Query: 77 -LLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRT 135
LL S VR EEAD+FY P + T L + + R A++ I+ + P W R+
Sbjct: 172 RLLKSVVRVHRQEEADFFYIPFFTTISFF---LLEKQQCKALYREALKWIT-DQPAWKRS 227
Query: 136 EGADHFFVVPH--DFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIP 193
G DH V H F Y + KAI +LP + GQ V L E + +P
Sbjct: 228 GGRDHILPVHHPWSFKTVRRYVK-KAI--WLLPDMDSTGNWYKPGQ---VYL-EKDLILP 280
Query: 194 PYAPPQKMQAHQIPPDTP-RSIFVYFRGLFYDVNNDPEGGYYA------RGARAAVWENF 246
A A + P R+ ++FRG + + G + RGA V E
Sbjct: 281 YVANVDFCDATCLSEINPKRNTLLFFRG---RLKRNAGGKIRSKLVDQLRGADGVVIEEG 337
Query: 247 KNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 306
+ M++++FCL P G P S RL +A+V GCIPVI++D++ LPF
Sbjct: 338 TSGE------GGKEAAQNGMRKSLFCLNPAGDTPSSARLFDAIVSGCIPVIVSDELELPF 391
Query: 307 ADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQP 363
+ + +I +FV+ D K L L I I Q+ LA S R L+ PAQP
Sbjct: 392 EGILDYRKIALFVSSNDALKPSWLLKYLKDIRSAHIKEMQQNLAKYS--RHFLYSSPAQP 449
>gi|326522384|dbj|BAK07654.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 496
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 94/355 (26%), Positives = 151/355 (42%), Gaps = 48/355 (13%)
Query: 40 LKVYVYELPSKYNKKLL--LKDP-RCLTHMFAAEIFMHRF-------------------- 76
L+VYVYE+P K+ LL +D R T++ + +HR
Sbjct: 110 LRVYVYEMPRKFTYDLLRLFRDSYRDTTNLTSNGSPVHRLIEQHSIDYWLWADLIAPESQ 169
Query: 77 -LLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRT 135
LL + +R EEAD FY P + T L + + R A++ ++ P W R+
Sbjct: 170 RLLKNVIRVERQEEADIFYVPFFTTISYF---LLEKQECKALYREALKWVTDQ-PAWQRS 225
Query: 136 EGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY 195
EG DH V H + ++ + + + LL + + V L + I PY
Sbjct: 226 EGRDHVIPVHHPWS----FKSVRRFVKKAIWLLPDMDSTGNWYKPGQVYLEKDVIL--PY 279
Query: 196 APPQKMQAHQIPPDT--PRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFD 253
P + ++ +T RS ++FRG G R +N ++ + +
Sbjct: 280 VPNVDLCDYKCVSETQSKRSTLLFFRGRL-----KRNAGGKIRSKLVTELQNIEDIIIEE 334
Query: 254 IST--DHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIP 311
S M++++FCL P G P S RL +A+V GCIPVII+D++ LPF +
Sbjct: 335 GSAGAKGKVAALTGMRKSLFCLNPAGDTPSSARLFDAIVSGCIPVIISDELELPFEGILD 394
Query: 312 WEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQP 363
+ +I +FV+ D + L L + + + Q L S R L+ PAQP
Sbjct: 395 YSKIALFVSSTDAVQPGWLVKYLRGVDGKRVREMQSNLLKYS--RHFLYSSPAQP 447
>gi|224133768|ref|XP_002321656.1| predicted protein [Populus trichocarpa]
gi|222868652|gb|EEF05783.1| predicted protein [Populus trichocarpa]
Length = 407
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 94/366 (25%), Positives = 156/366 (42%), Gaps = 57/366 (15%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYP 99
LKV+VY P K + L +A+E + L + T NP+EA F+ P+
Sbjct: 65 LKVFVY--PGGNPKTCYHSIDKKLKSNYASEHYFFMNLRNGSFLTENPDEAHLFFIPL-- 120
Query: 100 TCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFG----ACFHYQ 155
+C LP ++++ ++ ++ +PYWNRT GADHFFV H G A F +
Sbjct: 121 SCQPMEDQDALPRYKEMVIQNYVRALTIKYPYWNRTLGADHFFVSCHGIGNRATAAFPFL 180
Query: 156 EEKAIERGILP----------------LLQRATLVQTFGQRNHVCLNEGSITIPPYAP-P 198
+ AI P +L+ + + G N + I + P P
Sbjct: 181 LKNAIRLVCSPSYDSNYIPHKDVSLPQILELSFPPEGDGMWNDSTMESLPIQLSPVETHP 240
Query: 199 QKMQ----AHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDI 254
+ + A + +++ V+++GL + E + RA V + F+
Sbjct: 241 SRTKLCFWAGSPNSEVRKNLRVHYKGL-----EEFEIHFVENVKRALVLDTFQ------- 288
Query: 255 STDHPTTYYEDMQRAIFCLCPLGWAPWSPR-LVEAVVFGCIPVIIADDIVLPFADAIPWE 313
+++ R+ FC+CP G L E++ FGC+PVI++D LPF D + W
Sbjct: 289 ---------KEIHRSKFCICPRGKTQVGGVCLAESMAFGCVPVIMSDYYDLPFNDILDWN 339
Query: 314 EIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQP--GDAFHQIL 371
+ + E DVP + IL I PE + K R +K + F +P D FH ++
Sbjct: 340 AFSVILKEHDVPIMGEILKGI-PEDMFEKMR---QNVLKVSKYFKWHFRPVKYDEFHMVM 395
Query: 372 NGLARK 377
L ++
Sbjct: 396 YELWKR 401
>gi|307106650|gb|EFN54895.1| hypothetical protein CHLNCDRAFT_135016 [Chlorella variabilis]
Length = 584
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 91/345 (26%), Positives = 147/345 (42%), Gaps = 35/345 (10%)
Query: 68 AAEIFMHRFLLSSPVRTLNPEEADWFYTPI----YPTCDLTPTGLPLPFKSPRMM--RSA 121
A + F+ +FLLS PVRT +P EA+ FY P+ Y P+ P MM ++
Sbjct: 247 AYQQFLEQFLLS-PVRTEDPSEANLFYIPMLLYGYSGTPGGPSRAPQVDSLCNMMPGQAH 305
Query: 122 IQL----ISSNWPYWNRTEGADHFFVVPHDFGACFHY----QEEKAIERGILPLLQRATL 173
I L I+ WPYWNRT G DHF+ P D GAC+H Q K G+ L
Sbjct: 306 IDLVLDQIAHKWPYWNRTRGRDHFYWAPADRGACYHKGLAEQAIKVSHFGLHATNNSIDL 365
Query: 174 VQTFGQR----NHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRG----LFYDV 225
+ +H C + + P P +K+ A + + +G ++
Sbjct: 366 GDLYSHNQMSPDHGCYHPLRDVVAP--PFEKLAASWLNTTLRLGLDGNIKGKNATFYFSG 423
Query: 226 NNDPEGGYYARGARAAVWENFK--NNPLFDISTD--HPTTYYEDMQRAIFCLCPLGWAPW 281
N Y+ G R + K ++P F Y + ++ + FCL P G +
Sbjct: 424 NVQGINLMYSGGTRQKLQALIKQWDDPEFGFVEGRLQEGAYEQRIRESRFCLAPYGHG-Y 482
Query: 282 SPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILR 341
RL + + G IPVI+ + + P D +P+E + + +D+P+L IL I
Sbjct: 483 GMRLGQCIFAGSIPVIVQEHVFQPLEDVLPYEAFSIRLTNDDLPQLREILRGITEA---- 538
Query: 342 KQRLLANPSMKRAMLFP-QPAQPGDAFHQILNGLARKLPHDKSTY 385
+ R L ++ ++ + G AF ++ L R+ + KS +
Sbjct: 539 QYRELMTGLLRYSLALSWDTSLGGTAFDYTISALRRRYMNLKSLH 583
>gi|302835858|ref|XP_002949490.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
gi|300265317|gb|EFJ49509.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
Length = 499
Score = 95.5 bits (236), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 73/302 (24%), Positives = 125/302 (41%), Gaps = 40/302 (13%)
Query: 70 EIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNW 129
++ + LL S R + + ADW++ P+ F +R A+ I ++
Sbjct: 93 HVYFWQRLLGSGARVADGDLADWYFIPVRQR----------SFSDSWFLREALSYIRTHH 142
Query: 130 PYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQ--RATLVQTFGQRNHVCLNE 187
P+WNRTEG H + D+G ++ + + + L +T + E
Sbjct: 143 PWWNRTEGHRHMVLHTGDWGLGEVAKDVRQMSLNVTWLTHWGLSTDRPNIQRWTRAFRPE 202
Query: 188 GSITIPPYAPP--------QKMQAHQIPPDTPRSI-------FVYFRG-LFYDVN----- 226
+ IP Y P + H + + R+ ++F G + +D
Sbjct: 203 RDVVIPVYISPGHFVHFGINRSPLHPVTAASRRTAARPRNESLLFFAGRICHDAKRPNPD 262
Query: 227 -----NDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPW 281
D +Y G R + + N F + P Y M R++FCL P G A
Sbjct: 263 TFPACGDDTAEWYGGGVREKFFVSHWNRSGFHVVRSEPR-YSHYMSRSVFCLAPPG-AGH 320
Query: 282 SPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILR 341
R ++A+ GC+PV +AD + PF A+ WEE G+ +AE+D+P+ +L + E +
Sbjct: 321 GQRQIQALFMGCVPVTVADGVYEPFEPALSWEEWGLRIAEQDIPRAHELLGGLTREQLAE 380
Query: 342 KQ 343
KQ
Sbjct: 381 KQ 382
>gi|18394994|ref|NP_564141.1| exostosin-like protein [Arabidopsis thaliana]
gi|332191985|gb|AEE30106.1| exostosin-like protein [Arabidopsis thaliana]
Length = 462
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 97/396 (24%), Positives = 151/396 (38%), Gaps = 79/396 (19%)
Query: 40 LKVYVYELPSKYNKKLLL-------KDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADW 92
LK+YVY+ K LL K CL + +++ +H+ LL S RT+ +EAD
Sbjct: 89 LKIYVYDENEIDGLKELLYGRDGSVKTTACLKGQWGSQVKIHKLLLESKFRTIKKDEADL 148
Query: 93 FYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACF 152
F+ P Y C GL + + + + S PY+ R+ G DH FV P GA
Sbjct: 149 FFVPAYVKCVRMLGGL-----NDKEINQTYVKVLSQMPYFRRSGGRDHIFVFPSGAGAHL 203
Query: 153 HYQEEKAIERGIL--PLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIP--- 207
I R I+ P R +++ N I P M + P
Sbjct: 204 FRSWSTFINRSIILTPEADRTD------KKDTTAFNSWKDIIIPGNVDDAMTKNGQPDVQ 257
Query: 208 --PDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP------ 259
P + R + G A+G + L D+S P
Sbjct: 258 PLPLSKRKYLANYLG-------------RAQGKAGRL-------KLIDLSKQFPDKLECP 297
Query: 260 ------------TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFA 307
TTY+E ++ A FCL P G + W+ R E+ C+PV+++D LPF
Sbjct: 298 DLKFSGTEKFGRTTYFEHLRNAKFCLAPRGESSWTLRFYESFFVECVPVLLSDHAELPFQ 357
Query: 308 DAIPWEEIGM-----FVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQP-A 361
+ I + ++ + + E + L +I +I R +++ R + P +
Sbjct: 358 NVIDYAQVSIKWPSTRIGSEFLDYLASISDRDIEGMIARGRKI-------RCLFVYGPDS 410
Query: 362 QPGDAFHQILNGLARKLPH---DKSTYLKPGGKFLN 394
P A IL L RK+ H T+ G F+N
Sbjct: 411 APCSAVKGILWELQRKVRHFQQSTETFWLHNGSFVN 446
>gi|359481095|ref|XP_002265438.2| PREDICTED: probable glycosyltransferase At5g03795-like [Vitis
vinifera]
Length = 336
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 85/314 (27%), Positives = 136/314 (43%), Gaps = 35/314 (11%)
Query: 67 FAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLIS 126
+A+E + + L+ S T +P +AD F+ P + L +R I IS
Sbjct: 33 YASESYFKKVLMKSHFITKDPSKADLFFLP-FSIARLRHDPRVGVGGIQDFIRDYIFNIS 91
Query: 127 SNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLN 186
N+PYWN+T GADHF+V H G EKA E + +Q ++
Sbjct: 92 QNYPYWNQTGGADHFYVACHSIGRS---AMEKADE-------VKLNAIQVVCSSSYFL-- 139
Query: 187 EGSITIPPYAPPQKMQAHQIPPD---TPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVW 243
G I + PQ PPD + R +F G N P R +
Sbjct: 140 SGYIAHKDASLPQIWPRQGDPPDLALSERKKLAFFAGSI----NSP--------VRERLL 187
Query: 244 ENFKNNPLFDISTDHPTT-YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDI 302
+ ++N+ + TT Y +++ + FCL G+ + R+ +++ +GC+PVIIA+
Sbjct: 188 QVWRNDSEISVHFGRLTTPYADELLGSKFCLHVKGFEINTARIADSLYYGCVPVIIANHY 247
Query: 303 VLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQ 362
LPFAD + W+ + VA D+P L +L I L + +L + +K F
Sbjct: 248 DLPFADILNWKSFSIVVATLDIPLLKQVLKGIS----LNEYLMLQSNVLKVRNHFQWHVS 303
Query: 363 P--GDAFHQILNGL 374
P DAF+ ++ L
Sbjct: 304 PVDYDAFYMVMYEL 317
>gi|21537341|gb|AAM61682.1| unknown [Arabidopsis thaliana]
Length = 462
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 97/396 (24%), Positives = 151/396 (38%), Gaps = 79/396 (19%)
Query: 40 LKVYVYELPSKYNKKLLL-------KDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADW 92
LK+YVY+ K LL K CL + +++ +H+ LL S RT+ +EAD
Sbjct: 89 LKIYVYDENEIDGLKELLYGRDGSVKTTACLKGQWGSQVKIHKLLLESKFRTIKKDEADL 148
Query: 93 FYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACF 152
F+ P Y C GL + + + + S PY+ R+ G DH FV P GA
Sbjct: 149 FFVPAYVKCVRMLGGL-----NDKEINQTYVKVLSQMPYFRRSGGRDHIFVFPSGAGAHL 203
Query: 153 HYQEEKAIERGIL--PLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIP--- 207
I R I+ P R +++ N I P M + P
Sbjct: 204 FRSWSTFINRSIILTPEADRTD------KKDTTAFNSWKDIIIPGNVDDAMTKNGQPDVQ 257
Query: 208 --PDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP------ 259
P + R + G A+G + L D+S P
Sbjct: 258 PLPLSKRKYLANYLG-------------RAQGKAGRL-------KLIDLSKQFPDKLECP 297
Query: 260 ------------TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFA 307
TTY+E ++ A FCL P G + W+ R E+ C+PV+++D LPF
Sbjct: 298 DLKFSGTEKFGRTTYFEHLRNAKFCLAPRGESSWTLRFYESFFVECVPVLLSDHAELPFQ 357
Query: 308 DAIPWEEIGM-----FVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQP-A 361
+ I + ++ + + E + L +I +I R +++ R + P +
Sbjct: 358 NVIDYAQVSIKWPSTRIGSEFLDYLASISDRDIEGMIARGRKI-------RCLFVYGPDS 410
Query: 362 QPGDAFHQILNGLARKLPH---DKSTYLKPGGKFLN 394
P A IL L RK+ H T+ G F+N
Sbjct: 411 APCSAVKGILWELQRKVRHFQQSTETFWLHNGSFVN 446
>gi|18650643|gb|AAL75891.1| At1g21480/F24J8_23 [Arabidopsis thaliana]
Length = 462
Score = 95.1 bits (235), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 97/396 (24%), Positives = 151/396 (38%), Gaps = 79/396 (19%)
Query: 40 LKVYVYELPSKYNKKLLL-------KDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADW 92
LK+YVY+ K LL K CL + +++ +H+ LL S RT+ +EAD
Sbjct: 89 LKIYVYDENEIDGLKELLYGRDGSVKTTACLKGQWGSQVKIHKLLLESKFRTIKKDEADL 148
Query: 93 FYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACF 152
F+ P Y C GL + + + + S PY+ R+ G DH FV P GA
Sbjct: 149 FFVPAYVKCVRMLGGL-----NDKEINQTYVKVLSQMPYFRRSGGRDHIFVFPSGAGAHL 203
Query: 153 HYQEEKAIERGIL--PLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIP--- 207
I R I+ P R +++ N I P M + P
Sbjct: 204 FRSWSTFINRSIILTPEADRTD------KKDTTAFNSWKDIIIPGNVDDAMTKNGQPDVQ 257
Query: 208 --PDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP------ 259
P + R + G A+G + L D+S P
Sbjct: 258 PLPLSKRKYLANYLG-------------RAQGKAGRL-------KLIDLSKQFPDKLECP 297
Query: 260 ------------TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFA 307
TTY+E ++ A FCL P G + W+ R E+ C+PV+++D LPF
Sbjct: 298 DLKFSGTEKFGRTTYFEHLRNAKFCLAPRGESSWTLRFYESFFVECVPVLLSDHAELPFQ 357
Query: 308 DAIPWEEIGM-----FVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQP-A 361
+ I + ++ + + E + L +I +I R +++ R + P +
Sbjct: 358 NVIDYAQVSIKWPSTRIGSEFLDYLASISDRDIEGMIARGRKI-------RCLFVYGPDS 410
Query: 362 QPGDAFHQILNGLARKLPH---DKSTYLKPGGKFLN 394
P A IL L RK+ H T+ G F+N
Sbjct: 411 APCSAVKGILWELQRKVRHFQQSTETFWLHNGSFVN 446
>gi|168059172|ref|XP_001781578.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666988|gb|EDQ53629.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 344
Score = 95.1 bits (235), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 82/342 (23%), Positives = 145/342 (42%), Gaps = 26/342 (7%)
Query: 39 KLKVYVYELPSKYNKKLLLKDPRCL-THMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPI 97
KL+++VY K ++ + + + +A+E F + LL+S T EAD+F+ P+
Sbjct: 17 KLRIFVYPHDRKDPFHMIFESGNKVPSGNYASEEFFQQSLLTSTFLTKTASEADFFFMPV 76
Query: 98 YPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEE 157
T + + + I + S W YWNR+ GADHF++ H +
Sbjct: 77 SITKARMDKRINVG-GLQSFCANYITDVRSQWSYWNRSNGADHFYLSCHSIARNAMDRVP 135
Query: 158 KAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVY 217
+ I L + + ++ + ++P P + ++ T R +
Sbjct: 136 DVRQNAIQLLCPASYFLPSY-------ITHKDASVPQIWPRLGKEPEEVRTITQRKRLAF 188
Query: 218 FRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLG 277
F G N P R W N ++ + P Y E + FCL G
Sbjct: 189 FAGAL----NSP-----VRKDLERTWAN--DSKILVHKGRVPYPYSEALLTTKFCLHAKG 237
Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
+ + RL +A+ +GC+PV+IA+ LPF D + W + + V+ D+P L L ++ E
Sbjct: 238 FEVNTARLGDAMYYGCVPVVIANYYDLPFQDILDWTKFSIVVSSLDIPLLKKTLEAVTDE 297
Query: 338 --VILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARK 377
L +Q LLA ++ + P + DAFH ++ L ++
Sbjct: 298 QYAELHRQVLLA----RKHFQWHAPPEEYDAFHTVMYELWKR 335
>gi|255558009|ref|XP_002520033.1| catalytic, putative [Ricinus communis]
gi|223540797|gb|EEF42357.1| catalytic, putative [Ricinus communis]
Length = 507
Score = 95.1 bits (235), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 94/359 (26%), Positives = 147/359 (40%), Gaps = 56/359 (15%)
Query: 40 LKVYVYELPSKYNKKLL---LKDPRCLTHMFAAEIFMHRF-------------------- 76
++VYVYE+P+K+ LL R ++ + +HR
Sbjct: 119 IRVYVYEMPNKFTYDLLWLFRNTYRDTVNLTSNGSPVHRLIEQHSIDYWLWADLIAPETE 178
Query: 77 -LLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRT 135
LL S VR EEAD FY P + T L + + R A++ ++ P W R+
Sbjct: 179 RLLKSVVRVYRQEEADLFYIPFFTTISFF---LLEKQQCKALYREALKWVTDQ-PAWKRS 234
Query: 136 EGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY 195
G DH V H + ++ + + + LL + + V L + I PY
Sbjct: 235 GGRDHILPVHHPWS----FKSVRRYVKNAIWLLPDMDSTGNWYKPGQVFLEKDLIL--PY 288
Query: 196 APPQKMQAHQIPPD--TPRSIFVYFRGLFYDVNNDPEGGYYAR------GARAAVWENFK 247
P + + + + R+ ++FRG + + G A+ GA V E
Sbjct: 289 VPNVDLCDAKCASENESKRTTLLFFRG---RLKRNAGGKIRAKLVAELSGAEGVVVEEGT 345
Query: 248 NNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFA 307
M+++IFCL P G P S RL +A+V GCIPVI++D++ LPF
Sbjct: 346 AGE------GGKAAAQTGMRKSIFCLSPAGDTPSSARLFDAIVSGCIPVIVSDELELPFE 399
Query: 308 DAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQP 363
+ + +I +FV+ D + L L + P QR L S R L+ PAQP
Sbjct: 400 GILDYRKIAVFVSSSDAIQPGWLIKFLKDVSPAQTREMQRNLVKYS--RHFLYSSPAQP 456
>gi|47497631|dbj|BAD19700.1| exostosin-like protein [Oryza sativa Japonica Group]
Length = 345
Score = 94.7 bits (234), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 77/300 (25%), Positives = 133/300 (44%), Gaps = 24/300 (8%)
Query: 86 NPEEADWFYTPIYPTCDL-------TPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGA 138
P++A F PI C+L TG P + ++ +++++ +PYWNR+ GA
Sbjct: 48 RPDDAHAFLLPI-SVCNLVHYVYRLNATGDLAPLRG--LVADYVRVVAERYPYWNRSRGA 104
Query: 139 DHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPP 198
DH V HD+ A + + + +L A + F R L E ++ P
Sbjct: 105 DHVIVSCHDW-APMVTSAHRQLYGNAIRVLCNANTSEGFRPRKDATLPEVNLADGVLRRP 163
Query: 199 QKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDH 258
+PP+ R+ +F G + + ++ G + + + +
Sbjct: 164 TA----GLPPEN-RTTLAFFAGGRHGHIRESLLRHWLIGNKGGAAADGDGDGDMRVHEYL 218
Query: 259 PT--TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG 316
P Y+ M A FCLCP G+ SPR+VE+V GC+PVII++ PF D + W ++
Sbjct: 219 PAGEDYHAQMAAARFCLCPSGFEVASPRVVESVFAGCVPVIISEGYPPPFGDVLDWGKMS 278
Query: 317 MFVAEEDVPKLDTILTSIPPE--VILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
+ V +P+L IL + +LR + L A +R + +PA+ D H +L+ +
Sbjct: 279 VAVPAARIPELRAILRRVSERRYRVLRARVLQA----QRHFVLHRPARRFDMIHMVLHSI 334
>gi|359485862|ref|XP_002264076.2| PREDICTED: probable glycosyltransferase At5g03795-like [Vitis
vinifera]
Length = 484
Score = 94.7 bits (234), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 83/342 (24%), Positives = 142/342 (41%), Gaps = 32/342 (9%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYP 99
K+YVYE + + + C + F+H S RT +P++A ++ P
Sbjct: 157 FKIYVYE----EGEPPMFHNGPCKSIYSTEGRFIHEMEKGSVYRTTDPDQALLYFLPFSV 212
Query: 100 TCDLTPTGLPLPFKSPRMMRSAI---QLISSNWPYWNRTEGADHFFVVPHDFG----ACF 152
+ +P + + ++ I LIS N P+WNR+ GADHF + HD+G
Sbjct: 213 VMMVQYLYVPDSHEIHAIEKTVIDYINLISHNHPFWNRSLGADHFMLSCHDWGPRASTSV 272
Query: 153 HYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPR 212
Y +I +L A + F V E + + P + P + R
Sbjct: 273 PYLYNNSIR-----VLCNANTSEGFNPSKDVSFPEIHLRTGEMSGP----LGGLSP-SRR 322
Query: 213 SIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFC 272
I +F G + + R W++ + + +Y ++++ FC
Sbjct: 323 PILGFFAGRLHG---------HIRYLLLEQWKDKDKDLQVYDQLPNGLSYDSMLKKSRFC 373
Query: 273 LCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILT 332
LCP G+ SPR+VEA+ C+PV+I+D+ V PF D + W+ + V D+ + IL
Sbjct: 374 LCPSGYEVASPRVVEAIYAECVPVLISDNYVPPFNDVLNWKSFAVQVQVRDIANIKRILM 433
Query: 333 SIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
I LR R + ++R + Q D FH ++ +
Sbjct: 434 GISQTQYLRMYRRVKQ--VQRHFMVNAAPQRFDVFHMTIHSI 473
>gi|159470791|ref|XP_001693540.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158283043|gb|EDP08794.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 510
Score = 94.7 bits (234), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 100/383 (26%), Positives = 152/383 (39%), Gaps = 71/383 (18%)
Query: 41 KVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPT 100
+VY+Y+LP + D R + F+ L ++ R +P ADWFY P+
Sbjct: 121 RVYIYDLPPELTT--WRNDDR--LDRWTTRHFLE-MLTATGARVGDPAAADWFYLPVRLR 175
Query: 101 CDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAI 160
++R A++ + + P++N T G DHF + D G
Sbjct: 176 SS----------SDGHVLRRALEYVQAAQPWFNATGGKDHFVLAVGDMG-------RLES 218
Query: 161 ERGILPLLQRATLVQTFG--------------QRNHVCLNEGSITIPPYAPPQKMQAHQI 206
ERG PL T V +G + +H N I +P Y +K+Q I
Sbjct: 219 ERG--PLSANVTFVSHWGLYRSKAEQLQSPHWRASH--RNATDIVLPVYLTLRKLQKFGI 274
Query: 207 -------------PPDTPRS---IFVYFRGLFYDVN--------NDPEGGYYARGARAAV 242
PPD +F + + D + N P+ Y+ R AV
Sbjct: 275 LGSRHHPKFATVAPPDVRERNGPLFWFAGRVCQDSSPPRTDVWPNCPKAMGYSAMTRQAV 334
Query: 243 WENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDI 302
+ + N F + Y + M A FC P+G R +A + GC+PV+I D +
Sbjct: 335 YFHHWNRTGFAVLRGD-KQYAKHMLTAKFCFGPMG-GGHGQRQFQAALAGCVPVVIGDGV 392
Query: 303 VLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQ 362
+ + + W + G+ VAE D+P+L TIL +I PE RK R L + A A
Sbjct: 393 LEAWEPYLDWNDFGVRVAEADIPRLHTILGAIGPEEYARKVRSLRCAAQHMAFSSVTGAY 452
Query: 363 PG-----DAFHQILNGLARKLPH 380
G DAF +L LA + H
Sbjct: 453 MGESGRFDAFETLLAVLAARARH 475
>gi|159489402|ref|XP_001702686.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158280708|gb|EDP06465.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 615
Score = 94.7 bits (234), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 83/338 (24%), Positives = 135/338 (39%), Gaps = 52/338 (15%)
Query: 39 KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIY 98
+K+YVYELP +K H+ +F R L+S+ +RT+N +EAD+F+ P+
Sbjct: 178 SIKIYVYELPPNVTSWFNIKRLDRPLHL----LFWQR-LMSAGLRTVNGDEADYFFIPLN 232
Query: 99 PTCDLTPTGLP--LPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQE 156
+ P LP+ I + WPYW+R G H + D G ++
Sbjct: 233 TRTLMAPEQAAWILPY------------IRNTWPYWDRDNGHRHLIIHTGDMGL---HEL 277
Query: 157 EKAIERGILPLLQRATLVQTFGQRNH-------------------VCLNEGSITIPPYAP 197
+ R + L T + +G + V + + P P
Sbjct: 278 PLGLRRKMNETLSNITWLTHWGLHTYHPIGTWFPAHRPGKDIVIPVMITTPGFQLSPLNP 337
Query: 198 PQKMQAHQIPPDTPRSIFVYFRGLFYDVNN---------DPEGGYYARGARAAVWENFKN 248
+A + R +F G P+ Y+ R V+ + N
Sbjct: 338 AVAEKAAKRGRPYTREQTFFFAGRICGDRKPPDPLTHECAPKRTDYSASVRQRVYFHHHN 397
Query: 249 NPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFAD 308
F + T + Y +++ FCL P G R V + GCIPV I D + PF
Sbjct: 398 RTGFKVLTG-TSKYMQEITSHKFCLAPTGGG-HGKRQVLVALMGCIPVTITDGVYQPFEP 455
Query: 309 AIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLL 346
+PW + + VAE+D+P+L +L ++PPE + + Q L
Sbjct: 456 ELPWADFSVPVAEDDIPRLHEVLEALPPEQVEQMQSRL 493
>gi|296085068|emb|CBI28483.3| unnamed protein product [Vitis vinifera]
Length = 416
Score = 94.7 bits (234), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 83/342 (24%), Positives = 142/342 (41%), Gaps = 32/342 (9%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYP 99
K+YVYE + + + C + F+H S RT +P++A ++ P
Sbjct: 89 FKIYVYE----EGEPPMFHNGPCKSIYSTEGRFIHEMEKGSVYRTTDPDQALLYFLPFSV 144
Query: 100 TCDLTPTGLPLPFKSPRMMRSAI---QLISSNWPYWNRTEGADHFFVVPHDFG----ACF 152
+ +P + + ++ I LIS N P+WNR+ GADHF + HD+G
Sbjct: 145 VMMVQYLYVPDSHEIHAIEKTVIDYINLISHNHPFWNRSLGADHFMLSCHDWGPRASTSV 204
Query: 153 HYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPR 212
Y +I +L A + F V E + + P + P + R
Sbjct: 205 PYLYNNSIR-----VLCNANTSEGFNPSKDVSFPEIHLRTGEMSGPLG----GLSP-SRR 254
Query: 213 SIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFC 272
I +F G + + R W++ + + +Y ++++ FC
Sbjct: 255 PILGFFAGRLHG---------HIRYLLLEQWKDKDKDLQVYDQLPNGLSYDSMLKKSRFC 305
Query: 273 LCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILT 332
LCP G+ SPR+VEA+ C+PV+I+D+ V PF D + W+ + V D+ + IL
Sbjct: 306 LCPSGYEVASPRVVEAIYAECVPVLISDNYVPPFNDVLNWKSFAVQVQVRDIANIKRILM 365
Query: 333 SIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
I LR R + ++R + Q D FH ++ +
Sbjct: 366 GISQTQYLRMYRRVKQ--VQRHFMVNAAPQRFDVFHMTIHSI 405
>gi|125540270|gb|EAY86665.1| hypothetical protein OsI_08049 [Oryza sativa Indica Group]
Length = 482
Score = 94.7 bits (234), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 77/300 (25%), Positives = 133/300 (44%), Gaps = 24/300 (8%)
Query: 86 NPEEADWFYTPIYPTCDL-------TPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGA 138
P++A F PI C+L TG P + ++ +++++ +PYWNR+ GA
Sbjct: 185 RPDDAHAFLLPI-SVCNLVHYVYRLNATGDLAPLRG--LVADYVRVVAERYPYWNRSRGA 241
Query: 139 DHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPP 198
DH V HD+ A + + + +L A + F R L E ++ P
Sbjct: 242 DHVIVSCHDW-APMVTSAHRQLYGNAIRVLCNANTSEGFRPRKDATLPEVNLADGVLRRP 300
Query: 199 QKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDH 258
+PP+ R+ +F G + + ++ G + + + +
Sbjct: 301 TA----GLPPEN-RTTLAFFAGGRHGHIRESLLRHWLIGNKGGAAADGDGDGDMRVHEYL 355
Query: 259 PT--TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG 316
P Y+ M A FCLCP G+ SPR+VE+V GC+PVII++ PF D + W ++
Sbjct: 356 PAGEDYHAQMAAARFCLCPSGFEVASPRVVESVFAGCVPVIISEGYPPPFGDVLDWGKMS 415
Query: 317 MFVAEEDVPKLDTILTSIPPE--VILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
+ V +P+L IL + +LR + L A +R + +PA+ D H +L+ +
Sbjct: 416 VAVPAARIPELRAILRRVSERRYRVLRARVLQA----QRHFVLHRPARRFDMIHMVLHSI 471
>gi|307103001|gb|EFN51266.1| hypothetical protein CHLNCDRAFT_141180 [Chlorella variabilis]
Length = 404
Score = 94.7 bits (234), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 102/379 (26%), Positives = 155/379 (40%), Gaps = 63/379 (16%)
Query: 39 KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRF---LLSSPVRTLNPEEADWFYT 95
K+Y+YEL ++ L L H+ A H+F LL SPVRT +P EA+ F+
Sbjct: 56 NFKIYMYELSAQLAYDLDLASGPEEDHIHLAH---HKFIEQLLMSPVRTEDPSEANLFFV 112
Query: 96 PI----YPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGAC 151
P Y L L L I+S++PYWNR++G DH F + +D GAC
Sbjct: 113 PALSWSYGGNALNAVHLDL----------VADHIASHYPYWNRSQGRDHIFWLTNDRGAC 162
Query: 152 FHYQEEKAIERGILPLLQRATLVQTFGQRN---HVCLNEGSITIPPYAPPQKMQAHQIPP 208
+ + A + FG V G+ T P A ++ PP
Sbjct: 163 -----------ALTGRTEAAIKLTHFGLNTINISVGWGPGAATNPENACYNPLRDVVAPP 211
Query: 209 --DTPRSIFVYFRGLFYD---------------VNNDPEGGYYARGARAAVWENFK--NN 249
D R + R L + V+ND E Y+ R + E K N+
Sbjct: 212 FDDMARELMEVSRKLSVEDIIAAKTSLFFFSGAVSNDSE---YSGNTRQLLRELVKRWND 268
Query: 250 PLFDISTDHPT---TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 306
P T+ T Y + ++ + FC G+ + RL+ V G +P++I + + P
Sbjct: 269 PEIIFETEGDTGLGDYVKRLRASKFCPAVFGYG-FGMRLLTCVFSGSVPLVIQERVAQPL 327
Query: 307 ADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDA 366
D +P+E + + +P L IL SI + + QRL+ R +PA G A
Sbjct: 328 EDLLPYETFSLRLNNGHLPDLPRILRSITDQ---QYQRLVQGLVRYRDAFHWEPAAGGKA 384
Query: 367 FHQILNGLARKLPHDKSTY 385
F + L R+ + KS Y
Sbjct: 385 FEYTIASLRRRHLNFKSLY 403
>gi|296089301|emb|CBI39073.3| unnamed protein product [Vitis vinifera]
Length = 467
Score = 94.7 bits (234), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 85/314 (27%), Positives = 136/314 (43%), Gaps = 35/314 (11%)
Query: 67 FAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLIS 126
+A+E + + L+ S T +P +AD F+ P + L +R I IS
Sbjct: 152 YASESYFKKVLMKSHFITKDPSKADLFFLP-FSIARLRHDPRVGVGGIQDFIRDYIFNIS 210
Query: 127 SNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLN 186
N+PYWN+T GADHF+V H G EKA E + +Q ++
Sbjct: 211 QNYPYWNQTGGADHFYVACHSIGRS---AMEKADE-------VKLNAIQVVCSSSYFL-- 258
Query: 187 EGSITIPPYAPPQKMQAHQIPPD---TPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVW 243
G I + PQ PPD + R +F G N P R +
Sbjct: 259 SGYIAHKDASLPQIWPRQGDPPDLALSERKKLAFFAGSI----NSP--------VRERLL 306
Query: 244 ENFKNNPLFDISTDHPTT-YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDI 302
+ ++N+ + TT Y +++ + FCL G+ + R+ +++ +GC+PVIIA+
Sbjct: 307 QVWRNDSEISVHFGRLTTPYADELLGSKFCLHVKGFEINTARIADSLYYGCVPVIIANHY 366
Query: 303 VLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQ 362
LPFAD + W+ + VA D+P L +L I L + +L + +K F
Sbjct: 367 DLPFADILNWKSFSIVVATLDIPLLKQVLKGIS----LNEYLMLQSNVLKVRNHFQWHVS 422
Query: 363 P--GDAFHQILNGL 374
P DAF+ ++ L
Sbjct: 423 PVDYDAFYMVMYEL 436
>gi|60657600|gb|AAX33321.1| secondary cell wall-related glycosyltransferase family 47 [Populus
tremula x Populus tremuloides]
Length = 509
Score = 94.4 bits (233), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 93/380 (24%), Positives = 152/380 (40%), Gaps = 62/380 (16%)
Query: 40 LKVYVYELPSKYNKKLLLKDPR-----------------CLTHMFAAEIFMHRFLLS--- 79
LKV++Y+LPS+++ LL P+ L + E ++ LL+
Sbjct: 119 LKVFMYDLPSEFHFGLLDWKPQGGSVWPDLRAKVPAYPGGLNLQHSIEYWLTMDLLASEV 178
Query: 80 -------SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPF----KSPRMMRSAIQLISSN 128
S VR N EAD + P + + P K+ + ++ ++S
Sbjct: 179 PGIPRAGSAVRVQNSSEADVIFVPFFSSISYNRYSKVNPHQKKSKNKSLEEKLVKFVTSQ 238
Query: 129 WPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEG 188
W R+ G DH + H + + L ++ FG+ + N G
Sbjct: 239 -KEWKRSGGRDHIILAHHPNSMLY----------ARMKLWTAMFILADFGRYSPNIANVG 287
Query: 189 SITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKN 248
I PY K A+ R +YF+G Y +GG+ AR ++ K+
Sbjct: 288 KDVIAPYKHVIKSYANDSSNFDSRPTLLYFQGAIY----RKDGGF----ARQELFYALKD 339
Query: 249 NP-----LFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIV 303
+ D + + M + FCL G P S RL +A+ C+PVII+DDI
Sbjct: 340 EKDVHFQFGSVQKDGVSKASQGMHSSKFCLNIAGDTPSSNRLFDAIASHCVPVIISDDIE 399
Query: 304 LPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRK-QRLLANPSMKRAMLFPQ 359
LP+ D + + + +FV D + L ++ SI + R QRL ++ F
Sbjct: 400 LPYEDVLDYSQFCIFVRTSDAVREKFLINLVRSIKKDEWTRMWQRL---KEVENFFEFQY 456
Query: 360 PAQPGDAFHQILNGLARKLP 379
P++ GDA I +ARK+P
Sbjct: 457 PSKEGDAVQMIWQAVARKVP 476
>gi|218189647|gb|EEC72074.1| hypothetical protein OsI_05008 [Oryza sativa Indica Group]
Length = 174
Score = 94.4 bits (233), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 42/54 (77%), Positives = 46/54 (85%)
Query: 28 AGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSP 81
GDVLEDDP GKLKV+VYE+P KYN LL KD RCL HMFAAEIFMH+FLLS+P
Sbjct: 110 GGDVLEDDPTGKLKVFVYEMPRKYNLNLLAKDSRCLQHMFAAEIFMHQFLLSAP 163
>gi|356544337|ref|XP_003540609.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase F8H-like
[Glycine max]
Length = 494
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 98/356 (27%), Positives = 148/356 (41%), Gaps = 50/356 (14%)
Query: 40 LKVYVYELPSKYNKKLL--LKDP-RCLTHMFAAEIFMHRF-------------------- 76
L+VYVY++P K+ LL K+ R +++ + +HR
Sbjct: 104 LRVYVYDMPPKFTHDLLWLFKNTYRDTSNLTSNGSPVHRLIEQHSIDYWLWADLIAPQSE 163
Query: 77 -LLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRT 135
LL+S VR EEAD FY P + T L + + R A++ I+ P W R+
Sbjct: 164 RLLTSVVRVHRQEEADLFYIPFFTTISFF---LMEKQQCKALYREALKWITDQ-PAWKRS 219
Query: 136 EGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY 195
G DH V H + ++ + + + LL + + V L + I PY
Sbjct: 220 GGRDHILPVHHPWS----FKSVRRYVKNAIWLLPDMDSTGNWYKPGQVYLEKDLIL--PY 273
Query: 196 APPQKMQAHQIPPDT--PRSIFVYFRGLFYDVNND---PEGGYYARGARAAVWENFKNNP 250
P + + +T RS ++FRG + G GA V E
Sbjct: 274 VPNVDLCDAKCLSETNPKRSTLLFFRGRLKRNAGGKIRSKLGAELSGADGVVIEEGT--- 330
Query: 251 LFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAI 310
M++++FCL P G P S RL +A+V GCIPVII+D++ LPF +
Sbjct: 331 ---AGEGGKEAAQRGMRKSLFCLSPAGDTPSSARLFDAIVSGCIPVIISDELELPFEGIL 387
Query: 311 PWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQP 363
+ +I +F++ D K L L I P I Q+ LA S R L+ PA P
Sbjct: 388 DYRKIAVFISSNDAVKPGWLLKYLKGIRPAHIKEMQQNLAKYS--RHFLYSSPALP 441
>gi|224123938|ref|XP_002319201.1| predicted protein [Populus trichocarpa]
gi|222857577|gb|EEE95124.1| predicted protein [Populus trichocarpa]
Length = 450
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 93/380 (24%), Positives = 153/380 (40%), Gaps = 62/380 (16%)
Query: 40 LKVYVYELPSKYNKKLLLKDPR-----------------CLTHMFAAEIFMHRFLLS--- 79
LKV++Y+LPS+++ LL P+ L + E ++ LL+
Sbjct: 80 LKVFMYDLPSEFHFGLLDWKPQGGSVWPDLRAKVPAYPGGLNLQHSIEYWLTMDLLASEI 139
Query: 80 -------SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPF----KSPRMMRSAIQLISSN 128
S VR N EAD + P + + P K+ + ++ ++S
Sbjct: 140 PGIPRAGSAVRVQNSSEADVIFVPFFSSISYNRYSKVNPHQKKSKNKSLEEKLVKFVTSQ 199
Query: 129 WPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEG 188
W R+ G DH + H ++ + L ++ FG+ + N G
Sbjct: 200 -KEWKRSGGRDHIILAHH----------PNSMLYARMKLWTAMFILADFGRYSPNIANVG 248
Query: 189 SITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKN 248
I PY K A+ R +YF+G Y +GG+ AR ++ K+
Sbjct: 249 KDVIAPYKHVIKSYANDSSNFDSRPTLLYFQGAIY----RKDGGF----ARQELFYALKD 300
Query: 249 NP-----LFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIV 303
+ D + + M + FCL G P S RL +A+ C+PVII+DDI
Sbjct: 301 EKDVHFQFGSVQKDGVSKASQGMHSSKFCLNIAGDTPSSNRLFDAIASHCVPVIISDDIE 360
Query: 304 LPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRK-QRLLANPSMKRAMLFPQ 359
LP+ D + + + +FV D + L ++ SI + R QRL ++ F
Sbjct: 361 LPYEDVLDYSQFCIFVRTSDAVREKFLINLVRSIKKDEWTRMWQRL---KEVENFFEFQY 417
Query: 360 PAQPGDAFHQILNGLARKLP 379
P++ GDA I +ARK+P
Sbjct: 418 PSKEGDAVQMIWQAVARKVP 437
>gi|302812737|ref|XP_002988055.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
gi|300144161|gb|EFJ10847.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
Length = 459
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 99/397 (24%), Positives = 159/397 (40%), Gaps = 66/397 (16%)
Query: 34 DDPVGKLKVYVYELPSKY----------------NKKLLLKDPRCLTHMFAAEIFMHRFL 77
DD V LKVY+Y+LP+++ N L + P L + E ++ L
Sbjct: 79 DDRV--LKVYMYDLPAEFHFGMLDAAISGGSWPRNISSLPRYPGGLYQQHSPEYWLTADL 136
Query: 78 LSSP-----------VRTLNPEEADWFYTPIYPTCDLTP--------TGLPLPFKSPRMM 118
LSS VR +P AD F+ P + + G ++ R+
Sbjct: 137 LSSADPSSRKSPCSAVRVADPATADIFFVPFFSSLSYNRYCRTGHRFQGGRGCVENDRLE 196
Query: 119 RSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFG 178
+ ++ + W R G DH V+ H+ + R +L + +V FG
Sbjct: 197 KRLVEFLRGQ-ELWRRNGGVDHVIVM--------HHPNSLMVARSLLK--EAMFVVADFG 245
Query: 179 QRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGA 238
+ + N + PY A R ++F+G EGG
Sbjct: 246 RFSRAVANMRKDIVAPYKHVIPSFARDATTFESRETLLFFQGAIVR----KEGGII---- 297
Query: 239 RAAVWENFKNNPLFDIST-----DHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGC 293
R ++E K++P T D + M+ A FCL G P S RL +A+ C
Sbjct: 298 RQKLYEILKDSPGVHFVTGNTQKDGIRSATAGMRNAKFCLHLAGDTPSSNRLFDAIASHC 357
Query: 294 IPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTI---LTSIPPEVILRKQRLLANPS 350
+PVII+D+I LPF D + + + +FV + + + L I + RK +L S
Sbjct: 358 VPVIISDEIELPFEDELDYSQFCVFVESDKALRKGFVVRALERIGRDEWTRKWAML--KS 415
Query: 351 MKRAMLFPQPAQPGDAFHQILNGLARKLPHDKSTYLK 387
++R + P+ P DA H G+A+++P KST K
Sbjct: 416 VERHFEYQHPSLPEDAVHMTWRGIAKRVPALKSTAHK 452
>gi|2245029|emb|CAB10449.1| limonene cyclase like protein [Arabidopsis thaliana]
gi|7268424|emb|CAB78716.1| limonene cyclase like protein [Arabidopsis thaliana]
Length = 1024
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 44/115 (38%), Positives = 71/115 (61%), Gaps = 4/115 (3%)
Query: 261 TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVA 320
TY + M+ + +CLCP+G+ SPR+VEA+ + C+PV+IAD+ +LPF+D + W + V
Sbjct: 894 TYVQHMKSSKYCLCPMGYEVNSPRIVEAIYYECVPVVIADNFMLPFSDVLDWSAFSVVVP 953
Query: 321 EEDVPKLDTILTSIPPEVILRKQRLLANPSM-KRAMLFPQPAQPGDAFHQILNGL 374
E+++P+L IL IP L+ Q +N M +R L+ + D FH IL+ +
Sbjct: 954 EKEIPRLKEILLEIPMRRYLKMQ---SNVKMVQRHFLWSPKPRKYDVFHMILHSI 1005
>gi|307111406|gb|EFN59640.1| hypothetical protein CHLNCDRAFT_133110 [Chlorella variabilis]
Length = 767
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 92/338 (27%), Positives = 140/338 (41%), Gaps = 65/338 (19%)
Query: 67 FAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTP-----TGLPLPFKSP------ 115
+ E L+ S RTL+PEEAD+FY P++ +C + P + L F +P
Sbjct: 372 YGLESAFLEMLMQSEHRTLDPEEADFFYVPVFTSCFIYPVRSGASSLQDWFYAPVQNRVQ 431
Query: 116 ---RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACF-------------------- 152
M+ A I S+ P+W R G DH ++V HD G+C+
Sbjct: 432 GAANMLLEAFHWIQSHHPWWERRGGRDHIWLVTHDEGSCWVPAAIRPSIILSHWGRMDLN 491
Query: 153 -----HYQEE---KAIERGILPLLQRATLVQ--TFGQR--NHVCLNE-GSITIPPYAPPQ 199
Y E+ +A R + Q+ L + F Q+ H C + + +P P
Sbjct: 492 HSSTTGYWEDDYRQANARKLQHAEQQQHLFEPDGFQQKIAGHACYDPVKDLVVPLIKTPN 551
Query: 200 KMQAHQI--PPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAV--------WENFKNN 249
+ + + P R+ + RG VN+ E Y+RG R V W +
Sbjct: 552 RNKHSPLFGAPTRNRTWLAFHRGR---VNH--EFPRYSRGVRQRVDNASREHQWLENYGS 606
Query: 250 PLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADA 309
D S Y E + +IFCL G WS R+ +A+ GCIPV+I DD+ + F
Sbjct: 607 KFGDESLQ--GDYSELLASSIFCLVLQG-DGWSARMDDAMSHGCIPVVIIDDVHVSFESV 663
Query: 310 IPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLA 347
+ + + V DV +L IL ++ E QR LA
Sbjct: 664 LDLSQFSLRVKSADVERLPEILQAVSQERREELQRNLA 701
>gi|356538648|ref|XP_003537813.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase F8H-like
[Glycine max]
Length = 502
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 96/355 (27%), Positives = 148/355 (41%), Gaps = 48/355 (13%)
Query: 40 LKVYVYELPSKYNKKLL--LKDP-RCLTHMFAAEIFMHRF-------------------- 76
L+VYVY++P K+ LL K+ R +++ + +HR
Sbjct: 112 LRVYVYDMPPKFTHDLLWLFKNTYRDTSNLTSNGSPVHRLIEQHSIDYWLWADLIAPQSE 171
Query: 77 -LLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRT 135
LL+S VR EEAD FY P + T L + + R A++ I+ P W R+
Sbjct: 172 RLLTSVVRVHRQEEADLFYIPFFTTISFF---LMEKQQCKALYREALKWITDQ-PAWKRS 227
Query: 136 EGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY 195
G DH V H + ++ + + + LL + + V L + I PY
Sbjct: 228 GGRDHILPVHHPWS----FKSVRRYVKNAIWLLPDMDSTGNWYKPGQVYLEKDLIL--PY 281
Query: 196 APPQKMQAHQIPPDT--PRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFD 253
P + + +T RS ++FRG G R A + +
Sbjct: 282 VPNVDLCDAKCLSETNPKRSTLLFFRGRL-----KRNAGGKIRSKLGAELSGVDGVVIEE 336
Query: 254 ISTDH--PTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIP 311
+ M++++FCL P G P S RL +A+V GCIPVII+D++ LPF +
Sbjct: 337 GTAGDGGKEAAQRGMRKSLFCLSPAGDTPSSARLFDAIVSGCIPVIISDELELPFEGILD 396
Query: 312 WEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQP 363
+ +I +F++ D K L L I P I Q+ L S R L+ PAQP
Sbjct: 397 YRKIAVFISSIDAVKPGWLLKYLKGIRPAHIKAMQQNLVKYS--RHFLYSSPAQP 449
>gi|302790387|ref|XP_002976961.1| glycosyltransferase-like protein [Selaginella moellendorffii]
gi|300155439|gb|EFJ22071.1| glycosyltransferase-like protein [Selaginella moellendorffii]
Length = 408
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 74/312 (23%), Positives = 134/312 (42%), Gaps = 24/312 (7%)
Query: 34 DDPVGKLKVYVYELPSKYNKKLLLKDP----RCLTHMFAAEIFMHRFLLSSPVRTLNPEE 89
++ V KVYVY + ++ L P R L + F+ +F LL+S T +P E
Sbjct: 75 EEMVKSFKVYVYPFGNSDYSQVFLPHPDPYDRKLGNFFSEHMFKIN-LLNSTFATRDPGE 133
Query: 90 ADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFG 149
A F+ P + + S ++ IS + +WNRTEG DHF+V H G
Sbjct: 134 AHLFFMPFSINAMRNHPRIRSEAMISSFVESYVEEISQRYKFWNRTEGVDHFYVGCHSVG 193
Query: 150 ACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPD 209
+ + R + + T + Q+ +V + ++ P P+ + +PP+
Sbjct: 194 -----RNAASNSRALQQNAIQVTCSANYYQKLYVPHKDVAL---PQVWPRPLDTFIVPPE 245
Query: 210 TPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRA 269
R+ +F G + R +W N + +F + +Y + + R+
Sbjct: 246 -KRTKLAFFSG--------RAQNSHLRETLLKLWSNDSDMDIFAGTMQ--GSYEDALSRS 294
Query: 270 IFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDT 329
FCL G+ + R+ +A+ FGC+PVII++ LP ++ + W + ++ +P L
Sbjct: 295 KFCLHVKGYEVNTARISDALHFGCVPVIISNQYDLPLSNVLNWRSFSIVLSYTQIPALKA 354
Query: 330 ILTSIPPEVILR 341
L S+ + R
Sbjct: 355 KLQSVTHDEYAR 366
>gi|125547211|gb|EAY93033.1| hypothetical protein OsI_14833 [Oryza sativa Indica Group]
Length = 429
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 81/320 (25%), Positives = 136/320 (42%), Gaps = 37/320 (11%)
Query: 36 PVGKLKVYVYELPSKYNKKLLLKD-------PRCLTHMFAAEIFMHRFLLSSPVRTLNPE 88
P L++YVY + LL+ C+ + ++ +H+ LLSS RT + +
Sbjct: 51 PHRDLRIYVYAEDEVDGLRALLRGRNDDVTAATCIKGQWGTQVKIHQLLLSSRFRTFDKD 110
Query: 89 EADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDF 148
EAD F+ P Y C + TG + ++ ++++ S PY+ R+ G DH FV P
Sbjct: 111 EADLFFVPTYVKC-VRMTG---KLNDKEINQTYVKVVLSQMPYFRRSGGRDHIFVFPSGA 166
Query: 149 GACFHYQEEKAIERGILPLLQRATLVQTFGQRNH----VCLNE-GSITIPPYAPPQKMQA 203
GA + R L R+ ++ G R N I IP +++
Sbjct: 167 GA--------HLFRSWATFLNRSIILTPEGDRTDKRGISAFNTWKDIIIPGNVDDSMVKS 218
Query: 204 HQIP----PDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP 259
++ P T R F G + G A + + +P +S
Sbjct: 219 DRLAVKPIPLTKRKYLANFLG-----RAQGKVGRLQLVKLAKQYPDKLESPELKLSGPDK 273
Query: 260 ---TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG 316
Y++ ++ A FCL P G + W+ R E+ C+PVI++D++ LPF + I + E+
Sbjct: 274 LGRIDYFKHLRNAKFCLAPRGESSWTLRFYESFFVECVPVILSDEVELPFQNVIDYTEVS 333
Query: 317 M-FVAEEDVPKLDTILTSIP 335
+ + A + P L L SIP
Sbjct: 334 IKWPASKIGPGLLEYLESIP 353
>gi|255569522|ref|XP_002525728.1| catalytic, putative [Ricinus communis]
gi|223535028|gb|EEF36711.1| catalytic, putative [Ricinus communis]
Length = 336
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 79/321 (24%), Positives = 135/321 (42%), Gaps = 49/321 (15%)
Query: 67 FAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRM--------M 118
+A+E + + L+ S T +P +AD F+ P + PR+ +
Sbjct: 33 YASESYFKKVLMKSHFITKDPTKADLFFLPF---------SIARLRHDPRIGVEGIQDFI 83
Query: 119 RSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFG 178
R+ + IS +PYWNRT G DHF+V H G + E+ I + + +
Sbjct: 84 RAYVYNISQKYPYWNRTGGTDHFYVACHSIGRTAMEKAEEVKFNAIQVVCSSSYYL---- 139
Query: 179 QRNHVCLNEGSITIPPYAPPQKMQAHQIPPD---TPRSIFVYFRGLFYDVNNDPEGGYYA 235
G I + PQ PP+ + R +F G N P
Sbjct: 140 --------SGYIAHKDASLPQVWPRQGDPPNLASSERQKLAFFAGSI----NSP-----V 182
Query: 236 RGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIP 295
R VW N ++ ++ T+Y +++ + FCL G+ + R+ +++ +GC+P
Sbjct: 183 RERLLQVWRN--DSEIYVHYGRLNTSYADELLGSKFCLHVKGFEVNTARIADSLYYGCVP 240
Query: 296 VIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE--VILRKQRLLANPSMKR 353
+IIA+ LPF D + WE + VA D+ L IL + + V+L+ L +++
Sbjct: 241 IIIANHYDLPFTDILNWESFSVVVATLDILYLKKILQGVSSDRYVMLQSNVL----KVRK 296
Query: 354 AMLFPQPAQPGDAFHQILNGL 374
+ P DAFH ++ L
Sbjct: 297 HFQWHFPPVDYDAFHMVMYEL 317
>gi|302782213|ref|XP_002972880.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
gi|300159481|gb|EFJ26101.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
Length = 459
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 99/397 (24%), Positives = 159/397 (40%), Gaps = 66/397 (16%)
Query: 34 DDPVGKLKVYVYELPSKY----------------NKKLLLKDPRCLTHMFAAEIFMHRFL 77
DD V LKVY+Y+LP+++ N L + P L + E ++ L
Sbjct: 79 DDRV--LKVYMYDLPAEFHFGMLDAAISGRPWPRNISSLPRYPGGLYQQHSPEYWLTADL 136
Query: 78 LSS-----------PVRTLNPEEADWFYTPIYPTCDLTP--------TGLPLPFKSPRMM 118
LSS VR +P AD F+ P + + G ++ R+
Sbjct: 137 LSSTDPSSRKSPCSAVRVADPATADIFFVPFFSSLSYNRYCRTGHRFQGGRGCVENDRLE 196
Query: 119 RSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFG 178
+ ++ + W R GADH V+ H+ + R +L + +V FG
Sbjct: 197 KRLVEFLRGQ-ELWRRNGGADHVIVM--------HHPNSLMVARSLLK--EAMFVVADFG 245
Query: 179 QRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGA 238
+ + N + PY A R ++F+G EGG
Sbjct: 246 RFSRAVANMRKDIVAPYKHVIPSFARDATTFESRETLLFFQGAIVR----KEGGII---- 297
Query: 239 RAAVWENFKNNPLFDIST-----DHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGC 293
R ++E K++P T D + M+ A FCL G P S RL +A+ C
Sbjct: 298 RQKLYEILKDSPGVHFVTGNTQKDGIRSATAGMRNAKFCLHLAGDTPSSNRLFDAIASHC 357
Query: 294 IPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTI---LTSIPPEVILRKQRLLANPS 350
+PVII+D+I LPF D + + + +FV + + + L I + RK +L S
Sbjct: 358 VPVIISDEIELPFEDELDYSQFCVFVESDKALRKGFVVRALERIGRDEWTRKWAML--KS 415
Query: 351 MKRAMLFPQPAQPGDAFHQILNGLARKLPHDKSTYLK 387
++R + P+ P DA H G+A+++P KS K
Sbjct: 416 VERHFEYQHPSLPEDAVHMTWRGIAKRVPALKSMAHK 452
>gi|42571577|ref|NP_973879.1| exostosin-like protein [Arabidopsis thaliana]
gi|332191986|gb|AEE30107.1| exostosin-like protein [Arabidopsis thaliana]
Length = 410
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 78/310 (25%), Positives = 120/310 (38%), Gaps = 63/310 (20%)
Query: 40 LKVYVYELPSKYNKKLLL-------KDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADW 92
LK+YVY+ K LL K CL + +++ +H+ LL S RT+ +EAD
Sbjct: 89 LKIYVYDENEIDGLKELLYGRDGSVKTTACLKGQWGSQVKIHKLLLESKFRTIKKDEADL 148
Query: 93 FYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACF 152
F+ P Y C GL + + + + S PY+ R+ G DH FV P GA
Sbjct: 149 FFVPAYVKCVRMLGGL-----NDKEINQTYVKVLSQMPYFRRSGGRDHIFVFPSGAGAHL 203
Query: 153 HYQEEKAIERGIL--PLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIP--- 207
I R I+ P R +++ N I P M + P
Sbjct: 204 FRSWSTFINRSIILTPEADRTD------KKDTTAFNSWKDIIIPGNVDDAMTKNGQPDVQ 257
Query: 208 --PDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP------ 259
P + R + G A+G + L D+S P
Sbjct: 258 PLPLSKRKYLANYLG-------------RAQGKAGRL-------KLIDLSKQFPDKLECP 297
Query: 260 ------------TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFA 307
TTY+E ++ A FCL P G + W+ R E+ C+PV+++D LPF
Sbjct: 298 DLKFSGTEKFGRTTYFEHLRNAKFCLAPRGESSWTLRFYESFFVECVPVLLSDHAELPFQ 357
Query: 308 DAIPWEEIGM 317
+ I + ++ +
Sbjct: 358 NVIDYAQVSI 367
>gi|51477380|gb|AAU04753.1| EXO [Cucumis melo]
Length = 343
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 82/345 (23%), Positives = 149/345 (43%), Gaps = 38/345 (11%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYP 99
LK+YVY K + + C + F+H + T +P++A ++ P
Sbjct: 16 LKIYVY----KEGEPPMFHGGPCKSIYSTEGRFIHEMEKGNLYTTNDPDQALLYFLPFSV 71
Query: 100 TCDLTPTGLPLPFKSPRMMRSA---IQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQE 156
+ +P + + R+ I +IS P+W+R+ GADHF + HD+G
Sbjct: 72 VNLVQYLYVPNSHEVNAIGRAITDYINVISKKHPFWDRSLGADHFMLSCHDWGP------ 125
Query: 157 EKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQ-KMQAHQIP------PD 209
+PLL + ++ N ++EG + + P+ ++ +I
Sbjct: 126 ---RTTSYVPLLFNNS-IRVLCNAN---VSEGFLPSKDASFPEIHLRTGEIDGLIGGLSP 178
Query: 210 TPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRA 269
+ RS+ +F G + + R W+ + L +Y ++++
Sbjct: 179 SRRSVLAFFAGRLHG---------HIRYLLLQEWKEKDEDVLVYEELPSGISYNSMLKKS 229
Query: 270 IFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDT 329
FCLCP G+ SPR+VEA+ C+PV+I++ V PF+D + W+ + + +D+P +
Sbjct: 230 RFCLCPSGYEVASPRVVEAIYAECVPVLISESYVPPFSDVLNWKSFSVQIQVKDIPNIKK 289
Query: 330 ILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
IL I LR QR + ++R + DAFH IL+ +
Sbjct: 290 ILKGISQTQYLRMQRRVKQ--VQRHFALNGTPKRFDAFHMILHSI 332
>gi|388493832|gb|AFK34982.1| unknown [Lotus japonicus]
Length = 229
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 44/132 (33%), Positives = 75/132 (56%), Gaps = 2/132 (1%)
Query: 243 WENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDI 302
WEN + S +YY+ ++++ FCLCP G+ SPR+VEA+ GC+PV+I++
Sbjct: 87 WENKDGDMQVYSSLPKGVSYYDMLRKSKFCLCPSGYEVASPRVVEAIYTGCVPVLISEHY 146
Query: 303 VLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQ 362
V PF+D + W+ + V+ +D+P+L IL S+ +R QR + ++R P +
Sbjct: 147 VPPFSDVLNWKSFSLEVSVKDIPRLKEILLSVNTRQYIRMQRRVGQ--IRRHFEIHSPPK 204
Query: 363 PGDAFHQILNGL 374
D FH +L+ +
Sbjct: 205 RFDVFHMVLHSV 216
>gi|22330483|ref|NP_176908.2| exostosin-like protein [Arabidopsis thaliana]
gi|115311405|gb|ABI93883.1| At1g67410 [Arabidopsis thaliana]
gi|332196520|gb|AEE34641.1| exostosin-like protein [Arabidopsis thaliana]
Length = 430
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 92/379 (24%), Positives = 156/379 (41%), Gaps = 53/379 (13%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRC-------------------LTHMFAAEIFMHRFLLS- 79
L+V++Y+LP K+N + + DP + + E ++ LL+
Sbjct: 53 LRVFMYDLPRKFN--IAMMDPHSSDVEPITGKNLPSWPQTSGIKRQHSVEYWLMASLLNG 110
Query: 80 -----SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLP---FKSPRMMRSAIQLISSNWPY 131
+R +P+ AD FY P + + G + + R+++ + N Y
Sbjct: 111 GEDENEAIRVFDPDLADVFYVPFFSSLSFNTHGKNMTDPDTEFDRLLQVELMEFLENSKY 170
Query: 132 WNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSIT 191
WNR+ G DH V+P F + ++ + IL +V FG+ +
Sbjct: 171 WNRSGGKDH--VIPMTHPNAFRFLRQQ-VNASIL-------IVVDFGRYSKDMARLSKDV 220
Query: 192 IPPY-----APPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENF 246
+ PY + ++ P R+ +YFRG + EG R + +
Sbjct: 221 VSPYVHVVESLNEEGDDGMGDPFEARTTLLYFRG---NTVRKDEGKIRLRLEKLLAGNSD 277
Query: 247 KNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 306
+ +T + E M+ + FCL P G P S RL +A+V CIPVII+D I LPF
Sbjct: 278 VHFEKSVATTQNIKVSTEGMRSSKFCLHPAGDTPSSCRLFDAIVSHCIPVIISDKIELPF 337
Query: 307 ADAIPWEEIGMFVAEEDVPKLDTILTSI---PPEVILRKQRLLANPSMKRAMLFPQPAQP 363
D I + E +F + ++ + IL ++ P E L + L N S F P +
Sbjct: 338 EDEIDYSEFSLFFSIKESLEPGYILNNLRQFPKEKWLEMWKRLKNVS--HHFEFQYPPKR 395
Query: 364 GDAFHQILNGLARKLPHDK 382
DA + + + K+P+ K
Sbjct: 396 EDAVNMLWRQVKHKIPYVK 414
>gi|326491429|dbj|BAJ94192.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 400
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 87/354 (24%), Positives = 147/354 (41%), Gaps = 57/354 (16%)
Query: 40 LKVYVYELPSKYNKKLLLKD-------PRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADW 92
L+VYVY + LL+ CL + ++ +H+ LL S RTL+ +EA+
Sbjct: 54 LRVYVYAEDEVDGLRALLRGRDGAVSAATCLKGQWGTQVKIHQLLLRSRYRTLDKDEANL 113
Query: 93 FYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACF 152
F+ P Y C GL + + + + S PY+ R+ G DH FV P GA
Sbjct: 114 FFVPSYVKCVRMTGGL-----TDKEINQTYVKVLSQMPYFRRSGGRDHIFVFPSGAGAHL 168
Query: 153 HYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPR 212
R L R+ ++ G R ++ I
Sbjct: 169 F--------RSWATFLNRSIILTPEGDRT----DKRGI---------------------- 194
Query: 213 SIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKN---NPLFDISTDHP---TTYYEDM 266
S F ++ + N D G R + + + + +P +S Y++ +
Sbjct: 195 SAFNTWKDIIIPGNVDDSMGKVGRLKLVELAKQYPDKLESPELKLSGPDKLGRIDYFKHL 254
Query: 267 QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGM-FVAEEDVP 325
+ A FCL P G + W+ R E+ C+PV+++D++ LPF + I + +I + + A + P
Sbjct: 255 RNAKFCLAPRGESSWTLRFYESFFVECVPVLLSDEVELPFQNVIDYTKISIKWPASKIGP 314
Query: 326 KLDTILTSIPPEVILRKQRLLANPSMKRAM-LFPQPAQPGDAFHQILNGLARKL 378
+L L SIP E R + ++A R M ++ +P A I+ L RK+
Sbjct: 315 ELFQYLESIPEE---RIEEMIARGREVRCMWVYALDTEPCSAMTAIMWELQRKV 365
>gi|255076909|ref|XP_002502118.1| glycosyltransferase family 47 protein [Micromonas sp. RCC299]
gi|226517383|gb|ACO63376.1| glycosyltransferase family 47 protein [Micromonas sp. RCC299]
Length = 717
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 102/413 (24%), Positives = 171/413 (41%), Gaps = 81/413 (19%)
Query: 41 KVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPT 100
++Y+Y LP K+N + L+ LT ++ R LLSS R NPE+AD F+ PI PT
Sbjct: 227 RIYIYPLPPKFNGHVDLR----LTDRPLEQMIYER-LLSSHHRVANPEDADLFFLPI-PT 280
Query: 101 CDLTPTGLPLPFKSPRM---MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEE 157
GL P + AI+ + + W + + E + V D+G C + E+
Sbjct: 281 RAAFRGGLDNVGGWPGVNDFFHEAIEYVDNTWEWSKKHEWRNTIMVFTGDWGPCEWFSEK 340
Query: 158 KAIE-------------RGILPLLQRATLVQTFGQR--NHVCLNEG-------SITIPPY 195
+ + R I ++ A ++ +G + + L G + IPP
Sbjct: 341 YSKKENDPDYEAFWKKRRRINEVIANAIVLTHWGLTIADDLYLGGGPCFDPAKDVLIPPV 400
Query: 196 AP--------------------------------PQKMQAHQIPPDTPRSIFVYFRGLFY 223
P P + D PR ++F G +
Sbjct: 401 NPHFGLGPFDPDGWKAPMGTRRIEFDVGLRGSDVPFGSERAMTEQDEPRRWLLFFAGAWV 460
Query: 224 DVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYE-DMQRAIFCLCPLGWAPWS 282
D + Y R A A + + + H +YE + + FC+ P G + W
Sbjct: 461 D-----KPAYADRRAIAEAMAGREQEGIHVVQ--HAGQFYEKNYASSTFCIAPTG-SGWG 512
Query: 283 PRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVI--L 340
R+ A GCIPVI+ D+I P+ D +P++E + VA+ D+PK+ I+ +I PE + +
Sbjct: 513 RRMNLATQSGCIPVIVQDNIAAPYDDVLPYDEFSVRVAKADIPKIPDIVKAITPEKLDRM 572
Query: 341 RKQRLLANPSMKRAMLFPQPAQPG---DAFHQILNGLARKLPHDKSTYLKPGG 390
R+Q A +++ + + G DAF ++ L +L T++KP G
Sbjct: 573 RQQLACAARALQWSSILGSDFGEGGENDAFALLMLTLQHRL----VTHVKPEG 621
>gi|356577103|ref|XP_003556667.1| PREDICTED: probable glucuronosyltransferase Os02g0520750-like
[Glycine max]
Length = 459
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 86/320 (26%), Positives = 133/320 (41%), Gaps = 51/320 (15%)
Query: 82 VRTLNPEEADWFYTPIYPTCDL---------TPTGLPLPFKSPRMMRSAIQLISSNWPYW 132
VR +PEEAD F+ P + + L + +GL P S + A+ YW
Sbjct: 136 VRVADPEEADLFFVPFFSSLSLIVNPVRPPGSNSGLEKPVYSDEENQEALVEWLEKQEYW 195
Query: 133 NRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITI 192
R G DH V + A+ R I + LV FG+ + ++GS+ +
Sbjct: 196 KRNNGRDHVIVA----------SDPNAMYRVIDRVRNAVLLVSDFGR---LRPDQGSL-V 241
Query: 193 PPYAPPQKMQAHQIPPDT---PRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNN 249
P + P D R ++F G Y EGG R + EN K
Sbjct: 242 KDVVVPYSHRIRTYPGDVGVEDRKTLLFFMGNRYR----KEGGK-IRDLLFQILENEK-- 294
Query: 250 PLFDISTDHPTTYYED-------MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDI 302
D+ H E M + FCL P G P + RL +A+V CIPVI++D+I
Sbjct: 295 ---DVIIKHGAQSRESRRAASHGMHTSKFCLHPAGDTPSACRLFDAIVSLCIPVIVSDNI 351
Query: 303 VLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQ 359
LPF D I + +I +FV K L + L ++ P+ +L Q+ +K + +
Sbjct: 352 ELPFEDTIDYRKIAVFVETSSAIKPGHLLSKLRAVTPDRVLEYQK-----KLKEVKRYFE 406
Query: 360 PAQPGDAFHQILNGLARKLP 379
+P ++I +++KLP
Sbjct: 407 YEEPDGTINEIWRQVSKKLP 426
>gi|302848153|ref|XP_002955609.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
gi|300259018|gb|EFJ43249.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
Length = 572
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 93/337 (27%), Positives = 139/337 (41%), Gaps = 63/337 (18%)
Query: 41 KVYVYELPSKYN-KKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYP 99
K+YVYE+P ++ K+ + K R HM M R +LS RT +PE+AD+FY P
Sbjct: 140 KIYVYEIPPDFHVKRDIHKVDRPPLHM----ALMER-ILSGGHRTADPEKADFFYIPASA 194
Query: 100 TCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKA 159
DL K ++ + I WP WN+T GA H D G C
Sbjct: 195 R-DL---------KRAFLLEPLLNYIIEAWPIWNQTGGARHIMPAEGDVGTC-------E 237
Query: 160 IERGILPLLQRATLVQTFGQ-----------RNHV-CLNEGSITIPPYAPPQKMQAH-QI 206
+ I + T +Q +G N V C+ G + P+ M +H +
Sbjct: 238 LPMKIRNMTANVTWLQFWGMYDFHPHWTQIFHNRVPCMVPGRDIVVPF---MAMSSHDRF 294
Query: 207 PPDTP---------RSIFVYFRGLFYDVNND----PEGGYYAR-----GARAAVWENFKN 248
+TP R+ +F G N P YY + G R AV+ ++
Sbjct: 295 VIETPLHPRNQKRNRTNTFFFAGGICGSGNKRALPPHCTYYKQVRYSGGVRQAVYYHYHK 354
Query: 249 NPLFDI--STDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 306
P + + TD Y D +IFCL G W R + A ++GCIPV D + F
Sbjct: 355 RPGWRVVPGTD---DYARDYASSIFCLAAAG-GGWGKRGIVATMYGCIPVAATDMLYEAF 410
Query: 307 ADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQ 343
+ W G+ V++ +P+L +L + PE I + Q
Sbjct: 411 EPEMDWNRFGVRVSQAQIPQLGDMLEAFTPEQIRQMQ 447
>gi|224144703|ref|XP_002325382.1| predicted protein [Populus trichocarpa]
gi|222862257|gb|EEE99763.1| predicted protein [Populus trichocarpa]
Length = 505
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 93/359 (25%), Positives = 148/359 (41%), Gaps = 56/359 (15%)
Query: 40 LKVYVYELPSKYNKKLL---LKDPRCLTHMFAAEIFMHRF-------------------- 76
++VY+YE+PSK+ LL R ++ + +HR
Sbjct: 118 IRVYLYEMPSKFTYDLLWLFRNTYRNTDNLTSNGSPVHRLIEQHSVDYWLWADLIAPESE 177
Query: 77 -LLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRT 135
LL S VR E+AD FY P + T L + + R A++ ++ P W R+
Sbjct: 178 RLLKSVVRVERQEDADLFYVPFFTTISFF---LLEKQQCKALYREALKWVTDQ-PAWKRS 233
Query: 136 EGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY 195
EG +H F + H + ++ + + + LL + + V L + I PY
Sbjct: 234 EGRNHIFPIHHPWS----FKSVRRYVKNAIWLLPDMDSTGNWYKPGQVFLEKDLIL--PY 287
Query: 196 APPQKMQAHQI--PPDTPRSIFVYFRGLFYDVNNDPEGGYYAR------GARAAVWENFK 247
P + + ++ RS +YFRG + + G A+ GA E
Sbjct: 288 VPNVNLCDTKCISESESKRSTLLYFRG---RLKRNAGGKIRAKLVAELSGAEGVFIEEGT 344
Query: 248 NNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFA 307
M+++IFCL P G P S RL +A+V GCIPV+++D++ LPF
Sbjct: 345 ------AGEGGKAAAQIGMRKSIFCLSPAGDTPSSARLFDAIVSGCIPVVVSDELELPFE 398
Query: 308 DAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQP 363
+ + +I +FV+ D + L L I I QR LA S R ++ PA P
Sbjct: 399 GILDYRKIALFVSSSDAVQPGWLLKFLKGISLAQIRGMQRNLAKYS--RHFIYSSPALP 455
>gi|255565307|ref|XP_002523645.1| catalytic, putative [Ricinus communis]
gi|223537097|gb|EEF38731.1| catalytic, putative [Ricinus communis]
Length = 452
Score = 91.3 bits (225), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 106/434 (24%), Positives = 182/434 (41%), Gaps = 87/434 (20%)
Query: 3 VTAIALLLAFVSADNT-------------PKIERISGSAGDVLEDDPVG---KLKVYVYE 46
+ +I L+LA + NT P+ +SG G V E++ K+K+++Y+
Sbjct: 13 IFSICLILALYAVFNTFFNPTTSSLLYPSPEDNSLSGFPGKVTENNDNNNINKVKIFMYD 72
Query: 47 LPSKYNKKLLL--------KDPRCLT---HMFAAEIFM--------HRFLLSSPVRTLNP 87
LP K+ ++ KD + H E ++ H + S V+ +P
Sbjct: 73 LPKKFTTGIIQQHALARGSKDTSNVKYPGHQHMGEWYLFSDLNRPEHGRIGSPVVKVDDP 132
Query: 88 EEADWFYTPIYPTCDL---------TPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGA 138
+EAD FY P++ + L T GL + M ++ + YW R G
Sbjct: 133 DEADLFYVPVFSSLSLIVNPVRPAGTEPGLVQHYSDEEMQEQLVEWLEQQ-EYWKRNNGR 191
Query: 139 DHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSIT---IPPY 195
DH ++ D A + + ++ IL L+ FG+ V ++GS+ I PY
Sbjct: 192 DHV-IIAGDPNALYRVLDR--VKNAIL-------LLSDFGR---VRPDQGSLVKDIIVPY 238
Query: 196 APPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDIS 255
+ + I R+ ++F G Y +GG R +++ ++ D+
Sbjct: 239 SHRINVYNGDIGV-RDRNTLLFFMGNRYR----KDGG----KIRDLLFQMLESEE--DVV 287
Query: 256 TDHPTTYYED-------MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFAD 308
H T E+ M + FCL P G P + RL +++V C+PVI++D I LPF D
Sbjct: 288 IKHGTQSRENRRAASRGMHTSKFCLNPAGDTPSACRLFDSIVSLCVPVIVSDSIELPFED 347
Query: 309 AIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGD 365
I + +I +FV D K L +L + E IL Q+ +K+ + +
Sbjct: 348 VIDYTKIAIFVETTDSLKPGYLVKLLREVTSERILEYQK-----ELKKVTRYFEYDNSNG 402
Query: 366 AFHQILNGLARKLP 379
++I +A+KLP
Sbjct: 403 TVNEIWRQVAQKLP 416
>gi|357508695|ref|XP_003624636.1| Exostosin-like protein [Medicago truncatula]
gi|87162615|gb|ABD28410.1| Exostosin-like [Medicago truncatula]
gi|116831751|gb|ABK28848.1| exostosin-like protein [Medicago truncatula]
gi|355499651|gb|AES80854.1| Exostosin-like protein [Medicago truncatula]
Length = 486
Score = 91.3 bits (225), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 75/310 (24%), Positives = 133/310 (42%), Gaps = 32/310 (10%)
Query: 67 FAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLIS 126
+A+E + + L+ S T +P +AD F+ P + L +R +Q +
Sbjct: 187 YASESYFKKALMKSHFITKDPTKADLFFMP-FSIASLRHDRRVGVGGIQDFIRDYVQNMI 245
Query: 127 SNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLN 186
+PYWNRT GADHF+V H G + I + + + + CL
Sbjct: 246 HKYPYWNRTNGADHFYVACHSIGRSAMDKAPDVKFNAIQVVCSSSYFLSGYIAHKDACL- 304
Query: 187 EGSITIPPYAPPQKMQAHQIPPD---TPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVW 243
PQ ++ PP+ + R +F G +VN+ R W
Sbjct: 305 -----------PQIWPRNENPPNLVSSNRKKLAFFAG---EVNSP------VRINLVETW 344
Query: 244 ENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIV 303
+N + +F + T Y +++ + FC G+ + R+ +++ +GC+PVIIA+
Sbjct: 345 KN--DTEIFVHNGRLKTPYGDELLGSKFCFHVRGYEVNTARIGDSLYYGCVPVIIANYYD 402
Query: 304 LPFADAIPWEEIGMFVAEEDVPKLDTILTSI--PPEVILRKQRLLANPSMKRAMLFPQPA 361
LPFAD + W+ + V D+P L IL I E ++ ++ +L ++ + P
Sbjct: 403 LPFADVLNWKSFSVVVTTLDIPLLKKILKGIVNSGEYLMLQKNVLK---VREHFQWHSPP 459
Query: 362 QPGDAFHQIL 371
DAF+ ++
Sbjct: 460 IDFDAFYMVM 469
>gi|356546128|ref|XP_003541483.1| PREDICTED: LOW QUALITY PROTEIN: xylogalacturonan
beta-1,3-xylosyltransferase-like [Glycine max]
Length = 252
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 82/321 (25%), Positives = 131/321 (40%), Gaps = 78/321 (24%)
Query: 58 KDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRM 117
K P TH A +F+ F +S +R + Y P D P L R+
Sbjct: 4 KSPFKATHPEQAHLFLLPFSVSKVIR--------YVYKPRRSRSDYDPDRLQ------RL 49
Query: 118 MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTF 177
+ I ++++ +P WNR+ GADHF V HD+ + + + L A + F
Sbjct: 50 VLDYINIVANRYPNWNRSRGADHFLVSFHDW-----LDANPEVFKYFIRALCNANTSEGF 104
Query: 178 GQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARG 237
V + E + +P +++Q H+ P +GL
Sbjct: 105 QPSRDVSITE--VYLPSRKLDKEVQVHEYLP----------KGL---------------- 136
Query: 238 ARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVI 297
Y + M + FCLCP SPR+VEA+ GC+PVI
Sbjct: 137 -----------------------EYTKLMGQRKFCLCP-SXQVASPRVVEAIYVGCVPVI 172
Query: 298 IADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLAN-PSMKRAML 356
I D+ LPF+D + + F+A E +P+ TIL ++ + + L +N ++R +
Sbjct: 173 ICDNYSLPFSDVVKRSK---FIAVERIPETKTILQNVSKD---KYMELYSNVKRVRRHFV 226
Query: 357 FPQPAQPGDAFHQILNGLARK 377
+PA+P D H IL+ L K
Sbjct: 227 INRPAKPFDLIHMILHSLRNK 247
>gi|356571729|ref|XP_003554026.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
max]
Length = 487
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 87/342 (25%), Positives = 138/342 (40%), Gaps = 66/342 (19%)
Query: 67 FAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRM--------- 117
+ +E + + L+ S T +P EAD F+ LPF R+
Sbjct: 185 YTSESYFKKVLMKSHFITKDPPEADLFF---------------LPFSMARLWHDRRVGVG 229
Query: 118 -----MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRAT 172
+R I IS +PYWN T GADHF+V H G + I + +
Sbjct: 230 GIQDFIRDYIHNISHRYPYWNNTGGADHFYVACHSIGRSAMDKAPDEKFNAIQVVCSSSY 289
Query: 173 LVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPD---TPRSIFVYFRGLFYDVNNDP 229
+ + CL PQ PP+ + R +F G N P
Sbjct: 290 FLTGYFAHKDACL------------PQIWPRKGNPPNLVSSKRKRLAFFAGGV----NSP 333
Query: 230 EGGYYARGARAAVWENFKNNPLFDISTDH---PTTYYEDMQRAIFCLCPLGWAPWSPRLV 286
R + E +KN+ +I H T Y +++ + FCL G+ + R+
Sbjct: 334 --------VRVKLLETWKNDS--EIFVHHGRLKTPYADELLGSKFCLHVKGFEVNTARIG 383
Query: 287 EAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLL 346
+++ +GC+PVIIA+ LPFAD + W+ + V D+P L IL I + K +L
Sbjct: 384 DSLYYGCVPVIIANYYDLPFADVLNWKSFSVVVTTLDIPLLKKILKDI---ISSNKYLML 440
Query: 347 ANPSMKRAMLFP--QPAQPGDAFHQILNGLARKLPHDKSTYL 386
+ +K F P Q DAF+ ++ L + K+T++
Sbjct: 441 QSNVLKVRKHFQWHSPPQDFDAFYMVMYELWLRRSSIKNTWV 482
>gi|260813939|ref|XP_002601673.1| hypothetical protein BRAFLDRAFT_94550 [Branchiostoma floridae]
gi|229286975|gb|EEN57685.1| hypothetical protein BRAFLDRAFT_94550 [Branchiostoma floridae]
Length = 786
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 116/468 (24%), Positives = 179/468 (38%), Gaps = 102/468 (21%)
Query: 13 VSADNTPKIERISGSAGDV-------LEDDPVGK---------------------LKVYV 44
V D + + R G AG + L+DD G+ L+V+V
Sbjct: 317 VKDDESSPLPRREGQAGHISGKRKVYLKDDTTGQPSPKPVKRPESAGNAAQTSSPLRVFV 376
Query: 45 YELPSKYNKKLL----LKDPRCL---------------------THMFAAEIFMHRFLLS 79
Y+LPS++N L+ +K+ RC THMF+ E+ +H+ LLS
Sbjct: 377 YDLPSEFNSGLVHCIQVKN-RCYQLQDYGMGLEFARYGNVSFRSTHMFSLEVILHQKLLS 435
Query: 80 SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLP-FKSPRMMRSAIQLISSNWPYWNRTEGA 138
S RTL+PE+AD FY P YP L P+ SP + R Q I+SN+PY+ + +
Sbjct: 436 STFRTLDPEKADVFYIPYYPA--LAAACEPVSTIDSPALDRELWQFITSNYPYFQQGK-- 491
Query: 139 DHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNH---------VCLNEGS 189
PH +E + GIL R + TF H + +
Sbjct: 492 ------PHMMALGRIEREHADVTGGILK--TRESRSVTFVAIEHESDPKTLKFIRRSGLP 543
Query: 190 ITIPPYAPPQKM----------QAHQIPPDTPRSIFVYFRG---LFYDVNNDPEGGYYAR 236
+ + PY + ++ + D PR + V F G + +D+
Sbjct: 544 MVVAPYPSCGHLLSDNKFGGESKSERTQLDIPRDVLVLFAGSRRMSHDIRRILSQQLRPT 603
Query: 237 GARAAVWENF-KNNPLFDISTDHPTTYYED----MQRAIFCLCPLGWAPWSPRLVEAVVF 291
+ + K N F ++ E+ M ++FCL P G +P +AV
Sbjct: 604 SEKYDATSSLNKQNVWFITQECRDRSWQENLVEWMHHSVFCLQPPGDSPTRKSFFDAVQC 663
Query: 292 GCIPVIIADDI--VLPFADAIPWEEIGMFVAEEDV----PKLDTILTSIPPEVILRKQRL 345
GCIPVI D V PF D + + + + V + D + IL IP VI K+
Sbjct: 664 GCIPVIFKLDHEPVYPFDDVLDYSKFTVKVTDGDFFQEKRSIVDILQDIPEAVIAAKRAE 723
Query: 346 LANPSMKRAMLFP--QPAQPGDAFHQILNGLARKLPHDKSTYLKPGGK 391
L + +P DAF I+ + R + + G K
Sbjct: 724 LRQVTPLLQYSYPPLPETHVQDAFDMIMQEIGRTRGGRSNVRRRYGSK 771
>gi|222635067|gb|EEE65199.1| hypothetical protein OsJ_20321 [Oryza sativa Japonica Group]
Length = 551
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 52/124 (41%), Positives = 70/124 (56%), Gaps = 6/124 (4%)
Query: 253 DISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW 312
D + YY M+R+ FCLCP G SPR+VEA+ GC+PV++AD PFAD + W
Sbjct: 420 DAGGEGGNPYYWYMRRSRFCLCPSGHEVASPRVVEAIHAGCVPVVVADGYAPPFADVLRW 479
Query: 313 EEIGMFVAEEDVPKLDTILTSIP-PEVILRKQRLLANPSM-KRAMLFPQPAQPGDAFHQI 370
E + VA DVP+L +L IP PEV +RL + KR + QP + D FH I
Sbjct: 480 EAFSVAVAVADVPRLRELLERIPAPEV----ERLRDGVRLVKRHFMLHQPPERLDMFHMI 535
Query: 371 LNGL 374
L+ +
Sbjct: 536 LHSV 539
Score = 43.1 bits (100), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 40/160 (25%), Positives = 76/160 (47%), Gaps = 22/160 (13%)
Query: 10 LAFVSADNTPKIERI-------SGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRC 62
L+F + ++ER+ +G + +E + + KVYVYE + + + C
Sbjct: 106 LSFFGDADHARLERVYRNPRRSTGGSRSYVEMER--RFKVYVYE----EGEPPIAHEGPC 159
Query: 63 LTHMFAAE-IFMHRFLLSSP----VRTLNPEEADWFYTPIYPTCDLTPTGLPLPFK-SP- 115
+++A E F+ L +P VRT +P A + P+ + + PL + SP
Sbjct: 160 -KNIYAVEGRFIEELELMAPPLGGVRTWDPARAHALFLPLSVSQMVQLAYRPLSYDLSPL 218
Query: 116 -RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHY 154
++ + +++S +WNR+ GADHF + HD+ + H+
Sbjct: 219 RAIVADYVAVVASRHRFWNRSAGADHFMLSCHDWASTNHF 258
>gi|218197695|gb|EEC80122.1| hypothetical protein OsI_21883 [Oryza sativa Indica Group]
Length = 545
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 52/124 (41%), Positives = 70/124 (56%), Gaps = 6/124 (4%)
Query: 253 DISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW 312
D + YY M+R+ FCLCP G SPR+VEA+ GC+PV++AD PFAD + W
Sbjct: 414 DAGGEGGNPYYWYMRRSRFCLCPSGHEVASPRVVEAIHAGCVPVVVADGYAPPFADVLRW 473
Query: 313 EEIGMFVAEEDVPKLDTILTSIP-PEVILRKQRLLANPSM-KRAMLFPQPAQPGDAFHQI 370
E + VA DVP+L +L IP PEV +RL + KR + QP + D FH I
Sbjct: 474 EAFSVAVAVADVPRLRELLERIPAPEV----ERLRDGVRLVKRHFMLHQPPERLDMFHMI 529
Query: 371 LNGL 374
L+ +
Sbjct: 530 LHSV 533
Score = 42.0 bits (97), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 34/124 (27%), Positives = 61/124 (49%), Gaps = 13/124 (10%)
Query: 39 KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAE-IFMHRFLLSSP----VRTLNPEEADWF 93
+ KVYVYE + + + C +++A E F+ L +P VRT +P A
Sbjct: 137 RFKVYVYE----EGEPPIAHEGPC-KNIYAVEGRFIEELELMAPPLGGVRTWDPARAHAL 191
Query: 94 YTPIYPTCDLTPTGLPLPFK-SP--RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGA 150
+ P+ + + PL + SP ++ + +++S +WNR+ GADHF + HD+ +
Sbjct: 192 FLPLSVSQMVQLAYRPLSYDLSPLRAIVADYVAVVASRHRFWNRSAGADHFMLSCHDWAS 251
Query: 151 CFHY 154
H+
Sbjct: 252 TNHF 255
>gi|52075645|dbj|BAD44815.1| putative pectin-glucuronyltransferase [Oryza sativa Japonica Group]
gi|52075653|dbj|BAD44823.1| putative pectin-glucuronyltransferase [Oryza sativa Japonica Group]
Length = 514
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 52/124 (41%), Positives = 70/124 (56%), Gaps = 6/124 (4%)
Query: 253 DISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW 312
D + YY M+R+ FCLCP G SPR+VEA+ GC+PV++AD PFAD + W
Sbjct: 383 DAGGEGGNPYYWYMRRSRFCLCPSGHEVASPRVVEAIHAGCVPVVVADGYAPPFADVLRW 442
Query: 313 EEIGMFVAEEDVPKLDTILTSIP-PEVILRKQRLLANPSM-KRAMLFPQPAQPGDAFHQI 370
E + VA DVP+L +L IP PEV +RL + KR + QP + D FH I
Sbjct: 443 EAFSVAVAVADVPRLRELLERIPAPEV----ERLRDGVRLVKRHFMLHQPPERLDMFHMI 498
Query: 371 LNGL 374
L+ +
Sbjct: 499 LHSV 502
Score = 38.9 bits (89), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 33/119 (27%), Positives = 58/119 (48%), Gaps = 13/119 (10%)
Query: 39 KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAE-IFMHRFLLSSP----VRTLNPEEADWF 93
+ KVYVYE + + + C +++A E F+ L +P VRT +P A
Sbjct: 137 RFKVYVYE----EGEPPIAHEGPC-KNIYAVEGRFIEELELMAPPLGGVRTWDPARAHAL 191
Query: 94 YTPIYPTCDLTPTGLPLPFK-SP--RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFG 149
+ P+ + + PL + SP ++ + +++S +WNR+ GADHF + HD+
Sbjct: 192 FLPLSVSQMVQLAYRPLSYDLSPLRAIVADYVAVVASRHRFWNRSAGADHFMLSCHDWA 250
>gi|413915952|gb|AFW55884.1| secondary cell wall glycosyltransferase family 47 [Zea mays]
Length = 484
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 96/396 (24%), Positives = 157/396 (39%), Gaps = 63/396 (15%)
Query: 28 AGDVLEDDPVGKLKVYVYELPSKYNKKLLL--------------------KDPRCLTHMF 67
A V+ DP ++V++Y++P +++ LL + P L
Sbjct: 95 AAGVVRCDPRDAVRVFMYDMPPEFHFGLLGWSPPSPDSVWPDVTAASPPPRYPGGLNQQH 154
Query: 68 AAEIFMHRFLLSSP--------VRTLNPEEADWFYTPIYPTCDLTPTGLPLP---FKSPR 116
+ E ++ LLSS VR + +AD + P + + P+P R
Sbjct: 155 SVEYWLTLDLLSSSPPCGRHSAVRVSDSRDADLVFVPFFASLSYNRHYRPVPPEKVSRDR 214
Query: 117 MMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQT 176
+++ + + P W R GADH +V H + H + +L A V +
Sbjct: 215 VLQEKLVRYLAARPEWRRYGGADHV-IVAHHPNSLLHARA----------VLHPAVFVLS 263
Query: 177 -FGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGG--- 232
FG+ + I PY K A+ R +YFRG Y EGG
Sbjct: 264 DFGRYPPRVASLEKDVIAPYKHMAKTYANDSAGFDDRPTLLYFRGAIYR----KEGGSIR 319
Query: 233 ---YYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAV 289
YY V+ +F + + + + M + FCL G P S RL +A+
Sbjct: 320 QELYYMLKEEKDVYFSFGS-----VQDHGASKASQGMHSSKFCLNIAGDTPSSNRLFDAI 374
Query: 290 VFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLL 346
V C+PVII+DDI LP+ D + + + +FV D K L +L+ + + + L
Sbjct: 375 VTHCVPVIISDDIELPYEDVLDYSKFSIFVRSSDAVKKGYLMRLLSGVSKQQWTKMWDRL 434
Query: 347 ANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHDK 382
+ + + P+Q DA I L+RK+P K
Sbjct: 435 KE--VDKHFEYQYPSQKDDAVQMIWQALSRKVPSIK 468
>gi|224144815|ref|XP_002325425.1| predicted protein [Populus trichocarpa]
gi|222862300|gb|EEE99806.1| predicted protein [Populus trichocarpa]
Length = 332
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 78/316 (24%), Positives = 133/316 (42%), Gaps = 43/316 (13%)
Query: 67 FAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRM--------M 118
+A+E + + L+ S T +P +AD F+ P + T L PR+ +
Sbjct: 33 YASESYFKKALMKSHFITKDPAKADLFFLP------FSITRLR---HDPRVGVGGIQDFI 83
Query: 119 RSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFG 178
R I IS +P+WNRT GADHF+ H G + E+ I + + + +
Sbjct: 84 RDYILNISRKYPFWNRTGGADHFYAACHSIGRSAMEKSEEVKFNAIQVVCSSSYFLSGY- 142
Query: 179 QRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGA 238
+ ++ P Q+ R +F G N P R
Sbjct: 143 ------IAHKDVSFP------GCHLSQVVKCDYRKKLAFFAGSI----NSP-----VRER 181
Query: 239 RAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVII 298
W N ++ +F T Y +++ + FCL G+ + R+ +++ +GC+PVII
Sbjct: 182 LLHSWRN--DSEIFAHFGRLTTPYADELLGSKFCLHVKGFEVNTARIGDSLYYGCVPVII 239
Query: 299 ADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFP 358
A+ LPFAD + W+ + VA D+P L IL I + L Q+ + +++ +
Sbjct: 240 ANHYDLPFADILNWKSFSVVVATLDIPLLKKILKGISSDQYLMFQKKVL--EVRKHFQWH 297
Query: 359 QPAQPGDAFHQILNGL 374
P DAF+ ++ L
Sbjct: 298 CPPVDYDAFYMVMYEL 313
>gi|359472749|ref|XP_002276440.2| PREDICTED: probable glycosyltransferase At5g11130-like [Vitis
vinifera]
gi|147815974|emb|CAN68074.1| hypothetical protein VITISV_007510 [Vitis vinifera]
gi|297737986|emb|CBI27187.3| unnamed protein product [Vitis vinifera]
Length = 461
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 96/383 (25%), Positives = 149/383 (38%), Gaps = 68/383 (17%)
Query: 38 GKLKVYVYELPSKYN--------------KKLLLKDPRCLT---HMFAAEIFMHRFLLSS 80
G +KVY+Y+LP K+ +K+ + D L H + E ++ L+
Sbjct: 70 GPVKVYMYDLPRKFTYGVIESYAVARGGLEKVPVDDVSSLKYPGHQHSGEWYLFSDLIRE 129
Query: 81 P------VRTLNPEEADWFYTPIYPTCDLTPT--------GLPLPFKSPRMMRSAIQLIS 126
VR +PEEAD FY + + L G + M S ++ +
Sbjct: 130 DRGRRYVVRVSDPEEADLFYVSFFSSLSLVVNPIRPANGEGAGTGYSDEEMQESLMEWLE 189
Query: 127 SNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLN 186
YW R G DH F+ Q+ A+ + + LV FG+ +
Sbjct: 190 QQ-EYWKRNNGRDHVFIC----------QDPNALHLIVDRVKNGVLLVSDFGRLRSDTAS 238
Query: 187 EGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENF 246
I PYA K + +I + +S+ ++F G Y EGG R +++
Sbjct: 239 LVKDVILPYAHRIKSYSGEIGVENRKSL-LFFMGNRYR----KEGG----KIRDLLFQIL 289
Query: 247 KNNPLFDISTDHPTTYYED-------MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIA 299
+ D+ H E M + FCL P G P + RL +A+V C+PVI++
Sbjct: 290 EQEE--DVIIKHGAQSRESRRMASQGMHSSKFCLHPAGDTPSACRLFDAIVSLCVPVIVS 347
Query: 300 DDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAML 356
D I LPF D I + +I +FV K L L I E IL QR M+
Sbjct: 348 DQIELPFEDVIDYRKIAIFVDSTSAVKPGFLVKNLRKITRERILEYQR-----EMQEVTR 402
Query: 357 FPQPAQPGDAFHQILNGLARKLP 379
+ + +I ++ KLP
Sbjct: 403 YFEYEDTNGTVSEIWRQVSMKLP 425
>gi|307111456|gb|EFN59690.1| hypothetical protein CHLNCDRAFT_133230 [Chlorella variabilis]
Length = 569
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 91/356 (25%), Positives = 144/356 (40%), Gaps = 38/356 (10%)
Query: 39 KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTP-- 96
K+Y+YEL +++ D + FM FL SPVRT +P EA FY P
Sbjct: 243 NFKIYMYELNTQW-----AYDNDHHIAWTGHDPFMEEFL-ESPVRTEDPSEASLFYIPAF 296
Query: 97 IYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQE 156
+Y G + + I + WPYW+R G DHF VP D G C
Sbjct: 297 LYSYSGNMAGG-------DEHTQLLLDHIRATWPYWDRHGGRDHFLFVPADRGTCPWGSR 349
Query: 157 EKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQI--PPDTPRSI 214
+ R I+ +T H +G Y P + + A P P +
Sbjct: 350 FSDLIR-IVHFGMHSTRTNHNPHFGH----QGHPEFGCYNPLRDIVAAGTGAPLSLPWAG 404
Query: 215 FVYFRGLFYDVNNDPEGGYYAR-----GARAAVWENFKNNPLFDISTDHPTTYYEDMQRA 269
+++F F + Y R A W N+P F S + Y + A
Sbjct: 405 WLFF---FAGSIRTDDNVYSGRTRLILSELVAQW----NDPEFSFSGGYVNNYPAGFREA 457
Query: 270 IFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDT 329
FCL P G+ + RL ++++ GC+PV+I + + P+ + +P+E + ++ ED+P+L
Sbjct: 458 KFCLAPWGYG-FGMRLHQSILGGCVPVVIQEHVFQPYEEVLPYETFSLRLSNEDLPQLRE 516
Query: 330 ILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHDKSTY 385
L S+ E + + LL + + G AF + L R+ + S Y
Sbjct: 517 TLRSVTDE---QYRELLEGVVRYKEAFSWERHLGGRAFDYTIASLRRRWLNSLSLY 569
>gi|302773760|ref|XP_002970297.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
gi|300161813|gb|EFJ28427.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
Length = 437
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 95/404 (23%), Positives = 169/404 (41%), Gaps = 65/404 (16%)
Query: 13 VSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKD------------- 59
+ +++ I R SG KLKV++YELP KYN L +D
Sbjct: 39 IRSESRLGISRTSGKNA---------KLKVFMYELPRKYNFGLFNRDGPAQEIPWKNLSN 89
Query: 60 -----PRCLTHMFAAEIFMHRFLLS-------SPVRTLNPEEADWFYTPIYPTCDLTPTG 107
+ L + E +M LL + R +P EAD F+ P + + G
Sbjct: 90 LPGPHTQGLKKQHSVEYWMTLDLLDEGGREFRAAQRVSDPGEADVFFVPYFASLSFNVFG 149
Query: 108 L----PLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERG 163
+ P ++ I+ +S + P++ R+ G DH V+ H F + + +R
Sbjct: 150 VSMRDPETEHDKKLQVGMIEYLSKS-PWYQRSGGRDHVLVLHHP--NAFRFLK----DRL 202
Query: 164 ILPLLQRATLVQTFGQ--RNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGL 221
L LL +V FG+ + L++ + + P P R+ ++F+G
Sbjct: 203 NLSLL----VVADFGRFPKGVAALHKDVVAPYSHMVPTYNGDDGTDPFEERTTLLFFQG- 257
Query: 222 FYDVNNDPEGGYYARGARAAVWEN-----FKNNPLFDISTDHPTTYYEDMQRAIFCLCPL 276
V +G R AA+ EN F+ + + + + M+ + FCL P
Sbjct: 258 --RVKRKDDG--VVRTQLAAILENQPRVHFEEGIATNFTVEQAM---QGMRSSRFCLHPA 310
Query: 277 GWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPP 336
G P S RL +A+V C+PVI++D I LPF D + + E +F + ++ + +L ++
Sbjct: 311 GDTPSSCRLFDAIVSHCVPVIVSDKIELPFEDELDYSEFSLFFSVDEAVRPGFLLGALEK 370
Query: 337 EVILRKQRLLAN-PSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
R ++ + R + P+Q DA + + + + +K+P
Sbjct: 371 FSKRRWMKMWRRLKQVTRHFEYQHPSQRDDAVNMLWSQIHKKVP 414
>gi|159481468|ref|XP_001698801.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158273512|gb|EDO99301.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 967
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 99/402 (24%), Positives = 158/402 (39%), Gaps = 98/402 (24%)
Query: 32 LEDDPVGKLKVYVYELPSKYNKKLLLKDPRC-----------------LTHMFAAEIFMH 74
L +D G + LP + K+L L RC ++ E++ H
Sbjct: 472 LTNDHCGYRQSSHTPLPFNFPKRLCLCSARCDHCGYRQFSGPSNSSELFADGYSVEVYFH 531
Query: 75 RFLLSSPVRTLNPEEADWFYTPIYPTCDL-------------TPTGLPLPFKSPRMMRSA 121
L SP RT +P+EAD+FY P+Y TC + PT P + + +A
Sbjct: 532 EVLAISPHRTFDPDEADFFYLPVYYTCWMWPVNGWADTPFWGAPTSWHRPSNAANLWLAA 591
Query: 122 IQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQE--EKAIERGILPLLQRATLVQT--- 176
+ I ++PYW+R G DH ++ HD GAC+ E +I +L R L T
Sbjct: 592 KRWIQQHFPYWDRRGGRDHIWMTNHDEGACYMPTEIYNSSI---MLTHWGRLDLNHTSNT 648
Query: 177 -FGQRN----------------------HVCLN-EGSITIPPYAPPQKMQAHQIP--PDT 210
+G N H C + + + IP + PP+ + + P
Sbjct: 649 AYGPDNYSTGLTWPDINGGRDVTELWAGHPCYDPKKDLVIPGFKPPEHYKRSPLLGFPPY 708
Query: 211 PRSIFVYFRGLFYDVNND--PEGGYYARGARAAVWENFKNNPLFDISTDHP--------- 259
R I +Y RG DV P Y+RG R + +K + +H
Sbjct: 709 QRDILLYLRG---DVGKHRLPN---YSRGTRQKL---YKLSQAHGWIAEHRIFIGEKYEL 759
Query: 260 -TTYYEDMQRAIFCLCPLGWAP------------WSPRLVEAVVFGCIPVIIADDIVLPF 306
Y + + R++FC P ++ R +AV+ GC+P++I D F
Sbjct: 760 VGDYSDHLARSVFCAVVPAPPPTSKPPNLQPGDGYAMRFEDAVLHGCLPLVIMDRTHAVF 819
Query: 307 ADAIPWEEIGMFVAEEDVPK-LDTILTSIPPEVILRKQRLLA 347
+ + + ++E + + L +L +I PE I R QR LA
Sbjct: 820 ESILDIDGFSLRISEAALDEHLPALLKAIAPEQIERMQRRLA 861
>gi|224120296|ref|XP_002318294.1| predicted protein [Populus trichocarpa]
gi|222858967|gb|EEE96514.1| predicted protein [Populus trichocarpa]
Length = 382
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 85/323 (26%), Positives = 136/323 (42%), Gaps = 50/323 (15%)
Query: 78 LSSPVRTLN-PEEADWFYTPIYPTCDLTPTGLPL-------PFKSPRMMRSAIQLISSNW 129
+ SPV +N PEEAD FY P++ + LT + + P S M+ +
Sbjct: 53 VGSPVVKVNDPEEADLFYVPVFSSLSLTVNPVQVGKVPVSDPVYSDEKMQDELVEWLEKQ 112
Query: 130 PYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGS 189
YW R G DH + A+ R + + L+ FG+ V ++GS
Sbjct: 113 EYWRRNNGRDHVLFAG----------DPNALYRVLDRVKNAVLLLSDFGR---VRSDQGS 159
Query: 190 IT---IPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENF 246
+ I PYA + I D R ++F G Y +GG R +++
Sbjct: 160 LVKDVIVPYAHRINVYNGDIGVD-ERKTLLFFMGNRYR----KDGG----KIRDMLFQLL 210
Query: 247 KNNPLFDISTDHPTTYYED-------MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIA 299
+ D+ H T E M + FCL P G P + RL +++V C+P+I++
Sbjct: 211 EKEE--DVLISHGTQSRESRRTATLGMHTSKFCLNPAGDTPSACRLFDSIVSLCVPLIVS 268
Query: 300 DDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAML 356
D I LPF D I + +I +FV E K L +L ++ E IL Q+ M+
Sbjct: 269 DSIELPFEDVIDYRKIAIFVDTESSLKPGYLVKLLRAVSTERILEYQK-----EMREVKR 323
Query: 357 FPQPAQPGDAFHQILNGLARKLP 379
+ + + P ++I + +KLP
Sbjct: 324 YFEYSDPNGTVNEIWREIGQKLP 346
>gi|357126548|ref|XP_003564949.1| PREDICTED: probable glycosyltransferase At3g42180-like
[Brachypodium distachyon]
Length = 432
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 85/311 (27%), Positives = 133/311 (42%), Gaps = 33/311 (10%)
Query: 82 VRTLNPEEADWFYTPIYPTCDLTPTGLPLP---FKSPRMMRSAIQLISSNWP--YWNRTE 136
VR +PE A+ F+ P + + G + ++ R+++ ++LI W YW R+
Sbjct: 124 VRVADPEAAEAFFVPFFSSLSFNVHGRNMTDPDTEADRLLQ--VELIDVLWKSKYWQRSA 181
Query: 137 GADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYA 196
G DH V+P F + + + +L +V FG+ + + PY
Sbjct: 182 GRDH--VIPMHHPNAFRFLRD-MVNASVL-------IVADFGRYTQELASLRKDVVAPYV 231
Query: 197 PPQKMQAHQIPPD--TPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDI 254
+ PPD R ++FRG EG A+ A+ K+ F+
Sbjct: 232 HVVDSFINDDPPDPFEARPTLLFFRG---RTVRKAEGKIRAKLAKIL---KDKDGVRFED 285
Query: 255 ST---DHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIP 311
S + T E M+ + FCL P G P S RL +A+V CIPVI++ I LPF D I
Sbjct: 286 SLATGEGINTSTEGMRSSKFCLHPAGDTPSSCRLFDAIVSHCIPVIVSSRIELPFEDEID 345
Query: 312 WEEIGMFVAEEDVPKLDTILTSI---PPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFH 368
+ E +F + E+ K D +L + P E + L N S F P + GDA +
Sbjct: 346 YSEFSLFFSVEEALKPDYLLDQLRQMPKEKWVEMWSKLKNVSSHYE--FQYPTRKGDAVN 403
Query: 369 QILNGLARKLP 379
I + K+P
Sbjct: 404 MIWRQVRHKIP 414
>gi|159473729|ref|XP_001694986.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158276365|gb|EDP02138.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 703
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 101/415 (24%), Positives = 158/415 (38%), Gaps = 91/415 (21%)
Query: 39 KLKVYVYELPSKYNKKLLLKDPRCLTH-MFAAEIFMHRFLLSS-PVRTLNPEEADWFYTP 96
K+KVYVY+LPS + D L M+ AE+ LL VRT NP EA FY P
Sbjct: 272 KIKVYVYDLPSNVVHRREFHDQWSLIDLMYNAELEFTELLLGDWGVRTENPWEAALFYVP 331
Query: 97 IYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQE 156
+ G P +++ A + + N P++N T G +H F +D G C
Sbjct: 332 TFTYWYTGNVGHPY-----FVIQHATKWLQENSPFFNLTGGRNHIFWATNDRGVCKLQMA 386
Query: 157 EKAIERGI----------LPLLQRATLVQTFGQRNHVCLN-------------------- 186
++ I P R + QT H+ L
Sbjct: 387 PPEMQHSIKLVHFGQSPRRPYAARMKVGQTTDPTAHLALRGALPQPGTRFPEFPAEFTAM 446
Query: 187 ------------EGSITIPPYAPPQKMQAH---------QIP-----------PDTP-RS 213
E + P Y P ++ H Q+P PD P R+
Sbjct: 447 DILEESEICIRPEKDVVTPNYIMPTWVEPHNYNQVWNVTQLPDGTRQVTRKRGPDVPVRN 506
Query: 214 IFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYE--------- 264
+ +YF G P+ Y ++G R + + F +D + Y+
Sbjct: 507 LTLYFAGF-----TKPQMSY-SQGVRQLIHKLFGPGGKYDPKGPNARPDYKVGGPGGGEA 560
Query: 265 --DMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEE 322
MQ++ FCL P+G + W RLVEA++ GC+PVII D + D +P+ E + V
Sbjct: 561 ATYMQQSRFCLAPMG-SGWGIRLVEAMISGCVPVIIQDHVYQAHWDVVPFPEFSIRVGRH 619
Query: 323 DVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARK 377
D+ +L +L + P+ + + L A F G A++ + L ++
Sbjct: 620 DLHRLVELLDDVAPQEL---EELQAGIERYHRAFFWDAQWGGLAYNYTIQALKQR 671
>gi|449487654|ref|XP_004157734.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10L-like
[Cucumis sativus]
Length = 464
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 102/412 (24%), Positives = 161/412 (39%), Gaps = 70/412 (16%)
Query: 12 FVSADNTPKIERISGSAGDVLEDD--PVGKLKVYVYELPSKYNKKLL------------- 56
F SA++ + S D+L + +K+Y+Y++P+++ ++
Sbjct: 43 FSSANSVIVSDEFSSQDTDLLNSSGKSLSPVKIYLYDVPTRFTYGVIENHGIARGGKPVP 102
Query: 57 -LKDPRCLTHMFAAEIFMHRFLL--------SSPVRTLNPEEADWFYTPIYPTCDLTPTG 107
+ D + H AE F+ LL S+ VR +PEEAD FY P + + L
Sbjct: 103 DVTDLKYPGHQHMAEWFLFTDLLRPESERIGSAVVRVFDPEEADLFYVPFFSSLSLIVNP 162
Query: 108 LPLPFKSPRMMRSAI------QLISSNW----PYWNRTEGADHFFVVPHDFGACFHYQEE 157
+ S + R + Q W YW R+ G DH + Q+
Sbjct: 163 IRPATGSDQQQRKLVYSDEETQDAFMEWLEKQEYWKRSNGRDHVIIA----------QDP 212
Query: 158 KAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVY 217
A+ R I + LV FG+ + I PY+ I + R ++
Sbjct: 213 NALYRLIDRVKNSILLVSDFGRLRADQASLVKDVIVPYSHRINTYTGDIGVEN-RKTLLF 271
Query: 218 FRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYED-------MQRAI 270
F G Y EGG R ++ + D+ H T E M +
Sbjct: 272 FMGNRYR----KEGG----KIRDMLFNILELEQ--DVIIKHGTQSRESRRAATHGMHTSK 321
Query: 271 FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK---L 327
FCL P G P + RL ++VV C+PVI++D I LPF D I + +I +F K L
Sbjct: 322 FCLNPAGDTPSACRLFDSVVSLCVPVIVSDSIELPFEDVIDYSKIAVFFDSVSAVKPEFL 381
Query: 328 DTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
+ L I E IL QR MK+ + + ++I +++KLP
Sbjct: 382 ISKLRRISEERILDYQR-----EMKKIKRYFEYTDSNGTVNEIWRQVSQKLP 428
>gi|116787500|gb|ABK24530.1| unknown [Picea sitchensis]
Length = 446
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 92/378 (24%), Positives = 151/378 (39%), Gaps = 56/378 (14%)
Query: 40 LKVYVYELPSKYNKKLLLKDPR------------------CLTHMFAAEIFMHRFLLS-- 79
LKVY+Y+LP ++N +L K+ L + E +M +LL
Sbjct: 66 LKVYMYDLPRRFNLGMLKKNSSDLDLPWTSSKIPPWPQRSGLKKQHSIEYWMMVYLLGQH 125
Query: 80 -------SPVRTLNPEEADWFYTPIYPTCDLTPTGL----PLPFKSPRMMRSAIQLISSN 128
+ VR +P++AD FY P + + G P ++ + ++ +
Sbjct: 126 VGEEGERTAVRVADPDQADVFYVPFFASLSFNTHGQNMRDPETEFDKKLQIEVVDMLKRS 185
Query: 129 WPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQ--RNHVCLN 186
W R+ G DH V+ H F E A +V FG+ R+ L
Sbjct: 186 -KSWQRSGGRDHVIVIHHPNAFRFLRDEVNA----------SIFVVADFGRYPRSVSFLR 234
Query: 187 EGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENF 246
+ + + + P R++ +YFRG EG + R A + N
Sbjct: 235 KDVVAPYVHVVDTYVNDDSSDPFESRTMLLYFRG---RTKRKDEG--FVRLKLAKILGNH 289
Query: 247 KNNPLFD--ISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVL 304
K D +T+ + M+ + FCL P G P S RL +A+V C+PVI++D I L
Sbjct: 290 KRVHFEDSLATTEGFEVAKQGMRSSRFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDRIEL 349
Query: 305 PFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPA 361
PF D I ++E +F + ++ + L L + P E L+ L + + P
Sbjct: 350 PFEDEIDYQEFSLFFSVKEALRPGYLMQKLETFPKEKWLKMWNKLKQ--VAHHFEYQYPP 407
Query: 362 QPGDAFHQILNGLARKLP 379
DA + + + RKLP
Sbjct: 408 IKDDAVNMLWRQIHRKLP 425
>gi|302793388|ref|XP_002978459.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
gi|300153808|gb|EFJ20445.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
Length = 437
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 92/404 (22%), Positives = 167/404 (41%), Gaps = 65/404 (16%)
Query: 13 VSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKD------------- 59
+ +++ I R SG KLKV++YELP KYN L +D
Sbjct: 39 IRSESRLGISRTSGKNA---------KLKVFMYELPRKYNFGLFDRDGPAQEIPWKNLSN 89
Query: 60 -----PRCLTHMFAAEIFMHRFLLS-------SPVRTLNPEEADWFYTPIYPTCDLTPTG 107
+ L + E +M LL + R +P EAD F+ P + + G
Sbjct: 90 LPGPHTQGLKKQHSVEYWMTLDLLDEGGREFRAAQRVSDPGEADVFFVPYFASLSFNVFG 149
Query: 108 L----PLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERG 163
+ P ++ I+ +S + P++ R+ G DH V+ H F + +++
Sbjct: 150 VSMRDPETEHDKKLQVGMIEYLSKS-PWYQRSGGRDHVLVLHHP--NAFRFLKDR----- 201
Query: 164 ILPLLQRATLVQTFGQ--RNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGL 221
L +V FG+ + L++ + + P P R+ ++F+G
Sbjct: 202 ---LNSSLLVVADFGRFPKGVAALHKDVVAPYSHMVPTYNGDDGSDPFEERTTLLFFQG- 257
Query: 222 FYDVNNDPEGGYYARGARAAVWEN-----FKNNPLFDISTDHPTTYYEDMQRAIFCLCPL 276
V +G R AA+ EN F+ + + + + M+ + FCL P
Sbjct: 258 --RVKRKDDG--VVRTQLAAILENQPRVHFEEGIATNFTVEQAM---QGMRSSRFCLHPA 310
Query: 277 GWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPP 336
G P S RL +A+V C+PVI++D I LPF D + + E +F + ++ + +L ++
Sbjct: 311 GDTPSSCRLFDAIVSHCVPVIVSDKIELPFEDELDYSEFSLFFSVDEAVRPGFLLGALEK 370
Query: 337 EVILRKQRLLAN-PSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
R ++ + R + P+Q DA + + + + +K+P
Sbjct: 371 FSKRRWMKMWRRLKQVTRHFEYQHPSQRDDAVNMLWSQIHKKVP 414
>gi|242046000|ref|XP_002460871.1| hypothetical protein SORBIDRAFT_02g036640 [Sorghum bicolor]
gi|241924248|gb|EER97392.1| hypothetical protein SORBIDRAFT_02g036640 [Sorghum bicolor]
Length = 500
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 100/378 (26%), Positives = 152/378 (40%), Gaps = 77/378 (20%)
Query: 36 PVGKLKVYVYELPSKYNKKLLL--------------------KDPRCLTHMFAAEIFMHR 75
P +K+Y+Y+LP+K+ ++ + R H +AE ++ +
Sbjct: 100 PPAPVKIYLYDLPAKFTYGVVRSYTATRAPSGSADAAAALPDEQLRYPGHQHSAEWWLFK 159
Query: 76 FLL-----SSPV-RTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSP----------RMMR 119
LL PV R +P +AD FY P + + L P +SP
Sbjct: 160 DLLRRGPRDRPVARVDDPSDADLFYVPFFSSLSLVVN----PIRSPPAANASGAAAAYSD 215
Query: 120 SAIQLISSNW----PYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQ 175
A+Q W PYW R G DH F+ Q+ A+ R I + LV
Sbjct: 216 DAMQEELLEWLERQPYWRRHMGRDHVFIC----------QDPNALYRVIDRISNAVLLVS 265
Query: 176 TFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYA 235
FG+ + I PY+ ++ D R + ++F G Y EGG
Sbjct: 266 DFGRLRSDQASLVKDVILPYSHRINSFKGEVGVDG-RPLLLFFMGNRYR----KEGG--- 317
Query: 236 RGARAAVWENFKNNPLFDISTDHPTTYYED-------MQRAIFCLCPLGWAPWSPRLVEA 288
R A+++ +N D++ H T E M + FCL P G P + RL +A
Sbjct: 318 -KVRDALFQILENED--DVTIKHGTQSRESRRAARQGMHSSKFCLHPAGDTPSACRLFDA 374
Query: 289 VVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRL 345
+V C+PVI++D I LPF D I + +I +FV + L ++L I E IL QR
Sbjct: 375 LVSLCVPVIVSDYIELPFEDIIDYNKISIFVGTSKAVQPGYLTSMLRRISSERILEYQR- 433
Query: 346 LANPSMKRAMLFPQPAQP 363
+KR + P P
Sbjct: 434 -ETKKVKRYFEYEDPNGP 450
>gi|224062946|ref|XP_002300942.1| predicted protein [Populus trichocarpa]
gi|222842668|gb|EEE80215.1| predicted protein [Populus trichocarpa]
Length = 460
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 88/355 (24%), Positives = 137/355 (38%), Gaps = 32/355 (9%)
Query: 40 LKVYVYELPSKYNKKLLLKD-------PRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADW 92
LK+YVYE K LL+ CL + ++ +H LL S RT EEAD
Sbjct: 87 LKIYVYEEDEIDGLKELLRGRDGKISADACLKGQWGTQVKIHGLLLESRFRTRKKEEADL 146
Query: 93 FYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACF 152
F+ P Y C GL + + + + S PY+ R+ G DH FV P GA
Sbjct: 147 FFVPAYVKCVRMMGGL-----NDKEINHTYVKVLSQMPYFRRSGGRDHIFVFPSGAGAHL 201
Query: 153 HYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIP----- 207
I R I+ T +++ N I P M +I
Sbjct: 202 FRSWATYINRSII----LTTEADRTDKKDTSAFNTWKDIIIPGNVEDGMTKRRIAMVQPL 257
Query: 208 PDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP---TTYYE 264
P + R + G + G A + + +P S Y++
Sbjct: 258 PLSKRKYLANYLG-----RAQGKVGRLKLIELAKQYPDKLESPELKFSGPGKFGRMEYFQ 312
Query: 265 DMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDV 324
++ A FCL P G + W+ R E+ C+PVI++D PF + I + +I + +
Sbjct: 313 HLRNAKFCLAPRGESSWTLRFYESFFVECVPVILSDQAEFPFQNVIDYTQISIKWPSTRI 372
Query: 325 P-KLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKL 378
+L L SIP E I +Q + A ++ ++ + A I+ L RK+
Sbjct: 373 GLELLEYLESIPDENI--EQMIAAGRQIRCLWVYAPEFESCSAMQGIMWELQRKV 425
>gi|302845052|ref|XP_002954065.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
gi|300260564|gb|EFJ44782.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
Length = 600
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 89/338 (26%), Positives = 137/338 (40%), Gaps = 53/338 (15%)
Query: 40 LKVYVYELPSKYNKKL-LLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIY 98
+KVY+YELPS ++ R + MF L+SS +RTL+ +AD+FY PI
Sbjct: 163 VKVYIYELPSNMTSWYPFMRMDRPVHLMFWQR------LMSSGMRTLDGNKADYFYIPIN 216
Query: 99 PTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEK 158
T TG + + + I +P+W++ G H + D G
Sbjct: 217 -----TRTG----SLAREELEWTLPYIKKTYPWWSKDNGNRHLIIHTGDMGI---NDFPL 264
Query: 159 AIERGILPLLQRATLVQTFGQRNH-------------------VCLNEGSITIPPYAPPQ 199
A R + L T + +G + V + + P P
Sbjct: 265 ATRRELNESLSNITWLTHWGLHEYHPIAKWYPAHRPGKDIVIPVMIMTQGFHLSPMNP-- 322
Query: 200 KMQAHQIPPDTPR--SIFVYFRGLFYDVNN--DPEGGY-------YARGARAAVWENFKN 248
+M+A PR + ++F G + DP+ G Y+ G R AV+ +N
Sbjct: 323 RMEAEIKAQGAPRLRNGTLFFAGRICGDRDLPDPKTGKCGPGHEDYSFGVRQAVYLQHRN 382
Query: 249 NPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFAD 308
F I +TY ED+ FCL P+G R + GC+PV+I D ++ PF
Sbjct: 383 VKGFRIVA-WTSTYLEDISSHKFCLAPVG-GGHGKRNILVAFMGCLPVLIGDHVLQPFEP 440
Query: 309 AIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLL 346
I W + V E D+P L IL ++P + KQ+ L
Sbjct: 441 EIDWSRFSISVPEADIPDLPRILANVPASEVASKQKRL 478
>gi|412986347|emb|CCO14773.1| predicted protein [Bathycoccus prasinos]
Length = 445
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 102/379 (26%), Positives = 154/379 (40%), Gaps = 37/379 (9%)
Query: 28 AGDVLEDDPVGKLKVYVYELPSKYNKKLLL--------KDPRCLTHMFAAEIFMHRFLLS 79
A + ++DD +K+YVY KKLL K C M+ +++ + FLL
Sbjct: 73 AEESIKDDISPPIKIYVYNDAD--VKKLLFPGKETQAYKSGVCGMKMYGSQVHIADFLLK 130
Query: 80 SP-VRTLNPEEADWFYTPIYPTC--DLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTE 136
S +RT NP +AD+F+ P +P C D P G L + + + + PY ++
Sbjct: 131 SKELRTENPSDADFFFLPGWPKCMLDAPPNGAGL---TDDELAKRLNGVIEKLPYIKKSG 187
Query: 137 GADHFFVVPHDFGACFHYQEEKAIERGILPLLQR------ATLVQTFGQRNHVCLNEGSI 190
G DH FV P G + I I + TL F V L
Sbjct: 188 GRDHVFVWPSGRGPTLYKNWRCKIPNSIFLTPEGFYTDPYRTLAPYFDPWKDVVL----- 242
Query: 191 TIPPYAPPQKMQAHQIPPDTP-RSIFVYFRGLFYDVNNDPEGGYYARGA-RAAVWENFKN 248
P + +K + T R+ F G D + + R + + K
Sbjct: 243 --PGFMDGRKDSYLETNKRTSKRTKLASFAGTVPDGQALKGDEKHVKAHPRERLLKLSKK 300
Query: 249 NP--LFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 306
P L IS P Y E + + FC+ P G +PW+ R E GC+PVII+D + LPF
Sbjct: 301 YPDDLLAISGRTPK-YAEILGDSKFCIVPRGLSPWTLRTYETFFAGCVPVIISDSVRLPF 359
Query: 307 ADAIPWEEIGMFVAEEDVPK-LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGD 365
+ + W I + E + + L T L SIP E I + R ++ + A +
Sbjct: 360 QEFLDWSLISIKWPEAKIDESLLTYLKSIPDEEIEKIVR--RGEQVRCVFAYQADATKCN 417
Query: 366 AFHQILNGLARKLPHDKST 384
AF I+ L+ K + ST
Sbjct: 418 AFSAIMWALSLKDRNVAST 436
>gi|359476146|ref|XP_002285237.2| PREDICTED: probable glycosyltransferase At5g25310-like [Vitis
vinifera]
gi|296082059|emb|CBI21064.3| unnamed protein product [Vitis vinifera]
Length = 478
Score = 87.8 bits (216), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 42/113 (37%), Positives = 66/113 (58%), Gaps = 2/113 (1%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
YY M ++ +CLCP G SPR+VEA+ C+PVI++D VLPF+D + WE + V
Sbjct: 358 YYSLMLQSKYCLCPSGHEVASPRIVEAIYSECVPVILSDHYVLPFSDVLRWEAFSVKVEA 417
Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
++P+L +L +I E R + + +++R QPA+ D FH IL+ +
Sbjct: 418 SEIPRLKEVLQAISEEKYTRLKEGVR--AVRRHFELNQPAKRFDVFHMILHSV 468
>gi|297841429|ref|XP_002888596.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
gi|297334437|gb|EFH64855.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
Length = 429
Score = 87.8 bits (216), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 91/379 (24%), Positives = 151/379 (39%), Gaps = 53/379 (13%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRC-------------------LTHMFAAEIFMHRFLLS- 79
L+V++Y+LP K+N ++ DP + + E ++ LL+
Sbjct: 52 LRVFMYDLPRKFNVAMM--DPHSSDVEPLTGKNLPSWPQTSGIKRQHSVEYWLMASLLNG 109
Query: 80 -----SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLP---FKSPRMMRSAIQLISSNWPY 131
+R +P+ AD FY P + + G + + R ++ + Y
Sbjct: 110 GDDDNEAIRVFDPDLADAFYVPFFSSLSFNTHGKNMTDPDTEFDRQLQVELMEFLEGSEY 169
Query: 132 WNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSIT 191
WNR+ G DH V+P F + ++ + IL +V FG+
Sbjct: 170 WNRSGGKDH--VIPMTHPNAFRFLRQQ-VNASIL-------IVVDFGRYAKDMARLSKDV 219
Query: 192 IPPYAPPQKMQAHQ-----IPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENF 246
+ PY + + P R+ +YFRG + EG R + +
Sbjct: 220 VSPYVHVVESLNEEDDDGLTDPFEARTTLLYFRG---NTVRKDEGKIRLRLEKLLAGNSD 276
Query: 247 KNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 306
+ +T + E M+ + FCL P G P S RL +A+V CIPVII+D I LPF
Sbjct: 277 VHFEKSVATTQNIKVSTEGMRSSKFCLHPAGDTPSSCRLFDAIVSHCIPVIISDKIELPF 336
Query: 307 ADAIPWEEIGMFVAEEDVPKLDTILTSI---PPEVILRKQRLLANPSMKRAMLFPQPAQP 363
D I + E +F + ++ + IL + P E L + L N S F P +
Sbjct: 337 EDEIDYSEFSLFFSIKESLEPGYILNKLRQFPKEKWLEMWKRLKNVS--HHFEFQYPPKR 394
Query: 364 GDAFHQILNGLARKLPHDK 382
DA + + + K+P+ K
Sbjct: 395 EDAVNMLWRQVKHKIPNVK 413
>gi|359473867|ref|XP_002268876.2| PREDICTED: probable glycosyltransferase At5g03795-like [Vitis
vinifera]
gi|296085575|emb|CBI29307.3| unnamed protein product [Vitis vinifera]
Length = 449
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 81/342 (23%), Positives = 147/342 (42%), Gaps = 33/342 (9%)
Query: 37 VGKLKVYVYELPSKYNK-----KLLLKDPRCLTHMFAAEIFMHRF---LLSSPVRTLNPE 88
+ KLK++VY S ++ L +P A H F LL S + T +P+
Sbjct: 114 LHKLKIFVYPDASMNQSSSPFARVFLPNPNPFHPKLANYFSEHMFKVALLRSSLLTPHPQ 173
Query: 89 EADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDF 148
+A +F+ P + + IS + +WN + G DHF++ H
Sbjct: 174 DAHFFFLPFSVNTLRNDPRVHSEASISDFVTQYTTRISWEYKFWNASRGTDHFYICCHSV 233
Query: 149 GACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY--APPQKMQAHQI 206
G +E + + + T ++ QR + ++ + +P PP+K+ +
Sbjct: 234 G-----REAASKHHDLHNNAIQVTCSSSYFQR--LYISHKDVGLPQVWPRPPEKL----L 282
Query: 207 PPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM 266
P R V+F G V N + R AVW N + LF S P Y E +
Sbjct: 283 NPPELRHKLVFFAG---RVQNS-----HIRQELMAVWGNDTDIDLFSGSP--PFPYEEGL 332
Query: 267 QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK 326
+++ +CL G+ + R+ +A+ +GCIPVI+++ LPF++ + W + + ++ + +
Sbjct: 333 RKSKYCLHVKGYEVNTARVCDAIHYGCIPVIVSNYYDLPFSNVLDWSKFSVIISHKSIAT 392
Query: 327 LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFH 368
L IL SI + L + L ++R + + D+FH
Sbjct: 393 LKKILLSISKQKYLSMYQNLC--LVRRHFAWHTTPRGYDSFH 432
>gi|125572389|gb|EAZ13904.1| hypothetical protein OsJ_03829 [Oryza sativa Japonica Group]
Length = 504
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 89/346 (25%), Positives = 150/346 (43%), Gaps = 38/346 (10%)
Query: 40 LKVYVYELPSKYNKKLL--LKDP-RCLTHMFAAEIFMHRFLLSSPVR------TLNPEEA 90
L+VYVYE+PS++ LL +D R +++ + +HR + + + PE
Sbjct: 113 LRVYVYEMPSRFTYDLLRLFRDSYRETSNLTSNGSPVHRLVEQHSIDYWLWADLIAPESQ 172
Query: 91 DWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLI---SSNWPYWNRTEGADHFFVVPHD 147
I + D + +S R++++ I+ ++ P W R+EG DH V H
Sbjct: 173 RLLKNVIRHSIDYWLWADLIAPESQRLLKNVIREALKWVTDQPAWQRSEGRDHVIPVHHP 232
Query: 148 FGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIP 207
+ ++ + + + LL + + V L + I PY P + +
Sbjct: 233 WS----FKSVRRFVKKAIWLLPDMDSTGNWYKPGQVYLEKD--VILPYVPNVDLCDSKCV 286
Query: 208 PDTP--RSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDI-----STDHPT 260
+T RS ++FRG GG R+ + K+ I D
Sbjct: 287 SETQSRRSTLLFFRGRL----RRNAGGKI----RSKLVTELKDAEGIIIEEGTAGADGKA 338
Query: 261 TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVA 320
M++++FCL P G P S RL +A+V GCIPVI++D++ LPF + + +I +FV+
Sbjct: 339 AAQNGMRKSLFCLNPAGDTPSSARLFDAIVSGCIPVIVSDELELPFEGILDYRKIALFVS 398
Query: 321 EEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQP 363
D + L L SI + I + Q L S R L+ PA+P
Sbjct: 399 SNDAVQPGWLVKYLRSIDAKRIRQMQSNLLKYS--RHFLYSSPARP 442
>gi|449455387|ref|XP_004145434.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10L-like
[Cucumis sativus]
Length = 464
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 101/412 (24%), Positives = 160/412 (38%), Gaps = 70/412 (16%)
Query: 12 FVSADNTPKIERISGSAGDVLEDD--PVGKLKVYVYELPSKYNKKLL------------- 56
F SA++ + S D+L + +K+Y+Y++P+++ ++
Sbjct: 43 FSSANSVIVSDEFSSQDTDLLNSSGKSLSPVKIYLYDVPTRFTYGVIENHGIARGGKPVP 102
Query: 57 -LKDPRCLTHMFAAEIFMHRFLL--------SSPVRTLNPEEADWFYTPIYPTCDLTPTG 107
+ D + H AE F+ LL S+ VR +PE AD FY P + + L
Sbjct: 103 DVTDLKYPGHQHMAEWFLFTDLLRPESERIGSAVVRVFDPEVADLFYVPFFSSLSLIVNP 162
Query: 108 LPLPFKSPRMMRSAI------QLISSNW----PYWNRTEGADHFFVVPHDFGACFHYQEE 157
+ S + R + Q W YW R+ G DH + Q+
Sbjct: 163 IRPATGSDQQQRKLVYSDEETQDAFMEWLEKQEYWKRSNGRDHVIIA----------QDP 212
Query: 158 KAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVY 217
A+ R I + LV FG+ + I PY+ I + R ++
Sbjct: 213 NALYRLIDRVKNSILLVSDFGRLRADQASLVKDVIVPYSHRINTYTGDIGVEN-RKTLLF 271
Query: 218 FRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYED-------MQRAI 270
F G Y EGG R ++ + D+ H T E M +
Sbjct: 272 FMGNRYR----KEGG----KIRDMLFNILEQEQ--DVIIKHGTQSRESRRAATHGMHTSK 321
Query: 271 FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK---L 327
FCL P G P + RL ++VV C+PVI++D I LPF D I + +I +F K L
Sbjct: 322 FCLNPAGDTPSACRLFDSVVSLCVPVIVSDSIELPFEDVIDYSKIAVFFDSVSAVKPEFL 381
Query: 328 DTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
+ L I E IL QR MK+ + + ++I +++KLP
Sbjct: 382 ISKLRRISEERILDYQR-----EMKKIKRYFEYTDSNGTVNEIWRQVSQKLP 428
>gi|168012366|ref|XP_001758873.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162690010|gb|EDQ76379.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 426
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 87/356 (24%), Positives = 144/356 (40%), Gaps = 49/356 (13%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCL---THMFAAEIFMHRF-------------------- 76
L +YVYE+P+K+ LL L ++ + +HR
Sbjct: 38 LNIYVYEMPAKFTTDLLWLFHNSLDQTVNLTSNGSPVHRLIQQHSVDFWLFSDLMTREDK 97
Query: 77 -LLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRT 135
LL + R + E+AD +Y P + T P L +S + R A++ I+ W R+
Sbjct: 98 RLLKTFRRVSHQEQADVYYVPFFTTI---PFFLLSRVQSRTLYREAVKWITRQ-AAWQRS 153
Query: 136 EGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY 195
G DH V H + H + + + LL + + V L E + +P
Sbjct: 154 GGRDHVLAVHHPWSMKSH----RRFLKSAIWLLSDLDSSGNWYKEGEVSL-EKDVIMPYV 208
Query: 196 APPQKMQAHQIPPDTP-RSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFD- 253
A + + P R ++F+G + R AAV + K +F
Sbjct: 209 ANVDACDDNCLATSKPSRKTLLFFQGRIVRGSAGK-----VRSRLAAVLRDEKERIVFQE 263
Query: 254 --ISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIP 311
+ T M+ ++FCL P G P S RL +A+V GCIPV+++D++ LPF +
Sbjct: 264 GFSGAEGKATAQHGMRSSVFCLSPAGDTPSSARLFDAIVSGCIPVVVSDELELPFEGILD 323
Query: 312 WEEIGMFV----AEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQP 363
+ ++ +FV A + + + P +V +QRL R + PAQP
Sbjct: 324 YRQVALFVPAARAAQKGWLVAHLRNKTPQDVAAMQQRL---AQYGRHFRYGTPAQP 376
>gi|51535578|dbj|BAD37522.1| pectin-glucuronyltransferase-like [Oryza sativa Japonica Group]
Length = 342
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 93/348 (26%), Positives = 144/348 (41%), Gaps = 65/348 (18%)
Query: 58 KDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPI--------YPTCDLTPTGLP 109
+ PR LT +A+E + + + S RT + E+A F+ PI P+ L T
Sbjct: 20 QTPRKLTGKYASEGYFFQNIRESRFRTDDLEKAHLFFVPISPHKMRGKVPSSLLLVTYAW 79
Query: 110 LPFKSPRMMRSAIQL--------ISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIE 161
L RS + L SS +W GADHFFV HD G +A E
Sbjct: 80 LILHIRSYDRSILFLDLYWWCPLCSSFRGHWG--VGADHFFVTCHDVGV-------RAFE 130
Query: 162 RGILPLLQRATLVQTFGQRNHVC---LNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYF 218
LP + + ++ VC N G I A PQ +Q +P
Sbjct: 131 G--LPFIIKNSI-------RVVCSPSYNAGYIPHKDVALPQILQPFALPAGGN------- 174
Query: 219 RGLFYDVNNDPEGGYYA-------RGARAAVWEN-----FKNNPLFDISTDHPTTYYEDM 266
D+ N G++A R A +WEN NN + + Y +
Sbjct: 175 -----DIENRTILGFWAGHRNSKIRVILARIWENDTELAISNNRINRAIGN--LVYQKHF 227
Query: 267 QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK 326
R FC+CP G S R+ +++ +GC+PVI++D L F+ + W + + + E DV +
Sbjct: 228 FRTKFCVCPGGSQVNSARISDSIHYGCMPVILSDYYDLRFSGILNWRKFAVVLKESDVYE 287
Query: 327 LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
L +IL S+ + + + L +++ + P P DAFH I+ L
Sbjct: 288 LKSILKSLSQKEFVSLHKSLVQ--VQKHFEWHSPPVPYDAFHMIMYEL 333
>gi|225457795|ref|XP_002265632.1| PREDICTED: probable glucuronosyltransferase Os03g0107900 [Vitis
vinifera]
gi|302142755|emb|CBI19958.3| unnamed protein product [Vitis vinifera]
Length = 459
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 92/387 (23%), Positives = 153/387 (39%), Gaps = 39/387 (10%)
Query: 40 LKVYVYELPSKYNKKLLL--KDPR-----CLTHMFAAEIFMHRFLLSSPVRTLNPEEADW 92
LK+YVYE K LL +D C+T + ++ +HR LL S RT EEAD
Sbjct: 86 LKIYVYEEDEIDGLKSLLYGRDGSIPTEVCVTGQWGTQVKIHRLLLKSRFRTRRKEEADL 145
Query: 93 FYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACF 152
F+ P Y C GL + + + + S PY+ + G +H FV P G
Sbjct: 146 FFVPTYIKCVRMKGGL-----NDKEIDQMYVKVLSQMPYFRLSGGRNHIFVFPSGAGPHL 200
Query: 153 HYQEEKAIERGIL--PLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAH-----Q 205
+ R I+ P R +++ N I P +M + Q
Sbjct: 201 FKSWATYLNRSIILTPEGDRTD------KKDTSAFNTWKDIIIPGNVADEMTTNGATFVQ 254
Query: 206 IPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP---TTY 262
P + R F G + G A + + +P S Y
Sbjct: 255 PLPLSKRKFLANFLG-----RAQRKLGRLQLIELAKQYPDKLESPELQFSGPDKLGRIEY 309
Query: 263 YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGM-FVAE 321
+ ++ A FC P G + W+ R E+ C+PVI++D + LPF + I + ++ + + +
Sbjct: 310 FHHLRNAKFCFAPRGESSWTLRFYESFFVECVPVILSDQVELPFQNVIDYTQVSIKWPSS 369
Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP-- 379
+ P+L L SIP +VI ++ + ++ ++ ++P A I+ L RK+
Sbjct: 370 QIGPQLLEYLESIPDKVI--EEMISRGREVRCWWVYASESEPCSAMRGIMWELQRKVRKF 427
Query: 380 -HDKSTYLKPGGKFLNWTSGPVGDLKP 405
T+ G +N + KP
Sbjct: 428 HQSTETFWLHNGSIINRNLEQFSNWKP 454
>gi|357157638|ref|XP_003577864.1| PREDICTED: probable glycosyltransferase At3g07620-like
[Brachypodium distachyon]
Length = 485
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 75/309 (24%), Positives = 123/309 (39%), Gaps = 27/309 (8%)
Query: 80 SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLP---FKSPRMMRSAIQLISSNWPYWNRTE 136
+ R + ++AD + P + + P+P R ++ + + P W R+
Sbjct: 175 AAARVADAQDADVVFVPFFASLSYNRHSKPVPPEKVSRDRALQEKLVRYLAARPEWKRSG 234
Query: 137 GADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYA 196
GADH +V H + H + L ++ FG+ + + + PY
Sbjct: 235 GADHV-IVAHHPNSLLHARSA---------LFPAVFVLSDFGRYHPRVASLEKDLVAPYR 284
Query: 197 PPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNP----LF 252
K + R +YFRG Y EGG R ++ K+ F
Sbjct: 285 HMAKTFVNDTAGFDDRPTLLYFRGAIYR----KEGG----NIRQELYNMLKDEKDVFFSF 336
Query: 253 DISTDHPTTY-YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIP 311
DH + + M + FCL G P S RL +A+V C+PVII+DDI LP+ D +
Sbjct: 337 GSVQDHGVSKASQGMHSSKFCLNIAGDTPSSNRLFDAIVSHCVPVIISDDIELPYEDVLD 396
Query: 312 WEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLAN-PSMKRAMLFPQPAQPGDAFHQI 370
+ + +FV D K ++ I R R+ + + + P++ DA I
Sbjct: 397 YSKFSIFVRSSDAVKRGYLMKLIRGVTKHRWTRMWKRLKEVDKHFEYQFPSRKDDAVQMI 456
Query: 371 LNGLARKLP 379
LARK+P
Sbjct: 457 WQALARKVP 465
>gi|242055419|ref|XP_002456855.1| hypothetical protein SORBIDRAFT_03g044100 [Sorghum bicolor]
gi|241928830|gb|EES01975.1| hypothetical protein SORBIDRAFT_03g044100 [Sorghum bicolor]
Length = 432
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 93/371 (25%), Positives = 149/371 (40%), Gaps = 49/371 (13%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRC-----------LTHMFAAEIFMHRFLLSSP------- 81
L+V++Y+LP++++ ++ D + + E +M L
Sbjct: 61 LRVFMYDLPARFHVAMMGADDGAGFPAWPPSAGGIRRQHSVEYWMMASLQDGAAGPDGGR 120
Query: 82 --VRTLNPEEADWFYTPIYPTCDLTPTGLPLP---FKSPRMMRSAIQLISSNWPYWNRTE 136
VR +P+ AD F+ P + + G + ++ R+++ I I YW R+
Sbjct: 121 EAVRVRDPDAADAFFVPFFSSLSFNVHGRNMTDPDTEADRLLQVEIVDILWKSKYWQRSA 180
Query: 137 GADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYA 196
G DH V+P F + + IL +V FG+ + + PY
Sbjct: 181 GRDH--VIPMHHPNAFRFLRA-MVNASIL-------IVSDFGRYTKELASLRKDVVAPYV 230
Query: 197 PPQKMQAHQIPPD--TPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDI 254
PPD R ++FRG V D EG A+ + K F+
Sbjct: 231 HVVDSFLDDDPPDPFEARHTLLFFRG--RTVRKD-EGKIRAKLGKVL---KGKEGVRFED 284
Query: 255 ST---DHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIP 311
S D E M+ + FCL P G P S RL +A+V C+PVI++ I LPF D I
Sbjct: 285 SIATGDGIKISTEGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSSRIELPFEDEID 344
Query: 312 WEEIGMFVAEEDVPKLDTILT---SIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFH 368
+ E +F + E+ + D +L IP + + L N S F P + GDA +
Sbjct: 345 YSEFSLFFSVEEALRPDYLLNQLRQIPKKKWVDMWSKLKNVSHHYE--FQYPPRKGDAVN 402
Query: 369 QILNGLARKLP 379
I + K+P
Sbjct: 403 MIWRQVRHKIP 413
>gi|168043245|ref|XP_001774096.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674642|gb|EDQ61148.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 351
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 86/353 (24%), Positives = 152/353 (43%), Gaps = 37/353 (10%)
Query: 34 DDPVGKLKVYVYELPSKYN--KKLLLKDPRCLTHMFAAEI------FMHRFLLSSPVRTL 85
+D L++Y+Y YN + +P + E F + + S T
Sbjct: 15 NDMAKNLRIYLYPASQNYNFTQYEYGMNPSEMVSELGVETSSTTDTFFNLLVESKRFVTD 74
Query: 86 NPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVP 145
+ + A ++ PI G P K +R +Q + + + W+ + GADHF+
Sbjct: 75 DADGAHLYFLPISIDRVWAAVG---PAKVGEHLRHYLQWLRNTYKLWDLSLGADHFYFSS 131
Query: 146 HDFGACFHYQEEKAIERGILPLLQRATLVQTFG-QRNHVCLNEGSITIPPYAPPQKMQAH 204
H + H R L L + A V + +RN I++P Y +
Sbjct: 132 HAYDPINH--------RNNLELTKNAIQVASSPLRRNQNFFPHKDISLPSYKSQHIAEVQ 183
Query: 205 QIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDI-STDHPTTYY 263
+ + R V+ D+ DP A+V + + ++ F + S D P+ +
Sbjct: 184 NLVGASQRPKLVFVSSPPEDI--DP--------IVASVIQKWTSDSDFHVESADQPSPPF 233
Query: 264 EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIV--LPFADAIPWEEIGMFVAE 321
E + + FC+ A + +V+++ GC+PV+IAD I+ LPF D + W+E + +
Sbjct: 234 EKLLSSRFCVSVSPQAMLN--VVDSLRLGCVPVLIADSIIYDLPFQDVLNWKEFSVVLGV 291
Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
++ P L T+L+SI + RK + L + + K M + P +P DAFH L+ L
Sbjct: 292 KESPNLKTLLSSISTDE-YRKMQYLGHQASKH-MEWNDPPKPWDAFHMTLHEL 342
>gi|302847711|ref|XP_002955389.1| hypothetical protein VOLCADRAFT_96311 [Volvox carteri f.
nagariensis]
gi|300259231|gb|EFJ43460.1| hypothetical protein VOLCADRAFT_96311 [Volvox carteri f.
nagariensis]
Length = 1222
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 94/387 (24%), Positives = 149/387 (38%), Gaps = 80/387 (20%)
Query: 41 KVYVYELP---SKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPI 97
+VYVY+LP + + + L P +F R + ++ VR + + ADW++ P+
Sbjct: 576 RVYVYDLPPSMTTWRNDMRLDRP-------TLRLFQER-ITATGVRVGDGDTADWYFIPV 627
Query: 98 YPT--CDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQ 155
CD T + +A+ I P+WNRT+G HF + D G
Sbjct: 628 VLRHFCDAT------------TLSAAVSYIREMHPWWNRTQGHRHFVIATGDMG------ 669
Query: 156 EEKAIERGILPLLQRATLVQTFG----------QRNHVCLNEGSITIPPYAPPQKMQAHQ 205
ERG L T V +G + +H N I +P + K+
Sbjct: 670 -RSESERG--HLTANVTFVSYWGLHAPKLSSGWRASH--RNATDIVLPVFLGSPKLSRMG 724
Query: 206 I------------PPDTPRS----IFVYFRGLFYDVNN-----------DPEGGYYARGA 238
I P R IF + + D + P Y+ G
Sbjct: 725 IFTSRLHPKFATKAPHELRERNGPIFFFAGRICGDHSKPQVDGVWPNCKSPHNMGYSGGT 784
Query: 239 RAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVII 298
R + + N + I Y + + + FC P G R ++AV GC+PV+I
Sbjct: 785 RQKIHFHHWNRTGYFIQLG-DRHYAKHLLTSKFCFGPTG-GGHGQRQMQAVQAGCVPVVI 842
Query: 299 ADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFP 358
+DD++ F + W G+ +AE D+P++ +L +I PE RK+ LL + A
Sbjct: 843 SDDVLEAFEPFLDWNTFGVRLAEADIPRMHEVLEAISPEEYARKEVLLRCAAQHMAFSTV 902
Query: 359 QPAQPG-----DAFHQILNGLARKLPH 380
+ G DAF +L L K H
Sbjct: 903 TGSYIGESGRYDAFETLLEILRAKAAH 929
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 37/124 (29%), Positives = 60/124 (48%), Gaps = 6/124 (4%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + + + +C P G R ++AV GC+PV+I+DD++ F + W G+ +AE
Sbjct: 255 YAKHLLTSKYCFGPTG-GGHGQRQMQAVQAGCVPVVISDDVLEAFEPFLDWNTFGVRLAE 313
Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPG-----DAFHQILNGLAR 376
D+P++ +L +I PE K+ LL + A + G DAF +L L
Sbjct: 314 ADIPRMHEVLEAISPEEYAHKEVLLRCAAQHMAFSTVTGSYIGESGRYDAFETLLEILRA 373
Query: 377 KLPH 380
K H
Sbjct: 374 KAAH 377
Score = 46.6 bits (109), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 32/112 (28%), Positives = 49/112 (43%), Gaps = 21/112 (18%)
Query: 41 KVYVYELPSKYNKKLL-LKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPI-- 97
+VYVY+LP L ++ R T +F + ++ VR + + ADW++ P+
Sbjct: 1037 RVYVYDLPPSMTTWLNDMRLDRPTTRLFQERV------TATGVRVGDGDTADWYFIPVTL 1090
Query: 98 YPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFG 149
TCD T + A+ I P+WNRT G HF + D G
Sbjct: 1091 RHTCDAT------------TLSGAMSYIREMHPWWNRTHGHRHFVIATGDGG 1130
>gi|224136688|ref|XP_002322391.1| predicted protein [Populus trichocarpa]
gi|222869387|gb|EEF06518.1| predicted protein [Populus trichocarpa]
Length = 457
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 98/381 (25%), Positives = 166/381 (43%), Gaps = 70/381 (18%)
Query: 39 KLKVYVYELPSKYNKKL-----LLKDPRCLT------HMFAAEIFMHRFL-------LSS 80
K++V++Y+LP K+ + L + L+ H E +M+ L + S
Sbjct: 71 KVRVFMYDLPKKFTTGIIENHALARGSSDLSKVSYPGHQHMGEWYMYLDLSRPDLDRVGS 130
Query: 81 PVRTLN-PEEADWFYTPIYPTCDL--------TPTGLPLPFKSPRMMRSAIQLISSNWPY 131
PV +N PEEAD FY P++ + L T G + +M ++ + Y
Sbjct: 131 PVVKVNDPEEADLFYVPVFSSLSLIVNPARAGTVPGSDPVYSDEKMQEELVEWLEEQ-EY 189
Query: 132 WNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSI- 190
W R G DH V D A + + ++R +L L+ FG+ V ++GS+
Sbjct: 190 WRRNNGRDHV-VFAGDPNALY-----RVLDRVKNVVL----LLSDFGR---VRSDQGSLI 236
Query: 191 --TIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKN 248
I PY+ + I + R ++F G Y +GG R +++ +
Sbjct: 237 KDVIVPYSHRINVYNGDIGVE-ERKTLLFFMGNRYR----KDGG----KIRDLLFQMLEK 287
Query: 249 NPLFDISTDHPTTYYED-------MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADD 301
D+ H T E+ M + FCL P G P + RL +++V C+P+I++D
Sbjct: 288 EE--DVVIRHGTQSRENRRTATRGMHTSKFCLNPAGDTPSACRLFDSIVSLCVPLIVSDS 345
Query: 302 IVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFP 358
I LPF D I + +I +FV E K L +L ++ E IL Q+ + +KR ++
Sbjct: 346 IELPFEDVIDYRKIAIFVDTESSLKPGYLVRMLRAVSTEKILEYQKQMRE--VKRYFVY- 402
Query: 359 QPAQPGDAFHQILNGLARKLP 379
+ ++I +A+KLP
Sbjct: 403 --SDSNGTVNEIWREVAQKLP 421
>gi|302838103|ref|XP_002950610.1| hypothetical protein VOLCADRAFT_104742 [Volvox carteri f.
nagariensis]
gi|300264159|gb|EFJ48356.1| hypothetical protein VOLCADRAFT_104742 [Volvox carteri f.
nagariensis]
Length = 1119
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 97/374 (25%), Positives = 144/374 (38%), Gaps = 76/374 (20%)
Query: 40 LKVYVYELPSKYNKKLLLKDP-RCLTHMFAAEI-FMHRFLLSSPVRTLNPEEADWFYTPI 97
+K+YVY+LPS + +D + M+ AE+ F R L VRT NP EA FY P
Sbjct: 298 IKIYVYDLPSIVLHRREFQDQWSYIDLMYNAELEFTERLLGDWSVRTENPWEAALFYVPT 357
Query: 98 YPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEE 157
+ G P + + + + P++N T G +H +D GAC +
Sbjct: 358 FTYWFTGNVGHPT-----FIFQHVVHHLQHLAPFFNLTGGRNHVMWATNDRGACKLQMSQ 412
Query: 158 -------KAIERGILP---LLQRATL------VQTFGQRNHV------------------ 183
K + G P LQ+A + + T G +
Sbjct: 413 LELQHPIKMVHFGQAPRHAYLQKAHMMGSGHGIDTLGGVASLLGALPHPGHRFEEFPEFT 472
Query: 184 ---CLNEGSITIPP---YAPPQKMQAHQIPPDTPRSI----FVYFRGLFYDVNNDPE--- 230
L E I + P P + + P R + FV + +D E
Sbjct: 473 AADILEEHEICLRPEKDVVTPNVLHQGWVEPGAYRQVWDVKFVDGERVVTRQQDDLEKRT 532
Query: 231 -----GGY------YARGARAAVWENFKNNPLFD----------ISTDHPTTYYEDMQRA 269
GGY Y++G R A+ F +D + TD + M R+
Sbjct: 533 YTLFFGGYTKSIMAYSQGVRQALRSMFGPGGKYDPNGPNARSDFLVTDPRHDAIDLMARS 592
Query: 270 IFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDT 329
FCL P+G A W RL EA+V GC+PV+I D + P D +P+EE + + DV L
Sbjct: 593 KFCLAPMG-AGWGIRLAEAMVRGCVPVVIQDHVYQPLWDVVPFEEFSLRFSRRDVADLVD 651
Query: 330 ILTSIPPEVILRKQ 343
L + E + R Q
Sbjct: 652 HLDDVTSEQLARLQ 665
>gi|297826983|ref|XP_002881374.1| hypothetical protein ARALYDRAFT_482474 [Arabidopsis lyrata subsp.
lyrata]
gi|297327213|gb|EFH57633.1| hypothetical protein ARALYDRAFT_482474 [Arabidopsis lyrata subsp.
lyrata]
Length = 446
Score = 85.9 bits (211), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 98/397 (24%), Positives = 164/397 (41%), Gaps = 78/397 (19%)
Query: 31 VLEDDPVG------KLKVYVYELPSKYNKKLL---------LKDP--RCLTHMFAAEIFM 73
+ EDD G K++VY+Y LP ++ L+ +K P T + M
Sbjct: 44 IREDDDEGRAPIQPKVRVYMYNLPKRFTYGLIDQHSIARGGIKKPVDDVTTLKYPGHQHM 103
Query: 74 HRFLLSSP-------------VRTLNPEEADWFYTPIYPTCDL-TPTGLPLP----FKSP 115
H + L S VR L+P++AD FY P++ + L G P+ +
Sbjct: 104 HEWYLFSDLNRPEVDRSGSPIVRVLDPDDADLFYVPVFSSLSLIVNAGRPVEPGSGYSDE 163
Query: 116 RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQ 175
+M ++ + +W R G DH V+P + A+ R + + LV
Sbjct: 164 KMQEGLMEWLEGQ-EWWRRNGGRDH--VIPAG--------DPNALYRILDRVKNSVLLVA 212
Query: 176 TFGQRNHVCLNEGSIT---IPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGG 232
FG+ H ++GS + PY+ + +I R+ ++F G Y +GG
Sbjct: 213 DFGRLRH---DQGSFVKDVVIPYSHRVNLFNGEIGV-QDRNTLLFFMGNRYR----KDGG 264
Query: 233 YYARGARAAVWENFKNNPLFDISTDHPTTYYED-------MQRAIFCLCPLGWAPWSPRL 285
R +++ + D++ H T E+ M + FCL P G P + RL
Sbjct: 265 ----KVRDLLFQVLEKED--DVTIKHGTQSRENRRAATKGMHTSKFCLNPAGDTPSACRL 318
Query: 286 VEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRK 342
+++V C+PVI++D I LPF D I + + +FV + L +L I + IL
Sbjct: 319 FDSIVSLCVPVIVSDSIELPFEDVIDYRKFSIFVEANAALQPGFLVQMLRKIKTKKILEY 378
Query: 343 QRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
QR M+ + P A +I +++KLP
Sbjct: 379 QR-----EMQPVRRYFDYDNPNGAVKEIWRQVSQKLP 410
>gi|356527767|ref|XP_003532479.1| PREDICTED: LOW QUALITY PROTEIN: probable glycosyltransferase
At5g03795-like [Glycine max]
Length = 467
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 64/216 (29%), Positives = 96/216 (44%), Gaps = 25/216 (11%)
Query: 125 ISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVC 184
IS+ PY+NRT GADHF HD+ Y+ +E I L A + Q F V
Sbjct: 223 ISAKIPYFNRTGGADHFLAACHDWAP---YETRHHMEYCIKALCN-ADVTQGFKIGRDVS 278
Query: 185 LNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWE 244
L E + PQ+ + P P I + G + Y W+
Sbjct: 279 LPEAYVR--SVRDPQRDLGGKPPHQXP--ILAFHAGNMHG---------YLHPILLKHWK 325
Query: 245 NFKNNPLFDISTDHP------TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVII 298
+ +P I P Y M+ + +C+CP G+ SPR+VEA+ C+PVII
Sbjct: 326 D--KDPDMKIYGPMPHGVTSKMNYINHMKNSKYCICPKGYEVNSPRMVEAIFXECVPVII 383
Query: 299 ADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSI 334
+D+ V F + + W+ + +AE+D+P L IL S+
Sbjct: 384 SDNFVPHFFEVLNWDVFSIILAEKDIPNLKQILLSV 419
>gi|357168395|ref|XP_003581626.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10-like
[Brachypodium distachyon]
Length = 348
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 85/344 (24%), Positives = 141/344 (40%), Gaps = 61/344 (17%)
Query: 73 MHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYW 132
+H+ LL S RTL+ +EA F+ P Y C L + + + + S PY+
Sbjct: 42 IHQLLLKSRFRTLDKDEAHLFFVPSYVKCVRMTGAL-----TDKEINQTYVKVLSQMPYF 96
Query: 133 NRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITI 192
R+ G DH FV P GA + R L R+ ++ G R ++ I
Sbjct: 97 RRSGGRDHIFVFPSGAGA--------HLFRSWATFLNRSIILTPEGDRT----DKRGI-- 142
Query: 193 PPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKN---N 249
S F ++ + N D G R + + + + +
Sbjct: 143 --------------------SAFNTWKDIIIPGNVDDSMGKAGRLKLVELAKQYPDKLES 182
Query: 250 PLFDISTDHP---TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 306
P +S Y++ ++ A FCL P G + W+ R E+ C+PVI++D++ LPF
Sbjct: 183 PELKLSGPDKLGRIDYFKHLRNAKFCLAPRGESSWTLRFYESFFVECVPVILSDEVELPF 242
Query: 307 ADAIPWEEIGM-FVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQP-AQPG 364
+ I + EI + + + + P+L L SIP E R + ++A R + P +P
Sbjct: 243 QNMIDYTEISIKWPSSKISPELFEYLESIPEE---RIEEMIARGREVRCLWVYAPDLEPC 299
Query: 365 DAFHQILNGLARKLP-----------HDKSTYLKPGGKFLNWTS 397
A IL L RK+ H++ST + +F WT+
Sbjct: 300 SAMAAILWELQRKVRRFHQSAQTFWLHNRSTVNRDLVEFHQWTT 343
>gi|224142185|ref|XP_002324439.1| predicted protein [Populus trichocarpa]
gi|222865873|gb|EEF03004.1| predicted protein [Populus trichocarpa]
Length = 208
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 43/114 (37%), Positives = 64/114 (56%), Gaps = 2/114 (1%)
Query: 261 TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVA 320
Y MQR+ +C+CP G+ SPR+VEA+ + C+PVII+D+ V PF D + W + +A
Sbjct: 92 NYIHHMQRSKYCICPKGYEVNSPRVVEAIFYECVPVIISDNFVPPFFDVLDWGAFSLILA 151
Query: 321 EEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
E+D+ L IL SIP E L+ Q L +R L+ D F+ L+ +
Sbjct: 152 EKDISNLKEILLSIPKEKYLQMQ--LGVRKAQRHFLWHASPMKYDLFYMTLHSI 203
>gi|255550902|ref|XP_002516499.1| catalytic, putative [Ricinus communis]
gi|223544319|gb|EEF45840.1| catalytic, putative [Ricinus communis]
Length = 601
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 41/114 (35%), Positives = 68/114 (59%), Gaps = 2/114 (1%)
Query: 261 TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVA 320
Y + M+ + +C+C G+ SPR+VEA+++ C+PVII+D+ V PF + + WE +FV
Sbjct: 479 NYVQHMKSSRYCICARGYEVNSPRIVEAILYECVPVIISDNYVPPFFEVLNWESFAVFVL 538
Query: 321 EEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
E+D+P L IL SI PE R+ ++ + + +P + D FH IL+ +
Sbjct: 539 EKDIPNLKNILLSI-PEKRYREMQMRVKMVQQHFLWHARPVK-YDLFHMILHSV 590
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 40/132 (30%), Positives = 63/132 (47%), Gaps = 25/132 (18%)
Query: 30 DVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAE-IFMHRFLLSSPVRTLNPE 88
+++E++ LKVY+Y K +K +L P L ++A+E FM + S T
Sbjct: 319 EIMEEN----LKVYIY----KEGEKPILHQP-VLKGIYASEGWFMKQLEASKKFVTKKSR 369
Query: 89 EADWFYTPIYPTCDLTPTGLPLPFKSP---------RMMRSAIQLISSNWPYWNRTEGAD 139
+A FY P + L L P + +++ + LI + +P+WNRTEG D
Sbjct: 370 KAHLFYLP------FSSRNLELQLYVPDSHSRKNLIKYLKNYLDLIVAKYPFWNRTEGVD 423
Query: 140 HFFVVPHDFGAC 151
HF V HD+ A
Sbjct: 424 HFLVACHDWAAS 435
>gi|357122399|ref|XP_003562903.1| PREDICTED: xylogalacturonan beta-1,3-xylosyltransferase-like
[Brachypodium distachyon]
Length = 498
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 102/387 (26%), Positives = 154/387 (39%), Gaps = 72/387 (18%)
Query: 40 LKVYVYELPSKYNKKLLL--------------------KDPRCLTHMFAAEIFMHRFLL- 78
+K+Y+Y+LPSK+ ++ ++ R H +AE ++ + L
Sbjct: 101 VKIYLYDLPSKFTYGVVRSYMSARAPPGSADAAATLPDEELRYPGHQHSAEWWLFKDLRR 160
Query: 79 ----SSPV-RTLNPEEADWFYTPIYPTCDLTPTGL-PL-----------PFKSPRMMRSA 121
PV R +P EAD FY P + + L + PL P S M+
Sbjct: 161 RGPRERPVARVDDPAEADLFYVPFFSSLSLVVNPIRPLAAANASVAAAEPAYSDEAMQDE 220
Query: 122 IQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRN 181
+ PYW R G DH F+ Q+ A+ R + + LV FG+
Sbjct: 221 LVEWLERQPYWRRHRGRDHVFIC----------QDPNALYRVVDRISNAVLLVSDFGRLR 270
Query: 182 HVCLNEGSITIPPYAPPQKMQAHQIPP-DTPRSIFVYFRGLFYDVNNDPEGGYYARGARA 240
+ I PY+ H+I P SI LF+ N + G R
Sbjct: 271 GDQASLVKDVILPYS-------HRINPFQGDVSIEARPALLFFMGNRYRKEGGKVRDTLF 323
Query: 241 AVWEN-----FKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIP 295
V EN K+ +S T + M + FCL P G P + RL +A+V C+P
Sbjct: 324 QVLENEGDVIIKHGTQSRVSRRMAT---QGMHSSKFCLHPAGDTPSACRLFDALVSLCVP 380
Query: 296 VIIADDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMK 352
VII+D I LPF D I + I +FV + L ++L + E IL QR +K
Sbjct: 381 VIISDHIELPFEDVIDYSNISIFVDTSKAVQPGFLTSMLRRVSSERILEYQR-----EIK 435
Query: 353 RAMLFPQPAQPGDAFHQILNGLARKLP 379
R + + P +QI + ++ K P
Sbjct: 436 RVKHYFEYEDPNGPVNQIWHQVSMKAP 462
>gi|302783725|ref|XP_002973635.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
gi|300158673|gb|EFJ25295.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
Length = 452
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 90/335 (26%), Positives = 135/335 (40%), Gaps = 67/335 (20%)
Query: 40 LKVYVYELPSKYNKKLL---LKDPRC----------LTHMFAAEIFMHRFLLSSP----- 81
L+VYV +LP +++ LL + C L +AE ++ R LL SP
Sbjct: 56 LRVYVADLPREFHHGLLESYCRSQNCCSTGEYPTNPLLKQHSAEFWLLRDLLDSPSKKKE 115
Query: 82 --VRTLNPEEADWFYTPIYPTCDLT---PTGLPLPFKSPR-------MMRSAIQLISSNW 129
VR + AD + P + G F+ R ++L++S+
Sbjct: 116 NFVRVWDSRLADVVFVPFFAALSAQIQLRGGHRGEFRKKSSKNSDFDRQRRVVELVTSSL 175
Query: 130 PYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFG----QRNHVCL 185
W R+ G DH FV+ D A +H +E+ I + LV FG + L
Sbjct: 176 E-WRRSNGVDHVFVLA-DPMAMWHVREQ--ISTAMF-------LVVDFGGWYLEDAKNKL 224
Query: 186 NEGSITIPPYAPPQKM----QAHQIPP-----DTPRSIFVYFRGLFYDVNNDPEGGYYAR 236
N +I P K H +PP D R++ +YFRG + G
Sbjct: 225 NSSTIIQHSQVSPIKDVIIPHTHLLPPLKIADDQHRTVLLYFRG----ARHRHRSGL--- 277
Query: 237 GARAAVWENFKNNPLFDISTDHP-----TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVF 291
R +W+ N P + P M+ + FCL P G P S RL +A+
Sbjct: 278 -VREKLWKILDNEPEVLLEKGLPDDAGLAEATRGMRSSEFCLTPAGDTPSSCRLYDAIAS 336
Query: 292 GCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK 326
CIPVI++DDI LPF + +EE +FV+ D +
Sbjct: 337 LCIPVIVSDDIQLPFEGFVNYEEFCVFVSTRDATQ 371
>gi|302787839|ref|XP_002975689.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
gi|300156690|gb|EFJ23318.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
Length = 452
Score = 84.7 bits (208), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 90/335 (26%), Positives = 135/335 (40%), Gaps = 67/335 (20%)
Query: 40 LKVYVYELPSKYNKKLL---LKDPRC----------LTHMFAAEIFMHRFLLSSP----- 81
L+VYV +LP +++ LL + C L +AE ++ R LL SP
Sbjct: 56 LRVYVADLPREFHHGLLESYCRSQNCCSTGEYPTNPLLKQHSAEFWLLRDLLDSPSKKKE 115
Query: 82 --VRTLNPEEADWFYTPIYPTCDLT---PTGLPLPFKSPR-------MMRSAIQLISSNW 129
VR + AD + P + G F+ R ++L++S+
Sbjct: 116 NFVRVWDSRLADVVFVPFFAALSAQIQLRGGHRGEFRKRSSKNSDFDRQRRVVELVTSSL 175
Query: 130 PYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFG----QRNHVCL 185
W R+ G DH FV+ D A +H +E+ I + LV FG + L
Sbjct: 176 E-WRRSNGVDHVFVLA-DPMAMWHVREQ--ISTAVF-------LVVDFGGWYLEDAKNKL 224
Query: 186 NEGSITIPPYAPPQKM----QAHQIPP-----DTPRSIFVYFRGLFYDVNNDPEGGYYAR 236
N +I P K H +PP D R++ +YFRG + G
Sbjct: 225 NSSTIIQHSQVSPIKDVIIPHTHLLPPLKIADDQHRTVLLYFRG----ARHRHRSGL--- 277
Query: 237 GARAAVWENFKNNPLFDISTDHP-----TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVF 291
R +W+ N P + P M+ + FCL P G P S RL +A+
Sbjct: 278 -VREKLWKILDNEPEVLLEEGLPDDAGLAEATRGMRSSEFCLTPAGDTPSSCRLYDAIAS 336
Query: 292 GCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK 326
CIPVI++DDI LPF + +EE +FV+ D +
Sbjct: 337 LCIPVIVSDDIQLPFEGFVNYEEFCVFVSARDATQ 371
>gi|356565097|ref|XP_003550781.1| PREDICTED: xylogalacturonan beta-1,3-xylosyltransferase-like
[Glycine max]
Length = 472
Score = 84.7 bits (208), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 83/329 (25%), Positives = 131/329 (39%), Gaps = 42/329 (12%)
Query: 80 SPVRTLNPEEADWFYTPIYPTCDL---TPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTE 136
+ +R N +AD + P + + + RM++ + + W R+
Sbjct: 166 TAIRVQNSRQADVVFVPFFSSLSYNRHSKIHGKEKVSVNRMLQQRLVQLLMEREEWKRSG 225
Query: 137 GADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY- 195
G DH V H +I R L ++ FG+ N I PY
Sbjct: 226 GRDHVIVAHH----------PNSILRARRKLGSAMLVLADFGRYPSQLANIKKDIIAPYR 275
Query: 196 -----APPQKMQAHQIPPDTPRSIFVYFRGLFY--DVNNDPEGGYYARGARAAVWENFKN 248
P + +++ RS +YF+G Y D + YY V F +
Sbjct: 276 HLVSTVPRAESASYE-----ERSTLLYFQGAIYRKDGGAIRQKLYYLLKDEKDVHFAFGS 330
Query: 249 NPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFAD 308
I + + M + FCL G P S RL +A+V C+PVII+D+I LPF D
Sbjct: 331 -----IRKNGINQASQGMALSKFCLNVAGDTPSSNRLFDAIVSHCVPVIISDEIELPFED 385
Query: 309 AIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGD 365
+ + E G+FV D + L +L SI PE + L + + + + P+QPGD
Sbjct: 386 VLDYSEFGLFVHASDAVRKGYLLNLLRSIKPEKWTQMWERLKD--ITQHFEYQYPSQPGD 443
Query: 366 AFHQILNGLARKLP------HDKSTYLKP 388
A + I +A K+ H K+ Y +P
Sbjct: 444 AVNMIWEEVAHKISSLQFNLHRKNRYQRP 472
>gi|302843170|ref|XP_002953127.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
gi|300261514|gb|EFJ45726.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
Length = 616
Score = 84.3 bits (207), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 87/298 (29%), Positives = 118/298 (39%), Gaps = 58/298 (19%)
Query: 77 LLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQL---ISSNWPYWN 133
++SS RTLN EAD+FY +P+ F+ M A ++ + + WP+WN
Sbjct: 231 IISSGHRTLNGGEADYFY-------------IPVDFR--HMFSEAFEVLEYVQATWPFWN 275
Query: 134 RTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFG-QRNH------VCLN 186
T G D G C E K + + L A + +G R H C
Sbjct: 276 ATGG---------DLGGC----EGKQLML-LRNLTSAAIWLTPWGLTRKHPRVWWPGCHR 321
Query: 187 EG-SITIPPYAPPQKMQAHQIPPDT---PRSIFVYFRGLFYDVNNDPEGG---------- 232
G I IP A M + P PR+ YF G N DP+
Sbjct: 322 PGHDIVIPVLAQANNMIMTPLNPKVKPLPRNTTFYFAGKICGDNKDPKEDTSSWPICQTP 381
Query: 233 ---YYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAV 289
Y+ G R V+ F + ++Y DM + FCL P G R V
Sbjct: 382 RNPLYSAGVRQLVYFYHSKRDGF-VVRPRSSSYVRDMSTSKFCLAPTG-GGHGKRQVLVG 439
Query: 290 VFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLA 347
+GCIPV I D ++ PF + W + V EEDVP L TIL +I + QR LA
Sbjct: 440 RYGCIPVPITDYVLQPFEPELDWPAFSVTVKEEDVPNLHTILAAINDTKLAEMQRALA 497
>gi|159483641|ref|XP_001699869.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158281811|gb|EDP07565.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 427
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 85/314 (27%), Positives = 123/314 (39%), Gaps = 68/314 (21%)
Query: 72 FMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPY 131
FM R +LS RT NPEEAD+FY P G K +++ + IS+ WP+
Sbjct: 125 FMER-ILSGGHRTHNPEEADFFYIP----------GSSRDLKKAFLLQPLLAYISTTWPF 173
Query: 132 WNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQR-----ATLVQTFGQ------- 179
WN T GA H D G C LPL R T +Q +G
Sbjct: 174 WNATGGARHIMPAEGDVGTC------------ELPLKVRLFTANVTWLQFWGMYDFHPHW 221
Query: 180 ----RNHV-CLNEGSITIPPYAPPQKMQAH-QIPPDTP---------RSIFVYFRGLFYD 224
N + C+ G + P+ M +H + +TP R+ +F G
Sbjct: 222 TQIFHNRIPCMVPGRDIVVPF---MAMSSHDRFVIETPLHPRNQKRNRTNTFFFAGGVCG 278
Query: 225 VNND----PEGGYYAR-----GARAAVWENFKNNPLFDI--STDHPTTYYEDMQRAIFCL 273
N P YY + G R AV+ +F N + + TD Y D + FCL
Sbjct: 279 SGNKRALPPHCTYYKQVRYSGGVRQAVYLHFHNRTGWRVVPGTD---DYARDYASSRFCL 335
Query: 274 CPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTS 333
G W R + A ++GCIPV D + F + W G+ + + ++P+L L +
Sbjct: 336 AAAG-GGWGKRGIVAAMYGCIPVAATDMLYEAFEPEMDWGRFGVRITQAEIPQLADKLEA 394
Query: 334 IPPEVILRKQRLLA 347
+ R Q A
Sbjct: 395 YSEAEVARMQERTA 408
>gi|226529660|ref|NP_001151894.1| secondary cell wall-related glycosyltransferase family 47 [Zea
mays]
gi|195650683|gb|ACG44809.1| secondary cell wall-related glycosyltransferase family 47 [Zea
mays]
Length = 488
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 95/400 (23%), Positives = 155/400 (38%), Gaps = 67/400 (16%)
Query: 28 AGDVLEDDPVGKLKVYVYELPSKYNKKLLL--------------------KDPRCLTHMF 67
A + DP ++V++Y++P +++ LL + P L
Sbjct: 95 AAGAVRCDPRDAVRVFMYDMPPEFHFGLLGWSPPSPDSVWPDVTAASPPPRYPGGLNQQH 154
Query: 68 AAEIFMHRFLLS------------SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLP---F 112
+ E ++ LLS S VR + +AD + P + + P+P
Sbjct: 155 SVEYWLTLDLLSSSSSSSPPCGRHSAVRVSDSRDADLVFVPFFASLSYNRHYRPVPPEKV 214
Query: 113 KSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRAT 172
R ++ + + P W R GADH +V H + H + +L A
Sbjct: 215 SRDRALQEKLVRYLAARPEWRRFGGADH-VIVAHHPNSLLHARA----------VLHPAV 263
Query: 173 LVQT-FGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEG 231
V + FG+ + I PY K A+ R +YFRG Y EG
Sbjct: 264 FVLSDFGRYPPRVASLEKDVIAPYKHMAKTYANDSAGFDDRPTLLYFRGAIY----RKEG 319
Query: 232 G------YYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRL 285
G YY V+ +F + + + + M + FCL G P S RL
Sbjct: 320 GSIRQELYYMLKEEKDVYFSFGS-----VQDHGASKASQGMHSSKFCLNIAGDTPSSNRL 374
Query: 286 VEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRK 342
+A+V C+PVII+DDI LP+ D + + + +FV D K L +L+ + + +
Sbjct: 375 FDAIVTHCVPVIISDDIELPYEDVLDYSKFSIFVRSSDAVKKGYLMRLLSGVSKQQWTKM 434
Query: 343 QRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHDK 382
L + + + P+Q DA I L+RK+P K
Sbjct: 435 WDRLKE--VDKHFEYQYPSQKDDAVQMIWQALSRKVPSIK 472
>gi|412988024|emb|CCO19420.1| exostosin-like glycosyltransferase [Bathycoccus prasinos]
Length = 909
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 86/342 (25%), Positives = 137/342 (40%), Gaps = 70/342 (20%)
Query: 66 MFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSA---- 121
++ E +H FLL S RT+NPE AD F+ P Y TC P P S ++++
Sbjct: 502 FYSLENTLHEFLLRSAHRTINPENADVFFIPQYGTCYRLAYQTPSPQVSLSLIKTKPGDR 561
Query: 122 ----------IQLISSNWP-------------YWNRTEGADHFFVVPHDFGACFHYQEEK 158
+ N P Y++R EG DH + +D GA H+ +
Sbjct: 562 SHAANLFLERVTEYVRNIPFNVINNEKGEIQSYFDRNEGRDHAVIAAYDEGA-VHFPD-- 618
Query: 159 AIERGIL------------------------PLLQRATLVQTFGQRNHVCLNEGSITIPP 194
+I I L+++ + + N I PP
Sbjct: 619 SIANAIFITHWGNTGYPRNSSHTAYSPDKWDELVKQGVVTGAWRAYNR----NKDIVAPP 674
Query: 195 YAPPQKMQAHQ------IPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKN 248
++ P+ + + P T R+ F +F G P G Y+RG R V ++N
Sbjct: 675 WSQPKTNEVREPADVNSWTPATQRTTFCFFSGNL--GLEKPWGEDYSRGLRQKVARRWQN 732
Query: 249 NPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFAD 308
FDI + H Y ++ + FCL G WS L + GCIPVI+ D + +P+
Sbjct: 733 VYGFDILS-HTDDYLGRIRSSKFCLALPG-DGWSGGLSVYIRNGCIPVIVQDGVDMPWEG 790
Query: 309 A-IPWEEIGMFVAEEDVP-KLDTILTSIPPEVILRKQRLLAN 348
+ + + + V E DV +L ++L ++ PE + Q L N
Sbjct: 791 TFLDYSKFSIRVREGDVENRLQSVLETVTPEELQNLQNGLKN 832
>gi|414882087|tpg|DAA59218.1| TPA: hypothetical protein ZEAMMB73_484283 [Zea mays]
Length = 479
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 77/320 (24%), Positives = 127/320 (39%), Gaps = 29/320 (9%)
Query: 80 SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLP---FKSPRMMRSAIQLISSNWPYWNRTE 136
S VR +P +AD + P + + PLP + ++ + + P W R
Sbjct: 170 SAVRVTDPRDADLVFVPFFASLSYNRHSRPLPPEKVGRDKALQEKLVGYLTARPEWRRFG 229
Query: 137 GADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYA 196
GADH +V H + H + A+ + ++ FG+ + I PY
Sbjct: 230 GADHV-IVAHHPNSLLHARA--ALSPAVF-------VLSDFGRYPPRVASLEKDVIAPYK 279
Query: 197 PPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGG------YYARGARAAVWENFKNNP 250
K + R +YFRG Y EGG YY V+ +F +
Sbjct: 280 HMAKTFVNDSAGFDDRPTLLYFRGAIYR----KEGGTIRQELYYMLKDEKDVYFSFGS-- 333
Query: 251 LFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAI 310
+ + + M + FCL G P S R+ +A+V C+PVII+DDI LP+ D +
Sbjct: 334 ---VQDHGASKASQGMHSSKFCLNIAGDTPSSNRMFDAIVSHCVPVIISDDIELPYEDVL 390
Query: 311 PWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLAN-PSMKRAMLFPQPAQPGDAFHQ 369
+ + +FV D + ++ + R + + + R + P+Q DA
Sbjct: 391 DYSKFSIFVRSSDAVEKGHLMRLLSGVSKQRWTEMWSRLREVDRHFEYQYPSQKDDAVQM 450
Query: 370 ILNGLARKLPHDKSTYLKPG 389
I L+RK+P K + G
Sbjct: 451 IWRSLSRKVPSIKLKVHRSG 470
>gi|357436207|ref|XP_003588379.1| Exostosin-2 [Medicago truncatula]
gi|355477427|gb|AES58630.1| Exostosin-2 [Medicago truncatula]
Length = 551
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 81/304 (26%), Positives = 133/304 (43%), Gaps = 55/304 (18%)
Query: 39 KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEI-FMHRFLLSSPVRTLNPEEADWFYTPI 97
+LKVYVY + + +L P LT ++A+E FM + T NP++A FY P
Sbjct: 253 RLKVYVY----REGARPILHSP-FLTGIYASEGWFMKLMEANKRFVTKNPKKAHLFYLPF 307
Query: 98 YPTCDLTPTGLPLPFKSPRMMRSAIQ-------LISSNWPYWNRTEGADHFFVVPHDFGA 150
+ + L + K+ ++ IQ +I++ +WNRT GADHF V HD+
Sbjct: 308 --SSRMLEEALYV--KNSHSHKNLIQYLHDYVDMIAARHSFWNRTGGADHFLVGCHDWAP 363
Query: 151 CFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPD- 209
E K + L A + + F V + S+ P ++ QIP
Sbjct: 364 S----ETKLRLANCIRSLCNADVKEGF-----VFGKDASL------PETYVRNAQIPTRD 408
Query: 210 ------TPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDI-----STDH 258
+ ++ +F G + Y R WEN +P I ++
Sbjct: 409 LGGNSFSKKTTLAFFAGSMHG---------YVRPILLKHWEN--KDPDMKIFGKLPNSKG 457
Query: 259 PTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMF 318
+ Y M+ + +C+C G+ SPR+VEA+ + C+PVII+D+ V PF + + WE +
Sbjct: 458 NSNYIHYMKSSKYCICAKGYEVNSPRVVEAIFYECVPVIISDNFVPPFFEVLDWESFSVI 517
Query: 319 VAEE 322
V +
Sbjct: 518 VQKH 521
>gi|449452903|ref|XP_004144198.1| PREDICTED: probable glycosyltransferase At3g07620-like [Cucumis
sativus]
Length = 429
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 87/374 (23%), Positives = 149/374 (39%), Gaps = 51/374 (13%)
Query: 40 LKVYVYELPSKYNKKLLLKD---------------PR--CLTHMFAAEIFMHRFLL---- 78
L+VY+Y+LP ++N +L + PR L + E +M LL
Sbjct: 52 LRVYMYDLPRRFNVGILNRRNLDQTPVTASTWPPWPRNSGLKRQHSVEYWMMGSLLHEAT 111
Query: 79 ---SSPVRTLNPEEADWFYTPIYPTCDLTPTGLPL---PFKSPRMMRSAIQLISSNWPYW 132
VR ++PE AD F+ P + + G + + ++ + S YW
Sbjct: 112 GDGRDAVRVMDPENADAFFVPFFSSLSFNSHGRNMTDPATEVDHQLQIDLMKFLSESKYW 171
Query: 133 NRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITI 192
R++G DH V+P F + + + +V FG+ N G +
Sbjct: 172 QRSKGRDH--VIPMTHPNAFRFLRNQ--------VNASIQIVVDFGRYPKTMSNLGKDVV 221
Query: 193 PPYAPPQKMQAHQIPPD--TPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNP 250
PY PPD R ++F+G + ++ R A + + + +
Sbjct: 222 APYVHVVSSFIDDNPPDPFESRPTLLFFQGKTFRKDDG-----IIRVKLAKILDGYDDVH 276
Query: 251 LFDISTDHPT--TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFAD 308
+ + T + M+ + FCL P G P S RL +A+V C+PVI++D I LP+ D
Sbjct: 277 YERSAATEKSIKTSSQGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDQIELPYED 336
Query: 309 AIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGD 365
I + + +F + E+ + + L P E + + L S R F P + D
Sbjct: 337 EIDYSQFTLFFSFEEALQPGYMVEKLREFPKERWIEMWKQLKEIS--RHYEFQYPPKKED 394
Query: 366 AFHQILNGLARKLP 379
A + + + KLP
Sbjct: 395 AVNMLWRQVKHKLP 408
>gi|356519776|ref|XP_003528545.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10L-like
[Glycine max]
Length = 452
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 82/321 (25%), Positives = 136/321 (42%), Gaps = 53/321 (16%)
Query: 82 VRTLNPEEADWFYTPIYPTCDL---------TPTGLPLPFKSPRMMRSAIQLISSNWPYW 132
V +PEEAD F+ P + + L + +G P S + A+ YW
Sbjct: 130 VLVADPEEADLFFVPFFSSLSLIVNPVRPPGSNSGSEKPVYSDEENQEALVEWLEKQEYW 189
Query: 133 NRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSIT- 191
R G DH V + A+ R I + LV FG+ + ++GS+
Sbjct: 190 KRNSGRDHVIVA----------SDPNAMYRVIDRVRNAVLLVSDFGR---LRPDQGSLVK 236
Query: 192 --IPPYAPPQKMQAHQIPPDTP-RSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKN 248
+ PY+ +++ +Q R+ ++F G Y EGG R + EN K
Sbjct: 237 DVVVPYS--HRIRTYQGDAGVEDRNTLLFFMGNRYR----KEGGKI-RDILFKILENEK- 288
Query: 249 NPLFDISTDHPTTYYED-------MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADD 301
D+ H E M + FCL P G P + RL +A+V CIPVI++D+
Sbjct: 289 ----DVIIKHGAQSRESRRAASQGMHTSKFCLHPAGDTPSACRLFDAIVSLCIPVIVSDN 344
Query: 302 IVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFP 358
I LPF D I + ++ +F+ K L + L ++ P+ +L Q+ +K +
Sbjct: 345 IELPFEDTIDYRKLAVFIETSSAIKPGYLVSKLRALTPDRVLAYQK-----ELKEVKRYF 399
Query: 359 QPAQPGDAFHQILNGLARKLP 379
+ +P ++I +++KLP
Sbjct: 400 EYEEPDGTVNEIWRQVSKKLP 420
>gi|302835860|ref|XP_002949491.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
gi|300265318|gb|EFJ49510.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
Length = 554
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 101/357 (28%), Positives = 139/357 (38%), Gaps = 56/357 (15%)
Query: 42 VYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTC 101
VYVYELP L K TH+ +F R LL S R + + ADW+Y PI
Sbjct: 121 VYVYELPPHLFTWLNTKRLDRSTHL----MFYQR-LLGSGARVADGDLADWYYIPI---- 171
Query: 102 DLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIE 161
T F ++ AI+ I +P+WNRT GA HF + D GA +
Sbjct: 172 -RLRTATDSAF-----LKYAIEYIREAYPWWNRTGGARHFVIHTGDLGA-------DEVM 218
Query: 162 RGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVY-FRG 220
+ + T + +G +T+ +AH+ PD R+ + RG
Sbjct: 219 DDVYGMAANMTWLTHWG-----------LTVDKNTSGW-WKAHR--PDKARAGARWGTRG 264
Query: 221 LFY---DVNND--------PEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRA 269
+Y VN P + G R V + N F I T Y + + +
Sbjct: 265 GYYTRVSVNRRRGSHMWGPPSPAPHRAGVRQKVHFHHWNRTGFRIVT-FERNYGKALVSS 323
Query: 270 IFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDT 329
FCL PLG R + GCIPV IAD + PF W E + AE D+P+L
Sbjct: 324 KFCLAPLG-GGHGQRQIIVSYMGCIPVCIADGVYEPFEPQTDWTEFAVRPAEADIPRLHE 382
Query: 330 ILTSIPPEVILRKQ----RLLANPSMKRAMLFPQPAQPG--DAFHQILNGLARKLPH 380
IL I L + R A + +M+ + G DAF L L K H
Sbjct: 383 ILEGISAGNKLAEMQVALRCAAQHLLYSSMVGGLFGEDGRYDAFETTLEVLRVKAAH 439
>gi|449489243|ref|XP_004158257.1| PREDICTED: LOW QUALITY PROTEIN: probable glycosyltransferase
At3g07620-like [Cucumis sativus]
Length = 429
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 87/374 (23%), Positives = 148/374 (39%), Gaps = 51/374 (13%)
Query: 40 LKVYVYELPSKYNKKLLLKD---------------PR--CLTHMFAAEIFMHRFLL---- 78
L+VY+Y+LP ++N +L + PR L + E +M LL
Sbjct: 52 LRVYMYDLPRRFNVGILNRRNLDQTPVTASTWPPWPRNSGLKRQHSVEYWMMGSLLHEAT 111
Query: 79 ---SSPVRTLNPEEADWFYTPIYPTCDLTPTGLPL---PFKSPRMMRSAIQLISSNWPYW 132
VR ++PE AD F+ P + + G + + ++ + S YW
Sbjct: 112 GDGRDAVRVMDPENADAFFVPFFSSLSFNSHGRNMTDPATEVDHQLQIDLMKFLSESKYW 171
Query: 133 NRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITI 192
R++G DH V+P F + + + +V FG+ N G +
Sbjct: 172 QRSKGRDH--VIPMTHPNAFRFLRNQ--------VNASIQIVVDFGRYPKTMSNLGKDVV 221
Query: 193 PPYAPPQKMQAHQIPPD--TPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNP 250
PY PPD R ++F+G + ++ R A + + + +
Sbjct: 222 APYVHVVSSFIDDNPPDPFESRPTLLFFQGKTFRKDDG-----IIRVKLAKILDGYDDVH 276
Query: 251 LFDISTDHPT--TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFAD 308
+ + T + M+ + FCL P G P S RL +A+V C+PVI++D I LP+ D
Sbjct: 277 YERSAATEKSIKTSSQGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDQIELPYED 336
Query: 309 AIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGD 365
I + + +F E+ + + L P E + + L S R F P + D
Sbjct: 337 EIDYSQFTLFFXFEEALQPGYMVEKLREFPKERWIEMWKQLKEIS--RHYEFQYPPKKED 394
Query: 366 AFHQILNGLARKLP 379
A + + + KLP
Sbjct: 395 AVNMLWRQVKHKLP 408
>gi|159490316|ref|XP_001703125.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158270755|gb|EDO96590.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 528
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 85/315 (26%), Positives = 133/315 (42%), Gaps = 49/315 (15%)
Query: 42 VYVYELPSK----YNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPI 97
VYVYELP + YN + L + THM +F R LLSS R + ++AD++Y P+
Sbjct: 181 VYVYELPPELSAWYNHRRLDRP----THM----LFAQR-LLSSGARIADGDQADYYYIPV 231
Query: 98 YPTCDLTPTGLPLP-FKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQE 156
+ P F S + +I + W R E + + + H+
Sbjct: 232 RQRSTMFALAAPCSSFMSAHTLVPGRFMIRTGCGDWGRAEMDPNLLQLTENMTWLTHWGL 291
Query: 157 EKAIERGILPLL-----QRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTP 211
IL + ++ +V F H + S+ P P K + P D
Sbjct: 292 TTDRHDKILRWVPSYRPEKDVVVPVFISPGH--FVKFSMIHTPLNPANKAK----PRDKA 345
Query: 212 RSIF---VYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQR 268
R F + F + V++ GY+ AR+ Y + + R
Sbjct: 346 RFFFAGRICFNSKWVFVSHWNRSGYHV--ARSE------------------KRYGQYLAR 385
Query: 269 AIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLD 328
+++CL P G A R ++A+ GC+PV IAD + PF A+ W + G+ VAE DVP++
Sbjct: 386 SLYCLAPPG-AGHGQRQIQALFMGCVPVTIADGVAEPFEPAVNWTDWGVRVAEADVPQMH 444
Query: 329 TILTSIPPEVILRKQ 343
T+L I PE + KQ
Sbjct: 445 TLLDDIGPEQLAVKQ 459
>gi|224093268|ref|XP_002309859.1| predicted protein [Populus trichocarpa]
gi|222852762|gb|EEE90309.1| predicted protein [Populus trichocarpa]
Length = 420
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 75/307 (24%), Positives = 129/307 (42%), Gaps = 59/307 (19%)
Query: 59 DPR-CLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRM 117
DPR L +A+E + + L+ S T +P A F P+ +C T + +
Sbjct: 101 DPRDKLKRKYASEHYFLKNLIPSSFFTDDPTVAHLFLIPL--SCKKTGG------REEKD 152
Query: 118 MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGA-----------------CFHYQEEKAI 160
+ + ++ + S++PYWNRT GADHF+ H + C + K I
Sbjct: 153 IENYVKSLISSYPYWNRTLGADHFYFSCHGIDSGTIEEVPLLMKNVIRLVCSPSYDSKYI 212
Query: 161 ERGILPLLQRATLVQTFGQ---RNHVCLNEGSITIPPYAPPQKMQ----AHQIPPDTPRS 213
+ L Q L G ++ + P P++ + A + D ++
Sbjct: 213 PHKDISLPQTLELSLHDGDDVWSRSTVMSRPLMIYPEMMLPRRTKLGFWAGSLNSDVRKN 272
Query: 214 IFVYFRGLFYDVNNDPEGGY--YARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIF 271
+ V+++G PE + + + +AA+ + ++N ++ + F
Sbjct: 273 LQVFYKGA-------PEFNFHFFDKMKKAAILDAYEN----------------ELYGSKF 309
Query: 272 CLCPLGWAP-WSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTI 330
C+CP G S L E++ FGC+PVI+ D PF D + W + + EE VP L+ I
Sbjct: 310 CICPRGNNHVGSVCLTESMTFGCVPVILHDYYDFPFNDVLDWNNFSVILKEEHVPDLEKI 369
Query: 331 LTSIPPE 337
L IP E
Sbjct: 370 LKGIPEE 376
>gi|226508962|ref|NP_001152546.1| exostosin-like [Zea mays]
gi|195657373|gb|ACG48154.1| exostosin-like [Zea mays]
gi|414878975|tpg|DAA56106.1| TPA: exostosin-like protein [Zea mays]
Length = 427
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 83/314 (26%), Positives = 133/314 (42%), Gaps = 39/314 (12%)
Query: 82 VRTLNPEEADWFYTPIYPTCDLTPTGLPLP---FKSPRMMRSAIQLISSNWP--YWNRTE 136
VR +P+ AD F+ P + + G + ++ R+++ ++L+ W YW R+
Sbjct: 118 VRVRDPDAADAFFVPFFSSLSFNVHGRNMTDPDTEADRLLQ--VELVDILWKSKYWQRSA 175
Query: 137 GADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYA 196
G DH V+P F + + IL +V FG+ + + PY
Sbjct: 176 GRDH--VIPMHHPNAFRFL-RAMVNASIL-------IVSDFGRYTKELASLRKDVVAPYV 225
Query: 197 PPQKMQAHQIPPD--TPRSIFVYFRGLFYDVNNDPEGGYYA------RGARAAVWEN-FK 247
PPD R ++FRG V D EG + +G +E+
Sbjct: 226 HVVGSFLDDDPPDPFEARHTLLFFRG--RTVRKD-EGKIRSKLEKILKGKEGVRFEDSIA 282
Query: 248 NNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFA 307
+IST E M+ + FCL P G P S RL +A+V C+PVI++ I LPF
Sbjct: 283 TGDGINIST-------EGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSSRIELPFE 335
Query: 308 DAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRK--QRLLANPSMKRAMLFPQPAQPGD 365
D I + E +F + E+ + D +L + +V RK L ++ F P + GD
Sbjct: 336 DEIDYSEFSLFFSVEEALRPDYLLNEL-RQVPKRKWVDMWLKLKNVSHHYEFQYPPRKGD 394
Query: 366 AFHQILNGLARKLP 379
A + I + K+P
Sbjct: 395 AVNMIWRQVRHKIP 408
>gi|326521044|dbj|BAJ92885.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 289
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 79/281 (28%), Positives = 118/281 (41%), Gaps = 51/281 (18%)
Query: 64 THMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLT----------PTGLPL-PF 112
+M A + ++ +L+SP RTLN +EAD+FY P+ +C +T P L L +
Sbjct: 4 NYMDTATMALYESILASPHRTLNGDEADYFYVPVLDSCLITRSDDAPHLRMPEDLRLRSY 63
Query: 113 KSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQE-------------EKA 159
+ R A I+ +PYWNRT G DH + D GAC+ +E
Sbjct: 64 HTLEYYRKAYDHIAQRYPYWNRTSGRDHIWFFSWDEGACYAPKEIWNSMMLVHWGNTNTK 123
Query: 160 IERGILPLLQRATLVQTFGQR-NHVCLN-EGSITIPPYAPPQ------KMQAHQIPPDTP 211
E+ F +R NH C + + +P + P K+ A P
Sbjct: 124 HEKSTTAYWADNWDDIPFDRRGNHPCFDPRKDLVLPAWKEPNPGAIWLKLWAR---PKIN 180
Query: 212 RSIFVYFRGLFYDVNND--PEGGYYARGARAAVWENFKNNP-----LFDISTDHPTT--- 261
R+ YF G + PE Y+ G R + F + P L T + T
Sbjct: 181 RTTLFYFNGNLGPAYEEGRPEDT-YSMGIRQKLAAEFGSTPNKQGKLGRQQTANVTVTYL 239
Query: 262 ----YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVII 298
YYE++ +IFC G WS R+ ++++ GCIPVII
Sbjct: 240 KSEMYYEELASSIFCGVLPG-DGWSGRMEDSMLQGCIPVII 279
>gi|255543306|ref|XP_002512716.1| catalytic, putative [Ricinus communis]
gi|223548677|gb|EEF50168.1| catalytic, putative [Ricinus communis]
Length = 426
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 74/275 (26%), Positives = 116/275 (42%), Gaps = 26/275 (9%)
Query: 63 LTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAI 122
L + F+ IF L SS V TL+P +A +F+ P +
Sbjct: 126 LGNYFSEHIFKVALLRSSLV-TLDPAKALFFFLPFSINNLRNDPRFHSEESISEFVAHYT 184
Query: 123 QLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNH 182
IS + YWN + GADHF+V H G A+ + L ++ Q F
Sbjct: 185 TTISQRFSYWNASAGADHFYVCCHSVGRQA-ASRHPALHNNAIQLTCSSSYFQRF----- 238
Query: 183 VCLNEGSITIPPY--APPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARA 240
++ + +P PPQ + P R VYF G V N R
Sbjct: 239 -FVSHKDVGLPQVWPRPPQTA----LNPPHARHRLVYFAG---RVQNSQ-----VRRELV 285
Query: 241 AVWENFKNNPLFDISTDHPTTYYED-MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIA 299
+W N+ DI P+ YE+ +R+ +CL G+ + R+ +++ +GCIPVII+
Sbjct: 286 NLW---GNDTEMDIINGSPSFPYEEGFKRSKYCLHVKGYEVNTARVSDSIHYGCIPVIIS 342
Query: 300 DDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSI 334
+ LPFA + W + + + + D+P L T L +I
Sbjct: 343 NYYDLPFATVLDWSKFSVVINQADIPFLKTTLLAI 377
>gi|356570361|ref|XP_003553358.1| PREDICTED: probable glycosyltransferase At3g07620-like [Glycine
max]
Length = 520
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 75/311 (24%), Positives = 121/311 (38%), Gaps = 31/311 (9%)
Query: 80 SPVRTLNPEEADWFYTPIYPTC---DLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTE 136
S +R N EAD + P + + L+ TG ++++ + + W R+
Sbjct: 197 SVIRVRNSSEADVIFVPFFSSLCYNRLSKTGPHEKRSRNKLLQEKLVKYVTAQEEWKRSG 256
Query: 137 GADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYA 196
G DH + H ++ + L ++ FG+ N I PY
Sbjct: 257 GKDHVILAHH----------PNSMLDARMKLWPGTFILSDFGRYPTNIANVEKDVIAPYK 306
Query: 197 PPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNN-----PL 251
+ R+ +YF+G Y +GG+ R ++ KN
Sbjct: 307 HVVGSYDNDQSSFDSRTTLLYFQGAIYR----KDGGH----VRHELYYLLKNEKDVHFSF 358
Query: 252 FDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIP 311
+ E M+ + FCL G P S RL +A+ C+PVII+DDI LP+ D +
Sbjct: 359 GSVQKGGVRKATEGMRSSKFCLNIAGDTPSSNRLFDAIASHCVPVIISDDIELPYEDVLD 418
Query: 312 WEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFH 368
+ + +FV D K L + SI E R L ++ F P++ GDA
Sbjct: 419 YSQFCIFVRTRDALKKRYLINFIRSIGKEEWTRMWNRL--KEVESFFEFQFPSKEGDAVQ 476
Query: 369 QILNGLARKLP 379
I +ARK+P
Sbjct: 477 MIWKAIARKVP 487
>gi|357478277|ref|XP_003609424.1| Xylogalacturonan beta-1,3-xylosyltransferase [Medicago truncatula]
gi|355510479|gb|AES91621.1| Xylogalacturonan beta-1,3-xylosyltransferase [Medicago truncatula]
Length = 229
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 43/115 (37%), Positives = 67/115 (58%), Gaps = 6/115 (5%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + M + FCLCP G SPR+VEA+ GC+PVII D+ LPF+D + W + M +A
Sbjct: 102 YTKLMGLSKFCLCPSGHEVASPRVVEAIYAGCVPVIICDNYSLPFSDVLNWSQFSMEIAV 161
Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANP--SMKRAMLFPQPAQPGDAFHQILNGL 374
+ +P++ TIL +I K R+L + +++ +PA+P D H IL+ +
Sbjct: 162 DRIPEIKTILQNITET----KYRVLYSNVRRVRKHFEMNRPAKPFDLIHMILHSV 212
>gi|223975191|gb|ACN31783.1| unknown [Zea mays]
gi|414887161|tpg|DAA63175.1| TPA: exostosin-like protein [Zea mays]
Length = 500
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 97/386 (25%), Positives = 154/386 (39%), Gaps = 72/386 (18%)
Query: 40 LKVYVYELPSKYNKKLL--LKDPRCLT------------------HMFAAEIFMHRFLL- 78
+K+Y+Y+LP+K+ ++ K R + H +AE ++ + LL
Sbjct: 104 VKIYLYDLPAKFTYGVVRSYKAARATSGSADAAATLPDEQLRYPGHQHSAEWWLFKDLLR 163
Query: 79 ----SSPV-RTLNPEEADWFYTPIYPTCDL--TPTGLPLPFK--------SPRMMRSAIQ 123
PV R +P +AD FY P + + L P PL S ++ +
Sbjct: 164 RRPQDRPVARVDDPSDADLFYVPFFSSLSLVVNPIRSPLAANASGAGTAYSDEALQEELL 223
Query: 124 LISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHV 183
YW R G DH F+ Q+ A+ R + + LV FG+
Sbjct: 224 EWLERQLYWQRHRGRDHVFIC----------QDPNALYRVVDRISNAVLLVSDFGRLRSD 273
Query: 184 CLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVW 243
+ I PY+ ++ D R ++F G Y EGG R A++
Sbjct: 274 QASLVKDVILPYSHRINSFKGEVGVDG-RPSLLFFMGNRYR----KEGG----KVRDALF 324
Query: 244 ENFKNNPLFDISTDHPTTYYED-------MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPV 296
+ +N D++ H T E M + FCL P G P + RL +A+V C+PV
Sbjct: 325 QILENED--DVTIKHGTQSRESRREATQGMHSSKFCLHPAGDTPSACRLFDALVSLCVPV 382
Query: 297 IIADDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKR 353
I +D I LPF D I + +I +FV + L + L I E IL QR +K+
Sbjct: 383 IASDYIELPFEDIIDYNKISIFVGTSKAVQPGYLTSTLRRISSERILEYQR-----EIKK 437
Query: 354 AMLFPQPAQPGDAFHQILNGLARKLP 379
+ + P ++I ++ K P
Sbjct: 438 VRHYFEYEDPNGPVNEIWRQVSLKAP 463
>gi|307110230|gb|EFN58466.1| hypothetical protein CHLNCDRAFT_140476 [Chlorella variabilis]
Length = 632
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 87/344 (25%), Positives = 134/344 (38%), Gaps = 53/344 (15%)
Query: 55 LLLKDPRCLTHM----FAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPT---- 106
+ L+ R H+ +A E LL S RTL+PEEAD+FY P++ +C + P
Sbjct: 220 VCLRAQRGAHHLWEWTYALEAGFLEMLLQSEHRTLDPEEADFFYVPVFTSCFINPVRDGA 279
Query: 107 -GLPLPF---------KSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQE 156
L F + M+ A + + +PYW R G DH ++V HD +C+
Sbjct: 280 DSLRDFFYGVSHNRVQGAANMLLEAYHWVQAMFPYWERRGGRDHIWLVTHDEASCWVPAA 339
Query: 157 EKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPP---QKMQAHQ-------- 205
++ IL R T G + +T P + P K+ Q
Sbjct: 340 IRSTSI-ILSHWGRMDAHHTSGTGYSADVYSNDVTHPQFEPDGFLGKLNLTQPCYDPVKD 398
Query: 206 --IP----------------PDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAV---WE 244
+P P R+ + RG N G R A+AA W
Sbjct: 399 LVVPLMKTPEHYRLSPLVGAPPRQRTWLAFHRGRVQADNPPYSRGIRQRLAKAAAEGGWL 458
Query: 245 NFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVL 304
+ + T Y E + ++FC G WS R+ +A++ GCIPV+I D++ +
Sbjct: 459 EKHKIAVGEYDTLQ-GDYSELLASSVFCPVIPG-DGWSARMDDAMLHGCIPVLIMDEVQV 516
Query: 305 PFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLAN 348
F + + + E D KL IL ++ E QR LA
Sbjct: 517 SFESVVDLSTFTIRIPEADAEKLPDILQAVTQERREEMQRALAR 560
>gi|195611602|gb|ACG27631.1| exostosin-like [Zea mays]
Length = 456
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 94/351 (26%), Positives = 140/351 (39%), Gaps = 67/351 (19%)
Query: 40 LKVYVYELPSKY------------------NKKLLLKDP--RCLTHMFAAEIFMHRFLL- 78
+K+Y+Y+LP+K+ N L D R H +AE ++ + LL
Sbjct: 60 VKIYLYDLPAKFTYGVVRSYKAARATSGSANAAATLPDEQLRYPGHQHSAEWWLFKDLLR 119
Query: 79 ----SSPV-RTLNPEEADWFYTPIYPTCDL--TPTGLPLPFK--------SPRMMRSAIQ 123
PV R +P +AD FY P + + L P PL S ++ +
Sbjct: 120 RRPQDRPVARVDDPSDADLFYVPFFSSLSLVVNPIRSPLAANASGAGTAYSDEALQEELL 179
Query: 124 LISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHV 183
YW R G DH F+ Q+ A+ R + + LV FG+
Sbjct: 180 EWLERQLYWQRHRGRDHVFIC----------QDPNALYRVVDRISNAVLLVSDFGRLRSD 229
Query: 184 CLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVW 243
+ I PY+ ++ D R ++F G Y EGG R A++
Sbjct: 230 QASLVKDVILPYSHRINSFKGEVGVDG-RPSLLFFMGNRYR----KEGG----KVRDALF 280
Query: 244 ENFKNNPLFDISTDHPTTYYED-------MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPV 296
+ +N D++ H T E M + FCL P G P + RL +A+V C+PV
Sbjct: 281 QILENED--DVTIKHGTQSRESRRAATQGMHSSKFCLHPAGDTPSACRLFDALVSLCVPV 338
Query: 297 IIADDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQR 344
I +D I LPF D I + +I +FV + L + L I E IL QR
Sbjct: 339 IASDYIELPFEDIIDYNKISIFVGTSKAVQPGYLTSTLRRISSERILEYQR 389
>gi|255580236|ref|XP_002530948.1| catalytic, putative [Ricinus communis]
gi|223529463|gb|EEF31420.1| catalytic, putative [Ricinus communis]
Length = 512
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 89/380 (23%), Positives = 149/380 (39%), Gaps = 62/380 (16%)
Query: 40 LKVYVYELPSKYNKKLL------------LKD-----PRCLTHMFAAEIFMHRFLLS--- 79
LKV++Y+LP +++ +LL L+ P L + E ++ LL+
Sbjct: 110 LKVFMYDLPPEFHFELLDWKAQGDSVWPDLRTKIPGYPGGLNLQHSIEYWLTLDLLASEI 169
Query: 80 -------SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPF----KSPRMMRSAIQLISSN 128
S +R N EAD + P + + P K+ + ++ ++S
Sbjct: 170 SGIPRAGSAIRVRNSSEADVIFVPFFSSLSYNRYSKVNPHQKRSKNKLLQEKLVRYVTSQ 229
Query: 129 WPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEG 188
W R++G DH + H ++ + L ++ FG+ N
Sbjct: 230 ME-WKRSQGQDHIILAHH----------PNSMLDARMKLWPALFILADFGRYPPNIANVD 278
Query: 189 SITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKN 248
I PY + A R +YF+G Y +GG+ AR ++ K+
Sbjct: 279 KDLIAPYKHVIRSYADDSSTFDSRPTLLYFQGAIY----RKDGGF----ARQELFYLLKD 330
Query: 249 NP-----LFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIV 303
+ D + M + FCL G P S RL +A+ C+PVII+DDI
Sbjct: 331 EKDVHFQFGSVQKDGINKASQGMHTSKFCLNIAGDTPSSNRLFDAIASHCVPVIISDDIE 390
Query: 304 LPFADAIPWEEIGMFVAEEDVPKLDTILTSI----PPEVILRKQRLLANPSMKRAMLFPQ 359
LP+ D + + + +FV D K ++ I E Q+L ++R F
Sbjct: 391 LPYEDVLDYSQFCIFVRTSDAIKEKFLINLIRGIGKDEWTQMWQKL---KEVERFFEFQY 447
Query: 360 PAQPGDAFHQILNGLARKLP 379
P++ GDA I +ARK+P
Sbjct: 448 PSKEGDAVQMIWQAVARKVP 467
>gi|326432404|gb|EGD77974.1| hypothetical protein PTSG_09607 [Salpingoeca sp. ATCC 50818]
Length = 377
Score = 81.6 bits (200), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 72/232 (31%), Positives = 96/232 (41%), Gaps = 22/232 (9%)
Query: 138 ADHFFVVPHDFGACFHYQEE----KAIERGILPLLQRATLVQTFGQRNHVCLNEGS-ITI 192
A V+ D G C ++ E + ++R I L+ ATL+ G + C G + I
Sbjct: 116 ARFLVVLSSDHGPCHNFNEHLTGRRLVKRWIDRSLENATLLMNDGSLVNKCYRPGKDVVI 175
Query: 193 PPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLF 252
PP P T R F +F G + R + N
Sbjct: 176 PPSTWIGNATFACSRPITDRKHFAFFAGAASSL------------IREYIINELGNEDWL 223
Query: 253 DISTD-HPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIP 311
I D Y +M A+FCL P G A WSPRLVEA+ GCIPVIIAD PF D +
Sbjct: 224 FIPHDLQHEEYMCEMGNAVFCLAPRGRAAWSPRLVEALEAGCIPVIIADMNHEPFHDVLD 283
Query: 312 WEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAML-FPQPAQ 362
+ + V E+ + L L SI + RL AN RA +P PA+
Sbjct: 284 YSTFTVQVHEDKLETLGEQLHSISSGQV---ARLHANGQRARAHFRYPPPAR 332
>gi|115487106|ref|NP_001066040.1| Os12g0124400 [Oryza sativa Japonica Group]
gi|77552918|gb|ABA95714.1| secondary cell wall-related glycosyltransferase family 47,
putative, expressed [Oryza sativa Japonica Group]
gi|113648547|dbj|BAF29059.1| Os12g0124400 [Oryza sativa Japonica Group]
gi|215678612|dbj|BAG92267.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218186350|gb|EEC68777.1| hypothetical protein OsI_37312 [Oryza sativa Indica Group]
Length = 475
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 75/310 (24%), Positives = 127/310 (40%), Gaps = 29/310 (9%)
Query: 80 SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLP----FKSPRMMRSAIQLISSNWPYWNRT 135
+ VR + +AD + P + + +P + + ++ + + P W R+
Sbjct: 166 AAVRVADSRDADVVFVPFFASLSYNRHSRVVPPEKVSRDKELQEKLVRYLMAQ-PEWKRS 224
Query: 136 EGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY 195
GADH +V H + H + + P++ ++ FG+ + + I PY
Sbjct: 225 GGADHV-IVAHHPNSLLHARSV------LFPVV---FVLSDFGRYHPRVASLEKDVIAPY 274
Query: 196 APPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNP----L 251
K + R +YFRG + EGG R ++ K+
Sbjct: 275 KHMAKTFVNDSAGFDDRPTLLYFRGAIFR----KEGG----NIRQELYYMLKDEKDVYFA 326
Query: 252 FDISTDH-PTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAI 310
F DH + + M + FCL G P S RL +A+V C+PVII+DDI LP+ DA+
Sbjct: 327 FGSVQDHGASKASKGMHASKFCLNIAGDTPSSNRLFDAIVSHCVPVIISDDIELPYEDAL 386
Query: 311 PWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLAN-PSMKRAMLFPQPAQPGDAFHQ 369
+ + +FV D K ++ I + R+ + + + P+Q DA
Sbjct: 387 DYSKFSIFVRSSDAVKKGYLMRLIRGVSKHQWTRMWNRLKEVDKHFEYQYPSQKDDAVQM 446
Query: 370 ILNGLARKLP 379
I LARK+P
Sbjct: 447 IWQALARKVP 456
>gi|302804107|ref|XP_002983806.1| hypothetical protein SELMODRAFT_445668 [Selaginella moellendorffii]
gi|300148643|gb|EFJ15302.1| hypothetical protein SELMODRAFT_445668 [Selaginella moellendorffii]
Length = 1068
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 95/349 (27%), Positives = 149/349 (42%), Gaps = 51/349 (14%)
Query: 39 KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIY 98
+ KVYVY + + L +D + +F+ L S V T +PE AD F P
Sbjct: 55 RFKVYVYPMIQNASAPDL-RDGKAARPGSIDRVFVDSLLASGFV-TDDPEAADLFLLP-- 110
Query: 99 PTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEK 158
+ P P ++S IQ + +PYW R+ GADHFFV HD + +
Sbjct: 111 ASISAIWKKRPDPKGIAHSLKSYIQQLRDLYPYWQRSLGADHFFVSCHDITSDW------ 164
Query: 159 AIERGILPLLQRATLVQTFGQRNHVC---LNEGSITIPPYA----PPQKMQAHQIPPDTP 211
R +L L + A + F H L IT+PP PPQ+ +
Sbjct: 165 --SRNVLELKKNAIQIACFPLARHGAQEFLAHKDITMPPAGGSIDPPQRRR--------- 213
Query: 212 RSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIF 271
+ YD ++ GY AR A+ W++ ++ ++ D + + F
Sbjct: 214 ------WNLAVYDSSSQ---GYAARDVPAS-WKSDESFVAGAVALD-----LQLLVTTRF 258
Query: 272 CLCPLGWAPWSPRLVEAVVFGCIPVIIADDIV--LPFADAIPWEEIGMFVAEEDVPKLDT 329
CL LG + ++ AV GCIPVI + + LPF D + W + ++ + + +
Sbjct: 259 CLS-LGSSD-RHLVIPAVRSGCIPVIFSAGKLSDLPFQDILDWNSFAIVLSRDQLHQTKG 316
Query: 330 ILTSIPPEVILRKQRLLANPS-MKRAMLFPQPAQPGDAFHQILNGLARK 377
IL SI E ++ RL N + + M + P QP DAF+ +L L R+
Sbjct: 317 ILESIDEE---KRSRLQENGARAAKHMEWHSPPQPEDAFYMVLYQLWRR 362
>gi|3668093|gb|AAC61825.1| unknown protein [Arabidopsis thaliana]
Length = 460
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 90/378 (23%), Positives = 157/378 (41%), Gaps = 68/378 (17%)
Query: 39 KLKVYVYELPSKYNKKLL---------LKDP--RCLTHMFAAEIFMHRFLLSSP------ 81
+++VY+Y LP ++ L+ +K P T + MH + L S
Sbjct: 59 RVRVYMYNLPKRFTYGLIEQHSIARGGIKKPVGDVTTLKYPGHQHMHEWYLFSDLNQPEV 118
Query: 82 -------VRTLNPEEADWFYTPIYPTCDL-TPTGLPLP----FKSPRMMRSAIQLISSNW 129
VR +P +AD FY P++ + L G P+ + +M ++ +
Sbjct: 119 DRSGSPIVRVSDPADADLFYVPVFSSLSLIVNAGRPVEAGSGYSDEKMQEGLVEWLEGQ- 177
Query: 130 PYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGS 189
+W R G DH V+P + A+ R + + LV FG+ + ++GS
Sbjct: 178 EWWRRNAGRDH--VIPAG--------DPNALYRILDRVKNAVLLVSDFGR---LRPDQGS 224
Query: 190 IT---IPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENF 246
+ PY+ + +I + R+ ++F G Y +GG R +++
Sbjct: 225 FVKDVVIPYSHRVNLFNGEIGVED-RNTLLFFMGNRYR----KDGG----KVRDLLFQVL 275
Query: 247 KNNPLFDISTDHPTTYYED-------MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIA 299
+ D++ H T E+ M + FCL P G P + RL +++V C+P+I++
Sbjct: 276 EKED--DVTIKHGTQSRENRRAATKGMHTSKFCLNPAGDTPSACRLFDSIVSLCVPLIVS 333
Query: 300 DDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAML 356
D I LPF D I + + +FV + L +L I + IL QR + + R L
Sbjct: 334 DSIELPFEDVIDYRKFSIFVEANAALQPGFLVQMLRKIKTKKILEYQREMKS-KFDRTKL 392
Query: 357 FPQPAQPGDAFHQILNGL 374
+P ++H LN L
Sbjct: 393 LMSMHKPNRSYHFHLNNL 410
>gi|302795426|ref|XP_002979476.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
gi|300152724|gb|EFJ19365.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
Length = 434
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 92/394 (23%), Positives = 163/394 (41%), Gaps = 63/394 (15%)
Query: 40 LKVYVYELPSKYN---------KKLLLKDPRCLTH-----------MFAAEIFMHRFLLS 79
L++++++LPSK+ + + ++ + L + + E R L
Sbjct: 62 LRIFMHDLPSKFTYGVVERYLRSRGIARNDKRLRYPGTQHSAEWWLFYDLEQGEDRRLSD 121
Query: 80 SPVRTLNPEEADWFYTPIYPTCDLT-PTGLPLPFKSPRMMRSAIQLISS---NWPYWNRT 135
S VR +NP+EAD FY P + + L G + P + + + W +
Sbjct: 122 SSVRVMNPQEADVFYVPFFSSLSLVVGNGKSEDDEDPYSDEDTQEELMAWLEEQESWKKN 181
Query: 136 EGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY 195
+G DH + Q+ A++R L L+ F + + + PY
Sbjct: 182 KGRDHVVIC----------QDPNALKRLRDRLKNTVLLLSDFERFKPDQASLVKDVVLPY 231
Query: 196 APPQKMQAHQIPPDT-PRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDI 254
++ ++ T R ++F G Y EGG R +++ P D+
Sbjct: 232 T--HRIDSYSNENVTLDRDTLLFFMGNRYR----KEGG----KIRDQLFQVLDVEP--DM 279
Query: 255 STDHPTTYYED-------MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFA 307
H T E MQ + FCL P G P + RL +A+V C+PVI++DDI LPF
Sbjct: 280 VMKHGTQSREGRRLAKVGMQTSKFCLHPAGDTPSACRLFDAIVSVCVPVIVSDDIELPFE 339
Query: 308 DAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPG 364
D + + E +FV + + L + L SI P+++ +KQ+ L + + + + G
Sbjct: 340 DELDYSEFAIFVPSINALEPGYLGSYLRSISPDLLKQKQQRL-----REVRKYFEYEEKG 394
Query: 365 DAFHQILNGLARKLPHDKSTYLKPGGKFLNWTSG 398
A I + +KLP + T + + + +SG
Sbjct: 395 GAVEMIWLQVKKKLPFIR-TMINRDKRLVERSSG 427
>gi|359473738|ref|XP_002272591.2| PREDICTED: LOW QUALITY PROTEIN: probable glycosyltransferase
At5g25310-like [Vitis vinifera]
Length = 437
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 93/383 (24%), Positives = 148/383 (38%), Gaps = 59/383 (15%)
Query: 38 GKLKVYVYELPSKYNKKLLLK------------------DPRCLTHMFAAEIFMHRFLL- 78
G L VY+Y+LP +++ +L + L + E +M LL
Sbjct: 51 GPLMVYMYDLPRRFHVGMLRRRSPADESPVTAENLPPWPSNSGLKKQHSVEYWMMASLLY 110
Query: 79 --------SSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLP---FKSPRMMRSAIQLISS 127
VR +PE AD F+ P + + G + + R ++ I I
Sbjct: 111 DGGGGNETREAVRVWDPEMADAFFVPFFSSLSFNTHGHNMTDPDTEFDRQLQIDILKILR 170
Query: 128 NWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNE 187
YW R+ G DH V+P F + E+ + IL +V FG+ N
Sbjct: 171 ESKYWQRSGGRDH--VIPMHHPNAFRFFREQ-VNTSIL-------IVADFGRYPKEISNL 220
Query: 188 GSITIPPYAPPQKMQAHQIPPD--TPRSIFVYFRGLFYDVNNDPEGGYYAR------GAR 239
+ PY PD R+ ++FRG + D EG + G
Sbjct: 221 RKDVVAPYVHVVDSFTDDNSPDPYESRTTLLFFRGR--TIRKD-EGIVRDKLVKLLAGXD 277
Query: 240 AAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIA 299
+ +F + T + M+ + FCL P G P S RL +A+V C+PVI++
Sbjct: 278 DYLQLHFHHRSYLSFLVXQST---QGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVS 334
Query: 300 DDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAML 356
D I LP+ D I + + +F ++++ + + L IP E + R L S
Sbjct: 335 DQIELPYEDEIDYTQFSIFFSDKEALEPGYMIEQLRQIPKERWVEMWRHLKYISHHYE-- 392
Query: 357 FPQPAQPGDAFHQILNGLARKLP 379
F P + GDA + + KLP
Sbjct: 393 FQYPPKKGDAIDMLWRQVKHKLP 415
>gi|302814814|ref|XP_002989090.1| hypothetical protein SELMODRAFT_447545 [Selaginella moellendorffii]
gi|300143191|gb|EFJ09884.1| hypothetical protein SELMODRAFT_447545 [Selaginella moellendorffii]
Length = 1522
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 94/348 (27%), Positives = 144/348 (41%), Gaps = 49/348 (14%)
Query: 39 KLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIY 98
+ KVYVY + + L +D + +F+ L S V T +PE AD F P
Sbjct: 55 RFKVYVYPMIQNASAPDL-RDGKAARPGSIDRVFVDSLLASGFV-TDDPEAADLFLLP-- 110
Query: 99 PTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEK 158
+ P P ++S IQ + +PYW R+ GADHFFV HD + +
Sbjct: 111 ASISAIWKKRPDPKGIAHSLKSYIQQLRDLYPYWQRSLGADHFFVSCHDITSDW------ 164
Query: 159 AIERGILPLLQRATLVQTFGQRNHVC---LNEGSITIPPYA----PPQKMQAHQIPPDTP 211
R +L L + A + F H L IT+PP PPQ+ + +
Sbjct: 165 --SRNVLELKKNAIQIACFPLARHGAQEFLAHKDITMPPAGGSIDPPQRRRWN------- 215
Query: 212 RSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIF 271
+ VY D YA A W++ ++ + D + + F
Sbjct: 216 --LAVY----------DSSSQGYAASDVPASWKSDESFVAGAVKMD-----LQLLVTTRF 258
Query: 272 CLCPLGWAPWSPRLVEAVVFGCIPVIIADDIV--LPFADAIPWEEIGMFVAEEDVPKLDT 329
CL LG + ++ AV GCIPVI + + LPF D + W + ++ + + +
Sbjct: 259 CLS-LGSSD-RHLVIPAVRSGCIPVIFSAGKLSDLPFQDILDWNSFAIVLSRDQLHQTKA 316
Query: 330 ILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARK 377
IL SI E + R Q A + + M + P QP DAF+ +L L R+
Sbjct: 317 ILESIDEEKLSRLQENGARAA--KHMEWHSPPQPEDAFYMVLYQLWRR 362
>gi|449465854|ref|XP_004150642.1| PREDICTED: probable glycosyltransferase At5g03795-like [Cucumis
sativus]
Length = 312
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 73/305 (23%), Positives = 124/305 (40%), Gaps = 82/305 (26%)
Query: 69 AEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSN 128
AE ++ LL+SP T +P++A F+ P P S R + I+ + ++
Sbjct: 65 AESLFYKSLLNSPYTTHDPDQAHLFFIPFSPHI------------STRSLARLIRTLRTD 112
Query: 129 WPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEG 188
PYWNRT GADHFF+ G +R ++ L + A V +F +
Sbjct: 113 LPYWNRTLGADHFFLSSSGIGYI--------SDRNVVELKKNAIQVSSFPVSPGKFIPHK 164
Query: 189 SITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKN 248
+++PP + +PP TP
Sbjct: 165 DVSLPPVS--------TLPPRTP------------------------------------- 179
Query: 249 NPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIV--LPF 306
+ Y + + ++ FCL S + EA+ FGC+PV+I+D + LP
Sbjct: 180 -----------SCYGDKLAKSDFCLFEYEGGDVSG-IGEALRFGCVPVVISDRWIQDLPL 227
Query: 307 ADAIPWEEIGMFVA-EEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGD 365
D + WEE+ +FVA + + +L + E + R ++L A + + ++ P QP D
Sbjct: 228 MDVVRWEEMAVFVAGGGGIEGVKKVLRRVDGERLDRMKKLGA--AAAQHFVWNSPPQPLD 285
Query: 366 AFHQI 370
AF+ +
Sbjct: 286 AFNTV 290
>gi|302792172|ref|XP_002977852.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
gi|300154555|gb|EFJ21190.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
Length = 434
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 94/399 (23%), Positives = 158/399 (39%), Gaps = 73/399 (18%)
Query: 40 LKVYVYELPSKYN---------KKLLLKDPRCLTH-----------MFAAEIFMHRFLLS 79
L++++Y+LPSK+ + + ++ + L + + E R L
Sbjct: 62 LRIFMYDLPSKFTYGVVERYLRSRGIARNDKRLRYPGTQHSAEWWLFYDLEQGEDRRLSD 121
Query: 80 SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNW----PYWNRT 135
+ VR +NP+EAD FY P + + L IQ W W +
Sbjct: 122 ASVRVMNPQEADVFYVPFFSSLSLVVGNGKSEDDEDPYSDEDIQEELMAWLEEQESWKKN 181
Query: 136 EGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY 195
+G DH + Q+ A++R L L+ F + + + PY
Sbjct: 182 KGRDHVVIC----------QDPNALKRLRDRLKNTVLLLSDFERFKPDQASLVKDVVLPY 231
Query: 196 APPQKMQAHQIPP------DTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNN 249
H+I R ++F G Y EGG R +++
Sbjct: 232 T-------HRIDSYFNENVTLDRDTLLFFMGNRYR----KEGG----KIRDQLFQVLDVE 276
Query: 250 PLFDISTDHPTTYYED-------MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDI 302
P D+ H T E MQ + FCL P G P + RL +A+V C+PVI++DDI
Sbjct: 277 P--DMVMKHGTQSREGRRLAKVGMQTSKFCLHPAGDTPSACRLFDAIVSVCVPVIVSDDI 334
Query: 303 VLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQ 359
LPF D + + E +FV + + L + L SI P+++ +KQ+ L + + +
Sbjct: 335 ELPFEDELDYSEFAIFVPSINALEPGYLGSYLRSISPDLLKQKQQRL-----REVRKYFE 389
Query: 360 PAQPGDAFHQILNGLARKLPHDKSTYLKPGGKFLNWTSG 398
+ G A I + +KLP + T + + + +SG
Sbjct: 390 YEEKGGAVEMIWLEVKKKLPFIR-TMINRDKRLVERSSG 427
>gi|224093262|ref|XP_002309856.1| predicted protein [Populus trichocarpa]
gi|222852759|gb|EEE90306.1| predicted protein [Populus trichocarpa]
Length = 326
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 91/356 (25%), Positives = 143/356 (40%), Gaps = 63/356 (17%)
Query: 39 KLKVYVYE---------LPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEE 89
+ KV+VY+ LPSK+N + + +E + L SP T + E
Sbjct: 4 EFKVFVYQDRNITKHCDLPSKHNSR------------YESEEYFFSNLKMSPFLTDDAAE 51
Query: 90 ADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFG 149
A F+ PI+ T + + ++ + S +PYWNRT GADHFFV D
Sbjct: 52 AHLFFIPIFSQ---KMTKKRSEDERAIAVEDFVKSLISKYPYWNRTLGADHFFVTCADIN 108
Query: 150 ACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQ--AHQIP 207
A R I L++ + V N + +++P PP + + I
Sbjct: 109 VT-------ATAR-IANLMKNSIKVMCTPSYNDEYVPHKDVSLPQRVPPLALTPAGNNI- 159
Query: 208 PDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPT------T 261
T R ++RGL NN R WEN + LF P+
Sbjct: 160 --TNRITLAFWRGL----NNSD-----IRQKLLEAWEN--DLELFIQKGRKPSLEQGDLV 206
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVE-AVVFGCIPVIIADDIVLPFADAIPWEEIGMFVA 320
++E + +C+CP G P R + A+ +GC+PVI++D LPF D + W + + +
Sbjct: 207 HHEAFNNSKYCICPGG--PELDRTIALAIHYGCVPVIMSDYYDLPFKDILDWRKFSIILE 264
Query: 321 EEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPG--DAFHQILNGL 374
E V L L E++ + R + ++ F P DAFH + L
Sbjct: 265 ESQVYYLREHL----KEMLEHEYRAMQTNTVMVRKHFQWNLVPAKYDAFHMTMYDL 316
>gi|3176669|gb|AAC18793.1| End is cut off [Arabidopsis thaliana]
Length = 440
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 97/407 (23%), Positives = 158/407 (38%), Gaps = 87/407 (21%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRC-------------------LTHMFAAEIFMHRFLLS- 79
L+V++Y+LP K+N + + DP + + E ++ LL+
Sbjct: 53 LRVFMYDLPRKFN--IAMMDPHSSDVEPITGKNLPSWPQTSGIKRQHSVEYWLMASLLNG 110
Query: 80 -----SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLP---FKSPRMMRSAIQLISSNWPY 131
+R +P+ AD FY P + + G + + R+++ + N Y
Sbjct: 111 GEDENEAIRVFDPDLADVFYVPFFSSLSFNTHGKNMTDPDTEFDRLLQVELMEFLENSKY 170
Query: 132 WNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSIT 191
WNR+ G DH V+P F + ++ + IL +V FG+ +
Sbjct: 171 WNRSGGKDH--VIPMTHPNAFRFLRQQ-VNASIL-------IVVDFGRYSKDMARLSKDV 220
Query: 192 IPPY-----APPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGAR------- 239
+ PY + ++ P R+ +YFRG + EG R +
Sbjct: 221 VSPYVHVVESLNEEGDDGMGDPFEARTTLLYFRG---NTVRKDEGKIRLRLEKLLAGNSD 277
Query: 240 ------AAVWENFKNNPLFDISTDHPTTYY-----------------EDMQRAIFCLCPL 276
A +N K + L + Y E M+ + FCL P
Sbjct: 278 VHFEKSVATTQNIKVSDLEQNRGRYLMLTYQNDSNCSTCVLFCYQSTEGMRSSKFCLHPA 337
Query: 277 GWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSI-- 334
G P S RL +A+V CIPVII+D I LPF D I + E +F + ++ + IL ++
Sbjct: 338 GDTPSSCRLFDAIVSHCIPVIISDKIELPFEDEIDYSEFSLFFSIKESLEPGYILNNLRQ 397
Query: 335 -PPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPH 380
P E L + L N S F P + DA +N L R++ H
Sbjct: 398 FPKEKWLEMWKRLKNVS--HHFEFQYPPKREDA----VNMLWRQVKH 438
>gi|224144904|ref|XP_002325457.1| predicted protein [Populus trichocarpa]
gi|222862332|gb|EEE99838.1| predicted protein [Populus trichocarpa]
Length = 463
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 86/378 (22%), Positives = 149/378 (39%), Gaps = 58/378 (15%)
Query: 40 LKVYVYELPSKYNKKLL----LKD-------------PRCLTHMFAAEIFMHRFLLS--- 79
LK+++Y+LPS+++ LL L D P L + E ++ LL+
Sbjct: 73 LKIFMYDLPSEFHFGLLDLKPLGDSVWPDLRAKVPEYPGGLNLQHSIEYWLTLDLLASEV 132
Query: 80 -------SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSP---RMMRSAIQLISSNW 129
S VR N EAD + P + + P + ++++ + ++
Sbjct: 133 PGIPRAGSAVRVRNSSEADVIFVPFFSSLCYNRYSKVNPHQKKSKDKLLQEKLVKFLTSQ 192
Query: 130 PYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGS 189
W R+ G DH + H ++ + L ++ FG+ N
Sbjct: 193 KEWKRSGGRDHVLLAHH----------PNSMLDARVKLWPAIFILADFGRYPPNIANVAK 242
Query: 190 ITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNN 249
I PY + + R +YF+G Y +GG+ AR ++ K+
Sbjct: 243 DVIAPYKHVIRSYVNDSSNFDSRPTLLYFQGAIY----RKDGGF----ARQELFYLLKDE 294
Query: 250 P-----LFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVL 304
+ D + M + FCL G P S RL +A+ C+PVII+DDI L
Sbjct: 295 KEVHFQFGSVQKDGVGKASQGMHSSKFCLNIAGDTPSSNRLFDAIASHCVPVIISDDIEL 354
Query: 305 PFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPA 361
P+ + + + + +FV D + L ++ SI + R + L ++ F P+
Sbjct: 355 PYENVLDYSQFCIFVRTSDAVREKFLVNLIRSIKKDEWTRMWKRL--KEVENFFEFQYPS 412
Query: 362 QPGDAFHQILNGLARKLP 379
+ GDA I +ARK+P
Sbjct: 413 REGDAVQMIWQAVARKVP 430
>gi|357475227|ref|XP_003607899.1| Xylogalacturonan beta-1,3-xylosyltransferase [Medicago truncatula]
gi|355508954|gb|AES90096.1| Xylogalacturonan beta-1,3-xylosyltransferase [Medicago truncatula]
Length = 481
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 97/387 (25%), Positives = 160/387 (41%), Gaps = 76/387 (19%)
Query: 40 LKVYVYELPSKYNKKL-----LLKDPRCLT------------HMFAAEIFMHRFLL---- 78
+KVY+Y+LP ++ + L + R T H AE ++ L
Sbjct: 92 VKVYLYDLPKRFTYGVIHHHSLARGSRANTDEKDVTSFKYPGHQHMAEWYLFSDLSRPDS 151
Query: 79 ----SSPVRTLNPEEADWFYTPIYPTCDLT-----PTGL-PLPFK---SPRMMRSAIQLI 125
S VR +PEEAD F+ P + + L P G +P K S + A+
Sbjct: 152 ERSGSPVVRVSDPEEADLFFVPFFSSLSLIVNPVRPAGSGSVPEKTAYSDEENQEALMEW 211
Query: 126 SSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCL 185
+W R++G DH V + A+ R + + LV FG+ +
Sbjct: 212 LEMQEFWKRSKGRDHVIVA----------SDPNAMYRVVDRVKNCVLLVSDFGR---LRP 258
Query: 186 NEGSIT---IPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAV 242
++GS+ I PY+ + I D R+ ++F G Y EGG R +
Sbjct: 259 DQGSLVKDVIVPYSHRIRTYDGGIGVD-KRNTLLFFMGNRYR----KEGG----KIRDTL 309
Query: 243 WENFKNNPLFDISTDHPTTYYED-------MQRAIFCLCPLGWAPWSPRLVEAVVFGCIP 295
++ + D+ H E M + FCL P G P + RL +A+V C+P
Sbjct: 310 FQILEKED--DVIIKHGAQSRESRRAASQGMHTSKFCLHPAGDTPSACRLFDAIVSLCVP 367
Query: 296 VIIADDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMK 352
VI++D I LPF D I + +I +FV + L +IL + P+ I+ Q+ +K
Sbjct: 368 VIVSDSIELPFEDTIDYRKIAVFVETAAAIQPGYLVSILRGMAPDRIVEYQK-----ELK 422
Query: 353 RAMLFPQPAQPGDAFHQILNGLARKLP 379
+ + +P ++I +++KLP
Sbjct: 423 EVKRYFKYDEPDGTVNEIWRQISKKLP 449
>gi|449503277|ref|XP_004161922.1| PREDICTED: probable glycosyltransferase At5g03795-like [Cucumis
sativus]
Length = 310
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 73/305 (23%), Positives = 123/305 (40%), Gaps = 82/305 (26%)
Query: 69 AEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSN 128
AE + LL+SP T +P++A F+ P P S R + I+ + ++
Sbjct: 63 AESLFFKSLLNSPYATHDPDQAHLFFIPFSPHI------------STRSLARLIRTLRTD 110
Query: 129 WPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEG 188
PYWNRT GADHFF+ G +R ++ L + A V +F +
Sbjct: 111 LPYWNRTLGADHFFLSSSGIGYI--------SDRNVVELKKNAIQVSSFPVSPGKFIPHK 162
Query: 189 SITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKN 248
+++PP + +PP TP
Sbjct: 163 DVSLPPVS--------TLPPRTP------------------------------------- 177
Query: 249 NPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIV--LPF 306
+ Y + + ++ FCL S + EA+ FGC+PV+I+D + LP
Sbjct: 178 -----------SCYGDKLAKSDFCLFEYEGGDVSG-IGEALRFGCVPVVISDRWIQDLPL 225
Query: 307 ADAIPWEEIGMFVA-EEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGD 365
D + WEE+ +FVA + + +L + E + R ++L A + + ++ P QP D
Sbjct: 226 MDVVRWEEMAVFVAGGGGIEGVKKVLRRVDGERLDRMKKLGA--AAAQHFVWNSPPQPLD 283
Query: 366 AFHQI 370
AF+ +
Sbjct: 284 AFNTV 288
>gi|194707922|gb|ACF88045.1| unknown [Zea mays]
Length = 240
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 60/185 (32%), Positives = 87/185 (47%), Gaps = 24/185 (12%)
Query: 191 TIPPYAPPQKMQAHQ--IPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKN 248
I PY P + H+ + + RSI ++FRG GG R+ + E K+
Sbjct: 19 VILPYVPNVDLCDHKCVLETQSKRSILLFFRGRL----KRNAGGKI----RSKLVEELKS 70
Query: 249 NPLFDISTDHPTT-------YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADD 301
DI + +T + M+++ FCL P G P S RL +A+V GCIPVII+D+
Sbjct: 71 AK--DIVIEEGSTGAQGKAAAQDGMRKSFFCLSPAGDTPSSARLFDAIVSGCIPVIISDE 128
Query: 302 IVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFP 358
+ LPF + + EI +FV+ D + L L I + I Q L S R L+
Sbjct: 129 LELPFEGILDYREIALFVSASDAVQPGWLLKYLRGINAKRIREIQSNLVKYS--RHFLYS 186
Query: 359 QPAQP 363
PAQP
Sbjct: 187 SPAQP 191
>gi|222616547|gb|EEE52679.1| hypothetical protein OsJ_35064 [Oryza sativa Japonica Group]
Length = 528
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 75/308 (24%), Positives = 126/308 (40%), Gaps = 29/308 (9%)
Query: 82 VRTLNPEEADWFYTPIYPTCDLTPTGLPLP----FKSPRMMRSAIQLISSNWPYWNRTEG 137
VR + +AD + P + + +P + + ++ + + P W R+ G
Sbjct: 221 VRVADSRDADVVFVPFFASLSYNRHSRVVPPEKVSRDKELQEKLVRYLMAQ-PEWKRSGG 279
Query: 138 ADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAP 197
ADH +V H + H + + P++ ++ FG+ + + I PY
Sbjct: 280 ADHV-IVAHHPNSLLHARSV------LFPVV---FVLSDFGRYHPRVASLEKDVIAPYKH 329
Query: 198 PQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNP----LFD 253
K + R +YFRG + EGG R ++ K+ F
Sbjct: 330 MAKTFVNDSAGFDDRPTLLYFRGAIFR----KEGG----NIRQELYYMLKDEKDVYFAFG 381
Query: 254 ISTDH-PTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW 312
DH + + M + FCL G P S RL +A+V C+PVII+DDI LP+ DA+ +
Sbjct: 382 SVQDHGASKASKGMHASKFCLNIAGDTPSSNRLFDAIVSHCVPVIISDDIELPYEDALDY 441
Query: 313 EEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLAN-PSMKRAMLFPQPAQPGDAFHQIL 371
+ +FV D K ++ I + R+ + + + P+Q DA I
Sbjct: 442 SKFSIFVRSSDAVKKGYLMRLIRGVSKHQWTRMWNRLKEVDKHFEYQYPSQKDDAVQMIW 501
Query: 372 NGLARKLP 379
LARK+P
Sbjct: 502 QALARKVP 509
>gi|27817890|dbj|BAC55656.1| exostosin family protein-like protein [Oryza sativa Japonica Group]
Length = 453
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 94/393 (23%), Positives = 154/393 (39%), Gaps = 76/393 (19%)
Query: 36 PVGKLKVYVYELPSKYNKKLLLK--------------------DPRCLTHMFAAEIFMHR 75
P +K+Y+Y+LP+K+ ++ + R H +AE ++ +
Sbjct: 53 PPAPVKIYMYDLPAKFTYGVVRSYMAARARAGAAGAASAIPDDELRYPGHQHSAEWWLFK 112
Query: 76 FL-----LSSPV-RTLNPEEADWFYTPIYPTCDLTPTGL------------PLPFKSPRM 117
L L PV R +P +AD FY P + + L + P S
Sbjct: 113 DLRRRGPLDRPVARVDDPADADLFYVPFFSSLSLVVNPIRPSAAAANASDAAEPAYSDES 172
Query: 118 MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTF 177
+ + + PYW R +G DH F+ Q+ A+ R + + L+ F
Sbjct: 173 TQEELLVWLERQPYWRRHQGRDHVFIC----------QDPNALYRVVDRISNAVLLISDF 222
Query: 178 GQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDT-PRSIFVYFRGLFYDVNNDPEGGYYAR 236
G+ + I PYA ++ + Q R ++F G Y EGG
Sbjct: 223 GRLRSEQASLVKDVILPYA--HRINSFQGDVGVESRPSLLFFMGNRYR----KEGG---- 272
Query: 237 GARAAVWENFKNNPLFDISTDHPTTYYED-------MQRAIFCLCPLGWAPWSPRLVEAV 289
R +++ +N D+ H E M + FCL P G P + RL +A+
Sbjct: 273 KVRDTLFQVLENEA--DVIIKHGAQSRESRRMATRGMHSSKFCLHPAGDTPSACRLFDAL 330
Query: 290 VFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLL 346
V C+PVI++D I LPF D I + I +FV + L + L I + IL QR
Sbjct: 331 VSLCVPVIVSDYIELPFEDVIDYRNISIFVETSKAVQPGFLTSTLRGISSQRILEYQR-- 388
Query: 347 ANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
+K+ + + P +QI + ++ K P
Sbjct: 389 ---EIKKVKHYFEYEDPNGPVNQIWHQVSSKAP 418
>gi|222637299|gb|EEE67431.1| hypothetical protein OsJ_24780 [Oryza sativa Japonica Group]
Length = 500
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 94/393 (23%), Positives = 154/393 (39%), Gaps = 76/393 (19%)
Query: 36 PVGKLKVYVYELPSKYNKKLLLK--------------------DPRCLTHMFAAEIFMHR 75
P +K+Y+Y+LP+K+ ++ + R H +AE ++ +
Sbjct: 100 PPAPVKIYMYDLPAKFTYGVVRSYMAARARAGAADAASAIPDDELRYPGHQHSAEWWLFK 159
Query: 76 FL-----LSSPV-RTLNPEEADWFYTPIYPTCDLTPTGL------------PLPFKSPRM 117
L L PV R +P +AD FY P + + L + P S
Sbjct: 160 DLRRRGPLDRPVARVDDPADADLFYVPFFSSLSLVVNPIRPSAAAANASDAAEPAYSDES 219
Query: 118 MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTF 177
+ + + PYW R +G DH F+ Q+ A+ R + + L+ F
Sbjct: 220 TQEELLVWLERQPYWRRHQGRDHVFIC----------QDPNALYRVVDRISNAVLLISDF 269
Query: 178 GQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDT-PRSIFVYFRGLFYDVNNDPEGGYYAR 236
G+ + I PYA ++ + Q R ++F G Y EGG
Sbjct: 270 GRLRSEQASLVKDVILPYA--HRINSFQGDVGVESRPSLLFFMGNRYR----KEGG---- 319
Query: 237 GARAAVWENFKNNPLFDISTDHPTTYYED-------MQRAIFCLCPLGWAPWSPRLVEAV 289
R +++ +N D+ H E M + FCL P G P + RL +A+
Sbjct: 320 KVRDTLFQVLENEA--DVIIKHGAQSRESRRMATRGMHSSKFCLHPAGDTPSACRLFDAL 377
Query: 290 VFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLL 346
V C+PVI++D I LPF D I + I +FV + L + L I + IL QR
Sbjct: 378 VSLCVPVIVSDYIELPFEDVIDYRNISIFVETSKAVQPGFLTSTLRGISSQRILEYQR-- 435
Query: 347 ANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
+K+ + + P +QI + ++ K P
Sbjct: 436 ---EIKKVKHYFEYEDPNGPVNQIWHQVSSKAP 465
>gi|115472841|ref|NP_001060019.1| Os07g0567000 [Oryza sativa Japonica Group]
gi|113611555|dbj|BAF21933.1| Os07g0567000 [Oryza sativa Japonica Group]
Length = 500
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 94/393 (23%), Positives = 154/393 (39%), Gaps = 76/393 (19%)
Query: 36 PVGKLKVYVYELPSKYNKKLLLK--------------------DPRCLTHMFAAEIFMHR 75
P +K+Y+Y+LP+K+ ++ + R H +AE ++ +
Sbjct: 100 PPAPVKIYMYDLPAKFTYGVVRSYMAARARAGAAGAASAIPDDELRYPGHQHSAEWWLFK 159
Query: 76 FL-----LSSPV-RTLNPEEADWFYTPIYPTCDLTPTGL------------PLPFKSPRM 117
L L PV R +P +AD FY P + + L + P S
Sbjct: 160 DLRRRGPLDRPVARVDDPADADLFYVPFFSSLSLVVNPIRPSAAAANASDAAEPAYSDES 219
Query: 118 MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTF 177
+ + + PYW R +G DH F+ Q+ A+ R + + L+ F
Sbjct: 220 TQEELLVWLERQPYWRRHQGRDHVFIC----------QDPNALYRVVDRISNAVLLISDF 269
Query: 178 GQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDT-PRSIFVYFRGLFYDVNNDPEGGYYAR 236
G+ + I PYA ++ + Q R ++F G Y EGG
Sbjct: 270 GRLRSEQASLVKDVILPYA--HRINSFQGDVGVESRPSLLFFMGNRYR----KEGG---- 319
Query: 237 GARAAVWENFKNNPLFDISTDHPTTYYED-------MQRAIFCLCPLGWAPWSPRLVEAV 289
R +++ +N D+ H E M + FCL P G P + RL +A+
Sbjct: 320 KVRDTLFQVLENEA--DVIIKHGAQSRESRRMATRGMHSSKFCLHPAGDTPSACRLFDAL 377
Query: 290 VFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLL 346
V C+PVI++D I LPF D I + I +FV + L + L I + IL QR
Sbjct: 378 VSLCVPVIVSDYIELPFEDVIDYRNISIFVETSKAVQPGFLTSTLRGISSQRILEYQR-- 435
Query: 347 ANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
+K+ + + P +QI + ++ K P
Sbjct: 436 ---EIKKVKHYFEYEDPNGPVNQIWHQVSSKAP 465
>gi|255542540|ref|XP_002512333.1| catalytic, putative [Ricinus communis]
gi|223548294|gb|EEF49785.1| catalytic, putative [Ricinus communis]
Length = 434
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 82/373 (21%), Positives = 146/373 (39%), Gaps = 48/373 (12%)
Query: 40 LKVYVYELPSKYNKKLL-------------------------LKDPRCLTHMFAAEIFMH 74
LKVY+Y+LP +++ ++ L+ + + A +
Sbjct: 56 LKVYMYDLPRRFHVGMMDHGGDAKNDTPVTGENLPTWPKNSGLRKQHSVEYWLMASLLYE 115
Query: 75 RFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPL---PFKSPRMMRSAIQLISSNWPY 131
VR L+PE+AD F+ P + + G + + R ++ + + Y
Sbjct: 116 GADEREAVRVLDPEKADAFFVPFFSSLSFNTHGHTMTDPETEIDRQLQVDVIDMLYKSKY 175
Query: 132 WNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSIT 191
W ++ G DH V+P F + ++ + IL +V FG+
Sbjct: 176 WQKSGGRDH--VIPMTHPNAFRFLRQQ-LNASIL-------IVADFGRYPKSMSTLSKDV 225
Query: 192 IPPYAPP-QKMQAHQIP-PDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNN 249
+ PY ++ P R+ ++FRG + EG A+ A+ + +
Sbjct: 226 VAPYVHVVDSFTDDEVSNPFESRTTLLFFRG---NTIRKDEGKVRAKLAKILTGYDDIHF 282
Query: 250 PLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADA 309
+ + E M+ + FCL P G P S RL +A+V C+PVI++D I LP+ D
Sbjct: 283 ERSSATAETIKASTEGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDQIELPYEDE 342
Query: 310 IPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDA 366
I + + +F + + + + L +P E L R L S+ F P + DA
Sbjct: 343 IDYSQFSVFFSVNEAIQPGYMVDQLRQLPKERWLEMWRKLK--SISHHFEFQYPPEKEDA 400
Query: 367 FHQILNGLARKLP 379
+ + KLP
Sbjct: 401 VDMLWREVKHKLP 413
>gi|356512371|ref|XP_003524893.1| PREDICTED: probable glycosyltransferase At3g07620-like [Glycine
max]
Length = 427
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 94/394 (23%), Positives = 153/394 (38%), Gaps = 54/394 (13%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHM-------------------FAAEIFMHRFLLS- 79
L+V++Y+LP ++N ++ D R M + E +M LL+
Sbjct: 52 LRVFMYDLPRRFNVGMI--DRRSAAEMPVTVEDWPAWPVNWGLKKQHSVEYWMMGSLLNV 109
Query: 80 ----SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSP-----RMMRSAIQLISSNWP 130
VR +PE A F+ P + + G + K P R ++ + +
Sbjct: 110 GGGREVVRVSDPELAQAFFVPFFSSLSFNTHGHTM--KDPATQIDRQLQVDLMELLKKSN 167
Query: 131 YWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSI 190
YW R+ G DH F + H F + ++ L + +V FG+ N
Sbjct: 168 YWQRSGGRDHVFPMTHP--NAFRFLRDQ--------LNESIQVVVDFGRYPRGMSNLNKD 217
Query: 191 TIPPYAPPQKMQAHQIPPD--TPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKN 248
+ PY P D RS ++FRG Y EG + A+ + +
Sbjct: 218 VVSPYVHVVDSFTDDEPQDPYESRSTLLFFRGRTY---RKDEGIVRVKLAKILAGYDDVH 274
Query: 249 NPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFAD 308
+ ++ + M+ + FCL P G P S RL +A+V CIPVI++D I LPF D
Sbjct: 275 YERSVATEENIKASSKGMRSSKFCLHPAGDTPSSCRLFDAIVSHCIPVIVSDQIELPFED 334
Query: 309 AIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGD 365
I + + +F + ++ + + L P E R L S+ F P + D
Sbjct: 335 EIDYSQFSVFFSFKEALQPGYMIDQLRKFPKEKWTEMWRQLK--SISHHYEFRYPPKRED 392
Query: 366 AFHQILNGLARKLPHDK-STYLKPGGKFLNWTSG 398
A + + KLP K S + K +W G
Sbjct: 393 AVDMLWRQVKHKLPGVKLSVHRNRRLKIPDWWQG 426
>gi|168061581|ref|XP_001782766.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162665739|gb|EDQ52413.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 212
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 48/142 (33%), Positives = 71/142 (50%), Gaps = 10/142 (7%)
Query: 82 VRTLNPEEADWFYTPIYPTCDLTP-TGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADH 140
+ TL+P EAD+F+ +Y +C +P TG P ++ ++M +A+ +S+ +WNR+ G D
Sbjct: 50 IGTLDPYEADYFFILVYVSCKFSPKTGTPWLGRARKLMEAAVNHVSTKMEFWNRSGGRDQ 109
Query: 141 FFVVPHDFGACFHYQEEKAIERGIL-PLLQRATL-VQTFGQRNHVCLNEGSITIPPYAPP 198
F D CFH E +A I L + L VQ F H C I IPPY P
Sbjct: 110 IFAASQDNSVCFHTLETEAWNTRIYTKLFNPSNLGVQDF----HPCQAAEHIQIPPYVSP 165
Query: 199 QKMQAHQIPPDTPRSIFVYFRG 220
++ PR+IF +F G
Sbjct: 166 SVAASY---IKDPRNIFAFFSG 184
>gi|147818414|emb|CAN68950.1| hypothetical protein VITISV_039537 [Vitis vinifera]
Length = 488
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 91/382 (23%), Positives = 148/382 (38%), Gaps = 66/382 (17%)
Query: 40 LKVYVYELPSKYNKKLL------------LKD-----PRCLTHMFAAEIFMHRFLLSS-- 80
LKV++Y+LP++++ LL ++ P L + E ++ LLSS
Sbjct: 98 LKVFMYDLPAEFHFGLLDWAPAGESVWPDIRTKFPLYPGGLNLQHSIEYWLTLDLLSSEF 157
Query: 81 --------PVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPR------MMRSAIQLIS 126
+R N EAD + P + + G FK + +++ +
Sbjct: 158 PENQNARXAIRVHNSSEADVIFVPFFSSLSYNHFG---KFKGRQKKNENNLLQDKLVKFL 214
Query: 127 SNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLN 186
+ W R+EG DH + H ++ + L ++ FG+ N
Sbjct: 215 TAQEEWIRSEGRDHIIMAHH----------PNSMLDARMKLWPAIFILSDFGRYPPNIAN 264
Query: 187 EGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGY------YARGARA 240
G I PY K + R +YF+G Y +GG+ Y
Sbjct: 265 VGKDLIAPYKHVIKSFINDTSDFDSRPTLLYFQGAIYR----KDGGFIRQELFYLLKDEK 320
Query: 241 AVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD 300
V F N I+ + M + FCL G P S RL +A+ C+PVII+D
Sbjct: 321 DVHFAFGNTQGNGINKAS-----QGMHSSKFCLNIAGDTPSSNRLFDAIASHCVPVIISD 375
Query: 301 DIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLF 357
+I LP+ D + + + +FV D K L ++ SI + R R L ++ F
Sbjct: 376 EIELPYEDVLDYSQFCIFVRTSDALKDKFLXKLIRSIKKDEWTRMWRRL--KEVENFFEF 433
Query: 358 PQPAQPGDAFHQILNGLARKLP 379
P++ GDA I + RK+P
Sbjct: 434 QYPSKEGDAVQMIWQAITRKVP 455
>gi|297814922|ref|XP_002875344.1| hypothetical protein ARALYDRAFT_904882 [Arabidopsis lyrata subsp.
lyrata]
gi|297321182|gb|EFH51603.1| hypothetical protein ARALYDRAFT_904882 [Arabidopsis lyrata subsp.
lyrata]
Length = 64
Score = 79.0 bits (193), Expect = 4e-12, Method: Composition-based stats.
Identities = 34/63 (53%), Positives = 46/63 (73%)
Query: 268 RAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKL 327
R+IFCLCP GW WSPRLVE+ V GC+PV+IA+ I LPF++ + W EI + +A++D L
Sbjct: 2 RSIFCLCPFGWGIWSPRLVESAVSGCVPVVIANGIQLPFSEIVRWPEILLMMAKKDDMNL 61
Query: 328 DTI 330
I
Sbjct: 62 QKI 64
>gi|168060641|ref|XP_001782303.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666233|gb|EDQ52893.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 426
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 88/385 (22%), Positives = 155/385 (40%), Gaps = 62/385 (16%)
Query: 38 GKLKVYVYELPSKYNKKLLL------------------KDPRCLTHMFAAEIFMHRFLLS 79
+LKV++Y+LPS+++ ++ K P L + E ++ L++
Sbjct: 37 ARLKVFMYDLPSEFHYGMISEFTPKKNQIWPQNVSDIPKYPGGLYQQHSPEYWLISDLVT 96
Query: 80 SPV----------RTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSA---IQLIS 126
S + R ++A P + + + ++ R+ + LIS
Sbjct: 97 SDMPDRSTPCTVFRVKRWQDAGVILIPFFASLSYNKYSRAPLLRGKKLDRNQELQLNLIS 156
Query: 127 --SNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVC 184
S+ P W +EG++H V+ H A H +E+ R ++ +V FG+
Sbjct: 157 FLSSQPAWRASEGSNHVVVIHHP-NAMLHTREKF---RSVM------FVVADFGRYGAEV 206
Query: 185 LNEGSITIPPYA---PPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAA 241
N + PY P R+ ++F+G EGG +
Sbjct: 207 ANMAKDVVAPYKHVIPNFDEDVDAALSFKSRTTLLFFQGAIAR----KEGGIIRQQLYEL 262
Query: 242 VWEN----FKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVI 297
+ E F N + T M+++ FCL G P S RL +AV C+P+I
Sbjct: 263 LGEEPNIIFSNGTTSNAGIRSATA---GMRQSKFCLHLAGDTPSSNRLFDAVASHCVPLI 319
Query: 298 IADDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRA 354
I+++I LPF D + + E +FV D + + +L+++ + R L ++R
Sbjct: 320 ISNEIELPFEDVLNYSEFSLFVNSSDALRKGFVTDLLSNVGEKEWTRMHDRLRQ--VERH 377
Query: 355 MLFPQPAQPGDAFHQILNGLARKLP 379
+ PAQ GDA H +ARK+P
Sbjct: 378 FQYQLPAQIGDAVHMTWEAIARKVP 402
>gi|218201205|gb|EEC83632.1| hypothetical protein OsI_29363 [Oryza sativa Indica Group]
Length = 566
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 102/397 (25%), Positives = 148/397 (37%), Gaps = 87/397 (21%)
Query: 40 LKVYVYELPSKYNKKLL--------------LKDPRC-------LTHMFAAEIFMHRFLL 78
L+VY+Y+LP +++ +L DPR L + E ++ +L
Sbjct: 179 LRVYLYDLPPEFHFGMLGWDGKAAGAAWPDVAGDPRAVPRYPGGLNLQHSVEYWLTLDIL 238
Query: 79 SS-------------PVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLI 125
SS VR N AD F P + + RS QL
Sbjct: 239 SSTTSGDHRRRRPCTAVRVTNASLADVFLVPFFASLSYNRQSKSPHGGHGSGGRSDRQLQ 298
Query: 126 SSNWPY------WNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQ 179
Y W R GADH VVPH + + + +L FG+
Sbjct: 299 GELVRYLARREEWRRWGGADHL-VVPHHPNSMMDARRRLSAAMFVL---------SDFGR 348
Query: 180 RNHVCLNEGSITIPPYAPPQKMQAHQIPP----DTP----RSIFVYFRGLFYDVNNDPEG 231
N I PY H +P D+P R + YF+G + N G
Sbjct: 349 YPPDVANLRKDVIAPYK-------HVVPSLGDGDSPGFEQRPVLAYFQGAIHRKN----G 397
Query: 232 GYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAI-------FCLCPLGWAPWSPR 284
G R +++ K+ D+ + + ++RA FCL G P S R
Sbjct: 398 GR----VRQRLYQLIKDEK--DVHFTYGSVRQNGIRRATKGMASSKFCLNIAGDTPSSNR 451
Query: 285 LVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILR 341
L +A+V C+PVII+DDI LPF D + + + +FV D K L +L I E
Sbjct: 452 LFDAIVSHCVPVIISDDIELPFEDVLDYSDFCVFVRASDAVKRGFLLHLLRGISQEEWTA 511
Query: 342 KQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKL 378
R L + + P+QPGDA I +ARK+
Sbjct: 512 MWRRL--KEVAHHFEYQYPSQPGDAVQMIWGAVARKM 546
>gi|291242407|ref|XP_002741099.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 524
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 101/429 (23%), Positives = 161/429 (37%), Gaps = 84/429 (19%)
Query: 17 NTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYN------------------------ 52
N P + + G + P KVYVY+LP K+N
Sbjct: 75 NIPNVTFNTNKNGKHERNSP---YKVYVYDLPPKFNVNLSDCVKKVDGCFHLDEKMFGMG 131
Query: 53 KKLLLKDPRCL---THMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYP--TCDLTPTG 107
+LL +D + TH F+ E+ +H +L S RT+NP+ AD FY P YP C +
Sbjct: 132 SRLLRRDSQFSYRNTHQFSLEVILHHKILHSRYRTMNPKHADIFYIPFYPGLAC-FCRSF 190
Query: 108 LPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPL 167
F + + ++ WP++ E PH +E + GIL
Sbjct: 191 QKSSFDLDLLHKELWHYLTEKWPFFEMRE--------PHAMALGKIEREHWSQRCGILKG 242
Query: 168 LQRATLVQTFG--------QRNHVCLNEGSITIPPYAP----PQKMQAHQIP----PDTP 211
+ A +Q G R++ N + + PY + +AH+
Sbjct: 243 NKYANRIQFIGIEEEYKTAYRSYFERNGQHVLVAPYPSYGHFIEGEEAHRNDFTKGAKYD 302
Query: 212 RSIFVYF----------RGLFYDVNNDPEGGY--YARGARA--AVW----ENFKNNPLFD 253
R++FV R + D Y Y VW E +NN ++
Sbjct: 303 RNVFVLMAASSRASHEVRKILQDQLTRTSKSYNTYVNTEEQYDGVWYVTPECGQNNAAWE 362
Query: 254 ISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADD---IVLPFADAI 310
T+ T E M+ ++FCL P G +P ++V CIPVI + + PF +
Sbjct: 363 QGTELFTV--EWMRHSVFCLQPPGDSPTRKSFYDSVTAHCIPVIFLPEKARVKYPFQRLL 420
Query: 311 PWEEIGMFVAEE----DVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDA 366
+ E + E + P + +L IP + +++ Q L N S + +P DA
Sbjct: 421 NYSEFTVNFGLETFLLEKPDIVDLLRKIPEDYVIQLQNNLLNVSKRLQYSYPSNIDSDDA 480
Query: 367 FHQILNGLA 375
F IL+ L
Sbjct: 481 FQMILDELG 489
>gi|225439168|ref|XP_002268382.1| PREDICTED: xylogalacturonan beta-1,3-xylosyltransferase [Vitis
vinifera]
Length = 488
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 91/382 (23%), Positives = 148/382 (38%), Gaps = 66/382 (17%)
Query: 40 LKVYVYELPSKYNKKLL------------LKD-----PRCLTHMFAAEIFMHRFLLSS-- 80
LKV++Y+LP++++ LL ++ P L + E ++ LLSS
Sbjct: 98 LKVFMYDLPAEFHFGLLDWAPAGESVWPDIRTKFPLYPGGLNLQHSIEYWLTLDLLSSEF 157
Query: 81 --------PVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPR------MMRSAIQLIS 126
+R N EAD + P + + G FK + +++ +
Sbjct: 158 PENQNARAAIRVHNSSEADVIFVPFFSSLSYNHFG---KFKGRQKKNENNLLQDKLVKFL 214
Query: 127 SNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLN 186
+ W R+EG DH + H ++ + L ++ FG+ N
Sbjct: 215 TAQEEWIRSEGRDHIIMAHH----------PNSMLDARMKLWPAIFILSDFGRYPPNIAN 264
Query: 187 EGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGY------YARGARA 240
G I PY K + R +YF+G Y +GG+ Y
Sbjct: 265 VGKDVIAPYKHVIKSFINDTSDFDSRPTLLYFQGAIYR----KDGGFIRQELFYLLKDEK 320
Query: 241 AVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD 300
V F N I+ + M + FCL G P S RL +A+ C+PVII+D
Sbjct: 321 DVHFAFGNTQGNGINKAS-----QGMHSSKFCLNIAGDTPSSNRLFDAIASHCVPVIISD 375
Query: 301 DIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLF 357
+I LP+ D + + + +FV D K L ++ SI + R R L ++ F
Sbjct: 376 EIELPYEDVLDYSQFCIFVRTSDALKDKFLIKLIRSIKKDEWTRMWRRL--KEVENFFEF 433
Query: 358 PQPAQPGDAFHQILNGLARKLP 379
P++ GDA I + RK+P
Sbjct: 434 QYPSKEGDAVQMIWQAITRKVP 455
>gi|30686300|ref|NP_850241.1| Exostosin family protein [Arabidopsis thaliana]
gi|50253448|gb|AAT71926.1| At2g35100 [Arabidopsis thaliana]
gi|53828615|gb|AAU94417.1| At2g35100 [Arabidopsis thaliana]
gi|110737872|dbj|BAF00874.1| hypothetical protein [Arabidopsis thaliana]
gi|330253970|gb|AEC09064.1| Exostosin family protein [Arabidopsis thaliana]
Length = 447
Score = 78.6 bits (192), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 91/383 (23%), Positives = 157/383 (40%), Gaps = 72/383 (18%)
Query: 39 KLKVYVYELPSKYNKKLL---------LKDP--RCLTHMFAAEIFMHRFLLSSP------ 81
+++VY+Y LP ++ L+ +K P T + MH + L S
Sbjct: 59 RVRVYMYNLPKRFTYGLIEQHSIARGGIKKPVGDVTTLKYPGHQHMHEWYLFSDLNQPEV 118
Query: 82 -------VRTLNPEEADWFYTPIYPTCDL-TPTGLPLP----FKSPRMMRSAIQLISSNW 129
VR +P +AD FY P++ + L G P+ + +M ++ +
Sbjct: 119 DRSGSPIVRVSDPADADLFYVPVFSSLSLIVNAGRPVEAGSGYSDEKMQEGLVEWLEGQ- 177
Query: 130 PYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGS 189
+W R G DH V+P + A+ R + + LV FG+ + ++GS
Sbjct: 178 EWWRRNAGRDH--VIPAG--------DPNALYRILDRVKNAVLLVSDFGR---LRPDQGS 224
Query: 190 IT---IPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENF 246
+ PY+ + +I + R+ ++F G Y +GG R +++
Sbjct: 225 FVKDVVIPYSHRVNLFNGEIGVED-RNTLLFFMGNRYR----KDGG----KVRDLLFQVL 275
Query: 247 KNNPLFDISTDHPTTYYED-------MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIA 299
+ D++ H T E+ M + FCL P G P + RL +++V C+P+I++
Sbjct: 276 EKED--DVTIKHGTQSRENRRAATKGMHTSKFCLNPAGDTPSACRLFDSIVSLCVPLIVS 333
Query: 300 DDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAML 356
D I LPF D I + + +FV + L +L I + IL QR MK
Sbjct: 334 DSIELPFEDVIDYRKFSIFVEANAALQPGFLVQMLRKIKTKKILEYQR-----EMKSVRR 388
Query: 357 FPQPAQPGDAFHQILNGLARKLP 379
+ P A +I ++ KLP
Sbjct: 389 YFDYDNPNGAVKEIWRQVSHKLP 411
>gi|168000699|ref|XP_001753053.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162695752|gb|EDQ82094.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 471
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 81/314 (25%), Positives = 131/314 (41%), Gaps = 39/314 (12%)
Query: 68 AAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTC---DLTPTGLPLPFKSPRMMRSAIQL 124
A++F R L S T + A F P DL PT + +R ++
Sbjct: 163 TADLFF-RLLTRSEFVTEKAKRAQLFLLPFSIDVLWVDLGPTQVA------EKLRRYLEK 215
Query: 125 ISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQR-NHV 183
+ +N+PYW + GADHF++ H F E + R IL L + + R N
Sbjct: 216 VRTNYPYWESSLGADHFYLSCHAF-------EHNSKHRNILELGKNSIQAACAPLRHNQK 268
Query: 184 CLNEGSITIPPYAPPQKMQAHQ-IPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAV 242
+ P Y P + Q I R+ YF G DV ++
Sbjct: 269 FYPHKDVVFPQYKPVGEEDVRQAILGRRNRTSLAYFSGC-PDVTTPLLSAFHT------- 320
Query: 243 WENFKNNPLFDISTD---HPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIA 299
WE +P F + + H + Y ++ R+ FC+ L +S LV+A+ FGC+PV+++
Sbjct: 321 WE---TDPDFIVEANPSPHRLSVYRNLARSRFCVSVLPHDTFS--LVDALRFGCVPVLLS 375
Query: 300 DDIV--LPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLF 357
LPF + W + + + ED+P L IL ++ R+ + L + ++K + +
Sbjct: 376 KLTFHDLPFQGFLNWGQFAVVLGIEDLPNLKQILANV-SSTKHREMQYLGHQAIKH-LEW 433
Query: 358 PQPAQPGDAFHQIL 371
P DAFH L
Sbjct: 434 NNPPVAYDAFHMTL 447
>gi|297815694|ref|XP_002875730.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
gi|297321568|gb|EFH51989.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
Length = 479
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 84/344 (24%), Positives = 126/344 (36%), Gaps = 58/344 (16%)
Query: 83 RTLNPEEADWFYTPIYPTCDLTPTGLPLPF-KSPRMMRSAIQLIS--SNWPYWNRTEGAD 139
R N EAD + P + + P K+ R +L++ + W R+ G D
Sbjct: 158 RVYNSSEADVIFVPFFSSLSYNRFSKVNPHQKTSRNKDLQGKLVTFLTGQEEWKRSGGRD 217
Query: 140 HFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQ 199
H + H ++ L ++ FG+ N I PY
Sbjct: 218 HVVLAHH----------PNSMLDARNKLFPAMFILSDFGRYPPTVANVEKDIIAPYKHVI 267
Query: 200 KMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP 259
K + R I +YF+G Y +GG F LF + D
Sbjct: 268 KAYENDTSGFDSRPILLYFQGAIYR----KDGG-------------FVRQELFYLLQDEK 310
Query: 260 TTYY--------------EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLP 305
++ + M + FCL G P S RL +A+ C+PVII+DDI LP
Sbjct: 311 DVHFSFGSVRNGGINKASQGMHNSKFCLNIAGDTPSSNRLFDAIASHCVPVIISDDIELP 370
Query: 306 FADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQ 362
F D I + E +FV D K L ++ I E R L +++ F P++
Sbjct: 371 FEDVIDYSEFAVFVRTSDALKENFLVNLIRGISKEEWTRMWNRL--KEVEKYYEFHFPSK 428
Query: 363 PGDAFHQILNGLARKLP------HDKSTY---LKPGGKFLNWTS 397
DA I +ARK+P H Y + GK +W+S
Sbjct: 429 VDDAVQMIWQAIARKVPGVKMRIHKSRRYSGSVSDTGKESSWSS 472
>gi|374922021|gb|AFA26188.1| hypothetical protein, partial [Lolium perenne]
Length = 282
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 64/258 (24%), Positives = 107/258 (41%), Gaps = 44/258 (17%)
Query: 119 RSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFH----YQEEKAIERGILPLLQRATLV 174
R A I+ +PYWNRT G DH + D GAC+ + + G +
Sbjct: 2 RMAYDHIAQRYPYWNRTSGRDHIWFFSWDEGACYAPKEIWNSMMLVHWGNTNTKHENSTT 61
Query: 175 QTFGQR----------NHVCLN-EGSITIPPYAPPQ------KMQAHQIPPDTPRSIFVY 217
+ +H C + + +P + P K+ A P + R Y
Sbjct: 62 AYWADNWDNIPLDRRGDHPCFDPTKDLVLPAWKDPDPAAIWLKLWAR---PRSNRRTLFY 118
Query: 218 FRGLFYDVNNDPEGGY----YARGARAAVWENFKNNP----------LFDISTDH--PTT 261
F G ++ + E G Y+ G R + F + P + +++ H
Sbjct: 119 FNG---NLGSAYEQGRPEDTYSMGIRQKLAAEFGSTPNKQGKLGRQHVANVTVTHLRSEK 175
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
YYE++ +IFC G WS R+ ++++ GCIPVII D I LP+ + + + + + E
Sbjct: 176 YYEELASSIFCGVLPG-DGWSGRMEDSMLQGCIPVIIQDGIFLPYENVLNYNSFAVRIQE 234
Query: 322 EDVPKLDTILTSIPPEVI 339
+D+P L T+L + I
Sbjct: 235 DDIPNLITVLRGMNETQI 252
>gi|291237398|ref|XP_002738622.1| PREDICTED: tout-velu-like [Saccoglossus kowalevskii]
Length = 783
Score = 78.2 bits (191), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 84/352 (23%), Positives = 133/352 (37%), Gaps = 58/352 (16%)
Query: 41 KVYVYELPSKYNKKLL-------------LKD-------------PRCLTHMFAAEIFMH 74
K+YVY++P+ +N+ +L L+D TH FA E +H
Sbjct: 399 KIYVYDMPAAFNEDILDCVHTKVRGECIHLQDGGFGKMLWTDNNISYHFTHQFALEPIIH 458
Query: 75 RFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNR 134
LL+S RTLN +AD FY P Y P + + + S N P+
Sbjct: 459 HKLLNSTQRTLNASDADLFYLPYYAGLKCFCHDRYTPGVTAGDLNNKFWEYSLNLPF--- 515
Query: 135 TEGADHFFVV-----PHDFGAC--FHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNE 187
+ HF + H C K I ++ QR F + H E
Sbjct: 516 IKTKPHFMALGKIEREHCSSGCPLLRSAHSKHILYLMIEQEQRRRSRVAFKRDGH----E 571
Query: 188 GSITIPPYAPPQKMQAHQIPP--DTPRSIFVYF-------RGLFYDVNNDPEGGYYARGA 238
+ + PY P + RSI V + + D + A G
Sbjct: 572 DEVIVVPYPSYAHFTTEDAVPRFNVSRSILVLMCAGVRRTQSFRVKLRQDLQKEENATGR 631
Query: 239 RAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVII 298
V+ F + ++ + MQ+++FCL P G +P ++V+ GCIPV
Sbjct: 632 HRGVY--FHTRECMEETSRKVIDF---MQQSVFCLQPWGDSPTRKSFYDSVLSGCIPVRF 686
Query: 299 ADDIVLPFADAIPWEEIGMFVAEEDVPKLDT----ILTSIPPEVILRKQRLL 346
D++ PF D I ++E +FV + ++ +T L +P E I + Q L
Sbjct: 687 LKDVIYPFEDRINYDEFSLFVDKNELETTNTSIVDYLAKVPKERIEKMQDKL 738
>gi|356525152|ref|XP_003531191.1| PREDICTED: probable glycosyltransferase At3g07620-like isoform 1
[Glycine max]
Length = 427
Score = 77.8 bits (190), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 88/372 (23%), Positives = 144/372 (38%), Gaps = 49/372 (13%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRC-----------------LTHMFAAEIFMHRFLLSS-- 80
L+V++Y+LP ++N ++ + L + E +M LL++
Sbjct: 52 LRVFMYDLPRRFNVGMIDRRSASETPVTVEDWPAWPVNWGLKKQHSVEYWMMGSLLNAGE 111
Query: 81 ---PVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSP-----RMMRSAIQLISSNWPYW 132
VR +PE A F+ P + + G + K P R ++ + + YW
Sbjct: 112 GREAVRVSDPELAQAFFVPFFSSLSFNTHGHTM--KDPATQIDRQLQVDLMELLKKSKYW 169
Query: 133 NRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITI 192
R+ G DH F + H F RG L + +V FG+ N +
Sbjct: 170 QRSGGRDHVFPMTHPNAFRF--------LRG--QLNESIQVVVDFGRYPRGMSNLNKDVV 219
Query: 193 PPYAPPQKMQAHQIPPD--TPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNP 250
PY P D RS ++FRG Y EG + A+ + +
Sbjct: 220 SPYVHVVDSFTDDEPQDPYESRSTLLFFRGRTY---RKDEGIVRVKLAKILAGYDDVHYE 276
Query: 251 LFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAI 310
+ ++ + M+ + FCL P G P S RL +A+V C+PVI++D I LPF D I
Sbjct: 277 RSVATEENIKASSKGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDQIELPFEDDI 336
Query: 311 PWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAF 367
+ + +F + ++ + + L P E R L S+ F P + DA
Sbjct: 337 DYSQFSVFFSFKEALQPGYMIDQLRKFPKEKWTEMWRQLK--SISHHYEFEYPPKREDAV 394
Query: 368 HQILNGLARKLP 379
+ KLP
Sbjct: 395 DMLWRQAKHKLP 406
>gi|326490259|dbj|BAJ84793.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 498
Score = 77.8 bits (190), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 102/394 (25%), Positives = 153/394 (38%), Gaps = 84/394 (21%)
Query: 40 LKVYVYELPSKY------------------NKKLLLKDP--RCLTHMFAAEIFMHRFLLS 79
+K+Y+Y+LPSK+ + +L D R H +AE ++ + L
Sbjct: 99 VKIYMYDLPSKFTYGVVRSYMSARGPSGSSDAAAVLADEELRYPGHQHSAEWWLFKDLRQ 158
Query: 80 S-----PV-RTLNPEEADWFYTPIYPTCDL--TPTGLPL------------PFKSPRMMR 119
PV R +P EAD FY P + + L P P P+ S M+
Sbjct: 159 RGPRDRPVARVDDPVEADLFYVPFFSSLSLVVNPIRPPAAANASEAAATVGPWYSDEAMQ 218
Query: 120 SAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQ 179
+ YW R G DH F+ Q+ A+ R + + LV FG+
Sbjct: 219 DELVEWLERQSYWRRYRGRDHVFIC----------QDPNALYRVVDRISNAVLLVSDFGR 268
Query: 180 RNHVCLNEGSITIPPYAPPQKMQAHQIPP------DTPRSIFVYFRGLFYDVNNDPEGGY 233
+ I PY+ H+I P R ++F G Y EGG
Sbjct: 269 LRGDQASLVKDVILPYS-------HRINPFKGDVNVDSRPALLFFMGNRYR----KEGGK 317
Query: 234 YARGARAAVWEN-----FKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEA 288
R V EN K+ +S T + M + FCL P G P + RL +A
Sbjct: 318 I-RDTLFQVLENEGDVIIKHGAQSRVSRRMAT---QGMHSSKFCLHPAGDTPSACRLFDA 373
Query: 289 VVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRL 345
+V C+PVI++D I LPF D I + I +FV + L ++L + E IL QR
Sbjct: 374 LVSLCVPVIVSDHIELPFEDVIDYSNISIFVDTSKAIQPGFLTSMLRKVSSERILEYQR- 432
Query: 346 LANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
++R + + P +QI ++ K P
Sbjct: 433 ----EIQRVKHYFEYEDPNGPVNQIWRQVSMKAP 462
>gi|115476598|ref|NP_001061895.1| Os08g0438600 [Oryza sativa Japonica Group]
gi|42408650|dbj|BAD09870.1| Exostosin family-like protein [Oryza sativa Japonica Group]
gi|42408898|dbj|BAD10156.1| Exostosin family-like protein [Oryza sativa Japonica Group]
gi|113623864|dbj|BAF23809.1| Os08g0438600 [Oryza sativa Japonica Group]
gi|215715329|dbj|BAG95080.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215740942|dbj|BAG97437.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 566
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 102/400 (25%), Positives = 151/400 (37%), Gaps = 93/400 (23%)
Query: 40 LKVYVYELPSKYNKKLL--------------LKDPRC-------LTHMFAAEIFMHRFLL 78
L+VY+Y+LP +++ +L DPR L + E ++ +L
Sbjct: 179 LRVYLYDLPPEFHFGMLGWDGKAAGAAWPDVAGDPRAVPRYPGGLNLQHSVEYWLTLDIL 238
Query: 79 SS-------------PVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLI 125
SS VR N AD F P + + RS QL
Sbjct: 239 SSTTSGDHRRRRPCTAVRVTNASLADVFLVPFFASLSYNRQSKSPHGGHGSGGRSDRQLQ 298
Query: 126 SSNWPY------WNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQ 179
Y W R GADH VVPH + + + +L FG+
Sbjct: 299 GELVRYLARREEWRRWGGADHL-VVPHHPNSMMDARRRLSAAMFVL---------SDFGR 348
Query: 180 RNHVCLNEGSITIPPYAPPQKMQAHQIPP----DTP----RSIFVYFRGLFYDVNNDPEG 231
N I PY H +P D+P R + YF+G + N G
Sbjct: 349 YPPDVANLRKDVIAPYK-------HVVPSLGDGDSPGFEQRPVLAYFQGAIHRKN----G 397
Query: 232 GYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAI-------FCLCPLGWAPWSPR 284
G R +++ K+ D+ + + ++RA FCL G P S R
Sbjct: 398 GR----VRQRLYQLIKDEK--DVHFTYGSVRQNGIRRATKGMASSKFCLNIAGDTPSSNR 451
Query: 285 LVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPE---V 338
L +A+V C+PVII+DDI LPF D + + +FV D K L +L I E
Sbjct: 452 LFDAIVSHCVPVIISDDIELPFEDVLDYSAFCVFVRASDAVKRGFLLHLLRGISQEEWTA 511
Query: 339 ILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKL 378
+ R+ + +A+ + P+QPGDA I +ARK+
Sbjct: 512 MWRRLKEVAHH-----FEYQYPSQPGDAVQMIWGAVARKM 546
>gi|15230692|ref|NP_190126.1| exostosin family protein [Arabidopsis thaliana]
gi|6967106|emb|CAB72489.1| putative protein [Arabidopsis thaliana]
gi|332644505|gb|AEE78026.1| exostosin family protein [Arabidopsis thaliana]
Length = 475
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 84/344 (24%), Positives = 125/344 (36%), Gaps = 58/344 (16%)
Query: 83 RTLNPEEADWFYTPIYPTCDLTPTGLPLPF-KSPRMMRSAIQLIS--SNWPYWNRTEGAD 139
R N EAD + P + + P K+ R +L++ + W R+ G D
Sbjct: 154 RVYNSSEADVIFVPFFSSLSYNRFSKVNPHQKTSRNKDLQGKLVTFLTAQEEWKRSGGRD 213
Query: 140 HFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQ 199
H + H ++ L ++ FG+ N I PY
Sbjct: 214 HVVLAHH----------PNSMLDARNKLFPAMFILSDFGRYPPTVANVEKDVIAPYKHVI 263
Query: 200 KMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP 259
K + R I +YF+G Y +GG F LF + D
Sbjct: 264 KAYENDTSGFDSRPILLYFQGAIYR----KDGG-------------FVRQELFYLLQDEK 306
Query: 260 TTYY--------------EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLP 305
++ + M + FCL G P S RL +A+ C+PVII+DDI LP
Sbjct: 307 DVHFSFGSVRNGGINKASQGMHNSKFCLNIAGDTPSSNRLFDAIASHCVPVIISDDIELP 366
Query: 306 FADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQ 362
F D I + E +FV D K L ++ I E R L +++ F P++
Sbjct: 367 FEDVIDYSEFSVFVRTSDALKENFLVNLIRGITKEEWTRMWNRL--KEVEKYYEFHFPSK 424
Query: 363 PGDAFHQILNGLARKLP------HDKSTY---LKPGGKFLNWTS 397
DA I +ARK+P H Y + GK W+S
Sbjct: 425 VDDAVQMIWQAIARKVPGVKMRIHKSRRYSGSVSETGKESRWSS 468
>gi|302842849|ref|XP_002952967.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
gi|300261678|gb|EFJ45889.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
Length = 728
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 85/334 (25%), Positives = 139/334 (41%), Gaps = 62/334 (18%)
Query: 41 KVYVYELPSKYNK-KLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYP 99
++YVYELP + N + L + R L ++ + + LLS+ +R + AD+F PI
Sbjct: 232 RIYVYELPPEMNTFQNLDRLDRPLMYL------IWQRLLSAGLRVADAASADFFLVPIR- 284
Query: 100 TCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGAD-HFFVVPHDFGACFHYQEEK 158
+ + + S R+++ A+ I S WPYWN T G H FV D+G ++
Sbjct: 285 --------VRMAYDSDRVIQ-AVSYIRSVWPYWNATRGGGRHIFVHTGDWG------RDE 329
Query: 159 AIERGILPLLQRATLVQTFG-QRNHVCLN-------EGSITIPPYAPPQKMQAHQIPPDT 210
E L L + AT + +G R+H + +P + +Q+P +
Sbjct: 330 LSEDAQL-LTRNATWLTHWGLARDHEFAGWRQSHRPGKDVVLPLMLAASLLSTYQLPRAS 388
Query: 211 P---------RSIFVYFRGLFYDVNNDPE-GGYY---------------ARGARAAVWEN 245
P R+ ++F G P G Y A RA + +
Sbjct: 389 PLHPAGPRPERTTTLFFAGRICGSRATPSLNGTYPNCPNVLGSEDAYSAATRQRAYFYHH 448
Query: 246 FKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLP 305
+ N ++ P +M A FCL P G R V A + GC+PV + D ++ P
Sbjct: 449 GRANWKLVTASRAPAA---EMATAKFCLAPSGGGQ-GKRSVLAPLMGCVPVPVTDGLMQP 504
Query: 306 FADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVI 339
F + WE + V E D+P + +L + PE +
Sbjct: 505 FEPELRWERFAVGVRERDLPVMHELLDRLMPEQV 538
>gi|296085881|emb|CBI31205.3| unnamed protein product [Vitis vinifera]
Length = 455
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 87/356 (24%), Positives = 140/356 (39%), Gaps = 47/356 (13%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTL-NPEEADWFYTPIY 98
LKV++Y+LP++++ LL P A E P+ + N EAD + P +
Sbjct: 98 LKVFMYDLPAEFHFGLLDWAP-------AGESVWPDIRTKFPLYPVHNSSEADVIFVPFF 150
Query: 99 PTCDLTPTGLPLPFKSPR------MMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACF 152
+ G FK + +++ + + W R+EG DH + H
Sbjct: 151 SSLSYNHFG---KFKGRQKKNENNLLQDKLVKFLTAQEEWIRSEGRDHIIMAHH------ 201
Query: 153 HYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPR 212
++ + L ++ FG+ N G I PY K + R
Sbjct: 202 ----PNSMLDARMKLWPAIFILSDFGRYPPNIANVGKDVIAPYKHVIKSFINDTSDFDSR 257
Query: 213 SIFVYFRGLFYDVNNDPEGGY------YARGARAAVWENFKNNPLFDISTDHPTTYYEDM 266
+YF+G Y +GG+ Y V F N I+ + M
Sbjct: 258 PTLLYFQGAIYR----KDGGFIRQELFYLLKDEKDVHFAFGNTQGNGINKAS-----QGM 308
Query: 267 QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK 326
+ FCL G P S RL +A+ C+PVII+D+I LP+ D + + + +FV D K
Sbjct: 309 HSSKFCLNIAGDTPSSNRLFDAIASHCVPVIISDEIELPYEDVLDYSQFCIFVRTSDALK 368
Query: 327 ---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
L ++ SI + R R L ++ F P++ GDA I + RK+P
Sbjct: 369 DKFLIKLIRSIKKDEWTRMWRRL--KEVENFFEFQYPSKEGDAVQMIWQAITRKVP 422
>gi|401888173|gb|EJT52138.1| hypothetical protein A1Q1_06676 [Trichosporon asahii var. asahii CBS
2479]
Length = 1189
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 81/328 (24%), Positives = 139/328 (42%), Gaps = 49/328 (14%)
Query: 64 THMFAAEIFMHRFL--LSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSA 121
TH F+A+ M+R + +S P EEAD P+Y C+ + ++
Sbjct: 868 THCFSADSMMYRAMRHISVPA-----EEADLIVLPVYQHCE----------GADFILHDV 912
Query: 122 IQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEE--KAIERGIL---PLLQRATLVQT 176
++ P N E A V+ HD+G C + E A + L PLL+ +
Sbjct: 913 WHHAQASIPGINTGEKAVSL-VMTHDWGICLAFTWEIWSARQDHRLYPDPLLKNTLVFSV 971
Query: 177 FGQRNHVCLNEGS-ITIPPYA--PPQKMQA----HQIPPDTPRSIFVYFRGLFYDVNND- 228
G C + +PP + P+ + + I P + R + + G ++
Sbjct: 972 MGDWFSPCYRPAQDVVVPPRSCTSPRLLASFPTVSHIKPVSERPRLISWAGTYWGSGKSE 1031
Query: 229 ------PEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWS 282
P GG G R + N + D Y E++ A FC P G A WS
Sbjct: 1032 RLRLACPRGGA---GMRELLPGAGPQNHI-----DKYDDYLEELNTARFCPQPRGIAGWS 1083
Query: 283 PRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRK 342
PR+ +A+ GCIPV+ ++D PFA I W +I + V ++ ++ +L SIP + R
Sbjct: 1084 PRVNDAIFAGCIPVLTSEDTHYPFAGLIDWSQISVRVHPTELDHVEELLASIP---LARL 1140
Query: 343 QRLLAN-PSMKRAMLFPQPAQPGDAFHQ 369
+++ AN +++ A ++P + + +
Sbjct: 1141 EQIQANIVAIRDAFMYPADGKAEEELQK 1168
>gi|218199855|gb|EEC82282.1| hypothetical protein OsI_26516 [Oryza sativa Indica Group]
Length = 393
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 79/320 (24%), Positives = 126/320 (39%), Gaps = 50/320 (15%)
Query: 83 RTLNPEEADWFYTPIYPTCDLTPTGL------------PLPFKSPRMMRSAIQLISSNWP 130
R +P +AD FY P + + L + P S + + + P
Sbjct: 66 RVDDPADADLFYVPFFSSLSLVVNPIRPSAAAANASDAAEPAYSDESTQEELLVWLERQP 125
Query: 131 YWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSI 190
YW R +G DH F+ Q+ A+ R + + L+ FG+ +
Sbjct: 126 YWRRHQGRDHVFIC----------QDPNALYRVVDRISNAVLLISDFGRLRSEQASLVKD 175
Query: 191 TIPPYAPPQKMQAHQIPPDT-PRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNN 249
I PYA ++ + Q R ++F G Y EGG R +++ +N
Sbjct: 176 VILPYA--HRINSFQGDVGVESRPSLLFFMGNRYR----KEGG----KVRDTLFQVLENE 225
Query: 250 PLFDISTDHPTTYYED-------MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDI 302
D+ H E M + FCL P G P + RL +A+V C+PVI++D I
Sbjct: 226 A--DVIIKHGAQSRESRRMATRGMHSSKFCLHPAGDTPSACRLFDALVSLCVPVIVSDYI 283
Query: 303 VLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQ 359
LPF D I + I +FV + L + L I + IL QR +K+ + +
Sbjct: 284 ELPFEDVIDYRNISIFVETSKAVQPGFLTSTLRGISSQRILEYQR-----EIKKVKHYFE 338
Query: 360 PAQPGDAFHQILNGLARKLP 379
P +QI + ++ K P
Sbjct: 339 YEDPNGPVNQIWHQVSSKAP 358
>gi|168042843|ref|XP_001773896.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674740|gb|EDQ61244.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 373
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 66/274 (24%), Positives = 108/274 (39%), Gaps = 28/274 (10%)
Query: 113 KSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRAT 172
K+ + +Q + P W + G DH V+ H F + L++A
Sbjct: 103 KNQELQEKLMQFLEKQ-PAWQASGGVDHVIVIHHPNSGYF-----------MRDHLRKAM 150
Query: 173 -LVQTFGQRNHVCLNEGSITIPPYAPP-QKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPE 230
+V FG+ N G + PY +A R ++F+G E
Sbjct: 151 FVVADFGRYASDVANIGKDIVAPYKHVVNDFEAEATISYEKRKTLLFFQGAIMR----KE 206
Query: 231 GGYYARGARAAVWENFKNNPLFDISTDHPT-----TYYEDMQRAIFCLCPLGWAPWSPRL 285
GG R +++ P + T + E MQ + FCL G P S RL
Sbjct: 207 GGI----IRLQLYKLLNGEPDVHFEGGNTTNSAIRSASEGMQNSKFCLNLAGDTPSSNRL 262
Query: 286 VEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRL 345
+A+ C+PVII+DDI +PF D + + +F+ D K + I+ + + ++
Sbjct: 263 FDAIASHCVPVIISDDIEVPFEDTLNYSTFSIFIKSSDALKSNFIIDLLRGVSREKWTKM 322
Query: 346 LAN-PSMKRAMLFPQPAQPGDAFHQILNGLARKL 378
A ++ + P QP DA H +ARK+
Sbjct: 323 WATLKQVEHHFKYQYPTQPDDAVHMTWKAIARKI 356
>gi|356506196|ref|XP_003521873.1| PREDICTED: probable glucuronosyltransferase GUT1-like [Glycine max]
Length = 505
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 78/313 (24%), Positives = 117/313 (37%), Gaps = 34/313 (10%)
Query: 80 SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQ--LIS--SNWPYWNRT 135
S +R N E+D + P + + R M +Q L+ + W R+
Sbjct: 191 SVIRVRNSSESDVVFVPFFSSLCYNRFSSKTNPHEKRSMNKVLQEKLVKYVTEQEEWKRS 250
Query: 136 EGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY 195
G DH V H ++ + L ++ FG+ N I PY
Sbjct: 251 GGKDHVIVAHH----------PNSMLDARMKLWPGTFILSDFGRYPTNIANVEKDVIAPY 300
Query: 196 APPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGG------YYARGARAAVWENFKNN 249
+ R +YF+G Y +GG YY V +F N
Sbjct: 301 KHVVGSYDNDQSSFDSRPTLLYFQGAIYR----KDGGHVRHELYYLVKNEKDVHFSFGN- 355
Query: 250 PLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADA 309
+ E M+ + FCL G P S RL +A+ C+PVII+D+I LP+ D
Sbjct: 356 ----VEKGGVRNAAEGMRSSKFCLNIAGDTPSSNRLFDAIASHCVPVIISDEIELPYEDV 411
Query: 310 IPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDA 366
I + + +FV D K L + SI E R L ++ F P++ GDA
Sbjct: 412 IDYSQFCVFVRTRDALKKRYLINFIRSIGKEEWTRMWNRL--KEVESFFEFQFPSKEGDA 469
Query: 367 FHQILNGLARKLP 379
I +ARK+P
Sbjct: 470 VQMIWKAVARKVP 482
>gi|115441893|ref|NP_001045226.1| Os01g0921300 [Oryza sativa Japonica Group]
gi|19386797|dbj|BAB86176.1| OJ1485_B09.5 [Oryza sativa Japonica Group]
gi|57899432|dbj|BAD88370.1| exostosin-like [Oryza sativa Japonica Group]
gi|113534757|dbj|BAF07140.1| Os01g0921300 [Oryza sativa Japonica Group]
gi|125573139|gb|EAZ14654.1| hypothetical protein OsJ_04578 [Oryza sativa Japonica Group]
gi|215741014|dbj|BAG97509.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767487|dbj|BAG99715.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 437
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 76/307 (24%), Positives = 129/307 (42%), Gaps = 25/307 (8%)
Query: 82 VRTLNPEEADWFYTPIYPTCDLTPTGLPL---PFKSPRMMRSAIQLISSNWPYWNRTEGA 138
VR +P+ A+ F+ P + + G + ++ R+++ + I YW R+ G
Sbjct: 128 VRVTDPDAAEAFFVPFFSSLSFNVHGRNMTDPETEADRLLQVELMEILWKSKYWQRSAGR 187
Query: 139 DHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAP- 197
DH V+P F + + + IL +V FG+ + + PY
Sbjct: 188 DH--VIPMHHPNAFRFLRD-MVNASIL-------IVADFGRYTKELASLRKDVVAPYVHV 237
Query: 198 -PQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDIST 256
+ P R ++FRG V D EG A+ A+ K+ F+ S
Sbjct: 238 VDSFLNDDPPDPFDDRPTLLFFRG--RTVRKD-EGKIRAKLAKIL---KGKDGVRFEDSL 291
Query: 257 ---DHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWE 313
+ T E M+ + FCL P G P S RL +A+V C+PVI++ I LPF D I +
Sbjct: 292 ATGEGIKTSTEGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSSRIELPFEDEIDYS 351
Query: 314 EIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLAN-PSMKRAMLFPQPAQPGDAFHQILN 372
E +F + E+ + D +L + + + + ++ F P + GDA + I
Sbjct: 352 EFSLFFSVEEALRPDYLLNQLRQIQKTKWVEIWSKLKNVSHHYEFQNPPRKGDAVNMIWR 411
Query: 373 GLARKLP 379
+ K+P
Sbjct: 412 QVKHKVP 418
>gi|219124193|ref|XP_002182394.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217406355|gb|EEC46295.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 573
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 84/376 (22%), Positives = 138/376 (36%), Gaps = 74/376 (19%)
Query: 60 PRCLTHMFAAEIFMHRFLL--SSPVRTLNPEEADWFYTPIYPTCDL--TPTGLPLPFKSP 115
P + F+ E+ + L S +RT +PE+A FY P P+ + G P FK+
Sbjct: 144 PFSMHAQFSTELLVREILTHPDSCLRTYDPEQASLFYVPYLPSMEFHAGARGRPPSFKTS 203
Query: 116 RMMRSAIQLISSNW-----------PYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGI 164
+ + ++ + ++ YW R G+DH V H ++++ +
Sbjct: 204 KYANAILRALEGDYQPWTDHFGLTPKYWQRRNGSDHILVFSEPLQGLTHPKKKRGNYHFV 263
Query: 165 L-------PLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQ------------KMQAHQ 205
P++ L TF C + + P + +Q
Sbjct: 264 HTQKQLAPPIVVSVELSTTFVNMYPSCAQKNILMPYPITDGRYFNGDLDKEARWAIQNRS 323
Query: 206 IPPDTPRSIFVYFR-----GLFYDVNNDPEGGYYARGA-------RAAVWENFKNNPLFD 253
+ +S V G D P +Y G RAA+ +N+K P F
Sbjct: 324 LDSIDSKSSPVLVAEKDPVGTLADAR--PIAQWYRAGVHGECVPLRAALQQNYKCTPSFP 381
Query: 254 ISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFAD----- 308
PTTY M+ A FC CP G + R+ +AV+ GCIP+I++ D V P +D
Sbjct: 382 SFKRTPTTYPLGMRMATFCPCPGGDTASAKRMFDAVLAGCIPIILSHDFVWPLSDEFEPE 441
Query: 309 --------AIPWEEIGMFVAEED-------------VPKLDTILTSIPPEVILRKQRLLA 347
A+ W V + D +P + +L +IP I R +R L
Sbjct: 442 MLIKVSDFALRWNASNFVVRKFDNQCRPSVANTNYALPSVQELLEAIPASEIRRLRRGLR 501
Query: 348 NPSMKRAMLFPQPAQP 363
+ + P+ P
Sbjct: 502 HAQQAYSYYKPRKGFP 517
>gi|392575274|gb|EIW68408.1| hypothetical protein TREMEDRAFT_32575 [Tremella mesenterica DSM 1558]
Length = 1176
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 77/329 (23%), Positives = 132/329 (40%), Gaps = 54/329 (16%)
Query: 65 HMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQL 124
H F+A+ ++R + V + EEAD P+Y C F + A++
Sbjct: 839 HCFSADNILYRAMRHISV---SAEEADLIVLPVYQHC------TDEEFMLHDLTAFAVKT 889
Query: 125 ISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERG---ILP--LLQRATLVQTFGQ 179
I + FV+ HD+G C ++ E RG + P +L+ A + G
Sbjct: 890 IPGVL-----NQDKKLAFVLTHDWGICVNFAWEIWSARGESHLTPDWILRNALVWSVMGD 944
Query: 180 RNHVCLN-EGSITIPPYAPPQ-KMQAH-----QIPPDTPRSIFVYFRGLFYDVNND---- 228
N C I +PP ++ H + P R+ V + G ++ +
Sbjct: 945 YNSPCYRPHQDIVVPPRTCKSIDLREHFPEITNVTPMRQRTKLVTWSGTYWGTGKNMRLR 1004
Query: 229 ---PEGGYYAR-----GARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAP 280
GG G + W N++ Y +++ A FC P G A
Sbjct: 1005 LTCERGGAGKEELVPGGGPMSSWYNWE--------------YMKEISGARFCPQPTGIAG 1050
Query: 281 WSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVIL 340
WSPR+ +A+ GCIPV+ A+ PFAD + W + + + ++ +L+ IL++IP E +
Sbjct: 1051 WSPRINDAIYAGCIPVLTAEGTHYPFADFLDWSKFSIRIKPTELDQLERILSAIPLEQLE 1110
Query: 341 RKQRLLANPSMKRAMLFPQPAQPGDAFHQ 369
Q L ++ A ++ P D +
Sbjct: 1111 EMQANLM--LVREAFIYSTDENPEDELKR 1137
>gi|302855272|ref|XP_002959133.1| hypothetical protein VOLCADRAFT_121779 [Volvox carteri f.
nagariensis]
gi|300255495|gb|EFJ39797.1| hypothetical protein VOLCADRAFT_121779 [Volvox carteri f.
nagariensis]
Length = 771
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 78/330 (23%), Positives = 130/330 (39%), Gaps = 51/330 (15%)
Query: 41 KVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTP-IYP 99
++Y+YE P + +L R L I + ++S +R ++P AD+F+ P
Sbjct: 263 RIYIYEFPPYMHMWGMLWLDRPLN------IIVWERIISMGLREVDPARADYFFIPGCGR 316
Query: 100 TCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKA 159
CD +++ YW + +G DH P D+G C H + A
Sbjct: 317 GCDKWDDKFHF-------------ILAHYGQYWTQNQGRDHIMTHPGDWGRCEHSWDAFA 363
Query: 160 IERGILPLLQRATLVQTFG-------QRNHVCLN-----EGSITIPP-----YAPPQKMQ 202
+ + T++Q +G + H N I +PP Y +
Sbjct: 364 DK-----FISNVTMLQHWGMTVDRSSEVEHNLFNTCHKPNQDILVPPMCGDLYPQFEYNI 418
Query: 203 AHQIPPDTP--RSIFVYFRGLFYDVNNDPE----GGYYARGARAAVWENFKNNPLFDIST 256
H + P ++ G N+ E YY+ G RAA+W+ ++ P F I+
Sbjct: 419 WHPNRKENPITKTNLASVAGSICGWNSVEEPPCRNRYYSLGVRAALWQ-LRDVPGFHIA- 476
Query: 257 DHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG 316
+ M + FC P G A + R V A GC+PVII+D + P+ + W E G
Sbjct: 477 KRVAMMGQSMAESEFCFAPTG-AGYGKRNVMATTLGCMPVIISDHVAQPYEPFLNWNEFG 535
Query: 317 MFVAEEDVPKLDTILTSIPPEVILRKQRLL 346
+++ E ++ IL P+ K L
Sbjct: 536 VWIPESQAKDVEIILRGFTPQQKAEKMEKL 565
>gi|357463031|ref|XP_003601797.1| hypothetical protein MTR_3g085480 [Medicago truncatula]
gi|355490845|gb|AES72048.1| hypothetical protein MTR_3g085480 [Medicago truncatula]
Length = 425
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 82/371 (22%), Positives = 145/371 (39%), Gaps = 46/371 (12%)
Query: 40 LKVYVYELPSKYNKKLLLK-----------------DPRCLTHMFAAEIFM------HRF 76
L+VY+Y+LP ++N +L D + L + E +M
Sbjct: 50 LRVYMYDLPRRFNVGMLDGRNTTEAPVTIADYPLWPDNQGLRRQHSVEYWMMGSLLNGGG 109
Query: 77 LLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPL---PFKSPRMMRSAIQLISSNWPYWN 133
S VR L+PE D ++ P + + G + + ++ + + YW
Sbjct: 110 NGSEAVRVLDPEVVDVYFVPFFSSLSFNTHGHHMRDPETEIDHQLQIDLMGLLGQSKYWQ 169
Query: 134 RTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIP 193
R+ G DH F + H F + ++ L + +V FG+ N +
Sbjct: 170 RSGGRDHIFPMTH--PNAFRFLRDQ--------LNESIQVVVDFGRYPKGVSNLNKDVVS 219
Query: 194 PYAPPQKMQAHQIPPD--TPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPL 251
PY P D R+ ++FRG + +G A+ + + +
Sbjct: 220 PYVHFVDSYVDDEPHDPFESRTTLLFFRG---GTHRKDKGIVRAKFTKILAGFDDVHYER 276
Query: 252 FDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIP 311
+ ++ + M+ + FCL P G P S RL +A+V C+PVI++D I LPF + I
Sbjct: 277 SSATGENIKLSSKGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDKIELPFENEID 336
Query: 312 WEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFH 368
+ + +F + ++ + + L S P + R L N S F P + DA +
Sbjct: 337 YSQFSLFFSFKEALEPGYMINQLRSFPKQNWTEMWRQLKNISHHYE--FHYPPEREDAVN 394
Query: 369 QILNGLARKLP 379
+ + KLP
Sbjct: 395 MLWRQIKHKLP 405
>gi|168017876|ref|XP_001761473.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162687479|gb|EDQ73862.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 399
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 88/386 (22%), Positives = 149/386 (38%), Gaps = 48/386 (12%)
Query: 40 LKVYVYELPSKYNKKLLLK-------DPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADW 92
L++Y+Y++ + LL C+ + ++ +H+ L+ S RTL+ A++
Sbjct: 16 LQIYIYDVDEIDGLRELLSGRDHSIASSACVRGQWGTQVKIHKLLMKSRYRTLDKSRANF 75
Query: 93 FYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACF 152
F+ P+Y C GL + ++ Q+ PY++R+ G DH FV P GA
Sbjct: 76 FFVPVYVKCVRIFGGLNEKEVNEHFLKILRQM-----PYFHRSGGRDHIFVFPSGAGA-- 128
Query: 153 HYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPR 212
+ +G L R+ + G R T P + P ++
Sbjct: 129 ------HLVKGWPNFLNRSIFLTPEGDRTDKKAFSSFNTWKDIIIPGNVDIINHPSNSAT 182
Query: 213 SIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNP--------LFDISTDHPTTYYE 264
S + + + G + R + E K P F S Y
Sbjct: 183 SPLPLSKRKYVA---NYLGRAQGKKGRLQLIELAKQFPAELDAPELAFQGSAKLGRIEYY 239
Query: 265 D-MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGM-FVAEE 322
+ ++ A FCL P G + W+ R EA C+PVI++D+I LP+ + + + + + +
Sbjct: 240 NRLRNAKFCLAPRGESSWTLRFYEAFFVECVPVILSDEIELPYQNVLDYSGFSIKWPSSR 299
Query: 323 DVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPG-DAFHQILNGLARKLP-- 379
+L L SIP I +R+L R + P G A IL L RK+
Sbjct: 300 TNEELLRYLRSIPEFEI---ERMLVLGRNIRCLFTYAPDSEGCTAMTGILWELQRKVRAF 356
Query: 380 ---------HDKSTYLKPGGKFLNWT 396
H+ S + KF W+
Sbjct: 357 HQYHHTFWLHNASIVNRDLAKFHEWS 382
>gi|357141481|ref|XP_003572240.1| PREDICTED: xylogalacturonan beta-1,3-xylosyltransferase-like
[Brachypodium distachyon]
Length = 543
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 77/325 (23%), Positives = 131/325 (40%), Gaps = 37/325 (11%)
Query: 82 VRTLNPEEADWFYTPIYPTCDL---TPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGA 138
VR N +AD F+ P + + + R++++ + + W R G
Sbjct: 235 VRVTNASQADVFFVPFFASLSYNRHSKLQGKEKMSRNRLLQAELVKYLARQEEWRRWGGK 294
Query: 139 DHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPP 198
DH VVPH + Q K + + ++ FG+ N + PY
Sbjct: 295 DHL-VVPHHPNSMM--QARKKLSAAMY-------VLSDFGRYPPDVANLKKDVVAPYKHV 344
Query: 199 -QKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTD 257
+ ++ + P R + YF+G + +GG R +++ K+ D+
Sbjct: 345 VRSLRDDESPTFDQRPVLAYFQGAIHR----KDGG----KVRQKLYQLLKDEK--DVHFT 394
Query: 258 HPTTYYEDMQRAI-------FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAI 310
+ + ++RA FCL G P S RL +A+V C+PV+I+DDI LPF D +
Sbjct: 395 YGSVRQNGIRRATKGMASSKFCLNIAGDTPSSNRLFDAIVSHCVPVMISDDIELPFEDVL 454
Query: 311 PWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRA---MLFPQPAQPGDAF 367
+ E +FV D + +L + I R + +K + P++P DA
Sbjct: 455 DYSEFCVFVRASDAVRKGFLLRLL--RGITRDEWNTMWERLKEVAHHFEYQYPSKPDDAV 512
Query: 368 HQILNGLARKLPHDKSTYLKPGGKF 392
I +ARK+ H L G+F
Sbjct: 513 QMIWGAVARKM-HSLKLQLHKSGRF 536
>gi|321261385|ref|XP_003195412.1| hypothetical protein CGB_G5240C [Cryptococcus gattii WM276]
gi|317461885|gb|ADV23625.1| Conserved hypothetical protein [Cryptococcus gattii WM276]
Length = 1130
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 77/321 (23%), Positives = 125/321 (38%), Gaps = 36/321 (11%)
Query: 64 THMFAAEIFMHRFL--LSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSA 121
T F+A+ M+R L +S P +EA+ P+Y CD T F M A
Sbjct: 806 TDCFSADSIMYRALRHMSVPA-----DEAELVVVPVYQQCDGTQ------FLLHDAMHHA 854
Query: 122 IQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEE---KAIERGILP--LLQRATLVQT 176
+ I + V+ HD+G C + + ER + P +L A +
Sbjct: 855 SETIQGV-----KNGEKKVALVLTHDWGICVDFAWDIWSARGERALHPDGILNNALVWSV 909
Query: 177 FGQRNHVCLN-EGSITIPPYAPPQKM------QAHQIPPDTPRSIFVYFRGLFYDVNNDP 229
G + C + IP I P RS + + G +
Sbjct: 910 MGDYDSPCYRPHQDVVIPARTCRSNTLRETFPNVEAIKPMRERSNLLMWSGTYSGTGKSE 969
Query: 230 EGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAV 289
A E K S + Y +D+ A FC P G A WSP+ +A+
Sbjct: 970 RIRLTCNRGGAGDRELIKGGG--KQSNFASSDYMKDLNNARFCAQPRGIAGWSPQTSDAI 1027
Query: 290 VFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLAN- 348
GCIPV I++ PFAD + W ++ + VA ++ K++ +L +IP + + + L AN
Sbjct: 1028 YAGCIPVFISEGTHYPFADFLDWSKLSVRVAPTELDKIEKVLAAIP---LSKVEELQANL 1084
Query: 349 PSMKRAMLFPQPAQPGDAFHQ 369
++ A L+ +P + +
Sbjct: 1085 VCVREAFLYSGDGKPEEELER 1105
>gi|255566500|ref|XP_002524235.1| conserved hypothetical protein [Ricinus communis]
gi|223536512|gb|EEF38159.1| conserved hypothetical protein [Ricinus communis]
Length = 337
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 78/304 (25%), Positives = 131/304 (43%), Gaps = 58/304 (19%)
Query: 77 LLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTE 136
L +S TLNPE+A F+ P P+ L SPR + I+ + + +PYWNRT
Sbjct: 71 LQNSHFITLNPEQAHLFFIPF-------PSDL-----SPRSLARVIRDLRTEFPYWNRTL 118
Query: 137 GADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYA 196
GADHF++ G Y+ +R ++ L + + + F N + IT+PP
Sbjct: 119 GADHFYISCTGLG----YES----DRNLVELKKNSVQISCFPSPNGKFVPHKDITLPPLV 170
Query: 197 PPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDIST 256
P ++ P FV + +G RG + E+
Sbjct: 171 PSTIHKSSN--KRRPYKAFVKY-----------DGVEELRGDLEVLIES----------- 206
Query: 257 DHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIV--LPFADAIPWEE 314
P+ ++ R+ FCL +A + EA+ GC+P++I + + LP D + W+E
Sbjct: 207 -QPS---DEKTRSEFCL--FDYAANISGIGEALSSGCVPLVITERPIQDLPLMDVLRWQE 260
Query: 315 IGMFVAEED--VPKLDTIL--TSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQI 370
I + V D + +L T + R +RL A S + +++ + +P DAFH +
Sbjct: 261 IAVIVGSSDDGFKWVKRVLNGTCSRGDTCERMRRLGAGAS--QHLVWNETPEPYDAFHMV 318
Query: 371 LNGL 374
+ L
Sbjct: 319 MYQL 322
>gi|125544214|gb|EAY90353.1| hypothetical protein OsI_11935 [Oryza sativa Indica Group]
Length = 218
Score = 74.7 bits (182), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 45/116 (38%), Positives = 63/116 (54%), Gaps = 12/116 (10%)
Query: 261 TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVA 320
+Y + M+ + FCLCP+G+ SPR+VEA+ + C+PVIIAD+ VL W + VA
Sbjct: 101 SYIQHMKNSRFCLCPMGYEVNSPRIVEALYYECVPVIIADNFVL------DWSAFAVVVA 154
Query: 321 EEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQP--GDAFHQILNGL 374
E+DVP L IL I LRK + + F A+P D FH IL+ +
Sbjct: 155 EKDVPDLKKILQGIT----LRKYVAMHGCVKRLQRHFLWHARPLRYDLFHMILHSI 206
>gi|242041079|ref|XP_002467934.1| hypothetical protein SORBIDRAFT_01g036720 [Sorghum bicolor]
gi|241921788|gb|EER94932.1| hypothetical protein SORBIDRAFT_01g036720 [Sorghum bicolor]
Length = 462
Score = 74.3 bits (181), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 92/396 (23%), Positives = 156/396 (39%), Gaps = 64/396 (16%)
Query: 26 GSAGDVLEDDPV-GKLKVYVYELPSKYNKKLL--------------LKD-----PRCLTH 65
G AG + DP L+V++Y+LP +++ LL ++D P L
Sbjct: 53 GGAGPGRKCDPAEALLRVFMYDLPPEFHFGLLDWKPPGFGGGVWPDVRDGVPEYPGGLNL 112
Query: 66 MFAAEIFMHRFLLSS---------PVRTLNPEEADWFYTPIYPTCDLTPTGLPLP----F 112
+ E ++ LL+S VR +AD + P + + +P
Sbjct: 113 QHSIEYWLTLDLLASEQGAPTPCAAVRVRRAADADVVFVPFFASLSFNRHSRVVPPARDS 172
Query: 113 KSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQR-- 170
+ + R ++ +++ P W RT G DH + H G+L R
Sbjct: 173 EDRALQRRLLEFLAAR-PEWRRTGGRDHVVLAHH--------------PNGMLDARYRFW 217
Query: 171 --ATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFY--DVN 226
++ FG+ N I PY A+ R +YF+G Y D
Sbjct: 218 PCVFVLCDFGRYPPSVANLDKDVIAPYRHLVANFANDTAGYDDRPTLLYFQGAIYRKDGG 277
Query: 227 NDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLV 286
+ + YY V +F + ++ + M+ + FCL G P S RL
Sbjct: 278 SIRQELYYLLKDEKDVHFSFGS-----VAGNGIEQSTHGMRSSKFCLNIAGDTPSSNRLF 332
Query: 287 EAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQ 343
+++V C+PVII+D+I LPF D + + + + V D K L +++T I E
Sbjct: 333 DSIVSHCVPVIISDEIELPFEDVLDYSKFSVIVRGADAVKKGFLMSLITGISQEEWAHMW 392
Query: 344 RLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
L +++ ++ P+Q DA I +ARK+P
Sbjct: 393 NKLKE--VEKHFVYQYPSQTDDAVQMIWKAIARKVP 426
>gi|168032656|ref|XP_001768834.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162679946|gb|EDQ66387.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 368
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 72/315 (22%), Positives = 126/315 (40%), Gaps = 40/315 (12%)
Query: 83 RTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSA---IQLIS--SNWPYWNRTEG 137
R N + AD P + + P +M R+ + L+S + P W + G
Sbjct: 57 RVKNWQIADVILIPFFASLSYNKYSRPAVRGRKKMDRNQELQVNLLSFLRSQPAWRASNG 116
Query: 138 ADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYA- 196
ADH ++ H + ++ ++ +V FG+ + N + PY
Sbjct: 117 ADHVLIIHHPNAMVYKREQFRSA----------MFVVADFGRYDAEVANIAKDVVAPYKH 166
Query: 197 --PPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDI 254
P + R+ ++F+G EGG R ++E ++ D+
Sbjct: 167 IIPNFDDDIDSVSSFNTRTTLLFFQGAIVR----KEGGII----RQKLYELLRDES--DV 216
Query: 255 STDHPTTYY-------EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFA 307
+ TT M+++ FCL G P S RL +AV C+P+I++DDI LPF
Sbjct: 217 VFVNGTTTSAGIRSATSGMRQSKFCLHMEGDTPSSNRLFDAVASHCVPLIVSDDIELPFE 276
Query: 308 DAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPG 364
D I + E +FV D + L +L + + R + +++ + P++ G
Sbjct: 277 DVINYTEFCLFVNSSDALRKGFLTNLLRNFGEKEWTRMHDRMR--EVQKHFEYQLPSEIG 334
Query: 365 DAFHQILNGLARKLP 379
DA +ARK+P
Sbjct: 335 DAVQMTWEAIARKVP 349
>gi|125528898|gb|EAY77012.1| hypothetical protein OsI_04969 [Oryza sativa Indica Group]
Length = 416
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 71/273 (26%), Positives = 117/273 (42%), Gaps = 22/273 (8%)
Query: 113 KSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRAT 172
++ R+++ + I YW R+ G DH V+P F + + + IL
Sbjct: 141 EADRLLQVELMEILWKSKYWQRSAGRDH--VIPMHHPNAFRFLRD-MVNASIL------- 190
Query: 173 LVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPD--TPRSIFVYFRGLFYDVNNDPE 230
+V FG+ + + PY + PPD R ++FRG V D E
Sbjct: 191 IVADFGRYTKELASLRKDVVAPYVHVVDSFLNDDPPDPFDARPTLLFFRG--RTVRKD-E 247
Query: 231 GGYYARGARAAVWENFKNNPLFDIST---DHPTTYYEDMQRAIFCLCPLGWAPWSPRLVE 287
G A+ A+ K+ F+ S + T E M+ + FCL P G P S RL +
Sbjct: 248 GKIRAKLAKIL---KGKDGVRFEDSLATGEGIKTSTEGMRSSKFCLHPAGDTPSSCRLFD 304
Query: 288 AVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLA 347
A+V C+PVI++ I LPF D I + E +F + E+ + D +L + + + +
Sbjct: 305 AIVSHCVPVIVSSRIELPFEDEIDYSEFSLFFSVEEALRPDYLLNQLRQIQKTKWVEIWS 364
Query: 348 N-PSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
++ F P + GDA + I + K+P
Sbjct: 365 KLKNVSHHYEFQNPPRKGDAVNMIWRQVKHKVP 397
>gi|159472635|ref|XP_001694450.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158276674|gb|EDP02445.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 883
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 46/144 (31%), Positives = 70/144 (48%), Gaps = 29/144 (20%)
Query: 42 VYVYELPSKYNKKLL---LKDPRCLTHMFA--------------AEIFMHRFLLSSPVRT 84
+YVY++ S + LL ++ CL FA AE +H L+S RT
Sbjct: 497 IYVYDMKSDFGTDLLQYRIEGSHCLYRSFARANQTSFVGYNAYAAEPVLHELFLTSEHRT 556
Query: 85 LNPEEADWFYTPIYPTCDLT----------PTGL--PLPFKSPRMMRSAIQLISSNWPYW 132
L+PEEAD+F+ P+ C P GL P + M R A + +++ +PY+
Sbjct: 557 LDPEEADFFFVPVNVGCLFDVYGWNEIPRWPRGLLGPRTHGATMMQREAARWLNATFPYF 616
Query: 133 NRTEGADHFFVVPHDFGACFHYQE 156
R G DH ++ PHD GAC+ ++E
Sbjct: 617 ARRGGRDHIWLNPHDEGACYVWRE 640
>gi|159477447|ref|XP_001696822.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158275151|gb|EDP00930.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 1279
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 44/140 (31%), Positives = 68/140 (48%), Gaps = 29/140 (20%)
Query: 42 VYVYELPSKYNKKLL---LKDPRCL-------------THMFAAEIFMHRFLLSSPVRTL 85
+Y+Y+L +N ++L L CL TH ++ E + L++S RT
Sbjct: 1125 IYMYDLDPLFNTRMLQYRLVRTSCLHRMFRADNSSTFVTHGYSIETYFIELLMTSGHRTY 1184
Query: 86 NPEEADWFYTPIYPTCDLTPT---------GLPL----PFKSPRMMRSAIQLISSNWPYW 132
+PEEAD+F+ P+ TC P G+PL P + M R A + I +P+W
Sbjct: 1185 DPEEADFFFVPLLVTCYFWPVLGWSDHPWFGMPLAIVRPHQGAFMYRDAKRWIQKTYPWW 1244
Query: 133 NRTEGADHFFVVPHDFGACF 152
+R G DH ++ HD GAC+
Sbjct: 1245 DRRGGKDHIWMTAHDEGACW 1264
>gi|224129654|ref|XP_002320639.1| predicted protein [Populus trichocarpa]
gi|222861412|gb|EEE98954.1| predicted protein [Populus trichocarpa]
Length = 375
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 98/403 (24%), Positives = 153/403 (37%), Gaps = 82/403 (20%)
Query: 39 KLKVYVYELPSKYNKKLLLKD-------------------PRCLTHMFAAEIFMHRFLLS 79
KLKV+VY+LP +++ LL P L + E ++ LL+
Sbjct: 2 KLKVFVYDLPPEFHFGLLGWKGNTNQTWPNVDSHSRIPPYPGGLNLQHSVEYWLTLDLLA 61
Query: 80 S-------PVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSP----RMMRSAIQLISSN 128
S VR N +AD + P + + L K +M+++ + +
Sbjct: 62 SNTPKVGTAVRVQNSSQADIVFVPFFSSLSYNRHS-KLHGKEKVSVNKMLQTKLVQFLTA 120
Query: 129 WPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEG 188
W R G DH +V H + H +++ L ++ FG+ N G
Sbjct: 121 RDEWKRFGGNDHL-IVAHHPNSMLHARKK---------LGSAMFVLADFGRYPVEIANLG 170
Query: 189 SITIPPYAPPQKMQAHQIPPDTP-----RSIFVYFRGLFYDVNNDPEGG------YYARG 237
I PY K IP R I ++F+G Y +GG YY
Sbjct: 171 KDIIAPY----KHVVRTIPSGESAQFDRRPILMHFQGAIYR----KDGGAIRQELYYLLK 222
Query: 238 ARAAV---WENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCI 294
V + ++ N + + + M + FCL G P S RL +A+ C+
Sbjct: 223 DEKDVHFTFGTYRGNGIKKAA--------QGMASSKFCLNIAGDTPSSNRLFDAIASHCV 274
Query: 295 PVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRA 354
PVII+DDI LPF D + + E +FV D K +L + I + Q +K
Sbjct: 275 PVIISDDIELPFEDVLDYSEFCLFVRASDAVKKGYLLDLL--RGIEKDQWTKLWERLKEI 332
Query: 355 ---MLFPQPAQPGDAFHQILNGLARKLP------HDKSTYLKP 388
+ P+QPGDA + + RK H K+ Y +P
Sbjct: 333 APHFEYSYPSQPGDAVDMVWKAVLRKTSSVQFKRHRKNRYARP 375
>gi|108863947|gb|ABA91286.2| secondary cell wall-related glycosyltransferase family 47,
putative, expressed [Oryza sativa Japonica Group]
gi|215769393|dbj|BAH01622.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 476
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 84/379 (22%), Positives = 146/379 (38%), Gaps = 56/379 (14%)
Query: 40 LKVYVYELPSKYNKKLLL------------------------KDPRCLTHMFAAEIFMHR 75
++V++Y+LP +++ LL + P L + E ++
Sbjct: 95 VRVFMYDLPPEFHFGLLGWSPPTDGAADAAMWPDVGSGAAAPRYPGGLNQQHSVEYWLTL 154
Query: 76 FL-------LSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLP----FKSPRMMRSAIQL 124
L + VR + +AD + P + + +P + + ++
Sbjct: 155 DLLSSSSPPCGAAVRVADSRDADVVFVPFFASLSYNRHSRVVPPEKVSRDKGLQERLVRY 214
Query: 125 ISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVC 184
+ + P W R+ GADH +V H + H + L ++ FG+ +
Sbjct: 215 LMAQ-PEWKRSGGADH-VIVAHHPNSLLHARSV---------LFPAVFVLSDFGRYHPRV 263
Query: 185 LNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWE 244
+ I PY K + R +YFRG + EGG + + +
Sbjct: 264 ASLEKDVIAPYKHMAKTFVNDSAGFDDRPTLLYFRGAIF----RKEGGNIRQELHYMLKD 319
Query: 245 NFKNNPLFDISTDH-PTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIV 303
F DH + + M + FCL G P S RL +A+V C+PVII+DDI
Sbjct: 320 EKDVYFAFGSVQDHGASKASQGMHASKFCLNIAGDTPSSNRLFDAIVSHCVPVIISDDIE 379
Query: 304 LPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMK---RAMLFPQP 360
LP+ DA+ + + +FV D K ++ I + + Q + +K + + P
Sbjct: 380 LPYEDALDYSKFSIFVRSSDAVKKGYLMRLI--RGVSKHQWTMMWRRLKEVDKHFEYQYP 437
Query: 361 AQPGDAFHQILNGLARKLP 379
+Q DA I LARK+P
Sbjct: 438 SQKDDAVQMIWQTLARKVP 456
>gi|297839341|ref|XP_002887552.1| F25A4.34 [Arabidopsis lyrata subsp. lyrata]
gi|297333393|gb|EFH63811.1| F25A4.34 [Arabidopsis lyrata subsp. lyrata]
Length = 458
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 77/325 (23%), Positives = 127/325 (39%), Gaps = 32/325 (9%)
Query: 79 SSPVRTLNPEEADWFYTPIYPTCDLTPTGL---PLPFKSPRMMRSAIQLISSNWPYWNRT 135
S+ +R N EAD + P + + R+++ + + W R
Sbjct: 142 SAAIRVKNSNEADIVFVPFFASLSYNRKSKLRGNETISGDRLLQERLVEFLKSQDEWKRF 201
Query: 136 EGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQT-FGQRNHVCLNEGSITIPP 194
+G DH + H + L A V + FG+ + N I P
Sbjct: 202 DGKDHLIIAHHPNSLLYAKN-----------FLGSAMFVLSDFGRYSSANANLEKDIIAP 250
Query: 195 YAPPQK-MQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFD 253
Y K + ++ P R + YF+G Y +GG + + + + F
Sbjct: 251 YLHVVKTISNNESAPFEKRPVLAYFQGAIYR----KDGGTIRQELYNLLRDEKDVHFAFG 306
Query: 254 ISTDHPTTYY-EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW 312
+ T + M + FCL G P S RL +A+V C+PVII+D I LPF D++ +
Sbjct: 307 TVRRNGTKQTGKGMASSKFCLNIAGDTPSSNRLFDAIVSHCVPVIISDQIELPFEDSLDY 366
Query: 313 EEIGMFV-AEEDVPK--LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQ 369
+FV A E V K L +L I + +K L + + P+ PGD+ +
Sbjct: 367 SGFSVFVHASEAVKKGFLVNLLRGITEDQWKKKWGRL--KEVAGCFEYRFPSHPGDSVNM 424
Query: 370 ILNGLARKLP------HDKSTYLKP 388
I + ++ KL H K+ Y +
Sbjct: 425 IWSAVSHKLSSLQFDVHRKNRYRRS 449
>gi|260782087|ref|XP_002586123.1| hypothetical protein BRAFLDRAFT_105910 [Branchiostoma floridae]
gi|229271215|gb|EEN42134.1| hypothetical protein BRAFLDRAFT_105910 [Branchiostoma floridae]
Length = 551
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 96/416 (23%), Positives = 161/416 (38%), Gaps = 85/416 (20%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCL-------------------------THMFAAEIFMH 74
++Y+YELP+++N+ L+ R L THMFA E+ +H
Sbjct: 125 FRLYIYELPAEFNRNLVSCVVRELGGCFRLGSFGMGPEFARHGNMSYRHTHMFALEVILH 184
Query: 75 RFLLSSPVRTLNPEEADWFYTPIY--------PTCDLTPTGLPLP--------FKSPRMM 118
+ L SP RTL+P AD FY P Y C T P + +
Sbjct: 185 QKALYSPSRTLDPHSADAFYIPYYAGLCTRHHSGCSTTKNISPYAGLACLCPGLDATALN 244
Query: 119 RSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQR---ATLVQ 175
R ++S +P++ R G H + G Q + LP +R A + Q
Sbjct: 245 RKLFSHVTSRYPFYFR--GRPHLMAL----GKIEREQWTQDCSLLTLPQARRVVFAGIEQ 298
Query: 176 TF--GQRNHVCLNEGSITIPPY--------APPQ-----KMQAHQIPPDT-PRSIFVYFR 219
F R H + + PY A Q M+A ++ DT PR +FV+
Sbjct: 299 EFSPALRAHFGRRGSPLIVAPYPAFGHVISAGSQGDVKSHMKAGEL--DTVPRDVFVFLA 356
Query: 220 GLFYDVNNDPEG---GYYARG----ARAAVWENFKNNPLFDISTDHPTTY----YEDMQR 268
+ + +G ++ G + A +P++ ++ + + E M+
Sbjct: 357 ASSRNAHKIRQGLRPQFHVTGQPYSSEEAARVRRDGSPVWLLTPECRGNWEGKVVEWMRH 416
Query: 269 AIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADD--IVLPFADAIPWEEIGMFVAEEDVPK 326
++FCL P G +P +AV GC+PV + + PF + + + + + +DV
Sbjct: 417 SVFCLQPPGDSPTRKSFYDAVACGCVPVTFTLEHPVRYPFDQVLNYSDFSVIIDGKDVTD 476
Query: 327 LD----TILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKL 378
+ IL IP E I Q L + +P DAF +L +A+++
Sbjct: 477 RNITILNILRKIPSERIKMLQDNLKKVAPLLQYSYPSTVPSQDAFTMVLEEMAQRV 532
>gi|218185155|gb|EEC67582.1| hypothetical protein OsI_34941 [Oryza sativa Indica Group]
Length = 483
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 86/382 (22%), Positives = 150/382 (39%), Gaps = 55/382 (14%)
Query: 40 LKVYVYELPSKYNKKLLL------------------------KDPRCLTHMFAAEIFMHR 75
++V++Y+LP +++ LL + P L + E ++
Sbjct: 95 VRVFMYDLPPEFHFGLLGWSPPTDGAADAAMWPDVGSGAAAPRYPGGLNQQHSVEYWLTL 154
Query: 76 FL-------LSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLP----FKSPRMMRSAIQL 124
L + VR + +AD + P + + +P + + ++
Sbjct: 155 DLLSSSSPPCGAAVRVADSRDADVVFVPFFASLSYNRHSRVVPPEKVSRDKGLQERLVRY 214
Query: 125 ISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVC 184
+ + P W R+ GADH +V H + H + L ++ FG+ +
Sbjct: 215 LMAQ-PEWKRSGGADH-VIVAHHPNSLLHARSV---------LFPAVFVLSDFGRYHPRV 263
Query: 185 LNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFY--DVNNDPEGGYYARGARAAV 242
+ I PY K + R +YFRG + +V D G R +
Sbjct: 264 ASLEKDVIAPYKHMAKTFVNDSAGFDDRPTLLYFRGAIFRKEVKIDSWKGGNIRQELHYM 323
Query: 243 WENFKNNPL-FDISTDH-PTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD 300
++ K+ F DH + + M + FCL G P S RL +A+V C+PVII+D
Sbjct: 324 LKDEKDVYFAFGSVQDHGASKASQGMHASKFCLNIAGDTPSSNRLFDAIVSHCVPVIISD 383
Query: 301 DIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMK---RAMLF 357
DI LP+ DA+ + + +FV D K ++ I + + Q + +K + +
Sbjct: 384 DIELPYEDALDYSKFSIFVRSSDAVKKGYLMRLI--RGVSKHQWTMMWRRLKEVDKHFEY 441
Query: 358 PQPAQPGDAFHQILNGLARKLP 379
P+Q DA I LARK+P
Sbjct: 442 QYPSQKDDAVQMIWQTLARKVP 463
>gi|242079279|ref|XP_002444408.1| hypothetical protein SORBIDRAFT_07g021480 [Sorghum bicolor]
gi|241940758|gb|EES13903.1| hypothetical protein SORBIDRAFT_07g021480 [Sorghum bicolor]
Length = 539
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 97/426 (22%), Positives = 158/426 (37%), Gaps = 86/426 (20%)
Query: 34 DDPVGKLKVYVYELPSKYNKKLL-------------LKDPRCLTHMFAAEIFMHRF---- 76
D L+VYVY+LP++++ +L ++D R H H
Sbjct: 126 DADSAALRVYVYDLPAEFHFGMLGWDGKGKPAAWPDVRDARAAPHYPGGLNLQHSVAYWL 185
Query: 77 ---LLSSP---------------------VRTLNPEEADWFYTPIYPTCDL---TPTGLP 109
+LSS VR N AD F+ P + + +
Sbjct: 186 TLDILSSALPPGTGTGSDDVVIRDRPCVAVRVTNASLADVFFVPFFASLSYNRHSKLRRG 245
Query: 110 LPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQ 169
R +++ + W R G +H +VPH + +++ + +L
Sbjct: 246 EKVNRNRFLQAELVRYLMRKEEWRRWGGKNHL-IVPHHPNSMMEARKKLSAAMFVL---- 300
Query: 170 RATLVQTFGQRN-HVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYD---- 224
FG+ + HV + + P + P R I YF+G +
Sbjct: 301 -----SDFGRYSPHVANLKKDVIAPYMHVVRSFGDGDSPAFDQRPILAYFQGAIHRKAVR 355
Query: 225 ------VNNDP--EGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAI------ 270
V N P +GG R +++ K+ D+ + + ++RA
Sbjct: 356 ALCSVLVANRPAFQGG----KVRQKLYQLLKDER--DVHFTYGSVRQNGIRRATAGMSTS 409
Query: 271 -FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK--- 326
FCL G P S RL +A+V C+PVII+DDI LPF D + + E +FV D K
Sbjct: 410 KFCLNIAGDTPSSNRLFDAIVSHCVPVIISDDIELPFEDVLDYSEFCVFVRSADAAKRGF 469
Query: 327 LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHDKSTYL 386
L +L I + + L + R + P++ GDA I + +ARK+ H L
Sbjct: 470 LLRLLRGISRDEWTKMWMRLKK--VTRHFEYQYPSRSGDAVQMIWSAVARKM-HSVQLQL 526
Query: 387 KPGGKF 392
G+F
Sbjct: 527 HKRGRF 532
>gi|326530125|dbj|BAK08342.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 461
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 88/380 (23%), Positives = 146/380 (38%), Gaps = 62/380 (16%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRC------------------LTHMFAAEIFMHRFLLSS- 80
L+VY+Y+LP +++ +L +P L + E ++ LL+S
Sbjct: 69 LRVYMYDLPLEFHFGMLDWEPGSGGGLWPDVRHGVPEYPGGLNLQHSIEYWLTLDLLASE 128
Query: 81 --------PVRTLNPEEADWFYTPIYPTCDLTPTGLPLP----FKSPRMMRSAIQLISSN 128
VR +P AD + P + + +P + + R I+ +++
Sbjct: 129 QGAPTPCNAVRVRDPARADVVFVPFFASLSFNRHSKVVPPARTSEDRTLQRRLIEFLAAR 188
Query: 129 WPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEG 188
P W R+ G DH + H G + L ++ FG+ H N
Sbjct: 189 -PEWRRSGGRDHVVLAHHPNG----------MLDARYKLWPCVFVLCDFGRYPHSVANID 237
Query: 189 SITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGG------YYARGARAAV 242
I PY + R +YF+G Y +GG YY V
Sbjct: 238 KDVIAPYQHVVDDFLNDSTGYDDRPTLLYFQGAIYR----KDGGFIRQELYYLLKDEKDV 293
Query: 243 WENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDI 302
+F + ++ + M+ + FCL G P S RL +++V C+PVII+D+I
Sbjct: 294 HFSFGS-----VAGNGIEESTRGMRASKFCLNIAGDTPSSNRLFDSIVSHCVPVIISDEI 348
Query: 303 VLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQ 359
LPF D + + + + V D K L ++ I PE L ++ +
Sbjct: 349 ELPFEDMLDYSKFCIIVRGADAVKKGFLINLIKGISPEEWTSMWNKLRE--VEGHFEYQY 406
Query: 360 PAQPGDAFHQILNGLARKLP 379
P+QP DA I +ARK+P
Sbjct: 407 PSQPEDAVQMIWKTIARKVP 426
>gi|159477331|ref|XP_001696764.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158275093|gb|EDP00872.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 967
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 71/262 (27%), Positives = 104/262 (39%), Gaps = 45/262 (17%)
Query: 124 LISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQR--- 180
L SS +W+R G DH F++ +D GAC+ QE +L R V G
Sbjct: 542 LDSSGKSFWDRRGGRDHIFMMLNDEGACWMPQEVYNTSI-VLTHWGRMDNVHVCGSAWGY 600
Query: 181 -----------------------NHVCLNEGS-ITIPPYAPPQKMQAHQI--PPDTPRSI 214
H C G + +P PP + + P R I
Sbjct: 601 DNYSAPLDSWKPYVDGDWRKEYDGHPCYTPGKDLVVPSLKPPSHYASSPLLGAPPLERDI 660
Query: 215 FVYFRGLFYDVNNDP-EGGYYARG-----ARAAVWENFKNNPLFDISTDH--PTTYYEDM 266
+Y RG + P +Y+RG A+ A N+ + I +Y E +
Sbjct: 661 LLYLRG-----DTGPYRAHWYSRGIRQRLAKLAYKHNWADKYRIYIGEGWQISGSYSEHL 715
Query: 267 QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK 326
R+ FC+ G WS R +A++ GCIP++I D + F + W+ + + EE V +
Sbjct: 716 ARSTFCVVAPG-DGWSARAEDAILHGCIPLVIMDGVHAVFESIVEWDAFAVRIREEAVNE 774
Query: 327 -LDTILTSIPPEVILRKQRLLA 347
L L S PE I R QR LA
Sbjct: 775 DLPKFLLSFSPEQIERMQRRLA 796
>gi|357508931|ref|XP_003624754.1| Secondary cell wall-related glycosyltransferase family [Medicago
truncatula]
gi|355499769|gb|AES80972.1| Secondary cell wall-related glycosyltransferase family [Medicago
truncatula]
Length = 510
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 71/309 (22%), Positives = 116/309 (37%), Gaps = 31/309 (10%)
Query: 82 VRTLNPEEADWFYTPIYPTCDL---TPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGA 138
R N EAD + P + + + TG ++++ + N W R+ G
Sbjct: 187 TRVRNSTEADVIFVPFFSSLTYNRHSKTGPHERRSRNKVLQEKLVRYLMNQEEWKRSGGR 246
Query: 139 DHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPP 198
DH + H ++ + L ++ FG+ N I PY
Sbjct: 247 DHLILAHH----------PNSMLDARMKLWPATFILSDFGRYPPNIANVDKDVIAPYKHV 296
Query: 199 QKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNN-----PLFD 253
R +YF+G Y +GGY AR ++ K
Sbjct: 297 IASYVDDQSTFDSRKTLLYFQGAIYR----KDGGY----ARQELFYLLKEEKDVHFSFGS 348
Query: 254 ISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWE 313
+ M+ + FCL G P S RL +A+ C+PVII+D+I LP+ D + +
Sbjct: 349 VQKGGVRNATNGMRSSKFCLNIAGDTPSSNRLFDAIASHCVPVIISDEIELPYEDVLDYS 408
Query: 314 EIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQI 370
+ +FV D K L + SI + R L +++ F P++ GDA I
Sbjct: 409 KFCVFVRTRDAVKKKYLINFIRSIGKDEWTRMWNRL--KEVEKFFEFQFPSKEGDAVEMI 466
Query: 371 LNGLARKLP 379
++RK+P
Sbjct: 467 WQAVSRKVP 475
>gi|299116802|emb|CBN74915.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 1052
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 76/304 (25%), Positives = 121/304 (39%), Gaps = 52/304 (17%)
Query: 38 GKLKVYVYELPSKYNKKLLL-------------KDPRCLTHMFAAEIFMHRFLLSSPVRT 84
GK+ V+VY+ + + L+ +D R EI++HR +L+ P R
Sbjct: 155 GKVSVFVYDGVPELDHSWLVPCYRQVRDGVSPWQDERADMAQDMGEIWLHRAMLAHPWRV 214
Query: 85 LNPEEADWFYTPIYPTCDLTPTGLPLPFKS-PRMMRSAIQLISSNWPYWNRTEGADHFFV 143
NPEEAD FY P+YP K+ ++ ++++ ++ + Y+ R GADH V
Sbjct: 215 ANPEEADLFYVPMYPVLSTKLGNNRCGGKTHDELINTSVEYLALSSVYFRRFGGADHTLV 274
Query: 144 VPHDFGACFHYQEEKAIERGILP-LLQRATLV----QTFGQRNHVCLNEGSITIPPYAPP 198
C + + A+ G P +L R T+V + C + +TIP A
Sbjct: 275 -------CAWWNCKSAL--GPKPRMLLRRTVVGINEKMLEWTRWGCGLDKMVTIPYTASS 325
Query: 199 QKMQAHQIPPDTP--RSIFVYFRGLFYD---------VNNDPEGGYYARGARAAVWENFK 247
+ I R I +F G V EG G + W
Sbjct: 326 VLTTSEMIGGRAAEDRDIPFFFVGTARGRPERQNLDVVTGMAEGSVMMLGDHQSDWGM-- 383
Query: 248 NNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDI-VLPF 306
+ T Y + R+ FC CP G S R+ +AV GC P++ + VLPF
Sbjct: 384 ----------NSTQYAAHIARSRFCFCPRGDTESSRRIFDAVAAGCTPIVTEASVAVLPF 433
Query: 307 ADAI 310
++ +
Sbjct: 434 SEHV 437
>gi|356497977|ref|XP_003517832.1| PREDICTED: xylogalacturonan beta-1,3-xylosyltransferase-like
[Glycine max]
Length = 509
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 86/378 (22%), Positives = 135/378 (35%), Gaps = 58/378 (15%)
Query: 40 LKVYVYELPSKYNKKLL-----------------------LKDPRCLTHMFAAEIFMHRF 76
LKV++Y+LP +++ LL L + +I F
Sbjct: 119 LKVFMYDLPPEFHFGLLDWKPSGNSVWPDVRTNIPGYPGGLNLQHSIEFWLTLDILASEF 178
Query: 77 LLSSPVRTL----NPEEADWFYTPIYPTCDLTPTGLPLPF---KSPRMMRSAIQLISSNW 129
+S RT+ N EAD + P + + P ++++ +
Sbjct: 179 PQASKARTVIRVQNSSEADIIFVPFFSSLSYNRYSKSKPHVKKSKNKILQEKLVTYLMAQ 238
Query: 130 PYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGS 189
W R+ G DH + H ++ + L ++ FG+ N
Sbjct: 239 EEWKRSGGKDHLILAHH----------PNSMLDARMKLWPATFILSDFGRYPPNIANVEK 288
Query: 190 ITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNN 249
I PY + R +YF+G Y +GG AR ++ K+
Sbjct: 289 DVIAPYKHLISSYVNDNSNFDSRPTLLYFQGAIY----RKDGGL----ARQELFYLLKDE 340
Query: 250 -----PLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVL 304
I D E M+ + FCL G P S RL +A+ C+PVII+D I L
Sbjct: 341 KDVHFSFGSIGKDGIKKATEGMRASKFCLNIAGDTPSSNRLFDAIASHCVPVIISDKIEL 400
Query: 305 PFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPA 361
P+ D I + E +FV D K L + I E R L ++ F P+
Sbjct: 401 PYEDVIDYSEFCIFVRTSDAIKEKFLINFIRGIAKEEWTRMWNKL--KEVEHFFEFHFPS 458
Query: 362 QPGDAFHQILNGLARKLP 379
+ DA I +ARK+P
Sbjct: 459 KENDAVQMIWQAVARKVP 476
>gi|359359070|gb|AEV40977.1| putative exostosin family domain-containing protein [Oryza
punctata]
Length = 459
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 77/294 (26%), Positives = 129/294 (43%), Gaps = 34/294 (11%)
Query: 119 RSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKA------IERGILPLLQRAT 172
R + ++++ P W R+ G DH FV+ D A +H ++E A ++ G L +
Sbjct: 166 REVVDRVTAH-PAWRRSGGRDHVFVLT-DPVAMWHVRKEIAPAILLVVDFGGWYKLDSNS 223
Query: 173 LVQTFGQR-NHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEG 231
F H ++ I PY + Q+ + R+ +YF+G + G
Sbjct: 224 ASSNFSHMIQHTQVSLLKDVIVPYT--HLLPTMQLSENKERTTLLYFKG----AKHRHRG 277
Query: 232 GYYARGARAAVWENFKNNPLFDISTDHPTTYYED-----MQRAIFCLCPLGWAPWSPRLV 286
G R +W+ N P + +P + M+ + FCL P G P S RL
Sbjct: 278 GL----VREKLWDLMVNEPDVVMEEGYPNATGREQSIKGMRTSEFCLHPAGDTPTSCRLF 333
Query: 287 EAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDV--PK-LDTILTSIPPEVILRKQ 343
+AV CIPVI++D+I LPF I + E +FV+ + PK L L ++P + +
Sbjct: 334 DAVASLCIPVIVSDEIELPFEGMIDYTEFVIFVSVSNAMRPKWLTNYLRNVPRQKKDEFR 393
Query: 344 RLLAN--PSMKRAMLFP----QPAQPGDAFHQILNGLARKLPHDKSTYLKPGGK 391
R +A+ P + ++P AQ G A + I + +KLP + ++ K
Sbjct: 394 RNMAHVQPIFEYDSIYPGRMTSAAQDG-AVNHIWKKIHQKLPMIQEAVIREKRK 446
>gi|297833038|ref|XP_002884401.1| hypothetical protein ARALYDRAFT_477609 [Arabidopsis lyrata subsp.
lyrata]
gi|297330241|gb|EFH60660.1| hypothetical protein ARALYDRAFT_477609 [Arabidopsis lyrata subsp.
lyrata]
Length = 495
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 77/322 (23%), Positives = 119/322 (36%), Gaps = 32/322 (9%)
Query: 80 SPVRTLNPEEADWFYTPIYPTCD---LTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTE 136
+ VR N EAD + P + + + + ++ + ++ W +
Sbjct: 192 AAVRVKNSSEADVVFVPFFSSLSYNRFSKVNQKQKKSQDKELQVNVVKYVTSQKEWKISG 251
Query: 137 GADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYA 196
G DH + H ++ L +V FG+ + N + PY
Sbjct: 252 GKDHVIMAHH----------PNSMSTARHKLYPAMFVVADFGRYSPHVANIDKDIVAPYK 301
Query: 197 PPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDIST 256
A+ R I +YF+G Y GG+ + V +F + +
Sbjct: 302 HLVPSYANDTSGFDGRPILLYFQGAIYR----KAGGFVRQELYKDVHFSFGS-----VRN 352
Query: 257 DHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG 316
T E M+ + FCL G P S RL +A+ CIPVII+DDI LP+ D + + E
Sbjct: 353 HGITKAGEGMRSSKFCLNIAGDTPSSNRLFDAIASHCIPVIISDDIELPYEDVLNYNEFC 412
Query: 317 MFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGD-AFHQILN 372
+FV D K L ++ SI + + L + FP GD A I
Sbjct: 413 LFVRSSDALKKGFLMGLVKSIGRDEYNKMWLRLKEVERYFDLRFPTKDDEGDYAVQMIWE 472
Query: 373 GLARKLP------HDKSTYLKP 388
+ARK P H Y KP
Sbjct: 473 AVARKAPLVKMKVHRFQRYTKP 494
>gi|5882750|gb|AAD55303.1|AC008263_34 F25A4.34 [Arabidopsis thaliana]
gi|12324821|gb|AAG52383.1|AC011765_35 unknown protein; 115857-117304 [Arabidopsis thaliana]
Length = 458
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 78/329 (23%), Positives = 127/329 (38%), Gaps = 40/329 (12%)
Query: 79 SSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSP---RMMRSAIQLISSNWPYWNRT 135
S+ +R N EAD + P + + ++ R+++ + + W R
Sbjct: 142 SAAIRVKNSNEADIVFVPFFASLSYNRKSKLRGNETSSDDRLLQERLVEFLKSQDEWKRF 201
Query: 136 EGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQT-FGQRNHVCLNEGSITIPP 194
+G DH V H + L A V + FG+ + N I P
Sbjct: 202 DGKDHLIVAHHPNSLLYARN-----------FLGSAMFVLSDFGRYSSAIANLEKDIIAP 250
Query: 195 YAPPQK-MQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNP--- 250
Y K + ++ R + YF+G Y +GG R ++ K+
Sbjct: 251 YVHVVKTISNNESASFEKRPVLAYFQGAIYR----KDGG----TIRQELYNLLKDEKDVH 302
Query: 251 --LFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFAD 308
+ + + M + FCL G P S RL +A+V C+PVII+D I LPF D
Sbjct: 303 FAFGTVRGNGTKQTGKGMASSKFCLNIAGDTPSSNRLFDAIVSHCVPVIISDQIELPFED 362
Query: 309 AIPWEEIGMFV-AEEDVPK--LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGD 365
+ + +FV A E V K L IL I + +K L + + P+Q GD
Sbjct: 363 TLDYSGFSVFVHASEAVKKEFLVNILRGITEDQWKKKWGRL--KEVAGCFEYRFPSQVGD 420
Query: 366 AFHQILNGLARKLP------HDKSTYLKP 388
+ + I + ++ KL H K+ Y +
Sbjct: 421 SVNMIWSAVSHKLSSLQFDVHRKNRYRRS 449
>gi|326525254|dbj|BAK07897.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 440
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 77/309 (24%), Positives = 126/309 (40%), Gaps = 29/309 (9%)
Query: 82 VRTLNPEEADWFYTPIYPTCDLTPTGLPLP---FKSPRMMRSAIQLISSNWPYWNRTEGA 138
VR +P A+ F+ P + + G + ++ R+++ + I YW R+ G
Sbjct: 132 VRVRDPAAAEAFFVPFFSSLSFNVHGRNMTDPDTEADRLLQVELMDILGKSEYWQRSAGR 191
Query: 139 DHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAP- 197
DH V+P F + + + +L +V FG+ + + PY
Sbjct: 192 DH--VIPMHHPNAFRFMRD-MVNASVL-------IVSDFGRYTKELASLRKDVVAPYVHV 241
Query: 198 -PQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDIS- 255
+ + P ++FRG EG RG A + ++ ++ F+ S
Sbjct: 242 VDSFLDDNASDPFEADPTLLFFRGRPV---RKAEGKI--RGKLAKILKD-RDGVRFEDSL 295
Query: 256 --TDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWE 313
D + M+ + FCL P G P S RL +A+V CIPVII+ I LPF D I +
Sbjct: 296 AIGDGIKISTDGMRSSKFCLHPAGDTPSSCRLFDAIVSHCIPVIISSRIELPFEDEIDYS 355
Query: 314 EIGMFVAEEDVPKLDTILTSI---PPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQI 370
E F + E+ + D +L + P E + L N S F P + DA + I
Sbjct: 356 EFSPFFSVEEALEPDYLLNQLRQMPKEKWVEMWSKLKNVSSHYE--FQYPPRKDDAVNMI 413
Query: 371 LNGLARKLP 379
+ K+P
Sbjct: 414 WRHVRHKIP 422
>gi|18410670|ref|NP_565089.1| exostosin-like protein [Arabidopsis thaliana]
gi|16209709|gb|AAL14411.1| At1g74680/F1M20_36 [Arabidopsis thaliana]
gi|27363220|gb|AAO11529.1| At1g74680/F1M20_36 [Arabidopsis thaliana]
gi|332197500|gb|AEE35621.1| exostosin-like protein [Arabidopsis thaliana]
Length = 461
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 78/329 (23%), Positives = 127/329 (38%), Gaps = 40/329 (12%)
Query: 79 SSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSP---RMMRSAIQLISSNWPYWNRT 135
S+ +R N EAD + P + + ++ R+++ + + W R
Sbjct: 145 SAAIRVKNSNEADIVFVPFFASLSYNRKSKLRGNETSSDDRLLQERLVEFLKSQDEWKRF 204
Query: 136 EGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQT-FGQRNHVCLNEGSITIPP 194
+G DH V H + L A V + FG+ + N I P
Sbjct: 205 DGKDHLIVAHHPNSLLYARN-----------FLGSAMFVLSDFGRYSSAIANLEKDIIAP 253
Query: 195 YAPPQK-MQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNP--- 250
Y K + ++ R + YF+G Y +GG R ++ K+
Sbjct: 254 YVHVVKTISNNESASFEKRPVLAYFQGAIYR----KDGG----TIRQELYNLLKDEKDVH 305
Query: 251 --LFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFAD 308
+ + + M + FCL G P S RL +A+V C+PVII+D I LPF D
Sbjct: 306 FAFGTVRGNGTKQTGKGMASSKFCLNIAGDTPSSNRLFDAIVSHCVPVIISDQIELPFED 365
Query: 309 AIPWEEIGMFV-AEEDVPK--LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGD 365
+ + +FV A E V K L IL I + +K L + + P+Q GD
Sbjct: 366 TLDYSGFSVFVHASEAVKKEFLVNILRGITEDQWKKKWGRL--KEVAGCFEYRFPSQVGD 423
Query: 366 AFHQILNGLARKLP------HDKSTYLKP 388
+ + I + ++ KL H K+ Y +
Sbjct: 424 SVNMIWSAVSHKLSSLQFDVHRKNRYRRS 452
>gi|302832868|ref|XP_002947998.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
gi|300266800|gb|EFJ50986.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
Length = 638
Score = 71.6 bits (174), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 78/322 (24%), Positives = 124/322 (38%), Gaps = 62/322 (19%)
Query: 41 KVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPT 100
++YVY++P K++ R +F +F R L S V + EEADWF+ P+
Sbjct: 150 RIYVYDIPHKFSS--WYNPTRVDRELFW--VFWERLLGSGAV-VADGEEADWFWLPVK-- 202
Query: 101 CDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAI 160
L R++ AIQ + + WP+++R +G HF + D G +
Sbjct: 203 -------LRSTTDGYRLL-EAIQYVRTEWPWYDRLQGHRHFVIHTGDTG-------RGEV 247
Query: 161 ERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYF-- 218
R I T + +G + AH+ P ++FV
Sbjct: 248 AREIRDATANMTWLHHWG------------LWEDWNASGWKAAHR-PGKIGLALFVSRCG 294
Query: 219 -RGLFYDVNNDPEG--GYYA-----------RGARAAVWENFKNNPLFDISTDHPTTYYE 264
GL D G G+ A R AV + N + I T +
Sbjct: 295 DSGLLGSPTRDVGGVDGWAAPPGAEIAEDGHTAVRGAVHYHHHNRTGYKIVTGDSRYPLD 354
Query: 265 DMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDV 324
+ C C L V GC+P+II+D ++ PF + W+ IG+ +A ED+
Sbjct: 355 LLTYKYVCDCVL-----------VSVMGCLPLIISDSVMQPFEPEMDWDRIGLRLAHEDI 403
Query: 325 PKLDTILTSIPPEVILRKQRLL 346
P L L +I E + R++ L
Sbjct: 404 PTLHERLAAISDEELDRRRAAL 425
>gi|255547405|ref|XP_002514760.1| catalytic, putative [Ricinus communis]
gi|223546364|gb|EEF47866.1| catalytic, putative [Ricinus communis]
Length = 490
Score = 71.6 bits (174), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 96/388 (24%), Positives = 146/388 (37%), Gaps = 73/388 (18%)
Query: 38 GKLKVYVYELPSKYNKKLL------------LKD-------PRCLTHMFAAEIFMHRFLL 78
G LKVY+Y++P +++ LL + D P L + E ++ LL
Sbjct: 110 GLLKVYMYDMPPEFHFGLLGWKGKANQIWPNVDDLDHIPLYPGGLNLQHSIEYWLTLDLL 169
Query: 79 SS----------PVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSP----RMMRSAIQL 124
+S VR N +AD + P + + L K +M+++ +
Sbjct: 170 ASNRPKVVRPCGAVRVDNSSQADIIFVPYFSSLSYNRHS-KLHGKEKVSMNKMLQNRLVE 228
Query: 125 ISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVC 184
W R+ G DH V H+ R +L ++ FG+
Sbjct: 229 FLMGQDEWKRSGGRDHLIVA--------HHPNSMLDARKMLG--AAMFVLADFGRYPVEI 278
Query: 185 LNEGSITIPPYAPPQKMQAHQIPPDTP-----RSIFVYFRGLFYDVNNDPEGG------Y 233
N I PY K IP R I V+F+G Y +GG Y
Sbjct: 279 ANLKKDVIAPY----KHVVRTIPSGESAQFEERPILVFFQGAIYR----KDGGIIRQELY 330
Query: 234 YARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGC 293
Y V F + + + M + FCL G P S RL +A+V C
Sbjct: 331 YLLKDEKDVHFTFGT-----VRKNGVNKAGQGMASSKFCLNIAGDTPSSNRLFDAIVSHC 385
Query: 294 IPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKR 353
+PVII+DDI LPF D + + E +FV D K +L + + I R + + +K
Sbjct: 386 VPVIISDDIELPFEDVLDYSEFSVFVRASDAVKEGYLLNLL--QSIDRDKWTMMWERLKE 443
Query: 354 A---MLFPQPAQPGDAFHQILNGLARKL 378
+ P+Q GDA I ++RKL
Sbjct: 444 IAPHFEYQYPSQSGDAVDMIWQAVSRKL 471
>gi|15228598|ref|NP_187015.1| exostosin-like protein [Arabidopsis thaliana]
gi|6091757|gb|AAF03467.1|AC009327_6 hypothetical protein [Arabidopsis thaliana]
gi|30102722|gb|AAP21279.1| At3g03650 [Arabidopsis thaliana]
gi|110736553|dbj|BAF00242.1| hypothetical protein [Arabidopsis thaliana]
gi|332640446|gb|AEE73967.1| exostosin-like protein [Arabidopsis thaliana]
Length = 499
Score = 71.6 bits (174), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 72/308 (23%), Positives = 115/308 (37%), Gaps = 22/308 (7%)
Query: 80 SPVRTLNPEEADWFYTPIYPTCD---LTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTE 136
+ +R N EAD + P + + + + ++ + ++ W +
Sbjct: 190 AAIRVKNSSEADVVFVPFFSSLSYNRFSKVNQKQKKSQDKELQENVVKYVTSQKEWKTSG 249
Query: 137 GADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYA 196
G DH + H ++ L +V FG+ + N + PY
Sbjct: 250 GKDHVIMAHH----------PNSMSTARHKLFPAMFVVADFGRYSPHVANVDKDIVAPYK 299
Query: 197 PPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDIST 256
+ R I +YF+G Y GG+ + + E + F
Sbjct: 300 HLVPSYVNDTSGFDGRPILLYFQGAIYR----KAGGFVRQELYNLLKEEKDVHFSFGSVR 355
Query: 257 DHPTTYY-EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEI 315
+H + E M+ + FCL G P S RL +A+ CIPVII+DDI LP+ D + + E
Sbjct: 356 NHGISKAGEGMRSSKFCLNIAGDTPSSNRLFDAIASHCIPVIISDDIELPYEDVLNYNEF 415
Query: 316 GMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGD-AFHQIL 371
+FV D K L ++ SI E + L + FP GD A I
Sbjct: 416 CLFVRSSDALKKGFLMGLVRSIGREEYNKMWLRLKEVERYFDLRFPVKDDEGDYAVQMIW 475
Query: 372 NGLARKLP 379
+ARK P
Sbjct: 476 KAVARKAP 483
>gi|219363227|ref|NP_001136572.1| hypothetical protein precursor [Zea mays]
gi|194696226|gb|ACF82197.1| unknown [Zea mays]
gi|414866559|tpg|DAA45116.1| TPA: hypothetical protein ZEAMMB73_757616 [Zea mays]
Length = 453
Score = 71.6 bits (174), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 86/381 (22%), Positives = 150/381 (39%), Gaps = 63/381 (16%)
Query: 40 LKVYVYELPSKYNKKLL--------------LKD-----PRCLTHMFAAEIFMHRFLLSS 80
L+V++Y+LP +++ LL ++D P L + E ++ LL+S
Sbjct: 60 LRVFMYDLPPEFHFGLLDWKPPGFGGGVWPDVRDGVPDYPGGLNLQHSIEYWLTLDLLAS 119
Query: 81 ---------PVRTLNPEEADWFYTPIYPTCDLTPTGLPLP----FKSPRMMRSAIQLISS 127
R + +AD + P + + +P + + R ++ +++
Sbjct: 120 EQGAPTPCAAARVRHAADADVVFVPFFASLSFNRHSRVVPPARNSEDRALQRRLLEFLAA 179
Query: 128 NWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQR----ATLVQTFGQRNHV 183
P W RT G DH + H G+L R ++ FG+
Sbjct: 180 R-PEWRRTGGRDHVVLAHH--------------PNGMLDARYRFWPCVFVLCDFGRYPPS 224
Query: 184 CLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFY--DVNNDPEGGYYARGARAA 241
N I PY A+ R +YF+G Y D + + YY
Sbjct: 225 VANLDKDIIAPYRHLVANFANDTAGYDDRPTLLYFQGAIYRKDGGSIRQELYYLLKDEKD 284
Query: 242 VWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADD 301
V +F + ++ + + M+ + FCL G P S RL +++V C+PVII+D+
Sbjct: 285 VHFSFGS-----VAGNGIEQATQGMRSSKFCLNIAGDTPSSNRLFDSIVSHCVPVIISDE 339
Query: 302 IVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFP 358
I LPF D + + + + V D K L +++ I E R L +++ +
Sbjct: 340 IELPFEDVLDYSKFSVIVRGADAVKKGFLKSLIKGISQEEWTRMWNKLKE--VEKHFEYQ 397
Query: 359 QPAQPGDAFHQILNGLARKLP 379
P+Q DA I +ARK+P
Sbjct: 398 YPSQTDDAVQMIWKAIARKVP 418
>gi|15241619|ref|NP_199306.1| Exostosin family protein [Arabidopsis thaliana]
gi|30694651|ref|NP_851132.1| Exostosin family protein [Arabidopsis thaliana]
gi|10177484|dbj|BAB10875.1| unnamed protein product [Arabidopsis thaliana]
gi|15081733|gb|AAK82521.1| AT5g44930/K21C13_11 [Arabidopsis thaliana]
gi|27363262|gb|AAO11550.1| At5g44930/K21C13_11 [Arabidopsis thaliana]
gi|332007793|gb|AED95176.1| Exostosin family protein [Arabidopsis thaliana]
gi|332007794|gb|AED95177.1| Exostosin family protein [Arabidopsis thaliana]
Length = 443
Score = 71.2 bits (173), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 98/420 (23%), Positives = 168/420 (40%), Gaps = 75/420 (17%)
Query: 3 VTAIALLLAFVSADNTPKIERISGSAGDVLEDDPVG--------KLKVYVYELPSKYNKK 54
V ++ L+ FV+ P SGS L D K KVY+YELP+ +
Sbjct: 20 VLSVFLVFVFVNTFFYPSFYSDSGSIRRNLVDSRESFHFPGNFRKTKVYMYELPTNFTYG 79
Query: 55 LLL-----KDPRCLTHMFAAEIFMHRFLLSSP-------------VRTLNPEEADWFYTP 96
++ K + MH + L S VR +P EAD FY
Sbjct: 80 VIEQHGGEKSDDVTGLKYPGHQHMHEWYLYSDLTRPEVKRVGSPIVRVFDPAEADLFYVS 139
Query: 97 IYPTCDL-TPTGLP-LPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHY 154
+ + L +G P + M S + + S +W R G DH V
Sbjct: 140 AFSSLSLIVDSGRPGFGYSDEEMQESLVSWLESQ-EWWRRNNGRDHVIVAG--------- 189
Query: 155 QEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSIT---IPPYAPPQKMQAHQIPPDTP 211
+ A++R ++ ++ A L+ T R + ++GS+ I PY+ ++ A++
Sbjct: 190 -DPNALKR-VMDRVKNAVLLVTDFDR--LRADQGSLVKDVIIPYS--HRIDAYEGELGVK 243
Query: 212 -RSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAI 270
R+ ++F G Y +GG R +++ + D+ T E+M RA+
Sbjct: 244 QRTNLLFFMGNRYR----KDGG----KVRDLLFKLLEKEE--DVVIKRGTQSRENM-RAV 292
Query: 271 --------FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEE 322
FCL G + RL +A+ C+PVI++D I LPF D I + + +F+ +
Sbjct: 293 KQGMHTSKFCLHLAGDTSSACRLFDAIASLCVPVIVSDGIELPFEDVIDYRKFSIFLRRD 352
Query: 323 DVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
K + L + P IL+ Q++ MK + + ++I + +K+P
Sbjct: 353 AALKPGFVVKKLRKVKPGKILKYQKV-----MKEVRRYFDYTHLNGSVNEIWRQVTKKIP 407
>gi|159471277|ref|XP_001693783.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158283286|gb|EDP09037.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 611
Score = 71.2 bits (173), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 78/344 (22%), Positives = 134/344 (38%), Gaps = 72/344 (20%)
Query: 41 KVYVYELPSKYN--KKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIY 98
+VY+Y+LP + + + D + H L +S R + +ADW++ P+
Sbjct: 157 RVYIYDLPPELTTWRSDINLDRWTIRHFL-------EMLTASGARVADGAKADWYFIPVR 209
Query: 99 PTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEK 158
+++ AI + P++N T G HF + D G
Sbjct: 210 LRSS----------SDAYVLQRAIHHLRHAHPWFNATGGGRHFVIAVGDMG-------RL 252
Query: 159 AIERGILPLLQRATLVQTFG---------------QRNHVCLNEGSITIPPYAPPQKMQA 203
ERG PL T V +G + +H N I +P Y +K++
Sbjct: 253 ESERG--PLSANVTFVSHWGLASSKKAERLNSSPWRASH--RNATDIVLPVYISLRKLEK 308
Query: 204 HQI-------------PPDTPRS---IFVYFRGLFYDVN---------NDPEGGYYARGA 238
+ I PP+ +F + + D + N P Y+
Sbjct: 309 YGITRSRHHPKFATMAPPEIRERNGPLFWFAGRICQDRSKPATDGVWPNCPGAMGYSAMT 368
Query: 239 RAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVII 298
R AV + + P + ++ Y + + FC +G R ++A + GC+PV+I
Sbjct: 369 RQAVHFHHWDRPGYFVAPG-DKNYSRHLLTSKFCFGAMGGG-HGQRQLQAALAGCVPVVI 426
Query: 299 ADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRK 342
D ++ + + W + G+ VAE D+P+L TIL +I PE RK
Sbjct: 427 GDGVLEAWEPYLDWNDFGVRVAEADIPRLHTILGAIGPEEYARK 470
>gi|159483551|ref|XP_001699824.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158281766|gb|EDP07520.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 740
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 44/133 (33%), Positives = 66/133 (49%), Gaps = 8/133 (6%)
Query: 248 NNPLF--DISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLP 305
N P F I+ H T M + FCL P GW W RL+EAV GC+PV++ D + P
Sbjct: 596 NRPDFVLSIAGRHSQTL---MASSRFCLAPSGWG-WGVRLLEAVACGCVPVVVQDQVYQP 651
Query: 306 FADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGD 365
D +P++E + + + +L +L ++ P + Q LA RA L+ + G
Sbjct: 652 LWDVVPYDEFAVVLPRSQLHRLPQLLDAVGPGQLAALQAGLAR--WHRAFLYRHHSPSGL 709
Query: 366 AFHQILNGLARKL 378
AF+ L L R+L
Sbjct: 710 AFNYTLAALRRRL 722
>gi|255075893|ref|XP_002501621.1| hypothetical protein MICPUN_99882 [Micromonas sp. RCC299]
gi|226516885|gb|ACO62879.1| hypothetical protein MICPUN_99882 [Micromonas sp. RCC299]
Length = 918
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 51/179 (28%), Positives = 81/179 (45%), Gaps = 50/179 (27%)
Query: 42 VYVYELPSKYNKKLL--LKDPR-CLT-------------HMFAAEIFMHRFLLSSPVRTL 85
+YVY+L K+ L +D R C+T +++ AE+ +H LL SP RT
Sbjct: 419 IYVYDLLPKFTTAQLQHRQDVRKCVTRFAEEGNATRFEDNLYGAEVALHELLLDSPHRTD 478
Query: 86 NPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSA-----------------------I 122
NPE AD+F+ P+Y C ++ P P S ++ +
Sbjct: 479 NPEIADFFFVPMYHFCFISRLQQPTPGHSQQLFSRTRGVGCDLRGSHVDAAFQHLFVPVL 538
Query: 123 QLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRN 181
+ + ++P+WNRT+GADH HD GAC+ +E ATL+ +G+R+
Sbjct: 539 EHLRRDYPWWNRTDGADHIVPFLHDEGACYAPRE-----------FGDATLLVHWGRRD 586
>gi|356516800|ref|XP_003527081.1| PREDICTED: xylogalacturonan beta-1,3-xylosyltransferase-like
[Glycine max]
Length = 493
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 94/399 (23%), Positives = 150/399 (37%), Gaps = 78/399 (19%)
Query: 40 LKVYVYELPSKY-------------------NKKLLLKDPRCLTHMFAAEIFMHRFLLSS 80
LKV++Y+LP ++ N K + + P L + E ++ LLSS
Sbjct: 114 LKVFMYDLPPEFHFGLLGWKRSVNQTWPEVNNPKRIPRYPGGLNLQHSMEYWLTLDLLSS 173
Query: 81 PV-------RTLNPEEADWFYTPIYPTCDL---TPTGLPLPFKSPRMMRSAIQLISSNWP 130
V R + +AD + P + + + +M++ +
Sbjct: 174 KVGQPCTAIRVQDSSQADVIFVPFFSSLSYNRHSKLNGEEKVSLNKMLQDRLVQFLMGQK 233
Query: 131 YWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSI 190
W R+ G DH V H ++ L ++ FG+ N
Sbjct: 234 EWKRSGGKDHLIVAHH----------PNSLLDARRKLGAAMLVLADFGRYPTELANIKKD 283
Query: 191 TIPPYA------PPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGG------YYARGA 238
I PY P K + + R+ VYF+G Y +GG YY
Sbjct: 284 IIAPYRHLVSTIPKAKSASFE-----KRTTLVYFQGAIYR----KDGGAIRQELYYLLKD 334
Query: 239 RAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVII 298
V F + I + + M + FCL G P S RL +A+V C+PVII
Sbjct: 335 EKDVHFTFGS-----IGGNGINQASQGMAMSKFCLNIAGDTPSSNRLFDAIVSHCVPVII 389
Query: 299 ADDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRK-QRLLANPSMKRA 354
+D+I LPF D + + + +FV D K L +L SI + + +RL +
Sbjct: 390 SDEIELPFEDVLDYSDFSIFVRASDSMKKGYLLNLLRSITQKEWSKMWERL---KQITHH 446
Query: 355 MLFPQPAQPGDAFHQILNGLARKLP------HDKSTYLK 387
+ P+QPGDA + I + RK+ H K+ Y +
Sbjct: 447 FEYQYPSQPGDAVNMIWQQVERKISSIRFNLHRKNRYQR 485
>gi|414587899|tpg|DAA38470.1| TPA: hypothetical protein ZEAMMB73_317055 [Zea mays]
Length = 326
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 38/74 (51%), Positives = 50/74 (67%), Gaps = 11/74 (14%)
Query: 20 KIERISGSAGDV---------LEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMF--A 68
+IE+I S ++ LED+P+GKLKVYV++L SKYNKKLL KDPRCL HMF +
Sbjct: 130 RIEKILKSGANIVLTTKRINGLEDNPIGKLKVYVHDLLSKYNKKLLQKDPRCLNHMFHVS 189
Query: 69 AEIFMHRFLLSSPV 82
+I LLS+P+
Sbjct: 190 QKISFSCPLLSAPI 203
>gi|224145217|ref|XP_002325567.1| predicted protein [Populus trichocarpa]
gi|222862442|gb|EEE99948.1| predicted protein [Populus trichocarpa]
Length = 476
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 88/350 (25%), Positives = 135/350 (38%), Gaps = 66/350 (18%)
Query: 83 RTLNPEEADWFYTPIYPT--CDLTPTGLPLPFKSP------RMMRSAIQLISSNWPYWNR 134
R + EAD + P + T ++ F+ R + + ++ N W R
Sbjct: 139 RVFDCNEADVVFVPFFATLSAEMELAKGKGSFRRKEGNEDYRRQKQVVDIVR-NSDAWKR 197
Query: 135 TEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFG---------------- 178
+ G DH FV+ D A +H + E I IL LV FG
Sbjct: 198 SGGKDHVFVLT-DPVAMWHLRAE--IAPAIL-------LVVDFGGWYRLDSKSSNGSSSD 247
Query: 179 --QRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYAR 236
Q V L + I PY + Q+ + RS +YF+G + GG
Sbjct: 248 MIQHTQVSLLKDVIV--PYT--HLLPRLQLSENKKRSTLLYFKG----AKHRHRGGI--- 296
Query: 237 GARAAVWENFKNNPLFDISTDHPTTYYED-----MQRAIFCLCPLGWAPWSPRLVEAVVF 291
R +W+ N P I P + M+ + FCL P G P S RL +A+
Sbjct: 297 -VREKLWDLLVNEPGVIIEEGFPNATGREQSIRGMRSSEFCLHPAGDTPSSCRLFDAIQS 355
Query: 292 GCIPVIIADDIVLPFADAIPWEEIGMFVAEED-------VPKLDTILTSIPPEVILRKQR 344
CIPV+++D+I LPF + + E +FVA +D V +L +I S+ R+
Sbjct: 356 LCIPVVVSDNIELPFEGMVDYTEFAVFVAVDDALKPRWLVDRLRSI--SVKQRNEFRRNM 413
Query: 345 LLANPSMKRAMLFP---QPAQPGDAFHQILNGLARKLPHDKSTYLKPGGK 391
P ++ P P P A + I + +KLP K ++ K
Sbjct: 414 AKVQPILQYDNGHPGGIGPISPDGAVNHIWKKVLQKLPAIKEAVVRERRK 463
>gi|359359217|gb|AEV41121.1| putative exostosin family domain-containing protein [Oryza
officinalis]
Length = 460
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 77/288 (26%), Positives = 124/288 (43%), Gaps = 40/288 (13%)
Query: 116 RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKA------IERG---ILP 166
R R + ++++ P W R+ G DH FV+ D A +H ++E A ++ G L
Sbjct: 164 RRQREVVDRVTAH-PAWRRSGGRDHVFVLT-DPVAMWHVRKEIAPSILLVVDFGGWYKLD 221
Query: 167 LLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVN 226
++ V Q V L + I PY + Q+ + R +YF+G
Sbjct: 222 SNSASSNVSHMIQHTQVSLLKDVIV--PYT--HLLPTMQLSENKDRLTLLYFKG----AK 273
Query: 227 NDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYED-----MQRAIFCLCPLGWAPW 281
+ GG R +W+ N P + +P + M+ + FCL P G P
Sbjct: 274 HRHRGGL----VREKLWDLMVNEPDVVMEEGYPNATGREQSIKGMRTSEFCLHPAGDTPT 329
Query: 282 SPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEV 338
S RL +AV CIPVI++D+I LPF I + E +FV+ + + L L ++P +
Sbjct: 330 SCRLFDAVASLCIPVIVSDEIELPFEGMIDYTEFTIFVSVSNAMRPKWLTNYLRNVPRQQ 389
Query: 339 -------ILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
+ R Q + S+ R + AQ G A + I + +KLP
Sbjct: 390 KDEFRRNMARVQPIFEYDSIYRGRM-TSAAQDG-AVNHIWKKIYQKLP 435
>gi|359359170|gb|AEV41075.1| putative exostosin family domain-containing protein [Oryza minuta]
Length = 317
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 77/288 (26%), Positives = 124/288 (43%), Gaps = 40/288 (13%)
Query: 116 RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKA------IERG---ILP 166
R R + ++++ P W R+ G DH FV+ D A +H ++E A ++ G L
Sbjct: 21 RRQREVVDRVTAH-PAWRRSGGRDHVFVLT-DPVAMWHVRKEIAPSILLVVDFGGWYKLD 78
Query: 167 LLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVN 226
++ V Q V L + I PY + Q+ + R +YF+G
Sbjct: 79 SNSASSNVSHMIQHTQVSLLKDVIV--PYT--HLLPTMQLSENKDRLTLLYFKG----AK 130
Query: 227 NDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYED-----MQRAIFCLCPLGWAPW 281
+ GG R +W+ N P + +P + M+ + FCL P G P
Sbjct: 131 HRHRGGL----VREKLWDLMVNEPDVVMEEGYPNATGREQSIKGMRTSEFCLHPAGDTPT 186
Query: 282 SPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEV 338
S RL +AV CIPVI++D+I LPF I + E +FV+ + + L L ++P +
Sbjct: 187 SCRLFDAVASLCIPVIVSDEIELPFEGMIDYTEFTIFVSVSNAMRPKWLTNYLRNVPRQQ 246
Query: 339 -------ILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
+ R Q + S+ R + AQ G A + I + +KLP
Sbjct: 247 KDEFRRNMARVQPIFEYDSIYRGRM-TSAAQDG-AVNHIWKKIYQKLP 292
>gi|359359122|gb|AEV41028.1| putative exostosin family domain-containing protein [Oryza minuta]
Length = 459
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 64/232 (27%), Positives = 103/232 (44%), Gaps = 27/232 (11%)
Query: 119 RSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKA------IERGILPLLQRAT 172
R + ++++ P W R+ G DH FV+ D A +H ++E A ++ G L +
Sbjct: 166 REVVDRVTAH-PAWRRSGGRDHVFVLT-DPVAMWHVRKEIAPAILLVVDFGGWYKLDSNS 223
Query: 173 LVQTFGQR-NHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEG 231
F H ++ I PY + Q+ + R+ +YF+G + G
Sbjct: 224 ASSNFSHMIQHTQVSLLKDVIVPYT--HLLPTMQLSENKERTTLLYFKG----AKHRHRG 277
Query: 232 GYYARGARAAVWENFKNNPLFDISTDHPTTYYED-----MQRAIFCLCPLGWAPWSPRLV 286
G R +W+ N P + +P + M+ + FCL P G P S RL
Sbjct: 278 GL----VREKLWDLMVNEPDVVMEEGYPNATGREQSIKGMRTSEFCLHPAGDTPTSCRLF 333
Query: 287 EAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDV--PK-LDTILTSIP 335
+AV CIPVI++D+I LPF I + E +FV+ + PK L L ++P
Sbjct: 334 DAVASLCIPVIVSDEIELPFEGMIDYTEFVIFVSVSNAMRPKWLTNYLRNVP 385
>gi|357112427|ref|XP_003558010.1| PREDICTED: probable glycosyltransferase At3g07620-like
[Brachypodium distachyon]
Length = 464
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 73/313 (23%), Positives = 124/313 (39%), Gaps = 35/313 (11%)
Query: 80 SPVRTLNPEEADWFYTPIYPTCDLTPTGLPLP----FKSPRMMRSAIQLISSNWPYWNRT 135
+ VR +P AD + P + + +P + + R I+ +++ P W R+
Sbjct: 139 NAVRVRDPARADVVFVPFFASLSFNRHSKVVPPARTSEDRALQRRLIEFLAAR-PEWRRS 197
Query: 136 EGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY 195
G DH + H G + L ++ FG+ H N I PY
Sbjct: 198 GGRDHVVLAHHPNG----------MLDARYKLWPCVFVLCDFGRYPHSVANIDKDVIAPY 247
Query: 196 APPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGG------YYARGARAAVWENFKNN 249
+ R +YF+G Y +GG YY V +F +
Sbjct: 248 LHVVGNFFNDSAGYDARPTLLYFQGAIYR----KDGGFIRQELYYLLKDEKDVHFSFGS- 302
Query: 250 PLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADA 309
++ + + M+ + FCL G P S RL +++V C+P+II+D+I LPF D
Sbjct: 303 ----VAGNGIEQSTQGMRASKFCLNIAGDTPSSNRLFDSIVSHCVPIIISDEIELPFEDV 358
Query: 310 IPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMK---RAMLFPQPAQPGDA 366
+ + + + V D K ++ I + I R++ +K R + P+Q DA
Sbjct: 359 LDYSKFCIIVRGVDAVKKGFLINLI--KGISRQEWTSMWNKLKEVERHFEYQYPSQHDDA 416
Query: 367 FHQILNGLARKLP 379
I +ARK+P
Sbjct: 417 VQMIWKTIARKVP 429
>gi|449528899|ref|XP_004171439.1| PREDICTED: LOW QUALITY PROTEIN: probable glycosyltransferase
At3g07620-like, partial [Cucumis sativus]
Length = 173
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 44/143 (30%), Positives = 69/143 (48%), Gaps = 15/143 (10%)
Query: 240 AAVWENFKNNPLFDISTDHPTT------YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGC 293
A VWEN + DIS + + Y + + FC+CP G S R+ +++ +GC
Sbjct: 29 ARVWEN---DTELDISNNRISRATGHLLYQKRFYKTKFCICPGGSQVNSARIADSIHYGC 85
Query: 294 IPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKR 353
+PVI++D LPF D + W + + V E DV +L IL I ++ + L M+
Sbjct: 86 VPVILSDYYDLPFNDILDWRKFSVIVKERDVYQLKQILKDISDIEFIKLHKNL----MQV 141
Query: 354 AMLFPQPAQP--GDAFHQILNGL 374
F + P DAFH ++ L
Sbjct: 142 QKHFQWNSXPIKYDAFHMVMYDL 164
>gi|4678284|emb|CAB41192.1| putative protein [Arabidopsis thaliana]
Length = 736
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 76/320 (23%), Positives = 130/320 (40%), Gaps = 64/320 (20%)
Query: 29 GDVLEDDPVGKLK---VYVYELPSKYNKKLLLKDP---RCLTHMF---AAEIFMHRFLLS 79
G+++ V K K +Y+Y+LP +N L+ C+ ++ A + + +L+
Sbjct: 334 GNLINLSAVVKKKRPLIYIYDLPPDFNSLLIEGRHFKFECVNRIYDERNATMAFYENILA 393
Query: 80 SPVRTLNPEEADWFYTPIYPTC-----DLTP-------TGLPLPFKSPRMMRSAIQLISS 127
+ RT+N EEAD+F+ P+ +C D P TGL + + A + I
Sbjct: 394 TAHRTMNGEEADFFFVPVLDSCIINRADDAPHINMQNHTGLRSSL-TLEFYKRAYEHIVE 452
Query: 128 NWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQR------- 180
+PYWNR+ G DH ++ + + G + FG
Sbjct: 453 KYPYWNRSAGRDHIWI----------WNSMMLVHWGNTNSKHNHSTTAYFGDNWDDISDE 502
Query: 181 ---NHVCLN-EGSITIPPYAPP---QKMQAHQIPPDTPRSIFVYFRGLF---YDVNNDPE 230
+H C + + IP + P + + P R YF G Y+ PE
Sbjct: 503 RRGDHPCFDPRKDLVIPAWKVPDPYSMRKNYWERPREKRKTLFYFNGNLGPAYE-KGRPE 561
Query: 231 GGYYARGARAAVWENFKNNPLFD--ISTDH----------PTTYYEDMQRAIFCLCPLGW 278
Y+ G R + E F ++P + + H Y++D+ +IFC G
Sbjct: 562 DS-YSMGIRQKLAEEFGSSPNKEGKLGKQHAEDVIVTPLRSDNYHKDIANSIFCGAFPG- 619
Query: 279 APWSPRLVEAVVFGCIPVII 298
WS R+ ++++ GC+PVII
Sbjct: 620 DGWSGRMEDSILQGCVPVII 639
>gi|168016669|ref|XP_001760871.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162687880|gb|EDQ74260.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 378
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 69/267 (25%), Positives = 108/267 (40%), Gaps = 46/267 (17%)
Query: 130 PYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGS 189
P W + G DH FV+ H +++ L +V FG+ + N
Sbjct: 123 PAWKASGGRDHVFVIHH----------PNSMQATRNRLRNSLFIVSDFGRYDSEVANIQK 172
Query: 190 ITIPPYAPPQKMQAHQIPP----DTP---RSIFVYFRGLFYDVNNDPEGGYYARGARAAV 242
+ PY H IP D+ R I ++F+G EGG R +
Sbjct: 173 DVVAPYK-------HVIPTFDFDDSSFHTRKILLFFQGAIVR----KEGG----KIRHEL 217
Query: 243 WENFKNNPLFDIST-----DHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVI 297
+ K+ P +T D + M+ + FCL G P S RL +++V C+PVI
Sbjct: 218 YRLLKDKPGVRFTTGNTALDGFQSATIGMRSSKFCLNMAGDTPSSNRLFDSIVSHCVPVI 277
Query: 298 IADDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPE--VILRKQRLLANPSMK 352
I+DDI LPF D + + +F+ K + +L ++ E L Q LL +
Sbjct: 278 ISDDIELPFEDTLDYSNFCIFINSSLALKPGYVINMLRNVSEEEWTQLWNQLLLVEHHFE 337
Query: 353 RAMLFPQPAQPGDAFHQILNGLARKLP 379
+ P + DA + + +ARKLP
Sbjct: 338 ----YQHPTRKNDAVNMVWKDIARKLP 360
>gi|357162592|ref|XP_003579459.1| PREDICTED: xylogalacturonan beta-1,3-xylosyltransferase-like
[Brachypodium distachyon]
Length = 474
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 60/217 (27%), Positives = 92/217 (42%), Gaps = 24/217 (11%)
Query: 116 RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIE-------RGILPLL 168
R R + ++++ P W R+ G DH FV+ D A +H + E A G L
Sbjct: 178 RRQREVVDRVTAH-PAWRRSGGRDHVFVLT-DPMAMWHVRAEIAPAILLVVDFGGWYKLD 235
Query: 169 QRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNND 228
++ + H ++ I PY + Q+ + R +YF+G +
Sbjct: 236 SKSAGSNSSHMIQHTQVSLLKDVIIPYT--HLLPTLQLSENMDRPTLLYFKG----AKHR 289
Query: 229 PEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYED-----MQRAIFCLCPLGWAPWSP 283
GG R +W+ N P + P + M+ + FCL P G P S
Sbjct: 290 HRGGL----VREKLWDVMINEPGVVMEEGFPNATGREQSIKGMRTSEFCLHPAGDTPSSC 345
Query: 284 RLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVA 320
RL +AV CIPVI++DDI LPF I + E +FV+
Sbjct: 346 RLFDAVASLCIPVIVSDDIELPFEGMIDYTEFSIFVS 382
>gi|384251773|gb|EIE25250.1| hypothetical protein COCSUDRAFT_40557 [Coccomyxa subellipsoidea
C-169]
Length = 280
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 45/149 (30%), Positives = 71/149 (47%), Gaps = 7/149 (4%)
Query: 234 YARGARAAVWENFKNNPLFDI----STDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAV 289
Y+ G R + F +P F + T Y D R+ FCL G A W RL A+
Sbjct: 115 YSHGVRQTLMMLFSQHPGFKLIDTGERGGYTQYMADFGRSTFCLAATG-AGWGVRLKLAL 173
Query: 290 VFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANP 349
+ GCIPVIIAD++ +PF D +P+++ + V E + +L +L +I L K+ +
Sbjct: 174 MHGCIPVIIADNVQMPFEDVLPYQDFAVHVREHALYRLPEVLDAILSTEGLVKRMQINVS 233
Query: 350 SMKRAMLFPQPAQPGDAFHQILNGLARKL 378
+ R + P A ++ L RK+
Sbjct: 234 CIWRYFTWRDP--QARAIDALICSLRRKM 260
>gi|168010648|ref|XP_001758016.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162690893|gb|EDQ77258.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 481
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 85/391 (21%), Positives = 147/391 (37%), Gaps = 73/391 (18%)
Query: 38 GKLKVYVYELPSKYNKKLLLKD------------------PRCLTHMFAAEIFMHRFLLS 79
KLK+++Y+LP +++ +L++ P L + E ++ LL+
Sbjct: 97 AKLKLFMYDLPPEFHYSMLVEQAYTGGQIWPKNISDIPPYPGGLYQQHSPEYWLTNDLLT 156
Query: 80 SPV----------RTLNPEEADWFYTPIYP-------TCDLTPTGLPLPF---KSPRMMR 119
S + R + AD + P + T G L K+ ++
Sbjct: 157 SNMAGRQSACTAFRVNDWRAADLMFVPFFASLAYNRYTKSEHKVGGELDLVGDKNQKLQE 216
Query: 120 SAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRAT-LVQTFG 178
++ + P W + G+DH V+ H FH + +A +V FG
Sbjct: 217 KLLKFLEQQ-PAWQASGGSDHIVVIHHP--NSFH---------AMRNFFSKAIFIVADFG 264
Query: 179 QRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGA 238
+ N + PY P R I ++F+G +GG
Sbjct: 265 RYPSEVANLRKDVVAPYKHVIPSFVDDSTPFEEREILLFFQGTIVR----KQGGV----I 316
Query: 239 RAAVWENFKNNPLFDISTD-------HPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVF 291
R ++E KN H T M+R+ CL G P S RL +A+
Sbjct: 317 RQQLYEMLKNEKGVHFEEGSAGSAGIHSAT--TGMRRSKCCLNIAGDTPSSNRLFDAIAS 374
Query: 292 GCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSM 351
C+PVII+D+I LPF D + + +F+ D + ++ I + RK+ + +
Sbjct: 375 HCVPVIISDEIELPFEDELDYSGFSIFINSTDAVQEKFVINLI--RSVSRKEWMRLWKRL 432
Query: 352 KRAML---FPQPAQPGDAFHQILNGLARKLP 379
K L + P +P DA + + +A K+P
Sbjct: 433 KEVSLHFEYQHPTKPYDAVNMVWRAVAHKVP 463
>gi|32488405|emb|CAE02830.1| OSJNBa0043A12.35 [Oryza sativa Japonica Group]
gi|90265244|emb|CAH67697.1| H0624F09.5 [Oryza sativa Indica Group]
Length = 464
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 78/287 (27%), Positives = 126/287 (43%), Gaps = 38/287 (13%)
Query: 116 RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKA------IERG---ILP 166
R R + ++++ P W R+ G DH FV+ D A +H ++E A ++ G L
Sbjct: 168 RRQREVVDRVTAH-PAWRRSGGRDHVFVLT-DPVAMWHVRKEIAPAILLVVDFGGWYKLD 225
Query: 167 LLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVN 226
++ V Q V L + I PY + + + R +YF+G
Sbjct: 226 SNSASSNVSHMIQHTQVSLLKDVIV--PYT--HLLPTMHLSENKDRPTLLYFKG----AK 277
Query: 227 NDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYED-----MQRAIFCLCPLGWAPW 281
+ GG R +W+ N P + +P + M+ + FCL P G P
Sbjct: 278 HRHRGGL----VREKLWDLMVNEPDVVMEEGYPNATGREQSIKGMRTSEFCLHPAGDTPT 333
Query: 282 SPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDV--PK-LDTILTSIPPEV 338
S RL +AV CIPVI++D+I LPF I + E +FV+ + PK L L ++P +
Sbjct: 334 SCRLFDAVASLCIPVIVSDEIELPFEGMIDYTEFAIFVSVNNSMRPKWLTNYLRNVPRQQ 393
Query: 339 ILRKQRLLAN--PSMKRAMLFP----QPAQPGDAFHQILNGLARKLP 379
+R +A+ P + ++P AQ G A + I + +KLP
Sbjct: 394 KDEFRRNMAHVQPIFEYDSIYPGRMASAAQDG-AVNHIWKKIHQKLP 439
>gi|320164456|gb|EFW41355.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 380
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 84/336 (25%), Positives = 137/336 (40%), Gaps = 38/336 (11%)
Query: 36 PVGKLKVYVYELPSKYNKKLLLKDP---RC---LTHMFAAEIFMHRFLLSSPVRTLNPEE 89
P+G K+YVY K++L P C L M+ EI + ++ S T NPEE
Sbjct: 4 PIGIFKIYVY-------KRILNAMPTRDECRDSLQIMWYEEIELPYWIRDSIHHTDNPEE 56
Query: 90 ADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFG 149
A +FY P C F R +++ + + PY++R G DH + P
Sbjct: 57 AHFFYIPTMVKCLFNLN--RARFNETRQFLISVRHLHRS-PYFHRNNGHDHALLNP---- 109
Query: 150 ACFHYQEEKAIERGILPLLQRA------TLVQTFGQRNHVCLNEGSITIPPY---APPQK 200
Y ++ G + A T + T R + I +P Y A
Sbjct: 110 GGGSYNVTSSVLHGSYLFGRGAGHYSNVTKLLTEAYRPYAYFAGRDIIVPGYPDDAFFSY 169
Query: 201 MQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPT 260
+ +Q R +F+Y G+ + G A + ++ PL + T +
Sbjct: 170 QETYQDALRERRRLFLYTGGVQLSYQRR-QLGRLAELLKIPSAKSSFYAPLVLLQTRKVS 228
Query: 261 TYYEDMQRAI----FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG 316
+ + Q+ + FC P G +PW+ R +A + GCIPV+ + VLPF + I W+ I
Sbjct: 229 SNKFEYQQLVKDFTFCAAPRGTSPWTQRFYDAAIVGCIPVLFDRNFVLPFPNQIDWDSIV 288
Query: 317 MFVAEE---DVPKLDTILT-SIPPEVILRKQRLLAN 348
+ +E+ LD + S E I ++R LA+
Sbjct: 289 VRFSEDFSHSFSFLDHLYQLSQDVEAIRERRRKLAD 324
>gi|412988049|emb|CCO19445.1| exostosin-like glycosyltransferase [Bathycoccus prasinos]
Length = 663
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 47/134 (35%), Positives = 71/134 (52%), Gaps = 11/134 (8%)
Query: 39 KLKVYVYE-LPSKYNKKLLLK-DPRCLTHMFA-AEIFMHRFLLSSPVRTLNPEEADWFYT 95
+ KV+VYE +P N L K +C + +A AE + + S V T +PE AD++
Sbjct: 184 RFKVFVYEDVPEALNADLRTKRSDKCKDNGYANAEWKIPELIAKSEVYTPDPELADFYVV 243
Query: 96 PIYPTC---DLTPTG----LPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDF 148
P++P C D G + K +M ++AI I+ N+PYW R+EG DH F+ P +
Sbjct: 244 PLFPECYVRDKLEKGGADYVTAVRKVNKMYQAAIDRIAGNYPYWRRSEGRDHVFIFPAEK 303
Query: 149 GACFHYQEEKAIER 162
GA + EK +ER
Sbjct: 304 GAE-NVLNEKTLER 316
Score = 47.4 bits (111), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 31/125 (24%), Positives = 56/125 (44%), Gaps = 5/125 (4%)
Query: 265 DMQRAIFCLCP--LGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEE 322
+M+ + CL G WS L +A++ GC+P+I+ DD F+D + WE +
Sbjct: 527 EMRNSETCLITPHGGLEGWSTALSDAILSGCVPLIVHDDFEPYFSDVLDWENFSYKIPTR 586
Query: 323 DVPK--LDTIL-TSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
+ + D +L I R +R RA ++ Q GDA + L R++
Sbjct: 587 EALRNAKDALLHEKSKASSITRSKRENDLKIAARASVWNDKWQSGDALDALFESLRRRVR 646
Query: 380 HDKST 384
+ +++
Sbjct: 647 YHRNS 651
>gi|218195798|gb|EEC78225.1| hypothetical protein OsI_17865 [Oryza sativa Indica Group]
gi|222629749|gb|EEE61881.1| hypothetical protein OsJ_16573 [Oryza sativa Japonica Group]
Length = 452
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 78/287 (27%), Positives = 126/287 (43%), Gaps = 38/287 (13%)
Query: 116 RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKA------IERG---ILP 166
R R + ++++ P W R+ G DH FV+ D A +H ++E A ++ G L
Sbjct: 156 RRQREVVDRVTAH-PAWRRSGGRDHVFVLT-DPVAMWHVRKEIAPAILLVVDFGGWYKLD 213
Query: 167 LLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVN 226
++ V Q V L + I PY + + + R +YF+G
Sbjct: 214 SNSASSNVSHMIQHTQVSLLKDVIV--PYT--HLLPTMHLSENKDRPTLLYFKG----AK 265
Query: 227 NDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYED-----MQRAIFCLCPLGWAPW 281
+ GG R +W+ N P + +P + M+ + FCL P G P
Sbjct: 266 HRHRGGL----VREKLWDLMVNEPDVVMEEGYPNATGREQSIKGMRTSEFCLHPAGDTPT 321
Query: 282 SPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDV--PK-LDTILTSIPPEV 338
S RL +AV CIPVI++D+I LPF I + E +FV+ + PK L L ++P +
Sbjct: 322 SCRLFDAVASLCIPVIVSDEIELPFEGMIDYTEFAIFVSVNNSMRPKWLTNYLRNVPRQQ 381
Query: 339 ILRKQRLLAN--PSMKRAMLFP----QPAQPGDAFHQILNGLARKLP 379
+R +A+ P + ++P AQ G A + I + +KLP
Sbjct: 382 KDEFRRNMAHVQPIFEYDSIYPGRMASAAQDG-AVNHIWKKIHQKLP 427
>gi|115469550|ref|NP_001058374.1| Os06g0680900 [Oryza sativa Japonica Group]
gi|113596414|dbj|BAF20288.1| Os06g0680900 [Oryza sativa Japonica Group]
gi|125556494|gb|EAZ02100.1| hypothetical protein OsI_24187 [Oryza sativa Indica Group]
Length = 477
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 72/262 (27%), Positives = 106/262 (40%), Gaps = 26/262 (9%)
Query: 83 RTLNPEEADWFYTPIY--PTCDLTPTGLPLPFKSP--RMMRSAIQLISSNWPYWNRTEGA 138
R +P EA F+ PI + F P R++ + +++ +P+WNR+ GA
Sbjct: 182 RARHPGEAHAFFLPISVASIAGYVYRRDMIDFWDPQLRLVAGYVDGLAAMYPFWNRSRGA 241
Query: 139 DHFFVVPHDFGACFHYQEEKAIERG-ILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAP 197
DHF V H + KA RG + ++ A + F V L P
Sbjct: 242 DHFLVSCHQWAPIL--SAAKAELRGNAIRVMCDADMSDGFDPATDVAL-----------P 288
Query: 198 PQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTD 257
P A P P+ R + GG R A A WE + +
Sbjct: 289 PVVASARATP---PQGRVASERTVLAFFAAGGGGGGAVREALLARWEGRDDRVVVYGRLP 345
Query: 258 HPTTYYEDMQRAIFCLCPL----GWAPWSPRLVEAVVFGCIPVIIADDIVL-PFADAIPW 312
+ E M+RA FCLCP G A S R+VEA+ GC+PV++ D PF+D + W
Sbjct: 346 AGVDHGELMRRARFCLCPCGGGEGAAAASRRVVEAITAGCVPVLVDDGGYSPPFSDVLDW 405
Query: 313 EEIGMFVAEEDVPKLDTILTSI 334
+ V E V ++ IL +
Sbjct: 406 ARFSVAVPAERVGEIKDILGGV 427
>gi|58269252|ref|XP_571782.1| hypothetical protein [Cryptococcus neoformans var. neoformans JEC21]
gi|57228018|gb|AAW44475.1| conserved hypothetical protein [Cryptococcus neoformans var.
neoformans JEC21]
Length = 1132
Score = 68.6 bits (166), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 37/109 (33%), Positives = 60/109 (55%), Gaps = 4/109 (3%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y D+ A FC P G WSP+ +A+ GCIPV I++ PFAD + W ++ + VA
Sbjct: 1002 YMNDLNNARFCPQPRGITGWSPQTNDAIYAGCIPVFISEGTHYPFADFLDWSKLSVRVAP 1061
Query: 322 EDVPKLDTILTSIPPEVILRKQRLLAN-PSMKRAMLFPQPAQPGDAFHQ 369
++ K++ IL +IP + + + L AN SM+ A L+ +P + +
Sbjct: 1062 TELDKIEKILAAIP---LSKVEELQANLVSMREAFLYSGDEKPEEELER 1107
>gi|134114275|ref|XP_774385.1| hypothetical protein CNBG3660 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50257020|gb|EAL19738.1| hypothetical protein CNBG3660 [Cryptococcus neoformans var.
neoformans B-3501A]
Length = 1132
Score = 68.6 bits (166), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 37/109 (33%), Positives = 60/109 (55%), Gaps = 4/109 (3%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y D+ A FC P G WSP+ +A+ GCIPV I++ PFAD + W ++ + VA
Sbjct: 1002 YMNDLNNARFCPQPRGITGWSPQTNDAIYAGCIPVFISEGTHYPFADFLDWSKLSVRVAP 1061
Query: 322 EDVPKLDTILTSIPPEVILRKQRLLAN-PSMKRAMLFPQPAQPGDAFHQ 369
++ K++ IL +IP + + + L AN SM+ A L+ +P + +
Sbjct: 1062 TELDKIEKILAAIP---LSKVEELQANLVSMREAFLYSGDEKPEEELER 1107
>gi|168017794|ref|XP_001761432.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162687438|gb|EDQ73821.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 377
Score = 68.6 bits (166), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 65/282 (23%), Positives = 110/282 (39%), Gaps = 43/282 (15%)
Query: 113 KSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRAT 172
K+ ++ ++ + P W ++G DH V+ H H + R +L +L
Sbjct: 112 KNQKLQEKLLEYLKQQ-PAWQASDGCDHILVMHH--PNSMHAMRDSF--RNVLFVLA--- 163
Query: 173 LVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGL----------- 221
FG+ N + PY + R ++F+G
Sbjct: 164 ---DFGRYPPDVANVEKDVVAPYKHIIPSFDNDSSSFEDRETLLFFQGTIVRKQGGVIRQ 220
Query: 222 -FYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAP 280
Y++ D EG ++ G+ + H T M+ + FCL G P
Sbjct: 221 QLYEMLKDEEGVHFEEGSSGS-------------EGVHSAT--SGMRGSKFCLNIAGDTP 265
Query: 281 WSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVIL 340
S RL +++ C+PVII+DDI LPF D + + E +F+ ED K ++ + I
Sbjct: 266 SSNRLFDSIASHCVPVIISDDIELPFEDELDYSEFCVFIKSEDALKEKYVINLL--RSIT 323
Query: 341 RKQRLLANPSMK---RAMLFPQPAQPGDAFHQILNGLARKLP 379
R Q +K R + P +P DA + + +AR+ P
Sbjct: 324 RVQWTFLWKRLKAVARHFEYQHPTKPYDAVNMVWRAIARRAP 365
>gi|307106602|gb|EFN54847.1| hypothetical protein CHLNCDRAFT_134888 [Chlorella variabilis]
Length = 833
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 47/142 (33%), Positives = 66/142 (46%), Gaps = 31/142 (21%)
Query: 42 VYVYELPSKYNKKLL------------LKDPRCLT------HMFAAEIFMHRFLLSSPVR 83
+YVYELP YN +L D R T H++ E +H LL S R
Sbjct: 371 IYVYELPPFYNSVMLQYRVSREGCVHRFFDDRNATVFNDMMHLYNPEPGLHEALLQSEHR 430
Query: 84 TLNPEEADWFYTPIYPTCDLTPT----------GLPLPFKS---PRMMRSAIQLISSNWP 130
TL+P+EAD+FY P + +C L P G P+ +++ M I S++P
Sbjct: 431 TLDPDEADFFYIPAFVSCFLFPVLSATDFPYFHGGPVAWRTHAAANMFIEVYHWIRSHYP 490
Query: 131 YWNRTEGADHFFVVPHDFGACF 152
YW+R G DH HD G+C+
Sbjct: 491 YWDRNGGRDHIVGSFHDEGSCW 512
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 36/150 (24%), Positives = 68/150 (45%), Gaps = 15/150 (10%)
Query: 208 PDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAV---------WENFKNNPLFDISTDH 258
P R I +F+G N PE Y+RG R + W K + ++
Sbjct: 611 PTRERRILAFFKGRTQQSN--PE---YSRGIRQTLENLTREHDWWGKHKVHVGEEMPEGE 665
Query: 259 PTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMF 318
+Y + +++FC +G +S R +A++ GCIPV+I D + +++ + +
Sbjct: 666 SDSYSAMLAQSVFCFALMG-DGFSSRTDDAIIHGCIPVLIQDGVEPTWSNLLDTGSYSVR 724
Query: 319 VAEEDVPKLDTILTSIPPEVILRKQRLLAN 348
+ ++D+ ++ IL +I E + R Q L
Sbjct: 725 ILQKDMERVPEILQAISKEDVARMQANLGK 754
>gi|224098485|ref|XP_002311191.1| predicted protein [Populus trichocarpa]
gi|222851011|gb|EEE88558.1| predicted protein [Populus trichocarpa]
Length = 345
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 67/247 (27%), Positives = 109/247 (44%), Gaps = 40/247 (16%)
Query: 77 LLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTE 136
L +SP T NPEEA F+ P S R + I+ + +PYWNRT
Sbjct: 69 LQASPFVTQNPEEAHLFFVPFASNL------------STRSIARFIRDLRMEFPYWNRTL 116
Query: 137 GADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYA 196
GADHF+V G Y+ +R ++ L + + + F + I++PP A
Sbjct: 117 GADHFYVSCAGLG----YES----DRNLVELKKNSVQISCFPVPEGKFVPHKDISLPPLA 168
Query: 197 PPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDIST 256
+ +A P + R + Y V + GG + ++ A ++ L +
Sbjct: 169 --RITRASHAPGN---------RTVRYLVRH---GG--VKDSKLANELRNDSDFLMESEP 212
Query: 257 DHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIV--LPFADAIPWEE 314
+ T E + ++FCL G A S + EA+ FGC+PV++ D + LP D + W++
Sbjct: 213 SNEMTLVERLGSSMFCLFEDG-ADISG-IGEALRFGCVPVMVTDRPMQDLPLMDVLSWQK 270
Query: 315 IGMFVAE 321
I +FV
Sbjct: 271 IAVFVGS 277
>gi|224135879|ref|XP_002327326.1| predicted protein [Populus trichocarpa]
gi|222835696|gb|EEE74131.1| predicted protein [Populus trichocarpa]
Length = 449
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 76/295 (25%), Positives = 114/295 (38%), Gaps = 53/295 (17%)
Query: 128 NWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFG--------- 178
N W R+ G DH FV+ D A +H + E I P + LV FG
Sbjct: 164 NSEAWKRSGGKDHVFVLT-DPVAMWHVRAE------IAPAI---LLVVDFGGWYRLDSKS 213
Query: 179 -------QRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEG 231
H ++ I PY + Q + R+ +YF+G + G
Sbjct: 214 SNGSSSDMIRHTQVSLLKDVIVPYT--HLLPRFQFSENKKRNTLLYFKG----AKHRHRG 267
Query: 232 GYYARGARAAVWENFKNNPLFDISTDHPTTYYED-----MQRAIFCLCPLGWAPWSPRLV 286
G R +W+ N P + P + M+ + FCL P G P S RL
Sbjct: 268 GI----VRENLWDLLVNEPGVIMEEGFPNATGRELSIRGMRTSEFCLHPAGDTPTSCRLF 323
Query: 287 EAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLL 346
+A+ CIPVI++D+I LPF + + E +FVA +D K ++ + I KQ+
Sbjct: 324 DAIQSLCIPVIVSDNIELPFEGILDYTEFSVFVAGDDALKPTWLMDHL--RSISEKQKEE 381
Query: 347 ANPSMKRAMLFPQ----------PAQPGDAFHQILNGLARKLPHDKSTYLKPGGK 391
+M + L Q P P A + I + KLP K ++ K
Sbjct: 382 LRRNMAKIQLIYQYENGHPGGIGPISPNGAVNHIWKKIHEKLPVIKEAIVREKRK 436
>gi|168014691|ref|XP_001759885.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689015|gb|EDQ75389.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 383
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 69/285 (24%), Positives = 115/285 (40%), Gaps = 49/285 (17%)
Query: 113 KSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRAT 172
K+ + +Q + + P W + G +H V+ H +A+ + L
Sbjct: 118 KNEELQAKLVQYLEKH-PAWKASNGKNHVMVIHHP-------NSMQAVRDRLRNALY--- 166
Query: 173 LVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDT-------PRSIFVYFRGLFYDV 225
+V FG+ + N + PY H +P T RS VYF+G
Sbjct: 167 VVSDFGRYENETANIRKDVVAPYK-------HVLPTFTDDSSSFHTRSTVVYFQGSIVR- 218
Query: 226 NNDPEGGYYARGARAAVWENFKNNPLFDISTD-------HPTTYYEDMQRAIFCLCPLGW 278
EGG R +++ K+ P +T H T M+ + FCL G
Sbjct: 219 ---KEGG----KIRHELYDLLKDEPDVHFTTGITASEGFHSAT--RGMRSSRFCLNLAGD 269
Query: 279 APWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK----LDTILTSI 334
P S RL +++ C+PVII+DD+ LPF D + + +F+ + ++ +
Sbjct: 270 TPSSNRLFDSIASHCVPVIISDDLELPFEDDLNYSSFCIFINSTRALQPGYVINLLRNVS 329
Query: 335 PPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
E L +RLL ++R + P+ DA + + +ARKLP
Sbjct: 330 SEEWTLMWERLLV---VERHFEYQFPSVANDAVNMVWKAIARKLP 371
>gi|449462180|ref|XP_004148819.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase F8H-like
[Cucumis sativus]
gi|449524512|ref|XP_004169266.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase F8H-like
[Cucumis sativus]
Length = 458
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 41/138 (29%), Positives = 70/138 (50%), Gaps = 8/138 (5%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGM-FVA 320
Y+E ++ A FCL P G + W+ R E+ C+PV+++D + LPF + I + +I + + +
Sbjct: 308 YFEHLRNAKFCLAPRGESSWTLRFYESFFVECVPVVLSDQVELPFQNVIDYSQISIKWPS 367
Query: 321 EEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAM-LFPQPAQPGDAFHQILNGLARKLP 379
E P+L L SIP E I +++A R + ++ ++P IL L RK+
Sbjct: 368 SEIGPQLLEYLESIPDETI---DKMIARGRRVRCLWVYASDSEPCSTMQGILWELQRKVR 424
Query: 380 ---HDKSTYLKPGGKFLN 394
T+ G F+N
Sbjct: 425 QFHQSTETFWLHNGSFVN 442
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 42/133 (31%), Positives = 60/133 (45%), Gaps = 12/133 (9%)
Query: 40 LKVYVYELPSKYNKKLLL--KDPR-----CLTHMFAAEIFMHRFLLSSPVRTLNPEEADW 92
LK+YVY+ K L+ +D + CL + ++ +HR LL S RT N EEAD+
Sbjct: 85 LKIYVYDETEIQGLKALMYGRDGKITAAACLKGQWGTQVKIHRLLLQSRFRTRNKEEADF 144
Query: 93 FYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACF 152
F+ P Y C GL + + + A + PY+ + G DH FV P GA
Sbjct: 145 FFVPAYVKCVRMLGGL-----NDKEINEAYIQVLGQMPYFRLSGGRDHIFVFPSGAGAHL 199
Query: 153 HYQEEKAIERGIL 165
I R I+
Sbjct: 200 FKSWATYINRSII 212
>gi|449444196|ref|XP_004139861.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10L-like
[Cucumis sativus]
Length = 478
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 70/278 (25%), Positives = 111/278 (39%), Gaps = 35/278 (12%)
Query: 83 RTLNPEEADWFYTPIYPT--CDLTPTGLPLPFKSP------RMMRSAIQLISSNWPYWNR 134
R EEAD + P + T ++ F+ R+ + + S W +
Sbjct: 141 RVFKAEEADVIFVPFFATMSAEMQLGMAKGAFRKKVGNEDYERQRNVMDFLKST-DAWKK 199
Query: 135 TEGADHFFVVPHDFGACFHYQEEKAIE-------RGILPLLQRATLVQTFGQRNHVCLNE 187
+ G DH FV+ D A +H + E A G L +++ + H ++
Sbjct: 200 SGGRDHVFVLT-DPVAMWHVKTEIAPAVLLVVDFGGWFRLDTKSSNGSSPDMIQHTQVSV 258
Query: 188 GSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFK 247
I PY + + + R +YF+G GG R +W+
Sbjct: 259 LKDVIVPYT--HLLPRLHLSANKKRQTLLYFKG----AKRRHRGGL----VREKLWDLLV 308
Query: 248 NNPLFDISTDHPTTYYED-----MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDI 302
N P + P ++ M+ + FCL P G P S RL +A+ CIPV+++D+I
Sbjct: 309 NEPDVIMEEGFPNATGKEQSIKGMRSSEFCLHPAGDTPTSCRLFDAIQSLCIPVVVSDNI 368
Query: 303 VLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPE 337
LPF D + + E +FVA D K L L +IP E
Sbjct: 369 ELPFEDMVDYSEFSVFVAVNDALKPNWLVKHLRTIPEE 406
>gi|326429052|gb|EGD74622.1| hypothetical protein PTSG_05987 [Salpingoeca sp. ATCC 50818]
Length = 452
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 40/117 (34%), Positives = 60/117 (51%), Gaps = 2/117 (1%)
Query: 265 DMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFAD-AIPWEEIGMFVAEED 323
+M + FCLCP G + R+ EAV+ GCIPVI++D PF A + + V E+D
Sbjct: 337 EMATSEFCLCPRGLTAGTRRIFEAVLVGCIPVIVSDGYTWPFPHLAAELDAASVRVPEKD 396
Query: 324 VPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPH 380
++ IL + + K+ LA+ + P P QPGDAF+ I+ +A H
Sbjct: 397 AARVLDILGHVSRRERVAKRVRLAHLAHNVTYHLPAP-QPGDAFYNIIRAIADTQQH 452
>gi|323456573|gb|EGB12440.1| hypothetical protein AURANDRAFT_70660 [Aureococcus anophagefferens]
Length = 1731
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 28/74 (37%), Positives = 44/74 (59%)
Query: 260 TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFV 319
Y E M+RA +C+ G++PWSPRL EAV GC+P ++ + P+A + W + +
Sbjct: 1512 ANYTESMRRAKYCVVTEGFSPWSPRLSEAVALGCVPCFLSPSLAPPYATVLDWSAFSVEI 1571
Query: 320 AEEDVPKLDTILTS 333
AE DV +L +L +
Sbjct: 1572 AEADVGRLPEVLAA 1585
>gi|297611147|ref|NP_001065633.2| Os11g0128000 [Oryza sativa Japonica Group]
gi|255679743|dbj|BAF27478.2| Os11g0128000, partial [Oryza sativa Japonica Group]
Length = 199
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 40/119 (33%), Positives = 61/119 (51%), Gaps = 5/119 (4%)
Query: 264 EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED 323
+ M + FCL G P S RL +A+V C+PVII+DDI LP+ DA+ + + +FV D
Sbjct: 63 QGMHASKFCLNIAGDTPSSNRLFDAIVSHCVPVIISDDIELPYEDALDYSKFSIFVRSSD 122
Query: 324 VPKLDTILTSIPPEVILRKQRLLANPSMK---RAMLFPQPAQPGDAFHQILNGLARKLP 379
K ++ I + + Q + +K + + P+Q DA I LARK+P
Sbjct: 123 AVKKGYLMRLIRG--VSKHQWTMMWRRLKEVDKHFEYQYPSQKDDAVQMIWQTLARKVP 179
>gi|194701034|gb|ACF84601.1| unknown [Zea mays]
gi|413955857|gb|AFW88506.1| hypothetical protein ZEAMMB73_716681 [Zea mays]
Length = 462
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 86/385 (22%), Positives = 148/385 (38%), Gaps = 71/385 (18%)
Query: 40 LKVYVYELPSKYNKKLL--------------LKD-----PRCLTHMFAAEIFMHRFLLSS 80
L+V++Y+LP +++ LL ++D P L + E ++ LL+S
Sbjct: 68 LRVFMYDLPPEFHFGLLDWKPPGFGGGVWPDIRDGVPDYPGGLNLQHSIEYWLTLDLLAS 127
Query: 81 P---------VRTLNPEEADWFYTPIYPTCDLTPTGLPLP----FKSPRMMRSAIQLISS 127
R + +AD + P + + +P + + R ++ +++
Sbjct: 128 EQGAPTPCAVARVRHAADADVVFVPFFASLSFNRHSRVVPPARDSEDRALQRRLLEFLAA 187
Query: 128 NWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQR----ATLVQTFGQRNHV 183
P W RT G DH + H G+L R ++ FG+
Sbjct: 188 R-PEWRRTGGRDHVVLAHH--------------PNGMLDARYRFWPCVFVLCDFGRYPPS 232
Query: 184 CLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGG------YYARG 237
N I PY A+ R +YF+G Y +GG YY
Sbjct: 233 VANLDKDVIAPYRHLVANFANDTAGYDDRPTLLYFQGAIYR----KDGGFIRQELYYLLK 288
Query: 238 ARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVI 297
V +F + ++ + + M+ + FCL G P S RL +++V C+PV
Sbjct: 289 DEKDVHFSFGS-----VAGNGIEQATQGMRSSKFCLNIAGDTPSSNRLFDSIVSHCVPVT 343
Query: 298 IADDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRA 354
I+D+I LPF D + + + + V D K L ++ I E R L +++
Sbjct: 344 ISDEIELPFEDVLDYSKFSVIVRGADAVKKGFLMNLIKGISREEWTRMWNRLKE--VEKH 401
Query: 355 MLFPQPAQPGDAFHQILNGLARKLP 379
+ P+Q DA I +ARK+P
Sbjct: 402 FEYQYPSQTDDAVQMIWKAIARKVP 426
>gi|343171902|gb|AEL98655.1| exostosin-like protein, partial [Silene latifolia]
Length = 250
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 59/213 (27%), Positives = 90/213 (42%), Gaps = 14/213 (6%)
Query: 173 LVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPD--TPRSIFVYFRGLFYDVNNDPE 230
+V FG+ + N + PY PD R+ ++FRG V D E
Sbjct: 26 IVADFGRYSKSMSNLRKDVVAPYVHVVDSYTDDDAPDPFKARTTLLFFRG--RTVRKD-E 82
Query: 231 GGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVV 290
G AR A+ V + S + + M+ + FCL P G P S RL +A+V
Sbjct: 83 GVVRARLAKILVDYEDVHYERSVASEESIKLSTQGMRSSKFCLDPAGDTPSSCRLFDAIV 142
Query: 291 FGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLA 347
C+PVI++D I LPF D + + +F + E+ K + L +P E L + L
Sbjct: 143 SHCVPVIVSDQIELPFEDELDYSNFSIFFSTEEALKPGYMVEELRKVPEEKWLNMYKKLK 202
Query: 348 NPSMKRAMLFPQPAQPGDAFHQILNGLARKLPH 380
N S +P + G +N L R++ H
Sbjct: 203 NISHHYEFQYPPKREDG------VNMLWRQIRH 229
>gi|343171904|gb|AEL98656.1| exostosin-like protein, partial [Silene latifolia]
Length = 250
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 59/213 (27%), Positives = 90/213 (42%), Gaps = 14/213 (6%)
Query: 173 LVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPD--TPRSIFVYFRGLFYDVNNDPE 230
+V FG+ + N + PY PD R+ ++FRG V D E
Sbjct: 26 IVADFGRYSKSMSNLRKDVVAPYVHVVDSYTDDDAPDPFKARTTLLFFRG--RTVRKD-E 82
Query: 231 GGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVV 290
G AR A+ V + S + + M+ + FCL P G P S RL +A+V
Sbjct: 83 GVVRARLAKILVDYEDIHYERSVASEESIKLSTQGMRSSKFCLDPAGDTPSSCRLFDAIV 142
Query: 291 FGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLA 347
C+PVI++D I LPF D + + +F + E+ K + L +P E L + L
Sbjct: 143 SHCVPVIVSDQIELPFEDELDYSNFSIFFSTEEALKPGYMVEELRKVPEEKWLNMYKKLK 202
Query: 348 NPSMKRAMLFPQPAQPGDAFHQILNGLARKLPH 380
N S +P + G +N L R++ H
Sbjct: 203 NISHHYEFQYPPKREDG------VNMLWRQIRH 229
>gi|222635945|gb|EEE66077.1| hypothetical protein OsJ_22095 [Oryza sativa Japonica Group]
Length = 265
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 40/140 (28%), Positives = 69/140 (49%), Gaps = 9/140 (6%)
Query: 240 AAVWEN-----FKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCI 294
A +WEN NN + + Y + R FC+CP G S R+ +++ +GC+
Sbjct: 121 ARIWENDTELAISNNRINRAIGN--LVYQKHFFRTKFCVCPGGSQVNSARISDSIHYGCM 178
Query: 295 PVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRA 354
PVI++D L F+ + W + + + E DV +L +IL S+ + + + L +++
Sbjct: 179 PVILSDYYDLRFSGILNWRKFAVVLKESDVYELKSILKSLSQKEFVSLHKSLVQ--VQKH 236
Query: 355 MLFPQPAQPGDAFHQILNGL 374
+ P P DAFH I+ L
Sbjct: 237 FEWHSPPVPYDAFHMIMYEL 256
>gi|196004286|ref|XP_002112010.1| hypothetical protein TRIADDRAFT_24279 [Trichoplax adhaerens]
gi|190585909|gb|EDV25977.1| hypothetical protein TRIADDRAFT_24279 [Trichoplax adhaerens]
Length = 668
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 83/330 (25%), Positives = 131/330 (39%), Gaps = 51/330 (15%)
Query: 35 DPVGKLKVYVY---ELPSKYNKKLLLKDP-RCLTHMFAAEIFMHRFLLSSPVRTLNPEEA 90
D +L VYVY LP Y + DP +T + M + ++ SP NP+ A
Sbjct: 35 DDHRRLTVYVYPRYLLPDLYG----ISDPVNTITREYRR---MLQAIIDSPFYVDNPKTA 87
Query: 91 DWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGA 150
C L P+ L + + + + L +N PYWN+ + F ++P G
Sbjct: 88 ----------CLLIPSYDVLNQRHVNLQGAGLTL--TNLPYWNQGKNHLLFNMLP---GQ 132
Query: 151 CFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEG-SITIPPYAPPQKMQAHQIPPD 209
H+ RG RA L G + + G I+IP Y+ Q HQ D
Sbjct: 133 WPHFNTSLDARRG------RAMLAG--GGFSSLSYRYGFDISIPIYSLHQDDDYHQRSQD 184
Query: 210 TP---------RSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFD-----IS 255
R + L YD D A R + + N S
Sbjct: 185 ENYHNNISHHHRPWLLLSSQLHYDRKFDNRLMNLANDNRILILQRCSVNSSHKAYYQRCS 244
Query: 256 TDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEI 315
+Y +Q + FCL G+ ++A+ FGCIPV+++D+ +LPF++ + W+
Sbjct: 245 HKQSISYPTILQDSTFCLMLRGYRLIQSNFLDALKFGCIPVVLSDEYILPFSEVLDWKRA 304
Query: 316 GMFVAEEDVPKLDTILTSIPPEVI--LRKQ 343
+ E+ + L +L+SI + LRKQ
Sbjct: 305 ALVFREDQLLSLPAVLSSISTKTRHNLRKQ 334
>gi|356576131|ref|XP_003556187.1| PREDICTED: probable glucuronosyltransferase Os01g0926400-like
[Glycine max]
Length = 481
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 71/297 (23%), Positives = 119/297 (40%), Gaps = 36/297 (12%)
Query: 112 FKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRA 171
+K R + A++ N WNR+ G DH FV+ D A +H ++E A ++
Sbjct: 183 YKRQRQVMDAVK----NTHAWNRSGGRDHVFVLT-DPVAMWHVKDEIAPAVLLVVDFGGW 237
Query: 172 TLVQTFGQRN--------HVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFY 223
+ + G N H ++ I PY + + + R +YF+G
Sbjct: 238 YRLDSRGGSNCSESDVIPHTQVSVIKDVIVPYT--HLLPRLDLSDNKERHQLLYFKG--- 292
Query: 224 DVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYED-----MQRAIFCLCPLGW 278
+ GG R +W+ + P + P + MQ + FCL P G
Sbjct: 293 -AKHRHRGGII----REKLWDLLVSEPGVIMEEGFPNATGREQSIKGMQTSEFCLHPAGD 347
Query: 279 APWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEV 338
P S RL +A+ CIPVI++D+I LPF + + E +F A D K +++ +
Sbjct: 348 TPTSCRLFDAIQSLCIPVIVSDNIELPFEGMVDYAEFSVFAAVSDALKPSWLVSHLQSFS 407
Query: 339 ILRKQRLLANPSMKRAMLFPQPAQPG--------DAFHQILNGLARKLPHDKSTYLK 387
+K R N + + + PG A + I + +KLP K ++
Sbjct: 408 KEQKDRFRQNMARVQPIFVYDNGHPGGIGPIPADGAVNHIWKKVHQKLPMIKEAIIR 464
>gi|212722274|ref|NP_001131151.1| uncharacterized protein LOC100192459 [Zea mays]
gi|194690716|gb|ACF79442.1| unknown [Zea mays]
gi|413955856|gb|AFW88505.1| hypothetical protein ZEAMMB73_716681 [Zea mays]
Length = 391
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 72/315 (22%), Positives = 121/315 (38%), Gaps = 43/315 (13%)
Query: 82 VRTLNPEEADWFYTPIYPTCDLTPTGLPLP----FKSPRMMRSAIQLISSNWPYWNRTEG 137
R + +AD + P + + +P + + R ++ +++ P W RT G
Sbjct: 67 ARVRHAADADVVFVPFFASLSFNRHSRVVPPARDSEDRALQRRLLEFLAAR-PEWRRTGG 125
Query: 138 ADHFFVVPHDFGACFHYQEEKAIERGILPLLQR----ATLVQTFGQRNHVCLNEGSITIP 193
DH + H G+L R ++ FG+ N I
Sbjct: 126 RDHVVLAHH--------------PNGMLDARYRFWPCVFVLCDFGRYPPSVANLDKDVIA 171
Query: 194 PYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGG------YYARGARAAVWENFK 247
PY A+ R +YF+G Y +GG YY V +F
Sbjct: 172 PYRHLVANFANDTAGYDDRPTLLYFQGAIYR----KDGGFIRQELYYLLKDEKDVHFSFG 227
Query: 248 NNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFA 307
+ ++ + + M+ + FCL G P S RL +++V C+PV I+D+I LPF
Sbjct: 228 S-----VAGNGIEQATQGMRSSKFCLNIAGDTPSSNRLFDSIVSHCVPVTISDEIELPFE 282
Query: 308 DAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPG 364
D + + + + V D K L ++ I E R L +++ + P+Q
Sbjct: 283 DVLDYSKFSVIVRGADAVKKGFLMNLIKGISREEWTRMWNRLK--EVEKHFEYQYPSQTD 340
Query: 365 DAFHQILNGLARKLP 379
DA I +ARK+P
Sbjct: 341 DAVQMIWKAIARKVP 355
>gi|224112673|ref|XP_002316258.1| predicted protein [Populus trichocarpa]
gi|222865298|gb|EEF02429.1| predicted protein [Populus trichocarpa]
Length = 339
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 73/303 (24%), Positives = 122/303 (40%), Gaps = 52/303 (17%)
Query: 77 LLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTE 136
L +SP T NPEEA ++ P S R + I+ + +PYWNRT
Sbjct: 69 LQNSPFVTQNPEEAHLYFVPFSSNL------------STRSVARFIRDLRMEFPYWNRTL 116
Query: 137 GADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYA 196
GADHF+V G Y+ +R ++ L + + + F + IT PP+A
Sbjct: 117 GADHFYVSCAGLG----YES----DRNLVELKKNSVQISCFPTTEGRFVPHKDITFPPHA 168
Query: 197 PPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDIST 256
+ + + R Y + + + + F I +
Sbjct: 169 QGNRTAKY----------LGFVR--------------YNEVKESNLVNELRKDSDFLIES 204
Query: 257 D--HPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIV--LPFADAIPW 312
+ + T + ++FCL G A S + EA+ FGC+PV++ D + LP D I W
Sbjct: 205 EPSNGMTLVGRLGSSVFCLFEYG-ADVSG-IGEALRFGCVPVMVMDRPMQDLPLMDVIGW 262
Query: 313 EEIGMFVAEE-DVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQIL 371
++I +FV V ++ L + +R L + + + P QP D+FH ++
Sbjct: 263 QKIAIFVGSRGGVKEVKRELDRTCKDDECAGRRRLGVVASQHFVWNHMP-QPYDSFHMVM 321
Query: 372 NGL 374
L
Sbjct: 322 YQL 324
>gi|320165867|gb|EFW42766.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 508
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 73/305 (23%), Positives = 119/305 (39%), Gaps = 46/305 (15%)
Query: 36 PVGKLKVYVYELPSKYNKKLLLKDPRCL---THMFAAEIFMHRFLLSSPVRTLNPEEADW 92
P+G LK+YVY+ K +++ +CL T M+ E+ + +++ S T NP+EA
Sbjct: 129 PIGTLKIYVYKEMMK-----VMEPHKCLQTVTPMWLEEVELPSWVVKSIHYTNNPDEAHM 183
Query: 93 FYTPIYPTC--DLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGA 150
F+ P C D T L + M + ++ + Y+ R G DHF + P G
Sbjct: 184 FFIPAMVRCILDFNRTQFHLTSEFTEM----VDVLHTKHDYYRRNHGHDHFIINP---GG 236
Query: 151 CFHYQEEKAIERGILPL-----LQRATLVQTFGQRNHVCLNEGSITIP---PYAPPQKMQ 202
+ + P+ AT + + R+ + IP Y + M
Sbjct: 237 GSMNVISSLLAGELHPVAANDWYSNATKLLSEAARSRAYFSGLDFVIPGSADYIFGKFMD 296
Query: 203 AHQIPPDTPRSIFVYFRG------------LFYDVNNDPEGGYYARGARAAVWENFKNNP 250
Q + +F+Y G L V D E + R + N ++P
Sbjct: 297 VSQKIEEERPMLFLYLGGTSLGDQRQALGRLRKLVQGDSEQAAFFRD--KVLIANKIDDP 354
Query: 251 LFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAI 310
+ P Y +Q FC P G +PW+ R ++++ GCIPV F D I
Sbjct: 355 V-------PELYSLRIQNFTFCAAPHGTSPWTQRFYDSLISGCIPVQFDRRFRFGFYDHI 407
Query: 311 PWEEI 315
W+ I
Sbjct: 408 DWDSI 412
>gi|227206252|dbj|BAH57181.1| AT3G03650 [Arabidopsis thaliana]
Length = 244
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 60/212 (28%), Positives = 87/212 (41%), Gaps = 9/212 (4%)
Query: 173 LVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGG 232
+V FG+ + N + PY + R I +YF+G Y GG
Sbjct: 21 VVADFGRYSPHVANVDKDIVAPYKHLVPSYVNDTSGFDGRPILLYFQGAIYR----KAGG 76
Query: 233 YYARGARAAVWENFKNNPLFDISTDHPTTYY-EDMQRAIFCLCPLGWAPWSPRLVEAVVF 291
+ + + E + F +H + E M+ + FCL G P S RL +A+
Sbjct: 77 FVRQELYNLLKEEKDVHFSFGSVRNHGISKAGEGMRSSKFCLNIAGDTPSSNRLFDAIAS 136
Query: 292 GCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLAN 348
CIPVII+DDI LP+ D + + E +FV D K L ++ SI E + L
Sbjct: 137 HCIPVIISDDIELPYEDVLNYNEFCLFVRSSDALKKGFLMGLVRSIGREEYNKMWLRLKE 196
Query: 349 PSMKRAMLFPQPAQPGD-AFHQILNGLARKLP 379
+ FP GD A I +ARK P
Sbjct: 197 VERYFDLRFPVKDDEGDYAVQMIWKAVARKAP 228
>gi|168043163|ref|XP_001774055.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674601|gb|EDQ61107.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 541
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 69/296 (23%), Positives = 122/296 (41%), Gaps = 49/296 (16%)
Query: 68 AAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISS 127
A++F R LL S T P++A F P G P + +R +Q + +
Sbjct: 178 TADLF-SRLLLRSKFSTELPQKAQLFLLPFSIDALRVDLG---PSRISDHLRRYVQNVRT 233
Query: 128 NWPYWNRTEGADHFFVVPHDFGACFHYQ-----EEKAIERGILPLLQRATLVQTFGQRNH 182
++PYWN + GA+HF++ F ++ E+ +I+ PL Q N
Sbjct: 234 SYPYWNLSLGANHFYLSSQAFENNNKHRNVLELEKNSIQAACAPLRQ-----------NQ 282
Query: 183 VCLNEGSITIPPYAPPQKMQAH-QIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAA 241
P Y P + + + + T R++ YF G D A
Sbjct: 283 NFYPHKDFIFPRYKPITQTEFYAALEGRTSRTVLAYFGGTLADT-----------PALVF 331
Query: 242 VWENFKNNPLFDISTD---HPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVII 298
+ + +K++P F++ D H + Y + R+ FC+ + V+A+ FGC+ V++
Sbjct: 332 ILDAWKSDPDFEVEVDPSPHRISVYRQLARSKFCVNVPSRDTFD--FVDAIRFGCVLVLL 389
Query: 299 ADDIV--LPFADAIPWEEIGMFVAEEDV--PKLDTILTSIPPEVILRKQRLLANPS 350
+ + LPF + W + F A+ V KL+ + +++LLAN S
Sbjct: 390 SKSVFLDLPFQGFLDWRQ---FAAQHQVWPSKLEQRHCG-----FMMREKLLANCS 437
>gi|34394612|dbj|BAC83914.1| limonene cyclase like protein [Oryza sativa Japonica Group]
gi|50508943|dbj|BAD31847.1| limonene cyclase like protein [Oryza sativa Japonica Group]
Length = 332
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 55/103 (53%), Gaps = 3/103 (2%)
Query: 273 LCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILT 332
L LG A + EA+ C+PV+I DD LPFAD + W + VA D+P+L IL
Sbjct: 218 LLELGLAKARATIREAIYLECVPVVIGDDYTLPFADVLNWAAFSVRVAVGDIPRLKEILA 277
Query: 333 SIPPEVILRKQRLLANPSMKRAMLFPQPA-QPGDAFHQILNGL 374
++ P +R QR + +++R + A + D FH IL+ +
Sbjct: 278 AVSPRQYIRMQRRV--RAVRRHFMVSDGAPRRFDVFHMILHSI 318
>gi|159483755|ref|XP_001699926.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158281868|gb|EDP07622.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 785
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 59/229 (25%), Positives = 100/229 (43%), Gaps = 34/229 (14%)
Query: 165 LPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQK-------MQAHQIPPD-------- 209
LP LQ+ +R E + PPY ++ +A+ PD
Sbjct: 565 LPALQQ--------EREQCYRPEHDVAFPPYLSDREGNWFSVMKEAYDYTPDGKATFKRG 616
Query: 210 TPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRA 269
R YF G P+ Y A G R + F N+ D+S + + M R+
Sbjct: 617 VARDTLFYFNGF-----TKPDLAYSA-GVRQGLLALFGNSTRADLSINKGGGS-QRMLRS 669
Query: 270 IFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDT 329
FC P+G+ W RL +A++ GC+P+++ D + D +P+E+ + V+ ++ +L
Sbjct: 670 RFCFTPMGFG-WGIRLSQAMLTGCVPIMVHDHVWPTLWDVLPYEQFSIRVSRHNMYRLLD 728
Query: 330 ILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKL 378
L SI P+ + R Q +A +A ++ QP G A++ L L +L
Sbjct: 729 YLESITPQQLARLQDGVAQ--WHKAFVW-QPEVGGLAYNYTLTSLHHRL 774
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 38/119 (31%), Positives = 56/119 (47%), Gaps = 7/119 (5%)
Query: 39 KLKVYVYELPSKYNKKLLLKDPRCLTH-MFAAEI-FMHRFLLSSPVRTLNPEEADWFYTP 96
+LK+YVY+LP L D L H ++ AEI HR L + V T NP EA+ FY P
Sbjct: 373 ELKIYVYDLPQHVAYMRPLGDHWPLHHSIYLAEIELYHRLLGDTTVVTENPWEANLFYVP 432
Query: 97 IYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQ 155
+ + G P ++ + + +P+WN T G +H +D G C Y+
Sbjct: 433 THTYYYIGNIGFP-----GKLYTAVFHHVRQQYPWWNLTAGRNHVVSNSNDRGCCDLYR 486
>gi|359491711|ref|XP_002284930.2| PREDICTED: xylogalacturonan beta-1,3-xylosyltransferase isoform 1
[Vitis vinifera]
Length = 498
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 57/192 (29%), Positives = 83/192 (43%), Gaps = 28/192 (14%)
Query: 212 RSIFVYFRGLFYDVNNDPEGG------YYARGARAAVWENFKNNPLFDISTDHPTTYYED 265
R + VYF+G Y +GG YY V F + + + E
Sbjct: 311 RPLLVYFQGAIYR----KDGGAIRQELYYLLRDEKDVHFTFGS-----VRGNGINGASEG 361
Query: 266 MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVP 325
M + FCL G P S RL +A+V C+PVII+D+I LPF D + + E +FV D
Sbjct: 362 MASSKFCLNIAGDTPSSNRLFDAIVSHCVPVIISDEIELPFEDVLDYSEFCIFVRASDAV 421
Query: 326 K---LDTILTSIPPEVILRK-QRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP-- 379
K L +L I E + +RL + + P+Q GDA I ++RK+
Sbjct: 422 KNGFLLNLLRGIKREKWTKMWERL---KEIAHHFEYQYPSQAGDAVDMIWGAVSRKISSI 478
Query: 380 ----HDKSTYLK 387
H K+ Y +
Sbjct: 479 QNKLHRKNRYRR 490
>gi|255644422|gb|ACU22716.1| unknown [Glycine max]
Length = 189
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 43/134 (32%), Positives = 65/134 (48%), Gaps = 11/134 (8%)
Query: 264 EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED 323
+ M + FCL G P S RL +A+V C+PVII+D+I LPF D + + + +FV D
Sbjct: 51 QGMAMSKFCLNIAGDTPSSNRLFDAIVSHCVPVIISDEIELPFEDVLDYSDFSIFVRASD 110
Query: 324 VPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP- 379
K L +L SI + + L + + P+QPGDA + I + RK+
Sbjct: 111 SMKKGYLLNLLRSITQKEWSKMWERL--KQITHHFEYQYPSQPGDAVNMIWQQVERKISS 168
Query: 380 -----HDKSTYLKP 388
H K+ Y +
Sbjct: 169 IRFNLHRKNRYQRS 182
>gi|297733976|emb|CBI15223.3| unnamed protein product [Vitis vinifera]
Length = 455
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 57/192 (29%), Positives = 83/192 (43%), Gaps = 28/192 (14%)
Query: 212 RSIFVYFRGLFYDVNNDPEGG------YYARGARAAVWENFKNNPLFDISTDHPTTYYED 265
R + VYF+G Y +GG YY V F + + + E
Sbjct: 268 RPLLVYFQGAIYR----KDGGAIRQELYYLLRDEKDVHFTFGS-----VRGNGINGASEG 318
Query: 266 MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVP 325
M + FCL G P S RL +A+V C+PVII+D+I LPF D + + E +FV D
Sbjct: 319 MASSKFCLNIAGDTPSSNRLFDAIVSHCVPVIISDEIELPFEDVLDYSEFCIFVRASDAV 378
Query: 326 K---LDTILTSIPPEVILRK-QRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP-- 379
K L +L I E + +RL + + P+Q GDA I ++RK+
Sbjct: 379 KNGFLLNLLRGIKREKWTKMWERL---KEIAHHFEYQYPSQAGDAVDMIWGAVSRKISSI 435
Query: 380 ----HDKSTYLK 387
H K+ Y +
Sbjct: 436 QNKLHRKNRYRR 447
>gi|145352087|ref|XP_001420390.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144580624|gb|ABO98683.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 517
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 85/397 (21%), Positives = 145/397 (36%), Gaps = 69/397 (17%)
Query: 30 DVLEDDPVGKLKVYVYELPSKYNKKLLLKDPR-CLTHMFAAEIFMHRFLLSSPVRTLNPE 88
+ L D P G L++Y + P K + + E ++ R + P T + +
Sbjct: 140 EALNDLP-GTLRLYTLKTPRVTRLKYAFAEKIPTFNRGYGVETYLAREIERGPFATRSFK 198
Query: 89 EADWFYTPIYPTCDLTPTGLPLPFKSP----------------RMMRSAIQLISSNWPYW 132
+A + P+ P T P R+ R ++ + N W
Sbjct: 199 DATAIFIPVRPYLQRLLTAEAYVMAGPAKGSNNNGAIRQAIRDRLSRDIERVKAINAEAW 258
Query: 133 NRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLL--------------------QRAT 172
+ + D G + + G + + +RA
Sbjct: 259 TSKQKCARVVMTNIDIGLSAFDSSDDEVRHGAVVITGNSELPIQNSEDAKDDEARARRAN 318
Query: 173 LVQT-FGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEG 231
VQ F HV + G I + P + ++ + R+I V FRG
Sbjct: 319 AVQAGFDPARHVAIWFG---ISSHLPREVVRMGALKSTNVRTIEVSFRG----------- 364
Query: 232 GYYARGARAAVWENFKNNPL---FDIST---DHPTTYYEDMQRAIFCLCPLGWAPWSPRL 285
+ G R V+ K +D+ST D P Y + ++ +CL G + RL
Sbjct: 365 SMHRGGVRRVVFPTLKQAEAGRGWDLSTSGQDKPRDYMTMLSKSKYCLYVYGDRAHTARL 424
Query: 286 VEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRL 345
+ + FGC+PVI+AD LPF+ W + + V E+DV L +IL + LR++ +
Sbjct: 425 YDIITFGCVPVIVADGYDLPFSWLFDWSKFSVRVLEDDVATLPSILDRADYDS-LRRELV 483
Query: 346 LANPSMK---RAMLFPQPAQPGDAFHQILNGLARKLP 379
+ + R +F GDAF + G+ R+L
Sbjct: 484 KVHSFFQYHNRGSIF------GDAFWITMLGVRRQLA 514
>gi|405958592|gb|EKC24705.1| Xyloglucan galactosyltransferase KATAMARI1 [Crassostrea gigas]
Length = 497
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 82/335 (24%), Positives = 128/335 (38%), Gaps = 73/335 (21%)
Query: 19 PKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLL----LKDPRCL----------- 63
PK + SAG V + K+Y+Y+LP K+N ++L + RC
Sbjct: 56 PKTCTVLPSAG-VSTNGVKTPFKIYIYDLPKKFNLEILKIYDVWHARCYSFEFCGFGARL 114
Query: 64 -----------THMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPF 112
+H F+ E+ +H L SP RTL+PE+AD FY P Y GL +
Sbjct: 115 FNLESGVHVHDSHQFSLEVLVHHLLQLSPYRTLDPEQADLFYIPAY-------IGLQCLY 167
Query: 113 KSPRMMRSAIQLISSNWPYWNR----TEGADHFFVVPHDFGACFHYQEEKAIERGILPLL 168
S + + +LI+ + Y G HF + E + +G P L
Sbjct: 168 ASFDNVSATNKLINELFVYLQSQPYFASGKPHF--------SSLAKIEREMQSKGCCPYL 219
Query: 169 ---QRATLVQTF-----GQRNHVCLNEGSITIP----PYAPPQKMQAHQIPPDTPRSIFV 216
Q A + TF R LN+ IT+P + +Q +PR++F+
Sbjct: 220 LHPQSANI--TFLSIERETRYQSALNQRVITVPYPSYIHLDGSVTSRNQYLHSSPRNVFI 277
Query: 217 ----------YFRGLFYDVNNDPEGGYYARGARAAVWEN-FKNNPLFDISTDHPTTYYED 265
++R L D + Y W + F DH Y
Sbjct: 278 LLAAGTRRSNHYRSLILDQFREKTHLSYPEYTATNQWRSEFPMVMYITKECDHSAKYSTV 337
Query: 266 --MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVII 298
M +++FCL P G +P +A++ GC+PV+
Sbjct: 338 RWMLQSVFCLQPPGDSPTRKSFYDALLSGCVPVLF 372
>gi|242077606|ref|XP_002448739.1| hypothetical protein SORBIDRAFT_06g032380 [Sorghum bicolor]
gi|241939922|gb|EES13067.1| hypothetical protein SORBIDRAFT_06g032380 [Sorghum bicolor]
Length = 456
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 68/246 (27%), Positives = 104/246 (42%), Gaps = 45/246 (18%)
Query: 116 RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQ 175
R R + ++S+ P W R+ G DH FV+ D A +H + E I P + LV
Sbjct: 160 RRQREVVDRVTSH-PAWRRSGGRDHVFVMT-DPVAMWHVRAE------IAPAI---LLVV 208
Query: 176 TFG----------QRN------HVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFR 219
FG RN H ++ I PY + + + R +YF+
Sbjct: 209 DFGGWYKVDSKSANRNSSRMIQHTQVSLLKDVIVPYT--HLLPTLLLSENKDRPTLLYFK 266
Query: 220 GLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYED-----MQRAIFCLC 274
G + GG R +W+ N P + P + MQ + FCL
Sbjct: 267 G----AKHRHRGGL----VREKLWDLLGNEPDVIMEEGFPNATGREQSIKGMQTSEFCLH 318
Query: 275 PLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDV--PK-LDTIL 331
P G P S RL +A+ CIPVI++D++ LP+ I + E +FV+ + PK L + L
Sbjct: 319 PAGDTPTSCRLFDAIASLCIPVIVSDEVELPYEGMIDYTEFSIFVSVRNAMRPKWLTSYL 378
Query: 332 TSIPPE 337
+IP +
Sbjct: 379 RNIPKQ 384
>gi|224097754|ref|XP_002311068.1| predicted protein [Populus trichocarpa]
gi|222850888|gb|EEE88435.1| predicted protein [Populus trichocarpa]
Length = 380
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 70/309 (22%), Positives = 122/309 (39%), Gaps = 29/309 (9%)
Query: 82 VRTLNPEEADWFYTPIYPTCDLTPTGL----PLPFKSPRMMRSAIQLISSNWPYWNRTEG 137
VR L+PE A+ ++ P + + G P K ++ I + + YW R+ G
Sbjct: 72 VRVLDPEIAEAYFVPFFSSLSFNTHGRNMTDPETEKDRQLQVDLIDFLQKS-KYWQRSGG 130
Query: 138 ADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAP 197
DH V+P F + + + IL +V FG+ + PY
Sbjct: 131 RDH--VIPMTHPNAFRFLRQ-LVNASIL-------IVADFGRYPKSLSTLSKDVVSPYV- 179
Query: 198 PQKMQAHQ----IPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFD 253
+ + + + P R ++FRG + +G A+ + +
Sbjct: 180 -HNVDSFKDDDLLDPFESRKTLLFFRG---NTVRKDKGKVRAKLEKILAGYDDVRYERSS 235
Query: 254 ISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWE 313
+ + + M+ + FCL P G P S RL +A+V C+PVI++D I LP+ D I +
Sbjct: 236 PTAEAIQASTQGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDLIELPYEDEIDYS 295
Query: 314 EIGMFVAEEDVPKLDTI---LTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQI 370
+ +F + + + D + L P + + R L S F P DA + +
Sbjct: 296 QFSIFFSINEAIQPDYLVNQLRKFPKDRWIEMWRQLKKIS--HHFEFQYPPVKEDAVNML 353
Query: 371 LNGLARKLP 379
+ KLP
Sbjct: 354 WRQVKNKLP 362
>gi|255585230|ref|XP_002533317.1| catalytic, putative [Ricinus communis]
gi|223526861|gb|EEF29074.1| catalytic, putative [Ricinus communis]
Length = 478
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 71/300 (23%), Positives = 122/300 (40%), Gaps = 40/300 (13%)
Query: 116 RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQ-----R 170
+ + I+ + S+ W R+ G DH FV+ D A +H + E I +L ++ R
Sbjct: 182 KRQKEVIEFVKSS-DAWKRSGGKDHVFVLT-DPVAMWHVRAE--IAPAVLLVVDFGGWYR 237
Query: 171 ATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIP--PDTPRSIFVYFRGLFYDVNND 228
+ G +++ + + P Q+P + R +YF+G
Sbjct: 238 LDSKSSDGNSSNIIRHTQVSLLKDVIVPYTHLLPQLPLSENKKRQTLLYFKG-------- 289
Query: 229 PEGGYYARGA--RAAVWENFKNNPLFDISTDHPTTYYED-----MQRAIFCLCPLGWAPW 281
Y RG R +W+ N P + P + M+ + FCL P G P
Sbjct: 290 --AKYRHRGGMVREKLWDLLVNEPGVIMEEGFPNATGREQSIKGMRTSEFCLHPAGDTPT 347
Query: 282 SPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILR 341
S RL +A+ CIP+I++D+I LPF + + E +F+A +D K + ++ + + I +
Sbjct: 348 SCRLFDAIQSLCIPIIVSDNIELPFEGIVDYLEFSVFMAVDDALKPNWLVDHL--KSISK 405
Query: 342 KQRLLANPSMKRAMLFPQ----------PAQPGDAFHQILNGLARKLPHDKSTYLKPGGK 391
KQR M + P P A + I + +KLP K ++ K
Sbjct: 406 KQRDEFRQKMAEVQSIFEYDNGYAGGIGPVPPNGAVNHIWKKVHQKLPIIKEAIVREKRK 465
>gi|159473595|ref|XP_001694919.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158276298|gb|EDP02071.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 674
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 46/148 (31%), Positives = 75/148 (50%), Gaps = 11/148 (7%)
Query: 234 YARGARAAVWENFKNNPLFDISTDH---PTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVV 290
Y+ G R + F N D+S + P+ M R+ FC P+G+ W RL +A +
Sbjct: 527 YSGGVRQGLLALFGNTTRPDVSINKGGGPSL----MLRSRFCFTPMGFG-WGVRLTQAAM 581
Query: 291 FGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPS 350
GC+PV++ D + D +P+E+ + V+ ++ +L IL SI E + Q LA+
Sbjct: 582 TGCVPVMVQDHVWPTLWDVLPYEKFSIRVSRHNLYRLFEILDSITAEELASLQAGLAH-- 639
Query: 351 MKRAMLFPQPAQPGDAFHQILNGLARKL 378
RA ++ QP G A++ L L R+L
Sbjct: 640 WHRAFVW-QPEFGGLAYNYTLESLQRRL 666
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 37/128 (28%), Positives = 55/128 (42%), Gaps = 7/128 (5%)
Query: 39 KLKVYVYELPSKYNKKLLLKD--PRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTP 96
+LK+YVY+LP D P A F +R L RT NP EA+ FY P
Sbjct: 255 RLKIYVYDLPESVVHMRSHSDEWPLHFPIYLAEHEFFNRLLGDWATRTENPWEANLFYIP 314
Query: 97 IYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQE 156
+ + G P ++ + + N+P+WN T G +H +D G C Y+
Sbjct: 315 TFTYYYIGNVGQP-----GKLFSRVVSYVRHNYPFWNMTGGRNHILTSVNDRGCCDIYRL 369
Query: 157 EKAIERGI 164
++R I
Sbjct: 370 GPDVQRPI 377
>gi|125576050|gb|EAZ17272.1| hypothetical protein OsJ_32791 [Oryza sativa Japonica Group]
Length = 506
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 40/119 (33%), Positives = 61/119 (51%), Gaps = 5/119 (4%)
Query: 264 EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED 323
+ M + FCL G P S RL +A+V C+PVII+DDI LP+ DA+ + + +FV D
Sbjct: 370 QGMHASKFCLNIAGDTPSSNRLFDAIVSHCVPVIISDDIELPYEDALDYSKFSIFVRSSD 429
Query: 324 VPKLDTILTSIPPEVILRKQRLLANPSMK---RAMLFPQPAQPGDAFHQILNGLARKLP 379
K ++ I + + Q + +K + + P+Q DA I LARK+P
Sbjct: 430 AVKKGYLMRLI--RGVSKHQWTMMWRRLKEVDKHFEYQYPSQKDDAVQMIWQTLARKVP 486
>gi|302766093|ref|XP_002966467.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
gi|300165887|gb|EFJ32494.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
Length = 453
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 70/277 (25%), Positives = 111/277 (40%), Gaps = 62/277 (22%)
Query: 83 RTLNPEEADWFYTPIYP--TCDLTPTGLPLPFKSPR------MMRSAIQLISSNWPYWNR 134
R L+P++AD + P + + + F+ + R+ +++++S+ W R
Sbjct: 113 RVLDPDQADVVFVPFFAALSAEAQLRNGKGHFRHRKDNEDYERQRAVMEIVTSS-SRWQR 171
Query: 135 TEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFG---------QRN---- 181
+ G DH FV+ D A +H++ E I IL LV FG RN
Sbjct: 172 SGGRDHVFVLT-DPMAMYHFRAE--IANSIL-------LVVDFGGWYMEDAKSSRNLSSP 221
Query: 182 ----HVCLNEGSITIPPYAPPQKMQAHQIPP------DTPRSIFVYFRGLFYDVNNDPEG 231
H ++ I PY H +P + RS +YF+G Y
Sbjct: 222 QPIYHTQVSLIKDVIVPYT-------HLLPTLALSQDNAVRSTLLYFKGARYR------- 267
Query: 232 GYYARGARAAVWENFKNNPLFDISTDHPT-----TYYEDMQRAIFCLCPLGWAPWSPRLV 286
+ R +W P + P + M+ + FCL P G P S RL
Sbjct: 268 -HRTGLVRDQLWSVLDGEPGVLLEEGFPNRTGQVQAVQGMRNSHFCLHPAGDTPSSCRLF 326
Query: 287 EAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED 323
+AV CIPVI++D I LPF + + + +FV+ D
Sbjct: 327 DAVASLCIPVIVSDSIELPFEGMLDYTQFAIFVSVHD 363
>gi|222640628|gb|EEE68760.1| hypothetical protein OsJ_27458 [Oryza sativa Japonica Group]
Length = 518
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 43/121 (35%), Positives = 62/121 (51%), Gaps = 11/121 (9%)
Query: 264 EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED 323
+ M + FCL G P S RL +A+V C+PVII+DDI LPF D + + +FV D
Sbjct: 383 KGMASSKFCLNIAGDTPSSNRLFDAIVSHCVPVIISDDIELPFEDVLDYSAFCVFVRASD 442
Query: 324 VPK---LDTILTSIPPE---VILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARK 377
K L +L I E + R+ + +A+ + P+QPGDA I +ARK
Sbjct: 443 AVKRGFLLHLLRGISQEEWTAMWRRLKEVAHH-----FEYQYPSQPGDAVQMIWGAVARK 497
Query: 378 L 378
+
Sbjct: 498 M 498
>gi|405121937|gb|AFR96705.1| hypothetical protein CNAG_03480 [Cryptococcus neoformans var. grubii
H99]
Length = 1125
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 36/109 (33%), Positives = 59/109 (54%), Gaps = 4/109 (3%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y D+ A FC P G WSP+ +A+ GCIPV IA+ PFA + W ++ + VA
Sbjct: 995 YINDLNNARFCPQPRGITGWSPQTNDAIYAGCIPVFIAEGTHYPFAGFLDWSKLSVRVAP 1054
Query: 322 EDVPKLDTILTSIPPEVILRKQRLLAN-PSMKRAMLFPQPAQPGDAFHQ 369
++ K++ IL +IP + + + L AN S++ A L+ +P + +
Sbjct: 1055 TELDKIEKILAAIP---LSKVEELQANLVSVREAFLYSGDEKPEEELER 1100
>gi|219362519|ref|NP_001137073.1| uncharacterized protein LOC100217246 [Zea mays]
gi|194691480|gb|ACF79824.1| unknown [Zea mays]
gi|413919898|gb|AFW59830.1| hypothetical protein ZEAMMB73_270023 [Zea mays]
Length = 458
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 61/228 (26%), Positives = 94/228 (41%), Gaps = 46/228 (20%)
Query: 116 RMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQ 175
R R + ++S+ P W R+ G DH FV+ D A +H + E I P + LV
Sbjct: 162 RRQREVVDRVTSH-PAWRRSSGRDHIFVLT-DPVAMWHVRAE------IAPAI---LLVV 210
Query: 176 TFG------------------QRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVY 217
FG Q V L + I + P + + + R +Y
Sbjct: 211 DFGGWYKVDSKSSSKNSSRVIQHTQVSLLKDVIVPYTHLLPTLL----LSENKDRRTLLY 266
Query: 218 FRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYED-----MQRAIFC 272
F+G + GG R +W+ N P + P + ++ + FC
Sbjct: 267 FKG----AKHRHRGGL----VREKLWDLLGNEPDVIMEEGFPNATGREQSIKGLRTSEFC 318
Query: 273 LCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVA 320
L P G P S RL +A+ CIPVI++D++ LPF I + EI +FV+
Sbjct: 319 LHPAGDTPTSCRLFDAIASLCIPVIVSDEVELPFEGIIDYTEISIFVS 366
>gi|320165878|gb|EFW42777.1| hypothetical protein CAOG_07909 [Capsaspora owczarzaki ATCC 30864]
Length = 508
Score = 64.7 bits (156), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 78/330 (23%), Positives = 126/330 (38%), Gaps = 23/330 (6%)
Query: 34 DDPVGKLKVYVY-ELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADW 92
D P+G LK++VY E+ +K L M+ E + ++++S T NPEEA
Sbjct: 126 DAPIGVLKIFVYTEMIKNMEPHNCVK---ALVPMWREEAELPTWVVNSIHYTTNPEEAHM 182
Query: 93 FYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPH--DFGA 150
F+ P C + P +S + +AI ++ Y+ R G DHF + P G
Sbjct: 183 FFIPAMGRCMIAIHDRPHVLQSDSFL-NAIDILHVKNDYFRRRYGYDHFIINPGGGSLGL 241
Query: 151 CFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQ---KMQAHQIP 207
+ I AT + + R TIP A + M+ H P
Sbjct: 242 ITDILWGSSSSATINTFYSNATKLLSESVRPRGYFAGRDFTIPGSADYRFGPYMKIHHQP 301
Query: 208 PDTPRSIFVYFRGLFYDVNNDPEG----GYYARGARAAVWENFKNNPLF--DISTDHPTT 261
+F++ G +AA F++ L I+ P+
Sbjct: 302 LADRPMLFMFLGDTGLREQRQALGRLKVALQGDSEQAAF---FRDKVLIASKINDPDPSL 358
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + Q FC P G +PW+ R ++++ GCIPV + D + W+ I +
Sbjct: 359 YPKRTQNFTFCAAPHGTSPWTQRFYDSLISGCIPVQFDRRFRFGYYDHVDWDAIVIRYPT 418
Query: 322 EDVPKLDTI----LTSIPPEVILRKQRLLA 347
V K + S+ E I +QR +A
Sbjct: 419 AQVDKFSFLEYLYKLSLDVEFIRERQRRIA 448
>gi|324508547|gb|ADY43609.1| Exostosin-1b, partial [Ascaris suum]
Length = 628
Score = 64.7 bits (156), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 30/79 (37%), Positives = 45/79 (56%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y M + FCL P G S R +E++ GCIPVI++DD LPF++ I W + + E
Sbjct: 292 YETTMSNSTFCLTPRGRRLGSFRFLESLRLGCIPVILSDDWELPFSEIIDWSQAAVIAHE 351
Query: 322 EDVPKLDTILTSIPPEVIL 340
+ V + +L +IP E +L
Sbjct: 352 DTVLTISDVLNAIPLERVL 370
>gi|449502905|ref|XP_004161776.1| PREDICTED: probable glucuronosyltransferase Os01g0926700-like
[Cucumis sativus]
Length = 482
Score = 64.3 bits (155), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 68/281 (24%), Positives = 109/281 (38%), Gaps = 37/281 (13%)
Query: 83 RTLNPEEADWFYTPIYPT--CDLTPTGLPLPFKSP------RMMRSAIQLISSNWPYWNR 134
R EEAD + P + T ++ F+ R+ + + S W +
Sbjct: 141 RVFEAEEADVIFVPFFATMSAEMQLGMAKGAFRKKVGNEDYERQRNVMDFLKST-DAWKK 199
Query: 135 TEGADHFFVVPHDFG---ACFHYQEEKAIE-------RGILPLLQRATLVQTFGQRNHVC 184
+ G DH H A +H + E A G L +++ + H
Sbjct: 200 SGGRDHVLFSLHSLTDPVAMWHVKAEIAPAVLLVVDFGGWFRLDTKSSNGSSPDMIQHTQ 259
Query: 185 LNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWE 244
++ I PY + + + R +YF+G + GG R +W+
Sbjct: 260 VSVLKDVIVPYT--HLLPRLHLSANKKRQTLLYFKG----AKHRHRGGL----VREKLWD 309
Query: 245 NFKNNPLFDISTDHPTTYYED-----MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIA 299
N P + P ++ M+ + FCL P G P S RL +A+ CIPV+++
Sbjct: 310 LLVNEPDVIMEEGFPNATGKEQSIKGMRSSEFCLHPAGDTPTSCRLFDAIQSLCIPVVVS 369
Query: 300 DDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPE 337
D+I LPF D + + E +FVA D K L L +IP E
Sbjct: 370 DNIELPFEDMVDYSEFSVFVAVNDALKPNWLVKHLRTIPEE 410
>gi|356507133|ref|XP_003522325.1| PREDICTED: probable glycosyltransferase At3g42180-like [Glycine
max]
Length = 484
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 93/398 (23%), Positives = 149/398 (37%), Gaps = 76/398 (19%)
Query: 40 LKVYVYELPSKYNKKLLL-------------------KDPRCLTHMFAAEIFMHRFLLSS 80
LKV++Y+LP +++ LL + P L + E ++ LLSS
Sbjct: 112 LKVFMYDLPPEFHFGLLGWKGSVNQTWPEVDNPERIPRYPGGLNLQHSMEYWLTLDLLSS 171
Query: 81 PV-------RTLNPEEADWFYTPIYPTCDLTP----TGLPLPFKSPRMMRSAIQLISSNW 129
V R + +AD + P + + G + R+ +Q +
Sbjct: 172 KVGQPCTAIRVQDSSQADVIFVPFFSSLSYNRHSKLNGQEKVSLNKRLQDRLVQFLMGR- 230
Query: 130 PYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGS 189
W R+ G DH V H ++ L ++ FG+ N
Sbjct: 231 KEWKRSGGKDHLIVAHH----------PNSLLDARRRLGAAMLVLADFGRYPVELANIKK 280
Query: 190 ITIPPYAPPQKMQAHQIPPDTP-----RSIFVYFRGLFYDVNNDPEGGYYARGARAAVW- 243
I PY + IP R+ VYF+G Y +GG R ++
Sbjct: 281 DIIAPY----RHLVGTIPRAESASFEKRTTLVYFQGAIYR----KDGG----AIRQELYY 328
Query: 244 ----ENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIA 299
EN + I + + M + FCL G P S RL +A+V C+PVII+
Sbjct: 329 LLKDENDVHFTFGSIGGNGINQASQGMALSKFCLNIAGDTPSSNRLFDAIVSHCVPVIIS 388
Query: 300 DDIVLPFADAIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRK-QRLLANPSMKRAM 355
D+I LPF D + + + + V D K L +L SI + + +RL +
Sbjct: 389 DEIELPFEDDLDYSDFSIIVHASDAMKKGYLLNLLRSIKRDEWNKMWERL---KQITHHF 445
Query: 356 LFPQPAQPGDAFHQILNGLARKLP------HDKSTYLK 387
+ P+QPGDA + I + K+ H K+ Y +
Sbjct: 446 EYQYPSQPGDAVNMIWQQVEHKISSIRFNLHRKNRYQR 483
>gi|326920358|ref|XP_003206441.1| PREDICTED: exostosin-2-like [Meleagris gallopavo]
Length = 718
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 35/104 (33%), Positives = 55/104 (52%), Gaps = 8/104 (7%)
Query: 241 AVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD 300
AV + NN +FD Y + +Q A FC+ G L + + GC+PVIIAD
Sbjct: 295 AVRKRCHNNQIFD--------YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVIIAD 346
Query: 301 DIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQR 344
+LPF++ + W+ + + EE +P++ +IL S+P I QR
Sbjct: 347 SYILPFSEVLDWKRASVVIPEEKMPEMYSILQSVPQRQIEEMQR 390
>gi|302760669|ref|XP_002963757.1| galactosyltransferase-like protein [Selaginella moellendorffii]
gi|300169025|gb|EFJ35628.1| galactosyltransferase-like protein [Selaginella moellendorffii]
Length = 411
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 93/411 (22%), Positives = 149/411 (36%), Gaps = 72/411 (17%)
Query: 26 GSAGDVLEDDPVGKL-KVYVYELPSKYNKKLLLK-------------------------- 58
S +LE+DPV K +++VY+LP ++N LL
Sbjct: 3 ASENAILEEDPVCKGGRIFVYDLPPRFNADLLANCSTLNPWLSLCDALSHGGLGKPMTTT 62
Query: 59 -------DPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLP 111
P T F+ E+ H +L P T + + A+ FY P Y D++ L P
Sbjct: 63 PWPSSKPSPWFYTEQFSGEVIFHTRILRHPCVTNDSDSANVFYVPFYAGLDVSRY-LWRP 121
Query: 112 FKSPRMMRSAIQLIS--SNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQ 169
K+ +L+ S P W R G DHF ++ F EE A G+L + +
Sbjct: 122 SKAEDRDHLGHKLVEWLSTQPAWTRARGRDHFTMIGR-ITWDFRRPEENAWGSGLLNMAE 180
Query: 170 RATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRS-----IFVYFRGLFYD 224
+ + + N E + P PQ Q + R+ +F +
Sbjct: 181 MKNMTRLAIESNPWEGGEYGVPYPTSFHPQNEHQLQEWQEFVRNKERGLVFSFAGATRKR 240
Query: 225 VNNDPEGGYYA-----RGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWA 279
+ ND A RGA +A+ D + P + + FCL P G
Sbjct: 241 IPNDFRLELLAQCSDSRGACSAM-------DCSDSKCETPEPVVQLFLNSTFCLQPRGDG 293
Query: 280 PWSPRLVEAVVFGCIPVIIADDI-------VLPFADAIPWEEIGMFVAEEDV---PKLDT 329
+ ++V+ GCIPV + P D E +F+ EDV K+
Sbjct: 294 YTRRSIFDSVLAGCIPVFFWNQSSYWQYKWFFPEED----ESYSVFIDREDVRKGTKIME 349
Query: 330 ILTSIPPEVI--LRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKL 378
+L+ E + +R + A P + A DAF ++G+ R +
Sbjct: 350 VLSRFSQERVKAMRNTLIDALPKLVYATA-DHELSGADAFDTAIDGVLRSM 399
>gi|302800594|ref|XP_002982054.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
gi|300150070|gb|EFJ16722.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
Length = 453
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 77/304 (25%), Positives = 123/304 (40%), Gaps = 63/304 (20%)
Query: 83 RTLNPEEADWFYTPIYP--TCDLTPTGLPLPFKSPR-----MMRSAIQLISSNWPYWNRT 135
R L+P++AD + P + + + F+ + + A+ I ++ W R+
Sbjct: 113 RVLDPDQADVVFVPFFAALSAEAQLRNGKGHFRHRKDNEDYERQKAVMEIVTSSSRWQRS 172
Query: 136 EGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFG---------QRN----- 181
G DH FV+ D A +H++ E I IL LV FG RN
Sbjct: 173 GGRDHVFVLT-DPMAMYHFRAE--IANSIL-------LVVDFGGWYMEDAKSSRNLSSPQ 222
Query: 182 ---HVCLNEGSITIPPYAPPQKMQAHQIPP------DTPRSIFVYFRGLFYDVNNDPEGG 232
H ++ I PY H +P + R+ +YF+G Y
Sbjct: 223 PIYHTQVSLIKDVIVPYT-------HLLPTLALSQDNAVRTTLLYFKGARYR-------- 267
Query: 233 YYARGARAAVWENFKNNPLFDISTDHPT-----TYYEDMQRAIFCLCPLGWAPWSPRLVE 287
+ R +W P + P + M+ + FCL P G P S RL +
Sbjct: 268 HRTGLVRDQLWSVLDGEPGVLLEEGFPNRTGQVQAVQGMRNSHFCLHPAGDTPSSCRLFD 327
Query: 288 AVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED--VPK-LDTILTSIPPEVILRKQR 344
AV CIPVI++D I LPF + + + +FV+ D +PK L L+S+ +V + +
Sbjct: 328 AVASLCIPVIVSDSIELPFEGMLDYTQFAIFVSVHDALLPKWLVRHLSSLSSKVRNQMRH 387
Query: 345 LLAN 348
LA+
Sbjct: 388 NLAS 391
>gi|443722437|gb|ELU11306.1| hypothetical protein CAPTEDRAFT_208188 [Capitella teleta]
Length = 449
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 44/133 (33%), Positives = 61/133 (45%), Gaps = 7/133 (5%)
Query: 266 MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD-DIVL--PFADAIPWE----EIGMF 318
MQRA+FCL P G +P +AV+ GCIPVI D D+ + PF + + EI
Sbjct: 317 MQRALFCLQPPGDSPTRKSFYDAVISGCIPVIFKDADVTVRYPFDSHLNYSAFCVEIDAS 376
Query: 319 VAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKL 378
D LD + + I QR L + FP P DAF ILN + +L
Sbjct: 377 AVRRDRTALDALRELVSQRNIQHMQRDLQTAAACLQYSFPFHHSPNDAFAMILNQIEVRL 436
Query: 379 PHDKSTYLKPGGK 391
++ S ++P G
Sbjct: 437 QNNGSVDVQPLGS 449
Score = 44.7 bits (104), Expect = 0.085, Method: Compositional matrix adjust.
Identities = 28/89 (31%), Positives = 40/89 (44%), Gaps = 23/89 (25%)
Query: 41 KVYVYELPSKYNKKL--LLKDPRCL---------------------THMFAAEIFMHRFL 77
++YVYELP +YN+ + + C T FA E+ +H +
Sbjct: 62 RIYVYELPGEYNRDIAQCFEGNECEKLGSCGYGPLIAQHGNLQVRNTWQFALEVIVHHRM 121
Query: 78 LSSPVRTLNPEEADWFYTPIYPTCDLTPT 106
L+SP RTL+ EA+ FY P Y D T
Sbjct: 122 LASPYRTLDINEANAFYLPYYSGLDCLCT 150
>gi|389744786|gb|EIM85968.1| hypothetical protein STEHIDRAFT_168993 [Stereum hirsutum FP-91666
SS1]
Length = 1111
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 77/331 (23%), Positives = 129/331 (38%), Gaps = 72/331 (21%)
Query: 65 HMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQL 124
H F A+ M + + + VR +EA+ + P Y D ++
Sbjct: 772 HCFTADTQMEKGVSAMGVRR---DEAEMTFIPSYQQWD-----------------GDHRM 811
Query: 125 ISSNWPYWNRT--EGADHFFVV---PHDFGACF-------HYQEEKAIERGILPLLQRAT 172
+ W Y NR +G D V+ HDFG C H +E + + + P ++ T
Sbjct: 812 LERTWEY-NRDAFDGFDASKVIIPFTHDFGQCLAFEWDVWHMRERQGLR--VHPFVRSTT 868
Query: 173 LVQTFGQRNHVCLN-EGSITIPPYA--PPQKMQAH----QIPPDTPRSIFVYFRGLFY-- 223
G + C + IPP P ++ + P R + V F G+ +
Sbjct: 869 AWSVMGDLHSACYRPHQDVIIPPRTCLSPSLFKSFPTVADVRPARDRRVLVAFNGVLWGT 928
Query: 224 ---------------DVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQR 268
D +++ +A G N K+ + T+ Y +
Sbjct: 929 GALNRNRLVCPRSHWDSDDNASRRLHASGP------NLKSL----VGTNGDYEYMSLLND 978
Query: 269 AIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLD 328
+FC P G W+ RLV+++ GCIPV+I PF D + W +I + V D+ +L+
Sbjct: 979 TVFCPQPAGTTGWATRLVDSMYAGCIPVLIGQASHFPFYDMLDWGKISIRVEPSDLAQLE 1038
Query: 329 TILTSIPPEVILRKQRLLANPSMKR-AMLFP 358
IL S + +RL AN + R A+++P
Sbjct: 1039 DILFS--RYSLDDIERLQANIMLVRDALVYP 1067
>gi|9454585|gb|AAF87908.1|AC015447_18 Hypothetical protein [Arabidopsis thaliana]
Length = 414
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 41/133 (30%), Positives = 59/133 (44%), Gaps = 12/133 (9%)
Query: 40 LKVYVYELPSKYNKKLLL-------KDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADW 92
LK+YVY+ K LL K CL + +++ +H+ LL S RT+ +EAD
Sbjct: 89 LKIYVYDENEIDGLKELLYGRDGSVKTTACLKGQWGSQVKIHKLLLESKFRTIKKDEADL 148
Query: 93 FYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACF 152
F+ P Y C GL + + + + S PY+ R+ G DH FV P GA
Sbjct: 149 FFVPAYVKCVRMLGGL-----NDKEINQTYVKVLSQMPYFRRSGGRDHIFVFPSGAGAHL 203
Query: 153 HYQEEKAIERGIL 165
I R I+
Sbjct: 204 FRSWSTFINRSII 216
>gi|387015840|gb|AFJ50039.1| Exostosin-2-like [Crotalus adamanteus]
Length = 718
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 33/83 (39%), Positives = 47/83 (56%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + +Q A FC+ G L +A+ GCIPV+IAD VLPF++ + W+ + + E
Sbjct: 308 YPQVLQEATFCIVLRGARLGQAVLSDALQAGCIPVVIADSYVLPFSEVLDWKRASVVIPE 367
Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
E +P+L IL SIP I QR
Sbjct: 368 EKMPELYNILQSIPQRQIEEMQR 390
>gi|296081850|emb|CBI20855.3| unnamed protein product [Vitis vinifera]
Length = 618
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 90/396 (22%), Positives = 148/396 (37%), Gaps = 88/396 (22%)
Query: 2 WVTAI--ALLLAFVSADNTPKIERISGSAGDVLEDDPVGK---------LKVYVYELPSK 50
W +A+ LL A +N P I+ G + + V K LKVY Y +
Sbjct: 263 WSSAVDKELLYAKSQIENAPIIKNDPGLHASLYRNVSVFKRSYELMENTLKVYTY----R 318
Query: 51 YNKKLLLKDPRCLTHMFAAEI-FMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLP 109
++ + P + ++A+E FM + T N +A FY P + + L
Sbjct: 319 EGERPVFHQP-PIKGIYASEGWFMKLMQANKKFVTKNGRKAHLFYLPF--SSLMLEEALY 375
Query: 110 LPFKSPRM-----MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGI 164
+P R +++ + +I + +P+WNRT GADHF V HD+ E +
Sbjct: 376 VPNSHSRKNLEQYLKNYLDMIGAKYPFWNRTGGADHFLVACHDWAPS----ETLKLMANS 431
Query: 165 LPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYD 224
+ L + + + F V L E + IP P +++ P + R I +F G +
Sbjct: 432 IRALCNSDIREGFKLGKDVSLPETCVRIPQ-NPLRQLGG---KPPSQRRILAFFAGSMHG 487
Query: 225 VNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPT------TYYEDMQRAIFCLCPLGW 278
Y R WEN +P I P Y + M+ + +C+C G+
Sbjct: 488 ---------YVRPILLKYWEN--KDPDMKIYGRMPKAKKGTMNYIQHMKSSKYCICAKGY 536
Query: 279 APWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEV 338
SPR +D+P L +IL SIP +
Sbjct: 537 EVNSPR-------------------------------------KDIPNLKSILLSIPEKS 559
Query: 339 ILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 374
L Q + +++ L+ D FH IL+ +
Sbjct: 560 YLEIQMRVKQ--VQQHFLWHAKPVKYDVFHMILHSV 593
>gi|18399194|ref|NP_564443.1| exostosin-like protein [Arabidopsis thaliana]
gi|5091619|gb|AAD39607.1|AC007454_6 F23M19.7 [Arabidopsis thaliana]
gi|15450928|gb|AAK96735.1| Unknown protein [Arabidopsis thaliana]
gi|20148711|gb|AAM10246.1| unknown protein [Arabidopsis thaliana]
gi|332193570|gb|AEE31691.1| exostosin-like protein [Arabidopsis thaliana]
Length = 477
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 59/214 (27%), Positives = 89/214 (41%), Gaps = 34/214 (15%)
Query: 128 NWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNE 187
N W R+ G DH FV+ D A +H +EE A+ IL LV FG +
Sbjct: 191 NTKAWKRSNGRDHVFVLT-DPVAMWHVREEIAL--SIL-------LVVDFGG----WFRQ 236
Query: 188 GSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLF----YDVNNDPEGGYYARGARAAVW 243
S + + P+++Q Q+ + + V + L N Y +GA+
Sbjct: 237 DSKSSNGTSLPERIQHTQV--SVIKDVIVPYTHLLPRLDLSQNQRRHSLLYFKGAKHRHR 294
Query: 244 ENFKNNPLFDISTDHPTTYYED--------------MQRAIFCLCPLGWAPWSPRLVEAV 289
L+D+ + P E+ M+ + FCL P G P S RL +A+
Sbjct: 295 GGLIREKLWDLLVNEPGVVMEEGFPNATGREQSIRGMRNSEFCLHPAGDTPTSCRLFDAI 354
Query: 290 VFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED 323
CIPVI++D I LPF I + E +F + D
Sbjct: 355 QSLCIPVIVSDTIELPFEGIIDYSEFSVFASVSD 388
>gi|302845050|ref|XP_002954064.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
gi|300260563|gb|EFJ44781.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
Length = 1122
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 71/291 (24%), Positives = 108/291 (37%), Gaps = 46/291 (15%)
Query: 71 IFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWP 130
+F R L+S+ +RT++ +EAD+++ P+ +L P M+ + I +P
Sbjct: 636 LFWQR-LMSAGIRTVDGDEADYYFIPVNTRTELAPG----------MVEWVLSYIRRTYP 684
Query: 131 YWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSI 190
+W++ G H + D G LP R+ L F N L I
Sbjct: 685 WWSKDNGNRHLIIHTGDMGI------------ADLPADMRSRLKSAFS--NITWLTHWGI 730
Query: 191 TIPPYAPPQK-MQAHQIPPDTPRSIFVYFRGLFYD-VNNDPEGGYYARGARAAVWENF-- 246
Y P K AH+ D + V +G +N E R A F
Sbjct: 731 Y--QYHPVAKWYPAHRPGKDVVLPVMVTTQGFHLSPLNPRVEARARRRNQTMARSGTFFF 788
Query: 247 ----------KNNPLFDISTDHP----TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFG 292
+ D S P D+ FCL PLG R V + G
Sbjct: 789 AGRICGDRKPPDPATGDCSRTRPDYSGGVRQLDISSHKFCLAPLG-GGHGKRQVLVSLMG 847
Query: 293 CIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQ 343
C+PV+I + ++ PF I W + V E D+P L IL +I + + Q
Sbjct: 848 CVPVLIGNGVLQPFEPEIDWSRFSVSVPEADIPDLPRILANISDQRVADMQ 898
>gi|339247053|ref|XP_003375160.1| exostosin-1 [Trichinella spiralis]
gi|316971539|gb|EFV55297.1| exostosin-1 [Trichinella spiralis]
Length = 1019
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 33/84 (39%), Positives = 51/84 (60%)
Query: 261 TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVA 320
Y E M + FCL P G S R +EA+ GCIPVI+++D VLPF++ I W++ +
Sbjct: 604 NYEELMANSKFCLVPRGRRLGSFRFLEALEKGCIPVILSNDWVLPFSEVIDWDQAVVRGD 663
Query: 321 EEDVPKLDTILTSIPPEVILRKQR 344
E + +L ++L + P VILR ++
Sbjct: 664 ERTLFQLPSLLRAYPESVILRMRQ 687
>gi|168017435|ref|XP_001761253.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162687593|gb|EDQ73975.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 336
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 43/82 (52%)
Query: 263 YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEE 322
YED+ F L P G P S R +E + G IPV+IAD+ V PF IPW +
Sbjct: 214 YEDLMNTTFALVPAGRQPSSYRFIEVLAAGSIPVLIADNYVKPFDSLIPWYTCAIQFPTT 273
Query: 323 DVPKLDTILTSIPPEVILRKQR 344
++ ++ L + PE L++QR
Sbjct: 274 EIKRIVNTLRKVSPEEKLKRQR 295
>gi|307108297|gb|EFN56537.1| hypothetical protein CHLNCDRAFT_51514 [Chlorella variabilis]
Length = 810
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/141 (30%), Positives = 64/141 (45%), Gaps = 30/141 (21%)
Query: 42 VYVYELPSKYNKKLLL---KDPRCL---------THM----FAAEIFMHRFLLSSPVRTL 85
+YVYE+P++Y ++ C+ TH+ +AAE LL S RTL
Sbjct: 340 IYVYEMPAEYVTLMIQYRWGGTACVPRYFDFHNETHLSEDLYAAESGFLEMLLQSEHRTL 399
Query: 86 NPEEADWFYTPIYPTCDLTPTGLPLP--------------FKSPRMMRSAIQLISSNWPY 131
+PEEAD+ Y P Y +C +TP + M+ A I ++ PY
Sbjct: 400 DPEEADFLYVPAYTSCLITPVQRTADSLRDMWYGVENLRVHAATHMLLEAYYWIKAHAPY 459
Query: 132 WNRTEGADHFFVVPHDFGACF 152
WNR G DH ++V D +C+
Sbjct: 460 WNRRGGWDHIWLVTFDEASCY 480
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 40/159 (25%), Positives = 67/159 (42%), Gaps = 5/159 (3%)
Query: 181 NHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSI--FVYFRGLFYDVNNDPEGGYYARGA 238
+H C + + +P PQ + TPRS + RG N G R A
Sbjct: 561 SHPCHAQQDLVVPLMKTPQTFYKSPLLGATPRSRTWLAFHRGRVQQDNPPYSRGLRQRLA 620
Query: 239 RAAVWENF--KNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPV 296
A+ + K+ + Y E + + FCL G WS R+ +A++ GCIPV
Sbjct: 621 NASAAGGWLEKHKIVMGEHDVVEGDYSELLASSTFCLVLPG-DGWSARMDDAMLHGCIPV 679
Query: 297 IIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIP 335
II D++ + F + + + + D +L +L ++P
Sbjct: 680 IIMDNVHVSFESILDLAAFTVRIPQADAERLPEVLAAVP 718
>gi|159473731|ref|XP_001694987.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158276366|gb|EDP02139.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 721
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 50/182 (27%), Positives = 77/182 (42%), Gaps = 22/182 (12%)
Query: 208 PDTP-RSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFD-----------IS 255
PD P R+ +YF G + Y++G R + + F +D I
Sbjct: 524 PDAPARTTTLYFGGYTKPIMA------YSQGVRQTIHKMFGPGGKYDPEGPNARKDFVIG 577
Query: 256 TDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEI 315
+ M+ A FCL P+G A W RL EA+V GC+PVII D I D +P+EE
Sbjct: 578 GPAGGAAVDSMKLAKFCLAPMG-AGWGIRLAEAMVSGCVPVIIQDHIYQAHWDILPFEEF 636
Query: 316 GMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLA 375
+ + ++ +L IL + P+ + L A F G A++ + L
Sbjct: 637 SIRIGRNELHQLVDILDDVSPQQL---DSLQAGIERYHRAFFWDAHWGGLAYNYTVQALK 693
Query: 376 RK 377
R+
Sbjct: 694 RR 695
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 36/115 (31%), Positives = 55/115 (47%), Gaps = 7/115 (6%)
Query: 39 KLKVYVYELPSKYNKKLLLKDPRCLTH-MFAAEI-FMHRFLLSSPVRTLNPEEADWFYTP 96
++KVYVY+LP+ + D L M+ +E+ F + L VRT NP EA FY P
Sbjct: 300 EIKVYVYDLPNNVVHRREFNDQWALIDLMYNSELEFTDKLLGDWGVRTENPWEAALFYVP 359
Query: 97 IYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGAC 151
+ G P +++ A + + N P++N T G +H F +D G C
Sbjct: 360 TFTYWFTGNVGHPY-----YVIQHATKWLQENSPFFNLTGGRNHIFWATNDRGVC 409
>gi|393907069|gb|EFO19790.2| exostosin-1 [Loa loa]
Length = 677
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 32/99 (32%), Positives = 56/99 (56%), Gaps = 1/99 (1%)
Query: 243 WENFKNNPLFDISTDHPTTYYE-DMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADD 301
W+ +++ + + ++ YE M + FCL P G S R +EA+ GCIPV+++DD
Sbjct: 271 WKKYEDGRCEEDNVEYDHWDYEMTMANSTFCLTPRGRRLGSFRFLEALRLGCIPVVLSDD 330
Query: 302 IVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVIL 340
LPF++ I W + + E+ V + +L++IP + IL
Sbjct: 331 WELPFSEVIDWRQAVVIGHEDTVLTISDVLSAIPFDRIL 369
>gi|384245403|gb|EIE18897.1| hypothetical protein COCSUDRAFT_45043 [Coccomyxa subellipsoidea
C-169]
Length = 420
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 53/184 (28%), Positives = 76/184 (41%), Gaps = 45/184 (24%)
Query: 1 MWVTAIALLLAFVSADNTPKIERISGSAGDVLEDDPVGKLK--VYVYELPSKYNKKLLLK 58
M + A L+A VS +AG G+ K VYVY+LP K+NK + K
Sbjct: 1 MGIAAFLCLVALVS----------EAAAGGWRHTQGRGQAKEFVYVYDLPEKFNKDI--K 48
Query: 59 DPRCLTH--MFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLT--------PTGL 108
+ + H + + +H+ L S V T +P A FY P+Y P+
Sbjct: 49 ELPTIWHPEQYDIDQVLHKHLTHSEVNTKDPSVAKVFYIPVYLGRYFNAQWQRFSDPSDA 108
Query: 109 PLPFK---------------------SPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHD 147
L K + ++RSAI + N+PYWN + GADHF V +D
Sbjct: 109 WLINKECHGLDSVDCWAEKWKVAENATSDLVRSAIAHVKENYPYWNASNGADHFMVFSYD 168
Query: 148 FGAC 151
G C
Sbjct: 169 HGKC 172
>gi|312084451|ref|XP_003144281.1| exostosin-1 [Loa loa]
Length = 694
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 32/99 (32%), Positives = 56/99 (56%), Gaps = 1/99 (1%)
Query: 243 WENFKNNPLFDISTDHPTTYYE-DMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADD 301
W+ +++ + + ++ YE M + FCL P G S R +EA+ GCIPV+++DD
Sbjct: 271 WKKYEDGRCEEDNVEYDHWDYEMTMANSTFCLTPRGRRLGSFRFLEALRLGCIPVVLSDD 330
Query: 302 IVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVIL 340
LPF++ I W + + E+ V + +L++IP + IL
Sbjct: 331 WELPFSEVIDWRQAVVIGHEDTVLTISDVLSAIPFDRIL 369
>gi|449439621|ref|XP_004137584.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase
IRX7-like [Cucumis sativus]
Length = 494
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 91/408 (22%), Positives = 149/408 (36%), Gaps = 59/408 (14%)
Query: 17 NTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLL----LKD------------- 59
N K E + D+ D +L+V++Y+LP Y+ LL KD
Sbjct: 89 NFVKDEVSVENQSDLGCDPAKARLRVFMYDLPPLYHFGLLGWKGEKDQIWPYVSNRSQIP 148
Query: 60 --PRCLTHMFAAEIFMHRFLLSS----------PVRTLNPEEADWFYTPIYPTCDL---T 104
P L + E ++ LLSS VR + +AD + P + + +
Sbjct: 149 PYPGGLNLQHSMEYWLTLDLLSSNVPDMDHTCTAVRVKDSSQADVIFVPFFSSLSYNQHS 208
Query: 105 PTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGI 164
+ ++++ + W RT G +H + H K + +
Sbjct: 209 KSHGKEKINVNKILQQKLIDFLFGQKEWRRTGGKNHLVIAHHPNSML---DARKKLGSAM 265
Query: 165 LPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQK-MQAHQIPPDTPRSIFVYFRGLFY 223
L FG+ N I PY K + + + R I VYF+G Y
Sbjct: 266 FVLAD-------FGRYPAAIANIEKDIIAPYRHIVKTVPSSKSATFDERPILVYFQGAIY 318
Query: 224 DVNNDPEGG------YYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLG 277
+GG YY V F + + + + M + FCL G
Sbjct: 319 R----KDGGVVRQELYYLLKDEEDVHFTFGS-----VKGNGINKAGQGMASSKFCLNIAG 369
Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
P S RL +++ C+PVII+DDI LP+ D + + E +FV D + +L +
Sbjct: 370 DTPSSNRLFDSIASHCVPVIISDDIELPYEDILDYSEFCVFVRAADSIRKGYLLNLLRGI 429
Query: 338 VILRKQRLLAN-PSMKRAMLFPQPAQPGDAFHQILNGLARKLPHDKST 384
R ++ + + P+Q GDA I ++RK+ KS
Sbjct: 430 GRERWTKMWDRIKEIVHEFEYQYPSQSGDAVDMIWQAVSRKVSKIKSN 477
>gi|449531319|ref|XP_004172634.1| PREDICTED: probable glycosyltransferase At5g03795-like [Cucumis
sativus]
Length = 307
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 38/115 (33%), Positives = 59/115 (51%), Gaps = 5/115 (4%)
Query: 35 DPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFY 94
D K KVY+Y + + PR LT +A+E + + + S RT +P++A F+
Sbjct: 146 DMESKFKVYIYP---DGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFRTEDPDQAHLFF 202
Query: 95 TPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFG 149
PI +C ++++ ++ + S +PYWNRT GADHFFV HD G
Sbjct: 203 IPI--SCHKMRGKGTSYENMTVIVQNYVEGLISKYPYWNRTLGADHFFVTCHDVG 255
>gi|224119530|ref|XP_002318097.1| predicted protein [Populus trichocarpa]
gi|222858770|gb|EEE96317.1| predicted protein [Populus trichocarpa]
Length = 243
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 33/83 (39%), Positives = 48/83 (57%), Gaps = 2/83 (2%)
Query: 68 AAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISS 127
A+E + L SP T NP+EA F+ I+ +C PLP R+++ ++ + S
Sbjct: 85 ASEHYFFMNLRDSPFLTKNPQEAHLFF--IFISCLPLSDEEPLPGYRERVIKRYVKGLIS 142
Query: 128 NWPYWNRTEGADHFFVVPHDFGA 150
+PYWNRT GADHFFV H+ G+
Sbjct: 143 TYPYWNRTLGADHFFVSCHNIGS 165
>gi|170583869|ref|XP_001896766.1| exostosin-1 [Brugia malayi]
gi|158595933|gb|EDP34387.1| exostosin-1, putative [Brugia malayi]
Length = 411
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 29/75 (38%), Positives = 44/75 (58%)
Query: 266 MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVP 325
M + FCL P G S R +EA+ GCIPV+++DD LPF++ I W + + E+ V
Sbjct: 156 MSNSTFCLTPRGRRLGSFRFLEALRLGCIPVVLSDDWELPFSEVIDWRQAVIIGHEDTVL 215
Query: 326 KLDTILTSIPPEVIL 340
+ +L +IP + IL
Sbjct: 216 TISDVLNAIPLDRIL 230
>gi|402589373|gb|EJW83305.1| hypothetical protein WUBG_05784 [Wuchereria bancrofti]
Length = 689
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 29/75 (38%), Positives = 45/75 (60%)
Query: 266 MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVP 325
M + FCL P G S R +EA+ GCIPV+++DD LPF++ I W + + E+ V
Sbjct: 296 MSNSTFCLTPRGRRLGSFRFLEALRLGCIPVVLSDDWELPFSEVIDWRQAVIIGHEDTVL 355
Query: 326 KLDTILTSIPPEVIL 340
+ +L++IP + IL
Sbjct: 356 TISDVLSAIPLDRIL 370
>gi|195486003|ref|XP_002091324.1| GE13596 [Drosophila yakuba]
gi|194177425|gb|EDW91036.1| GE13596 [Drosophila yakuba]
Length = 760
Score = 62.0 bits (149), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 70/147 (47%), Gaps = 11/147 (7%)
Query: 200 KMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP 259
K H I +T S+F ++N + W ++N + + ++
Sbjct: 266 KRYVHGIGSETRNSLF--------HLHNGRDMVLVTTCRHGKSWRELQDNRCDEDNREYD 317
Query: 260 TTYYEDM-QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMF 318
YE + Q + FCL P G S R +EA+ GCIPV++++ VLPF I W++ ++
Sbjct: 318 RYDYETLLQNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNAWVLPFESKIDWKQAAIW 377
Query: 319 VAEEDVPKLDTILTSIPPEVI--LRKQ 343
E + ++ I+ SIP E I LR+Q
Sbjct: 378 ADERLLLQVPDIVRSIPAERIFALRQQ 404
>gi|3435314|gb|AAC32397.1| putative tumor suppressor homolog [Drosophila melanogaster]
Length = 760
Score = 62.0 bits (149), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 70/147 (47%), Gaps = 11/147 (7%)
Query: 200 KMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP 259
K H I +T S+F ++N + W ++N + + ++
Sbjct: 266 KRYVHGIGSETRNSLF--------HLHNGRDMVLVTTCRHGKSWRELQDNRCDEDNREYD 317
Query: 260 TTYYEDM-QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMF 318
YE + Q + FCL P G S R +EA+ GCIPV++++ VLPF I W++ ++
Sbjct: 318 RYDYETLLQNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNAWVLPFESKIDWKQAAIW 377
Query: 319 VAEEDVPKLDTILTSIPPEVI--LRKQ 343
E + ++ I+ SIP E I LR+Q
Sbjct: 378 ADERLLLQVPDIVRSIPAERIFALRQQ 404
>gi|195583406|ref|XP_002081513.1| GD11060 [Drosophila simulans]
gi|194193522|gb|EDX07098.1| GD11060 [Drosophila simulans]
Length = 760
Score = 62.0 bits (149), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 70/147 (47%), Gaps = 11/147 (7%)
Query: 200 KMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP 259
K H I +T S+F ++N + W ++N + + ++
Sbjct: 266 KRYVHGIGSETRNSLF--------HLHNGRDMVLVTTCRHGKSWRELQDNRCDEDNREYD 317
Query: 260 TTYYEDM-QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMF 318
YE + Q + FCL P G S R +EA+ GCIPV++++ VLPF I W++ ++
Sbjct: 318 RYDYETLLQNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNAWVLPFESKIDWKQAAIW 377
Query: 319 VAEEDVPKLDTILTSIPPEVI--LRKQ 343
E + ++ I+ SIP E I LR+Q
Sbjct: 378 ADERLLLQVPDIVRSIPAERIFALRQQ 404
>gi|255570432|ref|XP_002526175.1| catalytic, putative [Ricinus communis]
gi|223534552|gb|EEF36251.1| catalytic, putative [Ricinus communis]
Length = 453
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 36/119 (30%), Positives = 62/119 (52%), Gaps = 5/119 (4%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y+E ++ A FCL P G + W+ R E+ C+PV+++D LPF + I + + +
Sbjct: 303 YFEHLRNAKFCLAPRGESSWTLRFYESFFVECVPVLLSDQAELPFQNVIDYTHVSIKWPS 362
Query: 322 EDV-PKLDTILTSIPPEVILRKQRLLANPSMKRAM-LFPQPAQPGDAFHQILNGLARKL 378
+ P+L L SIP E I +R++AN R + ++ ++ A I+ L RK+
Sbjct: 363 TKIGPELLEYLESIPDEDI---ERMIANGRQVRCLWVYAPESEQCSAMQGIMWELQRKV 418
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 39/133 (29%), Positives = 57/133 (42%), Gaps = 12/133 (9%)
Query: 40 LKVYVYELPSKYNKKLLLKD-------PRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADW 92
+K+YVYE K LL+ C+ + ++ +HR L +S RT EEAD
Sbjct: 80 MKIYVYEEKEIDGLKELLRGRDGKISADTCVKGQWGTQVKIHRLLQNSRYRTRKKEEADL 139
Query: 93 FYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACF 152
F+ P Y C GL + + + + S PY+ R+ G DH FV P GA
Sbjct: 140 FFVPAYVKCVRMLGGL-----NDKEINLTYVKVLSQMPYFRRSGGRDHIFVFPSGAGAHL 194
Query: 153 HYQEEKAIERGIL 165
I R ++
Sbjct: 195 FRSWATYINRSVI 207
>gi|17137330|ref|NP_477231.1| tout-velu, isoform A [Drosophila melanogaster]
gi|442623684|ref|NP_001260971.1| tout-velu, isoform B [Drosophila melanogaster]
gi|61212925|sp|Q9V730.1|EXT1_DROME RecName: Full=Exostosin-1; AltName: Full=Protein tout-velu
gi|7303172|gb|AAF58236.1| tout-velu, isoform A [Drosophila melanogaster]
gi|60678089|gb|AAX33551.1| LD10920p [Drosophila melanogaster]
gi|85701308|dbj|BAE78509.1| Tout-velu [Drosophila melanogaster]
gi|220960240|gb|ACL92656.1| ttv-PA [synthetic construct]
gi|440214386|gb|AGB93503.1| tout-velu, isoform B [Drosophila melanogaster]
Length = 760
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 70/147 (47%), Gaps = 11/147 (7%)
Query: 200 KMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP 259
K H I +T S+F ++N + W ++N + + ++
Sbjct: 266 KRYVHGIGSETRNSLF--------HLHNGRDMVLVTTCRHGKSWRELQDNRCDEDNREYD 317
Query: 260 TTYYEDM-QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMF 318
YE + Q + FCL P G S R +EA+ GCIPV++++ VLPF I W++ ++
Sbjct: 318 RYDYETLLQNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNAWVLPFESKIDWKQAAIW 377
Query: 319 VAEEDVPKLDTILTSIPPEVI--LRKQ 343
E + ++ I+ SIP E I LR+Q
Sbjct: 378 ADERLLLQVPDIVRSIPAERIFALRQQ 404
>gi|195334376|ref|XP_002033859.1| GM21554 [Drosophila sechellia]
gi|194125829|gb|EDW47872.1| GM21554 [Drosophila sechellia]
Length = 760
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 70/147 (47%), Gaps = 11/147 (7%)
Query: 200 KMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP 259
K H I +T S+F ++N + W ++N + + ++
Sbjct: 266 KRYVHGIGSETRNSLF--------HLHNGRDMVLVTTCRHGKSWRELQDNRCDEDNREYD 317
Query: 260 TTYYEDM-QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMF 318
YE + Q + FCL P G S R +EA+ GCIPV++++ VLPF I W++ ++
Sbjct: 318 RYDYETLLQNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNAWVLPFESKIDWKQAAIW 377
Query: 319 VAEEDVPKLDTILTSIPPEVI--LRKQ 343
E + ++ I+ SIP E I LR+Q
Sbjct: 378 ADERLLLQVPDIVRSIPAERIFALRQQ 404
>gi|124301269|gb|ABN04853.1| Exostosin-like [Medicago truncatula]
Length = 196
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 37/119 (31%), Positives = 63/119 (52%), Gaps = 8/119 (6%)
Query: 264 EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED 323
+ M + FCL P G P + RL +A+V C+PVI++D I LPF D I + +I +FV
Sbjct: 51 QGMHTSKFCLHPAGDTPSACRLFDAIVSLCVPVIVSDSIELPFEDTIDYRKIAVFVETAA 110
Query: 324 VPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
+ L +IL + P+ I+ Q+ +K + + +P ++I +++KLP
Sbjct: 111 AIQPGYLVSILRGMAPDRIVEYQK-----ELKEVKRYFKYDEPDGTVNEIWRQISKKLP 164
>gi|194883052|ref|XP_001975618.1| GG20465 [Drosophila erecta]
gi|190658805|gb|EDV56018.1| GG20465 [Drosophila erecta]
Length = 760
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 70/147 (47%), Gaps = 11/147 (7%)
Query: 200 KMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP 259
K H I +T S+F ++N + W ++N + + ++
Sbjct: 266 KRYVHGIGSETRNSLF--------HLHNGRDMVLVTTCRHGKSWRELQDNRCDEDNREYD 317
Query: 260 TTYYEDM-QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMF 318
YE + Q + FCL P G S R +EA+ GCIPV++++ VLPF I W++ ++
Sbjct: 318 RYDYETLLQNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNAWVLPFESKIDWKQAAIW 377
Query: 319 VAEEDVPKLDTILTSIPPEVI--LRKQ 343
E + ++ I+ SIP E I LR+Q
Sbjct: 378 ADERLLLQVPDIVRSIPAERIFALRQQ 404
>gi|71897059|ref|NP_001026520.1| exostosin-2 [Gallus gallus]
gi|60098757|emb|CAH65209.1| hypothetical protein RCJMB04_7p22 [Gallus gallus]
Length = 567
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 33/104 (31%), Positives = 55/104 (52%), Gaps = 8/104 (7%)
Query: 241 AVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD 300
AV + N+ +FD Y + +Q A FC+ G L + + GC+PVIIAD
Sbjct: 295 AVRKRCHNSQMFD--------YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVIIAD 346
Query: 301 DIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQR 344
+LPF++ + W+ + + E+ +P++ +IL S+P I QR
Sbjct: 347 SYILPFSEVLDWKRASVVIPEDKMPEMYSILQSVPQRQIEEMQR 390
>gi|332030617|gb|EGI70305.1| Exostosin-2 [Acromyrmex echinatior]
Length = 594
Score = 61.2 bits (147), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 30/83 (36%), Positives = 46/83 (55%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + +Q + FCL G L++A+ GCIPVIIAD +++PF D I W + +F+ E
Sbjct: 187 YPDSLQTSTFCLIIRGARLAQSALLDAMAAGCIPVIIADSLMMPFHDVIDWTKAAVFIRE 246
Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
D+ +L I P+ I+ Q
Sbjct: 247 VDILLTIQLLKKISPQRIMDMQE 269
>gi|224128538|ref|XP_002320357.1| predicted protein [Populus trichocarpa]
gi|222861130|gb|EEE98672.1| predicted protein [Populus trichocarpa]
Length = 137
Score = 61.2 bits (147), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 41/117 (35%), Positives = 58/117 (49%), Gaps = 9/117 (7%)
Query: 266 MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVP 325
M + FCL G P S RL +A+ C+PVII+D I LPF D + + E G+FV D
Sbjct: 1 MASSKFCLNIAGDTPSSNRLFDAIASQCVPVIISDGIELPFEDVLDYSEFGVFVRASDAV 60
Query: 326 K---LDTILTSIPPE--VILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARK 377
K L +L I + IL ++ P + + P+QPGDA + + RK
Sbjct: 61 KKGYLLYLLRGIKKDQWTILWERLKEIAPQFE----YRYPSQPGDAVDMVWEAVLRK 113
>gi|357508935|ref|XP_003624756.1| Cysteine synthase [Medicago truncatula]
gi|355499771|gb|AES80974.1| Cysteine synthase [Medicago truncatula]
Length = 407
Score = 61.2 bits (147), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 39/120 (32%), Positives = 61/120 (50%), Gaps = 5/120 (4%)
Query: 266 MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVP 325
M+ + FCL G P S RL +A+ C+PVII+D+I LP+ D + + + +FV D
Sbjct: 45 MRSSKFCLNIAGDTPSSNRLFDAIASHCVPVIISDEIELPYEDVLDYSKFCVFVRTRDAV 104
Query: 326 K---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPHDK 382
K L + SI + R L +++ F P++ GDA I ++RK+P K
Sbjct: 105 KKKYLINFIRSIGKDEWTRMWNRL--KEVEKFFEFQFPSKEGDAVEMIWQAVSRKVPFMK 162
>gi|449533617|ref|XP_004173769.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase
IRX7-like, partial [Cucumis sativus]
Length = 330
Score = 61.2 bits (147), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 70/318 (22%), Positives = 119/318 (37%), Gaps = 34/318 (10%)
Query: 80 SPVRTLNPEEADWFYTPIYPTCDL---TPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTE 136
+ VR + +AD + P + + + + ++++ + W RT
Sbjct: 17 TAVRVKDSSQADVIFVPFFSSLSYNQHSKSHGKEKINVNKILQQKLIDFLFGQKEWRRTG 76
Query: 137 GADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYA 196
G +H + H ++ L ++ FG+ N I PY
Sbjct: 77 GKNHLVIAHH----------PNSMLDARKKLGSAMFVLADFGRYPAAIANIEKDIIAPYR 126
Query: 197 PPQK-MQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGG------YYARGARAAVWENFKNN 249
K + + + R I VYF+G Y +GG YY V F +
Sbjct: 127 HIVKTVPSSKSATFDERPILVYFQGAIYR----KDGGVVRQELYYLLKDEEDVHFTFGS- 181
Query: 250 PLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADA 309
+ + + M + FCL G P S RL +++ C+PVII+DDI LP+ D
Sbjct: 182 ----VKGNGINKAGQGMASSKFCLNIAGDTPSSNRLFDSIASHCVPVIISDDIELPYEDI 237
Query: 310 IPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDA 366
+ + E +FV D + L +L I E + + + + P+Q GDA
Sbjct: 238 LDYSEFCVFVRAADSIRKGYLLNLLRGIGRERWTKMWDRI--KEIVHEFEYQYPSQSGDA 295
Query: 367 FHQILNGLARKLPHDKST 384
I ++RK+ KS
Sbjct: 296 VDMIWQAVSRKVSKIKSN 313
>gi|223975397|gb|ACN31886.1| unknown [Zea mays]
Length = 134
Score = 61.2 bits (147), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 39/116 (33%), Positives = 60/116 (51%), Gaps = 3/116 (2%)
Query: 266 MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVP 325
M+ + FCL P G P S RL +A+V C+PVI++ I LPF D I + E +F + E+
Sbjct: 1 MRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSSRIELPFEDEIDYSEFSLFFSVEEAL 60
Query: 326 KLDTILTSIPPEVILRK--QRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
+ D +L + +V RK L ++ F P + GDA + I + K+P
Sbjct: 61 RPDYLLNEL-RQVPKRKWVDMWLKLKNVSHHYEFQYPPRKGDAVNMIWRQVRHKIP 115
>gi|255642058|gb|ACU21295.1| unknown [Glycine max]
Length = 340
Score = 61.2 bits (147), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 41/133 (30%), Positives = 59/133 (44%), Gaps = 12/133 (9%)
Query: 40 LKVYVYELPSKYNKKLLLK-------DPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADW 92
LK+YVY+ K LL+ D CL + +++ +H+ LL S RT EEAD
Sbjct: 67 LKIYVYQENEIDGLKELLRGRDAKITDEACLKGQWGSQVKIHKLLLQSKQRTWKKEEADL 126
Query: 93 FYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACF 152
F+ P Y C GL + + + S + S PY+ + G +H FV P GA
Sbjct: 127 FFVPSYVKCARMMGGL-----NDKEINSTYVKVISQMPYFRLSGGRNHIFVFPSGAGAHL 181
Query: 153 HYQEEKAIERGIL 165
I R I+
Sbjct: 182 FKSWATYINRSII 194
>gi|298711085|emb|CBJ26480.1| Glycosyltransferase, family GT47 [Ectocarpus siliculosus]
Length = 561
Score = 61.2 bits (147), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 33/97 (34%), Positives = 48/97 (49%), Gaps = 1/97 (1%)
Query: 248 NNPLFDIST-DHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 306
N L D S D + YED+ F L P G +P + RL EA+ G +PV I D V PF
Sbjct: 424 NKGLCDASERDFDSVSYEDLMNTTFALLPAGRSPATYRLAEALSAGALPVFIHQDFVKPF 483
Query: 307 ADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQ 343
+PW E E+ P++ L ++P + + + Q
Sbjct: 484 PGQVPWSEFSFSFPPEEAPRILETLRAVPDKKLAQMQ 520
>gi|326932982|ref|XP_003212589.1| PREDICTED: exostosin-1c-like [Meleagris gallopavo]
Length = 734
Score = 61.2 bits (147), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 74/286 (25%), Positives = 120/286 (41%), Gaps = 39/286 (13%)
Query: 79 SSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGA 138
SS T +PEEA F I T D P S +RS Q I S +P WN G
Sbjct: 130 SSRYYTPHPEEACLFVLSI-DTLDRDPL-------SAHYIRSVEQRIRS-FPLWN--SGR 178
Query: 139 DHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPP 198
+H + G +Y EE + G +L +A+ F +N + SI + P P
Sbjct: 179 NHLIFNLYA-GTWPNYTEELGFDIG-HAMLAKAS----FHSKNFRPGFDVSIPLFPREHP 232
Query: 199 QK------MQAHQIPPDTPRSIFVYFRGLFY-------------DVNNDPEGGYYARGAR 239
Q+ + + +PP + + F+G Y ++N +
Sbjct: 233 QRGGQSGWLHHNSVPPK--KKYLLVFKGKRYLTGIGSGTRNALHHIHNGRDIVSLTTCRH 290
Query: 240 AAVWENFKNNPLFDISTDHPT-TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVII 298
WE K+ + D+ Y E + + FC+ P G S R +EA+ CIPV++
Sbjct: 291 GKDWERHKDTRCDQDNVDYEKFDYQELLHNSTFCIVPRGRRLGSFRFLEALQAACIPVLL 350
Query: 299 ADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQR 344
+D LPF++AI W + + +E + ++ + + I PE +L Q+
Sbjct: 351 SDGWELPFSEAIDWGKAAVVGSERLLLQIPSAVRCIRPERVLAFQQ 396
>gi|402218829|gb|EJT98904.1| hypothetical protein DACRYDRAFT_110241 [Dacryopinax sp. DJM-731 SS1]
Length = 1153
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 77/322 (23%), Positives = 123/322 (38%), Gaps = 58/322 (18%)
Query: 65 HMFAAEIFMHRFLLSSPVRTLN--PEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAI 122
H F+A++ M P+R ++ EEA+ + P Y A
Sbjct: 816 HCFSADVSMEL-----PLREVSVPAEEAEMIFMPFY-------------------QNRAQ 851
Query: 123 QLISSNWPYWNRT-EGAD-HFFVVP--HDFGACFHYQEEKAIERGILPLLQRATLV---- 174
L W + T G D H V+P HDFGAC + E ++ R RA
Sbjct: 852 WLFGEEWKFMRETIPGLDPHKVVIPFTHDFGACMWW--EHSVYRAREERFDRAKEARDSI 909
Query: 175 --QTFGQRNHVCLNE-GSITIPPY--APPQKMQA----HQIPPDTPRSIFVYFRGLFYDV 225
Q N C + +PP A PQ A ++ P R++ F+G ++
Sbjct: 910 AWQVMADMNTPCYAPLQDVVMPPRTCASPQLYAAFSDMARVKPARQRNVLATFKGSYWGT 969
Query: 226 NNDPEGGYYARGARAAVWENFKNNPL---------FDISTDHPTTYYEDMQRAIFCLCPL 276
+ R E+ L +D D+ +Y + I+C P
Sbjct: 970 GANTRRKLNCE-KRLRTLEDVATPRLETEQRLMTVWDSLGDY-ESYPAILNDTIWCPLPE 1027
Query: 277 GWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPP 336
G W+ RL + V GCIPV + PF D + W ++ + + +D+ +++ +L S
Sbjct: 1028 GVTGWATRLEDVVYGGCIPVFVGHASQYPFYDMLDWSKLSIAIERKDLQRIEEVLMSYTM 1087
Query: 337 EVILRKQRLLANPSMKRAMLFP 358
E I R Q L ++ A L+P
Sbjct: 1088 EEIERFQTNLM--LVRDAFLYP 1107
>gi|332017043|gb|EGI57842.1| Exostosin-1 [Acromyrmex echinatior]
Length = 911
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 59/124 (47%), Gaps = 1/124 (0%)
Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
R Y ++N + + W F++ + ++ T YE + A FCL P G
Sbjct: 254 RNALYHLHNGKDLVFVTTCRHGKTWREFQDEHCQQDNQEYDTYDYEILLMNATFCLVPRG 313
Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
S R +EA+ GCIPVI+++ LPF + I W + +F E + ++ IL S+
Sbjct: 314 RRLGSFRFLEALRSGCIPVILSNSWALPFHERIDWNQAVIFSDERLLLQIPDILRSVSNV 373
Query: 338 VILR 341
IL+
Sbjct: 374 QILK 377
>gi|356560377|ref|XP_003548469.1| PREDICTED: LOW QUALITY PROTEIN: probable glycosyltransferase
At5g03795-like [Glycine max]
Length = 334
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 71/268 (26%), Positives = 104/268 (38%), Gaps = 50/268 (18%)
Query: 67 FAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLIS 126
+A+E + + + S T +P EAD F+ P + L +R IQ IS
Sbjct: 95 YASESYFKKVPMKSHFITKDPTEADLFFLP-FSIARLRHNRRVGVGGKQDFIRDYIQNIS 153
Query: 127 SNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLN 186
+PYWNRT GADHF+V H G + +A V+ F VC +
Sbjct: 154 HKYPYWNRTGGADHFYVACHSIGR---------------SAMDKAPDVK-FNAIQVVCSS 197
Query: 187 E----GSITIPPYAPPQKMQAHQIPP---DTPRSIFVYFRGLFYDVNNDPEGGYYARGAR 239
G+I PQ PP + R +F G N P R
Sbjct: 198 SYFLTGNIAHKDTCLPQIWPRKGNPPILVSSKRKRLAFFAGGV----NSP--------VR 245
Query: 240 AAVWENFKNNPLFDISTDH---PTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPV 296
+ E +KN+ +I H T Y +++ + F L G+ + R I V
Sbjct: 246 VKLLETWKNDS--EIFVHHGRLKTPYADELLGSKFGLHVKGFEVNTTR---------IGV 294
Query: 297 IIADDIVLPFADAIPWEEIGMFVAEEDV 324
IIA+ LPFAD + W+ + V D+
Sbjct: 295 IIANYYDLPFADVLNWKSFSVVVTTLDI 322
>gi|125586098|gb|EAZ26762.1| hypothetical protein OsJ_10674 [Oryza sativa Japonica Group]
Length = 364
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 38/119 (31%), Positives = 59/119 (49%), Gaps = 5/119 (4%)
Query: 264 EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED 323
+ M+ + FCL G P S RL +++V C+P+II+D+I LPF D + + + + V D
Sbjct: 213 QGMRASKFCLNIAGDTPSSNRLFDSIVSHCVPIIISDEIELPFEDVLDYSKFCIIVRGAD 272
Query: 324 VPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
K L ++ I E R L ++R + P+Q DA I +ARK P
Sbjct: 273 AVKKGFLMNLINGISREDWTRMWNRLK--EVERHFEYQYPSQNDDAVQMIWKAIARKAP 329
>gi|224051022|ref|XP_002199808.1| PREDICTED: exostosin-2 [Taeniopygia guttata]
Length = 718
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 34/104 (32%), Positives = 54/104 (51%), Gaps = 8/104 (7%)
Query: 241 AVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD 300
AV + N +FD Y + +Q + FC+ G L + + GC+PVIIAD
Sbjct: 295 AVRKRCYKNQVFD--------YPQVLQESTFCVVLRGARLGQAVLSDVLQAGCVPVIIAD 346
Query: 301 DIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQR 344
+LPF++ + W+ + + EE +P++ +IL SIP I QR
Sbjct: 347 SYILPFSEVLDWKRASVVIPEEKMPEMYSILQSIPQRQIEEMQR 390
>gi|449270126|gb|EMC80844.1| Exostosin-2, partial [Columba livia]
Length = 714
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 33/104 (31%), Positives = 54/104 (51%), Gaps = 8/104 (7%)
Query: 241 AVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIAD 300
AV + N +FD Y + +Q + FC+ G L + + GC+PVIIAD
Sbjct: 291 AVRKRCHKNQVFD--------YPQVLQESTFCVVLRGARLGQAVLSDVLQAGCVPVIIAD 342
Query: 301 DIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQR 344
+LPF++ + W+ + + EE +P++ +IL S+P I QR
Sbjct: 343 SYILPFSEVLDWKRASVVIPEEKMPEMYSILQSVPQRQIEEMQR 386
>gi|327259699|ref|XP_003214673.1| PREDICTED: exostosin-2-like [Anolis carolinensis]
Length = 718
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 33/97 (34%), Positives = 51/97 (52%), Gaps = 8/97 (8%)
Query: 248 NNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFA 307
NN +FD Y + +Q A FC+ G L + + GCIPV+IAD +LPF+
Sbjct: 302 NNQVFD--------YPQVLQEATFCIVLRGARLGQAVLSDVLQAGCIPVVIADSYILPFS 353
Query: 308 DAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQR 344
+ + W+ + + EE + ++ +IL SIP I QR
Sbjct: 354 EVLDWKRASVVIPEEKMSEMYSILHSIPQRQIEEMQR 390
>gi|297738432|emb|CBI27633.3| unnamed protein product [Vitis vinifera]
Length = 441
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 38/119 (31%), Positives = 60/119 (50%), Gaps = 5/119 (4%)
Query: 264 EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED 323
+ M+ + FCL P G P S RL +A+V C+PVI++D I LP+ D I + + +F ++++
Sbjct: 303 QGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDQIELPYEDEIDYTQFSIFFSDKE 362
Query: 324 VPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
+ + L IP E + R L S F P + GDA + + KLP
Sbjct: 363 ALEPGYMIEQLRQIPKERWVEMWRHLKYISHHYE--FQYPPKKGDAIDMLWRQVKHKLP 419
>gi|388512347|gb|AFK44235.1| unknown [Lotus japonicus]
Length = 267
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 63/254 (24%), Positives = 102/254 (40%), Gaps = 21/254 (8%)
Query: 131 YWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSI 190
+W R+ G DH F + H F + L + +V FG R N
Sbjct: 9 HWQRSRGRDHVFPMTHPNAFRFLRNQ----------LNESIQVVVDFG-RYPKGSNLNKD 57
Query: 191 TIPPYAPPQKMQAHQIPPD--TPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKN 248
+ PY P D R ++FRG + + EG A+ A+ + +
Sbjct: 58 VVSPYVHVVDSFTDDEPQDPYESRPTLLFFRGRTFRKD---EGIVRAKLAKILTGFDDVH 114
Query: 249 NPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFAD 308
+ ++ + M+ + FCL P G P S RL +A+V C+PVI++D I LPF D
Sbjct: 115 YERSFATGENIKLSSQGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDQIELPFED 174
Query: 309 AIPWEEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGD 365
I + + +F + ++ + + L P + R L N S F P + D
Sbjct: 175 EIDYSQFSLFFSFKEALQPGYMIDQLRKFPKDKWSEMWRQLKNISHHYE--FQYPPKKED 232
Query: 366 AFHQILNGLARKLP 379
A + + + KLP
Sbjct: 233 AVNMLWRQVKHKLP 246
>gi|156378657|ref|XP_001631258.1| predicted protein [Nematostella vectensis]
gi|156218295|gb|EDO39195.1| predicted protein [Nematostella vectensis]
Length = 563
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 23/66 (34%), Positives = 43/66 (65%)
Query: 266 MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVP 325
MQRA FCL G L+++++ GCIP++++DD +LPF++ + W+ + V+E ++
Sbjct: 234 MQRATFCLVIRGARLGQTALLDSLMMGCIPIVVSDDYILPFSEVLDWKRAAVVVSENEID 293
Query: 326 KLDTIL 331
++ IL
Sbjct: 294 RIPLIL 299
>gi|338711993|ref|XP_001489915.2| PREDICTED: LOW QUALITY PROTEIN: exostosin-2 [Equus caballus]
Length = 728
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 40/131 (30%), Positives = 66/131 (50%), Gaps = 9/131 (6%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + +Q A FC+ G L + + GC+PV+IAD +LPF++ + W+ + V E
Sbjct: 308 YPQVLQEATFCMVLRGARLGQAVLSDVLRAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367
Query: 322 EDVPKLDTILTSIPPEVILRKQR-LLANPSMKRAMLFPQPAQ-----PGDAF-HQILNGL 374
E + + +IL SIP I QR L P+ + + L + PG+ + +I+N
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQRQLFQEPARRESWLVANHXRSSLIXPGEEWGSEIIND- 426
Query: 375 ARKLPHDKSTY 385
R P+ S+Y
Sbjct: 427 -RIYPYAASSY 436
>gi|356534007|ref|XP_003535549.1| PREDICTED: LOW QUALITY PROTEIN: xylogalacturonan
beta-1,3-xylosyltransferase-like [Glycine max]
Length = 373
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 72/295 (24%), Positives = 110/295 (37%), Gaps = 30/295 (10%)
Query: 83 RTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSP--RMMRSAIQLISSNWPYWNRTEGADH 140
R L+P AD + P + T + + R + + S WNR+ G DH
Sbjct: 48 RVLDPLLADVVFVPFFATLSANKGAFRKKHGNDDYKRQRQVVDAVKSTQ-VWNRSGGRDH 106
Query: 141 FFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSI---TIPPYAP 197
FV+ GA + G R G+ + V + S+ I PY
Sbjct: 107 VFVLT---GAFCKNPSFSFVPGGDFGGWSRGGGGSNCGESDVVPHTQVSVIKDVIVPYM- 162
Query: 198 PQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTD 257
+ + + R +YF+G + GG R +W+ + P +
Sbjct: 163 -HLLPRLDLSENKVRHQLLYFKG----AKHRHRGGII----REKLWDLLVSEPGVIMEEG 213
Query: 258 HPTTYYED-----MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW 312
P + M+ + FCL P G P S RL +A+ CIPVI++D I LPF + +
Sbjct: 214 FPNATGREQSIKGMRTSEFCLHPAGDTPTSCRLFDAIQSLCIPVIVSDIIELPFEGMVDY 273
Query: 313 EEIGMFVAEEDVPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPG 364
E +F A D K L L S E +K R N + + + PG
Sbjct: 274 AEFSVFPAVNDARKPSWLGNHLQSFSKE---QKDRFRQNMAQVQPIFVYDNGHPG 325
>gi|297851878|ref|XP_002893820.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
gi|297339662|gb|EFH70079.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
Length = 478
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 57/214 (26%), Positives = 86/214 (40%), Gaps = 34/214 (15%)
Query: 128 NWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNE 187
N W R+ G DH FV+ D A +H +EE A+ LV FG +
Sbjct: 192 NTEAWKRSNGRDHVFVLT-DPVAMWHVREEIALS---------ILLVVDFGG----WFRQ 237
Query: 188 GSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLF----YDVNNDPEGGYYARGARAAVW 243
S + + P++++ Q+ + + V + L N Y +GA+
Sbjct: 238 DSKSSNGTSLPERIEHTQV--SVIKDVIVPYTHLLPSLDLSQNQRRHSLLYFKGAKHRHR 295
Query: 244 ENFKNNPLFDISTDHPTTYYED--------------MQRAIFCLCPLGWAPWSPRLVEAV 289
L+D+ D E+ M+ + FCL P G P S RL +A+
Sbjct: 296 GGLIREKLWDLLVDEQGIVMEEGFPNATGREQSIIGMRNSEFCLHPAGDTPTSCRLFDAI 355
Query: 290 VFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED 323
CIPVI++D I LPF I + E +FV D
Sbjct: 356 QSLCIPVIVSDTIELPFEGIIDYSEFSVFVPVSD 389
>gi|326437066|gb|EGD82636.1| hypothetical protein PTSG_03293 [Salpingoeca sp. ATCC 50818]
Length = 366
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 38/113 (33%), Positives = 54/113 (47%), Gaps = 8/113 (7%)
Query: 40 LKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIY- 98
L+VY+Y LP L+ P + +AAE R L +S T +PEEA F+ +
Sbjct: 106 LRVYMYPLPES------LQLPPTRDYKYAAEATFTRMLRASTFSTDSPEEAQLFFVRVSC 159
Query: 99 PTCDLTPTGLPLPFKSPRMMRSAI-QLISSNWPYWNRTEGADHFFVVPHDFGA 150
T ++ +A+ + +PYWNRT+G DHFFV HD GA
Sbjct: 160 AEARFTQRDREAGQRAADAHATAVLAHVQQRYPYWNRTQGRDHFFVCGHDMGA 212
Score = 43.1 bits (100), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 19/50 (38%), Positives = 29/50 (58%)
Query: 282 SPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTIL 331
+PRL++A+ GC+PV IAD P A + W + +F+AE D + L
Sbjct: 260 TPRLMDAIWAGCVPVFIADHYDPPLAKYVDWALLAVFIAEADAAHIKAHL 309
>gi|215686890|dbj|BAG89740.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215693850|dbj|BAG89049.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 264
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 52/189 (27%), Positives = 85/189 (44%), Gaps = 23/189 (12%)
Query: 205 QIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYE 264
+ + R +YF+G + GG R +W+ N P + +P
Sbjct: 60 HLSENKDRPTLLYFKG----AKHRHRGGL----VREKLWDLMVNEPDVVMEEGYPNATGR 111
Query: 265 D-----MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFV 319
+ M+ + FCL P G P S RL +AV CIPVI++D+I LPF I + E +FV
Sbjct: 112 EQSIKGMRTSEFCLHPAGDTPTSCRLFDAVASLCIPVIVSDEIELPFEGMIDYTEFAIFV 171
Query: 320 AEEDVPK---LDTILTSIPPEVILRKQRLLAN--PSMKRAMLFP----QPAQPGDAFHQI 370
+ + + L L ++P + +R +A+ P + ++P AQ G A + I
Sbjct: 172 SVNNSMRPKWLTNYLRNVPRQQKDEFRRNMAHVQPIFEYDSIYPGRMASAAQDG-AVNHI 230
Query: 371 LNGLARKLP 379
+ +KLP
Sbjct: 231 WKKIHQKLP 239
>gi|307177266|gb|EFN66444.1| Exostosin-1 [Camponotus floridanus]
Length = 711
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 59/124 (47%), Gaps = 1/124 (0%)
Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
R Y ++N + + W F++ + ++ T YE + A FCL P G
Sbjct: 254 RNALYHLHNGKDLVFVTTCRHGKAWREFQDEHCQQNNQEYDTYDYEILLMNATFCLVPRG 313
Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
S R +EA+ GCIPVI+++ LPF + I W + +F E + ++ I+ S+
Sbjct: 314 RRLGSFRFLEALRAGCIPVILSNGWALPFHERIDWTQAAIFSDERLLLQIPDIVRSVSNV 373
Query: 338 VILR 341
IL+
Sbjct: 374 QILK 377
>gi|345305619|ref|XP_001510258.2| PREDICTED: exostosin-2 [Ornithorhynchus anatinus]
Length = 615
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 47/83 (56%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + +Q A FC+ G L + + GC+PVI+AD +LPF++ + W+ + + E
Sbjct: 308 YPQVLQEATFCVVLRGARLGQSVLSDVLQAGCVPVILADSYILPFSEVLDWKRASVVIPE 367
Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
E +P++ +IL SIP I QR
Sbjct: 368 EKMPEMYSILQSIPQRQIEEMQR 390
>gi|115452759|ref|NP_001049980.1| Os03g0324700 [Oryza sativa Japonica Group]
gi|108707908|gb|ABF95703.1| secondary cell wall-related glycosyltransferase family 47,
putative, expressed [Oryza sativa Japonica Group]
gi|113548451|dbj|BAF11894.1| Os03g0324700 [Oryza sativa Japonica Group]
gi|125543687|gb|EAY89826.1| hypothetical protein OsI_11372 [Oryza sativa Indica Group]
gi|215768347|dbj|BAH00576.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 468
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/119 (31%), Positives = 59/119 (49%), Gaps = 5/119 (4%)
Query: 264 EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED 323
+ M+ + FCL G P S RL +++V C+P+II+D+I LPF D + + + + V D
Sbjct: 317 QGMRASKFCLNIAGDTPSSNRLFDSIVSHCVPIIISDEIELPFEDVLDYSKFCIIVRGAD 376
Query: 324 VPK---LDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
K L ++ I E R L ++R + P+Q DA I +ARK P
Sbjct: 377 AVKKGFLMNLINGISREDWTRMWNRLKE--VERHFEYQYPSQNDDAVQMIWKAIARKAP 433
>gi|308491504|ref|XP_003107943.1| hypothetical protein CRE_12808 [Caenorhabditis remanei]
gi|308249890|gb|EFO93842.1| hypothetical protein CRE_12808 [Caenorhabditis remanei]
Length = 847
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/154 (24%), Positives = 66/154 (42%), Gaps = 14/154 (9%)
Query: 194 PYAPPQKMQAHQIPPDTPRSIFVYFRG-------------LFYDVNNDPEGGYYARGARA 240
PY + H P + R F+G L + ++N +
Sbjct: 148 PYQIESQRALHNEPKEEKRRYLASFKGKRYVYGIGSGTRNLVHHLHNGDDIVMVTTCKHN 207
Query: 241 AVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIA 299
W+ ++++ + ++ Y+D+ + FCL P G S R +E + GCIPV+I+
Sbjct: 208 NDWQVYQDDRCQRDNEEYDRWEYDDLLSNSTFCLVPRGRRLGSFRFLETLRSGCIPVVIS 267
Query: 300 DDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTS 333
D +LPF++ I W + VAE D + +L S
Sbjct: 268 DSWILPFSETIDWHSAAIVVAERDALSIPELLMS 301
>gi|115461230|ref|NP_001054215.1| Os04g0670600 [Oryza sativa Japonica Group]
gi|113565786|dbj|BAF16129.1| Os04g0670600, partial [Oryza sativa Japonica Group]
Length = 275
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 52/189 (27%), Positives = 85/189 (44%), Gaps = 23/189 (12%)
Query: 205 QIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYE 264
+ + R +YF+G + GG R +W+ N P + +P
Sbjct: 71 HLSENKDRPTLLYFKG----AKHRHRGGL----VREKLWDLMVNEPDVVMEEGYPNATGR 122
Query: 265 D-----MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFV 319
+ M+ + FCL P G P S RL +AV CIPVI++D+I LPF I + E +FV
Sbjct: 123 EQSIKGMRTSEFCLHPAGDTPTSCRLFDAVASLCIPVIVSDEIELPFEGMIDYTEFAIFV 182
Query: 320 AEEDVPK---LDTILTSIPPEVILRKQRLLAN--PSMKRAMLFP----QPAQPGDAFHQI 370
+ + + L L ++P + +R +A+ P + ++P AQ G A + I
Sbjct: 183 SVNNSMRPKWLTNYLRNVPRQQKDEFRRNMAHVQPIFEYDSIYPGRMASAAQDG-AVNHI 241
Query: 371 LNGLARKLP 379
+ +KLP
Sbjct: 242 WKKIHQKLP 250
>gi|443712570|gb|ELU05824.1| hypothetical protein CAPTEDRAFT_92231 [Capitella teleta]
Length = 668
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 70/282 (24%), Positives = 118/282 (41%), Gaps = 46/282 (16%)
Query: 77 LLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTE 136
L S T NPEEA F P T D L + + M S IQ + PYWN +
Sbjct: 62 LRQSRYHTTNPEEACLF-VPAIDTLDRDK----LSAEYIQNMESKIQSL----PYWN--D 110
Query: 137 GADH--FFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPP 194
G +H F + P G HY E +L +A++ + + N I+ P
Sbjct: 111 GLNHIIFNLYP---GTWPHYDETDLGFNTGKAMLAKASVSDMWFRPNF------DISFPL 161
Query: 195 YAPPQKMQAHQ--------IPPDTPRSIFVYFRGL-------------FYDVNNDPEGGY 233
+ K + + +PP RS + F+G Y ++ND +
Sbjct: 162 FHKEHKFKGGEPGFLTENLVPP--LRSYTLSFKGKRYLTGIGSETRNSLYHIHNDDDIVM 219
Query: 234 YARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLGWAPWSPRLVEAVVFG 292
W++ K++ + ++ Y+ + + FCL P G S R +EA+
Sbjct: 220 LTTCKHGKSWKDMKDDRCERDNAEYEKYDYKILLHNSTFCLVPRGRRLGSYRFLEALQAA 279
Query: 293 CIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSI 334
CIPV ++++ VLPF++ I W + ++ E + ++ +I+ SI
Sbjct: 280 CIPVFLSNNWVLPFSEVIDWNQAAIWGDERLLLQIPSIVRSI 321
>gi|145355370|ref|XP_001421936.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144582175|gb|ABP00230.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 490
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/131 (29%), Positives = 60/131 (45%), Gaps = 13/131 (9%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y M+ + FCL G SPRL+E+++FGC+PVI+ADD LP + + W + + E
Sbjct: 350 YTAKMKNSRFCLYMRGTRVHSPRLIESMLFGCVPVILADDYELPLSWLVDWSAFSVMIPE 409
Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFP-----QPAQPGDAFHQILNGLAR 376
D + P+ + R M+ M+ P + GDAF G+ R
Sbjct: 410 RDFQTI--------PDALERANSDWDAMHMRLQMVLPLFLYRRRPLVGDAFWATALGVER 461
Query: 377 KLPHDKSTYLK 387
+L ++ K
Sbjct: 462 QLRRRRAECTK 472
>gi|157110617|ref|XP_001651176.1| exostosin-2 [Aedes aegypti]
gi|108878646|gb|EAT42871.1| AAEL005626-PA [Aedes aegypti]
Length = 711
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 29/80 (36%), Positives = 48/80 (60%), Gaps = 2/80 (2%)
Query: 266 MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVP 325
++ FCL G P L++A+ GCIPVI+AD+++LPF+D + W+ I + + E ++
Sbjct: 307 LENGQFCLIARGVRLSQPTLMDALAAGCIPVIMADNLILPFSDILDWDLISIRIYENNLH 366
Query: 326 KLDTILTSIPPEVI--LRKQ 343
+ T L ++ E I LR Q
Sbjct: 367 SVITTLKAVSKERIQELRAQ 386
>gi|123504861|ref|XP_001328850.1| Exostosin family protein [Trichomonas vaginalis G3]
gi|121911798|gb|EAY16627.1| Exostosin family protein [Trichomonas vaginalis G3]
Length = 325
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 72/307 (23%), Positives = 123/307 (40%), Gaps = 33/307 (10%)
Query: 73 MHRFLLSSPVRTLNPEEADWFYTPIYP-TCDLTPTGLPLPFKSPRMMRSAIQLISSNWPY 131
+H LL+SP+ T N +AD FY P Y C +G P + + ++ Y
Sbjct: 6 VHEQLLASPILTNNTNDADLFYIPHYSRMC----SGFTPPEERWEELPDYLEKYGH---Y 58
Query: 132 WNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSIT 191
+ R DHF + H + AI+ P++ L + +
Sbjct: 59 FTRYSTVDHFMM--HSVPNYGDKPADIAIDDSRQPII--GVLDFKWSEMIKSPWTHAKSQ 114
Query: 192 IPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPL 251
I P+ + +I P R I V+ + N+ + R +++ KN+
Sbjct: 115 ILPFITLKS----KINPKAKRKIPVF---VAMSTNHLAKNSANLRKNLTEIFKKIKNSEF 167
Query: 252 FDISTDHPTTYYE-------DMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVL 304
IS P + + M + FC+ P G AP S RL +A+ CIP+I+AD + L
Sbjct: 168 IKISRTSPKSVRDILAVLPTKMGSSDFCIIPPGDAPTSKRLYDAISHLCIPIIVADYMTL 227
Query: 305 PF-ADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAML---FPQP 360
PF +I + E + + +D+ K+ ++ + I ++ L + R M + P
Sbjct: 228 PFDGTSINYTECVIQIPSKDIEKIPDLVNNFDKNKIKEMRKKL---EIVREMFIWDYKNP 284
Query: 361 AQPGDAF 367
G AF
Sbjct: 285 PNAGQAF 291
>gi|359493064|ref|XP_002270238.2| PREDICTED: probable glucuronosyltransferase Os01g0926700-like
[Vitis vinifera]
Length = 483
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 66/264 (25%), Positives = 102/264 (38%), Gaps = 38/264 (14%)
Query: 83 RTLNPEEADWFYTPIYPT--CDLTPTGLPLPFKSPR------MMRSAIQLISSNWPYWNR 134
R + EAD + P + T ++ G F+ R ++ + W R
Sbjct: 146 RVFDVNEADVVFVPFFATISAEIQLGGGKGVFRKKEGNEDYERQRQVMEFVRGT-EAWKR 204
Query: 135 TEGADHFFVVPHDFGACFHYQEEKAIE-------RGILPLLQRATLVQTFGQRNHVCLNE 187
+ G DH FV+ D A +H + E A G L +A+ H ++
Sbjct: 205 SGGRDHVFVLT-DPVAMWHVKAEIAPAILLVVDFGGWYKLDSKASNNSLSEMIQHTQVSL 263
Query: 188 GSITIPPYA---PPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWE 244
I PY P + +QI R +YF+G + GG R +W+
Sbjct: 264 LKDVIVPYTHLLPRLHLSENQI-----RQTLLYFKG----AKHRHRGGL----VREKLWD 310
Query: 245 NFKNNPLFDISTDHPTTYYED-----MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIA 299
+ P + M+ + FCL P G P S RL +A+ CIPVI++
Sbjct: 311 LLVYEQGVIMEEGFPNATGREQSIKGMRTSEFCLHPAGDTPTSCRLFDAIQSLCIPVIVS 370
Query: 300 DDIVLPFADAIPWEEIGMFVAEED 323
D+I LPF + + E +FVA D
Sbjct: 371 DNIELPFEGMVDYSEFSVFVAVRD 394
>gi|440798055|gb|ELR19126.1| hypothetical protein ACA1_335890 [Acanthamoeba castellanii str.
Neff]
Length = 350
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 34/98 (34%), Positives = 50/98 (51%)
Query: 251 LFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAI 310
L D S Y++ M F L G S RL EA+ +PVI+AD+ VLPF++A+
Sbjct: 221 LDDSSRGDSGDYHDLMLNTKFALIVQGNGLHSYRLTEAMRANAVPVILADNYVLPFSEAV 280
Query: 311 PWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLAN 348
W+EI +FV E + ++ I E + R + LA
Sbjct: 281 RWDEIAIFVPESQWASIPDVIGRIDDEALARMREKLAT 318
>gi|291190070|ref|NP_001167078.1| Exostosin-2 [Salmo salar]
gi|223647990|gb|ACN10753.1| Exostosin-2 [Salmo salar]
Length = 724
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 28/83 (33%), Positives = 47/83 (56%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + +Q + FC+ G L + + GC+PVI+AD +LPF++ + W+ + + E
Sbjct: 314 YPQILQESSFCVVLRGARLGQATLSDVLQAGCVPVILADSYILPFSEVLDWKRASVVIPE 373
Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
E +P++ TIL SIP + QR
Sbjct: 374 EKLPEMYTILKSIPHRQVEEMQR 396
>gi|303271929|ref|XP_003055326.1| glycosyltransferase family 47 protein [Micromonas pusilla CCMP1545]
gi|226463300|gb|EEH60578.1| glycosyltransferase family 47 protein [Micromonas pusilla CCMP1545]
Length = 595
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 38/142 (26%), Positives = 71/142 (50%), Gaps = 11/142 (7%)
Query: 248 NNPLFDISTDHPTT---YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVL 304
N +D+ +D TT Y + + R+ FCL G ++PRLVEA++FGC+PVIIAD L
Sbjct: 450 NRTDWDLDSDGATTPSGYMKLLARSKFCLHVRGTRVYAPRLVEAMLFGCVPVIIADGYDL 509
Query: 305 PFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAM---LFPQPA 361
P + + W+ + + E + + + E++ + +++R + ++ P
Sbjct: 510 PLSWFLDWDAFSVRMTER-----EGVNATRAAEIVDAADWREKHEALRRVVGFFMYHDPP 564
Query: 362 QPGDAFHQILNGLARKLPHDKS 383
GDA G+ R++ ++
Sbjct: 565 VFGDALWATAAGIERQISRGRA 586
>gi|56606002|ref|NP_001008400.1| exostosin-2 [Danio rerio]
gi|55247892|gb|AAV48783.1| exostosin-2 [Danio rerio]
Length = 719
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 28/83 (33%), Positives = 47/83 (56%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + +Q + FC+ G L + + GC+PVI+AD +LPF++ + W+ + + E
Sbjct: 309 YPQILQESSFCVVLRGARLGQATLSDVLQAGCVPVIMADSYILPFSEVLDWKRASVVIPE 368
Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
E +P++ TIL SIP + QR
Sbjct: 369 EKLPEMYTILKSIPHRQVEEMQR 391
>gi|383847805|ref|XP_003699543.1| PREDICTED: exostosin-2-like [Megachile rotundata]
Length = 707
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 44/83 (53%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + +Q A FCL G L+E + G IPVIIAD + +PF D I W +F+ E
Sbjct: 300 YPDVLQTATFCLVIRGARLGQSTLLECMATGSIPVIIADSLAMPFHDVIDWTRAAIFIRE 359
Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
D+ + ++L + P+ I Q+
Sbjct: 360 VDILSVISVLKKVSPKRITELQK 382
>gi|341881773|gb|EGT37708.1| CBN-RIB-1 protein [Caenorhabditis brenneri]
Length = 383
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 29/74 (39%), Positives = 41/74 (55%), Gaps = 1/74 (1%)
Query: 263 YEDM-QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
YED+ + FCL P G S R +E + GCIPV+I+D +LPF + I W + VAE
Sbjct: 264 YEDLLTNSTFCLVPRGRRLGSFRFLETLRSGCIPVVISDSWILPFTETIDWSSAAIVVAE 323
Query: 322 EDVPKLDTILTSIP 335
D + +L S+
Sbjct: 324 RDALSIPELLMSMS 337
>gi|297688867|ref|XP_002821894.1| PREDICTED: exostosin-2 isoform 2 [Pongo abelii]
Length = 728
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 76/329 (23%), Positives = 127/329 (38%), Gaps = 52/329 (15%)
Query: 35 DPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFY 94
+P K+KVY+Y L KY + ++ + E+ M ++D++
Sbjct: 99 NPKNKIKVYIYAL-KKYVDDFGVSVSNTISREYN-ELLMA------------ISDSDYYT 144
Query: 95 TPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVV-----PHDFG 149
I C P+ L + R+ +A L + W+R G++H P D+
Sbjct: 145 DDINRACLFVPSIDVLNQNTLRIKETAQAL--AQLSRWDR--GSNHLLFNMLPGGPPDYN 200
Query: 150 ACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEG-SITIPPYAPPQKMQAHQIPP 208
++A+ G G + +G ++IP Y+P + A P
Sbjct: 201 TALDVPRDRALLAG--------------GGFSTWTYRQGYDVSIPVYSP---LSAEVDLP 243
Query: 209 DT---PRSIFVYFR--GLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPT--- 260
+ PR F+ GL + D E G V + N +S
Sbjct: 244 EKGPGPRQYFLLSSQVGLHPEYREDLEALQVKHGESVLVLDKCTNLSEGVLSVRKRCHKH 303
Query: 261 ---TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGM 317
Y + +Q A FC+ G L + + GC+PV+IAD +LPF++ + W+ +
Sbjct: 304 QVFDYPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASV 363
Query: 318 FVAEEDVPKLDTILTSIPPEVILRKQRLL 346
V EE + + +IL SIP I QR L
Sbjct: 364 VVPEEKMSDVYSILQSIPQRQIEEMQRQL 392
>gi|198429575|ref|XP_002120379.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
Length = 482
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 65/276 (23%), Positives = 108/276 (39%), Gaps = 39/276 (14%)
Query: 12 FVSADNTPKIERISGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEI 71
++ N +++ + S D E V LKVY+YE K+ +A E
Sbjct: 202 YIKQPNYKVLKKTAQSPNDYRE--MVNNLKVYIYE------TKIGTDHHPHRVGGYAVER 253
Query: 72 FMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLT--PTGLPLPFKSPRMMRSAIQLISSNW 129
L S RT +P A +F+ PI + + PT ++ R+ + + I + +
Sbjct: 254 VFQELLEKSNFRTQHPNLATFFFIPIRCSSYILDYPTEHEGLMEAKRVTANILHEIQTQY 313
Query: 130 PYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGS 189
PYW+++ GA+HF++ HD GA + G++ LV T + +
Sbjct: 314 PYWSQSSGANHFYICSHDVGA--------KVAEGLMK--NAIGLVSTADYDDPYFIPHKD 363
Query: 190 ITIPPYAPPQKMQAHQIPPDTP------RSIFVYFRGLFYDVNNDPEGGYYARGARAAVW 243
I+IPP H I R+I +F G D G R W
Sbjct: 364 ISIPPTPSSGLSNIHLIGKGGALVDVRGRNILAFFAG-------DITSG----RIRPLAW 412
Query: 244 ENFKNNPLFDISTD--HPTTYYEDMQRAIFCLCPLG 277
+ ++ +I P+ Y E +++A FCL G
Sbjct: 413 RTWYSDQDIEIINRILKPSAYIEKLKKAKFCLIFRG 448
>gi|198457239|ref|XP_001360599.2| GA10084 [Drosophila pseudoobscura pseudoobscura]
gi|198135910|gb|EAL25174.2| GA10084 [Drosophila pseudoobscura pseudoobscura]
Length = 765
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 34/104 (32%), Positives = 57/104 (54%), Gaps = 3/104 (2%)
Query: 243 WENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADD 301
W ++N + + ++ YE + Q + FCL P G S R +EA+ GCIPV++++
Sbjct: 305 WRELQDNRCDEDNREYDRYDYETLLQNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNA 364
Query: 302 IVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVI--LRKQ 343
VLPF I W++ ++ E + ++ I+ SI E I LR+Q
Sbjct: 365 WVLPFESKIDWKQAAIWADERLLLQVPDIVRSISAERIFALRQQ 408
>gi|195381971|ref|XP_002049706.1| GJ20607 [Drosophila virilis]
gi|194144503|gb|EDW60899.1| GJ20607 [Drosophila virilis]
Length = 757
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 69/147 (46%), Gaps = 11/147 (7%)
Query: 200 KMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP 259
K H I +T S+F ++N + W ++N + + ++
Sbjct: 262 KRYVHGIGSETRNSLF--------HLHNGRDMVMVTTCRHGKSWRELQDNRCDEDNREYD 313
Query: 260 TTYYEDM-QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMF 318
YE + Q + FCL P G S R +EA+ GCIPV++++ VLPF I W++ ++
Sbjct: 314 RYDYETLLQNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNAWVLPFESKIDWKQAAIW 373
Query: 319 VAEEDVPKLDTILTSIPPEVI--LRKQ 343
E + ++ I+ SI E I LR+Q
Sbjct: 374 ADERLLLQVPDIVRSISAERIFALRQQ 400
>gi|260825668|ref|XP_002607788.1| hypothetical protein BRAFLDRAFT_64151 [Branchiostoma floridae]
gi|229293137|gb|EEN63798.1| hypothetical protein BRAFLDRAFT_64151 [Branchiostoma floridae]
Length = 733
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 46/88 (52%)
Query: 257 DHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG 316
D Y + +Q A FC+ G L +A++ GCIPVI D V+PF++ + W+
Sbjct: 320 DRVYNYPDILQEATFCMILRGARMGQSALSDAMMAGCIPVIAIDTYVMPFSEVLDWKRAA 379
Query: 317 MFVAEEDVPKLDTILTSIPPEVILRKQR 344
+ + EED+P + +L I E I +R
Sbjct: 380 VILREEDLPDVHNVLRRISQERITNMRR 407
>gi|195431447|ref|XP_002063753.1| GK15839 [Drosophila willistoni]
gi|194159838|gb|EDW74739.1| GK15839 [Drosophila willistoni]
Length = 776
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 69/147 (46%), Gaps = 11/147 (7%)
Query: 200 KMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP 259
K H I +T S+F ++N + W ++N + + ++
Sbjct: 273 KRYVHGIGSETRNSLF--------HLHNGRDMVLVTTCRHGKSWRELQDNRCDEDNREYD 324
Query: 260 TTYYEDM-QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMF 318
YE + Q + FCL P G S R +EA+ GCIPV++++ VLPF I W++ ++
Sbjct: 325 RYDYETLLQNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNAWVLPFESKIDWKQAAIW 384
Query: 319 VAEEDVPKLDTILTSIPPEVI--LRKQ 343
E + ++ I+ SI E I LR+Q
Sbjct: 385 ADERLLLQVPDIVRSISAERIFALRQQ 411
>gi|347963135|ref|XP_311070.5| AGAP000081-PA [Anopheles gambiae str. PEST]
gi|333467342|gb|EAA06302.6| AGAP000081-PA [Anopheles gambiae str. PEST]
Length = 761
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 41/135 (30%), Positives = 62/135 (45%), Gaps = 14/135 (10%)
Query: 266 MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVP 325
+ FCL G P L+EA+ GCIPV++AD+ VLPFAD + WE + + + E ++
Sbjct: 357 LTTGTFCLVARGVRLGQPALLEAMAAGCIPVVMADNYVLPFADLLDWELLAVRLPEANLH 416
Query: 326 KLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLA-RKLPHDKST 384
+ +L +I E + Q + R++ A +L L R PH T
Sbjct: 417 TIVPVLRAISAERVAEMQ------AQIRSVYRRYFASLDRIVLTVLEQLNDRIFPHRSRT 470
Query: 385 YLKPGGKFLNWTSGP 399
Y L+W GP
Sbjct: 471 Y-------LHWNVGP 478
>gi|332210817|ref|XP_003254509.1| PREDICTED: exostosin-2 isoform 2 [Nomascus leucogenys]
Length = 728
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 37/112 (33%), Positives = 56/112 (50%), Gaps = 13/112 (11%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + +Q A FC+ G L + + GC+PV+IAD +LPF++ + W+ + V E
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367
Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGD---AFHQI 370
E + + +IL SIP I QR LF +PA+ + A HQI
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQR----------QLFMEPARRENWSAANHQI 409
>gi|145337133|ref|NP_176534.2| root hair specific 8 [Arabidopsis thaliana]
gi|6633849|gb|AAF19708.1|AC008047_15 F2K11.17 [Arabidopsis thaliana]
gi|332195980|gb|AEE34101.1| root hair specific 8 [Arabidopsis thaliana]
Length = 664
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 70/299 (23%), Positives = 115/299 (38%), Gaps = 56/299 (18%)
Query: 42 VYVYELPSKYNKKLLLKDPRCL---------------------------THMFAAEIFMH 74
VYVY+LPSK+NK LL + + TH ++ E H
Sbjct: 284 VYVYDLPSKFNKDLLRECSDMVPWADFCNYFKNDAFGELMESMGKGWFRTHQYSLEPIFH 343
Query: 75 RFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKS-PRMMRSAIQLISSNW---- 129
+L P R N +A FY P Y D+ L FK+ ++ + + W
Sbjct: 344 SRILKHPCRVHNETQAKLFYVPFYGGMDV----LRWHFKNVSSDVKDVLPIEIVKWLGSK 399
Query: 130 PYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGS 189
W + G DH FV+ F ++ + +L + + + +RN +N+ +
Sbjct: 400 KSWRKNSGKDHVFVLGK-ISWDFRRVDKYSWGSSLLEMQEMKNPTKLLIERNPWEVNDIA 458
Query: 190 ITIPPYAPPQKMQAHQIPPD----TPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWEN 245
I P Y P+ I + PR + F G N PE R+ + +
Sbjct: 459 IPHPTYFHPKTDTDIAIWQNKILGKPRRSLISFAGAARPGN--PE------SIRSILIDQ 510
Query: 246 FKNNP-------LFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVI 297
+++P D D + E + + FCL P G +P + ++++ GCIPVI
Sbjct: 511 CRSSPNQCRFLNCTDGGCDKSESVIELFRDSEFCLQPPGDSPTRKSIFDSLILGCIPVI 569
>gi|195025057|ref|XP_001985992.1| GH20787 [Drosophila grimshawi]
gi|193901992|gb|EDW00859.1| GH20787 [Drosophila grimshawi]
Length = 754
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 69/147 (46%), Gaps = 11/147 (7%)
Query: 200 KMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP 259
K H I +T S+F ++N + W ++N + + ++
Sbjct: 260 KRYVHGIGSETRNSLF--------HLHNGRDMVMVTTCRHGKSWRELQDNRCDEDNREYD 311
Query: 260 TTYYEDM-QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMF 318
YE + Q + FCL P G S R +EA+ GCIPV++++ VLPF I W++ ++
Sbjct: 312 RYDYETLLQNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNAWVLPFESKIDWKQAAIW 371
Query: 319 VAEEDVPKLDTILTSIPPEVI--LRKQ 343
E + ++ I+ SI E I LR+Q
Sbjct: 372 ADERLLLQVPDIVRSISAERIFALRQQ 398
>gi|2251238|gb|AAB62718.1| multiple exostoses type II protein EXT2.I [Homo sapiens]
Length = 728
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 31/92 (33%), Positives = 49/92 (53%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + +Q A FC+ G L + + GC+PV+IAD +LPF++ + W+ + V E
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367
Query: 322 EDVPKLDTILTSIPPEVILRKQRLLANPSMKR 353
E + + +IL SIP I QR L ++R
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQRQLFMEPVRR 399
>gi|414879654|tpg|DAA56785.1| TPA: hypothetical protein ZEAMMB73_800597 [Zea mays]
Length = 265
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 35/118 (29%), Positives = 52/118 (44%), Gaps = 12/118 (10%)
Query: 40 LKVYVYELPSKYNKKLLLKD-------PRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADW 92
L++YVY + LL+ CL + ++ +H+FLL S RT N + A+
Sbjct: 53 LRIYVYAEDEIQGLRALLRGRDGTINAATCLKGQWGTQVKVHQFLLKSRFRTFNKDHANL 112
Query: 93 FYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGA 150
F+ P Y C L S + + + S PY+ R+ G DH FV P GA
Sbjct: 113 FFVPSYVKCVRMTGAL-----SDKEINQTYVKVLSQMPYFRRSGGRDHIFVFPSGAGA 165
>gi|332836218|ref|XP_003313041.1| PREDICTED: exostosin-2 isoform 1 [Pan troglodytes]
Length = 728
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 76/329 (23%), Positives = 126/329 (38%), Gaps = 52/329 (15%)
Query: 35 DPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFY 94
+P K+KVY+Y L KY + ++ + E+ M ++D++
Sbjct: 99 NPKNKIKVYIYAL-KKYVDDFGVSVSNAISREYN-ELLMA------------ISDSDYYT 144
Query: 95 TPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVV-----PHDFG 149
I C P+ L + R+ +A L + W+R G +H P D+
Sbjct: 145 DDINRACLFVPSIDVLNQNTLRIKETAQAL--AQLSRWDR--GTNHLLFNMLPGGPPDYN 200
Query: 150 ACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEG-SITIPPYAPPQKMQAHQIPP 208
++A+ G G + +G ++IP Y+P + A P
Sbjct: 201 TALDVPRDRALLAG--------------GGFSTWTYRQGYDVSIPVYSP---LSAEVDLP 243
Query: 209 DT---PRSIFVYFR--GLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPT--- 260
+ PR F+ GL + D E G V + N +S
Sbjct: 244 EKGPGPRQYFLLSSQVGLHPEYREDLEALQVKHGESVLVLDKCTNLSEGVLSVRKRCHKH 303
Query: 261 ---TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGM 317
Y + +Q A FC+ G L + + GC+PV+IAD +LPF++ + W+ +
Sbjct: 304 QVFDYPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASV 363
Query: 318 FVAEEDVPKLDTILTSIPPEVILRKQRLL 346
V EE + + +IL SIP I QR L
Sbjct: 364 VVPEEKMSDVYSILQSIPQRQIEEMQRQL 392
>gi|308491052|ref|XP_003107717.1| CRE-RIB-1 protein [Caenorhabditis remanei]
gi|308249664|gb|EFO93616.1| CRE-RIB-1 protein [Caenorhabditis remanei]
Length = 347
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 28/72 (38%), Positives = 41/72 (56%), Gaps = 1/72 (1%)
Query: 263 YEDM-QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y+D+ + FCL P G S R +E + GCIPV+I+D +LPF++ I W + VAE
Sbjct: 229 YDDLLSNSTFCLVPRGRRLGSFRFLETLRSGCIPVVISDSWILPFSETIDWHSAAIVVAE 288
Query: 322 EDVPKLDTILTS 333
D + +L S
Sbjct: 289 RDALSIPELLMS 300
>gi|414879653|tpg|DAA56784.1| TPA: hypothetical protein ZEAMMB73_800597 [Zea mays]
Length = 238
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 35/118 (29%), Positives = 52/118 (44%), Gaps = 12/118 (10%)
Query: 40 LKVYVYELPSKYNKKLLLKD-------PRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADW 92
L++YVY + LL+ CL + ++ +H+FLL S RT N + A+
Sbjct: 53 LRIYVYAEDEIQGLRALLRGRDGTINAATCLKGQWGTQVKVHQFLLKSRFRTFNKDHANL 112
Query: 93 FYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGA 150
F+ P Y C L S + + + S PY+ R+ G DH FV P GA
Sbjct: 113 FFVPSYVKCVRMTGAL-----SDKEINQTYVKVLSQMPYFRRSGGRDHIFVFPSGAGA 165
>gi|312378632|gb|EFR25154.1| hypothetical protein AND_09769 [Anopheles darlingi]
Length = 732
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 27/78 (34%), Positives = 45/78 (57%)
Query: 266 MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVP 325
++ +FCL G P L+EA+ GCIPVI+AD++VLPF++ + WE + + V E +
Sbjct: 327 LESGVFCLIARGVRLAQPVLLEAMATGCIPVIVADNLVLPFSNILDWELLSVRVYESQLH 386
Query: 326 KLDTILTSIPPEVILRKQ 343
+ +L + + I Q
Sbjct: 387 SVLALLKRVSDQRIRELQ 404
>gi|91076108|ref|XP_968944.1| PREDICTED: similar to tout-velu CG10117-PA [Tribolium castaneum]
gi|270014706|gb|EFA11154.1| hypothetical protein TcasGA2_TC004758 [Tribolium castaneum]
Length = 719
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 39/143 (27%), Positives = 65/143 (45%), Gaps = 9/143 (6%)
Query: 200 KMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP 259
K H I DT R Y ++N + W++ K+ + ++
Sbjct: 238 KRYVHGIGSDT--------RNSLYHLHNRKDMIMVTTCRHGKSWKDMKDERCDQDNKEYD 289
Query: 260 TTYYEDM-QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMF 318
YE + Q + FCL P G S R +EA+ GCIPV++++ LPFA I W + ++
Sbjct: 290 KYDYEVLLQNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNGWALPFAQKIDWSKAAIW 349
Query: 319 VAEEDVPKLDTILTSIPPEVILR 341
E + ++ I+ S+ P IL+
Sbjct: 350 ADERLLLQVPYIVRSLAPAKILQ 372
>gi|440897880|gb|ELR49485.1| Exostosin-2, partial [Bos grunniens mutus]
Length = 731
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 31/83 (37%), Positives = 45/83 (54%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + +Q A FC+ G L + + GC+PVIIAD VLPF++ + W+ + V E
Sbjct: 321 YPQVLQEATFCMVLRGARLGQAVLSDVLRAGCVPVIIADSYVLPFSEVLDWKRASVVVPE 380
Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
E + + +IL SIP I QR
Sbjct: 381 EKMSDVYSILQSIPRRQIEEMQR 403
>gi|20138161|sp|O77783.1|EXT2_BOVIN RecName: Full=Exostosin-2; AltName:
Full=Glucuronosyl-N-acetylglucosaminyl-proteoglycan/N-
acetylglucosaminyl-proteoglycan
4-alpha-N-acetylglucosaminyltransferase; AltName:
Full=HS-polymerase; Short=HS-POL; AltName: Full=Multiple
exostoses protein 2 homolog
gi|3599568|gb|AAC35386.1| glucuronyl/N-acetylglucosaminyl transferase [Bos taurus]
Length = 718
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 31/83 (37%), Positives = 45/83 (54%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + +Q A FC+ G L + + GC+PVIIAD VLPF++ + W+ + V E
Sbjct: 308 YPQVLQEATFCMVLRGARLGQAVLSDVLRAGCVPVIIADSYVLPFSEVLDWKRASVVVPE 367
Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
E + + +IL SIP I QR
Sbjct: 368 EKMSDVYSILQSIPRRQIEEMQR 390
>gi|195150443|ref|XP_002016164.1| GL11446 [Drosophila persimilis]
gi|194110011|gb|EDW32054.1| GL11446 [Drosophila persimilis]
Length = 676
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 34/104 (32%), Positives = 57/104 (54%), Gaps = 3/104 (2%)
Query: 243 WENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADD 301
W ++N + + ++ YE + Q + FCL P G S R +EA+ GCIPV++++
Sbjct: 305 WRELQDNRCDEDNREYDRYDYETLLQNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNA 364
Query: 302 IVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVI--LRKQ 343
VLPF I W++ ++ E + ++ I+ SI E I LR+Q
Sbjct: 365 WVLPFESKIDWKQAAIWADERLLLQVPDIVRSISAERIFALRQQ 408
>gi|302780765|ref|XP_002972157.1| galactosyltransferase-like protein [Selaginella moellendorffii]
gi|300160456|gb|EFJ27074.1| galactosyltransferase-like protein [Selaginella moellendorffii]
Length = 470
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 76/319 (23%), Positives = 122/319 (38%), Gaps = 59/319 (18%)
Query: 64 THMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQ 123
T F+ EI HR +L RTL+P+ A F+ P Y D++ S + Q
Sbjct: 131 TDQFSGEIIFHRRMLDHRCRTLDPDGASAFFVPFYAGLDISRNLWASGKSSSDVDSLGEQ 190
Query: 124 L---ISSNWPYWNRTEGADHFFV---VPHDF-----------GACFHYQEEKAIERGILP 166
L + P++NR+ GADHF V + DF + FH E ++++R ++
Sbjct: 191 LLHWLQRQHPHFNRSGGADHFLVAGRISWDFRRMPSAAGEWGSSLFHQIEMRSVKRLVI- 249
Query: 167 LLQRATLVQTFGQRNHVCLNEGSITIP----PYAPPQKMQAHQIPPDTPRSIFVYFRGLF 222
+RN +E + P P + Q + +PR V F G
Sbjct: 250 ------------ERNPWDDSELGVPYPTSFHPSSDEDLAQWVEFVQGSPRPHLVAFAG-- 295
Query: 223 YDVNNDPEGGYYA----------RGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFC 272
P GY + R A+ + T P + ++FC
Sbjct: 296 -----SPRPGYRSDFRQVLLGQCRAAQRGISGCLDCTADTAGCTSDPLRVTKLFLSSVFC 350
Query: 273 LCPLGWAPWSPRLVEAVVFGCIPVIIADDIVL-PFADAIPW--EEIGMFVAEEDVPK--- 326
L P G + L ++++ GCIPV+ + + +P EE +F+ + V
Sbjct: 351 LQPRGDSFTRKSLFDSLISGCIPVLFWNQSAYWQYELYLPRDPEEYSVFIPHQSVKNGTN 410
Query: 327 -LDTILTSIPPEVILRKQR 344
LD +L + E I R QR
Sbjct: 411 VLD-VLQGVSRERIGRMQR 428
>gi|117935049|ref|NP_803462.2| exostosin-2 [Bos taurus]
gi|117306659|gb|AAI26571.1| Exostoses (multiple) 2 [Bos taurus]
gi|296479641|tpg|DAA21756.1| TPA: exostosin-2 [Bos taurus]
Length = 718
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 31/83 (37%), Positives = 45/83 (54%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + +Q A FC+ G L + + GC+PVIIAD VLPF++ + W+ + V E
Sbjct: 308 YPQVLQEATFCMVLRGARLGQAVLSDVLRAGCVPVIIADSYVLPFSEVLDWKRASVVVPE 367
Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
E + + +IL SIP I QR
Sbjct: 368 EKMSDVYSILQSIPRRQIEEMQR 390
>gi|321468156|gb|EFX79142.1| hypothetical protein DAPPUDRAFT_304945 [Daphnia pulex]
Length = 729
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 53/235 (22%), Positives = 100/235 (42%), Gaps = 23/235 (9%)
Query: 130 PYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFG----------Q 179
P+WN G + + G Y E+ ++ G L + + VQ +
Sbjct: 160 PHWN---GGQNHIIFNFYSGTWPDYTEDLGMDIGRAILAKASISVQNYRPSFDISLPLVH 216
Query: 180 RNHVCLNEGSITIPPYAP--PQKMQAHQIPPDTPRSIF---VYFRGLFYDVNNDPEGGYY 234
+ H L G +P YA P +++ + R ++ R Y ++N +
Sbjct: 217 KEH--LERGGDILPIYAENIPAASKSYLLAFKGKRYVYGIGSETRNSLYHLHNSRDVIMV 274
Query: 235 ARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLGWAPWSPRLVEAVVFGC 293
W+ K+ + + ++ YE + + FCL P G S R +E + GC
Sbjct: 275 TTCKHGKSWKELKDERCEEDNAEYDRYDYEILLHNSTFCLVPRGRRLGSFRFIEVLQAGC 334
Query: 294 IPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVI--LRKQRLL 346
IPV+++++ V+PF++ I W+ ++ E + ++ I+ SI E + LR+Q L
Sbjct: 335 IPVLLSNNWVIPFSEIIDWKTSAIWADERLLLQVPDIVRSIEAERVMALRQQSQL 389
>gi|322790909|gb|EFZ15575.1| hypothetical protein SINV_11283 [Solenopsis invicta]
Length = 676
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 44/83 (53%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + +Q A FCL G L++A+ GCIP IIAD +++PF D I W + +F+ E
Sbjct: 269 YPDSLQTATFCLIIRGARLAQSVLLDAMAAGCIPAIIADSLMMPFHDVIDWTKAAVFIRE 328
Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
D+ +L I + I+ Q
Sbjct: 329 VDILLTIQLLKKISHQRIMEMQE 351
>gi|426368047|ref|XP_004051026.1| PREDICTED: exostosin-2 isoform 2 [Gorilla gorilla gorilla]
Length = 728
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 30/85 (35%), Positives = 46/85 (54%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + +Q A FC+ G L + + GC+PV+IAD +LPF++ + W+ + V E
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367
Query: 322 EDVPKLDTILTSIPPEVILRKQRLL 346
E + + +IL SIP I QR L
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQRQL 392
>gi|313235568|emb|CBY11023.1| unnamed protein product [Oikopleura dioica]
Length = 663
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 30/87 (34%), Positives = 48/87 (55%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y ++ + FCL P G S R +E + C+PV++ADD VLPF++ I WE + E
Sbjct: 232 YISLLKNSTFCLVPRGRRLGSFRFIETLQQACVPVLLADDWVLPFSEVIDWERSTISWEE 291
Query: 322 EDVPKLDTILTSIPPEVILRKQRLLAN 348
+ + +L L + P +LR ++ AN
Sbjct: 292 KLLLELGQHLEDVSPADVLRMRQEGAN 318
>gi|296010875|ref|NP_001171554.1| exostosin-2 isoform 3 [Homo sapiens]
gi|119588476|gb|EAW68070.1| exostoses (multiple) 2, isoform CRA_c [Homo sapiens]
Length = 728
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 30/85 (35%), Positives = 46/85 (54%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + +Q A FC+ G L + + GC+PV+IAD +LPF++ + W+ + V E
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367
Query: 322 EDVPKLDTILTSIPPEVILRKQRLL 346
E + + +IL SIP I QR L
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQRQL 392
>gi|449681334|ref|XP_002163001.2| PREDICTED: exostosin-1a-like [Hydra magnipapillata]
Length = 469
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 36/130 (27%), Positives = 66/130 (50%), Gaps = 2/130 (1%)
Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPT-TYYEDMQRAIFCLCPLG 277
R Y ++N+ + W+ F ++ + ++++ Y E + + FCL P G
Sbjct: 140 RNSLYLIHNNEDIILLTTCIHEKNWQKFADSRCEEDNSNYDRFNYTELLANSTFCLIPRG 199
Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIP-P 336
S R +EA+ +GCIPVI+++ LPF D I W + + + E + +L +IL I
Sbjct: 200 RRLASFRFLEAIQYGCIPVIMSNGWDLPFNDVIDWVKFSIVLDESLLLQLPSILRGISFD 259
Query: 337 EVILRKQRLL 346
+V+ KQ+ +
Sbjct: 260 QVLAMKQQTI 269
>gi|218198616|gb|EEC81043.1| hypothetical protein OsI_23835 [Oryza sativa Indica Group]
Length = 250
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 24/67 (35%), Positives = 42/67 (62%)
Query: 271 FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTI 330
FC+CP G S R+ +++ +GC+PVI++D LPF+ + W + + + E DV +L +I
Sbjct: 166 FCVCPGGSQVNSARISDSIHYGCVPVILSDYYDLPFSGILNWRKFAVVLKESDVYELKSI 225
Query: 331 LTSIPPE 337
L S+ +
Sbjct: 226 LKSLSQK 232
>gi|344281096|ref|XP_003412316.1| PREDICTED: LOW QUALITY PROTEIN: exostosin-2-like [Loxodonta
africana]
Length = 728
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 47/86 (54%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + +Q A FC+ G L + + GC+PV+IAD +LPF++ + W+ + V E
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367
Query: 322 EDVPKLDTILTSIPPEVILRKQRLLA 347
E + + +IL SIP I QR ++
Sbjct: 368 EKMSDMYSILQSIPQRQIEEMQRQVS 393
>gi|297688871|ref|XP_002821896.1| PREDICTED: exostosin-2 isoform 4 [Pongo abelii]
Length = 718
Score = 58.2 bits (139), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 75/327 (22%), Positives = 126/327 (38%), Gaps = 52/327 (15%)
Query: 35 DPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFY 94
+P K+KVY+Y L KY + ++ + E+ M ++D++
Sbjct: 99 NPKNKIKVYIYAL-KKYVDDFGVSVSNTISREYN-ELLMA------------ISDSDYYT 144
Query: 95 TPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVV-----PHDFG 149
I C P+ L + R+ +A L + W+R G++H P D+
Sbjct: 145 DDINRACLFVPSIDVLNQNTLRIKETAQAL--AQLSRWDR--GSNHLLFNMLPGGPPDYN 200
Query: 150 ACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEG-SITIPPYAPPQKMQAHQIPP 208
++A+ G G + +G ++IP Y+P + A P
Sbjct: 201 TALDVPRDRALLAG--------------GGFSTWTYRQGYDVSIPVYSP---LSAEVDLP 243
Query: 209 DT---PRSIFVYFR--GLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPT--- 260
+ PR F+ GL + D E G V + N +S
Sbjct: 244 EKGPGPRQYFLLSSQVGLHPEYREDLEALQVKHGESVLVLDKCTNLSEGVLSVRKRCHKH 303
Query: 261 ---TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGM 317
Y + +Q A FC+ G L + + GC+PV+IAD +LPF++ + W+ +
Sbjct: 304 QVFDYPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASV 363
Query: 318 FVAEEDVPKLDTILTSIPPEVILRKQR 344
V EE + + +IL SIP I QR
Sbjct: 364 VVPEEKMSDVYSILQSIPQRQIEEMQR 390
>gi|395815565|ref|XP_003781296.1| PREDICTED: exostosin-2 [Otolemur garnettii]
Length = 718
Score = 58.2 bits (139), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 30/83 (36%), Positives = 45/83 (54%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + +Q A FC+ G L E + GC+PV+IAD +LPF++ + W+ + V E
Sbjct: 308 YPQVLQDATFCVVLRGARLGQAVLSEVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367
Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
E + + +IL SIP I QR
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQR 390
>gi|17541994|ref|NP_502180.1| Protein RIB-1 [Caenorhabditis elegans]
gi|6434267|emb|CAB61014.1| Protein RIB-1 [Caenorhabditis elegans]
gi|130381603|dbj|BAF48989.1| heparan sulfate polymerase [Caenorhabditis elegans]
Length = 382
Score = 58.2 bits (139), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 27/72 (37%), Positives = 40/72 (55%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y E + + FCL P G S R +E + GC+PV+I+D +LPF++ I W + VAE
Sbjct: 262 YDELLANSTFCLVPRGRRLGSFRFLETLRSGCVPVVISDSWILPFSETIDWNSAAIVVAE 321
Query: 322 EDVPKLDTILTS 333
D + +L S
Sbjct: 322 RDALSIPELLMS 333
>gi|3023724|sp|O01704.1|EXT1_CAEEL RecName: Full=Multiple exostoses homolog 1; AltName: Full=Related
to mammalian RIB protein 1
gi|2058697|gb|AAC47509.1| multiple exostoses homolog 1 [Caenorhabditis elegans]
Length = 378
Score = 58.2 bits (139), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 27/72 (37%), Positives = 40/72 (55%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y E + + FCL P G S R +E + GC+PV+I+D +LPF++ I W + VAE
Sbjct: 258 YDELLANSTFCLVPRGRRLGSFRFLETLRSGCVPVVISDSWILPFSETIDWNSAAIVVAE 317
Query: 322 EDVPKLDTILTS 333
D + +L S
Sbjct: 318 RDALSIPELLMS 329
>gi|302832215|ref|XP_002947672.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
gi|300267020|gb|EFJ51205.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
Length = 790
Score = 58.2 bits (139), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 43/131 (32%), Positives = 66/131 (50%), Gaps = 12/131 (9%)
Query: 210 TPRSIFVYFRG-LFYDVNNDPEGGYYARGARAAV--------WENFKNNPLFDISTDHPT 260
+ R++F++FRG L DPE Y+R R + W N L D ST H
Sbjct: 502 SERNVFLFFRGDLRLAPGQDPEC-KYSRCIRQTLYNLSISERWREKYNVLLGDTSTVH-G 559
Query: 261 TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVA 320
Y + +++FCL G WSPRL +AV+ GCIPVII D++ F + + +
Sbjct: 560 DYSVLLSQSLFCLVAPGDG-WSPRLEDAVLHGCIPVIIMDEVQAVFESILDLPSFSVRIP 618
Query: 321 EEDVPKLDTIL 331
+ ++ ++ TIL
Sbjct: 619 QANMTQIVTIL 629
>gi|342320455|gb|EGU12395.1| Proteophosphoglycan ppg4 [Rhodotorula glutinis ATCC 204091]
Length = 1322
Score = 58.2 bits (139), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 76/335 (22%), Positives = 124/335 (37%), Gaps = 38/335 (11%)
Query: 41 KVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFM--HRFLLSSPVRTLNPEEADWFYTPIY 98
K +V +LP + L K P C + FM H + + R++ +EA + P Y
Sbjct: 973 KGHVVQLPDTIDNHL--KSPACFDLERSPLPFMGDHYLVEAMKNRSVTLDEASFVLVPYY 1030
Query: 99 PTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVP--HDFGACFHYQE 156
C K + A I++N V+P HDFG+C + +
Sbjct: 1031 QGCYYNYLQENTFKKLADTVGFAETAIATNPAITG------DRIVIPFTHDFGSCTGWWQ 1084
Query: 157 EKAIERGILPL--LQRATLVQTFGQRNHVCLN-EGSITIPPYAPPQK--MQAHQIPPD-- 209
+ G P + +A Q G N C+ + + +P K + + P D
Sbjct: 1085 KLEDVLGHSPPSPMDQAVAWQVNGDYNTRCIKVDRDVVVPAVTKHTKALFETFKTPADVA 1144
Query: 210 --TPRSIFVYF----RGLFYDVNNDPEGGYYARGARAAV-WENFKNNPLFDISTDHPTTY 262
R +F RG G + +A+ ++ F Y
Sbjct: 1145 PVNSRKHLAFFAGGVRGFGAIARTKIGCGRTGQDPNSAILYQQFSPG----------QRY 1194
Query: 263 YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEE 322
+ + FCL P G W R EA+ GCIP I D + PF D + + + + E
Sbjct: 1195 LGTLNASKFCLLPRGIPAWMTRTFEAIYAGCIPAFIVDRNLFPFQDILDYSRFSVTIPEA 1254
Query: 323 DVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLF 357
D +++ IL++ PE + Q L ++ A LF
Sbjct: 1255 DAHRIEEILSAYTPEQLSELQANLVK--VREAFLF 1287
>gi|332836216|ref|XP_508383.3| PREDICTED: exostosin-2 isoform 3 [Pan troglodytes]
Length = 718
Score = 58.2 bits (139), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 75/327 (22%), Positives = 125/327 (38%), Gaps = 52/327 (15%)
Query: 35 DPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFY 94
+P K+KVY+Y L KY + ++ + E+ M ++D++
Sbjct: 99 NPKNKIKVYIYAL-KKYVDDFGVSVSNAISREYN-ELLMA------------ISDSDYYT 144
Query: 95 TPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVV-----PHDFG 149
I C P+ L + R+ +A L + W+R G +H P D+
Sbjct: 145 DDINRACLFVPSIDVLNQNTLRIKETAQAL--AQLSRWDR--GTNHLLFNMLPGGPPDYN 200
Query: 150 ACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEG-SITIPPYAPPQKMQAHQIPP 208
++A+ G G + +G ++IP Y+P + A P
Sbjct: 201 TALDVPRDRALLAG--------------GGFSTWTYRQGYDVSIPVYSP---LSAEVDLP 243
Query: 209 DT---PRSIFVYFR--GLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPT--- 260
+ PR F+ GL + D E G V + N +S
Sbjct: 244 EKGPGPRQYFLLSSQVGLHPEYREDLEALQVKHGESVLVLDKCTNLSEGVLSVRKRCHKH 303
Query: 261 ---TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGM 317
Y + +Q A FC+ G L + + GC+PV+IAD +LPF++ + W+ +
Sbjct: 304 QVFDYPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASV 363
Query: 318 FVAEEDVPKLDTILTSIPPEVILRKQR 344
V EE + + +IL SIP I QR
Sbjct: 364 VVPEEKMSDVYSILQSIPQRQIEEMQR 390
>gi|395543711|ref|XP_003773757.1| PREDICTED: exostosin-2 [Sarcophilus harrisii]
Length = 917
Score = 58.2 bits (139), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 72/322 (22%), Positives = 131/322 (40%), Gaps = 48/322 (14%)
Query: 35 DPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFY 94
+P K+KVY+Y L KY + + ++ + LL++ ++D++
Sbjct: 99 NPKDKIKVYIYSL-KKYVDDVGVPVSNTISREYNE-------LLTAI------SDSDFYT 144
Query: 95 TPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPH---DFGAC 151
I C P+ L K+ R+ +A L + W+R F ++P D+
Sbjct: 145 DDINRACLFVPSIDVLNQKTLRIKETAQAL--AQLARWDRGTNQLLFNMLPGEPPDYNTA 202
Query: 152 FHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEG-SITIPPYAPPQKMQAHQIPPDT 210
++A+ G G + +G ++IP Y+P + A P+
Sbjct: 203 LDVPRDRALLAG--------------GGFSTWTYRQGYDVSIPVYSP---LSAEVDLPER 245
Query: 211 PRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAI 270
+ + GL G +R R VW + + + D + ++ A
Sbjct: 246 GPGLVIRALGL---------GNEGSRPRRDLVWGLMELRDVMESMLDVESEGL-GLREAS 295
Query: 271 FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTI 330
FC+ G L + + GC+PVIIAD +LPF++ + W+ + + EE + + ++
Sbjct: 296 FCIVLRGARLGQAALSDVLQAGCVPVIIADSYILPFSEVLDWKRASVVIPEEKMRDMYSV 355
Query: 331 LTSIPPEVILRKQR-LLANPSM 351
L SIP I QR + A PS+
Sbjct: 356 LRSIPQRQIEEMQRQVRAGPSV 377
>gi|395742863|ref|XP_003777829.1| PREDICTED: exostosin-2 [Pongo abelii]
Length = 751
Score = 58.2 bits (139), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 75/327 (22%), Positives = 126/327 (38%), Gaps = 52/327 (15%)
Query: 35 DPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFY 94
+P K+KVY+Y L KY + ++ + E+ M ++D++
Sbjct: 132 NPKNKIKVYIYAL-KKYVDDFGVSVSNTISREYN-ELLMA------------ISDSDYYT 177
Query: 95 TPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVV-----PHDFG 149
I C P+ L + R+ +A L + W+R G++H P D+
Sbjct: 178 DDINRACLFVPSIDVLNQNTLRIKETAQAL--AQLSRWDR--GSNHLLFNMLPGGPPDYN 233
Query: 150 ACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEG-SITIPPYAPPQKMQAHQIPP 208
++A+ G G + +G ++IP Y+P + A P
Sbjct: 234 TALDVPRDRALLAG--------------GGFSTWTYRQGYDVSIPVYSP---LSAEVDLP 276
Query: 209 DT---PRSIFVYFR--GLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPT--- 260
+ PR F+ GL + D E G V + N +S
Sbjct: 277 EKGPGPRQYFLLSSQVGLHPEYREDLEALQVKHGESVLVLDKCTNLSEGVLSVRKRCHKH 336
Query: 261 ---TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGM 317
Y + +Q A FC+ G L + + GC+PV+IAD +LPF++ + W+ +
Sbjct: 337 QVFDYPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASV 396
Query: 318 FVAEEDVPKLDTILTSIPPEVILRKQR 344
V EE + + +IL SIP I QR
Sbjct: 397 VVPEEKMSDVYSILQSIPQRQIEEMQR 423
>gi|297688865|ref|XP_002821893.1| PREDICTED: exostosin-2 isoform 1 [Pongo abelii]
Length = 731
Score = 58.2 bits (139), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 75/327 (22%), Positives = 126/327 (38%), Gaps = 52/327 (15%)
Query: 35 DPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFY 94
+P K+KVY+Y L KY + ++ + E+ M ++D++
Sbjct: 112 NPKNKIKVYIYAL-KKYVDDFGVSVSNTISREYN-ELLMA------------ISDSDYYT 157
Query: 95 TPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVV-----PHDFG 149
I C P+ L + R+ +A L + W+R G++H P D+
Sbjct: 158 DDINRACLFVPSIDVLNQNTLRIKETAQAL--AQLSRWDR--GSNHLLFNMLPGGPPDYN 213
Query: 150 ACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEG-SITIPPYAPPQKMQAHQIPP 208
++A+ G G + +G ++IP Y+P + A P
Sbjct: 214 TALDVPRDRALLAG--------------GGFSTWTYRQGYDVSIPVYSP---LSAEVDLP 256
Query: 209 DT---PRSIFVYFR--GLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPT--- 260
+ PR F+ GL + D E G V + N +S
Sbjct: 257 EKGPGPRQYFLLSSQVGLHPEYREDLEALQVKHGESVLVLDKCTNLSEGVLSVRKRCHKH 316
Query: 261 ---TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGM 317
Y + +Q A FC+ G L + + GC+PV+IAD +LPF++ + W+ +
Sbjct: 317 QVFDYPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASV 376
Query: 318 FVAEEDVPKLDTILTSIPPEVILRKQR 344
V EE + + +IL SIP I QR
Sbjct: 377 VVPEEKMSDVYSILQSIPQRQIEEMQR 403
>gi|332836220|ref|XP_003313042.1| PREDICTED: exostosin-2 isoform 2 [Pan troglodytes]
Length = 751
Score = 58.2 bits (139), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 75/327 (22%), Positives = 125/327 (38%), Gaps = 52/327 (15%)
Query: 35 DPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFY 94
+P K+KVY+Y L KY + ++ + E+ M ++D++
Sbjct: 132 NPKNKIKVYIYAL-KKYVDDFGVSVSNAISREYN-ELLMA------------ISDSDYYT 177
Query: 95 TPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVV-----PHDFG 149
I C P+ L + R+ +A L + W+R G +H P D+
Sbjct: 178 DDINRACLFVPSIDVLNQNTLRIKETAQAL--AQLSRWDR--GTNHLLFNMLPGGPPDYN 233
Query: 150 ACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEG-SITIPPYAPPQKMQAHQIPP 208
++A+ G G + +G ++IP Y+P + A P
Sbjct: 234 TALDVPRDRALLAG--------------GGFSTWTYRQGYDVSIPVYSP---LSAEVDLP 276
Query: 209 DT---PRSIFVYFR--GLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPT--- 260
+ PR F+ GL + D E G V + N +S
Sbjct: 277 EKGPGPRQYFLLSSQVGLHPEYREDLEALQVKHGESVLVLDKCTNLSEGVLSVRKRCHKH 336
Query: 261 ---TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGM 317
Y + +Q A FC+ G L + + GC+PV+IAD +LPF++ + W+ +
Sbjct: 337 QVFDYPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASV 396
Query: 318 FVAEEDVPKLDTILTSIPPEVILRKQR 344
V EE + + +IL SIP I QR
Sbjct: 397 VVPEEKMSDVYSILQSIPQRQIEEMQR 423
>gi|410973605|ref|XP_003993238.1| PREDICTED: exostosin-2 [Felis catus]
Length = 718
Score = 58.2 bits (139), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 46/83 (55%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + +Q A FC+ G L + + GC+PV+IAD VLPF++ + W+ + V E
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLRAGCVPVVIADSYVLPFSEVLDWKRASVVVPE 367
Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
E + ++ +IL S+P I QR
Sbjct: 368 EKMSEVYSILQSVPQRQIEEMQR 390
>gi|322795454|gb|EFZ18199.1| hypothetical protein SINV_05251 [Solenopsis invicta]
Length = 711
Score = 58.2 bits (139), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 58/124 (46%), Gaps = 1/124 (0%)
Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
R Y ++N + + W ++ + ++ T YE + A FCL P G
Sbjct: 254 RNALYHLHNGKDLVFVTTCRHGKAWRELQDEHCQQDNQEYDTYDYEILLMNATFCLVPRG 313
Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
S R +EA+ GCIPVI+++ LPF + I W + +F E + ++ IL S+
Sbjct: 314 RRLGSFRFLEALRSGCIPVILSNGWALPFHERIDWTQAVIFSDERLLLQIPDILRSVSNV 373
Query: 338 VILR 341
IL+
Sbjct: 374 QILK 377
>gi|397473541|ref|XP_003808267.1| PREDICTED: exostosin-2 [Pan paniscus]
gi|410305316|gb|JAA31258.1| exostosin 2 [Pan troglodytes]
Length = 751
Score = 58.2 bits (139), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 75/327 (22%), Positives = 125/327 (38%), Gaps = 52/327 (15%)
Query: 35 DPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFY 94
+P K+KVY+Y L KY + ++ + E+ M ++D++
Sbjct: 132 NPKNKIKVYIYAL-KKYVDDFGVSVSNAISREYN-ELLMA------------ISDSDYYT 177
Query: 95 TPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVV-----PHDFG 149
I C P+ L + R+ +A L + W+R G +H P D+
Sbjct: 178 DDINRACLFVPSIDVLNQNTLRIKETAQAL--AQLSRWDR--GTNHLLFNMLPGGPPDYN 233
Query: 150 ACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEG-SITIPPYAPPQKMQAHQIPP 208
++A+ G G + +G ++IP Y+P + A P
Sbjct: 234 TALDVPRDRALLAG--------------GGFSTWTYRQGYDVSIPVYSP---LSAEVDLP 276
Query: 209 DT---PRSIFVYFR--GLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPT--- 260
+ PR F+ GL + D E G V + N +S
Sbjct: 277 EKGPGPRQYFLLSSQVGLHPEYREDLEALQVKHGESVLVLDKCTNLSEGVLSVRKRCHKH 336
Query: 261 ---TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGM 317
Y + +Q A FC+ G L + + GC+PV+IAD +LPF++ + W+ +
Sbjct: 337 QVFDYPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASV 396
Query: 318 FVAEEDVPKLDTILTSIPPEVILRKQR 344
V EE + + +IL SIP I QR
Sbjct: 397 VVPEEKMSDVYSILQSIPQRQIEEMQR 423
>gi|296088686|emb|CBI38136.3| unnamed protein product [Vitis vinifera]
Length = 291
Score = 57.8 bits (138), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 57/207 (27%), Positives = 84/207 (40%), Gaps = 29/207 (14%)
Query: 132 WNRTEGADHFFVVPHDFGACFHYQEEKAIE-------RGILPLLQRATLVQTFGQRNHVC 184
W R+ G DH FV+ D A +H + E A G L +A+ H
Sbjct: 10 WKRSGGRDHVFVLT-DPVAMWHVKAEIAPAILLVVDFGGWYKLDSKASNNSLSEMIQHTQ 68
Query: 185 LNEGSITIPPYA---PPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAA 241
++ I PY P + +QI R +YF+G + GG R
Sbjct: 69 VSLLKDVIVPYTHLLPRLHLSENQI-----RQTLLYFKG----AKHRHRGGL----VREK 115
Query: 242 VWENFKNNPLFDISTDHPTTYYED-----MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPV 296
+W+ + P + M+ + FCL P G P S RL +A+ CIPV
Sbjct: 116 LWDLLVYEQGVIMEEGFPNATGREQSIKGMRTSEFCLHPAGDTPTSCRLFDAIQSLCIPV 175
Query: 297 IIADDIVLPFADAIPWEEIGMFVAEED 323
I++D+I LPF + + E +FVA D
Sbjct: 176 IVSDNIELPFEGMVDYSEFSVFVAVRD 202
>gi|302791425|ref|XP_002977479.1| galactosyltransferase-like protein [Selaginella moellendorffii]
gi|300154849|gb|EFJ21483.1| galactosyltransferase-like protein [Selaginella moellendorffii]
Length = 466
Score = 57.8 bits (138), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 78/323 (24%), Positives = 122/323 (37%), Gaps = 67/323 (20%)
Query: 64 THMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQ 123
T F+ EI HR +L RTL+P+ A F+ P Y D++ S + Q
Sbjct: 131 TDQFSGEIIFHRRMLDHRCRTLDPDGASAFFVPFYAGLDISRNLWASGKSSSDVDSLGEQ 190
Query: 124 L---ISSNWPYWNRTEGADHFFV---VPHDF-----------GACFHYQEEKAIERGILP 166
L + P++NR+ GADHF V + DF + FH E ++++R ++
Sbjct: 191 LLHWLQRQHPHFNRSGGADHFLVAGRISWDFRRMPSAAGEWGSSLFHQIEMRSVKRLVI- 249
Query: 167 LLQRATLVQTFGQRNHVCLNEGSITIP----PYAPPQKMQAHQIPPDTPRSIFVYFRGLF 222
+RN +E + P P + Q + +PR V F G
Sbjct: 250 ------------ERNPWDDSELGVPYPTSFHPSSDEDLAQWVEFVQGSPRPHLVAFAG-- 295
Query: 223 YDVNNDPEGGYYA----------RGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFC 272
P GY + R A + T P + ++FC
Sbjct: 296 -----SPRPGYRSDFRQVLLGQCRAAPRGISRCLDCTADTAGCTSDPLRVTKLFLSSVFC 350
Query: 273 LCPLGWAPWSPRLVEAVVFGCIPVIIAD-------DIVLPFADAIPWEEIGMFVAEEDVP 325
L P G + L ++++ GCIPV+ + ++ LP EE +F+ + V
Sbjct: 351 LQPRGDSFTRKSLFDSLISGCIPVLFWNQSAYWQYELYLPRDP----EEYSVFIPHQSVK 406
Query: 326 K----LDTILTSIPPEVILRKQR 344
LD +L I E I R QR
Sbjct: 407 NGTNVLD-VLQGISRERIGRMQR 428
>gi|222629835|gb|EEE61967.1| hypothetical protein OsJ_16742 [Oryza sativa Japonica Group]
Length = 196
Score = 57.8 bits (138), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 33/115 (28%), Positives = 56/115 (48%), Gaps = 11/115 (9%)
Query: 36 PVGKLKVYVYELPSKYNKKLLLKD-------PRCLTHMFAAEIFMHRFLLSSPVRTLNPE 88
P L++YVY + LL+ C+ + ++ +H+ LLSS RT + +
Sbjct: 51 PHRDLRIYVYAEDEVDGLRALLRGRNDDVTAATCIKGQWGTQVKIHQLLLSSRFRTFDKD 110
Query: 89 EADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFV 143
EAD F+ P Y C + TG + ++ ++++ S PY+ R+ G DH FV
Sbjct: 111 EADLFFVPTYVKC-VRMTG---KLNDKEINQTYVKVVLSQMPYFRRSGGRDHIFV 161
>gi|431915722|gb|ELK16055.1| Exostosin-2 [Pteropus alecto]
Length = 1849
Score = 57.8 bits (138), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 28/83 (33%), Positives = 45/83 (54%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + +Q A FC+ G L + + GC+PV+IAD +LPF++ + W+ + V E
Sbjct: 1439 YPQVLQEATFCVVLRGARLGQAVLSDVLRAGCVPVVIADSYILPFSEVLDWKRASVVVPE 1498
Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
E + + +IL SIP I Q+
Sbjct: 1499 EKISDVYSILQSIPQRQIEEMQK 1521
>gi|47230478|emb|CAF99671.1| unnamed protein product [Tetraodon nigroviridis]
Length = 722
Score = 57.8 bits (138), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 27/83 (32%), Positives = 47/83 (56%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + +Q + FC+ G L + + GC+PVI+AD +LPF++ + W+ +F+ E
Sbjct: 309 YPQILQESSFCVVLRGARLGQAVLSDVLQAGCVPVILADSYILPFSEVLDWKRASVFIPE 368
Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
E + ++ +IL SIP + QR
Sbjct: 369 EKLSEMYSILKSIPHRQVEEMQR 391
>gi|307176975|gb|EFN66281.1| Exostosin-2 [Camponotus floridanus]
Length = 563
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 34/95 (35%), Positives = 47/95 (49%), Gaps = 10/95 (10%)
Query: 250 PLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADA 309
P FDIS + A FCL G L++A+ GCIPVIIAD + +PF D
Sbjct: 154 PGFDIS----------LPSATFCLIIRGARLAQSSLLDAMAAGCIPVIIADSLTMPFHDV 203
Query: 310 IPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQR 344
I W + +FV E D+ + +L I + I+ Q
Sbjct: 204 IDWTKAAVFVREVDILLIIQLLKKISHQRIMEMQE 238
>gi|348558689|ref|XP_003465149.1| PREDICTED: exostosin-2-like isoform 2 [Cavia porcellus]
Length = 669
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 45/83 (54%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + +Q A FC+ G L + + GC+PVIIAD +LPF++ + W+ + V E
Sbjct: 307 YPQVLQEATFCVVLRGARLGQAALSDVLQAGCVPVIIADSYILPFSEVLDWKRASVVVPE 366
Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
E + + +IL SIP + QR
Sbjct: 367 EKIADVYSILQSIPRRQMEEMQR 389
>gi|348558687|ref|XP_003465148.1| PREDICTED: exostosin-2-like isoform 1 [Cavia porcellus]
Length = 717
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 45/83 (54%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + +Q A FC+ G L + + GC+PVIIAD +LPF++ + W+ + V E
Sbjct: 307 YPQVLQEATFCVVLRGARLGQAALSDVLQAGCVPVIIADSYILPFSEVLDWKRASVVVPE 366
Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
E + + +IL SIP + QR
Sbjct: 367 EKIADVYSILQSIPRRQMEEMQR 389
>gi|196014406|ref|XP_002117062.1| hypothetical protein TRIADDRAFT_1026 [Trichoplax adhaerens]
gi|190580284|gb|EDV20368.1| hypothetical protein TRIADDRAFT_1026 [Trichoplax adhaerens]
Length = 657
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/79 (35%), Positives = 48/79 (60%), Gaps = 1/79 (1%)
Query: 263 YEDMQ-RAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y+D+ + FCL P G S R +EA+ FGCIP+++++ VLPF++ I W++ + + E
Sbjct: 228 YQDLLINSTFCLVPRGRRLGSFRFLEALQFGCIPIVLSNGWVLPFSEVIDWKKACVQIDE 287
Query: 322 EDVPKLDTILTSIPPEVIL 340
+ + ++ SI E IL
Sbjct: 288 RQLFDVPELIESISDEKIL 306
>gi|426245345|ref|XP_004016473.1| PREDICTED: exostosin-2 [Ovis aries]
Length = 718
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 31/83 (37%), Positives = 45/83 (54%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + +Q A FC+ G L + + GC+PVIIAD VLPF++ + W+ + V E
Sbjct: 308 YPQVLQEATFCMVLRGARLGQAVLSDVLRAGCVPVIIADSYVLPFSEVLDWKRASVVVPE 367
Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
E + + +IL SIP I QR
Sbjct: 368 EKMLDVYSILQSIPRRQIEEMQR 390
>gi|355752199|gb|EHH56319.1| Exostosin-2, partial [Macaca fascicularis]
Length = 733
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 45/83 (54%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + +Q A FC+ G L + + GC+PV+IAD +LPF++ + W+ + V E
Sbjct: 323 YPQVLQEATFCVVLRGARLGQAMLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 382
Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
E + + +IL SIP I QR
Sbjct: 383 EKMSDVYSILQSIPQRQIEEMQR 405
>gi|332210815|ref|XP_003254508.1| PREDICTED: exostosin-2 isoform 1 [Nomascus leucogenys]
Length = 718
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 75/327 (22%), Positives = 125/327 (38%), Gaps = 52/327 (15%)
Query: 35 DPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFY 94
+P K+KVY+Y L KY + ++ + E+ M ++D++
Sbjct: 99 NPKNKIKVYIYAL-KKYVDDFGVSVSNTISREYN-ELLMA------------ISDSDYYT 144
Query: 95 TPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVV-----PHDFG 149
I C P+ L + R+ +A L + W+R G +H P D+
Sbjct: 145 DDINRACLFVPSIDVLNQNTLRIKETAQAL--AQLSRWDR--GMNHLLFNMLPGGPPDYN 200
Query: 150 ACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEG-SITIPPYAPPQKMQAHQIPP 208
++A+ G G + +G ++IP Y+P + A P
Sbjct: 201 TALDVPRDRALLAG--------------GGFSTWTYRQGYDVSIPVYSP---LSAEVDLP 243
Query: 209 DT---PRSIFVYFR--GLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPT--- 260
+ PR F+ GL + D E G V + N +S
Sbjct: 244 EKGPGPRQYFLLSSQVGLHPEYREDLEALQVKHGESVLVLDKCTNLSEGVLSVRKRCHKH 303
Query: 261 ---TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGM 317
Y + +Q A FC+ G L + + GC+PV+IAD +LPF++ + W+ +
Sbjct: 304 QVFDYPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASV 363
Query: 318 FVAEEDVPKLDTILTSIPPEVILRKQR 344
V EE + + +IL SIP I QR
Sbjct: 364 VVPEEKMSDVYSILQSIPQRQIEEMQR 390
>gi|224085069|ref|XP_002307477.1| predicted protein [Populus trichocarpa]
gi|222856926|gb|EEE94473.1| predicted protein [Populus trichocarpa]
Length = 460
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 40/133 (30%), Positives = 57/133 (42%), Gaps = 12/133 (9%)
Query: 40 LKVYVYELPSKYNKKLLLKDPR-------CLTHMFAAEIFMHRFLLSSPVRTLNPEEADW 92
LK+YVYE K LL+ C+ + ++ +HR LL S RT EA+
Sbjct: 87 LKIYVYEEDEIDGLKELLRGREGKISADACVKGQWGTQVKIHRLLLQSRFRTRKKGEANL 146
Query: 93 FYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACF 152
F+ P Y C GL + + ++ + S PY+ R+ G DH FV P GA
Sbjct: 147 FFVPAYAKCVRMMGGL----NDKEINHTYVKAL-SQMPYFRRSGGRDHIFVFPSGAGAHL 201
Query: 153 HYQEEKAIERGIL 165
I R I+
Sbjct: 202 FRSWATYINRSII 214
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 33/119 (27%), Positives = 58/119 (48%), Gaps = 5/119 (4%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y++ ++ A FC+ P G + W+ R E+ C+PVI++D PF + I + +I +
Sbjct: 310 YFQHLRNAKFCVAPRGESSWTLRFYESFFVECVPVILSDQAEFPFQNVIDYTQISIKWPS 369
Query: 322 EDVP-KLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPA-QPGDAFHQILNGLARKL 378
+ +L L SIP E + +R++A R + P + A I+ L RK+
Sbjct: 370 TRIGLELLEYLESIPDEDV---ERMIAAGRQVRCLWVYAPELELCSAMQGIMWELQRKV 425
>gi|221041480|dbj|BAH12417.1| unnamed protein product [Homo sapiens]
Length = 337
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 45/83 (54%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + +Q A FC+ G L + + GC+PV+IAD +LPF++ + W+ + V E
Sbjct: 189 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 248
Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
E + + +IL SIP I QR
Sbjct: 249 EKMSDVYSILQSIPQRQIEEMQR 271
>gi|118482536|gb|ABK93189.1| unknown [Populus trichocarpa]
Length = 230
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 40/133 (30%), Positives = 57/133 (42%), Gaps = 12/133 (9%)
Query: 40 LKVYVYELPSKYNKKLLLKDPR-------CLTHMFAAEIFMHRFLLSSPVRTLNPEEADW 92
LK+YVYE K LL+ C+ + ++ +HR LL S RT EA+
Sbjct: 87 LKIYVYEEDEIDGLKELLRGREGRISADACVKGQWGTQVKIHRLLLQSRFRTRKKGEANL 146
Query: 93 FYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACF 152
F+ P Y C GL + + ++ + S PY+ R+ G DH FV P GA
Sbjct: 147 FFVPAYAKCVRMMGGL----NDKEINHTYVKAL-SQMPYFRRSGGRDHIFVFPSGAGAHL 201
Query: 153 HYQEEKAIERGIL 165
I R I+
Sbjct: 202 FRSWATYINRSII 214
>gi|46370069|ref|NP_997005.1| exostosin-2 isoform 2 [Homo sapiens]
gi|3023739|sp|Q93063.1|EXT2_HUMAN RecName: Full=Exostosin-2; AltName:
Full=Glucuronosyl-N-acetylglucosaminyl-proteoglycan/N-
acetylglucosaminyl-proteoglycan
4-alpha-N-acetylglucosaminyltransferase; AltName:
Full=Multiple exostoses protein 2; AltName:
Full=Putative tumor suppressor protein EXT2
gi|1518042|gb|AAB07008.1| EXT2 [Homo sapiens]
gi|1519605|gb|AAC51219.1| multiple exostosis 2 [Homo sapiens]
gi|1621113|gb|AAC50764.1| hereditary multiple exostoses gene 2 protein [Homo sapiens]
gi|14603196|gb|AAH10058.1| Exostoses (multiple) 2 [Homo sapiens]
gi|119588474|gb|EAW68068.1| exostoses (multiple) 2, isoform CRA_b [Homo sapiens]
gi|189065454|dbj|BAG35293.1| unnamed protein product [Homo sapiens]
gi|325463523|gb|ADZ15532.1| exostoses (multiple) 2 [synthetic construct]
Length = 718
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 45/83 (54%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + +Q A FC+ G L + + GC+PV+IAD +LPF++ + W+ + V E
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367
Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
E + + +IL SIP I QR
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQR 390
>gi|332210819|ref|XP_003254510.1| PREDICTED: exostosin-2 isoform 3 [Nomascus leucogenys]
Length = 751
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 75/327 (22%), Positives = 125/327 (38%), Gaps = 52/327 (15%)
Query: 35 DPVGKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFY 94
+P K+KVY+Y L KY + ++ + E+ M ++D++
Sbjct: 132 NPKNKIKVYIYAL-KKYVDDFGVSVSNTISREYN-ELLMA------------ISDSDYYT 177
Query: 95 TPIYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVV-----PHDFG 149
I C P+ L + R+ +A L + W+R G +H P D+
Sbjct: 178 DDINRACLFVPSIDVLNQNTLRIKETAQAL--AQLSRWDR--GMNHLLFNMLPGGPPDYN 233
Query: 150 ACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEG-SITIPPYAPPQKMQAHQIPP 208
++A+ G G + +G ++IP Y+P + A P
Sbjct: 234 TALDVPRDRALLAG--------------GGFSTWTYRQGYDVSIPVYSP---LSAEVDLP 276
Query: 209 DT---PRSIFVYFR--GLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPT--- 260
+ PR F+ GL + D E G V + N +S
Sbjct: 277 EKGPGPRQYFLLSSQVGLHPEYREDLEALQVKHGESVLVLDKCTNLSEGVLSVRKRCHKH 336
Query: 261 ---TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGM 317
Y + +Q A FC+ G L + + GC+PV+IAD +LPF++ + W+ +
Sbjct: 337 QVFDYPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASV 396
Query: 318 FVAEEDVPKLDTILTSIPPEVILRKQR 344
V EE + + +IL SIP I QR
Sbjct: 397 VVPEEKMSDVYSILQSIPQRQIEEMQR 423
>gi|444707556|gb|ELW48821.1| Exostosin-2 [Tupaia chinensis]
Length = 740
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 45/83 (54%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + +Q A FCL G L + + GC+PVI+AD +LPF++ + W+ + V E
Sbjct: 330 YPQVLQEATFCLVLRGARLGQAVLSDVLQAGCVPVIVADSYILPFSEVLDWKRASVAVPE 389
Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
E + + +IL S+P I QR
Sbjct: 390 EKLSDVYSILQSVPQRQIEEMQR 412
>gi|268536482|ref|XP_002633376.1| C. briggsae CBR-RIB-1 protein [Caenorhabditis briggsae]
Length = 349
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 32/116 (27%), Positives = 57/116 (49%), Gaps = 1/116 (0%)
Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
R L + ++N + + W+ ++++ + ++ Y+D+ + FCL P G
Sbjct: 186 RNLVHHLHNGEDIVMVTTCKHNSDWQAYQDDRCQGDNNEYDRWEYDDLLANSTFCLVPRG 245
Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTS 333
S R +E + GCIPV+I+D VLPF++ W + VAE D + +L S
Sbjct: 246 RRLGSFRFLETLRSGCIPVVISDSWVLPFSETTDWNSAVIVVAERDALSIPELLMS 301
>gi|426368045|ref|XP_004051025.1| PREDICTED: exostosin-2 isoform 1 [Gorilla gorilla gorilla]
Length = 718
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 45/83 (54%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + +Q A FC+ G L + + GC+PV+IAD +LPF++ + W+ + V E
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367
Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
E + + +IL SIP I QR
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQR 390
>gi|390470425|ref|XP_002755247.2| PREDICTED: exostosin-2 isoform 1 [Callithrix jacchus]
Length = 718
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 45/83 (54%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + +Q A FC+ G L + + GC+PV+IAD +LPF++ + W+ + V E
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367
Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
E + + +IL SIP I QR
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQR 390
>gi|46250443|gb|AAH68545.1| Exostoses (multiple) 2 [Homo sapiens]
Length = 718
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 45/83 (54%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + +Q A FC+ G L + + GC+PV+IAD +LPF++ + W+ + V E
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367
Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
E + + +IL SIP I QR
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQR 390
>gi|296010873|ref|NP_000392.3| exostosin-2 isoform 1 [Homo sapiens]
gi|119588477|gb|EAW68071.1| exostoses (multiple) 2, isoform CRA_d [Homo sapiens]
Length = 751
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 45/83 (54%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + +Q A FC+ G L + + GC+PV+IAD +LPF++ + W+ + V E
Sbjct: 341 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 400
Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
E + + +IL SIP I QR
Sbjct: 401 EKMSDVYSILQSIPQRQIEEMQR 423
>gi|73982548|ref|XP_533196.2| PREDICTED: exostosin-2 isoform 2 [Canis lupus familiaris]
Length = 718
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 45/83 (54%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + +Q A FC+ G L + + GC+PV+IAD +LPF++ + W+ + V E
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLRAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367
Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
E + + +IL SIP I QR
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQR 390
>gi|403254631|ref|XP_003920065.1| PREDICTED: exostosin-2 [Saimiri boliviensis boliviensis]
Length = 718
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 45/83 (54%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + +Q A FC+ G L + + GC+PV+IAD +LPF++ + W+ + V E
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367
Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
E + + +IL SIP I QR
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQR 390
>gi|432090368|gb|ELK23794.1| Exostosin-2 [Myotis davidii]
Length = 718
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 45/83 (54%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + +Q A FC+ G L + + GC+PV+IAD +LPF++ + W+ + V E
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLRAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367
Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
E + + +IL SIP I QR
Sbjct: 368 EKMADVYSILQSIPQRQIEEMQR 390
>gi|297268065|ref|XP_001106468.2| PREDICTED: exostosin-2 [Macaca mulatta]
Length = 718
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 45/83 (54%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + +Q A FC+ G L + + GC+PV+IAD +LPF++ + W+ + V E
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAILSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367
Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
E + + +IL SIP I QR
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQR 390
>gi|119588473|gb|EAW68067.1| exostoses (multiple) 2, isoform CRA_a [Homo sapiens]
gi|119588475|gb|EAW68069.1| exostoses (multiple) 2, isoform CRA_a [Homo sapiens]
Length = 731
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 45/83 (54%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + +Q A FC+ G L + + GC+PV+IAD +LPF++ + W+ + V E
Sbjct: 321 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 380
Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
E + + +IL SIP I QR
Sbjct: 381 EKMSDVYSILQSIPQRQIEEMQR 403
>gi|291384884|ref|XP_002708900.1| PREDICTED: exostosin 2 [Oryctolagus cuniculus]
Length = 765
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 44/83 (53%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + +Q A FC+ G L + + GC+PV+IAD +LPF++ + W+ + V E
Sbjct: 355 YPQVLQEATFCVVLRGARLGQAALSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 414
Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
E + + IL SIP I QR
Sbjct: 415 EKMADVYKILQSIPQRQIQEMQR 437
>gi|126332604|ref|XP_001362716.1| PREDICTED: exostosin-2 [Monodelphis domestica]
Length = 718
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 46/88 (52%)
Query: 257 DHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG 316
D Y + +Q A FC+ G L + + GC+PVIIAD +LPF++ + W+
Sbjct: 303 DQVFDYPQVLQEASFCMVLRGARLGQAVLSDVLQAGCVPVIIADSYILPFSEVLDWKRAS 362
Query: 317 MFVAEEDVPKLDTILTSIPPEVILRKQR 344
+ + EE + + ++L SIP I QR
Sbjct: 363 VVIPEEKMRDMYSVLRSIPQRQIEEMQR 390
>gi|402893736|ref|XP_003910045.1| PREDICTED: exostosin-2-like isoform 1 [Papio anubis]
Length = 535
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 45/83 (54%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + +Q A FC+ G L + + GC+PV+IAD +LPF++ + W+ + V E
Sbjct: 341 YPQVLQEATFCVVLRGARLGQAILSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 400
Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
E + + +IL SIP I QR
Sbjct: 401 EKMSDVYSILQSIPQRQIEEMQR 423
>gi|432851754|ref|XP_004067068.1| PREDICTED: exostosin-2-like [Oryzias latipes]
Length = 719
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 27/83 (32%), Positives = 46/83 (55%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + +Q + FC+ G L + + GC+PVI+AD +LPF++ + W+ + + E
Sbjct: 309 YPQILQESSFCVVLRGARLGQATLSDVLQAGCVPVILADSYILPFSEVLDWKRASVVIPE 368
Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
E + ++ TIL SIP + QR
Sbjct: 369 EKLSEMYTILKSIPHRQVEEMQR 391
>gi|402893738|ref|XP_003910046.1| PREDICTED: exostosin-2-like isoform 2 [Papio anubis]
Length = 515
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 45/83 (54%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + +Q A FC+ G L + + GC+PV+IAD +LPF++ + W+ + V E
Sbjct: 321 YPQVLQEATFCVVLRGARLGQAILSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 380
Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
E + + +IL SIP I QR
Sbjct: 381 EKMSDVYSILQSIPQRQIEEMQR 403
>gi|194698238|gb|ACF83203.1| unknown [Zea mays]
gi|413919897|gb|AFW59829.1| hypothetical protein ZEAMMB73_270023 [Zea mays]
Length = 264
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 35/120 (29%), Positives = 56/120 (46%), Gaps = 13/120 (10%)
Query: 206 IPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYED 265
+ + R +YF+G + GG R +W+ N P + P +
Sbjct: 61 LSENKDRRTLLYFKG----AKHRHRGGL----VREKLWDLLGNEPDVIMEEGFPNATGRE 112
Query: 266 -----MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVA 320
++ + FCL P G P S RL +A+ CIPVI++D++ LPF I + EI +FV+
Sbjct: 113 QSIKGLRTSEFCLHPAGDTPTSCRLFDAIASLCIPVIVSDEVELPFEGIIDYTEISIFVS 172
>gi|168277694|dbj|BAG10825.1| exostosin-2 [synthetic construct]
Length = 746
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 45/83 (54%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + +Q A FC+ G L + + GC+PV+IAD +LPF++ + W+ + V E
Sbjct: 336 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 395
Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
E + + +IL SIP I QR
Sbjct: 396 EKMSDVYSILQSIPQRQIEEMQR 418
>gi|148235505|ref|NP_001080448.1| exostosin 2 [Xenopus laevis]
gi|27881752|gb|AAH44703.1| Ext2-prov protein [Xenopus laevis]
Length = 718
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 34/111 (30%), Positives = 57/111 (51%), Gaps = 9/111 (8%)
Query: 234 YARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGC 293
Y+ GA AA+ + N ++D Y + +Q + FC+ G L + + GC
Sbjct: 289 YSDGA-AALRKRCYKNTVYD--------YPQILQESTFCIVLRGARLGQGLLSDVLQAGC 339
Query: 294 IPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQR 344
+PVIIAD VLPF++ + W+ + + EE + ++ +IL IP + QR
Sbjct: 340 VPVIIADSYVLPFSEVLDWKRASVVIPEEKMFEMYSILQGIPQRQVEEMQR 390
>gi|348509623|ref|XP_003442347.1| PREDICTED: exostosin-2-like [Oreochromis niloticus]
Length = 719
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 27/83 (32%), Positives = 46/83 (55%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + +Q + FC+ G L + + GC+PVI+AD +LPF++ + W+ + + E
Sbjct: 309 YPQILQESSFCVVLRGARLGQAALSDVLQAGCVPVILADSYILPFSEVLDWKRASVVIPE 368
Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
E + ++ TIL SIP + QR
Sbjct: 369 EKLSEMYTILKSIPHRQVEEMQR 391
>gi|299472640|emb|CBN78292.1| Glycosyltransferase, family GT47 [Ectocarpus siliculosus]
Length = 587
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/81 (34%), Positives = 41/81 (50%)
Query: 263 YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEE 322
Y D+ F L P G +P + RL EA+ G +PV I D V PF IPW + E
Sbjct: 473 YGDLMNTTFALLPAGRSPATYRLAEALSAGALPVFIHQDFVKPFPGKIPWSDFSFSFPPE 532
Query: 323 DVPKLDTILTSIPPEVILRKQ 343
+VP++ L ++P + + Q
Sbjct: 533 EVPRMLKTLRAVPDRELAQMQ 553
>gi|194755878|ref|XP_001960206.1| GF13249 [Drosophila ananassae]
gi|190621504|gb|EDV37028.1| GF13249 [Drosophila ananassae]
Length = 765
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 33/104 (31%), Positives = 56/104 (53%), Gaps = 3/104 (2%)
Query: 243 WENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADD 301
W ++N + + ++ YE + Q + FCL P G S R +EA+ GCIPV++++
Sbjct: 302 WRELQDNRCDEDNREYDRYDYETLLQNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNA 361
Query: 302 IVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVI--LRKQ 343
VLPF I W++ ++ E + ++ + SI E I LR+Q
Sbjct: 362 WVLPFESKIDWKQAAIWADERLLLQVPDTVRSISVERIFALRQQ 405
>gi|354491358|ref|XP_003507822.1| PREDICTED: exostosin-2 [Cricetulus griseus]
Length = 718
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/83 (33%), Positives = 45/83 (54%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + +Q A FC+ G L + + GC+PV+IAD +LPF++ + W+ + V E
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367
Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
E + + +IL +IP I QR
Sbjct: 368 EKMSDMYSILQNIPQRQIEEMQR 390
>gi|56751461|ref|YP_172162.1| hypothetical protein syc1452_c [Synechococcus elongatus PCC 6301]
gi|56686420|dbj|BAD79642.1| hypothetical protein [Synechococcus elongatus PCC 6301]
Length = 788
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 41/146 (28%), Positives = 65/146 (44%), Gaps = 10/146 (6%)
Query: 197 PPQKMQAHQIPPDTPRSIFVYFRGL--------FYDVNNDPEGGYYARGARAAVWENFKN 248
PP + P RS + FRG ++ PE A + + W
Sbjct: 134 PPMPLLDQHCYPSRDRSTLLSFRGANSHPVREQLQRLHQPPEIA--AELIQQSYWGTLNY 191
Query: 249 NPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFAD 308
+ + Y + + R+ F + P G +S RL+E + G IPVI+ADD VLPF++
Sbjct: 192 VDEAEGLSAEQQVYTDLIARSRFSVAPRGHDIFSYRLLEVMAGGAIPVILADDWVLPFSE 251
Query: 309 AIPWEEIGMFVAEEDVPKLDTILTSI 334
+ W E + VAE+ +L +L +I
Sbjct: 252 LLDWSEFSLSVAEDRCWELPQLLQAI 277
>gi|344257014|gb|EGW13118.1| Exostosin-2 [Cricetulus griseus]
Length = 701
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/83 (33%), Positives = 45/83 (54%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + +Q A FC+ G L + + GC+PV+IAD +LPF++ + W+ + V E
Sbjct: 291 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 350
Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
E + + +IL +IP I QR
Sbjct: 351 EKMSDMYSILQNIPQRQIEEMQR 373
>gi|307215037|gb|EFN89864.1| Exostosin-2 [Harpegnathos saltator]
Length = 594
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 44/83 (53%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + + A FCL G L++A+ GCIPVIIAD +++PF D I W + + V E
Sbjct: 187 YPDILHSATFCLIIRGARLAQSVLLDAMAAGCIPVIIADSLIMPFHDVIDWTKAAILVRE 246
Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
D+ + +L I + I+ Q
Sbjct: 247 VDILLIIQLLKKISHQRIVEMQE 269
>gi|308799499|ref|XP_003074530.1| Acetylglucosaminyltransferase EXT1/exostosin 1 (ISS) [Ostreococcus
tauri]
gi|116000701|emb|CAL50381.1| Acetylglucosaminyltransferase EXT1/exostosin 1 (ISS), partial
[Ostreococcus tauri]
Length = 439
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 31/83 (37%), Positives = 46/83 (55%), Gaps = 3/83 (3%)
Query: 252 FDIST---DHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFAD 308
+D+ST D P Y + ++ +CL G + RL + + FGC+PVI+AD LPF+
Sbjct: 328 WDLSTSGQDKPRDYMMLLSKSRYCLYVYGDRAHTARLYDIITFGCVPVIVADGYDLPFSW 387
Query: 309 AIPWEEIGMFVAEEDVPKLDTIL 331
W + + V E+DV KL IL
Sbjct: 388 LFDWSKFSVRVPEDDVAKLPGIL 410
>gi|81298858|ref|YP_399066.1| TPR repeat-containing protein [Synechococcus elongatus PCC 7942]
gi|81167739|gb|ABB56079.1| TPR repeat [Synechococcus elongatus PCC 7942]
Length = 788
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 41/146 (28%), Positives = 65/146 (44%), Gaps = 10/146 (6%)
Query: 197 PPQKMQAHQIPPDTPRSIFVYFRGL--------FYDVNNDPEGGYYARGARAAVWENFKN 248
PP + P RS + FRG ++ PE A + + W
Sbjct: 134 PPMPLLDQHCYPSRDRSTLLSFRGANSHPVREQLQRLHQPPEIA--AELIQQSYWGTLNY 191
Query: 249 NPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFAD 308
+ + Y + + R+ F + P G +S RL+E + G IPVI+ADD VLPF++
Sbjct: 192 VDEAEGLSAEQQVYTDLIARSRFSVAPRGHDIFSYRLLEVMAGGAIPVILADDWVLPFSE 251
Query: 309 AIPWEEIGMFVAEEDVPKLDTILTSI 334
+ W E + VAE+ +L +L +I
Sbjct: 252 LLDWSEFSLSVAEDRCWELPQLLQAI 277
>gi|410912512|ref|XP_003969733.1| PREDICTED: exostosin-2-like [Takifugu rubripes]
Length = 719
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 27/83 (32%), Positives = 46/83 (55%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + +Q + FC+ G L + + GC+PVI+AD +LPF++ + W+ +F+ E
Sbjct: 309 YPQVLQDSSFCVVLRGARLGQAALSDVLQAGCVPVILADSYILPFSEVLDWKRASVFIPE 368
Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
E + ++ IL SIP + QR
Sbjct: 369 EKLSEMYGILKSIPHRQVEEMQR 391
>gi|307197832|gb|EFN78943.1| Exostosin-1 [Harpegnathos saltator]
Length = 711
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 34/124 (27%), Positives = 58/124 (46%), Gaps = 1/124 (0%)
Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
R Y ++N + + W ++ + ++ T YE + A FCL P G
Sbjct: 254 RNALYHLHNGKDLVFVTTCRHGKAWRELQDEHCQQDNQEYDTYDYEILLMNATFCLVPRG 313
Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
S R +EA+ GCIPVI+++ LPF + I W + +F E + ++ I+ S+
Sbjct: 314 RRLGSFRFLEALRAGCIPVILSNGWALPFHERIDWTQAVIFSDERLLLQIPDIVRSVSNV 373
Query: 338 VILR 341
IL+
Sbjct: 374 QILK 377
>gi|348538380|ref|XP_003456670.1| PREDICTED: exostosin-1a-like [Oreochromis niloticus]
Length = 776
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 57/236 (24%), Positives = 94/236 (39%), Gaps = 20/236 (8%)
Query: 118 MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTF 177
+RS IQ N P WN +G +H + G+ Y E+ E G L + + V F
Sbjct: 203 IRSRIQ----NLPTWN--DGRNHLIFNLYS-GSWPDYTEDLGFEVGQAMLAKASADVVNF 255
Query: 178 GQRNHVCL------------NEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDV 225
+ + G +T+ P +K Q I R Y +
Sbjct: 256 RSNYDISIPLFSKDHPLKGGGIGYLTLNDAPPSRKYQLVFKGKRYLTGIGSETRNALYHI 315
Query: 226 NNDPEGGYYARGARAAVWENFKNNPLFDISTDHPT-TYYEDMQRAIFCLCPLGWAPWSPR 284
+N + WE K++ + D+ Y E + + FCL P G S R
Sbjct: 316 HNGEDIILLTTCKHGKDWEKHKDSRCDRDNEDYSKFDYQELLHNSTFCLVPRGRRLGSFR 375
Query: 285 LVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVIL 340
+EA+ CIPVI+++ LPF++ I W + + E + ++ +I S+ + IL
Sbjct: 376 FLEALQAACIPVILSNGWELPFSEVIDWRKAAIIGDERLLLQVPSITRSVGRDRIL 431
>gi|195123279|ref|XP_002006135.1| GI20872 [Drosophila mojavensis]
gi|193911203|gb|EDW10070.1| GI20872 [Drosophila mojavensis]
Length = 761
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 40/147 (27%), Positives = 68/147 (46%), Gaps = 11/147 (7%)
Query: 200 KMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHP 259
K H I +T S+F ++N + W ++N + + ++
Sbjct: 266 KRYVHGIGSETRNSLF--------HLHNGRDMVMVTTCKHGKSWRELQDNRCDEDNREYD 317
Query: 260 TTYYEDM-QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMF 318
Y + Q + FCL P G S R +EA+ GCIPV++++ VLPF I W++ ++
Sbjct: 318 RYDYGTLLQNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNAWVLPFESKIDWKQAAIW 377
Query: 319 VAEEDVPKLDTILTSIPPEVI--LRKQ 343
E + ++ I+ SI E I LR+Q
Sbjct: 378 ADERLLLQVPDIVRSISAERIFALRQQ 404
>gi|340719750|ref|XP_003398310.1| PREDICTED: exostosin-1-like isoform 1 [Bombus terrestris]
gi|340719752|ref|XP_003398311.1| PREDICTED: exostosin-1-like isoform 2 [Bombus terrestris]
Length = 711
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 34/124 (27%), Positives = 58/124 (46%), Gaps = 1/124 (0%)
Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
R Y ++N + + W ++ + ++ T YE + A FCL P G
Sbjct: 254 RNALYHLHNGKDLVFVTTCRHGKAWRELQDEHCQQDNQEYDTYDYEILLMNATFCLVPRG 313
Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
S R +EA+ GCIPVI+++ LPF + I W + +F E + ++ I+ S+
Sbjct: 314 RRLGSFRFLEALRAGCIPVILSNGWALPFHERIDWTQAVIFSDERLLLQIPDIVRSVSNV 373
Query: 338 VILR 341
IL+
Sbjct: 374 HILK 377
>gi|242020116|ref|XP_002430502.1| Exostosin-2, putative [Pediculus humanus corporis]
gi|212515659|gb|EEB17764.1| Exostosin-2, putative [Pediculus humanus corporis]
Length = 696
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 42/155 (27%), Positives = 68/155 (43%), Gaps = 3/155 (1%)
Query: 190 ITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNN 249
I++P ++P K + + FV + + E G + + + EN +
Sbjct: 219 ISLPLFSPYAK-SIKTLNKSQEKKWFVINSQANLHSDYERELGALSGVVKLELCEN--GD 275
Query: 250 PLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADA 309
P F Y +Q A FC+ G L+E++ GCIPVI AD +VLPF D
Sbjct: 276 PRFRCHQGIAYNYPSVLQHATFCIIIRGARLAQQALLESLSAGCIPVIAADLMVLPFQDV 335
Query: 310 IPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQR 344
I W+ + + E D+ L L+S+ + L Q+
Sbjct: 336 IDWKRASITILESDLSSLIEKLSSVSDDKKLELQQ 370
>gi|350421075|ref|XP_003492724.1| PREDICTED: exostosin-1-like isoform 1 [Bombus impatiens]
gi|350421078|ref|XP_003492725.1| PREDICTED: exostosin-1-like isoform 2 [Bombus impatiens]
Length = 711
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 34/124 (27%), Positives = 58/124 (46%), Gaps = 1/124 (0%)
Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
R Y ++N + + W ++ + ++ T YE + A FCL P G
Sbjct: 254 RNALYHLHNGKDLVFVTTCRHGKAWRELQDEHCQQDNQEYDTYDYEILLMNATFCLVPRG 313
Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
S R +EA+ GCIPVI+++ LPF + I W + +F E + ++ I+ S+
Sbjct: 314 RRLGSFRFLEALRAGCIPVILSNGWALPFHERIDWTQAVIFSDERLLLQIPDIVRSVSNV 373
Query: 338 VILR 341
IL+
Sbjct: 374 HILK 377
>gi|170054218|ref|XP_001863025.1| exostosin-2 [Culex quinquefasciatus]
gi|167874545|gb|EDS37928.1| exostosin-2 [Culex quinquefasciatus]
Length = 758
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 26/80 (32%), Positives = 47/80 (58%), Gaps = 2/80 (2%)
Query: 266 MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVP 325
++ FCL G P L++A+ GCIPVI+AD++VLPF + + W+ + + + E ++
Sbjct: 354 LENGQFCLVARGVRLSQPTLMDALASGCIPVIMADNLVLPFGEVLDWDLVSIRIHENNLH 413
Query: 326 KLDTILTSIPPEVI--LRKQ 343
+ + L ++ E + LR Q
Sbjct: 414 SVISTLKAVSKERVQELRAQ 433
>gi|440791537|gb|ELR12775.1| exostosin family protein [Acanthamoeba castellanii str. Neff]
Length = 504
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 40/74 (54%), Gaps = 1/74 (1%)
Query: 249 NPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFAD 308
P+F + H T Y E + +++FC G P + L + GCIP++ +D LPF D
Sbjct: 299 GPMFGATVPH-TYYVEQITQSVFCAVARGHTPTTRALFNMLAGGCIPILFSDRWHLPFVD 357
Query: 309 AIPWEEIGMFVAEE 322
+PW E+ +F EE
Sbjct: 358 QLPWHEMVIFAPEE 371
>gi|401412800|ref|XP_003885847.1| conserved hypothetical protein [Neospora caninum Liverpool]
gi|325120267|emb|CBZ55821.1| conserved hypothetical protein [Neospora caninum Liverpool]
Length = 1577
Score = 56.6 bits (135), Expect = 2e-05, Method: Composition-based stats.
Identities = 35/84 (41%), Positives = 47/84 (55%), Gaps = 4/84 (4%)
Query: 266 MQRAIFCLCPL-GWAPWSPRLVEAVVFGCIPVIIA--DDIVLPFADAIPWEEIGMFVAEE 322
+ RA FCLCP GW P S + EA+ CIPVII +++ LP W +I +FV
Sbjct: 1422 ISRATFCLCPQEGWLP-SLCIFEALAHACIPVIIGGEEELWLPGGCLFDWIQIAVFVPLS 1480
Query: 323 DVPKLDTILTSIPPEVILRKQRLL 346
P IL+ IP + IL KQ++L
Sbjct: 1481 RAPYTSLILSLIPEKEILEKQQML 1504
>gi|335281986|ref|XP_003122905.2| PREDICTED: exostosin-2-like [Sus scrofa]
Length = 718
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 45/83 (54%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + +Q A FC+ G L + + GC+PV+IAD VLPF++ + W+ + V E
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLRAGCVPVVIADSYVLPFSEVLDWKRASVVVPE 367
Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
E + + +IL SIP + QR
Sbjct: 368 EKMSDVYSILQSIPRRQMEEMQR 390
>gi|302854903|ref|XP_002958955.1| hypothetical protein VOLCADRAFT_100271 [Volvox carteri f.
nagariensis]
gi|300255701|gb|EFJ39990.1| hypothetical protein VOLCADRAFT_100271 [Volvox carteri f.
nagariensis]
Length = 705
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 48/168 (28%), Positives = 79/168 (47%), Gaps = 15/168 (8%)
Query: 221 LFYDVNNDPEGGYYARGARAAVWENFK-----NNPLFDISTDHPTT--YYEDMQRAIFCL 273
F+ + P Y+ GAR A+ + K DI T Y R+ FCL
Sbjct: 508 FFFAGSVRPRDTSYSGGARQALSAHLKALMASGGNYSDIQFVEGTVPDYEALYMRSRFCL 567
Query: 274 CPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFAD--AIPWEEIGMFVAEEDVPKLDTIL 331
P G A + RL A+ CIPVII D + P+ +P+ + + +++ D+P + IL
Sbjct: 568 APHG-AGFGVRLTLAMTHACIPVIIQDQVYQPYESDGLLPYSQFSLRLSKSDIPYIVDIL 626
Query: 332 TSIPPEVILRKQRL-LANPSMKRAMLFPQPAQPGDAFHQILNGLARKL 378
S+ E R++R+ LA A L+ +P+ G A++ + L ++L
Sbjct: 627 RSVSTE---RQKRMRLAMAKYHHAFLW-EPSLGGRAYNYTIRALNQRL 670
>gi|350423159|ref|XP_003493402.1| PREDICTED: exostosin-2-like [Bombus impatiens]
Length = 708
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 27/79 (34%), Positives = 41/79 (51%)
Query: 266 MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVP 325
+Q A FCL G L+E + G IP+IIAD + +P+ I W +FV E D+
Sbjct: 305 LQTATFCLVIRGARLGQSTLLECMAAGSIPIIIADSLTMPYHGIIDWSRAAIFVREVDIL 364
Query: 326 KLDTILTSIPPEVILRKQR 344
+ ++L I P+ I+ Q
Sbjct: 365 SIISVLKKISPQRIIELQE 383
>gi|340727245|ref|XP_003401958.1| PREDICTED: exostosin-2-like isoform 2 [Bombus terrestris]
Length = 710
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 27/79 (34%), Positives = 41/79 (51%)
Query: 266 MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVP 325
+Q A FCL G L+E + G IP+IIAD + +P+ I W +FV E D+
Sbjct: 305 LQTATFCLVIRGARLGQSTLLECMAAGSIPIIIADSLTMPYHGIIDWSRAAIFVREVDIL 364
Query: 326 KLDTILTSIPPEVILRKQR 344
+ ++L I P+ I+ Q
Sbjct: 365 SIISVLKKISPQRIIELQE 383
>gi|401886285|gb|EJT50333.1| hypothetical protein A1Q1_00388 [Trichosporon asahii var. asahii
CBS 2479]
Length = 1041
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 33/106 (31%), Positives = 50/106 (47%), Gaps = 2/106 (1%)
Query: 261 TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVA 320
TY + + FC P G A W+ RL + V GCIPV++ D + + W + V
Sbjct: 891 TYLDLLGDTRFCPIPFGTAGWTYRLSDVVYAGCIPVLVGDQSHMTYWSMFDWSLFSVQVF 950
Query: 321 EEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDA 366
E ++ L+ IL+ I E RKQ L ++ A L+P +A
Sbjct: 951 EHELDHLERILSGITEEDAQRKQDALM--LVREAFLYPSEGHVEEA 994
>gi|340727243|ref|XP_003401957.1| PREDICTED: exostosin-2-like isoform 1 [Bombus terrestris]
Length = 708
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 27/79 (34%), Positives = 41/79 (51%)
Query: 266 MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVP 325
+Q A FCL G L+E + G IP+IIAD + +P+ I W +FV E D+
Sbjct: 305 LQTATFCLVIRGARLGQSTLLECMAAGSIPIIIADSLTMPYHGIIDWSRAAIFVREVDIL 364
Query: 326 KLDTILTSIPPEVILRKQR 344
+ ++L I P+ I+ Q
Sbjct: 365 SIISVLKKISPQRIIELQE 383
>gi|294956185|ref|XP_002788843.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
gi|239904455|gb|EER20639.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
Length = 87
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 22/53 (41%), Positives = 35/53 (66%)
Query: 261 TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWE 313
TY +M+ +IFC P G PW+ R+ +A++ GCIPV++++ IV PF + W
Sbjct: 12 TYDAEMKDSIFCFIPRGNTPWTRRIFDAIISGCIPVVLSNAIVFPFESLLDWS 64
>gi|328788198|ref|XP_001121972.2| PREDICTED: exostosin-2 [Apis mellifera]
Length = 708
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 29/79 (36%), Positives = 41/79 (51%)
Query: 266 MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVP 325
+Q A FCL G L+E + G IPVIIAD + +PF I W +F+ E D+
Sbjct: 305 LQTATFCLVIRGARLGQTTLLECMAAGSIPVIIADSLTMPFHGIIDWTRAVIFIREVDIL 364
Query: 326 KLDTILTSIPPEVILRKQR 344
L ++L I E I+ Q+
Sbjct: 365 SLISVLKKISQERIIELQQ 383
>gi|255070683|ref|XP_002507423.1| glycosyltransferase family 47 protein [Micromonas sp. RCC299]
gi|226522698|gb|ACO68681.1| glycosyltransferase family 47 protein [Micromonas sp. RCC299]
Length = 698
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 34/123 (27%), Positives = 58/123 (47%), Gaps = 6/123 (4%)
Query: 212 RSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIF 271
R I + FRG + G+Y ++ W+ + + P+ Y M+ + F
Sbjct: 486 RPIEMSFRGTLRGGVRERILGHYLSVGKSRNWDLRSDGQV------SPSRYMRLMRDSKF 539
Query: 272 CLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTIL 331
CL G SPRL+E ++FGC+PVI+AD V P + W + + + E + +L +L
Sbjct: 540 CLHVRGTRVQSPRLIEGMLFGCVPVIVADGYVPPLSWLFDWSKFSVRLPEVEHERLPEVL 599
Query: 332 TSI 334
+
Sbjct: 600 QGV 602
>gi|380025736|ref|XP_003696624.1| PREDICTED: LOW QUALITY PROTEIN: exostosin-2-like [Apis florea]
Length = 700
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 30/83 (36%), Positives = 43/83 (51%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + +Q A FCL G L+E + G IPVIIAD + +PF I W +F+ E
Sbjct: 295 YPDVLQTATFCLVIRGARLGQTTLLECMAAGSIPVIIADSLTMPFHGIIDWTRAVIFIRE 354
Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
D+ L ++L I E I+ Q+
Sbjct: 355 VDILSLISVLKKISQERIIELQQ 377
>gi|358335371|dbj|GAA53901.1| exostosin-2 [Clonorchis sinensis]
Length = 1030
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 31/87 (35%), Positives = 49/87 (56%), Gaps = 8/87 (9%)
Query: 262 YYEDMQRAIFCLC----PLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGM 317
Y E + R++FCL P+G L +++ GCIPVI D+ +LPF++ + W +I +
Sbjct: 693 YGEALARSLFCLIIQIPPVGQFA----LFDSMNAGCIPVIADDNFILPFSEVLDWSKIAI 748
Query: 318 FVAEEDVPKLDTILTSIPPEVILRKQR 344
V ++ K+ T LTS E I + QR
Sbjct: 749 RVRHSELHKIVTTLTSFTSEEIAQFQR 775
>gi|355566589|gb|EHH22968.1| Exostosin-2 [Macaca mulatta]
Length = 751
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 28/83 (33%), Positives = 45/83 (54%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + +Q A FC+ G L + + GC+PV+IAD +LPF++ + W+ + V E
Sbjct: 341 YPQVLQEATFCVVLRGARLGQAMLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 400
Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
E + + +IL SIP I +R
Sbjct: 401 EKMSDVYSILQSIPQRQIEEMKR 423
>gi|307111745|gb|EFN59979.1| hypothetical protein CHLNCDRAFT_56485 [Chlorella variabilis]
Length = 478
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 30/91 (32%), Positives = 45/91 (49%), Gaps = 14/91 (15%)
Query: 76 FLLSSPVRTLNPEEADWFYTPIYPTCDLTPT--------------GLPLPFKSPRMMRSA 121
L+ S RTL+ EEAD+FY P++ +C + P G + M+ A
Sbjct: 320 MLMQSEHRTLDAEEADFFYVPVFTSCFIWPVRDGADSLYDFFYSVGHNRVQGATNMLLEA 379
Query: 122 IQLISSNWPYWNRTEGADHFFVVPHDFGACF 152
I S+ P+W R G DH ++V HD G+C+
Sbjct: 380 FHWIQSHQPWWERRGGRDHIWLVTHDEGSCW 410
>gi|351696937|gb|EHA99855.1| Exostosin-2 [Heterocephalus glaber]
Length = 717
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 27/83 (32%), Positives = 44/83 (53%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + +Q A FC+ G L + + GC+PV+IAD +LPF++ + W+ + V E
Sbjct: 307 YPQVLQEATFCVVLRGARLGQAALSDVLRAGCVPVVIADSYILPFSEVLDWKRASVVVPE 366
Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
E + + +L SIP + QR
Sbjct: 367 EKIADMYGVLQSIPRRQMEEMQR 389
>gi|395801788|ref|ZP_10481043.1| hypothetical protein FF52_07909 [Flavobacterium sp. F52]
gi|395435977|gb|EJG01916.1| hypothetical protein FF52_07909 [Flavobacterium sp. F52]
Length = 341
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 28/110 (25%), Positives = 51/110 (46%), Gaps = 4/110 (3%)
Query: 257 DHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG 316
D ++ ++Q ++ C G +S R EA++ G IPV++ D+ LPFA+ I W +
Sbjct: 226 DSTLEFFTNIQNNLYVFCLRGNGNFSIRFYEALIMGRIPVLVDTDVRLPFANTINWSKHC 285
Query: 317 MFVAE----EDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQ 362
+ V+E E + + + E + R L + R F Q ++
Sbjct: 286 LLVSEKNLLEKIVSFHNAKSQVELEAMQFGNRKLIKEKLNRIDYFIQFSK 335
>gi|449460844|ref|XP_004148154.1| PREDICTED: xyloglucan galactosyltransferase KATAMARI1 homolog
[Cucumis sativus]
Length = 471
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 44/147 (29%), Positives = 60/147 (40%), Gaps = 38/147 (25%)
Query: 35 DPVGKLKVYVYELPSKYNKKLLLK----DP---RC------------------------- 62
DP +V+VY+LP +NK LL DP RC
Sbjct: 67 DPCNSGRVFVYDLPPFFNKDLLQNCESLDPWTSRCDDVSNGGFGRQATELNGVVPDGLTP 126
Query: 63 ---LTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMR 119
+ + E MH +L+ RTL+PE A FY P Y + L L + + R
Sbjct: 127 AWFWSEQYMLEPIMHNRILNYKCRTLDPESATAFYIPFYAGLSIGRY-LWLNYTTSDRDR 185
Query: 120 SAIQLIS--SNWPYWNRTEGADHFFVV 144
+ +LI N PYWNR+ G DHF +
Sbjct: 186 DSEKLIEWVQNEPYWNRSNGGDHFITL 212
>gi|302794690|ref|XP_002979109.1| hypothetical protein SELMODRAFT_418827 [Selaginella moellendorffii]
gi|300153427|gb|EFJ20066.1| hypothetical protein SELMODRAFT_418827 [Selaginella moellendorffii]
Length = 435
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 42/143 (29%), Positives = 71/143 (49%), Gaps = 16/143 (11%)
Query: 260 TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFV 319
T++ + +IFCL P G P S RL +A+V GCIPVI++D++ PF + + ++ +F
Sbjct: 281 TSWSQRHTFSIFCLSPAGDTPSSARLFDAIVSGCIPVIVSDELEPPFEGLVDYRKVALF- 339
Query: 320 AEEDVPKLDTI----LTSIPPEVILRKQRLLANPSMK--RAMLFPQPAQ---PGDAFHQI 370
VP + T L S + R+ +L + ++ R + PAQ P D Q
Sbjct: 340 ----VPSVKTTEKGWLVSYLRAITARQLSMLRSHMLEFSRHFQYSSPAQQLGPEDLTWQA 395
Query: 371 LNGLAR--KLPHDKSTYLKPGGK 391
+ G + +L ++ L GG+
Sbjct: 396 VAGKLQSIRLHIRRAQRLVDGGR 418
Score = 45.8 bits (107), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 42/151 (27%), Positives = 64/151 (42%), Gaps = 33/151 (21%)
Query: 25 SGSAGDV---LEDDPVGK--LKVYVYELPSKYNKKLL----------------------L 57
+GSA D+ +ED L+VYVY++P K+ +LL L
Sbjct: 78 TGSAEDLDRGMEDRARSSQVLRVYVYDMPEKFTLQLLRLFRDTYKETANLTSNGSPVHRL 137
Query: 58 KDPRCLTHMFAAEIFM--HRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSP 115
+ + + A++ + LL S R LNP EAD FY P + T L +
Sbjct: 138 IEQHSIDYWLYADLLAPESQRLLKSVKRVLNPTEADIFYIPFFTTISYF---LMEKQQCK 194
Query: 116 RMMRSAIQLISSNWPYWNRTEGADHFFVVPH 146
++ R A+ + +N W R+ G DH V H
Sbjct: 195 QLYREALSWV-TNQAAWKRSGGRDHVLPVHH 224
>gi|116786286|gb|ABK24052.1| unknown [Picea sitchensis]
Length = 208
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 37/125 (29%), Positives = 60/125 (48%), Gaps = 13/125 (10%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGM---- 317
Y++ ++ A FCL P G + W+ R EA C+PVI++D I LPF + + + + +
Sbjct: 58 YFQHLRNAKFCLAPRGESSWTLRFYEAFFVECVPVILSDQIELPFQNVLDYSQFSIKWPA 117
Query: 318 -FVAEEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPG-DAFHQILNGLA 375
+ E + LD+I + +I R Q++ R + P G A IL L
Sbjct: 118 TRIGVELLEYLDSITDTEIKRMIARGQQV-------RCLWAYAPESVGCSAMTGILWELQ 170
Query: 376 RKLPH 380
RK+ H
Sbjct: 171 RKVRH 175
>gi|443721868|gb|ELU10993.1| hypothetical protein CAPTEDRAFT_157657 [Capitella teleta]
Length = 712
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 25/83 (30%), Positives = 49/83 (59%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + +Q + FCL L +A+ GC+PVI+AD +LPF++ I W+ + VAE
Sbjct: 303 YPQVLQESEFCLVIRTSRLGQLTLSDAMKAGCVPVIVADSFILPFSEVIDWKRAAIVVAE 362
Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
+++ ++ ++ +I + +L+ +R
Sbjct: 363 DNLSTVNEVVRAISRDSLLQMRR 385
>gi|449499718|ref|XP_004160896.1| PREDICTED: xyloglucan galactosyltransferase KATAMARI1 homolog
[Cucumis sativus]
Length = 471
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 44/147 (29%), Positives = 60/147 (40%), Gaps = 38/147 (25%)
Query: 35 DPVGKLKVYVYELPSKYNKKLLLK----DP---RC------------------------- 62
DP +V+VY+LP +NK LL DP RC
Sbjct: 67 DPCNSGRVFVYDLPPFFNKDLLQNCESLDPWTSRCDDVSNGGFGRQATELNGVVPDGLTP 126
Query: 63 ---LTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMR 119
+ + E MH +L+ RTL+PE A FY P Y + L L + + R
Sbjct: 127 AWFWSEQYMLEPIMHNRILNYKCRTLDPESATAFYIPFYAGLSIGRY-LWLNYTTSDRDR 185
Query: 120 SAIQLIS--SNWPYWNRTEGADHFFVV 144
+ +LI N PYWNR+ G DHF +
Sbjct: 186 DSEKLIEWVQNEPYWNRSNGGDHFITL 212
>gi|260826381|ref|XP_002608144.1| hypothetical protein BRAFLDRAFT_91377 [Branchiostoma floridae]
gi|229293494|gb|EEN64154.1| hypothetical protein BRAFLDRAFT_91377 [Branchiostoma floridae]
Length = 595
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 26/76 (34%), Positives = 41/76 (53%)
Query: 269 AIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLD 328
A FC+ G L +A++ GCIPVI D V+PF++ + W+ + + EED+P +
Sbjct: 194 ATFCMILRGARMGQSALSDAMMAGCIPVIAIDTYVMPFSEVLDWKRAAVILREEDLPDVH 253
Query: 329 TILTSIPPEVILRKQR 344
+L I E I +R
Sbjct: 254 NVLRRISQERITNMRR 269
>gi|74201817|dbj|BAE28510.1| unnamed protein product [Mus musculus]
Length = 701
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 28/83 (33%), Positives = 44/83 (53%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + +Q A FC G L + + GC+PV+IAD +LPF++ + W+ + V E
Sbjct: 291 YPQVLQEATFCTVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 350
Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
E + + +IL +IP I QR
Sbjct: 351 EKMSDVYSILQNIPQRQIEEMQR 373
>gi|110637313|ref|YP_677520.1| hypothetical protein CHU_0899 [Cytophaga hutchinsonii ATCC 33406]
gi|110279994|gb|ABG58180.1| conserved hypothetical protein [Cytophaga hutchinsonii ATCC 33406]
Length = 330
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 30/73 (41%), Positives = 44/73 (60%), Gaps = 3/73 (4%)
Query: 261 TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVA 320
TY ED+ + F LCP G A +S R++E + G +PVIIAD+ V PF +I + + +A
Sbjct: 180 TYLEDILSSYFVLCPRGIASYSHRIIETMALGSVPVIIADEWV-PF--SIEEDNYYVRIA 236
Query: 321 EEDVPKLDTILTS 333
E DV + IL +
Sbjct: 237 ESDVENIYAILKA 249
>gi|302809300|ref|XP_002986343.1| hypothetical protein SELMODRAFT_425371 [Selaginella moellendorffii]
gi|300145879|gb|EFJ12552.1| hypothetical protein SELMODRAFT_425371 [Selaginella moellendorffii]
Length = 421
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 42/143 (29%), Positives = 70/143 (48%), Gaps = 16/143 (11%)
Query: 260 TTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFV 319
T++ + +IFCL P G P S RL +A+V GCIPVI++D++ PF + + ++ +F
Sbjct: 281 TSWSQRHTFSIFCLSPAGDTPSSARLFDAIVSGCIPVIVSDELEPPFEGLVDYRKVALF- 339
Query: 320 AEEDVPKLDTI----LTSIPPEVILRKQRLLANPSMK--RAMLFPQPAQ---PGDAFHQI 370
VP + T L S + R+ +L ++ R + PAQ P D Q
Sbjct: 340 ----VPSVKTTEKGWLVSYLRAITARQLSMLRGHMLEFSRHFQYSSPAQQLGPEDLTWQT 395
Query: 371 LNGLAR--KLPHDKSTYLKPGGK 391
+ G + +L ++ L GG+
Sbjct: 396 VAGKLQSIRLHIRRAQRLVDGGR 418
Score = 44.7 bits (104), Expect = 0.089, Method: Compositional matrix adjust.
Identities = 42/151 (27%), Positives = 63/151 (41%), Gaps = 33/151 (21%)
Query: 25 SGSAGDV---LEDDPVGK--LKVYVYELPSKYNKKLL----------------------L 57
+GSA D+ +ED L+VYVY++P K+ +LL L
Sbjct: 78 TGSAEDLDRGMEDRARSSQVLRVYVYDMPEKFTLQLLRLFRDTYKETANLTSNGSPVHRL 137
Query: 58 KDPRCLTHMFAAEIFM--HRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSP 115
+ + A++ + LL S R LNP EAD FY P + T L +
Sbjct: 138 IGQHSIDYWLYADLLAPESQRLLKSVKRVLNPTEADIFYIPFFTTISYF---LMEKQQCK 194
Query: 116 RMMRSAIQLISSNWPYWNRTEGADHFFVVPH 146
++ R A+ + +N W R+ G DH V H
Sbjct: 195 QLYREALSWV-TNQAAWKRSGGRDHVLPVHH 224
>gi|417404013|gb|JAA48784.1| Putative glycosyl transferase family 64 domain protein [Desmodus
rotundus]
Length = 701
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 44/83 (53%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + +Q A FC+ G L + + GC+PV+IAD +LPF++ + W+ + V E
Sbjct: 291 YPQVLQEATFCVVLRGARLGQAVLSDILRAGCVPVVIADSYILPFSEVLDWKRASVVVPE 350
Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
E + + IL SIP I QR
Sbjct: 351 EKMSDVYGILQSIPQRQIGEMQR 373
>gi|226442845|ref|NP_034293.2| exostosin-2 [Mus musculus]
gi|341940670|sp|P70428.2|EXT2_MOUSE RecName: Full=Exostosin-2; AltName:
Full=Glucuronosyl-N-acetylglucosaminyl-proteoglycan/N-
acetylglucosaminyl-proteoglycan
4-alpha-N-acetylglucosaminyltransferase; AltName:
Full=Multiple exostoses protein 2 homolog
gi|74216635|dbj|BAE37749.1| unnamed protein product [Mus musculus]
gi|148695682|gb|EDL27629.1| exostoses (multiple) 2, isoform CRA_a [Mus musculus]
Length = 718
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 28/83 (33%), Positives = 44/83 (53%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + +Q A FC G L + + GC+PV+IAD +LPF++ + W+ + V E
Sbjct: 308 YPQVLQEATFCTVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367
Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
E + + +IL +IP I QR
Sbjct: 368 EKMSDVYSILQNIPQRQIEEMQR 390
>gi|13879260|gb|AAH06597.1| Exostoses (multiple) 2 [Mus musculus]
Length = 718
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 28/83 (33%), Positives = 44/83 (53%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + +Q A FC G L + + GC+PV+IAD +LPF++ + W+ + V E
Sbjct: 308 YPQVLQEATFCTVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367
Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
E + + +IL +IP I QR
Sbjct: 368 EKMSDVYSILQNIPQRQIEEMQR 390
>gi|255537445|ref|XP_002509789.1| Xyloglucan galactosyltransferase KATAMARI1, putative [Ricinus
communis]
gi|223549688|gb|EEF51176.1| Xyloglucan galactosyltransferase KATAMARI1, putative [Ricinus
communis]
Length = 567
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 83/346 (23%), Positives = 126/346 (36%), Gaps = 87/346 (25%)
Query: 42 VYVYELPSKYNKKLL---------------------------LKDPRCLTHMFAAEIFMH 74
+YVY+LPSK+NK LL L LTH ++ E H
Sbjct: 187 IYVYDLPSKFNKDLLGQCREMIPWTDFCKYFDNEAFGKPIEKLGKGWYLTHQYSLEPIFH 246
Query: 75 RFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKS-PRMMRSAIQLISSNW---- 129
+L P R N EA FY P Y D+ L FK+ ++ + L W
Sbjct: 247 SRILKHPCRVYNENEAKLFYVPYYGGLDI----LRWHFKNVSNDVKDTLALELLKWLESR 302
Query: 130 PYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGS 189
W + G DH FV+ F + + + L L Q V+ +R +N+
Sbjct: 303 KTWLQNSGKDHVFVLGK-ISWDFRRKIDSSWGTRFLQLQQMQNPVKLLIERQPWDVNDIG 361
Query: 190 ITIPPYAPPQK---MQAHQIP-----------------PDTPRSIFVYFRGLFYDVNNDP 229
I P + P + A Q+ PD P SI R + +N
Sbjct: 362 IPHPTFFHPHSDDDIVAWQLKIIRTTRKNLLTFAGAARPDQPESI----RSIL--INQCT 415
Query: 230 EGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAV 289
G + + N K+ D P T E + FCL P G +P + +++
Sbjct: 416 SAGDKCK------FLNCKSG-----GCDRPETIIELFAESEFCLQPPGDSPTRKSVFDSL 464
Query: 290 VFGCIPVIIADDIVLPFAD--AIPWE------EIGMFVAEEDVPKL 327
+ GCIPV+ PF PW + +F+ +E+V ++
Sbjct: 465 ISGCIPVLFN-----PFTAYYQYPWHLPEDHSKYSVFIDQEEVRQM 505
>gi|74214457|dbj|BAE31083.1| unnamed protein product [Mus musculus]
Length = 690
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 28/83 (33%), Positives = 44/83 (53%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + +Q A FC G L + + GC+PV+IAD +LPF++ + W+ + V E
Sbjct: 412 YPQVLQEATFCTVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 471
Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
E + + +IL +IP I QR
Sbjct: 472 EKMSDVYSILQNIPQRQIEEMQR 494
>gi|198416199|ref|XP_002119284.1| PREDICTED: similar to exostoses (multiple) 2, partial [Ciona
intestinalis]
Length = 659
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 41/134 (30%), Positives = 60/134 (44%), Gaps = 9/134 (6%)
Query: 256 TDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEI 315
D Y E +Q FCL L+E++ GCIPV D +LPF++ + W
Sbjct: 292 NDEQKKYPEILQEGTFCLLLPTSRLGQSALMESMQAGCIPVFACDTYILPFSEVLDWSRA 351
Query: 316 GMFVAEEDVPKLDTILTSIPPE--VILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNG 373
+ + E+ +P + IL IP E V+++KQ S + PA QI+N
Sbjct: 352 SVLIREDSLPDIMNILRRIPHEQVVLMKKQVEFLYTSYFTNI----PAITMTTL-QIIND 406
Query: 374 LARKLPHDKSTYLK 387
R PH +TY K
Sbjct: 407 --RVFPHVATTYEK 418
>gi|188528923|ref|NP_001120887.1| exostosin 2 [Xenopus (Silurana) tropicalis]
gi|183986340|gb|AAI66243.1| ext2 protein [Xenopus (Silurana) tropicalis]
Length = 718
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 32/111 (28%), Positives = 56/111 (50%), Gaps = 9/111 (8%)
Query: 234 YARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGC 293
Y GA A ++N + D+P + +Q + FC+ G L + + GC
Sbjct: 289 YTDGAAAHRKRCYRN-----VVYDYP----QILQESTFCIVLRGARLGQSVLSDVLQAGC 339
Query: 294 IPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQR 344
+PVIIAD VLPF++ + W+ + + EE + ++ +IL ++P + QR
Sbjct: 340 VPVIIADSYVLPFSEVLDWKRASVVIPEEKMFEMYSILQAVPQRQLEEMQR 390
>gi|242018853|ref|XP_002429885.1| Exostosin-1, putative [Pediculus humanus corporis]
gi|212514919|gb|EEB17147.1| Exostosin-1, putative [Pediculus humanus corporis]
Length = 725
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 55/235 (23%), Positives = 99/235 (42%), Gaps = 24/235 (10%)
Query: 110 LPFKSPRMMRSAIQLISSNWP-YWNRTEGADH-FFVVPHDFGACFHYQEEKAIERGILPL 167
LP+ + L S WP Y G D+ F ++ + H++ + +PL
Sbjct: 165 LPYWNKGRNHVIFNLYSGTWPDYAEENLGFDYGFSILAKASMSTVHFRPNFDVS---IPL 221
Query: 168 LQRATLVQTFGQRNHVCLNEGSITIPPY--APPQKMQAHQIPPDTPRSIFVYFRGLFYDV 225
+ + G+ +V N + I Y A K H I +T S+ Y +
Sbjct: 222 FHKNHR-EKGGESGYVSSNNFPV-IKKYVLAFKGKRYVHGIGSETRNSL--------YHL 271
Query: 226 NNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLGWAPWSPR 284
+N+ + W+ K+ + + ++ YE + Q + FCL P G S R
Sbjct: 272 HNEKDMVLVTTCKHGKSWKEMKDERCDEDNQEYDRYDYEVLLQNSTFCLVPRGRRLGSFR 331
Query: 285 LVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVI 339
+EA+ GCIPV++++ VLPF + I W + ++ E +L +I P+++
Sbjct: 332 FLEALQAGCIPVLLSNGWVLPFQEVIDWTKAAVWADER------LLLQAIVPDIV 380
>gi|363742350|ref|XP_003642625.1| PREDICTED: exostosin-1b-like, partial [Gallus gallus]
Length = 456
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 31/103 (30%), Positives = 54/103 (52%), Gaps = 1/103 (0%)
Query: 243 WENFKNNPLFDISTDHPT-TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADD 301
WE K+ + D+ Y E + + FC+ P G S R +EA+ CIPV+++D
Sbjct: 20 WERHKDTRCDQDNVDYEKFDYQELLHNSTFCIVPRGRRLGSFRFLEALQAACIPVLLSDG 79
Query: 302 IVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQR 344
LPF++AI W + + +E + ++ + + I PE +L Q+
Sbjct: 80 WELPFSEAIDWGKAAVVGSERLLLQIPSAVRCIRPERVLAFQQ 122
>gi|126322251|ref|XP_001370057.1| PREDICTED: exostosin-1 [Monodelphis domestica]
Length = 746
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 32/123 (26%), Positives = 58/123 (47%), Gaps = 1/123 (0%)
Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
R Y V+N + W+ K++ +T++ Y +M A FCL P G
Sbjct: 280 RNALYHVHNGEDVVLLTTCKHGKDWQKHKDSRCDKDNTEYEKYDYREMLHNATFCLVPRG 339
Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
S R +EA+ C+PV++++ LPF++ I W + + E + ++ + + SI +
Sbjct: 340 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVIDWNQAAIIGDERLLLQIPSTIRSIHQD 399
Query: 338 VIL 340
IL
Sbjct: 400 KIL 402
>gi|395512319|ref|XP_003760388.1| PREDICTED: exostosin-1 [Sarcophilus harrisii]
Length = 746
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 32/123 (26%), Positives = 58/123 (47%), Gaps = 1/123 (0%)
Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
R Y V+N + W+ K++ +T++ Y +M A FCL P G
Sbjct: 280 RNALYHVHNGEDVVLLTTCKHGKDWQKHKDSRCDKDNTEYEKYDYREMLHNATFCLVPRG 339
Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
S R +EA+ C+PV++++ LPF++ I W + + E + ++ + + SI +
Sbjct: 340 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVIDWNQAAIIGDERLLLQIPSTIRSIHQD 399
Query: 338 VIL 340
IL
Sbjct: 400 KIL 402
>gi|74213823|dbj|BAE29346.1| unnamed protein product [Mus musculus]
Length = 786
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 28/83 (33%), Positives = 44/83 (53%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + +Q A FC G L + + GC+PV+IAD +LPF++ + W+ + V E
Sbjct: 376 YPQVLQEATFCTVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 435
Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
E + + +IL +IP I QR
Sbjct: 436 EKMSDVYSILQNIPQRQIEEMQR 458
>gi|355686834|gb|AER98200.1| exostoses 2 [Mustela putorius furo]
Length = 717
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 43/83 (51%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + +Q A FC+ G L + + GC+PVI+AD VLPF++ + W+ + V E
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAALSDVLRAGCVPVIVADSYVLPFSEVLDWKRASVVVPE 367
Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
E + + IL IP I QR
Sbjct: 368 EKLSDVYGILQGIPRRQIEEMQR 390
>gi|449500758|ref|XP_004161187.1| PREDICTED: xyloglucan galactosyltransferase KATAMARI1 homolog
[Cucumis sativus]
Length = 618
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 80/318 (25%), Positives = 111/318 (34%), Gaps = 75/318 (23%)
Query: 38 GKLKVYVYELPSKYNKKLL---------------------------LKDPRCLTHMFAAE 70
G +YVYELP+K+NK+L+ L D T+ +A E
Sbjct: 222 GGRGIYVYELPAKFNKELVGQCGEMVPWMNFCKYFNNEGLGEKIPELGDGWYNTNQYALE 281
Query: 71 IFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKS-------PRMMRSAIQ 123
H +L P R N EEA FY P Y D+ L FK+ + IQ
Sbjct: 282 PIFHSRVLKHPCRVYNQEEAKLFYVPYYGGLDI----LRWHFKNNVTYELKDSLGLELIQ 337
Query: 124 LISSNWPYWNRTEGADHFFV---VPHDF-----GACFHYQEEK-AIERGILPLLQRATLV 174
+S+ P W + G DH FV + DF G Y + L L Q +
Sbjct: 338 WLSAQKP-WAKNSGKDHVFVLGKISWDFRRNANGNINGYNSNNPSWGTKFLELHQLQNPI 396
Query: 175 QTFGQRNHVCLNEGSITIPPYAPPQK---MQAHQIPPDTPRSIFVY-------------F 218
+ +R N+ I P + P + A Q R ++
Sbjct: 397 KLLIERQPWHQNDVGIPHPTFFHPHSDDDIFAWQWKAIRSRRKYLVGFAGGARPESSENI 456
Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGW 278
R L D EGG R N K D P E + FCL P G
Sbjct: 457 RSLLIDHCTTTEGGRLCRHL------NCKKG-----DCDRPKAVIELFLESEFCLQPPGD 505
Query: 279 APWSPRLVEAVVFGCIPV 296
+P + ++++ GCIPV
Sbjct: 506 SPTRKSVFDSLISGCIPV 523
>gi|321463593|gb|EFX74608.1| hypothetical protein DAPPUDRAFT_307216 [Daphnia pulex]
Length = 724
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 76/321 (23%), Positives = 130/321 (40%), Gaps = 62/321 (19%)
Query: 38 GKLKVYVYELPSKYNKKLLLKDPRCLTHMFAAEIF-MHRFLLSSPVRTLNPEEADWFYTP 96
G++ VYVY + + + KD +T +E + + + + S T +P EA F
Sbjct: 110 GRIGVYVYPVV-----EFVDKDKVPITKKITSEFYNILKAITKSMYFTSDPNEACLFI-- 162
Query: 97 IYPTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADH--FFVVPHDFGACFHY 154
PT DL R+ + S+ PYWN EG +H F ++P G+ Y
Sbjct: 163 --PTIDL--------LNQNRIRPKDVGKALSSLPYWN--EGRNHLIFNILP---GSMPEY 207
Query: 155 QEEKAIERGILPLLQRATLVQTFGQRNHVCLNEG-SITIPPYAPPQKMQAHQIPPDTPRS 213
++ G + T G + + G +++P Y+P Q D R+
Sbjct: 208 HPFLEVDIGYA--------MMTSGGFSSLTYRSGFDVSVPVYSPLAAELRTQSSQD--RN 257
Query: 214 IFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDI--------------STDHP 259
V +++ P Y R + E +P F + D
Sbjct: 258 WLV--------ISSQP---YIHEDFREVISEMAAEHPGFLVLNSCGSSPLDTKLRCRDEE 306
Query: 260 TTYYED-MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMF 318
T + D + A FCL G P L+E++ GCIPV+++D VLP+ + I W+ +
Sbjct: 307 TYNFPDILMNATFCLVVRGARLGQPTLMESLAAGCIPVVVSDSYVLPYEEVIDWKSAVLQ 366
Query: 319 VAEEDVPKLDTILTSIPPEVI 339
+ E+D+ K+ +L + + I
Sbjct: 367 LYEDDLSKMMDLLRGVSSDRI 387
>gi|148695683|gb|EDL27630.1| exostoses (multiple) 2, isoform CRA_b [Mus musculus]
Length = 706
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 28/83 (33%), Positives = 44/83 (53%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + +Q A FC G L + + GC+PV+IAD +LPF++ + W+ + V E
Sbjct: 372 YPQVLQEATFCTVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 431
Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
E + + +IL +IP I QR
Sbjct: 432 EKMSDVYSILQNIPQRQIEEMQR 454
>gi|449449896|ref|XP_004142700.1| PREDICTED: xyloglucan galactosyltransferase KATAMARI1 homolog
[Cucumis sativus]
Length = 618
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 80/318 (25%), Positives = 111/318 (34%), Gaps = 75/318 (23%)
Query: 38 GKLKVYVYELPSKYNKKLL---------------------------LKDPRCLTHMFAAE 70
G +YVYELP+K+NK+L+ L D T+ +A E
Sbjct: 222 GGRGIYVYELPAKFNKELVGQCGEMVPWMNFCKYFNNEGLGEKIPELGDGWYNTNQYALE 281
Query: 71 IFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKS-------PRMMRSAIQ 123
H +L P R N EEA FY P Y D+ L FK+ + IQ
Sbjct: 282 PIFHSRVLKHPCRVYNQEEAKLFYVPYYGGLDI----LRWHFKNNVTYELKDSLGLELIQ 337
Query: 124 LISSNWPYWNRTEGADHFFV---VPHDF-----GACFHYQEEK-AIERGILPLLQRATLV 174
+S+ P W + G DH FV + DF G Y + L L Q +
Sbjct: 338 WLSAQKP-WAKNSGKDHVFVLGKISWDFRRNANGNINGYNSNNPSWGTKFLELHQLQNPI 396
Query: 175 QTFGQRNHVCLNEGSITIPPYAPPQK---MQAHQIPPDTPRSIFVY-------------F 218
+ +R N+ I P + P + A Q R ++
Sbjct: 397 KLLIERQPWHQNDVGIPHPTFFHPHSDDDIFAWQWKAIRSRRKYLVGFAGGARPESSENI 456
Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGW 278
R L D EGG R N K D P E + FCL P G
Sbjct: 457 RSLLIDHCTTTEGGRLCRHL------NCKKG-----DCDRPKAVIELFLESEFCLQPPGD 505
Query: 279 APWSPRLVEAVVFGCIPV 296
+P + ++++ GCIPV
Sbjct: 506 SPTRKSVFDSLISGCIPV 523
>gi|406700155|gb|EKD03339.1| hypothetical protein A1Q2_02368 [Trichosporon asahii var. asahii
CBS 8904]
Length = 1041
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 32/106 (30%), Positives = 50/106 (47%), Gaps = 2/106 (1%)
Query: 261 TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVA 320
TY + + FC P G A W+ RL + V GCIPV++ D + + W + V
Sbjct: 891 TYLDLLGDTRFCPIPFGTAGWTYRLSDVVYAGCIPVLVGDQSHMTYWSMFDWSLFSVQVF 950
Query: 321 EEDVPKLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDA 366
E ++ L+ IL ++ E RKQ L ++ A L+P +A
Sbjct: 951 EHELDHLERILGAVTEEDAQRKQDALM--LVREAFLYPSEGHVEEA 994
>gi|291238132|ref|XP_002738985.1| PREDICTED: tout-velu-like, partial [Saccoglossus kowalevskii]
Length = 403
Score = 55.1 bits (131), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 29/78 (37%), Positives = 45/78 (57%), Gaps = 2/78 (2%)
Query: 268 RAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKL 327
A FC+ G L +A+ GCIPV+++D VLPF+ + W+ + + EED+ ++
Sbjct: 1 EATFCIVLRGARLGQTALSDALKAGCIPVVLSDTYVLPFSQVLDWKRAAIRLNEEDLDQV 60
Query: 328 DTILTSIPPEVI--LRKQ 343
++L SI P I LRKQ
Sbjct: 61 ASVLRSISPTRINSLRKQ 78
>gi|412986477|emb|CCO14903.1| predicted protein [Bathycoccus prasinos]
Length = 574
Score = 55.1 bits (131), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 50/187 (26%), Positives = 78/187 (41%), Gaps = 18/187 (9%)
Query: 140 HFFVVPHDFG---ACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYA 196
HF V + G A F Q++ A+ + L R + + +H NE
Sbjct: 328 HFIVSNSEIGVENAPFDAQKDVAVIPSLSFYLPREAVSRGLVDLSHFLRNEEEYASVGLE 387
Query: 197 PPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDIST 256
P K++A R+I + FRG NN G + R + + +
Sbjct: 388 LP-KLEAE-------RNIRLMFRG-----NN--RGPLREKVFRYLIENGSPEDSIETTGV 432
Query: 257 DHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG 316
P Y M+ + +CL G SPRL+E ++FGC+PVI+AD LP A + W +
Sbjct: 433 ASPQAYMSLMEHSKYCLHVRGTRVMSPRLIELMLFGCVPVIVADAYELPLAWFLDWTKFS 492
Query: 317 MFVAEED 323
+ V E +
Sbjct: 493 IRVPESE 499
>gi|187469677|gb|AAI66748.1| Ext2 protein [Rattus norvegicus]
Length = 718
Score = 55.1 bits (131), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 27/83 (32%), Positives = 44/83 (53%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + +Q A FC G L + + GC+PV+IAD +LPF++ + W+ + + E
Sbjct: 308 YPQVLQEATFCTVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVIPE 367
Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
E + + +IL +IP I QR
Sbjct: 368 EKMSDVYSILQNIPQRQIEEMQR 390
>gi|166183792|gb|ABY84155.1| exostosin 1 (predicted) [Callithrix jacchus]
Length = 746
Score = 55.1 bits (131), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 32/123 (26%), Positives = 58/123 (47%), Gaps = 1/123 (0%)
Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
R Y V+N + W+ K++ +T++ Y +M A FCL P G
Sbjct: 280 RNALYHVHNGEDVVLLTTCKHGKDWQKHKDSRCDRDNTEYEKYDYREMLHNATFCLVPRG 339
Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
S R +EA+ C+PV++++ LPF++ I W + + E + ++ + + SI +
Sbjct: 340 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIPSTIRSIHQD 399
Query: 338 VIL 340
IL
Sbjct: 400 KIL 402
>gi|157820411|ref|NP_001101221.1| exostosin-2 [Rattus norvegicus]
gi|149022701|gb|EDL79595.1| exostoses (multiple) 2 (predicted) [Rattus norvegicus]
Length = 670
Score = 55.1 bits (131), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 27/83 (32%), Positives = 44/83 (53%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + +Q A FC G L + + GC+PV+IAD +LPF++ + W+ + + E
Sbjct: 308 YPQVLQEATFCTVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVIPE 367
Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
E + + +IL +IP I QR
Sbjct: 368 EKMSDVYSILQNIPQRQIEEMQR 390
>gi|426235700|ref|XP_004011818.1| PREDICTED: exostosin-1 [Ovis aries]
Length = 746
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 32/123 (26%), Positives = 58/123 (47%), Gaps = 1/123 (0%)
Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
R Y V+N + W+ K++ +T++ Y +M A FCL P G
Sbjct: 280 RNALYHVHNGEDVVLLTTCKHGKDWQKHKDSRCDRDNTEYEKYDYREMLHNATFCLVPRG 339
Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
S R +EA+ C+PV++++ LPF++ I W + + E + ++ + + SI +
Sbjct: 340 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIPSTIRSIHQD 399
Query: 338 VIL 340
IL
Sbjct: 400 KIL 402
>gi|148697308|gb|EDL29255.1| exostoses (multiple) 1, isoform CRA_b [Mus musculus]
Length = 566
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 32/123 (26%), Positives = 58/123 (47%), Gaps = 1/123 (0%)
Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
R Y V+N + W+ K++ +T++ Y +M A FCL P G
Sbjct: 294 RNALYHVHNGEDVLLLTTCKHGKDWQKHKDSRCDRDNTEYEKYDYREMLHNATFCLVPRG 353
Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
S R +EA+ C+PV++++ LPF++ I W + + E + ++ + + SI +
Sbjct: 354 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIPSTIRSIHQD 413
Query: 338 VIL 340
IL
Sbjct: 414 KIL 416
>gi|73974379|ref|XP_539145.2| PREDICTED: exostosin-1 isoform 1 [Canis lupus familiaris]
Length = 746
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 32/123 (26%), Positives = 58/123 (47%), Gaps = 1/123 (0%)
Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
R Y V+N + W+ K++ +T++ Y +M A FCL P G
Sbjct: 280 RNALYHVHNGEDVVLLTTCKHGKDWQKHKDSRCDRDNTEYEKYDYREMLHNATFCLVPRG 339
Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
S R +EA+ C+PV++++ LPF++ I W + + E + ++ + + SI +
Sbjct: 340 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIPSTIRSIHQD 399
Query: 338 VIL 340
IL
Sbjct: 400 KIL 402
>gi|197101852|ref|NP_001125538.1| exostosin-1 [Pongo abelii]
gi|75042009|sp|Q5RBC3.1|EXT1_PONAB RecName: Full=Exostosin-1; AltName:
Full=Glucuronosyl-N-acetylglucosaminyl-proteoglycan/N-
acetylglucosaminyl-proteoglycan
4-alpha-N-acetylglucosaminyltransferase; AltName:
Full=Multiple exostoses protein 1 homolog
gi|55728386|emb|CAH90937.1| hypothetical protein [Pongo abelii]
Length = 746
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 32/123 (26%), Positives = 58/123 (47%), Gaps = 1/123 (0%)
Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
R Y V+N + W+ K++ +T++ Y +M A FCL P G
Sbjct: 280 RNALYHVHNGEDVVLLTTCKHGKDWQKHKDSRCDRDNTEYEKYDYREMLHNATFCLVPRG 339
Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
S R +EA+ C+PV++++ LPF++ I W + + E + ++ + + SI +
Sbjct: 340 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIPSTIRSIHQD 399
Query: 338 VIL 340
IL
Sbjct: 400 KIL 402
>gi|74144073|dbj|BAE22143.1| unnamed protein product [Mus musculus]
Length = 435
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 28/83 (33%), Positives = 44/83 (53%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + +Q A FC G L + + GC+PV+IAD +LPF++ + W+ + V E
Sbjct: 25 YPQVLQEATFCTVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 84
Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
E + + +IL +IP I QR
Sbjct: 85 EKMSDVYSILQNIPQRQIEEMQR 107
>gi|46370066|ref|NP_000118.2| exostosin-1 [Homo sapiens]
gi|20141422|sp|Q16394.2|EXT1_HUMAN RecName: Full=Exostosin-1; AltName:
Full=Glucuronosyl-N-acetylglucosaminyl-proteoglycan/N-
acetylglucosaminyl-proteoglycan
4-alpha-N-acetylglucosaminyltransferase; AltName:
Full=Multiple exostoses protein 1; AltName:
Full=Putative tumor suppressor protein EXT1
gi|12654671|gb|AAH01174.1| Exostoses (multiple) 1 [Homo sapiens]
gi|119612378|gb|EAW91972.1| exostoses (multiple) 1 [Homo sapiens]
gi|123981930|gb|ABM82794.1| exostoses (multiple) 1 [synthetic construct]
gi|123996761|gb|ABM85982.1| exostoses (multiple) 1 [synthetic construct]
gi|189053697|dbj|BAG35949.1| unnamed protein product [Homo sapiens]
gi|261859910|dbj|BAI46477.1| exostoses (multiple) 1 [synthetic construct]
Length = 746
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 32/123 (26%), Positives = 58/123 (47%), Gaps = 1/123 (0%)
Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
R Y V+N + W+ K++ +T++ Y +M A FCL P G
Sbjct: 280 RNALYHVHNGEDVVLLTTCKHGKDWQKHKDSRCDRDNTEYEKYDYREMLHNATFCLVPRG 339
Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
S R +EA+ C+PV++++ LPF++ I W + + E + ++ + + SI +
Sbjct: 340 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIPSTIRSIHQD 399
Query: 338 VIL 340
IL
Sbjct: 400 KIL 402
>gi|147902262|ref|NP_001091564.1| exostosin-1 [Bos taurus]
gi|238686639|sp|A5D7I4.1|EXT1_BOVIN RecName: Full=Exostosin-1; AltName:
Full=Glucuronosyl-N-acetylglucosaminyl-proteoglycan/N-
acetylglucosaminyl-proteoglycan
4-alpha-N-acetylglucosaminyltransferase; AltName:
Full=Multiple exostoses protein 1 homolog
gi|146186820|gb|AAI40569.1| EXT1 protein [Bos taurus]
gi|296480555|tpg|DAA22670.1| TPA: exostosin-1 [Bos taurus]
Length = 746
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 32/123 (26%), Positives = 58/123 (47%), Gaps = 1/123 (0%)
Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
R Y V+N + W+ K++ +T++ Y +M A FCL P G
Sbjct: 280 RNALYHVHNGEDVVLLTTCKHGKDWQKHKDSRCDRDNTEYEKYDYREMLHNATFCLVPRG 339
Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
S R +EA+ C+PV++++ LPF++ I W + + E + ++ + + SI +
Sbjct: 340 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIPSTIRSIHQD 399
Query: 338 VIL 340
IL
Sbjct: 400 KIL 402
>gi|403283504|ref|XP_003933159.1| PREDICTED: exostosin-1 [Saimiri boliviensis boliviensis]
Length = 746
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 32/123 (26%), Positives = 58/123 (47%), Gaps = 1/123 (0%)
Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
R Y V+N + W+ K++ +T++ Y +M A FCL P G
Sbjct: 280 RNALYHVHNGEDVVLLTTCKHGKDWQKHKDSRCDRDNTEYEKYDYREMLHNATFCLVPRG 339
Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
S R +EA+ C+PV++++ LPF++ I W + + E + ++ + + SI +
Sbjct: 340 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIPSTIRSIHQD 399
Query: 338 VIL 340
IL
Sbjct: 400 KIL 402
>gi|281183009|ref|NP_001162444.1| exostosin-1 [Papio anubis]
gi|384475833|ref|NP_001245062.1| exostosin 1 [Macaca mulatta]
gi|114621433|ref|XP_001141496.1| PREDICTED: exostosin-1 isoform 1 [Pan troglodytes]
gi|397505670|ref|XP_003823375.1| PREDICTED: exostosin-1 [Pan paniscus]
gi|238687365|sp|A9X1C8.1|EXT1_PAPAN RecName: Full=Exostosin-1; AltName:
Full=Glucuronosyl-N-acetylglucosaminyl-proteoglycan/N-
acetylglucosaminyl-proteoglycan
4-alpha-N-acetylglucosaminyltransferase; AltName:
Full=Multiple exostoses protein 1 homolog
gi|163781066|gb|ABY40823.1| exostoses 1 (predicted) [Papio anubis]
gi|355698182|gb|EHH28730.1| Exostosin-1 [Macaca mulatta]
gi|383410151|gb|AFH28289.1| exostosin-1 [Macaca mulatta]
gi|387541472|gb|AFJ71363.1| exostosin-1 [Macaca mulatta]
gi|410224816|gb|JAA09627.1| exostosin 1 [Pan troglodytes]
gi|410259738|gb|JAA17835.1| exostosin 1 [Pan troglodytes]
gi|410302704|gb|JAA29952.1| exostosin 1 [Pan troglodytes]
gi|410353629|gb|JAA43418.1| exostosin 1 [Pan troglodytes]
Length = 746
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 32/123 (26%), Positives = 58/123 (47%), Gaps = 1/123 (0%)
Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
R Y V+N + W+ K++ +T++ Y +M A FCL P G
Sbjct: 280 RNALYHVHNGEDVVLLTTCKHGKDWQKHKDSRCDRDNTEYEKYDYREMLHNATFCLVPRG 339
Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
S R +EA+ C+PV++++ LPF++ I W + + E + ++ + + SI +
Sbjct: 340 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIPSTIRSIHQD 399
Query: 338 VIL 340
IL
Sbjct: 400 KIL 402
>gi|311253457|ref|XP_001925015.2| PREDICTED: exostosin-1 [Sus scrofa]
Length = 746
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 32/123 (26%), Positives = 58/123 (47%), Gaps = 1/123 (0%)
Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
R Y V+N + W+ K++ +T++ Y +M A FCL P G
Sbjct: 280 RNALYHVHNGEDVVLLTTCKHGKDWQKHKDSRCDRDNTEYEKYDYREMLHNATFCLVPRG 339
Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
S R +EA+ C+PV++++ LPF++ I W + + E + ++ + + SI +
Sbjct: 340 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIPSTIRSIHQD 399
Query: 338 VIL 340
IL
Sbjct: 400 KIL 402
>gi|301774588|ref|XP_002922714.1| PREDICTED: exostosin-1-like [Ailuropoda melanoleuca]
Length = 746
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 32/123 (26%), Positives = 58/123 (47%), Gaps = 1/123 (0%)
Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
R Y V+N + W+ K++ +T++ Y +M A FCL P G
Sbjct: 280 RNALYHVHNGEDVVLLTTCKHGKDWQKHKDSRCDRDNTEYEKYDYREMLHNATFCLVPRG 339
Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
S R +EA+ C+PV++++ LPF++ I W + + E + ++ + + SI +
Sbjct: 340 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIPSTIRSIHQD 399
Query: 338 VIL 340
IL
Sbjct: 400 KIL 402
>gi|395521813|ref|XP_003765009.1| PREDICTED: exostosin-like 1 [Sarcophilus harrisii]
Length = 694
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 32/144 (22%), Positives = 60/144 (41%), Gaps = 8/144 (5%)
Query: 203 AHQIPPDTPRSIFVYFRGLF------YDVNNDPEGGYYARGARAAVWENFKNNPLFDIST 256
H IPP P + F G + + D +++++ P T
Sbjct: 219 VHLIPP--PGEDLLAFEGQLDMWGTGFSIQQDQNSILNPSDHHGETIKSYRDGPCKQDHT 276
Query: 257 DHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIG 316
+ + + FC P S RL++A+ GC+PV+++ LPFA+ I W
Sbjct: 277 IEQSEMQNGLHNSTFCFIPSSCHAGSFRLLQALKAGCVPVLLSRGWELPFAEVIDWGTAA 336
Query: 317 MFVAEEDVPKLDTILTSIPPEVIL 340
+ + E + ++ ++L +PP +L
Sbjct: 337 VIIDERHLLQIKSVLQGLPPARVL 360
>gi|1168162|gb|AAB62283.1| putative tumour suppressor/hereditary multiple exostoses candidate
gene [Homo sapiens]
gi|1586817|prf||2204384A EXT1 gene
Length = 746
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 32/123 (26%), Positives = 58/123 (47%), Gaps = 1/123 (0%)
Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
R Y V+N + W+ K++ +T++ Y +M A FCL P G
Sbjct: 280 RNALYHVHNGEDVVLLTTCKHGKDWQKHKDSRCDRDNTEYEKYDYREMLHNATFCLVPRG 339
Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
S R +EA+ C+PV++++ LPF++ I W + + E + ++ + + SI +
Sbjct: 340 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIPSTIRSIHQD 399
Query: 338 VIL 340
IL
Sbjct: 400 KIL 402
>gi|397643551|gb|EJK75938.1| hypothetical protein THAOC_02324 [Thalassiosira oceanica]
Length = 505
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 37/141 (26%), Positives = 60/141 (42%), Gaps = 32/141 (22%)
Query: 264 EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEED 323
+D A FCLCP G + + R +++ GCIPV+++ IVLPF I + +FVA +D
Sbjct: 351 QDTFEATFCLCPAGDSDVARRFFTSILAGCIPVVMSQHIVLPFESLIDYSTFVVFVAFDD 410
Query: 324 VPK---------------------------LDTILTSIPPEVILRKQRLLANPSMKRAML 356
D +L EV+ R++ LL ++ +
Sbjct: 411 TENAEKNILPTVGDKDEGSTVLRVSNFESVYDALLHMTEEEVLTRRRNLLC---VRDHFV 467
Query: 357 FPQP--AQPGDAFHQILNGLA 375
+ + PGDA I+ +A
Sbjct: 468 YRREPGGHPGDAVDTIVAEMA 488
>gi|168013242|ref|XP_001759310.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689623|gb|EDQ75994.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 392
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 66/277 (23%), Positives = 107/277 (38%), Gaps = 23/277 (8%)
Query: 64 THMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQ 123
TH +A E+ H ++ RT +P A+ FY P Y D+ L + R A+
Sbjct: 65 THQYALELVSHARIMKYKCRTEDPNLANLFYIPYYGGLDVIRWHFDL--NATNTNRDALG 122
Query: 124 LISSNW----PYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQ 179
W P W R G DH V+ F Q +L + +++ +
Sbjct: 123 WKLVRWLEKQPSWRRRGGLDHLLVLGK-ISWDFRRQLRGNWGSRLLEFPEIQNMMRVMIE 181
Query: 180 RNHVCLNEGSITIPPYAPPQKMQA----HQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYA 235
RN N+ + P Y P+ Q R+ V F G + NDP A
Sbjct: 182 RNPWSKNDIGVPHPTYFHPKSASDIDTWLQHVKSQERTSLVAFVG--KERRNDPTNVRSA 239
Query: 236 --RGARAAVWENFKNNPLFDISTD---HPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVV 290
R R A E + D HP + + FC+ P+G +P + ++++
Sbjct: 240 LVRQCRGASSEAVCR--FVECKKDLCQHPVFVTKTFVTSQFCMQPVGDSPTRRSVFDSLI 297
Query: 291 FGCIPVIIA-DDIVLPFADAIPWEEI--GMFVAEEDV 324
GCIPV+ L +A +P E ++++E++V
Sbjct: 298 AGCIPVLFHPATAYLQYAWHLPRNESSWSVYISEDEV 334
>gi|356523730|ref|XP_003530488.1| PREDICTED: xyloglucan galactosyltransferase KATAMARI1-like [Glycine
max]
Length = 437
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 66/273 (24%), Positives = 108/273 (39%), Gaps = 36/273 (13%)
Query: 46 ELPSKYNKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTP 105
++ K N+K+LLK+ T+ F+ E+ H L T + A Y P Y D+
Sbjct: 73 KVSKKSNEKVLLKESFYATNQFSLEVIFHNTLKHYKCLTNDSSLASAIYVPYYAGLDVVQ 132
Query: 106 ---TGLPLPFK--SPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAI 160
G + + SP+ + + + P W R G DHF VV G+ F + E
Sbjct: 133 YLWGGFNVSIRDASPKELVKWL----AQQPEWKRMWGRDHFMVVGR-IGSDFRRRTENND 187
Query: 161 ERGI-LPLLQRATLVQTFGQRNHVCLNEGSITIPPYAPP----------QKMQAHQIP-- 207
+ G L LL A + + NE SI P Y P +KM+ + P
Sbjct: 188 DWGTKLMLLPEARNMSILSIESGSKENEFSIPYPTYFHPSKDKEVFQWQKKMRKVKRPYL 247
Query: 208 ---PDTPRSIFVYFRGLFY-DVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYY 263
PR + Y + ++ + + + N+ N+P+ H T +
Sbjct: 248 FSFAGAPRPYYNYLSSIIRNEIIKECQSSRSCKLLNCNAGHNYCNDPV------HVTKVF 301
Query: 264 EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPV 296
Q ++FCL P G + ++++ GCIPV
Sbjct: 302 ---QSSVFCLQPPGDSFTRRSTFDSILAGCIPV 331
>gi|372266629|ref|ZP_09502677.1| exostosin [Alteromonas sp. S89]
Length = 328
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 47/86 (54%), Gaps = 3/86 (3%)
Query: 251 LFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAI 310
L +I Y E ++ FCLCP G P S RL EA+ FGC+PV+++DD+ L A
Sbjct: 215 LKEIKRREEREYVEVLKETKFCLCPSGSGPNSIRLWEAIKFGCVPVLLSDDLDLRGLSA- 273
Query: 311 PWEEIGMFVAEEDVPKLDTILTSIPP 336
+ + + ++E+V KL L + P
Sbjct: 274 --KALRVTESKEEVEKLPARLANTDP 297
>gi|358256597|dbj|GAA50186.1| glucuronyl/N-acetylglucosaminyl transferase EXT1 [Clonorchis
sinensis]
Length = 802
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 35/94 (37%), Positives = 52/94 (55%), Gaps = 4/94 (4%)
Query: 253 DISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPW 312
D++T Y E M + FCL P G S R +EA+ CIPV++++D LPF++ I W
Sbjct: 342 DMATYDQYDYNELMHNSTFCLVPRGRRLGSYRFLEALEASCIPVMLSNDWELPFSEVIDW 401
Query: 313 EEIGMFVAEEDVP-KLDTILTSIPPEVI--LRKQ 343
+ + A+E +P L +L IP I LR+Q
Sbjct: 402 SK-AVIWADEHLPLTLSLMLRRIPDYRIVQLRQQ 434
>gi|348588241|ref|XP_003479875.1| PREDICTED: exostosin-1 [Cavia porcellus]
Length = 746
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 32/123 (26%), Positives = 58/123 (47%), Gaps = 1/123 (0%)
Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
R Y V+N + W+ K++ +T++ Y +M A FCL P G
Sbjct: 280 RNALYHVHNGEDVLLLTTCKHGKDWQKHKDSRCDRDNTEYEKYDYREMLHNATFCLVPRG 339
Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
S R +EA+ C+PV++++ LPF++ I W + + E + ++ + + SI +
Sbjct: 340 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIPSTIRSIHQD 399
Query: 338 VIL 340
IL
Sbjct: 400 KIL 402
>gi|149721624|ref|XP_001496484.1| PREDICTED: exostosin-1 [Equus caballus]
Length = 746
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 32/123 (26%), Positives = 58/123 (47%), Gaps = 1/123 (0%)
Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
R Y V+N + W+ K++ +T++ Y +M A FCL P G
Sbjct: 280 RNALYHVHNGEDVLLLTTCKHGKDWQKHKDSRCDRDNTEYEKYDYREMLHNATFCLVPRG 339
Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
S R +EA+ C+PV++++ LPF++ I W + + E + ++ + + SI +
Sbjct: 340 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIPSTIRSIHQD 399
Query: 338 VIL 340
IL
Sbjct: 400 KIL 402
>gi|91076424|ref|XP_976077.1| PREDICTED: similar to exostosin-2 isoform 2 [Tribolium castaneum]
gi|270002589|gb|EEZ99036.1| hypothetical protein TcasGA2_TC004910 [Tribolium castaneum]
Length = 706
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 29/89 (32%), Positives = 46/89 (51%), Gaps = 4/89 (4%)
Query: 256 TDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEI 315
D+P +E + FCL G L+EA+ CIPV++ D VLPF++ I W+
Sbjct: 297 VDYPAILHE----STFCLVFRGERIGQFALLEAMAANCIPVVVMDGAVLPFSNVIDWKRA 352
Query: 316 GMFVAEEDVPKLDTILTSIPPEVILRKQR 344
+F+ E + L +L I P+ I + Q+
Sbjct: 353 AVFIMENYLHTLVDVLEKISPQRIKQMQK 381
>gi|112807209|ref|NP_034292.2| exostosin-1 [Mus musculus]
gi|283837898|ref|NP_001124012.1| exostosin 1 [Rattus norvegicus]
gi|3023731|sp|P97464.1|EXT1_MOUSE RecName: Full=Exostosin-1; AltName:
Full=Glucuronosyl-N-acetylglucosaminyl-proteoglycan/N-
acetylglucosaminyl-proteoglycan
4-alpha-N-acetylglucosaminyltransferase; AltName:
Full=Multiple exostoses protein 1 homolog
gi|1813640|gb|AAB41728.1| Ext1 [Mus musculus]
gi|13435765|gb|AAH04741.1| Exostoses (multiple) 1 [Mus musculus]
gi|74205633|dbj|BAE21106.1| unnamed protein product [Mus musculus]
gi|148697307|gb|EDL29254.1| exostoses (multiple) 1, isoform CRA_a [Mus musculus]
gi|149066396|gb|EDM16269.1| similar to Ext1 [Rattus norvegicus]
Length = 746
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 32/123 (26%), Positives = 58/123 (47%), Gaps = 1/123 (0%)
Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
R Y V+N + W+ K++ +T++ Y +M A FCL P G
Sbjct: 280 RNALYHVHNGEDVLLLTTCKHGKDWQKHKDSRCDRDNTEYEKYDYREMLHNATFCLVPRG 339
Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
S R +EA+ C+PV++++ LPF++ I W + + E + ++ + + SI +
Sbjct: 340 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIPSTIRSIHQD 399
Query: 338 VIL 340
IL
Sbjct: 400 KIL 402
>gi|291388448|ref|XP_002710791.1| PREDICTED: exostosin 1 [Oryctolagus cuniculus]
Length = 746
Score = 54.7 bits (130), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 32/123 (26%), Positives = 58/123 (47%), Gaps = 1/123 (0%)
Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
R Y V+N + W+ K++ +T++ Y +M A FCL P G
Sbjct: 280 RNALYHVHNGEDVLLLTTCKHGKDWQKHKDSRCDRDNTEYEKYDYREMLHNATFCLVPRG 339
Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
S R +EA+ C+PV++++ LPF++ I W + + E + ++ + + SI +
Sbjct: 340 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIPSTIRSIHQD 399
Query: 338 VIL 340
IL
Sbjct: 400 KIL 402
>gi|383847108|ref|XP_003699197.1| PREDICTED: exostosin-1-like [Megachile rotundata]
Length = 711
Score = 54.7 bits (130), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 33/124 (26%), Positives = 57/124 (45%), Gaps = 1/124 (0%)
Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
R Y ++N + + W ++ + ++ YE + A FCL P G
Sbjct: 254 RNALYHLHNGKDLVFVTTCRHGKAWRELQDEHCQQDNQEYDMYDYEILLMNATFCLVPRG 313
Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
S R +EA+ GCIPVI+++ LPF + I W + +F E + ++ I+ S+
Sbjct: 314 RRLGSFRFLEALRAGCIPVILSNGWALPFHERIDWTQAVIFSDERLLLQIPDIVRSVSNV 373
Query: 338 VILR 341
IL+
Sbjct: 374 QILK 377
>gi|358058807|dbj|GAA95770.1| hypothetical protein E5Q_02427 [Mixia osmundae IAM 14324]
Length = 492
Score = 54.7 bits (130), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 29/71 (40%), Positives = 39/71 (54%), Gaps = 8/71 (11%)
Query: 40 LKVYVYELPSKYNKKLLLKDPR--------CLTHMFAAEIFMHRFLLSSPVRTLNPEEAD 91
K+YVY+LP K+ L+ C+ F AEI MHR+LL S RT + EEAD
Sbjct: 109 FKIYVYDLPQKHTTDLVESVMSLNYFCTWDCVHAQFTAEIQMHRYLLQSCARTDDIEEAD 168
Query: 92 WFYTPIYPTCD 102
+Y P+Y T +
Sbjct: 169 LYYLPVYVTAE 179
Score = 51.2 bits (121), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 45/158 (28%), Positives = 67/158 (42%), Gaps = 30/158 (18%)
Query: 242 VWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADD 301
+E F N D DHP M + FC P G P + RL A+ GCIPVI +D+
Sbjct: 338 TYERFGLNLTRD---DHP------MFHSAFCAVPAGDTPTTRRLFNAIFAGCIPVIFSDE 388
Query: 302 IVLPFADA-IPWE-----------EIGMFVAEEDVPKLDTILTSIPPEVILRKQR----- 344
+VLPF + IP+E E G+ +D+ ++ T+ + L R
Sbjct: 389 LVLPFHRSQIPYEDMLLRYPMDQDEAGLHKIYDDLHRMQLDGTADKMQRALNHHRRKLEF 448
Query: 345 ---LLANPSMK-RAMLFPQPAQPGDAFHQILNGLARKL 378
P + + P +Q GDAF + LA ++
Sbjct: 449 ELPFEMTPGVNYTSFRGPDRSQTGDAFDYGMRELAMRV 486
>gi|221506379|gb|EEE32014.1| conserved hypothetical protein [Toxoplasma gondii VEG]
Length = 1692
Score = 54.7 bits (130), Expect = 9e-05, Method: Composition-based stats.
Identities = 35/92 (38%), Positives = 46/92 (50%), Gaps = 4/92 (4%)
Query: 258 HPTTYYEDMQRAIFCLCPL-GWAPWSPRLVEAVVFGCIPVIIA--DDIVLPFADAIPWEE 314
H T + RA FCLCP GW P S + EAV CIPVII +++ LP W +
Sbjct: 1529 HAATKSFFISRATFCLCPQEGWLP-SMCIFEAVAHACIPVIIGGEEELWLPGGCLFDWIQ 1587
Query: 315 IGMFVAEEDVPKLDTILTSIPPEVILRKQRLL 346
+ +FV P IL P +L KQ++L
Sbjct: 1588 MAVFVPLSRAPYTSLILALTPDNQVLEKQQML 1619
>gi|237844077|ref|XP_002371336.1| hypothetical protein TGME49_017690 [Toxoplasma gondii ME49]
gi|211969000|gb|EEB04196.1| hypothetical protein TGME49_017690 [Toxoplasma gondii ME49]
Length = 1692
Score = 54.7 bits (130), Expect = 9e-05, Method: Composition-based stats.
Identities = 35/92 (38%), Positives = 46/92 (50%), Gaps = 4/92 (4%)
Query: 258 HPTTYYEDMQRAIFCLCPL-GWAPWSPRLVEAVVFGCIPVIIA--DDIVLPFADAIPWEE 314
H T + RA FCLCP GW P S + EAV CIPVII +++ LP W +
Sbjct: 1529 HAATKSFFISRATFCLCPQEGWLP-SMCIFEAVAHACIPVIIGGEEELWLPGGCLFDWIQ 1587
Query: 315 IGMFVAEEDVPKLDTILTSIPPEVILRKQRLL 346
+ +FV P IL P +L KQ++L
Sbjct: 1588 MAVFVPLSRAPYTSLILALTPDNQVLEKQQML 1619
>gi|183637125|gb|ACC64545.1| exostosin 1 (predicted) [Rhinolophus ferrumequinum]
Length = 746
Score = 54.7 bits (130), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 32/123 (26%), Positives = 58/123 (47%), Gaps = 1/123 (0%)
Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
R Y V+N + W+ K++ +T++ Y +M A FCL P G
Sbjct: 280 RNALYHVHNGEDVLLLTTCKHGKDWQKHKDSRCDRDNTEYEKYDYREMLHNATFCLVPRG 339
Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
S R +EA+ C+PV++++ LPF++ I W + + E + ++ + + SI +
Sbjct: 340 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIPSTIRSIHQD 399
Query: 338 VIL 340
IL
Sbjct: 400 KIL 402
>gi|350537983|ref|NP_001233696.1| exostosin-1 [Cricetulus griseus]
gi|20138355|sp|Q9JK82.1|EXT1_CRIGR RecName: Full=Exostosin-1; AltName: Full=Heparan sulfate
copolymerase; AltName: Full=Multiple exostoses protein 1
homolog
gi|7960285|gb|AAF71276.1|AF252858_1 exostosin 1 [Cricetulus griseus]
Length = 746
Score = 54.7 bits (130), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 32/123 (26%), Positives = 58/123 (47%), Gaps = 1/123 (0%)
Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
R Y V+N + W+ K++ +T++ Y +M A FCL P G
Sbjct: 280 RNALYHVHNGEDVLLLTTCKHGKDWQKHKDSRCDRDNTEYEKYDYREMLHNATFCLVPRG 339
Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
S R +EA+ C+PV++++ LPF++ I W + + E + ++ + + SI +
Sbjct: 340 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIPSTIRSIHQD 399
Query: 338 VIL 340
IL
Sbjct: 400 KIL 402
>gi|168067686|ref|XP_001785740.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162662615|gb|EDQ49446.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 392
Score = 54.7 bits (130), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 73/308 (23%), Positives = 122/308 (39%), Gaps = 34/308 (11%)
Query: 64 THMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKS-PRMMRSAI 122
T F EI H L P RT N EEA FY P Y DL F + R+ I
Sbjct: 66 TDQFTIEITFHERLKIHPCRTTNTEEASLFYIPFYHGIDLIKNLYNTDFVARDRLTLLFI 125
Query: 123 QLISSNWPYWNRTEGADHFFVVPH---DFGACFHYQEEKAIERGILPLLQRATLVQTFGQ 179
+ + S P W R +G H V+ DF Y ++K +L + + + +
Sbjct: 126 KWLRSQKP-WQRYQGKRHVLVLGRIVWDF--IRDYSKDKTWGSSLLTHPELTNVTKLLIE 182
Query: 180 RNHVCLNEGSITIP---PYAPPQK--MQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYY 234
R+ + ++ +P + P + ++A Q R++ + R F +
Sbjct: 183 RD--IWKDDTLGVPYPTSFHPSSESDLRAWQ------RTVRTFKRHKFVSLAGATRDNKL 234
Query: 235 ARGARAAVWENFKNNP-LFDISTD------HPTTYYEDMQRAIFCLCPLGWAPWSPRLVE 287
R AV+E N+ I+ + +P + ++FCL P G +P + +
Sbjct: 235 TGLIRDAVFEQCANSSRCHSIACNDGWCKRNPQVIVQMGLESVFCLQPPGDSPTRKGIFD 294
Query: 288 AVVFGCIPVII-ADDIVLPFADAIP--WEEIGMFVAEEDVPKLD----TILTSIPPEVIL 340
++ GCIPV+ L + +P + + V+EEDV + L+ IP +
Sbjct: 295 SLQTGCIPVVFNRQQAALQYLSHLPGNHSDYSVIVSEEDVCDHNYDIMNHLSRIPLAEVA 354
Query: 341 RKQRLLAN 348
R Q + N
Sbjct: 355 RMQANVVN 362
>gi|380027202|ref|XP_003697318.1| PREDICTED: exostosin-1-like [Apis florea]
Length = 711
Score = 54.7 bits (130), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 33/124 (26%), Positives = 57/124 (45%), Gaps = 1/124 (0%)
Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
R Y ++N + + W ++ + ++ YE + A FCL P G
Sbjct: 254 RNALYHLHNGKDLVFVTTCRHGKAWRELQDEHCQQDNQEYDMYDYEILLMNATFCLVPRG 313
Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
S R +EA+ GCIPVI+++ LPF + I W + +F E + ++ I+ S+
Sbjct: 314 RRLGSFRFLEALRAGCIPVILSNGWALPFHERIDWTQAVIFSDERLLLQIPDIVRSVSNV 373
Query: 338 VILR 341
IL+
Sbjct: 374 HILK 377
>gi|1235559|emb|CAA65443.1| ext1 [Mus musculus]
Length = 745
Score = 54.3 bits (129), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 32/123 (26%), Positives = 58/123 (47%), Gaps = 1/123 (0%)
Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
R Y V+N + W+ K++ +T++ Y +M A FCL P G
Sbjct: 280 RNALYHVHNGEDVLLLTTCKHGKDWQKHKDSRCDRDNTEYEKYDYREMLHNATFCLVPRG 339
Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
S R +EA+ C+PV++++ LPF++ I W + + E + ++ + + SI +
Sbjct: 340 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIPSTIRSIHQD 399
Query: 338 VIL 340
IL
Sbjct: 400 KIL 402
>gi|326918032|ref|XP_003205297.1| PREDICTED: exostosin-1-like, partial [Meleagris gallopavo]
Length = 535
Score = 54.3 bits (129), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 32/123 (26%), Positives = 59/123 (47%), Gaps = 1/123 (0%)
Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
R Y V+N + W+ K++ +T++ Y +M A FCL P G
Sbjct: 118 RNALYHVHNGEDVVLLTTCKHGKDWQKHKDSRCDRDNTEYEKYDYREMLHNATFCLVPRG 177
Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
S R +EA+ C+PV++++ LPF++ I W++ + E + ++ + + SI +
Sbjct: 178 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVIDWKQAAVIGDERLLLQIPSTIRSIHQD 237
Query: 338 VIL 340
IL
Sbjct: 238 KIL 240
>gi|66517433|ref|XP_391845.2| PREDICTED: exostosin-1 [Apis mellifera]
Length = 711
Score = 54.3 bits (129), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 33/124 (26%), Positives = 57/124 (45%), Gaps = 1/124 (0%)
Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
R Y ++N + + W ++ + ++ YE + A FCL P G
Sbjct: 254 RNALYHLHNGKDLVFVTTCRHGKAWRELQDEHCQQDNQEYDMYDYEILLMNATFCLVPRG 313
Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
S R +EA+ GCIPVI+++ LPF + I W + +F E + ++ I+ S+
Sbjct: 314 RRLGSFRFLEALRAGCIPVILSNGWALPFHERIDWTQAVIFSDERLLLQIPDIVRSVSNV 373
Query: 338 VILR 341
IL+
Sbjct: 374 HILK 377
>gi|355686831|gb|AER98199.1| exostoses 1 [Mustela putorius furo]
Length = 702
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 32/123 (26%), Positives = 58/123 (47%), Gaps = 1/123 (0%)
Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
R Y V+N + W+ K++ +T++ Y +M A FCL P G
Sbjct: 236 RNALYHVHNGEDVVLLTTCKHGKDWQKHKDSRCDRDNTEYEKYDYREMLHNATFCLVPRG 295
Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
S R +EA+ C+PV++++ LPF++ I W + + E + ++ + + SI +
Sbjct: 296 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIPSTIRSIHQD 355
Query: 338 VIL 340
IL
Sbjct: 356 KIL 358
>gi|190402228|gb|ACE77644.1| exostosin 1 (predicted) [Sorex araneus]
Length = 746
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 32/123 (26%), Positives = 58/123 (47%), Gaps = 1/123 (0%)
Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
R Y V+N + W+ K++ +T++ Y +M A FCL P G
Sbjct: 280 RNALYHVHNGEDILLLTTCKHGKDWQKHKDSRCDRDNTEYEKYDYREMLHNATFCLVPRG 339
Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
S R +EA+ C+PV++++ LPF++ I W + + E + ++ + + SI +
Sbjct: 340 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIPSTIRSIHQD 399
Query: 338 VIL 340
IL
Sbjct: 400 KIL 402
>gi|226526925|gb|ACO71282.1| exostoses 1 (predicted) [Dasypus novemcinctus]
Length = 744
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 32/123 (26%), Positives = 58/123 (47%), Gaps = 1/123 (0%)
Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
R Y V+N + W+ K++ +T++ Y +M A FCL P G
Sbjct: 278 RNALYHVHNGEDILLLTTCKHGKDWQKHKDSRCDRDNTEYEKYDYREMLHNATFCLVPRG 337
Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
S R +EA+ C+PV++++ LPF++ I W + + E + ++ + + SI +
Sbjct: 338 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIPSTIRSIHQD 397
Query: 338 VIL 340
IL
Sbjct: 398 KIL 400
>gi|440797580|gb|ELR18663.1| exostosin, putative [Acanthamoeba castellanii str. Neff]
Length = 423
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 33/121 (27%), Positives = 52/121 (42%), Gaps = 9/121 (7%)
Query: 227 NDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLV 286
+DP+ G + +V E P ++ Y + + F LCP G A +S R
Sbjct: 251 HDPDNGVIIQ---TSVEEQIGGKPSVEVE------YLDTLFHTQFTLCPRGRALYSYRTT 301
Query: 287 EAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILRKQRLL 346
EA+ G IPVI+ D P+ + I W + + E + +L S E I R +R +
Sbjct: 302 EAIAAGAIPVILGDGYAFPYNELIDWRSFAVILPESSWETMMDVLRSFTSEEIARMRRNM 361
Query: 347 A 347
Sbjct: 362 G 362
>gi|260786761|ref|XP_002588425.1| hypothetical protein BRAFLDRAFT_153069 [Branchiostoma floridae]
gi|229273587|gb|EEN44436.1| hypothetical protein BRAFLDRAFT_153069 [Branchiostoma floridae]
Length = 107
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 31/79 (39%), Positives = 44/79 (55%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + + FCL P G S R +EA+ CIPVI A+ LPF++ I W+ + E
Sbjct: 3 YNTLLHNSTFCLVPRGRRLGSFRFLEALQAACIPVIEANGWELPFSEVIEWDRATITADE 62
Query: 322 EDVPKLDTILTSIPPEVIL 340
+ +L +IL +IPPE IL
Sbjct: 63 RLLFQLPSILRAIPPEKIL 81
>gi|449495258|ref|XP_004159780.1| PREDICTED: xyloglucan galactosyltransferase KATAMARI1-like [Cucumis
sativus]
Length = 490
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 75/358 (20%), Positives = 126/358 (35%), Gaps = 45/358 (12%)
Query: 52 NKKLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPT----G 107
++ + L + T+ F E+ H + T + A Y P Y D++
Sbjct: 120 SRGVFLNNSWFKTNQFLLEVIFHNKMKQYECLTNDSAMASAVYVPFYAGLDISHYLWNPS 179
Query: 108 LPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGI-LP 166
+ + S R S+I S P W R G DHFFV F Q ++ + G L
Sbjct: 180 ITIRDSSARDFLSSI----SEKPEWKRMFGRDHFFVAGR-ISWDFRRQTDEVSDWGSKLR 234
Query: 167 LLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQKMQA----HQIPPDTPRSIFVYFRGLF 222
L + + N+ +I P Y P K+ + R F G
Sbjct: 235 FLSESHNMTMLSVEASSWKNDFAIPYPTYFHPSKLSEIVEWQSLMRARQRQHLFTFTGAP 294
Query: 223 YDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWS 282
D G R + F + ++ D PTT E Q +IFCL P G +
Sbjct: 295 RPDLTDSIRGMVIEQCRGSSLCKFIDCSSDGVNCDDPTTTMEAFQSSIFCLQPPGDSYTR 354
Query: 283 PRLVEAVVFGCIPVII------------------ADDIVLPFADAIPWEEIGMFVAEEDV 324
+ ++++ GCIPV A + +P + W+E
Sbjct: 355 RSIFDSILAGCIPVFFHPGTAYSQYLWHFPKNQTAYSVFIPVRNVKKWDE---------- 404
Query: 325 PKLDTILTSIPP--EVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLPH 380
++ IL+ I E +R++ + PS+ + DAF + G+ ++ +
Sbjct: 405 -SIEGILSGISKDRESSMREEVIRVIPSIVYGDPRSKIGNLEDAFDLAVKGILERVEN 461
>gi|159477445|ref|XP_001696821.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158275150|gb|EDP00929.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 191
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 47/158 (29%), Positives = 66/158 (41%), Gaps = 23/158 (14%)
Query: 181 NHVCLNEGSITIPPYAPPQKMQAHQ---IPPDTPRSIFVYFRGLFYDVNNDP-EGGYYAR 236
H C G + P P AH P R I +Y RG + P +Y+R
Sbjct: 37 GHPCYTPGKDLVIPLFKPPGHYAHSPLLGAPPLQRDILLYLRG-----DTGPYRAHWYSR 91
Query: 237 GARAAVWENFKNNPLFDISTDH----------PTTYYEDMQRAIFCLCPLGWAPWSPRLV 286
G R + K + D + +H P TY E + R+IFC+ G +S R
Sbjct: 92 GIRQRL---AKLAYMHDWAEEHRIFVGEQFMIPGTYSEHLARSIFCVVAPG-DGYSGRGE 147
Query: 287 EAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDV 324
+AV+ GCIP+II D + F I W + +AE V
Sbjct: 148 DAVLHGCIPLIIMDGVHAVFESIIDWSAFSIRIAESAV 185
>gi|380799435|gb|AFE71593.1| exostosin-1, partial [Macaca mulatta]
Length = 542
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 32/123 (26%), Positives = 58/123 (47%), Gaps = 1/123 (0%)
Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
R Y V+N + W+ K++ +T++ Y +M A FCL P G
Sbjct: 76 RNALYHVHNGEDVVLLTTCKHGKDWQKHKDSRCDRDNTEYEKYDYREMLHNATFCLVPRG 135
Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
S R +EA+ C+PV++++ LPF++ I W + + E + ++ + + SI +
Sbjct: 136 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIPSTIRSIHQD 195
Query: 338 VIL 340
IL
Sbjct: 196 KIL 198
>gi|294881106|ref|XP_002769247.1| hypothetical protein Pmar_PMAR007658 [Perkinsus marinus ATCC 50983]
gi|239872525|gb|EER01965.1| hypothetical protein Pmar_PMAR007658 [Perkinsus marinus ATCC 50983]
Length = 75
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 21/53 (39%), Positives = 34/53 (64%)
Query: 261 TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWE 313
TY +M+ + FC P G PW+ R+ +A++ GCIPV++++ IV PF + W
Sbjct: 12 TYDAEMKDSTFCFIPRGNTPWTRRIFDAIISGCIPVVLSNAIVFPFESLLDWS 64
>gi|219129557|ref|XP_002184953.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217403738|gb|EEC43689.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 982
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 64/286 (22%), Positives = 106/286 (37%), Gaps = 45/286 (15%)
Query: 67 FAAEIFMHRFLLS--SPVRTLNPEEADWFYTPIYPTCD------------LTPTGLP-LP 111
+A E+ + +++ S +RT NP+EA FY P P+ + L+P G L
Sbjct: 606 YATELLVREVMINPKSCLRTYNPDEATLFYVPYLPSVEHHKGSKYINDMALSPYGNAILD 665
Query: 112 FKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGIL------ 165
+ YW R GADH V +H ++ + I
Sbjct: 666 ILDKDNYTAWENTFGLTAKYWKRHGGADHILVFSEPMHGLWHPRQRRGNYHFIHSQKQLH 725
Query: 166 -PLLQRATLVQTFGQRNHVCLNEGSITIPPYAPPQ----KMQAHQIPPDTP--RSIFVYF 218
P++ L TF + C + + P + K + + T S+ V
Sbjct: 726 PPIVISVELSTTFVKMYPKCAAKNILMPYPNTDGRWFNGKHHSEAVKASTAWNASLKVSI 785
Query: 219 RGL----------------FYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTY 262
L FY N + + + ++ LF + ++Y
Sbjct: 786 AALPEEQLLGQEPARPIAQFYGAGNHGTCKQLRQAMASDYSQCALSSKLFKQNV-KISSY 844
Query: 263 YEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFAD 308
M A FC CP G +P + R+ +AV+ GCIP+I++ D V PF +
Sbjct: 845 VIGMNLASFCPCPGGDSPSAKRMFDAVLAGCIPIILSQDFVWPFTN 890
>gi|77748248|gb|AAI05840.1| Ext1 protein [Rattus norvegicus]
Length = 557
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 32/123 (26%), Positives = 58/123 (47%), Gaps = 1/123 (0%)
Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
R Y V+N + W+ K++ +T++ Y +M A FCL P G
Sbjct: 91 RNALYHVHNGEDVLLLTTCKHGKDWQKHKDSRCDRDNTEYEKYDYREMLHNATFCLVPRG 150
Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
S R +EA+ C+PV++++ LPF++ I W + + E + ++ + + SI +
Sbjct: 151 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIPSTIRSIHQD 210
Query: 338 VIL 340
IL
Sbjct: 211 KIL 213
>gi|118781611|ref|XP_311558.3| AGAP010388-PA [Anopheles gambiae str. PEST]
gi|116130033|gb|EAA07205.3| AGAP010388-PA [Anopheles gambiae str. PEST]
Length = 744
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 52/99 (52%), Gaps = 1/99 (1%)
Query: 243 WENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADD 301
W + ++ + + ++ YE + Q + FCL P G S R +E + GCIPV++++
Sbjct: 298 WRDLQDARCDEDNREYDRYDYETLLQNSTFCLVPRGRRLGSFRFLEVLQAGCIPVLLSNS 357
Query: 302 IVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVIL 340
VLPF I W++ ++ E + ++ I+ SI IL
Sbjct: 358 WVLPFQSKIDWKQAAIWADERLLLQVPDIVRSISTSRIL 396
>gi|449662559|ref|XP_002154865.2| PREDICTED: exostosin-2-like [Hydra magnipapillata]
Length = 729
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 35/59 (59%)
Query: 266 MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDV 324
+Q FCL G+ S L++A++ GCIPVI+ +D VLPF + I W + V E+ +
Sbjct: 349 LQEGTFCLLLPGYYYGSSLLLDAMMMGCIPVIMMNDYVLPFNEVIDWSRAAIIVREQQI 407
>gi|241997492|ref|XP_002433395.1| exostosin-2, putative [Ixodes scapularis]
gi|215490818|gb|EEC00459.1| exostosin-2, putative [Ixodes scapularis]
Length = 714
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/139 (28%), Positives = 64/139 (46%), Gaps = 14/139 (10%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + + FCL L +A++ GC+PVI+AD+ +LPF++ + W+ + + E
Sbjct: 300 YPDILAEGKFCLVVRAARLGQSVLSDALMAGCVPVIVADEYILPFSEVLDWKRAAIQIRE 359
Query: 322 EDVPKLDTILTSIPPEVI--LRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLARKLP 379
+D+ L T+L + + +R Q LL R+M A+ ++N R P
Sbjct: 360 DDLEDLVTVLKGVSKARLFEMRSQALLLWDRYFRSM-----AKIALTTLDVIND--RVFP 412
Query: 380 HDKSTY-----LKPGGKFL 393
H TY L PG F
Sbjct: 413 HIAKTYEDWNDLPPGVSFF 431
>gi|357481629|ref|XP_003611100.1| Xyloglucan galactosyltransferase KATAMARI1 [Medicago truncatula]
gi|355512435|gb|AES94058.1| Xyloglucan galactosyltransferase KATAMARI1 [Medicago truncatula]
Length = 547
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 69/309 (22%), Positives = 112/309 (36%), Gaps = 44/309 (14%)
Query: 25 SGSAGDVLEDDPVGKLKVYVYELPSKYNKKLLLKDPRCL--------------------- 63
S + + DD G +YVY+LPSK+NK L+ + L
Sbjct: 151 SENKNNATCDDGKG---IYVYDLPSKFNKDLVGQCSDMLPWQDFCRYISNEGFGEPISKL 207
Query: 64 ------THMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKS-PR 116
TH ++ E+ H +L P R N +A FY P Y D+ L F++
Sbjct: 208 GKGWYKTHQYSLELIFHSKVLKHPCRVYNENDAKLFYVPFYGGLDV----LRWHFQNVSN 263
Query: 117 MMRSAIQLISSNW----PYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRAT 172
++ ++ L W W R G DH FV+ F + +L L +
Sbjct: 264 DVKDSLSLELVKWLERQVNWKRNLGKDHVFVLGK-ISWDFRRTSDSPWGTRLLELEKLQN 322
Query: 173 LVQTFGQRNHVCLNEGSITIPPYAPPQK----MQAHQIPPDTPRSIFVYFRGLFYDVNND 228
++ +R +N+ I P Y P+ + + R V F G D +D
Sbjct: 323 PIKLLIERQPWHVNDIGIPHPTYFHPKSDNDIIDWQLKIIRSNRRNLVSFAGAARDHADD 382
Query: 229 PEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEA 288
+ K + P + E + FCL P G +P + ++
Sbjct: 383 HIRSILINQCSSESDGKCKFLNCSSAKCNEPESIIELFVESEFCLQPPGDSPTRKSVFDS 442
Query: 289 VVFGCIPVI 297
++ GCIPV+
Sbjct: 443 LISGCIPVL 451
>gi|405969796|gb|EKC34747.1| Exostosin-2 [Crassostrea gigas]
Length = 698
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 27/84 (32%), Positives = 46/84 (54%)
Query: 261 TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVA 320
TY + +Q + FC+ L +A+ GCIPVI+AD VLPF++ + W+ + +
Sbjct: 289 TYPDILQDSKFCMVLRSARLGHTALSDALRTGCIPVIVADGYVLPFSEVLDWKRAAVVIR 348
Query: 321 EEDVPKLDTILTSIPPEVILRKQR 344
EE++ + +L S E I + +R
Sbjct: 349 EENLKDVVEVLKSYSMERIYQMRR 372
>gi|390332053|ref|XP_003723408.1| PREDICTED: LOW QUALITY PROTEIN: exostosin-2-like
[Strongylocentrotus purpuratus]
Length = 707
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 29/80 (36%), Positives = 45/80 (56%), Gaps = 2/80 (2%)
Query: 266 MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVP 325
+Q A FC+ L +A+ GCIPVII+D +LPF++ I W+ + V E+ +P
Sbjct: 301 LQDATFCIVLRRTRLGQAALSDALQAGCIPVIISDAYILPFSEVIDWKRASLVVREDRIP 360
Query: 326 KLDTILTSIPPEVI--LRKQ 343
L IL ++ E I +R+Q
Sbjct: 361 DLPDILHAVELEHIYEMRQQ 380
>gi|390338044|ref|XP_783281.3| PREDICTED: exostosin-1b-like [Strongylocentrotus purpuratus]
Length = 702
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 79/329 (24%), Positives = 130/329 (39%), Gaps = 69/329 (20%)
Query: 40 LKVYVYELPSKYNK-KLLLKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIY 98
KVY+Y P Y K L D LT + A+ F T N +EA F T I
Sbjct: 63 FKVYIY--PVSYQKPSQLYSD--ILTSIRASRYF-----------TTNADEACLFVTSI- 106
Query: 99 PTCDLTPTGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEK 158
T D L + + S+I +S +WN G +H + G Y E+
Sbjct: 107 DTLDRDK----LSSTYVKNIESSISKLS----HWN--NGKNHLIFNLYS-GTWPDYSEDL 155
Query: 159 AIERGILPLLQRATLVQTFGQRNHVCLNEGSITIPPYAP--PQK------MQAHQIPPDT 210
G +L +A+ + + + N I++P +A PQK +Q + P
Sbjct: 156 GFNIG-EAILAKASFSEQYFRPNF------DISLPLFAKTHPQKGGNSGDLQGNNFP--V 206
Query: 211 PRSIFVYFRG--LFYDVNNDPEGGYY-----------ARGARAAVWENFKNNPLFDISTD 257
R + F+G + + +D Y W+ K D+ D
Sbjct: 207 QRKYLLAFKGKRYLFGIGSDTRNALYHLHNGVDIILLTTCKHGKNWQKHK-----DVRCD 261
Query: 258 HPTTYYED------MQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIP 311
H ++ + + FCL P G S R +E++ CIP+++++ LPF++ I
Sbjct: 262 HDNAEFDKFDYQVLLHNSTFCLVPRGRRLGSFRFLESLQAACIPMLLSNGWELPFSEVID 321
Query: 312 WEEIGMFVAEEDVPKLDTILTSIPPEVIL 340
W + +F E + ++ +I+ SI E IL
Sbjct: 322 WSKAVVFGDERLLLQVPSIVRSITAEQIL 350
>gi|387015838|gb|AFJ50038.1| Exostosin-1-like [Crotalus adamanteus]
Length = 751
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 32/123 (26%), Positives = 57/123 (46%), Gaps = 1/123 (0%)
Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
R Y V+N + W+ K++ + ++ Y +M A FCL P G
Sbjct: 285 RNALYHVHNGEDVVLLTTCKHGKDWQKHKDSRCDRDNAEYEKYDYREMLHNATFCLVPRG 344
Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
S R +EA+ CIPV++++ LPF++ I W + + E + ++ + + SI +
Sbjct: 345 RRLGSFRFLEALQAACIPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIPSTIRSIHQD 404
Query: 338 VIL 340
IL
Sbjct: 405 KIL 407
>gi|363731065|ref|XP_003640902.1| PREDICTED: exostosin-1-like [Gallus gallus]
Length = 740
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 31/123 (25%), Positives = 58/123 (47%), Gaps = 1/123 (0%)
Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
R Y V+N + W+ K++ + ++ Y +M A FCL P G
Sbjct: 274 RNALYHVHNGEDVVLLTTCKHGKDWQKHKDSRCDRDNAEYEKYDYREMLHNATFCLVPRG 333
Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
S R +EA+ C+PV++++ LPF++ I W++ + E + ++ + + SI +
Sbjct: 334 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVIDWKQAAVIGDERLLLQIPSTIRSIHQD 393
Query: 338 VIL 340
IL
Sbjct: 394 KIL 396
>gi|255633864|gb|ACU17293.1| unknown [Glycine max]
Length = 57
Score = 53.1 bits (126), Expect = 2e-04, Method: Composition-based stats.
Identities = 23/26 (88%), Positives = 26/26 (100%)
Query: 22 ERISGSAGDVLEDDPVGKLKVYVYEL 47
ERISGSAGDVLE+DPVG+LKV+VYEL
Sbjct: 32 ERISGSAGDVLENDPVGRLKVFVYEL 57
>gi|224046654|ref|XP_002200457.1| PREDICTED: exostosin-1 [Taeniopygia guttata]
Length = 741
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 31/123 (25%), Positives = 57/123 (46%), Gaps = 1/123 (0%)
Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
R Y V+N + W+ K++ + ++ Y +M A FCL P G
Sbjct: 275 RNALYHVHNGEDVVLLTTCKHGKDWQKHKDSRCDRDNAEYEKYDYREMLHNATFCLVPRG 334
Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
S R +EA+ C+PV++++ LPF++ I W + + E + ++ + + SI +
Sbjct: 335 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVIDWNQAAIIGDERLLLQIPSTIRSIHQD 394
Query: 338 VIL 340
IL
Sbjct: 395 KIL 397
>gi|452825321|gb|EME32318.1| glucuronyl/N-acetylglucosaminyl transferase EXT1 [Galdieria
sulphuraria]
Length = 354
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 56/207 (27%), Positives = 90/207 (43%), Gaps = 47/207 (22%)
Query: 130 PYWNRTEGADHFFVVPHDF--GACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNE 187
PYWN G +H + DF G C E + G LL+ F R++ +
Sbjct: 144 PYWNL--GINHMVL---DFLDGKC-----EGQHDVGFAALLKTHLSSDIF--RSNF---D 188
Query: 188 GSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFK 247
S+ +PP + H + P+ + F+G Y + + G ++ NF
Sbjct: 189 KSLPLPP-------RQHFRILEHPKKFLITFKGTSYPL--------HTAGGIRSILNNFH 233
Query: 248 N--------NPLFDISTDHPT--TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVI 297
N P F + T +Y E ++++ FCL P G +P + RL+E + GCIPV
Sbjct: 234 NGKDIIFVTKPQFSVPTPEYDLWSYEELVEQSKFCLVPQGRSPSTFRLLEVMSAGCIPVF 293
Query: 298 IADDI----VLPFADAIPWEEIGMFVA 320
+ D ++P +D I WE+I F+A
Sbjct: 294 LFDTTAAKYIMPISDEIAWEQIS-FIA 319
>gi|327280286|ref|XP_003224883.1| PREDICTED: exostosin-1-like [Anolis carolinensis]
Length = 753
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 31/123 (25%), Positives = 57/123 (46%), Gaps = 1/123 (0%)
Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
R Y V+N + W+ K++ + ++ Y +M A FCL P G
Sbjct: 287 RNALYHVHNGEDVVLLTTCKHGKDWQKHKDSRCDRDNAEYEKYDYREMLHNATFCLVPRG 346
Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
S R +EA+ C+PV++++ LPF++ I W + + E + ++ + + SI +
Sbjct: 347 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVIDWNQAAVIGDERLLLQIPSTIRSIHQD 406
Query: 338 VIL 340
IL
Sbjct: 407 KIL 409
>gi|395818023|ref|XP_003782438.1| PREDICTED: exostosin-1 [Otolemur garnettii]
gi|197215619|gb|ACH53015.1| exostosin 1 (predicted) [Otolemur garnettii]
Length = 746
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 31/123 (25%), Positives = 57/123 (46%), Gaps = 1/123 (0%)
Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
R Y V+N + W+ K++ + ++ Y +M A FCL P G
Sbjct: 280 RNALYHVHNGEDVVLLTTCKHGKDWQKHKDSRCDRDNMEYEKYDYREMLHNATFCLVPRG 339
Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
S R +EA+ C+PV++++ LPF++ I W + + E + ++ + + SI +
Sbjct: 340 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIPSTIRSIHQD 399
Query: 338 VIL 340
IL
Sbjct: 400 KIL 402
>gi|357628255|gb|EHJ77645.1| hypothetical protein KGM_04618 [Danaus plexippus]
Length = 655
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 27/99 (27%), Positives = 53/99 (53%), Gaps = 1/99 (1%)
Query: 243 WENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADD 301
W++ ++ + + ++ YE + + FCL G S R +EA+ GC+PV++++
Sbjct: 203 WKDLRDERCDEDNREYDKFDYEQLLANSTFCLVARGRRLGSYRFLEALAAGCVPVLLSNG 262
Query: 302 IVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVIL 340
LPF + I W ++ E + ++ ++ S+PPE IL
Sbjct: 263 WRLPFDERIDWRRAVIWADERLLLQVPELVRSVPPERIL 301
>gi|298713685|emb|CBJ48876.1| Glycosyltransferase, family GT47 [Ectocarpus siliculosus]
Length = 226
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 42/166 (25%), Positives = 68/166 (40%), Gaps = 17/166 (10%)
Query: 180 RNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGAR 239
RN +G++ + + ++M + + I +D + PE Y R
Sbjct: 39 RNFFLTTKGTLYLSGHGSGERMSLVPLQDEANGVIMSLHCFEIHDEHLLPENVEYCRA-- 96
Query: 240 AAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIA 299
L D DH YE + + F L P G +P + RL E + G IPV +
Sbjct: 97 -----------LSDRYGDHS---YESLMNSTFGLVPAGRSPGTYRLGEVMSAGSIPVFVG 142
Query: 300 DDIVLPFADAIPWEEIGMFVAEEDV-PKLDTILTSIPPEVILRKQR 344
D+VLPF + I W A + V P++ L ++P + + QR
Sbjct: 143 WDLVLPFKELIDWSSFSFAFAPDQVGPQMVRTLRAVPRAELEQMQR 188
>gi|1619954|gb|AAB17006.1| multiple exostosis 2 protein [Mus musculus]
Length = 718
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 27/83 (32%), Positives = 44/83 (53%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + +Q A FC L + + GC+PV+IAD +LPF++ + W++ + V E
Sbjct: 308 YPQVLQEATFCTVLRRARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKKASVVVPE 367
Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
E + + +IL +IP I QR
Sbjct: 368 EKMSDVYSILQNIPQRQIEEMQR 390
>gi|302807981|ref|XP_002985685.1| galactosyltransferase-like protein [Selaginella moellendorffii]
gi|300146594|gb|EFJ13263.1| galactosyltransferase-like protein [Selaginella moellendorffii]
Length = 500
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 89/396 (22%), Positives = 151/396 (38%), Gaps = 73/396 (18%)
Query: 35 DPVGKLKVYVYELPSKYNKKLLLK---------------------------------DPR 61
D +V++YELP ++N ++L K DP
Sbjct: 97 DSCEGRRVFMYELPRRFNLEVLEKCDKMVSWLTFCDHFINHGFGKALAGANSSWYATDPY 156
Query: 62 CLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSA 121
L +F + +R L++SP EAD F+ P Y D L + + + R
Sbjct: 157 MLEVIFHERMHRYRCLVNSP------READAFFIPYYAGLD----ALRFLYGADNLNRHE 206
Query: 122 -----IQLISSNWPY-WNRTEGADHFFVVPH---DFGACFHYQEEKAIERGI-LPLLQRA 171
++ + +N+ + W R G DHF V DF + Y+ + G L LL++
Sbjct: 207 QGVDLVEFLEANYSWSWTRNLGHDHFMVTGRTAWDFAS---YRGKSGSSWGTSLRLLKQM 263
Query: 172 TLVQTF-GQRNHVCLNEGSITIP-PYAPPQKMQAH---QIPPDTPRSIFVYFRGLFYDVN 226
V T +R E +I P + P K + + +PR+ F+ F G
Sbjct: 264 ENVTTLVMERRPWDRTEQAIPYPTSFHPATKSELQAWIERVKASPRANFMSFAGAPRPQQ 323
Query: 227 NDP-EGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRL 285
N+ G + + ++ E + L +P E + +IFCL P G
Sbjct: 324 NESIRGILFEQCRKSRSCEAVNCSKL--RCAHNPLPIAEKLLSSIFCLQPQGDTSTRRSS 381
Query: 286 VEAVVFGCIPVII-ADDIVLPFADAIPWEE--IGMFVAEEDVPK-----LDTILTSIPPE 337
+++V GCIPV AD + +P E +F+ EED+ + + + + +
Sbjct: 382 FDSLVCGCIPVFFHADSAYTQYTWHLPRERESYSVFIPEEDIRRDGLEVEEFLRSKFSSQ 441
Query: 338 VILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNG 373
I QR + + R + +P GD L+G
Sbjct: 442 RIGELQRNI-RKIIPRLLYTGKPWSSGDGGRDSLDG 476
>gi|168038843|ref|XP_001771909.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162676860|gb|EDQ63338.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 308
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 63/244 (25%), Positives = 99/244 (40%), Gaps = 37/244 (15%)
Query: 127 SNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLN 186
S PYWN+ G +H V+ F +H + G ++ T+ V
Sbjct: 67 STLPYWNK--GLNHVLVI---FADKWHQRGPHQASIGNASVMASDMHETTYRPGFDV--- 118
Query: 187 EGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYA----RGAR--- 239
SI +P ++ Q+ + P R FRGL Y + EG + + RG
Sbjct: 119 --SIPLPGRHHVRRFQS--VKP-LERKYLATFRGLRY-LGPKGEGVFRSLDSFRGMHNGN 172
Query: 240 ----AAVWENFKNN------PLFDISTD-----HPTTYYEDMQRAIFCLCPLGWAPWSPR 284
A E+ NN P + D H ++D+ F L P G P S R
Sbjct: 173 DVIVATSCEHMINNMIRKKEPTLGVHCDEDLLIHKNYTFDDLMNTTFGLVPAGVQPASYR 232
Query: 285 LVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPP-EVILRKQ 343
+E + G IPV+IAD+ V PF I W + + ++ ++ L ++ P EV +R++
Sbjct: 233 FIEVLSAGAIPVLIADNYVKPFDTLILWYKCLLQFPTTEMHRIVGTLRAMKPEEVKIRQE 292
Query: 344 RLLA 347
LA
Sbjct: 293 NCLA 296
>gi|55742053|ref|NP_001006730.1| exostosin 1 [Xenopus (Silurana) tropicalis]
gi|49523045|gb|AAH75481.1| exostoses (multiple) 1 [Xenopus (Silurana) tropicalis]
Length = 738
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 31/123 (25%), Positives = 56/123 (45%), Gaps = 1/123 (0%)
Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
R Y V+N + W+ K+ + ++ Y +M A FCL P G
Sbjct: 272 RNALYHVHNGEDVVLLTTCKHGKDWQKHKDARCDKDNAEYEKYDYREMLHNATFCLVPRG 331
Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
S R +EA+ C+PV++++ LPF++ I W + + E + ++ + + SI +
Sbjct: 332 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVIDWNQAAVIGDERLLLQIPSTIRSIHQD 391
Query: 338 VIL 340
IL
Sbjct: 392 KIL 394
>gi|298713683|emb|CBJ48874.1| Glycosyltransferase, family GT47 [Ectocarpus siliculosus]
Length = 335
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 44/169 (26%), Positives = 72/169 (42%), Gaps = 23/169 (13%)
Query: 180 RNHVCLNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYFRGLFYDVNND---PEGGYYAR 236
RN +GS+ + + ++M + P + V ++++N+ PE Y R
Sbjct: 148 RNFFLTTKGSLYLSRHGSGERMS---LVPLQDEANGVIMSLHCFELHNEHLLPENVEYCR 204
Query: 237 GARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPV 296
L D DH YE + + F L P G +P + RL E + G IPV
Sbjct: 205 A-------------LSDRYGDHS---YESLMNSTFGLVPAGRSPGTFRLGEVMSAGSIPV 248
Query: 297 IIADDIVLPFADAIPWEEIGMFVAEEDV-PKLDTILTSIPPEVILRKQR 344
+ D+VLPF + I W A + V P++ L ++P + + QR
Sbjct: 249 FVGWDLVLPFKELIDWSSFSFAFAPDQVGPQMVRTLRAVPRAELEQMQR 297
>gi|224058539|ref|XP_002299537.1| predicted protein [Populus trichocarpa]
gi|222846795|gb|EEE84342.1| predicted protein [Populus trichocarpa]
Length = 573
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 77/333 (23%), Positives = 123/333 (36%), Gaps = 60/333 (18%)
Query: 42 VYVYELPSKYNKKLL--------------------LKDPRC-------LTHMFAAEIFMH 74
VYVY+LPSK+NK L+ L +P TH ++ E H
Sbjct: 192 VYVYDLPSKFNKDLIGQCGDMMPWTDFCKYFNNEALGEPIANLGKGWYHTHQYSLEPIFH 251
Query: 75 RFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKS-PRMMRSAIQLISSNWPY-- 131
+LS P R N EA FY P Y D+ L FK+ ++ A+ + W
Sbjct: 252 SRILSHPCRVYNESEAKLFYVPYYGGLDI----LRWHFKNVSDDVKDALAMDLMKWLEHR 307
Query: 132 --WNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLNEGS 189
W + G DH FV+ F + + L L Q ++ +R +N+ +
Sbjct: 308 RPWVQNSGTDHVFVLG-KISWDFRRKNYTSWGTRFLELEQMQNPIKLLIERQPWEVNDIA 366
Query: 190 ITIPPYAPPQK----MQAHQIPPDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWEN 245
I P + P + Q +T R V F G D + +
Sbjct: 367 IPHPTFFHPHSDDDIVAWQQKIIETTRKNLVSFAGA---ARPDQPESIRSTLINQCTSTS 423
Query: 246 FKNNPLFDIST---DHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDI 302
D + + P + + + FCL P G +P + +++V GCIPV+
Sbjct: 424 SDKCQFLDCKSGGCNQPESVTKLFLESEFCLQPPGDSPTRKSVFDSLVSGCIPVLFD--- 480
Query: 303 VLPFAD--AIPWE------EIGMFVAEEDVPKL 327
PF PW + +F+ +E+V ++
Sbjct: 481 --PFTAYYQYPWHLPEDHGKYSVFIDQEEVRQM 511
>gi|297725119|ref|NP_001174923.1| Os06g0638350 [Oryza sativa Japonica Group]
gi|255677258|dbj|BAH93651.1| Os06g0638350 [Oryza sativa Japonica Group]
Length = 257
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 71/269 (26%), Positives = 104/269 (38%), Gaps = 63/269 (23%)
Query: 58 KDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPI--------YPTCDLTPTGLP 109
+ PR LT +A+E + + + S RT + E+A F+ PI P+ L T
Sbjct: 20 QTPRKLTGKYASEGYFFQNIRESRFRTDDLEKAHLFFVPISPHKMRGKVPSSLLLVTYAW 79
Query: 110 LPFKSPRMMRSAI--------QLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIE 161
L RS + L SS +W GADHFFV HD G +A E
Sbjct: 80 LILHIRSYDRSILFLDLYWWCPLCSSFRGHWG--VGADHFFVTCHDVGV-------RAFE 130
Query: 162 RGILPLLQRATLVQTFGQRNHVC---LNEGSITIPPYAPPQKMQAHQIPPDTPRSIFVYF 218
LP + + ++ VC N G I A PQ +Q +P
Sbjct: 131 G--LPFIIKNSI-------RVVCSPSYNAGYIPHKDVALPQILQPFALPAGGN------- 174
Query: 219 RGLFYDVNNDPEGGYYA-------RGARAAVWEN-----FKNNPLFDISTDHPTTYYEDM 266
D+ N G++A R A +WEN NN + + Y +
Sbjct: 175 -----DIENRTILGFWAGHRNSKIRVILARIWENDTELAISNNRINRAIGN--LVYQKHF 227
Query: 267 QRAIFCLCPLGWAPWSPRLVEAVVFGCIP 295
R FC+CP G S R+ +++ +GC+P
Sbjct: 228 FRTKFCVCPGGSQVNSARISDSIHYGCMP 256
>gi|148234643|ref|NP_001083782.1| exostosin 1 [Xenopus laevis]
gi|62871603|gb|AAH94398.1| XEXT1 protein [Xenopus laevis]
Length = 738
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 31/123 (25%), Positives = 56/123 (45%), Gaps = 1/123 (0%)
Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
R Y V+N + W+ K+ + ++ Y +M A FCL P G
Sbjct: 272 RNALYHVHNGEDVVLLTTCKHGKDWQKHKDARCDKDNAEYEKYDYREMLHNATFCLVPRG 331
Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
S R +EA+ C+PV++++ LPF++ I W + + E + ++ + + SI +
Sbjct: 332 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVIDWNQAAVIGDERLLLQIPSTIRSIHQD 391
Query: 338 VIL 340
IL
Sbjct: 392 KIL 394
>gi|148229160|ref|NP_001083108.1| uncharacterized protein LOC398750 [Xenopus laevis]
gi|37805422|gb|AAH60367.1| MGC68803 protein [Xenopus laevis]
Length = 718
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 26/83 (31%), Positives = 44/83 (53%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y + +Q + FC+ G L + + GC+PVIIAD VLPF++ + W+ + + E
Sbjct: 308 YPQILQESTFCIVLRGARLGQGVLSDVLQAGCVPVIIADSYVLPFSEVLDWKRASVVIPE 367
Query: 322 EDVPKLDTILTSIPPEVILRKQR 344
E + ++ +IL I + QR
Sbjct: 368 EKMFEMYSILQGISQRQVEEMQR 390
>gi|123457129|ref|XP_001316295.1| Exostosin family protein [Trichomonas vaginalis G3]
gi|121898997|gb|EAY04072.1| Exostosin family protein [Trichomonas vaginalis G3]
Length = 350
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 61/271 (22%), Positives = 97/271 (35%), Gaps = 67/271 (24%)
Query: 65 HMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIY--------PTCDLTPTGLPLPFKSPR 116
H F E +HR + R +AD FY PIY CD LPL
Sbjct: 28 HTFYFEPVLHRHFMKGYNRVSYESDADLFYVPIYLGLFNMQREKCDFDRCVLPL------ 81
Query: 117 MMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQ---RATL 173
+ S Y++R DH FV + +P Q + +
Sbjct: 82 --------VRSQGDYYDRFGSVDHVFV-------------QMLFSHNNVPFTQHHQKVIV 120
Query: 174 VQTFGQRNHVCLNEGSITIPPYAPPQKMQAHQIP---------PDTPRSIFVYFRGLFYD 224
QT + + + + + P Q + +P + + I + G
Sbjct: 121 AQT-------TIGDINWNLSIFEPRQMTRFTVMPYNSNFDFYESSSKQCITAFLTGQMTI 173
Query: 225 VNNDPEGGYYARGARAAVWE---NFKNNPLFDISTDH---PTTYYED---MQRAIFCLCP 275
+ D AR R A+ E N KN + + Y++ M+ + FC P
Sbjct: 174 ASFDKR----ARNIRQALKEEMRNTKNTAVIETKRKSHFIAAEYFQIESLMRNSEFCPVP 229
Query: 276 LGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 306
G P S RL +++ GCIP++++D+I PF
Sbjct: 230 HGDGPQSKRLYDSMRTGCIPIVLSDEIRFPF 260
>gi|19909908|dbj|BAB87180.1| XEXT1 [Xenopus laevis]
Length = 735
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 31/123 (25%), Positives = 56/123 (45%), Gaps = 1/123 (0%)
Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
R Y V+N + W+ K+ + ++ Y +M A FCL P G
Sbjct: 269 RNALYHVHNGEDVVLLTTCKHGKDWQKHKDARCDKDNAEYEKYDYREMLHNATFCLVPRG 328
Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
S R +EA+ C+PV++++ LPF++ I W + + E + ++ + + SI +
Sbjct: 329 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVIDWNQAAVIGDERLLLQIPSTIRSIHQD 388
Query: 338 VIL 340
IL
Sbjct: 389 KIL 391
>gi|156361936|ref|XP_001625539.1| predicted protein [Nematostella vectensis]
gi|156212377|gb|EDO33439.1| predicted protein [Nematostella vectensis]
Length = 881
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 45/175 (25%), Positives = 78/175 (44%), Gaps = 23/175 (13%)
Query: 190 ITIPPYAPPQK----MQAHQIPPDTPRSIFVYFRGLFY-------DVNND------PEGG 232
+ +PP P QA Q P T R F+ F G F+ +V+ D E
Sbjct: 310 VIVPPLIGPTSGEVWAQAGQQLP-TLRKYFLTFEGSFHLSNEGAEEVSKDDLVFLSKESS 368
Query: 233 YYARGARAAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSP--RLVEAVV 290
++ N+ +++ H ++++ F L P+G + + RL+EA+
Sbjct: 369 DLFIAVECVTYDKISNSAGWNLCGSHKNRTVR-LKKSTFSLVPVGNSGFVTHVRLIEALQ 427
Query: 291 FGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVI--LRKQ 343
G IPVI+ +LP A+ I W + + + + +L+TIL ++ E I LR+Q
Sbjct: 428 TGAIPVILGTTNMLPLAEFIDWRSVSITLTPARIMELNTILRTVSSEDIHNLRQQ 482
>gi|50416406|gb|AAH77234.1| XEXT1 protein [Xenopus laevis]
Length = 735
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 31/123 (25%), Positives = 56/123 (45%), Gaps = 1/123 (0%)
Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
R Y V+N + W+ K+ + ++ Y +M A FCL P G
Sbjct: 269 RNALYHVHNGEDVVLLTTCKHGKDWQKHKDARCDKDNAEYEKYDYREMLHNATFCLVPRG 328
Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
S R +EA+ C+PV++++ LPF++ I W + + E + ++ + + SI +
Sbjct: 329 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVIDWNQAAVIGDERLLLQIPSTIRSIHQD 388
Query: 338 VIL 340
IL
Sbjct: 389 KIL 391
>gi|302847713|ref|XP_002955390.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
gi|300259232|gb|EFJ43461.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
Length = 267
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 33/104 (31%), Positives = 52/104 (50%), Gaps = 6/104 (5%)
Query: 282 SPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPEVILR 341
PR+ +AV GC+PV+I+DD++ F + W G+ +AE D+P++ + +I PE
Sbjct: 11 GPRM-QAVQAGCVPVVISDDVLEAFEPFLDWNTFGVRLAEADIPRMHEVREAISPEEYAH 69
Query: 342 KQRLLANPSMKRAMLFPQPAQPG-----DAFHQILNGLARKLPH 380
K+ LL + A + G DAF +L L K H
Sbjct: 70 KEVLLRCAAQHMAFSTVTGSYIGESGRYDAFETLLEILRAKAAH 113
>gi|225457534|ref|XP_002270383.1| PREDICTED: xyloglucan galactosyltransferase KATAMARI1 homolog
[Vitis vinifera]
Length = 511
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 41/136 (30%), Positives = 56/136 (41%), Gaps = 39/136 (28%)
Query: 42 VYVYELPSKYNKKLLLKDPRCL----------------------------THMFAAEIFM 73
VYVYELP K+N LL K+ R L TH F AE+
Sbjct: 125 VYVYELPPKFNIGLL-KECRRLNVYTDMCPHVANCGLGQPILEMGSSWFATHQFIAEMIF 183
Query: 74 HRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRS---AIQLIS--SN 128
H + + P RT +PE+AD FY P Y + F+ + A++L+
Sbjct: 184 HARMENHPCRTRDPEKADLFYVPFYGGLHASSK-----FRESNLAARDALAVELVEYIHR 238
Query: 129 WPYWNRTEGADHFFVV 144
+W R GADHF +
Sbjct: 239 QRWWRRNHGADHFLAL 254
>gi|256088012|ref|XP_002580154.1| exostosin-1 [Schistosoma mansoni]
gi|353230126|emb|CCD76297.1| putative exostosin-1 [Schistosoma mansoni]
Length = 766
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 23/60 (38%), Positives = 36/60 (60%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y+E M + FCL P G S R +E + GCIPV++++D+ LPF++ I W ++ E
Sbjct: 322 YWELMYNSTFCLVPRGRRLGSYRFLEVLQAGCIPVMLSNDLELPFSEVIDWNRAVIWADE 381
>gi|224144348|ref|XP_002336132.1| predicted protein [Populus trichocarpa]
gi|222873674|gb|EEF10805.1| predicted protein [Populus trichocarpa]
Length = 421
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 41/142 (28%), Positives = 57/142 (40%), Gaps = 38/142 (26%)
Query: 38 GKLKVYVYELPSKYNKKLLLKDPRCL---------------------------------T 64
G + VY+Y++P+++NK LL KD L T
Sbjct: 24 GGMSVYLYDMPAEFNKGLL-KDCSHLNAYTDMCPHVANRGLGQPLSYMAESAVATTWFAT 82
Query: 65 HMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPT--GLPLPFKSPRMMRSAI 122
H F AE+ H + + P R L+P A FY P Y D + L + +R A
Sbjct: 83 HQFIAEMIFHARMENHPCRVLDPINAKLFYVPFYGGLDASSKFHDANLTARDELAVRLAD 142
Query: 123 QLISSNWPYWNRTEGADHFFVV 144
L S P+W R G DHF V+
Sbjct: 143 YLRSK--PWWERHHGKDHFLVL 162
>gi|224061383|ref|XP_002300452.1| predicted protein [Populus trichocarpa]
gi|222847710|gb|EEE85257.1| predicted protein [Populus trichocarpa]
Length = 400
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 41/142 (28%), Positives = 57/142 (40%), Gaps = 38/142 (26%)
Query: 38 GKLKVYVYELPSKYNKKLLLKDPRCL---------------------------------T 64
G + VY+Y++P+++NK LL KD L T
Sbjct: 3 GGMSVYLYDMPAEFNKGLL-KDCSHLNPYTDMCPHVANRGLGQPLSYMAESAVATTWFAT 61
Query: 65 HMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPT--GLPLPFKSPRMMRSAI 122
H F AE+ H + + P R L+P A FY P Y D + L + +R A
Sbjct: 62 HQFIAEMIFHARMENHPCRVLDPINAKLFYVPFYGGLDASSKFHDANLTARDELAVRLAD 121
Query: 123 QLISSNWPYWNRTEGADHFFVV 144
L S P+W R G DHF V+
Sbjct: 122 YLRSK--PWWERHHGKDHFLVL 141
>gi|345480513|ref|XP_001603840.2| PREDICTED: exostosin-1-like isoform 1 [Nasonia vitripennis]
gi|345480515|ref|XP_003424163.1| PREDICTED: exostosin-1-like isoform 2 [Nasonia vitripennis]
Length = 713
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 27/72 (37%), Positives = 41/72 (56%)
Query: 269 AIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLD 328
+ FCL P G S R +EA+ GCIPVI+++ LPF D I W + ++ E + ++
Sbjct: 307 STFCLVPRGRRLGSFRFLEALRAGCIPVILSNGWALPFHDRIDWFQAVIYADERLLFQVP 366
Query: 329 TILTSIPPEVIL 340
IL S+ E I+
Sbjct: 367 DILRSVVEEKIM 378
>gi|296227312|ref|XP_002759355.1| PREDICTED: exostosin-1 [Callithrix jacchus]
Length = 475
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 27/85 (31%), Positives = 46/85 (54%)
Query: 256 TDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEI 315
T+H Y E + A FCL P G S R +EA+ C+PV++++ LPF++ I W +
Sbjct: 47 TEHWYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQA 106
Query: 316 GMFVAEEDVPKLDTILTSIPPEVIL 340
+ E + ++ + + SI + IL
Sbjct: 107 AVIGDERLLLQIPSTIRSIHQDKIL 131
>gi|159465527|ref|XP_001690974.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158279660|gb|EDP05420.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 341
Score = 52.0 bits (123), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 36/143 (25%), Positives = 62/143 (43%), Gaps = 14/143 (9%)
Query: 204 HQIPPDTPRSIFVYFRGLFYDVNN--DPEGGY----------YARGARAAVWENFKNNPL 251
H + P PR+ F G ++ DPE + Y++GAR V + P
Sbjct: 72 HPLAPRIPRNGTFLFSGRICGDHSLPDPEKPWPHCATRRSPGYSQGARQLVHFHHSQRPG 131
Query: 252 FDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIP 311
+ + + Y D+ +CL P G R V GC+PV+++D ++ PF +
Sbjct: 132 YKVMVKN-KEYQVDLINYKWCLAPSG-GGHGHRQVLVAAMGCLPVVVSDLVMQPFEPEMD 189
Query: 312 WEEIGMFVAEEDVPKLDTILTSI 334
W + V ++DVP L + ++
Sbjct: 190 WSAFSLRVEQKDVPTLHEAIEAV 212
>gi|310831249|ref|YP_003969892.1| putative exostosin family protein [Cafeteria roenbergensis virus
BV-PW1]
gi|309386433|gb|ADO67293.1| putative exostosin family protein [Cafeteria roenbergensis virus
BV-PW1]
Length = 300
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 33/108 (30%), Positives = 56/108 (51%), Gaps = 3/108 (2%)
Query: 231 GGYYARGARAAVWENFKNNPLFDISTDHPTTYYED-MQRAIFCLCPLGWAPWSPRLVEAV 289
G ++ + +N K D S + T Y + + +IF LCP G P S R E++
Sbjct: 172 GNWHFNKIVYSTKQNAKQELNIDNSHINNTKNYNNILLDSIFSLCPSGSGPNSIRFWESL 231
Query: 290 VFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
G IPV+++D + LP I W++I + + E+++ +D IL +I E
Sbjct: 232 AVGSIPVLLSDKLDLP--KNINWQDIIIIIPEKEISNIDIILNNISVE 277
>gi|302785233|ref|XP_002974388.1| galactosyltransferase-like protein [Selaginella moellendorffii]
gi|300157986|gb|EFJ24610.1| galactosyltransferase-like protein [Selaginella moellendorffii]
Length = 500
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 81/382 (21%), Positives = 145/382 (37%), Gaps = 59/382 (15%)
Query: 41 KVYVYELPSKYNKKLLLKDPRCLT----------HMFAA-----------------EIFM 73
+V++YELP K+N ++L K + ++ H F E+
Sbjct: 105 RVFMYELPRKFNLEVLEKCDKMVSWLTFCDHFINHGFGKALAGANSSWYATDPYMLEVIF 164
Query: 74 HRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSA-----IQLISSN 128
H + +P EAD F+ P Y D L + + + R ++ + +N
Sbjct: 165 HERMRRYRCLVNSPREADAFFIPYYAGLD----ALRFLYGADNLNRHEQGVDLVKFLEAN 220
Query: 129 WPY-WNRTEGADHFFVVPH---DFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVC 184
+ + W R G DHF V DF + Y+ + + + L Q + +R
Sbjct: 221 YSWSWRRNLGHDHFMVTGRTAWDFAS---YRGKSSWGTSLRLLKQMENVTTLVMERRPWD 277
Query: 185 LNEGSITIP-PYAPPQKMQAH---QIPPDTPRSIFVYFRGLFYDVNNDP-EGGYYARGAR 239
E +I P + P K + + +PR+ F+ F G N+ G + + +
Sbjct: 278 RTEQAIPYPTSFHPATKSELQAWIERVKASPRTNFMSFAGAPRPQQNESIRGILFEQCRK 337
Query: 240 AAVWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVII- 298
+ E + L +P E + +IFCL P G +++V GCIPV
Sbjct: 338 SRSCEAVNCSKL--RCAHNPLPIAEKLLSSIFCLQPQGDTSTRRSSFDSLVCGCIPVFFH 395
Query: 299 ADDIVLPFADAIPWEE--IGMFVAEEDVPK-----LDTILTSIPPEVILRKQRLLANPSM 351
AD + +P E +F+ EE++ + + + + + I QR + +
Sbjct: 396 ADSAYTQYTWHLPRERESYSVFIPEEEIRRDGLEVEEFLRSKFSSQRIGELQRNI-RKII 454
Query: 352 KRAMLFPQPAQPGDAFHQILNG 373
R + +P GD L+G
Sbjct: 455 PRLLYTGKPWSSGDGGRDSLDG 476
>gi|427796253|gb|JAA63578.1| Putative tout-velu, partial [Rhipicephalus pulchellus]
Length = 835
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 25/78 (32%), Positives = 42/78 (53%)
Query: 262 YYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAE 321
Y M+ A FCL P G S R +E++ GC+PV++A+ LPF +++ WE + E
Sbjct: 394 YGSLMENATFCLVPRGRRLGSFRFLESLQAGCVPVLLANGWELPFGESLRWEGAALRADE 453
Query: 322 EDVPKLDTILTSIPPEVI 339
+ ++ L S+P +
Sbjct: 454 RLLLQVPDTLRSMPRRRV 471
>gi|348519387|ref|XP_003447212.1| PREDICTED: exostosin-1b-like [Oreochromis niloticus]
Length = 740
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 49/195 (25%), Positives = 80/195 (41%), Gaps = 19/195 (9%)
Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
R Y V+N + W+ K+ + ++ Y++M + FCL P G
Sbjct: 274 RNALYHVHNAEDVVLLTTCKHGKDWQKHKDARCDKDNAEYDKYDYKEMLHNSTFCLVPRG 333
Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
S R +EA+ C+PV++++ LPF++ I W + E + ++ T + SI +
Sbjct: 334 RRLGSFRFLEALQAACVPVMLSNGWELPFSEIIDWNTAAVIGDERLLLQIPTTVRSIHQD 393
Query: 338 VILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGLA----RKLPHDKSTYL----KPG 389
IL S+++ F A +L L R L H + L PG
Sbjct: 394 KIL---------SLRQQTQFLWEAYFSSVEKIVLTTLEIIQDRVLEHASRSSLMWNSHPG 444
Query: 390 GKF-LNWTSGPVGDL 403
G F L SG +GD
Sbjct: 445 GLFALPQYSGYLGDF 459
>gi|156362595|ref|XP_001625861.1| predicted protein [Nematostella vectensis]
gi|156212714|gb|EDO33761.1| predicted protein [Nematostella vectensis]
Length = 728
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 37/135 (27%), Positives = 62/135 (45%), Gaps = 12/135 (8%)
Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKN------NPLFDISTDHPTTYYEDMQRAIFC 272
R Y ++N + W K+ N L+D +Y E + A FC
Sbjct: 267 RNALYHIHNGRDIVLLTTCKHGKAWHKHKDERCDGDNALYD-----RYSYDELLLNATFC 321
Query: 273 LCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILT 332
L P G S R +E++ GCIP +++D LPFA+ I W++ + +E + ++ I+
Sbjct: 322 LVPRGRRLGSFRFLESLKVGCIPFLLSDGWELPFAEVIDWKKAVIDGSERLLMQVPGIVR 381
Query: 333 SIP-PEVILRKQRLL 346
S +V+ KQ+ L
Sbjct: 382 SYSRSQVLAMKQQSL 396
>gi|340377881|ref|XP_003387457.1| PREDICTED: exostosin-1b-like [Amphimedon queenslandica]
Length = 729
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 33/128 (25%), Positives = 55/128 (42%), Gaps = 11/128 (8%)
Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENF------KNNPLFDISTDHPTTYYEDMQRAIFC 272
R Y ++N + W + K+N F+ Y + + FC
Sbjct: 262 RNAIYHLDNGDDVIMLTTCKHGKFWSRYRDEKCKKDNIFFE-----KYNYNHLISNSTFC 316
Query: 273 LCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILT 332
L P G S R +E++ GCIPV +++ +VLPF + I W + E + ++ +L
Sbjct: 317 LVPRGRRLGSFRFLESLHAGCIPVSLSNGLVLPFHELIDWSKALFVFDERQLFQVPHMLR 376
Query: 333 SIPPEVIL 340
IP + IL
Sbjct: 377 HIPEDKIL 384
>gi|441647993|ref|XP_004090847.1| PREDICTED: LOW QUALITY PROTEIN: exostosin-1 [Nomascus leucogenys]
Length = 746
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 31/123 (25%), Positives = 57/123 (46%), Gaps = 1/123 (0%)
Query: 219 RGLFYDVNNDPEGGYYARGARAAVWENFKNNPLFDISTDHPTTYYEDM-QRAIFCLCPLG 277
R Y V+N + W+ K++ +T++ Y +M A FCL P G
Sbjct: 280 RNALYHVHNGEDVVLLTTCKHGKDWQKHKDSRCDRDNTEYEKYDYREMLHNATFCLVPRG 339
Query: 278 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGMFVAEEDVPKLDTILTSIPPE 337
S +EA+ C+PV++++ LPF++ I W + + E + ++ + + SI +
Sbjct: 340 RRLGSFIFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIPSTIRSIHQD 399
Query: 338 VIL 340
IL
Sbjct: 400 KIL 402
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.323 0.140 0.457
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 7,271,486,090
Number of Sequences: 23463169
Number of extensions: 328051390
Number of successful extensions: 619690
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1149
Number of HSP's successfully gapped in prelim test: 160
Number of HSP's that attempted gapping in prelim test: 616856
Number of HSP's gapped (non-prelim): 1973
length of query: 406
length of database: 8,064,228,071
effective HSP length: 145
effective length of query: 261
effective length of database: 8,957,035,862
effective search space: 2337786359982
effective search space used: 2337786359982
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 78 (34.7 bits)