BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 015235
(411 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|359495862|ref|XP_002266299.2| PREDICTED: probable glycosyltransferase At5g03795 [Vitis vinifera]
Length = 594
Score = 666 bits (1719), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 324/418 (77%), Positives = 360/418 (86%), Gaps = 10/418 (2%)
Query: 1 MIGSKAAQPNQAQTTLCSLRSSLLTLALVTLLCFTCLSFNSFRSYPLQNNFNST------ 54
M K+ Q A +TLCS+ SLLTLA +TL+ FT +S S S +FNS+
Sbjct: 179 MTAVKSPQQAVASSTLCSIHGSLLTLATLTLISFTYISLKSLHSPFHDPSFNSSPPPQVI 238
Query: 55 --LSFAINKQDYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGK 112
+ + ++D SD++HSP +F LNY EM NFK+YIYPDGDPNTFYQTPRKLTGK
Sbjct: 239 SRVEHLVEEED--SPFSDIYHSPEIFRLNYREMEKNFKVYIYPDGDPNTFYQTPRKLTGK 296
Query: 113 YASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHKMRGKGTSYENMTVIVKDYVDSLINK 172
YASEGYFFQNIR+SRF T+DPD+AHLFFIPISCHKMRGKGTSYENMTVIV++YV SLI+K
Sbjct: 297 YASEGYFFQNIRDSRFRTNDPDQAHLFFIPISCHKMRGKGTSYENMTVIVQNYVGSLISK 356
Query: 173 YPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVL 232
YPYWNRTLGADHFFVTCHDVGVRATEGVPFL+KN+IRVVCSPSYDV FIPHKDVALPQVL
Sbjct: 357 YPYWNRTLGADHFFVTCHDVGVRATEGVPFLVKNSIRVVCSPSYDVGFIPHKDVALPQVL 416
Query: 233 QPFALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQ 292
QPFALP GG D+ENRT LGFWAGHRNSKIRVILARVWENDTELDI NNRINRA G L+YQ
Sbjct: 417 QPFALPAGGNDIENRTTLGFWAGHRNSKIRVILARVWENDTELDIMNNRINRAAGELLYQ 476
Query: 293 RRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERD 352
+RFY+TKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKF+V+L E D
Sbjct: 477 KRFYRTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFSVVLKELD 536
Query: 353 VYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIKY 410
VYQLKQILK+I DAEFV+LHNNLV+VQKHFQWNSPP+R+DAFHM+MYELWLR VIKY
Sbjct: 537 VYQLKQILKDIPDAEFVALHNNLVQVQKHFQWNSPPIRYDAFHMIMYELWLRHQVIKY 594
Score = 85.5 bits (210), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 59/117 (50%), Positives = 67/117 (57%), Gaps = 21/117 (17%)
Query: 1 MIGSKAAQPNQAQTTLCSLRSSLLTLALVTLLCFTCLSFNSFRSYPLQNNFNSTLSFAIN 60
M K+ Q A +TLCS+ SLLTLA +TL+ FT +S S S +FNS
Sbjct: 1 MTAVKSPQQAVASSTLCSIHGSLLTLATLTLISFTYISLKSLHSPFHDPSFNS------- 53
Query: 61 KQDYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEG 117
SP LNY EM NFK+YIYPDGDPNTFYQTPRKLTGKYASE
Sbjct: 54 -------------SPPP-QLNYREMEKNFKVYIYPDGDPNTFYQTPRKLTGKYASEA 96
>gi|255563758|ref|XP_002522880.1| catalytic, putative [Ricinus communis]
gi|223537865|gb|EEF39480.1| catalytic, putative [Ricinus communis]
Length = 406
Score = 666 bits (1718), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 315/410 (76%), Positives = 353/410 (86%), Gaps = 4/410 (0%)
Query: 1 MIGSKAAQPNQAQTTLCSLRSSLLTLALVTLLCFTCLSFNSFRSYPLQNNFNSTLSFAIN 60
M K Q A + SL+ SLL LA++TL+ F LS N S NS ++
Sbjct: 1 MTPGKLLQQAAAPQFMFSLKGSLLILAVLTLISFVYLSVNYLHSESPPGVINSI----VH 56
Query: 61 KQDYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFF 120
D D+ SD++HS +F LNY EM NFK+YIYPDGDPNTFYQTPRKLTGKYASEGYFF
Sbjct: 57 ADDTDDVMSDLYHSARIFKLNYEEMERNFKVYIYPDGDPNTFYQTPRKLTGKYASEGYFF 116
Query: 121 QNIRESRFLTHDPDRAHLFFIPISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTL 180
QNIRES+F T+DPD+AHLFFIPISCHKMRGKGTSYENMT+IV++YV+SL KYPYWNRTL
Sbjct: 117 QNIRESKFRTNDPDQAHLFFIPISCHKMRGKGTSYENMTIIVQNYVESLAVKYPYWNRTL 176
Query: 181 GADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRG 240
GADHFFVTCHDVGVRATEGVPFL+KNAIRVVCSPSYDV FIPHKDVALPQVLQPFALP G
Sbjct: 177 GADHFFVTCHDVGVRATEGVPFLVKNAIRVVCSPSYDVGFIPHKDVALPQVLQPFALPAG 236
Query: 241 GRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKF 300
G D+ENRT LGFWAGHRNSKIRVILARVWENDTELDISNNRINRA G LVYQ+RFY+TKF
Sbjct: 237 GNDLENRTTLGFWAGHRNSKIRVILARVWENDTELDISNNRINRATGHLVYQKRFYRTKF 296
Query: 301 CICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQIL 360
CICPGGSQVNSARIADSIHYGCVPVILS+YYDLPFNDILDW++F+VIL E+DVY+LKQ+L
Sbjct: 297 CICPGGSQVNSARIADSIHYGCVPVILSDYYDLPFNDILDWKRFSVILKEKDVYRLKQVL 356
Query: 361 KNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIKY 410
K+ISD EFV+LH NLV+VQKHFQWNSPP+++DAFHMVM++LWLR HVIKY
Sbjct: 357 KDISDDEFVALHENLVEVQKHFQWNSPPIKYDAFHMVMFDLWLRHHVIKY 406
>gi|296090519|emb|CBI40850.3| unnamed protein product [Vitis vinifera]
Length = 416
Score = 665 bits (1716), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 324/418 (77%), Positives = 360/418 (86%), Gaps = 10/418 (2%)
Query: 1 MIGSKAAQPNQAQTTLCSLRSSLLTLALVTLLCFTCLSFNSFRSYPLQNNFNST------ 54
M K+ Q A +TLCS+ SLLTLA +TL+ FT +S S S +FNS+
Sbjct: 1 MTAVKSPQQAVASSTLCSIHGSLLTLATLTLISFTYISLKSLHSPFHDPSFNSSPPPQVI 60
Query: 55 --LSFAINKQDYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGK 112
+ + ++D SD++HSP +F LNY EM NFK+YIYPDGDPNTFYQTPRKLTGK
Sbjct: 61 SRVEHLVEEED--SPFSDIYHSPEIFRLNYREMEKNFKVYIYPDGDPNTFYQTPRKLTGK 118
Query: 113 YASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHKMRGKGTSYENMTVIVKDYVDSLINK 172
YASEGYFFQNIR+SRF T+DPD+AHLFFIPISCHKMRGKGTSYENMTVIV++YV SLI+K
Sbjct: 119 YASEGYFFQNIRDSRFRTNDPDQAHLFFIPISCHKMRGKGTSYENMTVIVQNYVGSLISK 178
Query: 173 YPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVL 232
YPYWNRTLGADHFFVTCHDVGVRATEGVPFL+KN+IRVVCSPSYDV FIPHKDVALPQVL
Sbjct: 179 YPYWNRTLGADHFFVTCHDVGVRATEGVPFLVKNSIRVVCSPSYDVGFIPHKDVALPQVL 238
Query: 233 QPFALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQ 292
QPFALP GG D+ENRT LGFWAGHRNSKIRVILARVWENDTELDI NNRINRA G L+YQ
Sbjct: 239 QPFALPAGGNDIENRTTLGFWAGHRNSKIRVILARVWENDTELDIMNNRINRAAGELLYQ 298
Query: 293 RRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERD 352
+RFY+TKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKF+V+L E D
Sbjct: 299 KRFYRTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFSVVLKELD 358
Query: 353 VYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIKY 410
VYQLKQILK+I DAEFV+LHNNLV+VQKHFQWNSPP+R+DAFHM+MYELWLR VIKY
Sbjct: 359 VYQLKQILKDIPDAEFVALHNNLVQVQKHFQWNSPPIRYDAFHMIMYELWLRHQVIKY 416
>gi|449431924|ref|XP_004133750.1| PREDICTED: probable glycosyltransferase At5g03795-like [Cucumis
sativus]
Length = 412
Score = 657 bits (1694), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 321/396 (81%), Positives = 356/396 (89%), Gaps = 2/396 (0%)
Query: 16 LCSLRSSLLTLALVTLLCFTCLSFNSFRSYPLQNNFNSTLSF-AINKQDYSDLGSDVFHS 74
LCSLR+SLLTLA++TLL FT LSF S S P ++ + A+N +++ SDV+HS
Sbjct: 18 LCSLRASLLTLAVLTLLSFTYLSFTSLHSSPPSSSSQLPVKLGALNDAADAEI-SDVYHS 76
Query: 75 PSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPD 134
P VF LNY +M + FK+YIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRF T DPD
Sbjct: 77 PQVFRLNYADMESKFKVYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFRTEDPD 136
Query: 135 RAHLFFIPISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV 194
+AHLFFIPISCHKMRGKGTSYENMTVIV++YV+ LI+KYPYWNRTLGADHFFVTCHDVGV
Sbjct: 137 QAHLFFIPISCHKMRGKGTSYENMTVIVQNYVEGLISKYPYWNRTLGADHFFVTCHDVGV 196
Query: 195 RATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDVENRTVLGFWA 254
RA+EG+PFLIKNAIRVVCSPSYDV FIPHKDVALPQVLQPFALP GG D ENRT LGFWA
Sbjct: 197 RASEGLPFLIKNAIRVVCSPSYDVGFIPHKDVALPQVLQPFALPAGGNDTENRTTLGFWA 256
Query: 255 GHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARI 314
GHRNSKIRVILARVWENDTELDISNNRI+RA G L+YQ+RFYKTKFCICPGGSQVNSARI
Sbjct: 257 GHRNSKIRVILARVWENDTELDISNNRISRATGHLLYQKRFYKTKFCICPGGSQVNSARI 316
Query: 315 ADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNN 374
ADSIHYGCVPVILS+YYDLPFNDILDWRKF+VI+ ERDVYQLKQILK+ISD EF+ LH N
Sbjct: 317 ADSIHYGCVPVILSDYYDLPFNDILDWRKFSVIVKERDVYQLKQILKDISDIEFIKLHKN 376
Query: 375 LVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIKY 410
L++VQKHFQWNSPP+++DAFHMVMY+LWLR HVIKY
Sbjct: 377 LMQVQKHFQWNSPPIKYDAFHMVMYDLWLRHHVIKY 412
>gi|356496874|ref|XP_003517290.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
max]
Length = 404
Score = 656 bits (1693), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 313/398 (78%), Positives = 343/398 (86%), Gaps = 8/398 (2%)
Query: 18 SLRSSLLTLALVTLLCFTCLSFNSFRSYPLQ-----NNFNSTLSFAINKQDYSDLGSDVF 72
SLR SLL A++TLL FT LS P N N +++++ + D +
Sbjct: 10 SLRGSLLFFAILTLLSFTYLSLKYSTPTPQVAKLSVENLNDA---PVSEKEEKEEVPDTY 66
Query: 73 HSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHD 132
HSP VF LNY EM FK+YIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRF T +
Sbjct: 67 HSPRVFKLNYEEMEKKFKVYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFCTEN 126
Query: 133 PDRAHLFFIPISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDV 192
PD AHLFFIPISCHKMRGKGTSYENMT+IV++YV+SLI+KYPYWNRTLGADHFFVTCHDV
Sbjct: 127 PDEAHLFFIPISCHKMRGKGTSYENMTIIVQNYVESLISKYPYWNRTLGADHFFVTCHDV 186
Query: 193 GVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDVENRTVLGF 252
GVRATEG+ FL+KN+IR VCSPSYDV FIPHKDVALPQVLQPFALP GG D+ENRT LGF
Sbjct: 187 GVRATEGLEFLVKNSIRAVCSPSYDVGFIPHKDVALPQVLQPFALPAGGNDIENRTTLGF 246
Query: 253 WAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSA 312
WAGHRNSKIRVILARVWENDTELDISNNRI+RA G LVYQ+RFY++KFCICPGGSQVNSA
Sbjct: 247 WAGHRNSKIRVILARVWENDTELDISNNRISRATGHLVYQKRFYRSKFCICPGGSQVNSA 306
Query: 313 RIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLH 372
RIADSIHYGC+PVILSNYYDLPFNDILDW KFAV+L E DVYQLKQILKNISDAEFV+LH
Sbjct: 307 RIADSIHYGCIPVILSNYYDLPFNDILDWNKFAVVLKESDVYQLKQILKNISDAEFVTLH 366
Query: 373 NNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIKY 410
NNLVKVQKHFQWNSPP+RFDAFH+VMY+LWLR H IKY
Sbjct: 367 NNLVKVQKHFQWNSPPIRFDAFHLVMYDLWLRHHTIKY 404
>gi|224084457|ref|XP_002307304.1| predicted protein [Populus trichocarpa]
gi|222856753|gb|EEE94300.1| predicted protein [Populus trichocarpa]
Length = 406
Score = 655 bits (1690), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 318/406 (78%), Positives = 350/406 (86%), Gaps = 11/406 (2%)
Query: 16 LCSLRSSLLTLALVTLLCFTCLSFNSFRSYPLQNNFN----------STLSFAINKQDYS 65
+CSL+ SL TLA++TL+ FT LSFNS S ++ + +T I +DY
Sbjct: 1 MCSLKGSLRTLAILTLVSFTYLSFNSLHSSYFSSSSSISASSVSLAPATKKTTILVKDYD 60
Query: 66 DLG-SDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIR 124
D SD++HSP VF LNY EM NFKI+IYPDGDPNTFYQTPRKLTGKYASEGYFFQNIR
Sbjct: 61 DDEISDLYHSPRVFKLNYEEMERNFKIFIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIR 120
Query: 125 ESRFLTHDPDRAHLFFIPISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADH 184
ESRF T DPD+AHLFFIPISCHKMRGKG SYENMT+IV +YV+SL +KYPYWNRTLGADH
Sbjct: 121 ESRFQTQDPDQAHLFFIPISCHKMRGKGISYENMTIIVDNYVESLKSKYPYWNRTLGADH 180
Query: 185 FFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDV 244
FFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDV FIPHKD+ALPQVLQPFALP GG DV
Sbjct: 181 FFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVGFIPHKDIALPQVLQPFALPAGGNDV 240
Query: 245 ENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICP 304
E RT LGFWAGHRNS+IRVILARVWENDTELDISNNRINRA G LVYQ+RFY +K+CICP
Sbjct: 241 EKRTTLGFWAGHRNSRIRVILARVWENDTELDISNNRINRATGHLVYQKRFYGSKYCICP 300
Query: 305 GGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNIS 364
GGSQVNSARIADSIHYGC+PVILSNYYDLPFNDILDW KF+VIL E+DVY+LKQILK+I
Sbjct: 301 GGSQVNSARIADSIHYGCIPVILSNYYDLPFNDILDWHKFSVILKEQDVYRLKQILKDIP 360
Query: 365 DAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIKY 410
D + VSLH NLVKVQKHFQWNSPPV++DAFHMVMY+LWLR HVIKY
Sbjct: 361 DNKLVSLHKNLVKVQKHFQWNSPPVKYDAFHMVMYDLWLRHHVIKY 406
>gi|15233650|ref|NP_195517.1| Exostosin family protein [Arabidopsis thaliana]
gi|4467110|emb|CAB37544.1| putative protein [Arabidopsis thaliana]
gi|7270787|emb|CAB80469.1| putative protein [Arabidopsis thaliana]
gi|15293111|gb|AAK93666.1| unknown protein [Arabidopsis thaliana]
gi|21280961|gb|AAM45007.1| unknown protein [Arabidopsis thaliana]
gi|332661466|gb|AEE86866.1| Exostosin family protein [Arabidopsis thaliana]
Length = 425
Score = 652 bits (1683), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 314/415 (75%), Positives = 344/415 (82%), Gaps = 20/415 (4%)
Query: 12 AQTTLCSLRSSLLTLALVTLLCFTCLSFNSFRSYP---------------LQNNFNS-TL 55
+ LCSL+SSLLT+A++T + LS NS R+ P N + +
Sbjct: 15 GSSPLCSLKSSLLTVAILTFISLFYLSLNSLRTSPPSPVIVVTPIHVPHTFVNEYKTDNE 74
Query: 56 SFAINKQDYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYAS 115
+ + ++ Y SDV+HSP F LNY EM FK+YIYPDGDPNTFYQTPRK+TGKYAS
Sbjct: 75 TPTMEEETY----SDVYHSPEAFRLNYAEMEKRFKVYIYPDGDPNTFYQTPRKVTGKYAS 130
Query: 116 EGYFFQNIRESRFLTHDPDRAHLFFIPISCHKMRGKGTSYENMTVIVKDYVDSLINKYPY 175
EGYFFQNIRESRF T DPD A LFFIPISCHKMRGKGTSYENMTVIV++YVD LI KYPY
Sbjct: 131 EGYFFQNIRESRFRTLDPDEADLFFIPISCHKMRGKGTSYENMTVIVQNYVDGLIAKYPY 190
Query: 176 WNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPF 235
WNRTLGADHFFVTCHDVGVRA EG P LIKN IRVVCSPSY+V FIPHKDVALPQVLQPF
Sbjct: 191 WNRTLGADHFFVTCHDVGVRAFEGSPLLIKNTIRVVCSPSYNVGFIPHKDVALPQVLQPF 250
Query: 236 ALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRF 295
ALP GG DVENRT LGFWAGHRNSKIRVILA VWENDTELDISNNRINRA G LVYQ+RF
Sbjct: 251 ALPAGGNDVENRTTLGFWAGHRNSKIRVILAHVWENDTELDISNNRINRATGHLVYQKRF 310
Query: 296 YKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQ 355
Y+TKFCICPGGSQVNSARI DSIHYGC+PVILS+YYDLPFNDIL+WRKFAV+L E+DVY
Sbjct: 311 YRTKFCICPGGSQVNSARITDSIHYGCIPVILSDYYDLPFNDILNWRKFAVVLREQDVYN 370
Query: 356 LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIKY 410
LKQILKNI +EFVSLHNNLVKVQKHFQWNSPPV+FDAFHM+MYELWLR HV+KY
Sbjct: 371 LKQILKNIPHSEFVSLHNNLVKVQKHFQWNSPPVKFDAFHMIMYELWLRHHVVKY 425
>gi|297802132|ref|XP_002868950.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
gi|297314786|gb|EFH45209.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
Length = 428
Score = 647 bits (1670), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 313/410 (76%), Positives = 341/410 (83%), Gaps = 15/410 (3%)
Query: 16 LCSLRSSLLTLALVTLLCFTCLSFNSFRSYP-----LQNNFNSTLSFAINKQDYSDLG-- 68
LCSL+ SLLT+A++T LS NS R+ P + + +F + +D G
Sbjct: 19 LCSLKGSLLTVAILTFGSLFYLSLNSMRTSPPSPVIVVTPIHVPQTFVNEDKTDNDDGAA 78
Query: 69 --------SDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFF 120
SDV+HSP F LNY EM FK+YIYPDGDPNTFYQTPRK+TGKYASEGYFF
Sbjct: 79 PTTEAETYSDVYHSPEAFRLNYAEMEKRFKVYIYPDGDPNTFYQTPRKVTGKYASEGYFF 138
Query: 121 QNIRESRFLTHDPDRAHLFFIPISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTL 180
QNIRESRF T DP+ A LFFIPISCHKMRGKGTSYENMTVIV++YVD LI KYPYWNRTL
Sbjct: 139 QNIRESRFRTLDPEEADLFFIPISCHKMRGKGTSYENMTVIVQNYVDGLIAKYPYWNRTL 198
Query: 181 GADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRG 240
GADHFFVTCHDVGVRA EG LIKN IRVVCSPSY+V FIPHKDVALPQVLQPFALP G
Sbjct: 199 GADHFFVTCHDVGVRAFEGSLLLIKNTIRVVCSPSYNVGFIPHKDVALPQVLQPFALPAG 258
Query: 241 GRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKF 300
G DVENRT LGFWAGHRNSKIRVILARVWENDTELDISNNRINRA G LVYQ+RFY+TKF
Sbjct: 259 GNDVENRTTLGFWAGHRNSKIRVILARVWENDTELDISNNRINRATGHLVYQKRFYRTKF 318
Query: 301 CICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQIL 360
CICPGGSQVNSARI DSIHYGC+PVILS+YYDLPFNDIL+WRKFAV+L E+DVY LKQIL
Sbjct: 319 CICPGGSQVNSARITDSIHYGCIPVILSDYYDLPFNDILNWRKFAVVLREQDVYNLKQIL 378
Query: 361 KNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIKY 410
KNI +EF+SLHNNLVKVQKHFQWNSPPV+FDAFHM+MYELWLR HVIKY
Sbjct: 379 KNIPHSEFISLHNNLVKVQKHFQWNSPPVKFDAFHMIMYELWLRHHVIKY 428
>gi|357485421|ref|XP_003612998.1| Exostosin-like protein [Medicago truncatula]
gi|355514333|gb|AES95956.1| Exostosin-like protein [Medicago truncatula]
Length = 415
Score = 647 bits (1668), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 321/412 (77%), Positives = 350/412 (84%), Gaps = 7/412 (1%)
Query: 6 AAQPNQAQT---TLCSLRSSLLTLALVTLLCFTCLSFN-SFRSYPLQNNFNSTLSFAINK 61
+ + NQ Q+ TL SLR SLLTLA++TLL FT LS S S + N + A
Sbjct: 4 SVKQNQQQSSFHTLFSLRGSLLTLAILTLLSFTYLSLKYSTPSSQGPESVNVKVVDAGKN 63
Query: 62 QDYSDLG---SDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGY 118
++ D G DV+HSP VF LN+ EM FK+YIYPDGD TFYQTPRKLTGKYASEGY
Sbjct: 64 EEEDDGGDEFGDVYHSPRVFKLNFAEMEKKFKVYIYPDGDSKTFYQTPRKLTGKYASEGY 123
Query: 119 FFQNIRESRFLTHDPDRAHLFFIPISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNR 178
FFQNIRESRF T DPD AHLFFIPISCHKMRGKGTSYENMT+IV++YV+SLI+KYPYWNR
Sbjct: 124 FFQNIRESRFRTLDPDEAHLFFIPISCHKMRGKGTSYENMTIIVQNYVESLISKYPYWNR 183
Query: 179 TLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALP 238
TLGADHFFVTCHDVGVRATEG+P L+KN+IR VCSPSYDV FIPHKDVALPQVLQPFALP
Sbjct: 184 TLGADHFFVTCHDVGVRATEGLPLLVKNSIRAVCSPSYDVGFIPHKDVALPQVLQPFALP 243
Query: 239 RGGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKT 298
GG DVENRT LGFWAGHRNSKIRVILARVWENDTELDISNNRI+RA G LVYQ+RFY T
Sbjct: 244 AGGNDVENRTSLGFWAGHRNSKIRVILARVWENDTELDISNNRISRATGHLVYQKRFYST 303
Query: 299 KFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQ 358
KFCICPGGSQVNSARIADSIHYGC+PVILSNYYDLPFNDILDWRKFAV+L E DVYQLKQ
Sbjct: 304 KFCICPGGSQVNSARIADSIHYGCIPVILSNYYDLPFNDILDWRKFAVVLKESDVYQLKQ 363
Query: 359 ILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIKY 410
ILKN S EFV+LHNNLVK+QKHFQWNSPPVR+DAFHMVMY+LWLR H I+Y
Sbjct: 364 ILKNKSQDEFVALHNNLVKIQKHFQWNSPPVRYDAFHMVMYDLWLRHHTIQY 415
>gi|363808264|ref|NP_001242494.1| uncharacterized protein LOC100777368 [Glycine max]
gi|255635742|gb|ACU18220.1| unknown [Glycine max]
Length = 409
Score = 641 bits (1653), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 294/341 (86%), Positives = 317/341 (92%)
Query: 70 DVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFL 129
D FHSP VF LNY EM FK+YIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIR+SRF
Sbjct: 69 DTFHSPRVFKLNYEEMEKKFKVYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRDSRFR 128
Query: 130 THDPDRAHLFFIPISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTC 189
T +PD AHLFFIPISCHKMRGKGTSYENMT+IV++YV+SLI+KYPYWNRTLGADHFFVTC
Sbjct: 129 TENPDEAHLFFIPISCHKMRGKGTSYENMTIIVQNYVESLISKYPYWNRTLGADHFFVTC 188
Query: 190 HDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDVENRTV 249
HDVGVRATEG+ FL+KN+IR VCSPSYDV FIPHKDVALPQVLQPFALP GG D+ENRT
Sbjct: 189 HDVGVRATEGLEFLVKNSIRAVCSPSYDVGFIPHKDVALPQVLQPFALPAGGNDIENRTT 248
Query: 250 LGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQV 309
LGFWAGHRNSKIRVILARVWENDTELDISNNRI+RA G LVYQ+RFY++KFCICPGGSQV
Sbjct: 249 LGFWAGHRNSKIRVILARVWENDTELDISNNRISRATGHLVYQKRFYRSKFCICPGGSQV 308
Query: 310 NSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFV 369
NSARIADSIHYGC+PV+LSNYYDLPFNDILDW KFAV+L E DVYQLKQILKNISDAEFV
Sbjct: 309 NSARIADSIHYGCIPVMLSNYYDLPFNDILDWNKFAVVLKESDVYQLKQILKNISDAEFV 368
Query: 370 SLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIKY 410
+LHNNLVKVQKHFQWNSP +RFDAFH+VMY+LWLR H IKY
Sbjct: 369 TLHNNLVKVQKHFQWNSPSIRFDAFHLVMYDLWLRHHTIKY 409
>gi|225464075|ref|XP_002268836.1| PREDICTED: probable glycosyltransferase At5g03795-like [Vitis
vinifera]
Length = 417
Score = 636 bits (1641), Expect = e-180, Method: Compositional matrix adjust.
Identities = 308/406 (75%), Positives = 349/406 (85%), Gaps = 11/406 (2%)
Query: 16 LCSLRSSLLTLALVTLLCFTCLSFNSFRSYPLQNNFNSTLSFAI-----NKQDYSDLG-- 68
CSL+ SLL LAL+TLL FTC+SF RS + N I + + Y G
Sbjct: 12 FCSLQRSLLALALLTLLAFTCVSFTFLRSSSTDLSPNPPKVAVIGDLVESSEIYQSPGKN 71
Query: 69 ----SDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIR 124
S+++H P +F LNY EM FK+++YPDGDPNT+YQTPRKLTGKYASEGYFFQNIR
Sbjct: 72 VDEFSEIYHLPDLFRLNYAEMEMKFKVFVYPDGDPNTYYQTPRKLTGKYASEGYFFQNIR 131
Query: 125 ESRFLTHDPDRAHLFFIPISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADH 184
+S+F T DPD+AHLFFIPISCHKMRGKGTSYENMT+IV++YV SL++KYPYWNRTLGADH
Sbjct: 132 QSQFRTDDPDQAHLFFIPISCHKMRGKGTSYENMTIIVQNYVQSLMSKYPYWNRTLGADH 191
Query: 185 FFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDV 244
FF+TCHDVGVRATEGVP L+KN+IRVVCSPSYDV FIPHKDVALPQVLQPFALP GGRD+
Sbjct: 192 FFLTCHDVGVRATEGVPLLVKNSIRVVCSPSYDVGFIPHKDVALPQVLQPFALPTGGRDI 251
Query: 245 ENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICP 304
+NRT LGFWAGHRNSKIRVILAR+WENDTELDI NNRINRA G LVYQ +FY+TKFCICP
Sbjct: 252 KNRTTLGFWAGHRNSKIRVILARIWENDTELDIKNNRINRATGHLVYQNKFYRTKFCICP 311
Query: 305 GGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNIS 364
GGSQVNSARIADSIHYGCVPVILS+YYDLPFNDILDWRKF+VIL ERDVY+LK ILK+I
Sbjct: 312 GGSQVNSARIADSIHYGCVPVILSDYYDLPFNDILDWRKFSVILKERDVYRLKYILKDIP 371
Query: 365 DAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIKY 410
DAEF++LH+NLVKVQKHFQWN+PP+++DAFHMVMYELWLR HVIKY
Sbjct: 372 DAEFIALHDNLVKVQKHFQWNTPPIKYDAFHMVMYELWLRHHVIKY 417
>gi|224071347|ref|XP_002303415.1| predicted protein [Populus trichocarpa]
gi|222840847|gb|EEE78394.1| predicted protein [Populus trichocarpa]
Length = 326
Score = 619 bits (1596), Expect = e-175, Method: Compositional matrix adjust.
Identities = 292/326 (89%), Positives = 307/326 (94%)
Query: 85 MVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPIS 144
M NFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRF T DPD+AHLFFIPIS
Sbjct: 1 MEHNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFRTLDPDQAHLFFIPIS 60
Query: 145 CHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLI 204
CHKMRGKGTSYENMTVIV++YV+SLI KY YWNRTLGADHFFVTCHDVGVRATEGVPFLI
Sbjct: 61 CHKMRGKGTSYENMTVIVENYVESLIAKYSYWNRTLGADHFFVTCHDVGVRATEGVPFLI 120
Query: 205 KNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDVENRTVLGFWAGHRNSKIRVI 264
KNAIRVVCSPSYDV FIPHKDVALPQVLQPFALP GG DVENRT LGFWAGHRNS+IRVI
Sbjct: 121 KNAIRVVCSPSYDVGFIPHKDVALPQVLQPFALPAGGNDVENRTTLGFWAGHRNSRIRVI 180
Query: 265 LARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVP 324
LARVWENDTELDIS+NRINRA G LVYQ+RFY TKFCICPGGSQVNSARIADSIHYGCVP
Sbjct: 181 LARVWENDTELDISSNRINRATGHLVYQKRFYGTKFCICPGGSQVNSARIADSIHYGCVP 240
Query: 325 VILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
VILSNYYDLPFNDILDW KF+VIL E+DVYQLKQILK+I D +FVSLH NL+KVQKHFQW
Sbjct: 241 VILSNYYDLPFNDILDWHKFSVILKEQDVYQLKQILKDIPDNQFVSLHKNLIKVQKHFQW 300
Query: 385 NSPPVRFDAFHMVMYELWLRRHVIKY 410
NSPP++ DAFHMVMY+LWLRRHVIKY
Sbjct: 301 NSPPIKNDAFHMVMYDLWLRRHVIKY 326
>gi|242064274|ref|XP_002453426.1| hypothetical protein SORBIDRAFT_04g005890 [Sorghum bicolor]
gi|241933257|gb|EES06402.1| hypothetical protein SORBIDRAFT_04g005890 [Sorghum bicolor]
Length = 403
Score = 619 bits (1596), Expect = e-175, Method: Compositional matrix adjust.
Identities = 286/342 (83%), Positives = 313/342 (91%)
Query: 69 SDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRF 128
S V+HSP F Y EM +FK+YIYPDGDP TFYQTPRKLTGKYASEGYFFQNIRESRF
Sbjct: 62 SSVYHSPEAFAAGYAEMERSFKVYIYPDGDPKTFYQTPRKLTGKYASEGYFFQNIRESRF 121
Query: 129 LTHDPDRAHLFFIPISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVT 188
T DPD+AHLFF+PIS HKMRGKGT+YENMTVIVKDYV+ LINKYPYWNRTLGADHFFVT
Sbjct: 122 RTDDPDQAHLFFVPISPHKMRGKGTTYENMTVIVKDYVEGLINKYPYWNRTLGADHFFVT 181
Query: 189 CHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDVENRT 248
CHDVGVRA EG+PF++KN+IRVVCSPSY+V FIPHKD+ALPQVLQPFALP GG D+ENRT
Sbjct: 182 CHDVGVRAFEGLPFMVKNSIRVVCSPSYNVDFIPHKDIALPQVLQPFALPEGGNDIENRT 241
Query: 249 VLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQ 308
+LGFWAGHRNSKIRVILARVWENDTEL ISNNRI+RAIG LVYQ++FY+TKFCICPGGSQ
Sbjct: 242 ILGFWAGHRNSKIRVILARVWENDTELAISNNRISRAIGELVYQKQFYRTKFCICPGGSQ 301
Query: 309 VNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEF 368
VNSARI+DSIHYGCVPVILS+YYDLPFND LDWRKFAVIL ERDVYQLK ILK+IS EF
Sbjct: 302 VNSARISDSIHYGCVPVILSDYYDLPFNDALDWRKFAVILRERDVYQLKNILKSISQEEF 361
Query: 369 VSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIKY 410
VSLH +LV+VQKHF W+SPPV +DAFHMVMYELWLR +VIKY
Sbjct: 362 VSLHKSLVQVQKHFVWHSPPVSYDAFHMVMYELWLRHNVIKY 403
>gi|195624236|gb|ACG33948.1| exostosin-like [Zea mays]
Length = 403
Score = 618 bits (1594), Expect = e-174, Method: Compositional matrix adjust.
Identities = 286/342 (83%), Positives = 311/342 (90%)
Query: 69 SDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRF 128
S V+HSP F Y EM FK+YIYPDGDP TFYQTPRKLTGKYASEGYFFQNIRESRF
Sbjct: 62 SSVYHSPEAFTAGYAEMERXFKVYIYPDGDPKTFYQTPRKLTGKYASEGYFFQNIRESRF 121
Query: 129 LTHDPDRAHLFFIPISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVT 188
T DPD+AHLFF+PIS HKMRGKGTSYENMTVIVKDYV+ LINKYPYWNRTLGADHFFVT
Sbjct: 122 RTDDPDQAHLFFVPISPHKMRGKGTSYENMTVIVKDYVEGLINKYPYWNRTLGADHFFVT 181
Query: 189 CHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDVENRT 248
CHDVGVRA EG+PF++KN+IRVVCSPSY+V FIPHKD+ALPQVLQPFALP GG D+ENRT
Sbjct: 182 CHDVGVRAFEGLPFMVKNSIRVVCSPSYNVDFIPHKDIALPQVLQPFALPEGGNDIENRT 241
Query: 249 VLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQ 308
LGFWAGHRNSKIRVILARVWENDTEL ISNNRI+RAIG LVYQ++FY+TKFCICPGGSQ
Sbjct: 242 TLGFWAGHRNSKIRVILARVWENDTELAISNNRISRAIGELVYQKQFYRTKFCICPGGSQ 301
Query: 309 VNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEF 368
VNSARI+DSIHYGCVPVILS+YYDLPFND LDWRKFAVIL ERDVYQLK ILK+IS EF
Sbjct: 302 VNSARISDSIHYGCVPVILSDYYDLPFNDALDWRKFAVILRERDVYQLKNILKSISQEEF 361
Query: 369 VSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIKY 410
VSLH +LV+VQKHF W+SPPV +DAFHM+MYELWLR +VIKY
Sbjct: 362 VSLHKSLVQVQKHFVWHSPPVSYDAFHMIMYELWLRHNVIKY 403
>gi|212721192|ref|NP_001132213.1| exostosin-like protein [Zea mays]
gi|194693780|gb|ACF80974.1| unknown [Zea mays]
gi|413935957|gb|AFW70508.1| exostosin-like protein [Zea mays]
Length = 403
Score = 617 bits (1592), Expect = e-174, Method: Compositional matrix adjust.
Identities = 285/342 (83%), Positives = 312/342 (91%)
Query: 69 SDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRF 128
S V+HSP F Y +M +FK+YIYPDGDP TFYQTPRKLTGKYASEGYFFQNIRESRF
Sbjct: 62 SSVYHSPEAFTAGYADMERSFKVYIYPDGDPKTFYQTPRKLTGKYASEGYFFQNIRESRF 121
Query: 129 LTHDPDRAHLFFIPISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVT 188
T DPD+AHLFF+PIS HKMRGKGTSYENMTVIVKDYV+ LINKYPYWNRTLGADHFFVT
Sbjct: 122 RTDDPDQAHLFFVPISPHKMRGKGTSYENMTVIVKDYVEGLINKYPYWNRTLGADHFFVT 181
Query: 189 CHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDVENRT 248
CHDVGVRA EG+PF++KN+IRVVCSPSY+V FIPHKD+ALPQVLQPFALP GG D+ENRT
Sbjct: 182 CHDVGVRAFEGLPFMVKNSIRVVCSPSYNVDFIPHKDIALPQVLQPFALPEGGNDIENRT 241
Query: 249 VLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQ 308
LGFWAGHRNSKIRVILARVWENDTEL ISNNRI+RAIG LVYQ++FY+TKFCICPGGSQ
Sbjct: 242 TLGFWAGHRNSKIRVILARVWENDTELAISNNRISRAIGELVYQKQFYRTKFCICPGGSQ 301
Query: 309 VNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEF 368
VNSARI+DSIHYGCVPVILS+YYDLPFND LDWRKFAVIL ERDVYQLK ILK+IS EF
Sbjct: 302 VNSARISDSIHYGCVPVILSDYYDLPFNDALDWRKFAVILRERDVYQLKNILKSISQEEF 361
Query: 369 VSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIKY 410
VSLH +LV+VQKHF W+SPPV +DAFHM+MYELWLR +VIKY
Sbjct: 362 VSLHKSLVQVQKHFVWHSPPVSYDAFHMIMYELWLRHNVIKY 403
>gi|358343604|ref|XP_003635890.1| Exostosin-like protein [Medicago truncatula]
gi|358344094|ref|XP_003636128.1| Exostosin-like protein [Medicago truncatula]
gi|355501825|gb|AES83028.1| Exostosin-like protein [Medicago truncatula]
gi|355502063|gb|AES83266.1| Exostosin-like protein [Medicago truncatula]
Length = 435
Score = 617 bits (1590), Expect = e-174, Method: Compositional matrix adjust.
Identities = 304/407 (74%), Positives = 333/407 (81%), Gaps = 23/407 (5%)
Query: 10 NQAQT---TLCSLRSSLLTLALVTLLCFTCLSFNSFRSYPLQNNFNSTLSFAINKQDYSD 66
NQ Q+ TL SLR SLLTLA++T L FT LS + + Q + A ++ D
Sbjct: 46 NQQQSSFHTLFSLRGSLLTLAILTHLSFTYLSLK-YSTPSSQVPVTVKVVDAGRNEEEDD 104
Query: 67 LG---SDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNI 123
G DV+HSP VF LN+ EM FK+YIYPDGD TFYQTPRKLTGKYASEGYFFQNI
Sbjct: 105 GGDEFGDVYHSPRVFKLNFAEMEKKFKVYIYPDGDSKTFYQTPRKLTGKYASEGYFFQNI 164
Query: 124 RESRFLTHDPDRAHLFFIPISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGAD 183
RESRF T DPD+AHLFFIPISCHKMRGKGTSY+NMT+IV++YV+SLI+KYPYWNRTLGAD
Sbjct: 165 RESRFRTLDPDQAHLFFIPISCHKMRGKGTSYDNMTIIVQNYVESLISKYPYWNRTLGAD 224
Query: 184 HFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRD 243
HFFVTCHDVGVRATEG+P L+KN+IR VCSPSYDV FIPHKDVALPQVLQPFALP GG D
Sbjct: 225 HFFVTCHDVGVRATEGLPLLVKNSIRAVCSPSYDVGFIPHKDVALPQVLQPFALPAGGND 284
Query: 244 VENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCIC 303
VENRT LGFWAGHRNSKIRVILARVWENDTELDISNNRI TKFCIC
Sbjct: 285 VENRTSLGFWAGHRNSKIRVILARVWENDTELDISNNRI--------------YTKFCIC 330
Query: 304 PGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNI 363
PGGSQVNSARIADSIHYGC+PVILSNYYDLPFNDILDWRKFAV+ NE DVYQLKQILKN
Sbjct: 331 PGGSQVNSARIADSIHYGCIPVILSNYYDLPFNDILDWRKFAVVHNESDVYQLKQILKNK 390
Query: 364 SDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIKY 410
S EF++LHNNL +QKHFQWNSPPVR+DAFHMVMY+LWLR H I+Y
Sbjct: 391 SQDEFIALHNNL--IQKHFQWNSPPVRYDAFHMVMYDLWLRHHTIQY 435
>gi|357138738|ref|XP_003570946.1| PREDICTED: probable glycosyltransferase At5g11130-like
[Brachypodium distachyon]
Length = 405
Score = 615 bits (1585), Expect = e-173, Method: Compositional matrix adjust.
Identities = 281/340 (82%), Positives = 312/340 (91%)
Query: 71 VFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLT 130
V+HSP F Y EM +FK+YIYPDGDP TFYQTPRKLTGKYASEGYFFQNIRESRF T
Sbjct: 66 VYHSPEAFSAGYAEMERSFKVYIYPDGDPKTFYQTPRKLTGKYASEGYFFQNIRESRFRT 125
Query: 131 HDPDRAHLFFIPISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCH 190
DPD A LFF+PIS HKMRGKGTSYENMT+IVKDYV+ LINKYPYWNRTLGADHFFVTCH
Sbjct: 126 EDPDSADLFFVPISPHKMRGKGTSYENMTIIVKDYVEGLINKYPYWNRTLGADHFFVTCH 185
Query: 191 DVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDVENRTVL 250
DVGVRA EG+PF++KN+IRVVCSPSY+V FIPHKDVALPQVLQPFALP+GG DVENRT L
Sbjct: 186 DVGVRAFEGLPFMVKNSIRVVCSPSYNVDFIPHKDVALPQVLQPFALPKGGNDVENRTNL 245
Query: 251 GFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVN 310
GFWAGHRNSKIRVILARVWENDTEL ISNNRI+RAIG LVYQ++FY+TKFCICPGGSQVN
Sbjct: 246 GFWAGHRNSKIRVILARVWENDTELAISNNRISRAIGELVYQKQFYRTKFCICPGGSQVN 305
Query: 311 SARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVS 370
SARI+DSIHYGCVPVILS+YYDLPFND+LDW+KFA++L ERDVY+LK ILK+IS EFV+
Sbjct: 306 SARISDSIHYGCVPVILSDYYDLPFNDVLDWKKFAIVLKERDVYELKSILKSISQEEFVA 365
Query: 371 LHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIKY 410
LHN+LV++QKHF W+SPP+ +DAFHMVMYELWLR HVIKY
Sbjct: 366 LHNSLVQIQKHFVWHSPPIPYDAFHMVMYELWLRHHVIKY 405
>gi|115444695|ref|NP_001046127.1| Os02g0187200 [Oryza sativa Japonica Group]
gi|46390046|dbj|BAD15422.1| putative pectin-glucuronyltransferase [Oryza sativa Japonica Group]
gi|46390077|dbj|BAD15452.1| putative pectin-glucuronyltransferase [Oryza sativa Japonica Group]
gi|113535658|dbj|BAF08041.1| Os02g0187200 [Oryza sativa Japonica Group]
gi|215694561|dbj|BAG89554.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 402
Score = 613 bits (1580), Expect = e-173, Method: Compositional matrix adjust.
Identities = 282/340 (82%), Positives = 310/340 (91%)
Query: 71 VFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLT 130
V+HS F Y EM +FK+Y+YPDGDP TFYQTPRKLTGKYASEGYFFQNIRESRF T
Sbjct: 63 VYHSAEAFAAGYAEMERSFKVYMYPDGDPKTFYQTPRKLTGKYASEGYFFQNIRESRFRT 122
Query: 131 HDPDRAHLFFIPISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCH 190
DPD+AHLFF+PIS HKMRGKGTSYENMT+IVKDYV+ LINKYPYWNRTLGADHFFVTCH
Sbjct: 123 GDPDKAHLFFVPISPHKMRGKGTSYENMTIIVKDYVEGLINKYPYWNRTLGADHFFVTCH 182
Query: 191 DVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDVENRTVL 250
DVGVRA EG+PF++KN+IRVVCSPSY+V FIPHKD+ALPQVLQPFALP GG DVENRT+L
Sbjct: 183 DVGVRAFEGLPFMVKNSIRVVCSPSYNVDFIPHKDIALPQVLQPFALPEGGNDVENRTIL 242
Query: 251 GFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVN 310
GFWAGHRNSKIRVILA+VWENDTEL ISNNRI+RAIG LVYQ++FY TKFCICPGGSQVN
Sbjct: 243 GFWAGHRNSKIRVILAQVWENDTELAISNNRISRAIGELVYQKQFYHTKFCICPGGSQVN 302
Query: 311 SARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVS 370
SARI+DSIHYGCVPVILS+YYDLPFNDILDWRKFAV+L ERDVYQLK ILK+IS EFV
Sbjct: 303 SARISDSIHYGCVPVILSDYYDLPFNDILDWRKFAVVLKERDVYQLKSILKSISQEEFVE 362
Query: 371 LHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIKY 410
LH +LV+VQKHF W+SPP+ +DAFHMVMYELWLR HVIKY
Sbjct: 363 LHKSLVQVQKHFVWHSPPLPYDAFHMVMYELWLRHHVIKY 402
>gi|226498050|ref|NP_001147269.1| exostosin-like [Zea mays]
gi|195609318|gb|ACG26489.1| exostosin-like [Zea mays]
gi|413926327|gb|AFW66259.1| exostosin-like protein [Zea mays]
Length = 403
Score = 610 bits (1573), Expect = e-172, Method: Compositional matrix adjust.
Identities = 282/342 (82%), Positives = 311/342 (90%)
Query: 69 SDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRF 128
S V+HSP F + Y EM +FK+YIYPDGDP TFYQTPRKLTGKYASEGYFFQNIRESRF
Sbjct: 62 STVYHSPEAFAVGYAEMERSFKVYIYPDGDPKTFYQTPRKLTGKYASEGYFFQNIRESRF 121
Query: 129 LTHDPDRAHLFFIPISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVT 188
T DPD+AHLFF+PIS HKMRGKGTSYENMTVIVKDYV+ LINKYPYWNRTLGADHFFVT
Sbjct: 122 RTDDPDQAHLFFVPISPHKMRGKGTSYENMTVIVKDYVEGLINKYPYWNRTLGADHFFVT 181
Query: 189 CHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDVENRT 248
CHDVGVRA EG+ F++KN+IRVVCSPSY+V FIPHKD+ALPQVLQPFAL GG D++NR
Sbjct: 182 CHDVGVRAFEGLKFMVKNSIRVVCSPSYNVDFIPHKDIALPQVLQPFALHEGGNDIDNRV 241
Query: 249 VLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQ 308
+LGFWAGHRNSKIRVILARVWENDTEL ISNNRI+RAIG LVYQ++FY+TKFCICPGGSQ
Sbjct: 242 ILGFWAGHRNSKIRVILARVWENDTELAISNNRISRAIGELVYQKQFYRTKFCICPGGSQ 301
Query: 309 VNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEF 368
VNSARI+DSIHYGCVPVILS+YYDLPFND LDWRKFAVIL ERDVYQLK ILK+IS EF
Sbjct: 302 VNSARISDSIHYGCVPVILSDYYDLPFNDALDWRKFAVILRERDVYQLKNILKSISQEEF 361
Query: 369 VSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIKY 410
+SLH +LV+VQKHF W+SPPV +DAFHMVMYELWLR +VIKY
Sbjct: 362 ISLHKSLVQVQKHFVWHSPPVSYDAFHMVMYELWLRHNVIKY 403
>gi|296088784|emb|CBI38234.3| unnamed protein product [Vitis vinifera]
Length = 326
Score = 609 bits (1570), Expect = e-172, Method: Compositional matrix adjust.
Identities = 277/326 (84%), Positives = 308/326 (94%)
Query: 85 MVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPIS 144
M FK+++YPDGDPNT+YQTPRKLTGKYASEGYFFQNIR+S+F T DPD+AHLFFIPIS
Sbjct: 1 MEMKFKVFVYPDGDPNTYYQTPRKLTGKYASEGYFFQNIRQSQFRTDDPDQAHLFFIPIS 60
Query: 145 CHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLI 204
CHKMRGKGTSYENMT+IV++YV SL++KYPYWNRTLGADHFF+TCHDVGVRATEGVP L+
Sbjct: 61 CHKMRGKGTSYENMTIIVQNYVQSLMSKYPYWNRTLGADHFFLTCHDVGVRATEGVPLLV 120
Query: 205 KNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDVENRTVLGFWAGHRNSKIRVI 264
KN+IRVVCSPSYDV FIPHKDVALPQVLQPFALP GGRD++NRT LGFWAGHRNSKIRVI
Sbjct: 121 KNSIRVVCSPSYDVGFIPHKDVALPQVLQPFALPTGGRDIKNRTTLGFWAGHRNSKIRVI 180
Query: 265 LARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVP 324
LAR+WENDTELDI NNRINRA G LVYQ +FY+TKFCICPGGSQVNSARIADSIHYGCVP
Sbjct: 181 LARIWENDTELDIKNNRINRATGHLVYQNKFYRTKFCICPGGSQVNSARIADSIHYGCVP 240
Query: 325 VILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
VILS+YYDLPFNDILDWRKF+VIL ERDVY+LK ILK+I DAEF++LH+NLVKVQKHFQW
Sbjct: 241 VILSDYYDLPFNDILDWRKFSVILKERDVYRLKYILKDIPDAEFIALHDNLVKVQKHFQW 300
Query: 385 NSPPVRFDAFHMVMYELWLRRHVIKY 410
N+PP+++DAFHMVMYELWLR HVIKY
Sbjct: 301 NTPPIKYDAFHMVMYELWLRHHVIKY 326
>gi|125581086|gb|EAZ22017.1| hypothetical protein OsJ_05674 [Oryza sativa Japonica Group]
Length = 408
Score = 606 bits (1562), Expect = e-171, Method: Compositional matrix adjust.
Identities = 282/346 (81%), Positives = 310/346 (89%), Gaps = 6/346 (1%)
Query: 71 VFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLT 130
V+HS F Y EM +FK+Y+YPDGDP TFYQTPRKLTGKYASEGYFFQNIRESRF T
Sbjct: 63 VYHSAEAFAAGYAEMERSFKVYMYPDGDPKTFYQTPRKLTGKYASEGYFFQNIRESRFRT 122
Query: 131 HDPDRAHLFFIPISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCH 190
DPD+AHLFF+PIS HKMRGKGTSYENMT+IVKDYV+ LINKYPYWNRTLGADHFFVTCH
Sbjct: 123 GDPDKAHLFFVPISPHKMRGKGTSYENMTIIVKDYVEGLINKYPYWNRTLGADHFFVTCH 182
Query: 191 DVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDVENRTVL 250
DVGVRA EG+PF++KN+IRVVCSPSY+V FIPHKD+ALPQVLQPFALP GG DVENRT+L
Sbjct: 183 DVGVRAFEGLPFMVKNSIRVVCSPSYNVDFIPHKDIALPQVLQPFALPEGGNDVENRTIL 242
Query: 251 GFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVN 310
GFWAGHRNSKIRVILA+VWENDTEL ISNNRI+RAIG LVYQ++FY TKFCICPGGSQVN
Sbjct: 243 GFWAGHRNSKIRVILAQVWENDTELAISNNRISRAIGELVYQKQFYHTKFCICPGGSQVN 302
Query: 311 SARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVS 370
SARI+DSIHYGCVPVILS+YYDLPFNDILDWRKFAV+L ERDVYQLK ILK+IS EFV
Sbjct: 303 SARISDSIHYGCVPVILSDYYDLPFNDILDWRKFAVVLKERDVYQLKSILKSISQEEFVE 362
Query: 371 LHNNLV------KVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIKY 410
LH +LV +VQKHF W+SPP+ +DAFHMVMYELWLR HVIKY
Sbjct: 363 LHKSLVQNISCPQVQKHFVWHSPPLPYDAFHMVMYELWLRHHVIKY 408
>gi|125538390|gb|EAY84785.1| hypothetical protein OsI_06153 [Oryza sativa Indica Group]
Length = 332
Score = 592 bits (1526), Expect = e-166, Method: Compositional matrix adjust.
Identities = 276/332 (83%), Positives = 304/332 (91%), Gaps = 6/332 (1%)
Query: 85 MVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPIS 144
M +FK+Y+YPDGDP TFYQTPRKLTGKYASEGYFFQNIRESRF T DPD+AHLFF+PIS
Sbjct: 1 MERSFKVYMYPDGDPKTFYQTPRKLTGKYASEGYFFQNIRESRFRTGDPDKAHLFFVPIS 60
Query: 145 CHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLI 204
HKMRGKGTSYENMT+IVKDYV+ LINKYPYWNRTLGADHFFVTCHDVGVRA EG+PF++
Sbjct: 61 PHKMRGKGTSYENMTIIVKDYVEGLINKYPYWNRTLGADHFFVTCHDVGVRAFEGLPFMV 120
Query: 205 KNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDVENRTVLGFWAGHRNSKIRVI 264
KN+IRVVCSPSY+V FIPHKD+ALPQVLQPFALP GG DVENRT+LGFWAGHRNSKIRVI
Sbjct: 121 KNSIRVVCSPSYNVDFIPHKDIALPQVLQPFALPEGGNDVENRTILGFWAGHRNSKIRVI 180
Query: 265 LARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVP 324
LA+VWENDTEL ISNNRI+RAIG LVYQ++FY+TKFCICPGGSQVNSARI+DSIHYGCVP
Sbjct: 181 LAQVWENDTELAISNNRISRAIGELVYQKQFYRTKFCICPGGSQVNSARISDSIHYGCVP 240
Query: 325 VILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLV------KV 378
VILS+YYDLPFNDILDWRKFAV+L ERDVYQLK ILK+IS EFV LH +LV +V
Sbjct: 241 VILSDYYDLPFNDILDWRKFAVVLKERDVYQLKSILKSISQEEFVELHKSLVQNISCPQV 300
Query: 379 QKHFQWNSPPVRFDAFHMVMYELWLRRHVIKY 410
QKHF W+SPP+ +DAFHMVMYELWLR HVIKY
Sbjct: 301 QKHFVWHSPPLPYDAFHMVMYELWLRHHVIKY 332
>gi|227202794|dbj|BAH56870.1| AT4G38040 [Arabidopsis thaliana]
Length = 407
Score = 588 bits (1516), Expect = e-165, Method: Compositional matrix adjust.
Identities = 286/385 (74%), Positives = 315/385 (81%), Gaps = 20/385 (5%)
Query: 12 AQTTLCSLRSSLLTLALVTLLCFTCLSFNSFRSYP---------------LQNNFNS-TL 55
+ LCSL+SSLLT+A++T + LS NS R+ P N + +
Sbjct: 15 GSSPLCSLKSSLLTVAILTFISLFYLSLNSLRTSPPSPVIVVTPIHVPHTFVNEYKTDNE 74
Query: 56 SFAINKQDYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYAS 115
+ + ++ Y SDV+HSP F LNY EM FK+YIYPDGDPNTFYQTPRK+TGKYAS
Sbjct: 75 TPTMEEETY----SDVYHSPEAFRLNYAEMEKRFKVYIYPDGDPNTFYQTPRKVTGKYAS 130
Query: 116 EGYFFQNIRESRFLTHDPDRAHLFFIPISCHKMRGKGTSYENMTVIVKDYVDSLINKYPY 175
EGYFFQNIRESRF T DPD A LFFIPISCHKMRGKGTSYENMTVIV++YVD LI KYPY
Sbjct: 131 EGYFFQNIRESRFRTLDPDEADLFFIPISCHKMRGKGTSYENMTVIVQNYVDGLIAKYPY 190
Query: 176 WNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPF 235
WNRTLGADHFFVTCHDVGVRA EG P LIKN IRVVCSPSY+V FIPHKDVALPQVLQPF
Sbjct: 191 WNRTLGADHFFVTCHDVGVRAFEGSPLLIKNTIRVVCSPSYNVGFIPHKDVALPQVLQPF 250
Query: 236 ALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRF 295
ALP GG DVENRT LGFWAGHRNSKIRVILA VWENDTELDISNNRINRA G LVYQ+RF
Sbjct: 251 ALPAGGNDVENRTTLGFWAGHRNSKIRVILAHVWENDTELDISNNRINRATGHLVYQKRF 310
Query: 296 YKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQ 355
Y+TKFCICPGGSQVNSARI DSIHYGC+PVILS+YYDLPFNDIL+WRKFAV+L E+DVY
Sbjct: 311 YRTKFCICPGGSQVNSARITDSIHYGCIPVILSDYYDLPFNDILNWRKFAVVLREQDVYN 370
Query: 356 LKQILKNISDAEFVSLHNNLVKVQK 380
LKQILKNI +EFVSLHNNLVKV++
Sbjct: 371 LKQILKNIPHSEFVSLHNNLVKVKR 395
>gi|302820946|ref|XP_002992138.1| glycosyltransferase-like protein [Selaginella moellendorffii]
gi|300140064|gb|EFJ06793.1| glycosyltransferase-like protein [Selaginella moellendorffii]
Length = 342
Score = 513 bits (1322), Expect = e-143, Method: Compositional matrix adjust.
Identities = 237/340 (69%), Positives = 285/340 (83%), Gaps = 14/340 (4%)
Query: 85 MVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPIS 144
M+ FKI+IYPDGDPNT+YQTPRK+TGKYASEGYFFQN+RES+F+T +P++AHLFFIPIS
Sbjct: 1 MLKRFKIFIYPDGDPNTYYQTPRKITGKYASEGYFFQNLRESKFVTKNPNKAHLFFIPIS 60
Query: 145 CHKMRGK-------------GTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHD 191
CHKMRGK G SYE M IV++YV+ LI KYPYWNRTLGADHFFVTCHD
Sbjct: 61 CHKMRGKVPYYLTSNWNKMQGISYEKMADIVQEYVEGLIVKYPYWNRTLGADHFFVTCHD 120
Query: 192 VGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDVENRTVLG 251
VG RAT V L+KN+IRVVCSPSY+ FIPHKD+A+PQVLQPFALPRGG DV NRT+LG
Sbjct: 121 VGARATNKVANLVKNSIRVVCSPSYNGDFIPHKDIAMPQVLQPFALPRGGNDVRNRTILG 180
Query: 252 FWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNS 311
FWAGHRNSKIRV+LA++WE D L ISNNRI+RA G LVYQ++FY++KFCICPGGSQVNS
Sbjct: 181 FWAGHRNSKIRVVLAKLWEEDDVLAISNNRISRATGELVYQKQFYRSKFCICPGGSQVNS 240
Query: 312 ARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSL 371
ARI DSIHYGCVPVILS++YDLPFND+LDW++FA++L ERDV LK L+++S +++SL
Sbjct: 241 ARIVDSIHYGCVPVILSDHYDLPFNDVLDWKRFALLLRERDVGDLKLKLQSVSKEQYLSL 300
Query: 372 HNNLVK-VQKHFQWNSPPVRFDAFHMVMYELWLRRHVIKY 410
H LV+ VQ F+W++PP +DAFHMV+YELWLR ++Y
Sbjct: 301 HRGLVEVVQDRFEWHTPPRPYDAFHMVVYELWLRHFTVQY 340
>gi|51535578|dbj|BAD37522.1| pectin-glucuronyltransferase-like [Oryza sativa Japonica Group]
Length = 342
Score = 512 bits (1319), Expect = e-142, Method: Compositional matrix adjust.
Identities = 242/342 (70%), Positives = 286/342 (83%), Gaps = 16/342 (4%)
Query: 85 MVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPIS 144
M +F++++YPDGDP TFYQTPRKLTGKYASEGYFFQNIRESRF T D ++AHLFF+PIS
Sbjct: 1 MERSFRVFVYPDGDPGTFYQTPRKLTGKYASEGYFFQNIRESRFRTDDLEKAHLFFVPIS 60
Query: 145 CHKMRGKGTS------YENMTVIVKDYVDSLINKYPYW----------NRTLGADHFFVT 188
HKMRGK S Y + + ++ Y S++ YW + +GADHFFVT
Sbjct: 61 PHKMRGKVPSSLLLVTYAWLILHIRSYDRSILFLDLYWWCPLCSSFRGHWGVGADHFFVT 120
Query: 189 CHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDVENRT 248
CHDVGVRA EG+PF+IKN+IRVVCSPSY+ +IPHKDVALPQ+LQPFALP GG D+ENRT
Sbjct: 121 CHDVGVRAFEGLPFIIKNSIRVVCSPSYNAGYIPHKDVALPQILQPFALPAGGNDIENRT 180
Query: 249 VLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQ 308
+LGFWAGHRNSKIRVILAR+WENDTEL ISNNRINRAIG LVYQ+ F++TKFC+CPGGSQ
Sbjct: 181 ILGFWAGHRNSKIRVILARIWENDTELAISNNRINRAIGNLVYQKHFFRTKFCVCPGGSQ 240
Query: 309 VNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEF 368
VNSARI+DSIHYGC+PVILS+YYDL F+ IL+WRKFAV+L E DVY+LK ILK++S EF
Sbjct: 241 VNSARISDSIHYGCMPVILSDYYDLRFSGILNWRKFAVVLKESDVYELKSILKSLSQKEF 300
Query: 369 VSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIKY 410
VSLH +LV+VQKHF+W+SPPV +DAFHM+MYELWLR HVIKY
Sbjct: 301 VSLHKSLVQVQKHFEWHSPPVPYDAFHMIMYELWLRHHVIKY 342
>gi|168019297|ref|XP_001762181.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162686585|gb|EDQ72973.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 344
Score = 506 bits (1302), Expect = e-141, Method: Compositional matrix adjust.
Identities = 227/343 (66%), Positives = 286/343 (83%), Gaps = 1/343 (0%)
Query: 69 SDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRF 128
V+HS +F ++ EM FK+Y+YPDGDP T+YQTPRKLTGKY+SEGYFFQN+RESRF
Sbjct: 2 EQVYHSTDIFQRSFKEMEEKFKVYVYPDGDPETYYQTPRKLTGKYSSEGYFFQNLRESRF 61
Query: 129 LTHDPDRAHLFFIPISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVT 188
+T+D A LFF+P+SCHKMRGKG SYE M IV+ YV+SLI KYP+WNRT+GADHFFVT
Sbjct: 62 VTNDSAAADLFFLPVSCHKMRGKGLSYEKMADIVRAYVESLIIKYPFWNRTVGADHFFVT 121
Query: 189 CHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDVENRT 248
CHDVGVRAT V L+KN+IRVVCSPSY+ +FIPHKDVALPQVLQPF LP GG D+ NRT
Sbjct: 122 CHDVGVRATAKVEHLVKNSIRVVCSPSYNGSFIPHKDVALPQVLQPFPLPAGGDDIHNRT 181
Query: 249 VLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQ 308
VLGFWAGHRNSK+RV LA W+ D L ++NNR+NR+ G +YQ +FY++KFCICP GSQ
Sbjct: 182 VLGFWAGHRNSKVRVNLADAWQYDPILFVANNRLNRSTGDYIYQNQFYRSKFCICPAGSQ 241
Query: 309 VNSARIADSIHYGCVP-VILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAE 367
VNSARIA+SIHYGCVP VI++++YDLPFNDILDWRKF++++ ER+ LK+IL+ ++ +
Sbjct: 242 VNSARIAESIHYGCVPEVIMADFYDLPFNDILDWRKFSLVVREREYDNLKKILQAVTVQK 301
Query: 368 FVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIKY 410
+ LH + +V++HF+W+SPP+++DAFHMVMYELWLRR I++
Sbjct: 302 YRMLHAGVRQVRRHFEWHSPPIKYDAFHMVMYELWLRRFTIRF 344
>gi|302790752|ref|XP_002977143.1| glycosyltransferase-like protein [Selaginella moellendorffii]
gi|300155119|gb|EFJ21752.1| glycosyltransferase-like protein [Selaginella moellendorffii]
Length = 342
Score = 494 bits (1272), Expect = e-137, Method: Compositional matrix adjust.
Identities = 232/340 (68%), Positives = 276/340 (81%), Gaps = 14/340 (4%)
Query: 85 MVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPIS 144
M+ FKI+IYPDGDPNT+YQTPRK+TGKYASEGYFFQN+RES+F+T +P++AHLFFIPIS
Sbjct: 1 MLKRFKIFIYPDGDPNTYYQTPRKITGKYASEGYFFQNLRESKFVTKNPNKAHLFFIPIS 60
Query: 145 CHKMRGK-------------GTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHD 191
CHKMRGK G SYE M IV++YV+ LI KYPYWNRTLGADHFFVTCHD
Sbjct: 61 CHKMRGKVPYYLTSNWNKMQGISYEKMADIVQEYVEGLIVKYPYWNRTLGADHFFVTCHD 120
Query: 192 VGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDVENRTVLG 251
VG RAT V L+KN+IRVVCSPSY+ FIPHKD+A+PQVLQPFALPRGG DV NRT+LG
Sbjct: 121 VGARATNKVANLVKNSIRVVCSPSYNGDFIPHKDIAMPQVLQPFALPRGGNDVRNRTILG 180
Query: 252 FWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNS 311
FWAGHRNSKIRV+LA++WE D L ISNNRI+RA G LVYQ++FY++KFCICPGGSQVNS
Sbjct: 181 FWAGHRNSKIRVVLAKLWEEDDVLAISNNRISRATGELVYQKQFYRSKFCICPGGSQVNS 240
Query: 312 ARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSL 371
ARI DSIHYGCVPVILS++YDLPFND+LDW++FA++L ERDV LK + L
Sbjct: 241 ARIVDSIHYGCVPVILSDHYDLPFNDVLDWKRFALLLRERDVGDLKLFFFSFFLFSSRVL 300
Query: 372 HNNLVK-VQKHFQWNSPPVRFDAFHMVMYELWLRRHVIKY 410
+L + VQ F+W++PP +DAFHMV+YELWLR ++Y
Sbjct: 301 IASLFRQVQDRFEWHTPPRPYDAFHMVVYELWLRHFTVQY 340
>gi|147784355|emb|CAN72733.1| hypothetical protein VITISV_033460 [Vitis vinifera]
Length = 321
Score = 489 bits (1260), Expect = e-136, Method: Compositional matrix adjust.
Identities = 230/294 (78%), Positives = 260/294 (88%), Gaps = 3/294 (1%)
Query: 119 FFQNIRESRFLTHDPDRAHLFFIPIS--CHKMRGKGTSYENMTVIVKDYVDSLINKYPYW 176
FF +I L + P +FF+ +S C +GTSYENMT+IV++YV SL++KYPYW
Sbjct: 29 FFLDIPPCNNLMNFPSLV-VFFLSLSNLCDATVLQGTSYENMTIIVQNYVQSLMSKYPYW 87
Query: 177 NRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFA 236
NRTLGADHFF+TCHD GVRATEGVP L+KN+IRVVCSPSYDV FIPHKDVALPQVLQPFA
Sbjct: 88 NRTLGADHFFLTCHDXGVRATEGVPLLVKNSIRVVCSPSYDVGFIPHKDVALPQVLQPFA 147
Query: 237 LPRGGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFY 296
LP GGRD++NRT LGFWAGHRNSKIRVILAR+WENDTELDI NNRINRA G LVYQ +FY
Sbjct: 148 LPTGGRDIKNRTTLGFWAGHRNSKIRVILARIWENDTELDIKNNRINRATGHLVYQNKFY 207
Query: 297 KTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQL 356
+TKFCICPGGSQVNSARIADSIHYGCVPVILS+YYDLPFNDILDWRKF+VIL ERDVY+L
Sbjct: 208 RTKFCICPGGSQVNSARIADSIHYGCVPVILSDYYDLPFNDILDWRKFSVILKERDVYRL 267
Query: 357 KQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIKY 410
K ILK+I DAEF++LH+NLVKVQKHFQWN+PP+++DAFHMVMYELWLR HVIKY
Sbjct: 268 KYILKDIPDAEFIALHDNLVKVQKHFQWNTPPIKYDAFHMVMYELWLRHHVIKY 321
>gi|356546741|ref|XP_003541781.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
max]
Length = 410
Score = 485 bits (1248), Expect = e-134, Method: Compositional matrix adjust.
Identities = 220/347 (63%), Positives = 279/347 (80%), Gaps = 4/347 (1%)
Query: 67 LGSDVFHSPS-VFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRE 125
S VFHSP F L+Y +M FK+++YPDGDP T++ TPRKLTGKYASEGYFF+NIRE
Sbjct: 65 FSSGVFHSPEEAFRLDYQKMEEEFKVFVYPDGDPETYFHTPRKLTGKYASEGYFFKNIRE 124
Query: 126 SRFLTHDPDRAHLFFIPISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHF 185
SRF T DP RAHLFF+PISCHKMRG+G + E M V+ YV+ L +YPYWNRTLGADHF
Sbjct: 125 SRFFTDDPRRAHLFFLPISCHKMRGRGLTNERMIDEVEKYVEHLKFEYPYWNRTLGADHF 184
Query: 186 FVTCHDVGVRATEGVPFLIKNAIRVVCSPSY-DVAFIPHKDVALPQVLQPFALPRGGRDV 244
FVTCHD+GV+AT+GVP ++KN+IRV+CS Y D +IPHKDV LPQV PF P GG D+
Sbjct: 185 FVTCHDIGVKATKGVPHMMKNSIRVICSSRYDDDGYIPHKDVTLPQVQLPFFHPPGGNDI 244
Query: 245 ENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRIN-RAIGPLVYQRRFYKTKFCIC 303
+NR L FWAG +S+++ L +W+NDTE+DI N+R++ RA GP+VY + YK+KFC+C
Sbjct: 245 KNRNTLAFWAGRSDSRLKEDLIAIWDNDTEIDIQNSRVDLRATGPVVYMEKLYKSKFCLC 304
Query: 304 PGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNI 363
P G + S+RIADSIH+GCVPVI+S YYDLPFNDILDW +F+++L E DVYQLK L++I
Sbjct: 305 PHGP-IGSSRIADSIHFGCVPVIMSKYYDLPFNDILDWTQFSIVLKETDVYQLKYTLRSI 363
Query: 364 SDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIKY 410
S+ F++L++N+VK+QKHF+WN+PPVR DAFHMVMYELW RRH+I+Y
Sbjct: 364 SEKHFITLNHNIVKIQKHFKWNTPPVRQDAFHMVMYELWRRRHLIRY 410
>gi|356557595|ref|XP_003547101.1| PREDICTED: LOW QUALITY PROTEIN: probable glycosyltransferase
At5g03795-like [Glycine max]
Length = 412
Score = 477 bits (1228), Expect = e-132, Method: Compositional matrix adjust.
Identities = 232/402 (57%), Positives = 289/402 (71%), Gaps = 15/402 (3%)
Query: 12 AQTTLCSLRSSLLTLALVTLLCFTCLSFNSFRSYPLQNNFNSTLSFAINKQDYSDLGSDV 71
A L R + LV + +T + SY + TL F+ S V
Sbjct: 23 AVAVLSDTRVCIAGYVLVLFIIYT-KQHSDLPSYSDSDAHRRTLPFS----------SGV 71
Query: 72 FHSPS-VFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLT 130
FHSP F +Y +M FKI++YPDGDP T++ TPRKLTGKYASEGYFF+NIRESRF T
Sbjct: 72 FHSPEEAFRFDYEKMEEEFKIFVYPDGDPETYFHTPRKLTGKYASEGYFFKNIRESRFFT 131
Query: 131 HDPDRAHLFFIPISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCH 190
DP RAHLFF+PISCHKMRG+G + E M V+ YV+ L KYPYWNRTLGADHFFVTCH
Sbjct: 132 DDPRRAHLFFLPISCHKMRGRGLTIERMIDEVEKYVEHLKLKYPYWNRTLGADHFFVTCH 191
Query: 191 DVGVRATEGVPFLIKNAIRVVCSPSY-DVAFIPHKDVALPQVLQPFALPRGGRDVENRTV 249
D+GV+AT+GVP L KN+IRV CS SY D ++PHKDV LPQV PF P G D++NR
Sbjct: 192 DIGVKATKGVPHLTKNSIRVACSSSYDDDDYVPHKDVTLPQVQLPFFHPPGENDIKNRNT 251
Query: 250 LGFWAGHRNSKIRVILARVWENDTELDISNNRIN-RAIGPLVYQRRFYKTKFCICPGGSQ 308
FWAG +S+++ L +W+NDTELDI N R++ RA GP+VY + YK+KFC+CP G
Sbjct: 252 FAFWAGRSDSRLKDDLMAMWDNDTELDIQNXRVDLRATGPVVYMEKLYKSKFCLCPHGPV 311
Query: 309 VNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEF 368
NS IADSIH+GCVPVI+ NYYDLPFNDILDW +F+V+L E ++Y LK IL++IS+ F
Sbjct: 312 GNSL-IADSIHFGCVPVIMPNYYDLPFNDILDWSQFSVVLKETNIYLLKDILRSISEKHF 370
Query: 369 VSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIKY 410
+SL+ N+VK+QKHF+WN+PPVR DAFHMVMYE+WLRRH+I+Y
Sbjct: 371 ISLNRNIVKIQKHFKWNTPPVRQDAFHMVMYEIWLRRHLIRY 412
>gi|413926326|gb|AFW66258.1| hypothetical protein ZEAMMB73_148778 [Zea mays]
Length = 267
Score = 470 bits (1209), Expect = e-130, Method: Compositional matrix adjust.
Identities = 218/266 (81%), Positives = 243/266 (91%)
Query: 145 CHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLI 204
C + +GTSYENMTVIVKDYV+ LINKYPYWNRTLGADHFFVTCHDVGVRA EG+ F++
Sbjct: 2 CVVLTLQGTSYENMTVIVKDYVEGLINKYPYWNRTLGADHFFVTCHDVGVRAFEGLKFMV 61
Query: 205 KNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDVENRTVLGFWAGHRNSKIRVI 264
KN+IRVVCSPSY+V FIPHKD+ALPQVLQPFAL GG D++NR +LGFWAGHRNSKIRVI
Sbjct: 62 KNSIRVVCSPSYNVDFIPHKDIALPQVLQPFALHEGGNDIDNRVILGFWAGHRNSKIRVI 121
Query: 265 LARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVP 324
LARVWENDTEL ISNNRI+RAIG LVYQ++FY+TKFCICPGGSQVNSARI+DSIHYGCVP
Sbjct: 122 LARVWENDTELAISNNRISRAIGELVYQKQFYRTKFCICPGGSQVNSARISDSIHYGCVP 181
Query: 325 VILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
VILS+YYDLPFND LDWRKFAVIL ERDVYQLK ILK+IS EF+SLH +LV+VQKHF W
Sbjct: 182 VILSDYYDLPFNDALDWRKFAVILRERDVYQLKNILKSISQEEFISLHKSLVQVQKHFVW 241
Query: 385 NSPPVRFDAFHMVMYELWLRRHVIKY 410
+SPPV +DAFHMVMYELWLR +VIKY
Sbjct: 242 HSPPVSYDAFHMVMYELWLRHNVIKY 267
>gi|302823230|ref|XP_002993269.1| glycosyltransferase-like protein [Selaginella moellendorffii]
gi|300138939|gb|EFJ05690.1| glycosyltransferase-like protein [Selaginella moellendorffii]
Length = 353
Score = 413 bits (1062), Expect = e-113, Method: Compositional matrix adjust.
Identities = 186/350 (53%), Positives = 256/350 (73%), Gaps = 21/350 (6%)
Query: 65 SDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIR 124
++LG+++FH VF +Y +M K+++YPD P + KL GKYASEGYFF+N+
Sbjct: 21 AELGTEIFHVADVFQEDYEQMEQQLKVFVYPD--PVVY----TKLAGKYASEGYFFRNLM 74
Query: 125 ESRFLTHDPDRAHLFFIPISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADH 184
ESRF+T DP++A LFF+PISC ++R +G ++ ++ V +V+S+I K+PYWNRT+GADH
Sbjct: 75 ESRFVTTDPEKAQLFFVPISCARLREEGLDHDEISDNVASFVESVIAKFPYWNRTMGADH 134
Query: 185 FFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDV 244
FFVTCH++G RAT V L+KN+IRVVC+ SY FIPHKDVALPQ+LQPF PRGG D
Sbjct: 135 FFVTCHEIGTRATSKVAHLVKNSIRVVCASSYSGPFIPHKDVALPQILQPFPSPRGGDDT 194
Query: 245 ENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICP 304
E R LGFWAG NSK R++L + W+ D+++ IS + ++FY++KFCICP
Sbjct: 195 EKRETLGFWAGPANSKTRILLTKTWQEDSDMVISTKHVG--------MQQFYRSKFCICP 246
Query: 305 GGSQVNSARIADSIHYGCVP----VILSNYYDLPFNDILDWRKFAVILNERDVYQLKQIL 360
G++V++ARI +SIH+GCVP +ILS++YDLPFND+LDWRKFAVIL E+D LK L
Sbjct: 247 SGTRVSTARIVESIHFGCVPGKSFLILSDHYDLPFNDVLDWRKFAVILPEQDAGTLKDAL 306
Query: 361 KNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIKY 410
+ A + +LH NL++ Q HF+W+SPP+++D FHMVMYELWLR ++Y
Sbjct: 307 EL---APYATLHRNLLQAQAHFEWHSPPIKYDTFHMVMYELWLRHSTVRY 353
>gi|302824287|ref|XP_002993788.1| glycosyltransferase-like protein [Selaginella moellendorffii]
gi|300138384|gb|EFJ05154.1| glycosyltransferase-like protein [Selaginella moellendorffii]
Length = 353
Score = 412 bits (1058), Expect = e-112, Method: Compositional matrix adjust.
Identities = 186/350 (53%), Positives = 255/350 (72%), Gaps = 21/350 (6%)
Query: 65 SDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIR 124
++LG+++FH VF +Y +M K+++YPD P + KL GKYASEGYFF+N+
Sbjct: 21 AELGTEIFHVADVFQEDYEQMEQQLKVFVYPD--PVVY----TKLAGKYASEGYFFRNLM 74
Query: 125 ESRFLTHDPDRAHLFFIPISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADH 184
ESRF+T DP+ A LFF+PISC +++ +G ++ ++ V +V+S+I K+PYWNRT+GADH
Sbjct: 75 ESRFVTTDPEEAQLFFVPISCARLKEEGLDHDEISDNVASFVESVIAKFPYWNRTMGADH 134
Query: 185 FFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDV 244
FFVTCH++G RAT V L+KN+IRVVC+ SY FIPHKDVALPQ+LQPF PRGG D
Sbjct: 135 FFVTCHEIGTRATSKVAHLVKNSIRVVCASSYSGPFIPHKDVALPQILQPFPSPRGGDDT 194
Query: 245 ENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICP 304
E R LGFWAG NSK R++L + W+ D+++ IS + ++FY++KFCICP
Sbjct: 195 EKRETLGFWAGPANSKTRILLTKTWQEDSDMVISTKHVG--------MQQFYRSKFCICP 246
Query: 305 GGSQVNSARIADSIHYGCVP----VILSNYYDLPFNDILDWRKFAVILNERDVYQLKQIL 360
G++V++ARI +SIH+GCVP +ILS++YDLPFND+LDWRKFAVIL E+D LK L
Sbjct: 247 SGTRVSTARIVESIHFGCVPGKSYLILSDHYDLPFNDVLDWRKFAVILPEQDAGTLKDAL 306
Query: 361 KNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIKY 410
+ A + +LH NL++ Q HF+W+SPP+++D FHMVMYELWLR I+Y
Sbjct: 307 EL---APYATLHRNLLQAQAHFEWHSPPIKYDTFHMVMYELWLRHSTIRY 353
>gi|297725119|ref|NP_001174923.1| Os06g0638350 [Oryza sativa Japonica Group]
gi|255677258|dbj|BAH93651.1| Os06g0638350 [Oryza sativa Japonica Group]
Length = 257
Score = 375 bits (963), Expect = e-101, Method: Compositional matrix adjust.
Identities = 179/256 (69%), Positives = 210/256 (82%), Gaps = 16/256 (6%)
Query: 85 MVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPIS 144
M +F++++YPDGDP TFYQTPRKLTGKYASEGYFFQNIRESRF T D ++AHLFF+PIS
Sbjct: 1 MERSFRVFVYPDGDPGTFYQTPRKLTGKYASEGYFFQNIRESRFRTDDLEKAHLFFVPIS 60
Query: 145 CHKMRGKGTS------YENMTVIVKDYVDSLINKYPYW----------NRTLGADHFFVT 188
HKMRGK S Y + + ++ Y S++ YW + +GADHFFVT
Sbjct: 61 PHKMRGKVPSSLLLVTYAWLILHIRSYDRSILFLDLYWWCPLCSSFRGHWGVGADHFFVT 120
Query: 189 CHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDVENRT 248
CHDVGVRA EG+PF+IKN+IRVVCSPSY+ +IPHKDVALPQ+LQPFALP GG D+ENRT
Sbjct: 121 CHDVGVRAFEGLPFIIKNSIRVVCSPSYNAGYIPHKDVALPQILQPFALPAGGNDIENRT 180
Query: 249 VLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQ 308
+LGFWAGHRNSKIRVILAR+WENDTEL ISNNRINRAIG LVYQ+ F++TKFC+CPGGSQ
Sbjct: 181 ILGFWAGHRNSKIRVILARIWENDTELAISNNRINRAIGNLVYQKHFFRTKFCVCPGGSQ 240
Query: 309 VNSARIADSIHYGCVP 324
VNSARI+DSIHYGC+P
Sbjct: 241 VNSARISDSIHYGCMP 256
>gi|218198616|gb|EEC81043.1| hypothetical protein OsI_23835 [Oryza sativa Indica Group]
Length = 250
Score = 364 bits (934), Expect = 5e-98, Method: Compositional matrix adjust.
Identities = 180/285 (63%), Positives = 211/285 (74%), Gaps = 50/285 (17%)
Query: 85 MVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPIS 144
M +F++++YPDGDP TFYQTPRKLTGKYASEGYFFQNIRESRF T D ++AHLFF+PIS
Sbjct: 1 MERSFRVFVYPDGDPGTFYQTPRKLTGKYASEGYFFQNIRESRFRTDDLEQAHLFFVPIS 60
Query: 145 CHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLI 204
HKMRGKGTSYENMT+IV++YV+SLINKYPYWNRTLG +H ++G P +
Sbjct: 61 PHKMRGKGTSYENMTIIVQNYVESLINKYPYWNRTLG-EH------------SKGFPLSL 107
Query: 205 KNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDVENRTVLGFWAGHRNSKIRVI 264
R + T+LGFWAGH NSKIRVI
Sbjct: 108 -------------------------------------RTLSEWTILGFWAGHCNSKIRVI 130
Query: 265 LARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVP 324
LAR+WENDTEL ISNNRINRAIG LVYQ++F+ TKFC+CPGGSQVNSARI+DSIHYGCVP
Sbjct: 131 LARIWENDTELAISNNRINRAIGNLVYQKQFFWTKFCVCPGGSQVNSARISDSIHYGCVP 190
Query: 325 VILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFV 369
VILS+YYDLPF+ IL+WRKFAV+L E DVY+LK ILK++S EFV
Sbjct: 191 VILSDYYDLPFSGILNWRKFAVVLKESDVYELKSILKSLSQKEFV 235
>gi|224133768|ref|XP_002321656.1| predicted protein [Populus trichocarpa]
gi|222868652|gb|EEF05783.1| predicted protein [Populus trichocarpa]
Length = 407
Score = 354 bits (908), Expect = 5e-95, Method: Compositional matrix adjust.
Identities = 184/406 (45%), Positives = 250/406 (61%), Gaps = 29/406 (7%)
Query: 27 ALVTLLCFTCLSFNSFRSYPLQNNFNSTLSFAINKQDYSDLGSDVFHSPSVFHLNYLEMV 86
ALV L + PL + STL + + + + V HS F LNY M
Sbjct: 9 ALVIFLGIASILLQQRYWLPLSSKGKSTL-YGLQQST-----AGVHHSEEFFLLNYEAME 62
Query: 87 TNFKIYIYPDGDPNTFYQT-PRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISC 145
+ K+++YP G+P T Y + +KL YASE YFF N+R FLT +PD AHLFFIP+SC
Sbjct: 63 KDLKVFVYPGGNPKTCYHSIDKKLKSNYASEHYFFMNLRNGSFLTENPDEAHLFFIPLSC 122
Query: 146 HKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIK 205
M + +++++YV +L KYPYWNRTLGADHFFV+CH +G RAT PFL+K
Sbjct: 123 QPMEDQDALPRYKEMVIQNYVRALTIKYPYWNRTLGADHFFVSCHGIGNRATAAFPFLLK 182
Query: 206 NAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDVEN------------------- 246
NAIR+VCSPSYD +IPHKDV+LPQ+L+ + P G + N
Sbjct: 183 NAIRLVCSPSYDSNYIPHKDVSLPQILE-LSFPPEGDGMWNDSTMESLPIQLSPVETHPS 241
Query: 247 RTVLGFWAGHRNSKIRVILARVWENDTELDIS-NNRINRAIGPLVYQRRFYKTKFCICP- 304
RT L FWAG NS++R L ++ E +I + RA+ +Q+ +++KFCICP
Sbjct: 242 RTKLCFWAGSPNSEVRKNLRVHYKGLEEFEIHFVENVKRALVLDTFQKEIHRSKFCICPR 301
Query: 305 GGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNIS 364
G +QV +A+S+ +GCVPVI+S+YYDLPFNDILDW F+VIL E DV + +ILK I
Sbjct: 302 GKTQVGGVCLAESMAFGCVPVIMSDYYDLPFNDILDWNAFSVILKEHDVPIMGEILKGIP 361
Query: 365 DAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIKY 410
+ F + N++KV K+F+W+ PV++D FHMVMYELW RRH+I+Y
Sbjct: 362 EDMFEKMRQNVLKVSKYFKWHFRPVKYDEFHMVMYELWKRRHIIRY 407
>gi|224093262|ref|XP_002309856.1| predicted protein [Populus trichocarpa]
gi|222852759|gb|EEE90306.1| predicted protein [Populus trichocarpa]
Length = 326
Score = 351 bits (900), Expect = 4e-94, Method: Compositional matrix adjust.
Identities = 169/328 (51%), Positives = 219/328 (66%), Gaps = 3/328 (0%)
Query: 85 MVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPIS 144
M FK+++Y D + P K +Y SE YFF N++ S FLT D AHLFFIPI
Sbjct: 1 MEREFKVFVYQDRNITKHCDLPSKHNSRYESEEYFFSNLKMSPFLTDDAAEAHLFFIPIF 60
Query: 145 CHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLI 204
KM K + E + V+D+V SLI+KYPYWNRTLGADHFFVTC D+ V AT + L+
Sbjct: 61 SQKMTKKRSEDER-AIAVEDFVKSLISKYPYWNRTLGADHFFVTCADINVTATARIANLM 119
Query: 205 KNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDVENRTVLGFWAGHRNSKIRVI 264
KN+I+V+C+PSY+ ++PHKDV+LPQ + P AL G ++ NR L FW G NS IR
Sbjct: 120 KNSIKVMCTPSYNDEYVPHKDVSLPQRVPPLALTPAGNNITNRITLAFWRGLNNSDIRQK 179
Query: 265 LARVWENDTELDISNNRI-NRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCV 323
L WEND EL I R + G LV+ F +K+CICPGG +++ IA +IHYGCV
Sbjct: 180 LLEAWENDLELFIQKGRKPSLEQGDLVHHEAFNNSKYCICPGGPELDRT-IALAIHYGCV 238
Query: 324 PVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQ 383
PVI+S+YYDLPF DILDWRKF++IL E VY L++ LK + + E+ ++ N V V+KHFQ
Sbjct: 239 PVIMSDYYDLPFKDILDWRKFSIILEESQVYYLREHLKEMLEHEYRAMQTNTVMVRKHFQ 298
Query: 384 WNSPPVRFDAFHMVMYELWLRRHVIKYF 411
WN P ++DAFHM MY+LWLR H KY+
Sbjct: 299 WNLVPAKYDAFHMTMYDLWLRNHFTKYY 326
>gi|449531319|ref|XP_004172634.1| PREDICTED: probable glycosyltransferase At5g03795-like [Cucumis
sativus]
Length = 307
Score = 348 bits (892), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 178/231 (77%), Positives = 199/231 (86%), Gaps = 2/231 (0%)
Query: 16 LCSLRSSLLTLALVTLLCFTCLSFNSFRSYPLQNNFNSTLSF-AINKQDYSDLGSDVFHS 74
LCSLR+SLLTLA++TLL FT LSF S S P ++ + A+N +++ SDV+HS
Sbjct: 78 LCSLRASLLTLAVLTLLSFTYLSFTSLHSSPPSSSSQLPVKLGALNDAADAEI-SDVYHS 136
Query: 75 PSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPD 134
P VF LNY +M + FK+YIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRF T DPD
Sbjct: 137 PQVFRLNYADMESKFKVYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFRTEDPD 196
Query: 135 RAHLFFIPISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV 194
+AHLFFIPISCHKMRGKGTSYENMTVIV++YV+ LI+KYPYWNRTLGADHFFVTCHDVGV
Sbjct: 197 QAHLFFIPISCHKMRGKGTSYENMTVIVQNYVEGLISKYPYWNRTLGADHFFVTCHDVGV 256
Query: 195 RATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDVE 245
RA+EG+PFLIKNAIRVVCSPSYDV FIPHKDVALPQVLQPFALP GG D E
Sbjct: 257 RASEGLPFLIKNAIRVVCSPSYDVGFIPHKDVALPQVLQPFALPAGGNDTE 307
>gi|224093268|ref|XP_002309859.1| predicted protein [Populus trichocarpa]
gi|222852762|gb|EEE90309.1| predicted protein [Populus trichocarpa]
Length = 420
Score = 331 bits (848), Expect = 4e-88, Method: Compositional matrix adjust.
Identities = 180/423 (42%), Positives = 244/423 (57%), Gaps = 31/423 (7%)
Query: 10 NQAQTTLCSLRSSLLTLALVTL-LCFTCLSFNSFRSYPLQNNFNSTLSFAINKQDYSDLG 68
N + + R ++ + VTL L F + PL +S + + +G
Sbjct: 7 NPRPINIWTARRNIKKIIPVTLGLVFIVTQIIQLQRSPLFREKDSLPDEVFRTRKGNSIG 66
Query: 69 SDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRF 128
S ++H F NY M +FK+++YP DP T Y KL KYASE YF +N+ S F
Sbjct: 67 S-IYHREDFFLPNYATMENDFKVFVYPGRDPTTCYDPRDKLKRKYASEHYFLKNLIPSSF 125
Query: 129 LTHDPDRAHLFFIPISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVT 188
T DP AHLF IP+SC K G+ +++YV SLI+ YPYWNRTLGADHF+ +
Sbjct: 126 FTDDPTVAHLFLIPLSCKKTGGREEK------DIENYVKSLISSYPYWNRTLGADHFYFS 179
Query: 189 CHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDV---- 244
CH + E VP L+KN IR+VCSPSYD +IPHKD++LPQ L+ +L G DV
Sbjct: 180 CHGIDSGTIEEVPLLMKNVIRLVCSPSYDSKYIPHKDISLPQTLE-LSL-HDGDDVWSRS 237
Query: 245 ---------------ENRTVLGFWAGHRNSKIRVILARVWENDTELDISN-NRINRAIGP 288
RT LGFWAG NS +R L ++ E + +++ +A
Sbjct: 238 TVMSRPLMIYPEMMLPRRTKLGFWAGSLNSDVRKNLQVFYKGAPEFNFHFFDKMKKAAIL 297
Query: 289 LVYQRRFYKTKFCICP-GGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVI 347
Y+ Y +KFCICP G + V S + +S+ +GCVPVIL +YYD PFND+LDW F+VI
Sbjct: 298 DAYENELYGSKFCICPRGNNHVGSVCLTESMTFGCVPVILHDYYDFPFNDVLDWNNFSVI 357
Query: 348 LNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHV 407
L E V L++ILK I + + +H NL++V+KHFQWNS PV++D F M+MYELWLRRH+
Sbjct: 358 LKEEHVPDLEKILKGIPEENYKKMHQNLLQVRKHFQWNSLPVKYDLFRMIMYELWLRRHI 417
Query: 408 IKY 410
IKY
Sbjct: 418 IKY 420
>gi|449528899|ref|XP_004171439.1| PREDICTED: LOW QUALITY PROTEIN: probable glycosyltransferase
At3g07620-like, partial [Cucumis sativus]
Length = 173
Score = 309 bits (792), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 142/171 (83%), Positives = 158/171 (92%)
Query: 240 GGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTK 299
G ++ RT LGFWAGHRNSKIRVILARVWENDTELDISNNRI+RA G L+YQ+RFYKTK
Sbjct: 3 GDVNIIYRTTLGFWAGHRNSKIRVILARVWENDTELDISNNRISRATGHLLYQKRFYKTK 62
Query: 300 FCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQI 359
FCICPGGSQVNSARIADSIHYGCVPVILS+YYDLPFNDILDWRKF+VI+ ERDVYQLKQI
Sbjct: 63 FCICPGGSQVNSARIADSIHYGCVPVILSDYYDLPFNDILDWRKFSVIVKERDVYQLKQI 122
Query: 360 LKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIKY 410
LK+ISD EF+ LH NL++VQKHFQWNS P+++DAFHMVMY+LWLR HVIKY
Sbjct: 123 LKDISDIEFIKLHKNLMQVQKHFQWNSXPIKYDAFHMVMYDLWLRHHVIKY 173
>gi|357508695|ref|XP_003624636.1| Exostosin-like protein [Medicago truncatula]
gi|87162615|gb|ABD28410.1| Exostosin-like [Medicago truncatula]
gi|116831751|gb|ABK28848.1| exostosin-like protein [Medicago truncatula]
gi|355499651|gb|AES80854.1| Exostosin-like protein [Medicago truncatula]
Length = 486
Score = 301 bits (771), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 158/351 (45%), Positives = 221/351 (62%), Gaps = 21/351 (5%)
Query: 69 SDVFHSPSVFHLNYLEMVTNFKIYIYP--DGDP--NTFYQTPRKLTGKYASEGYFFQNIR 124
++VFH +F +Y EM +FKIY+YP DP N + +G YASE YF + +
Sbjct: 139 NNVFHDRDIFLEDYKEMNRSFKIYVYPHKKDDPFANVLLPVKTEPSGNYASESYFKKALM 198
Query: 125 ESRFLTHDPDRAHLFFIPISCHKMR-GKGTSYENMTVIVKDYVDSLINKYPYWNRTLGAD 183
+S F+T DP +A LFF+P S +R + + ++DYV ++I+KYPYWNRT GAD
Sbjct: 199 KSHFITKDPTKADLFFMPFSIASLRHDRRVGVGGIQDFIRDYVQNMIHKYPYWNRTNGAD 258
Query: 184 HFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVA-FIPHKDVALPQVLQPFALPRGGR 242
HF+V CH +G A + P + NAI+VVCS SY ++ +I HKD LPQ+ PR
Sbjct: 259 HFYVACHSIGRSAMDKAPDVKFNAIQVVCSSSYFLSGYIAHKDACLPQIW-----PRNEN 313
Query: 243 D----VENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKT 298
NR L F+AG NS +R+ L W+NDTE+ + N R+ G +
Sbjct: 314 PPNLVSSNRKKLAFFAGEVNSPVRINLVETWKNDTEIFVHNGRLKTPYG-----DELLGS 368
Query: 299 KFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQ 358
KFC G +VN+ARI DS++YGCVPVI++NYYDLPF D+L+W+ F+V++ D+ LK+
Sbjct: 369 KFCFHVRGYEVNTARIGDSLYYGCVPVIIANYYDLPFADVLNWKSFSVVVTTLDIPLLKK 428
Query: 359 ILKNI-SDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVI 408
ILK I + E++ L N++KV++HFQW+SPP+ FDAF+MVMYELWLRR I
Sbjct: 429 ILKGIVNSGEYLMLQKNVLKVREHFQWHSPPIDFDAFYMVMYELWLRRSSI 479
>gi|302780894|ref|XP_002972221.1| glycosyltransferase-like protein [Selaginella moellendorffii]
gi|300159688|gb|EFJ26307.1| glycosyltransferase-like protein [Selaginella moellendorffii]
Length = 376
Score = 295 bits (756), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 144/345 (41%), Positives = 216/345 (62%), Gaps = 11/345 (3%)
Query: 72 FHSPSVFHLNYLEMVTNFKIYIYP----DGDPNTFYQTPRKLTGKYASEGYFFQNIRESR 127
+HS F +Y +MV K+++YP D + F T +G YASE YF + + ES
Sbjct: 34 YHSRRAFLDDYRDMVNTMKVFVYPCSPRDPFSHIFLPTSSAPSGNYASEAYFKKALAESG 93
Query: 128 FLTHDPDRAHLFFIPISCHKMRGK-GTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFF 186
+T DP +A LFF+P S ++R M V+DYV ++ +++PYWNRT G+DHF+
Sbjct: 94 MVTDDPSQADLFFMPFSITRLRNDPKVGVGRMPAFVRDYVKNISHRWPYWNRTGGSDHFY 153
Query: 187 VTCHDVGVRATEGVPFLIKNAIRVVCSPSYDV-AFIPHKDVALPQVLQPFALPRGGRDVE 245
V CH +G A E + NAI+VVCS +Y V FIPHKDVA+PQ+ R + +E
Sbjct: 154 VACHSIGKVALEKAQHVRLNAIQVVCSSNYYVQGFIPHKDVAMPQIWPRSESFREIKTIE 213
Query: 246 NRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPG 305
R VL F+AG NS +R + R W NDT++ +RI + Y ++KFC+
Sbjct: 214 QRKVLAFFAGGSNSPVRANVVRTWRNDTQIHAYPSRIQGS-----YAEALLRSKFCLHVK 268
Query: 306 GSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISD 365
G +VN+AR+ D+ YGCVPV+++NYYDLPF+ +L+W+ F+V++ ++ +LK IL IS
Sbjct: 269 GYEVNTARLGDAFFYGCVPVVIANYYDLPFSSVLNWKSFSVVVTTANIPKLKAILSGISR 328
Query: 366 AEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIKY 410
++ +H ++ ++HFQW++PP +DAF+MVMY+LWLRRHV++Y
Sbjct: 329 EDYSQMHRLVLDARRHFQWHAPPREYDAFYMVMYQLWLRRHVVRY 373
>gi|302804767|ref|XP_002984135.1| glycosyltransferase-like protein [Selaginella moellendorffii]
gi|300147984|gb|EFJ14645.1| glycosyltransferase-like protein [Selaginella moellendorffii]
Length = 376
Score = 291 bits (744), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 142/345 (41%), Positives = 215/345 (62%), Gaps = 11/345 (3%)
Query: 72 FHSPSVFHLNYLEMVTNFKIYIYP----DGDPNTFYQTPRKLTGKYASEGYFFQNIRESR 127
+HS F +Y +MV K+++YP D + F T +G YASE YF + + S
Sbjct: 34 YHSRRAFLDDYRDMVNTMKVFVYPCSPRDPFSHIFLPTSSAPSGNYASEAYFKKALAGSG 93
Query: 128 FLTHDPDRAHLFFIPISCHKMRGK-GTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFF 186
+T DP +A LFF+P S ++R M V+DYV ++ +++PYWNRT G+DHF+
Sbjct: 94 MVTDDPSQADLFFMPFSITRLRNDPKVGVGRMPAFVRDYVKNISHRWPYWNRTGGSDHFY 153
Query: 187 VTCHDVGVRATEGVPFLIKNAIRVVCSPSYDV-AFIPHKDVALPQVLQPFALPRGGRDVE 245
V CH +G A E + NAI+VVCS +Y V FIPHKDVA+PQ+ R + +E
Sbjct: 154 VACHSIGKVALEKAQHVKLNAIQVVCSSNYYVQGFIPHKDVAIPQIWPRSESFREIKTIE 213
Query: 246 NRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPG 305
R VL F+AG NS +R + R W NDT++ +RI + Y ++KFC+
Sbjct: 214 QRKVLAFFAGGSNSPVRANVVRTWRNDTQIHAYPSRIQGS-----YAEALLRSKFCLHVK 268
Query: 306 GSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISD 365
G +VN+AR+ D+ YGCVPV+++N+YDLPF+ +L+W+ F+V++ ++ +LK IL IS
Sbjct: 269 GYEVNTARLGDAFFYGCVPVVIANHYDLPFSSVLNWKSFSVVVTTANIPKLKAILSGISR 328
Query: 366 AEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIKY 410
++ +H ++ ++HFQW++PP +DAF+MVMY+LWLRRHV++Y
Sbjct: 329 EDYSQMHRLVLDARRHFQWHAPPREYDAFYMVMYQLWLRRHVVRY 373
>gi|222635945|gb|EEE66077.1| hypothetical protein OsJ_22095 [Oryza sativa Japonica Group]
Length = 265
Score = 290 bits (741), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 131/175 (74%), Positives = 153/175 (87%)
Query: 236 ALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRF 295
P R + T+LGFWAGHRNSKIRVILAR+WENDTEL ISNNRINRAIG LVYQ+ F
Sbjct: 91 GFPLSLRTLSEWTILGFWAGHRNSKIRVILARIWENDTELAISNNRINRAIGNLVYQKHF 150
Query: 296 YKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQ 355
++TKFC+CPGGSQVNSARI+DSIHYGC+PVILS+YYDL F+ IL+WRKFAV+L E DVY+
Sbjct: 151 FRTKFCVCPGGSQVNSARISDSIHYGCMPVILSDYYDLRFSGILNWRKFAVVLKESDVYE 210
Query: 356 LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIKY 410
LK ILK++S EFVSLH +LV+VQKHF+W+SPPV +DAFHM+MYELWLR HVIKY
Sbjct: 211 LKSILKSLSQKEFVSLHKSLVQVQKHFEWHSPPVPYDAFHMIMYELWLRHHVIKY 265
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 54/76 (71%), Positives = 64/76 (84%)
Query: 85 MVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPIS 144
M +F++++YPDGDP TFYQTPRKLTGKYASEGYFFQNIRESRF T D ++AHLFF+PIS
Sbjct: 1 MERSFRVFVYPDGDPGTFYQTPRKLTGKYASEGYFFQNIRESRFRTDDLEKAHLFFVPIS 60
Query: 145 CHKMRGKGTSYENMTV 160
HKMRGK T +T+
Sbjct: 61 PHKMRGKDTGGWGLTI 76
>gi|449527873|ref|XP_004170933.1| PREDICTED: probable glycosyltransferase At5g03795-like [Cucumis
sativus]
Length = 466
Score = 287 bits (735), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 148/350 (42%), Positives = 214/350 (61%), Gaps = 12/350 (3%)
Query: 66 DLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQT--PRKLT--GKYASEGYFFQ 121
+ VFH +F +Y EM +FKIY+YP + F ++ P G YASE YF +
Sbjct: 118 EQAEGVFHDEELFLEDYKEMNKSFKIYVYPHKRSDPFARSLLPENFEPHGNYASESYFKK 177
Query: 122 NIRESRFLTHDPDRAHLFFIPISCHKMRG-KGTSYENMTVIVKDYVDSLINKYPYWNRTL 180
++ +S F+T+DP A FF+P S +R + S + ++DY+ + +KYPYWNRT
Sbjct: 178 SLIKSHFITNDPKEADFFFLPFSITGLRNDRRVSVSGIPNFIRDYIFDVSHKYPYWNRTG 237
Query: 181 GADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDV-AFIPHKDVALPQVLQPFALPR 239
GADHF+V CH VG A + + ++VVCS SY + +I HKD ALPQ+ P
Sbjct: 238 GADHFYVACHSVGRSAMDKSSEAKSSIVQVVCSSSYFLTGYISHKDAALPQIW-PRKEDP 296
Query: 240 GGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTK 299
RT L F+AG NS R L +VW D+E+ + R+ Y ++K
Sbjct: 297 SNLASSKRTRLAFFAGAMNSPTRQALVQVWGKDSEIFAYSGRLKTP-----YADELLRSK 351
Query: 300 FCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQI 359
FC+ G +VN+AR+ DSI YGCVPVI++NYYDLPF DIL+W+ F++++ D+ +LK+I
Sbjct: 352 FCLHVKGFEVNTARVGDSIFYGCVPVIIANYYDLPFGDILNWKSFSIVVTTSDIPRLKEI 411
Query: 360 LKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIK 409
LK I+D E+ L +N++KV+KHF+W+S PV +D FHMVMY+LWLRR ++
Sbjct: 412 LKGINDEEYARLQSNVLKVRKHFKWHSSPVDYDTFHMVMYQLWLRRTSVR 461
>gi|356571729|ref|XP_003554026.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
max]
Length = 487
Score = 285 bits (728), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 153/352 (43%), Positives = 216/352 (61%), Gaps = 21/352 (5%)
Query: 69 SDVFHSPSVFHLNYLEMVTNFKIYIYP--DGDP--NTFYQTPRKLTGKYASEGYFFQNIR 124
+DVFH VF +Y EM + KIY+YP + DP N + G Y SE YF + +
Sbjct: 137 NDVFHDRDVFLEDYKEMNRSLKIYVYPHREDDPFANVLLPVESEPGGNYTSESYFKKVLM 196
Query: 125 ESRFLTHDPDRAHLFFIPISCHKM-RGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGAD 183
+S F+T DP A LFF+P S ++ + + ++DY+ ++ ++YPYWN T GAD
Sbjct: 197 KSHFITKDPPEADLFFLPFSMARLWHDRRVGVGGIQDFIRDYIHNISHRYPYWNNTGGAD 256
Query: 184 HFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDV-AFIPHKDVALPQVLQPFALPRGGR 242
HF+V CH +G A + P NAI+VVCS SY + + HKD LPQ+ PR G
Sbjct: 257 HFYVACHSIGRSAMDKAPDEKFNAIQVVCSSSYFLTGYFAHKDACLPQIW-----PRKGN 311
Query: 243 D----VENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKT 298
R L F+AG NS +RV L W+ND+E+ + + R+ Y +
Sbjct: 312 PPNLVSSKRKRLAFFAGGVNSPVRVKLLETWKNDSEIFVHHGRLKTP-----YADELLGS 366
Query: 299 KFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQ 358
KFC+ G +VN+ARI DS++YGCVPVI++NYYDLPF D+L+W+ F+V++ D+ LK+
Sbjct: 367 KFCLHVKGFEVNTARIGDSLYYGCVPVIIANYYDLPFADVLNWKSFSVVVTTLDIPLLKK 426
Query: 359 ILKN-ISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIK 409
ILK+ IS +++ L +N++KV+KHFQW+SPP FDAF+MVMYELWLRR IK
Sbjct: 427 ILKDIISSNKYLMLQSNVLKVRKHFQWHSPPQDFDAFYMVMYELWLRRSSIK 478
>gi|449446746|ref|XP_004141132.1| PREDICTED: probable glycosyltransferase At5g03795-like [Cucumis
sativus]
Length = 466
Score = 284 bits (727), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 147/350 (42%), Positives = 213/350 (60%), Gaps = 12/350 (3%)
Query: 66 DLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQT--PRKLT--GKYASEGYFFQ 121
+ VFH +F +Y EM +FKIY+YP + F ++ P G YASE YF +
Sbjct: 118 EQAEGVFHDEELFLEDYKEMNKSFKIYVYPHKRSDPFARSLLPENFEPHGNYASESYFKK 177
Query: 122 NIRESRFLTHDPDRAHLFFIPISCHKMRG-KGTSYENMTVIVKDYVDSLINKYPYWNRTL 180
++ +S F+T+DP A F +P S +R + S + ++DY+ + +KYPYWNRT
Sbjct: 178 SLIKSHFITNDPKEADFFSLPFSITGLRNDRRVSVSGIPNFIRDYIFDVSHKYPYWNRTG 237
Query: 181 GADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDV-AFIPHKDVALPQVLQPFALPR 239
GADHF+V CH VG A + + ++VVCS SY + +I HKD ALPQ+ P
Sbjct: 238 GADHFYVACHSVGRSAMDKSSEAKSSIVQVVCSSSYFLTGYISHKDAALPQIW-PRKEDP 296
Query: 240 GGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTK 299
RT L F+AG NS R L +VW D+E+ + R+ Y ++K
Sbjct: 297 SNLASSKRTRLAFFAGAMNSPTRQALVQVWGKDSEIFAYSGRLKTP-----YADELLRSK 351
Query: 300 FCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQI 359
FC+ G +VN+AR+ DSI YGCVPVI++NYYDLPF DIL+W+ F++++ D+ +LK+I
Sbjct: 352 FCLHVKGFEVNTARVGDSIFYGCVPVIIANYYDLPFGDILNWKSFSIVVTTSDIPRLKEI 411
Query: 360 LKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIK 409
LK I+D E+ L +N++KV+KHF+W+S PV +D FHMVMY+LWLRR ++
Sbjct: 412 LKGINDEEYARLQSNVLKVRKHFKWHSSPVDYDTFHMVMYQLWLRRTSVR 461
>gi|255569522|ref|XP_002525728.1| catalytic, putative [Ricinus communis]
gi|223535028|gb|EEF36711.1| catalytic, putative [Ricinus communis]
Length = 336
Score = 282 bits (722), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 150/335 (44%), Positives = 209/335 (62%), Gaps = 20/335 (5%)
Query: 85 MVTNFKIYIYP--DGDP--NTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFF 140
M +FKIY+YP DP N + G YASE YF + + +S F+T DP +A LFF
Sbjct: 1 MNRSFKIYVYPHRQNDPFANVLLPVDFEPGGNYASESYFKKVLMKSHFITKDPTKADLFF 60
Query: 141 IPISCHKMR-GKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEG 199
+P S ++R E + ++ YV ++ KYPYWNRT G DHF+V CH +G A E
Sbjct: 61 LPFSIARLRHDPRIGVEGIQDFIRAYVYNISQKYPYWNRTGGTDHFYVACHSIGRTAMEK 120
Query: 200 VPFLIKNAIRVVCSPSYDVA-FIPHKDVALPQVLQPFALPRGGR----DVENRTVLGFWA 254
+ NAI+VVCS SY ++ +I HKD +LPQV PR G R L F+A
Sbjct: 121 AEEVKFNAIQVVCSSSYYLSGYIAHKDASLPQVW-----PRQGDPPNLASSERQKLAFFA 175
Query: 255 GHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARI 314
G NS +R L +VW ND+E+ + R+N + Y +KFC+ G +VN+ARI
Sbjct: 176 GSINSPVRERLLQVWRNDSEIYVHYGRLNTS-----YADELLGSKFCLHVKGFEVNTARI 230
Query: 315 ADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNN 374
ADS++YGCVP+I++N+YDLPF DIL+W F+V++ D+ LK+IL+ +S +V L +N
Sbjct: 231 ADSLYYGCVPIIIANHYDLPFTDILNWESFSVVVATLDILYLKKILQGVSSDRYVMLQSN 290
Query: 375 LVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIK 409
++KV+KHFQW+ PPV +DAFHMVMYELWLRR ++
Sbjct: 291 VLKVRKHFQWHFPPVDYDAFHMVMYELWLRRSSVR 325
>gi|296089301|emb|CBI39073.3| unnamed protein product [Vitis vinifera]
Length = 467
Score = 280 bits (717), Expect = 8e-73, Method: Compositional matrix adjust.
Identities = 149/347 (42%), Positives = 214/347 (61%), Gaps = 20/347 (5%)
Query: 73 HSPSVFHLNYLEMVTNFKIYIYPD--GDP--NTFYQTPRKLTGKYASEGYFFQNIRESRF 128
H ++F NY EM +FKIY YP DP N + G YASE YF + + +S F
Sbjct: 108 HDRNLFVENYKEMNRSFKIYCYPHKRDDPFANALLPVDFEPGGNYASESYFKKVLMKSHF 167
Query: 129 LTHDPDRAHLFFIPISCHKMR-GKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFV 187
+T DP +A LFF+P S ++R + ++DY+ ++ YPYWN+T GADHF+V
Sbjct: 168 ITKDPSKADLFFLPFSIARLRHDPRVGVGGIQDFIRDYIFNISQNYPYWNQTGGADHFYV 227
Query: 188 TCHDVGVRATEGVPFLIKNAIRVVCSPSYDVA-FIPHKDVALPQVLQPFALPRGGR---- 242
CH +G A E + NAI+VVCS SY ++ +I HKD +LPQ+ PR G
Sbjct: 228 ACHSIGRSAMEKADEVKLNAIQVVCSSSYFLSGYIAHKDASLPQIW-----PRQGDPPDL 282
Query: 243 DVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCI 302
+ R L F+AG NS +R L +VW ND+E+ + R+ Y +KFC+
Sbjct: 283 ALSERKKLAFFAGSINSPVRERLLQVWRNDSEISVHFGRLTTP-----YADELLGSKFCL 337
Query: 303 CPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKN 362
G ++N+ARIADS++YGCVPVI++N+YDLPF DIL+W+ F++++ D+ LKQ+LK
Sbjct: 338 HVKGFEINTARIADSLYYGCVPVIIANHYDLPFADILNWKSFSIVVATLDIPLLKQVLKG 397
Query: 363 ISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIK 409
IS E++ L +N++KV+ HFQW+ PV +DAF+MVMYELWLRR ++
Sbjct: 398 ISLNEYLMLQSNVLKVRNHFQWHVSPVDYDAFYMVMYELWLRRSSVR 444
>gi|168059172|ref|XP_001781578.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666988|gb|EDQ53629.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 344
Score = 276 bits (705), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 132/345 (38%), Positives = 213/345 (61%), Gaps = 12/345 (3%)
Query: 72 FHSPSVFHLNYLEMVTNFKIYIYPDGDPNTF---YQTPRKL-TGKYASEGYFFQNIRESR 127
+HS +F +Y EM +I++YP + F +++ K+ +G YASE +F Q++ S
Sbjct: 1 YHSRRIFESDYAEMKRKLRIFVYPHDRKDPFHMIFESGNKVPSGNYASEEFFQQSLLTST 60
Query: 128 FLTHDPDRAHLFFIPISCHKMR-GKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFF 186
FLT A FF+P+S K R K + + +Y+ + +++ YWNR+ GADHF+
Sbjct: 61 FLTKTASEADFFFMPVSITKARMDKRINVGGLQSFCANYITDVRSQWSYWNRSNGADHFY 120
Query: 187 VTCHDVGVRATEGVPFLIKNAIRVVCSPSYDV-AFIPHKDVALPQVLQPFAL-PRGGRDV 244
++CH + A + VP + +NAI+++C SY + ++I HKD ++PQ+ P R +
Sbjct: 121 LSCHSIARNAMDRVPDVRQNAIQLLCPASYFLPSYITHKDASVPQIWPRLGKEPEEVRTI 180
Query: 245 ENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICP 304
R L F+AG NS +R L R W ND+++ + R+ P Y TKFC+
Sbjct: 181 TQRKRLAFFAGALNSPVRKDLERTWANDSKILVHKGRV-----PYPYSEALLTTKFCLHA 235
Query: 305 GGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNIS 364
G +VN+AR+ D+++YGCVPV+++NYYDLPF DILDW KF+++++ D+ LK+ L+ ++
Sbjct: 236 KGFEVNTARLGDAMYYGCVPVVIANYYDLPFQDILDWTKFSIVVSSLDIPLLKKTLEAVT 295
Query: 365 DAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIK 409
D ++ LH ++ +KHFQW++PP +DAFH VMYELW RRH+++
Sbjct: 296 DEQYAELHRQVLLARKHFQWHAPPEEYDAFHTVMYELWKRRHIVR 340
>gi|359481095|ref|XP_002265438.2| PREDICTED: probable glycosyltransferase At5g03795-like [Vitis
vinifera]
Length = 336
Score = 271 bits (694), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 144/335 (42%), Positives = 207/335 (61%), Gaps = 20/335 (5%)
Query: 85 MVTNFKIYIYPD--GDP--NTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFF 140
M +FKIY YP DP N + G YASE YF + + +S F+T DP +A LFF
Sbjct: 1 MNRSFKIYCYPHKRDDPFANALLPVDFEPGGNYASESYFKKVLMKSHFITKDPSKADLFF 60
Query: 141 IPISCHKMR-GKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEG 199
+P S ++R + ++DY+ ++ YPYWN+T GADHF+V CH +G A E
Sbjct: 61 LPFSIARLRHDPRVGVGGIQDFIRDYIFNISQNYPYWNQTGGADHFYVACHSIGRSAMEK 120
Query: 200 VPFLIKNAIRVVCSPSYDVA-FIPHKDVALPQVLQPFALPRGGRD----VENRTVLGFWA 254
+ NAI+VVCS SY ++ +I HKD +LPQ+ PR G + R L F+A
Sbjct: 121 ADEVKLNAIQVVCSSSYFLSGYIAHKDASLPQIW-----PRQGDPPDLALSERKKLAFFA 175
Query: 255 GHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARI 314
G NS +R L +VW ND+E+ + R+ Y +KFC+ G ++N+ARI
Sbjct: 176 GSINSPVRERLLQVWRNDSEISVHFGRLTTP-----YADELLGSKFCLHVKGFEINTARI 230
Query: 315 ADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNN 374
ADS++YGCVPVI++N+YDLPF DIL+W+ F++++ D+ LKQ+LK IS E++ L +N
Sbjct: 231 ADSLYYGCVPVIIANHYDLPFADILNWKSFSIVVATLDIPLLKQVLKGISLNEYLMLQSN 290
Query: 375 LVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIK 409
++KV+ HFQW+ PV +DAF+MVMYELWLRR ++
Sbjct: 291 VLKVRNHFQWHVSPVDYDAFYMVMYELWLRRSSVR 325
>gi|302790387|ref|XP_002976961.1| glycosyltransferase-like protein [Selaginella moellendorffii]
gi|300155439|gb|EFJ22071.1| glycosyltransferase-like protein [Selaginella moellendorffii]
Length = 408
Score = 271 bits (694), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 151/377 (40%), Positives = 229/377 (60%), Gaps = 30/377 (7%)
Query: 48 QNNFNSTLSFAINKQDYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPN---TFYQ 104
Q +F+S S +QD+S S V+H+ +F L+Y EMV +FK+Y+YP G+ + F
Sbjct: 44 QRHFSSRTS-RPQRQDHS---SKVYHNWELFSLDYEEMVKSFKVYVYPFGNSDYSQVFLP 99
Query: 105 TP----RKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHKMRG--KGTSYENM 158
P RKL G + SE F N+ S F T DP AHLFF+P S + MR + S +
Sbjct: 100 HPDPYDRKL-GNFFSEHMFKINLLNSTFATRDPGEAHLFFMPFSINAMRNHPRIRSEAMI 158
Query: 159 TVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSY-D 217
+ V+ YV+ + +Y +WNRT G DHF+V CH VG A L +NAI+V CS +Y
Sbjct: 159 SSFVESYVEEISQRYKFWNRTEGVDHFYVGCHSVGRNAASNSRALQQNAIQVTCSANYYQ 218
Query: 218 VAFIPHKDVALPQV----LQPFALPRGGRDVENRTVLGFWAGH-RNSKIRVILARVWEND 272
++PHKDVALPQV L F +P E RT L F++G +NS +R L ++W ND
Sbjct: 219 KLYVPHKDVALPQVWPRPLDTFIVP-----PEKRTKLAFFSGRAQNSHLRETLLKLWSND 273
Query: 273 TELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYD 332
+++DI + + Y+ ++KFC+ G +VN+ARI+D++H+GCVPVI+SN YD
Sbjct: 274 SDMDIFAGTMQGS-----YEDALSRSKFCLHVKGYEVNTARISDALHFGCVPVIISNQYD 328
Query: 333 LPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFD 392
LP +++L+WR F+++L+ + LK L++++ E+ L +N +V++HF W+ P +D
Sbjct: 329 LPLSNVLNWRSFSIVLSYTQIPALKAKLQSVTHDEYARLWSNGRRVKRHFGWHHSPREYD 388
Query: 393 AFHMVMYELWLRRHVIK 409
+F M MYELW +RH ++
Sbjct: 389 SFQMTMYELWSKRHFVR 405
>gi|102139904|gb|ABF70053.1| exostosin family protein [Musa acuminata]
Length = 585
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 142/353 (40%), Positives = 214/353 (60%), Gaps = 16/353 (4%)
Query: 66 DLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRE 125
DL + +F + SVF +Y M + K+YIY DG P TP L G YASEG+F + + E
Sbjct: 230 DLYAPLFRNVSVFKKSYKMMESILKVYIYEDG-PKPLCHTPH-LDGIYASEGWFMKLMEE 287
Query: 126 -SRFLTHDPDRAHLFFIPISCHKMR-----GKGTSYENMTVIVKDYVDSLINKYPYWNRT 179
++F+ DP++AHLF++P S ++R S + +++ ++DYV+S+ KYP+WNRT
Sbjct: 288 NTQFVVKDPNKAHLFYLPYSSRQLRTHLYVAGSRSMQPLSIFLRDYVNSIAAKYPFWNRT 347
Query: 180 LGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSY-DVAFIPHKDVALPQ--VLQPFA 236
GADHF V CHD T L KN I+VVC+ + F+ KDV+L + V P +
Sbjct: 348 RGADHFLVACHDWATYTTNLHEDLRKNTIKVVCNADVSEGVFVRGKDVSLAETYVRTPNS 407
Query: 237 LPR---GGRDVENRTVLGFWAGHRNSKIRVILARVWEN-DTELDISNNRINRAIGPLVYQ 292
PR GGR R++L F+AG + ++R IL R W D ++ I + + Y
Sbjct: 408 -PRKAIGGRPASRRSILAFFAGQMHGRVRPILLRHWRGRDRDMRIYEVLPDEIAAKMSYI 466
Query: 293 RRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERD 352
+KFCICP G +VNS RI ++I+Y CVPVI++N + LPF ++LDW F+V++ E+D
Sbjct: 467 EHMKSSKFCICPMGYEVNSPRIVEAIYYDCVPVIIANNFVLPFEEVLDWGAFSVVVAEKD 526
Query: 353 VYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
+ +LKQIL IS +V + N+ +++KHF WN PV++D FHM+++ +W R
Sbjct: 527 IPKLKQILLGISGRRYVRMQTNVRRLRKHFLWNDKPVKYDLFHMILHSIWFNR 579
>gi|102140005|gb|ABF70140.1| exostosin family protein [Musa balbisiana]
Length = 533
Score = 264 bits (675), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 142/353 (40%), Positives = 213/353 (60%), Gaps = 16/353 (4%)
Query: 66 DLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRE 125
DL + +F + SVF +Y M K+YIY DG P TP L G YASEG+F + + E
Sbjct: 178 DLYAPLFLNVSVFKRSYTMMERILKVYIYEDG-PKPLCHTPH-LDGIYASEGWFMKLMEE 235
Query: 126 -SRFLTHDPDRAHLFFIPISCHKMR-----GKGTSYENMTVIVKDYVDSLINKYPYWNRT 179
++F+ +DP++AHLF++P S ++R S +++ ++DYV+S+ KYP+WNRT
Sbjct: 236 NTQFVVNDPNKAHLFYLPYSSRQLRTHLYVAGSRSMRPLSIFLRDYVNSISAKYPFWNRT 295
Query: 180 LGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSY-DVAFIPHKDVALPQ--VLQPFA 236
GADHF V CHD T L KN I+VVC+ + F+ KDV+L + V P +
Sbjct: 296 RGADHFLVACHDWATYTTNLHEELRKNTIKVVCNADVSEGVFVRGKDVSLAETYVRTPNS 355
Query: 237 LPR---GGRDVENRTVLGFWAGHRNSKIRVILARVWEN-DTELDISNNRINRAIGPLVYQ 292
PR GGR R++L F+AG + ++R IL R W D ++ I + + Y
Sbjct: 356 -PRKAIGGRPASRRSILAFFAGQMHGRVRPILLRYWRGRDRDMRIYEVLPDEIAAKMSYI 414
Query: 293 RRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERD 352
+KFCICP G +VNS RI ++I+Y CVPVI++N + LPF ++LDW F+V++ E+D
Sbjct: 415 EHMKSSKFCICPMGYEVNSPRIVEAIYYDCVPVIIANNFVLPFEEVLDWGAFSVVVAEKD 474
Query: 353 VYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
+ +LKQIL IS +V + N+ +++KHF WN PV++D FHM+++ +W R
Sbjct: 475 IPKLKQILLGISGRRYVRMQRNVRRLRKHFLWNDKPVKYDLFHMILHSIWFNR 527
>gi|357440259|ref|XP_003590407.1| hypothetical protein MTR_1g061760 [Medicago truncatula]
gi|355479455|gb|AES60658.1| hypothetical protein MTR_1g061760 [Medicago truncatula]
Length = 508
Score = 263 bits (673), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 134/364 (36%), Positives = 216/364 (59%), Gaps = 16/364 (4%)
Query: 52 NSTLSFAINKQDYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTG 111
++ S DY +G ++ + FH +YLEM FK+++Y +G+P F+ P K
Sbjct: 142 EASYSTQTQDPDYVPIGP-MYWNAKAFHRSYLEMEKQFKVFVYEEGEPPVFHNGPCK--S 198
Query: 112 KYASEGYFFQNIR-ESRFLTHDPDRAHLFFIPISCHKM-----RGKGTSYENMTVIVKDY 165
Y+ EG F I +F T DP +AH++F+P S + + + V DY
Sbjct: 199 IYSMEGNFIHAIELNDQFRTRDPQKAHVYFLPFSVVMLVRFVYLRDSRDFGPIRKTVTDY 258
Query: 166 VDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKD 225
++ + KYPYWNR+LGADHF + CHD G + VP+L KN+IRV+C+ + F P KD
Sbjct: 259 INVIAGKYPYWNRSLGADHFMLACHDWGPETSFSVPYLHKNSIRVLCNANTSERFNPAKD 318
Query: 226 VALPQV-LQPFALPR--GGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNR- 281
V+ P++ LQ ++ GG R +L F+AG + IR IL WEN+ + D+ +
Sbjct: 319 VSFPEINLQTGSINGFLGGLSASKRPILAFFAGGLHGHIRAILLEHWENNKDQDMMIQKY 378
Query: 282 INRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDW 341
+ + + Y K+KFC+CP G +V S RI ++I+ GCVPV++S++Y PF+D+L+W
Sbjct: 379 LPKGVS---YYEMLRKSKFCLCPSGYEVASPRIVEAIYTGCVPVLISDHYVPPFSDVLNW 435
Query: 342 RKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYEL 401
+ F+V ++ D+ +LK IL IS +++ + +V++++HF+ +SPP RFD FHM+++ +
Sbjct: 436 KSFSVEISVEDIPKLKDILMRISPTQYIRMQRRVVQIRRHFEVHSPPKRFDVFHMILHSV 495
Query: 402 WLRR 405
WLRR
Sbjct: 496 WLRR 499
>gi|359496494|ref|XP_002269459.2| PREDICTED: probable glycosyltransferase At5g03795-like [Vitis
vinifera]
Length = 554
Score = 262 bits (669), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 139/363 (38%), Positives = 214/363 (58%), Gaps = 20/363 (5%)
Query: 63 DYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQN 122
D +L + +FH+ SVF +Y M T K+YIYPDG F+ L G YASEG+F +
Sbjct: 177 DDPELYASLFHNVSVFKRSYELMETILKVYIYPDGARPIFHAP--HLRGIYASEGWFMKL 234
Query: 123 IRESR-FLTHDPDRAHLFFIPISCHKMRGK-----GTSYENMTVIVKDYVDSLINKYPYW 176
+ E+R F+T DP +AHLF++P S ++ + +++ ++D+V+ + KYP+W
Sbjct: 235 MEENRQFVTRDPKKAHLFYLPYSARQLETALYVPNSHNIRPLSIFLRDHVNMIAAKYPFW 294
Query: 177 NRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVA-FIPHKDVALPQVLQPF 235
NRT G+DHF V CHD G L +N I+ +C+ F+ KDV+LP+
Sbjct: 295 NRTHGSDHFLVACHDWGPYTVNEHQELSRNTIKALCNADLSEGIFVAGKDVSLPETT--I 352
Query: 236 ALPR------GGRDVENRTVLGFWAGHRNSKIRVILARVW-ENDTELDISNNRINRAIGP 288
PR GGR V R +L F+AG+ + ++R L + W + D ++ I NR
Sbjct: 353 RNPRRPLRNVGGRRVSQRPILAFFAGNMHGRVRPTLLKYWSDKDEDMRIYGPLPNRISRK 412
Query: 289 LVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVIL 348
+ Y + ++FCICP G +VNS RI ++I+Y CVPVI+++ + P ND+LDW F+VI+
Sbjct: 413 MSYIQHMKSSRFCICPMGYEVNSPRIVEAIYYECVPVIIADNFVPPLNDVLDWTAFSVIV 472
Query: 349 NERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELW--LRRH 406
E+D+ +LK+IL I ++ + N+ VQKHF WN PVR+D FHM+++ +W L+ H
Sbjct: 473 AEKDIPKLKEILLAIPLRRYLVMQTNVKMVQKHFLWNPKPVRYDLFHMILHSIWVVLKHH 532
Query: 407 VIK 409
V+K
Sbjct: 533 VLK 535
>gi|357125316|ref|XP_003564340.1| PREDICTED: probable glycosyltransferase At5g03795-like
[Brachypodium distachyon]
Length = 553
Score = 261 bits (666), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 135/357 (37%), Positives = 217/357 (60%), Gaps = 18/357 (5%)
Query: 63 DYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQN 122
D DL + +F + S+F +Y M K++IY DG F+ +L G YASEG+F +
Sbjct: 194 DDPDLYAPLFLNVSIFKRSYELMERLLKVFIYHDGAKPIFHSP--ELKGIYASEGWFMKL 251
Query: 123 IRESR-FLTHDPDRAHLFFIPISCHKMR-----GKGTSYENMTVIVKDYVDSLINKYPYW 176
I ++ F+ DP+RAHLF++P S ++ + E +++ VK+Y+D + K+PYW
Sbjct: 252 IEGNQNFVVRDPNRAHLFYLPYSSRQLEHNLYVPGSNTLEPLSIFVKNYIDMISAKFPYW 311
Query: 177 NRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSY-DVAFIPHKDVALPQVLQPF 235
NRT GADHFFV CHD G T+ L +N I+ +C+ + FI +DV+LP+
Sbjct: 312 NRTKGADHFFVACHDWGPYTTKLHDELRRNTIKALCNADLSEGVFIRGRDVSLPETF--V 369
Query: 236 ALPR------GGRDVENRTVLGFWAGHRNSKIRVILARVWEN-DTELDISNNRINRAIGP 288
PR GG+ R++L F+AG + ++R IL + W DT++ I R
Sbjct: 370 RSPRRPLRDIGGKPATERSILAFFAGQMHGRVRPILLQYWGGKDTDMRIYGRLPRRITRR 429
Query: 289 LVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVIL 348
+ Y + +K+CICP G +VNS RI ++I+Y C+PVI+++ + LPF+D LDW F+V++
Sbjct: 430 MNYVQHMKSSKYCICPMGYEVNSPRIVEAIYYECIPVIIADNFVLPFDDALDWSTFSVVV 489
Query: 349 NERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
E+DV +LK+IL I ++ ++++ +N+ KVQKHF W++ PV++D FHM+++ +W R
Sbjct: 490 PEKDVPRLKEILLRIPESRYITMQSNVKKVQKHFLWHAKPVKYDIFHMILHSVWFSR 546
>gi|356570806|ref|XP_003553575.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
max]
Length = 537
Score = 261 bits (666), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 131/352 (37%), Positives = 214/352 (60%), Gaps = 15/352 (4%)
Query: 63 DYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQN 122
DY +G ++++ + FH +YLEM FK+++Y +G+P F+ P K Y+ EG F
Sbjct: 183 DYVPVGP-MYNNANAFHRSYLEMEKQFKVFVYEEGEPPVFHNGPCKSI--YSMEGNFIHA 239
Query: 123 IR-ESRFLTHDPDRAHLFFIPISCHKMRG-----KGTSYENMTVIVKDYVDSLINKYPYW 176
I +F T DP+ AH+FF+P S + + + V DYV+ + +YPYW
Sbjct: 240 IEMNDQFRTRDPEEAHVFFLPFSVAMLVQFVYVRDSHDFGPIKKTVTDYVNVIGGRYPYW 299
Query: 177 NRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQV-LQPF 235
NR+LGADHF++ CHD G + +P L KN+IRV+C+ + F P KDV+ P++ LQ
Sbjct: 300 NRSLGADHFYLACHDWGPETSRSIPNLNKNSIRVLCNANTSEGFKPSKDVSFPEINLQTG 359
Query: 236 ALPR--GGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQR 293
++ GG R +L F+AG + IR +L WEN E + + + + Y
Sbjct: 360 SINGFIGGPSASRRPLLAFFAGGLHGPIRPVLLEHWENKDEDIQVHKYLPKGVS---YYE 416
Query: 294 RFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDV 353
K+KFC+CP G +V S R+ ++I+ GCVPV++S++Y PFND+L+W+ F+V ++ +D+
Sbjct: 417 MLRKSKFCLCPSGYEVASPRVVEAIYTGCVPVLISDHYVPPFNDVLNWKSFSVEVSVKDI 476
Query: 354 YQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
+LK+IL +IS +++ + + +V++HF+ +SPP R+D FHM+++ +WLRR
Sbjct: 477 PRLKEILLSISPRQYIRMQRRVGQVRRHFEVHSPPKRYDVFHMILHSVWLRR 528
>gi|296083497|emb|CBI23466.3| unnamed protein product [Vitis vinifera]
Length = 585
Score = 259 bits (663), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 136/357 (38%), Positives = 209/357 (58%), Gaps = 18/357 (5%)
Query: 63 DYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQN 122
D +L + +FH+ SVF +Y M T K+YIYPDG F+ L G YASEG+F +
Sbjct: 177 DDPELYASLFHNVSVFKRSYELMETILKVYIYPDGARPIFHAP--HLRGIYASEGWFMKL 234
Query: 123 IRESR-FLTHDPDRAHLFFIPISCHKMRGK-----GTSYENMTVIVKDYVDSLINKYPYW 176
+ E+R F+T DP +AHLF++P S ++ + +++ ++D+V+ + KYP+W
Sbjct: 235 MEENRQFVTRDPKKAHLFYLPYSARQLETALYVPNSHNIRPLSIFLRDHVNMIAAKYPFW 294
Query: 177 NRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVA-FIPHKDVALPQVLQPF 235
NRT G+DHF V CHD G L +N I+ +C+ F+ KDV+LP+
Sbjct: 295 NRTHGSDHFLVACHDWGPYTVNEHQELSRNTIKALCNADLSEGIFVAGKDVSLPETT--I 352
Query: 236 ALPR------GGRDVENRTVLGFWAGHRNSKIRVILARVW-ENDTELDISNNRINRAIGP 288
PR GGR V R +L F+AG+ + ++R L + W + D ++ I NR
Sbjct: 353 RNPRRPLRNVGGRRVSQRPILAFFAGNMHGRVRPTLLKYWSDKDEDMRIYGPLPNRISRK 412
Query: 289 LVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVIL 348
+ Y + ++FCICP G +VNS RI ++I+Y CVPVI+++ + P ND+LDW F+VI+
Sbjct: 413 MSYIQHMKSSRFCICPMGYEVNSPRIVEAIYYECVPVIIADNFVPPLNDVLDWTAFSVIV 472
Query: 349 NERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
E+D+ +LK+IL I ++ + N+ VQKHF WN PVR+D FHM+++ +W R
Sbjct: 473 AEKDIPKLKEILLAIPLRRYLVMQTNVKMVQKHFLWNPKPVRYDLFHMILHSIWFSR 529
>gi|79326862|ref|NP_001031828.1| putative glycosyltransferase [Arabidopsis thaliana]
gi|292630778|sp|Q9FFN2.2|GLYT3_ARATH RecName: Full=Probable glycosyltransferase At5g03795
gi|332003272|gb|AED90655.1| putative glycosyltransferase [Arabidopsis thaliana]
Length = 518
Score = 259 bits (663), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 134/343 (39%), Positives = 215/343 (62%), Gaps = 16/343 (4%)
Query: 77 VFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIR-ESRFLTHDPDR 135
VFH +YLEM FKIY+Y +G+P F+ P K Y+ EG F I ++RF T++PD+
Sbjct: 178 VFHRSYLEMEKQFKIYVYKEGEPPLFHDGPCK--SIYSMEGSFIYEIETDTRFRTNNPDK 235
Query: 136 AHLFFIPISCHKM-----RGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCH 190
AH+F++P S KM + + VKDY++ + +KYPYWNR++GADHF ++CH
Sbjct: 236 AHVFYLPFSVVKMVRYVYERNSRDFSPIRNTVKDYINLVGDKYPYWNRSIGADHFILSCH 295
Query: 191 DVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQV-LQPFALPR--GGRDVENR 247
D G A+ P L N+IR +C+ + F P KDV++P++ L+ +L GG +R
Sbjct: 296 DWGPEASFSHPHLGHNSIRALCNANTSERFKPRKDVSIPEINLRTGSLTGLVGGPSPSSR 355
Query: 248 TVLGFWAGHRNSKIRVILARVWEN-DTELDISNNRINRAIGPLVYQRRFYKTKFCICPGG 306
+L F+AG + +R +L + WEN D ++ + + + R Y +KFCICP G
Sbjct: 356 PILAFFAGGVHGPVRPVLLQHWENKDNDIRV-HKYLPRGTS---YSDMMRNSKFCICPSG 411
Query: 307 SQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDA 366
+V S RI ++++ GCVPV++++ Y PF+D+L+WR F+VI++ D+ LK IL +IS
Sbjct: 412 YEVASPRIVEALYSGCVPVLINSGYVPPFSDVLNWRSFSVIVSVEDIPNLKTILTSISPR 471
Query: 367 EFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIK 409
+++ ++ ++KV++HF+ NSP RFD FHM+++ +W+RR +K
Sbjct: 472 QYLRMYRRVLKVRRHFEVNSPAKRFDVFHMILHSIWVRRLNVK 514
>gi|224144815|ref|XP_002325425.1| predicted protein [Populus trichocarpa]
gi|222862300|gb|EEE99806.1| predicted protein [Populus trichocarpa]
Length = 332
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 140/336 (41%), Positives = 202/336 (60%), Gaps = 26/336 (7%)
Query: 85 MVTNFKIYIYPD--GDP--NTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFF 140
M +FKIY+YP DP N + G YASE YF + + +S F+T DP +A LFF
Sbjct: 1 MNRSFKIYVYPHRRNDPFANVLLPVDFEPGGNYASESYFKKALMKSHFITKDPAKADLFF 60
Query: 141 IPISCHKMR-GKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEG 199
+P S ++R + ++DY+ ++ KYP+WNRT GADHF+ CH +G A E
Sbjct: 61 LPFSITRLRHDPRVGVGGIQDFIRDYILNISRKYPFWNRTGGADHFYAACHSIGRSAMEK 120
Query: 200 VPFLIKNAIRVVCSPSYDVA-FIPHKDVALP-----QVLQPFALPRGGRDVENRTVLGFW 253
+ NAI+VVCS SY ++ +I HKDV+ P QV++ + R L F+
Sbjct: 121 SEEVKFNAIQVVCSSSYFLSGYIAHKDVSFPGCHLSQVVK----------CDYRKKLAFF 170
Query: 254 AGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSAR 313
AG NS +R L W ND+E+ R+ Y +KFC+ G +VN+AR
Sbjct: 171 AGSINSPVRERLLHSWRNDSEIFAHFGRLTTP-----YADELLGSKFCLHVKGFEVNTAR 225
Query: 314 IADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHN 373
I DS++YGCVPVI++N+YDLPF DIL+W+ F+V++ D+ LK+ILK IS +++
Sbjct: 226 IGDSLYYGCVPVIIANHYDLPFADILNWKSFSVVVATLDIPLLKKILKGISSDQYLMFQK 285
Query: 374 NLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIK 409
+++V+KHFQW+ PPV +DAF+MVMYELWLRR ++
Sbjct: 286 KVLEVRKHFQWHCPPVDYDAFYMVMYELWLRRTSVR 321
>gi|9758008|dbj|BAB08605.1| unnamed protein product [Arabidopsis thaliana]
Length = 408
Score = 258 bits (660), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 133/342 (38%), Positives = 211/342 (61%), Gaps = 14/342 (4%)
Query: 77 VFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIR-ESRFLTHDPDR 135
VFH +YLEM FKIY+Y +G+P F+ P K Y+ EG F I ++RF T++PD+
Sbjct: 68 VFHRSYLEMEKQFKIYVYKEGEPPLFHDGPCKSI--YSMEGSFIYEIETDTRFRTNNPDK 125
Query: 136 AHLFFIPISCHKM-----RGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCH 190
AH+F++P S KM + + VKDY++ + +KYPYWNR++GADHF ++CH
Sbjct: 126 AHVFYLPFSVVKMVRYVYERNSRDFSPIRNTVKDYINLVGDKYPYWNRSIGADHFILSCH 185
Query: 191 DVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQV-LQPFALPR--GGRDVENR 247
D G A+ P L N+IR +C+ + F P KDV++P++ L+ +L GG +R
Sbjct: 186 DWGPEASFSHPHLGHNSIRALCNANTSERFKPRKDVSIPEINLRTGSLTGLVGGPSPSSR 245
Query: 248 TVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGS 307
+L F+AG + +R +L + WEN + + R Y +KFCICP G
Sbjct: 246 PILAFFAGGVHGPVRPVLLQHWENKDNDIRVHKYLPRGTS---YSDMMRNSKFCICPSGY 302
Query: 308 QVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAE 367
+V S RI ++++ GCVPV++++ Y PF+D+L+WR F+VI++ D+ LK IL +IS +
Sbjct: 303 EVASPRIVEALYSGCVPVLINSGYVPPFSDVLNWRSFSVIVSVEDIPNLKTILTSISPRQ 362
Query: 368 FVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIK 409
++ ++ ++KV++HF+ NSP RFD FHM+++ +W+RR +K
Sbjct: 363 YLRMYRRVLKVRRHFEVNSPAKRFDVFHMILHSIWVRRLNVK 404
>gi|242056743|ref|XP_002457517.1| hypothetical protein SORBIDRAFT_03g008610 [Sorghum bicolor]
gi|241929492|gb|EES02637.1| hypothetical protein SORBIDRAFT_03g008610 [Sorghum bicolor]
Length = 521
Score = 258 bits (660), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 137/358 (38%), Positives = 214/358 (59%), Gaps = 18/358 (5%)
Query: 62 QDYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQ 121
D DL + +F + S+F +Y M K++IY DG F+ +L G YASEG+F +
Sbjct: 163 SDDPDLYAPLFRNVSIFKRSYELMERLLKVFIYHDGAKPIFHSP--ELKGIYASEGWFMR 220
Query: 122 NIRESR-FLTHDPDRAHLFFIPISCHKMRGK-----GTSYENMTVIVKDYVDSLINKYPY 175
+ ++ F+ DP+RAHLF++P S ++ S E +++ VK+Y+D + KYPY
Sbjct: 221 LMETNQNFVVRDPNRAHLFYLPYSSRQLEHNLYVPGSNSIEPLSIFVKNYIDLISAKYPY 280
Query: 176 WNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVA-FIPHKDVALPQVLQP 234
WNRT GADHFFV CHD G T+ L KN I+ +C+ FI KDV+LP+
Sbjct: 281 WNRTKGADHFFVACHDWGPYTTKLHDELRKNTIKALCNADLSEGIFIRGKDVSLPETF-- 338
Query: 235 FALPR------GGRDVENRTVLGFWAGHRNSKIRVILARVW-ENDTELDISNNRINRAIG 287
PR GGR RT+L F+AG + ++R +L + W + D ++ I + +R
Sbjct: 339 LRSPRRPLRDIGGRPAAQRTILAFFAGQMHGRVRPVLLKYWGDKDADMRIYSRLPHRITR 398
Query: 288 PLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVI 347
Y + +K+CICP G +VNS RI ++I+Y CVPVI+++ + LPF+ L+W F+V+
Sbjct: 399 KRNYVQHMKSSKYCICPMGYEVNSPRIVEAIYYECVPVIIADNFVLPFDAALNWSAFSVV 458
Query: 348 LNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
+ E DV +LK+IL I ++ +++L +N+ +VQKHF W+ PV++D FHM+++ +W R
Sbjct: 459 VPESDVPKLKEILLAIPESRYITLQSNVKRVQKHFLWHPNPVKYDIFHMILHSVWFSR 516
>gi|356548353|ref|XP_003542567.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
max]
Length = 505
Score = 258 bits (660), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 133/366 (36%), Positives = 216/366 (59%), Gaps = 20/366 (5%)
Query: 49 NNFNSTLSFAINKQDYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRK 108
N+N T DY +G ++ +P FH +YLEM FK+++Y +G+ F++ P
Sbjct: 142 ENWNQT-----QDSDYVPVGP-MYWNPKEFHRSYLEMEKQFKVFVYEEGELPVFHEGP-- 193
Query: 109 LTGKYASEGYFFQNIR-ESRFLTHDPDRAHLFFIPISCHKMRG-----KGTSYENMTVIV 162
Y++EG F I F T DP +AH+FF+P S M + + V
Sbjct: 194 CASIYSTEGSFIHAIEMNEHFRTRDPKKAHVFFLPFSVVMMVRYVYIRDSHDFGPIKRTV 253
Query: 163 KDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIP 222
+DY++ + +YPYWNR+LGADHF ++CHD G A++ P+L KN+IRV+C+ + F P
Sbjct: 254 RDYINVIAARYPYWNRSLGADHFMLSCHDWGPEASKFSPYLRKNSIRVLCNANTSEGFDP 313
Query: 223 HKDVALPQV-LQ--PFALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISN 279
KDV+ P++ LQ P GG R++L F+AG + IR IL WE E +
Sbjct: 314 RKDVSFPEINLQRGPIDGLLGGPSASQRSILAFFAGGIHGPIRPILLEHWEKKDEDIQVH 373
Query: 280 NRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDIL 339
+ + + Y K+KFC+CP G +V S R+ ++I+ GCVPV++S++Y PF+D+L
Sbjct: 374 QYLPKGVS---YYGMLRKSKFCLCPSGYEVASPRVVEAIYTGCVPVLISDHYVPPFSDVL 430
Query: 340 DWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMY 399
+W+ F+V ++ +++ LK IL NIS +++ + + ++++HF+ +SPP R+D FHM+++
Sbjct: 431 NWKMFSVEVSMKEIPNLKDILMNISPRKYIRMQKRVRQIRRHFEVHSPPKRYDVFHMILH 490
Query: 400 ELWLRR 405
+WLRR
Sbjct: 491 SVWLRR 496
>gi|356505300|ref|XP_003521429.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
max]
Length = 534
Score = 258 bits (660), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 129/353 (36%), Positives = 214/353 (60%), Gaps = 15/353 (4%)
Query: 62 QDYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQ 121
+DY +G ++++ + FH +YLEM FK+++Y +G+P F+ P K Y+ EG F
Sbjct: 179 KDYVPVGP-MYNNANAFHRSYLEMEKQFKVFVYEEGEPPVFHNGPCKSI--YSMEGNFIH 235
Query: 122 NIR-ESRFLTHDPDRAHLFFIPISCHKMRG-----KGTSYENMTVIVKDYVDSLINKYPY 175
I +F T DP++AH+FF+P S + + + V DYV+ + +YPY
Sbjct: 236 AIEMNDQFRTRDPEKAHVFFLPFSVAMLVQFVYVRDSHDFGPIKKTVTDYVNVIAGRYPY 295
Query: 176 WNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQV-LQP 234
WNR+LGADHF++ CHD G + +P L +N+IRV+C+ + F P KDV+ P++ LQ
Sbjct: 296 WNRSLGADHFYLACHDWGPETSRSIPNLNENSIRVLCNANTSEGFKPSKDVSFPEINLQT 355
Query: 235 FALPR--GGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQ 292
++ GG R +L F+AG + IR +L WEN E + + + + Y
Sbjct: 356 GSINGFIGGPSASGRPLLAFFAGGLHGPIRPVLLEHWENRDEDIQVHKYLPKGVS---YY 412
Query: 293 RRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERD 352
K++FC+CP G +V S R+ ++I+ GCVPV++S++Y PFND+L+W+ F+V ++ +D
Sbjct: 413 EMLRKSRFCLCPSGYEVASPRVVEAIYTGCVPVLISDHYVPPFNDVLNWKSFSVEVSVKD 472
Query: 353 VYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
+ +LK+IL +IS ++ + + V++HF+ +SPP R+D FHM+++ +WLRR
Sbjct: 473 IPRLKEILLSISPRHYIRMQRRVGLVRRHFEVHSPPKRYDVFHMILHSVWLRR 525
>gi|413947133|gb|AFW79782.1| putative homeodomain-like transcription factor superfamily protein
[Zea mays]
Length = 544
Score = 258 bits (659), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 137/357 (38%), Positives = 213/357 (59%), Gaps = 18/357 (5%)
Query: 63 DYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQN 122
D DL + +F + S+F +Y M K++IY DG F+ +L G YASEG+F +
Sbjct: 187 DDPDLYAPLFRNVSIFKRSYELMERLLKVFIYHDGAKPIFHSP--ELKGIYASEGWFMKL 244
Query: 123 IRESR-FLTHDPDRAHLFFIPISCHKMR-----GKGTSYENMTVIVKDYVDSLINKYPYW 176
+ ++ F+ DP+ AHLF++P S ++ + E +++ VK+Y+D + KYPYW
Sbjct: 245 METNQNFVVRDPNTAHLFYLPYSSRQLEHNLYVPGSNTIEPLSIFVKNYIDLISAKYPYW 304
Query: 177 NRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAF-IPHKDVALPQVLQPF 235
NRT GADHFFV CHD G T+ L KN I+ +C+ F I KDV+LP+
Sbjct: 305 NRTKGADHFFVACHDWGPYTTKLHDELRKNTIKALCNADLSEGFFIRGKDVSLPETF--L 362
Query: 236 ALPR------GGRDVENRTVLGFWAGHRNSKIRVILARVWEN-DTELDISNNRINRAIGP 288
PR GGR RT+L F+AG + ++R +L + W N D ++ I + +R
Sbjct: 363 RSPRRPLRDIGGRPAAQRTILAFFAGQMHGRVRPVLLKYWGNKDDDMRIYSRLPHRITRK 422
Query: 289 LVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVIL 348
Y + +K+CICP G +VNS RI ++I+Y CVPVI+++++ LPF+ L+W F+V++
Sbjct: 423 RNYVQHMKSSKYCICPMGYEVNSPRIVEAIYYECVPVIIADHFVLPFDAALNWSTFSVVV 482
Query: 349 NERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
E DV +LKQIL I ++ +++L N+ +VQKHF W+ PV++D FHM+++ +W R
Sbjct: 483 PESDVPKLKQILLAIPESRYITLQANVKRVQKHFMWHPNPVKYDIFHMILHSVWFSR 539
>gi|414881250|tpg|DAA58381.1| TPA: hypothetical protein ZEAMMB73_758303 [Zea mays]
Length = 474
Score = 258 bits (659), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 135/361 (37%), Positives = 220/361 (60%), Gaps = 25/361 (6%)
Query: 63 DYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQN 122
D DL + +F + S F +Y M K+YIY DG F+ P L+G YASEG+F +
Sbjct: 111 DDPDLYAPLFKNVSQFKRSYELMERILKVYIYQDGRRPIFHTPP--LSGIYASEGWFMKL 168
Query: 123 IRESR-FLTHDPDRAHLFFIPISCHKMR-----GKGTSYENMTVIVKDYVDSLINKYPYW 176
++ESR + D +AHLF++P S ++R + + ++++V L +KYP+W
Sbjct: 169 LKESRRHVVADAGKAHLFYLPYSSQQLRLTLYEAGSHNLRPLAAYLRNFVRGLASKYPFW 228
Query: 177 NRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVC-SPSYDVAFIPHKDVALPQVLQPF 235
NRT GADHF V CHD G T L KNAI+ +C + S + F P KDV+LP+
Sbjct: 229 NRTRGADHFLVACHDWGPYTTTAHRDLRKNAIKALCNADSSEGIFTPGKDVSLPETT--I 286
Query: 236 ALPR------GGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDIS-----NNRINR 284
PR GG V R++L F+AG+ + ++R +L R W N + D+ +R++R
Sbjct: 287 RTPRRPLRYVGGLPVSRRSILAFFAGNVHGRVRPVLLRHWGNGQDDDMRVYSLLPSRVSR 346
Query: 285 AIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKF 344
+ Y + ++FC+CP G +VNS RI ++++Y CVPVI+++ + LPF+++LDW F
Sbjct: 347 RMN---YIQHMKNSRFCLCPMGYEVNSPRIVEALYYECVPVIIADNFVLPFSEVLDWSAF 403
Query: 345 AVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLR 404
+V++ E+D+ LK+ILK IS +V++H+++ ++Q+HF W++ P+++D FHM+++ +WL
Sbjct: 404 SVVIAEKDIPDLKKILKGISLRRYVAMHDSVKRLQRHFLWHARPIKYDLFHMILHSIWLS 463
Query: 405 R 405
R
Sbjct: 464 R 464
>gi|296085068|emb|CBI28483.3| unnamed protein product [Vitis vinifera]
Length = 416
Score = 258 bits (658), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 136/368 (36%), Positives = 214/368 (58%), Gaps = 17/368 (4%)
Query: 52 NSTLSFAINKQDYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTG 111
N +L DY G ++ + + FH +YLEM FKIY+Y +G+P F+ P K
Sbjct: 53 NGSLKSTHEDPDYVPQGP-IYRNANAFHRSYLEMEKLFKIYVYEEGEPPMFHNGPCKSI- 110
Query: 112 KYASEGYFFQNIRE-SRFLTHDPDRAHLFFIPISCHKMRG-----KGTSYENMTVIVKDY 165
Y++EG F + + S + T DPD+A L+F+P S M + V DY
Sbjct: 111 -YSTEGRFIHEMEKGSVYRTTDPDQALLYFLPFSVVMMVQYLYVPDSHEIHAIEKTVIDY 169
Query: 166 VDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKD 225
++ + + +P+WNR+LGADHF ++CHD G RA+ VP+L N+IRV+C+ + F P KD
Sbjct: 170 INLISHNHPFWNRSLGADHFMLSCHDWGPRASTSVPYLYNNSIRVLCNANTSEGFNPSKD 229
Query: 226 VALPQV---LQPFALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWEN-DTELDISNNR 281
V+ P++ + P GG R +LGF+AG + IR +L W++ D +L + +
Sbjct: 230 VSFPEIHLRTGEMSGPLGGLSPSRRPILGFFAGRLHGHIRYLLLEQWKDKDKDLQVYDQL 289
Query: 282 INRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDW 341
N L Y K++FC+CP G +V S R+ ++I+ CVPV++S+ Y PFND+L+W
Sbjct: 290 PNG----LSYDSMLKKSRFCLCPSGYEVASPRVVEAIYAECVPVLISDNYVPPFNDVLNW 345
Query: 342 RKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYEL 401
+ FAV + RD+ +K+IL IS +++ ++ + +VQ+HF N+ P RFD FHM ++ +
Sbjct: 346 KSFAVQVQVRDIANIKRILMGISQTQYLRMYRRVKQVQRHFMVNAAPQRFDVFHMTIHSI 405
Query: 402 WLRRHVIK 409
WLRR I+
Sbjct: 406 WLRRLNIR 413
>gi|359485862|ref|XP_002264076.2| PREDICTED: probable glycosyltransferase At5g03795-like [Vitis
vinifera]
Length = 484
Score = 258 bits (658), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 136/368 (36%), Positives = 214/368 (58%), Gaps = 17/368 (4%)
Query: 52 NSTLSFAINKQDYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTG 111
N +L DY G ++ + + FH +YLEM FKIY+Y +G+P F+ P K
Sbjct: 121 NGSLKSTHEDPDYVPQGP-IYRNANAFHRSYLEMEKLFKIYVYEEGEPPMFHNGPCKSI- 178
Query: 112 KYASEGYFFQNIRE-SRFLTHDPDRAHLFFIPISCHKMRG-----KGTSYENMTVIVKDY 165
Y++EG F + + S + T DPD+A L+F+P S M + V DY
Sbjct: 179 -YSTEGRFIHEMEKGSVYRTTDPDQALLYFLPFSVVMMVQYLYVPDSHEIHAIEKTVIDY 237
Query: 166 VDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKD 225
++ + + +P+WNR+LGADHF ++CHD G RA+ VP+L N+IRV+C+ + F P KD
Sbjct: 238 INLISHNHPFWNRSLGADHFMLSCHDWGPRASTSVPYLYNNSIRVLCNANTSEGFNPSKD 297
Query: 226 VALPQV---LQPFALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWEN-DTELDISNNR 281
V+ P++ + P GG R +LGF+AG + IR +L W++ D +L + +
Sbjct: 298 VSFPEIHLRTGEMSGPLGGLSPSRRPILGFFAGRLHGHIRYLLLEQWKDKDKDLQVYDQL 357
Query: 282 INRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDW 341
N L Y K++FC+CP G +V S R+ ++I+ CVPV++S+ Y PFND+L+W
Sbjct: 358 PNG----LSYDSMLKKSRFCLCPSGYEVASPRVVEAIYAECVPVLISDNYVPPFNDVLNW 413
Query: 342 RKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYEL 401
+ FAV + RD+ +K+IL IS +++ ++ + +VQ+HF N+ P RFD FHM ++ +
Sbjct: 414 KSFAVQVQVRDIANIKRILMGISQTQYLRMYRRVKQVQRHFMVNAAPQRFDVFHMTIHSI 473
Query: 402 WLRRHVIK 409
WLRR I+
Sbjct: 474 WLRRLNIR 481
>gi|356537024|ref|XP_003537031.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
max]
Length = 472
Score = 258 bits (658), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 131/352 (37%), Positives = 210/352 (59%), Gaps = 15/352 (4%)
Query: 63 DYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQN 122
DY +G ++ + FH +YLEM FK+++Y +G+ F+ P K Y+ EG F
Sbjct: 118 DYVPIGP-MYWNAKTFHRSYLEMEKQFKVFVYEEGETPVFHNGPCKSI--YSMEGNFIHA 174
Query: 123 IR-ESRFLTHDPDRAHLFFIPISCHKM-----RGKGTSYENMTVIVKDYVDSLINKYPYW 176
I F T DP +AH+FF+P S M + + + V DY++ + +Y YW
Sbjct: 175 IEMNDHFRTKDPKKAHVFFLPFSVVMMVRFVYQRDSRDFGPIRKTVIDYINLIAARYSYW 234
Query: 177 NRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQV-LQPF 235
NR+LGADHF + CHD G A+ +P+L KN+IRV+C+ + F P KDV+ P++ LQ
Sbjct: 235 NRSLGADHFMLACHDWGPEASLSLPYLHKNSIRVLCNANTSEGFKPAKDVSFPEINLQTG 294
Query: 236 ALPR--GGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQR 293
++ GG R++L F+AG + IR IL WEN E + + + + Y
Sbjct: 295 SINGFIGGPSASKRSILAFFAGGVHGPIRPILLEHWENKDEDIQVHKYLPKGVS---YYD 351
Query: 294 RFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDV 353
+ +KFC+CP G +V S R+ ++I+ GCVPV++S +Y PF+D+L+W+ F+V L+ +D+
Sbjct: 352 KLRNSKFCLCPSGYEVASPRVVEAIYTGCVPVLISEHYVPPFSDVLNWKSFSVELSVKDI 411
Query: 354 YQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
LK IL +IS +++ + ++++Q+HF+ +SPP RFD FHM+++ +WLRR
Sbjct: 412 PNLKDILMSISPRQYIRMQRRVIQIQRHFEVHSPPKRFDVFHMILHSVWLRR 463
>gi|255575457|ref|XP_002528630.1| catalytic, putative [Ricinus communis]
gi|223531919|gb|EEF33733.1| catalytic, putative [Ricinus communis]
Length = 574
Score = 256 bits (655), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 131/357 (36%), Positives = 215/357 (60%), Gaps = 17/357 (4%)
Query: 63 DYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQN 122
+Y +G ++S VFH +YLEM FK+++Y +G+P F+ P K Y+ EG F
Sbjct: 215 EYVPIGPMYWNS-KVFHRSYLEMEKQFKVFVYEEGEPPVFHNGPCK--SIYSMEGNFIHR 271
Query: 123 IR-ESRFLTHDPDRAHLFFIPISCHKMRG-----KGTSYENMTVIVKDYVDSLINKYPYW 176
+ + +F T DP++AH++F+P S M + + V+DYV+ + KYPYW
Sbjct: 272 MEIDDQFRTKDPEKAHVYFLPFSVAMMVQFVYVRDSHDFGPIKRTVRDYVNLVAGKYPYW 331
Query: 177 NRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFA 236
NR+LGADHF + CHD G + +P L KN+IR +C+ + F P KDV+ P++
Sbjct: 332 NRSLGADHFMLACHDWGPETSFSLPDLAKNSIRALCNANTSERFNPIKDVSFPEINLQTG 391
Query: 237 LPRG---GRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLV-YQ 292
+G G R++L F+AG + IR IL WEN ++ +++R + V Y
Sbjct: 392 TTKGFIGGPSPSKRSILAFFAGGLHGPIRPILLEHWENKD----NDMKVHRYLPKGVSYY 447
Query: 293 RRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERD 352
K+KFC+CP G +V S R+ ++++ GCVPV++S++Y PF+D+L+W+ F+V + D
Sbjct: 448 EMMRKSKFCLCPSGYEVASPRVVEALYTGCVPVLISDHYVPPFSDVLNWKSFSVEVPVSD 507
Query: 353 VYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIK 409
+ LK+IL +IS +++ + +++V++HF+ NSPP R+D FHM+++ +WLRR +K
Sbjct: 508 IPNLKRILTSISSRQYIRMQRRVLQVRRHFEVNSPPKRYDVFHMILHSIWLRRLNVK 564
>gi|356565159|ref|XP_003550812.1| PREDICTED: LOW QUALITY PROTEIN: probable glycosyltransferase
At5g20260-like [Glycine max]
Length = 488
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 134/348 (38%), Positives = 208/348 (59%), Gaps = 18/348 (5%)
Query: 71 VFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNI--RESRF 128
V+ + FH +++EM+ FK++ Y +G+ ++ P ++ Y EG+ I R F
Sbjct: 139 VYRNAYAFHQSHIEMLKRFKVWTYKEGELPXAHEGP--MSSIYGIEGHLIAQIDNRTGPF 196
Query: 129 LTHDPDRAHLFFIPISCHKM----RGKGTSY--ENMTVIVKDYVDSLINKYPYWNRTLGA 182
L PD AH+F +PIS ++ T+Y + + I DY + + ++YPYWNRT GA
Sbjct: 197 LARYPDEAHVFMLPISVTQIVRYVYNPLTTYSRDQLMRITVDYTNIIAHRYPYWNRTKGA 256
Query: 183 DHFFVTCHDVG--VRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQV-LQPFAL-- 237
DHF +CHD + E L KN IRV+C+ + F P KDV +P++ LQ F L
Sbjct: 257 DHFLASCHDWAPDISREESGRELFKNIIRVLCNANTSEGFKPEKDVPMPEMNLQGFKLSS 316
Query: 238 PRGGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYK 297
P G D+ NR++L F+AG + +IR IL W++ E + + + + YQ +
Sbjct: 317 PIPGFDLNNRSILAFFAGGAHGRIRKILLEHWKDKDEEVQVHEYLPKGVD---YQGLMGQ 373
Query: 298 TKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLK 357
+KFC+CP G +V S RI +SI+ GCVPVI+S+YY LPF+D+LDW KF++ + R + ++K
Sbjct: 374 SKFCLCPSGYEVASPRIVESINIGCVPVIVSDYYQLPFSDVLDWSKFSLHIPSRRIAEIK 433
Query: 358 QILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
ILKN+ A+++ L ++KVQ+HF+ N P FD FHM+++ +WLRR
Sbjct: 434 TILKNVPHAKYLKLQKRVMKVQRHFELNRPAKPFDVFHMILHSIWLRR 481
>gi|449482564|ref|XP_004156325.1| PREDICTED: probable glycosyltransferase At5g03795-like [Cucumis
sativus]
Length = 516
Score = 255 bits (651), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 129/335 (38%), Positives = 207/335 (61%), Gaps = 14/335 (4%)
Query: 77 VFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIR-ESRFLTHDPDR 135
FH +YLEM KI++Y +G+P F+ P K Y++EG F I +S+F T DP++
Sbjct: 178 AFHRSYLEMEKEMKIFVYEEGEPPLFHNGPCK--SIYSTEGNFIHAIEMDSQFRTKDPNK 235
Query: 136 AHLFFIPISCHKMR-----GKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCH 190
AH+FF+P+S + + + V DY++ + KYP+WNR+LGADHF ++CH
Sbjct: 236 AHVFFLPLSVAMLVRFVYVHDSHDFTPIRHTVVDYINVIGTKYPFWNRSLGADHFMLSCH 295
Query: 191 DVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQV-LQPFALPR--GGRDVENR 247
D G A++ VP L KN+IRV+C+ + F P KDV+ P++ LQ L GG +R
Sbjct: 296 DWGPEASKSVPNLYKNSIRVLCNANTSEGFNPSKDVSFPEINLQTGHLTGFLGGPSPSHR 355
Query: 248 TVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGS 307
+L F+AG + IR IL + WEN + + + + + Y K+KFC+CP G
Sbjct: 356 PILAFFAGGLHGPIRPILIQQWENQDQDIQVHQYLPKGVS---YIDMMRKSKFCLCPSGY 412
Query: 308 QVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAE 367
+V S RI ++I+ GCVPV++S++Y PF+D+++W+ F+V ++ D+ LK IL IS +
Sbjct: 413 EVASPRIVEAIYTGCVPVLISDHYVPPFSDVINWKSFSVEVSVDDIPNLKTILTGISTRQ 472
Query: 368 FVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELW 402
++ ++ +VKV++HF+ NSPP R+D +HM+++ +W
Sbjct: 473 YLRMYRRVVKVRRHFEVNSPPKRYDVYHMILHSVW 507
>gi|115434052|ref|NP_001041784.1| Os01g0107700 [Oryza sativa Japonica Group]
gi|52076197|dbj|BAD44851.1| pectin-glucuronyltransferase -like [Oryza sativa Japonica Group]
gi|52076236|dbj|BAD44890.1| pectin-glucuronyltransferase -like [Oryza sativa Japonica Group]
gi|113531315|dbj|BAF03698.1| Os01g0107700 [Oryza sativa Japonica Group]
gi|215697737|dbj|BAG91731.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222617591|gb|EEE53723.1| hypothetical protein OsJ_00064 [Oryza sativa Japonica Group]
Length = 550
Score = 255 bits (651), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 132/354 (37%), Positives = 215/354 (60%), Gaps = 18/354 (5%)
Query: 66 DLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRE 125
DL + +F + SVF +Y M K+++Y DG F+ +L G YASEG+F + +
Sbjct: 194 DLHAPLFRNVSVFRRSYELMERLLKVFVYHDGAKPIFHSP--ELKGIYASEGWFMKLMEG 251
Query: 126 SR-FLTHDPDRAHLFFIPISCHKMR-----GKGTSYENMTVIVKDYVDSLINKYPYWNRT 179
++ F+ DP+RAHLF++P S ++ + E +++ VK Y+D + K+PYWNRT
Sbjct: 252 NQHFVVRDPNRAHLFYLPYSSRQLEHNLYVPGSNTIEPLSIFVKKYIDFISTKFPYWNRT 311
Query: 180 LGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSY-DVAFIPHKDVALPQVLQPFALP 238
GADHFFV CHD G T+ L KN I+ +C+ + FI +DV+LP+ P
Sbjct: 312 KGADHFFVACHDWGPYTTKLHDELRKNTIKALCNADLSEGVFIHGRDVSLPETF--LRSP 369
Query: 239 R------GGRDVENRTVLGFWAGHRNSKIRVILARVWEN-DTELDISNNRINRAIGPLVY 291
R GG+ R++L F+AG + ++R +L + W D ++ I + +R + Y
Sbjct: 370 RRPLRGIGGKPAAERSILAFFAGQMHGRVRPVLLQYWGGKDADMRIYDRLPHRITRRMNY 429
Query: 292 QRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNER 351
+ +K+CICP G +VNS RI ++I+Y CVPVI+++ + LPF+D L+W F+V++ E+
Sbjct: 430 IQHMKSSKYCICPMGYEVNSPRIVEAIYYECVPVIIADNFVLPFDDALNWSAFSVVIPEK 489
Query: 352 DVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
DV +LKQIL I D +++++ +N+ +VQKHF W+ P+++D FHM+++ +W R
Sbjct: 490 DVPKLKQILLAIPDDQYMAMQSNVQRVQKHFIWHPNPIKYDIFHMILHSIWYSR 543
>gi|359481952|ref|XP_002284018.2| PREDICTED: probable glycosyltransferase At5g03795 [Vitis vinifera]
Length = 546
Score = 254 bits (650), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 131/344 (38%), Positives = 203/344 (59%), Gaps = 24/344 (6%)
Query: 76 SVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIR-ESRFLTHDPD 134
+ FH +YLEM K+Y+Y +G+P F+ P K Y+ EG F + +S F T DP+
Sbjct: 201 NAFHRSYLEMEKQLKVYVYDEGEPPLFHNGPCKSI--YSMEGNFIHKMEMDSHFRTKDPE 258
Query: 135 RAHLFFIPISCHKM-------RGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFV 187
+AHLFF+P S + TVI DYV+ + KYPYWNR+LGADHF +
Sbjct: 259 KAHLFFLPFSVAMLVRFVYVRDSHDLGPIKQTVI--DYVNVVSTKYPYWNRSLGADHFML 316
Query: 188 TCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQV------LQPFALPRGG 241
CHD G + +P+L KN+IRV+C+ + F P KDV+ P++ F GG
Sbjct: 317 ACHDWGPETSFSIPYLHKNSIRVLCNANTSEGFNPSKDVSFPEINLLTGSTDSFI---GG 373
Query: 242 RDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFC 301
+RT+L F+AG + IR IL WEN E + + + + Y K+K+C
Sbjct: 374 PSPSHRTLLAFFAGGLHGPIRPILLEHWENKDEDVKVHKYLPKGVS---YYEMMRKSKYC 430
Query: 302 ICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILK 361
+CP G +V S R+ ++++ GCVPV++S++Y PF+D+L+W+ F+V + R++ LK+IL
Sbjct: 431 LCPSGYEVASPRVVEALYTGCVPVLISDHYVPPFSDVLNWKSFSVEVPVREIPNLKRILM 490
Query: 362 NISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
+IS +++ + ++ ++HF+ NSPP R+D FHM+++ LWLRR
Sbjct: 491 DISPRQYIRMQRRGIQARRHFEVNSPPKRYDVFHMILHSLWLRR 534
>gi|359484716|ref|XP_002263848.2| PREDICTED: probable glycosyltransferase At5g20260-like [Vitis
vinifera]
gi|296084516|emb|CBI25537.3| unnamed protein product [Vitis vinifera]
Length = 477
Score = 254 bits (650), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 132/351 (37%), Positives = 202/351 (57%), Gaps = 21/351 (5%)
Query: 71 VFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQN----IRES 126
++ +P FH +++EMV FK++ Y +G F++ P LT YA EG F + +S
Sbjct: 125 IYRNPYAFHQSHIEMVKRFKVWTYREGAQPIFHEGP--LTNIYAIEGQFIDEMDFIVGKS 182
Query: 127 RFLTHDPDRAHLFFIPISCHKM---------RGKGTSYENMTVIVKDYVDSLINKYPYWN 177
F+ PD AH FF+P+S K+ + S + + IV DYV + +KYPYWN
Sbjct: 183 PFIAKHPDEAHAFFLPLSVVKVVQFLYLPITSPEDYSRKRLQRIVTDYVKVVADKYPYWN 242
Query: 178 RTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQP--- 234
R+ GADHF V+CHD + P L KN IRV+C+ + F P +DV+LP+V P
Sbjct: 243 RSGGADHFMVSCHDWAPSVSYANPELFKNFIRVLCNANSSEGFRPGRDVSLPEVNLPAGE 302
Query: 235 FALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRR 294
P G+ NR VL F+AG + IR IL W++ + + R+++ Y +
Sbjct: 303 LGPPHLGQPSNNRPVLAFFAGRAHGNIRKILFEHWKDQDNEVLVHERLHKGQN---YAKL 359
Query: 295 FYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVY 354
++KFC+CP G +V S R+ ++IH GCVPVI+SN Y LPFND+LDW +F++ + +
Sbjct: 360 MGQSKFCLCPSGYEVASPRVVEAIHAGCVPVIISNNYSLPFNDVLDWSQFSIQIPVAKIP 419
Query: 355 QLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
++K IL IS +++ + +++V++HF N P FD HM+++ LWLRR
Sbjct: 420 EIKTILLGISKNKYLKMQERVLRVRRHFVLNRPARPFDIIHMILHSLWLRR 470
>gi|242053777|ref|XP_002456034.1| hypothetical protein SORBIDRAFT_03g029210 [Sorghum bicolor]
gi|241928009|gb|EES01154.1| hypothetical protein SORBIDRAFT_03g029210 [Sorghum bicolor]
Length = 517
Score = 254 bits (649), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 134/358 (37%), Positives = 216/358 (60%), Gaps = 19/358 (5%)
Query: 63 DYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQN 122
D DL + +F + S F +Y M K+YIY DG F+ P L+G YASEG+F +
Sbjct: 156 DDPDLYAPLFKNVSQFKRSYELMERILKVYIYQDGRRPIFHTPP--LSGIYASEGWFMKL 213
Query: 123 IRESR-FLTHDPDRAHLFFIPISCHKMR-----GKGTSYENMTVIVKDYVDSLINKYPYW 176
++ESR + D +AHLF++P S ++R + + ++++V L NKYP+W
Sbjct: 214 LKESRRHVVADAGKAHLFYLPYSSQQLRLTLYQADSHNLRPLAAYLRNFVRGLANKYPFW 273
Query: 177 NRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVC-SPSYDVAFIPHKDVALPQVLQPF 235
NRT GADHF V CHD G T L KNAI+ +C + S + F P KDV+LP+
Sbjct: 274 NRTRGADHFLVACHDWGPYTTAAHRDLRKNAIKALCNADSSEGIFTPGKDVSLPETT--I 331
Query: 236 ALPR------GGRDVENRTVLGFWAGHRNSKIRVILARVWEN--DTELDISNNRINRAIG 287
PR GG V R++L F+AG+ + ++R +L R W + D E+ + + NR
Sbjct: 332 RNPRRPLRYVGGLPVSRRSILAFFAGNVHGRVRPVLLRHWGDGQDDEMRVYSLLPNRVSR 391
Query: 288 PLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVI 347
+ Y + ++FC+CP G +VNS RI ++ +Y CVPVI+++ + LP +++LDW F+V+
Sbjct: 392 RMNYIQHMKNSRFCLCPMGYEVNSPRIVEAFYYECVPVIIADNFVLPLSEVLDWSAFSVV 451
Query: 348 LNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
+ E+D+ LK+IL+ IS +V++H+ + ++Q+HF W++ P+++D FHM+++ +WL R
Sbjct: 452 VAEKDIPDLKKILQGISPRRYVAMHSCVKRLQRHFLWHARPIKYDLFHMILHSIWLSR 509
>gi|356565368|ref|XP_003550913.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
max]
Length = 534
Score = 254 bits (649), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 129/355 (36%), Positives = 211/355 (59%), Gaps = 19/355 (5%)
Query: 66 DLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRE 125
D+ + +F + SVF +Y M K+YIY DG F++ P L G YASEG+F + + E
Sbjct: 175 DIYAPIFRNISVFKRSYELMEMILKVYIYRDGSRPIFHKPP--LKGIYASEGWFMKLMEE 232
Query: 126 SR-FLTHDPDRAHLFFIPISCHKMR-----GKGTSYENMTVIVKDYVDSLINKYPYWNRT 179
++ F+T DP++AHLF++P S +M + +++ ++DYV+ + KYP+WNRT
Sbjct: 233 NKQFVTKDPEKAHLFYLPYSARQMGLTLYVPGSHDLKPLSIFLRDYVNKIAAKYPFWNRT 292
Query: 180 LGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSY-DVAFIPHKDVALPQVLQPFALP 238
G+DHF V CHD G G L +N I+ +C+ + F+ +DV+LP+ P
Sbjct: 293 QGSDHFLVACHDWGPYTVTGHEELKRNTIKALCNADLSEGVFVAGRDVSLPETT--IRAP 350
Query: 239 R------GGRDVENRTVLGFWAGHRNSKIRVILARVWEN--DTELDISNNRINRAIGPLV 290
R GG V R +L F+AG + ++R L W D ++ I R +
Sbjct: 351 RRPLRYLGGNRVSLRPILAFFAGSMHGRVRPTLLTYWGGGKDEDMKIYKRLPLRVSQRMT 410
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + +K+C+CP G +VNS RI ++I+Y CVPVI+++ + LPF+++LDW F+V++ E
Sbjct: 411 YIQHMKSSKYCVCPMGFEVNSPRIVEAIYYECVPVIIADNFVLPFSEVLDWSAFSVVVAE 470
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
+D+ +LK+IL +I +++++ NN+ VQKHF WN P+R+D FHM+++ +W +
Sbjct: 471 KDIPRLKEILLSIPLRKYLTMQNNVKMVQKHFLWNPRPIRYDLFHMILHSIWFNK 525
>gi|297810461|ref|XP_002873114.1| hypothetical protein ARALYDRAFT_487158 [Arabidopsis lyrata subsp.
lyrata]
gi|297318951|gb|EFH49373.1| hypothetical protein ARALYDRAFT_487158 [Arabidopsis lyrata subsp.
lyrata]
Length = 519
Score = 254 bits (649), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 130/338 (38%), Positives = 206/338 (60%), Gaps = 14/338 (4%)
Query: 77 VFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIR-ESRFLTHDPDR 135
VFH +YLEM FKIY+Y +G+P F+ P K Y+ EG F + ++RF T++PD+
Sbjct: 179 VFHRSYLEMEKQFKIYVYKEGEPPLFHDGPCK--SIYSMEGSFIYEMETDTRFRTNNPDK 236
Query: 136 AHLFFIPISCHKM-----RGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCH 190
AH F++P S KM + + V+DY++ + +KYPYWNR++GADHF ++CH
Sbjct: 237 AHAFYLPFSVVKMVRYVYERNSRDFSPIRNTVRDYINLVGDKYPYWNRSIGADHFILSCH 296
Query: 191 DVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQV-LQPFALPR--GGRDVENR 247
D G A+ P L N+IR +C+ + F P KDV++P++ L+ +L GG +R
Sbjct: 297 DWGPEASFSHPHLGHNSIRALCNANTSEKFKPRKDVSIPEINLRTGSLTGLVGGPSPSSR 356
Query: 248 TVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGS 307
+L F+AG + +R +L WEN + + R Y +KFCICP G
Sbjct: 357 PILAFFAGGVHGPVRPVLLEHWENKDNDIRVHKYLPRGTS---YSDMMRNSKFCICPSGY 413
Query: 308 QVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAE 367
+V S RI ++++ GCVPV++++ Y PF+D+L+WR F+VI++ D+ LK IL IS +
Sbjct: 414 EVASPRIVEALYSGCVPVLINSGYVPPFSDVLNWRSFSVIVSVEDIPNLKTILTAISPRQ 473
Query: 368 FVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
++ ++ ++KV++HF+ NSP RFD FHM+++ +W+RR
Sbjct: 474 YLRMYRRVLKVRRHFEVNSPAKRFDVFHMILHSIWVRR 511
>gi|449451243|ref|XP_004143371.1| PREDICTED: probable glycosyltransferase At5g03795-like [Cucumis
sativus]
Length = 516
Score = 254 bits (649), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 128/335 (38%), Positives = 207/335 (61%), Gaps = 14/335 (4%)
Query: 77 VFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIR-ESRFLTHDPDR 135
FH +YLEM KI++Y +G+P F+ P K Y++EG F I +S+F T DP++
Sbjct: 178 AFHRSYLEMEKEMKIFVYEEGEPPLFHNGPCK--SIYSTEGNFIHAIEMDSQFRTKDPNK 235
Query: 136 AHLFFIPISCHKMR-----GKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCH 190
AH+FF+P+S + + + V DY++ + KYP+WNR+LGADHF ++CH
Sbjct: 236 AHVFFLPLSVAMLVRFVYVHDSHDFTPIRHTVVDYINVIGTKYPFWNRSLGADHFMLSCH 295
Query: 191 DVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQV-LQPFALPR--GGRDVENR 247
D G A++ VP L KN+IRV+C+ + F P KDV+ P++ LQ L GG +R
Sbjct: 296 DWGPEASKSVPNLYKNSIRVLCNANTSEGFNPSKDVSFPEINLQTGHLTGFLGGPSPSHR 355
Query: 248 TVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGS 307
++ F+AG + IR IL + WEN + + + + + Y K+KFC+CP G
Sbjct: 356 PIMAFFAGGLHGPIRPILIQRWENQDQDIQVHQYLPKGVS---YIDMMRKSKFCLCPSGY 412
Query: 308 QVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAE 367
+V S RI ++I+ GCVPV++S++Y PF+D+++W+ F+V ++ D+ LK IL IS +
Sbjct: 413 EVASPRIVEAIYTGCVPVLISDHYVPPFSDVINWKSFSVEVSVDDIPNLKTILTGISTRQ 472
Query: 368 FVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELW 402
++ ++ +VKV++HF+ NSPP R+D +HM+++ +W
Sbjct: 473 YLRMYRRVVKVRRHFEVNSPPKRYDVYHMILHSVW 507
>gi|224071129|ref|XP_002303362.1| predicted protein [Populus trichocarpa]
gi|222840794|gb|EEE78341.1| predicted protein [Populus trichocarpa]
Length = 386
Score = 254 bits (648), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 133/352 (37%), Positives = 206/352 (58%), Gaps = 15/352 (4%)
Query: 67 LGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRES 126
L + +F + SVF +Y M T K+YIYPDGD F+Q L G YASEG+F + + S
Sbjct: 28 LSAHIFRNISVFKRSYELMETILKVYIYPDGDKPIFHQP--HLYGIYASEGWFMKFMEAS 85
Query: 127 R-FLTHDPDRAHLFFIPISCHKMR-----GKGTSYENMTVIVKDYVDSLINKYPYWNRTL 180
R F++ DP++AHLF++P S ++ + +++ ++DY + + KYPYWNRT
Sbjct: 86 REFVSRDPEKAHLFYLPYSARQLEVAVYVPNSHNLRPLSIFMRDYANMIAAKYPYWNRTH 145
Query: 181 GADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVA-FIPHKDVALPQVL-----QP 234
G DHF V CHD G A L KN ++ +C+ F +DV+LP+ +P
Sbjct: 146 GRDHFLVACHDWGPYALTMHEELTKNTMKALCNADVSEGIFTAGQDVSLPETTIRSPKRP 205
Query: 235 FALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWEN-DTELDISNNRINRAIGPLVYQR 293
GG V R +L F+AG+ + ++R L + W N D ++ I + Y +
Sbjct: 206 LRNVGGGIRVSQRPILAFFAGNLHGRVRPTLLKYWHNKDDDMKIYGPLPIGISRKMTYVQ 265
Query: 294 RFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDV 353
+K+CICP G +VNS RI ++I+Y CVPVI+++ + LPFN++LDW F+V++ E+D+
Sbjct: 266 HMKSSKYCICPMGYEVNSPRIVEAIYYECVPVIIADNFVLPFNEVLDWSAFSVVVAEKDI 325
Query: 354 YQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
+LK+IL I ++++ NL VQKHF WN P+R+D FHM+++ +W R
Sbjct: 326 PKLKEILLAIPLRRYLTMLANLKTVQKHFLWNPRPLRYDLFHMILHSIWFSR 377
>gi|356546040|ref|XP_003541440.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
max]
Length = 491
Score = 254 bits (648), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 137/365 (37%), Positives = 211/365 (57%), Gaps = 20/365 (5%)
Query: 50 NFNSTLSFAINKQDYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKL 109
N N TL DY G ++ + FH +YLEM FK+++Y +G+ F+ P K
Sbjct: 129 NRNRTLD-----SDYVPTGP-MYWNAKAFHRSYLEMEKQFKVFVYEEGETPVFHNGPCKS 182
Query: 110 TGKYASEGYFFQNIR-ESRFLTHDPDRAHLFFIPISCHKM-----RGKGTSYENMTVIVK 163
Y+ EG F I F T DP +AH+FF+P S M + + V
Sbjct: 183 I--YSMEGNFIHAIEMNDHFRTKDPKKAHVFFLPFSVVMMVRFVYERDSRDFGPIKKTVI 240
Query: 164 DYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPH 223
DYV+ + +YPYWNR+LGADHF + CHD G A+ +P+L KN+IRV+C+ + F P
Sbjct: 241 DYVNLIATRYPYWNRSLGADHFMLACHDWGPEASFSLPYLHKNSIRVLCNANTSEGFKPA 300
Query: 224 KDVALPQV-LQPFALPR--GGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNN 280
KDV+ P++ LQ ++ GG R++L F+AG + IR IL WEN E +
Sbjct: 301 KDVSFPEINLQTGSINGFVGGPSASKRSILAFFAGGVHGPIRPILLEHWENKDEDIQVHK 360
Query: 281 RINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILD 340
+ + + Y K+KFC+CP G +V S R+ ++I+ GCVPV++S +Y PF+D+L+
Sbjct: 361 YLPKGVS---YYGMLRKSKFCLCPSGYEVASPRVVEAIYTGCVPVLISEHYVPPFSDVLN 417
Query: 341 WRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYE 400
W+ F+V L+ +D+ LK IL +IS + + + + ++++HF+ +SPP RFD FHM+++
Sbjct: 418 WKSFSVELSVKDIPILKDILMSISPRQHIRMQRRVGQIRRHFEVHSPPKRFDVFHMILHS 477
Query: 401 LWLRR 405
+WLRR
Sbjct: 478 VWLRR 482
>gi|30683790|ref|NP_567512.2| Exostosin family protein [Arabidopsis thaliana]
gi|19347795|gb|AAL86348.1| unknown protein [Arabidopsis thaliana]
gi|26983908|gb|AAN86206.1| unknown protein [Arabidopsis thaliana]
gi|332658395|gb|AEE83795.1| Exostosin family protein [Arabidopsis thaliana]
Length = 542
Score = 254 bits (648), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 127/351 (36%), Positives = 213/351 (60%), Gaps = 15/351 (4%)
Query: 65 SDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIR 124
+DL + +F + SVF +Y M K+YIYPDGD F++ L G YASEG+F + +
Sbjct: 176 TDLFAPLFRNLSVFKRSYELMELILKVYIYPDGDKPIFHEP--HLNGIYASEGWFMKLME 233
Query: 125 ESR-FLTHDPDRAHLFFIPISCHKMRGK-----GTSYENMTVIVKDYVDSLINKYPYWNR 178
++ F+T +P+RAHLF++P S +++ + + +++ ++DYV+ L KYP+WNR
Sbjct: 234 SNKQFVTKNPERAHLFYMPYSVKQLQKSIFVPGSHNIKPLSIFLRDYVNMLSIKYPFWNR 293
Query: 179 TLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSY-DVAFIPHKDVALPQVL----- 232
T G+DHF V CHD G P L +NAI+ +C+ D F+P KDV+LP+
Sbjct: 294 THGSDHFLVACHDWGPYTVNEHPELKRNAIKALCNADLSDGIFVPGKDVSLPETSIRNAG 353
Query: 233 QPFALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWEN-DTELDISNNRINRAIGPLVY 291
+P G V R +L F+AG+ + ++R L + W N D ++ I + + Y
Sbjct: 354 RPLRNIGNGNRVSQRPILAFFAGNLHGRVRPKLLKHWRNKDEDMKIYGPLPHNVARKMTY 413
Query: 292 QRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNER 351
+ +K+C+CP G +VNS RI ++I+Y CVPV++++ + LPF+D+LDW F+V++ E+
Sbjct: 414 VQHMKSSKYCLCPMGYEVNSPRIVEAIYYECVPVVIADNFMLPFSDVLDWSAFSVVVPEK 473
Query: 352 DVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELW 402
++ +LK+IL I ++ + +N+ VQ+HF W+ P ++D FHM+++ +W
Sbjct: 474 EIPRLKEILLEIPMRRYLKMQSNVKMVQRHFLWSPKPRKYDVFHMILHSIW 524
>gi|326519558|dbj|BAK00152.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 564
Score = 254 bits (648), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 135/352 (38%), Positives = 217/352 (61%), Gaps = 15/352 (4%)
Query: 66 DLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIR- 124
DL + ++ + SVF +Y M K++IY DG F+ +L G YASEG+F + I
Sbjct: 207 DLYAPLYRNMSVFKRSYELMERLLKVFIYHDGAKPIFHSP--ELKGIYASEGWFMKLIEA 264
Query: 125 ESRFLTHDPDRAHLFFIPISCHKMR-----GKGTSYENMTVIVKDYVDSLINKYPYWNRT 179
+ F+ DP+RAHLF++P S ++ + + ++V VK+Y+D + K+PYWNRT
Sbjct: 265 DQNFVVRDPNRAHLFYLPYSSRQLEHNLYVPGSNTLDPLSVFVKNYIDMISAKFPYWNRT 324
Query: 180 LGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSY-DVAFIPHKDVALPQV-LQPFAL 237
GADHFFV CHD G T+ L KN I+ +C+ + FI +DV+LP+ ++
Sbjct: 325 KGADHFFVACHDWGPYTTKLHDELRKNTIKALCNADVSEGVFIRGRDVSLPETYIKSARR 384
Query: 238 PR---GGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGP-LVYQR 293
P GG+ R++L F+AG + ++R +L DT++ I + RI R I + Y +
Sbjct: 385 PVRDIGGKPAAERSILAFFAGQMHGRVRPVLKYWGGKDTDMRIYS-RIPRQITRRMNYAK 443
Query: 294 RFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDV 353
+K+CICP G +VNS RI ++I+Y CVPVI+++ + LPF+D LDW F+V++ E+DV
Sbjct: 444 HMKSSKYCICPMGYEVNSPRIVEAIYYECVPVIIADNFVLPFDDALDWSAFSVVVAEKDV 503
Query: 354 YQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
+LK IL I ++ ++++ +N+ KVQ+HF W++ PV++D FHM+++ +W R
Sbjct: 504 PKLKAILLAIPESRYITMRSNVKKVQRHFLWHAKPVKYDIFHMILHSVWFSR 555
>gi|218187361|gb|EEC69788.1| hypothetical protein OsI_00072 [Oryza sativa Indica Group]
Length = 536
Score = 253 bits (647), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 131/354 (37%), Positives = 214/354 (60%), Gaps = 18/354 (5%)
Query: 66 DLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRE 125
DL + +F + SVF +Y M K+++Y DG F+ +L G YASEG+F + +
Sbjct: 180 DLHAPLFRNVSVFRRSYELMERLLKVFVYHDGAKPIFHSP--ELKGIYASEGWFMKLMEG 237
Query: 126 SR-FLTHDPDRAHLFFIPISCHKMR-----GKGTSYENMTVIVKDYVDSLINKYPYWNRT 179
++ F+ DP+RAHLF++P S ++ + E +++ VK Y+D + K+PYWNRT
Sbjct: 238 NQHFVVRDPNRAHLFYLPYSSRQLEHNLYVPGSNTIEPLSIFVKKYIDFISTKFPYWNRT 297
Query: 180 LGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSY-DVAFIPHKDVALPQVLQPFALP 238
GADHFFV CHD G T+ L KN I+ +C+ + FI +DV+LP+ P
Sbjct: 298 KGADHFFVACHDWGPYTTKLHDELRKNTIKALCNADLSEGVFIHGRDVSLPETF--LRSP 355
Query: 239 R------GGRDVENRTVLGFWAGHRNSKIRVILARVWEN-DTELDISNNRINRAIGPLVY 291
R GG+ R++L F+AG + ++R +L + W D ++ I + +R + Y
Sbjct: 356 RRPLRGIGGKPAAERSILAFFAGQMHGRVRPVLLQYWGGKDADMRIYDRLPHRITRRMNY 415
Query: 292 QRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNER 351
+ +K+CICP G +VNS RI ++I+Y CVPVI+++ + LPF+D +W F+V++ E+
Sbjct: 416 IQHMKSSKYCICPMGYEVNSPRIVEAIYYECVPVIIADNFVLPFDDAFNWSAFSVVILEK 475
Query: 352 DVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
DV +LKQIL I D +++++ +N+ +VQKHF W+ P+++D FHM+++ +W R
Sbjct: 476 DVPKLKQILLEIPDDQYMAMQSNVQRVQKHFIWHPNPIKYDIFHMILHSIWYSR 529
>gi|302753728|ref|XP_002960288.1| glycosylransferase-like protein [Selaginella moellendorffii]
gi|300171227|gb|EFJ37827.1| glycosylransferase-like protein [Selaginella moellendorffii]
Length = 372
Score = 253 bits (647), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 132/340 (38%), Positives = 204/340 (60%), Gaps = 21/340 (6%)
Query: 81 NYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRE-SRFLTHDPDRAHLF 139
+Y M FK+Y+Y +G+P ++ P LTG Y+SEG F + + SRF+THDP AH+F
Sbjct: 24 SYELMERLFKVYVYKEGEPRLVHKGP--LTGIYSSEGRFIHEMNQNSRFVTHDPQEAHMF 81
Query: 140 FIPISCHKM------RGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVG 193
F+P S M G T + +KDYV+ + +K+P+WN T G+DHFF +CHD G
Sbjct: 82 FLPYSVAHMVLDLYVPGSHTMLP-LATFIKDYVNLIASKHPFWNLTRGSDHFFTSCHDWG 140
Query: 194 VRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQV-LQPFALPR--GGRDVENRTVL 250
P L KN+++VVC+ F+P KD +LP+ L LP GG R +L
Sbjct: 141 PATARDHPELRKNSVKVVCNSDLTEEFVPDKDASLPETYLHAVKLPTKLGGPGPSKRPIL 200
Query: 251 GFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFY-----KTKFCICPG 305
F+AG + ++R L + W++ + D+ RI + P V +R Y +KFCIC
Sbjct: 201 AFFAGQMHGRVRPALIKHWKDRGDPDM---RIYEVLPPEVARRTSYVQHMKSSKFCICAM 257
Query: 306 GSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISD 365
G +VNS RI +SI+Y CVPV++++ + LPF+D+L+W F++ ++E+DV +LK++L +S+
Sbjct: 258 GFEVNSPRIVESIYYDCVPVLIADNFVLPFSDVLNWGSFSLTVSEKDVPRLKELLLAVSE 317
Query: 366 AEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
+ + + L KV+KHF W+ RFD FHM+++ +W RR
Sbjct: 318 DRYRKMQSRLKKVRKHFLWHDSAERFDMFHMILHSVWTRR 357
>gi|297804498|ref|XP_002870133.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
gi|297315969|gb|EFH46392.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
Length = 540
Score = 253 bits (647), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 127/351 (36%), Positives = 212/351 (60%), Gaps = 15/351 (4%)
Query: 65 SDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIR 124
+DL + +F + SVF +Y M K+YIYPDG+ F+Q L G YASEG+F + +
Sbjct: 174 TDLFAPLFRNLSVFKRSYELMELILKVYIYPDGEKPIFHQP--HLNGIYASEGWFMKLME 231
Query: 125 E-SRFLTHDPDRAHLFFIPISCHKMRGK-----GTSYENMTVIVKDYVDSLINKYPYWNR 178
++F+T +P+RAHLF++P S +++ + + +++ ++DYV+ L KYP+WNR
Sbjct: 232 SNTQFVTKNPERAHLFYMPYSVKQLQTSIFVPGSHNIKPLSIFLRDYVNMLSTKYPFWNR 291
Query: 179 TLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSY-DVAFIPHKDVALPQVL----- 232
T G+DHF V CHD G P L +N I+ +C+ D FIP KDV+LP+
Sbjct: 292 THGSDHFLVACHDWGPYTVNEHPELRRNTIKALCNADLADGIFIPGKDVSLPETSIRNAG 351
Query: 233 QPFALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWEN-DTELDISNNRINRAIGPLVY 291
+P G V R +L F+AG+ + ++R L + W N D ++ I + + Y
Sbjct: 352 KPLRNIGNGNRVSQRPILAFFAGNLHGRVRPKLLKHWRNKDDDMKIYGPLPHNVARKMTY 411
Query: 292 QRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNER 351
+ +K+C+CP G +VNS RI ++I+Y CVPV++++ + LPF+D+LDW F+V++ E+
Sbjct: 412 VQHMKSSKYCLCPMGYEVNSPRIVEAIYYECVPVVIADNFMLPFSDVLDWSAFSVVVPEK 471
Query: 352 DVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELW 402
++ +LK+IL I ++ + +N+ VQ+HF W+ P ++D FHM+++ +W
Sbjct: 472 EIPRLKEILLEIPMRRYLKMQSNVKMVQRHFLWSPKPRKYDVFHMILHSIW 522
>gi|302768010|ref|XP_002967425.1| glycosyltransferase-like protein [Selaginella moellendorffii]
gi|300165416|gb|EFJ32024.1| glycosyltransferase-like protein [Selaginella moellendorffii]
Length = 372
Score = 253 bits (646), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 131/339 (38%), Positives = 203/339 (59%), Gaps = 19/339 (5%)
Query: 81 NYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRE-SRFLTHDPDRAHLF 139
+Y M FK+Y+Y +G+P ++ P LTG Y+SEG F + + SRF+THDP AH+F
Sbjct: 24 SYELMERLFKVYVYKEGEPRLVHKGP--LTGIYSSEGRFIHEMNQNSRFVTHDPQEAHMF 81
Query: 140 FIPISCHKMR-----GKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV 194
F+P S M S + +KDYV+ + +K+P+WN T G+DHFF +CHD G
Sbjct: 82 FLPYSVAHMVLDLYVPGSHSMLPLATFIKDYVNLIASKHPFWNLTRGSDHFFASCHDWGP 141
Query: 195 RATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQV-LQPFALPR--GGRDVENRTVLG 251
P L KN+++VVC+ F+P KD +LP+ L LP GG R +L
Sbjct: 142 ATARDHPELRKNSVKVVCNSDLTEEFVPDKDASLPETYLHAVKLPTKLGGPGPSKRPILA 201
Query: 252 FWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFY-----KTKFCICPGG 306
F+AG + ++R L + W++ + D+ RI + P V +R Y +KFCIC G
Sbjct: 202 FFAGQMHGRVRPALIKHWKDRGDPDM---RIYEVLPPDVARRTSYVQHMKSSKFCICAMG 258
Query: 307 SQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDA 366
+VNS RI +SI+Y CVPV++++ + LPF+D+L+W F++ ++E+DV +LK++L +S+
Sbjct: 259 FEVNSPRIVESIYYDCVPVLIADNFVLPFSDVLNWGSFSLTVSEKDVPRLKELLLAVSED 318
Query: 367 EFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
+ + + L KV+KHF W+ RFD FHM+++ +W RR
Sbjct: 319 RYRKMQSRLKKVRKHFLWHDSAERFDMFHMILHSVWTRR 357
>gi|218188734|gb|EEC71161.1| hypothetical protein OsI_03019 [Oryza sativa Indica Group]
Length = 499
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 135/364 (37%), Positives = 214/364 (58%), Gaps = 31/364 (8%)
Query: 63 DYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQN 122
D DL + +F + S F +Y M K+YIY DG F+ P L+G YASEG+F +
Sbjct: 138 DDPDLYAPLFKNVSQFKRSYELMERILKVYIYQDGRRPIFHTPP--LSGIYASEGWFMKL 195
Query: 123 IRESR-FLTHDPDRAHLFFIPISCHKMR-----GKGTSYENMTVIVKDYVDSLINKYPYW 176
++ESR F DP +AHLF++P S ++R + + ++D+V L KYP+W
Sbjct: 196 LKESRRFAVTDPAKAHLFYLPYSSQQLRISLYVPDSHNLRPLAAYLRDFVKGLAAKYPFW 255
Query: 177 NRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVC-SPSYDVAFIPHKDVALPQVLQPF 235
NRT GADHF V CHD G T L +N ++ +C + S + F P +DV+LP+
Sbjct: 256 NRTRGADHFLVACHDWGSYTTTAHGDLRRNTVKALCNADSSEGIFTPGRDVSLPETT--I 313
Query: 236 ALPR------GGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPL 289
PR GG V R +L F+AG+ + ++R +L + W + + D+ R GPL
Sbjct: 314 RTPRRPLRYVGGLPVSRRGILAFFAGNVHGRVRPVLLKHWGDGRDDDM------RVYGPL 367
Query: 290 --------VYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDW 341
Y + ++FC+CP G +VNS RI ++++Y CVPVI+++ + LP +D+LDW
Sbjct: 368 PARVSRRMSYIQHMKNSRFCLCPMGYEVNSPRIVEALYYECVPVIIADNFVLPLSDVLDW 427
Query: 342 RKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYEL 401
FAV++ E+DV LK+IL+ I+ ++V++H + ++Q+HF W++ P+R+D FHM+++ +
Sbjct: 428 SAFAVVVAEKDVPDLKKILQGITLRKYVAMHGCVKRLQRHFLWHARPLRYDLFHMILHSI 487
Query: 402 WLRR 405
WL R
Sbjct: 488 WLSR 491
>gi|357476811|ref|XP_003608691.1| hypothetical protein MTR_4g100730 [Medicago truncatula]
gi|355509746|gb|AES90888.1| hypothetical protein MTR_4g100730 [Medicago truncatula]
Length = 535
Score = 252 bits (643), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 130/355 (36%), Positives = 209/355 (58%), Gaps = 19/355 (5%)
Query: 66 DLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRE 125
DL + +F + SVF +Y M T K+YIY DG F+ L G YASEG+F + ++E
Sbjct: 181 DLYAPLFRNVSVFKRSYELMETVLKVYIYRDGSRPIFHNP--SLKGIYASEGWFMKLMQE 238
Query: 126 SR-FLTHDPDRAHLFFIPISCHKMR-----GKGTSYENMTVIVKDYVDSLINKYPYWNRT 179
++ F+T DP+RAHLF++P S +M + +++ ++DYV+ + KYP+WNRT
Sbjct: 239 NKQFVTKDPERAHLFYLPYSARQMEVTLYVPGSHDLKPLSIFLRDYVNKIAAKYPFWNRT 298
Query: 180 LGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSY-DVAFIPHKDVALPQVLQPFALP 238
G+DHF V CHD G L +N ++ +C+ + FI +DV+LP+ P
Sbjct: 299 HGSDHFLVACHDWGPYTVTEHEELARNTLKALCNADLSERIFIEGRDVSLPETT--IRAP 356
Query: 239 R------GGRDVENRTVLGFWAGHRNSKIRVILARVW--ENDTELDISNNRINRAIGPLV 290
R GG R +L F+AG + ++R L + W E ++ I R +
Sbjct: 357 RRPLRYLGGNRASLRPILAFFAGSMHGRVRPTLLKYWGGEKYEDMKIYKRLPLRVSKKMT 416
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + +K+C+CP G +VNS RI ++I+Y CVPVI+++ + LP +++LDW F+V++ E
Sbjct: 417 YIQHMKSSKYCLCPMGFEVNSPRIVEAIYYECVPVIIADNFVLPLSEVLDWSAFSVVVAE 476
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
+D+ +LK IL +I ++V++ NN+ VQKHF WN P+R+D FHM+++ +WL +
Sbjct: 477 KDIPRLKDILLSIPMRKYVAMQNNVKMVQKHFLWNPKPIRYDLFHMILHSIWLNK 531
>gi|255579651|ref|XP_002530666.1| catalytic, putative [Ricinus communis]
gi|223529799|gb|EEF31735.1| catalytic, putative [Ricinus communis]
Length = 528
Score = 251 bits (641), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 130/352 (36%), Positives = 209/352 (59%), Gaps = 15/352 (4%)
Query: 67 LGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRES 126
L + +F + SVF +Y M K+YIYPDG F+ L G YASEG+F + + E+
Sbjct: 170 LYAPLFLNVSVFKRSYELMELILKVYIYPDGKRPIFHVP--HLNGIYASEGWFMKFMEEN 227
Query: 127 R-FLTHDPDRAHLFFIPISCHKMR-----GKGTSYENMTVIVKDYVDSLINKYPYWNRTL 180
R F+T DP++AHLF++P S +++ + +++ ++DY + + KYP+WNRT
Sbjct: 228 RQFVTRDPEKAHLFYLPYSARQLQMALYVPNSHNLRPLSIFMRDYANMIATKYPFWNRTH 287
Query: 181 GADHFFVTCHDVGVRATEGVPFLIKNAIRVVC-SPSYDVAFIPHKDVALPQVL-----QP 234
G DHF V CHD G L KN I+ +C + + + F P KDV+LP+ +P
Sbjct: 288 GRDHFLVACHDWGPYTLTMHEELTKNTIKALCNADASEGIFDPTKDVSLPETTIRIPRRP 347
Query: 235 FALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWEN-DTELDISNNRINRAIGPLVYQR 293
GG V R +L F+AG+ + ++R L + W+N D +L I R + Y +
Sbjct: 348 LKNVGGGIRVSQRPILAFFAGNMHGRVRPTLLQYWQNKDEDLKIYGPLPARISRKMNYVQ 407
Query: 294 RFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDV 353
+++CICP G +VNS RI ++I+Y CVPVI+++ + LPF+D+LDW F++++ E+D+
Sbjct: 408 HMKSSRYCICPMGHEVNSPRIVEAIYYECVPVIIADNFVLPFSDVLDWSAFSIVVAEKDI 467
Query: 354 YQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
+LK+IL I ++++ NL +Q+HF WN P+R+D FHM+++ +W R
Sbjct: 468 PKLKEILLAIPLRRYLTMLTNLKMLQRHFLWNPRPLRYDLFHMILHSIWFSR 519
>gi|356533401|ref|XP_003535253.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
max]
Length = 523
Score = 250 bits (639), Expect = 8e-64, Method: Compositional matrix adjust.
Identities = 126/352 (35%), Positives = 209/352 (59%), Gaps = 15/352 (4%)
Query: 63 DYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQN 122
DY +G ++ + FH +YLEM FK+++Y +G+ F+ P + Y++EG F
Sbjct: 161 DYVPIGP-MYWNAKEFHRSYLEMEKQFKVFVYEEGELPVFHDGP--CSSIYSTEGSFIHA 217
Query: 123 IR-ESRFLTHDPDRAHLFFIPISCHKMRG-----KGTSYENMTVIVKDYVDSLINKYPYW 176
I F T DP +A++FF+P S M + + V+DYV+ + +YPYW
Sbjct: 218 IEMNEHFRTRDPKKANVFFLPFSIAWMVRYVYIRNSYDFGPIKRTVRDYVNVIATRYPYW 277
Query: 177 NRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQV-LQPF 235
NR+LGADHF ++CHD G ++ +P+L KN+IRV+C+ + F P KD + P++ LQP
Sbjct: 278 NRSLGADHFMLSCHDWGPETSKSIPYLRKNSIRVLCNANTSEGFDPIKDASFPEINLQPG 337
Query: 236 ALPR--GGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQR 293
GG R++L F+AG + IR IL WEN E + + + + Y
Sbjct: 338 LKDSFVGGPPASKRSILAFFAGGNHGPIRPILLEHWENKDEDIQVHKYLPKGVS---YYG 394
Query: 294 RFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDV 353
+KFC+CP G +V S R+ ++I+ GCVPV++S +Y PF+D+L+W+ F+V ++ +++
Sbjct: 395 MLRNSKFCLCPSGYEVASPRVVEAIYTGCVPVLISEHYVPPFSDVLNWKMFSVNVSVKEI 454
Query: 354 YQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
LK IL +IS +++ + + ++++HF+ +SPP R+D FHM+++ +WLRR
Sbjct: 455 PNLKDILTSISPRQYIRMQKRVGQIRRHFEVHSPPKRYDVFHMILHSVWLRR 506
>gi|168059393|ref|XP_001781687.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666856|gb|EDQ53500.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 376
Score = 250 bits (639), Expect = 9e-64, Method: Compositional matrix adjust.
Identities = 137/360 (38%), Positives = 204/360 (56%), Gaps = 19/360 (5%)
Query: 61 KQDYSDLGSDVFHSPSVFHLNYLEMVTN-FKIYIYPDGDPNTFYQTPRKLTGKYASEGYF 119
K++Y G+ + S V L E++ FK+YIY DG + P+ G YASEG F
Sbjct: 4 KRNYGTRGNILTRSALVNLLRSYELMEKVFKVYIYKDGRKPLVHSGPQ--LGIYASEGQF 61
Query: 120 FQNIRE-SRFLTHDPDRAHLFFIPISCHKMRG-----KGTSYENMTVIVKDYVDSLINKY 173
+ + S FLT DP RAH+FF+P S ++M S + +KDYV++L +Y
Sbjct: 62 IERMEAASEFLTDDPSRAHMFFLPYSVYRMVTHLYVPNSRSMLPLATFIKDYVEALARQY 121
Query: 174 PYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQ 233
PYWNRT GADHFFV+CHD G P L NA++VVC+ F+ KD +LP+V
Sbjct: 122 PYWNRTKGADHFFVSCHDWGPATARDHPTLRSNAVKVVCNADLTEEFVVGKDASLPEVYM 181
Query: 234 -------PFALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWEN-DTELDISNNRINRA 285
P L GG + R L F+AG + ++R IL W++ D +L I
Sbjct: 182 HKSKTKAPIKL--GGPGYDERPYLAFFAGQMHGRVRPILLDHWKDKDPDLMIYGVLPKPI 239
Query: 286 IGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFA 345
+ Y + +K+CIC G +VNS RI +SIHY CVPVI+++ + LPF+D+L+W F+
Sbjct: 240 AKQISYVQHMKMSKYCICAAGYEVNSPRIVESIHYDCVPVIIADNFVLPFSDVLNWDAFS 299
Query: 346 VILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
V + E D+ +LK IL +I + + S+ L K+++HF W+ P ++D FHM+++ +W+ R
Sbjct: 300 VTMPESDIPKLKAILNDIPEKTYRSMQIRLRKIRQHFVWHKKPEKYDVFHMILHSVWMSR 359
>gi|356518828|ref|XP_003528079.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
max]
Length = 505
Score = 250 bits (639), Expect = 9e-64, Method: Compositional matrix adjust.
Identities = 133/366 (36%), Positives = 214/366 (58%), Gaps = 21/366 (5%)
Query: 50 NFNSTLSFAINKQDYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKL 109
NF TL DY GS ++ + + FH +YLEM FKI++Y +G+P F+ K
Sbjct: 143 NFTPTL----QDPDYVPQGS-IYRNANAFHRSYLEMEKVFKIFVYEEGEPPLFHNGLSK- 196
Query: 110 TGKYASEGYFFQNIRESRFL-THDPDRAHLFFIPISCHKM------RGKGTSYENMTVIV 162
YA+EG F + + R+ T+DPD A ++++P S + RG + + + ++V
Sbjct: 197 -DIYATEGRFIHEMEKGRYYRTYDPDEAFVYYLPFSVVMLVEYVYDRGSNYNLDPLGLVV 255
Query: 163 KDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIP 222
KDY+ + +K+P+WNR+LG DH ++CHD G + V L NAIRV+C+ + F P
Sbjct: 256 KDYIQIIAHKHPFWNRSLGHDHVMLSCHDWGPLVSSYVDHLYNNAIRVLCNANTSEGFKP 315
Query: 223 HKDVALPQV--LQPFALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWEN-DTELDISN 279
KDV+ P++ ++ GG RT+L F+AGH + IR +L W+N D ++ I
Sbjct: 316 AKDVSFPEIKLIKGEVKGLGGYPPSQRTILAFFAGHLHGYIRYLLLSTWKNKDQDMQIYE 375
Query: 280 NRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDIL 339
+ I Y + +KFC+CP G +V S R+ ++I CVPV++S+ Y PF+D+L
Sbjct: 376 -ELPEGIS---YYTKLRSSKFCLCPSGYEVASPRVVEAIFAECVPVLISDSYVPPFSDVL 431
Query: 340 DWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMY 399
+W F+V +N +D+ +K+IL IS+ +++ +H + +VQ+HF N PP R+D FHM ++
Sbjct: 432 NWNSFSVQVNVKDIPNIKRILMEISEKQYLRMHKRVKQVQRHFVPNEPPKRYDMFHMTVH 491
Query: 400 ELWLRR 405
+WLRR
Sbjct: 492 SIWLRR 497
>gi|255567220|ref|XP_002524591.1| catalytic, putative [Ricinus communis]
gi|223536144|gb|EEF37799.1| catalytic, putative [Ricinus communis]
Length = 507
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 133/361 (36%), Positives = 210/361 (58%), Gaps = 17/361 (4%)
Query: 55 LSFAINKQDYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYA 114
LS + DY G V+ + + FH +YLEM FKIY+Y +G P ++ P K Y+
Sbjct: 144 LSSVHDDPDYVPQGP-VYRNANAFHRSYLEMEKQFKIYVYEEGGPPMYHDGPCK--SIYS 200
Query: 115 SEGYFFQNIRESR-FLTHDPDRAHLFFIPISCHKMRG---KGTSYENMTV--IVKDYVDS 168
SEG F + + + + T DPD A ++F+P S M S+E + + DY+
Sbjct: 201 SEGRFIHELEKGKLYRTLDPDEALVYFLPFSVVMMVEYLYVPDSHETNAIGRAIVDYIHV 260
Query: 169 LINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVAL 228
+ NK+P+WNR+LGADHF ++CHD G RA+ VP L ++IRV+C+ + F P KD +
Sbjct: 261 ISNKHPFWNRSLGADHFMLSCHDWGPRASSYVPHLFNSSIRVLCNANTSEGFNPSKDASF 320
Query: 229 PQV---LQPFALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWEN-DTELDISNNRINR 284
P++ + GG R++L F+AG + IR IL W+N D ++ + + N
Sbjct: 321 PEIHLKTGEISGLLGGVSPSRRSILAFFAGRLHGHIRQILLEQWKNKDEDVQVYDQMPNG 380
Query: 285 AIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKF 344
+ Y+ ++FC+CP G +V S RI ++I+ CVPV++S+ Y PF+D+L+W+ F
Sbjct: 381 ----VSYESMLKTSRFCLCPSGYEVASPRIVEAIYTECVPVLISDNYVPPFSDVLNWKAF 436
Query: 345 AVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLR 404
+V + RD+ ++K+IL IS +++ + L +VQ+HF N PP RFD FHM ++ +WLR
Sbjct: 437 SVQIQVRDIPKIKEILMGISQRQYLRMQRRLKQVQRHFVVNGPPKRFDMFHMTIHSIWLR 496
Query: 405 R 405
R
Sbjct: 497 R 497
>gi|222618931|gb|EEE55063.1| hypothetical protein OsJ_02778 [Oryza sativa Japonica Group]
Length = 482
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 131/356 (36%), Positives = 209/356 (58%), Gaps = 32/356 (8%)
Query: 63 DYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQN 122
D DL + +F + S F +Y M K+YIY DG F+ P L+G YASEG+F +
Sbjct: 138 DDPDLYAPLFKNVSQFKRSYELMERILKVYIYQDGRRPIFHTPP--LSGIYASEGWFMKL 195
Query: 123 IRESR-FLTHDPDRAHLFFIPISCHKMR-----GKGTSYENMTVIVKDYVDSLINKYPYW 176
++ESR F DP +AHLF++P S ++R + + ++D+V L KYP+W
Sbjct: 196 LKESRRFAVTDPAKAHLFYLPYSSQQLRISLYVPDSHNLRPLAAYLRDFVKGLAAKYPFW 255
Query: 177 NRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVC-SPSYDVAFIPHKDVALPQVLQPF 235
NRT GADHF V CHD G T L +N ++ +C + S + F P +DV+LP+
Sbjct: 256 NRTRGADHFLVACHDWGSYTTTAHGDLRRNTVKALCNADSSEGIFTPGRDVSLPETT--I 313
Query: 236 ALPR------GGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPL 289
PR GG V R +L F+AG+ + ++R +L + W + + D+
Sbjct: 314 RTPRRPLRYVGGLPVSRRGILAFFAGNVHGRVRPVLLKHWGDGRDDDM------------ 361
Query: 290 VYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILN 349
R ++FC+CP G +VNS RI ++++Y CVPVI+++ + LP +D+LDW FAV++
Sbjct: 362 ---RHMKNSRFCLCPMGYEVNSPRIVEALYYECVPVIIADNFVLPLSDVLDWSAFAVVVA 418
Query: 350 ERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
E+DV LK+IL+ I+ ++V++H + ++Q+HF W++ P+R+D FHM+++ +WL R
Sbjct: 419 EKDVPDLKKILQGITLRKYVAMHGCVKRLQRHFLWHARPLRYDLFHMILHSIWLSR 474
>gi|15239118|ref|NP_196722.1| Exostosin family protein [Arabidopsis thaliana]
gi|7573418|emb|CAB87721.1| putative protein [Arabidopsis thaliana]
gi|332004317|gb|AED91700.1| Exostosin family protein [Arabidopsis thaliana]
Length = 546
Score = 249 bits (636), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 137/359 (38%), Positives = 208/359 (57%), Gaps = 18/359 (5%)
Query: 58 AINKQDYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEG 117
A+ K+D L + ++H+ S+F +Y M K+Y+Y +GD F+Q + G YASEG
Sbjct: 190 ALVKKD-DTLYAPLYHNISIFKRSYELMEQTLKVYVYSEGDRPIFHQPEAIMEGIYASEG 248
Query: 118 YFFQNIRES-RFLTHDPDRAHLFFIPISCHKMRGK-----GTSYENMTVIVKDYVDSLIN 171
+F + + S RFLT DP +AHLF+IP S ++ K S N+ + +Y+D + +
Sbjct: 249 WFMKLMESSHRFLTKDPTKAHLFYIPFSSRILQQKLYVHDSHSRNNLVKYLGNYIDLIAS 308
Query: 172 KYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQV 231
YP WNRT G+DHFF CHD T G P++ N IR +C+ + F+ KDV+LP+
Sbjct: 309 NYPSWNRTCGSDHFFTACHDWAPTETRG-PYI--NCIRALCNADVGIDFVVGKDVSLPET 365
Query: 232 ----LQPFALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIG 287
LQ GG RT+L F+AG + +R IL W + E D+ +I I
Sbjct: 366 KVSSLQNPNGKIGGSRPSKRTILAFFAGSLHGYVRPILLNQWSSRPEQDM---KIFNRID 422
Query: 288 PLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVI 347
Y R +++FC+C G +VNS R+ +SI YGCVPVI+S+ + PF +IL+W FAV
Sbjct: 423 HKSYIRYMKRSRFCVCAKGYEVNSPRVVESILYGCVPVIISDNFVPPFLEILNWESFAVF 482
Query: 348 LNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW-NSPPVRFDAFHMVMYELWLRR 405
+ E+++ L++IL +I +V + ++KVQKHF W + PVR+D FHM+++ +W R
Sbjct: 483 VPEKEIPNLRKILISIPVRRYVEMQKRVLKVQKHFMWHDGEPVRYDIFHMILHSVWYNR 541
>gi|26450497|dbj|BAC42362.1| unknown protein [Arabidopsis thaliana]
Length = 546
Score = 249 bits (636), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 138/360 (38%), Positives = 209/360 (58%), Gaps = 20/360 (5%)
Query: 58 AINKQDYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEG 117
A+ K+D L + ++H+ S+F +Y M K+Y+Y +GD F+Q + G YASEG
Sbjct: 190 ALVKKD-DTLYAPLYHNISIFKRSYELMEQTLKVYVYSEGDRPIFHQPEAIMEGIYASEG 248
Query: 118 YFFQNIRES-RFLTHDPDRAHLFFIPISCHKMRGK-----GTSYENMTVIVKDYVDSLIN 171
+F + + S RFLT DP +AHLF+IP S ++ K S N+ + +Y+D + +
Sbjct: 249 WFMKLMESSHRFLTKDPTKAHLFYIPFSSRILQQKLYVHDSHSRNNLVKYLGNYIDLIAS 308
Query: 172 KYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQV 231
YP WNRT G+DHFF CHD T G P++ N IR +C+ + F+ KDV+LP+
Sbjct: 309 NYPSWNRTCGSDHFFTACHDWAPTETRG-PYI--NCIRALCNADVGIDFVVGKDVSLPET 365
Query: 232 ----LQPFALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISN-NRINRAI 286
LQ GG RT+L F+AG + +R IL W + E D+ NRI+
Sbjct: 366 KVSSLQNPNGKIGGSRPSKRTILAFFAGSLHGYVRPILLNQWSSRPEQDMKIFNRIDHK- 424
Query: 287 GPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAV 346
Y R +++FC+C G +VNS R+ +SI YGCVPVI+S+ + PF +IL+W FAV
Sbjct: 425 ---SYIRYMKRSRFCVCAKGYEVNSPRVVESILYGCVPVIISDNFVPPFLEILNWESFAV 481
Query: 347 ILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW-NSPPVRFDAFHMVMYELWLRR 405
+ E+++ L++IL +I +V + ++KVQKHF W + PVR+D FHM+++ +W R
Sbjct: 482 FVPEKEIPNLRKILISIPVRRYVEMQKRVLKVQKHFMWHDGEPVRYDIFHMILHSVWYNR 541
>gi|359473867|ref|XP_002268876.2| PREDICTED: probable glycosyltransferase At5g03795-like [Vitis
vinifera]
gi|296085575|emb|CBI29307.3| unnamed protein product [Vitis vinifera]
Length = 449
Score = 248 bits (634), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 149/385 (38%), Positives = 214/385 (55%), Gaps = 37/385 (9%)
Query: 49 NNFNSTLSFAINKQDYSD-------LGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPN- 100
NN N+ S A + QD +D LGS +H+ +F ++ EM+ KI++YPD N
Sbjct: 71 NNNNAHHSSAESVQDRNDSLSSGNLLGSP-YHNWQLFASDFQEMLHKLKIFVYPDASMNQ 129
Query: 101 -------TFYQTPRKLTGK---YASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHKMRG 150
F P K Y SE F + S LT P AH FF+P S + +R
Sbjct: 130 SSSPFARVFLPNPNPFHPKLANYFSEHMFKVALLRSSLLTPHPQDAHFFFLPFSVNTLRN 189
Query: 151 KGTSYENMTV--IVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAI 208
+ ++ V Y + +Y +WN + G DHF++ CH VG A L NAI
Sbjct: 190 DPRVHSEASISDFVTQYTTRISWEYKFWNASRGTDHFYICCHSVGREAASKHHDLHNNAI 249
Query: 209 RVVCSPSY-DVAFIPHKDVALPQVLQPFALPRGGRDVEN----RTVLGFWAGH-RNSKIR 262
+V CS SY +I HKDV LPQV PR + N R L F+AG +NS IR
Sbjct: 250 QVTCSSSYFQRLYISHKDVGLPQVW-----PRPPEKLLNPPELRHKLVFFAGRVQNSHIR 304
Query: 263 VILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGC 322
L VW NDT++D+ + P Y+ K+K+C+ G +VN+AR+ D+IHYGC
Sbjct: 305 QELMAVWGNDTDIDLFSGS-----PPFPYEEGLRKSKYCLHVKGYEVNTARVCDAIHYGC 359
Query: 323 VPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHF 382
+PVI+SNYYDLPF+++LDW KF+VI++ + + LK+IL +IS +++S++ NL V++HF
Sbjct: 360 IPVIVSNYYDLPFSNVLDWSKFSVIISHKSIATLKKILLSISKQKYLSMYQNLCLVRRHF 419
Query: 383 QWNSPPVRFDAFHMVMYELWLRRHV 407
W++ P +D+FHM Y+LWLRR V
Sbjct: 420 AWHTTPRGYDSFHMTAYQLWLRRGV 444
>gi|115438799|ref|NP_001043679.1| Os01g0640600 [Oryza sativa Japonica Group]
gi|55297176|dbj|BAD68851.1| pectin-glucuronyltransferase-like [Oryza sativa Japonica Group]
gi|113533210|dbj|BAF05593.1| Os01g0640600 [Oryza sativa Japonica Group]
Length = 501
Score = 248 bits (633), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 135/366 (36%), Positives = 214/366 (58%), Gaps = 33/366 (9%)
Query: 63 DYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQN 122
D DL + +F + S F +Y M K+YIY DG F+ P L+G YASEG+F +
Sbjct: 138 DDPDLYAPLFKNVSQFKRSYELMERILKVYIYQDGRRPIFHTPP--LSGIYASEGWFMKL 195
Query: 123 IRESR-FLTHDPDRAHLFFIPISCHKMR-----GKGTSYENMTVIVKDYVDSLINKYPYW 176
++ESR F DP +AHLF++P S ++R + + ++D+V L KYP+W
Sbjct: 196 LKESRRFAVTDPAKAHLFYLPYSSQQLRISLYVPDSHNLRPLAAYLRDFVKGLAAKYPFW 255
Query: 177 NRTLGADHFFVTCHD--VGVRATEGVPFLIKNAIRVVC-SPSYDVAFIPHKDVALPQVLQ 233
NRT GADHF V CHD G T L +N ++ +C + S + F P +DV+LP+
Sbjct: 256 NRTRGADHFLVACHDWLQGSYTTTAHGDLRRNTVKALCNADSSEGIFTPGRDVSLPETT- 314
Query: 234 PFALPR------GGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIG 287
PR GG V R +L F+AG+ + ++R +L + W + + D+ R G
Sbjct: 315 -IRTPRRPLRYVGGLPVSRRGILAFFAGNVHGRVRPVLLKHWGDGRDDDM------RVYG 367
Query: 288 PL--------VYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDIL 339
PL Y + ++FC+CP G +VNS RI ++++Y CVPVI+++ + LP +D+L
Sbjct: 368 PLPARVSRRMSYIQHMKNSRFCLCPMGYEVNSPRIVEALYYECVPVIIADNFVLPLSDVL 427
Query: 340 DWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMY 399
DW FAV++ E+DV LK+IL+ I+ ++V++H + ++Q+HF W++ P+R+D FHM+++
Sbjct: 428 DWSAFAVVVAEKDVPDLKKILQGITLRKYVAMHGCVKRLQRHFLWHARPLRYDLFHMILH 487
Query: 400 ELWLRR 405
+WL R
Sbjct: 488 SIWLSR 493
>gi|326529697|dbj|BAK04795.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 348
Score = 247 bits (630), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 131/341 (38%), Positives = 207/341 (60%), Gaps = 21/341 (6%)
Query: 81 NYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESR-FLTHDPDRAHLF 139
+Y M K+YIY DG F+ P L+G YASEG+F + ++ESR F+ D +AHLF
Sbjct: 5 SYQLMERILKVYIYQDGRRPIFHTPP--LSGIYASEGWFMKLLKESRRFVVADAAKAHLF 62
Query: 140 FIPISCHKMR-----GKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV 194
++P S +R + + V ++D+V L KYP+WNRT GADHF V CHD G
Sbjct: 63 YLPYSSQHLRLSLYVPDSHNLRPLAVYLRDFVKGLAAKYPFWNRTRGADHFLVACHDWGP 122
Query: 195 RATEGVPFLIKNAIRVVC-SPSYDVAFIPHKDVALPQ--VLQPFALPR--GGRDVENRTV 249
T L KN+I+ +C + S + F P KDV+LP+ + P R GG V R +
Sbjct: 123 YTTTAHRDLSKNSIKALCNADSSEGIFTPGKDVSLPETTIRTPKRPLRYVGGLPVSRRRI 182
Query: 250 LGFWAGHRNSKIRVILARVWENDTELDIS-----NNRINRAIGPLVYQRRFYKTKFCICP 304
L F+AG+ + ++R +L + W + D+ R++R + Y + +KFC+CP
Sbjct: 183 LAFFAGNVHGRVRPVLLQHWGKGQDDDMRVYALLPGRVSRTMN---YIQHMKNSKFCLCP 239
Query: 305 GGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNIS 364
G +VNS RI ++++Y CVPVI+++ + LPF+D+LDW F+V++ E+D+ +LK+IL+ IS
Sbjct: 240 MGYEVNSPRIVEALYYECVPVIIADNFVLPFSDVLDWSAFSVVVAEKDIPELKRILQGIS 299
Query: 365 DAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
+V++H+ + ++Q+HF W P+R+D FHM+++ +WL R
Sbjct: 300 LRRYVAMHDCVKRLQRHFLWYDRPLRYDLFHMILHSIWLSR 340
>gi|357119191|ref|XP_003561329.1| PREDICTED: probable glycosyltransferase At5g03795-like
[Brachypodium distachyon]
Length = 569
Score = 246 bits (629), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 132/378 (34%), Positives = 211/378 (55%), Gaps = 16/378 (4%)
Query: 37 LSFNSFRSYPLQNNFNSTLSFAINKQDYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPD 96
L RS ++ + N + +DY +G V+ + + FH +YLEM FKIY+Y +
Sbjct: 186 LGLAKARSTIMEASHNKDNRPPLTDKDYVPVGP-VYRNANAFHRSYLEMEKLFKIYVYDE 244
Query: 97 GDPNTFYQTPRKLTGKYASEGYFFQNIR-ESRFLTHDPDRAHLFFIPISCHKMRGK---- 151
G+P ++ P Y++EG F + E+R T DP AH+FF+P S KM
Sbjct: 245 GEPPIYHDGP--CHNIYSTEGRFIHAMEMENRMRTTDPGLAHVFFLPFSIAKMEKTIYVP 302
Query: 152 -GTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRV 210
+ E + V DY+D L K+PYWNR+ GADHF ++CHD G + L N+IRV
Sbjct: 303 GSHTMEPLRRTVFDYIDVLSTKHPYWNRSQGADHFMLSCHDWGPYVSSVDGNLFSNSIRV 362
Query: 211 VCSPSYDVAFIPHKDVALPQV--LQPFALPRGGRDVENRTVLGFWAGHRNSKIRVILARV 268
+C+ + FIP KDV+LP++ L F GG R +L F+AG + +R +L +
Sbjct: 363 LCNANTSEGFIPSKDVSLPEINHLNDFKKDIGGPSASGRPILAFFAGGNHGPVRPLLLKH 422
Query: 269 WEN-DTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVIL 327
W+ D ++ +S + Y ++KFC+CP G +V S R+A++I+ CVPV++
Sbjct: 423 WKGKDPDVQVSE----YLPAGVSYVETMRRSKFCLCPSGFEVASPRVAEAIYVECVPVVI 478
Query: 328 SNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSP 387
++ Y LPF+D+L W F++ + RD+ +K+IL +S ++ + + V++HF N
Sbjct: 479 ADDYVLPFSDVLSWPAFSLRVAVRDIPDIKRILSAVSPRRYIRMQRRVRAVRRHFMLNGV 538
Query: 388 PVRFDAFHMVMYELWLRR 405
P R+D FHM+++ +WLRR
Sbjct: 539 PQRYDVFHMILHSIWLRR 556
>gi|297811281|ref|XP_002873524.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
gi|297319361|gb|EFH49783.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
Length = 546
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 133/351 (37%), Positives = 207/351 (58%), Gaps = 17/351 (4%)
Query: 66 DLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRE 125
+L + ++H+ S+F +Y M K+YIY +GD F+Q + G YASEG+F + +
Sbjct: 197 NLYAPLYHNLSIFKRSYELMEQTLKVYIYSEGDRPIFHQPEAIMEGIYASEGWFMKLMES 256
Query: 126 S-RFLTHDPDRAHLFFIPISCHKMRGK-----GTSYENMTVIVKDYVDSLINKYPYWNRT 179
+ RFLT DP+ AHLF++P S ++ K S N+ +K+Y+D + + YP+WNRT
Sbjct: 257 NHRFLTKDPNIAHLFYLPFSTRILQQKLYVHDSHSRRNLVKYLKNYLDLIASNYPFWNRT 316
Query: 180 LGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQV-LQPFALP 238
G+DHFF CHD T G P++ N IR +C+ V F+ KDV+LP+ + P
Sbjct: 317 RGSDHFFTACHDWAPAETRG-PYI--NCIRSLCNADVGVDFVVGKDVSLPETKISSSQNP 373
Query: 239 RG---GRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRF 295
G G RT+L F+AG+ + +R IL W + E D+ +I I Y R
Sbjct: 374 NGNIGGNRPSKRTILAFFAGNLHGYVRPILLNQWSSRPEPDM---KIFNRIDHKSYIRYM 430
Query: 296 YKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQ 355
+++FC+C G +VNS R+ +S+ YGCVPVI+S+ + PF +IL+W FAV + E+++
Sbjct: 431 KRSRFCVCAKGYEVNSPRVVESVLYGCVPVIISDNFVPPFLEILNWESFAVFVPEKEIPN 490
Query: 356 LKQILKNISDAEFVSLHNNLVKVQKHFQW-NSPPVRFDAFHMVMYELWLRR 405
L++IL +I +V + ++KVQKHF W + PVR+D FHM+++ +W R
Sbjct: 491 LRKILISIPVRRYVEMQKRVMKVQKHFMWHDGEPVRYDVFHMILHSVWYNR 541
>gi|357130589|ref|XP_003566930.1| PREDICTED: probable glycosyltransferase At5g03795-like
[Brachypodium distachyon]
Length = 510
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 132/360 (36%), Positives = 215/360 (59%), Gaps = 22/360 (6%)
Query: 66 DLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRE 125
DL + +F + S F +Y M K+YIY DG F+ P L+G YASEG+F + ++E
Sbjct: 145 DLYAPLFRNVSQFKRSYELMERILKVYIYQDGRRPIFHTPP--LSGIYASEGWFMKLLKE 202
Query: 126 SR-FLTHDPDRAHLFFIPISCHKMR-----GKGTSYENMTVIVKDYVDSLINKYPYWNRT 179
SR F+ D +AHLF++P S +R + + V ++D+V L KYP+WNR
Sbjct: 203 SRRFVVADGAKAHLFYLPYSSQHLRLSLYVPDSHNLRPLAVYLRDFVQGLAAKYPFWNRN 262
Query: 180 LGADHFFVTCHD-------VGVRATEGVPFLIKNAIRVVC-SPSYDVAFIPHKDVALPQ- 230
GADHF V CHD G T L +N+I+ +C + S + F P KDV+LP+
Sbjct: 263 RGADHFLVACHDWRKFYYLQGPYTTTAHRDLRRNSIKALCNADSSERIFSPGKDVSLPET 322
Query: 231 -VLQPFALPR--GGRDVENRTVLGFWAGHRNSKIRVILARVWEN--DTELDISNNRINRA 285
+ P R GG V R +L F+AG+ + ++R +L + W + D ++ + NR
Sbjct: 323 TIRTPKRPLRYVGGLPVSRRRILAFFAGNVHGRVRPVLLKHWGDGRDDDMRVYGPLPNRV 382
Query: 286 IGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFA 345
+ Y + ++FC+CP G +VNS RI ++++Y CVPV++++ + LPF+D+LDW F+
Sbjct: 383 SRQMSYIQHMKNSRFCLCPMGHEVNSPRIVEALYYECVPVVIADNFVLPFSDVLDWTAFS 442
Query: 346 VILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
V++ E+D+ LK+IL+ IS +V++H+ + ++Q+HF W++ P+R+D FHM+++ +WL R
Sbjct: 443 VVVAEKDIPDLKKILQGISLRRYVAMHDCVKRLQRHFLWHARPLRYDLFHMILHSIWLSR 502
>gi|296085067|emb|CBI28482.3| unnamed protein product [Vitis vinifera]
Length = 634
Score = 246 bits (628), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 136/354 (38%), Positives = 204/354 (57%), Gaps = 19/354 (5%)
Query: 63 DYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQN 122
DY G D++ + + FH +YL M FKI+IY +G+P F+ P K Y+ EG FF
Sbjct: 284 DYVPQG-DIYRNATAFHRSYLLMEKLFKIFIYKEGEPPLFHNGPCKSI--YSIEGVFFSL 340
Query: 123 IR-ESRFLTHDPDRAHLFFIPISC-----HKMRGKGTSYENMTVIVKDYVDSLINKYPYW 176
+ ++ F T DPD AH++F+P S H M +V DYV + KY YW
Sbjct: 341 MEGDTHFRTQDPDEAHVYFLPFSVVMIIHHLFDPIVRDKYVMKHVVSDYVKVISQKYRYW 400
Query: 177 NRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQV----L 232
NR+LGADHF ++CHD G RAT VP L N+IR++C+ + F P KD ++P++
Sbjct: 401 NRSLGADHFMLSCHDWGPRATWYVPQLYYNSIRLLCNANTSECFNPRKDASIPEINLIDG 460
Query: 233 QPFALPRGGRDVENRTVLGFWAGHRNSKIRVILARVW-ENDTELDISNNRINRAIGPLVY 291
+ L GG RT+L F+AG + +IR L + W E D ++ + L Y
Sbjct: 461 ETIGL-TGGLPPSKRTILAFFAGGLHGRIRPALLQHWKEKDEQVQVYETLPEG----LSY 515
Query: 292 QRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNER 351
K+K+CICP G +V S RI ++I+ CVPV++S +Y LPF+D+LDW F++ ++
Sbjct: 516 PDLMKKSKYCICPSGHEVASPRIVEAIYAECVPVLISQHYVLPFSDVLDWGSFSIQVSVN 575
Query: 352 DVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
++ LK+IL I ++ + + +VQ+HF N+PP RFD FHM+++ +WLRR
Sbjct: 576 EIPNLKKILLGIPQDRYIRMQERVKQVQQHFVVNNPPKRFDVFHMIIHSIWLRR 629
>gi|15231488|ref|NP_187419.1| putative glycosyltransferase [Arabidopsis thaliana]
gi|75207507|sp|Q9SSE8.1|GLYT1_ARATH RecName: Full=Probable glycosyltransferase At3g07620
gi|6466945|gb|AAF13080.1|AC009176_7 hypothetical protein [Arabidopsis thaliana]
gi|332641055|gb|AEE74576.1| putative glycosyltransferase [Arabidopsis thaliana]
Length = 470
Score = 246 bits (628), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 138/373 (36%), Positives = 213/373 (57%), Gaps = 18/373 (4%)
Query: 48 QNNFNSTLSFAINKQDYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPR 107
Q N++ST S + +DY G D++ +P FH +YL M FKIY+Y +GDP F+
Sbjct: 103 QLNYSSTTSSPLGDEDYVPHG-DIYRNPYAFHRSYLLMEKMFKIYVYEEGDPPIFHYGLC 161
Query: 108 KLTGKYASEGYFFQNIRES--RFLTHDPDRAHLFFIPISC-----HKMRGKGTSYENMTV 160
K Y+ EG F + ++ T DPD+AH++F+P S H +
Sbjct: 162 K--DIYSMEGLFLNFMENDVLKYRTRDPDKAHVYFLPFSVVMILHHLFDPVVRDKAVLER 219
Query: 161 IVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAF 220
++ DYV + KYPYWN + G DHF ++CHD G RAT V L N+IRV+C+ + F
Sbjct: 220 VIADYVQIISKKYPYWNTSDGFDHFMLSCHDWGHRATWYVKKLFFNSIRVLCNANISEYF 279
Query: 221 IPHKDVALPQV---LQPFALPRGGRDVENRTVLGFWAGHRNSKIRVILARVW-ENDTELD 276
P KD P++ GG D +RT L F+AG + KIR +L W E D ++
Sbjct: 280 NPEKDAPFPEINLLTGDINNLTGGLDPISRTTLAFFAGKSHGKIRPVLLNHWKEKDKDIL 339
Query: 277 ISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFN 336
+ N + L Y K++FCICP G +V S R+ ++I+ GCVPV++S Y LPF+
Sbjct: 340 VYENLPDG----LDYTEMMRKSRFCICPSGHEVASPRVPEAIYSGCVPVLISENYVLPFS 395
Query: 337 DILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHM 396
D+L+W KF+V ++ +++ +LK+IL +I + ++ L+ + KV++H N PP R+D F+M
Sbjct: 396 DVLNWEKFSVSVSVKEIPELKRILMDIPEERYMRLYEGVKKVKRHILVNDPPKRYDVFNM 455
Query: 397 VMYELWLRRHVIK 409
+++ +WLRR +K
Sbjct: 456 IIHSIWLRRLNVK 468
>gi|356565155|ref|XP_003550810.1| PREDICTED: xylogalacturonan beta-1,3-xylosyltransferase-like
[Glycine max]
Length = 496
Score = 246 bits (627), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 132/386 (34%), Positives = 228/386 (59%), Gaps = 29/386 (7%)
Query: 41 SFRSYPLQNNFNSTLSFAINKQDYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPN 100
S + Y L N+ S ++ + GS ++ +P F +++EMV FK+++Y +G+
Sbjct: 119 SIQEYILSRNYTSQ-----RRESFVPKGS-IYRNPHAFLQSHIEMVKRFKVWVYQEGEQP 172
Query: 101 TFYQTPRKLTGKYASEGYFFQNI----RESRFLTHDPDRAHLFFIPIS-------CHKMR 149
+ P + YA EG F I + S+F P+ AH+FF+P S +K
Sbjct: 173 LVHDGP--VNNIYAIEGQFMDEIDNNDKWSQFRARHPEEAHVFFLPFSIANVVHYVYKPI 230
Query: 150 GKGTSYE--NMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNA 207
K + YE + ++V+DY+ + +KYPYWNR+ GADHF ++CHD + + G P L ++
Sbjct: 231 LKQSDYEPVRLQLLVEDYISVIEDKYPYWNRSKGADHFLLSCHDWAPKVSNGNPELFQSF 290
Query: 208 IRVVCSPSYDVAFIPHKDVALPQVLQP---FALPRGGRDVENRTVLGFWAGHRNSKIRVI 264
IR +C+ + F P++DV++P+V P P G+ +RT+L F+AG + +IR I
Sbjct: 291 IRALCNANTSEGFHPNRDVSIPEVYLPVGKLGPPSLGQHPNSRTILAFFAGGVHGEIRKI 350
Query: 265 LARVWEN-DTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCV 323
L + W++ D E+ + + + ++ Y + ++KFC+CP G +V S R+ ++IH GCV
Sbjct: 351 LLKHWKDKDNEVRV-HEYLPKSQN---YTKLMGQSKFCLCPSGHEVASPRVVEAIHAGCV 406
Query: 324 PVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQ 383
PVI+ + Y LPF+D+L W +F+V ++ + + ++K IL++IS +++ LH N+++V++HF
Sbjct: 407 PVIICDNYSLPFSDVLHWSQFSVKVSVQKIPEIKSILQSISRKKYLRLHMNVLRVRRHFM 466
Query: 384 WNSPPVRFDAFHMVMYELWLRRHVIK 409
N P FD HM+++ +WLRR IK
Sbjct: 467 INRPAKPFDMMHMILHSIWLRRLNIK 492
>gi|357466145|ref|XP_003603357.1| hypothetical protein MTR_3g106690 [Medicago truncatula]
gi|355492405|gb|AES73608.1| hypothetical protein MTR_3g106690 [Medicago truncatula]
Length = 393
Score = 246 bits (627), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 137/347 (39%), Positives = 202/347 (58%), Gaps = 19/347 (5%)
Query: 76 SVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRES--RFLTHDP 133
S ++ +YLEM FK+Y+YPDGD + P K Y++EG F + +F T+DP
Sbjct: 50 SYYYRSYLEMEKIFKVYVYPDGDLPIVHDGPCK--DIYSTEGRFLHEMERGVGKFRTNDP 107
Query: 134 DRAHLFFIPISCHKMRG---KGTSYE--NMTVIVKDYVDSLINKYPYWNRTLGADHFFVT 188
+ AH++F+P S M +SY+ +T V DYV + +YP+WNRT GADHF +
Sbjct: 108 NAAHVYFLPFSVTWMVKYLYTPSSYDITPLTQFVSDYVRVVSMRYPFWNRTHGADHFMLA 167
Query: 189 CHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQV------LQPFALPRGGR 242
CHD G A++G PFL +IRV+C+ + F P KDV+LP++ + P L
Sbjct: 168 CHDWGPHASKGNPFLYNTSIRVLCNANTSEGFNPLKDVSLPEIHLYGGEVSPKLLSLPPE 227
Query: 243 DVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCI 302
+ R L F+AG + IR IL + W+N + + N + + I Y +KFC+
Sbjct: 228 NAPRR-YLAFFAGGMHGPIRPILLQHWKNRDKDILVNEYLPKGID---YYSIMLSSKFCL 283
Query: 303 CPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKN 362
CP G +V S RI +SI+ CVPVILSNYY LPF+D+L W F+V ++ D+ +LK+IL
Sbjct: 284 CPSGFEVASPRIVESIYAECVPVILSNYYVLPFSDVLQWEAFSVQVDVSDIPRLKEILSA 343
Query: 363 ISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIK 409
I ++++ L + V++HF N P RFD FHM+++ +WLRR IK
Sbjct: 344 IPESKYKKLKQGVRAVRRHFTLNQPAKRFDVFHMILHSIWLRRLNIK 390
>gi|255543306|ref|XP_002512716.1| catalytic, putative [Ricinus communis]
gi|223548677|gb|EEF50168.1| catalytic, putative [Ricinus communis]
Length = 426
Score = 245 bits (626), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 139/364 (38%), Positives = 211/364 (57%), Gaps = 32/364 (8%)
Query: 64 YSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPD----GDP--NTF------YQTPRKLTG 111
+S S +H+ +F ++ EM KI++Y D P N F + P+ G
Sbjct: 70 HSKWESGPYHNWKLFAADFEEMKQQLKIFVYSDVSNKSSPFANIFLPIENPFHHPK--LG 127
Query: 112 KYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHKMRG--KGTSYENMTVIVKDYVDSL 169
Y SE F + S +T DP +A FF+P S + +R + S E+++ V Y ++
Sbjct: 128 NYFSEHIFKVALLRSSLVTLDPAKALFFFLPFSINNLRNDPRFHSEESISEFVAHYTTTI 187
Query: 170 INKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSY-DVAFIPHKDVAL 228
++ YWN + GADHF+V CH VG +A P L NAI++ CS SY F+ HKDV L
Sbjct: 188 SQRFSYWNASAGADHFYVCCHSVGRQAASRHPALHNNAIQLTCSSSYFQRFFVSHKDVGL 247
Query: 229 PQVLQPFALPRGGRDVEN----RTVLGFWAGH-RNSKIRVILARVWENDTELDISNNRIN 283
PQV PR + N R L ++AG +NS++R L +W NDTE+DI N +
Sbjct: 248 PQVW-----PRPPQTALNPPHARHRLVYFAGRVQNSQVRRELVNLWGNDTEMDIINGSPS 302
Query: 284 RAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRK 343
Y+ F ++K+C+ G +VN+AR++DSIHYGC+PVI+SNYYDLPF +LDW K
Sbjct: 303 -----FPYEEGFKRSKYCLHVKGYEVNTARVSDSIHYGCIPVIISNYYDLPFATVLDWSK 357
Query: 344 FAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWL 403
F+V++N+ D+ LK L I+ ++++ NL +V++HF+W++ P +D+F+M Y+LWL
Sbjct: 358 FSVVINQADIPFLKTTLLAITRKTYITMFQNLCRVRRHFEWHTTPKGYDSFYMTAYQLWL 417
Query: 404 RRHV 407
RR +
Sbjct: 418 RRSI 421
>gi|359485860|ref|XP_002264111.2| PREDICTED: probable glycosyltransferase At3g07620-like [Vitis
vinifera]
Length = 410
Score = 245 bits (626), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 136/354 (38%), Positives = 204/354 (57%), Gaps = 19/354 (5%)
Query: 63 DYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQN 122
DY G D++ + + FH +YL M FKI+IY +G+P F+ P K Y+ EG FF
Sbjct: 60 DYVPQG-DIYRNATAFHRSYLLMEKLFKIFIYKEGEPPLFHNGPCKSI--YSIEGVFFSL 116
Query: 123 IR-ESRFLTHDPDRAHLFFIPISC-----HKMRGKGTSYENMTVIVKDYVDSLINKYPYW 176
+ ++ F T DPD AH++F+P S H M +V DYV + KY YW
Sbjct: 117 MEGDTHFRTQDPDEAHVYFLPFSVVMIIHHLFDPIVRDKYVMKHVVSDYVKVISQKYRYW 176
Query: 177 NRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQV----L 232
NR+LGADHF ++CHD G RAT VP L N+IR++C+ + F P KD ++P++
Sbjct: 177 NRSLGADHFMLSCHDWGPRATWYVPQLYYNSIRLLCNANTSECFNPRKDASIPEINLIDG 236
Query: 233 QPFALPRGGRDVENRTVLGFWAGHRNSKIRVILARVW-ENDTELDISNNRINRAIGPLVY 291
+ L GG RT+L F+AG + +IR L + W E D ++ + L Y
Sbjct: 237 ETIGL-TGGLPPSKRTILAFFAGGLHGRIRPALLQHWKEKDEQVQVYETLPEG----LSY 291
Query: 292 QRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNER 351
K+K+CICP G +V S RI ++I+ CVPV++S +Y LPF+D+LDW F++ ++
Sbjct: 292 PDLMKKSKYCICPSGHEVASPRIVEAIYAECVPVLISQHYVLPFSDVLDWGSFSIQVSVN 351
Query: 352 DVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
++ LK+IL I ++ + + +VQ+HF N+PP RFD FHM+++ +WLRR
Sbjct: 352 EIPNLKKILLGIPQDRYIRMQERVKQVQQHFVVNNPPKRFDVFHMIIHSIWLRR 405
>gi|297833558|ref|XP_002884661.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297330501|gb|EFH60920.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 470
Score = 245 bits (625), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 140/373 (37%), Positives = 209/373 (56%), Gaps = 18/373 (4%)
Query: 48 QNNFNSTLSFAINKQDYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPR 107
Q N NST S + +DY G D++ +P FH +YL M FKIY+Y +GDP F+
Sbjct: 103 QLNSNSTASSPLGDEDYVPHG-DIYRNPYAFHRSYLLMERMFKIYVYEEGDPPIFHYGLC 161
Query: 108 KLTGKYASEGYFFQNIRES--RFLTHDPDRAHLFFIPISC-----HKMRGKGTSYENMTV 160
K Y+ EG F + ++ T DPD+AH++F+P S H +
Sbjct: 162 K--DIYSMEGLFLNFMENDVLKYRTRDPDKAHVYFLPFSVVMILHHLFDPVVRDKAVLES 219
Query: 161 IVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAF 220
++ DYV + KYPYWN + G DHF ++CHD G RAT V L N+IRV+C+ + F
Sbjct: 220 VIADYVQIISKKYPYWNTSDGFDHFMLSCHDWGHRATWYVKKLFFNSIRVLCNANISEYF 279
Query: 221 IPHKDVALPQV---LQPFALPRGGRDVENRTVLGFWAGHRNSKIRVILARVW-ENDTELD 276
P KD P++ GG D +R L F+AG + KIR +L W E D ++
Sbjct: 280 NPEKDAPFPEINLLTGEINNLTGGLDPISRKTLAFFAGKSHGKIRPVLLNHWKEKDKDIL 339
Query: 277 ISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFN 336
+ N L Y KT+FCICP G +V S RI ++I+ GCVPV++S Y LPF+
Sbjct: 340 VYENLPED----LNYTEMMRKTRFCICPSGHEVASPRIPEAIYSGCVPVLISENYVLPFS 395
Query: 337 DILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHM 396
D+L+W KF+V ++ +++ +LK+IL +I + ++ L+ + KV+ H N PP R+D F+M
Sbjct: 396 DVLNWEKFSVSVSVKEIPELKRILMDIPEERYMRLYEGVKKVKMHILVNDPPKRYDVFNM 455
Query: 397 VMYELWLRRHVIK 409
+++ +WLRR +K
Sbjct: 456 IIHSIWLRRLNVK 468
>gi|356565157|ref|XP_003550811.1| PREDICTED: xylogalacturonan beta-1,3-xylosyltransferase-like
[Glycine max]
Length = 411
Score = 244 bits (622), Expect = 8e-62, Method: Compositional matrix adjust.
Identities = 130/376 (34%), Positives = 219/376 (58%), Gaps = 24/376 (6%)
Query: 47 LQNNFNSTLSFAINKQDYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTP 106
+Q + S + N+ ++ GS ++ +P FH ++ EM+ FK+++Y +G+ + P
Sbjct: 32 IQESIRSRNYTSANRVNFVPKGS-IYLNPHAFHQSHEEMLKRFKVWVYEEGEQPLVHDGP 90
Query: 107 RKLTGKYASEGYFFQNI----RESRFLTHDPDRAHLFFIPIS-------CHKMRGKGTSY 155
Y+ EG F I + S F PD+A +FF+P S +K K + Y
Sbjct: 91 --ANDIYSIEGQFIDEIDNDAKWSHFRAEHPDQAQVFFLPFSIANVVHYVYKPIRKHSDY 148
Query: 156 ENMTV--IVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCS 213
E + + +V+DY+ + NKYPYWNR+ GADHF ++CHD G + + G P L KN IRV+C+
Sbjct: 149 EPIRLQRLVEDYIGVIANKYPYWNRSEGADHFLLSCHDWGPKVSYGNPKLFKNFIRVLCN 208
Query: 214 PSYDVAFIPHKDVALPQVLQP---FALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWE 270
+ F+P+KDV++P+V P P G+ +R++L F+AG + IR IL W+
Sbjct: 209 ANTSEGFLPNKDVSIPEVYLPKGKLGPPNLGQRPNDRSILAFFAGREHGDIRKILLNHWK 268
Query: 271 N-DTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSN 329
D ++ + + + + Y + ++KFC+CP G +V S R+ ++IH GCVPV++S+
Sbjct: 269 GKDNDIQV-HEYLPKGKN---YTQLMGQSKFCLCPSGYEVASPRVVEAIHAGCVPVLISS 324
Query: 330 YYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPV 389
Y PF D+L+W +F+V + + ++K IL++IS ++ LH N+++V++HF N P
Sbjct: 325 SYSPPFTDVLNWSQFSVEIPVEKISEIKTILQSISRNRYLRLHMNVLRVRRHFMLNRPAK 384
Query: 390 RFDAFHMVMYELWLRR 405
FD HM+++ +WLRR
Sbjct: 385 PFDLMHMILHSIWLRR 400
>gi|297740031|emb|CBI30213.3| unnamed protein product [Vitis vinifera]
Length = 337
Score = 243 bits (620), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 124/333 (37%), Positives = 195/333 (58%), Gaps = 20/333 (6%)
Query: 85 MVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIR-ESRFLTHDPDRAHLFFIPI 143
M K+Y+Y +G+P F+ P K Y+ EG F + +S F T DP++AHLFF+P
Sbjct: 1 MEKQLKVYVYDEGEPPLFHNGPCKSI--YSMEGNFIHKMEMDSHFRTKDPEKAHLFFLPF 58
Query: 144 SCHKMRG-----KGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATE 198
S + + V DYV+ + KYPYWNR+LGADHF + CHD G +
Sbjct: 59 SVAMLVRFVYVRDSHDLGPIKQTVIDYVNVVSTKYPYWNRSLGADHFMLACHDWGPETSF 118
Query: 199 GVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQV------LQPFALPRGGRDVENRTVLGF 252
+P+L KN+IRV+C+ + F P KDV+ P++ F GG +RT+L F
Sbjct: 119 SIPYLHKNSIRVLCNANTSEGFNPSKDVSFPEINLLTGSTDSFI---GGPSPSHRTLLAF 175
Query: 253 WAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSA 312
+AG + IR IL WEN E + + + + Y K+K+C+CP G +V S
Sbjct: 176 FAGGLHGPIRPILLEHWENKDEDVKVHKYLPKGVS---YYEMMRKSKYCLCPSGYEVASP 232
Query: 313 RIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLH 372
R+ ++++ GCVPV++S++Y PF+D+L+W+ F+V + R++ LK+IL +IS +++ +
Sbjct: 233 RVVEALYTGCVPVLISDHYVPPFSDVLNWKSFSVEVPVREIPNLKRILMDISPRQYIRMQ 292
Query: 373 NNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
++ ++HF+ NSPP R+D FHM+++ LWLRR
Sbjct: 293 RRGIQARRHFEVNSPPKRYDVFHMILHSLWLRR 325
>gi|449518133|ref|XP_004166098.1| PREDICTED: probable glycosyltransferase At5g20260-like, partial
[Cucumis sativus]
Length = 362
Score = 243 bits (619), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 132/349 (37%), Positives = 207/349 (59%), Gaps = 21/349 (6%)
Query: 75 PSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIR--ESRFLTHD 132
P FH +++EM KI+ Y +G+ + P K Y+ EG+F + +S F H+
Sbjct: 18 PFSFHQSHIEMKKRLKIWTYKEGEQPLVHDGPMKHI--YSIEGHFIDEMDSGKSPFSAHE 75
Query: 133 PDRAHLFFIPIS----CHKMRGKGTSY--ENMTVIVKDYVDSLINKYPYWNRTLGADHFF 186
P+ A +FF+PIS + T+Y + + I DYV + NKYPYWNRT GADHF
Sbjct: 76 PEEAQVFFLPISIVYIVDYIYKPITTYARDRLVRIFTDYVRVVANKYPYWNRTRGADHFM 135
Query: 187 VTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQP----FALPRGGR 242
V+CHD T+ P L K IRV+C+ + F P +D +LP++ P LPR G+
Sbjct: 136 VSCHDWAPEVTKEDPNLFKYFIRVLCNANTSEGFNPMRDASLPEINLPPTFHLNLPRLGQ 195
Query: 243 DVENRTVLGFWAGHRNSKIRVILARVWEN-DTELDISNNRINRAIGPLV-YQRRFYKTKF 300
+NR++L F+AG + IR IL + W++ D E+ +++ + P Y ++KF
Sbjct: 196 PPQNRSILAFFAGGAHGFIRHILMQHWKDKDHEI-----QVHEYLPPSQNYTELIDRSKF 250
Query: 301 CICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQIL 360
C+CP G +V S R+ ++IH GCVPV++S+YY LPF+D+LDW KF++ + + ++K IL
Sbjct: 251 CLCPSGYEVASPRLVEAIHGGCVPVVISDYYSLPFDDVLDWSKFSMRIPSERIPEIKTIL 310
Query: 361 KNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIK 409
+ +S +++ L ++KVQ+HF+ + P FD FHMV++ +WLRR +K
Sbjct: 311 RGVSMKKYLKLQRGVMKVQRHFEIHRPAKAFDMFHMVLHSVWLRRLNVK 359
>gi|51477380|gb|AAU04753.1| EXO [Cucumis melo]
Length = 343
Score = 242 bits (618), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 124/346 (35%), Positives = 200/346 (57%), Gaps = 28/346 (8%)
Query: 76 SVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESR-FLTHDPD 134
SV +YLEM KIY+Y +G+P F+ P K Y++EG F + + + T+DPD
Sbjct: 3 SVMCRSYLEMERLLKIYVYKEGEPPMFHGGPCK--SIYSTEGRFIHEMEKGNLYTTNDPD 60
Query: 135 RAHLFFIPISC------------HKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGA 182
+A L+F+P S H++ G + + DY++ + K+P+W+R+LGA
Sbjct: 61 QALLYFLPFSVVNLVQYLYVPNSHEVNAIGRA-------ITDYINVISKKHPFWDRSLGA 113
Query: 183 DHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQV-LQPFALPR-- 239
DHF ++CHD G R T VP L N+IRV+C+ + F+P KD + P++ L+ +
Sbjct: 114 DHFMLSCHDWGPRTTSYVPLLFNNSIRVLCNANVSEGFLPSKDASFPEIHLRTGEIDGLI 173
Query: 240 GGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTK 299
GG R+VL F+AG + IR +L + W+ E + + I Y K++
Sbjct: 174 GGLSPSRRSVLAFFAGRLHGHIRYLLLQEWKEKDEDVLVYEELPSGIS---YNSMLKKSR 230
Query: 300 FCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQI 359
FC+CP G +V S R+ ++I+ CVPV++S Y PF+D+L+W+ F+V + +D+ +K+I
Sbjct: 231 FCLCPSGYEVASPRVVEAIYAECVPVLISESYVPPFSDVLNWKSFSVQIQVKDIPNIKKI 290
Query: 360 LKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
LK IS +++ + + +VQ+HF N P RFDAFHM+++ +WLRR
Sbjct: 291 LKGISQTQYLRMQRRVKQVQRHFALNGTPKRFDAFHMILHSIWLRR 336
>gi|449469432|ref|XP_004152424.1| PREDICTED: probable glycosyltransferase At5g03795-like [Cucumis
sativus]
Length = 472
Score = 242 bits (617), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 136/364 (37%), Positives = 214/364 (58%), Gaps = 32/364 (8%)
Query: 63 DYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQN 122
D +DL + +F + S+F +Y M K+YIY DG F+ TP L G YASEG+F +
Sbjct: 111 DDADLYAPLFLNVSIFKRSYELMELILKVYIYRDGSRPIFH-TPH-LRGIYASEGWFMKL 168
Query: 123 IRESR-FLTHDPDRAHLFFIPISCHKMRGK-----GTSYENMTVIVKDYVDSLINKYPYW 176
+ E+R F+T DP++AHLF++ S +++ + + +++ ++D+V+ + KYPYW
Sbjct: 169 MEENRQFVTKDPEKAHLFYLAYSSRQLQTALYVPDSHNMKPLSIYLRDHVNWIAGKYPYW 228
Query: 177 NRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSY-DVAFIPHKDVALPQVLQPF 235
NRT G DHF V CHD G L ++ I+ +C+ + F KDV+LP+
Sbjct: 229 NRTHGYDHFLVACHDWGPYTVNEHRELSQHTIKALCNADLSEGVFKLGKDVSLPETT--I 286
Query: 236 ALPR------GGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPL 289
PR GG+ V R +L F+AG+ + ++R IL + W ND + DI R GPL
Sbjct: 287 RTPRKPLRNVGGKRVSQRPILAFFAGNMHGRVRPILLKHW-NDKDDDI------RVYGPL 339
Query: 290 --------VYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDW 341
Y + +K+CICP G +VNS RI ++I+Y CVPVI+++ + LPF++ LDW
Sbjct: 340 PLRVSRKMTYIQHMKSSKYCICPMGYEVNSPRIIEAIYYECVPVIIADNFVLPFSEFLDW 399
Query: 342 RKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYEL 401
F+V++ E+D+ +LK+IL I ++++ N+ VQKHF WN P+++D FHMV++ +
Sbjct: 400 SAFSVVVAEKDIPKLKEILTAIPLKRYLTMQINVKMVQKHFLWNPKPLKYDLFHMVLHSI 459
Query: 402 WLRR 405
W R
Sbjct: 460 WFSR 463
>gi|356565153|ref|XP_003550809.1| PREDICTED: probable glycosyltransferase At5g20260-like [Glycine
max]
Length = 398
Score = 241 bits (616), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 123/360 (34%), Positives = 203/360 (56%), Gaps = 20/360 (5%)
Query: 61 KQDYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFF 120
KQ++ ++ +P FH +++EMV FK+++Y +G+ + P + YA EG F
Sbjct: 35 KQEHFIPKGSIYRNPHAFHQSHMEMVKRFKVWVYEEGEQPLVHYGP--VNDIYAIEGQFI 92
Query: 121 QNIRESR---FLTHDPDRAHLFFIPISCHK---------MRGKGTSYENMTVIVKDYVDS 168
I S+ F +PD AH FF+P+S M S + + +V+DY+
Sbjct: 93 DEIDNSKRSPFKARNPDEAHAFFLPLSVVNVVHYVYKPYMSQNDYSRDRLQRLVEDYIGV 152
Query: 169 LINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVAL 228
+ +KYPYWNR+ GADHF ++CHD + P L KN IRV+C+ + F P +DV++
Sbjct: 153 VADKYPYWNRSNGADHFLLSCHDWAPEISHANPDLFKNFIRVLCNANNSEGFQPKRDVSI 212
Query: 229 PQVLQP---FALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRA 285
P+V P P G+ NRT+L F++G + IR +L + W++ + + +
Sbjct: 213 PEVYLPVGKLGPPNLGQHPMNRTILAFFSGGAHGDIRKLLLKHWKDKDNHVQVHEYLPKG 272
Query: 286 IGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFA 345
Y +KFC+CP G +V S R+ ++I+ GCVPVI+S Y LPF+D+L+W +F+
Sbjct: 273 QN---YTELMGLSKFCLCPSGYEVASPRVVEAINAGCVPVIISENYSLPFSDVLNWSQFS 329
Query: 346 VILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
+ ++ ++ +K IL+N++ ++ LH N+ +VQ+HF N P FD HM+++ +WLRR
Sbjct: 330 IQISVENISDIKTILQNVTQKKYKKLHRNVRRVQRHFVMNRPAKPFDLMHMILHSIWLRR 389
>gi|224068931|ref|XP_002326234.1| predicted protein [Populus trichocarpa]
gi|222833427|gb|EEE71904.1| predicted protein [Populus trichocarpa]
Length = 328
Score = 241 bits (614), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 119/333 (35%), Positives = 198/333 (59%), Gaps = 20/333 (6%)
Query: 85 MVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIR-ESRFLTHDPDRAHLFFIPI 143
M FK+++Y +G+P F+ P + Y+ EG F + + F T DPD+AH++F+P
Sbjct: 1 MEKQFKVFVYGEGEPPVFHNGPCRSI--YSMEGNFIHRMEIDGHFRTKDPDKAHVYFLPF 58
Query: 144 SCHKM-----RGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATE 198
S M + + + V DY++ + KYP+WNR+LGADHF + CHD G A+
Sbjct: 59 SVAMMVRFVYERESRDFGPIRRTVSDYINLISGKYPFWNRSLGADHFMLACHDWGPEASF 118
Query: 199 GVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQV------LQPFALPRGGRDVENRTVLGF 252
VP L K +IR +C+ + F P KDV+LP++ ++ F GG R++L F
Sbjct: 119 SVPHLGKISIRALCNANTSEKFNPIKDVSLPEINLRTGSIKGFV---GGLSPSKRSILAF 175
Query: 253 WAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSA 312
+AG + IR ++ WEN + + ++ + + Y +KFC+CP G +V S
Sbjct: 176 FAGRLHGPIRPVVLEHWENKDDDIKVHQQLPKGVS---YYEMMRGSKFCLCPSGYEVASP 232
Query: 313 RIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLH 372
RI ++++ GCVPV++S++Y PF+D+L+W+ F+V + D+ LK+IL +IS +++ +
Sbjct: 233 RIVEALYAGCVPVLISDHYVPPFSDVLNWKSFSVEVPVSDIPSLKKILTSISPRQYIRMQ 292
Query: 373 NNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
+++V++HF+ NSPP RFD FHM+++ +WLRR
Sbjct: 293 RRVLQVRRHFEVNSPPKRFDVFHMILHSIWLRR 325
>gi|255550904|ref|XP_002516500.1| catalytic, putative [Ricinus communis]
gi|223544320|gb|EEF45841.1| catalytic, putative [Ricinus communis]
Length = 456
Score = 241 bits (614), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 144/385 (37%), Positives = 213/385 (55%), Gaps = 23/385 (5%)
Query: 38 SFNSFRS-YPLQNN-----FNSTLSFAINKQDYSDLGSDVFHSPSVFHLNYLEMVTNFKI 91
SFNS R +PL+ + S + A ++ + L + ++ + S+F +Y M K+
Sbjct: 71 SFNSVRPHWPLEVDQQLMFARSQIENAPGVKNDTVLYAPIYRNVSMFERSYELMENMLKV 130
Query: 92 YIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIR-ESRFLTHDPDRAHLFFIPISCHKMR- 149
+IY +G+ F+Q+ L G YASEG+F + + +F+T DP AHLF+IP S +
Sbjct: 131 FIYQEGEKPIFHQS--ILEGIYASEGWFIKLMEANEKFVTKDPKEAHLFYIPFSSRLLEL 188
Query: 150 ----GKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIK 205
K S N+ +K+Y D + KYP+W+RT GADHF CHD T G +
Sbjct: 189 TLYVRKSHSRNNLIEYMKNYTDMIAAKYPFWSRTGGADHFVAACHDWAPAETRG---RML 245
Query: 206 NAIRVVCSPSYDVAFIPHKDVALPQV----LQPFALPRGGRDVENRTVLGFWAGHRNSKI 261
N IR +C+ DV F KDV+LP+ Q G R +L F+AG+ + +
Sbjct: 246 NCIRALCNADIDVGFRIGKDVSLPETYVRSAQNPLKNLDGNPPSQRPILAFFAGNVHGFV 305
Query: 262 RVILARVWEN-DTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHY 320
R IL WEN D E+ I + R G Y + +K+CICP G +VNS RI +SI Y
Sbjct: 306 RPILLEYWENKDPEMKIFGP-MPRVKGNTNYIQLMKSSKYCICPRGHEVNSPRIVESIFY 364
Query: 321 GCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQK 380
CVPVI+S+ Y PF ++LDW FAV + E+D+ LK+IL +I + +V +H + KVQ+
Sbjct: 365 ECVPVIISDNYVPPFFEVLDWESFAVFVLEKDIPNLKKILLSIPEETYVEMHKRVKKVQQ 424
Query: 381 HFQWNSPPVRFDAFHMVMYELWLRR 405
HF W+S P + D FHM+++ +W R
Sbjct: 425 HFLWHSEPEKHDLFHMILHSVWYNR 449
>gi|356546126|ref|XP_003541482.1| PREDICTED: xylogalacturonan beta-1,3-xylosyltransferase-like
[Glycine max]
Length = 490
Score = 240 bits (613), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 125/358 (34%), Positives = 212/358 (59%), Gaps = 22/358 (6%)
Query: 69 SDVFHSPSVF-HLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFF----QNI 123
++ +P F H +++EMV FK+++Y +G+ + P + YA EG F N
Sbjct: 134 GSIYRNPHAFLHRSHIEMVKRFKVWVYQEGEQPLVHDGP--VNNIYAIEGQFMDEMDNNG 191
Query: 124 RESRFLTHDPDRAHLFFIPIS-------CHKMRGKGTSYENMTV--IVKDYVDSLINKYP 174
+ S+F P+ AH+FF+PIS +K K + YE + + +V+DY+ + +KYP
Sbjct: 192 KWSQFRARHPEEAHVFFLPISIANVVHYVYKPILKQSDYEPVRLQHLVEDYIGVIQDKYP 251
Query: 175 YWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQP 234
YWNR++GADHF ++CHD G + + G P L + IR +C+ + F P++DV++P+V P
Sbjct: 252 YWNRSIGADHFLLSCHDWGPKVSYGNPELFQTFIRALCNANTSEGFHPNRDVSIPEVYLP 311
Query: 235 ---FALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVY 291
G+ +RT L F+AG + +IR IL + W++ + + + + Y
Sbjct: 312 VGKLGPASLGQHPNSRTTLAFFAGGVHGEIRKILLKHWKDKDNEVLVHEYLPKGQD---Y 368
Query: 292 QRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNER 351
+ ++KFC+CP G +V S R+ ++IH GCVPVI+ + Y LPF+D+L+W +F+V +
Sbjct: 369 TKLMGQSKFCLCPSGHEVASPRVVEAIHAGCVPVIICDNYSLPFSDVLNWSQFSVEIPVE 428
Query: 352 DVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIK 409
+ ++K IL++IS +++ LH N+++V++HF N P FD HM+++ +WLRR IK
Sbjct: 429 KIPEIKSILQSISRNKYLRLHMNVLRVRRHFMINRPTKPFDMMHMILHSIWLRRLNIK 486
>gi|224053619|ref|XP_002297901.1| predicted protein [Populus trichocarpa]
gi|222845159|gb|EEE82706.1| predicted protein [Populus trichocarpa]
Length = 457
Score = 240 bits (612), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 140/370 (37%), Positives = 210/370 (56%), Gaps = 30/370 (8%)
Query: 65 SDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPD----GDP--NTFYQTPRKLT----GKYA 114
S L + +H+ +F ++ EM+ + KI++YPD P N F L G Y
Sbjct: 90 SVLETGPYHNWELFAADFQEMMRHLKIFVYPDTFNRSSPFANIFLPHENPLNNPKLGNYF 149
Query: 115 SEGYFFQNIRESRFLTHDPDRAHLFFIPISCHKMRG--KGTSYENMTVIVKDYVDSLINK 172
SE F ++ S LT P++AH FF+P S + +R + S ++ V Y S+ +
Sbjct: 150 SEHMFKVSLLHSPLLTATPEKAHFFFLPFSINDLRNDPRVHSEAKISQFVAQYTSSISSS 209
Query: 173 YPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAF-IPHKDVALPQV 231
+ +WN + GADHF+V CH VG A L NAI++ C SY F + HKDV LPQV
Sbjct: 210 FRFWNASGGADHFYVCCHSVGREAPSRHHGLRNNAIQLTCCSSYFQRFYLSHKDVGLPQV 269
Query: 232 --------LQPFALPRGGRDVEN---RTVLGFWAGH-RNSKIRVILARVWENDTELDISN 279
L P DV R L ++AG +NS++R L +W NDT+ DI N
Sbjct: 270 WPRTDQTALNPPHASVCYLDVNTYRCRHRLVYFAGRVQNSQVRQQLVNLWGNDTQFDIFN 329
Query: 280 NRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDIL 339
Y+ F ++KFC+ G +VN+AR++D+IHYGC+PVI+SNYYDLPF ++L
Sbjct: 330 GNPT-----FPYEEGFKRSKFCLHVKGYEVNTARVSDAIHYGCIPVIISNYYDLPFANVL 384
Query: 340 DWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMY 399
DW KF+V++N+RD+ LK L +I ++ +++NL KV++HF W++ P +D+F+M Y
Sbjct: 385 DWSKFSVVINQRDIAFLKTKLLSIKREMYLRMYHNLFKVRRHFVWHTTPRGYDSFYMTAY 444
Query: 400 ELWLRRHVIK 409
+LWLRR ++
Sbjct: 445 QLWLRRSTLR 454
>gi|449442160|ref|XP_004138850.1| PREDICTED: probable glycosyltransferase At5g25310-like [Cucumis
sativus]
gi|449530249|ref|XP_004172108.1| PREDICTED: probable glycosyltransferase At5g25310-like [Cucumis
sativus]
Length = 465
Score = 239 bits (610), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 135/355 (38%), Positives = 201/355 (56%), Gaps = 18/355 (5%)
Query: 68 GSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRE-- 125
S V+H+P F+ +Y+EM FK+Y+YP+G+ + P K Y EG F + +
Sbjct: 114 SSSVYHNPRAFYQSYVEMEKRFKVYVYPEGELPITHAGPCK--NIYTIEGRFIHEMEDGG 171
Query: 126 SRFLTHDPDRAHLFFIPISCHKMRG---KGTSYEN--MTVIVKDYVDSLINKYPYWNRTL 180
+ F T DP RAH+ F+P S M K SY+ + + V DYV+ + KYP+WN+T
Sbjct: 172 NGFRTVDPSRAHVLFMPFSVAWMVKYLYKDGSYDQTPLRMFVSDYVEVVSKKYPFWNKTN 231
Query: 181 GADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQV------LQP 234
GADHF + CHD G ATEG FL +IRV+C+ + F P KDV+LP++ + P
Sbjct: 232 GADHFILACHDWGPIATEGNRFLYNTSIRVLCNANSSEGFNPQKDVSLPEIHLYDGEISP 291
Query: 235 FALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRR 294
L +R L F+AG + IR IL W+N T +I + L Y
Sbjct: 292 -KLLSASNSHHHRPHLAFFAGGLHGPIRPILLNHWKNRTHTNI--HVYEYLPKQLDYYDE 348
Query: 295 FYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVY 354
+++FC+CP G +V S RI ++I+ CVPVI+S Y LPF+D+L W F++ ++ ++
Sbjct: 349 MLQSRFCLCPSGYEVASPRIVEAIYAECVPVIISERYVLPFSDVLRWEGFSIEVSVSEIP 408
Query: 355 QLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIK 409
+L++IL +S+ + L L V+KHF N P RFDAFHM+++ +WLRR +K
Sbjct: 409 RLEEILMGVSEERYEKLIQGLRTVRKHFVLNRPAKRFDAFHMILHSVWLRRLNVK 463
>gi|357478275|ref|XP_003609423.1| Xylogalacturonan beta-1,3-xylosyltransferase [Medicago truncatula]
gi|355510478|gb|AES91620.1| Xylogalacturonan beta-1,3-xylosyltransferase [Medicago truncatula]
Length = 398
Score = 239 bits (610), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 128/374 (34%), Positives = 214/374 (57%), Gaps = 21/374 (5%)
Query: 47 LQNNFNSTLSFAINKQDYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTP 106
+Q S S KQ + GS ++ +P FH ++ EMV FK+++Y +G+ + P
Sbjct: 18 IQRAIRSKKSTTNMKQSFVPKGS-IYLNPHAFHQSHKEMVKRFKVWVYKEGEQPLVHDGP 76
Query: 107 RKLTGKYASEGYFFQNI---RESRFLTHDPDRAHLFFIPISCHKM-------RGKGTSY- 155
+ KY+ EG F + +S F P+ AH+FF+P S K+ R + Y
Sbjct: 77 --VNNKYSIEGQFIDEMDTSNKSPFKATHPELAHVFFLPFSVSKVIRYVYKPRKSRSDYN 134
Query: 156 -ENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSP 214
+ ++V+DY+ + NKYPYWN + GADHF ++CHD G R + P L K+ IR +C+
Sbjct: 135 PHRLQLLVEDYIKIVANKYPYWNISQGADHFLLSCHDWGPRVSYANPKLFKHFIRALCNA 194
Query: 215 SYDVAFIPHKDVALPQVLQP---FALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWEN 271
+ F P++DV++PQ+ P P + NRT+L F+AG + KIR L + W++
Sbjct: 195 NTSEGFWPNRDVSIPQLNLPVGKLGPPNTDQHPNNRTILTFFAGGAHGKIRKKLLKSWKD 254
Query: 272 DTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYY 331
E + + + Y + +KFC+CP G +V S R+ ++I+ GCVPVI+ + Y
Sbjct: 255 KDEEVQVHEYLPKGQD---YTKLMGLSKFCLCPSGHEVASPRVVEAIYAGCVPVIICDNY 311
Query: 332 DLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRF 391
LPF+D+L+W +F++ + + ++K IL+NI++ ++ L++N+ +V+KHF+ N P F
Sbjct: 312 SLPFSDVLNWSQFSMEIAVDRIPEIKTILQNITETKYRVLYSNVRRVRKHFEMNRPAKPF 371
Query: 392 DAFHMVMYELWLRR 405
D HM+++ +WLRR
Sbjct: 372 DLIHMILHSVWLRR 385
>gi|414883870|tpg|DAA59884.1| TPA: hypothetical protein ZEAMMB73_690759, partial [Zea mays]
Length = 577
Score = 239 bits (610), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 138/371 (37%), Positives = 207/371 (55%), Gaps = 29/371 (7%)
Query: 59 INKQDYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGY 118
+ +DY +G V+ + VFH +YLEM K+Y+Y +G+P F+ P + Y++EG
Sbjct: 205 LPDKDYVPMGP-VYRNAHVFHRSYLEMERQLKVYVYEEGEPPVFHDGPCRSI--YSTEGR 261
Query: 119 FFQNIRES-------RFLTHDPDRAHLFFIPISCHKMRGKGTSYEN-------MTVIVKD 164
F ++ R T DP RAH+FF+P S KM T YE + V D
Sbjct: 262 FIHSMETETEAEEGRRLRTRDPARAHVFFLPFSVVKM--VQTIYEPGSRDMAPLKRTVAD 319
Query: 165 YVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHK 224
YV L +KYPYWNR+LGADHF ++CHD G + L N+IRV+C+ + F P +
Sbjct: 320 YVRVLSSKYPYWNRSLGADHFMLSCHDWGPYVSSANAQLFGNSIRVLCNANTSEGFDPAR 379
Query: 225 DVALPQV-LQPFALPR--GGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDIS--- 278
DV+LPQV L+ A+ R GG R VL F+AG + +R L W
Sbjct: 380 DVSLPQVNLRSDAVERQVGGPSASRRPVLAFFAGGNHGPVRPALLAHWGPGGRRGGDPDV 439
Query: 279 --NNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSN-YYDLPF 335
+ + R G Y +++FC+CPGG +V S R+A++++ GCVPV++ + Y LPF
Sbjct: 440 RVSEYLPRGGGAPSYADMMRRSRFCLCPGGYEVASPRLAEALYLGCVPVVVDDGEYALPF 499
Query: 336 NDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWN-SPPVRFDAF 394
D+LDW FA+ L D+ +LK+IL +S +++ + + V++HF + PP R+DAF
Sbjct: 500 ADVLDWDAFALRLRVADIPRLKEILAAVSPRQYIRMQRRVRMVRRHFMLHGGPPRRYDAF 559
Query: 395 HMVMYELWLRR 405
HM+++ +WLRR
Sbjct: 560 HMILHSVWLRR 570
>gi|356507416|ref|XP_003522463.1| PREDICTED: LOW QUALITY PROTEIN: probable glycosyltransferase
At3g07620-like [Glycine max]
Length = 471
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 126/356 (35%), Positives = 207/356 (58%), Gaps = 19/356 (5%)
Query: 55 LSFAINKQDYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYA 114
L+ + DY GS ++ + + F +YLEM FKI++Y +G+P F+ K YA
Sbjct: 122 LTSNLQDPDYVPQGS-IYRNVNAFQRSYLEMEKVFKIFVYEEGEPPLFHNGLSKDI--YA 178
Query: 115 SEGYFFQNIRESRFL-THDPDRAHLFFIPISCHKMRGKGTSYENMTVIVKDYVDSLINKY 173
+EG F + + R+ T+DPD A ++++P S G + + ++VKDY+ + +K+
Sbjct: 179 TEGRFIHEMEKGRYYRTYDPDEAFVYYLPFS-------GVYVDPLGLVVKDYIQVIAHKH 231
Query: 174 PYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVL- 232
P+WNR+LG DHF ++CHD G + V NAIRV+C+ + F P KDV+ P++
Sbjct: 232 PFWNRSLGYDHFMLSCHDWGPLVSSYVDHFYNNAIRVLCNANVSEGFKPAKDVSFPEIKL 291
Query: 233 --QPFALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWEN-DTELDISNNRINRAIGPL 289
GG RT+L F+AGH++ IR +L W+N D ++ I + I
Sbjct: 292 IKGEVTNLVGGYPPSQRTILAFFAGHQHGYIRXLLQSTWKNKDQDMQIYE-ELPEGIS-- 348
Query: 290 VYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILN 349
Y + +KFC+CP G +V S R+ +I CVPV++S+ Y PF+D+L+W F+V ++
Sbjct: 349 -YYTKLRSSKFCLCPSGYEVASPRVVKAIFAECVPVLISDGYVPPFSDVLNWNSFSVQVD 407
Query: 350 ERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
+D+ +K+IL IS+ +++ ++ + +VQ+HF N PP R+D FHM ++ +WLRR
Sbjct: 408 VKDIPNIKKILMGISERQYLRMYKRVKQVQRHFVPNEPPKRYDMFHMTVHSIWLRR 463
>gi|334186614|ref|NP_001190743.1| Exostosin family protein [Arabidopsis thaliana]
gi|332658396|gb|AEE83796.1| Exostosin family protein [Arabidopsis thaliana]
Length = 589
Score = 239 bits (609), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 123/343 (35%), Positives = 204/343 (59%), Gaps = 15/343 (4%)
Query: 65 SDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIR 124
+DL + +F + SVF +Y M K+YIYPDGD F++ L G YASEG+F + +
Sbjct: 176 TDLFAPLFRNLSVFKRSYELMELILKVYIYPDGDKPIFHEP--HLNGIYASEGWFMKLME 233
Query: 125 ESR-FLTHDPDRAHLFFIPISCHKMRGK-----GTSYENMTVIVKDYVDSLINKYPYWNR 178
++ F+T +P+RAHLF++P S +++ + + +++ ++DYV+ L KYP+WNR
Sbjct: 234 SNKQFVTKNPERAHLFYMPYSVKQLQKSIFVPGSHNIKPLSIFLRDYVNMLSIKYPFWNR 293
Query: 179 TLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSY-DVAFIPHKDVALPQVL----- 232
T G+DHF V CHD G P L +NAI+ +C+ D F+P KDV+LP+
Sbjct: 294 THGSDHFLVACHDWGPYTVNEHPELKRNAIKALCNADLSDGIFVPGKDVSLPETSIRNAG 353
Query: 233 QPFALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWEN-DTELDISNNRINRAIGPLVY 291
+P G V R +L F+AG+ + ++R L + W N D ++ I + + Y
Sbjct: 354 RPLRNIGNGNRVSQRPILAFFAGNLHGRVRPKLLKHWRNKDEDMKIYGPLPHNVARKMTY 413
Query: 292 QRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNER 351
+ +K+C+CP G +VNS RI ++I+Y CVPV++++ + LPF+D+LDW F+V++ E+
Sbjct: 414 VQHMKSSKYCLCPMGYEVNSPRIVEAIYYECVPVVIADNFMLPFSDVLDWSAFSVVVPEK 473
Query: 352 DVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAF 394
++ +LK+IL I ++ + +N+ VQ+HF W+ P + F
Sbjct: 474 EIPRLKEILLEIPMRRYLKMQSNVKMVQRHFLWSPKPRKIKPF 516
>gi|356562965|ref|XP_003549738.1| PREDICTED: probable glycosyltransferase At3g42180-like [Glycine
max]
Length = 473
Score = 239 bits (609), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 129/375 (34%), Positives = 215/375 (57%), Gaps = 23/375 (6%)
Query: 47 LQNNFNSTLSFAINKQDYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTP 106
+Q S ++ K + S ++ +P FH +++EM+ K++ Y +G+ + P
Sbjct: 97 IQEAIRSKINTTATKDSFVPKDS-IYWNPHAFHQSHVEMMKRLKVWAYKEGEQPLVHDGP 155
Query: 107 RKLTGKYASEGYFFQNI---RESRFLTHDPDRAHLFFIPISCHKM-------RGKGTSYE 156
+ KY+ EG F + S F P++AHLF +P S K+ R + Y+
Sbjct: 156 --VNNKYSIEGQFIDEMDMASMSPFKATHPEQAHLFLLPYSVSKVIRYVYKPRRSRSDYD 213
Query: 157 --NMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSP 214
+ +V DY++ L N+YPYWNR+ GADHF V+CHD G R ++ P L K IR +C+
Sbjct: 214 PDRLQRLVADYINILANRYPYWNRSKGADHFLVSCHDWGPRISDANPELFKYFIRALCNA 273
Query: 215 SYDVAFIPHKDVALPQVLQP---FALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWEN 271
+ F P++DV++P+V P P G+ NRT+L F+AG + KIR L + W+N
Sbjct: 274 NTSEGFQPNRDVSIPEVYLPSGKLGPPNMGQHPNNRTILAFFAGGAHGKIRKKLLKRWKN 333
Query: 272 -DTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNY 330
D E+ + + + + Y + +KFC+CP G +V S R+ ++I+ GCVPVI+ +
Sbjct: 334 KDKEVQV-HEYLPKGQD---YTKLMGLSKFCLCPSGHEVASPRVVEAIYAGCVPVIICDN 389
Query: 331 YDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVR 390
Y LPF D+L+WRKF++ + + ++K IL+++S +++ L++N+ +V++HF N P
Sbjct: 390 YSLPFIDVLNWRKFSMEIAVERMPEIKTILQSVSKDKYLELYSNVRRVRRHFVINRPAKP 449
Query: 391 FDAFHMVMYELWLRR 405
FD HM+++ LWLRR
Sbjct: 450 FDLIHMILHSLWLRR 464
>gi|115470965|ref|NP_001059081.1| Os07g0188700 [Oryza sativa Japonica Group]
gi|34394611|dbj|BAC83913.1| putative pectin-glucuronyltransferase [Oryza sativa Japonica Group]
gi|50508944|dbj|BAD31848.1| putative pectin-glucuronyltransferase [Oryza sativa Japonica Group]
gi|113610617|dbj|BAF20995.1| Os07g0188700 [Oryza sativa Japonica Group]
gi|125599387|gb|EAZ38963.1| hypothetical protein OsJ_23384 [Oryza sativa Japonica Group]
gi|215707162|dbj|BAG93622.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 606
Score = 238 bits (608), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 127/370 (34%), Positives = 212/370 (57%), Gaps = 18/370 (4%)
Query: 47 LQNNFNSTLSFAINKQDYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTP 106
+QN N + +DY +G V+ + FH +YLEM FK+++Y +G+P F+ P
Sbjct: 234 IQNKDNKP---PLTDKDYVPVGP-VYRNAYAFHRSYLEMEKVFKVFVYEEGEPPVFHDGP 289
Query: 107 RKLTGKYASEGYFFQNIR-ESRFLTHDPDRAHLFFIPISCHKM-----RGKGTSYENMTV 160
+ Y++EG F + E+R T DPD+AH+FF+P S KM + +
Sbjct: 290 CRSI--YSTEGRFIYAMEMENRMRTRDPDQAHVFFLPFSVVKMVKMIYEPNSHDMDPLRR 347
Query: 161 IVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAF 220
+ DY++ + KYP+WNR+LGADHF ++CHD G + L N+IRV+C+ + F
Sbjct: 348 TISDYINVVSTKYPHWNRSLGADHFMLSCHDWGPYVSSANGHLFSNSIRVLCNANTSEGF 407
Query: 221 IPHKDVALPQV-LQPFALPR--GGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDI 277
P +DV+LP++ L+ + R GG +R +L F+AG + +R +L + W + DI
Sbjct: 408 DPSRDVSLPEINLRSDVVDRQVGGPSASHRPILAFFAGGDHGPVRPLLLQHWGKGQDADI 467
Query: 278 S-NNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFN 336
+ + R G + Y +++FC+CP G +V S R+ ++I+ CVPV++ + Y LPF
Sbjct: 468 QVSEYLPRRHG-MSYTDMMRRSRFCLCPSGYEVASPRVVEAIYLECVPVVIGDDYTLPFA 526
Query: 337 DILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQ-WNSPPVRFDAFH 395
D+L+W F+V + D+ +LK+IL +S +++ + + V++HF + P RFD FH
Sbjct: 527 DVLNWAAFSVRVAVGDIPRLKEILAAVSPRQYIRMQRRVRAVRRHFMVSDGAPRRFDVFH 586
Query: 396 MVMYELWLRR 405
M+++ +WLRR
Sbjct: 587 MILHSIWLRR 596
>gi|357478279|ref|XP_003609425.1| Xylogalacturonan beta-1,3-xylosyltransferase [Medicago truncatula]
gi|355510480|gb|AES91622.1| Xylogalacturonan beta-1,3-xylosyltransferase [Medicago truncatula]
Length = 617
Score = 238 bits (606), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 123/340 (36%), Positives = 193/340 (56%), Gaps = 20/340 (5%)
Query: 81 NYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESR---FLTHDPDRAH 137
+++EMV KI++Y +G+ + P + YA EG F I S+ F P+ AH
Sbjct: 274 SHMEMVKRLKIWVYQEGEQPIVHDGP--VNNIYAIEGQFIDEIDNSKMSPFKAKHPNEAH 331
Query: 138 LFFIPISCHK---------MRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVT 188
+FF+P S M K + + + +V+DYV+ + +KYPYWNR+ GADHF ++
Sbjct: 332 IFFLPFSVANVVQYVYKPIMSKKDFNRDRLHRMVEDYVNVVAHKYPYWNRSNGADHFLLS 391
Query: 189 CHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQP---FALPRGGRDVE 245
CHD ++ P L KN RV+C+ + F P +DV++P+V P P G+
Sbjct: 392 CHDWAPEISDANPNLFKNFTRVLCNANTSEGFQPKRDVSIPEVYLPVGKLGPPNLGQSPL 451
Query: 246 NRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPG 305
NRT+L F++G + IR +L W+N + + + Y +KFC+CP
Sbjct: 452 NRTILAFFSGGAHGDIRKLLLNHWKNKDAQVQVHEYLPKGQN---YTELMGLSKFCLCPS 508
Query: 306 GSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISD 365
G +V S RI ++I+ GCVPVI+S Y LPFND+L+W +F+V + + ++K IL+N++
Sbjct: 509 GYEVASPRIVEAINAGCVPVIISQNYSLPFNDVLNWSEFSVEIPVEKIVEIKNILQNVTK 568
Query: 366 AEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
+++ LH N++KVQKHF N P FD HM+++ +WLRR
Sbjct: 569 DKYMKLHMNVMKVQKHFVMNRPAKPFDVMHMILHSIWLRR 608
>gi|356575096|ref|XP_003555678.1| PREDICTED: probable glycosyltransferase At3g07620-like [Glycine
max]
Length = 493
Score = 237 bits (604), Expect = 9e-60, Method: Compositional matrix adjust.
Identities = 133/355 (37%), Positives = 202/355 (56%), Gaps = 15/355 (4%)
Query: 60 NKQDYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYF 119
+ DY G D++ + FH +Y M FKI++Y +G+P F+ P K Y+ EG F
Sbjct: 139 DTSDYIPEG-DIYRNAVAFHRSYQLMEKVFKIFVYEEGEPPLFHYGPCK--NIYSMEGIF 195
Query: 120 FQNIR-ESRFLTHDPDRAHLFFIPISC-----HKMRGKGTSYENMTVIVKDYVDSLINKY 173
++ S+F T +PD AH++F+P S H + + DYV + +KY
Sbjct: 196 INSLEINSQFRTQNPDEAHVYFLPFSVVMILEHLFHPVIRDKAVLERTIGDYVHIISHKY 255
Query: 174 PYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQ 233
YWNR+ GADHF ++CHD G RAT V L AIRV+C+ + F P KD + P++
Sbjct: 256 KYWNRSYGADHFMLSCHDWGPRATWYVKELYFIAIRVLCNANISEHFNPKKDASFPEINL 315
Query: 234 PFALPRG---GRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLV 290
RG G NRT+L F+AG + +IR +L + WE + + ++ +
Sbjct: 316 VNGETRGLIGGYPPCNRTILAFFAGQMHGRIRPVLFQHWEGKDKDVLVYEKLPDGVP--- 372
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y K+K+CICP G +V S RI ++I+ CVPVI+S Y LPF+D+L+W F+V +
Sbjct: 373 YHETMKKSKYCICPSGFEVASPRIVEAIYAQCVPVIISQQYVLPFSDVLNWDSFSVQILV 432
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
DV +LK+IL IS+ +++ L + +VQ+HF N+PP R+D FHM+++ +WLRR
Sbjct: 433 SDVPKLKEILLGISEDKYMRLQEGVKQVQRHFVVNNPPKRYDVFHMIIHSIWLRR 487
>gi|296082060|emb|CBI21065.3| unnamed protein product [Vitis vinifera]
Length = 402
Score = 237 bits (604), Expect = 9e-60, Method: Compositional matrix adjust.
Identities = 125/350 (35%), Positives = 197/350 (56%), Gaps = 21/350 (6%)
Query: 71 VFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESR--F 128
V+ +P F+ +++EM F+++ Y +G+ F+ P L Y EG F + F
Sbjct: 52 VYRNPYAFYQSHIEMEKRFRVWTYQEGEKPLFHSGPMNLI--YGIEGQFMDELESKNCPF 109
Query: 129 LTHDPDRAHLFFIPISC-------HKMRGKGTSYENMTV--IVKDYVDSLINKYPYWNRT 179
PD A LF+IPIS ++ SY + + +V DY+D + KYP+WNR+
Sbjct: 110 SARHPDEAMLFYIPISVVNIIRYVYQPYTNSASYSRVRLQGLVADYIDVISRKYPFWNRS 169
Query: 180 LGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFAL-- 237
GADHF V+CHD + P ++ IRV+C+ + F P +DV+LP++L P+ +
Sbjct: 170 SGADHFMVSCHDWAPEISAANPKFFRHFIRVLCNANTSEGFKPVRDVSLPEILVPYRMLG 229
Query: 238 -PRGGRDVENRTVLGFWAGHRNSKIRVILARVW-ENDTELDISNNRINRAIGPLVYQRRF 295
P G+ NR++L F+AG + K+R IL W E D ++ + L Y
Sbjct: 230 PPYLGQPPTNRSILAFFAGGAHGKVRSILFHYWKEKDEDIQVHEYLPTT----LNYTELM 285
Query: 296 YKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQ 355
++KFC+CP G +V S R+ +SI+ GCVPVI+S+ Y LPF+D+LDW +F+V + + +
Sbjct: 286 GRSKFCLCPSGFEVASPRVVESIYAGCVPVIISDNYSLPFSDVLDWSQFSVHIPIARIPE 345
Query: 356 LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
K IL+ I E+++ +++VQ+HF N P RFD HMV++ +WLRR
Sbjct: 346 TKTILQAIPIQEYLTKQKTVMQVQRHFTLNRPAKRFDVLHMVLHSIWLRR 395
>gi|359476148|ref|XP_002285239.2| PREDICTED: probable glycosyltransferase At5g11130 [Vitis vinifera]
Length = 472
Score = 236 bits (603), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 125/350 (35%), Positives = 197/350 (56%), Gaps = 21/350 (6%)
Query: 71 VFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESR--F 128
V+ +P F+ +++EM F+++ Y +G+ F+ P L Y EG F + F
Sbjct: 122 VYRNPYAFYQSHIEMEKRFRVWTYQEGEKPLFHSGPMNLI--YGIEGQFMDELESKNCPF 179
Query: 129 LTHDPDRAHLFFIPISC-------HKMRGKGTSYENMTV--IVKDYVDSLINKYPYWNRT 179
PD A LF+IPIS ++ SY + + +V DY+D + KYP+WNR+
Sbjct: 180 SARHPDEAMLFYIPISVVNIIRYVYQPYTNSASYSRVRLQGLVADYIDVISRKYPFWNRS 239
Query: 180 LGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFAL-- 237
GADHF V+CHD + P ++ IRV+C+ + F P +DV+LP++L P+ +
Sbjct: 240 SGADHFMVSCHDWAPEISAANPKFFRHFIRVLCNANTSEGFKPVRDVSLPEILVPYRMLG 299
Query: 238 -PRGGRDVENRTVLGFWAGHRNSKIRVILARVW-ENDTELDISNNRINRAIGPLVYQRRF 295
P G+ NR++L F+AG + K+R IL W E D ++ + L Y
Sbjct: 300 PPYLGQPPTNRSILAFFAGGAHGKVRSILFHYWKEKDEDIQVHEYLPTT----LNYTELM 355
Query: 296 YKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQ 355
++KFC+CP G +V S R+ +SI+ GCVPVI+S+ Y LPF+D+LDW +F+V + + +
Sbjct: 356 GRSKFCLCPSGFEVASPRVVESIYAGCVPVIISDNYSLPFSDVLDWSQFSVHIPIARIPE 415
Query: 356 LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
K IL+ I E+++ +++VQ+HF N P RFD HMV++ +WLRR
Sbjct: 416 TKTILQAIPIQEYLTKQKTVMQVQRHFTLNRPAKRFDVLHMVLHSIWLRR 465
>gi|356554237|ref|XP_003545455.1| PREDICTED: probable glycosyltransferase At5g25310-like [Glycine
max]
Length = 483
Score = 236 bits (603), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 135/353 (38%), Positives = 198/353 (56%), Gaps = 20/353 (5%)
Query: 71 VFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRES--RF 128
++ + +F+ +YLEM FK+Y+YPDGD + P K Y+ EG F + RF
Sbjct: 135 IYRNARLFYRSYLEMEKIFKVYVYPDGDLPIAHDGPCKDI--YSIEGRFLHEMEHGAGRF 192
Query: 129 LTHDPDRAHLFFIPISCHKM-----RGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGAD 183
T+DP+ AH++F+P S M + + V DYV + ++P+WN T GAD
Sbjct: 193 RTNDPNAAHVYFLPFSVTWMVKYLYTPLSFNVTPLKQFVSDYVRVISTRHPFWNITHGAD 252
Query: 184 HFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQV------LQPFAL 237
HF + CHD G A++G PFL +IRV+C+ + F P KDV+LP++ + P L
Sbjct: 253 HFMLACHDWGPHASQGNPFLYNTSIRVLCNANTSEGFNPRKDVSLPEIHLYGGEVSPKLL 312
Query: 238 PRGGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGP-LVYQRRFY 296
D R L F++G + IR L R W+ND + DI R+ + L Y
Sbjct: 313 -SPPPDTAPRRYLAFFSGGLHGPIRPALLRHWKNDNDDDI---RVYEYLPKDLDYYSFML 368
Query: 297 KTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQL 356
+KFC+CP G +V S RI ++I+ CVPVILS YY LPF+D+L W F+V ++ D+ +L
Sbjct: 369 NSKFCLCPSGHEVASPRIVEAIYAECVPVILSEYYVLPFSDVLQWEAFSVQVDVSDIPRL 428
Query: 357 KQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIK 409
K+IL IS+ ++ L + V+ HF N P RFD FHM+++ +WLRR IK
Sbjct: 429 KEILSAISEDKYRKLKEGVKAVRGHFTLNRPAKRFDVFHMILHSIWLRRLNIK 481
>gi|358348291|ref|XP_003638181.1| hypothetical protein MTR_121s0013 [Medicago truncatula]
gi|355504116|gb|AES85319.1| hypothetical protein MTR_121s0013 [Medicago truncatula]
Length = 550
Score = 236 bits (601), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 129/348 (37%), Positives = 194/348 (55%), Gaps = 13/348 (3%)
Query: 65 SDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIR 124
S L + V+ S F +Y M K+YIY +G+ F+Q K+ G YASEG+F + +
Sbjct: 202 SGLHAPVYREVSKFSRSYELMERKLKVYIYREGEKPIFHQP--KMRGIYASEGWFMKLME 259
Query: 125 -ESRFLTHDPDRAHLFFIPISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGAD 183
RF+ DP +AHLF++P S +R + + M + YV+ + KY +WNRT GAD
Sbjct: 260 GNKRFIVKDPKKAHLFYLPFSSQMLRANLSDNKKMEQYLDKYVNIIAGKYRFWNRTGGAD 319
Query: 184 HFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALP-----QVLQPFALP 238
HF V CHD R T +KN IR +C+ + F KD LP V+ P
Sbjct: 320 HFLVACHDWASRITRQP---MKNCIRSLCNANVAKGFQIGKDTTLPATYIHSVMNPLR-K 375
Query: 239 RGGRDVENRTVLGFWAGHRNSKIRVILARVWEN-DTELDISNNRINRAIGPLVYQRRFYK 297
G+ RT+L F+AG + +R IL + WEN + ++ I A G +Y
Sbjct: 376 IAGKHPSERTILAFFAGSMHGYLRPILLKHWENKEPDMKIFGAMARDAEGKRIYMDYMNS 435
Query: 298 TKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLK 357
+K+CIC G +V S RI ++I CVPVI+S+ Y PF ++L W F+V + ERDV L+
Sbjct: 436 SKYCICARGYEVYSPRIVEAIFSECVPVIISDNYVPPFFEVLKWEAFSVFVRERDVPNLR 495
Query: 358 QILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
IL +I++ ++++LH + KVQ+HF W+ PV++D FHM+++ +W R
Sbjct: 496 SILLSITEEKYLALHLGVKKVQQHFLWHKVPVKYDLFHMILHSIWNNR 543
>gi|357478261|ref|XP_003609416.1| Xylogalacturonan beta-1,3-xylosyltransferase [Medicago truncatula]
gi|355510471|gb|AES91613.1| Xylogalacturonan beta-1,3-xylosyltransferase [Medicago truncatula]
Length = 380
Score = 236 bits (601), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 128/349 (36%), Positives = 205/349 (58%), Gaps = 21/349 (6%)
Query: 71 VFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNI--RESRF 128
+ HS S ++ EM+ FK++ Y +G+P + P ++ Y EG+F I R S F
Sbjct: 32 ILHS-SRLRRSHKEMLKRFKVWTYKEGEPPLVHDGP--MSSIYGIEGHFMTEIENRLSPF 88
Query: 129 LTHDPDRAHLFFIPISC----HKMRGKGTSY--ENMTVIVKDYVDSLINKYPYWNRTLGA 182
TH+PD AH+F +P+S H + T+Y + + + DY + + +KY YWNR+ GA
Sbjct: 89 STHNPDEAHVFMLPLSVTNMVHYLYNPLTTYSRDQIMHVTIDYTNIIAHKYRYWNRSKGA 148
Query: 183 DHFFVTCHDVG---VRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQV-LQPFAL- 237
DH V+CHD R + G L KN I+V+C+ + F P +DV++P++ LQ + L
Sbjct: 149 DHLLVSCHDWAPEISRESSGKE-LFKNLIKVLCNANTSEGFDPKRDVSMPEMNLQGYKLS 207
Query: 238 -PRGGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFY 296
P ++ NR++L F+AG + IR L W+ + + + + L Y +
Sbjct: 208 SPIPSKESNNRSILAFFAGGEHGMIRKTLLDQWKGKDKEVLVYEYLPKK---LKYFKLMG 264
Query: 297 KTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQL 356
K+KFC+CP G +V S R+ +SI+ GCVPVI+S+ Y LPF+D+LDW KF++ + + + ++
Sbjct: 265 KSKFCLCPSGYEVASPRLVESINTGCVPVIVSDNYQLPFSDVLDWSKFSLHIPSKRISEI 324
Query: 357 KQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
K ILK++ A ++ LH ++KVQ+HF N P FD FHM+++ +WLRR
Sbjct: 325 KTILKSVPHARYLKLHRRVLKVQRHFVLNPPAKPFDVFHMILHSIWLRR 373
>gi|125557509|gb|EAZ03045.1| hypothetical protein OsI_25187 [Oryza sativa Indica Group]
Length = 601
Score = 236 bits (601), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 124/369 (33%), Positives = 208/369 (56%), Gaps = 16/369 (4%)
Query: 47 LQNNFNSTLSFAINKQDYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTP 106
+QN N + +DY +G V+ + FH +YLEM FK+++Y +G+P F+ P
Sbjct: 229 IQNKDNKP---PLTDKDYVPVGP-VYRNAYAFHRSYLEMEKVFKVFVYEEGEPPVFHDGP 284
Query: 107 RKLTGKYASEGYFFQNIR-ESRFLTHDPDRAHLFFIPISCHKM-----RGKGTSYENMTV 160
Y++EG F + E+R T DP++AH+FF+P S KM + +
Sbjct: 285 --CHSIYSTEGRFIYAMEMENRMRTRDPNQAHVFFLPFSVVKMVKMIYEPNSHDMDPLRR 342
Query: 161 IVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAF 220
+ DY++ + KYP+WNR+LGADHF ++CHD G + L N+IRV+C+ + F
Sbjct: 343 TISDYINVVSTKYPHWNRSLGADHFMLSCHDWGPYVSSANGHLFSNSIRVLCNANTSEGF 402
Query: 221 IPHKDVALPQV-LQPFALPR--GGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDI 277
P +DV+LP++ L+ + R GG +R +L F+AG + +R +L + W + DI
Sbjct: 403 DPSRDVSLPEINLRSDVVARQVGGPSASHRPILAFFAGGDHGPVRPLLLQHWGKGQDADI 462
Query: 278 SNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFND 337
+ + Y +++FC+CP G +V S R+ ++I+ CVPV++ + Y LPF D
Sbjct: 463 QVSEYLPRRHSMSYTDMMRRSRFCLCPSGYEVASPRVVEAIYLECVPVVIGDDYALPFAD 522
Query: 338 ILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQ-WNSPPVRFDAFHM 396
+L+W F+V + D+ +LK+IL +S +++ + + V++HF + P RFD FHM
Sbjct: 523 VLNWAAFSVRVAVGDIPRLKEILAAVSPRQYIRMQRRVRAVRRHFMVSDGAPWRFDVFHM 582
Query: 397 VMYELWLRR 405
+++ +WLRR
Sbjct: 583 ILHSIWLRR 591
>gi|168033546|ref|XP_001769276.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162679541|gb|EDQ65988.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 408
Score = 235 bits (599), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 137/391 (35%), Positives = 214/391 (54%), Gaps = 22/391 (5%)
Query: 32 LCFTCLSFNSF--RSYPLQNNFNSTLSFAINKQD--YSD---LGSDVFHSPSVFHLNYLE 84
+ F CL S R+ L+ + T+ K++ YS L ++V+H+ + F Y E
Sbjct: 1 MSFWCLQGTSHPPRNSQLRKAYLRTIRDTRGKEENLYSSEDMLHAEVYHNFTYFKERYAE 60
Query: 85 MVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRES--RFLTHDPDRAHLFFIP 142
M FKIY+Y DG + K G YA+EG F + + +S RF +P +AH+F +P
Sbjct: 61 MNKTFKIYVYRDGFKPLVHGA--KTGGIYATEGLFLKRMEDSNNRFTVSEPSKAHMFLLP 118
Query: 143 ISCHKM-----RGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRAT 197
S +M S + + +YVD+L +KYPYWNRT GADHFFV+CHD +T
Sbjct: 119 YSVRQMVDILQDPYSRSMRPLKTFISNYVDTLASKYPYWNRTHGADHFFVSCHDWAPLST 178
Query: 198 EGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQ---PFALPRGGRDVENRTVLGFWA 254
L N+++VVC+ V F KDV++PQ L+ L G E R L F+A
Sbjct: 179 MLHGELHTNSMKVVCNADLTVNFDIEKDVSIPQTLKGGNQSDLDVGSLGPEERDFLAFYA 238
Query: 255 GHRNSKIRVILARVWEN-DTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSAR 313
G + +R +L W+ D + + + + Y + ++++C+CP G +VNS R
Sbjct: 239 GQMHGTVRPVLLDYWKGKDPTMKVYEVLPSDIAVNISYAQHMKRSRYCLCPKGFEVNSPR 298
Query: 314 IADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHN 373
I ++I GCVPVI+++ + LP+ND+LDW KF+V + E D+ LK+IL +IS+ + S+
Sbjct: 299 IVEAILSGCVPVIIADNFVLPYNDVLDWTKFSVTVPEEDIPDLKKILSSISNVTYRSMQR 358
Query: 374 NLVKVQKHFQWNSPP--VRFDAFHMVMYELW 402
L +++HF W P ++D+FHM +Y +W
Sbjct: 359 RLRYIRRHFLWLEDPEDTQYDSFHMTLYSIW 389
>gi|255548866|ref|XP_002515489.1| catalytic, putative [Ricinus communis]
gi|223545433|gb|EEF46938.1| catalytic, putative [Ricinus communis]
Length = 481
Score = 234 bits (598), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 131/348 (37%), Positives = 198/348 (56%), Gaps = 19/348 (5%)
Query: 71 VFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRES--RF 128
++ +P+ F+ +Y+EM FK+Y+Y +GDP + P K Y EG F + R+
Sbjct: 134 IYRNPAAFYQSYMEMERRFKVYVYSEGDPPIVHDGPCK--DIYTIEGRFIHEMEHGARRY 191
Query: 129 LTHDPDRAHLFFIPISCHKMRG---KGTSYEN--MTVIVKDYVDSLINKYPYWNRTLGAD 183
T DP RAH++F+P S M K +Y++ + V DYV L KYP+WNRT GAD
Sbjct: 192 RTRDPKRAHVYFMPFSVTWMVKYLYKPLTYDHSPLRQFVADYVRVLSTKYPFWNRTHGAD 251
Query: 184 HFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQ-----PFALP 238
HF + CHD G A+ G L +IRV+C+ + F P KDV+LP++ P L
Sbjct: 252 HFMLACHDWGPHASRGDHLLYNTSIRVLCNANTSEGFNPRKDVSLPEIHLYGGNVPPQLL 311
Query: 239 RGGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGP-LVYQRRFYK 297
R L F+AG + IR +L + W+ D E D+ R+ + L Y +
Sbjct: 312 SPPPANTTRPHLAFFAGGLHGPIRPLLLKHWK-DRESDL---RVFEYLPKHLDYYSFMLR 367
Query: 298 TKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLK 357
+KFC+CP G +V S RI +SI+ CVPVILS++Y LPF+D+L W F++ LN ++ +L+
Sbjct: 368 SKFCLCPSGHEVASPRIVESIYAECVPVILSDHYVLPFSDVLRWDAFSIQLNVSEIPRLE 427
Query: 358 QILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
++L+++ + ++ L L V+ HF N P RFD FHM+++ +WLRR
Sbjct: 428 EVLRSVPEEKYERLKEGLRTVRTHFMLNQPAKRFDVFHMILHSIWLRR 475
>gi|356562054|ref|XP_003549290.1| PREDICTED: probable glycosyltransferase At5g25310-like [Glycine
max]
Length = 482
Score = 234 bits (596), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 140/372 (37%), Positives = 205/372 (55%), Gaps = 25/372 (6%)
Query: 53 STLSFAINKQD-----YSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPR 107
S LS A +D Y G+ ++ + +F+ +YLEM FK+Y+YPDGD + P
Sbjct: 111 SNLSVATTARDDSYHRYVPAGA-IYRNARLFYRSYLEMEKIFKVYVYPDGDLPIAHDGPC 169
Query: 108 KLTGKYASEGYFFQNIRES--RFLTHDPDRAHLFFIPISCHKMRGKGTSYENMTV----- 160
K Y+ EG F + RF T+DP+ AH+FF+P S M + + V
Sbjct: 170 K--DIYSIEGRFLHEMEHGAGRFRTNDPNAAHVFFLPFSVTWMVKYLYTPLSFNVTPLKK 227
Query: 161 IVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAF 220
V DYV + ++P+WN T GADHF + CHD G A++G PFL +IRV+C+ + F
Sbjct: 228 FVSDYVRVVSTRHPFWNITHGADHFMLACHDWGPHASQGNPFLYNTSIRVLCNANTSEGF 287
Query: 221 IPHKDVALPQV------LQPFALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWENDTE 274
P KDV+LP++ + P L D R L F++G + IR L W+N E
Sbjct: 288 NPRKDVSLPEIHLYGGEVSP-KLLSPPPDTAPRRYLAFFSGGLHGPIRPALLGHWKNHDE 346
Query: 275 LDISNNRINRAIGP-LVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDL 333
D+ R+ + L Y +KFC+CP G +V S RI ++I+ CVPVILS YY L
Sbjct: 347 NDVI--RVYEYLPKDLDYYSFMLTSKFCLCPSGHEVASPRIVEAIYAECVPVILSEYYVL 404
Query: 334 PFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDA 393
PF+D+L W F+V ++ D+ +LK+IL IS+ ++ L + V++HF N P RFD
Sbjct: 405 PFSDVLQWEAFSVQVDVSDIPRLKEILSAISEDKYRKLKEGVKAVRRHFTLNRPAKRFDV 464
Query: 394 FHMVMYELWLRR 405
FHM+++ +WLRR
Sbjct: 465 FHMILHSIWLRR 476
>gi|297802718|ref|XP_002869243.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
gi|297315079|gb|EFH45502.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
Length = 583
Score = 233 bits (595), Expect = 9e-59, Method: Compositional matrix adjust.
Identities = 125/353 (35%), Positives = 197/353 (55%), Gaps = 25/353 (7%)
Query: 71 VFHSP-----SVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRE 125
+ H+P S+F +Y M K+Y+Y +G ++ L G YASEG+F + ++
Sbjct: 234 LLHTPLYWNLSMFKRSYELMEKKLKVYVYREGKRPVLHKPV--LKGIYASEGWFMKQLKS 291
Query: 126 SR-FLTHDPDRAHLFFIPISCHKMRGK-----GTSYENMTVIVKDYVDSLINKYPYWNRT 179
SR F+T DP +AHLF++P S + S +N+ +K+Y+D + +KY +WN+T
Sbjct: 292 SRTFVTKDPRKAHLFYLPFSSKMLEETLYVPGSHSDQNLIQFLKNYLDMISSKYNFWNKT 351
Query: 180 LGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPR 239
G+DHF V CHD T + IR +C+ F+ KDVALP+ +PR
Sbjct: 352 GGSDHFLVACHDWAPSETRQ---YMAKCIRALCNSDVSEGFVFGKDVALPETT--ILVPR 406
Query: 240 ------GGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISN-NRINRAIGPLVYQ 292
GG+ V R +L F+AG + +R +L R W + + D+ + I ++ G Y
Sbjct: 407 RPLRALGGKPVSQRQILAFFAGGMHGYLRPLLLRNWGGNRDPDMKIFSEIPKSKGKKSYM 466
Query: 293 RRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERD 352
+KFCICP G +VNS R+ +++ Y CVPVI+S+ + PF ++L+W FAV + E+D
Sbjct: 467 EYMKSSKFCICPKGHEVNSPRVVEALFYECVPVIISDNFVPPFFEVLNWEAFAVFVLEKD 526
Query: 353 VYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
+ LK IL +I++ + + + VQKHF W+S P RFD FHM+++ +W R
Sbjct: 527 IPDLKNILVSITEERYREMQTRVKMVQKHFLWHSKPERFDIFHMILHSIWYNR 579
>gi|449443279|ref|XP_004139407.1| PREDICTED: probable glycosyltransferase At3g42180-like [Cucumis
sativus]
gi|449494024|ref|XP_004159425.1| PREDICTED: probable glycosyltransferase At3g42180-like [Cucumis
sativus]
Length = 447
Score = 233 bits (595), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 128/340 (37%), Positives = 198/340 (58%), Gaps = 21/340 (6%)
Query: 81 NYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNI--RESRFLTHDPDRAHL 138
+++EMV FK++ Y +G+ F+ P L YA EG F + +S F PD AH+
Sbjct: 107 SHIEMVKRFKVWSYREGEQPLFHDGP--LNSIYAIEGQFIDELDCSKSPFRASHPDEAHV 164
Query: 139 FFIPISC----HKMRGKGTS-----YENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTC 189
F +P+S H + TS + M + DY+ + N+YPYWNR+ GADHF V+C
Sbjct: 165 FLLPLSITNIIHFIYRPITSPADYNRDRMHRVTTDYIRVVANRYPYWNRSNGADHFVVSC 224
Query: 190 HDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQV-LQPFAL--PRGGRDVEN 246
HD ++ P L KN IRVVC+ + F P+ D+ LP++ + P L P G+ E
Sbjct: 225 HDWAPEISDANPQLFKNFIRVVCNANITEGFRPNIDIPLPEINIHPGTLGPPDLGQPPER 284
Query: 247 RTVLGFWAGHRNSKIRVILARVW-ENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPG 305
R +L F+AG + IR IL + W E D E+ + Y + ++KFC+CP
Sbjct: 285 RPILAFFAGGAHGYIRKILIKHWKEKDNEVQVHEYLPKTQ----NYTKLIGESKFCLCPS 340
Query: 306 GSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISD 365
G +V S R+ ++I+ GCVPVI+S+ Y LPF+D+LDW +F+V + + + ++K ILK IS+
Sbjct: 341 GYEVASPRVVEAIYGGCVPVIISDNYSLPFSDVLDWSRFSVQIPVQRIPEIKTILKAISE 400
Query: 366 AEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
+++ L+ ++KV++HF+ N P FD HM+++ LWLRR
Sbjct: 401 EKYLKLYKGVIKVKRHFKINRPAKPFDVIHMLLHSLWLRR 440
>gi|168029668|ref|XP_001767347.1| predicted protein [Physcomitrella patens subsp. patens]
gi|168029742|ref|XP_001767384.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162681411|gb|EDQ67838.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162681448|gb|EDQ67875.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 333
Score = 233 bits (594), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 122/330 (36%), Positives = 195/330 (59%), Gaps = 14/330 (4%)
Query: 85 MVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIR-ESRFLTHDPDRAHLFFIPI 143
M FK+Y+Y +G+ + P K YA EG F Q ++ ++ F+THDPD AH++F+P
Sbjct: 1 MERRFKVYVYSEGEEPLVHNGPCKEI--YAVEGRFIQELQGDNPFVTHDPDNAHVYFLPF 58
Query: 144 SCHKM-----RGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATE 198
S M + + + + V DYVD L++KYP+WNR+ GADHF ++CHD G T
Sbjct: 59 SVAMMVAYLYEKESGDMDPLRLFVGDYVDVLMHKYPFWNRSGGADHFMLSCHDWGPLITR 118
Query: 199 GVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQV-LQPFALPR--GGRDVENRTVLGFWAG 255
L +IRV+C+ + ++P KDV+LP++ L +P GG ++R L F+AG
Sbjct: 119 ENMNLGTRSIRVLCNANSSEGYVPWKDVSLPEIHLVGGHIPAELGGPPAKDRPHLAFFAG 178
Query: 256 HRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIA 315
+ +R L + WE + I + L Y +++CICPGG +VNS RI
Sbjct: 179 RDHGPVRPQLFKHWEGKDDDVIVYQWLPAH---LKYHELMKTSRYCICPGGYEVNSPRIV 235
Query: 316 DSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNL 375
++I+ CVPVI+++ + LPF+D+L+W F++ + E D+ LK IL+N++ + S+ +
Sbjct: 236 EAIYNECVPVIIADSFILPFSDVLNWESFSLHVKESDIPNLKSILQNVTMETYTSMQERV 295
Query: 376 VKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
+VQ+HF + PP R+D FHM+++ +WLRR
Sbjct: 296 SQVQRHFVLHQPPKRYDVFHMILHSVWLRR 325
>gi|334187609|ref|NP_196674.2| putative glycosyltransferase [Arabidopsis thaliana]
gi|292630779|sp|Q9LFP3.2|GLYT4_ARATH RecName: Full=Probable glycosyltransferase At5g11130
gi|332004254|gb|AED91637.1| putative glycosyltransferase [Arabidopsis thaliana]
Length = 480
Score = 233 bits (594), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 129/362 (35%), Positives = 207/362 (57%), Gaps = 22/362 (6%)
Query: 60 NKQDYSDLG----SDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYAS 115
++ + SD+G V+ + FH ++ EM FKI+ Y +G+ F++ P L YA
Sbjct: 116 DRTNNSDVGVVSNGSVYLNAFTFHQSHKEMEKRFKIWTYREGEAPLFHKGP--LNNIYAI 173
Query: 116 EGYFFQNIRE--SRFLTHDPDRAHLFFIPISCHKM----RGKGTSY--ENMTVIVKDYVD 167
EG F I SRF P+ A +F+IP+ + TSY + + IVKDY+
Sbjct: 174 EGQFMDEIENGNSRFKAASPEEATVFYIPVGIVNIIRFVYRPYTSYARDRLQNIVKDYIS 233
Query: 168 SLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVA 227
+ N+YPYWNR+ GADHFF++CHD + P L K+ IR +C+ + F P +DV+
Sbjct: 234 LISNRYPYWNRSRGADHFFLSCHDWAPDVSAVDPELYKHFIRALCNANSSEGFTPMRDVS 293
Query: 228 LPQVLQP---FALPRGGRDVENRTVLGFWAGHRNSKIRVILARVW-ENDTELDISNNRIN 283
LP++ P G +NR +L F+AG + +R IL + W E D ++ + N
Sbjct: 294 LPEINIPHSQLGFVHTGEPPQNRKLLAFFAGGSHGDVRKILFQHWKEKDKDVLVYEN--- 350
Query: 284 RAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRK 343
+ Y + K KFC+CP G +V S RI +S++ GCVPVI+++YY LPF+D+L+W+
Sbjct: 351 -LPKTMNYTKMMDKAKFCLCPSGWEVASPRIVESLYSGCVPVIIADYYVLPFSDVLNWKT 409
Query: 344 FAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWL 403
F+V + + +K+IL+ I++ E++++ +++V+KHF N P +D HM+M+ +WL
Sbjct: 410 FSVHIPISKMPDIKKILEAITEEEYLNMQRRVLEVRKHFVINRPSKPYDMLHMIMHSIWL 469
Query: 404 RR 405
RR
Sbjct: 470 RR 471
>gi|255548864|ref|XP_002515488.1| catalytic, putative [Ricinus communis]
gi|223545432|gb|EEF46937.1| catalytic, putative [Ricinus communis]
Length = 474
Score = 233 bits (594), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 124/369 (33%), Positives = 206/369 (55%), Gaps = 28/369 (7%)
Query: 54 TLSFAINKQDYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKY 113
T +++ +KQ+ ++ +P +FH +Y+EM FK++ Y +G+P F+ P K Y
Sbjct: 106 TRNYSSHKQESFIPRGSIYLNPYIFHQSYIEMEKRFKVWTYKEGEPPLFHNGPMKEI--Y 163
Query: 114 ASEGYFFQNIR--ESRFLTHDPDRAHLFFIPISCHKMRG------KGTSYENMTVIVKDY 165
++EG F +S F PD AH FF+P+S + S + +VKDY
Sbjct: 164 STEGQFIDEFESGKSLFSARRPDEAHAFFLPVSIVSIVKYVYRPYSDYSRIRLQNVVKDY 223
Query: 166 VDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKD 225
V + +KYP+WNR+ GADHF +CHD + G P L K RV+C+ + F+P +D
Sbjct: 224 VGVISSKYPFWNRSDGADHFLTSCHDWAPDVSAGHPELYKYFTRVLCNANTSEGFVPERD 283
Query: 226 VALPQVL--------QPFALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWEN-DTELD 276
V+LP++ +P +LP ++R +L F+AG + +R L W+ D ++
Sbjct: 284 VSLPEIRLRDRKLSPEPHSLP-----PKDRRILAFFAGGEHGHVRTKLFEHWKGKDRDVQ 338
Query: 277 ISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFN 336
+ L Y +KFC+CP G +V S R+ ++I+ GCVPVI+S+YY LPF+
Sbjct: 339 V----YEYLPKTLNYTELMSHSKFCLCPSGWEVASPRVPEAIYSGCVPVIISDYYYLPFS 394
Query: 337 DILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHM 396
D+LDW KF+V + + ++K +L+ I +++++ +++VQ+HF+ N P +D HM
Sbjct: 395 DVLDWSKFSVHIPVARIPEIKTVLQKIPMRKYLTMQKRVIQVQRHFKLNRPAKPYDVLHM 454
Query: 397 VMYELWLRR 405
V++ +WLRR
Sbjct: 455 VLHSIWLRR 463
>gi|357142838|ref|XP_003572711.1| PREDICTED: probable glycosyltransferase At5g20260-like
[Brachypodium distachyon]
Length = 484
Score = 233 bits (593), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 127/350 (36%), Positives = 189/350 (54%), Gaps = 17/350 (4%)
Query: 71 VFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRE--SRF 128
++ + H +Y+EM KI+ Y +G+P + P T Y+ EG F + E +RF
Sbjct: 130 IYRNARASHRSYVEMEKRLKIWTYSEGEPPLAHLAPG--TDIYSIEGQFLAEMEEPLNRF 187
Query: 129 LTHDPDRAHLFFIPISCHKM-----RGKGTSY-ENMTVIVKDYVDSLINKYPYWNRTLGA 182
PD A++F +PIS + R T++ + ++ DYV + +K+PYWNR+ GA
Sbjct: 188 AARHPDEANVFLLPISVCNLVHYVYRLNTTAHLAPLRKLLADYVAVISDKHPYWNRSGGA 247
Query: 183 DHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQP---FALPR 239
DH V+CHD +EG P L NAIRV+C+ + F+P KD LP+V LP
Sbjct: 248 DHVLVSCHDWAPLVSEGSPELRDNAIRVLCNANVSEGFVPRKDATLPEVNLADGVLRLPT 307
Query: 240 GGRDVENRTVLGFWAGHRNSKIRVILARVW--ENDTELDISNNRINRAIGPLV--YQRRF 295
G +NRT L F+AG +IR L W D E+D+ GP Y
Sbjct: 308 QGLPRQNRTTLAFFAGGMLGEIRRALLEQWAGREDPEMDVHEYLPPHGGGPGYDDYHALM 367
Query: 296 YKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQ 355
+ +FC+CP G +V S R+ +S+ GCVPVI+S Y LPF D+LDW K +V + + +
Sbjct: 368 GRARFCLCPSGFEVASPRVVESVFAGCVPVIISEGYPLPFGDVLDWSKMSVAVPAARIPE 427
Query: 356 LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
LK IL+ +S+ + L +++ Q+HF + P RFD HMV++ +WLRR
Sbjct: 428 LKAILRGVSERRYRVLRARVLQAQRHFVLHRPARRFDMIHMVLHSIWLRR 477
>gi|356510025|ref|XP_003523741.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
max]
Length = 619
Score = 232 bits (591), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 127/341 (37%), Positives = 197/341 (57%), Gaps = 17/341 (4%)
Query: 76 SVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESR-FLTHDPD 134
S+F +Y M K+YIY DGD F++ L G YASEG+F + + ++ F+T DP
Sbjct: 278 SMFRRSYELMENMLKVYIYQDGDRPIFHEP--LLDGIYASEGWFMKLMEANKQFVTRDPG 335
Query: 135 RAHLFFIPISCHKMRG-----KGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTC 189
+AHLF+IP S ++ N+ +K+YVD + KYP+WNRT GADHF V C
Sbjct: 336 KAHLFYIPFSSRLLQQTLYVRNSHRRSNLIEYMKNYVDMIAGKYPFWNRTSGADHFVVAC 395
Query: 190 HDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQV-LQPFALPR---GGRDVE 245
HD T G + + IR +C+ +V F KDV+LP+ ++ P GG
Sbjct: 396 HDWAPAETRG---RMLSCIRALCNADIEVGFKIGKDVSLPETYIRSSENPVKNIGGDPPS 452
Query: 246 NRTVLGFWAGHRNSKIRVILARVWEN-DTELDISNNRINRAIGPLVYQRRFYKTKFCICP 304
R +L F+AG + +R IL + WEN + ++ IS + G + Y + +KFCIC
Sbjct: 453 KRPILAFFAGGLHGYVRPILLKHWENKEPDMKISG-PLPHVRGNVNYIQLMKSSKFCICA 511
Query: 305 GGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNIS 364
G +VNS R+ ++I + C+PVI+S+ + PF +IL+W FAV + E ++ L+ IL +IS
Sbjct: 512 RGHEVNSPRVVEAIFHECIPVIISDNFIPPFFEILNWESFAVFVKEEEIPNLRNILLSIS 571
Query: 365 DAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
+ ++ +H KVQ+HF W++ PV++D FHM+++ +W R
Sbjct: 572 EERYLEMHKRAKKVQEHFLWHAEPVKYDLFHMLLHSIWYNR 612
>gi|302761638|ref|XP_002964241.1| xylosyltransferase-like protein [Selaginella moellendorffii]
gi|300167970|gb|EFJ34574.1| xylosyltransferase-like protein [Selaginella moellendorffii]
Length = 332
Score = 231 bits (589), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 120/331 (36%), Positives = 195/331 (58%), Gaps = 16/331 (4%)
Query: 85 MVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIR-ESRFLTHDPDRAHLFFIPI 143
M K++ YP+G+ + P K YA EG F Q ++ ++ +LT DP++AHLFF+P
Sbjct: 1 MEKRLKVFAYPEGEEPLVHNGPCKEI--YAIEGRFIQELQGKNSYLTSDPEKAHLFFLPF 58
Query: 144 SCHKM-----RGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATE 198
S M + +DY+D + ++Y WNR+ GADHF V+CHD G +
Sbjct: 59 SVAMMVTYLYTPGSHDMGPLGRFTRDYIDVISHRYSAWNRSRGADHFMVSCHDWGPHISR 118
Query: 199 GVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQV-LQPFALPR--GGRDVENRTVLGFWAG 255
P L+ N+IRV+C+ + ++P KD +LP++ L +P GG E R L F+AG
Sbjct: 119 AHPDLMANSIRVLCNANTSEGYVPSKDASLPEIHLVGGQVPSVLGGPPPEERRYLAFFAG 178
Query: 256 HRNSKIRVILARVW-ENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARI 314
+ +R +L + W E D ++ + +R Y +K+C+CPGG +VNS RI
Sbjct: 179 GDHGPVRPVLFKYWKEKDEDVRVFEKLPSRD----AYLDYMSHSKYCLCPGGYEVNSPRI 234
Query: 315 ADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNN 374
++I+ CVPV++++ + LPF+D+LDW F+V + ERD+ +LK IL+ I A ++ +
Sbjct: 235 VEAIYNDCVPVVIADDFVLPFSDVLDWDAFSVKVLERDIPRLKTILQAIPTARYLEMQAR 294
Query: 375 LVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
+ KV++HF++N PP R+D F+M+++ +WLRR
Sbjct: 295 VSKVRRHFRFNQPPERYDVFNMILHSVWLRR 325
>gi|449531189|ref|XP_004172570.1| PREDICTED: probable glycosyltransferase At5g03795-like, partial
[Cucumis sativus]
Length = 344
Score = 231 bits (589), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 130/346 (37%), Positives = 203/346 (58%), Gaps = 32/346 (9%)
Query: 81 NYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESR-FLTHDPDRAHLF 139
+Y M K+YIY DG F+ TP L G YASEG+F + + E+R F+T DP++AHLF
Sbjct: 1 SYELMELILKVYIYRDGSRPIFH-TPH-LRGIYASEGWFMKLMEENRQFVTKDPEKAHLF 58
Query: 140 FIPISCHKMRGK-----GTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV 194
++ S +++ + + +++ ++D+V+ + KYPYWNRT G DHF V CHD G
Sbjct: 59 YLAYSSRQLQTALYVPDSHNMKPLSIYLRDHVNWIAGKYPYWNRTHGYDHFLVACHDWGP 118
Query: 195 RATEGVPFLIKNAIRVVCSPSY-DVAFIPHKDVALPQVLQPFALPR------GGRDVENR 247
L ++ I+ +C+ + F KDV+LP+ PR GG+ V R
Sbjct: 119 YTVNEHRELSQHTIKALCNADLSEGVFKLGKDVSLPETT--IRTPRKPLRNVGGKRVSQR 176
Query: 248 TVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPL--------VYQRRFYKTK 299
+L F+AG+ + ++R IL + W ND + DI R GPL Y + +K
Sbjct: 177 PILAFFAGNMHGRVRPILLKHW-NDKDDDI------RVYGPLPLRVSRKMTYIQHMKSSK 229
Query: 300 FCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQI 359
+CICP G +VNS RI ++I+Y CVPVI+++ + LPF++ LDW F+V++ E+D+ +LK+I
Sbjct: 230 YCICPMGYEVNSPRIIEAIYYECVPVIIADNFVLPFSEFLDWSAFSVVVAEKDIPKLKEI 289
Query: 360 LKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
L I ++++ N+ VQKHF WN P+++D FHMV++ +W R
Sbjct: 290 LTAIPLKRYLTMQINVKMVQKHFLWNPKPLKYDLFHMVLHSIWFSR 335
>gi|359484708|ref|XP_002264880.2| PREDICTED: probable glycosyltransferase At5g20260-like [Vitis
vinifera]
Length = 467
Score = 231 bits (589), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 125/358 (34%), Positives = 201/358 (56%), Gaps = 21/358 (5%)
Query: 61 KQDYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFF 120
K+++ G+ V+ +P FH +++EM FKI+ Y +GD + P+ Y EG F
Sbjct: 111 KENFIPRGA-VYRNPYAFHQSHIEMEKRFKIWTYKEGDQPLVHGGPK--NSIYGIEGQFM 167
Query: 121 QNIR--ESRFLTHDPDRAHLFFIPIS----CHKMRGKGTSYEN--MTVIVKDYVDSLINK 172
+ +S F+ PD AH+F+IPIS H + Y + +V DY+ + NK
Sbjct: 168 DEMESGDSHFMAGHPDVAHVFYIPISVTRIAHYIYSPPVDYSGHMLQRLVTDYIYVVSNK 227
Query: 173 YPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVL 232
YPYWNR+ GADHF V+CHD + P L K+ IRV+C+ + F P +D++LP+V
Sbjct: 228 YPYWNRSNGADHFLVSCHDWAPEISIVTPDLYKHFIRVLCNANTSERFQPIRDISLPEVN 287
Query: 233 QP---FALPRGGRDVENRTVLGFWAGHRNSKIRVILARVW-ENDTELDISNN-RINRAIG 287
P P + R +L F++G + +R +L R W END E+ + + NR
Sbjct: 288 IPKGKLGPPHLDKPPNQRHILAFFSGRESGYMRTLLFRSWKENDDEVQVYEHLPSNRD-- 345
Query: 288 PLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVI 347
Y + +KFC+CP G +V S R+ ++I GCVPVI+ +YY LPF+++LDW KF++
Sbjct: 346 ---YAKSMVDSKFCLCPSGWEVASPRVVEAIAAGCVPVIICDYYVLPFSEVLDWSKFSIN 402
Query: 348 LNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
+ + ++K+ILK + + ++ + + +VQ+HF N P +D HM+++ +WLRR
Sbjct: 403 ITSDKIPEIKKILKAVPNERYLRMQKRVKQVQRHFVINRPAQPYDMLHMILHSVWLRR 460
>gi|296084507|emb|CBI25528.3| unnamed protein product [Vitis vinifera]
Length = 862
Score = 231 bits (588), Expect = 7e-58, Method: Compositional matrix adjust.
Identities = 121/348 (34%), Positives = 195/348 (56%), Gaps = 17/348 (4%)
Query: 69 SDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIR--ES 126
DV+ +P FH +++EM FKI+ Y +GD + P G YA EG F +I +S
Sbjct: 514 GDVYRNPYAFHQSHIEMEKRFKIWAYREGDQPLMHDGPS--NGIYAIEGQFMDDIESGKS 571
Query: 127 RFLTHDPDRAHLFFIPISCHKMRG---KGTSYENMTV--IVKDYVDSLINKYPYWNRTLG 181
FL PD A+ F+IP+S K+ + Y + +V DY++ + +KYPYWNR+ G
Sbjct: 572 HFLARRPDEANAFYIPMSLTKIVHFIYEPPHYYGKWIPRLVTDYINFVADKYPYWNRSKG 631
Query: 182 ADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQP---FALP 238
ADHF V+CHD + P L K+ IR +C+ + F P +D+++P++ P P
Sbjct: 632 ADHFLVSCHDWAPDVSALKPDLYKHFIRALCNANTSERFHPIRDISIPEINIPRGKLGPP 691
Query: 239 RGGRDVENRTVLGFWAGHRNSKIRVILARVW-ENDTELDISNNRINRAIGPLVYQRRFYK 297
+ R +L F+AG + +R +L + W E D E+ + R G Y +
Sbjct: 692 HLDQPPNKRPILAFFAGGAHGYVRSVLFKYWKEKDDEVQV----FERLPGNRNYSKSMGD 747
Query: 298 TKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLK 357
+KFC+CP G +V S RI ++I GCVP+I+ ++Y LPF+D+LDW KF++ + + ++K
Sbjct: 748 SKFCLCPSGYEVASPRIVEAIAAGCVPMIICDHYSLPFSDVLDWSKFSIYITSDKIPEIK 807
Query: 358 QILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
+ILK + ++ + + +VQ+HF N P +D HM+++ +WLRR
Sbjct: 808 KILKAVPTETYLEMQKRVKQVQRHFAINRPARPYDMLHMILHSVWLRR 855
>gi|302815783|ref|XP_002989572.1| xylosyltransferase-like protein [Selaginella moellendorffii]
gi|300142750|gb|EFJ09448.1| xylosyltransferase-like protein [Selaginella moellendorffii]
Length = 332
Score = 230 bits (587), Expect = 8e-58, Method: Compositional matrix adjust.
Identities = 120/331 (36%), Positives = 194/331 (58%), Gaps = 16/331 (4%)
Query: 85 MVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIR-ESRFLTHDPDRAHLFFIPI 143
M K++ YP+G+ + P K YA EG F Q ++ ++ +LT DP++AHLFF+P
Sbjct: 1 MEKRLKVFAYPEGEEPLVHNGPCKEI--YAIEGRFIQELQGKNSYLTSDPEKAHLFFLPF 58
Query: 144 SCHKM-----RGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATE 198
S M + +DY+D + ++Y WNR+ GADHF V+CHD G +
Sbjct: 59 SVAMMVTYLYTPGSHDMGPLGRFTRDYIDVISHRYSSWNRSRGADHFMVSCHDWGPHISR 118
Query: 199 GVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQV-LQPFALPR--GGRDVENRTVLGFWAG 255
P L+ N+IRV+C+ + ++P KD +LP++ L +P GG E R L F+AG
Sbjct: 119 AHPDLMANSIRVLCNANTSEGYVPSKDASLPEIHLVGGQVPSVLGGPPPEERRYLAFFAG 178
Query: 256 HRNSKIRVILARVW-ENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARI 314
+ +R +L + W E D ++ + +R Y +K+C+CPGG +VNS RI
Sbjct: 179 GDHGPVRPVLFKYWKEKDEDVRVFEKLPSRD----AYLDYMSHSKYCLCPGGYEVNSPRI 234
Query: 315 ADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNN 374
++I+ CVPV++++ + LPF+D+LDW F+V + ERD+ +LK IL+ I A ++ +
Sbjct: 235 VEAIYNDCVPVVIADDFVLPFSDVLDWDAFSVKVLERDIPRLKTILQAIPTARYLEMQAR 294
Query: 375 LVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
KV++HF++N PP R+D F+M+++ +WLRR
Sbjct: 295 ASKVRRHFRFNQPPERYDVFNMILHSVWLRR 325
>gi|359484706|ref|XP_002266646.2| PREDICTED: probable glycosyltransferase At5g20260-like [Vitis
vinifera]
Length = 480
Score = 230 bits (587), Expect = 8e-58, Method: Compositional matrix adjust.
Identities = 121/348 (34%), Positives = 195/348 (56%), Gaps = 17/348 (4%)
Query: 69 SDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIR--ES 126
DV+ +P FH +++EM FKI+ Y +GD + P G YA EG F +I +S
Sbjct: 132 GDVYRNPYAFHQSHIEMEKRFKIWAYREGDQPLMHDGPS--NGIYAIEGQFMDDIESGKS 189
Query: 127 RFLTHDPDRAHLFFIPISCHKMRG---KGTSYENMTV--IVKDYVDSLINKYPYWNRTLG 181
FL PD A+ F+IP+S K+ + Y + +V DY++ + +KYPYWNR+ G
Sbjct: 190 HFLARRPDEANAFYIPMSLTKIVHFIYEPPHYYGKWIPRLVTDYINFVADKYPYWNRSKG 249
Query: 182 ADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQP---FALP 238
ADHF V+CHD + P L K+ IR +C+ + F P +D+++P++ P P
Sbjct: 250 ADHFLVSCHDWAPDVSALKPDLYKHFIRALCNANTSERFHPIRDISIPEINIPRGKLGPP 309
Query: 239 RGGRDVENRTVLGFWAGHRNSKIRVILARVW-ENDTELDISNNRINRAIGPLVYQRRFYK 297
+ R +L F+AG + +R +L + W E D E+ + R G Y +
Sbjct: 310 HLDQPPNKRPILAFFAGGAHGYVRSVLFKYWKEKDDEVQV----FERLPGNRNYSKSMGD 365
Query: 298 TKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLK 357
+KFC+CP G +V S RI ++I GCVP+I+ ++Y LPF+D+LDW KF++ + + ++K
Sbjct: 366 SKFCLCPSGYEVASPRIVEAIAAGCVPMIICDHYSLPFSDVLDWSKFSIYITSDKIPEIK 425
Query: 358 QILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
+ILK + ++ + + +VQ+HF N P +D HM+++ +WLRR
Sbjct: 426 KILKAVPTETYLEMQKRVKQVQRHFAINRPARPYDMLHMILHSVWLRR 473
>gi|296084508|emb|CBI25529.3| unnamed protein product [Vitis vinifera]
Length = 388
Score = 230 bits (586), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 125/359 (34%), Positives = 201/359 (55%), Gaps = 21/359 (5%)
Query: 60 NKQDYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYF 119
K+++ G+ V+ +P FH +++EM FKI+ Y +GD + P+ Y EG F
Sbjct: 31 KKENFIPRGA-VYRNPYAFHQSHIEMEKRFKIWTYKEGDQPLVHGGPK--NSIYGIEGQF 87
Query: 120 FQNIR--ESRFLTHDPDRAHLFFIPIS----CHKMRGKGTSYEN--MTVIVKDYVDSLIN 171
+ +S F+ PD AH+F+IPIS H + Y + +V DY+ + N
Sbjct: 88 MDEMESGDSHFMAGHPDVAHVFYIPISVTRIAHYIYSPPVDYSGHMLQRLVTDYIYVVSN 147
Query: 172 KYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQV 231
KYPYWNR+ GADHF V+CHD + P L K+ IRV+C+ + F P +D++LP+V
Sbjct: 148 KYPYWNRSNGADHFLVSCHDWAPEISIVTPDLYKHFIRVLCNANTSERFQPIRDISLPEV 207
Query: 232 LQP---FALPRGGRDVENRTVLGFWAGHRNSKIRVILARVW-ENDTELDISNN-RINRAI 286
P P + R +L F++G + +R +L R W END E+ + + NR
Sbjct: 208 NIPKGKLGPPHLDKPPNQRHILAFFSGRESGYMRTLLFRSWKENDDEVQVYEHLPSNRD- 266
Query: 287 GPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAV 346
Y + +KFC+CP G +V S R+ ++I GCVPVI+ +YY LPF+++LDW KF++
Sbjct: 267 ----YAKSMVDSKFCLCPSGWEVASPRVVEAIAAGCVPVIICDYYVLPFSEVLDWSKFSI 322
Query: 347 ILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
+ + ++K+ILK + + ++ + + +VQ+HF N P +D HM+++ +WLRR
Sbjct: 323 NITSDKIPEIKKILKAVPNERYLRMQKRVKQVQRHFVINRPAQPYDMLHMILHSVWLRR 381
>gi|225429942|ref|XP_002281263.1| PREDICTED: probable glycosyltransferase At5g03795 [Vitis vinifera]
Length = 675
Score = 229 bits (585), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 124/352 (35%), Positives = 200/352 (56%), Gaps = 20/352 (5%)
Query: 67 LGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRES 126
L + ++ + SVF +Y M K+Y Y +G+ F+Q P K G YASEG+F + ++ +
Sbjct: 318 LHASLYRNVSVFKRSYELMENTLKVYTYREGERPVFHQPPIK--GIYASEGWFMKLMQAN 375
Query: 127 R-FLTHDPDRAHLFFIPISCHKMRG-----KGTSYENMTVIVKDYVDSLINKYPYWNRTL 180
+ F+T + +AHLF++P S + S +N+ +K+Y+D + KYP+WNRT
Sbjct: 376 KKFVTKNGRKAHLFYLPFSSLMLEEALYVPNSHSRKNLEQYLKNYLDMIGAKYPFWNRTG 435
Query: 181 GADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPR- 239
GADHF V CHD T L+ N+IR +C+ F KDV+LP+ +P+
Sbjct: 436 GADHFLVACHDWAPSETLK---LMANSIRALCNSDIREGFKLGKDVSLPETC--VRIPQN 490
Query: 240 -----GGRDVENRTVLGFWAGHRNSKIRVILARVWEN-DTELDISNNRINRAIGPLVYQR 293
GG+ R +L F+AG + +R IL + WEN D ++ I G + Y +
Sbjct: 491 PLRQLGGKPPSQRRILAFFAGSMHGYVRPILLKYWENKDPDMKIYGRMPKAKKGTMNYIQ 550
Query: 294 RFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDV 353
+K+CIC G +VNS R+ ++I Y CVPVI+S+ + PF +L+W FAV + E+D+
Sbjct: 551 HMKSSKYCICAKGYEVNSPRVVEAIFYECVPVIISDNFVPPFFGVLNWESFAVFILEKDI 610
Query: 354 YQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
LK IL +I + ++ + + +VQ+HF W++ PV++D FHM+++ +W R
Sbjct: 611 PNLKSILLSIPEKSYLEIQMRVKQVQQHFLWHAKPVKYDVFHMILHSVWYNR 662
>gi|15233924|ref|NP_195005.1| Exostosin family protein [Arabidopsis thaliana]
gi|3063691|emb|CAA18582.1| putative protein [Arabidopsis thaliana]
gi|7270226|emb|CAB79996.1| putative protein [Arabidopsis thaliana]
gi|332660719|gb|AEE86119.1| Exostosin family protein [Arabidopsis thaliana]
Length = 593
Score = 229 bits (585), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 123/353 (34%), Positives = 197/353 (55%), Gaps = 25/353 (7%)
Query: 71 VFHSP-----SVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRE 125
+ H+P S+F +Y M K+Y+Y +G ++ L G YASEG+F + ++
Sbjct: 244 LLHTPLYWNLSMFKRSYELMEKKLKVYVYREGKRPVLHKPV--LKGIYASEGWFMKQLKS 301
Query: 126 SR-FLTHDPDRAHLFFIPISCHKMRGK-----GTSYENMTVIVKDYVDSLINKYPYWNRT 179
SR F+T DP +AHLF++P S + S +N+ +K+Y+D + +KY +WN+T
Sbjct: 302 SRTFVTKDPRKAHLFYLPFSSKMLEETLYVPGSHSDKNLIQFLKNYLDMISSKYSFWNKT 361
Query: 180 LGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPR 239
G+DHF V CHD T + IR +C+ F+ KDVALP+ +PR
Sbjct: 362 GGSDHFLVACHDWAPSETRQ---YMAKCIRALCNSDVSEGFVFGKDVALPETT--ILVPR 416
Query: 240 ------GGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISN-NRINRAIGPLVYQ 292
GG+ V R +L F+AG + +R +L + W + + D+ + I ++ G Y
Sbjct: 417 RPLRALGGKPVSQRQILAFFAGGMHGYLRPLLLQNWGGNRDPDMKIFSEIPKSKGKKSYM 476
Query: 293 RRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERD 352
+K+CICP G +VNS R+ +++ Y CVPVI+S+ + PF ++L+W FAV + E+D
Sbjct: 477 EYMKSSKYCICPKGHEVNSPRVVEALFYECVPVIISDNFVPPFFEVLNWESFAVFVLEKD 536
Query: 353 VYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
+ LK IL +I++ + + + VQKHF W+S P RFD FHM+++ +W R
Sbjct: 537 IPDLKNILVSITEERYREMQMRVKMVQKHFLWHSKPERFDIFHMILHSIWYNR 589
>gi|334187794|ref|NP_197526.5| Exostosin family protein [Arabidopsis thaliana]
gi|332005439|gb|AED92822.1| Exostosin family protein [Arabidopsis thaliana]
Length = 458
Score = 229 bits (584), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 126/366 (34%), Positives = 206/366 (56%), Gaps = 27/366 (7%)
Query: 57 FAINKQDYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASE 116
F +K++ V+ + FH +++EM FK+++Y +G+ + P + Y+ E
Sbjct: 97 FVSDKEETFVPRGAVYRNAFAFHQSHIEMEKKFKVWVYREGETPLVHMGP--MNNIYSIE 154
Query: 117 GYFFQNIRE--SRFLTHDPDRAHLFFIPISC----HKMRGKGTSY--ENMTVIVKDYVDS 168
G F I S F ++P+ AH F +P+S H + +Y E + + DYVD
Sbjct: 155 GQFMDEIETGMSPFAANNPEEAHAFLLPVSVANIVHYLYRPLVTYSREQLHKVFLDYVDV 214
Query: 169 LINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVAL 228
+ +KYPYWNR+LGADHF+V+CHD + P L+KN IRV+C+ + F+P +DV++
Sbjct: 215 VAHKYPYWNRSLGADHFYVSCHDWAPDVSGSNPELMKNLIRVLCNANTSEGFMPQRDVSI 274
Query: 229 PQVLQP---FALPRGGRDV-ENRTVLGFWAGHRNSKIRVILARVWEN-DTELDISNNRIN 283
P++ P PR R +R +L F+AG + IR IL + W++ D E+ +
Sbjct: 275 PEINIPGGHLGPPRLSRSSGHDRPILAFFAGGSHGYIRRILLQHWKDKDEEVQVHEY--- 331
Query: 284 RAIGPLVYQRRFYK----TKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDIL 339
L + ++K +FC+CP G +V S R+ +I+ GCVPVI+S++Y LPF+D+L
Sbjct: 332 -----LAKNKDYFKLMATARFCLCPSGYEVASPRVVAAINLGCVPVIISDHYALPFSDVL 386
Query: 340 DWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMY 399
DW KF + + + + ++K ILK+IS + L +++VQ+HF N P FD M+++
Sbjct: 387 DWTKFTIHVPSKKIPEIKTILKSISWRRYRVLQRRVLQVQRHFVINRPSQPFDMLRMLLH 446
Query: 400 ELWLRR 405
+WLRR
Sbjct: 447 SVWLRR 452
>gi|224120358|ref|XP_002331028.1| predicted protein [Populus trichocarpa]
gi|222872958|gb|EEF10089.1| predicted protein [Populus trichocarpa]
Length = 334
Score = 229 bits (583), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 128/333 (38%), Positives = 188/333 (56%), Gaps = 18/333 (5%)
Query: 85 MVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIR--ESRFLTHDPDRAHLFFIP 142
MV FKI++Y +G+ + P K Y+ EG F + +S FL + D AH FF+P
Sbjct: 1 MVKRFKIWVYREGETPMVHNGPMKHI--YSIEGQFIDEMESGKSPFLARNHDEAHAFFLP 58
Query: 143 ISCHKMRG----KGTSY--ENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRA 196
IS + T+Y E + I KDYV + NKYPYWNR+ G DHF V+CHD +
Sbjct: 59 ISVAYIVEFVYLPITTYHRERLVRIFKDYVTVVANKYPYWNRSRGGDHFMVSCHDWAPQV 118
Query: 197 TEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQV-LQPFALPRGGRDV--ENRTVLGFW 253
+ P L KN IRV+C+ + F P +D LP++ P L R + R + F+
Sbjct: 119 SRDDPELYKNLIRVMCNANTSEGFRPRRDATLPELNCPPLKLTPACRGLAPHERKIFAFF 178
Query: 254 AGHRNSKIRVILARVW-ENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSA 312
AG + IR IL R W E D E+ + Y ++KFC+CP G +V S
Sbjct: 179 AGGAHGDIRKILLRHWKEKDDEIQVHEYLPKDQD----YMELMGQSKFCLCPSGFEVASP 234
Query: 313 RIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLH 372
R+A+SI+ GCVPVI+S++Y+LPF+D+LDW +F+V + + ++K IL+ IS E++ +
Sbjct: 235 RVAESIYSGCVPVIISDHYNLPFSDVLDWSQFSVQIPVEKIPEIKTILRGISYDEYLKMQ 294
Query: 373 NNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
++KVQ+HF N P +D HMV++ +WLRR
Sbjct: 295 KGVMKVQRHFVLNRPAKPYDVLHMVLHSVWLRR 327
>gi|292630785|sp|Q3E9A4.3|GLYT5_ARATH RecName: Full=Probable glycosyltransferase At5g20260
Length = 466
Score = 229 bits (583), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 126/366 (34%), Positives = 206/366 (56%), Gaps = 27/366 (7%)
Query: 57 FAINKQDYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASE 116
F +K++ V+ + FH +++EM FK+++Y +G+ + P + Y+ E
Sbjct: 105 FVSDKEETFVPRGAVYRNAFAFHQSHIEMEKKFKVWVYREGETPLVHMGP--MNNIYSIE 162
Query: 117 GYFFQNIRE--SRFLTHDPDRAHLFFIPISC----HKMRGKGTSY--ENMTVIVKDYVDS 168
G F I S F ++P+ AH F +P+S H + +Y E + + DYVD
Sbjct: 163 GQFMDEIETGMSPFAANNPEEAHAFLLPVSVANIVHYLYRPLVTYSREQLHKVFLDYVDV 222
Query: 169 LINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVAL 228
+ +KYPYWNR+LGADHF+V+CHD + P L+KN IRV+C+ + F+P +DV++
Sbjct: 223 VAHKYPYWNRSLGADHFYVSCHDWAPDVSGSNPELMKNLIRVLCNANTSEGFMPQRDVSI 282
Query: 229 PQVLQP---FALPRGGRDV-ENRTVLGFWAGHRNSKIRVILARVWEN-DTELDISNNRIN 283
P++ P PR R +R +L F+AG + IR IL + W++ D E+ +
Sbjct: 283 PEINIPGGHLGPPRLSRSSGHDRPILAFFAGGSHGYIRRILLQHWKDKDEEVQVHEY--- 339
Query: 284 RAIGPLVYQRRFYK----TKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDIL 339
L + ++K +FC+CP G +V S R+ +I+ GCVPVI+S++Y LPF+D+L
Sbjct: 340 -----LAKNKDYFKLMATARFCLCPSGYEVASPRVVAAINLGCVPVIISDHYALPFSDVL 394
Query: 340 DWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMY 399
DW KF + + + + ++K ILK+IS + L +++VQ+HF N P FD M+++
Sbjct: 395 DWTKFTIHVPSKKIPEIKTILKSISWRRYRVLQRRVLQVQRHFVINRPSQPFDMLRMLLH 454
Query: 400 ELWLRR 405
+WLRR
Sbjct: 455 SVWLRR 460
>gi|255567222|ref|XP_002524592.1| catalytic, putative [Ricinus communis]
gi|223536145|gb|EEF37800.1| catalytic, putative [Ricinus communis]
Length = 388
Score = 229 bits (583), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 129/357 (36%), Positives = 201/357 (56%), Gaps = 17/357 (4%)
Query: 63 DYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQN 122
DY G D++ + FH +YL M FKI++Y +G+P + P K Y+ EG F
Sbjct: 36 DYVPHG-DIYRNSCAFHRSYLLMEKLFKIFVYEEGEPPLLHYGPCK--NIYSMEGLFLSL 92
Query: 123 IR-ESRFLTHDPDRAHLFFIPISC-----HKMRGKGTSYENMTVIVKDYVDSLINKYPYW 176
+ +++F T +PD A +FF+P S H + V DYV + +KY YW
Sbjct: 93 METDTKFRTLNPDEAQVFFLPFSVVMIIEHLFHPIIRDKAVLERTVVDYVRIISHKYTYW 152
Query: 177 NRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQV---LQ 233
NR+LGADHF ++CHD G RAT L N+IRV+C+ + F P KD + P++
Sbjct: 153 NRSLGADHFMLSCHDWGPRATWYERQLYFNSIRVLCNANTSEYFNPKKDASFPEINLITG 212
Query: 234 PFALPRGGRDVENRTVLGFWAGHRNSKIRVILARVW-ENDTELDISNNRINRAIGPLVYQ 292
A GG NRT+L F++G + K+R +L + W E D ++ + L YQ
Sbjct: 213 EIADLTGGLPPSNRTILAFFSGKMHGKLRPLLFQHWKEKDKDVLVYETFPEG----LSYQ 268
Query: 293 RRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERD 352
K+++CICP G +V S RI ++I+ CVPV++S Y PF+D+L+W F++ ++ D
Sbjct: 269 EMMKKSRYCICPSGHEVASPRIVEAIYAECVPVLISQNYVFPFSDVLNWESFSIQVSVSD 328
Query: 353 VYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIK 409
+ LK IL I + +++ + + +VQ+HF N+PP R+D FHM+++ +WLRR +K
Sbjct: 329 ISNLKNILLGIPEDQYLRMRERVKQVQQHFLINNPPKRYDVFHMIIHSIWLRRLNVK 385
>gi|413937765|gb|AFW72316.1| hypothetical protein ZEAMMB73_504926 [Zea mays]
Length = 472
Score = 229 bits (583), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 129/348 (37%), Positives = 186/348 (53%), Gaps = 19/348 (5%)
Query: 71 VFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESR--F 128
++ +P FH +YLEM FKI+ Y +G+P + P Y+ EG F + I + R F
Sbjct: 124 IYRNPRAFHRSYLEMERKFKIWTYREGEPPLAHLGPS--ADIYSIEGQFLEEIEDPRNPF 181
Query: 129 LTHDPDRAHLFFIPISCHKM-----RGKGTSY-ENMTVIVKDYVDSLINKYPYWNRTLGA 182
DP AH F +P+S + R T+Y +M + DYVD + +KYPYWNR+ GA
Sbjct: 182 AARDPGEAHAFLLPVSVCNLVHYIYRLNTTAYMAHMRRALADYVDVVAHKYPYWNRSRGA 241
Query: 183 DHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFAL---PR 239
DH V+CHD +E L NAIRV+C+ + F P KD LP+V L P
Sbjct: 242 DHVIVSCHDWAPLVSEANRELYANAIRVLCNANTSEGFRPRKDATLPEVNLADGLLRRPT 301
Query: 240 GGRDVENRTVLGFWAGHRNSKIRVILARVW--ENDTELDISNNRINRAIGPLVYQRRFYK 297
G ENRT L F+AG + IR L W D ++DI Y +
Sbjct: 302 LGLPPENRTTLAFFAGGMHGHIRRALLGYWLGRKDPDMDIHEY----LPAGQDYHALMAR 357
Query: 298 TKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLK 357
+FC+CP G +V S R+ +S+ GCVPVI+S+ Y PF+D+LDW K +V + + +LK
Sbjct: 358 ARFCLCPSGFEVASPRVVESVFTGCVPVIISDGYPPPFSDVLDWSKMSVTVPPARIPELK 417
Query: 358 QILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
+LK +S+ + L +++ Q+HF + P RFD MV++ +WLRR
Sbjct: 418 AVLKGVSERRYRVLRARVLQAQRHFVVHRPARRFDMIRMVLHSIWLRR 465
>gi|224142972|ref|XP_002324801.1| predicted protein [Populus trichocarpa]
gi|222866235|gb|EEF03366.1| predicted protein [Populus trichocarpa]
Length = 398
Score = 229 bits (583), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 129/368 (35%), Positives = 207/368 (56%), Gaps = 17/368 (4%)
Query: 50 NFNSTLSFAINKQDYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKL 109
N S + A + S L + ++ + S+F +Y M K+YIY +G+ +Q P L
Sbjct: 29 NAKSQIQKAPLVESDSMLYAPLYRNISMFKKSYELMEDILKVYIYKEGERPILHQAP--L 86
Query: 110 TGKYASEGYFFQNIR-ESRFLTHDPDRAHLFFIPISCHKMR-----GKGTSYENMTVIVK 163
G YASEG+F + + +F+T DP ++HLF++P S + S++N+ +K
Sbjct: 87 KGIYASEGWFMKLLETNKKFVTKDPKKSHLFYLPFSSRNLEVNLYVPNSHSHKNLIQYLK 146
Query: 164 DYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPH 223
+Y+D + KYP+WNRT GADHF V CHD T + N IR +C+ F+
Sbjct: 147 NYLDMISAKYPFWNRTRGADHFLVACHDWAPTETRQ---HMANCIRALCNSDAKGGFVFG 203
Query: 224 KDVALPQ--VLQPFALPR--GGRDVENRTVLGFWAGHRNSKIRVILARVWEN-DTELDIS 278
KD ALP+ V P L R GG+ R++L F+AG + +R IL + W N D ++ +
Sbjct: 204 KDAALPETTVRTPQNLLRDLGGKPASKRSILAFFAGSMHGYLRPILLQHWGNKDPDVKVF 263
Query: 279 NNRIN-RAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFND 337
+ G + Y + +K+CIC G +VNS R+ ++I Y CVPVI+S+ + PF +
Sbjct: 264 GKLPKVKGRGKMNYPQYMKSSKYCICAKGFEVNSPRVVEAIFYECVPVIISDNFVPPFFE 323
Query: 338 ILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMV 397
+L+W FAV + E+D+ LK IL +I + ++ + + KVQ+HF W++ PV++D FHM+
Sbjct: 324 VLNWESFAVFVLEKDIPNLKNILLSIPENKYREMQMRVKKVQQHFLWHARPVKYDIFHMI 383
Query: 398 MYELWLRR 405
++ +W R
Sbjct: 384 LHSVWYNR 391
>gi|224129250|ref|XP_002320538.1| predicted protein [Populus trichocarpa]
gi|222861311|gb|EEE98853.1| predicted protein [Populus trichocarpa]
Length = 332
Score = 228 bits (582), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 121/338 (35%), Positives = 194/338 (57%), Gaps = 30/338 (8%)
Query: 85 MVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESR-FLTHDPDRAHLFFIPI 143
M FKIYIY +GDP F+ P K Y+SEG F + + + F T DPD A ++F+P
Sbjct: 1 MEKLFKIYIYKEGDPPMFHDGPCKSI--YSSEGRFIHELEKGKSFTTTDPDEALVYFLPF 58
Query: 144 SC------------HKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHD 191
S H++ G + V DY++ + +KYP+WNR+LGADHF ++CHD
Sbjct: 59 SVVMLVQYLYVPGSHEIDAIGNT-------VVDYINVIADKYPFWNRSLGADHFILSCHD 111
Query: 192 VGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQV-LQPFALPR--GGRDVENRT 248
G R + VP L N+IRV+C+ + F P KD + P++ L+ + GG R+
Sbjct: 112 WGPRTSSYVPHLFNNSIRVLCNANTSEGFNPKKDASFPEIHLRTGEITGLVGGPSPSRRS 171
Query: 249 VLGFWAGHRNSKIRVILARVWEN-DTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGS 307
+L F+AG + IR +L W++ D ++ + + N + Y ++FC+CP G
Sbjct: 172 ILAFFAGRLHGHIRRLLLEQWKDKDQDVQVHDQLRNG----MSYDSMLKNSRFCLCPSGY 227
Query: 308 QVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAE 367
+V S RI ++I+ CVPV++S+ Y PF+D+L+W+ F++ + +D+ ++K IL IS +
Sbjct: 228 EVASPRIVEAIYAECVPVLISDGYVPPFSDVLNWKAFSIQVQVKDIPKIKDILMGISQRQ 287
Query: 368 FVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
++ + + +VQ+HF N P RFD FHM ++ +WLRR
Sbjct: 288 YLRMQRRVKQVQRHFVVNGIPKRFDVFHMTIHSIWLRR 325
>gi|356563757|ref|XP_003550126.1| PREDICTED: probable glycosyltransferase At5g03795 [Glycine max]
Length = 645
Score = 228 bits (582), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 124/346 (35%), Positives = 200/346 (57%), Gaps = 17/346 (4%)
Query: 71 VFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRES-RFL 129
++H+ S+F +Y M K+Y+Y +G +P TG YASEG+F + + + RFL
Sbjct: 299 IYHNVSMFKRSYELMEQTLKVYVYREG-ARPIMHSPF-FTGLYASEGWFMKQMEANKRFL 356
Query: 130 THDPDRAHLFFIPISCHKMRG-----KGTSYENMTVIVKDYVDSLINKYPYWNRTLGADH 184
T DP++AHLF++P S + +++N+ + +YV+ + KY +WNRT GADH
Sbjct: 357 TRDPNKAHLFYLPFSSRMLEETLYVQNSHNHKNLVQYLHNYVEMIAGKYTFWNRTGGADH 416
Query: 185 FFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQV-LQPFALPR---G 240
F V CHD T+ + N IR +C+ F+ KD +LP+ ++ +P
Sbjct: 417 FLVGCHDWAPGETK---VDMANCIRSLCNADVKEGFVFGKDASLPETYVRDAKIPTKDLS 473
Query: 241 GRDVENRTVLGFWAGHRNSKIRVILARVWEN-DTELDISNNRINRAIGPLVYQRRFYKTK 299
G RT L F+AG + +R IL + WEN D ++ I R+ ++ G Y + +K
Sbjct: 474 GNSASKRTTLAFFAGSMHGYVRPILLQHWENKDPDMKIFG-RLPKSKGNRNYIQYMKSSK 532
Query: 300 FCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQI 359
+CIC G +VNS R+ ++I Y CVPVI+S+ + PF ++L+W FAVI+ E+D+ LK I
Sbjct: 533 YCICAKGYEVNSPRVVEAIFYECVPVIISDNFVPPFLEVLNWESFAVIVLEKDIPNLKNI 592
Query: 360 LKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
L +I + +++ L + KVQ+HF W+ PV++D FHM+++ +W R
Sbjct: 593 LLSIPEKQYLRLQMRVKKVQQHFLWHKNPVKYDIFHMILHSVWYNR 638
>gi|359484704|ref|XP_002264922.2| PREDICTED: probable glycosyltransferase At5g20260-like [Vitis
vinifera]
Length = 467
Score = 228 bits (581), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 133/428 (31%), Positives = 222/428 (51%), Gaps = 33/428 (7%)
Query: 4 SKAAQPNQAQTTLCSLRSSLLTLALVTLLCFTCLSFNSFRSYPLQN------NFNSTLSF 57
S ++ PN +QT + +S + T + + + PL+ + + +
Sbjct: 40 SSSSTPNHSQTKHQTPEASSFLGNVSTHGVPSTSTTTGEKMSPLERIEGGLASARAAIRE 99
Query: 58 AINKQDYSDLGSD-------VFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLT 110
A+ +Y+ + V+ +P FH +++EM FKI+ Y +GD + P+
Sbjct: 100 AVRSSNYTSQKKENFIPRGAVYRNPYAFHQSHIEMEKRFKIWTYKEGDQPLVHGGPK--N 157
Query: 111 GKYASEGYFFQNIR--ESRFLTHDPDRAHLFFIPIS----CHKMRGKGTSYEN--MTVIV 162
Y EG F + +S F+ PD AH+F+IPIS H + Y + +V
Sbjct: 158 SIYGIEGQFMDEMESGDSHFVAGHPDEAHVFYIPISVTRIAHYIYSPPVDYSGHMLQRLV 217
Query: 163 KDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIP 222
DY+ + +KYPYWNR+ GADHF V+CHD + P L K+ IRV+C+ + F P
Sbjct: 218 TDYIYVVSDKYPYWNRSNGADHFLVSCHDWAPEISIVTPDLYKHFIRVLCNANTSERFQP 277
Query: 223 HKDVALPQVLQP---FALPRGGRDVENRTVLGFWAGHRNSKIRVILARVW-ENDTELDIS 278
+D++LP+V P P + R +L F+AG + +R +L R W END E+ +
Sbjct: 278 IRDISLPEVNIPKGKLGPPHLDKPPNQRHILAFFAGRESGYMRTLLFRSWKENDDEVQVY 337
Query: 279 NN-RINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFND 337
+ NR Y + +KFC+CP G +V S R+ ++I GCVPVI+ +YY LPF++
Sbjct: 338 EHLPSNRD-----YAKSMGDSKFCLCPSGWEVASPRVVEAIAAGCVPVIICDYYVLPFSE 392
Query: 338 ILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMV 397
+L W KF++ + + ++K+ILK + + ++ + + +VQ+HF N P +D HM+
Sbjct: 393 VLVWSKFSINITSDKIPEIKKILKAVPNERYLRMQKRVKQVQRHFVINRPAQPYDMLHMI 452
Query: 398 MYELWLRR 405
++ +WLRR
Sbjct: 453 LHSVWLRR 460
>gi|147805375|emb|CAN76361.1| hypothetical protein VITISV_035438 [Vitis vinifera]
Length = 459
Score = 228 bits (580), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 124/334 (37%), Positives = 188/334 (56%), Gaps = 30/334 (8%)
Query: 97 GDPNTFYQTPRKLTGKYASEGYFFQN----IRESRFLTHDPDRAHLFFIPISCHKM---- 148
G F++ P LT YA EG F + +S F+ PD AH FF+P+S K+
Sbjct: 124 GAQPIFHEGP--LTNIYAIEGQFIDEMDFIVGKSPFIAKHPDEAHAFFLPLSVVKVVQFL 181
Query: 149 -----RGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHD-VGVRATEGV-- 200
+ S + + +V DYV + +KYPYWNR+ GADHF V+CHD V + E +
Sbjct: 182 YLPITSPEDYSRKRLQRVVTDYVKVVADKYPYWNRSGGADHFMVSCHDWVSSSSLEPMAP 241
Query: 201 ------PFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQP---FALPRGGRDVENRTVLG 251
P L KN IRV+C+ + F P +DV+LP+V P P G+ NR +L
Sbjct: 242 SVSYANPELFKNFIRVLCNANSSEGFRPGRDVSLPEVNLPAGELGPPHLGQPSNNRPILA 301
Query: 252 FWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNS 311
F+AG + IR IL W++ + + R+++ Y + ++KFC+CP G +V S
Sbjct: 302 FFAGRAHGNIRKILFEHWKDQDNEVLVHERLHKGQN---YAKLMGQSKFCLCPSGYEVAS 358
Query: 312 ARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSL 371
R+ ++IH GCVPVI+SN Y LPFND+LDW +F++ + E + ++K IL IS +++ +
Sbjct: 359 PRVVEAIHAGCVPVIISNXYSLPFNDVLDWSQFSIQIPEAKIPEIKTILLGISKNKYLKM 418
Query: 372 HNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
+++V++HF N P FD HM+++ LWLRR
Sbjct: 419 QERVLRVRRHFVLNRPARPFDIIHMILHSLWLRR 452
>gi|115456850|ref|NP_001052025.1| Os04g0109900 [Oryza sativa Japonica Group]
gi|38345576|emb|CAE01775.2| OSJNBa0027H06.12 [Oryza sativa Japonica Group]
gi|113563596|dbj|BAF13939.1| Os04g0109900 [Oryza sativa Japonica Group]
gi|125589112|gb|EAZ29462.1| hypothetical protein OsJ_13536 [Oryza sativa Japonica Group]
Length = 441
Score = 228 bits (580), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 127/369 (34%), Positives = 199/369 (53%), Gaps = 25/369 (6%)
Query: 55 LSFAINKQDYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYA 114
L F + DY DL S V+ +P+ F+ +Y+EM FK+Y+Y +G+P ++ P K Y
Sbjct: 71 LRFYGGEADY-DLLSRVYRNPAAFYRSYVEMERRFKVYVYEEGEPPILHEGPCK--NIYT 127
Query: 115 SEGYFFQNIR-------ESRFLTHDPDRAHLFFIPISCHKM-----RGKGTSYENMTVIV 162
EG F + + T DP RAH FF+P S +M R + IV
Sbjct: 128 IEGSFIEQLELMSPSDAGGGVRTWDPTRAHAFFLPFSVSQMVKFVYRPPSQDRAPLRAIV 187
Query: 163 KDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIP 222
DYV + ++P+WNR+ GADHF ++CHD G A+ G P L NAIR +C+ + F P
Sbjct: 188 ADYVRVVAARHPFWNRSAGADHFMLSCHDWGPYASRGQPELYTNAIRALCNANTSEGFRP 247
Query: 223 HKDVALPQV-LQPFALPRG----GRDVENRTVLGFWAGHRNSKIRVILARVWE--NDTEL 275
KDV++P++ L +PR +E+R +L F+AG R+ +R +L R W+ +
Sbjct: 248 GKDVSVPEINLYDGDMPRELLAPAPGLESRPLLAFFAGGRHGHVRDLLLRHWKGRDAATF 307
Query: 276 DISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPF 335
+ + A + RR +FC+CP G +V S R+ ++I CVPV++++ Y LPF
Sbjct: 308 PVYEYDLPAAGDYYSFMRR---ARFCLCPSGHEVASPRVVEAIQAECVPVVIADGYALPF 364
Query: 336 NDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFH 395
D+L W F+V + D+ +L++ L+ I AE L + V++H PP R D F+
Sbjct: 365 ADVLRWEAFSVAVAVGDIPRLRERLERIPAAEVERLRRGVRLVKRHLMLQQPPRRLDMFN 424
Query: 396 MVMYELWLR 404
M+++ +WLR
Sbjct: 425 MILHSVWLR 433
>gi|357462311|ref|XP_003601437.1| hypothetical protein MTR_3g080690 [Medicago truncatula]
gi|355490485|gb|AES71688.1| hypothetical protein MTR_3g080690 [Medicago truncatula]
Length = 450
Score = 228 bits (580), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 130/364 (35%), Positives = 203/364 (55%), Gaps = 17/364 (4%)
Query: 52 NSTLSFAINKQDYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTG 111
N T+ + DY G ++ + FH +Y M FKI++Y +G+P F+ P K
Sbjct: 88 NDTVVPLEDSHDYVPQGH-IYRNAFAFHRSYQLMEKLFKIFVYEEGEPPLFHYGPCK--N 144
Query: 112 KYASEGYFFQNIRESR--FLTHDPDRAHLFFIPISC-----HKMRGKGTSYENMTVIVKD 164
Y+ EG F N+ E+ F T +P+ AH++F+P S H + + D
Sbjct: 145 IYSMEGIFI-NLLENNTLFRTQNPNEAHVYFLPFSVVMILEHLFHPVIRDKAVLGRTIGD 203
Query: 165 YVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHK 224
YV + +KY YWNR+ GADHF ++CHD G RAT V L AIRV+C+ + F P K
Sbjct: 204 YVHIISHKYAYWNRSYGADHFMLSCHDWGPRATWYVKELYFIAIRVLCNANISEHFNPKK 263
Query: 225 DVALPQ---VLQPFALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNR 281
D + P+ V GG NRT+L F+AG N +IR +L + W+N + + +
Sbjct: 264 DASFPEINLVSGETTGLLGGYPTWNRTILAFFAGQMNGRIRPVLFQHWKNKDKDVLVYEK 323
Query: 282 INRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDW 341
+ I Y +K+CICP G +V S RI ++I+ CVP+++S Y LPF+D+L+W
Sbjct: 324 LPEKIS---YHETMKMSKYCICPSGWEVASPRIVEAIYAECVPILISQQYVLPFSDVLNW 380
Query: 342 RKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYEL 401
F+V + ++ +LK+IL IS+ +++ L + +VQ+HF N+PP ++D FHM+++ +
Sbjct: 381 DSFSVQIEVSEIPKLKEILLGISEEKYMRLQEGVKQVQRHFVVNNPPKKYDVFHMIIHSI 440
Query: 402 WLRR 405
WLRR
Sbjct: 441 WLRR 444
>gi|224092292|ref|XP_002309546.1| predicted protein [Populus trichocarpa]
gi|222855522|gb|EEE93069.1| predicted protein [Populus trichocarpa]
Length = 417
Score = 227 bits (579), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 128/355 (36%), Positives = 199/355 (56%), Gaps = 17/355 (4%)
Query: 62 QDYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQ 121
++ + L + ++ + S+F +Y M K+Y+Y DG+ F+Q L G YASEG+F +
Sbjct: 62 KNETRLYAPIYRNVSMFRRSYELMEKMLKVYVYQDGEKPIFHQP--ILDGIYASEGWFMK 119
Query: 122 NIRESR-FLTHDPDRAHLFFIPISCHKMR-----GKGTSYENMTVIVKDYVDSLINKYPY 175
++ + F+T DP +AHLF++P S + S N+ +++Y + KY +
Sbjct: 120 HMEANENFVTKDPGKAHLFYLPFSSRLLELTLYVRHSHSRTNLIEYMRNYAGMIAAKYHF 179
Query: 176 WNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQV---- 231
WNRT GADHF CHD T G P L N IR +C+ +V F KDV+LP+
Sbjct: 180 WNRTGGADHFVAACHDWAPAETRG-PLL--NCIRALCNADIEVGFSIGKDVSLPETYVRS 236
Query: 232 LQPFALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWEN-DTELDISNNRINRAIGPLV 290
Q G R +L F+AG+ + +R +L W N D ++ I + G
Sbjct: 237 AQNPLKNLEGNPPSQRPILAFFAGNMHGYVRPVLLDYWGNKDPDMKIFGP-MPHVKGNTN 295
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + +KFCICP G +VNS RI ++I CVPVI+S+ + PF ++LDW FAVI+ E
Sbjct: 296 YIQHMKSSKFCICPRGHEVNSPRIVEAIFLECVPVIISDNFVPPFFEVLDWESFAVIVLE 355
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
+D+ LK IL +IS+ +++ +H + KVQ+HF W+S P ++D FHM+++ +W R
Sbjct: 356 KDIPNLKNILVSISEEKYIEMHKRVKKVQQHFLWHSKPEKYDLFHMILHSVWYNR 410
>gi|90265152|emb|CAH67778.1| H0201G08.5 [Oryza sativa Indica Group]
gi|125546968|gb|EAY92790.1| hypothetical protein OsI_14593 [Oryza sativa Indica Group]
Length = 441
Score = 227 bits (579), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 127/369 (34%), Positives = 199/369 (53%), Gaps = 25/369 (6%)
Query: 55 LSFAINKQDYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYA 114
L F + DY DL S V+ +P+ F+ +Y+EM FK+Y+Y +G+P ++ P K Y
Sbjct: 71 LRFYGGEADY-DLLSRVYRNPAAFYRSYVEMERRFKVYVYEEGEPPILHEGPCK--NIYT 127
Query: 115 SEGYFFQNIR-------ESRFLTHDPDRAHLFFIPISCHKM-----RGKGTSYENMTVIV 162
EG F + + T DP RAH FF+P S +M R + IV
Sbjct: 128 IEGSFIEQLELMSPSDAGGGVRTWDPTRAHAFFLPFSVSQMVKFVYRPPSQDRPPLRAIV 187
Query: 163 KDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIP 222
DYV + ++P+WNR+ GADHF ++CHD G A+ G P L NAIR +C+ + F P
Sbjct: 188 ADYVRVVAARHPFWNRSAGADHFMLSCHDWGPYASRGQPELYTNAIRALCNANTSEGFRP 247
Query: 223 HKDVALPQV-LQPFALPRG----GRDVENRTVLGFWAGHRNSKIRVILARVWE--NDTEL 275
KDV++P++ L +PR +E+R +L F+AG R+ +R +L R W+ +
Sbjct: 248 GKDVSVPEINLYDGDMPRELLAPAPGLESRPLLAFFAGGRHGHVRDLLLRHWKGRDAATF 307
Query: 276 DISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPF 335
+ + A + RR +FC+CP G +V S R+ ++I CVPV++++ Y LPF
Sbjct: 308 PVYEYDLPAAGDYYSFMRR---ARFCLCPSGHEVASPRVVEAIQAECVPVVIADGYALPF 364
Query: 336 NDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFH 395
D+L W F+V + D+ +L++ L+ I AE L + V++H PP R D F+
Sbjct: 365 ADVLRWEAFSVAVAVGDIPRLRERLERIPAAEVERLRRGVRLVKRHLMLQQPPRRLDMFN 424
Query: 396 MVMYELWLR 404
M+++ +WLR
Sbjct: 425 MILHSVWLR 433
>gi|224129246|ref|XP_002320537.1| predicted protein [Populus trichocarpa]
gi|222861310|gb|EEE98852.1| predicted protein [Populus trichocarpa]
Length = 395
Score = 227 bits (579), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 135/379 (35%), Positives = 213/379 (56%), Gaps = 22/379 (5%)
Query: 43 RSYPLQNNFNSTLSFAINKQDYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTF 102
R NN S+L + DY G ++ + FH +YL M FKI++Y +G+P F
Sbjct: 26 REAAEDNNCTSSLH---DDLDYIPRGY-IYRNACAFHRSYLLMEKLFKIFVYEEGEPPLF 81
Query: 103 -YQTPRKLTGKYASEGYFFQNIR-ESRFLTHDPDRAHLFFIPISC-----HKMRGKGTSY 155
Y T + + Y+ EG F + ++F T +PD AH++F+P S H
Sbjct: 82 HYGTCKDI---YSMEGVFLSLMETNTKFRTSNPDEAHVYFLPFSVVMIIEHLFHPIIRDK 138
Query: 156 ENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPS 215
+ V DYV + +KY YWNR+LGADHF ++CHD G RAT V L N+IRV+C+ +
Sbjct: 139 AVLERTVSDYVRIISHKYLYWNRSLGADHFMLSCHDWGPRATWYVRQLYYNSIRVLCNAN 198
Query: 216 YDVAFIPHKDVALPQV-LQPFALP--RGGRDVENRTVLGFWAGHRNSKIRVILARVW-EN 271
F P KD + P++ L+ + GG NRTVL F+AG + K+R L + W
Sbjct: 199 TSEYFNPKKDASFPEINLKTGEITGLTGGLPPSNRTVLAFFAGKMHGKLRPALLQHWMGK 258
Query: 272 DTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYY 331
D ++ + + + I Y K+K+CICP G +V S RIA++I+ CVPV++S +Y
Sbjct: 259 DKDVQVYET-LPQGIS---YHEMMKKSKYCICPSGHEVASPRIAEAIYAECVPVLISQHY 314
Query: 332 DLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRF 391
PF+D+L+W F + + ++ LK IL+ I + +++ + + +VQ+HF N+PP R+
Sbjct: 315 IFPFSDVLNWDSFTIQVPVTEIPNLKNILEGIPEDQYLRMQERVRQVQRHFVVNNPPRRY 374
Query: 392 DAFHMVMYELWLRRHVIKY 410
D FHM+++ +WLRR +++
Sbjct: 375 DVFHMIIHSIWLRRLNVRF 393
>gi|240256346|ref|NP_197913.4| putative glycosyltransferase [Arabidopsis thaliana]
gi|292630776|sp|Q3E7Q9.2|GLYT6_ARATH RecName: Full=Probable glycosyltransferase At5g25310
gi|332006042|gb|AED93425.1| putative glycosyltransferase [Arabidopsis thaliana]
Length = 480
Score = 227 bits (579), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 128/362 (35%), Positives = 203/362 (56%), Gaps = 26/362 (7%)
Query: 65 SDL-GSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNI 123
SDL S+++ +PS + +YLEM FK+Y+Y +G+P + P K YA EG F +
Sbjct: 126 SDLPNSEIYRNPSALYRSYLEMEKRFKVYVYEEGEPPLVHDGPCK--SVYAVEGRFITEM 183
Query: 124 --RESRFLTHDPDRAHLFFIPISC----HKMRGKGTSYENMTVIVKDYVDSLINKYPYWN 177
R ++F T+DP++A+++F+P S + + + + V DY+ + +P+WN
Sbjct: 184 EKRRTKFRTYDPNQAYVYFLPFSVTWLVRYLYEGNSDAKPLKTFVSDYIRLVSTNHPFWN 243
Query: 178 RTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFAL 237
RT GADHF +TCHD G ++ L +IRV+C+ + F P KDV LP++ L
Sbjct: 244 RTNGADHFMLTCHDWGPLTSQANRDLFNTSIRVMCNANSSEGFNPTKDVTLPEI----KL 299
Query: 238 PRGGRD----------VENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIG 287
G D R LGF+AG + +R IL + W+ + D+
Sbjct: 300 YGGEVDHKLRLSKTLSASPRPYLGFFAGGVHGPVRPILLKHWK---QRDLDMPVYEYLPK 356
Query: 288 PLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVI 347
L Y +KFC CP G +V S R+ ++I+ C+PVILS + LPF D+L W F+V+
Sbjct: 357 HLNYYDFMRSSKFCFCPSGYEVASPRVIEAIYSECIPVILSVNFVLPFTDVLRWETFSVL 416
Query: 348 LNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHV 407
++ ++ +LK+IL +IS+ ++ L +NL V++HF+ N PP RFDAFH+ ++ +WLRR
Sbjct: 417 VDVSEIPRLKEILMSISNEKYEWLKSNLRYVRRHFELNDPPQRFDAFHLTLHSIWLRRLN 476
Query: 408 IK 409
+K
Sbjct: 477 LK 478
>gi|224142559|ref|XP_002324623.1| predicted protein [Populus trichocarpa]
gi|222866057|gb|EEF03188.1| predicted protein [Populus trichocarpa]
Length = 337
Score = 227 bits (578), Expect = 9e-57, Method: Compositional matrix adjust.
Identities = 121/336 (36%), Positives = 191/336 (56%), Gaps = 21/336 (6%)
Query: 85 MVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNI--RESRFLTHDPDRAHLFFIP 142
MV FK++ Y +G+ + P L Y+ EG+F + + S F DPD AH+FF+P
Sbjct: 1 MVKRFKVWPYKEGERPLVHDGP--LNNIYSIEGHFIDEVESKGSPFRAQDPDEAHVFFLP 58
Query: 143 ISCHKM---------RGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVG 193
+S + S + + +V DYV + KYPYWNR+ GADHF V+CHD
Sbjct: 59 VSVASIVHFIYLPITAAADYSRDRLRRVVTDYVHIVAKKYPYWNRSNGADHFMVSCHDWA 118
Query: 194 VRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFA---LPRGGRDVENRTVL 250
+ L IRV+C+ + + F P +DV LP++ PF+ G+ NR +L
Sbjct: 119 PDVSIANSELFNKFIRVLCNANISIGFRPPRDVLLPEIYLPFSGLGTTHMGQAPNNRPIL 178
Query: 251 GFWAGHRNSKIRVILARVWEN-DTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQV 309
F+ G + IR +L + W+N D E+ + + + + Y R ++KFC+CP G +V
Sbjct: 179 AFFEGRAHGYIRQVLFKHWKNKDNEVQV-HELLPKGKN---YTRLMGQSKFCLCPSGFEV 234
Query: 310 NSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFV 369
S R+ ++I+ GCVPVI+SN Y LPF+D+L+W +F+V + + ++K IL+ IS+++++
Sbjct: 235 ASPRVVEAIYQGCVPVIISNNYSLPFSDVLNWSQFSVQIPVEKIPEIKMILQRISNSKYL 294
Query: 370 SLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
+H + +VQ+HF N P FD HMV++ LWLRR
Sbjct: 295 RMHERVKRVQRHFVLNRPAKPFDVIHMVLHSLWLRR 330
>gi|356548516|ref|XP_003542647.1| PREDICTED: xylogalacturonan beta-1,3-xylosyltransferase-like
[Glycine max]
Length = 340
Score = 227 bits (578), Expect = 9e-57, Method: Compositional matrix adjust.
Identities = 121/346 (34%), Positives = 202/346 (58%), Gaps = 32/346 (9%)
Query: 85 MVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNI----RESRFLTHDPDRAHLFF 140
M+ FK+++Y +G+ + P + Y+ EG F + + S F +P++AH+F
Sbjct: 1 MLKRFKVWVYEEGEQPLVHYGP--VNNIYSIEGQFIDEMDNYHKWSHFRARNPNQAHVFL 58
Query: 141 IPISC---------HKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHD 191
IP S +R G+ +++ ++V+DY+ + +KYPYWNRT GADHF ++CHD
Sbjct: 59 IPFSIVNIVQYVYNRNLRQPGS--QSIQLLVEDYIRVIAHKYPYWNRTEGADHFLLSCHD 116
Query: 192 VGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRG-------GRDV 244
G + P L KN IRV+C+ + F P+KDV++P+V LPRG G+
Sbjct: 117 WGPTISYANPKLFKNFIRVLCNANTSEGFRPNKDVSIPEV---NLLPRGTLGSPNRGQHP 173
Query: 245 ENRTVLGFWAGHRNSKIRVILARVWEN-DTELDISNNRINRAIGPLVYQRRFYKTKFCIC 303
+RT+L F+AG + IR IL W++ D ++ I + VY + ++KFC+C
Sbjct: 174 NDRTILAFFAGREHGAIRTILLNHWKDKDNDVQIYESLPKGK----VYTKLMGQSKFCLC 229
Query: 304 PGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNI 363
P G +V S R+ ++I+ GCVPV++S+ Y PF D+L+W +F+V + + ++K IL+++
Sbjct: 230 PSGYEVASPRVVEAIYAGCVPVLISSSYSPPFTDVLNWSQFSVEIPVEKIPEIKTILQSV 289
Query: 364 SDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIK 409
S +++ L N+++VQ+HF N P FD HM+++ +WLRR +K
Sbjct: 290 SPKKYLKLQMNVLRVQRHFTINRPAKPFDLMHMILHSIWLRRLNLK 335
>gi|224087074|ref|XP_002308055.1| predicted protein [Populus trichocarpa]
gi|222854031|gb|EEE91578.1| predicted protein [Populus trichocarpa]
Length = 338
Score = 226 bits (577), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 117/335 (34%), Positives = 189/335 (56%), Gaps = 17/335 (5%)
Query: 85 MVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIR--ESRFLTHDPDRAHLFFIP 142
M FKI++Y +G+ + P + Y+ EG F I +S F+ PD AH F +P
Sbjct: 1 MEKRFKIWVYKEGELPVLHGGP--VNNIYSVEGQFLDEIERGKSHFIARHPDEAHAFLLP 58
Query: 143 ISC-------HKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVR 195
+S +K R + ++ + +V DYV + +KY YWNRT GADHF ++CHD G
Sbjct: 59 LSVAYIMHYIYKPRVTFSRHQ-LQTLVTDYVRVIADKYTYWNRTNGADHFSISCHDWGPD 117
Query: 196 ATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQV---LQPFALPR-GGRDVENRTVLG 251
+ P L K IR +C+ + F P +DV++P++ + LPR G + R +L
Sbjct: 118 ISRTNPELFKYFIRALCNANTSEGFQPQRDVSVPEIFLHVGKLGLPREGAQPPSKRPILA 177
Query: 252 FWAGHRNSKIRVILARVWEN-DTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVN 310
F+AG + +IR +L + W++ D E+ + R +Y + ++KFC+CP G +V
Sbjct: 178 FFAGGAHGRIRKVLLKRWKDKDGEIQVHEYVTQRKKNNNLYFKLMGQSKFCLCPSGHEVA 237
Query: 311 SARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVS 370
S R+ +I GCVPVI+S+ Y LPF+D+LDW KF+V + + ++K ILK IS +++
Sbjct: 238 SPRVVTAIQLGCVPVIISDNYSLPFSDVLDWSKFSVNIPSEKIQEIKTILKGISHKRYLT 297
Query: 371 LHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
+ +++ Q+HF N P +D HM+++ +WLRR
Sbjct: 298 MQRRVIQAQRHFTLNRPAKPYDMIHMILHSIWLRR 332
>gi|326517848|dbj|BAK03842.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 451
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 126/368 (34%), Positives = 196/368 (53%), Gaps = 37/368 (10%)
Query: 71 VFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIR-----E 125
V+ +P+ FH +Y+EM FK+Y+Y +G+P + P K Y EG F + +
Sbjct: 81 VYRNPAAFHRSYVEMERRFKVYVYEEGEPPILHTGPCKDI--YTIEGRFIEQLELLAPPA 138
Query: 126 SRFLTHDPDRAHLFFIPISCHKMRG---KGTSYEN--MTVIVKDYVDSLINKYPYWNRTL 180
T D DRAH FF+P S +M + SY+ + +V DYV + +++P+WNR+
Sbjct: 139 PGVRTRDADRAHAFFLPFSVAQMMQFAYRQLSYDRGPLLSLVGDYVRVVASRHPFWNRSA 198
Query: 181 GADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQV-----LQPF 235
GADHF ++CHD G A++G P L N IR +C+ + F P KDV++P++ P
Sbjct: 199 GADHFMLSCHDWGPDASKGDPELYANGIRALCNANTSEGFRPGKDVSIPEINLYDGDTPR 258
Query: 236 ALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWENDT-------ELDISNNRINRAIGP 288
L + R L F+AG R+ +R +L R W+ E DI + + G
Sbjct: 259 QLLGPSPGLSARPYLAFFAGGRHGHVRDLLLRHWKGRDPATFPVYEYDIPSTTGGNSSG- 317
Query: 289 LVYQRR-----------FYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFND 337
+ RR ++++FC+CP G +V S R+ ++IH CVPV++S Y PF D
Sbjct: 318 -RHNRRGRDRQSDYFAYMHRSRFCLCPSGHEVASPRVVEAIHAECVPVLVSEGYAPPFAD 376
Query: 338 ILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMV 397
+L W F+V + D+ +LK++L+ I AE L + V++HF PP R D FHM+
Sbjct: 377 VLRWESFSVSVPVVDIPRLKEVLEGIPMAEVERLREGVRLVKRHFTLRQPPERLDMFHMI 436
Query: 398 MYELWLRR 405
++ +WLRR
Sbjct: 437 LHSVWLRR 444
>gi|413952956|gb|AFW85605.1| hypothetical protein ZEAMMB73_586394 [Zea mays]
Length = 509
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 131/373 (35%), Positives = 194/373 (52%), Gaps = 36/373 (9%)
Query: 63 DYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQN 122
+Y+ L S V+ +P+ FH +Y EM F++Y+Y +G+P + P K Y EG F +
Sbjct: 136 EYALLAS-VYRNPAAFHRSYAEMEKLFRVYVYEEGEPPILHAGPCK--NIYTIEGRFIEQ 192
Query: 123 IR------------------ESRFLTHDPDRAHLFFIPISCHKM-----RGKGTSYENMT 159
+ S T +P RAH FF+P S +M R +
Sbjct: 193 LELMAPSPASSSRRGTRRRSASDVRTSEPARAHAFFLPFSVSQMVQFAYRPNTYDKTPLR 252
Query: 160 VIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVA 219
IV DYV + +++PYWNR+ GADHF + CHD G A+ G P L N IR +C+ +
Sbjct: 253 AIVADYVRVVASRHPYWNRSAGADHFMLACHDWGPEASTGHPELHANGIRALCNANSSEG 312
Query: 220 FIPHKDVALPQV-LQPFALPRG----GRDVENRTVLGFWAGHRNSKIRVILARVWENDTE 274
F P +DV++P + L +PR V +R L F+AG R+ IR +L R W+
Sbjct: 313 FRPWQDVSVPDINLYDGDMPRQLLAPAPGVTSRPFLAFFAGGRHGHIRDLLLRHWKGRDP 372
Query: 275 --LDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYD 332
+ +R + RR +FC+CP G QV S R+ +SIH CVPVILS+ Y
Sbjct: 373 DFFPVYEHRHEDGFDYYSFMRR---ARFCLCPSGYQVASPRVVESIHAECVPVILSDGYV 429
Query: 333 LPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFD 392
LPF D+L W F+V + D+ +L+++L+ I E L + V++HF N PP R D
Sbjct: 430 LPFADVLRWEAFSVAVPVADIPRLREVLERIPAPEVEKLQQGVRLVKRHFMLNQPPQRLD 489
Query: 393 AFHMVMYELWLRR 405
F+M+++ +WLRR
Sbjct: 490 MFNMILHSVWLRR 502
>gi|359476146|ref|XP_002285237.2| PREDICTED: probable glycosyltransferase At5g25310-like [Vitis
vinifera]
gi|296082059|emb|CBI21064.3| unnamed protein product [Vitis vinifera]
Length = 478
Score = 226 bits (576), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 132/370 (35%), Positives = 205/370 (55%), Gaps = 24/370 (6%)
Query: 52 NSTLSFAINKQDYSDLGS-DVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLT 110
N S I +D +++ + DV+ + F+ +Y+EM K+Y+Y +G+ + P K
Sbjct: 111 NLNRSSEIGGEDNNEVPTGDVYRNAGAFYQSYVEMEKRLKVYVYEEGEVPMIHDGPCK-- 168
Query: 111 GKYASEGYFFQNIR--ESRFLTHDPDRAHLFFIPISCHKMRG---KGTSYENMTVI---V 162
Y EG F I + +F T D +RAH++F+P S M K +Y N+T + V
Sbjct: 169 DIYTIEGRFIHEIEHGDGKFRTRDAERAHVYFMPFSVTWMVKYLYKPLTY-NLTPLRQFV 227
Query: 163 KDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIP 222
DYV + KYP+WN T GADHF + CHD G A+ G P L +IRV+C+ + F P
Sbjct: 228 SDYVSVISTKYPFWNTTQGADHFMLACHDWGPHASRGHPVLYNTSIRVLCNANTSEGFNP 287
Query: 223 HKDVALPQV------LQPFALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWEN-DTEL 275
KDV+LP++ + P L R L F+AG + IR IL + W DT+L
Sbjct: 288 QKDVSLPEIHLYGGNVNP-KLLSPPPPNSPRPFLAFFAGGLHGPIRPILIQHWMGRDTDL 346
Query: 276 DISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPF 335
+ + Y ++K+C+CP G +V S RI ++I+ CVPVILS++Y LPF
Sbjct: 347 RV----YEYLPKDMDYYSLMLQSKYCLCPSGHEVASPRIVEAIYSECVPVILSDHYVLPF 402
Query: 336 NDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFH 395
+D+L W F+V + ++ +LK++L+ IS+ ++ L + V++HF+ N P RFD FH
Sbjct: 403 SDVLRWEAFSVKVEASEIPRLKEVLQAISEEKYTRLKEGVRAVRRHFELNQPAKRFDVFH 462
Query: 396 MVMYELWLRR 405
M+++ +WLRR
Sbjct: 463 MILHSVWLRR 472
>gi|356546130|ref|XP_003541484.1| PREDICTED: probable glycosyltransferase At5g20260-like [Glycine
max]
Length = 468
Score = 225 bits (574), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 118/360 (32%), Positives = 201/360 (55%), Gaps = 21/360 (5%)
Query: 61 KQDYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYF- 119
++D+ GS ++ +P FH +++EMV FK+++Y +GD + P + YA EG F
Sbjct: 106 QEDFIPKGS-IYRNPHAFHQSHIEMVKRFKVWVYEEGDQPLVHYGP--VNDIYAIEGQFI 162
Query: 120 --FQNIRESRFLTHDPDRAHLFFIPISCHK---------MRGKGTSYENMTVIVKDYVDS 168
N + S F +PD AH FF+P S M + + +V+DY+
Sbjct: 163 DEMDNSKRSPFKAKNPDEAHAFFLPFSVVNVVHYAYKPYMSQNDYRRDRLQRLVEDYIVV 222
Query: 169 LINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVAL 228
+ +KYPYWNR+ GADHF ++CHD + P L KN IRV+C+ + F P +DV++
Sbjct: 223 VADKYPYWNRSNGADHFLLSCHDWAPEISHANPDLFKNFIRVLCNANNSEGFQPKRDVSI 282
Query: 229 PQV---LQPFALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRA 285
P+V + P G+ NRT+L F++G + IR +L + W++ + + +
Sbjct: 283 PEVYLSVGKLGPPNLGQHPMNRTILAFFSGGAHGDIRKLLLKHWKDKDNQVQVHEYLPKG 342
Query: 286 IGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFA 345
Y +KFC+CP G +V S R+ ++I+ CVPVI+S Y LP +D+L+W +F+
Sbjct: 343 QN---YTELMGLSKFCLCPSGYEVASPRVVEAINAVCVPVIISENYSLPLSDVLNWSQFS 399
Query: 346 VILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
+ ++ ++ +K IL+N++ ++ L+ N+ +V++HF + P FD HM+++ +WLRR
Sbjct: 400 IQISVENIPDIKTILQNVTQKKYKKLYRNVRRVRRHFVMHRPAKPFDLMHMIIHSIWLRR 459
>gi|42568129|ref|NP_198314.2| xylogalacturonan beta-1,3-xylosyltransferase [Arabidopsis thaliana]
gi|292630939|sp|Q94AA9.2|XGD1_ARATH RecName: Full=Xylogalacturonan beta-1,3-xylosyltransferase;
AltName: Full=Protein XYLOGALACTURONAN DEFICIENT 1
gi|332006509|gb|AED93892.1| xylogalacturonan beta-1,3-xylosyltransferase [Arabidopsis thaliana]
Length = 500
Score = 225 bits (573), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 118/359 (32%), Positives = 200/359 (55%), Gaps = 31/359 (8%)
Query: 69 SDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNI----- 123
S ++ +P+ FH ++ EM+ FK++ Y +G+ F+ P + Y EG F +
Sbjct: 144 SSLYKNPAAFHQSHTEMMNRFKVWTYTEGEVPLFHDGP--VNDIYGIEGQFMDEMCVDGP 201
Query: 124 -RESRFLTHDPDRAHLFFIPISCHKM---------RGKGTSYENMTVIVKDYVDSLINKY 173
SRF P+ AH+FFIP S K+ +G S + +++DYVD + K+
Sbjct: 202 KSRSRFRADRPENAHVFFIPFSVAKVIHFVYKPITSVEGFSRARLHRLIEDYVDVVATKH 261
Query: 174 PYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQ 233
PYWNR+ G DHF V+CHD +G P L + IR +C+ + F P+ DV++P++
Sbjct: 262 PYWNRSQGGDHFMVSCHDWAPDVIDGNPKLFEKFIRGLCNANTSEGFRPNVDVSIPEIY- 320
Query: 234 PFALPRG-------GRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAI 286
LP+G G+ R++L F+AG + +IR IL + W+ E+D +R
Sbjct: 321 ---LPKGKLGPSFLGKSPRVRSILAFFAGRSHGEIRKILFQHWK---EMDNEVQVYDRLP 374
Query: 287 GPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAV 346
Y + +KFC+CP G +V S R ++I+ GCVPVI+S+ Y LPF+D+L+W F++
Sbjct: 375 PGKDYTKTMGMSKFCLCPSGWEVASPREVEAIYAGCVPVIISDNYSLPFSDVLNWDSFSI 434
Query: 347 ILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
+ + ++K IL+++S ++ ++ +++V++HF N P +D HM+++ +WLRR
Sbjct: 435 QIPVSRIKEIKTILQSVSLVRYLKMYKRVLEVKQHFVLNRPAKPYDVMHMMLHSIWLRR 493
>gi|359484710|ref|XP_002266490.2| PREDICTED: probable glycosyltransferase At5g20260 [Vitis vinifera]
Length = 480
Score = 224 bits (571), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 119/347 (34%), Positives = 195/347 (56%), Gaps = 17/347 (4%)
Query: 70 DVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIR--ESR 127
DV+ +P FH +++EM FKI+ Y +GD + P YA EG F I +S+
Sbjct: 133 DVYRNPYAFHQSHIEMEKRFKIWAYREGDQPLMHDGPS--NDIYAIEGQFMDEIESGKSQ 190
Query: 128 FLTHDPDRAHLFFIPISCHKMRG---KGTSYENMTV--IVKDYVDSLINKYPYWNRTLGA 182
FL PD A+ F+IP+S ++ + Y + +V DY++ + +KYPYWNR+ GA
Sbjct: 191 FLARHPDEANAFYIPMSLTRVVHFIYEPPHYHGKWIPRLVTDYINFVADKYPYWNRSKGA 250
Query: 183 DHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQP---FALPR 239
DHF V+CHD + P L K+ IR +C+ + F P +D+++P++ P P
Sbjct: 251 DHFLVSCHDWAPDVSALKPDLYKHFIRALCNANTSERFHPIRDISIPEINIPRGKLGPPH 310
Query: 240 GGRDVENRTVLGFWAGHRNSKIRVILARVW-ENDTELDISNNRINRAIGPLVYQRRFYKT 298
+ R +L F+AG + +R +L + W E D E+ + R+ R Y + +
Sbjct: 311 LDQPPNKRPILAFFAGGAHGYVRSVLFKYWKEKDDEVQVFE-RLPRNRN---YSKSMGDS 366
Query: 299 KFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQ 358
KFC+CP G +V S RI ++I GCVP+I+ ++Y LPF+D+LDW KF++ + + ++K+
Sbjct: 367 KFCLCPSGYEVASPRIVEAIAAGCVPMIICDHYSLPFSDVLDWSKFSIYITSDKIPEIKK 426
Query: 359 ILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
ILK + ++ + + +VQ+HF N P +D HM+++ +WLRR
Sbjct: 427 ILKAVPTETYLEMQKRVKQVQRHFAINRPARPYDMLHMILHSVWLRR 473
>gi|8953375|emb|CAB96648.1| putative protein [Arabidopsis thaliana]
Length = 336
Score = 223 bits (569), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 122/333 (36%), Positives = 192/333 (57%), Gaps = 18/333 (5%)
Query: 85 MVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRE--SRFLTHDPDRAHLFFIP 142
M FKI+ Y +G+ F++ P L YA EG F I SRF P+ A +F+IP
Sbjct: 1 MEKRFKIWTYREGEAPLFHKGP--LNNIYAIEGQFMDEIENGNSRFKAASPEEATVFYIP 58
Query: 143 ISCHKM----RGKGTSY--ENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRA 196
+ + TSY + + IVKDY+ + N+YPYWNR+ GADHFF++CHD
Sbjct: 59 VGIVNIIRFVYRPYTSYARDRLQNIVKDYISLISNRYPYWNRSRGADHFFLSCHDWAPDV 118
Query: 197 TEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQP---FALPRGGRDVENRTVLGFW 253
+ P L K+ IR +C+ + F P +DV+LP++ P G +NR +L F+
Sbjct: 119 SAVDPELYKHFIRALCNANSSEGFTPMRDVSLPEINIPHSQLGFVHTGEPPQNRKLLAFF 178
Query: 254 AGHRNSKIRVILARVW-ENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSA 312
AG + +R IL + W E D ++ + N + Y + K KFC+CP G +V S
Sbjct: 179 AGGSHGDVRKILFQHWKEKDKDVLVYENLPKT----MNYTKMMDKAKFCLCPSGWEVASP 234
Query: 313 RIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLH 372
RI +S++ GCVPVI+++YY LPF+D+L+W+ F+V + + +K+IL+ I++ E++++
Sbjct: 235 RIVESLYSGCVPVIIADYYVLPFSDVLNWKTFSVHIPISKMPDIKKILEAITEEEYLNMQ 294
Query: 373 NNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
+++V+KHF N P +D HM+M+ +WLRR
Sbjct: 295 RRVLEVRKHFVINRPSKPYDMLHMIMHSIWLRR 327
>gi|449465629|ref|XP_004150530.1| PREDICTED: probable glycosyltransferase At5g20260-like [Cucumis
sativus]
Length = 299
Score = 223 bits (569), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 117/297 (39%), Positives = 182/297 (61%), Gaps = 17/297 (5%)
Query: 125 ESRFLTHDPDRAHLFFIPIS----CHKMRGKGTSY--ENMTVIVKDYVDSLINKYPYWNR 178
+S F H+P+ A +FF+PIS + T+Y + + I DYV + NKYPYWNR
Sbjct: 5 KSPFSAHEPEEAQVFFLPISIVYIVDYIYKPITTYARDRLVRIFTDYVRVVANKYPYWNR 64
Query: 179 TLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQP---- 234
T GADHF V+CHD T+ P L K IRV+C+ + F P +D +LP++ P
Sbjct: 65 TRGADHFMVSCHDWAPEVTKEDPNLFKYFIRVLCNANTSEGFNPMRDASLPEINLPPTFH 124
Query: 235 FALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWEN-DTELDISNNRINRAIGPLV-YQ 292
LPR G+ +NR++L F+AG + IR IL + W++ D E+ +++ + P Y
Sbjct: 125 LNLPRLGQPPQNRSILAFFAGGAHGFIRHILMQHWKDKDHEI-----QVHEYLPPSQNYT 179
Query: 293 RRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERD 352
++KFC+CP G +V S R+ ++IH GCVPV++S+YY LPF+D+LDW KF++ +
Sbjct: 180 ELIDRSKFCLCPSGYEVASPRLVEAIHGGCVPVVISDYYSLPFDDVLDWSKFSMRIPSER 239
Query: 353 VYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIK 409
+ ++K IL+ +S +++ L ++KVQ+HF+ + P FD FHMV++ +WLRR +K
Sbjct: 240 IPEIKTILRGVSMKKYLKLQRGVMKVQRHFEIHRPAKAFDMFHMVLHSVWLRRLNVK 296
>gi|356518346|ref|XP_003527840.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
max]
Length = 633
Score = 223 bits (569), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 129/354 (36%), Positives = 201/354 (56%), Gaps = 17/354 (4%)
Query: 63 DYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQN 122
D +L + +F + S F +Y M K+Y+Y +GD +P L+G YASEG+F ++
Sbjct: 279 DDVNLYAPLFRNVSRFKRSYELMERTLKVYVYREGD-KAIMHSP-ILSGLYASEGWFMKH 336
Query: 123 IRESR-FLTHDPDRAHLFFIPISCHKMRGK----GTSYENMTVIVKDYVDSLINKYPYWN 177
+ S+ F+T DP AHLF++P S ++ +SY N+ + +YVD + K+ +WN
Sbjct: 337 MEASKQFVTTDPKNAHLFYLPFSSQRLVDALWVPKSSYGNLIQYLSNYVDMIAGKHHFWN 396
Query: 178 RTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQ--VLQPF 235
RT GADHF V CHD T+ + +R +C+ F+ KD++LP+ V P
Sbjct: 397 RTGGADHFLVACHDWAPAETKQ---HMAKCLRALCNADVKQGFVFGKDMSLPETVVRSPR 453
Query: 236 ALPR--GGRDVENRTVLGFWAGHRNSKIRVILARVWEN-DTELDISNNRINRAIGPLVYQ 292
R GG V R L F+AG + +R IL + WEN D ++ I R+ ++ G Y
Sbjct: 454 NPTRSIGGNQVSKRKTLAFFAGQMHGYVRPILLQHWENKDPDMKIFG-RLPKSKGNRNYI 512
Query: 293 RRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERD 352
+ +K+CIC G +VNS R+ ++I Y CVPVILS+ + PF ++L+W FAV + E+D
Sbjct: 513 QYMKSSKYCICAKGYEVNSPRVVEAILYECVPVILSDNFVPPFFEMLNWESFAVFVLEKD 572
Query: 353 VYQLKQILKNISDAEFVSLHNNLVKVQKHFQW-NSPPVRFDAFHMVMYELWLRR 405
+ LK IL +I ++ + + KVQ+HF W N PV++D FHM+++ +W R
Sbjct: 573 IPNLKNILLSIPRKRYLQMQMMVRKVQQHFLWHNKSPVKYDIFHMILHSIWYNR 626
>gi|297812207|ref|XP_002873987.1| hypothetical protein ARALYDRAFT_326421 [Arabidopsis lyrata subsp.
lyrata]
gi|297319824|gb|EFH50246.1| hypothetical protein ARALYDRAFT_326421 [Arabidopsis lyrata subsp.
lyrata]
Length = 654
Score = 223 bits (569), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 124/361 (34%), Positives = 200/361 (55%), Gaps = 23/361 (6%)
Query: 57 FAINKQDYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASE 116
FA +K++ S V+ + FH +++EM FK+++Y +GD + P + Y+ E
Sbjct: 292 FASDKEETSVPRGAVYRNAFAFHQSHMEMEKKFKVWVYREGDTPLVHMGP--VNNIYSIE 349
Query: 117 GYFFQNIRE--SRFLTHDPDRAHLFFIPIS----CHKMRGKGTSY--ENMTVIVKDYVDS 168
G F I S F P+ AH F +P+S H + +Y E + + DYV+
Sbjct: 350 GQFMDEIETGMSPFAASGPEEAHAFLLPVSIANVVHYLYRPLVTYSREQLHKVFLDYVNV 409
Query: 169 LINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVAL 228
+ +KYPYWNR+LGADHFFV+CHD + P ++KN IRV+C+ + F+P +DV++
Sbjct: 410 VAHKYPYWNRSLGADHFFVSCHDWAPDVSGSNPEMMKNLIRVLCNANTSEGFMPQRDVSI 469
Query: 229 PQV------LQPFALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWEN-DTELDISNNR 281
P++ L P L R +R +L F++G + IR IL + W++ D E+ +
Sbjct: 470 PEINIPRGQLGPPQLSRSSG--HDRPILAFFSGGSHGYIRKILLQHWKDKDEEVQVHEYL 527
Query: 282 INRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDW 341
N Y + +FC+CP G +V S R+ +I+ GCVPVI+S++Y LPF+D+LDW
Sbjct: 528 TNNKD----YFKLMATARFCLCPSGYEVASPRVVAAINLGCVPVIISDHYALPFSDVLDW 583
Query: 342 RKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYEL 401
KF + + + ++K ILK+IS + L +++VQ+HF N P FD M+++ +
Sbjct: 584 TKFTIHVPSEKIPEIKTILKSISWRRYKVLQRRVLQVQRHFVINRPSQPFDMLRMLLHSV 643
Query: 402 W 402
W
Sbjct: 644 W 644
>gi|296084509|emb|CBI25530.3| unnamed protein product [Vitis vinifera]
Length = 387
Score = 223 bits (568), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 119/348 (34%), Positives = 195/348 (56%), Gaps = 17/348 (4%)
Query: 69 SDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIR--ES 126
DV+ +P FH +++EM FKI+ Y +GD + P YA EG F I +S
Sbjct: 39 GDVYRNPYAFHQSHIEMEKRFKIWAYREGDQPLMHDGPS--NDIYAIEGQFMDEIESGKS 96
Query: 127 RFLTHDPDRAHLFFIPISCHKMRG---KGTSYENMTV--IVKDYVDSLINKYPYWNRTLG 181
+FL PD A+ F+IP+S ++ + Y + +V DY++ + +KYPYWNR+ G
Sbjct: 97 QFLARHPDEANAFYIPMSLTRVVHFIYEPPHYHGKWIPRLVTDYINFVADKYPYWNRSKG 156
Query: 182 ADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQP---FALP 238
ADHF V+CHD + P L K+ IR +C+ + F P +D+++P++ P P
Sbjct: 157 ADHFLVSCHDWAPDVSALKPDLYKHFIRALCNANTSERFHPIRDISIPEINIPRGKLGPP 216
Query: 239 RGGRDVENRTVLGFWAGHRNSKIRVILARVW-ENDTELDISNNRINRAIGPLVYQRRFYK 297
+ R +L F+AG + +R +L + W E D E+ + R+ R Y +
Sbjct: 217 HLDQPPNKRPILAFFAGGAHGYVRSVLFKYWKEKDDEVQVFE-RLPRNRN---YSKSMGD 272
Query: 298 TKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLK 357
+KFC+CP G +V S RI ++I GCVP+I+ ++Y LPF+D+LDW KF++ + + ++K
Sbjct: 273 SKFCLCPSGYEVASPRIVEAIAAGCVPMIICDHYSLPFSDVLDWSKFSIYITSDKIPEIK 332
Query: 358 QILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
+ILK + ++ + + +VQ+HF N P +D HM+++ +WLRR
Sbjct: 333 KILKAVPTETYLEMQKRVKQVQRHFAINRPARPYDMLHMILHSVWLRR 380
>gi|449461995|ref|XP_004148727.1| PREDICTED: probable glycosyltransferase At5g03795-like [Cucumis
sativus]
gi|449501299|ref|XP_004161331.1| PREDICTED: probable glycosyltransferase At5g03795-like [Cucumis
sativus]
Length = 664
Score = 222 bits (566), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 120/351 (34%), Positives = 195/351 (55%), Gaps = 16/351 (4%)
Query: 66 DLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIR- 124
+L + +F + S+F +Y M KIY+Y DG F+Q L G YASEG+F + +
Sbjct: 312 ELYAPLFRNVSMFKRSYELMERTLKIYVYRDGKKPIFHQP--ILKGLYASEGWFMKLMEG 369
Query: 125 ESRFLTHDPDRAHLFFIPISCHKMR-----GKGTSYENMTVIVKDYVDSLINKYPYWNRT 179
RF+ DP +AHLF++P S + + N+ +K+Y +++ KYPYWNRT
Sbjct: 370 NKRFVVKDPRKAHLFYMPFSSRMLEYTLYVRNSHNRTNLRQFLKEYAENIAAKYPYWNRT 429
Query: 180 LGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFAL-- 237
GADHF CHD T +++ I+ +C+ V F +DV+LP+ A
Sbjct: 430 GGADHFLAGCHDWAPYETR---HHMEHCIKALCNADVTVGFKIGRDVSLPETYVRSARNP 486
Query: 238 --PRGGRDVENRTVLGFWAGHRNSKIRVILARVWEN-DTELDISNNRINRAIGPLVYQRR 294
GG+ R +L F+AG+ + +R IL + W++ + ++ I + Y +
Sbjct: 487 LRDLGGKPASQRHILAFYAGNMHGYVRPILLKYWKDKNPDMKIFGPMPPGVASKMNYIQH 546
Query: 295 FYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVY 354
+K+CICP G +VNS R+ ++I Y CVPVI+S+ + PF ++LDW F+VI+ E+D+
Sbjct: 547 MKSSKYCICPKGYEVNSPRVVEAIFYECVPVIISDNFVPPFFEVLDWEAFSVIVAEKDIP 606
Query: 355 QLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
L+ IL +I ++ + + KVQKHF W++ P+++D FHM ++ +W R
Sbjct: 607 NLQDILLSIPKDRYLEMQLRVRKVQKHFLWHAKPLKYDLFHMTLHSIWYNR 657
>gi|125540270|gb|EAY86665.1| hypothetical protein OsI_08049 [Oryza sativa Indica Group]
Length = 482
Score = 222 bits (566), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 132/373 (35%), Positives = 192/373 (51%), Gaps = 24/373 (6%)
Query: 53 STLSFAINKQDYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGK 112
ST+ A DY G+ ++ + FH +Y+EM FKI+ Y +G+P + P T
Sbjct: 107 STVVAAAGDDDYVPRGA-IYRNARAFHRSYVEMERRFKIWTYREGEPPVAHIGP--GTDI 163
Query: 113 YASEGYFFQNIRE--SRFLTHDPDRAHLFFIPISC-------HKMRGKGTSYENMTVIVK 163
Y+ EG F + + SRF PD AH F +PIS +++ G + +V
Sbjct: 164 YSIEGQFMYEMDDPRSRFAARRPDDAHAFLLPISVCNLVHYVYRLNATG-DLAPLRGLVA 222
Query: 164 DYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPH 223
DYV + +YPYWNR+ GADH V+CHD T L NAIRV+C+ + F P
Sbjct: 223 DYVRVVAERYPYWNRSRGADHVIVSCHDWAPMVTSAHRQLYGNAIRVLCNANTSEGFRPR 282
Query: 224 KDVALPQV-LQPFAL--PRGGRDVENRTVLGFWAGHRNSKIRVILARVW-------ENDT 273
KD LP+V L L P G ENRT L F+AG R+ IR L R W
Sbjct: 283 KDATLPEVNLADGVLRRPTAGLPPENRTTLAFFAGGRHGHIRESLLRHWLIGNKGGAAAD 342
Query: 274 ELDISNNRINRAI-GPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYD 332
+ R++ + Y + +FC+CP G +V S R+ +S+ GCVPVI+S Y
Sbjct: 343 GDGDGDMRVHEYLPAGEDYHAQMAAARFCLCPSGFEVASPRVVESVFAGCVPVIISEGYP 402
Query: 333 LPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFD 392
PF D+LDW K +V + + +L+ IL+ +S+ + L +++ Q+HF + P RFD
Sbjct: 403 PPFGDVLDWGKMSVAVPAARIPELRAILRRVSERRYRVLRARVLQAQRHFVLHRPARRFD 462
Query: 393 AFHMVMYELWLRR 405
HMV++ +WLRR
Sbjct: 463 MIHMVLHSIWLRR 475
>gi|357118563|ref|XP_003561022.1| PREDICTED: probable glycosyltransferase At5g25310-like
[Brachypodium distachyon]
Length = 479
Score = 221 bits (564), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 122/361 (33%), Positives = 194/361 (53%), Gaps = 30/361 (8%)
Query: 71 VFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIR-----E 125
V+ +P+ FH +Y+EM FK+++Y +G+P + P K Y EG F + +
Sbjct: 112 VYRNPAAFHRSYVEMERRFKVHVYAEGEPPILHAGPCK--NIYTIEGRFIEQLELMSPGG 169
Query: 126 SRFLTHDPDRAHLFFIPISCHKMRGKG---TSYEN--MTVIVKDYVDSLINKYPYWNRTL 180
+ T DP+RAH FF+P S +M SY+ + +V DYV + ++ +WNR+
Sbjct: 170 AGVRTWDPERAHAFFLPFSVSQMVQFAYVPLSYDRAPLRALVADYVRVVAARHRFWNRSS 229
Query: 181 GADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQV-------LQ 233
GADHF ++CHD G A+ G P L N IR +C+ + F P KDV++P++ +
Sbjct: 230 GADHFMLSCHDWGPEASRGDPELYGNGIRALCNANTSEGFRPGKDVSIPEINLYDGDTPR 289
Query: 234 PFALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWEN---------DTELDISNNRINR 284
LP G + R L F+AG R+ +R +L R W+ + +L + N
Sbjct: 290 QLLLPAPG--LSERPYLAFFAGGRHGHVRDLLLREWKGRDPDNFPVYEYDLPTTTNTTGG 347
Query: 285 AIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKF 344
Y + +FC+CP G +V S R+ ++IH GCVPV++S+ Y PF D+L W F
Sbjct: 348 GDKQHDYYSYMQRARFCLCPSGHEVASPRVVEAIHAGCVPVLVSDGYAPPFADVLRWEGF 407
Query: 345 AVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLR 404
+V + D+ +L+++L++I AE L + V++HF PP R D FHM+++ +WLR
Sbjct: 408 SVSVPVADIPRLREVLESIPAAEVERLRDGGRLVKQHFTLRQPPERLDMFHMILHSVWLR 467
Query: 405 R 405
R
Sbjct: 468 R 468
>gi|357520911|ref|XP_003630744.1| Xylogalacturonan beta-1,3-xylosyltransferase [Medicago truncatula]
gi|355524766|gb|AET05220.1| Xylogalacturonan beta-1,3-xylosyltransferase [Medicago truncatula]
Length = 653
Score = 221 bits (564), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 122/364 (33%), Positives = 206/364 (56%), Gaps = 16/364 (4%)
Query: 53 STLSFAINKQDYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGK 112
S + A + +L + +F + S+F +Y M K+YIY +G+ F+Q L G
Sbjct: 286 SEIEHAPTVTNDKELYAPLFRNHSMFKRSYELMELTLKVYIYMEGNKPIFHQPI--LKGL 343
Query: 113 YASEGYFFQNIRESR-FLTHDPDRAHLFFIPISCHKMR-----GKGTSYENMTVIVKDYV 166
YASEG+F + + E++ F+ DP +AHLF++P S + + N+ +K+Y
Sbjct: 344 YASEGWFMKLMEENKQFVVKDPAKAHLFYMPFSSRMLEFSVYVRNSHNRTNLRQYLKEYT 403
Query: 167 DSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDV 226
D + KY Y+NRT GADHF V CHD T ++ I+ +C+ F +DV
Sbjct: 404 DKISAKYRYFNRTGGADHFLVACHDWAPYETR---HHMEYCIKALCNSDVTQGFKIGRDV 460
Query: 227 ALPQVL-QPFALPR---GGRDVENRTVLGFWAGHRNSKIRVILARVW-ENDTELDISNNR 281
+LP+ + + P+ GG+ + R++L F+AG+ + +R IL + W E D ++ I
Sbjct: 461 SLPETMVRSVRNPQRDLGGKPPQQRSILAFYAGNMHGYLRSILLKHWKEKDPDMKIFGPM 520
Query: 282 INRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDW 341
+ + Y +K+CICP G +VNS R+ ++I Y CVPVI+S+ + PF ++L+W
Sbjct: 521 PHGVAHKMNYIEHMKSSKYCICPKGYEVNSPRVVEAIFYECVPVIISDNFVPPFFEVLNW 580
Query: 342 RKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYEL 401
F++IL E+D+ LKQIL ++ + +++ L + +VQKHF W++ P+++D FHM ++ +
Sbjct: 581 DSFSLILAEKDIPNLKQILLSVPEEKYLKLQLGVRRVQKHFLWHTKPLKYDLFHMTLHSI 640
Query: 402 WLRR 405
W R
Sbjct: 641 WYNR 644
>gi|356511373|ref|XP_003524401.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
max]
Length = 643
Score = 221 bits (563), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 123/351 (35%), Positives = 200/351 (56%), Gaps = 16/351 (4%)
Query: 66 DLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRE 125
+L + +F + S+F +Y M K+YIY DG+ F+Q K G YASEG+F + + E
Sbjct: 291 ELYAPLFRNLSMFKRSYELMERTLKVYIYKDGNKPIFHQPIMK--GLYASEGWFMKLMEE 348
Query: 126 SR-FLTHDPDRAHLFFIPISCHKMRG-----KGTSYENMTVIVKDYVDSLINKYPYWNRT 179
++ F+ DP +AHLF++P S + + N+ +KDY D + KY Y+NRT
Sbjct: 349 NKHFVLKDPAKAHLFYMPFSSRMLEHALYVRNSHNRTNLRQFLKDYTDKISAKYRYFNRT 408
Query: 180 LGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQV-LQPFALP 238
GADHF V CHD T ++ I+ +C+ F +DV+LP+ ++ P
Sbjct: 409 GGADHFLVACHDWAPYETR---HHMEYCIKALCNADVTQGFKIGRDVSLPEAYVRSVRDP 465
Query: 239 R---GGRDVENRTVLGFWAGHRNSKIRVILARVWEN-DTELDISNNRINRAIGPLVYQRR 294
+ GG+ R +L F+AG+ + +R IL + W++ D ++ I + A + Y
Sbjct: 466 QRDLGGKPPHQRPILAFYAGNMHGYLRPILLKHWKDKDPDMKIYGPMPHGAASKMNYINH 525
Query: 295 FYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVY 354
+K+CICP G +VNS R+ ++I Y CVPVI+S+ + PF ++L+W F++IL E+D+
Sbjct: 526 MKNSKYCICPKGYEVNSPRVVEAIFYECVPVIISDNFVPPFFEVLNWDAFSIILAEKDIP 585
Query: 355 QLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
LKQIL ++S +++ L + K QKHF W+ P+++D FHM ++ +W R
Sbjct: 586 NLKQILLSVSQEKYLKLQLGVRKAQKHFFWHVKPLKYDLFHMTLHSIWYNR 636
>gi|297805042|ref|XP_002870405.1| hypothetical protein ARALYDRAFT_493575 [Arabidopsis lyrata subsp.
lyrata]
gi|297316241|gb|EFH46664.1| hypothetical protein ARALYDRAFT_493575 [Arabidopsis lyrata subsp.
lyrata]
Length = 509
Score = 221 bits (563), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 117/359 (32%), Positives = 199/359 (55%), Gaps = 31/359 (8%)
Query: 69 SDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNI----- 123
S ++ +P+ FH ++ EM+ FK++ Y +G+ F+ P + Y EG F +
Sbjct: 153 SSLYKNPAAFHQSHTEMMNRFKVWTYTEGEVPLFHDGP--VNDIYGIEGQFMDEMCVDGP 210
Query: 124 -RESRFLTHDPDRAHLFFIPISCHKM---------RGKGTSYENMTVIVKDYVDSLINKY 173
SRF P+ AH+FFIP S K+ +G S + +++DYVD + K+
Sbjct: 211 KSRSRFRADHPEDAHVFFIPFSVAKVIHFVYKPITSVEGFSRARLHRLIEDYVDVVATKH 270
Query: 174 PYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQ 233
YWNR+ G DHF V+CHD +G P L + IR +C+ + F P+ DV++P++
Sbjct: 271 RYWNRSKGGDHFMVSCHDWAPDVIDGNPKLFEKFIRALCNANTSEGFRPNVDVSIPEIY- 329
Query: 234 PFALPRG-------GRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAI 286
LP+G G+ R++L F+AG + +IR IL + W+ E+D +R
Sbjct: 330 ---LPKGKLGPSFLGKSPRIRSILAFFAGRSHGEIRKILFKHWK---EMDNEVQVYDRLP 383
Query: 287 GPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAV 346
Y + +KFC+CP G +V S R ++I+ GCVPVI+S+ Y LPF+D+L+W F++
Sbjct: 384 PGKDYTKTMGMSKFCLCPSGWEVASPREVEAIYAGCVPVIISDNYSLPFSDVLNWDSFSI 443
Query: 347 ILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
+ + ++K IL+++S ++ ++ +++V++HF N P +D HM+++ +WLRR
Sbjct: 444 QIPVSRIPEIKTILQSVSLVRYLKMYKRVLEVKQHFVLNRPAKPYDVMHMMLHSIWLRR 502
>gi|15239502|ref|NP_197954.1| Exostosin family protein [Arabidopsis thaliana]
gi|110738111|dbj|BAF00988.1| hypothetical protein [Arabidopsis thaliana]
gi|332006108|gb|AED93491.1| Exostosin family protein [Arabidopsis thaliana]
Length = 654
Score = 221 bits (562), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 123/354 (34%), Positives = 197/354 (55%), Gaps = 22/354 (6%)
Query: 67 LGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRES 126
L + ++ + S+F +Y M K+Y Y +G+ + L G YASEG+F NI ES
Sbjct: 302 LYAPLYRNVSMFKRSYELMEKILKVYAYKEGNKPIMHSP--ILRGIYASEGWFM-NIIES 358
Query: 127 ---RFLTHDPDRAHLFFIPISCHKMR-----GKGTSYENMTVIVKDYVDSLINKYPYWNR 178
+F+T DP +AHLF++P S + S+ N+ +KDY+D + KYP+WNR
Sbjct: 359 NNNKFVTKDPAKAHLFYLPFSSRMLEVTLYVQDSHSHRNLIKYLKDYIDFISAKYPFWNR 418
Query: 179 TLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVL-----Q 233
T GADHF CHD T + +IR +C+ F+ KD +LP+ +
Sbjct: 419 TSGADHFLAACHDWAPSETRK---HMAKSIRALCNSDVKEGFVFGKDTSLPETFVRDPKK 475
Query: 234 PFALPRGGRDVENRTVLGFWAGHRNSK-IRVILARVWENDTELDISN-NRINRAIGPLVY 291
P + GG+ R +L F+AG + +R IL W N+ + D+ ++ R G Y
Sbjct: 476 PLSN-MGGKSANQRPILAFFAGKPDHGYLRPILLSYWGNNKDPDLKIFGKLPRTKGNKNY 534
Query: 292 QRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNER 351
+ +K+CIC G +VNS R+ ++I Y CVPVI+S+ + PF ++L+W FA+ + E+
Sbjct: 535 LQFMKTSKYCICAKGFEVNSPRVVEAIFYDCVPVIISDNFVPPFFEVLNWESFAIFIPEK 594
Query: 352 DVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
D+ LK+IL +I ++ + S+ + KVQKHF W++ P ++D FHM+++ +W R
Sbjct: 595 DIPNLKKILMSIPESRYRSMQMRVKKVQKHFLWHAKPEKYDMFHMILHSIWYNR 648
>gi|224143179|ref|XP_002324871.1| predicted protein [Populus trichocarpa]
gi|222866305|gb|EEF03436.1| predicted protein [Populus trichocarpa]
Length = 334
Score = 220 bits (561), Expect = 9e-55, Method: Compositional matrix adjust.
Identities = 125/334 (37%), Positives = 185/334 (55%), Gaps = 19/334 (5%)
Query: 85 MVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRES--RFLTHDPDRAHLFFIP 142
M FK+++Y +G+ + P K Y EG F + RF T DP RAH++F+P
Sbjct: 1 MEKRFKVHVYSEGELPIVHDGPCK--DIYTIEGRFIHEMEHGAKRFKTRDPRRAHVYFMP 58
Query: 143 ISCHKMRG---KGTSYEN--MTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRAT 197
S M K +Y++ M V DYV + +KYP+WNRT GADHF ++CHD G A+
Sbjct: 59 FSVTWMVKYLYKPLTYDHTAMKQFVADYVRVVSSKYPFWNRTQGADHFMLSCHDWGPHAS 118
Query: 198 EGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQ-----PFALPRGGRDVENRTVLGF 252
G PFL +IRV+C+ + F P KDV+LP++ P L R L F
Sbjct: 119 HGNPFLYNTSIRVLCNANSSEGFSPRKDVSLPEIHLYGGNVPPKLISPPPATSPRPYLAF 178
Query: 253 WAGHRNSKIRVILARVWEN-DTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNS 311
++G + IR IL W+ D +L + L Y ++KFC+CP G +V S
Sbjct: 179 FSGGLHGPIRPILLDHWKGRDPDLQVYEYLPKD----LDYYSFMLRSKFCLCPSGHEVAS 234
Query: 312 ARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSL 371
RI ++I+ CVPVILS++Y LPF+D+L W FA+ +N ++ +LK++L ++ + + L
Sbjct: 235 PRIVEAIYAECVPVILSDHYVLPFSDVLRWEAFAIQVNVSEIPRLKEVLISVPEERYRRL 294
Query: 372 HNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
L ++KHF N P RFD FHM+++ +WLRR
Sbjct: 295 KEGLRAIRKHFVLNQPAKRFDVFHMILHSIWLRR 328
>gi|224142181|ref|XP_002324437.1| predicted protein [Populus trichocarpa]
gi|222865871|gb|EEF03002.1| predicted protein [Populus trichocarpa]
Length = 334
Score = 220 bits (561), Expect = 9e-55, Method: Compositional matrix adjust.
Identities = 117/333 (35%), Positives = 187/333 (56%), Gaps = 18/333 (5%)
Query: 85 MVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIR-ESRFLTHDPDRAHLFFIPI 143
M K+Y+Y +G+ F+Q+ K+ G YASEG+F + I +F+ DP +AHLF++P
Sbjct: 1 MERMLKVYVYKEGEKPIFHQS--KMRGIYASEGWFMKLIEGNKKFVVRDPRKAHLFYLPF 58
Query: 144 SCHKMRGK-----GTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATE 198
S H +R + + + +K+YVD + KY +WNRT G DHF V CHD + T
Sbjct: 59 SPHMLRTALFDHNSLNQKELAEFLKNYVDLVAKKYSFWNRTGGTDHFLVGCHDWASQMTR 118
Query: 199 GVPFLIKNAIRVVCSPSYDVAFIPHKDVALP-----QVLQPFALPRGGRDVENRTVLGFW 253
++N IRV+C+ + F KD LP V P GG+ R +L F+
Sbjct: 119 ---HHMRNCIRVLCNSNVAKGFKIGKDTTLPVTYIRSVENPLK-ELGGKSPSERPILAFF 174
Query: 254 AGHRNSKIRVILARVWEN-DTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSA 312
AG+ + +R IL WEN + ++ I G Y+ ++K+CIC G +V++
Sbjct: 175 AGNMHGYLRPILLEYWENKEPDMKILGPMSRDIAGKRRYREYMKRSKYCICARGYEVHTP 234
Query: 313 RIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLH 372
R+ +SI Y CVPVI+S+ Y P ++L+W F+V + E+D+ L+ IL +I ++V++
Sbjct: 235 RVVESIFYECVPVIISDNYVPPLFEVLNWEAFSVFIQEKDIPNLRNILLSIPQEKYVAMQ 294
Query: 373 NNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
+ KVQ+HF W+ PV++D FHM+++ +W R
Sbjct: 295 LGVKKVQQHFLWHKKPVKYDLFHMILHSVWHSR 327
>gi|224092294|ref|XP_002309547.1| predicted protein [Populus trichocarpa]
gi|222855523|gb|EEE93070.1| predicted protein [Populus trichocarpa]
Length = 471
Score = 220 bits (561), Expect = 9e-55, Method: Compositional matrix adjust.
Identities = 121/345 (35%), Positives = 193/345 (55%), Gaps = 23/345 (6%)
Query: 76 SVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIR-ESRFLTHDPD 134
S+F +Y M K+YIY +G+ F+Q L G YASEG+F + + +F+T D
Sbjct: 128 SMFKKSYELMEDILKVYIYKEGEMPIFHQP--LLNGIYASEGWFMKLLEGNKKFVTKDSK 185
Query: 135 RAHLFFIPISCHKMRGK-----GTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTC 189
+AHLF++P S + + S++N+ +K Y+D + KYP+WNRT GADHF C
Sbjct: 186 KAHLFYLPFSSRYLEIRLYVPNSHSHKNLIEYLKKYLDMISEKYPFWNRTQGADHFLAAC 245
Query: 190 HDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPR-------GGR 242
HD T + N IR +C+ F+ KD +LP+ + L + GG
Sbjct: 246 HDWAPSETRQ---HMANCIRALCNSDAKEDFVYGKDASLPET---YVLTQENPLRDLGGN 299
Query: 243 DVENRTVLGFWAGHRNSKIRVILARVWEN-DTELDISNNRIN-RAIGPLVYQRRFYKTKF 300
R++L F+AG + +R IL + WEN D ++ I + G + Y R +K+
Sbjct: 300 RASKRSILAFFAGSMHGYLRPILLQHWENKDPDMKIFGRLPKVKGRGKMNYARYMKSSKY 359
Query: 301 CICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQIL 360
CIC G +VNS R+ ++I Y CVPVI+S+ + PF ++L+W FAV + E+D+ LK+IL
Sbjct: 360 CICAKGYEVNSPRVVEAIFYECVPVIISDNFVPPFLEVLNWESFAVFVLEKDIPNLKKIL 419
Query: 361 KNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
+I ++ + + +VQ+HF W++ PV++D FHM+++ +W R
Sbjct: 420 LSIPAKKYRRMQMRVKRVQQHFLWHARPVKYDVFHMILHSIWYNR 464
>gi|242043198|ref|XP_002459470.1| hypothetical protein SORBIDRAFT_02g005180 [Sorghum bicolor]
gi|241922847|gb|EER95991.1| hypothetical protein SORBIDRAFT_02g005180 [Sorghum bicolor]
Length = 584
Score = 220 bits (560), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 131/392 (33%), Positives = 210/392 (53%), Gaps = 28/392 (7%)
Query: 37 LSFNSFRSYPLQNNFNSTLSFAINKQDYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPD 96
L RS ++ N + +DY +G ++ + FH +YLEM K+Y+Y +
Sbjct: 187 LGLAKARSAIMEAIRNKDKRSPLPDKDYVPMGP-IYRNAHAFHRSYLEMEKQLKVYVYEE 245
Query: 97 GDPNTFYQTPRKLTGKYASEGYFFQNIR-ESRFLTHDPDRAHLFFIPISCHKMRGKGTSY 155
G+P F+ P + Y++EG F + +R T DP +AH+FF+P S KM T Y
Sbjct: 246 GEPPVFHDGPCRSI--YSTEGRFIHAMETATRLRTSDPSQAHVFFLPFSVVKM--VKTIY 301
Query: 156 EN-------MTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAI 208
E + V DY+ + +KYP+WNR+ GADHF ++CHD G + L N+I
Sbjct: 302 EPGSHDMAPLKRTVADYLRVISDKYPFWNRSAGADHFMLSCHDWGPYVSSANAELFGNSI 361
Query: 209 RVVCSPSYDVAFIPHKDVALPQV-LQPFALPR--GGRDVENRTVLGFWAGHRNSKIRVIL 265
RV+C+ + F KDV+LP++ L+ A+ R GG R L F+AG + +R L
Sbjct: 362 RVLCNANTSEGFDLAKDVSLPEINLRSDAVERQVGGPSASRRPFLAFFAGGNHGPVRPAL 421
Query: 266 ARVW------ENDTELDISN----NRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIA 315
W E+D ++ +S Y +++FC+CPGG +V S R+A
Sbjct: 422 LAHWGPGSGREDDPDVRVSEYLPTRGGRAGASAAAYTDMMRRSRFCLCPGGYEVASPRLA 481
Query: 316 DSIHYGCVPVILSN-YYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNN 374
++++ CVPV++ + Y LPF D+L+W FAV + DV ++K+IL +S +++ +
Sbjct: 482 EAVYLECVPVVVDDGEYALPFADVLNWDAFAVRVRVADVPRIKEILSAVSPRQYIRMQRR 541
Query: 375 LVKVQKHFQ-WNSPPVRFDAFHMVMYELWLRR 405
+ V++HF PP R+DAFHM+++ +WLRR
Sbjct: 542 VRMVRRHFMVHGGPPRRYDAFHMILHSVWLRR 573
>gi|255582309|ref|XP_002531945.1| catalytic, putative [Ricinus communis]
gi|223528391|gb|EEF30427.1| catalytic, putative [Ricinus communis]
Length = 453
Score = 219 bits (559), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 121/372 (32%), Positives = 197/372 (52%), Gaps = 43/372 (11%)
Query: 58 AINKQDYSDLGSD-------VFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLT 110
A+ ++YS + V+ +P FH +++EM K++++ G P +
Sbjct: 93 AVESRNYSSYKEETYIPRGVVYRNPYAFHQSHIEMEKKLKVWVH--GGP---------VK 141
Query: 111 GKYASEGYFFQNIRESR--FLTHDPDRAHLFFIPISCHKMRG------KGTSYENMTVIV 162
Y EG F + R F+ P+ A +FF+P+S + S + + +V
Sbjct: 142 NIYGVEGQFIDEMERGRSHFIARRPEEAQVFFLPVSVANIINFIYKPIVTYSRDQLQRLV 201
Query: 163 KDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIP 222
DYV ++ NKYPYWNR+ GADHF V+CHD + P L +N IRV+C+ + F P
Sbjct: 202 TDYVGTVANKYPYWNRSSGADHFLVSCHDWAPDISTANPELYRNFIRVLCNANTSERFNP 261
Query: 223 HKDVALPQVLQPFALPRG--------GRDVENRTVLGFWAGHRNSKIRVILARVWEN-DT 273
+DV++P++ +P G NRT+L F+AG + IR +L W+ D+
Sbjct: 262 KRDVSIPEI----NIPSGKLGPPLHQASSPSNRTILAFFAGGSHGYIRKLLLEHWKGKDS 317
Query: 274 ELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDL 333
E+ + + Y + +++FC+CP G +V S R+ +I GCVPV +S+ Y L
Sbjct: 318 EIQVHEYLDKKQ----NYFKLMGQSRFCLCPSGYEVASPRVVTAIQLGCVPVTISDNYTL 373
Query: 334 PFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDA 393
PF+DILDW KF+V + + ++K ILK IS +++ +H ++ V++HF N P FD
Sbjct: 374 PFSDILDWSKFSVHIPSGKIQEIKTILKGISPRQYLKMHKRVMLVRRHFMLNRPAQPFDM 433
Query: 394 FHMVMYELWLRR 405
HM+++ +WLRR
Sbjct: 434 IHMMLHSIWLRR 445
>gi|297807149|ref|XP_002871458.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
gi|297317295|gb|EFH47717.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
Length = 336
Score = 219 bits (558), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 117/332 (35%), Positives = 189/332 (56%), Gaps = 16/332 (4%)
Query: 85 MVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRE--SRFLTHDPDRAHLFFIP 142
M FKI+ Y +G+ F++ P L YA EG F I SR+ P+ A +F+IP
Sbjct: 1 MEKRFKIWTYREGEAPLFHKGP--LNNIYAIEGQFMDEIENGNSRYKAASPEEATVFYIP 58
Query: 143 ISCHKM----RGKGTSY--ENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRA 196
+ + TSY + + IVKDY+ + N+YPYWNR+ GADHFF++CHD
Sbjct: 59 VGIVNIIRFVYRPYTSYARDRLQNIVKDYISLISNRYPYWNRSRGADHFFLSCHDWAPDV 118
Query: 197 TEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQP---FALPRGGRDVENRTVLGFW 253
+ P L ++ IR +C+ + F P +DV+LP++ P G +NR +L F+
Sbjct: 119 SAVDPELYRHFIRALCNANASEGFTPMRDVSLPEINIPHSQLGFVHTGEAPQNRKLLAFF 178
Query: 254 AGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSAR 313
AG + ++R IL W+ + + + + + Y + K KFC+CP G +V S R
Sbjct: 179 AGGSHGEVRKILFEQWKEKDKDVLVYEYLPKTMN---YTKMMDKAKFCLCPSGWEVASPR 235
Query: 314 IADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHN 373
I +S++ GCVPVI+++ Y LPF+D+L+W+ F+V + + +K+IL+ IS+ E++ +
Sbjct: 236 IVESLYSGCVPVIIADSYVLPFSDVLNWKTFSVHIPISKMPDIKKILEAISEEEYLEMQR 295
Query: 374 NLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
+++V+KHF N P +D HM+M+ +WLRR
Sbjct: 296 RVLEVRKHFVINRPSKPYDMLHMIMHSIWLRR 327
>gi|255546379|ref|XP_002514249.1| catalytic, putative [Ricinus communis]
gi|223546705|gb|EEF48203.1| catalytic, putative [Ricinus communis]
Length = 334
Score = 219 bits (558), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 118/328 (35%), Positives = 189/328 (57%), Gaps = 16/328 (4%)
Query: 89 FKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIR-ESRFLTHDPDRAHLFFIPISCHK 147
K+Y+Y +G+ F+Q+ K+ G YASEG+F ++I +FL DP +AHLFF+P S
Sbjct: 5 LKVYVYKEGEKPIFHQS--KMRGIYASEGWFMKHIEGNKKFLVKDPRKAHLFFLPFSPQM 62
Query: 148 MRG-----KGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPF 202
+R K S +++ +K+YVD + KY +WNRT G DHF V CHD R T
Sbjct: 63 LRTVIFGQKLQSQKDLEEYLKNYVDLVARKYSFWNRTGGTDHFLVGCHDWASRITRK--- 119
Query: 203 LIKNAIRVVCSPSYDVAFIPHKDVALP-QVLQPFALPR---GGRDVENRTVLGFWAGHRN 258
++N IRV+C+ + F KD LP ++ P GG+ R L F+AG +
Sbjct: 120 YMQNCIRVLCNANVAKGFKIGKDTTLPVTYIRSAENPLKDVGGKHPSERYTLAFFAGGMH 179
Query: 259 SKIRVILARVWEN-DTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADS 317
+R IL + WEN ++++ I G +Y+ +K+CIC G +V++ RI ++
Sbjct: 180 GYLRPILVQFWENKESDMKIFGPMPRDIEGKRLYREYMKSSKYCICARGYEVHTPRIVEA 239
Query: 318 IHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVK 377
I Y CVPVI+S+ Y PF ++L+W F+V + E+D+ L+ IL +I + +++ + +
Sbjct: 240 ILYECVPVIISDNYVPPFFEVLNWEAFSVFVQEKDIPNLRSILLSIPEEKYLEMQLRVKM 299
Query: 378 VQKHFQWNSPPVRFDAFHMVMYELWLRR 405
VQ+HF W+ PV++D FHM+++ +W R
Sbjct: 300 VQQHFLWHKNPVKYDLFHMILHSVWHNR 327
>gi|224143183|ref|XP_002324873.1| predicted protein [Populus trichocarpa]
gi|222866307|gb|EEF03438.1| predicted protein [Populus trichocarpa]
Length = 335
Score = 219 bits (558), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 120/338 (35%), Positives = 192/338 (56%), Gaps = 27/338 (7%)
Query: 85 MVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIR--ESRFLTHDPDRAHLFFIP 142
M F+++ Y +G+ F++ P + Y+ EG + +S F +PD A FFIP
Sbjct: 1 MEKRFRVWTYREGEQPLFHRGP--MNDIYSIEGQIIDELDSGKSPFSAKNPDEALAFFIP 58
Query: 143 ISC----HKMRGKGTSYENMTV--IVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRA 196
+S H + +Y + I +DY+ + +KYPYWNR+ GADHF ++CHD
Sbjct: 59 VSIASILHFIYRPYVTYSRKQIQDIAEDYIGLISSKYPYWNRSSGADHFMISCHDWAPDV 118
Query: 197 TEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDVE--------NRT 248
+ P L +N IRV+C+ + F P +DV+LP+ F LPRG + E NR+
Sbjct: 119 SAANPDLYRNFIRVLCNANSSEGFKPARDVSLPE----FKLPRGKLEPEHILQPCDNNRS 174
Query: 249 VLGFWAGHRNSKIRVILARVW-ENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGS 307
+L F+AG + +R IL + W E D ++ + L Y + K+++C+CP G
Sbjct: 175 ILAFFAGGSHGSVRKILFKHWKEKDNDIQVYKYLPET----LNYTEQMSKSRYCLCPSGW 230
Query: 308 QVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAE 367
+V S R+ ++I+ GCVPVI+S+YY LPF+D+LDW KF+V + + ++K IL++I E
Sbjct: 231 EVASPRVVEAIYSGCVPVIISDYYVLPFSDVLDWIKFSVHIPVSGIPEIKTILQSIPVEE 290
Query: 368 FVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
++ +++VQ+HF+ + P FD HMVM+ +WLRR
Sbjct: 291 YLEKQKRVLQVQQHFKLHRPAKPFDVVHMVMHSVWLRR 328
>gi|359484343|ref|XP_002283936.2| PREDICTED: uncharacterized protein LOC100268163 [Vitis vinifera]
Length = 738
Score = 218 bits (556), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 120/351 (34%), Positives = 194/351 (55%), Gaps = 16/351 (4%)
Query: 66 DLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNI-R 124
+L + +F + S+F +Y M K+Y+Y DG+ F+Q L G YASEG+F + + R
Sbjct: 386 ELHAPLFRNVSMFKRSYELMERILKVYVYKDGEKPIFHQP--ILKGLYASEGWFMKLMER 443
Query: 125 ESRFLTHDPDRAHLFFIPISCHKMRGK-----GTSYENMTVIVKDYVDSLINKYPYWNRT 179
F+ DP +A LF++P S + K + N+ +K Y + + KY +WNRT
Sbjct: 444 NKHFVVKDPRQAQLFYMPFSSRMLEYKLYVRNSHNRTNLRQYLKQYSEKIAAKYRFWNRT 503
Query: 180 LGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFAL-- 237
GADHF V CHD T ++ I+ +C+ F +DV+LP+ A
Sbjct: 504 GGADHFLVACHDWAPYETR---HHMEQCIKALCNADVTAGFKIGRDVSLPETYVRSARNP 560
Query: 238 --PRGGRDVENRTVLGFWAGHRNSKIRVILARVWEN-DTELDISNNRINRAIGPLVYQRR 294
GG+ R +L F+AG+ + +R IL + W++ D ++ I + Y +
Sbjct: 561 LRDLGGKPPSERHILAFYAGNMHGYLRPILLKYWKDKDPDMKIYGPMPPGVASKMNYIQH 620
Query: 295 FYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVY 354
+KFCICP G +VNS R+ ++I Y CVPVI+S+ + PF D+LDW F++IL E+D+
Sbjct: 621 MKSSKFCICPKGYEVNSPRVVEAIFYECVPVIISDNFVPPFFDVLDWGAFSIILAEKDIP 680
Query: 355 QLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
LK +L +I + +++ + + KVQKHF W++ P+++D FHM ++ +W R
Sbjct: 681 NLKDVLLSIPNDKYLQMQLGVRKVQKHFLWHAKPLKYDLFHMTLHSIWYNR 731
>gi|242094892|ref|XP_002437936.1| hypothetical protein SORBIDRAFT_10g005120 [Sorghum bicolor]
gi|241916159|gb|EER89303.1| hypothetical protein SORBIDRAFT_10g005120 [Sorghum bicolor]
Length = 412
Score = 218 bits (555), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 128/391 (32%), Positives = 198/391 (50%), Gaps = 43/391 (10%)
Query: 53 STLSFAINKQDYSDLGSDVFHSPSVFH-----------LNYLEMVTNFKIYIYPDGDPNT 101
ST ++I+ D + + S +FH +Y EM FK+Y+Y +G+P
Sbjct: 14 STGEYSIDDGDPNPTERNTNDSHLIFHEYVRYGDCEASRSYREMERRFKVYVYEEGEPPI 73
Query: 102 FYQTPRKLTGKYASEGYFFQNIR----------ESRFLTHDPDRAHLFFIPISCHKM--- 148
+ P K Y EG F + + T DP RAH FF+P S KM
Sbjct: 74 LHAGPCKDI--YTIEGRFIEQLELMAPPPPAGGGGGARTSDPARAHAFFLPFSVSKMVQF 131
Query: 149 --RGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKN 206
R + IV DYV + +++PYWNR+ GADHF ++CHD G A+ G P L N
Sbjct: 132 AYRPDTYDKTPLRAIVADYVRVVASRHPYWNRSAGADHFMLSCHDWGPEASRGHPELHAN 191
Query: 207 AIRVVCSPSYDVAFIPHKDVALPQV-LQPFALPR-----GGRDVENRTVLGFWAGHRNSK 260
IR +C+ + F P +DV++P++ L +PR + +R L F+AG R+
Sbjct: 192 GIRALCNANTSEGFRPGQDVSVPEINLYTGDMPRQLLAPPAPPLASRPFLAFFAGGRHGH 251
Query: 261 IRVILARVWEN---------DTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNS 311
+R +L R W+ + E + S+ R + PL Y + +FC+CP G +V S
Sbjct: 252 VRDLLLRRWKGHDPDVFPVYEHEHEHSHGRRQQDGAPLDYYWYMRRARFCLCPSGYEVAS 311
Query: 312 ARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSL 371
R+ ++IH CVPVILS+ Y LPF D+L W F+V + D+ +L+++L+ I E L
Sbjct: 312 PRVVEAIHAECVPVILSDGYALPFADVLRWEAFSVAVPVADIPRLREVLERIPAPEVERL 371
Query: 372 HNNLVKVQKHFQWNSPPVRFDAFHMVMYELW 402
+ V++HF + PP R D F+M+++ +W
Sbjct: 372 QRGVRLVKRHFMLHQPPERLDMFNMILHSVW 402
>gi|449466083|ref|XP_004150756.1| PREDICTED: probable glycosyltransferase At3g07620-like [Cucumis
sativus]
Length = 341
Score = 218 bits (554), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 116/339 (34%), Positives = 192/339 (56%), Gaps = 15/339 (4%)
Query: 77 VFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESR-FLTHDPDR 135
+F +Y M K+Y+Y +G+ F+Q LTG YASEG+F + + +++ F+ DP++
Sbjct: 1 MFTRSYELMEKMLKVYVYDEGEKPIFHQP--ILTGIYASEGWFMKLLEDNKKFVVKDPEK 58
Query: 136 AHLFFIPISCHKMRGK-GTSYEN---MTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHD 191
AHLF++P S +R G + N + +K+Y+D + KY +WN+ G+DHF V CHD
Sbjct: 59 AHLFYLPFSSQFLRSAFGNKFRNKRDLQKPLKNYIDVIGKKYRFWNKNGGSDHFLVACHD 118
Query: 192 VGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVL----QPFALPRGGRDVENR 247
+ T+ L+KN IR +C+ + F KD +LP + GG+ R
Sbjct: 119 WAPKLTKR---LVKNCIRALCNANGAGDFEIGKDTSLPVTFVHSTEDLITKIGGKPPSER 175
Query: 248 TVLGFWAGHRNSKIRVILARVWEN-DTELDISNNRINRAIGPLVYQRRFYKTKFCICPGG 306
T L F+AG + +R IL WEN + ++ I N G Y + +K+CIC G
Sbjct: 176 TTLAFFAGSMHGYLRPILLHYWENKEPDMMIVGPMPNSIEGKNAYMEQMKSSKYCICARG 235
Query: 307 SQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDA 366
QV+S R+ ++I C+PVI+S+ Y P ++L+W F+V + ER++ L+ IL +I +
Sbjct: 236 YQVHSPRVIEAILNECIPVIISDNYVPPLFEVLNWESFSVFVKEREIPNLRDILLSIPEE 295
Query: 367 EFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
+ ++H+ + VQ+HF W+ P ++DAFHM+++ +W R
Sbjct: 296 NYRAMHSRVKMVQQHFLWHEKPAKYDAFHMILHSIWYTR 334
>gi|449462352|ref|XP_004148905.1| PREDICTED: probable glycosyltransferase At5g25310-like [Cucumis
sativus]
gi|449523501|ref|XP_004168762.1| PREDICTED: LOW QUALITY PROTEIN: probable glycosyltransferase
At5g25310-like [Cucumis sativus]
Length = 684
Score = 218 bits (554), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 124/352 (35%), Positives = 194/352 (55%), Gaps = 19/352 (5%)
Query: 66 DLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRE 125
+L + +F + S F +Y M + K+YIY +G F+Q P L YASEG+F + I E
Sbjct: 332 NLYAPLFQNISRFKRSYELMESTLKVYIYREGARPIFHQGP--LQSIYASEGWFMK-ILE 388
Query: 126 S--RFLTHDPDRAHLFFIPISCHKMRG-----KGTSYENMTVIVKDYVDSLINKYPYWNR 178
S +F+T +P +AHLF++P S ++ S++N+ +K+Y+D + KYP+WNR
Sbjct: 389 SNKKFVTKNPRKAHLFYLPFSSRQLEEVLYVRDSHSHKNLIQHLKNYLDFIAAKYPHWNR 448
Query: 179 TLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFAL- 237
T GADHF V CHD T + IR +C+ F+ KDV+LP+ A
Sbjct: 449 TGGADHFLVACHDWAPAETRK---YMAKCIRALCNSDVKEGFVFGKDVSLPETFVRVARN 505
Query: 238 ---PRGGRDVENRTVLGFWAGHRNSKIRVILARVWE-NDTELDISNNRINRAIGPLVYQR 293
GG R +L F+AG + +R L WE D ++ IS + + G Y
Sbjct: 506 PLRDVGGNPSSKRPILAFFAGSMHGYLRSTLLEYWERKDPDMKISG-PMPKVKGSKNYLW 564
Query: 294 RFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDV 353
+K+CIC G +VNS R+ +SI Y CVPVI+S+ + P ++L+W FAV + E+D+
Sbjct: 565 HMKNSKYCICAKGYEVNSPRVVESILYECVPVIISDNFVPPLFEVLNWESFAVFVAEKDI 624
Query: 354 YQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
LK+IL +I + + + + K+Q HF W++ P ++D FHM+++ +W R
Sbjct: 625 PNLKKILLSIPEKRYREMQMRVKKLQPHFLWHAKPQKYDMFHMILHSIWYNR 676
>gi|147799433|emb|CAN76867.1| hypothetical protein VITISV_012309 [Vitis vinifera]
Length = 1908
Score = 218 bits (554), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 119/351 (33%), Positives = 193/351 (54%), Gaps = 16/351 (4%)
Query: 66 DLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNI-R 124
+L + +F + S+F +Y M K+Y+Y DG+ F+Q L G YASEG+F + + R
Sbjct: 1042 ELHAPLFRNVSMFKRSYELMERILKVYVYKDGEKPIFHQP--ILKGLYASEGWFMKLMER 1099
Query: 125 ESRFLTHDPDRAHLFFIPISCHKMRGK-----GTSYENMTVIVKDYVDSLINKYPYWNRT 179
F+ DP +A LF++P S + K + N+ +K Y + + KY +WNRT
Sbjct: 1100 NKXFVVKDPRQAQLFYMPFSSRMLEYKLYVRNSHNRTNLRQYLKQYSEKIAAKYRFWNRT 1159
Query: 180 LGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFAL-- 237
G DHF V CHD T ++ I+ +C+ F +DV+LP+ A
Sbjct: 1160 GGXDHFLVACHDWAPYETR---HHMEQCIKALCNADVTAGFKIGRDVSLPETYVRSARNP 1216
Query: 238 --PRGGRDVENRTVLGFWAGHRNSKIRVILARVWEN-DTELDISNNRINRAIGPLVYQRR 294
GG+ R +L F+AG+ + +R IL + W++ D ++ I + Y +
Sbjct: 1217 LRDLGGKPPSERHILAFYAGNMHGYLRPILLKYWKDKDPDMKIYGPMPPGVASKMNYIQH 1276
Query: 295 FYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVY 354
+KFCICP G +VNS R+ ++I Y CVPVI+S+ + PF D+LDW F++IL E+D+
Sbjct: 1277 MKSSKFCICPKGYEVNSPRVVEAIFYECVPVIISDNFVPPFFDVLDWGAFSIILAEKDIP 1336
Query: 355 QLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
LK +L +I + +++ + + KVQKHF W++ P+++D FHM ++ +W R
Sbjct: 1337 NLKDVLLSIPNEKYLQMQLGVRKVQKHFLWHAKPLKYDLFHMTLHSIWYNR 1387
Score = 199 bits (507), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 126/396 (31%), Positives = 196/396 (49%), Gaps = 60/396 (15%)
Query: 67 LGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIR-E 125
L + V+ + S+F +Y M KIYIY +G+ F+Q PR L G YASEG+F + I
Sbjct: 314 LYASVYRNVSMFKRSYELMERVLKIYIYREGEKPIFHQ-PR-LRGIYASEGWFMKLIEGN 371
Query: 126 SRFLTHDPDRAHLFFIPISCHKMRG-----KGTSYENMTVIVKDYVDSLINKYPYWNRTL 180
RF+ DP +AHLF++P S +R ++ ++ K+YV + KY +WNRT
Sbjct: 372 KRFVVRDPRKAHLFYVPFSSKMLRTVFYEQNSSTPRDLEKYFKNYVGLIAGKYRFWNRTG 431
Query: 181 GADHFFVTCHD--------------VGVRATEGVPFLIK--------------------- 205
GADH V CHD + +A PF+
Sbjct: 432 GADHLIVACHDWNPIYRTISTNTIRIKSQAITMPPFIFVGGESTYDLVSGTFSNKGFNSQ 491
Query: 206 ----------NAIRVVCSPSYDVAFIPHKDVALP-----QVLQPFALPRGGRDVENRTVL 250
N+IR +C+ + F KD LP + P GG+ R +L
Sbjct: 492 APRITRQCSWNSIRALCNSNIASGFKIGKDTTLPVTYIRKSEDPLKY-LGGKPPSQRPIL 550
Query: 251 GFWAGHRNSKIRVILARVWEN-DTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQV 309
F+AG + +R IL + WEN + ++ I G Y+ +K+CIC G +V
Sbjct: 551 AFFAGSMHGYLRPILLQYWENKEQDIKIFGPMSRDDGGKSRYRDHMKSSKYCICARGYEV 610
Query: 310 NSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFV 369
++ R+ ++I Y CVPVI+S+ Y PF +IL+W FAV + E+DV L+ IL +I + +++
Sbjct: 611 HTPRVVEAIFYECVPVIISDNYVPPFFEILNWEAFAVFILEKDVPNLRNILLSIPEEKYL 670
Query: 370 SLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
+ + VQ+HF W+ PV++D FHM+++ +W R
Sbjct: 671 QMQMRVKMVQQHFLWHKKPVKYDLFHMILHSVWYNR 706
>gi|255546377|ref|XP_002514248.1| catalytic, putative [Ricinus communis]
gi|223546704|gb|EEF48202.1| catalytic, putative [Ricinus communis]
Length = 676
Score = 217 bits (552), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 122/359 (33%), Positives = 197/359 (54%), Gaps = 19/359 (5%)
Query: 58 AINKQDYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEG 117
A+N QD L + +F + S F +Y M K+YIY DG F+ K G YASEG
Sbjct: 319 AVNDQD---LYAPLFRNISKFKRSYELMERTLKVYIYKDGKKPIFHLPIMK--GLYASEG 373
Query: 118 YFFQNIRESR-FLTHDPDRAHLFFIPISCHKMR-----GKGTSYENMTVIVKDYVDSLIN 171
+F + ++ ++ FL DP RAHLF++P S + + N+ +KDY + +
Sbjct: 374 WFMKLMQGNKHFLVKDPRRAHLFYMPFSSRMLEYTLYVRNSHNRTNLRQYLKDYSEKIAA 433
Query: 172 KYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQV 231
KYP+WNRT GADHF V CHD T +++ I+ +C+ F +D++LP+
Sbjct: 434 KYPFWNRTDGADHFLVACHDWAPYETR---HHMEHCIKALCNADVTAGFKIGRDISLPET 490
Query: 232 LQPFAL----PRGGRDVENRTVLGFWAGHRNSKIRVILARVWEN-DTELDISNNRINRAI 286
A GG+ R +L F+AG + +R IL + W++ D + I
Sbjct: 491 YVRSARNPLRDLGGKPPSQRHILAFYAGSMHGYLRPILLKYWKDKDPSMKIFGPMPPGVA 550
Query: 287 GPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAV 346
+ Y + +K+CICP G +VNS R+ ++I Y CVPVI+S+ + PF ++ +W F++
Sbjct: 551 SKMNYIQHMKSSKYCICPKGYEVNSPRVVEAIFYECVPVIISDNFVPPFFEVFNWGAFSL 610
Query: 347 ILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
IL E+D+ LK+IL +I + +++ + + KVQKHF W+ P+++D F+M ++ +W R
Sbjct: 611 ILAEKDIPNLKEILLSIPEEKYLEMQLGVRKVQKHFLWHPSPMKYDLFYMTLHAIWYNR 669
>gi|224120362|ref|XP_002331029.1| predicted protein [Populus trichocarpa]
gi|222872959|gb|EEF10090.1| predicted protein [Populus trichocarpa]
Length = 336
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 118/337 (35%), Positives = 187/337 (55%), Gaps = 24/337 (7%)
Query: 85 MVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIR--ESRFLTHDPDRAHLFFIP 142
M F+I++Y +G+ + P L Y+ EG F + +S F PD AH F +P
Sbjct: 1 MEKRFRIWVYREGELPLVHSAPLNLI--YSIEGQFLDEMESGKSPFAASHPDEAHTFLLP 58
Query: 143 ISC----HKMRGKGTSYENMTV--IVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRA 196
IS H + ++ + + +V+DYV + KYPYWNRT GADHF V+CHD
Sbjct: 59 ISVAYIIHYIYRPLVTFSRVELQRLVQDYVTVVAGKYPYWNRTEGADHFLVSCHDWAPDI 118
Query: 197 TEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFAL---PRGGRDVENRTVLGFW 253
+ P L KN IRV+C+ + F P +DV++P++ PF P G R++ F+
Sbjct: 119 SRANPRLYKNFIRVLCNANTSERFEPRRDVSIPEINIPFGKFGPPGKGLPPSKRSIFAFF 178
Query: 254 AGHRNSKIRVILARVWEN-DTELDI----SNNRINRAIGPLVYQRRFYKTKFCICPGGSQ 308
AG + IR +L W++ D E+ + +N+ N Y + ++KFC+CP G +
Sbjct: 179 AGGAHGYIRKLLLEHWKDKDDEIQVHEYLDHNKKND------YFKLMGQSKFCLCPSGYE 232
Query: 309 VNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEF 368
V S R+ +I GC+PV +S+ Y LPF+D+LDW KF+V + + ++K ILK IS +
Sbjct: 233 VASPRVVTAIQSGCIPVTISDNYTLPFSDVLDWSKFSVNIPSEKIPEIKTILKKISFRRY 292
Query: 369 VSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
+ L ++K+++HF+ N P +D HM+++ +WLRR
Sbjct: 293 LILQGRVIKIRRHFKLNRPAQPYDMLHMILHSIWLRR 329
>gi|356518344|ref|XP_003527839.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
max]
Length = 637
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 122/353 (34%), Positives = 189/353 (53%), Gaps = 15/353 (4%)
Query: 63 DYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQN 122
+ +L + +F + S F +Y M K+Y+Y +GD + L G YASEG+F +
Sbjct: 283 NVENLYAPLFRNISRFKRSYELMEKTLKVYVYREGDKPIMHSP--YLLGIYASEGWFMRL 340
Query: 123 IRESR-FLTHDPDRAHLFFIPISCHKMRG-----KGTSYENMTVIVKDYVDSLINKYPYW 176
+ S+ F+T DP +AHLF++P S + S N+ +K+YVD + K+ +W
Sbjct: 341 MEASKQFVTKDPKKAHLFYLPFSSRMLEETLYVPNSHSSRNLIQYLKNYVDMIAGKHRFW 400
Query: 177 NRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQV----L 232
NRT GADHF V CHD T + +R +C+ F+ KD++LP+
Sbjct: 401 NRTGGADHFLVACHDWAPTETRQ---HMARCLRALCNADVKEGFVLGKDISLPETYVRNA 457
Query: 233 QPFALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQ 292
Q GG V R L F+AG + +R IL + WEN + ++ G Y
Sbjct: 458 QKPTRNIGGNRVSKRKTLAFFAGGMHGYVRPILLQHWENKDPAMKIFGILPKSKGNRNYI 517
Query: 293 RRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERD 352
+ +K+CIC G +VNS R+ ++I Y CVPVILS+ + PF ++L+W FAV + E+D
Sbjct: 518 QYMKSSKYCICAKGYEVNSPRVVEAILYECVPVILSDNFVPPFFEMLNWESFAVFVLEKD 577
Query: 353 VYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
+ LK IL +I ++ + + KVQ+HF W+ PV++D FHMV++ +W R
Sbjct: 578 IPNLKNILLSIPQKRYLQMQMMVRKVQQHFLWHRSPVKYDIFHMVLHSIWYNR 630
>gi|359484345|ref|XP_002280595.2| PREDICTED: probable glycosyltransferase At5g03795-like [Vitis
vinifera]
gi|297738776|emb|CBI28021.3| unnamed protein product [Vitis vinifera]
Length = 665
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 128/358 (35%), Positives = 195/358 (54%), Gaps = 32/358 (8%)
Query: 67 LGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIR-E 125
L + V+ + S+F +Y M KIYIY +G+ F+Q PR L G YASEG+F + I
Sbjct: 314 LYASVYRNVSMFKRSYELMERVLKIYIYREGEKPIFHQ-PR-LRGIYASEGWFMKLIEGN 371
Query: 126 SRFLTHDPDRAHLFFIPISCHKMRG-----KGTSYENMTVIVKDYVDSLINKYPYWNRTL 180
RF+ DP +AHLF++P S +R ++ ++ K+YV + KY +WNRT
Sbjct: 372 KRFVVRDPRKAHLFYVPFSSKMLRTVFYEQNSSTPRDLEKYFKNYVGLIAGKYRFWNRTG 431
Query: 181 GADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALP-----QVLQPF 235
GADH V CHD R T + N+IR +C+ + F KD LP + P
Sbjct: 432 GADHLIVACHDWAPRITRQCSW---NSIRALCNSNIASGFKIGKDTTLPVTYIRKSEDPL 488
Query: 236 ALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLV----- 290
GG+ R +L F+AG + +R IL + WEN E DI + GP+
Sbjct: 489 KY-LGGKPPSQRPILAFFAGSMHGYLRPILLQYWEN-KEQDI------KIFGPMSRDDGG 540
Query: 291 ---YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVI 347
Y+ +K+CIC G +V++ R+ ++I Y CVPVI+S+ Y PF +IL+W FAV
Sbjct: 541 KSRYRDHMKSSKYCICARGYEVHTPRVVEAIFYECVPVIISDNYVPPFFEILNWEAFAVF 600
Query: 348 LNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
+ E+DV L+ IL +I + +++ + + VQ+HF W+ PV++D FHM+++ +W R
Sbjct: 601 ILEKDVPNLRNILLSIPEEKYLQMQMRVKMVQQHFLWHKKPVKYDLFHMILHSVWYNR 658
>gi|224142563|ref|XP_002324625.1| predicted protein [Populus trichocarpa]
gi|222866059|gb|EEF03190.1| predicted protein [Populus trichocarpa]
Length = 336
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 116/337 (34%), Positives = 183/337 (54%), Gaps = 17/337 (5%)
Query: 85 MVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIR--ESRFLTHDPDRAHLFFIP 142
M FKI++Y +G+ + P L Y EG F + +S F PD AH+F +P
Sbjct: 1 MEKRFKIWVYKEGERPLVHGGP--LNNIYGVEGQFLDEMEHGKSPFAASHPDEAHMFLLP 58
Query: 143 ISCHKMRGK------GTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRA 196
IS + S + + +V+DYV + +KYPYWNR+ GADHF V+CHD
Sbjct: 59 ISVAYIISYVYKPIVTYSRDELQRLVQDYVGVVADKYPYWNRSKGADHFLVSCHDWAPDI 118
Query: 197 TEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQP---FALPRGGRDVENRTVLGFW 253
+ P L KN IRV+C+ + F P +DV++P++ P P G R++ F+
Sbjct: 119 SGANPDLYKNFIRVLCNANTSERFEPRRDVSIPEINIPNGKLGPPHKGLPPSKRSIFAFF 178
Query: 254 AGHRNSKIRVILARVWEN-DTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSA 312
AG + IR +L W++ D E+ + + Y ++KFC+CP G +V S
Sbjct: 179 AGGAHGYIRKVLLENWKDKDDEIQVHEYLDKKGTD---YFELMGQSKFCLCPSGYEVASP 235
Query: 313 RIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLH 372
R+ +I GCVPV +S+ Y LPF+D+LDW KF+V + + ++K ILK IS ++ +
Sbjct: 236 RVVTAIQLGCVPVTISDNYTLPFSDVLDWSKFSVHIPSEKIPEIKTILKKISPQRYLMMQ 295
Query: 373 NNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIK 409
+++VQ+HF+ N P +D HM+++ +W+RR +K
Sbjct: 296 MRVIQVQRHFELNRPARPYDLLHMLLHSVWVRRLNVK 332
>gi|356495238|ref|XP_003516486.1| PREDICTED: probable glycosyltransferase At3g07620-like [Glycine
max]
Length = 610
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 117/336 (34%), Positives = 186/336 (55%), Gaps = 13/336 (3%)
Query: 77 VFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIR-ESRFLTHDPDR 135
+F +Y M K++IY +G F+Q K+ G YASEG+F + + RF+ DP +
Sbjct: 274 LFFRSYELMERKLKVFIYREGAKPIFHQP--KMRGIYASEGWFMKLMEGNKRFIVKDPRK 331
Query: 136 AHLFFIPISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVR 195
AHLF++P S +R ++ + M ++ YV+ + +Y +WNRT GADHF V CHD R
Sbjct: 332 AHLFYLPFSSQMLRVTLSNPKQMEQHLEKYVELIAGRYRFWNRTDGADHFLVACHDWASR 391
Query: 196 ATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALP-----QVLQPFALPRGGRDVENRTVL 250
T +K IR +C+ + F KD LP V+ P G+ R+ L
Sbjct: 392 ITRQP---MKGCIRSLCNSNVAKGFQIGKDTTLPVTYIHSVMDPLK-ECAGKPPSERSAL 447
Query: 251 GFWAGHRNSKIRVILARVWEN-DTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQV 309
F+AG + +R IL + W N + ++ I G +Y +K+CIC G +V
Sbjct: 448 AFFAGSMHGYLRPILLKHWANKEPDMKIFGPMPRDLEGKKMYMEYMNSSKYCICARGYEV 507
Query: 310 NSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFV 369
++ RI ++I GCVPVI+S+ Y P ++L W F++ + ERDV L+ IL +I + +++
Sbjct: 508 HTPRIIEAIFSGCVPVIISDNYVPPLFEVLKWEAFSLFVRERDVPSLRDILLSIPEEKYL 567
Query: 370 SLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
+LH + KVQ+HF W+ PV++D FHM+++ +W R
Sbjct: 568 ALHLGVKKVQQHFLWHKVPVKYDLFHMILHAIWKNR 603
>gi|449511061|ref|XP_004163851.1| PREDICTED: probable glycosyltransferase At3g07620-like [Cucumis
sativus]
Length = 341
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 116/339 (34%), Positives = 191/339 (56%), Gaps = 15/339 (4%)
Query: 77 VFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESR-FLTHDPDR 135
+F +Y M K+Y+Y +G+ F+Q LTG YASEG+F + + +++ F+ DP++
Sbjct: 1 MFTRSYELMEKMLKVYVYDEGEKPIFHQP--ILTGIYASEGWFMKLLEDNKKFVVKDPEK 58
Query: 136 AHLFFIPISCHKMRGK-GTSYEN---MTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHD 191
AHLF++P S +R G + N + +K+Y+D + KY +WN+ G+DHF V CHD
Sbjct: 59 AHLFYLPFSSQFLRSAFGNKFRNKRDLQKPLKNYIDVIGKKYRFWNKNGGSDHFLVACHD 118
Query: 192 VGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVL----QPFALPRGGRDVENR 247
+ T+ L+KN IR +C+ + F KD +LP + GG+ R
Sbjct: 119 WAPKLTKR---LVKNCIRALCNANGAGDFEIGKDTSLPVTFVHSTEDLITKIGGKPPSER 175
Query: 248 TVLGFWAGHRNSKIRVILARVWEN-DTELDISNNRINRAIGPLVYQRRFYKTKFCICPGG 306
T L F+AG + +R IL WEN + ++ I N G Y + +K+CIC G
Sbjct: 176 TTLAFFAGSMHGYLRPILLHYWENKEPDMMIVGPMPNSIEGKNAYMEQMKSSKYCICARG 235
Query: 307 SQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDA 366
QV+S R+ ++I C+PVI+S+ Y P ++L+W F+V + ER++ L+ IL +I +
Sbjct: 236 YQVHSPRVIEAILNECIPVIISDNYVPPLFEVLNWESFSVFVKEREIPYLRDILLSIPEE 295
Query: 367 EFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
+ +H+ + VQ+HF W+ P ++DAFHM+++ +W R
Sbjct: 296 NYRVMHSRVKMVQQHFLWHEKPAKYDAFHMILHSIWYTR 334
>gi|9758523|dbj|BAB08970.1| unnamed protein product [Arabidopsis thaliana]
Length = 559
Score = 215 bits (548), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 128/372 (34%), Positives = 204/372 (54%), Gaps = 37/372 (9%)
Query: 62 QDYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQ-TPRKLTGKYASEGYFF 120
D+ L V+ + S F +Y M KIY+Y +G F+ PR G YASEG+F
Sbjct: 199 HDFLGLNPLVYRNISKFLRSYDLMERKLKIYVYKEGGKPIFHTPMPR---GIYASEGWFM 255
Query: 121 QNIRESR-FLTHDPDRAHLFFIPISCHKMRGK-GTSYENMTVI---VKDYVDSLINKYPY 175
+ + ++ F+ DP +AHLF+IPIS +R G ++ + +K+YVD + KY +
Sbjct: 256 KLMESNKKFVVKDPRKAHLFYIPISIKALRSSLGLDFQTPKSLADHLKEYVDLIAGKYKF 315
Query: 176 WNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQV-LQP 234
WNRT GADHF V CHD G + T +KN++R +C+ + F D ALP ++
Sbjct: 316 WNRTGGADHFLVACHDWGNKLTTKT---MKNSVRSLCNSNVAQGFRIGTDTALPVTYIRS 372
Query: 235 FALPR---GGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVY 291
P GG+ R +L F+AG + +R IL ++WEN E D+ + GP+
Sbjct: 373 SEAPLEYLGGKTSSERKILAFFAGSMHGYLRPILVKLWEN-KEPDM------KIFGPMPR 425
Query: 292 Q-------RRFYKT------KFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDI 338
R + K+ ++CIC G +V++ R+ ++I CVPVI+++ Y PF ++
Sbjct: 426 DPKSKKQYREYMKSSSSHFNRYCICARGYEVHTPRVVEAIINECVPVIIADNYVPPFFEV 485
Query: 339 LDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVM 398
L+W +FAV + E+D+ L+ IL +I + ++ + + VQ+HF W+ PV+FD FHM++
Sbjct: 486 LNWEEFAVFVEEKDIPNLRNILLSIPEDRYIGMQARVKAVQQHFLWHKKPVKFDQFHMIL 545
Query: 399 YELWLRR-HVIK 409
+ +W R H IK
Sbjct: 546 HSIWYSRVHRIK 557
>gi|297805260|ref|XP_002870514.1| hypothetical protein ARALYDRAFT_915839 [Arabidopsis lyrata subsp.
lyrata]
gi|297316350|gb|EFH46773.1| hypothetical protein ARALYDRAFT_915839 [Arabidopsis lyrata subsp.
lyrata]
Length = 536
Score = 215 bits (547), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 121/358 (33%), Positives = 199/358 (55%), Gaps = 43/358 (12%)
Query: 62 QDYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQ-TPRKLTGKYASEGYFF 120
++ L + V+ + S F +Y M K+Y+Y +G F++ PR G YASEG+F
Sbjct: 201 HEFLGLNASVYRNISKFLRSYDLMERKLKVYVYKEGGKPIFHKPLPR---GIYASEGWFM 257
Query: 121 QNIRESR-FLTHDPDRAHLFFIPISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRT 179
+ + ++ F+ DP +AHLF+IPI+ +K+YVD + KY +WNRT
Sbjct: 258 KLMESNKKFVVRDPRKAHLFYIPINH----------------LKEYVDLIAGKYKFWNRT 301
Query: 180 LGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQV-LQPFALP 238
GADHF V CHD G + T+ +KN++R +C+ + F D ALP ++ P
Sbjct: 302 GGADHFIVACHDWGNKLTKKT---MKNSVRALCNSNVAQGFRIGTDTALPVTYIRSAESP 358
Query: 239 R---GGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPL------ 289
GG+ R +L F+AG + +R IL ++WEN E D+ + +GP+
Sbjct: 359 LEYLGGKTPSKRKILAFFAGSMHGYLRPILVKLWENK-EPDM------KIVGPMPRDPES 411
Query: 290 --VYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVI 347
Y+ +K+CIC G +V++ R+ ++I CVPVI+++ Y PF +IL+W +FAV
Sbjct: 412 KTQYREYMKSSKYCICARGYEVHTPRVVEAIINECVPVIIADNYVPPFFEILNWEEFAVF 471
Query: 348 LNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
+ E+++ L+ IL +IS+ ++ + + VQ+HF W+ PV+FD FHM+++ +W R
Sbjct: 472 VEEKEIANLRNILLSISEERYIVMQARVKAVQQHFLWHKKPVKFDLFHMILHSIWHSR 529
>gi|224142567|ref|XP_002324627.1| predicted protein [Populus trichocarpa]
gi|222866061|gb|EEF03192.1| predicted protein [Populus trichocarpa]
Length = 342
Score = 214 bits (546), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 116/339 (34%), Positives = 185/339 (54%), Gaps = 16/339 (4%)
Query: 85 MVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIR--ESRFLTHDPDRAHLFFIP 142
M FK+++Y +G+ + P + Y+ EG F I +S F+ PD AH FF+P
Sbjct: 1 MEKRFKVWVYKEGELPVVHGAP--VNDIYSIEGQFLDEIESGKSPFIARHPDEAHAFFLP 58
Query: 143 ISC----HKMRGKGTSY--ENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRA 196
IS H + ++ + + +V DYV + +KY YWNRT GADHF V+CHD
Sbjct: 59 ISVAYIIHYVYKPRITFARDQLQRLVTDYVRVVADKYTYWNRTQGADHFSVSCHDWAPDV 118
Query: 197 TEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQP---FALPRG-GRDVENRTVLGF 252
+ P L + +RV+C+ + F P +DV++P++ P PR + R++L F
Sbjct: 119 SRANPELFRYFVRVLCNANISEGFRPQRDVSIPEIFLPVGKLGPPREYTKPPSKRSILAF 178
Query: 253 WAGHRNSKIRVILARVW-ENDTELDISNNRINR-AIGPLVYQRRFYKTKFCICPGGSQVN 310
+AG + IR +L W E D E+ + R +Y ++KFC+CP G +V
Sbjct: 179 FAGGAHGHIRKVLLTHWKEKDDEVQVHEYLTQRNKKNTNLYFELMGQSKFCLCPSGHEVA 238
Query: 311 SARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVS 370
S R+ +I GCVPV +S Y LPF+D+LDW KF+V + + ++K ILK IS +++
Sbjct: 239 SPRVVTAIQLGCVPVTISANYSLPFSDVLDWSKFSVDIPPEKIPEIKTILKGISSRRYLT 298
Query: 371 LHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIK 409
+ ++++Q+HF N P +D HM+++ +WLRR +K
Sbjct: 299 MQRRVMQIQRHFMLNRPAQPYDMLHMILHSVWLRRLNVK 337
>gi|168062371|ref|XP_001783154.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162665352|gb|EDQ52040.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 351
Score = 214 bits (545), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 115/337 (34%), Positives = 188/337 (55%), Gaps = 27/337 (8%)
Query: 85 MVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRE--SRFLTHDPDRAHLFFIP 142
M FK+Y+Y DG + K G YA+EG F + + + +R+ DP +AH+F +P
Sbjct: 1 MQKTFKVYVYKDGYKPLVHAA--KTGGIYATEGLFLKRMDDPGNRYTVSDPTQAHMFLLP 58
Query: 143 ISCHKM-----RGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRAT 197
S ++ S + + +YV+ + +KYPYWNRT GADHFFV+CHD +T
Sbjct: 59 YSVRQLVDFIQDPYSRSMRPLKTFIANYVERITSKYPYWNRTRGADHFFVSCHDWAPLST 118
Query: 198 EGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGR----DVEN-----RT 248
L N+++VVC+ F KDV++PQ + +GG D++N R
Sbjct: 119 ILHDELHNNSMKVVCNADLTANFDIQKDVSIPQAV------KGGNQSELDIDNLPPGKRD 172
Query: 249 VLGFWAGHRNSKIRVILARVWEN-DTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGS 307
L F+AG + +R +L + W D+ + + + Y + ++KFC+CP G
Sbjct: 173 YLAFYAGQMHGLVRPVLIQHWRGKDSSMKVYEVLPPEIAKNISYAQHMKRSKFCLCPKGF 232
Query: 308 QVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAE 367
+VNS RI ++I GCVPVI+++ + LPF+++LDW KF++ + E+D+ LK+IL N+ D
Sbjct: 233 EVNSPRIVEAILSGCVPVIIADNFVLPFSNVLDWSKFSITVEEKDIPNLKRILTNVPDGT 292
Query: 368 FVSLHNNLVKVQKHFQW--NSPPVRFDAFHMVMYELW 402
+ S+ + L +++HF W + ++D+FHM MY +W
Sbjct: 293 YRSMQSCLKYIRRHFVWLEDQEDTQYDSFHMTMYSIW 329
>gi|147775379|emb|CAN73458.1| hypothetical protein VITISV_022180 [Vitis vinifera]
Length = 333
Score = 214 bits (545), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 115/332 (34%), Positives = 184/332 (55%), Gaps = 17/332 (5%)
Query: 85 MVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIR--ESRFLTHDPDRAHLFFIP 142
M FKI+ Y +GD + P G YA EG F +I +S FL PD A+ F+IP
Sbjct: 1 MEKRFKIWAYREGDQPLMHDGPS--NGIYAIEGQFMDDIESGKSHFLARRPDEANAFYIP 58
Query: 143 ISCHKMRG---KGTSYENMTV--IVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRAT 197
+S K+ + Y + +V DY++ + +KYPYWNR+ GADHF V+CHD +
Sbjct: 59 MSLTKIVHFIYEPPHYYGKWIPRLVTDYINFVADKYPYWNRSKGADHFLVSCHDWAPDVS 118
Query: 198 EGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQP---FALPRGGRDVENRTVLGFWA 254
P L K+ IR +C+ + F P +D+++P++ P P + R +L F+A
Sbjct: 119 ALKPDLYKHFIRALCNANTSERFHPIRDISIPEINIPXGKLGPPHLDQPPNKRPILAFFA 178
Query: 255 GHRNSKIRVILARVW-ENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSAR 313
G + +R +L + W E D E+ + R G Y + +KFC+CP G +V S R
Sbjct: 179 GGAHGYVRSVLFKYWKEKDDEVQV----FERLPGNRNYSKSMGDSKFCLCPSGYEVASPR 234
Query: 314 IADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHN 373
I ++I GCVP+I+ ++Y LPF+D+LDW KF++ + + ++K+ILK + ++ +
Sbjct: 235 IVEAIAAGCVPMIICDHYSLPFSDVLDWSKFSIYITSDKIPEIKKILKAVPTETYLEMQK 294
Query: 374 NLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
+ +VQ+HF N P +D HM+++ +WLRR
Sbjct: 295 RVKQVQRHFAINRPARPYDMLHMILHSVWLRR 326
>gi|443732873|gb|ELU17436.1| hypothetical protein CAPTEDRAFT_228334 [Capitella teleta]
Length = 506
Score = 212 bits (539), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 131/396 (33%), Positives = 199/396 (50%), Gaps = 31/396 (7%)
Query: 36 CLSFNS--FRSYPLQNNFNSTLSFAINKQDYSDLGSDVFHSPSVFHLNY----LEMVTNF 89
C FNS + + + S++ + + L +V+ S + NY + M
Sbjct: 120 CAGFNSNGWLKKRIGQKYKSSVDLYFKES--APLDEEVYLSSQIIEGNYGKDYIRMTKEL 177
Query: 90 KIYIYPDG-DPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHKM 148
KIY+Y D + Y KY E F ++ S ++T DP A FF+P C
Sbjct: 178 KIYMYTTKIDAHINYVN----DWKYGVEELFIHLLKSSPYITQDPSEATFFFLPFRCFAY 233
Query: 149 RGKGTSYENMTVIVKDYVDSLI----NKYPYWNRTLGADHFFVTCHDVGVRATEGV-PFL 203
R + + ++ V ++ + Y +W+RTLGADHF+V HD G G PFL
Sbjct: 234 RKTISDRDRAQRFTEEMVSKILYEIKSNYSFWDRTLGADHFYVCAHDFGPAIVAGSDPFL 293
Query: 204 IKNAIRVVCSPSYD-VAFIPHKDVALP-------QVLQPFALPRGGRDVENRTVLGFWAG 255
KNAI +V + Y+ + ++PHKD++LP L G + +RTVL F+AG
Sbjct: 294 HKNAIAMVNTADYEHIYYVPHKDISLPPHPSHGKNSLANIGKGGHGLNPSDRTVLAFYAG 353
Query: 256 HRN-SKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARI 314
+ + +IR + W D + I + YQ +KFC+ G++ S +
Sbjct: 354 NLDRGRIRPSIKDFWSTDIDFRIFMGHLTDE----RYQHYLKTSKFCLILRGNEAWSPCL 409
Query: 315 ADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNN 374
D+I +GCVPVI+S+YYDLP + +LDW +FAV++ E V LK+IL +S + S+
Sbjct: 410 MDAIWFGCVPVIISDYYDLPLHGMLDWNQFAVVIRESKVKSLKEILLAVSPQKLTSMQEK 469
Query: 375 LVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIKY 410
L +V HF WN PP +DAF VMY+LW RR V++Y
Sbjct: 470 LKQVYGHFVWNDPPRPYDAFQSVMYQLWKRRGVVRY 505
>gi|242065760|ref|XP_002454169.1| hypothetical protein SORBIDRAFT_04g025885 [Sorghum bicolor]
gi|241934000|gb|EES07145.1| hypothetical protein SORBIDRAFT_04g025885 [Sorghum bicolor]
Length = 338
Score = 211 bits (537), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 123/337 (36%), Positives = 177/337 (52%), Gaps = 22/337 (6%)
Query: 85 MVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESR--FLTHDPDRAHLFFIP 142
M FKI+ Y +G+P + P Y+ EG F + I + R F P AH F +P
Sbjct: 1 MERKFKIWTYKEGEPPLTHLGPS--ADIYSIEGQFLEEIEDPRNPFAARHPGEAHAFLLP 58
Query: 143 ISCHKM--------RGKGTSY-ENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVG 193
+S + R T+Y +M + DYVD + +YPYWNR+ GADH V+CHD
Sbjct: 59 VSVCNLVQYIYPFYRRNTTAYMAHMRRALADYVDVVAGRYPYWNRSRGADHVMVSCHDWA 118
Query: 194 VRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFAL---PRGGRDVENRTVL 250
+E L NAIRV+C+ + +F P KD LP+V L P G ENRT L
Sbjct: 119 PLVSEANGELYANAIRVLCNANTSESFRPRKDATLPEVNLGDGLLRRPTFGMPPENRTTL 178
Query: 251 GFWAGHRNSKIRVILARVW--ENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQ 308
F+AG + IR L W D ++DI + + + Y +FC+CP G +
Sbjct: 179 AFFAGGMHGHIRKALLGYWLGRKDPDMDI-HEYLPKGQD---YHALMASARFCLCPSGFE 234
Query: 309 VNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEF 368
V S R+ +S+ GCVPVI+S+ Y PF+D+LDW K +V + + +LK ILK +S+ +
Sbjct: 235 VASPRVVESVFSGCVPVIISDGYPPPFSDVLDWSKMSVTVPPARIPELKDILKGVSERRY 294
Query: 369 VSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
L +++ Q+HF + P RFD MVM+ +WLRR
Sbjct: 295 RVLRARVLQAQRHFVVHRPSQRFDMIRMVMHSIWLRR 331
>gi|356518348|ref|XP_003527841.1| PREDICTED: LOW QUALITY PROTEIN: probable glycosyltransferase
At5g03795-like [Glycine max]
Length = 549
Score = 211 bits (537), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 123/352 (34%), Positives = 197/352 (55%), Gaps = 30/352 (8%)
Query: 71 VFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLT 130
++ + S+F +Y M K+ IY D D F++ L G YASEG+F + + ++ +T
Sbjct: 204 LYRNASMFRRSYELMENMLKVCIYQDEDRPIFHEP--LLDGIYASEGWFMKLMEANKXVT 261
Query: 131 HDPDRAHLFFIPISCHKMRG-----KGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHF 185
DP +AHLF+IP S ++ + N+ +K+YV + KYP+WNRT GADHF
Sbjct: 262 GDPGKAHLFYIPFSSRLLQQTLYVRNSHRHSNLIEYMKNYVKMIAGKYPFWNRTSGADHF 321
Query: 186 FVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQV-LQPFALPRG---G 241
V CHD T G + ++IR +C+ +V F KDV+LP+ ++ P G
Sbjct: 322 VVACHDWAPAETRG---RMLSSIRALCNADIEVGFKIGKDVSLPETYIRSSENPVKNIEG 378
Query: 242 RDVENRTVLGFWAGHRNSKIRVILARVWEN-DTELDISNNRINRAIGPLVYQR------R 294
R +L F+AG + + IL + WEN + ++ IS GPL + R +
Sbjct: 379 DPPSQRPILAFFAGGLHVYVXPILLKHWENKEPDMKIS--------GPLPHVRGNVNYIQ 430
Query: 295 FYKT-KFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDV 353
F K+ KFCI G +VNS R+ ++I + C+PVI+S+ + PF +IL+W FAV + E ++
Sbjct: 431 FMKSSKFCIHARGHEVNSPRVVEAIFHECIPVIISDNFIPPFFEILNWESFAVFVTEEEI 490
Query: 354 YQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
L+ IL +IS+ ++ +H + KVQ+HF W++ PV+ D HM+++ +W R
Sbjct: 491 PNLRNILLSISEERYLEMHKRVKKVQEHFPWHAEPVKDDLSHMLLHSIWYNR 542
>gi|297808029|ref|XP_002871898.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
gi|297317735|gb|EFH48157.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
Length = 610
Score = 211 bits (536), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 115/346 (33%), Positives = 194/346 (56%), Gaps = 16/346 (4%)
Query: 71 VFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESR-FL 129
+F + S+F +Y M K+Y+Y +G+ F+ TP L G YASEG+F + + ++ +
Sbjct: 263 IFRNVSMFKRSYELMERILKVYVYKEGNRPIFH-TP-ILKGLYASEGWFMKLMEGNKQYT 320
Query: 130 THDPDRAHLFFIPISCHKMR-----GKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADH 184
DP +AHL+++P S + + N+ +K+Y + + +KYP++NRT GADH
Sbjct: 321 VKDPRKAHLYYMPFSARMLEYTLYVRNSHNRTNLRQFLKEYTEHISSKYPFFNRTDGADH 380
Query: 185 FFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFA----LPRG 240
F V CHD T +++ I+ +C+ F +D++LP+ A G
Sbjct: 381 FLVACHDWAPYETR---HHMEHCIKALCNADVTAGFKIGRDISLPETYVRAAKNPLRDLG 437
Query: 241 GRDVENRTVLGFWAGHRNSKIRVILARVWEN-DTELDISNNRINRAIGPLVYQRRFYKTK 299
G+ R L F+AG + +R IL + W++ D ++ I + Y + +K
Sbjct: 438 GKPPSQRRTLAFYAGSMHGYLRQILLQHWKDKDPDMKIFGRMPFGVASKMNYIEQMKSSK 497
Query: 300 FCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQI 359
+CICP G +VNS R+ +SI Y CVPVI+S+ + PF ++LDW F+VI+ E+D+ +LK I
Sbjct: 498 YCICPKGYEVNSPRVVESIFYECVPVIISDNFVPPFFEVLDWSAFSVIVAEKDIPRLKDI 557
Query: 360 LKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
L +I + ++V + + K Q+HF W++ P ++D FHMV++ +W R
Sbjct: 558 LSSIPEEKYVKMQMAVRKAQRHFLWHAKPEKYDLFHMVLHSIWYNR 603
>gi|356546124|ref|XP_003541481.1| PREDICTED: probable glycosyltransferase At5g20260-like, partial
[Glycine max]
Length = 290
Score = 210 bits (535), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 113/292 (38%), Positives = 176/292 (60%), Gaps = 15/292 (5%)
Query: 132 DPDRAHLFFIPISCHKM----RGKGTSY--ENMTVIVKDYVDSLINKYPYWNRTLGADHF 185
+PD AH+F +PIS ++ T+Y + + I DY + + ++YPYWNRT GADHF
Sbjct: 1 NPDEAHVFMLPISVAQIVRYVYNPLTTYSRDQLMWITIDYTNIIAHRYPYWNRTRGADHF 60
Query: 186 FVTCHDVG---VRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQV-LQPFAL--PR 239
+CHD + E L KN IRV+ + + F P KDV +P+V LQ F L P
Sbjct: 61 LASCHDWAPPDISRAESGKELFKNIIRVLYNANKSEGFKPEKDVPMPEVNLQGFKLSSPI 120
Query: 240 GGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTK 299
G D NR++L F+AG + +IR IL + W++ E + + + + Y ++K
Sbjct: 121 LGLDPNNRSILAFFAGGVHGRIREILLQHWKDKDEEVQVHEYLPKGVD---YHGLMGQSK 177
Query: 300 FCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQI 359
FC+CP G +V S RI +SI+ GCVPVI+S+YY LPF+D+LD KF++ + R + ++K +
Sbjct: 178 FCLCPSGYEVASPRIVESINIGCVPVIVSDYYQLPFSDVLDRSKFSLHIPSRRIAEIKTM 237
Query: 360 LKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIKYF 411
LKN+ A+++ L ++KVQ+HF N P F+ FHM+++ +WLR+ I+ +
Sbjct: 238 LKNVPHAKYLKLQKRVMKVQRHFVLNRPAKSFNVFHMILHSIWLRQLNIRLY 289
>gi|15146187|gb|AAK83577.1| AT5g33290/F19N2_10 [Arabidopsis thaliana]
gi|18252279|gb|AAL62020.1| AT5g33290/F19N2_10 [Arabidopsis thaliana]
Length = 341
Score = 209 bits (533), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 113/343 (32%), Positives = 189/343 (55%), Gaps = 31/343 (9%)
Query: 85 MVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNI------RESRFLTHDPDRAHL 138
M+ FK++ Y +G+ F+ P + Y EG F + SRF P+ AH+
Sbjct: 1 MMNRFKVWTYTEGEVPLFHDGP--VNDIYGIEGQFMDEMCVDGPKSRSRFRADRPENAHV 58
Query: 139 FFIPISCHKM---------RGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTC 189
FFIP S K+ +G S + +++DYVD + K+PYWNR+ G DHF V+C
Sbjct: 59 FFIPFSVAKVIHFVYKPITSVEGFSRARLHRLIEDYVDVVATKHPYWNRSQGGDHFMVSC 118
Query: 190 HDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRG-------GR 242
HD +G P L + IR +C+ + F P+ DV++P++ LP+G G+
Sbjct: 119 HDWAPDEIDGNPKLFEKFIRGLCNANTSEGFRPNVDVSIPEIY----LPKGKLGPSFLGK 174
Query: 243 DVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCI 302
R++L F+AG + +IR IL + W+ E+D +R Y + +KFC+
Sbjct: 175 SPRVRSILAFFAGRSHGEIRKILFQHWK---EMDNEVQVYDRLPPGKDYTKTMGMSKFCL 231
Query: 303 CPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKN 362
CP G +V S R ++I+ GCVPVI+S+ Y LPF+D+L+W F++ + + ++K IL++
Sbjct: 232 CPSGWEVASPREVEAIYAGCVPVIISDNYSLPFSDVLNWDSFSIQIPVSRIKEIKTILQS 291
Query: 363 ISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
+S ++ ++ +++V++HF N P +D HM+++ +WLRR
Sbjct: 292 VSLVRYLKMYKRVLEVKQHFVLNRPAKPYDVMHMMLHSIWLRR 334
>gi|334187775|ref|NP_197468.2| Exostosin family protein [Arabidopsis thaliana]
gi|332005353|gb|AED92736.1| Exostosin family protein [Arabidopsis thaliana]
Length = 610
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 116/351 (33%), Positives = 196/351 (55%), Gaps = 16/351 (4%)
Query: 66 DLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRE 125
+L +F + S+F +Y M K+Y+Y +G+ F+ TP L G YASEG+F + +
Sbjct: 258 ELYPPIFRNVSLFKRSYELMERILKVYVYKEGNRPIFH-TP-ILKGLYASEGWFMKLMEG 315
Query: 126 SR-FLTHDPDRAHLFFIPISCHKMR-----GKGTSYENMTVIVKDYVDSLINKYPYWNRT 179
++ + DP +AHL+++P S + + N+ +K+Y + + +KYP++NRT
Sbjct: 316 NKQYTVKDPRKAHLYYMPFSARMLEYTLYVRNSHNRTNLRQFLKEYTEHISSKYPFFNRT 375
Query: 180 LGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFA--- 236
GADHF V CHD T +++ I+ +C+ F +D++LP+ A
Sbjct: 376 DGADHFLVACHDWAPYETR---HHMEHCIKALCNADVTAGFKIGRDISLPETYVRAAKNP 432
Query: 237 -LPRGGRDVENRTVLGFWAGHRNSKIRVILARVWEN-DTELDISNNRINRAIGPLVYQRR 294
GG+ R L F+AG + +R IL + W++ D ++ I + Y +
Sbjct: 433 LRDLGGKPPSQRRTLAFYAGSMHGYLRQILLQHWKDKDPDMKIFGRMPFGVASKMNYIEQ 492
Query: 295 FYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVY 354
+K+CICP G +VNS R+ +SI Y CVPVI+S+ + PF ++LDW F+VI+ E+D+
Sbjct: 493 MKSSKYCICPKGYEVNSPRVVESIFYECVPVIISDNFVPPFFEVLDWSAFSVIVAEKDIP 552
Query: 355 QLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
+LK IL +I + ++V + + K Q+HF W++ P ++D FHMV++ +W R
Sbjct: 553 RLKDILLSIPEDKYVKMQMAVRKAQRHFLWHAKPEKYDLFHMVLHSIWYNR 603
>gi|224120366|ref|XP_002331030.1| predicted protein [Populus trichocarpa]
gi|222872960|gb|EEF10091.1| predicted protein [Populus trichocarpa]
Length = 339
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 115/340 (33%), Positives = 188/340 (55%), Gaps = 18/340 (5%)
Query: 85 MVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIR--ESRFLTHDPDRAHLFFIP 142
M FK++IY +G+ + P + Y+ EG F + +S+F+ P+ A F +P
Sbjct: 1 MEKRFKVWIYKEGELPVVHGGP--VNDIYSIEGQFLDEMESGKSQFIARHPEEAQAFLLP 58
Query: 143 ISCHKM-----RGKGT-SYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRA 196
+S + R + T S + + +V DYV + +K+PYWNRT GADHF V+CHD
Sbjct: 59 VSVAYIIHYVYRPRITFSRDQLQRLVTDYVRVIADKHPYWNRTHGADHFSVSCHDWAPDV 118
Query: 197 TEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQP---FALPRG-GRDVENRTVLGF 252
+ P L K IR +C+ + F P +DV++P++ P P+ + R++L F
Sbjct: 119 SRADPGLFKYFIRALCNANTSEGFQPQRDVSIPEIFLPVGKLGPPQEYAQPPSKRSILAF 178
Query: 253 WAGHRNSKIRVILARVW-ENDTELDISN--NRINRAIGPLVYQRRFYKTKFCICPGGSQV 309
+AG + IR IL W E D E+ + R N+ L ++ ++KFC+CP G +V
Sbjct: 179 FAGGAHGHIRKILLERWKEKDDEIQVHEYLTRKNKKNNNLYFEL-MGQSKFCLCPSGHEV 237
Query: 310 NSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFV 369
S R+ +I GCVPV +S+ Y LPF+D+LDW KF+V + + +K ILK IS ++
Sbjct: 238 ASPRVVTAIQLGCVPVTISDNYSLPFSDVLDWSKFSVDIPSEKIPDIKIILKGISVRRYL 297
Query: 370 SLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIK 409
++ ++++++HF N P +D HM+++ +WLRR +K
Sbjct: 298 TMQRRVMQIRRHFTLNRPAQPYDMLHMILHSVWLRRLDVK 337
>gi|297818734|ref|XP_002877250.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323088|gb|EFH53509.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
Length = 422
Score = 209 bits (531), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 115/354 (32%), Positives = 192/354 (54%), Gaps = 25/354 (7%)
Query: 71 VFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIR------ 124
++ + FH +++EM+ FK++ Y +G+ + P + Y EG F +
Sbjct: 68 IYRNSFAFHQSHIEMMKRFKVWSYREGEQPLVHDGP--VNDIYGIEGQFIDELSNVIGGP 125
Query: 125 ESRFLTHDPDRAHLFFIPISC----HKMRGKGTSYEN-----MTVIVKDYVDSLINKYPY 175
RF P+ AH FF+P S H + TS + + I DYVD + K+P+
Sbjct: 126 SGRFRASRPEEAHAFFLPFSVANIVHYVYQPITSPADFNRARLHRIFNDYVDVVARKHPF 185
Query: 176 WNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQP- 234
WN++ GADHF V+CHD + P K+ +R +C+ + F P+ D+++P++ P
Sbjct: 186 WNQSNGADHFMVSCHDWAPDVADSKPEFFKDFMRGLCNANTTEGFRPNIDISIPEINIPK 245
Query: 235 --FALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWEN-DTELDISNNRINRAIGPLVY 291
P G+ ENRT+L F+AG + IR +L W+ D ++ + ++ Y
Sbjct: 246 RKLKPPFMGQTPENRTILAFFAGRAHGYIREVLFTHWKGKDKDVQVYDHLTKGQ----NY 301
Query: 292 QRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNER 351
+KFC+CP G +V S R ++I+ GCVPV++S+ Y LPFND+LDW KF+V +
Sbjct: 302 HELIGHSKFCLCPSGYEVASPREVEAIYSGCVPVVISDNYSLPFNDVLDWSKFSVEIPVD 361
Query: 352 DVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
+ +K+IL+ I +++ ++ N++KV++HF N P FD HM+++ +WLRR
Sbjct: 362 KIPDIKKILQEIPHDKYIRMYQNVMKVRRHFVVNRPAQPFDVIHMILHSVWLRR 415
>gi|224087070|ref|XP_002308053.1| predicted protein [Populus trichocarpa]
gi|222854029|gb|EEE91576.1| predicted protein [Populus trichocarpa]
Length = 339
Score = 208 bits (529), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 115/343 (33%), Positives = 187/343 (54%), Gaps = 24/343 (6%)
Query: 85 MVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIR--ESRFLTHDPDRAHLFFIP 142
M FK++IY +G+ + P + Y+ EG F + +S+F+ P+ A F +P
Sbjct: 1 MEKRFKVWIYKEGELPVVHGGP--VNDIYSIEGQFLDEMESGKSQFIARHPEEAQAFLLP 58
Query: 143 ISCHKM-----RGKGT-SYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRA 196
+S + R + T S + + +V DYV + +K+PYWNRT GADHF V+CHD
Sbjct: 59 VSVAYIIHYVYRPRITFSRDQLQRLVTDYVRVIADKHPYWNRTHGADHFSVSCHDWAPDV 118
Query: 197 TEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQP---FALPRG-GRDVENRTVLGF 252
+ P L K IR +C+ + F P +DV++P++ P P+ + R++L F
Sbjct: 119 SRADPGLFKYFIRALCNANTSEGFQPQRDVSIPEIFLPVGKLGPPQEYAQPPSKRSILAF 178
Query: 253 WAGHRNSKIRVILARVW-ENDTELDI-----SNNRINRAIGPLVYQRRFYKTKFCICPGG 306
+AG + IR IL W E D E+ + N+ N + Y ++KFC+CP G
Sbjct: 179 FAGGAHGHIRKILLERWKEKDDEIQVHEYLTQKNKKNNNL----YFELMGQSKFCLCPSG 234
Query: 307 SQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDA 366
+V S R+ +I GCVPV +S+ Y LPF+D+LDW KF+V + + +K ILK IS
Sbjct: 235 HEVASPRVVTAIQLGCVPVTISDNYSLPFSDVLDWSKFSVDIPSEKIPDIKIILKGISVR 294
Query: 367 EFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIK 409
++++ ++++++HF N P +D HM+++ +WLRR +K
Sbjct: 295 RYLTMQRRVMQIRRHFTLNRPAQPYDMLHMILHSVWLRRLDVK 337
>gi|255582307|ref|XP_002531944.1| catalytic, putative [Ricinus communis]
gi|223528390|gb|EEF30426.1| catalytic, putative [Ricinus communis]
Length = 363
Score = 207 bits (527), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 121/367 (32%), Positives = 196/367 (53%), Gaps = 56/367 (15%)
Query: 86 VTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIR--ESRFLTHDPDRAHLFFIPI 143
+ KI++Y +G+ + P K YA EG F + ES F+ D AH+FF+P+
Sbjct: 1 MKRLKIWVYMEGERPMVHSGPMK--NIYAIEGQFIDEMESGESPFIARHADEAHVFFLPV 58
Query: 144 SCHKMRG----KGTSYE--NMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVG---- 193
S + TSY+ + + KDYV + +KYP+WNR+ G+DHF ++CHD
Sbjct: 59 SVAHIVEYIYLPITSYDRDKLIRVFKDYVKVVADKYPFWNRSSGSDHFMLSCHDWAMVHI 118
Query: 194 -------VRA---------------------TEGVPFLIKNAIRVVCSPSYDVAFIPHKD 225
VRA ++ P L KN IRV+C+ + F+P +D
Sbjct: 119 NSIPVELVRAFALGRNLKFEISFHSHRAPEISQHDPELYKNLIRVICNANTSEGFLPTRD 178
Query: 226 VALPQVLQPFALPRGGRD-------VENRTVLGFWAGHRNSKIRVILARVWENDTELDIS 278
V LP++ +P GG D R +L F+AG + IR IL W+N +
Sbjct: 179 VTLPEL----NIPPGGFDHVHHCLPSHKRRILAFFAGGAHGYIRKILLHHWKNKDDEVQV 234
Query: 279 NNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDI 338
+ +++ Y++ ++KFC+CP G +V S RI +SI+ GC+PVI+S++Y+LPF+D+
Sbjct: 235 HEYLSKDED---YRKLMGQSKFCLCPSGYEVASPRIVESIYAGCIPVIISDHYNLPFSDV 291
Query: 339 LDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVM 398
LDW + +V + + ++K ILK +S+ +++ + + +VQ+HF+ N P FD HMV+
Sbjct: 292 LDWSQISVQIPVEKIPEIKTILKGVSNDKYLRMQKRVRRVQRHFEINRPSKPFDVLHMVL 351
Query: 399 YELWLRR 405
+ +WLRR
Sbjct: 352 HSVWLRR 358
>gi|145339080|ref|NP_189804.4| putative glycosyltransferase [Arabidopsis thaliana]
gi|292630777|sp|Q3EAR7.2|GLYT2_ARATH RecName: Full=Probable glycosyltransferase At3g42180
gi|332644208|gb|AEE77729.1| putative glycosyltransferase [Arabidopsis thaliana]
Length = 470
Score = 207 bits (526), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 115/348 (33%), Positives = 189/348 (54%), Gaps = 25/348 (7%)
Query: 77 VFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIR------ESRFLT 130
FH +++EM+ FK++ Y +G+ + P + Y EG F + RF
Sbjct: 122 AFHQSHIEMMKTFKVWSYKEGEQPLVHDGP--VNDIYGIEGQFIDELSYVMGGPSGRFRA 179
Query: 131 HDPDRAHLFFIPISC----HKMRGKGTSYEN-----MTVIVKDYVDSLINKYPYWNRTLG 181
P+ AH FF+P S H + TS + + I DYVD + +K+P+WN++ G
Sbjct: 180 SRPEEAHAFFLPFSVANIVHYVYQPITSPADFNRARLHRIFNDYVDVVAHKHPFWNQSNG 239
Query: 182 ADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQP---FALP 238
ADHF V+CHD + P KN +R +C+ + F + D ++P++ P P
Sbjct: 240 ADHFMVSCHDWAPDVPDSKPEFFKNFMRGLCNANTSEGFRRNIDFSIPEINIPKRKLKPP 299
Query: 239 RGGRDVENRTVLGFWAGHRNSKIRVILARVWEN-DTELDISNNRINRAIGPLVYQRRFYK 297
G++ ENRT+L F+AG + IR +L W+ D ++ + ++ Y
Sbjct: 300 FMGQNPENRTILAFFAGRAHGYIREVLFSHWKGKDKDVQVYDHLTKGQ----NYHELIGH 355
Query: 298 TKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLK 357
+KFC+CP G +V S R ++I+ GCVPV++S+ Y LPFND+LDW KF+V + + +K
Sbjct: 356 SKFCLCPSGYEVASPREVEAIYSGCVPVVISDNYSLPFNDVLDWSKFSVEIPVDKIPDIK 415
Query: 358 QILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
+IL+ I +++ ++ N++KV++HF N P FD HM+++ +WLRR
Sbjct: 416 KILQEIPHDKYLRMYRNVMKVRRHFVVNRPAQPFDVIHMILHSVWLRR 463
>gi|2245029|emb|CAB10449.1| limonene cyclase like protein [Arabidopsis thaliana]
gi|7268424|emb|CAB78716.1| limonene cyclase like protein [Arabidopsis thaliana]
Length = 1024
Score = 206 bits (525), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 115/352 (32%), Positives = 197/352 (55%), Gaps = 23/352 (6%)
Query: 65 SDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIR 124
+DL + +F + SVF +Y M K+YIYPDGD F++ L G YASEG+F + +
Sbjct: 664 TDLFAPLFRNLSVFKRSYELMELILKVYIYPDGDKPIFHEP--HLNGIYASEGWFMKLME 721
Query: 125 ESR-FLTHDPDRAHLFFIPISCHKMRGKGTSY------ENMTVIVKDYVDSLINKYPYWN 177
++ F+T +P+RAHLF++P S +++ K TS + T ++ + +
Sbjct: 722 SNKQFVTKNPERAHLFYMPYSVKQLQKKTTSTCSPSNTPSGTALMGQIISLSLA------ 775
Query: 178 RTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSY-DVAFIPHKDVALPQVL---- 232
T+G F G P L +NAI+ +C+ D F+P KDV+LP+
Sbjct: 776 -TIGYRKCFYVKDQWGPYTVNEHPELKRNAIKALCNADLSDGIFVPGKDVSLPETSIRNA 834
Query: 233 -QPFALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWEN-DTELDISNNRINRAIGPLV 290
+P G V R +L F+AG+ + ++R L + W N D ++ I + +
Sbjct: 835 GRPLRNIGNGNRVSQRPILAFFAGNLHGRVRPKLLKHWRNKDEDMKIYGPLPHNVARKMT 894
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + +K+C+CP G +VNS RI ++I+Y CVPV++++ + LPF+D+LDW F+V++ E
Sbjct: 895 YVQHMKSSKYCLCPMGYEVNSPRIVEAIYYECVPVVIADNFMLPFSDVLDWSAFSVVVPE 954
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELW 402
+++ +LK+IL I ++ + +N+ VQ+HF W+ P ++D FHM+++ +W
Sbjct: 955 KEIPRLKEILLEIPMRRYLKMQSNVKMVQRHFLWSPKPRKYDVFHMILHSIW 1006
>gi|109946595|gb|ABG48476.1| At3g42180 [Arabidopsis thaliana]
Length = 425
Score = 206 bits (524), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 115/356 (32%), Positives = 192/356 (53%), Gaps = 25/356 (7%)
Query: 69 SDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIR---- 124
++ + FH +++EM+ FK++ Y +G+ + P + Y EG F +
Sbjct: 69 GQIYRNSFAFHQSHIEMMKTFKVWSYKEGEQPLVHDGP--VNDIYGIEGQFIDELSYVMG 126
Query: 125 --ESRFLTHDPDRAHLFFIPISC----HKMRGKGTSYEN-----MTVIVKDYVDSLINKY 173
RF P+ AH FF+P S H + TS + + I DYVD + +K+
Sbjct: 127 GPSGRFRASRPEEAHAFFLPFSVANIVHYVYQPITSPADFNRARLHRIFNDYVDVVAHKH 186
Query: 174 PYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQ 233
P+WN++ GADHF V+CHD + P KN +R +C+ + F + D ++P++
Sbjct: 187 PFWNQSNGADHFMVSCHDWAPDVPDSKPEFFKNFMRGLCNANTSEGFRRNIDFSIPEINI 246
Query: 234 P---FALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWEN-DTELDISNNRINRAIGPL 289
P P G++ ENRT+L F+AG + IR +L W+ D ++ + ++
Sbjct: 247 PKRKLKPPFMGQNPENRTILAFFAGRAHGYIREVLFSHWKGKDKDVQVYDHLTKGQ---- 302
Query: 290 VYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILN 349
Y +KFC+CP G +V S R ++I+ GCVPV++S+ Y LPFND+LDW KF+V +
Sbjct: 303 NYHELIGHSKFCLCPSGYEVASPREVEAIYSGCVPVVISDNYSLPFNDVLDWSKFSVEIP 362
Query: 350 ERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
+ +K+IL+ I +++ ++ N++KV++HF N P FD HM+++ +WLRR
Sbjct: 363 VDKIPDIKKILQEIPHDKYLRMYRNVMKVRRHFVVNRPAQPFDVIHMILHSVWLRR 418
>gi|75214755|gb|ABA18105.1| exostosin family protein [Capsella rubella]
Length = 423
Score = 206 bits (524), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 116/358 (32%), Positives = 191/358 (53%), Gaps = 25/358 (6%)
Query: 71 VFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIR------ 124
++ + FH +++EM+ FK++ Y +G+ + P + Y EG F +
Sbjct: 69 IYRNSFAFHQSHIEMMKRFKVWSYKEGEQPLVHDGP--VNDIYGIEGQFIDELGNMIGGP 126
Query: 125 ESRFLTHDPDRAHLFFIPISC-------HKMRGKGTSYENMTV--IVKDYVDSLINKYPY 175
SRF P+ AH+FF+P S +K + + I DYVD + KYP+
Sbjct: 127 SSRFRAVRPEEAHVFFLPFSVANIVHYVYKPIASPADFNRARLHRIFNDYVDVVARKYPF 186
Query: 176 WNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQP- 234
W ++ GADHF V+CHD + P K+ +R +C+ + F P D ++P++ P
Sbjct: 187 WKQSNGADHFMVSCHDWAPDVPDSKPEFFKDFMRGLCNANTSEGFKPSIDFSIPEINIPK 246
Query: 235 --FALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWEN-DTELDISNNRINRAIGPLVY 291
P G++ ENRT+L F+AG + IR +L W+ D ++ + ++ Y
Sbjct: 247 GKLKPPFMGQNPENRTILAFFAGRAHGYIREVLFTHWKGKDKDVQVYDHLTKGQ----NY 302
Query: 292 QRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNER 351
+KFC+CP G +V S R ++I+ GCVPV++S+ Y LPF+D+LDW KF+V +
Sbjct: 303 HELTGHSKFCLCPSGYEVASPREVEAIYSGCVPVVISDNYSLPFSDVLDWSKFSVEIPVD 362
Query: 352 DVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIK 409
+ +K IL+ I +++ ++ N++KV+KHF N P FD HM+++ +WLRR IK
Sbjct: 363 RIPDIKNILQEIPHDKYIRMYQNVLKVRKHFVVNRPAQPFDVIHMILHSVWLRRLNIK 420
>gi|297808545|ref|XP_002872156.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
gi|297317993|gb|EFH48415.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
Length = 334
Score = 206 bits (524), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 116/334 (34%), Positives = 181/334 (54%), Gaps = 19/334 (5%)
Query: 85 MVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESR--FLTHDPDRAHLFFIP 142
M FK+Y+Y +G+P + P K YA EG F I + R F T+D ++A+++F+P
Sbjct: 1 MEKRFKVYVYEEGEPPLVHDGPCK--SVYAVEGRFITEIEKMRTKFRTYDANQAYVYFLP 58
Query: 143 ISC----HKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATE 198
S + + + + V DY+ + +P+WNRT GADHF + CHD G ++
Sbjct: 59 FSVTWLVRYLYEGNSDAKPLRTFVSDYIRLVSTNHPFWNRTNGADHFMLACHDWGPLTSQ 118
Query: 199 GVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQV------LQPFALPRGGRDVENRTVLGF 252
L +IRV+C+ + F P KDV LP++ + P R LGF
Sbjct: 119 ADNDLFNTSIRVMCNANSSEGFNPSKDVTLPEIKLYGGEVDPKLRLSKTLSASPRPYLGF 178
Query: 253 WAGHRNSKIRVILARVW-ENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNS 311
+AG + +R IL W + D ++ + L Y +KFC CP G +V S
Sbjct: 179 FAGGVHGPVRPILLNHWKQRDPDMPVYEYLPKH----LNYYDFMRSSKFCFCPSGYEVAS 234
Query: 312 ARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSL 371
R+ ++I+ C+PVILS + LPF D+L W F+V+++ ++ +LK+IL +ISD ++ L
Sbjct: 235 PRVIEAIYSECIPVILSVNFVLPFTDVLRWETFSVLVDVSEIPRLKEILMSISDEKYEWL 294
Query: 372 HNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
NL V++HF+ N PP RFDAFH+ ++ +WLRR
Sbjct: 295 KRNLRYVRRHFELNDPPKRFDAFHLTLHSIWLRR 328
>gi|47497631|dbj|BAD19700.1| exostosin-like protein [Oryza sativa Japonica Group]
Length = 345
Score = 204 bits (520), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 122/341 (35%), Positives = 175/341 (51%), Gaps = 23/341 (6%)
Query: 85 MVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRE--SRFLTHDPDRAHLFFIP 142
M FKI+ Y +G+P + P T Y+ EG F + + SRF PD AH F +P
Sbjct: 1 MERRFKIWTYREGEPPVAHIGPG--TDIYSIEGQFMYEMDDPRSRFAARRPDDAHAFLLP 58
Query: 143 ISC-------HKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVR 195
IS +++ G + +V DYV + +YPYWNR+ GADH V+CHD
Sbjct: 59 ISVCNLVHYVYRLNATG-DLAPLRGLVADYVRVVAERYPYWNRSRGADHVIVSCHDWAPM 117
Query: 196 ATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQV-LQPFAL--PRGGRDVENRTVLGF 252
T L NAIRV+C+ + F P KD LP+V L L P G ENRT L F
Sbjct: 118 VTSAHRQLYGNAIRVLCNANTSEGFRPRKDATLPEVNLADGVLRRPTAGLPPENRTTLAF 177
Query: 253 WAGHRNSKIRVILARVW-------ENDTELDISNNRINRAI-GPLVYQRRFYKTKFCICP 304
+AG R+ IR L R W + R++ + Y + +FC+CP
Sbjct: 178 FAGGRHGHIRESLLRHWLIGNKGGAAADGDGDGDMRVHEYLPAGEDYHAQMAAARFCLCP 237
Query: 305 GGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNIS 364
G +V S R+ +S+ GCVPVI+S Y PF D+LDW K +V + + +L+ IL+ +S
Sbjct: 238 SGFEVASPRVVESVFAGCVPVIISEGYPPPFGDVLDWGKMSVAVPAARIPELRAILRRVS 297
Query: 365 DAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
+ + L +++ Q+HF + P RFD HMV++ +WLRR
Sbjct: 298 ERRYRVLRARVLQAQRHFVLHRPARRFDMIHMVLHSIWLRR 338
>gi|302814814|ref|XP_002989090.1| hypothetical protein SELMODRAFT_447545 [Selaginella moellendorffii]
gi|300143191|gb|EFJ09884.1| hypothetical protein SELMODRAFT_447545 [Selaginella moellendorffii]
Length = 1522
Score = 200 bits (508), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 118/367 (32%), Positives = 183/367 (49%), Gaps = 33/367 (8%)
Query: 54 TLSFAINKQDYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKY 113
+L+F ++ + SDL ++HSP+ +Y E + FK+Y+YP + P GK
Sbjct: 21 SLAFLLDFEGASDLDRSLYHSPAFLARDYQEFLDRFKVYVYPMIQNAS---APDLRDGKA 77
Query: 114 ASEG----YFFQNIRESRFLTHDPDRAHLFFIPISCHKMRGKGTSYENMTVIVKDYVDSL 169
A G F ++ S F+T DP+ A LF +P S + K + + +K Y+ L
Sbjct: 78 ARPGSIDRVFVDSLLASGFVTDDPEAADLFLLPASISAIWKKRPDPKGIAHSLKSYIQQL 137
Query: 170 INKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSP---SYDVAFIPHKDV 226
+ YPYW R+LGADHFFV+CHD+ + V L KNAI++ C P F+ HKD+
Sbjct: 138 RDLYPYWQRSLGADHFFVSCHDITSDWSRNVLELKKNAIQIACFPLARHGAQEFLAHKDI 197
Query: 227 ALPQVLQPFALPRGGR---DVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRIN 283
+P P GG R L + + W++D +++
Sbjct: 198 TMP--------PAGGSIDPPQRRRWNLAVYDSSSQGYAASDVPASWKSDESFVAGAVKMD 249
Query: 284 RAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILS--NYYDLPFNDILDW 341
+ T+FC+ G S + + ++ GC+PVI S DLPF DILDW
Sbjct: 250 LQL--------LVTTRFCLSLGSS--DRHLVIPAVRSGCIPVIFSAGKLSDLPFQDILDW 299
Query: 342 RKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYEL 401
FA++L+ ++Q K IL++I + + L N + KH +W+SPP DAF+MV+Y+L
Sbjct: 300 NSFAIVLSRDQLHQTKAILESIDEEKLSRLQENGARAAKHMEWHSPPQPEDAFYMVLYQL 359
Query: 402 WLRRHVI 408
W RRH++
Sbjct: 360 WRRRHIL 366
>gi|52075645|dbj|BAD44815.1| putative pectin-glucuronyltransferase [Oryza sativa Japonica Group]
gi|52075653|dbj|BAD44823.1| putative pectin-glucuronyltransferase [Oryza sativa Japonica Group]
Length = 514
Score = 198 bits (504), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 121/391 (30%), Positives = 194/391 (49%), Gaps = 60/391 (15%)
Query: 71 VFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRE----- 125
V+ +P+ F+ +Y+EM FK+Y+Y +G+P ++ P K YA EG F + +
Sbjct: 120 VYRNPAAFYRSYVEMERRFKVYVYEEGEPPIAHEGPCK--NIYAVEGRFIEELELMAPPL 177
Query: 126 SRFLTHDPDRAHLFFIPISCHKM-----RGKGTSYENMTVIVKDYVDSLINKYPYWNRTL 180
T DP RAH F+P+S +M R + IV DYV + +++ +WNR+
Sbjct: 178 GGVRTWDPARAHALFLPLSVSQMVQLAYRPLSYDLSPLRAIVADYVAVVASRHRFWNRSA 237
Query: 181 GADHFFVTCHD------------------------VGVRATEGVPFLIKNAIRVVCSPSY 216
GADHF ++CHD +G A+ G P L NAIR +C+ +
Sbjct: 238 GADHFMLSCHDWAIHTPSVQRDSISGFPTFRVQRLIGPHASRGHPELYANAIRALCNANT 297
Query: 217 DVAFIPHKDVALPQV----LQPFALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWEN- 271
F P KDV++P++ R L F+AG R+ +R +L R W+
Sbjct: 298 SEGFRPDKDVSIPEINLYDGDMPPELLSPAPPPPRPFLAFFAGGRHGHVRDLLLRHWKGR 357
Query: 272 -------------DTELDISNNRINRAIG----PLVYQRRFYKTKFCICPGGSQVNSARI 314
+ +S + A G P + R +++FC+CP G +V S R+
Sbjct: 358 DPAVFPVYEYDLPSIPVSVSGDGDTDAGGEGGNPYYWYMR--RSRFCLCPSGHEVASPRV 415
Query: 315 ADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNN 374
++IH GCVPV++++ Y PF D+L W F+V + DV +L+++L+ I E L +
Sbjct: 416 VEAIHAGCVPVVVADGYAPPFADVLRWEAFSVAVAVADVPRLRELLERIPAPEVERLRDG 475
Query: 375 LVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
+ V++HF + PP R D FHM+++ +WLRR
Sbjct: 476 VRLVKRHFMLHQPPERLDMFHMILHSVWLRR 506
>gi|357436207|ref|XP_003588379.1| Exostosin-2 [Medicago truncatula]
gi|355477427|gb|AES58630.1| Exostosin-2 [Medicago truncatula]
Length = 551
Score = 198 bits (504), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 117/351 (33%), Positives = 183/351 (52%), Gaps = 47/351 (13%)
Query: 66 DLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRE 125
+L ++H+ S+F +Y M K+Y+Y +G +P LTG YASEG+F + +
Sbjct: 231 NLYGPIYHNVSMFKRSYELMEERLKVYVYREG-ARPILHSPF-LTGIYASEGWFMKLMEA 288
Query: 126 S-RFLTHDPDRAHLFFIPISCHKMR-----GKGTSYENMTVIVKDYVDSLINKYPYWNRT 179
+ RF+T +P +AHLF++P S + S++N+ + DYVD + ++ +WNRT
Sbjct: 289 NKRFVTKNPKKAHLFYLPFSSRMLEEALYVKNSHSHKNLIQYLHDYVDMIAARHSFWNRT 348
Query: 180 LGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQV-LQPFALP 238
GADHF V CHD T+ + N IR +C+ F+ KD +LP+ ++ +P
Sbjct: 349 GGADHFLVGCHDWAPSETK---LRLANCIRSLCNADVKEGFVFGKDASLPETYVRNAQIP 405
Query: 239 R---GGRDVENRTVLGFWAGHRNSKIRVILARVWEN-DTELDISNNRINRAIGPLVYQRR 294
GG +T L F+AG + +R IL + WEN D ++ I ++ + G Y
Sbjct: 406 TRDLGGNSFSKKTTLAFFAGSMHGYVRPILLKHWENKDPDMKIFG-KLPNSKGNSNYIHY 464
Query: 295 FYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVY 354
+K+CIC G +VNS R+ ++I Y CVPVI+S+ + PF ++LDW F+VI
Sbjct: 465 MKSSKYCICAKGYEVNSPRVVEAIFYECVPVIISDNFVPPFFEVLDWESFSVI------- 517
Query: 355 QLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
VQKHF WN PV++D FHM+++ +W R
Sbjct: 518 -----------------------VQKHFLWNKNPVKYDIFHMILHSIWYNR 545
>gi|224119530|ref|XP_002318097.1| predicted protein [Populus trichocarpa]
gi|222858770|gb|EEE96317.1| predicted protein [Populus trichocarpa]
Length = 243
Score = 198 bits (503), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 98/190 (51%), Positives = 132/190 (69%), Gaps = 5/190 (2%)
Query: 62 QDYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQ-TPRKLTGKYASEGYFF 120
QD +D D++HS + F LN+ M +FK+++YP G+P T Y T L +ASE YFF
Sbjct: 35 QDSTD---DIYHSRAFFLLNHEAMEKDFKVFVYPGGNPGTCYHSTNNTLKSNHASEHYFF 91
Query: 121 QNIRESRFLTHDPDRAHLFFIPISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTL 180
N+R+S FLT +P AHLFFI ISC + + ++K YV LI+ YPYWNRTL
Sbjct: 92 MNLRDSPFLTKNPQEAHLFFIFISCLPLSDEEPLPGYRERVIKRYVKGLISTYPYWNRTL 151
Query: 181 GADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRG 240
GADHFFV+CH++G AT+ +PFL+KNAIR+VCSPSYD ++IP KDVALPQ+L+ +LP
Sbjct: 152 GADHFFVSCHNIGSTATKEIPFLLKNAIRLVCSPSYDSSYIPQKDVALPQILE-LSLPPD 210
Query: 241 GRDVENRTVL 250
G D+ NR ++
Sbjct: 211 GDDMWNRNLM 220
>gi|302804107|ref|XP_002983806.1| hypothetical protein SELMODRAFT_445668 [Selaginella moellendorffii]
gi|300148643|gb|EFJ15302.1| hypothetical protein SELMODRAFT_445668 [Selaginella moellendorffii]
Length = 1068
Score = 197 bits (501), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 118/367 (32%), Positives = 182/367 (49%), Gaps = 33/367 (8%)
Query: 54 TLSFAINKQDYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKY 113
+L+F ++ + SDL ++HSP+ +Y E + FK+Y+YP + P GK
Sbjct: 21 SLAFLLDFEGASDLDRSLYHSPAFLARDYQEFLDRFKVYVYPMIQNAS---APDLRDGKA 77
Query: 114 ASEG----YFFQNIRESRFLTHDPDRAHLFFIPISCHKMRGKGTSYENMTVIVKDYVDSL 169
A G F ++ S F+T DP+ A LF +P S + K + + +K Y+ L
Sbjct: 78 ARPGSIDRVFVDSLLASGFVTDDPEAADLFLLPASISAIWKKRPDPKGIAHSLKSYIQQL 137
Query: 170 INKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSP---SYDVAFIPHKDV 226
+ YPYW R+LGADHFFV+CHD+ + V L KNAI++ C P F+ HKD+
Sbjct: 138 RDLYPYWQRSLGADHFFVSCHDITSDWSRNVLELKKNAIQIACFPLARHGAQEFLAHKDI 197
Query: 227 ALPQVLQPFALPRGGR---DVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRIN 283
+P P GG R L + + W++D ++
Sbjct: 198 TMP--------PAGGSIDPPQRRRWNLAVYDSSSQGYAARDVPASWKSDESFVAGAVALD 249
Query: 284 RAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILS--NYYDLPFNDILDW 341
+ T+FC+ G S + + ++ GC+PVI S DLPF DILDW
Sbjct: 250 LQL--------LVTTRFCLSLGSS--DRHLVIPAVRSGCIPVIFSAGKLSDLPFQDILDW 299
Query: 342 RKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYEL 401
FA++L+ ++Q K IL++I + + L N + KH +W+SPP DAF+MV+Y+L
Sbjct: 300 NSFAIVLSRDQLHQTKGILESIDEEKRSRLQENGARAAKHMEWHSPPQPEDAFYMVLYQL 359
Query: 402 WLRRHVI 408
W RRH++
Sbjct: 360 WRRRHIL 366
>gi|75214619|gb|ABA18091.1| exostosin family protein [Olimarabidopsis pumila]
Length = 343
Score = 197 bits (501), Expect = 9e-48, Method: Compositional matrix adjust.
Identities = 113/344 (32%), Positives = 181/344 (52%), Gaps = 25/344 (7%)
Query: 85 MVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIR------ESRFLTHDPDRAHL 138
M+ FK++ Y +G+ + P + Y EG F + RF P+ AH
Sbjct: 1 MMKRFKVWSYKEGEQPLVHDGP--VNDIYGIEGQFIDELGNVMGGPSGRFRAGRPEEAHA 58
Query: 139 FFIPISC----HKMRGKGTSYEN-----MTVIVKDYVDSLINKYPYWNRTLGADHFFVTC 189
FF+P S H + S + + I DYV+ + K+P+WN++ GADHF V+C
Sbjct: 59 FFLPFSVANIVHYVSQPIASPADFNRARLHRIFNDYVEVVARKHPFWNQSNGADHFMVSC 118
Query: 190 HDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQP---FALPRGGRDVEN 246
HD P K+ IR +C+ + F P D ++P++ P P G+ EN
Sbjct: 119 HDWAPDVAGSKPEFFKDFIRGLCNANTSEGFRPSIDFSIPEINIPKGKLKPPFMGQTPEN 178
Query: 247 RTVLGFWAGHRNSKIRVILARVWEN-DTELDISNNRINRAIGPLVYQRRFYKTKFCICPG 305
RT+L F+AG + IR +L W+ D ++ + ++ Y +KFC+CP
Sbjct: 179 RTILAFFAGRAHGYIREVLFTHWKGKDKDVQVYDHLTKGQ----NYHELIGHSKFCLCPS 234
Query: 306 GSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISD 365
G +V S R ++I+ GCVPV++S+ Y LPFND+LDW KF+V + + +K+IL+ I
Sbjct: 235 GYEVASPREVEAIYSGCVPVVISDNYSLPFNDVLDWSKFSVEIPVDKIPDIKKILQEIPH 294
Query: 366 AEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIK 409
+++ +++N++KV +HF N P FD HM+++ +WLRR IK
Sbjct: 295 EKYIKMYHNVMKVGRHFVVNRPAQPFDVIHMILHSVWLRRLNIK 338
>gi|168000699|ref|XP_001753053.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162695752|gb|EDQ82094.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 471
Score = 197 bits (500), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 123/360 (34%), Positives = 188/360 (52%), Gaps = 34/360 (9%)
Query: 71 VFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQ----TPRKLTGKYASEG-YFFQNIRE 125
++HSP VF L+Y EM +I++YP +T Y+ +T + +S FF+ +
Sbjct: 114 LYHSPKVFTLSYEEMREQLQIWVYPTQAGSTKYEHNYDGDEDVTEEISSTADLFFRLLTR 173
Query: 126 SRFLTHDPDRAHLFFIPISCHKM---RGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGA 182
S F+T RA LF +P S + G E + + Y++ + YPYW +LGA
Sbjct: 174 SEFVTEKAKRAQLFLLPFSIDVLWVDLGPTQVAEKL----RRYLEKVRTNYPYWESSLGA 229
Query: 183 DHFFVTCHDVGVRAT-EGVPFLIKNAIRVVCSP-SYDVAFIPHKDVALPQVLQPFALPRG 240
DHF+++CH + + L KN+I+ C+P ++ F PHKDV PQ P G
Sbjct: 230 DHFYLSCHAFEHNSKHRNILELGKNSIQAACAPLRHNQKFYPHKDVVFPQYK-----PVG 284
Query: 241 GRDVE-------NRTVLGFWAGHRNSKIRVILA-RVWENDTELDISNNRINRAIGPLVYQ 292
DV NRT L +++G + ++ A WE D + + N + VY
Sbjct: 285 EEDVRQAILGRRNRTSLAYFSGCPDVTTPLLSAFHTWETDPDFIVEANPSPHRLS--VY- 341
Query: 293 RRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILS--NYYDLPFNDILDWRKFAVILNE 350
R +++FC+ S + D++ +GCVPV+LS ++DLPF L+W +FAV+L
Sbjct: 342 RNLARSRFCVSVLPHDTFS--LVDALRFGCVPVLLSKLTFHDLPFQGFLNWGQFAVVLGI 399
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIKY 410
D+ LKQIL N+S + + + KH +WN+PPV +DAFHM + ELW+RRH IKY
Sbjct: 400 EDLPNLKQILANVSSTKHREMQYLGHQAIKHLEWNNPPVAYDAFHMTLLELWVRRHSIKY 459
>gi|255550902|ref|XP_002516499.1| catalytic, putative [Ricinus communis]
gi|223544319|gb|EEF45840.1| catalytic, putative [Ricinus communis]
Length = 601
Score = 196 bits (499), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 110/336 (32%), Positives = 170/336 (50%), Gaps = 59/336 (17%)
Query: 76 SVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESR-FLTHDPD 134
S+F +Y M N K+YIY +G+ +Q L G YASEG+F + + S+ F+T
Sbjct: 312 SMFKRSYEIMEENLKVYIYKEGEKPILHQP--VLKGIYASEGWFMKQLEASKKFVTKKSR 369
Query: 135 RAHLFFIPISCHKMRGK-----GTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTC 189
+AHLF++P S + + S +N+ +K+Y+D ++ KYP+WNRT G DHF V C
Sbjct: 370 KAHLFYLPFSSRNLELQLYVPDSHSRKNLIKYLKNYLDLIVAKYPFWNRTEGVDHFLVAC 429
Query: 190 HDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDVENRTV 249
HD TE + F N IR +C+ FI KD +LP+
Sbjct: 430 HDWAASETEQLMF---NCIRALCNADVKEGFIFGKDASLPET------------------ 468
Query: 250 LGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQV 309
N +A G + Y + +++CIC G +V
Sbjct: 469 ------------------------------NSDAKARGKMNYVQHMKSSRYCICARGYEV 498
Query: 310 NSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFV 369
NS RI ++I Y CVPVI+S+ Y PF ++L+W FAV + E+D+ LK IL +I + +
Sbjct: 499 NSPRIVEAILYECVPVIISDNYVPPFFEVLNWESFAVFVLEKDIPNLKNILLSIPEKRYR 558
Query: 370 SLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
+ + VQ+HF W++ PV++D FHM+++ +W R
Sbjct: 559 EMQMRVKMVQQHFLWHARPVKYDLFHMILHSVWYNR 594
>gi|6735291|emb|CAB68119.1| putative protein [Arabidopsis thaliana]
Length = 340
Score = 195 bits (496), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 109/337 (32%), Positives = 182/337 (54%), Gaps = 21/337 (6%)
Query: 86 VTNFKIYIYPDGDPNTFYQTPRK----LTGKYASEGYFFQNIRESRFLTHDPDRAHLFFI 141
+ FK++ Y +G+ + P + G++ E + RF P+ AH FF+
Sbjct: 1 MKTFKVWSYKEGEQPLVHDGPVNDIYGIEGQFIDELSYVMGGPSGRFRASRPEEAHAFFL 60
Query: 142 PISC----HKMRGKGTSYEN-----MTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDV 192
P S H + TS + + I DYVD + +K+P+WN++ GADHF V+CHD
Sbjct: 61 PFSVANIVHYVYQPITSPADFNRARLHRIFNDYVDVVAHKHPFWNQSNGADHFMVSCHDW 120
Query: 193 GVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQP---FALPRGGRDVENRTV 249
+ P KN +R +C+ + F + D ++P++ P P G++ ENRT+
Sbjct: 121 APDVPDSKPEFFKNFMRGLCNANTSEGFRRNIDFSIPEINIPKRKLKPPFMGQNPENRTI 180
Query: 250 LGFWAGHRNSKIRVILARVWEN-DTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQ 308
L F+AG + IR +L W+ D ++ + ++ Y +KFC+CP G +
Sbjct: 181 LAFFAGRAHGYIREVLFSHWKGKDKDVQVYDHLTKGQ----NYHELIGHSKFCLCPSGYE 236
Query: 309 VNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEF 368
V S R ++I+ GCVPV++S+ Y LPFND+LDW KF+V + + +K+IL+ I ++
Sbjct: 237 VASPREVEAIYSGCVPVVISDNYSLPFNDVLDWSKFSVEIPVDKIPDIKKILQEIPHDKY 296
Query: 369 VSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
+ ++ N++KV++HF N P FD HM+++ +WLRR
Sbjct: 297 LRMYRNVMKVRRHFVVNRPAQPFDVIHMILHSVWLRR 333
>gi|224098485|ref|XP_002311191.1| predicted protein [Populus trichocarpa]
gi|222851011|gb|EEE88558.1| predicted protein [Populus trichocarpa]
Length = 345
Score = 194 bits (492), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 112/348 (32%), Positives = 187/348 (53%), Gaps = 38/348 (10%)
Query: 69 SDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRF 128
+ ++ SP++F NY M+ +FKIYIY P +F +E FF +++ S F
Sbjct: 24 TSLYLSPTIFFPNYQNMLNSFKIYIYTPSKPFSFSS---------PTESLFFTSLQASPF 74
Query: 129 LTHDPDRAHLFFIPISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVT 188
+T +P+ AHLFF+P + + T + ++ L ++PYWNRTLGADHF+V+
Sbjct: 75 VTQNPEEAHLFFVPFASNLS----------TRSIARFIRDLRMEFPYWNRTLGADHFYVS 124
Query: 189 CHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDVENRT 248
C +G + + L KN++++ C P + F+PHKD++LP + + + R NRT
Sbjct: 125 CAGLGYESDRNLVELKKNSVQISCFPVPEGKFVPHKDISLPPLAR---ITRASHAPGNRT 181
Query: 249 V--LGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGG 306
V L G ++SK LA ND++ + + N + R + FC+ G
Sbjct: 182 VRYLVRHGGVKDSK----LANELRNDSDFLMESEPSNE----MTLVERLGSSMFCLFEDG 233
Query: 307 SQVNSARIADSIHYGCVPVILSN--YYDLPFNDILDWRKFAVILNE-RDVYQLKQIL-KN 362
+ ++ I +++ +GCVPV++++ DLP D+L W+K AV + + ++K++L +
Sbjct: 234 ADISG--IGEALRFGCVPVMVTDRPMQDLPLMDVLSWQKIAVFVGSGGGIKEMKRVLDRT 291
Query: 363 ISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIKY 410
D E V +HF WN P +D+F+MV+Y+LWLRRH I+Y
Sbjct: 292 CKDDECEGTRRLGVAASQHFGWNEIPQPYDSFYMVVYQLWLRRHTIRY 339
>gi|320163884|gb|EFW40783.1| conserved hypothetical protein [Capsaspora owczarzaki ATCC 30864]
Length = 478
Score = 194 bits (492), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 122/386 (31%), Positives = 191/386 (49%), Gaps = 67/386 (17%)
Query: 81 NYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFF 140
Y +M+ + +IY+Y D + + KY +E F + S F T PD+A++FF
Sbjct: 98 EYQQMLDSLRIYMY---DIALGREMRWLVDDKYGAEQLFINLLATSAFHTTAPDKANMFF 154
Query: 141 IPISCHKMRGKGTSYENMTVIVKD----YVDSLINKYPYWNRTLGADHFFVTCHDVGVRA 196
+P C R ++ K+ Y D ++NKY +WN + G DHF++ HD+G
Sbjct: 155 MPFRCTAYRRSVQERVLGDIVAKNVTAQYFDVVMNKYRWWNVSSGTDHFYICGHDMGTAV 214
Query: 197 TE-GVPFLIKNAIRVVCSPSYDVA-FIPHKDVALP---QVL------------------- 232
T P L+KNAI +V + YD A +IPHKD++LP VL
Sbjct: 215 TALSHPALVKNAIGLVNTADYDDARYIPHKDISLPPNIDVLPSAHVATEEEITADLIRLE 274
Query: 233 ---------------------QPFALPRGGRDV-----------ENRTVLGFWAGHRN-S 259
+P R G+ V E RT L ++AG +
Sbjct: 275 MARDRLYRATRQKVAHPDMNFEPLMERRMGKLVQYGLGGLIHPREKRTKLAYFAGPLHYG 334
Query: 260 KIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIH 319
++R + + NDT++ + R + P++Y +KFC+ G + S R+ D++
Sbjct: 335 RVRPKVRDAFANDTDIVLFEGRHAQ---PILYYNELATSKFCLFLRGYRAWSPRLMDAVF 391
Query: 320 YGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQ 379
GC+PVI+S++YDLP +LDW +F++ + E + +LKQ L +SDA+ + N L +V
Sbjct: 392 MGCIPVIISDHYDLPLGQLLDWSEFSITIPEARIPRLKQTLLAVSDAQLSRMQNRLAEVY 451
Query: 380 KHFQWNSPPVRFDAFHMVMYELWLRR 405
+HF WN PP FDAFHMV+++LW RR
Sbjct: 452 QHFVWNDPPKPFDAFHMVLWQLWRRR 477
>gi|147789666|emb|CAN71921.1| hypothetical protein VITISV_031190 [Vitis vinifera]
Length = 310
Score = 193 bits (491), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 102/302 (33%), Positives = 169/302 (55%), Gaps = 15/302 (4%)
Query: 115 SEGYFFQNIR--ESRFLTHDPDRAHLFFIPISCHKMRG---KGTSYENMTV--IVKDYVD 167
S+G F I +S+FL PD A+ F+IP+S ++ + Y + +V DY++
Sbjct: 6 SKGNFMDEIESGKSQFLARHPDEANAFYIPMSLTRVVHFIYEPPHYXGKWIPRLVXDYIN 65
Query: 168 SLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVA 227
+ +KYPYWNR+ GADHF V+CHD + P L K+ IR +C+ + F P +D++
Sbjct: 66 FVADKYPYWNRSKGADHFLVSCHDWAPDVSALKPDLYKHFIRALCNANTSERFHPIRDIS 125
Query: 228 LPQVLQP---FALPRGGRDVENRTVLGFWAGHRNSKIRVILARVW-ENDTELDISNNRIN 283
+P++ P P + R +L F+AG + +R +L + W E D E+ + R+
Sbjct: 126 IPEINIPRGKLGPPHLDQPPNKRPILAFFAGGAHGYVRSVLFKYWKEKDDEVQVFE-RLP 184
Query: 284 RAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRK 343
R Y + +KFC+CP G +V S RI +I GCVP+I+ ++Y LPF+D LDW K
Sbjct: 185 RNRN---YSKSMGDSKFCLCPSGYEVASPRIVKAIAAGCVPMIICDHYSLPFSDXLDWSK 241
Query: 344 FAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWL 403
F++ + + ++K+ILK + ++ + + +VQ+HF N P +D HM+++ +W
Sbjct: 242 FSIYITSDKIPEIKKILKAVPTETYLEMQKRVKQVQRHFAINRPAXPYDMLHMILHSVWX 301
Query: 404 RR 405
RR
Sbjct: 302 RR 303
>gi|75223627|gb|ABA18110.1| exostosin family protein [Arabidopsis arenosa]
Length = 340
Score = 192 bits (487), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 110/339 (32%), Positives = 179/339 (52%), Gaps = 25/339 (7%)
Query: 86 VTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIR------ESRFLTHDPDRAHLF 139
+ FK++ Y +G+ + P + Y EG F + RF P+ AH F
Sbjct: 1 MKRFKVWSYKEGEQPLVHDGP--VNDIYGIEGQFIDELSNVMGGPSGRFRASRPEEAHAF 58
Query: 140 FIPISC----HKMRGKGTSYEN-----MTVIVKDYVDSLINKYPYWNRTLGADHFFVTCH 190
F+P S H + TS + + I DYVD + K+P+WN++ GADHF V+CH
Sbjct: 59 FLPFSVANIVHYVYQPITSPADFNRARLHRIFNDYVDVVACKHPFWNQSNGADHFMVSCH 118
Query: 191 DVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQP---FALPRGGRDVENR 247
D + P K+ +R +C+ + F P+ D ++P++ P P G+ ENR
Sbjct: 119 DWAPDVADSKPEFFKDFMRGLCNANTTEGFRPNIDFSIPEINIPKRKLKPPFMGQTPENR 178
Query: 248 TVLGFWAGHRNSKIRVILARVWEN-DTELDISNNRINRAIGPLVYQRRFYKTKFCICPGG 306
T+L F+AG + IR +L W+ D ++ + ++ Y +KFC+CP G
Sbjct: 179 TILAFFAGRAHGYIREVLFTHWKGKDKDVQVYDHLTKGQ----NYHELIGHSKFCLCPSG 234
Query: 307 SQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDA 366
+V S R ++I+ GCVPV++S+ Y LPF D+LDW KF+V + + +K+IL+ I
Sbjct: 235 YEVASPREVEAIYSGCVPVVISDNYSLPFKDVLDWSKFSVEIPVDKIPDIKKILQEIPHD 294
Query: 367 EFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
++ ++ N++KV++HF N P FD HM+++ +WLRR
Sbjct: 295 KYRRMYQNVMKVRRHFVVNRPAQPFDVIHMILHSVWLRR 333
>gi|15239475|ref|NP_198516.1| Exostosin family protein [Arabidopsis thaliana]
gi|332006752|gb|AED94135.1| Exostosin family protein [Arabidopsis thaliana]
Length = 547
Score = 191 bits (485), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 111/326 (34%), Positives = 179/326 (54%), Gaps = 31/326 (9%)
Query: 81 NYLEMVTNFKIYIYPDGDPNTFYQ-TPRKLTGKYASEGYFFQNIRESR-FLTHDPDRAHL 138
+Y M KIY+Y +G F+ PR G YASEG+F + + ++ F+ DP +AHL
Sbjct: 232 SYDLMERKLKIYVYKEGGKPIFHTPMPR---GIYASEGWFMKLMESNKKFVVKDPRKAHL 288
Query: 139 FFIPISCHKMRGK-GTSYENMTVI---VKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV 194
F+IPIS +R G ++ + +K+YVD + KY +WNRT GADHF V CHD G
Sbjct: 289 FYIPISIKALRSSLGLDFQTPKSLADHLKEYVDLIAGKYKFWNRTGGADHFLVACHDWGN 348
Query: 195 RATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQV-LQPFALPR---GGRDVENRTVL 250
+ T +KN++R +C+ + F D ALP ++ P GG+ R +L
Sbjct: 349 KLTTKT---MKNSVRSLCNSNVAQGFRIGTDTALPVTYIRSSEAPLEYLGGKTSSERKIL 405
Query: 251 GFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPL--------VYQRRFYKTKFCI 302
F+AG + +R IL ++WEN E D+ + GP+ Y+ +++CI
Sbjct: 406 AFFAGSMHGYLRPILVKLWEN-KEPDM------KIFGPMPRDPKSKKQYREYMKSSRYCI 458
Query: 303 CPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKN 362
C G +V++ R+ ++I CVPVI+++ Y PF ++L+W +FAV + E+D+ L+ IL +
Sbjct: 459 CARGYEVHTPRVVEAIINECVPVIIADNYVPPFFEVLNWEEFAVFVEEKDIPNLRNILLS 518
Query: 363 ISDAEFVSLHNNLVKVQKHFQWNSPP 388
I + ++ + + VQ+HF W+ P
Sbjct: 519 IPEDRYIGMQARVKAVQQHFLWHKKP 544
>gi|302799042|ref|XP_002981280.1| glycosyltransferase-like protein [Selaginella moellendorffii]
gi|300150820|gb|EFJ17468.1| glycosyltransferase-like protein [Selaginella moellendorffii]
Length = 352
Score = 191 bits (485), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 115/343 (33%), Positives = 182/343 (53%), Gaps = 29/343 (8%)
Query: 81 NYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNI-RESRFLTHDPDRAHLF 139
+Y M F+I++Y DG ++ P+ TG YASEG F + R + F +P A +F
Sbjct: 17 SYEAMQRVFRIFVYKDGYKPLMHEGPK--TGIYASEGLFIATMERGNPFAVTEPKIATMF 74
Query: 140 FIPISCHKM-----RGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV 194
FIP S +M S +N+ + Y+ L +KYPY N T G DHFFV+CHD +
Sbjct: 75 FIPFSLKQMVDYMYDTNSHSMKNIQSYIAGYLRRLASKYPYMNATNGIDHFFVSCHDWAL 134
Query: 195 RATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVL------QPFALPRGGRDVENRT 248
A E +N ++VVC+ F +DV+LP+ P G D R
Sbjct: 135 MALEKQD-CQRNIVKVVCNADSSRGFNTSRDVSLPETRVRQGKHSPIIRDTSGMD---RP 190
Query: 249 VLGFWAGHRNSKIRVILARVWEN-DTELDISNNRINRAIGPLVYQRRFYK-----TKFCI 302
L F+AG + K+R +L W++ D E+ +I + P V +R Y +K+CI
Sbjct: 191 YLAFFAGQMHGKLRPVLLAHWKDKDPEM-----KIYEVLPPSVAERISYSEHMRLSKYCI 245
Query: 303 CPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKN 362
C G +VNS R+ ++I CVPVIL++ + LPF+++++W +V + E+DV LK IL
Sbjct: 246 CAAGFEVNSPRLVEAIVNECVPVILADNFVLPFSEVINWDSISVTVAEKDVANLKAILAG 305
Query: 363 ISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
I + + L V++HF+W + P ++D F+M+++ LW ++
Sbjct: 306 IPLRTYKEMQARLKHVKRHFEWKNSPEKYDIFNMIVHSLWTQQ 348
>gi|302772499|ref|XP_002969667.1| glycosyltransferase-like protein [Selaginella moellendorffii]
gi|300162178|gb|EFJ28791.1| glycosyltransferase-like protein [Selaginella moellendorffii]
Length = 352
Score = 188 bits (478), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 115/343 (33%), Positives = 181/343 (52%), Gaps = 29/343 (8%)
Query: 81 NYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNI-RESRFLTHDPDRAHLF 139
+Y M F+I++Y DG ++ P+ TG YASEG F + R + F +P A +F
Sbjct: 17 SYEAMQRVFRIFVYKDGYKPLMHEGPK--TGIYASEGLFIATMERGNPFAVTEPKIATMF 74
Query: 140 FIPISCHKM-----RGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV 194
FIP S +M S +N+ + Y+ L +KYPY N T G DHFFV+CHD +
Sbjct: 75 FIPFSLKQMVDYMYDTNSHSMKNIQSYIAGYLRRLASKYPYMNATNGIDHFFVSCHDWAL 134
Query: 195 RATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVL------QPFALPRGGRDVENRT 248
A E +N ++VVC+ F +DV+LP+ P G D R
Sbjct: 135 MALEKQD-CQRNIVKVVCNADSSRGFNTSRDVSLPETRVRQGKHSPIIRDISGMD---RP 190
Query: 249 VLGFWAGHRNSKIRVILARVWEN-DTELDISNNRINRAIGPLVYQRRFYK-----TKFCI 302
L F+AG + K+R +L W++ D E+ +I + P V +R Y +K+CI
Sbjct: 191 YLAFFAGQMHGKLRPVLLAHWKDKDPEM-----KIYEVLPPSVAERISYSEHMRLSKYCI 245
Query: 303 CPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKN 362
C G +VNS R+ ++I CVPVIL++ + LPF+++++W +V + E+DV LK IL
Sbjct: 246 CAAGFEVNSPRLVEAIVNECVPVILADNFVLPFSEVINWDSISVTVAEKDVANLKAILAG 305
Query: 363 ISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
I + + L V++HF W + P ++D F+M+++ LW ++
Sbjct: 306 IPLRRYKEMQARLKHVKRHFVWKNSPEKYDIFNMIVHSLWTQQ 348
>gi|356560377|ref|XP_003548469.1| PREDICTED: LOW QUALITY PROTEIN: probable glycosyltransferase
At5g03795-like [Glycine max]
Length = 334
Score = 187 bits (476), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 113/305 (37%), Positives = 163/305 (53%), Gaps = 30/305 (9%)
Query: 69 SDVFHSPSVFHLNYLEMVTNFKIYIYP----DGDPNTFYQTPRKLTGKYASEGYFFQNIR 124
+DV H +F +Y EM + I++YP D N K G YASE YF +
Sbjct: 47 NDVCHDRDIFLEDYKEMNRSLXIHVYPHREDDSFANVLLPVESKPGGNYASESYFKKVPM 106
Query: 125 ESRFLTHDPDRAHLFFIPISCHKMR-GKGTSYENMTVIVKDYVDSLINKYPYWNRTLGAD 183
+S F+T DP A LFF+P S ++R + ++DY+ ++ +KYPYWNRT GAD
Sbjct: 107 KSHFITKDPTEADLFFLPFSIARLRHNRRVGVGGKQDFIRDYIQNISHKYPYWNRTGGAD 166
Query: 184 HFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDV-AFIPHKDVALPQVLQPFALPRGGR 242
HF+V CH +G A + P + NAI+VVCS SY + I HKD LPQ+ PR G
Sbjct: 167 HFYVACHSIGRSAMDKAPDVKFNAIQVVCSSSYFLTGNIAHKDTCLPQI-----WPRKGN 221
Query: 243 D----VENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKT 298
R L F+AG NS +RV L W+ND+E+ + + R+ Y +
Sbjct: 222 PPILVSSKRKRLAFFAGGVNSPVRVKLLETWKNDSEIFVHHGRLKTP-----YADELLGS 276
Query: 299 KFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQ 358
KF + G +VN+ RI VI++NYYDLPF D+L+W+ F+V++ D+ QL +
Sbjct: 277 KFGLHVKGFEVNTTRIG---------VIIANYYDLPFADVLNWKSFSVVVTTLDI-QLGE 326
Query: 359 ILKNI 363
+K +
Sbjct: 327 DMKEL 331
>gi|26451681|dbj|BAC42936.1| unknown protein [Arabidopsis thaliana]
Length = 270
Score = 184 bits (468), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 93/259 (35%), Positives = 149/259 (57%), Gaps = 12/259 (4%)
Query: 154 SYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCS 213
S +N+ +K+Y+D + +KY +WN+T G+DHF V CHD T + IR +C+
Sbjct: 13 SDKNLIQFLKNYLDMISSKYSFWNKTGGSDHFLVACHDWAPSETRQ---YMAKCIRALCN 69
Query: 214 PSYDVAFIPHKDVALPQVLQPFALPR------GGRDVENRTVLGFWAGHRNSKIRVILAR 267
F+ KDVALP+ +PR GG+ V R +L F+AG + +R +L +
Sbjct: 70 SDVSEGFVFGKDVALPETT--ILVPRRPLRALGGKPVSQRQILAFFAGGMHGYLRPLLLQ 127
Query: 268 VWENDTELDISN-NRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVI 326
W + + D+ + I ++ G Y +K+CICP G +VNS R+ +++ Y CVPVI
Sbjct: 128 NWGGNRDPDMKIFSEIPKSKGKKSYMEYMKSSKYCICPKGHEVNSPRVVEALFYECVPVI 187
Query: 327 LSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
+S+ + PF ++L+W FAV + E+D+ LK IL +I++ + + + VQKHF W+S
Sbjct: 188 ISDNFVPPFFEVLNWESFAVFVLEKDIPDLKNILVSITEERYREMQMRVKMVQKHFLWHS 247
Query: 387 PPVRFDAFHMVMYELWLRR 405
P RFD FHM+++ +W R
Sbjct: 248 KPERFDIFHMILHSIWYNR 266
>gi|115469550|ref|NP_001058374.1| Os06g0680900 [Oryza sativa Japonica Group]
gi|113596414|dbj|BAF20288.1| Os06g0680900 [Oryza sativa Japonica Group]
gi|125556494|gb|EAZ02100.1| hypothetical protein OsI_24187 [Oryza sativa Indica Group]
Length = 477
Score = 181 bits (460), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 121/355 (34%), Positives = 176/355 (49%), Gaps = 24/355 (6%)
Query: 71 VFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRES---- 126
V+ FH +Y+EM FK++ Y +G+P + G EG+ + S
Sbjct: 120 VYRDAYAFHQSYIEMEKRFKVWTYREGEPPVVQKGGAAFAGNDGIEGHLIAELDSSGGGG 179
Query: 127 RFLTHDPDRAHLFFIPISCHKMRGKGTS------YENMTVIVKDYVDSLINKYPYWNRTL 180
R P AH FF+PIS + G ++ +V YVD L YP+WNR+
Sbjct: 180 RHRARHPGEAHAFFLPISVASIAGYVYRRDMIDFWDPQLRLVAGYVDGLAAMYPFWNRSR 239
Query: 181 GADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPF-ALPR 239
GADHF V+CH + L NAIRV+C F P DVALP V+ A P
Sbjct: 240 GADHFLVSCHQWAPILSAAKAELRGNAIRVMCDADMSDGFDPATDVALPPVVASARATPP 299
Query: 240 GGRDVENRTVLGFWAGHRNSKIRV---ILARVWENDTELDISNNRINRAIGPLVYQRRFY 296
GR RTVL F+A V +LAR WE + + R+ + RR
Sbjct: 300 QGRVASERTVLAFFAAGGGGGGAVREALLAR-WEGRDDRVVVYGRLPAGVDHGELMRR-- 356
Query: 297 KTKFCICP----GGSQVNSARIADSIHYGCVPVILSNY-YDLPFNDILDWRKFAVILNER 351
+FC+CP G+ S R+ ++I GCVPV++ + Y PF+D+LDW +F+V +
Sbjct: 357 -ARFCLCPCGGGEGAAAASRRVVEAITAGCVPVLVDDGGYSPPFSDVLDWARFSVAVPAE 415
Query: 352 DVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPV-RFDAFHMVMYELWLRR 405
V ++K IL +SD + L +++V++HF+ N PP RFD +MV++ +WLRR
Sbjct: 416 RVGEIKDILGGVSDRRYGVLRRRVLRVRRHFRLNRPPAKRFDVVNMVIHSIWLRR 470
>gi|168043245|ref|XP_001774096.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674642|gb|EDQ61148.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 351
Score = 181 bits (459), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 118/358 (32%), Positives = 180/358 (50%), Gaps = 27/358 (7%)
Query: 71 VFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQ-----TPRKLTGKYASE-----GYFF 120
++HSP F +NY +M N +IY+YP F Q P ++ + E FF
Sbjct: 3 LYHSPEFFAMNYNDMAKNLRIYLYPASQNYNFTQYEYGMNPSEMVSELGVETSSTTDTFF 62
Query: 121 QNIRES-RFLTHDPDRAHLFFIPISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRT 179
+ ES RF+T D D AHL+F+PIS ++ + ++ Y+ L N Y W+ +
Sbjct: 63 NLLVESKRFVTDDADGAHLYFLPISIDRVWA-AVGPAKVGEHLRHYLQWLRNTYKLWDLS 121
Query: 180 LGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSP-SYDVAFIPHKDVALPQVL-QPFAL 237
LGADHF+ + H L KNAI+V SP + F PHKD++LP Q A
Sbjct: 122 LGADHFYFSSHAYDPINHRNNLELTKNAIQVASSPLRRNQNFFPHKDISLPSYKSQHIAE 181
Query: 238 PRGGRDVENRTVLGFWAGHRNSKIRVILARV---WENDTELDISNNRINRAIGPLVYQRR 294
+ R L F I I+A V W +D++ + A P +
Sbjct: 182 VQNLVGASQRPKLVF-VSSPPEDIDPIVASVIQKWTSDSDF-----HVESADQPSPPFEK 235
Query: 295 FYKTKFCICPGGSQVNSARIADSIHYGCVPVILSN--YYDLPFNDILDWRKFAVILNERD 352
++FC+ + + DS+ GCVPV++++ YDLPF D+L+W++F+V+L ++
Sbjct: 236 LLSSRFCVSVSPQAM--LNVVDSLRLGCVPVLIADSIIYDLPFQDVLNWKEFSVVLGVKE 293
Query: 353 VYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIKY 410
LK +L +IS E+ + + KH +WN PP +DAFHM ++ELW+RRH IKY
Sbjct: 294 SPNLKTLLSSISTDEYRKMQYLGHQASKHMEWNDPPKPWDAFHMTLHELWVRRHSIKY 351
>gi|443705810|gb|ELU02170.1| hypothetical protein CAPTEDRAFT_209273 [Capitella teleta]
Length = 473
Score = 180 bits (456), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 109/326 (33%), Positives = 175/326 (53%), Gaps = 19/326 (5%)
Query: 64 YSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNI 123
Y D + H + F YLEM+++ KI++Y + + R + KY E F + +
Sbjct: 156 YDDNSEAIAHKDN-FGAPYLEMMSSLKIFMYTSELDD---KVNRGVHWKYGVESLFIKLL 211
Query: 124 RESRFLTHDPDRAHLFFIPISCHK----MRGKGTSYENMTVIVKDYVDSLINKYPYWNRT 179
+S F+T D + AH FF+P C +R + + +V + + + ++Y YW+R+
Sbjct: 212 SKSSFVTKDAEEAHFFFLPFQCATYRNVIRDRAAAQNFTENLVSNILKDISSRYTYWDRS 271
Query: 180 LGADHFFVTCHDVGVRATEGVPF-LIKNAIRVVCSPSY-DVAFIPHKDVALPQ--VLQPF 235
LGADHF+V HD+G + L KNAI +V + Y D ++PHKD+ALP
Sbjct: 272 LGADHFYVCAHDMGASSVAAADANLQKNAIALVNTADYADPFYVPHKDIALPPHPAHGKG 331
Query: 236 ALP---RGGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQ 292
+LP RGG R L F+AG+ +S + + W ND+++ I + ++ VY
Sbjct: 332 SLPDIGRGGGKSTERPNLAFYAGNLDSGQLRPVFKDWLNDSDIHIHHGHMSDN----VYI 387
Query: 293 RRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERD 352
+ KFC+ P G +V S + D++ GCVPVI+S+YYDLP + ++DW FAV L E++
Sbjct: 388 KNLQSAKFCLVPRGHRVWSPVVMDAVWTGCVPVIISDYYDLPLHGLIDWTHFAVFLKEKE 447
Query: 353 VYQLKQILKNISDAEFVSLHNNLVKV 378
V LK LK+I + + + + + KV
Sbjct: 448 VLSLKSKLKSIPEEKLRRMQSYIKKV 473
>gi|296081850|emb|CBI20855.3| unnamed protein product [Vitis vinifera]
Length = 618
Score = 176 bits (447), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 107/352 (30%), Positives = 175/352 (49%), Gaps = 57/352 (16%)
Query: 67 LGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRES 126
L + ++ + SVF +Y M K+Y Y +G+ F+Q P + G YASEG+F + ++ +
Sbjct: 290 LHASLYRNVSVFKRSYELMENTLKVYTYREGERPVFHQPP--IKGIYASEGWFMKLMQAN 347
Query: 127 R-FLTHDPDRAHLFFIPISCHKMRG-----KGTSYENMTVIVKDYVDSLINKYPYWNRTL 180
+ F+T + +AHLF++P S + S +N+ +K+Y+D + KYP+WNRT
Sbjct: 348 KKFVTKNGRKAHLFYLPFSSLMLEEALYVPNSHSRKNLEQYLKNYLDMIGAKYPFWNRTG 407
Query: 181 GADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPR- 239
GADHF V CHD T L+ N+IR +C+ F KDV+LP+ +P+
Sbjct: 408 GADHFLVACHDWAPSETLK---LMANSIRALCNSDIREGFKLGKDVSLPETC--VRIPQN 462
Query: 240 -----GGRDVENRTVLGFWAGHRNSKIRVILARVWEN-DTELDISNNRINRAIGPLVYQR 293
GG+ R +L F+AG + +R IL + WEN D ++ I G + Y +
Sbjct: 463 PLRQLGGKPPSQRRILAFFAGSMHGYVRPILLKYWENKDPDMKIYGRMPKAKKGTMNYIQ 522
Query: 294 RFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDV 353
+K+CIC G +VNS R +D+
Sbjct: 523 HMKSSKYCICAKGYEVNSPR-------------------------------------KDI 545
Query: 354 YQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
LK IL +I + ++ + + +VQ+HF W++ PV++D FHM+++ +W R
Sbjct: 546 PNLKSILLSIPEKSYLEIQMRVKQVQQHFLWHAKPVKYDVFHMILHSVWYNR 597
>gi|224112673|ref|XP_002316258.1| predicted protein [Populus trichocarpa]
gi|222865298|gb|EEF02429.1| predicted protein [Populus trichocarpa]
Length = 339
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 111/348 (31%), Positives = 175/348 (50%), Gaps = 44/348 (12%)
Query: 69 SDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRF 128
+ ++ SP++ +Y M+ +FKIYIY P L+ +E FF ++ S F
Sbjct: 24 TSLYLSPTILFPDYQNMLISFKIYIY---------TPPNALSFSSPTESNFFTCLQNSPF 74
Query: 129 LTHDPDRAHLFFIPISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVT 188
+T +P+ AHL+F+P S + T V ++ L ++PYWNRTLGADHF+V+
Sbjct: 75 VTQNPEEAHLYFVPFSSNLS----------TRSVARFIRDLRMEFPYWNRTLGADHFYVS 124
Query: 189 CHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDVENRT 248
C +G + + L KN++++ C P+ + F+PHKD+ P Q NRT
Sbjct: 125 CAGLGYESDRNLVELKKNSVQISCFPTTEGRFVPHKDITFPPHAQ-----------GNRT 173
Query: 249 V--LGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGG 306
LGF N L D++ I + N + R + FC+ G
Sbjct: 174 AKYLGFV--RYNEVKESNLVNELRKDSDFLIESEPSNG----MTLVGRLGSSVFCLFEYG 227
Query: 307 SQVNSARIADSIHYGCVPVILSN--YYDLPFNDILDWRKFAVILNER-DVYQLKQIL-KN 362
+ V+ I +++ +GCVPV++ + DLP D++ W+K A+ + R V ++K+ L +
Sbjct: 228 ADVSG--IGEALRFGCVPVMVMDRPMQDLPLMDVIGWQKIAIFVGSRGGVKEVKRELDRT 285
Query: 363 ISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIKY 410
D E V +HF WN P +D+FHMVMY+LWLRRH I+Y
Sbjct: 286 CKDDECAGRRRLGVVASQHFVWNHMPQPYDSFHMVMYQLWLRRHAIRY 333
>gi|356527767|ref|XP_003532479.1| PREDICTED: LOW QUALITY PROTEIN: probable glycosyltransferase
At5g03795-like [Glycine max]
Length = 467
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 105/351 (29%), Positives = 176/351 (50%), Gaps = 37/351 (10%)
Query: 65 SDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIR 124
+L + +F S+F +Y M K+YIY DG+ F+Q
Sbjct: 137 KELYAPLFRKVSMFKRSYELMECTLKVYIYKDGNKPIFHQP------------------- 177
Query: 125 ESRFLTHDPDRAHLFFIPISCHKMRGK-----GTSYENMTVIVKDYVDSLINKYPYWNRT 179
+ DP +AHLF++P S + + N+ +KDY D + K PY+NRT
Sbjct: 178 ----IMKDPAKAHLFYMPFSSRMLEHSLYVRNSHNRTNLRQFLKDYTDKISAKIPYFNRT 233
Query: 180 LGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQV-LQPFALP 238
GADHF CHD T ++ I+ +C+ F +DV+LP+ ++ P
Sbjct: 234 GGADHFLAACHDWAPYETR---HHMEYCIKALCNADVTQGFKIGRDVSLPEAYVRSVRDP 290
Query: 239 R---GGRDVENRTVLGFWAGHRNSKIRVILARVWEN-DTELDISNNRINRAIGPLVYQRR 294
+ GG+ +L F AG+ + + IL + W++ D ++ I + + Y
Sbjct: 291 QRDLGGKPPHQXPILAFHAGNMHGYLHPILLKHWKDKDPDMKIYGPMPHGVTSKMNYINH 350
Query: 295 FYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVY 354
+K+CICP G +VNS R+ ++I CVPVI+S+ + F ++L+W F++IL E+D+
Sbjct: 351 MKNSKYCICPKGYEVNSPRMVEAIFXECVPVIISDNFVPHFFEVLNWDVFSIILAEKDIP 410
Query: 355 QLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
LKQIL ++ +++ L + K QKHF W+ P+++D FH+ ++ +W R
Sbjct: 411 NLKQILLSVXH-KYLELQLGVRKAQKHFFWHVKPLKYDLFHITLHLIWYNR 460
>gi|224166016|ref|XP_002338879.1| predicted protein [Populus trichocarpa]
gi|222873817|gb|EEF10948.1| predicted protein [Populus trichocarpa]
Length = 208
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 89/209 (42%), Positives = 124/209 (59%), Gaps = 37/209 (17%)
Query: 152 GTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVV 211
G S + + V+ +V+SLI++YPYWNRTLGADHFF+TC D+ V A+E + L+KN+IRV+
Sbjct: 1 GRSVDERAIAVQHFVNSLISEYPYWNRTLGADHFFITCADIHVIASERIWNLMKNSIRVM 60
Query: 212 CSPSYDVAFIPHKDVALPQVLQPF----------------------ALPRGGRDVENR-- 247
CSPSY+V ++PHKDV+LPQ +QPF LP ++++R
Sbjct: 61 CSPSYNVEYVPHKDVSLPQSVQPFNVSVSQIMPPLYAFIAPTTQPLTLPAAKYNMKSRYR 120
Query: 248 ----------TVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPL--VYQRRF 295
FW G + + IR L WEND+ELDI + + + +Y +F
Sbjct: 121 YLLCPWIILEQEYSFWRGLKENYIRKSLVNAWENDSELDIKEIQTEASTTEIRRLYHEKF 180
Query: 296 YKTKFCICPGGSQVNSARIADSIHYGCVP 324
Y +KFCICPGG Q++ A IA +IHYGCVP
Sbjct: 181 YSSKFCICPGGPQIDGA-IAVAIHYGCVP 208
>gi|218197695|gb|EEC80122.1| hypothetical protein OsI_21883 [Oryza sativa Indica Group]
Length = 545
Score = 172 bits (435), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 120/422 (28%), Positives = 192/422 (45%), Gaps = 91/422 (21%)
Query: 71 VFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRE----- 125
V+ +P+ F+ +Y+EM FK+Y+Y +G+P ++ P K YA EG F + +
Sbjct: 120 VYRNPAAFYRSYVEMERRFKVYVYEEGEPPIAHEGPCK--NIYAVEGRFIEELELMAPPL 177
Query: 126 SRFLTHDPDRAHLFFIPISCHKM-----RGKGTSYENMTVIVKDYVDSLINKYPYWNRTL 180
T DP RAH F+P+S +M R + IV DYV + +++ +WNR+
Sbjct: 178 GGVRTWDPARAHALFLPLSVSQMVQLAYRPLSYDLSPLRAIVADYVAVVASRHRFWNRSA 237
Query: 181 GAD-------------HFF--------VTCHDV--------------------------- 192
GAD HFF + H +
Sbjct: 238 GADHFMLSCHDWASTNHFFPLFFRRKIFSQHQMISIQISKRVQIINSVHTPSVQRDSISG 297
Query: 193 -------GVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQV----LQPFALPRGG 241
G A+ G P L NAIR +C+ + F P KDV++P++
Sbjct: 298 FPTFRVNGPHASRGHPELYANAIRALCNANTSEGFRPDKDVSIPEINLYDGDMPPELLSP 357
Query: 242 RDVENRTVLGFWAGHRNSKIRVILARVWEN--------------DTELDISNNRINRAIG 287
R L F+AG R+ +R +L R W+ + +S + A G
Sbjct: 358 APPPPRPFLAFFAGGRHGHVRDLLLRHWKGRDPAVFPVYEYDLPSIPVSVSGDGDTDAGG 417
Query: 288 ----PLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRK 343
P + R +++FC+CP G +V S R+ ++IH GCVPV++++ Y PF D+L W
Sbjct: 418 EGGNPYYWYMR--RSRFCLCPSGHEVASPRVVEAIHAGCVPVVVADGYAPPFADVLRWEA 475
Query: 344 FAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWL 403
F+V + DV +L+++L+ I E L + + V++HF + PP R D FHM+++ +WL
Sbjct: 476 FSVAVAVADVPRLRELLERIPAPEVERLRDGVRLVKRHFMLHQPPERLDMFHMILHSVWL 535
Query: 404 RR 405
RR
Sbjct: 536 RR 537
>gi|255566500|ref|XP_002524235.1| conserved hypothetical protein [Ricinus communis]
gi|223536512|gb|EEF38159.1| conserved hypothetical protein [Ricinus communis]
Length = 337
Score = 164 bits (415), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 108/342 (31%), Positives = 168/342 (49%), Gaps = 48/342 (14%)
Query: 75 PSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPD 134
P++F NY M+ +FKIY Y P +F E FF +++ S F+T +P+
Sbjct: 32 PALFFPNYQRMLQSFKIYTYTPPQPFSFTS---------PVESLFFTSLQNSHFITLNPE 82
Query: 135 RAHLFFIPISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV 194
+AHLFFIP S ++ +++D L ++PYWNRTLGADHF+++C +G
Sbjct: 83 QAHLFFIPFP------SDLSPRSLARVIRD----LRTEFPYWNRTLGADHFYISCTGLGY 132
Query: 195 RATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDVENRTVLGFWA 254
+ + L KN++++ C PS + F+PHKD+ LP + +P N+
Sbjct: 133 ESDRNLVELKKNSVQISCFPSPNGKFVPHKDITLPPL-----VPSTIHKSSNK------- 180
Query: 255 GHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARI 314
R K V V E +L++ + Q KT+ C N + I
Sbjct: 181 -RRPYKAFVKYDGVEELRGDLEVL----------IESQPSDEKTRSEFCLFDYAANISGI 229
Query: 315 ADSIHYGCVPVILSN--YYDLPFNDILDWRKFAVILNERD--VYQLKQILKNI-SDAEFV 369
+++ GCVP++++ DLP D+L W++ AVI+ D +K++L S +
Sbjct: 230 GEALSSGCVPLVITERPIQDLPLMDVLRWQEIAVIVGSSDDGFKWVKRVLNGTCSRGDTC 289
Query: 370 SLHNNL-VKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIKY 410
L +H WN P +DAFHMVMY+LWLRRH I+Y
Sbjct: 290 ERMRRLGAGASQHLVWNETPEPYDAFHMVMYQLWLRRHTIRY 331
>gi|147775380|emb|CAN73459.1| hypothetical protein VITISV_022181 [Vitis vinifera]
Length = 319
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 106/331 (32%), Positives = 164/331 (49%), Gaps = 29/331 (8%)
Query: 85 MVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIR--ESRFLTHDPDRAHLFFIP 142
M FKI+ Y +GD + P+ Y EG F + +S F+ PD AH+F+IP
Sbjct: 1 MEKRFKIWTYKEGDQPLVHGGPK--NSIYGIEGQFMDEMESGDSHFMAGHPDVAHVFYIP 58
Query: 143 IS----CHKMRGKGTSYEN--MTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHD-VGVR 195
IS H + Y + +V DY+ + +KYPYWNR+ GADHF V+CHD V
Sbjct: 59 ISVTRIAHYIYSPPVDYSGHMLQRLVTDYIYVVSDKYPYWNRSNGADHFLVSCHDWVQSF 118
Query: 196 ATEGVPFLIKNAIRVVCSPS-YDVAFIPHKDVALPQVLQPFALPRGGRDVENRTVLGFWA 254
A P + +C S Y A H P L V T +
Sbjct: 119 AMPTPPKGSSPSETSLCQKSTYPKAXSAHLTSTNP--LTSVTSSPSSPAVNPATCEPSCS 176
Query: 255 GHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARI 314
G K + +V+E+ + +NR Y + +KFC+CP G +V S R+
Sbjct: 177 G--PGKKMMTKFQVYEH-----LPSNRD--------YAKSMGDSKFCLCPSGWEVASPRV 221
Query: 315 ADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNN 374
++I GCVPVI+ +YY LPF+++LDW KF++ + + ++K+ILK + + ++ +
Sbjct: 222 VEAIAAGCVPVIICDYYVLPFSEVLDWSKFSINITSDKIPEIKKILKAVPNERYLRMQKR 281
Query: 375 LVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
+ +VQ+HF N P +D HM+++ +WLRR
Sbjct: 282 VKQVQRHFVINRPAQPYDMLHMILHSVWLRR 312
>gi|388493832|gb|AFK34982.1| unknown [Lotus japonicus]
Length = 229
Score = 162 bits (409), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 79/224 (35%), Positives = 143/224 (63%), Gaps = 8/224 (3%)
Query: 186 FVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQV-LQPFALPR--GGR 242
++CHD G A++ VP L KN+IRV+C+ + F KD + P++ L+ ++ GG
Sbjct: 1 MLSCHDWGPMASQAVPNLYKNSIRVLCNANTSEGFKHGKDASFPEINLRTSSIDDLVGGP 60
Query: 243 DVENRTVLGFWAGHRNSKIRVILARVWEN-DTELDISNNRINRAIGPLVYQRRFYKTKFC 301
R+VL +AG + IR L WEN D ++ + ++ + + + Y K+KFC
Sbjct: 61 SPSKRSVLASFAGRLHGPIRPPLLEHWENKDGDMQVYSS-LPKGVS---YYDMLRKSKFC 116
Query: 302 ICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILK 361
+CP G +V S R+ ++I+ GCVPV++S +Y PF+D+L+W+ F++ ++ +D+ +LK+IL
Sbjct: 117 LCPSGYEVASPRVVEAIYTGCVPVLISEHYVPPFSDVLNWKSFSLEVSVKDIPRLKEILL 176
Query: 362 NISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
+++ +++ + + ++++HF+ +SPP RFD FHMV++ +WLRR
Sbjct: 177 SVNTRQYIRMQRRVGQIRRHFEIHSPPKRFDVFHMVLHSVWLRR 220
>gi|449503277|ref|XP_004161922.1| PREDICTED: probable glycosyltransferase At5g03795-like [Cucumis
sativus]
Length = 310
Score = 159 bits (401), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 104/338 (30%), Positives = 157/338 (46%), Gaps = 76/338 (22%)
Query: 78 FHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAH 137
F NY M N +I+ Y +P +F +E FF+++ S + THDPD+AH
Sbjct: 35 FLKNYNSMSANLRIFTYIPFNPFSFSSQ---------AESLFFKSLLNSPYATHDPDQAH 85
Query: 138 LFFIPISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRAT 197
LFFIP S H S ++ +++ +L PYWNRTLGADHFF++ +G +
Sbjct: 86 LFFIPFSPH------ISTRSLARLIR----TLRTDLPYWNRTLGADHFFLSSSGIGYISD 135
Query: 198 EGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDVENRTVLGFWAGHR 257
V L KNAI+V P FIPHKDV+LP V + RT
Sbjct: 136 RNVVELKKNAIQVSSFPVSPGKFIPHKDVSLPPV----------STLPPRT--------- 176
Query: 258 NSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCIC--PGGSQVNSARIA 315
P Y + K+ FC+ GG + + I
Sbjct: 177 ------------------------------PSCYGDKLAKSDFCLFEYEGG---DVSGIG 203
Query: 316 DSIHYGCVPVILSNYY--DLPFNDILDWRKFAV-ILNERDVYQLKQILKNISDAEFVSLH 372
+++ +GCVPV++S+ + DLP D++ W + AV + + +K++L+ + +
Sbjct: 204 EALRFGCVPVVISDRWIQDLPLMDVVRWEEMAVFVAGGGGIEGVKKVLRRVDGERLDRMK 263
Query: 373 NNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIKY 410
+HF WNSPP DAF+ V Y+LW+RRH ++Y
Sbjct: 264 KLGAAAAQHFVWNSPPQPLDAFNTVAYQLWVRRHAVRY 301
>gi|449465854|ref|XP_004150642.1| PREDICTED: probable glycosyltransferase At5g03795-like [Cucumis
sativus]
Length = 312
Score = 158 bits (400), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 103/339 (30%), Positives = 158/339 (46%), Gaps = 76/339 (22%)
Query: 77 VFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRA 136
+F NY M N +I+ Y +P +F +E F++++ S + THDPD+A
Sbjct: 36 IFLKNYNSMSANLRIFTYIPFNPFSFSSQ---------AESLFYKSLLNSPYTTHDPDQA 86
Query: 137 HLFFIPISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRA 196
HLFFIP S H S ++ +++ +L PYWNRTLGADHFF++ +G +
Sbjct: 87 HLFFIPFSPH------ISTRSLARLIR----TLRTDLPYWNRTLGADHFFLSSSGIGYIS 136
Query: 197 TEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDVENRTVLGFWAGH 256
V L KNAI+V P FIPHKDV+LP V + RT
Sbjct: 137 DRNVVELKKNAIQVSSFPVSPGKFIPHKDVSLPPV----------STLPPRT-------- 178
Query: 257 RNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCIC--PGGSQVNSARI 314
P Y + K+ FC+ GG + + I
Sbjct: 179 -------------------------------PSCYGDKLAKSDFCLFEYEGG---DVSGI 204
Query: 315 ADSIHYGCVPVILSNYY--DLPFNDILDWRKFAV-ILNERDVYQLKQILKNISDAEFVSL 371
+++ +GCVPV++S+ + DLP D++ W + AV + + +K++L+ + +
Sbjct: 205 GEALRFGCVPVVISDRWIQDLPLMDVVRWEEMAVFVAGGGGIEGVKKVLRRVDGERLDRM 264
Query: 372 HNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIKY 410
+HF WNSPP DAF+ V Y+LW+RRH ++Y
Sbjct: 265 KKLGAAAAQHFVWNSPPQPLDAFNTVAYQLWVRRHAVRY 303
>gi|42573339|ref|NP_974766.1| Exostosin family protein [Arabidopsis thaliana]
gi|332004318|gb|AED91701.1| Exostosin family protein [Arabidopsis thaliana]
Length = 453
Score = 155 bits (392), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 93/259 (35%), Positives = 139/259 (53%), Gaps = 17/259 (6%)
Query: 58 AINKQDYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEG 117
A+ K+D L + ++H+ S+F +Y M K+Y+Y +GD F+Q + G YASEG
Sbjct: 190 ALVKKD-DTLYAPLYHNISIFKRSYELMEQTLKVYVYSEGDRPIFHQPEAIMEGIYASEG 248
Query: 118 YFFQNIRES-RFLTHDPDRAHLFFIPISCHKMRGK-----GTSYENMTVIVKDYVDSLIN 171
+F + + S RFLT DP +AHLF+IP S ++ K S N+ + +Y+D + +
Sbjct: 249 WFMKLMESSHRFLTKDPTKAHLFYIPFSSRILQQKLYVHDSHSRNNLVKYLGNYIDLIAS 308
Query: 172 KYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQV 231
YP WNRT G+DHFF CHD T G P++ N IR +C+ + F+ KDV+LP+
Sbjct: 309 NYPSWNRTCGSDHFFTACHDWAPTETRG-PYI--NCIRALCNADVGIDFVVGKDVSLPET 365
Query: 232 ----LQPFALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIG 287
LQ GG RT+L F+AG + +R IL W + E D+ +I I
Sbjct: 366 KVSSLQNPNGKIGGSRPSKRTILAFFAGSLHGYVRPILLNQWSSRPEQDM---KIFNRID 422
Query: 288 PLVYQRRFYKTKFCICPGG 306
Y R +++FC+C G
Sbjct: 423 HKSYIRYMKRSRFCVCAKG 441
>gi|357478277|ref|XP_003609424.1| Xylogalacturonan beta-1,3-xylosyltransferase [Medicago truncatula]
gi|355510479|gb|AES91621.1| Xylogalacturonan beta-1,3-xylosyltransferase [Medicago truncatula]
Length = 229
Score = 151 bits (382), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 75/216 (34%), Positives = 130/216 (60%), Gaps = 6/216 (2%)
Query: 193 GVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQP---FALPRGGRDVENRTV 249
G R + P L K+ IR +C+ + F P++DV++PQ+ P P + NRT+
Sbjct: 4 GPRVSYANPKLFKHFIRALCNANTSEGFWPNRDVSIPQLNLPVGKLGPPNTDQHPNNRTI 63
Query: 250 LGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQV 309
L F+AG + KIR L + W++ E + + + Y + +KFC+CP G +V
Sbjct: 64 LTFFAGGAHGKIRKKLLKSWKDKDEEVQVHEYLPKGQD---YTKLMGLSKFCLCPSGHEV 120
Query: 310 NSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFV 369
S R+ ++I+ GCVPVI+ + Y LPF+D+L+W +F++ + + ++K IL+NI++ ++
Sbjct: 121 ASPRVVEAIYAGCVPVIICDNYSLPFSDVLNWSQFSMEIAVDRIPEIKTILQNITETKYR 180
Query: 370 SLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
L++N+ +V+KHF+ N P FD HM+++ +WLRR
Sbjct: 181 VLYSNVRRVRKHFEMNRPAKPFDLIHMILHSVWLRR 216
>gi|224142185|ref|XP_002324439.1| predicted protein [Populus trichocarpa]
gi|222865873|gb|EEF03004.1| predicted protein [Populus trichocarpa]
Length = 208
Score = 138 bits (347), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 66/207 (31%), Positives = 119/207 (57%), Gaps = 5/207 (2%)
Query: 204 IKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFAL----PRGGRDVENRTVLGFWAGHRNS 259
+++ I+ +C+ F +DV+ P+ A GG+ R +L F+AG+ +
Sbjct: 1 MEHCIKALCNADVTAGFKIGRDVSFPETYVRSARNPLRDLGGKPPSQRNILAFYAGNMHG 60
Query: 260 KIRVILARVWEN-DTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSI 318
+R IL + W++ D ++ I + Y ++K+CICP G +VNS R+ ++I
Sbjct: 61 YLRPILLKYWKDKDPDMKIFGPMPPGVASKMNYIHHMQRSKYCICPKGYEVNSPRVVEAI 120
Query: 319 HYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKV 378
Y CVPVI+S+ + PF D+LDW F++IL E+D+ LK+IL +I +++ + + K
Sbjct: 121 FYECVPVIISDNFVPPFFDVLDWGAFSLILAEKDISNLKEILLSIPKEKYLQMQLGVRKA 180
Query: 379 QKHFQWNSPPVRFDAFHMVMYELWLRR 405
Q+HF W++ P+++D F+M ++ +W R
Sbjct: 181 QRHFLWHASPMKYDLFYMTLHSIWYNR 207
>gi|125544214|gb|EAY90353.1| hypothetical protein OsI_11935 [Oryza sativa Indica Group]
Length = 218
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 68/194 (35%), Positives = 119/194 (61%), Gaps = 16/194 (8%)
Query: 220 FIPHKDVALPQVLQPFALPR------GGRDVENRTVLGFWAGHRNSKIRVILARVWEN-- 271
F P +DV+LP+ PR GG V R +L F+AG+ + ++R +L + W +
Sbjct: 25 FTPGRDVSLPETT--IRTPRRPLRYVGGLPVSRRGILAFFAGNVHGRVRPVLLKHWGDGR 82
Query: 272 DTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYY 331
D ++ + R + Y + ++FC+CP G +VNS RI ++++Y CVPVI+++
Sbjct: 83 DDDMWVYGPLPARVSRRMSYIQHMKNSRFCLCPMGYEVNSPRIVEALYYECVPVIIAD-- 140
Query: 332 DLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRF 391
N +LDW FAV++ E+DV LK+IL+ I+ ++V++H + ++Q+HF W++ P+R+
Sbjct: 141 ----NFVLDWSAFAVVVAEKDVPDLKKILQGITLRKYVAMHGCVKRLQRHFLWHARPLRY 196
Query: 392 DAFHMVMYELWLRR 405
D FHM+++ +WL R
Sbjct: 197 DLFHMILHSIWLSR 210
>gi|356546128|ref|XP_003541483.1| PREDICTED: LOW QUALITY PROTEIN: xylogalacturonan
beta-1,3-xylosyltransferase-like [Glycine max]
Length = 252
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 90/292 (30%), Positives = 145/292 (49%), Gaps = 54/292 (18%)
Query: 125 ESRFLTHDPDRAHLFFIPISCHKM-------RGKGTSYE--NMTVIVKDYVDSLINKYPY 175
+S F P++AHLF +P S K+ R + Y+ + +V DY++ + N+YP
Sbjct: 4 KSPFKATHPEQAHLFLLPFSVSKVIRYVYKPRRSRSDYDPDRLQRLVLDYINIVANRYPN 63
Query: 176 WNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPF 235
WNR+ GADHF V+ HD + P + K IR +C+ + F P +DV++ +V
Sbjct: 64 WNRSRGADHFLVSFHD----WLDANPEVFKYFIRALCNANTSEGFQPSRDVSITEVY--- 116
Query: 236 ALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRF 295
LP D E + G L Y +
Sbjct: 117 -LPSRKLDKEVQVHEYLPKG---------------------------------LEYTKLM 142
Query: 296 YKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQ 355
+ KFC+CP QV S R+ ++I+ GCVPVI+ + Y LPF+D++ KF + ER + +
Sbjct: 143 GQRKFCLCPS-XQVASPRVVEAIYVGCVPVIICDNYSLPFSDVVKRSKFIAV--ER-IPE 198
Query: 356 LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHV 407
K IL+N+S +++ L++N+ +V++HF N P FD HM+++ L +R V
Sbjct: 199 TKTILQNVSKDKYMELYSNVKRVRRHFVINRPAKPFDLIHMILHSLRNKRIV 250
>gi|198429575|ref|XP_002120379.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
Length = 482
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 85/242 (35%), Positives = 129/242 (53%), Gaps = 23/242 (9%)
Query: 81 NYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFF 140
+Y EMV N K+YIY + T + R G YA E F + + +S F T P+ A FF
Sbjct: 220 DYREMVNNLKVYIY-ETKIGTDHHPHR--VGGYAVERVFQELLEKSNFRTQHPNLATFFF 276
Query: 141 IPISCHK-MRGKGTSYENMTV---IVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRA 196
IPI C + T +E + + + + + +YPYW+++ GA+HF++ HDVG +
Sbjct: 277 IPIRCSSYILDYPTEHEGLMEAKRVTANILHEIQTQYPYWSQSSGANHFYICSHDVGAKV 336
Query: 197 TEGVPFLIKNAIRVVCSPSYDVA-FIPHKDVALP-----QVLQPFALPRGGR--DVENRT 248
EG L+KNAI +V + YD FIPHKD+++P + + +GG DV R
Sbjct: 337 AEG---LMKNAIGLVSTADYDDPYFIPHKDISIPPTPSSGLSNIHLIGKGGALVDVRGRN 393
Query: 249 VLGFWAGHRNS-KIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGS 307
+L F+AG S +IR + R W +D +++I INR + P Y + K KFC+ G
Sbjct: 394 ILAFFAGDITSGRIRPLAWRTWYSDQDIEI----INRILKPSAYIEKLKKAKFCLIFRGK 449
Query: 308 QV 309
+V
Sbjct: 450 EV 451
>gi|302786830|ref|XP_002975186.1| glycosyltransferase-like protein [Selaginella moellendorffii]
gi|300157345|gb|EFJ23971.1| glycosyltransferase-like protein [Selaginella moellendorffii]
Length = 405
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 102/357 (28%), Positives = 170/357 (47%), Gaps = 41/357 (11%)
Query: 83 LEMVTNFKIYIY--PDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFF 140
L + +N K+YIY P + R + +A+E QN+ S T DPD A FF
Sbjct: 49 LGLSSNIKVYIYDLPSSYNTDWLVDSRCSSHLFAAEVAIHQNLLRSPVRTLDPDEADFFF 108
Query: 141 IPI--SCHKMRGKG-TSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVG---- 193
+P+ SC+ G + + + I++ V + P+W+R G DH FV HD G
Sbjct: 109 MPVYVSCNFTSRSGFPTLFHASDILQAAVGLVSRNMPFWDRHQGRDHVFVATHDFGACFH 168
Query: 194 ----VRATEGVPFLIKNAIRVVC------SPSYDVAFI-------PHKDVALP--QVLQP 234
+ T G+P ++N+I + P +V I P K + P Q +
Sbjct: 169 AMEDLAVTMGIPQFLRNSIILQTFGEKNKHPCQNVDHIQIPPYVVPAKKLPDPRSQRRKI 228
Query: 235 FALPRGGRDVENRTVLGFWAGHRNSK-IRVILARVWENDTELDISNNRINRAIGPLVYQR 293
A RG ++ + V +GH S+ +R + R + +D I R + Y+
Sbjct: 229 LAFFRGKMEIHPKNV----SGHMYSRGVRTTIWRRFSHDRRFFIKRKRSDN------YKA 278
Query: 294 RFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDV 353
++ FC+CP G S RI +S+ GC+PVI+++ LP++ ++DWRK +V + ERDV
Sbjct: 279 EMLRSVFCLCPLGWAPWSPRIVESVIQGCIPVIIADNIQLPYSHVIDWRKISVTVAERDV 338
Query: 354 YQLKQILKNISDAEFVSLHNNLVK--VQKHFQWNSPPVRFDAFHMVMYELWLRRHVI 408
++L +IL ++ + NL + V++ +N P VR DA V+ L R++ +
Sbjct: 339 HKLDRILSRVAATNVSMIQANLWRDEVRQALVYNQPLVRGDATWQVLDLLSKRKNKV 395
>gi|302826405|ref|XP_002994684.1| glycosyltransferase-like protein [Selaginella moellendorffii]
gi|300137154|gb|EFJ04250.1| glycosyltransferase-like protein [Selaginella moellendorffii]
Length = 332
Score = 127 bits (320), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 98/342 (28%), Positives = 164/342 (47%), Gaps = 34/342 (9%)
Query: 85 MVTNFKIYIY--PDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIP 142
+ +N K+YIY P + R + +A+E QN+ S T DPD A FF+P
Sbjct: 2 LSSNIKVYIYDLPSSYNTDWLVDSRCSSHLFAAEVAIHQNLLRSPVRTLDPDEADFFFMP 61
Query: 143 I--SCHKMRGKG-TSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVG------ 193
+ SC+ G + + + I++ V + P+W+R G DH FV HD G
Sbjct: 62 VYVSCNFTSRSGFPTLFHASDILQAAVGLVSRNMPFWDRHQGRDHVFVATHDFGACFHAM 121
Query: 194 -VRATEGVPFLIKNAIRVVC------SPSYDVAFIPHKDVALPQVLQPFALPRGGRDVEN 246
+ T G+P ++N+I + P +V I ++L F RG ++
Sbjct: 122 DLAVTMGIPQFLRNSIILQTFGEKNKHPCQNVDHIQIPPYVRRKILAFF---RGKMEIHP 178
Query: 247 RTVLGFWAGHRNSK-IRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPG 305
+ V +GH S+ +R + R + +D I R + Y+ ++ FC+CP
Sbjct: 179 KNV----SGHMYSRGVRTTIWRRFSHDRRFFIKRKRSDN------YKAEMLRSVFCLCPL 228
Query: 306 GSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISD 365
G S RI +S+ GC+PVI+++ LP++ ++DWRK +V + ERDV++L +IL ++
Sbjct: 229 GWAPWSPRIVESVIQGCIPVIIADNIQLPYSHVIDWRKISVTVAERDVHKLDRILSRVAA 288
Query: 366 AEFVSLHNNLVK--VQKHFQWNSPPVRFDAFHMVMYELWLRR 405
+ NL + V++ +N P VR DA V+ L R+
Sbjct: 289 TNVSMIQANLWRDEVRQALVYNQPLVRGDATWQVLDLLSKRK 330
>gi|222635067|gb|EEE65199.1| hypothetical protein OsJ_20321 [Oryza sativa Japonica Group]
Length = 551
Score = 127 bits (319), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 75/236 (31%), Positives = 122/236 (51%), Gaps = 24/236 (10%)
Query: 192 VGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQV----LQPFALPRGGRDVENR 247
+G A+ G P L NAIR +C+ + F P KDV++P++ R
Sbjct: 310 IGPHASRGHPELYANAIRALCNANTSEGFRPDKDVSIPEINLYDGDMPPELLSPAPPPPR 369
Query: 248 TVLGFWAGHRNSKIRVILARVWEN--------------DTELDISNNRINRAIG----PL 289
L F+AG R+ +R +L R W+ + +S + A G P
Sbjct: 370 PFLAFFAGGRHGHVRDLLLRHWKGRDPAVFPVYEYDLPSIPVSVSGDGDTDAGGEGGNPY 429
Query: 290 VYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILN 349
+ R +++FC+CP G +V S R+ ++IH GCVPV++++ Y PF D+L W F+V +
Sbjct: 430 YWYMR--RSRFCLCPSGHEVASPRVVEAIHAGCVPVVVADGYAPPFADVLRWEAFSVAVA 487
Query: 350 ERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
DV +L+++L+ I E L + + V++HF + PP R D FHM+++ +WLRR
Sbjct: 488 VADVPRLRELLERIPAPEVERLRDGVRLVKRHFMLHQPPERLDMFHMILHSVWLRR 543
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 43/121 (35%), Positives = 65/121 (53%), Gaps = 12/121 (9%)
Query: 81 NYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRE-----SRFLTHDPDR 135
+Y+EM FK+Y+Y +G+P ++ P K YA EG F + + T DP R
Sbjct: 133 SYVEMERRFKVYVYEEGEPPIAHEGPCK--NIYAVEGRFIEELELMAPPLGGVRTWDPAR 190
Query: 136 AHLFFIPISCHKM-----RGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCH 190
AH F+P+S +M R + IV DYV + +++ +WNR+ GADHF ++CH
Sbjct: 191 AHALFLPLSVSQMVQLAYRPLSYDLSPLRAIVADYVAVVASRHRFWNRSAGADHFMLSCH 250
Query: 191 D 191
D
Sbjct: 251 D 251
>gi|297738775|emb|CBI28020.3| unnamed protein product [Vitis vinifera]
Length = 665
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 98/350 (28%), Positives = 160/350 (45%), Gaps = 72/350 (20%)
Query: 66 DLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNI-R 124
+L + +F + S+F +Y M K+Y+Y DG+ F+Q L G YASEG+F + + R
Sbjct: 371 ELHAPLFRNVSMFKRSYELMERILKVYVYKDGEKPIFHQP--ILKGLYASEGWFMKLMER 428
Query: 125 ESRFLTHDPDRAHLFFIPISCHKMRGK-----GTSYENMTVIVKDYVDSLINKYPYWNRT 179
F+ DP +A LF++P S + K + N+ +K Y + + KY +WNRT
Sbjct: 429 NKHFVVKDPRQAQLFYMPFSSRMLEYKLYVRNSHNRTNLRQYLKQYSEKIAAKYRFWNRT 488
Query: 180 LGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFA--- 236
GADHF V CHD T ++ I+ +C+ F +DV+LP+ A
Sbjct: 489 GGADHFLVACHDWAPYETR---HHMEQCIKALCNADVTAGFKIGRDVSLPETYVRSARNP 545
Query: 237 -LPRGGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRF 295
GG+ R +L F+AG+ + +R IL + W+ D + D+ + GP+
Sbjct: 546 LRDLGGKPPSERHILAFYAGNMHGYLRPILLKYWK-DKDPDM------KIYGPM------ 592
Query: 296 YKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQ 355
P G + G P D+P N +DV
Sbjct: 593 --------PPGLR------------GQQPT----EKDIP--------------NLKDV-- 612
Query: 356 LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
L +I + +++ + + KVQKHF W++ P+++D FHM ++ +W R
Sbjct: 613 ----LLSIPNDKYLQMQLGVRKVQKHFLWHAKPLKYDLFHMTLHSIWYNR 658
>gi|302791649|ref|XP_002977591.1| glycosyltransferase-like protein [Selaginella moellendorffii]
gi|300154961|gb|EFJ21595.1| glycosyltransferase-like protein [Selaginella moellendorffii]
Length = 345
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 103/354 (29%), Positives = 166/354 (46%), Gaps = 45/354 (12%)
Query: 85 MVTNFKIYIY--PDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIP 142
+ +N K+YIY P + R + +A+E QN+ S T DPD A FF+P
Sbjct: 2 LSSNIKVYIYDLPSSYNTDWLVDSRCSSHLFAAEVAIHQNLLRSPVRTLDPDEADFFFMP 61
Query: 143 I--SCHKMRGKG-TSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV--RAT 197
+ SC+ G + + + I++ V + P+W+R G DH FV HD G A
Sbjct: 62 VYVSCNFTSRSGFPTLFHASDILQAAVGLVSRNMPFWDRHQGRDHVFVATHDFGACFHAM 121
Query: 198 E------GVPFLIKNAIRVVC------SPSYDVAFIPHKDVALPQVLQPFALPRGGRDVE 245
E G+P ++N+I + P +V I +P P PRG R
Sbjct: 122 EDLAVAMGIPQFLRNSIILQTFGEKNKHPCQNVDHIQIPPYVVPAKKLPD--PRGQR--- 176
Query: 246 NRTVLGFW-----------AGHRNSK-IRVILARVWENDTELDISNNRINRAIGPLVYQR 293
R +L F+ +GH S+ +R + R + +D I R + Y+
Sbjct: 177 -RKILAFFRGKMEIHPKNVSGHMYSRGVRTTIWRRFSHDRRFFIKRKRSDN------YKA 229
Query: 294 RFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDV 353
++ FC+CP G S RI +S+ GC+PVI+++ LP++ ++DWRK +V + ERDV
Sbjct: 230 EMLRSVFCLCPLGWAPWSPRIVESVIQGCIPVIIADNIQLPYSHVIDWRKISVTVAERDV 289
Query: 354 YQLKQILKNISDAEFVSLHNNLVK--VQKHFQWNSPPVRFDAFHMVMYELWLRR 405
++L +IL ++ + NL + V++ +N P VR DA V+ L R+
Sbjct: 290 HKLDRILSKVAATNVSMIQANLWRDEVRQALVYNQPLVRGDATWQVLDLLSKRK 343
>gi|412993472|emb|CCO13983.1| predicted protein [Bathycoccus prasinos]
Length = 614
Score = 125 bits (313), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 105/364 (28%), Positives = 163/364 (44%), Gaps = 59/364 (16%)
Query: 88 NFKIYIY---PDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPI- 143
+FKI++Y P+ + + PR T +Y +E F +N+ LT+DP+ A FF+PI
Sbjct: 259 DFKIFVYDLKPEFNADLARDQPRCRTDQYGTEIRFHENLLHHSVLTNDPEEAEFFFVPIY 318
Query: 144 -SCHKMRGKGTSYENMTVIV-----KDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRAT 197
C+ R S N + V +D + ++ +YPYWNRT G DH + G
Sbjct: 319 GECYLFRETQNSGTNNAMKVTNLWYRDALKTIQTEYPYWNRTDGRDHVWSFPGARGPHIF 378
Query: 198 EGVPFLIKNAIRVVCSPSYDVAFIPH----KDVALPQVLQP------------------- 234
LIK +I +P D +F KD+ +P L+P
Sbjct: 379 RDWKKLIKKSI--FLTPEGDRSFGEQFNTWKDIVIPG-LEPDSEFIDGKLRKQSSLKKDI 435
Query: 235 FALPRGGRDVENRTVL---GFWAGHRNSKIRVILARVWEND---TELDISNNRINRAIGP 288
FA RG T+L G A R + ++ A D TE S +R
Sbjct: 436 FAFFRG-------TILNKAGILAYSRGIRPKMEAAFKKHKDVIFTEEIPSCDRD------ 482
Query: 289 LVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVIL 348
Y++ K+ FC+CP G + R ++ GC+PVI+++ +LP+ + LDW K +V +
Sbjct: 483 -CYRKELRKSTFCLCPRGWSPWTLRAYQAMMVGCIPVIIADEIELPYENSLDWTKLSVKI 541
Query: 349 NERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRF---DAFHMVMYELWLRR 405
E D + ILK IS +E + + KV K W S P + DA V++EL ++
Sbjct: 542 AEVDAEKTIDILKQISKSEIRNKQKAIEKVWKSVAWGSNPKKLDPMDAMECVLHELGRKK 601
Query: 406 HVIK 409
+K
Sbjct: 602 RAMK 605
>gi|168043163|ref|XP_001774055.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674601|gb|EDQ61107.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 541
Score = 124 bits (311), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 95/297 (31%), Positives = 149/297 (50%), Gaps = 36/297 (12%)
Query: 71 VFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGK-------YASEGYFFQNI 123
++HSP VF L+Y EM +I++YP NT Y+ K G+ ++ F + +
Sbjct: 129 LYHSPEVFTLSYEEMWIQLQIWVYPSQAGNTSYE--HKFDGRKDVMEELSSTADLFSRLL 186
Query: 124 RESRFLTHDPDRAHLFFIPISCHKMR-GKGTSYENMTVIVKDYVDSLINKYPYWNRTLGA 182
S+F T P +A LF +P S +R G S ++ ++ YV ++ YPYWN +LGA
Sbjct: 187 LRSKFSTELPQKAQLFLLPFSIDALRVDLGPS--RISDHLRRYVQNVRTSYPYWNLSLGA 244
Query: 183 DHFFVTCHDV-GVRATEGVPFLIKNAIRVVCSP-SYDVAFIPHKDVALPQ----VLQPFA 236
+HF+++ V L KN+I+ C+P + F PHKD P+ F
Sbjct: 245 NHFYLSSQAFENNNKHRNVLELEKNSIQAACAPLRQNQNFYPHKDFIFPRYKPITQTEFY 304
Query: 237 LPRGGRDVENRTVLGFWAGH-RNSKIRVILARVWEND----TELDISNNRINRAIGPLVY 291
GR +RTVL ++ G ++ V + W++D E+D S +RI+ VY
Sbjct: 305 AALEGRT--SRTVLAYFGGTLADTPALVFILDAWKSDPDFEVEVDPSPHRIS------VY 356
Query: 292 QRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSN--YYDLPFNDILDWRKFAV 346
R+ ++KFC+ ++ D+I +GCV V+LS + DLPF LDWR+FA
Sbjct: 357 -RQLARSKFCV--NVPSRDTFDFVDAIRFGCVLVLLSKSVFLDLPFQGFLDWRQFAA 410
>gi|357472125|ref|XP_003606347.1| Secondary cell wall-related glycosyltransferase family [Medicago
truncatula]
gi|355507402|gb|AES88544.1| Secondary cell wall-related glycosyltransferase family [Medicago
truncatula]
Length = 427
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 99/351 (28%), Positives = 165/351 (47%), Gaps = 45/351 (12%)
Query: 86 VTNFKIYIY--PDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPI 143
+ N KI++Y P + + PR T +ASE + + S T DP A FF+P+
Sbjct: 66 LKNLKIFVYDLPPKYNKNWLKNPRCKTHLFASEVAIHRALLTSDVRTFDPYEADFFFVPV 125
Query: 144 --SCHKMRGKG-TSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVG------- 193
SC+ G + + ++ V + +YP+WNR+ G+DH FV HD G
Sbjct: 126 YVSCNFSTVNGFPAIGHARSLISSAVKLISTEYPFWNRSTGSDHVFVASHDFGSCFHTLE 185
Query: 194 -VRATEGVPFLIKNAIRV-VCSPSYDVAFIPHKDVALPQVLQPFALPRGGRD-VENRTVL 250
V +GVP ++KN+I + +YD + V +P PF P R+ +EN V
Sbjct: 186 DVAMKDGVPEIMKNSIVLQTFGVTYDHPCQKVEHVVIP----PFVSPESVRNTLENFPVN 241
Query: 251 G---FWAGHR---------------NSKIRVILARVWENDTELDISNNRINRAIGPLVYQ 292
G W R + K+R ++ + + D + R +R G YQ
Sbjct: 242 GRRDIWVFFRGKMEVHPKNVSGRFYSKKVRTVIWKKFNGDRRFYL---RRHRFAG---YQ 295
Query: 293 RRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERD 352
++ FC+CP G S R+ +S+ GCVPVI+++ LPF+ ++W + +V + E+D
Sbjct: 296 SEIARSVFCLCPLGWAPWSPRLVESVALGCVPVIIADSIRLPFSSAVNWPEISVTVAEKD 355
Query: 353 VYQLKQILKNISDAEFVSLHNNL--VKVQKHFQWNSPPVRFDAFHMVMYEL 401
V++L +IL+ ++ + NL + +K +NS DA V++ L
Sbjct: 356 VWRLGEILEKVAATNLSIIQRNLWDPRTRKALLFNSRVHEGDATWQVLHSL 406
>gi|449459136|ref|XP_004147302.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase F8H-like
[Cucumis sativus]
Length = 447
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 106/403 (26%), Positives = 178/403 (44%), Gaps = 62/403 (15%)
Query: 10 NQAQTTLCSLRSSLLTLALVTLLCFTCLSFNSFRSYPLQNNFNSTLSFAINKQDYSDL-- 67
N + TL +SS LL TC SF F S+ L +N N ++ +L
Sbjct: 26 NHRKNTLICFKSSKY------LLLSTCFSFYVFTSFLLTHN---------NSPNHPNLIH 70
Query: 68 ---GSDVFHSPSVFH--LNYLEMVTNFKIYIYPDGDP--NTFYQTPRKLTGK-YASEGYF 119
S F S ++ +Y + K+++Y D P N + + + G +ASE
Sbjct: 71 HLSNSKPFPSKALVESTFDYYGNTNDLKVFVY-DLPPEFNANWLSDARCGGHLFASEVAI 129
Query: 120 FQNIRESRFLTHDPDRAHLFFIPI--SCHKMRGKG-TSYENMTVIVKDYVDSLINKYPYW 176
+ + S T DP A FF+P+ SC+ G + + ++ VD + ++P+W
Sbjct: 130 HKALLTSHVRTLDPSEADFFFVPVYVSCNFSSFNGFPAIAHAPSLLASAVDVISGQFPFW 189
Query: 177 NRTLGADHFFVTCHDVG--------VRATEGVPFLIKNAIRVVC------SPSYDVAFIP 222
NR+ G DH FV HD G + G+P +KN+I + P DV I
Sbjct: 190 NRSRGFDHVFVASHDYGACFHSLEDMAIANGIPEFLKNSIILQTFGVKYKHPCQDVENIL 249
Query: 223 HKDVALPQVLQP----------FALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWEND 272
P+ ++P FA RG +V + V G + G R +R + + + D
Sbjct: 250 IPPYISPEFMEPAVVDGRRRDIFAFFRGKMEVNPKNVGGRFYGKR---VRTTIWKKFHRD 306
Query: 273 TELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYD 332
+ +R Y+ ++ FC+CP G S R+ +S+ GCVPVI+++
Sbjct: 307 RRFYLRRHRFAG------YRSEIARSVFCLCPLGWAPWSPRLVESVALGCVPVIIADGIR 360
Query: 333 LPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNL 375
LPF +DW ++ + E+DV +L++IL+ ++ ++ NL
Sbjct: 361 LPFPSAVDWPGISLTVAEKDVGKLRKILERVAATNLTAIQKNL 403
>gi|42568020|ref|NP_197685.2| FRA8-like protein [Arabidopsis thaliana]
gi|75127070|sp|Q6NMM8.1|F8H_ARATH RecName: Full=Probable glucuronoxylan glucuronosyltransferase F8H;
AltName: Full=FRA8 homolog; AltName: Full=Protein
FRAGILE FIBER 8 homolog
gi|44681390|gb|AAS47635.1| At5g22940 [Arabidopsis thaliana]
gi|48958521|gb|AAT47813.1| At5g22940 [Arabidopsis thaliana]
gi|332005716|gb|AED93099.1| FRA8-like protein [Arabidopsis thaliana]
Length = 469
Score = 121 bits (304), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 104/409 (25%), Positives = 179/409 (43%), Gaps = 44/409 (10%)
Query: 30 TLLCFTCLSFNSFRSYPLQNNFNSTLSFAI--NKQDYSDLGSDVFHSPSVFHLNYLEMVT 87
T + + CLS F SY + +S S + N + S L S S + + T
Sbjct: 48 TWILWFCLSLYFFTSYFSVEDQSSPSSIRLLSNHKTSSSLPSRALIESSAIKTTSIGLFT 107
Query: 88 NFKIYIY---PDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPI- 143
KIY+Y + + + R + +A+E + + S T DPD A FF+P+
Sbjct: 108 GMKIYVYDLPASYNDDWVTASDRCASHLFAAEVAIHRALLSSDVRTLDPDEADYFFVPVY 167
Query: 144 -SCHKMRGKG-TSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVG-------- 193
SC+ G S + ++ VD L + YP+WNR+ G+DH FV HD G
Sbjct: 168 VSCNFSTSNGFPSLSHARSLLSSAVDFLSDHYPFWNRSQGSDHVFVASHDFGACFHAMED 227
Query: 194 VRATEGVP-FLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFAL-------PRGGRDVE 245
+ EG+P F+ ++ I Y + V +P + P ++ P GR
Sbjct: 228 MAIEEGIPKFMKRSIILQTFGVKYKHPCQEVEHVVIPPYIPPESVQKAIEKAPVNGR--- 284
Query: 246 NRTVLGFWAGHRNSKIRVILARVWENDTELDISN----------NRINRAIGPLVYQRRF 295
R + F+ G + I R + I NR +R G Y+
Sbjct: 285 -RDIWAFFRGKMEVNPKNISGRFYSKGVRTAILKKFGGRRRFYLNR-HRFAG---YRSEI 339
Query: 296 YKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQ 355
++ FC+CP G S R+ +S GCVPV++++ LPF++ + W + ++ + E+DV
Sbjct: 340 VRSVFCLCPLGWAPWSPRLVESAVLGCVPVVIADGIQLPFSETVQWPEISLTVAEKDVRN 399
Query: 356 LKQILKNISDAEFVSLHNNLVK--VQKHFQWNSPPVRFDAFHMVMYELW 402
L+++L++++ ++ NL + ++ +N P DA ++ LW
Sbjct: 400 LRKVLEHVAATNLSAIQRNLHEPVFKRALLYNVPMKEGDATWHILESLW 448
>gi|167517883|ref|XP_001743282.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163778381|gb|EDQ91996.1| predicted protein [Monosiga brevicollis MX1]
Length = 503
Score = 121 bits (304), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 88/312 (28%), Positives = 146/312 (46%), Gaps = 25/312 (8%)
Query: 112 KYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHKMR----GKGTSYENMTVIVKDYVD 167
KY E Q + S + P+ A F IP C R + + ++
Sbjct: 197 KYGVERRLPQVLASSPYAVQQPEEATHFLIPFQCTAHRYTVADRAGGQNAAEAGLASWIA 256
Query: 168 SLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIK-NAIRVVCSPSY---------D 217
S+ YPYWNR+ GA+HF+V HD+G A + + N I +V + D
Sbjct: 257 SISAAYPYWNRSAGANHFYVCSHDMGSSAVAQLSRAAQQNLIGLVNTADRRDGFFNVHRD 316
Query: 218 VAFIPHKDVALPQVLQP---FALPRGGRDVENRTVLGFWAGH-RNSKIRVILARVWENDT 273
+A PH LQ ++ R R L F AG+ + +R L + ++ D
Sbjct: 317 LATAPHIGDGCDTCLQGGTRLSVTREAWAGTPRNRLAFMAGNLQRGPVRPRLRQFFDGDP 376
Query: 274 ELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDL 333
+ + + + A Y++ +++FC+ G +V + R+ D++ GC+PVI+++ Y+L
Sbjct: 377 DFLLVDGTLAAAH----YRQALAESEFCLVVRGFRVWTPRLMDAVWSGCIPVIIADGYEL 432
Query: 334 PFNDILDWRKFAVILNERDVYQLKQI-LKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFD 392
PF+ +L W FAV + E DV +LK I L +S A L NL+ ++ ++S V D
Sbjct: 433 PFSSLLHWPSFAVFVPEHDVPRLKDILLAKLSQAPL--LRANLLAASQYLTYHSNWVPLD 490
Query: 393 AFHMVMYELWLR 404
AF ++M +L R
Sbjct: 491 AFDILMLQLAAR 502
>gi|356539378|ref|XP_003538175.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase
IRX7-like [Glycine max]
Length = 460
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 101/378 (26%), Positives = 174/378 (46%), Gaps = 57/378 (15%)
Query: 37 LSFNSFRSYPLQNNFN------STLSFAINKQDYSDLGSDVFHSPSVFHLNYLEMVTNFK 90
LS F SY + NN N S +S A+ + +++ +SP E + K
Sbjct: 52 LSLYFFTSYLISNNNNHHTKKPSHVSRALIQSNHTTTPQHALNSP--------ESLKKLK 103
Query: 91 IYIYPDGDP--NTFYQTPRKLTGK-YASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SC 145
+++Y D P NT + T + + +ASE + + S T DP A FF+P+ SC
Sbjct: 104 VFVY-DLPPKYNTDWLTNERCSKHLFASEVAIHRALLTSEVRTFDPYDADFFFVPVYVSC 162
Query: 146 HKMRGKG-TSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVG--------VRA 196
+ G + + ++ V+ + ++YP+WNR+ G+DH FV HD G V
Sbjct: 163 NFSTVNGFPAIGHARSLIASAVNLISSEYPFWNRSRGSDHVFVASHDFGSCFHTLEDVAM 222
Query: 197 TEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRD-VENRTVLG---F 252
+GVP +++N+I + ++ V F V+ P+ P RD +EN V G
Sbjct: 223 ADGVPEIVRNSIVL---QTFGVVFDHPCQKVEHVVIPPYVSPESVRDTMENFPVDGRRDI 279
Query: 253 WAGHR---------------NSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYK 297
WA R + ++R ++ R + D + +R YQ +
Sbjct: 280 WAFFRGKMEVHPKNVSGRFYSKEVRTVIWRKFNGDRRFYLQRHRFAG------YQSEIAR 333
Query: 298 TKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLK 357
+ FC+CP G S R+ +S+ GCVPV++++ LPF + W + +V + E+DV +L
Sbjct: 334 SVFCLCPLGWAPWSPRLVESVALGCVPVVIADGIRLPFVSAVKWSEISVTVAEKDVGRLA 393
Query: 358 QILKNISDAEFVSLHNNL 375
+IL+ ++ ++ NL
Sbjct: 394 EILERVAATNLSTIQRNL 411
>gi|297808283|ref|XP_002872025.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
gi|297317862|gb|EFH48284.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
Length = 465
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 99/386 (25%), Positives = 163/386 (42%), Gaps = 46/386 (11%)
Query: 53 STLSFAINKQDYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTF-----YQTPR 107
S++ N + S L S S N + + KIY+Y P +F + R
Sbjct: 74 SSIRLLSNHKTSSSLPSRALIESSAIKTNSFGLFSGMKIYVY--DLPASFNDDWVTASDR 131
Query: 108 KLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SCHKMRGKG-TSYENMTVIVKD 164
T +A+E + + S T DP+ A FF+P+ SC+ G S + ++
Sbjct: 132 CATHLFAAEVAIHRALLSSDVRTLDPEEADFFFVPVYVSCNFSTSNGFPSLSHARSLLSS 191
Query: 165 YVDSLINKYPYWNRTLGADHFFVTCHDVG--------VRATEGVP-FLIKNAIRVVCSPS 215
VD L + YP+WNRT G+DH FV HD G + EG+P F+ K+ I
Sbjct: 192 AVDFLSDHYPFWNRTQGSDHVFVASHDFGACFHAMEDMAIEEGIPEFMKKSIILQTFGVK 251
Query: 216 YDVAFIPHKDVALPQVLQPFALPRGGRDVE---NRTVLGFWAGHRNSKIRVILARVWEND 272
Y + V +P + P ++ R R + F+ G + I R +
Sbjct: 252 YKHPCQEVEHVVIPPYIPPESVQRAIEKAPANGRRDIWAFFRGKMEVNPKNISGRFYSKG 311
Query: 273 TELDISNNRINRAIGPLVYQRRFY--------------KTKFCICPGGSQVNSARIADSI 318
I R RRFY ++ FC+CP G S R+ +S
Sbjct: 312 VRTAILKKFGGR--------RRFYLNRHRFAGYRSEIVRSVFCLCPLGWAPWSPRLVESA 363
Query: 319 HYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVK- 377
GCVPV++++ LPF++ + W + ++ + E+DV L++IL++++ + NL
Sbjct: 364 VLGCVPVVIADGIKLPFSETVRWPEISLTVAEKDVRSLRKILEHVAATNLSVIQRNLHGP 423
Query: 378 -VQKHFQWNSPPVRFDAFHMVMYELW 402
++ +N P DA ++ LW
Sbjct: 424 VFKRALLYNVPMKEGDATWHILESLW 449
>gi|356542668|ref|XP_003539788.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase
IRX7-like [Glycine max]
Length = 461
Score = 118 bits (295), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 89/333 (26%), Positives = 150/333 (45%), Gaps = 41/333 (12%)
Query: 75 PSVFHLNYLEMVTNFKIYIY--PDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHD 132
P LN L + N K+++Y P + R +ASE + + S T D
Sbjct: 89 PQQQALNSLGSLKNLKVFVYDLPQKYNTDWLSNERCSKHLFASEVAIHRALLTSEVRTFD 148
Query: 133 PDRAHLFFIPI--SCHKMRGKG-TSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTC 189
P A FF+P+ SC+ G + + ++ V + ++YP+WNR+ G+DH FV
Sbjct: 149 PYDADFFFVPVYVSCNFSTVNGFPAIGHARSLIASAVSLVSSEYPFWNRSRGSDHVFVAS 208
Query: 190 HDVG--------VRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGG 241
HD G V +GVP +++N+I + ++ V + V+ P+ P
Sbjct: 209 HDFGSCFHTLEDVAMADGVPEIMRNSIVL---QTFGVVYDHPCQSVEHVVIPPYVSPESV 265
Query: 242 RD-VENRTVLG---FWAGHR---------------NSKIRVILARVWENDTELDISNNRI 282
RD +EN V G WA R + K+R ++ R + D + R
Sbjct: 266 RDTMENFPVNGRRDIWAFFRGKMELHPKNVSGRFYSKKVRTVIWRKFNGDRRFYLQRQRF 325
Query: 283 NRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWR 342
YQ ++ FC+CP G S R+ +S+ GCVPVI+++ LPF + W
Sbjct: 326 AG------YQSEIARSVFCLCPLGWAPWSPRLVESVALGCVPVIIADGIRLPFISAVKWP 379
Query: 343 KFAVILNERDVYQLKQILKNISDAEFVSLHNNL 375
+ ++ + E+DV +L +IL+ ++ ++ NL
Sbjct: 380 EISITVAEKDVGRLAEILERVAATNLSTIQRNL 412
>gi|115450193|ref|NP_001048697.1| Os03g0107900 [Oryza sativa Japonica Group]
gi|122247627|sp|Q10SX7.1|GT31_ORYSJ RecName: Full=Probable glucuronosyltransferase Os03g0107900
gi|108705764|gb|ABF93559.1| exostosin family protein, putative, expressed [Oryza sativa
Japonica Group]
gi|113547168|dbj|BAF10611.1| Os03g0107900 [Oryza sativa Japonica Group]
gi|125542077|gb|EAY88216.1| hypothetical protein OsI_09667 [Oryza sativa Indica Group]
gi|215766485|dbj|BAG98793.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222624042|gb|EEE58174.1| hypothetical protein OsJ_09104 [Oryza sativa Japonica Group]
Length = 427
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 93/312 (29%), Positives = 146/312 (46%), Gaps = 47/312 (15%)
Query: 133 PDRAHLFFIPI--SCHKMRGKG-TSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTC 189
PD A LFF+P+ SC+ G S + ++ D VD + + PYWNR+ GADH FV
Sbjct: 121 PDDATLFFVPVYVSCNFSTDNGFPSLSHARALLADAVDLVRAQMPYWNRSAGADHVFVAS 180
Query: 190 HDVG--------VRATEGVPFLIKNAIRV---------VCSPSYDVAFIPH--KDVAL-- 228
HD G V +G+P +K +I + VC + V PH +VAL
Sbjct: 181 HDFGACFHPMEDVAIADGIPEFLKRSILLQTFGVQGTHVCQEADHVVIPPHVPPEVALEL 240
Query: 229 --PQVLQP--FALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINR 284
P+ Q FA RG +V + + G + + K+R L + + + + + R
Sbjct: 241 PEPEKAQRDIFAFFRGKMEVHPKNISGRF---YSKKVRTELLQKYGRNRKFYLKRKRYGN 297
Query: 285 AIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKF 344
Y+ ++ FC+CP G S R+ +S+ GC+PVI+++ LPF +L W
Sbjct: 298 ------YRSEMARSLFCLCPLGWAPWSPRLVESVLLGCIPVIIADDIRLPFPSVLQWLDI 351
Query: 345 AVILNERDVYQLKQILKNISDAEFVSLHNNL---VKVQKHFQWNSPPVRFDAFHMVMYEL 401
++ + E+DV L+ +L ++ + NL VK +K +N P DA V+ EL
Sbjct: 352 SLQVAEKDVASLEMVLDHVVATNLTVIQKNLWDPVK-RKALVFNRPMEEGDATWQVLREL 410
Query: 402 WL------RRHV 407
+ RRHV
Sbjct: 411 EILLDRSQRRHV 422
>gi|356541948|ref|XP_003539434.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase
IRX7-like [Glycine max]
Length = 459
Score = 115 bits (287), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 95/374 (25%), Positives = 171/374 (45%), Gaps = 57/374 (15%)
Query: 42 FRSYPLQNNFNSTLSFAINKQDYSDLGSDVFHSPSVFH--LNYLEMVTNFKIYIY--PDG 97
F SY + +N N+T + A + S+ S+V SP++ N L ++ N K+++Y P
Sbjct: 53 FTSYLITSNPNNTPTSA-DTSHVSNSESNVV-SPTLVESTTNTLGVLKNMKVFVYELPPK 110
Query: 98 DPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SCH-KMRGKGTS 154
+ R + +ASE + + S T DP A FF+P+ SC+ +
Sbjct: 111 YNTDWLANERCSSHLFASEVAIHRALLTSEVRTFDPYEADFFFVPVYVSCNFSAVNDFPA 170
Query: 155 YENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVG--------VRATEGVPFLIKN 206
+ ++ V+ + +YP+WNR+ G+DH FV HD G V +G+P ++KN
Sbjct: 171 IGHARTLISSAVNLVSTEYPFWNRSRGSDHVFVASHDFGACFHTLEDVAMADGIPIILKN 230
Query: 207 AIRVVC------SPSYDVAFIPHKDVALPQVLQPFAL-------PRGGRDVENRTVLGFW 253
+I + P +V ++V +P + P ++ P GR R + F+
Sbjct: 231 SIVLQTFGVIHQHPCQEV-----ENVVIPPYVSPESVRSTLEKFPVTGR----RDIFAFF 281
Query: 254 AGHR------------NSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFC 301
G + ++R + R + D + +R YQ ++ FC
Sbjct: 282 RGKMEVHPKNVSRRFYSKRVRTEIWRKFNGDRRFYLQRHRFAG------YQLEIARSVFC 335
Query: 302 ICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILK 361
+CP G S R+ +S+ GCVPV++++ LPF+ + W + ++ + ERDV +L +IL+
Sbjct: 336 LCPLGWAPWSPRLVESVALGCVPVVIADGIQLPFSSAVRWSEISLSVAERDVGKLGKILE 395
Query: 362 NISDAEFVSLHNNL 375
++ + NL
Sbjct: 396 RVAATNLSVIQRNL 409
>gi|449469226|ref|XP_004152322.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10-like [Cucumis
sativus]
Length = 388
Score = 115 bits (287), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 95/335 (28%), Positives = 147/335 (43%), Gaps = 45/335 (13%)
Query: 86 VTNFKIYIY--PDG-DPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIP 142
V K+Y+Y P + T + PR LT +A+E Y + + S T +PD A F+ P
Sbjct: 48 VGRLKVYVYDLPSKYNKKTLQKDPRCLTHMFAAEIYMHRFLLNSPVRTLNPDEADWFYTP 107
Query: 143 I--SCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV------ 194
I +C +++ + + + +PYWNRT GADHFFV HD G
Sbjct: 108 IYVTCDLTPNGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQE 167
Query: 195 -RATE-GVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGG-RDVENRTVLG 251
+A + G+P+ ++ PS I F RG DV N G
Sbjct: 168 EKAIDRGIPYCPPQKMKTHLIPSETPRSI-------------FVYFRGLFYDVNNDPEGG 214
Query: 252 FWAGHRNSKIRVILARVWEN---DTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQ 308
++A R A VWEN + DIS + P Y + FC+CP G
Sbjct: 215 YYA-------RGARAAVWENFKNNPLFDISTDH------PTTYYEDMQRAIFCLCPLGWA 261
Query: 309 VNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEF 368
S R+ +++ +GC+PVI+++ LPF D + W + V ++E+DV L IL +I
Sbjct: 262 PWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFVDEKDVSNLDTILTSIPPDVI 321
Query: 369 VSLHNNLV--KVQKHFQWNSPPVRFDAFHMVMYEL 401
+ L +++ + P DAFH ++ L
Sbjct: 322 LRKQRLLANPSMKRAMMFPQPAQSGDAFHQILNGL 356
>gi|125582866|gb|EAZ23797.1| hypothetical protein OsJ_07509 [Oryza sativa Japonica Group]
Length = 322
Score = 115 bits (287), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 79/227 (34%), Positives = 113/227 (49%), Gaps = 19/227 (8%)
Query: 53 STLSFAINKQDYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGK 112
ST+ A DY G+ ++ + FH +Y+EM FKI+ Y +G+P + P T
Sbjct: 71 STVVAAAGDDDYVPRGA-IYRNARAFHRSYVEMERRFKIWTYREGEPPVAHIGP--GTDI 127
Query: 113 YASEGYFFQNIRE--SRFLTHDPDRAHLFFIPISC-------HKMRGKGTSYENMTVIVK 163
Y+ EG F + + SRF PD AH F +PIS +++ G + +V
Sbjct: 128 YSIEGQFMYEMDDPRSRFAARRPDDAHAFLLPISVCNLVHYVYRLNATG-DLAPLRGLVA 186
Query: 164 DYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPH 223
DYV + +YPYWNR+ GADH V+CHD T L NAIRV+C+ + F P
Sbjct: 187 DYVRVVAERYPYWNRSRGADHVIVSCHDWAPMVTSAHRQLYGNAIRVLCNANTSEGFRPR 246
Query: 224 KDVALPQVLQPFALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWE 270
KD LP++ A+P R E R +L + R RV+ ARV +
Sbjct: 247 KDATLPEM--SVAVP-AARIPELRAILRRVSERR---YRVLRARVLQ 287
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/63 (31%), Positives = 38/63 (60%)
Query: 343 KFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELW 402
+ +V + + +L+ IL+ +S+ + L +++ Q+HF + P RFD HMV++ +W
Sbjct: 253 EMSVAVPAARIPELRAILRRVSERRYRVLRARVLQAQRHFVLHRPARRFDMIHMVLHSIW 312
Query: 403 LRR 405
LRR
Sbjct: 313 LRR 315
>gi|15223522|ref|NP_174064.1| putative glycosyltransferase [Arabidopsis thaliana]
gi|75272002|sp|Q9FZJ1.1|IRX10_ARATH RecName: Full=Probable beta-1,4-xylosyltransferase IRX10; AltName:
Full=Glucuronoxylan glucuronosyltransferase 1;
Short=AtGUT1; AltName: Full=Glucuronoxylan
glucuronosyltransferase 2; AltName: Full=Protein
IRREGULAR XYLEM 10; AltName: Full=Xylan
xylosyltransferase IRX10
gi|9802541|gb|AAF99743.1|AC004557_22 F17L21.23 [Arabidopsis thaliana]
gi|63003872|gb|AAY25465.1| At1g27440 [Arabidopsis thaliana]
gi|98960979|gb|ABF58973.1| At1g27440 [Arabidopsis thaliana]
gi|332192709|gb|AEE30830.1| putative glycosyltransferase [Arabidopsis thaliana]
Length = 412
Score = 114 bits (286), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 100/354 (28%), Positives = 154/354 (43%), Gaps = 54/354 (15%)
Query: 86 VTNFKIYIY--PDG-DPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIP 142
V K+Y+Y P + + PR LT +A+E + + + S T +PD A F+ P
Sbjct: 43 VGKLKVYVYELPSKYNKKLLQKDPRCLTHMFAAEIFMHRFLLSSPVRTRNPDEADWFYTP 102
Query: 143 I--SCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV------ 194
I +C +++ + + + +PYWNRT GADHFFV HD G
Sbjct: 103 IYPTCDLTPTGLPLPFKSPRMMRSSIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQE 162
Query: 195 -RATE-GVPFLIKNAIRV--------VCSPSYDVAFIPHKDVALPQVLQ----PFALPRG 240
+A E G+ L++ A V VC + P A PQ +Q P +PR
Sbjct: 163 EKAIERGILPLLQRATLVQTFGQRNHVCLDEGSITIPP---FAPPQKMQAHFIPPDIPRS 219
Query: 241 --------GRDVENRTVLGFWAGHRNSKIRVILARVWEN---DTELDISNNRINRAIGPL 289
DV N G++A R A VWEN + DIS + P
Sbjct: 220 IFVYFRGLFYDVNNDPEGGYYA-------RGARAAVWENFKNNPLFDISTDH------PT 266
Query: 290 VYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILN 349
Y + FC+CP G S R+ +++ +GC+PVI+++ LPF D + W + V +
Sbjct: 267 TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFVA 326
Query: 350 ERDVYQLKQILKNISDAEFVSLHNNLV--KVQKHFQWNSPPVRFDAFHMVMYEL 401
E+DV +L IL +I + L +++ + P DAFH ++ L
Sbjct: 327 EKDVPELDTILTSIPTEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 380
>gi|10177241|dbj|BAB10615.1| unnamed protein product [Arabidopsis thaliana]
Length = 498
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 90/356 (25%), Positives = 156/356 (43%), Gaps = 50/356 (14%)
Query: 85 MVTNFKIYIY---PDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFI 141
+ T KIY+Y + + + R + +A+E + + S T DPD A FF+
Sbjct: 134 LFTGMKIYVYDLPASYNDDWVTASDRCASHLFAAEVAIHRALLSSDVRTLDPDEADYFFV 193
Query: 142 PI--SCHKMRGKG-TSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVG----- 193
P+ SC+ G S + ++ VD L + YP+WNR+ G+DH FV HD G
Sbjct: 194 PVYVSCNFSTSNGFPSLSHARSLLSSAVDFLSDHYPFWNRSQGSDHVFVASHDFGACFHA 253
Query: 194 ---VRATEGVP-FLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFAL-------PRGGR 242
+ EG+P F+ ++ I Y + V +P + P ++ P GR
Sbjct: 254 MEDMAIEEGIPKFMKRSIILQTFGVKYKHPCQEVEHVVIPPYIPPESVQKAIEKAPVNGR 313
Query: 243 DVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFY------ 296
R + F+ G + I R + I R RRFY
Sbjct: 314 ----RDIWAFFRGKMEVNPKNISGRFYSKGVRTAILKKFGGR--------RRFYLNRHRF 361
Query: 297 --------KTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVIL 348
++ FC+CP G S R+ +S GCVPV++++ LPF++ + W + ++ +
Sbjct: 362 AGYRSEIVRSVFCLCPLGWAPWSPRLVESAVLGCVPVVIADGIQLPFSETVQWPEISLTV 421
Query: 349 NERDVYQLKQILKNISDAEFVSLHNNLVK--VQKHFQWNSPPVRFDAFHMVMYELW 402
E+DV L+++L++++ ++ NL + ++ +N P DA ++ LW
Sbjct: 422 AEKDVRNLRKVLEHVAATNLSAIQRNLHEPVFKRALLYNVPMKEGDATWHILESLW 477
>gi|297851174|ref|XP_002893468.1| hypothetical protein ARALYDRAFT_472948 [Arabidopsis lyrata subsp.
lyrata]
gi|297339310|gb|EFH69727.1| hypothetical protein ARALYDRAFT_472948 [Arabidopsis lyrata subsp.
lyrata]
Length = 412
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 95/333 (28%), Positives = 146/333 (43%), Gaps = 51/333 (15%)
Query: 104 QTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SCHKMRGKGTSYENMTVI 161
+ PR LT +A+E + + + S T +PD A F+ PI +C +
Sbjct: 64 KDPRCLTHMFAAEIFMHRFLLSSPVRTRNPDEADWFYTPIYPTCDLTPTGLPLPFKSPRM 123
Query: 162 VKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV-------RATE-GVPFLIKNAIRV--- 210
++ + + + +PYWNRT GADHFFV HD G +A E G+ L++ A V
Sbjct: 124 MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTF 183
Query: 211 -----VCSPSYDVAFIPHKDVALPQVLQ----PFALPRG--------GRDVENRTVLGFW 253
VC + P A PQ +Q P +PR DV N G++
Sbjct: 184 GQRNHVCLDEGSITIPP---FAPPQKMQAHFIPPDIPRSIFVYFRGLFYDVNNDPEGGYY 240
Query: 254 AGHRNSKIRVILARVWEN---DTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVN 310
A R A VWEN + DIS + P Y + FC+CP G
Sbjct: 241 A-------RGARAAVWENFKNNPLFDISTDH------PTTYYEDMQRAIFCLCPLGWAPW 287
Query: 311 SARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVS 370
S R+ +++ +GC+PVI+++ LPF D + W + V + E+DV +L IL +I +
Sbjct: 288 SPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFVAEKDVPELDTILTSIPTEVILR 347
Query: 371 LHNNLV--KVQKHFQWNSPPVRFDAFHMVMYEL 401
L +++ + P DAFH ++ L
Sbjct: 348 KQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 380
>gi|302783378|ref|XP_002973462.1| glycosyltransferase-like protein [Selaginella moellendorffii]
gi|300159215|gb|EFJ25836.1| glycosyltransferase-like protein [Selaginella moellendorffii]
Length = 437
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 95/334 (28%), Positives = 148/334 (44%), Gaps = 59/334 (17%)
Query: 104 QTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SCHKMRGKGTSYENMTVI 161
+ PR LT +A+E + + + S T +P+ A F+ P+ +C R +
Sbjct: 61 KDPRCLTHMFATEIFVHEFLLGSAVRTLNPEEADWFYTPVYTTCDLTRNGLPLPFKSPRM 120
Query: 162 VKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV-------RATE-GVPFLIKNAIRV--- 210
++ + + N++PYWNRT GADHFFV HD G +A E G+ L++ A V
Sbjct: 121 MRSVIQYISNQWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILHLLQRATLVQTF 180
Query: 211 -----VCSPSYDVAFIPHKDVALPQVLQ----PFALPRG------------GRDVENRTV 249
VC + P+ PQ +Q P ++PR G D E
Sbjct: 181 GQRYHVCLKEGSIVVPPY---CPPQKMQAHLIPPSIPRSIFVYFRGLFYDYGNDPEG--- 234
Query: 250 LGFWAGHRNSKIRVILARVWEN--DTEL-DISNNRINRAIGPLVYQRRFYKTKFCICPGG 306
G++A R A VWEN D L DIS P+ Y + FC+CP G
Sbjct: 235 -GYYA-------RGARAAVWENFKDNPLFDISTEH------PITYYEDMQRAIFCLCPLG 280
Query: 307 SQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDA 366
S R+ + + +GC+PVI+++ LPF D + W + V + E+DV +L IL +I
Sbjct: 281 WAPWSPRLVEGVIFGCIPVIIADDIVLPFADAIPWEEIGVFVAEKDVPKLDTILTSIPPE 340
Query: 367 EFVSLHNNLV--KVQKHFQWNSPPVRFDAFHMVM 398
+ L +++ + P DAFH ++
Sbjct: 341 VILKKQRLLATPAMKQAMLFPQPAQPGDAFHQIL 374
>gi|449484890|ref|XP_004157009.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10-like [Cucumis
sativus]
Length = 417
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 99/354 (27%), Positives = 155/354 (43%), Gaps = 54/354 (15%)
Query: 86 VTNFKIYIY--PDG-DPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIP 142
V K+Y+Y P + T + PR LT +A+E Y + + S T +PD A F+ P
Sbjct: 48 VGRLKVYVYDLPSKYNKKTLQKDPRCLTHMFAAEIYMHRFLLNSPVRTLNPDEADWFYTP 107
Query: 143 I--SCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV------ 194
I +C +++ + + + +PYWNRT GADHFFV HD G
Sbjct: 108 IYVTCDLTPNGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQE 167
Query: 195 -RATE-GVPFLIKNAIRV--------VCSPSYDVAFIPHKDVALPQVLQPFALP------ 238
+A + G+ L++ A V VC + P+ PQ ++ +P
Sbjct: 168 EKAIDRGILPLLQRATLVQTFGQRNHVCLNEGSITIPPY---CPPQKMKTHLIPSETPRS 224
Query: 239 -----RGG-RDVENRTVLGFWAGHRNSKIRVILARVWEN---DTELDISNNRINRAIGPL 289
RG DV N G++A R A VWEN + DIS + P
Sbjct: 225 IFVYFRGLFYDVNNDPEGGYYA-------RGARAAVWENFKNNPLFDISTDH------PT 271
Query: 290 VYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILN 349
Y + FC+CP G S R+ +++ +GC+PVI+++ LPF D + W + V ++
Sbjct: 272 TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFVD 331
Query: 350 ERDVYQLKQILKNISDAEFVSLHNNLV--KVQKHFQWNSPPVRFDAFHMVMYEL 401
E+DV L IL +I + L +++ + P DAFH ++ L
Sbjct: 332 EKDVSNLDTILTSIPPDVILRKQRLLANPSMKRAMMFPQPAQSGDAFHQILNGL 385
>gi|242088705|ref|XP_002440185.1| hypothetical protein SORBIDRAFT_09g027450 [Sorghum bicolor]
gi|241945470|gb|EES18615.1| hypothetical protein SORBIDRAFT_09g027450 [Sorghum bicolor]
Length = 416
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 96/351 (27%), Positives = 150/351 (42%), Gaps = 54/351 (15%)
Query: 86 VTNFKIYIY---PDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIP 142
V K+Y+Y P + N + R L+ +A+E + + + S T +PD A F+ P
Sbjct: 47 VGRLKVYVYELPPKYNKNILAKDSRCLSHMFATEIFMHRFLLTSAVRTLNPDEADWFYTP 106
Query: 143 I--SCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV------ 194
+ +C +++ + + ++PYWNRT GADHFFVT HD G
Sbjct: 107 VYTTCDLTPWGHPLTTKSPRMMRSAIQYISKRWPYWNRTEGADHFFVTPHDFGACFYFQE 166
Query: 195 --RATEGVPFLIKNAIRV--------VCSPSYDVAFIPHKDVALPQVLQPFALP------ 238
GV +++ A V VC + P+ A P ++ +P
Sbjct: 167 ETAIQRGVLPVLRRATLVQTFGQKHHVCLKEGSITIPPY---APPHKIRTHIVPPETPRS 223
Query: 239 -----RGG-RDVENRTVLGFWAGHRNSKIRVILARVWEN---DTELDISNNRINRAIGPL 289
RG D N G++A R A VWEN + DIS P
Sbjct: 224 IFVYFRGLFYDTANDPEGGYYA-------RGARASVWENFKNNALFDISTEH------PP 270
Query: 290 VYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILN 349
Y + FC+CP G S R+ +++ +GC+PVI+++ LPF D + W + AV +
Sbjct: 271 TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIAVFVA 330
Query: 350 ERDVYQLKQILKNISDAEFVSLHNNLV--KVQKHFQWNSPPVRFDAFHMVM 398
E DV +L IL +I E + L +++ + P DAFH V+
Sbjct: 331 EDDVLKLDTILTSIPMEEILRKQRLLANPSMKQAMLFPQPAEPRDAFHQVL 381
>gi|356547155|ref|XP_003541982.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase
IRX7-like [Glycine max]
Length = 458
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 97/384 (25%), Positives = 167/384 (43%), Gaps = 56/384 (14%)
Query: 29 VTLLCFTCLSFNSFRSYPLQNNFNSTLSFAINKQDYSDLGSDVFHSPSVFHLNYLEMVTN 88
V L F+ F S+ NN N+ S N + + + + S S N L ++ N
Sbjct: 43 VLWLSFSLYFFTSYLITSNPNNTNTPTSHVSNSES-NVVPRTLVESTS----NTLGVLKN 97
Query: 89 FKIYIY--PDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPI--S 144
K+++Y P + R +ASE + + S T DP A FF+P+ S
Sbjct: 98 MKVFVYELPPKYNTDWLANERCSNHLFASEVAIHRALLTSEVRTFDPYEADFFFVPVYVS 157
Query: 145 CHKMRGKG-TSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVG--------VR 195
C+ G + + ++ V+ + +YP+WNR+ G+DH FV HD G V
Sbjct: 158 CNFSAVNGFPAIGHARTLISSAVNLVSTEYPFWNRSRGSDHVFVASHDFGACFHTLEDVA 217
Query: 196 ATEGVPFLIKNAIRVVC------SPSYDVAFIPHKDVALPQVLQP--------------- 234
+G+P ++KN+I + P DV ++V +P + P
Sbjct: 218 MADGIPKILKNSIVLQTFGVIHPHPCQDV-----ENVVIPPYVAPESVRSTLEKFPVNGR 272
Query: 235 ---FALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVY 291
+A RG +V + V G + R +R + R + D + R Y
Sbjct: 273 RDIWAFFRGKMEVHPKNVSGQFYSKR---VRTEIWRKFNGDRRFYLQRRRFAG------Y 323
Query: 292 QRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNER 351
Q ++ FC+CP G S R+ +S+ GCVPV++++ LPF+ + W + ++ + ER
Sbjct: 324 QLEIARSVFCLCPLGWAPWSPRLVESVALGCVPVVIADGIRLPFSSAVRWSEISLTVAER 383
Query: 352 DVYQLKQILKNISDAEFVSLHNNL 375
DV +L +IL+ ++ + +L
Sbjct: 384 DVGKLGKILERVAATNLSVIQKSL 407
>gi|242059739|ref|XP_002459015.1| hypothetical protein SORBIDRAFT_03g044510 [Sorghum bicolor]
gi|241930990|gb|EES04135.1| hypothetical protein SORBIDRAFT_03g044510 [Sorghum bicolor]
Length = 415
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 96/330 (29%), Positives = 145/330 (43%), Gaps = 51/330 (15%)
Query: 107 RKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SCHKMRGKGTSYENMTVIVKD 164
R L +A+E + Q + S T DP+ A F+ P+ +C I++
Sbjct: 70 RCLHHMFAAEIFMHQFLLSSAVRTLDPEEADWFYTPVYTTCDLTPQGFPLPFRAPRIMRS 129
Query: 165 YVDSLINKYPYWNRTLGADHFFVTCHDVGV-------RATE-GVPFLIKNAIRV------ 210
+ + +P+WNRT GADHFF+T HD G RA E G+ L++ A V
Sbjct: 130 AIRYVATTWPFWNRTDGADHFFLTPHDFGACFHYQEERAMERGILPLLRRATLVQTFGQR 189
Query: 211 --VCSPSYDVAFIPHKDVALPQVLQPFALPRGGR------------DVENRTVLGFWAGH 256
VC + P+ D P +Q + G D+ N G++A
Sbjct: 190 NHVCLQDGSITVPPYAD---PHKMQAHLISPGTPRSIFVYFRGLFYDMGNDPEGGYYA-- 244
Query: 257 RNSKIRVILARVWEN--DTEL-DISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSAR 313
R A VWEN D L DIS P Y + FC+CP G S R
Sbjct: 245 -----RGARASVWENFKDNPLFDISTEH------PYTYYEDMQRAIFCLCPLGWAPWSPR 293
Query: 314 IADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHN 373
+ +++ +GC+PVI+++ LPF D + W +V + ERDV +L IL +I A+ +
Sbjct: 294 LVEAVVFGCIPVIIADDIVLPFADAIPWEDISVFVAERDVPRLDSILTSIPLADILRRQR 353
Query: 374 NLVK--VQKHFQWNSPPVRFDAFHMVMYEL 401
L + V++ ++ P DAFH V+ L
Sbjct: 354 LLARESVKQALLFHQPARTGDAFHQVLNGL 383
>gi|18424516|ref|NP_568941.1| Exostosin family protein [Arabidopsis thaliana]
gi|75163931|sp|Q940Q8.1|IX10L_ARATH RecName: Full=Probable beta-1,4-xylosyltransferase IRX10L; AltName:
Full=Glucuronoxylan glucuronosyltransferase 1;
Short=AtGUT1; AltName: Full=Glucuronoxylan
glucuronosyltransferase 2; Short=AtGUT2; AltName:
Full=Protein IRREGULAR XYLEM 10-like; AltName:
Full=Xylan xylosyltransferase IRX10L
gi|15809826|gb|AAL06841.1| AT5g61840/mac9_140 [Arabidopsis thaliana]
gi|17978869|gb|AAL47406.1| AT5g61840/mac9_140 [Arabidopsis thaliana]
gi|332010141|gb|AED97524.1| Exostosin family protein [Arabidopsis thaliana]
Length = 415
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 109/414 (26%), Positives = 169/414 (40%), Gaps = 74/414 (17%)
Query: 23 LLTLALVTLLCFTCLSFNSFRSYPLQNNFNSTLSFAINKQDYSDLGSDVFHSPSVFHLNY 82
L + L+ LLC T S ++FR LS + + S DV
Sbjct: 3 LSSCVLIFLLCNTFSSISAFR-----------LSRSQPTERISGSAGDVLED-------- 43
Query: 83 LEMVTNFKIYIY--PDG-DPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLF 139
+ V K+++Y P + + PR L +A+E Y + + S T +P+ A F
Sbjct: 44 -DPVGRLKVFVYELPSKYNKKILQKDPRCLNHMFAAEIYMQRFLLSSPVRTLNPEEADWF 102
Query: 140 FIPI--SCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV--- 194
++P+ +C +++ + + + +PYWNRT GADHFFV HD G
Sbjct: 103 YVPVYTTCDLTPNGLPLPFKSPRMMRSAIQLIASNWPYWNRTEGADHFFVVPHDFGACFH 162
Query: 195 -----RATEGVPFLIKNAIRV--------VCSPSYDVAFIPHKDVALPQVLQPFALPRGG 241
G+ L++ A V VC + P+ A PQ +Q +P
Sbjct: 163 YQEEKAIGRGILPLLQRATLVQTFGQRNHVCLKEGSITVPPY---APPQKMQSHLIPEKT 219
Query: 242 R------------DVENRTVLGFWAGHRNSKIRVILARVWEN--DTEL-DISNNRINRAI 286
DV N G++A R A VWEN D L DIS
Sbjct: 220 PRSIFVYFRGLFYDVGNDPEGGYYA-------RGARAAVWENFKDNPLFDISTEH----- 267
Query: 287 GPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAV 346
P Y + FC+CP G S R+ +++ +GC+PVI+++ LPF D + W V
Sbjct: 268 -PTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVIFGCIPVIIADDIVLPFADAIPWEDIGV 326
Query: 347 ILNERDVYQLKQILKNISDAEFVSLHNNLV--KVQKHFQWNSPPVRFDAFHMVM 398
++E+DV L IL +I + L +++ + P DAFH V+
Sbjct: 327 FVDEKDVPYLDTILTSIPPEVILRKQRLLANPSMKQAMLFPQPAQPGDAFHQVL 380
>gi|18642697|gb|AAL76189.1|AC092173_1 Unknown protein [Oryza sativa Japonica Group]
gi|23821296|dbj|BAC20930.1| NpGUT1 homolog [Oryza sativa Japonica Group]
Length = 401
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 94/351 (26%), Positives = 149/351 (42%), Gaps = 48/351 (13%)
Query: 86 VTNFKIYIY---PDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIP 142
V K+Y+Y P + N + R L+ +A+E + + + S T +PD A F+ P
Sbjct: 32 VGKLKVYVYELPPKYNKNIVAKDSRCLSHMFATEIFMHRFLLSSAIRTSNPDEADWFYTP 91
Query: 143 I--SCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV------ 194
+ +C +++ + + +PYWNRT GADHFFV HD
Sbjct: 92 VYTTCDLTPWGHPLTTKSPRMMRSAIKFISKYWPYWNRTEGADHFFVVPHDFAACFYFQE 151
Query: 195 -RATE-GVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQV---------LQPFALPRG--- 240
+A E G+ +++ A V + A + + +P L P PR
Sbjct: 152 AKAIERGILPVLRRATLVQTFGQKNHACLKDGSITVPPYTPAHKIRAHLVPPETPRSIFV 211
Query: 241 -----GRDVENRTVLGFWAGHRNSKIRVILARVWEN---DTELDISNNRINRAIGPLVYQ 292
D N G++A R A VWEN + DIS + P Y
Sbjct: 212 YFRGLFYDTSNDPEGGYYA-------RGARASVWENFKNNPMFDISTDH------PQTYY 258
Query: 293 RRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERD 352
+ FC+CP G S R+ +++ +GC+PVI+++ DLP +D + W + AV + E D
Sbjct: 259 EDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIDLPLSDAIPWEEIAVFVAEDD 318
Query: 353 VYQLKQILKNISDAEFVSLHNNLVK--VQKHFQWNSPPVRFDAFHMVMYEL 401
V QL IL +I + L + +++ + P D FH VM L
Sbjct: 319 VPQLDTILTSIPTEVILRKQAMLAEPSMKQTMLFPQPAEPGDGFHQVMNAL 369
>gi|114325715|gb|ABI64067.1| glycosyltransferase GT47C [Populus tremula x Populus alba]
Length = 442
Score = 112 bits (279), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 92/357 (25%), Positives = 157/357 (43%), Gaps = 54/357 (15%)
Query: 85 MVTNFKIYIY--PDGDPNTFYQTPRKLTGKYASEGYFFQNIRES-RFLTHDPDRAHLFFI 141
++ + K+YIY P + + R +ASE + + S T DP A FF+
Sbjct: 84 LLKDLKVYIYELPSKYNTDWLENERCSNHLFASEVAIHKALSSSLDVRTFDPYEADFFFV 143
Query: 142 PI--SCHKMRGKG-TSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV---- 194
P+ SC+ G + + ++ V + + YP+WNR+ G+DH FV HD G
Sbjct: 144 PVYVSCNFSTVNGFPAIGHARSLLSSAVQLISSNYPFWNRSQGSDHVFVASHDYGACFHA 203
Query: 195 ---RATE-GVPFLIKNAIRVVC------SPSYDVAFIPHKDVALPQVLQP---------- 234
RA E G+P +K +I + P DV ++V +P + P
Sbjct: 204 MEERAMEDGIPEFLKRSIILQTFGVKFNHPCQDV-----ENVVIPPYISPGSVRATLEKY 258
Query: 235 --------FALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAI 286
+A RG +V + + G + + K+R ++ R + D + +R
Sbjct: 259 PLTGRRDIWAFFRGKMEVHPKNISG---RYYSKKVRTVILRKYSGDRRFYLQRHRFAG-- 313
Query: 287 GPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAV 346
YQ ++ FC+CP G S R+ +S+ GCVPVI+++ LPF + W + ++
Sbjct: 314 ----YQSEIVRSVFCLCPLGWAPWSPRLVESVALGCVPVIIADGIRLPFPTAVRWSEISL 369
Query: 347 ILNERDVYQLKQILKNISDAEFVSLHNNL--VKVQKHFQWNSPPVRFDAFHMVMYEL 401
+ E+DV L +L ++ ++ NL V++ +N P DA V+Y L
Sbjct: 370 TVAEKDVANLGTLLDQVAATNLSAIQKNLWDPDVRRALLFNDPVQGGDATWQVLYAL 426
>gi|255565439|ref|XP_002523710.1| transferase, putative [Ricinus communis]
gi|223537014|gb|EEF38650.1| transferase, putative [Ricinus communis]
Length = 461
Score = 111 bits (278), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 106/405 (26%), Positives = 173/405 (42%), Gaps = 46/405 (11%)
Query: 32 LCFTCLSFNSFRSYPLQNNFNSTLSFAINKQDYSDLGSDVFHSPSVFHLNYLEMVTNFKI 91
L F F S + PL + S +++ + S FH P + ++ + KI
Sbjct: 52 LYFITSYFISHKPIPLSKAQYFSKSAVVSRALFESTNSTFFHQPK--NNTNQALLKDLKI 109
Query: 92 YIY--PDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFL-THDPDRAHLFFIPI--SCH 146
YIY P + R +ASE + I S + T DP A FF+P+ SC+
Sbjct: 110 YIYELPSKYNRDWLSNKRCSNHLFASEVAIHKAISNSDDIRTFDPYEADFFFVPVYVSCN 169
Query: 147 KMRGKG-TSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV-------RATE 198
G + + ++ V + YP+WNR+ GADH FV HD G RA +
Sbjct: 170 FSTINGFPAIGHARSLLSSAVTFISTNYPFWNRSQGADHVFVASHDFGSCFHTLEERAMQ 229
Query: 199 -GVP-FLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALP------------------ 238
GVP FL K+ I YD ++V +P + P ++
Sbjct: 230 DGVPEFLKKSIILQTFGVKYDHPCQQVENVVIPPYISPVSVRSTLKKAPLTGRRDIWVFF 289
Query: 239 RGGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKT 298
RG +V + V G + + K+R + R + D + +R YQ ++
Sbjct: 290 RGKMEVHPKNVSGRFY---SKKVRTEIWRRFNGDRRFYLQRHRFAG------YQSEIARS 340
Query: 299 KFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQ 358
FC+CP G S R+ +S+ GCVPVI+++ LPF + W ++ + E+DV +L +
Sbjct: 341 VFCLCPLGWAPWSPRLVESVALGCVPVIIADGIRLPFPSAVPWPAISLTVAEKDVAKLGR 400
Query: 359 ILKNISDAEFVSLHNNL--VKVQKHFQWNSPPVRFDAFHMVMYEL 401
IL++++ + N+ V++ +N DA V+Y L
Sbjct: 401 ILEDVAATNLTLIQKNIWDPTVRRALLFNDQIEEGDATWQVLYAL 445
>gi|356516468|ref|XP_003526916.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10L-like
[Glycine max]
Length = 416
Score = 111 bits (277), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 98/354 (27%), Positives = 153/354 (43%), Gaps = 54/354 (15%)
Query: 86 VTNFKIYIY--PDG-DPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIP 142
V K+++Y P + + PR L +A+E + + + S T +P+ A F+ P
Sbjct: 47 VGRLKVFVYELPSKYNKKILQKDPRCLNHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTP 106
Query: 143 I--SCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV------ 194
+ +C +++ + + + +PYWNRT GADHFFVT HD G
Sbjct: 107 VYTTCDLTPNGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVTPHDFGACFHYQE 166
Query: 195 -RATE-GVPFLIKNAIRV--------VCSPSYDVAFIPHKDVALPQVLQPFALPRGGR-- 242
+A E G+ L++ A V VC + P+ A PQ + +P
Sbjct: 167 EKAIERGILPLLQRATLVQTFGQRNHVCLKEGSITIPPY---APPQKMHTHLIPEKTPRS 223
Query: 243 ----------DVENRTVLGFWAGHRNSKIRVILARVWEN--DTEL-DISNNRINRAIGPL 289
DV N G++A R A VWEN D L DIS P
Sbjct: 224 IFVYFRGLFYDVGNDPEGGYYA-------RGARAAVWENFKDNPLFDISTEH------PT 270
Query: 290 VYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILN 349
Y + FC+CP G S R+ +++ +GC+PVI+++ LPF D + W + V ++
Sbjct: 271 TYYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFVD 330
Query: 350 ERDVYQLKQILKNISDAEFVSLHNNLV--KVQKHFQWNSPPVRFDAFHMVMYEL 401
E+DV QL IL +I + L +++ + P DAFH V+ L
Sbjct: 331 EKDVPQLDTILTSIPPEVILRKQRLLANPSMKQAMLFPQPAQPGDAFHQVLNGL 384
>gi|39725600|dbj|BAD04923.1| pectin-glucuronyltransferase [Nicotiana tabacum]
Length = 317
Score = 111 bits (277), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 91/321 (28%), Positives = 140/321 (43%), Gaps = 51/321 (15%)
Query: 113 YASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SCHKMRGKGTSYENMTVIVKDYVDSLI 170
+A+E Y + + S T +P+ A F+ P+ +C +++ + +
Sbjct: 2 FAAEIYMHRFLLSSAVRTFNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRMMRSAIQLIA 61
Query: 171 NKYPYWNRTLGADHFFVTCHDVGV-------RATE-GVPFLIKNAIRV--------VCSP 214
+ +PYWNRT GADHFF+T HD G +A E G+ L++ A V VC
Sbjct: 62 SNWPYWNRTEGADHFFITPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLK 121
Query: 215 SYDVAFIPHKDVALPQVLQPFALP-----------RG-GRDVENRTVLGFWAGHRNSKIR 262
+ P+ A PQ +Q +P RG DV N G++A R
Sbjct: 122 DGSITIPPY---APPQKMQSHLIPPDTPRSIFVYFRGLFYDVGNDPEGGYYA-------R 171
Query: 263 VILARVWEN---DTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIH 319
A VWEN + DIS P Y + FC+CP G S R+ +++
Sbjct: 172 GARAAVWENFKDNPPFDISTEH------PTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVI 225
Query: 320 YGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVK-- 377
+GC+PVI+++ LPF D + W V + E+DV L IL +I E + L K
Sbjct: 226 FGCIPVIIADDIVLPFADAIPWEDIGVFVAEKDVPTLDTILTSIPPEEILRKQRLLAKPS 285
Query: 378 VQKHFQWNSPPVRFDAFHMVM 398
+++ P DAFH ++
Sbjct: 286 MKQAMLSPQPAQSGDAFHQIL 306
>gi|356508876|ref|XP_003523179.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10L-like
[Glycine max]
Length = 416
Score = 111 bits (277), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 98/354 (27%), Positives = 153/354 (43%), Gaps = 54/354 (15%)
Query: 86 VTNFKIYIY--PDG-DPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIP 142
V K+++Y P + + PR L +A+E + + + S T +P+ A F+ P
Sbjct: 47 VGRLKVFVYELPSKYNKKILQKDPRCLNHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTP 106
Query: 143 I--SCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV------ 194
+ +C +++ + + + +PYWNRT GADHFFVT HD G
Sbjct: 107 VYTTCDLTPNGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVTPHDFGACFHYQE 166
Query: 195 -RATE-GVPFLIKNAIRV--------VCSPSYDVAFIPHKDVALPQVLQPFALPRGGR-- 242
+A E G+ L++ A V VC + P+ A PQ + +P
Sbjct: 167 EKAIERGILPLLQRATLVQTFGQRNHVCLKEGSITIPPY---APPQKMHTHLIPEKTPRS 223
Query: 243 ----------DVENRTVLGFWAGHRNSKIRVILARVWEN--DTEL-DISNNRINRAIGPL 289
DV N G++A R A VWEN D L DIS P
Sbjct: 224 IFVYFRGLFYDVGNDPEGGYYA-------RGARAAVWENFKDNPLFDISTEH------PT 270
Query: 290 VYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILN 349
Y + FC+CP G S R+ +++ +GC+PVI+++ LPF D + W + V ++
Sbjct: 271 TYYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFVD 330
Query: 350 ERDVYQLKQILKNISDAEFVSLHNNLV--KVQKHFQWNSPPVRFDAFHMVMYEL 401
E+DV QL IL +I + L +++ + P DAFH V+ L
Sbjct: 331 EKDVPQLDTILTSIPPEVILRKQRLLANPSMKQAMLFPQPAQPGDAFHQVLNGL 384
>gi|296084506|emb|CBI25527.3| unnamed protein product [Vitis vinifera]
Length = 275
Score = 111 bits (277), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 63/179 (35%), Positives = 98/179 (54%), Gaps = 11/179 (6%)
Query: 60 NKQDYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYF 119
K+++ G+ V+ +P FH +++EM FKI+ Y +GD + P+ Y EG F
Sbjct: 31 KKENFIPRGA-VYRNPYAFHQSHIEMEKRFKIWTYKEGDQPLVHGGPK--NSIYGIEGQF 87
Query: 120 FQNIR--ESRFLTHDPDRAHLFFIPIS----CHKMRGKGTSYEN--MTVIVKDYVDSLIN 171
+ +S F+ PD AH+F+IPIS H + Y + +V DY+ + +
Sbjct: 88 MDEMESGDSHFVAGHPDEAHVFYIPISVTRIAHYIYSPPVDYSGHMLQRLVTDYIYVVSD 147
Query: 172 KYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQ 230
KYPYWNR+ GADHF V+CHD + P L K+ IRV+C+ + F P +D++LP+
Sbjct: 148 KYPYWNRSNGADHFLVSCHDWAPEISIVTPDLYKHFIRVLCNANTSERFQPIRDISLPE 206
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/72 (29%), Positives = 42/72 (58%), Gaps = 1/72 (1%)
Query: 334 PFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDA 393
P DI +F++ + + ++K+ILK + + ++ + + +VQ+HF N P +D
Sbjct: 198 PIRDI-SLPEFSINITSDKIPEIKKILKAVPNERYLRMQKRVKQVQRHFVINRPAQPYDM 256
Query: 394 FHMVMYELWLRR 405
HM+++ +WLRR
Sbjct: 257 LHMILHSVWLRR 268
>gi|224075447|ref|XP_002304637.1| predicted protein [Populus trichocarpa]
gi|222842069|gb|EEE79616.1| predicted protein [Populus trichocarpa]
Length = 413
Score = 111 bits (277), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 95/333 (28%), Positives = 144/333 (43%), Gaps = 51/333 (15%)
Query: 104 QTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SCHKMRGKGTSYENMTVI 161
+ PR LT +A+E + + + S T +P+ A F+ PI +C N +
Sbjct: 65 KDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFNSPRM 124
Query: 162 VKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV-------RATE-GVPFLIKNAIRV--- 210
++ + L + +PYWNRT GADHFFV HD G +A E G+ L++ A V
Sbjct: 125 MRSAIQLLSSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLRRATLVQTF 184
Query: 211 -----VCSPSYDVAFIPHKDVALPQVLQ----PFALPRG--------GRDVENRTVLGFW 253
VC + P+ A PQ +Q P PR DV N G++
Sbjct: 185 GQRNHVCLNEGSITIPPY---APPQKMQAHQIPLDTPRSIFVYFRGLFYDVNNDPEGGYY 241
Query: 254 AGHRNSKIRVILARVWEN---DTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVN 310
A R A VWEN + DIS + P Y + FC+CP G
Sbjct: 242 A-------RGARAAVWENFKNNPLFDISTDH------PTTYYEDMQRAIFCLCPLGWAPW 288
Query: 311 SARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVS 370
S R+ +++ +GC+PVI+++ LPF D + W + V + E DV L L +I +
Sbjct: 289 SPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFVAEEDVPHLDTFLTSIPPEVILR 348
Query: 371 LHNNLV--KVQKHFQWNSPPVRFDAFHMVMYEL 401
L +++ + P DAFH ++ L
Sbjct: 349 KQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 381
>gi|212275360|ref|NP_001130059.1| uncharacterized protein LOC100191151 precursor [Zea mays]
gi|194688192|gb|ACF78180.1| unknown [Zea mays]
Length = 418
Score = 111 bits (277), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 95/354 (26%), Positives = 151/354 (42%), Gaps = 54/354 (15%)
Query: 86 VTNFKIYIY--PDG-DPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIP 142
V K+Y+Y P + + PR L+ +A+E + + + S T +P+ A F+ P
Sbjct: 48 VGRLKVYVYDLPSKYNKKLLKKDPRCLSHMFAAEIFMHRFLLSSAVRTFNPEEADWFYTP 107
Query: 143 I--SCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV------ 194
+ +C +++ ++ + +PYWNR+ GADHFFVT HD G
Sbjct: 108 VYTTCDLTPSGLPLPFKSPRMMRSAIERVATNWPYWNRSEGADHFFVTPHDFGACFHYQE 167
Query: 195 --RATEGVPFLIKNAIRV--------VCSPSYDVAFIPHKDVALPQVLQPFALP------ 238
G+P L++ A V VC + P A PQ +Q +P
Sbjct: 168 EKAIGRGIPPLLQRATLVQTFGQKNHVCLKDGSITIPP---FAPPQKMQAHLIPADTPRS 224
Query: 239 -----RGG-RDVENRTVLGFWAGHRNSKIRVILARVWEN---DTELDISNNRINRAIGPL 289
RG D N G++A R A VWEN + DIS + P
Sbjct: 225 IFVYFRGLFYDTSNDPEGGYYA-------RGARASVWENFKNNPLFDISTDH------PA 271
Query: 290 VYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILN 349
Y ++ FC+CP G S R+ +++ +GC+PVI+++ LPF D + W V +
Sbjct: 272 TYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIVADDIVLPFADAIPWEDIGVFVA 331
Query: 350 ERDVYQLKQILKNISDAEFVSLHNNLV--KVQKHFQWNSPPVRFDAFHMVMYEL 401
E DV +L IL +I + L +++ + P DAFH ++ L
Sbjct: 332 EEDVPRLDTILTSIPTDVVLRKQRLLANPSMKQAMLFPQPAQAGDAFHQILNGL 385
>gi|168000965|ref|XP_001753186.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162695885|gb|EDQ82227.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 449
Score = 111 bits (277), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 101/355 (28%), Positives = 150/355 (42%), Gaps = 62/355 (17%)
Query: 86 VTNFKIYIYPDGDPNTFYQT------PRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLF 139
V K++IY D + Y T PR LT +A E Y + ES T +P+ A F
Sbjct: 76 VGRLKVFIY---DIPSKYNTDWLKKDPRCLTHMFAVEEYLHDFLTESPVRTLNPEEADWF 132
Query: 140 FIPI--SCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV--- 194
+ P+ +C +++ + + + +PYWNRT GADHFFV HD
Sbjct: 133 YTPVYTTCDLTPNGLPLPFKSPRVMRSAISYISSHWPYWNRTDGADHFFVVPHDFAACFH 192
Query: 195 ----RATE-GVPFLIKNAIRV--------VCSPSYDVAFIPHKDVALPQV-LQPFALPRG 240
+A E G+ L+K A + VC + P+ Q L P + PR
Sbjct: 193 YQEEKAIERGILPLLKRATLIQTFGQNHHVCLKEDSIVIPPYAPPERMQTRLNPPSTPRS 252
Query: 241 ------------GRDVENRTVLGFWAGHRNSKIRVILARVWEN--DTEL-DISNNRINRA 285
G D E G++A R A +WEN D L DIS
Sbjct: 253 IFAYFRGLFYDPGNDPEG----GYYA-------RGARAAIWENFKDNPLFDISTEH---- 297
Query: 286 IGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFA 345
P Y + FC+CP G S R+ + + +GC+PVI+++ LPF D + W K
Sbjct: 298 --PATYYEDMQRAIFCLCPLGWAPWSPRLVEGVIFGCIPVIIADDIVLPFADAIPWEKIG 355
Query: 346 VILNERDVYQLKQILKNISDAEFVSLHNNLV--KVQKHFQWNSPPVRFDAFHMVM 398
V + E+DV L +IL I+ E + L +++ + P DAFH ++
Sbjct: 356 VFVEEKDVPILDKILCTINHEEVLEKQRLLANPAMKQAMLFPRPAKPGDAFHQIL 410
>gi|363543513|ref|NP_001241766.1| secondary cell wall-related glycosyltransferase family 47 precursor
[Zea mays]
gi|195634597|gb|ACG36767.1| secondary cell wall-related glycosyltransferase family 47 [Zea
mays]
Length = 417
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 94/351 (26%), Positives = 150/351 (42%), Gaps = 48/351 (13%)
Query: 86 VTNFKIYIY---PDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIP 142
V K+Y+Y P + N + R L+ +A+E + + + S T +PD A F+ P
Sbjct: 48 VGRLKVYVYELPPKYNKNVVAKDSRCLSHMFATEIFMHRFLLASAVRTLNPDEADWFYTP 107
Query: 143 I--SCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV------ 194
+ +C +++ + + ++PYWNRT GADHFFVT HD G
Sbjct: 108 VYTTCDLTPWGHPLTVKSPRMMRSAIQYVSKRWPYWNRTEGADHFFVTPHDFGACFYFQE 167
Query: 195 -RATE-GVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFAL------PRGGR---- 242
+A + GV +++ A V + + + +P P + P R
Sbjct: 168 EKAIQRGVLPVLRRATLVQTFGQKNHVCLREGSITIPPYAPPHKIRAHIVPPETPRSIFV 227
Query: 243 -------DVENRTVLGFWAGHRNSKIRVILARVWEN---DTELDISNNRINRAIGPLVYQ 292
D N G++A R A VWEN + DIS P Y
Sbjct: 228 YFRGLFYDTANDPEGGYYA-------RGARASVWENFKNNALFDISTEH------PPTYY 274
Query: 293 RRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERD 352
+ FC+CP G S R+ +++ +GC+PVI+++ LPF D + W + AV + E D
Sbjct: 275 EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIAVFVPEDD 334
Query: 353 VYQLKQILKNISDAEFVSLHNNLV--KVQKHFQWNSPPVRFDAFHMVMYEL 401
V +L IL +I E + L +++ + P DAFH V+ L
Sbjct: 335 VLRLDTILTSIPMDEILRKQRLLANPSMKQAMLFPQPAEPRDAFHQVLNGL 385
>gi|255540029|ref|XP_002511079.1| catalytic, putative [Ricinus communis]
gi|223550194|gb|EEF51681.1| catalytic, putative [Ricinus communis]
Length = 417
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 92/330 (27%), Positives = 144/330 (43%), Gaps = 51/330 (15%)
Query: 104 QTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SCHKMRGKGTSYENMTVI 161
+ PR LT +A+E + + + S T +P+ A F+ P+ +C +
Sbjct: 69 KDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRM 128
Query: 162 VKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV-------RATE-GVPFLIKNAIRV--- 210
++ + + + +PYWNRT GADHFFV HD G +A E G+ +++ A V
Sbjct: 129 MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPMLQRATLVQTF 188
Query: 211 -----VCSPSYDVAFIPHKDVALPQVLQPFALPRGGR------------DVENRTVLGFW 253
VC + P+ A PQ +Q +P DV N G++
Sbjct: 189 GQRNHVCLKEGSITIPPY---APPQKMQAHLIPEKTPRSIFVYFRGLFYDVGNDPEGGYY 245
Query: 254 AGHRNSKIRVILARVWEN--DTEL-DISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVN 310
A R A VWEN D L DIS P Y + FC+CP G
Sbjct: 246 A-------RGARAAVWENFKDNPLFDISTEH------PTTYYEDMQRAVFCLCPLGWAPW 292
Query: 311 SARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVS 370
S R+ +++ +GC+PVI+++ LPF D + W + V ++E+DV L IL +I +
Sbjct: 293 SPRLVEAVIFGCIPVIIADDIVLPFADAIPWEEIGVFVDEKDVPNLDTILTSIPPEVILR 352
Query: 371 LHNNLV--KVQKHFQWNSPPVRFDAFHMVM 398
L +++ + P DAFH V+
Sbjct: 353 KQRLLANPSMKQAMLFPQPAQAGDAFHQVL 382
>gi|226529361|ref|NP_001147714.1| LOC100281324 precursor [Zea mays]
gi|195613258|gb|ACG28459.1| secondary cell wall-related glycosyltransferase family 47 [Zea
mays]
Length = 418
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 94/351 (26%), Positives = 150/351 (42%), Gaps = 48/351 (13%)
Query: 86 VTNFKIYIY---PDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIP 142
V K+Y+Y P + N + R L+ +A+E + + + S T +PD A F+ P
Sbjct: 49 VGRLKVYVYELPPKYNKNVVAKDSRCLSHMFATEIFMHRFLLASAVRTLNPDEADWFYTP 108
Query: 143 I--SCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV------ 194
+ +C +++ + + ++PYWNRT GADHFFVT HD G
Sbjct: 109 VYTTCDLTPWGHPLTVKSPRMMRSAIQYVSKRWPYWNRTEGADHFFVTPHDFGACFYFQE 168
Query: 195 -RATE-GVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFAL------PRGGR---- 242
+A + GV +++ A V + + + +P P + P R
Sbjct: 169 EKAIQRGVLPVLRRATLVQTFGQKNHVCLREGSITIPPYAPPHKIRAHIVPPETPRSIFV 228
Query: 243 -------DVENRTVLGFWAGHRNSKIRVILARVWEN---DTELDISNNRINRAIGPLVYQ 292
D N G++A R A VWEN + DIS P Y
Sbjct: 229 YFRGLFYDTANDPEGGYYA-------RGARASVWENFKNNALFDISTEH------PPTYY 275
Query: 293 RRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERD 352
+ FC+CP G S R+ +++ +GC+PVI+++ LPF D + W + AV + E D
Sbjct: 276 EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIAVFVPEDD 335
Query: 353 VYQLKQILKNISDAEFVSLHNNLV--KVQKHFQWNSPPVRFDAFHMVMYEL 401
V +L IL +I E + L +++ + P DAFH V+ L
Sbjct: 336 VLRLDTILTSIPMDEILRKQRLLANPSMKQAMLFPQPAEPRDAFHQVLNGL 386
>gi|302823891|ref|XP_002993593.1| glycosyltransferase-like protein [Selaginella moellendorffii]
gi|300138521|gb|EFJ05285.1| glycosyltransferase-like protein [Selaginella moellendorffii]
Length = 437
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 94/334 (28%), Positives = 147/334 (44%), Gaps = 59/334 (17%)
Query: 104 QTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SCHKMRGKGTSYENMTVI 161
+ PR LT +A+E + + + S T +P+ A F+ P+ +C +
Sbjct: 61 KDPRCLTHMFATEIFVHEFLLGSAVRTLNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRM 120
Query: 162 VKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV-------RATE-GVPFLIKNAIRV--- 210
++ + + N++PYWNRT GADHFFV HD G +A E G+ L++ A V
Sbjct: 121 MRSVIQYISNQWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILHLLQRATLVQTF 180
Query: 211 -----VCSPSYDVAFIPHKDVALPQVLQ----PFALPRG------------GRDVENRTV 249
VC + P+ PQ +Q P ++PR G D E
Sbjct: 181 GQRYHVCLKEGSIVVPPY---CPPQKMQAHLIPPSIPRSIFVYFRGLFYDYGNDPEG--- 234
Query: 250 LGFWAGHRNSKIRVILARVWEN--DTEL-DISNNRINRAIGPLVYQRRFYKTKFCICPGG 306
G++A R A VWEN D L DIS P+ Y + FC+CP G
Sbjct: 235 -GYYA-------RGARAAVWENFKDNPLFDISTEH------PITYYEDMQRAIFCLCPLG 280
Query: 307 SQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDA 366
S R+ + + +GC+PVI+++ LPF D + W + V + E+DV +L IL +I
Sbjct: 281 WAPWSPRLVEGVIFGCIPVIIADDIVLPFADAIPWEEIGVFVAEKDVPKLDTILTSIPPE 340
Query: 367 EFVSLHNNLV--KVQKHFQWNSPPVRFDAFHMVM 398
+ L +++ + P DAFH ++
Sbjct: 341 VILKKQRLLATPAMKQAMLFPQPAQPGDAFHQIL 374
>gi|224119858|ref|XP_002318180.1| predicted protein [Populus trichocarpa]
gi|224122230|ref|XP_002318783.1| predicted protein [Populus trichocarpa]
gi|222858853|gb|EEE96400.1| predicted protein [Populus trichocarpa]
gi|222859456|gb|EEE97003.1| predicted protein [Populus trichocarpa]
Length = 417
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 94/330 (28%), Positives = 143/330 (43%), Gaps = 51/330 (15%)
Query: 104 QTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SCHKMRGKGTSYENMTVI 161
+ PR LT +A+E + + + S T +PD A F+ P+ +C +
Sbjct: 69 KDPRCLTHMFAAEIFMHRFLLSSPVRTLNPDEADWFYTPVYTTCDLTTNGLPLPFKSPRM 128
Query: 162 VKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV-------RATE-GVPFLIKNAIRV--- 210
++ + + + +PYWNRT GADHFFV HD G +A E G+ L++ A V
Sbjct: 129 MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTF 188
Query: 211 -----VCSPSYDVAFIPHKDVALPQVLQPFALPRGGR------------DVENRTVLGFW 253
VC + P+ A PQ +Q +P DV N G++
Sbjct: 189 GQRNHVCLKDGSITVPPY---APPQKMQTHLIPEKTPRSIFVYFRGLFYDVGNDPEGGYY 245
Query: 254 AGHRNSKIRVILARVWEN--DTEL-DISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVN 310
A R A VWEN D L DIS P Y + FC+CP G
Sbjct: 246 A-------RGARAAVWENFKDNPLFDISTEH------PATYYEDMQRAVFCLCPLGWAPW 292
Query: 311 SARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVS 370
S R+ +++ +GC+PVI+++ LPF D + W + V ++E DV L IL +I +
Sbjct: 293 SPRLVEAVIFGCIPVIIADDIVLPFADAIPWEEIGVYVDEEDVPNLDTILTSIPPEVILR 352
Query: 371 LHNNLV--KVQKHFQWNSPPVRFDAFHMVM 398
L +++ + P DAFH V+
Sbjct: 353 KQRLLANPSMKQAMLFPQPAQPGDAFHQVL 382
>gi|194706890|gb|ACF87529.1| unknown [Zea mays]
Length = 418
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 94/351 (26%), Positives = 150/351 (42%), Gaps = 48/351 (13%)
Query: 86 VTNFKIYIY---PDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIP 142
V K+Y+Y P + N + R L+ +A+E + + + S T +PD A F+ P
Sbjct: 49 VGRLKVYVYELPPKYNKNVVAKDSRCLSHMFATEIFMHRFLLASAVRTLNPDEADWFYTP 108
Query: 143 I--SCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV------ 194
+ +C +++ + + ++PYWNRT GADHFFVT HD G
Sbjct: 109 VYTTCDLTPWGHPLTVKSPRMMRSAIQYVSKRWPYWNRTEGADHFFVTPHDFGACFYFQE 168
Query: 195 -RATE-GVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFAL------PRGGR---- 242
+A + GV +++ A V + + + +P P + P R
Sbjct: 169 EKAIQRGVLPVLRRATLVQTFGQKNHVCLREGSITIPPYAPPHKIRAHIVPPETPRSIFV 228
Query: 243 -------DVENRTVLGFWAGHRNSKIRVILARVWEN---DTELDISNNRINRAIGPLVYQ 292
D N G++A R A VWEN + DIS P Y
Sbjct: 229 YFRGLFYDTANDPEGGYYA-------RGARASVWENFKNNALFDISTEH------PPTYY 275
Query: 293 RRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERD 352
+ FC+CP G S R+ +++ +GC+PVI+++ LPF D + W + AV + E D
Sbjct: 276 EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIAVFVPEDD 335
Query: 353 VYQLKQILKNISDAEFVSLHNNLV--KVQKHFQWNSPPVRFDAFHMVMYEL 401
V +L IL +I E + L +++ + P DAFH V+ L
Sbjct: 336 VLRLDTILTSIPMDEILRKQRLLANPSMKQAMLFPQPAEPRDAFHQVLNGL 386
>gi|413946367|gb|AFW79016.1| secondary cell wall glycosyltransferase family 47 [Zea mays]
Length = 418
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 94/351 (26%), Positives = 150/351 (42%), Gaps = 48/351 (13%)
Query: 86 VTNFKIYIY---PDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIP 142
V K+Y+Y P + N + R L+ +A+E + + + S T +PD A F+ P
Sbjct: 49 VGRLKVYVYELPPKYNKNVVAKDSRCLSHMFATEIFMHRFLLASAVRTLNPDEADWFYTP 108
Query: 143 I--SCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV------ 194
+ +C +++ + + ++PYWNRT GADHFFVT HD G
Sbjct: 109 VYTTCDLTPWGHPLTVKSPRMMRSAIQYVSKRWPYWNRTEGADHFFVTPHDFGACFYFQE 168
Query: 195 -RATE-GVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFAL------PRGGR---- 242
+A + GV +++ A V + + + +P P + P R
Sbjct: 169 EKAIQRGVLPVLRRATLVQTFGQKNHVCLREGSITIPPYAPPHKIRAHIVPPETPRSIFV 228
Query: 243 -------DVENRTVLGFWAGHRNSKIRVILARVWEN---DTELDISNNRINRAIGPLVYQ 292
D N G++A R A VWEN + DIS P Y
Sbjct: 229 YFRGLFYDTANDPEGGYYA-------RGARASVWENFKNNALFDISTEH------PPTYY 275
Query: 293 RRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERD 352
+ FC+CP G S R+ +++ +GC+PVI+++ LPF D + W + AV + E D
Sbjct: 276 EDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIAVFVPEDD 335
Query: 353 VYQLKQILKNISDAEFVSLHNNLV--KVQKHFQWNSPPVRFDAFHMVMYEL 401
V +L IL +I E + L +++ + P DAFH V+ L
Sbjct: 336 VLRLDTILTSIPMDEILRKQRLLANPSMKQAMLFPQPAEPRDAFHQVLNGL 386
>gi|357121010|ref|XP_003562215.1| PREDICTED: probable glucuronosyltransferase Os03g0107900-like
[Brachypodium distachyon]
Length = 441
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 77/272 (28%), Positives = 132/272 (48%), Gaps = 39/272 (14%)
Query: 133 PDRAHLFFIPI--SCHKMRGKG-TSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTC 189
P+ A LFF+P+ SC+ G S + ++ D VD + + PYWNR+ GADH FV
Sbjct: 135 PEDADLFFVPVYVSCNFSTPNGFPSLSHARGLLADAVDLVRREAPYWNRSAGADHVFVAS 194
Query: 190 HDVG--------VRATEGVPFLIKNAIRV---------VCSPSYDVAFIPH--KDVALPQ 230
HD G V +G+P +K +I + VC + V PH +VAL +
Sbjct: 195 HDFGACFHPMEDVAIADGIPDFLKRSILLQTFGVQGPHVCQEAEHVVIPPHVPPEVAL-E 253
Query: 231 VLQP-------FALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRIN 283
+L+ FA RG +V + + G + + K+R L + + +++ + R +
Sbjct: 254 ILELEKTRRDIFAFFRGKMEVHPKNISGRFY---SKKVRTELLQRYGRNSKFYLKRKRYD 310
Query: 284 RAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRK 343
Y+ ++ FC+CP G S R+ +S+ GC+PVI+++ LPF +L W
Sbjct: 311 N------YRSEMARSLFCLCPLGWAPWSPRLVESVLLGCIPVIIADNIRLPFPSVLRWSD 364
Query: 344 FAVILNERDVYQLKQILKNISDAEFVSLHNNL 375
++ + E+DV L+++L ++ + NL
Sbjct: 365 ISLQVAEKDVASLEKVLDHVVATNLTVIQKNL 396
>gi|222636093|gb|EEE66225.1| hypothetical protein OsJ_22377 [Oryza sativa Japonica Group]
Length = 341
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 66/191 (34%), Positives = 87/191 (45%), Gaps = 11/191 (5%)
Query: 71 VFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRES---- 126
V+ FH +Y+EM FK++ Y +G+P + G EG+ + S
Sbjct: 120 VYRDAYAFHQSYIEMEKRFKVWTYREGEPPVVQKGGAAFAGNDGIEGHLIAELDSSGGGG 179
Query: 127 RFLTHDPDRAHLFFIPISCHKMRGKGTS------YENMTVIVKDYVDSLINKYPYWNRTL 180
R P AH FF+PIS + G ++ +V YVD L YP+WNR+
Sbjct: 180 RHRARHPGEAHAFFLPISVASIAGYVYRRDMIDFWDPQLRLVAGYVDGLAAMYPFWNRSR 239
Query: 181 GADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPF-ALPR 239
GADHF V+CH + L NAIRV+C F P DVALP V+ A P
Sbjct: 240 GADHFLVSCHQWAPILSAAKAELRGNAIRVMCDADMSDGFDPATDVALPPVVASARATPP 299
Query: 240 GGRDVENRTVL 250
GR RTVL
Sbjct: 300 QGRVASERTVL 310
>gi|60657602|gb|AAX33322.1| secondary cell wall-related glycosyltransferase family 47 [Populus
tremula x Populus tremuloides]
Length = 442
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 92/357 (25%), Positives = 156/357 (43%), Gaps = 54/357 (15%)
Query: 85 MVTNFKIYIY--PDGDPNTFYQTPRKLTGKYASEGYFFQNIRES-RFLTHDPDRAHLFFI 141
++ + K+YIY P + R +ASE + + S T DP A FF+
Sbjct: 84 LLKDLKVYIYELPSKYNTDWLANERCSNHLFASEVAIHKALSSSLDVRTFDPYEADFFFV 143
Query: 142 PI--SCHKMRGKG-TSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV---- 194
P+ SC+ G + + ++ V + + YP+WNR+ G+DH FV HD G
Sbjct: 144 PVYVSCNFSTVNGFPAIGHARSLLSSAVQLISSNYPFWNRSQGSDHVFVASHDYGACFHA 203
Query: 195 ---RATE-GVPFLIKNAIRVVC------SPSYDVAFIPHKDVALPQVLQP---------- 234
RA E G+P +K +I + P DV ++V +P + P
Sbjct: 204 MEERAMEDGIPEFLKRSIILQTFGVKFNHPCQDV-----ENVVIPPYISPGSVRTTLEKY 258
Query: 235 --------FALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAI 286
+A RG +V + + G + + K+R ++ R + D + +R
Sbjct: 259 PLTGRRDIWAFFRGKMEVHPKNISG---RYYSKKVRTVIWRKYSGDRRFYLQRHRFAG-- 313
Query: 287 GPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAV 346
YQ ++ FC+CP G S R+ +S+ GCVPVI+++ LPF + W + ++
Sbjct: 314 ----YQSEIVRSVFCLCPLGWAPWSPRLVESVALGCVPVIIADGIRLPFPTAVRWSEISL 369
Query: 347 ILNERDVYQLKQILKNISDAEFVSLHNNLV--KVQKHFQWNSPPVRFDAFHMVMYEL 401
+ E+DV L +L ++ ++ NL V++ +N P DA V+Y L
Sbjct: 370 TVAEKDVANLGTLLDQVAATNLSAIQKNLWDPDVRRALLFNDPVQGGDATWQVLYAL 426
>gi|115481310|ref|NP_001064248.1| Os10g0180000 [Oryza sativa Japonica Group]
gi|122212636|sp|Q33AH8.2|GT101_ORYSJ RecName: Full=Probable glucuronosyltransferase GUT1; AltName:
Full=Glucuronoxylan glucuronosyltransferase 1;
Short=OsGUT1
gi|110288701|gb|ABB46947.2| exostosin family protein, putative, expressed [Oryza sativa
Japonica Group]
gi|113638857|dbj|BAF26162.1| Os10g0180000 [Oryza sativa Japonica Group]
Length = 417
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 94/351 (26%), Positives = 149/351 (42%), Gaps = 48/351 (13%)
Query: 86 VTNFKIYIY---PDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIP 142
V K+Y+Y P + N + R L+ +A+E + + + S T +PD A F+ P
Sbjct: 48 VGKLKVYVYELPPKYNKNIVAKDSRCLSHMFATEIFMHRFLLSSAIRTSNPDEADWFYTP 107
Query: 143 I--SCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV------ 194
+ +C +++ + + +PYWNRT GADHFFV HD
Sbjct: 108 VYTTCDLTPWGHPLTTKSPRMMRSAIKFISKYWPYWNRTEGADHFFVVPHDFAACFYFQE 167
Query: 195 -RATE-GVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQV---------LQPFALPRG--- 240
+A E G+ +++ A V + A + + +P L P PR
Sbjct: 168 AKAIERGILPVLRRATLVQTFGQKNHACLKDGSITVPPYTPAHKIRAHLVPPETPRSIFV 227
Query: 241 -----GRDVENRTVLGFWAGHRNSKIRVILARVWEN---DTELDISNNRINRAIGPLVYQ 292
D N G++A R A VWEN + DIS + P Y
Sbjct: 228 YFRGLFYDTSNDPEGGYYA-------RGARASVWENFKNNPMFDISTDH------PQTYY 274
Query: 293 RRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERD 352
+ FC+CP G S R+ +++ +GC+PVI+++ LPF+D + W + AV + E D
Sbjct: 275 EDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFSDAIPWEEIAVFVAEDD 334
Query: 353 VYQLKQILKNISDAEFVSLHNNLVK--VQKHFQWNSPPVRFDAFHMVMYEL 401
V QL IL +I + L + +++ + P D FH VM L
Sbjct: 335 VPQLDTILTSIPTEVILRKQAMLAEPSMKQTMLFPQPAEPGDGFHQVMNAL 385
>gi|218184216|gb|EEC66643.1| hypothetical protein OsI_32904 [Oryza sativa Indica Group]
gi|222612524|gb|EEE50656.1| hypothetical protein OsJ_30886 [Oryza sativa Japonica Group]
Length = 400
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 94/351 (26%), Positives = 149/351 (42%), Gaps = 48/351 (13%)
Query: 86 VTNFKIYIY---PDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIP 142
V K+Y+Y P + N + R L+ +A+E + + + S T +PD A F+ P
Sbjct: 31 VGKLKVYVYELPPKYNKNIVAKDSRCLSHMFATEIFMHRFLLSSAIRTSNPDEADWFYTP 90
Query: 143 I--SCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV------ 194
+ +C +++ + + +PYWNRT GADHFFV HD
Sbjct: 91 VYTTCDLTPWGHPLTTKSPRMMRSAIKFISKYWPYWNRTEGADHFFVVPHDFAACFYFQE 150
Query: 195 -RATE-GVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQV---------LQPFALPRG--- 240
+A E G+ +++ A V + A + + +P L P PR
Sbjct: 151 AKAIERGILPVLRRATLVQTFGQKNHACLKDGSITVPPYTPAHKIRAHLVPPETPRSIFV 210
Query: 241 -----GRDVENRTVLGFWAGHRNSKIRVILARVWEN---DTELDISNNRINRAIGPLVYQ 292
D N G++A R A VWEN + DIS + P Y
Sbjct: 211 YFRGLFYDTSNDPEGGYYA-------RGARASVWENFKNNPMFDISTDH------PQTYY 257
Query: 293 RRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERD 352
+ FC+CP G S R+ +++ +GC+PVI+++ LPF+D + W + AV + E D
Sbjct: 258 EDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFSDAIPWEEIAVFVAEDD 317
Query: 353 VYQLKQILKNISDAEFVSLHNNLVK--VQKHFQWNSPPVRFDAFHMVMYEL 401
V QL IL +I + L + +++ + P D FH VM L
Sbjct: 318 VPQLDTILTSIPTEVILRKQAMLAEPSMKQTMLFPQPAEPGDGFHQVMNAL 368
>gi|225456511|ref|XP_002284685.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10L [Vitis
vinifera]
gi|147860849|emb|CAN83159.1| hypothetical protein VITISV_022554 [Vitis vinifera]
gi|297734521|emb|CBI15768.3| unnamed protein product [Vitis vinifera]
Length = 416
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 99/354 (27%), Positives = 153/354 (43%), Gaps = 54/354 (15%)
Query: 86 VTNFKIYIY--PDG-DPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIP 142
V K+++Y P + + PR LT +A+E + + + S T +P+ A F+ P
Sbjct: 47 VGRLKVFVYELPSKYNKKILQKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTP 106
Query: 143 I--SCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV------ 194
+ +C +++ + + + +PYWNRT GADHFFV HD G
Sbjct: 107 VYTTCDLTPNGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQE 166
Query: 195 -RATE-GVPFLIKNAIRV--------VCSPSYDVAFIPHKDVALPQVLQPFALP------ 238
+A E G+ L++ A V VC + P+ A PQ +Q +P
Sbjct: 167 EKAIERGILHLLQRATLVQTFGQRNHVCLKEGSITIPPY---APPQKMQSHLIPPDTPRS 223
Query: 239 -----RGG-RDVENRTVLGFWAGHRNSKIRVILARVWEN--DTEL-DISNNRINRAIGPL 289
RG DV N G++A R A VWEN D L DIS P+
Sbjct: 224 IFVYFRGLFYDVGNDPEGGYYA-------RGARAAVWENFKDNPLFDISTEH------PM 270
Query: 290 VYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILN 349
Y + FC+CP G S R+ + + +GC+PVI+++ LPF D + W + V +
Sbjct: 271 TYYEDMQRAVFCLCPLGWAPWSPRLVEGVIFGCIPVIIADDIVLPFADAIPWEEIGVFVA 330
Query: 350 ERDVYQLKQILKNISDAEFVSLHNNLV--KVQKHFQWNSPPVRFDAFHMVMYEL 401
E DV L IL +I + L +++ + P DAFH ++ L
Sbjct: 331 EEDVPNLDTILTSIPPEVILRKQRLLANPSMKQAMLFPQPAQPGDAFHQILNGL 384
>gi|21671946|gb|AAM74308.1|AC083944_26 Unknown protein similar to exostosin-2 [Oryza sativa Japonica
Group]
gi|110288700|gb|ABB46946.2| exostosin family protein, putative, expressed [Oryza sativa
Japonica Group]
Length = 401
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 94/351 (26%), Positives = 149/351 (42%), Gaps = 48/351 (13%)
Query: 86 VTNFKIYIY---PDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIP 142
V K+Y+Y P + N + R L+ +A+E + + + S T +PD A F+ P
Sbjct: 32 VGKLKVYVYELPPKYNKNIVAKDSRCLSHMFATEIFMHRFLLSSAIRTSNPDEADWFYTP 91
Query: 143 I--SCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV------ 194
+ +C +++ + + +PYWNRT GADHFFV HD
Sbjct: 92 VYTTCDLTPWGHPLTTKSPRMMRSAIKFISKYWPYWNRTEGADHFFVVPHDFAACFYFQE 151
Query: 195 -RATE-GVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQV---------LQPFALPRG--- 240
+A E G+ +++ A V + A + + +P L P PR
Sbjct: 152 AKAIERGILPVLRRATLVQTFGQKNHACLKDGSITVPPYTPAHKIRAHLVPPETPRSIFV 211
Query: 241 -----GRDVENRTVLGFWAGHRNSKIRVILARVWEN---DTELDISNNRINRAIGPLVYQ 292
D N G++A R A VWEN + DIS + P Y
Sbjct: 212 YFRGLFYDTSNDPEGGYYA-------RGARASVWENFKNNPMFDISTDH------PQTYY 258
Query: 293 RRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERD 352
+ FC+CP G S R+ +++ +GC+PVI+++ LPF+D + W + AV + E D
Sbjct: 259 EDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFSDAIPWEEIAVFVAEDD 318
Query: 353 VYQLKQILKNISDAEFVSLHNNLVK--VQKHFQWNSPPVRFDAFHMVMYEL 401
V QL IL +I + L + +++ + P D FH VM L
Sbjct: 319 VPQLDTILTSIPTEVILRKQAMLAEPSMKQTMLFPQPAEPGDGFHQVMNAL 369
>gi|242061766|ref|XP_002452172.1| hypothetical protein SORBIDRAFT_04g021180 [Sorghum bicolor]
gi|241932003|gb|EES05148.1| hypothetical protein SORBIDRAFT_04g021180 [Sorghum bicolor]
Length = 432
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 93/337 (27%), Positives = 148/337 (43%), Gaps = 59/337 (17%)
Query: 104 QTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SCHKMRGKGTSYENMTVI 161
+ PR L +A+E + + + S T +P+ A F+ P+ +C +
Sbjct: 84 KDPRCLHHMFAAEIFMHRFLLSSAVRTLNPEEADWFYTPVYTTCDLTPAGLPLPFKSPRM 143
Query: 162 VKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV-------RATE-GVPFLIKNAIRV--- 210
++ + + NK+P+WNRT GADHFFV HD G +ATE G+ +++ A V
Sbjct: 144 MRSAIKFISNKWPFWNRTDGADHFFVVPHDFGACFHYQEEKATERGILPMLRRATLVQTF 203
Query: 211 -----VCSPSYDVAFIPHKDVALPQVLQPFALP-----------RG-----GRDVENRTV 249
VC + P+ A PQ +Q +P RG G D E
Sbjct: 204 GQKNHVCLKEGSIIIPPY---APPQKMQAHLVPPDTPRSIFVYFRGLFYDNGNDPEG--- 257
Query: 250 LGFWAGHRNSKIRVILARVWEN---DTELDISNNRINRAIGPLVYQRRFYKTKFCICPGG 306
G++A R A +WEN + DIS + P+ Y + FC+CP G
Sbjct: 258 -GYYA-------RGARASLWENFKNNPLFDISTDH------PVTYYEDMQRAVFCLCPLG 303
Query: 307 SQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDA 366
S R+ +++ +GC+PVI+++ LPF D + W V ++E DV +L IL +I
Sbjct: 304 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEDIGVFVDEEDVPKLDSILTSIPIE 363
Query: 367 EFVSLHNNLV--KVQKHFQWNSPPVRFDAFHMVMYEL 401
+ L ++K + P DAFH ++ L
Sbjct: 364 NILRKQRLLANPSMKKAMLFPQPAQPRDAFHQILNGL 400
>gi|449496198|ref|XP_004160070.1| PREDICTED: LOW QUALITY PROTEIN: probable glucuronoxylan
glucuronosyltransferase IRX7-like [Cucumis sativus]
Length = 459
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 83/324 (25%), Positives = 147/324 (45%), Gaps = 41/324 (12%)
Query: 84 EMVTNFKIYIY--PDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFI 141
E++ K+++Y P + R +ASE + + S + T DP A FF+
Sbjct: 101 ELLEEVKVFVYDLPPKYNVEWLSNERCSNHLFASEVAIHRALLNSHYRTFDPLEADFFFV 160
Query: 142 PI--SCHKMRGKG-TSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVG----- 193
P+ SC+ G + + ++ V + + Y +WNRT G+DH FV HD
Sbjct: 161 PVYVSCNFSTVNGFPAIGHARSLISSAVSHISSHYSFWNRTNGSDHVFVASHDFASCFHT 220
Query: 194 ---VRATEGVPFLIKNAIRVVC------SPSYDVA------FIPHKDVALPQVLQP---- 234
V +GVP +KN+I + P DV +IP + + P
Sbjct: 221 MEHVAIADGVPSFLKNSIILQTFGVKYKHPCQDVEHVVIPPYIPPESIENTLERSPVTGR 280
Query: 235 ---FALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVY 291
FA RG ++ + V G + + K+R ++ R + D + +R Y
Sbjct: 281 RDIFAFFRGKMEMNPKNVSGRFY---SKKVRTMIWRKFNGDRRFYLQRHRFPG------Y 331
Query: 292 QRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNER 351
Q ++ FC+CP G S R+ +S+ GCVPVI+++ LPF ++W + ++ + E+
Sbjct: 332 QSEIVRSVFCLCPLGWAPWSPRLVESVALGCVPVIIADGIRLPFPSAVNWPEISITVAEK 391
Query: 352 DVYQLKQILKNISDAEFVSLHNNL 375
D+ +L +IL +++ + ++ NL
Sbjct: 392 DIGKLGRILDHVAGSNLTTIQKNL 415
>gi|23821292|dbj|BAC20928.1| pectin-glucuronyltransferase [Nicotiana plumbaginifolia]
Length = 341
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 93/324 (28%), Positives = 142/324 (43%), Gaps = 51/324 (15%)
Query: 113 YASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SCHKMRGKGTSYENMTVIVKDYVDSLI 170
+A+E Y + + S T +P+ A F+ P+ +C +++ + +
Sbjct: 2 FAAEIYMHRFLLSSAVRTFNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRMMRSAIQLIA 61
Query: 171 NKYPYWNRTLGADHFFVTCHDVGV-------RATE-GVPFLIKNAIRV--------VCSP 214
+ +PYWNRT GADHFF+T HD G +A E G+ L++ A V VC
Sbjct: 62 SNWPYWNRTEGADHFFITPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLK 121
Query: 215 SYDVAFIPHKDVALPQVLQPFALP-----------RG-GRDVENRTVLGFWAGHRNSKIR 262
+ P+ A PQ +Q +P RG DV N G++A R
Sbjct: 122 DGSITIPPY---APPQKMQSHLIPPDTPRSIFVYFRGLFYDVGNDPEGGYYA-------R 171
Query: 263 VILARVWEN--DTEL-DISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIH 319
A VWEN D L DIS P Y + FC+CP G S R+ +++
Sbjct: 172 GARAAVWENFKDNPLFDISTEH------PTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVI 225
Query: 320 YGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLV--K 377
+GC+PVI+++ LPF D + W V + E+DV L IL +I E + L
Sbjct: 226 FGCIPVIIADDIVLPFADAIPWEDIGVFVAEKDVPNLDTILTSIPPEEILRKQRLLANPS 285
Query: 378 VQKHFQWNSPPVRFDAFHMVMYEL 401
+++ + P DAFH ++ L
Sbjct: 286 MKQAMLFPQPAQSGDAFHQILNGL 309
>gi|386576430|gb|AFJ12119.1| glycosyltransferase, partial [Nicotiana tabacum]
Length = 317
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 92/321 (28%), Positives = 141/321 (43%), Gaps = 51/321 (15%)
Query: 113 YASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SCHKMRGKGTSYENMTVIVKDYVDSLI 170
+A+E Y + + S T +P+ A F+ P+ +C +++ + +
Sbjct: 2 FAAEIYMHRFLLSSAVRTFNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRMMRSAIQLIA 61
Query: 171 NKYPYWNRTLGADHFFVTCHDVGV-------RATE-GVPFLIKNAIRV--------VCSP 214
+ +PYWNRT GADHFF+T HD G +A E G+ L++ A V VC
Sbjct: 62 SNWPYWNRTEGADHFFITPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLK 121
Query: 215 SYDVAFIPHKDVALPQVLQPFALP-----------RG-GRDVENRTVLGFWAGHRNSKIR 262
+ P+ A PQ +Q +P RG DV N G++A R
Sbjct: 122 DGSITIPPY---APPQKMQSHLIPPDTPRSIFVYFRGLFYDVGNDPEGGYYA-------R 171
Query: 263 VILARVWEN--DTEL-DISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIH 319
A VWEN D L DIS P Y + FC+CP G S R+ +++
Sbjct: 172 GARAAVWENFKDNPLFDISTEH------PTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVI 225
Query: 320 YGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLV--K 377
+GC+PVI+++ LPF D + W V + E+DV L IL +I E + L
Sbjct: 226 FGCIPVIIADDIVLPFADAIPWEDIGVFVAEKDVPNLDTILTSIPPEEILRKQRLLANPS 285
Query: 378 VQKHFQWNSPPVRFDAFHMVM 398
+++ + P DAFH ++
Sbjct: 286 MKQAMLFPQPAQSGDAFHQIL 306
>gi|301072486|gb|ADK56173.1| glycosyltransferase 47 [Triticum aestivum]
Length = 422
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 95/354 (26%), Positives = 151/354 (42%), Gaps = 54/354 (15%)
Query: 86 VTNFKIYIY--PDG-DPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIP 142
V K++IY P + + PR L +A+E + + + S T +P+ A F+ P
Sbjct: 53 VGRLKVFIYDLPGKYNKKLLKKDPRCLNHMFAAEIFMHRFLLSSAVRTTNPEEADWFYTP 112
Query: 143 I--SCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV------ 194
+ +C +++ ++ + K+PYWNR+ GADHFFVT HD G
Sbjct: 113 VYPTCDLTPSGLPLPFKSPRMMRSAIELIATKWPYWNRSEGADHFFVTPHDFGACFHYQE 172
Query: 195 --RATEGVPFLIKNAIRV--------VCSPSYDVAFIPHKDVALPQVLQ----PFALPRG 240
G+ L++ A V VC + P A PQ +Q P PR
Sbjct: 173 EKAIGRGILPLLQRATLVQTFGQKNHVCLKEGSITIPP---FAPPQKMQNHLIPGETPRS 229
Query: 241 --------GRDVENRTVLGFWAGHRNSKIRVILARVWEN---DTELDISNNRINRAIGPL 289
D N G++A R A VWEN + DIS + P
Sbjct: 230 IFVYFRGLFYDTGNDPEGGYYA-------RGARASVWENFKNNPLFDISTDH------PP 276
Query: 290 VYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILN 349
Y ++ FC+CP G S R+ +++ +GC+PVI+++ LPF D + W + V +
Sbjct: 277 TYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFVP 336
Query: 350 ERDVYQLKQILKNISDAEFVSLHNNLV--KVQKHFQWNSPPVRFDAFHMVMYEL 401
E DV +L IL +I + + L +++ + P DAFH ++ L
Sbjct: 337 EEDVPRLDSILTSIPTEDILRKQRLLANPSMKQAMLFPQPAQAGDAFHQILNGL 390
>gi|150246869|emb|CAM91796.1| putative glycosyltransferase family 47 [Triticum aestivum]
Length = 420
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 90/333 (27%), Positives = 143/333 (42%), Gaps = 51/333 (15%)
Query: 104 QTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SCHKMRGKGTSYENMTVI 161
+ PR L +A+E + + + S T +P+ A F+ P+ +C +
Sbjct: 72 KDPRCLNHMFAAEIFMHRFLLSSAVRTTNPEEADWFYTPVYPTCDLTPSGLPLPFKSPRM 131
Query: 162 VKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV--------RATEGVPFLIKNAIRV--- 210
++ ++ + K+PYWNR+ GADHFFVT HD G G+ L++ A V
Sbjct: 132 MRSAIELIATKWPYWNRSEGADHFFVTPHDFGACFHYQEEKAIGRGILPLLQRATLVQTF 191
Query: 211 -----VCSPSYDVAFIPHKDVALPQVLQ----PFALPRG--------GRDVENRTVLGFW 253
VC + P A PQ +Q P PR D N G++
Sbjct: 192 GQKNHVCLKEGSITIPP---FAPPQKMQNHLIPGETPRSIFVYFRGLFYDTGNDPEGGYY 248
Query: 254 AGHRNSKIRVILARVWEN---DTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVN 310
A R A VWEN + DIS + P Y ++ FC+CP G
Sbjct: 249 A-------RGARASVWENFKNNPLFDISTDH------PPTYYEDMQRSVFCLCPLGWAPW 295
Query: 311 SARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVS 370
S R+ +++ +GC+PVI+++ LPF D + W + V + E DV +L IL +I + +
Sbjct: 296 SPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFVPEEDVPRLDSILTSIPTEDILR 355
Query: 371 LHNNLV--KVQKHFQWNSPPVRFDAFHMVMYEL 401
L +++ + P DAFH ++ L
Sbjct: 356 KQRLLANPSMKQAMLFPQPAQAGDAFHQILNGL 388
>gi|326488903|dbj|BAJ98063.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 421
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 95/354 (26%), Positives = 151/354 (42%), Gaps = 54/354 (15%)
Query: 86 VTNFKIYIY--PDG-DPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIP 142
V K++IY P + + PR L +A+E + + + S T +P+ A F+ P
Sbjct: 52 VGRLKVFIYDLPGKYNKKLLKKDPRCLNHMFAAEIFMHRFLLSSAVRTTNPEEADWFYTP 111
Query: 143 I--SCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV------ 194
+ +C +++ ++ + K+PYWNR+ GADHFFVT HD G
Sbjct: 112 VYPTCDLTPSGLPLPFKSPRMMRSAIELIATKWPYWNRSEGADHFFVTPHDFGACFHYQE 171
Query: 195 --RATEGVPFLIKNAIRV--------VCSPSYDVAFIPHKDVALPQVLQ----PFALPRG 240
G+ L++ A V VC + P A PQ +Q P PR
Sbjct: 172 EKAIGRGILPLLQRATLVQTFGQKNHVCLKDGSITIPP---FAPPQKMQNHLIPGETPRS 228
Query: 241 --------GRDVENRTVLGFWAGHRNSKIRVILARVWEN---DTELDISNNRINRAIGPL 289
D N G++A R A VWEN + DIS + P
Sbjct: 229 IFVYFRGLFYDTGNDPEGGYYA-------RGARASVWENFKNNPLFDISTDH------PP 275
Query: 290 VYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILN 349
Y ++ FC+CP G S R+ +++ +GC+PVI+++ LPF D + W + V +
Sbjct: 276 TYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFVP 335
Query: 350 ERDVYQLKQILKNISDAEFVSLHNNLV--KVQKHFQWNSPPVRFDAFHMVMYEL 401
E DV +L IL +I + + L +++ + P DAFH ++ L
Sbjct: 336 EEDVPRLDSILTSIPTEDILRKQRLLANPSMKQAMLFPQPAQAGDAFHQILNGL 389
>gi|242088703|ref|XP_002440184.1| hypothetical protein SORBIDRAFT_09g027440 [Sorghum bicolor]
gi|241945469|gb|EES18614.1| hypothetical protein SORBIDRAFT_09g027440 [Sorghum bicolor]
Length = 415
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 96/354 (27%), Positives = 152/354 (42%), Gaps = 54/354 (15%)
Query: 86 VTNFKIYIY--PDG-DPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIP 142
V K+Y+Y P + + PR L +A+E + + + S T +P+ A F+ P
Sbjct: 46 VGRLKVYVYDLPSKYNKKLLKKDPRCLNHMFAAEIFMHRFLLSSAVRTFNPEEADWFYTP 105
Query: 143 I-SCHKMRGKGTSYE-NMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV------ 194
+ + + KG +++ + + +PYWNR+ GADHFFVT HD G
Sbjct: 106 VYTTCDLTPKGLPLPFKSPRMMRSAIQLIATNWPYWNRSEGADHFFVTPHDFGACFHYQE 165
Query: 195 --RATEGVPFLIKNAIRV--------VCSPSYDVAFIPHKDVALPQVLQPFALP------ 238
G+ L++ A V VC + P+ A PQ +Q +P
Sbjct: 166 EKAIGRGILPLLQRATLVQTFGQKNHVCLKDGSITIPPY---APPQKMQTHLIPADTPRS 222
Query: 239 -----RGG-RDVENRTVLGFWAGHRNSKIRVILARVWEN---DTELDISNNRINRAIGPL 289
RG D N G++A R A VWEN + DIS + P
Sbjct: 223 IFVYFRGLFYDTGNDPEGGYYA-------RGARASVWENFKNNPLFDISTDH------PP 269
Query: 290 VYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILN 349
Y ++ FC+CP G S R+ +++ +GC+PVI+++ LPF D + W + V +
Sbjct: 270 TYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFVA 329
Query: 350 ERDVYQLKQILKNISDAEFVSLHNNLV--KVQKHFQWNSPPVRFDAFHMVMYEL 401
E DV QL IL +I + L +++ + P DAFH ++ L
Sbjct: 330 EEDVPQLDSILTSIPTDVVLRKQRLLANPSMKQAMLFPQPAQPGDAFHQILNGL 383
>gi|225436482|ref|XP_002275679.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase IRX7
[Vitis vinifera]
gi|297734915|emb|CBI17149.3| unnamed protein product [Vitis vinifera]
Length = 460
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 91/354 (25%), Positives = 160/354 (45%), Gaps = 39/354 (11%)
Query: 81 NYLEMVTNFKIYIY--PDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHL 138
++ +++ + K+YIY P + R +ASE + ++ES T DP A
Sbjct: 91 DHQDLLKDLKVYIYDLPSKYNVDWLSNERCSNHLFASEVALHKALQESDVRTFDPWEADF 150
Query: 139 FFIPI--SCHKMRGKG-TSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVG-- 193
FF+P+ SC+ G + + ++ + + + P+WNR+LGADH FV HD G
Sbjct: 151 FFVPVYVSCNFSTVNGFPAIGHARPLLASAIQHISTQLPFWNRSLGADHVFVASHDYGAC 210
Query: 194 ------VRATEGVPFLIKNAIRVVC------SPSYDVAFIPHKDVALPQVLQPF--ALPR 239
V +G+P +K +I + P DV + P+ +Q + P
Sbjct: 211 FHAMEDVARADGIPEFLKKSIILQTFGVKHQHPCQDVENVLIPPYVSPEKVQSTLDSAPA 270
Query: 240 GGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDI----SNNRI-----NRAIGPLV 290
G+ R + F+ G + I R + I NR +R G
Sbjct: 271 NGQ----RDIWVFFRGKMEVHPKNISGRFYSKAVRTAIWQKYGGNRKFYLKRHRFAG--- 323
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
YQ ++ FC+CP G S R+ +S+ GCVPVI+++ LPF++ + W + ++ + E
Sbjct: 324 YQSEIVRSVFCLCPLGWAPWSPRLVESVVLGCVPVIIADGIRLPFSEAIRWPEISLTVAE 383
Query: 351 RDVYQLKQILKNISDAEFVSLHNNL--VKVQKHFQWNSPPVRFDAFHMVMYELW 402
+DV +L IL++++ ++ NL + ++ +N+ DA V+ LW
Sbjct: 384 KDVGKLGMILEDVAATNLSTIQKNLWDPENKRALLFNNQVQEGDATWQVLNALW 437
>gi|414864293|tpg|DAA42850.1| TPA: hypothetical protein ZEAMMB73_024068 [Zea mays]
Length = 434
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 83/300 (27%), Positives = 140/300 (46%), Gaps = 41/300 (13%)
Query: 133 PDRAHLFFIPI--SCHKMRGKG-TSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTC 189
P+ A LFF+P+ SC+ G S + ++ D VD + + PYWNR+ GADH FV
Sbjct: 125 PEDADLFFVPVYVSCNFSTPNGFPSLSHARGLLADAVDLVRARMPYWNRSAGADHVFVAS 184
Query: 190 HDVG--------VRATEGVPFLIKNAIRV---------VCSPSYDVAFIPH------KDV 226
HD G V +G+P +K +I + VC V PH ++
Sbjct: 185 HDFGACFHPMEDVAIADGIPEFLKRSILLQTFGVQGHHVCQEVEHVVIPPHVPPEVAHEL 244
Query: 227 ALPQVLQP--FALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINR 284
P+ Q FA RG +V + + G + + K+R L + + + + + R +
Sbjct: 245 PEPEKAQRDIFAFFRGKMEVHPKNISGRFY---SKKVRTELLQHYGRNRKFYLKRKRFDN 301
Query: 285 AIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKF 344
Y+ ++ FC+CP G S R+ +S+ GC+PVI+++ LPF +L W++
Sbjct: 302 ------YRSEMARSLFCLCPLGWAPWSPRLVESVLLGCIPVIIADDIRLPFPPVLQWQEI 355
Query: 345 AVILNERDVYQLKQILKNISDAEFVSLHNNL---VKVQKHFQWNSPPVRFDAFHMVMYEL 401
++ + E+D+ L +L ++ + NL VK ++ +N P DA V+ EL
Sbjct: 356 SLQVAEKDIASLGMVLDHVVATNLTVIQKNLWDPVK-RRALVFNRPMEAGDATWQVLREL 414
>gi|21592991|gb|AAM64940.1| unknown [Arabidopsis thaliana]
Length = 415
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 108/414 (26%), Positives = 168/414 (40%), Gaps = 74/414 (17%)
Query: 23 LLTLALVTLLCFTCLSFNSFRSYPLQNNFNSTLSFAINKQDYSDLGSDVFHSPSVFHLNY 82
L + L+ LLC T S ++FR LS + + S DV
Sbjct: 3 LSSCVLIFLLCNTFSSISAFR-----------LSRSQPTERISGSAGDVLED-------- 43
Query: 83 LEMVTNFKIYIY--PDG-DPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLF 139
+ V K+++Y P + + PR L +A+E Y + + S T +P+ A F
Sbjct: 44 -DPVGRLKVFVYELPSKYNKKILQKDPRCLNHMFAAEIYMQRFLLSSPVRTLNPEEADWF 102
Query: 140 FIPI--SCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV--- 194
++P+ +C +++ + + + +PYWNRT GADHFFV HD
Sbjct: 103 YVPVYTTCDLTPNGLPLPFKSPRMMRSAIQLIASNWPYWNRTEGADHFFVVPHDFRACFH 162
Query: 195 -----RATEGVPFLIKNAIRV--------VCSPSYDVAFIPHKDVALPQVLQPFALPRGG 241
G+ L++ A V VC + P+ A PQ +Q +P
Sbjct: 163 YQEEKAIGRGILPLLQRATLVQTFGQRNHVCLKEGSITVPPY---APPQKMQSHLIPEKT 219
Query: 242 R------------DVENRTVLGFWAGHRNSKIRVILARVWEN--DTEL-DISNNRINRAI 286
DV N G++A R A VWEN D L DIS
Sbjct: 220 PRSIFVYFRGLFYDVGNDPEGGYYA-------RGARAAVWENFKDNPLFDISTEH----- 267
Query: 287 GPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAV 346
P Y + FC+CP G S R+ +++ +GC+PVI+++ LPF D + W V
Sbjct: 268 -PTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVIFGCIPVIIADDIVLPFADAIPWEDIGV 326
Query: 347 ILNERDVYQLKQILKNISDAEFVSLHNNLV--KVQKHFQWNSPPVRFDAFHMVM 398
++E+DV L IL +I + L +++ + P DAFH V+
Sbjct: 327 FVDEKDVPYLDTILTSIPPEVILRKQRLLANPSMKQAMLFPQPAQPGDAFHQVL 380
>gi|60657598|gb|AAX33320.1| secondary cell wall-related glycosyltransferase family 47 [Populus
tremula x Populus tremuloides]
Length = 412
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 99/354 (27%), Positives = 152/354 (42%), Gaps = 54/354 (15%)
Query: 86 VTNFKIYIY--PDG-DPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIP 142
V K+Y+Y P + + PR LT +A+E + + + S T +PD A F+ P
Sbjct: 43 VGRLKVYVYELPSKYNKKLLQKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPDEADWFYSP 102
Query: 143 I--SCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV------ 194
I +C +++ + + + +PYWNRT GADHFFV HD G
Sbjct: 103 IYPTCDLTPMGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHCQE 162
Query: 195 -RATE-GVPFLIKNAIRV--------VCSPSYDVAFIPHKDVALPQVLQPFALP------ 238
+A E G+ L++ + V VC + P A PQ +Q +P
Sbjct: 163 EKAVERGILPLLQRSTLVQTFGRRNHVCLNEGSITIPP---FAPPQKMQAHQIPPDIPRS 219
Query: 239 -----RGG-RDVENRTVLGFWAGHRNSKIRVILARVWEN---DTELDISNNRINRAIGPL 289
RG DV N G++A R A VWEN + DIS + P
Sbjct: 220 IFVYFRGLFYDVNNDPEGGYYA-------RGARAAVWENFKNNPLFDISTDH------PT 266
Query: 290 VYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILN 349
Y + FC+CP G S R+ +++ +GC+PVI+++ LPF D + W + V +
Sbjct: 267 TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFVA 326
Query: 350 ERDVYQLKQILKNISDAEFVSLHNNLV--KVQKHFQWNSPPVRFDAFHMVMYEL 401
E DV L IL +I + L +++ + P DAFH ++ L
Sbjct: 327 EEDVPNLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 380
>gi|42570324|ref|NP_850113.2| exostosin-like protein [Arabidopsis thaliana]
gi|75216857|sp|Q9ZUV3.1|IRX7_ARATH RecName: Full=Probable glucuronoxylan glucuronosyltransferase IRX7;
AltName: Full=Protein FRAGILE FIBER 8; AltName:
Full=Protein IRREGULAR XYLEM 7
gi|4063747|gb|AAC98455.1| hypothetical protein [Arabidopsis thaliana]
gi|77022037|gb|ABA60868.1| putative glucuronyltransferase [Arabidopsis thaliana]
gi|77022039|gb|ABA60869.1| putative glucuronyltransferase [Arabidopsis thaliana]
gi|330252987|gb|AEC08081.1| exostosin-like protein [Arabidopsis thaliana]
Length = 448
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 104/422 (24%), Positives = 186/422 (44%), Gaps = 61/422 (14%)
Query: 31 LLCFTCLSFNSFRSYPLQNNFNSTLSFAINKQDYSDLGSDVFHSPSVF----HLNYL--- 83
+LCF L+ F S+ + ++ + S +I+K ++ +F S ++F H + L
Sbjct: 22 ILCFI-LTLYFFASFFVDHDQDHRSSTSISKHLLTNHKPKLFASRAMFESKIHDHKLGFT 80
Query: 84 --------EMVTNFKIYIY--PDGDPNTFYQTPRKLTGKYASEGYFFQNIR--ESRFLTH 131
++ N KIY+Y P + R +A+E + E T
Sbjct: 81 SQQPNIKTDVFNNLKIYVYDLPSKFNKDWLANDRCTNHLFAAEVALHKAFLSLEGDVRTE 140
Query: 132 DPDRAHLFFIPI--SCHKMRGKG-TSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVT 188
DP A FF+P+ SC+ G + + ++ D + + +YP+WNRT G+DH F
Sbjct: 141 DPYEADFFFVPVYVSCNFSTINGFPAIGHARSLINDAIKLVSTQYPFWNRTSGSDHVFTA 200
Query: 189 CHDVGV-------RA-TEGVPFLIKNAIRVVCSPSYDVAF----IPHKDVALPQVLQPFA 236
HD G RA +GVP ++N+I + ++ V F ++V +P + P +
Sbjct: 201 THDFGSCFHTMEDRAIADGVPIFLRNSIIL---QTFGVTFNHPCQEVENVVIPPYISPES 257
Query: 237 LPRGGRDV---ENRTVLGFWAGHR------------NSKIRVILARVWENDTELDISNNR 281
L + +++ + R + F+ G + ++R + R + D + R
Sbjct: 258 LHKTQKNIPVTKERDIWVFFRGKMELHPKNISGRFYSKRVRTNIWRSYGGDRRFYLQRQR 317
Query: 282 INRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDW 341
YQ ++ FC+CP G S R+ +S+ GCVPVI+++ LPF + W
Sbjct: 318 FAG------YQSEIARSVFCLCPLGWAPWSPRLVESVALGCVPVIIADGIRLPFPSTVRW 371
Query: 342 RKFAVILNERDVYQLKQILKNISDAEFVSLHNNLV--KVQKHFQWNSPPVRFDAFHMVMY 399
++ + ERDV +L IL++++ + NL V++ +N P DA V+
Sbjct: 372 PDISLTVAERDVGKLGDILEHVAATNLSVIQRNLEDPSVRRALMFNVPSREGDATWQVLE 431
Query: 400 EL 401
L
Sbjct: 432 AL 433
>gi|297797147|ref|XP_002866458.1| hypothetical protein ARALYDRAFT_496352 [Arabidopsis lyrata subsp.
lyrata]
gi|297312293|gb|EFH42717.1| hypothetical protein ARALYDRAFT_496352 [Arabidopsis lyrata subsp.
lyrata]
Length = 417
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 95/351 (27%), Positives = 148/351 (42%), Gaps = 54/351 (15%)
Query: 86 VTNFKIYIY--PDG-DPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIP 142
V K+++Y P + + PR L +A+E Y + + S T +P+ A F++P
Sbjct: 48 VGRLKVFVYELPSKYNKKILQKDPRCLNHMFAAEIYMQRFLLSSPVRTLNPEEADWFYVP 107
Query: 143 I--SCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV------ 194
+ +C +++ + + + +PYWNRT GADHFFV HD G
Sbjct: 108 VYTTCDLTPNGLPLPFKSPRMMRSAIQLIASNWPYWNRTEGADHFFVVPHDFGACFHYQE 167
Query: 195 --RATEGVPFLIKNAIRV--------VCSPSYDVAFIPHKDVALPQVLQPFALPRGGR-- 242
G+ L++ A V VC + P+ A PQ +Q +P
Sbjct: 168 EKAIGRGILPLLQRATLVQTFGQRNHVCLKEGSITVPPY---APPQKMQSHLIPEKTPRS 224
Query: 243 ----------DVENRTVLGFWAGHRNSKIRVILARVWEN--DTEL-DISNNRINRAIGPL 289
DV N G++A R A VWEN D L DIS P
Sbjct: 225 IFVYFRGLFYDVGNDPEGGYYA-------RGARAAVWENFKDNPLFDISTEH------PT 271
Query: 290 VYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILN 349
Y + FC+CP G S R+ +++ +GC+PVI+++ LPF D + W V ++
Sbjct: 272 TYYEDMQRAIFCLCPLGWAPWSPRLVEAVIFGCIPVIIADDIVLPFADAIPWEDIGVFVD 331
Query: 350 ERDVYQLKQILKNISDAEFVSLHNNLV--KVQKHFQWNSPPVRFDAFHMVM 398
E+DV L IL +I + L +++ + P DAFH V+
Sbjct: 332 EKDVPYLDTILTSIPPEVILRKQRLLANPSMKQAMLFPQPAQPGDAFHQVL 382
>gi|224053575|ref|XP_002297880.1| predicted protein [Populus trichocarpa]
gi|222845138|gb|EEE82685.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 99/354 (27%), Positives = 152/354 (42%), Gaps = 54/354 (15%)
Query: 86 VTNFKIYIY--PDG-DPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIP 142
V K+Y+Y P + + PR LT +A+E + + + S T +PD A F+ P
Sbjct: 43 VGRLKVYVYELPSKYNKKLLQKDPRCLTHMFAAEIFMHRFLLSSPVRTLNPDEADWFYSP 102
Query: 143 I--SCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV------ 194
I +C +++ + + + +PYWNRT GADHFFV HD G
Sbjct: 103 IYPTCDLTPMGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQE 162
Query: 195 -RATE-GVPFLIKNAIRV--------VCSPSYDVAFIPHKDVALPQVLQPFALP------ 238
+A E G+ L++ + V VC + P A PQ +Q +P
Sbjct: 163 EKAIERGILPLLQRSTLVQTFGQRNHVCLNEGSITIPP---FAPPQKMQAHQIPPDIPRS 219
Query: 239 -----RGG-RDVENRTVLGFWAGHRNSKIRVILARVWEN---DTELDISNNRINRAIGPL 289
RG DV N G++A R A VWEN + DIS + P
Sbjct: 220 IFVYFRGLFYDVNNDPEGGYYA-------RGARAAVWENFKNNPLFDISTDH------PT 266
Query: 290 VYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILN 349
Y + FC+CP G S R+ +++ +GC+PVI+++ LPF D + W + V +
Sbjct: 267 TYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFVA 326
Query: 350 ERDVYQLKQILKNISDAEFVSLHNNLV--KVQKHFQWNSPPVRFDAFHMVMYEL 401
E DV L IL +I + L +++ + P DAFH ++ L
Sbjct: 327 EEDVPNLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPGDAFHQILNGL 380
>gi|195648072|gb|ACG43504.1| secondary cell wall-related glycosyltransferase family 47 [Zea
mays]
Length = 420
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 95/330 (28%), Positives = 144/330 (43%), Gaps = 51/330 (15%)
Query: 107 RKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIP--ISCHKMRGKGTSYENMTVIVKD 164
R L +A+E + Q + S T DP+ A F+ P +C I++
Sbjct: 75 RCLHHMFAAEIFMHQFLLSSAVRTLDPEEADWFYTPAYTTCDLTPQGFPLPFRAPRIMRS 134
Query: 165 YVDSLINKYPYWNRTLGADHFFVTCHDVGV-------RATE-GVPFLIKNAIRV------ 210
+ + +P+WNRT GADHFF+T HD G RA E G+ L++ A V
Sbjct: 135 AIRYVATTWPFWNRTDGADHFFLTPHDFGACFHYQEERAMERGILPLLRRATLVQTFGQR 194
Query: 211 --VCSPSYDVAFIPHKDVALPQVLQPFALPRGGR------------DVENRTVLGFWAGH 256
VC + P+ D P+ +Q + G D+ N G++A
Sbjct: 195 NHVCLQDGSITVPPYAD---PRKMQAHLISPGTPRSIFVYFRGLFYDMGNDPEGGYYA-- 249
Query: 257 RNSKIRVILARVWEN--DTEL-DISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSAR 313
R A VWEN D L DIS P Y + FC+CP G S R
Sbjct: 250 -----RGARASVWENFKDNPLFDISTEH------PSTYYEDMQRAIFCLCPLGWAPWSPR 298
Query: 314 IADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHN 373
+ +++ +GC+PVI+++ LPF D + W +V + ERDV +L IL +I + +
Sbjct: 299 LVEAVVFGCIPVIIADDIVLPFADAIPWEDISVFVAERDVPRLDSILTSIPLPDILRRQR 358
Query: 374 NLVK--VQKHFQWNSPPVRFDAFHMVMYEL 401
L + V++ ++ P DAFH V+ L
Sbjct: 359 LLARDSVKQALLFHQPARPGDAFHQVLNGL 388
>gi|212276027|ref|NP_001130448.1| uncharacterized protein LOC100191546 precursor [Zea mays]
gi|194689154|gb|ACF78661.1| unknown [Zea mays]
gi|413946366|gb|AFW79015.1| putative Secondary cell wall glycosyltransferase family 47 [Zea
mays]
Length = 415
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 96/354 (27%), Positives = 152/354 (42%), Gaps = 54/354 (15%)
Query: 86 VTNFKIYIY--PDG-DPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIP 142
V K+Y+Y P + + PR L +A+E + + + S T +P+ A F+ P
Sbjct: 46 VGRLKVYVYDLPSKYNKKLLKKDPRCLNHMFAAEIFMHRFLLSSAVRTFNPEEADWFYTP 105
Query: 143 I-SCHKMRGKGTSYE-NMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV------ 194
+ + + KG +++ + + +PYWNR+ GADHFFVT HD G
Sbjct: 106 VYTTCDLTPKGLPLPFKSPRMMRSAIQLIATNWPYWNRSEGADHFFVTPHDFGACFHYQE 165
Query: 195 --RATEGVPFLIKNAIRV--------VCSPSYDVAFIPHKDVALPQVLQPFALP------ 238
G+ L++ A V VC + P+ A PQ +Q +P
Sbjct: 166 EKAIGRGILPLLQRATLVQTFGQKNHVCLKDGSITIPPY---APPQKMQTHLIPPDTPRS 222
Query: 239 -----RGG-RDVENRTVLGFWAGHRNSKIRVILARVWEN---DTELDISNNRINRAIGPL 289
RG D N G++A R A VWEN + DIS + P
Sbjct: 223 IFVYFRGLFYDTGNDPEGGYYA-------RGARASVWENFKNNPLFDISTDH------PP 269
Query: 290 VYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILN 349
Y ++ FC+CP G S R+ +++ +GC+PVI+++ LPF D + W + V +
Sbjct: 270 TYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFVA 329
Query: 350 ERDVYQLKQILKNISDAEFVSLHNNLV--KVQKHFQWNSPPVRFDAFHMVMYEL 401
E DV QL IL +I + L +++ + P DAFH ++ L
Sbjct: 330 EDDVPQLDSILTSIPTDVVLRKQRLLANPSMKQAMLFPQPAQPGDAFHQILNGL 383
>gi|24476038|gb|AAN62780.1| Unknown protein [Oryza sativa Japonica Group]
Length = 449
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 93/334 (27%), Positives = 146/334 (43%), Gaps = 69/334 (20%)
Query: 133 PDRAHLFFIPI--SCHKMRGKG-TSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTC 189
PD A LFF+P+ SC+ G S + ++ D VD + + PYWNR+ GADH FV
Sbjct: 121 PDDATLFFVPVYVSCNFSTDNGFPSLSHARALLADAVDLVRAQMPYWNRSAGADHVFVAS 180
Query: 190 HDVG------------------------------VRATEGVPFLIKNAIRV--------- 210
HD G V +G+P +K +I +
Sbjct: 181 HDFGACFHPMELFVIIHFELGVNAKSNLALGQEDVAIADGIPEFLKRSILLQTFGVQGTH 240
Query: 211 VCSPSYDVAFIPH--KDVAL----PQVLQP--FALPRGGRDVENRTVLGFWAGHRNSKIR 262
VC + V PH +VAL P+ Q FA RG +V + + G + + K+R
Sbjct: 241 VCQEADHVVIPPHVPPEVALELPEPEKAQRDIFAFFRGKMEVHPKNISGRF---YSKKVR 297
Query: 263 VILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGC 322
L + + + + + R Y+ ++ FC+CP G S R+ +S+ GC
Sbjct: 298 TELLQKYGRNRKFYLKRKRYGN------YRSEMARSLFCLCPLGWAPWSPRLVESVLLGC 351
Query: 323 VPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNL---VKVQ 379
+PVI+++ LPF +L W ++ + E+DV L+ +L ++ + NL VK +
Sbjct: 352 IPVIIADDIRLPFPSVLQWLDISLQVAEKDVASLEMVLDHVVATNLTVIQKNLWDPVK-R 410
Query: 380 KHFQWNSPPVRFDAFHMVMYELWL------RRHV 407
K +N P DA V+ EL + RRHV
Sbjct: 411 KALVFNRPMEEGDATWQVLRELEILLDRSQRRHV 444
>gi|357141002|ref|XP_003572039.1| PREDICTED: probable glucuronosyltransferase Os03g0107900-like
[Brachypodium distachyon]
Length = 465
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 83/309 (26%), Positives = 140/309 (45%), Gaps = 47/309 (15%)
Query: 127 RFLTHDPDRAHLFFIPI--SCHKMRGKG-TSYENMTVIVKDYVDSLINKYPYWNRTLGAD 183
R L P+ A LF +P+ SC+ G S ++ ++ + V+ + PYWNR+ G D
Sbjct: 151 RHLRARPEEADLFLVPVYVSCNFSTPTGLPSLKHARGLLAEAVELVRRDMPYWNRSAGTD 210
Query: 184 HFFVTCHDVG--------VRATEGVPFLIKNAI---------RVVCSPSYDVAFIPHKDV 226
H FV HD G V G+P +K +I R C V PH
Sbjct: 211 HVFVASHDFGACFHAMEDVAIAGGIPEFLKRSILLQTFGVQGRHTCQEVEHVVIPPH--- 267
Query: 227 ALPQVLQPFALP-----------RGGRDVENRTVLGFWAGHRNSKIRVILARVWENDTEL 275
LP+V + P RG +V + + G + G K+R L +++ ++ +
Sbjct: 268 VLPEVARELPEPEKSHRDIFAFFRGKMEVHPKNMSGRFYG---KKVRTKLLQLYGHNRKF 324
Query: 276 DISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPF 335
+ + + Y+ ++ FC+CP G S R+ +S+ GC+PVI+++ LPF
Sbjct: 325 YLKRKQHDG------YRLEMARSLFCLCPLGWAPWSPRLVESVLLGCIPVIIADNIRLPF 378
Query: 336 NDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNL---VKVQKHFQWNSPPVRFD 392
+L W ++ + ERD+ L+ +L +++ ++ NL VK +K +N P D
Sbjct: 379 PGVLRWPDISLQVAERDIANLEAMLDHVASTNLTTIQGNLWDPVK-RKALVFNQPMEEGD 437
Query: 393 AFHMVMYEL 401
A V+ EL
Sbjct: 438 ATWQVLKEL 446
>gi|75159221|sp|Q8S1X7.1|GT15_ORYSJ RecName: Full=Probable glucuronosyltransferase Os01g0926700
gi|20160728|dbj|BAB89670.1| P0482D04.17 [Oryza sativa Japonica Group]
gi|20805226|dbj|BAB92893.1| putative pectin-glucuronyltransferase [Oryza sativa Japonica Group]
gi|125528942|gb|EAY77056.1| hypothetical protein OsI_05014 [Oryza sativa Indica Group]
gi|125573178|gb|EAZ14693.1| hypothetical protein OsJ_04618 [Oryza sativa Japonica Group]
Length = 417
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 94/354 (26%), Positives = 151/354 (42%), Gaps = 54/354 (15%)
Query: 86 VTNFKIYIY--PDG-DPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIP 142
V K+Y+Y P + + PR L +A+E + + + S T +P+ A F+ P
Sbjct: 48 VGRLKVYVYDLPSKYNKKLLKKDPRCLNHMFAAEIFMHRFLLSSAVRTFNPEEADWFYTP 107
Query: 143 I--SCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV------ 194
+ +C +++ ++ + +PYWNR+ GADHFFVT HD G
Sbjct: 108 VYTTCDLTPSGLPLPFKSPRMMRSAIELIATNWPYWNRSEGADHFFVTPHDFGACFHYQE 167
Query: 195 --RATEGVPFLIKNAIRV--------VCSPSYDVAFIPHKDVALPQVLQPFALP------ 238
G+ L++ A V VC + P+ A PQ +Q +P
Sbjct: 168 EKAIGRGILPLLQRATLVQTFGQKNHVCLKDGSITIPPY---APPQKMQAHLIPPDTPRS 224
Query: 239 -----RGG-RDVENRTVLGFWAGHRNSKIRVILARVWEN---DTELDISNNRINRAIGPL 289
RG D N G++A R A VWEN + DIS + P
Sbjct: 225 IFVYFRGLFYDTSNDPEGGYYA-------RGARASVWENFKNNPLFDISTDH------PP 271
Query: 290 VYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILN 349
Y ++ FC+CP G S R+ +++ +GC+PVI+++ LPF D + W + V +
Sbjct: 272 TYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFVA 331
Query: 350 ERDVYQLKQILKNISDAEFVSLHNNLV--KVQKHFQWNSPPVRFDAFHMVMYEL 401
E DV +L IL +I + L +++ + P DAFH ++ L
Sbjct: 332 EEDVPKLDSILTSIPTDVILRKQRLLANPSMKQAMLFPQPAQAGDAFHQILNGL 385
>gi|414878904|tpg|DAA56035.1| TPA: hypothetical protein ZEAMMB73_301237 [Zea mays]
Length = 360
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 97/327 (29%), Positives = 145/327 (44%), Gaps = 45/327 (13%)
Query: 107 RKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIP--ISCHKMRGKGTSYENMTVIVKD 164
R L +A+E + Q + S T DP+ A F+ P +C I++
Sbjct: 15 RCLHHMFAAEIFMHQFLLSSAVRTLDPEEADWFYTPAYTTCDLTPQGFPLPFRAPRIMRS 74
Query: 165 YVDSLINKYPYWNRTLGADHFFVTCHDVGV-------RATE-GVPFLIKNAIRV------ 210
+ + +P+WNRT GADHFF+T HD G RA E G+ L++ A V
Sbjct: 75 AIRYVATTWPFWNRTDGADHFFLTPHDFGACFHYQEERAMERGILPLLRRATLVQTFGQR 134
Query: 211 --VCSPSYDVAFIPHKDVALPQ--VLQP------FALPRG-GRDVENRTVLGFWAGHRNS 259
VC + P+ D Q ++ P F RG D+ N G++A
Sbjct: 135 NHVCLQDGSITVPPYADPGKMQAHLISPGTPRSIFVYFRGLFYDMGNDPEGGYYA----- 189
Query: 260 KIRVILARVWEN--DTEL-DISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIAD 316
R A VWEN D L DIS P Y + FC+CP G S R+ +
Sbjct: 190 --RGARASVWENFKDNPLFDISTEH------PSTYYEDMQRAIFCLCPLGWAPWSPRLVE 241
Query: 317 SIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLV 376
++ +GC+PVI+++ LPF D + W +V + ERDV +L IL +I + + L
Sbjct: 242 AVVFGCIPVIIADDIVLPFADAIPWEDISVFVAERDVPRLDSILTSIPLPDILRRQRLLA 301
Query: 377 K--VQKHFQWNSPPVRFDAFHMVMYEL 401
+ V++ ++ P DAFH V+ L
Sbjct: 302 RDSVKQALLFHQPARPGDAFHQVLNGL 328
>gi|414878903|tpg|DAA56034.1| TPA: hypothetical protein ZEAMMB73_301237 [Zea mays]
Length = 393
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 97/327 (29%), Positives = 145/327 (44%), Gaps = 45/327 (13%)
Query: 107 RKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIP--ISCHKMRGKGTSYENMTVIVKD 164
R L +A+E + Q + S T DP+ A F+ P +C I++
Sbjct: 48 RCLHHMFAAEIFMHQFLLSSAVRTLDPEEADWFYTPAYTTCDLTPQGFPLPFRAPRIMRS 107
Query: 165 YVDSLINKYPYWNRTLGADHFFVTCHDVGV-------RATE-GVPFLIKNAIRV------ 210
+ + +P+WNRT GADHFF+T HD G RA E G+ L++ A V
Sbjct: 108 AIRYVATTWPFWNRTDGADHFFLTPHDFGACFHYQEERAMERGILPLLRRATLVQTFGQR 167
Query: 211 --VCSPSYDVAFIPHKDVALPQ--VLQP------FALPRG-GRDVENRTVLGFWAGHRNS 259
VC + P+ D Q ++ P F RG D+ N G++A
Sbjct: 168 NHVCLQDGSITVPPYADPGKMQAHLISPGTPRSIFVYFRGLFYDMGNDPEGGYYA----- 222
Query: 260 KIRVILARVWEN--DTEL-DISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIAD 316
R A VWEN D L DIS P Y + FC+CP G S R+ +
Sbjct: 223 --RGARASVWENFKDNPLFDISTEH------PSTYYEDMQRAIFCLCPLGWAPWSPRLVE 274
Query: 317 SIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLV 376
++ +GC+PVI+++ LPF D + W +V + ERDV +L IL +I + + L
Sbjct: 275 AVVFGCIPVIIADDIVLPFADAIPWEDISVFVAERDVPRLDSILTSIPLPDILRRQRLLA 334
Query: 377 K--VQKHFQWNSPPVRFDAFHMVMYEL 401
+ V++ ++ P DAFH V+ L
Sbjct: 335 RDSVKQALLFHQPARPGDAFHQVLNGL 361
>gi|195619204|gb|ACG31432.1| secondary cell wall-related glycosyltransferase family 47 [Zea
mays]
Length = 419
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 96/355 (27%), Positives = 153/355 (43%), Gaps = 56/355 (15%)
Query: 86 VTNFKIYIYP---DGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIP 142
V K+Y+Y + + R L+ +A+E + + + S T +P+ A F+ P
Sbjct: 50 VGRLKVYVYELPTKYNKKMVAKDSRCLSHMFAAEIFMHRFLLSSAIRTLNPEEADWFYTP 109
Query: 143 I--SCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV------ 194
+ +C I++ + + N++PYWNRT GADHFFV HD G
Sbjct: 110 VYTTCDLTPWGHPLPFKSPRIMRSAIQFISNRWPYWNRTEGADHFFVVPHDFGACFHYQE 169
Query: 195 -RATE-GVPFLIKNAIRV--------VCSPSYDVAFIPHKDVALPQVLQPFALPRGGR-- 242
+A E GV L++ A V VC + P+ A PQ ++ +P G
Sbjct: 170 EKAIERGVLPLLRRATLVQTFGQKDHVCLKEGSITIPPY---APPQKMKTHLVPPGTPRS 226
Query: 243 ----------DVENRTVLGFWAGHRNSKIRVILARVWEN---DTELDISNNRINRAIGPL 289
D N G++A R A VWEN + DIS + P
Sbjct: 227 IFVYFRGLFYDTANDPEGGYYA-------RGARASVWENFKNNPLFDISTDH------PP 273
Query: 290 VYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILN 349
Y + FC+CP G S R+ +++ +GC+PVI+++ LPF D + W + V +
Sbjct: 274 TYYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFVA 333
Query: 350 ERDVYQLKQILKNISDAEFVSLHNNLV---KVQKHFQWNSPPVRFDAFHMVMYEL 401
E DV +L IL +I E + L+ +++ + P DAFH ++ L
Sbjct: 334 EDDVPRLDTILTSIP-MEVILRKQRLLANPSMKQAMLFPQPAQAGDAFHQILNGL 387
>gi|307135997|gb|ADN33853.1| glucuronoxylan glucuronosyltransferase [Cucumis melo subsp. melo]
Length = 416
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 97/352 (27%), Positives = 152/352 (43%), Gaps = 56/352 (15%)
Query: 86 VTNFKIYIY--PDG-DPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIP 142
V K+++Y P + + PR L +A+E + + + S T +P+ A F+ P
Sbjct: 47 VGRLKVFVYELPSKYNKKILQKDPRCLNHMFAAEIFMHRFLLTSPVRTLNPEEADWFYTP 106
Query: 143 I--SCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV------ 194
+ +C +++ + + + +PYWNRT GADHFFV HD G
Sbjct: 107 VYTTCDLTPNGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQE 166
Query: 195 -RATE-GVPFLIKNAIRV--------VCSPSYDVAFIPHKDVALPQVLQPFALPRGGR-- 242
+A E G+ L++ A V VC + P+ A PQ + +P
Sbjct: 167 EKAIERGILPLLQRATLVQTFGQRNHVCLKEGSITIPPY---APPQKMHAHLIPEKTPRS 223
Query: 243 ----------DVENRTVLGFWAGHRNSKIRVILARVWEN--DTEL-DISNNRINRAIGPL 289
DV N G++A R A VWEN D L DIS P
Sbjct: 224 IFVYFRGLFYDVGNDPEGGYYA-------RGARAAVWENFKDNPLFDISTEH------PT 270
Query: 290 VYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILN 349
Y + FC+CP G S R+ +++ +GC+PVI+++ LPF D + W + V L+
Sbjct: 271 TYYEDMQRAVFCLCPLGWAPWSPRLVEAVIFGCIPVIIADDIVLPFADAIPWEEIGVFLD 330
Query: 350 ERDVYQLKQILKNISDAEFVSLHNNLV---KVQKHFQWNSPPVRFDAFHMVM 398
E+DV L IL +I E + L+ +++ + P DAFH V+
Sbjct: 331 EKDVANLDTILTSIP-LEMILRKQRLLANPSMKQAMLFPQPAQPGDAFHQVL 381
>gi|357126610|ref|XP_003564980.1| PREDICTED: probable glucuronosyltransferase Os01g0926700-like
[Brachypodium distachyon]
Length = 418
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 97/354 (27%), Positives = 150/354 (42%), Gaps = 54/354 (15%)
Query: 86 VTNFKIYIYP-DGDPNT--FYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIP 142
V K+YIY G N + PR L +A+E + + + S T +P+ A F+ P
Sbjct: 49 VGRLKVYIYDLPGKYNKKLLKKDPRCLNHMFAAEIFMHRFLLSSAVRTFNPEEADWFYTP 108
Query: 143 I--SCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV------ 194
+ +C +V+ + + K+PYWNR+ GADHFFV HD G
Sbjct: 109 VYATCDLTPSGLPLPFKSPRMVRSAIQLIAEKWPYWNRSEGADHFFVAPHDFGACFHYQE 168
Query: 195 --RATEGVPFLIKNAIRV--------VCSPSYDVAFIPHKDVALPQVLQPFALP------ 238
G+ L++ A V VC + P+ A PQ +Q +P
Sbjct: 169 EKAIGRGILPLLQRATLVQTFGQKNHVCLKDGSITIPPY---APPQKMQNHLIPGDTPRS 225
Query: 239 -----RGG-RDVENRTVLGFWAGHRNSKIRVILARVWEN---DTELDISNNRINRAIGPL 289
RG D N G++A R A VWEN + DIS + P
Sbjct: 226 IFVYFRGLFYDTSNDPEGGYYA-------RGARASVWENFKNNPLFDISTDH------PP 272
Query: 290 VYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILN 349
Y ++ FC+CP G S R+ +++ +GC+PVI+++ LPF D + W V ++
Sbjct: 273 TYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEDIGVFVS 332
Query: 350 ERDVYQLKQILKNISDAEFVSLHNNLV--KVQKHFQWNSPPVRFDAFHMVMYEL 401
E DV +L IL +I + L +++ + P DAFH ++ L
Sbjct: 333 EDDVPKLDSILTSIPTDVILRKQRLLANPSMKQAMLFPQPAEAGDAFHQILNGL 386
>gi|414878901|tpg|DAA56032.1| TPA: secondary cell wall glycosyltransferase family 47 [Zea mays]
Length = 417
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 97/327 (29%), Positives = 145/327 (44%), Gaps = 45/327 (13%)
Query: 107 RKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIP--ISCHKMRGKGTSYENMTVIVKD 164
R L +A+E + Q + S T DP+ A F+ P +C I++
Sbjct: 72 RCLHHMFAAEIFMHQFLLSSAVRTLDPEEADWFYTPAYTTCDLTPQGFPLPFRAPRIMRS 131
Query: 165 YVDSLINKYPYWNRTLGADHFFVTCHDVGV-------RATE-GVPFLIKNAIRV------ 210
+ + +P+WNRT GADHFF+T HD G RA E G+ L++ A V
Sbjct: 132 AIRYVATTWPFWNRTDGADHFFLTPHDFGACFHYQEERAMERGILPLLRRATLVQTFGQR 191
Query: 211 --VCSPSYDVAFIPHKDVALPQ--VLQP------FALPRG-GRDVENRTVLGFWAGHRNS 259
VC + P+ D Q ++ P F RG D+ N G++A
Sbjct: 192 NHVCLQDGSITVPPYADPGKMQAHLISPGTPRSIFVYFRGLFYDMGNDPEGGYYA----- 246
Query: 260 KIRVILARVWEN--DTEL-DISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIAD 316
R A VWEN D L DIS P Y + FC+CP G S R+ +
Sbjct: 247 --RGARASVWENFKDNPLFDISTEH------PSTYYEDMQRAIFCLCPLGWAPWSPRLVE 298
Query: 317 SIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLV 376
++ +GC+PVI+++ LPF D + W +V + ERDV +L IL +I + + L
Sbjct: 299 AVVFGCIPVIIADDIVLPFADAIPWEDISVFVAERDVPRLDSILTSIPLPDILRRQRLLA 358
Query: 377 K--VQKHFQWNSPPVRFDAFHMVMYEL 401
+ V++ ++ P DAFH V+ L
Sbjct: 359 RDSVKQALLFHQPARPGDAFHQVLNGL 385
>gi|449440484|ref|XP_004138014.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10L-like
[Cucumis sativus]
gi|449528307|ref|XP_004171146.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10L-like
[Cucumis sativus]
Length = 416
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 97/352 (27%), Positives = 152/352 (43%), Gaps = 56/352 (15%)
Query: 86 VTNFKIYIY--PDG-DPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIP 142
V K+++Y P + + PR L +A+E + + + S T +P+ A F+ P
Sbjct: 47 VGRLKVFVYELPSKYNKKILQKDPRCLNHMFAAEIFMHRFLLTSPVRTLNPEEADWFYTP 106
Query: 143 I--SCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV------ 194
+ +C +++ + + + +PYWNRT GADHFFV HD G
Sbjct: 107 VYTTCDLTPNGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQE 166
Query: 195 -RATE-GVPFLIKNAIRV--------VCSPSYDVAFIPHKDVALPQVLQPFALPRGGR-- 242
+A E G+ L++ A V VC + P+ A PQ + +P
Sbjct: 167 EKAIERGILPLLQRATLVQTFGQRNHVCLKEGSITIPPY---APPQKMHAHLIPEKTPRS 223
Query: 243 ----------DVENRTVLGFWAGHRNSKIRVILARVWEN--DTEL-DISNNRINRAIGPL 289
DV N G++A R A VWEN D L DIS P
Sbjct: 224 IFVYFRGLFYDVGNDPEGGYYA-------RGARAAVWENFKDNPLFDISTEH------PT 270
Query: 290 VYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILN 349
Y + FC+CP G S R+ +++ +GC+PVI+++ LPF D + W + V L+
Sbjct: 271 TYYEDMQRAVFCLCPLGWAPWSPRLVEAVIFGCIPVIIADDIVLPFADAIPWEEIGVFLD 330
Query: 350 ERDVYQLKQILKNISDAEFVSLHNNLV---KVQKHFQWNSPPVRFDAFHMVM 398
E+DV L IL +I E + L+ +++ + P DAFH V+
Sbjct: 331 EKDVANLDTILTSIP-LEMILRKQRLLANPSMKQAMLFPQPAQPGDAFHQVL 381
>gi|223974777|gb|ACN31576.1| unknown [Zea mays]
Length = 412
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 95/330 (28%), Positives = 144/330 (43%), Gaps = 51/330 (15%)
Query: 107 RKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SCHKMRGKGTSYENMTVIVKD 164
R L +A+E + Q + S T DP+ A F+ P+ +C +++
Sbjct: 67 RCLQHMFAAEIFMHQFLLSSAVRTLDPEAADWFYTPVYTTCDLTPQGFPLPFRAPRMMRS 126
Query: 165 YVDSLINKYPYWNRTLGADHFFVTCHDVGV-------RATE-GVPFLIKNAIRV------ 210
+ + +P+WNRT GADHFF+T HD G RA E G+ L++ A V
Sbjct: 127 AIRYVAATWPFWNRTDGADHFFLTPHDFGACFHYQEERAVERGILPLLRRATLVQTFGQR 186
Query: 211 --VCSPSYDVAFIPHKDVALPQVLQPFALPRGGR------------DVENRTVLGFWAGH 256
VC + P+ A P LQ + G D+ N G++A
Sbjct: 187 NHVCLQDGSITVPPY---ASPHRLQAHLVGPGTPRSIFVYFRGLFYDMGNDPEGGYYA-- 241
Query: 257 RNSKIRVILARVWEN--DTEL-DISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSAR 313
R A VWEN D L DIS P Y + FC+CP G S R
Sbjct: 242 -----RGARASVWENFKDNPLFDISTEH------PATYYEDMQRAIFCLCPLGWAPWSPR 290
Query: 314 IADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHN 373
+ +++ +GC+PVI+++ LPF D + W +V + ERDV +L IL +I + +
Sbjct: 291 LVEAVVFGCIPVIIADDIVLPFADAIPWEDMSVFVAERDVPRLDSILTSIPLPDILRRQR 350
Query: 374 NLVK--VQKHFQWNSPPVRFDAFHMVMYEL 401
L + V++ ++ P DAFH V+ L
Sbjct: 351 LLARDSVKRALLFHQPARPGDAFHQVLNGL 380
>gi|293336613|ref|NP_001170701.1| uncharacterized protein LOC100384778 precursor [Zea mays]
gi|238007038|gb|ACR34554.1| unknown [Zea mays]
gi|413951433|gb|AFW84082.1| hypothetical protein ZEAMMB73_582447 [Zea mays]
Length = 412
Score = 108 bits (269), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 95/330 (28%), Positives = 144/330 (43%), Gaps = 51/330 (15%)
Query: 107 RKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SCHKMRGKGTSYENMTVIVKD 164
R L +A+E + Q + S T DP+ A F+ P+ +C +++
Sbjct: 67 RCLQHMFAAEIFMHQFLLSSAVRTLDPEAADWFYTPVYTTCDLTPQGFPLPFRAPRMMRS 126
Query: 165 YVDSLINKYPYWNRTLGADHFFVTCHDVGV-------RATE-GVPFLIKNAIRV------ 210
+ + +P+WNRT GADHFF+T HD G RA E G+ L++ A V
Sbjct: 127 AIRYVAATWPFWNRTDGADHFFLTPHDFGACFHYQEERAVERGILPLLRRATLVQTFGQR 186
Query: 211 --VCSPSYDVAFIPHKDVALPQVLQPFALPRGGR------------DVENRTVLGFWAGH 256
VC + P+ A P LQ + G D+ N G++A
Sbjct: 187 NHVCLQDGSITVPPY---ASPHRLQAHLVGPGTPRSIFVYFRGLFYDMGNDPEGGYYA-- 241
Query: 257 RNSKIRVILARVWEN--DTEL-DISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSAR 313
R A VWEN D L DIS P Y + FC+CP G S R
Sbjct: 242 -----RGARASVWENFKDNPLFDISTEH------PATYYEDMQRAIFCLCPLGWAPWSPR 290
Query: 314 IADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHN 373
+ +++ +GC+PVI+++ LPF D + W +V + ERDV +L IL +I + +
Sbjct: 291 LVEAVVFGCIPVIIADDIVLPFADAIPWEDMSVFVAERDVPRLDSILTSIPLPDILRRQR 350
Query: 374 NLVK--VQKHFQWNSPPVRFDAFHMVMYEL 401
L + V++ ++ P DAFH V+ L
Sbjct: 351 LLARDSVKRALLFHQPARPGDAFHQVLNGL 380
>gi|414878902|tpg|DAA56033.1| TPA: hypothetical protein ZEAMMB73_301237 [Zea mays]
Length = 387
Score = 108 bits (269), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 97/327 (29%), Positives = 145/327 (44%), Gaps = 45/327 (13%)
Query: 107 RKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIP--ISCHKMRGKGTSYENMTVIVKD 164
R L +A+E + Q + S T DP+ A F+ P +C I++
Sbjct: 42 RCLHHMFAAEIFMHQFLLSSAVRTLDPEEADWFYTPAYTTCDLTPQGFPLPFRAPRIMRS 101
Query: 165 YVDSLINKYPYWNRTLGADHFFVTCHDVGV-------RATE-GVPFLIKNAIRV------ 210
+ + +P+WNRT GADHFF+T HD G RA E G+ L++ A V
Sbjct: 102 AIRYVATTWPFWNRTDGADHFFLTPHDFGACFHYQEERAMERGILPLLRRATLVQTFGQR 161
Query: 211 --VCSPSYDVAFIPHKDVALPQ--VLQP------FALPRG-GRDVENRTVLGFWAGHRNS 259
VC + P+ D Q ++ P F RG D+ N G++A
Sbjct: 162 NHVCLQDGSITVPPYADPGKMQAHLISPGTPRSIFVYFRGLFYDMGNDPEGGYYA----- 216
Query: 260 KIRVILARVWEN--DTEL-DISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIAD 316
R A VWEN D L DIS P Y + FC+CP G S R+ +
Sbjct: 217 --RGARASVWENFKDNPLFDISTEH------PSTYYEDMQRAIFCLCPLGWAPWSPRLVE 268
Query: 317 SIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLV 376
++ +GC+PVI+++ LPF D + W +V + ERDV +L IL +I + + L
Sbjct: 269 AVVFGCIPVIIADDIVLPFADAIPWEDISVFVAERDVPRLDSILTSIPLPDILRRQRLLA 328
Query: 377 K--VQKHFQWNSPPVRFDAFHMVMYEL 401
+ V++ ++ P DAFH V+ L
Sbjct: 329 RDSVKQALLFHQPARPGDAFHQVLNGL 355
>gi|242042569|ref|XP_002468679.1| hypothetical protein SORBIDRAFT_01g050110 [Sorghum bicolor]
gi|241922533|gb|EER95677.1| hypothetical protein SORBIDRAFT_01g050110 [Sorghum bicolor]
Length = 429
Score = 108 bits (269), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 83/300 (27%), Positives = 140/300 (46%), Gaps = 41/300 (13%)
Query: 133 PDRAHLFFIPI--SCHKMRGKG-TSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTC 189
P+ A LFF+P+ SC+ G S + ++ + VD + + PYWNR+ GADH FV
Sbjct: 122 PEDADLFFVPVYVSCNFSTPNGFPSLSHARGLLAEAVDLVRVRMPYWNRSAGADHVFVAS 181
Query: 190 HDVG--------VRATEGVPFLIKNAIRV---------VCSPSYDVAFIPH------KDV 226
HD G V +G+P +K +I + VC V PH ++
Sbjct: 182 HDFGACFHPMEDVAIADGIPEFLKRSILLQTFGVQGHHVCQEVEHVVIPPHVPPEVAHEL 241
Query: 227 ALPQVLQP--FALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINR 284
P+ Q FA RG +V + + G + + K+R L + + + + + R +
Sbjct: 242 PEPEKAQRDIFAFFRGKMEVHPKNISGRFY---SKKVRTELLQHYGRNRKFYLKRKRFDN 298
Query: 285 AIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKF 344
Y+ ++ FC+CP G S R+ +S+ GC+PVI+++ LPF +L W +
Sbjct: 299 ------YRSEMARSLFCLCPLGWAPWSPRLVESVLLGCIPVIIADNIRLPFPSVLQWPEI 352
Query: 345 AVILNERDVYQLKQILKNISDAEFVSLHNNL---VKVQKHFQWNSPPVRFDAFHMVMYEL 401
++ + E+D+ L+ +L ++ + NL VK +K +N P DA V+ EL
Sbjct: 353 SLQVAEKDIANLEMVLDHVVATNLTMIQKNLWDPVK-RKALVFNRPMEVGDATWQVLREL 411
>gi|242059741|ref|XP_002459016.1| hypothetical protein SORBIDRAFT_03g044520 [Sorghum bicolor]
gi|241930991|gb|EES04136.1| hypothetical protein SORBIDRAFT_03g044520 [Sorghum bicolor]
Length = 417
Score = 107 bits (268), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 92/331 (27%), Positives = 145/331 (43%), Gaps = 53/331 (16%)
Query: 107 RKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SCHKMRGKGTSYENMTVIVKD 164
R L+ +A+E + + + S T +P+ A F+ P+ +C I++
Sbjct: 72 RCLSHMFAAEIFMHRFLLSSAIRTLNPEEADWFYTPVYTTCDLTPWGHPLPFKSPRIMRS 131
Query: 165 YVDSLINKYPYWNRTLGADHFFVTCHDVGV-------RATE-GVPFLIKNAIRV------ 210
+ + N++PYWNRT GADHFFV HD G +A E GV L++ A V
Sbjct: 132 AIQFISNRWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGVLPLLRRATLVQTFGQK 191
Query: 211 --VCSPSYDVAFIPHKDVALPQVLQPFALPRGGR------------DVENRTVLGFWAGH 256
VC + P+ A PQ ++ +P G D N G++A
Sbjct: 192 DHVCLKEGSITIPPY---APPQKMKTHLVPPGTPRSIFVYFRGLFYDTANDPEGGYYA-- 246
Query: 257 RNSKIRVILARVWEN---DTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSAR 313
R A VWEN + DIS + P Y + FC+CP G S R
Sbjct: 247 -----RGARASVWENFKNNPLFDISTDH------PPTYYEDMQRAVFCLCPLGWAPWSPR 295
Query: 314 IADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHN 373
+ +++ +GC+PVI+++ LPF D + W + V + E DV +L IL +I E +
Sbjct: 296 LVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFVAEDDVPKLDTILTSIP-MEVILRKQ 354
Query: 374 NLV---KVQKHFQWNSPPVRFDAFHMVMYEL 401
L+ +++ + P DAFH ++ L
Sbjct: 355 RLLANPSMKQAMLFPQPAQPGDAFHQILNGL 385
>gi|357149222|ref|XP_003575040.1| PREDICTED: probable glucuronosyltransferase Os02g0520750-like
[Brachypodium distachyon]
Length = 428
Score = 107 bits (268), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 90/337 (26%), Positives = 146/337 (43%), Gaps = 59/337 (17%)
Query: 104 QTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SCHKMRGKGTSYENMTVI 161
+ PR L+ +A+E + + + S T +P+ A F+ P+ +C R +
Sbjct: 80 KDPRCLSHMFAAEIFMHRFLFSSAVRTVNPEEADWFYTPVYTTCDLTRAGLPLPFKSPRM 139
Query: 162 VKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV--------RATEGVPFLIKNAIRV--- 210
++ + + NK+P+WNRT G DHFFV HD G+ L+++A V
Sbjct: 140 MRSAIQFISNKWPFWNRTDGGDHFFVVPHDFAACFHYQEENAIARGILPLLRHATLVQTF 199
Query: 211 -----VCSPSYDVAFIPHKDVALPQVLQPFALP-----------RG-----GRDVENRTV 249
VC + P+ A PQ +Q +P RG G D E
Sbjct: 200 GQKNHVCLKDGSITIPPY---APPQKMQAHLIPPDTPRSIFVYFRGLFYDNGNDPEG--- 253
Query: 250 LGFWAGHRNSKIRVILARVWEN---DTELDISNNRINRAIGPLVYQRRFYKTKFCICPGG 306
G++A R A +WEN + DIS P Y ++ FC+CP G
Sbjct: 254 -GYYA-------RGARASLWENFKNNPLFDISTEH------PATYYEDMQRSVFCLCPLG 299
Query: 307 SQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDA 366
S R+ +++ +GC+PVI+++ LPF D + W + V ++E DV +L IL +I
Sbjct: 300 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWDEIGVFVDEEDVPKLDSILTSIPID 359
Query: 367 EFVSLHNNLV--KVQKHFQWNSPPVRFDAFHMVMYEL 401
+ + L ++K + P DAFH ++ L
Sbjct: 360 DILRKQRLLANPSMKKAMLFPQPAQPRDAFHQILNGL 396
>gi|116793780|gb|ABK26875.1| unknown [Picea sitchensis]
Length = 420
Score = 107 bits (268), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 100/354 (28%), Positives = 152/354 (42%), Gaps = 54/354 (15%)
Query: 86 VTNFKIYIY--PDG-DPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIP 142
V K++IY P + + PR LT +A+E + + + S T +P+ A F+ P
Sbjct: 51 VGRLKVFIYELPSKYNKKILAKDPRCLTHMFATEIFMNRFLLGSPVRTLNPEEADWFYTP 110
Query: 143 I--SCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV------ 194
+ +C +++ + + +PYWNRT GADHFFV HD G
Sbjct: 111 VYTTCDLTPNGLPLPFKSPRMMRSAIQYISTNWPYWNRTEGADHFFVVPHDFGACFHYQE 170
Query: 195 -RATE-GVPFLIKNAIRV--------VCSPSYDVAFIPHKDVALPQVLQ----PFALPRG 240
+A E G+ L++ A V VC + P+ A PQ +Q P + PR
Sbjct: 171 EKAIERGILPLLQRATLVQTFGQRYHVCLKKGSITVPPY---APPQKMQAHLIPPSTPRS 227
Query: 241 --------GRDVENRTVLGFWAGHRNSKIRVILARVWEN--DTEL-DISNNRINRAIGPL 289
DV N G++A R A VWEN D L DIS P
Sbjct: 228 IFVYFRGLFYDVGNDPEGGYYA-------RGARASVWENFKDNPLFDISTEH------PA 274
Query: 290 VYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILN 349
Y + FC+CP G S R+ + + +GC+PVI+++ LPF D + W + V +
Sbjct: 275 TYYEDMQRAIFCLCPLGWAPWSPRLVEGVIFGCIPVIIADDIVLPFADAIPWEEIGVFVA 334
Query: 350 ERDVYQLKQILKNISDAEFVSLHNNLV--KVQKHFQWNSPPVRFDAFHMVMYEL 401
E+DV L IL +I + L +++ + P DAFH ++ L
Sbjct: 335 EKDVPNLDTILTSIPPEVILRKQRLLANPAMKQAMLFPQPAQPGDAFHQILNGL 388
>gi|242059743|ref|XP_002459017.1| hypothetical protein SORBIDRAFT_03g044530 [Sorghum bicolor]
gi|241930992|gb|EES04137.1| hypothetical protein SORBIDRAFT_03g044530 [Sorghum bicolor]
Length = 420
Score = 107 bits (267), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 94/354 (26%), Positives = 150/354 (42%), Gaps = 54/354 (15%)
Query: 86 VTNFKIYIY--PDG-DPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIP 142
V K+Y+Y P + + PR L +A+E + + + S T +P+ A F+ P
Sbjct: 51 VGRLKVYVYDLPSKYNKKLLKKDPRCLNHMFAAEIFMHRFLLSSAVRTFNPEEADWFYTP 110
Query: 143 I--SCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV------ 194
+ +C +++ ++ + +PYWNR+ GADHFFVT HD G
Sbjct: 111 VYTTCDLTPSGLPLPFKSPRMMRSAIELIATNWPYWNRSEGADHFFVTPHDFGACFHYQE 170
Query: 195 --RATEGVPFLIKNAIRV--------VCSPSYDVAFIPHKDVALPQVLQPFALP------ 238
G+ L++ A V VC + P A PQ +Q +P
Sbjct: 171 EKAIGRGILPLLQRATLVQTFGQKNHVCLKDGSITIPP---FAPPQKMQAHLIPADTPRS 227
Query: 239 -----RGG-RDVENRTVLGFWAGHRNSKIRVILARVWEN---DTELDISNNRINRAIGPL 289
RG D N G++A R A VWEN + DIS + P
Sbjct: 228 IFVYFRGLFYDTSNDPEGGYYA-------RGARASVWENFKNNPLFDISTDH------PP 274
Query: 290 VYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILN 349
Y ++ FC+CP G S R+ +++ +GC+PVI+++ LPF D + W + V +
Sbjct: 275 TYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFVA 334
Query: 350 ERDVYQLKQILKNISDAEFVSLHNNLV--KVQKHFQWNSPPVRFDAFHMVMYEL 401
E DV +L IL +I + L +++ + P DAFH ++ L
Sbjct: 335 EEDVPKLDSILTSIPTDVILRKQRLLANPSMKQAMLFPQPAQAGDAFHQILNGL 388
>gi|449456052|ref|XP_004145764.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase
IRX7-like [Cucumis sativus]
Length = 459
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 78/298 (26%), Positives = 136/298 (45%), Gaps = 49/298 (16%)
Query: 113 YASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SCHKMRGKG-TSYENMTVIVKDYVDSL 169
+ASE + + S + T DP A FF+P+ SC+ G + + ++ V +
Sbjct: 132 FASEVAIHRALLNSDYRTFDPLEADFFFVPVYVSCNFSTVNGFPAIGHARSLISSAVSHI 191
Query: 170 INKYPYWNRTLGADHFFVTCHDVG--------VRATEGVPFLIKNAIRVVC------SPS 215
+ Y +WNRT G+DH FV HD V +GVP +KN+I + P
Sbjct: 192 SSHYSFWNRTNGSDHVFVASHDFASCFHTMEHVAIADGVPSFLKNSIILQTFGVKYKHPC 251
Query: 216 YDVAFIPHKDVALPQVLQP------------------FALPRGGRDVENRTVLGFWAGHR 257
DV + V +P + P FA RG ++ + V G +
Sbjct: 252 QDV-----EHVVIPPYISPESIENTLERSPVTGRRDIFAFFRGKMEMNPKNVSGRFY--- 303
Query: 258 NSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADS 317
+ K+R ++ R + D + +R YQ ++ FC+CP G S R+ +S
Sbjct: 304 SKKVRTMIWRKFNGDRRFYLQRHRFPG------YQSEIVRSVFCLCPLGWAPWSPRLVES 357
Query: 318 IHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNL 375
+ GCVPVI+++ LPF ++W + ++ + E+D+ +L +IL +++ + ++ NL
Sbjct: 358 VALGCVPVIIADGIRLPFPSAVNWPEISITVAEKDIGKLGRILDHVAASNLTTIQKNL 415
>gi|317106607|dbj|BAJ53114.1| JHL07K02.4 [Jatropha curcas]
Length = 417
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 93/330 (28%), Positives = 143/330 (43%), Gaps = 51/330 (15%)
Query: 107 RKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SCHKMRGKGTSYENMTVIVKD 164
R LT +A+E + + + S T +P+ A F+ P+ +C +++
Sbjct: 72 RCLTHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRMMRS 131
Query: 165 YVDSLINKYPYWNRTLGADHFFVTCHDVGV-------RATE-GVPFLIKNAIRV------ 210
+ + + +PYWNRT GADHFFV HD G +A E G+ L++ A V
Sbjct: 132 AIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQR 191
Query: 211 --VCSPSYDVAFIPHKDVALPQVLQPFALPRGGR------------DVENRTVLGFWAGH 256
VC + P+ A PQ +Q +P DV N G++A
Sbjct: 192 NHVCLKDGSITIPPY---APPQKMQAHLIPEKTPRSIFVYFRGLFYDVGNDPEGGYYA-- 246
Query: 257 RNSKIRVILARVWEN--DTEL-DISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSAR 313
R A VWEN D L DIS P Y + FC+CP G S R
Sbjct: 247 -----RGARAAVWENFKDNPLFDISTEH------PSTYYEDMQRAVFCLCPLGWAPWSPR 295
Query: 314 IADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHN 373
+ +++ +GC+PVI+++ LPF D + W + V ++E+DV L IL +I +
Sbjct: 296 LVEAVIFGCIPVIIADDIVLPFADAIPWEQIGVFVDEKDVPNLDAILTSIPPEVILRKQR 355
Query: 374 NLV--KVQKHFQWNSPPVRFDAFHMVMYEL 401
L +++ + P DAFH V+ L
Sbjct: 356 LLANPSMKQAMLFPQPVQPGDAFHQVLNGL 385
>gi|413951430|gb|AFW84079.1| secondary cell wall glycosyltransferase family 47 [Zea mays]
Length = 421
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 94/354 (26%), Positives = 149/354 (42%), Gaps = 54/354 (15%)
Query: 86 VTNFKIYIY--PDG-DPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIP 142
V K+Y+Y P + + PR L +A+E + + + S T +P+ A F+ P
Sbjct: 52 VGRLKVYVYDLPSKYNKKLVKKDPRCLNHMFAAEIFMHRFLLSSAVRTFNPEEADWFYTP 111
Query: 143 I--SCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV------ 194
+ +C +++ ++ + +PYWNR+ GADHFFVT HD G
Sbjct: 112 VYATCDLTPSGLPLPFKSPRMMRSAIELIATNWPYWNRSEGADHFFVTPHDFGACFHYQE 171
Query: 195 --RATEGVPFLIKNAIRV--------VCSPSYDVAFIPHKDVALPQVLQ----PFALPRG 240
G+ L++ A V VC + P A PQ +Q P PR
Sbjct: 172 EKAIGRGILPLLQRATLVQTFGQKNHVCLKGGSITIPP---FAPPQKMQAHLIPLDTPRS 228
Query: 241 --------GRDVENRTVLGFWAGHRNSKIRVILARVWEN---DTELDISNNRINRAIGPL 289
D N G++A R A VWEN + DIS + P
Sbjct: 229 IFVYFRGLFYDTSNDPEGGYYA-------RGARASVWENFKNNPLFDISTDH------PP 275
Query: 290 VYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILN 349
Y ++ FC+CP G S R+ +++ +GC+PVI+++ LPF D + W + V +
Sbjct: 276 TYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFVA 335
Query: 350 ERDVYQLKQILKNISDAEFVSLHNNLV--KVQKHFQWNSPPVRFDAFHMVMYEL 401
E DV +L IL +I + L +++ + P DAFH ++ L
Sbjct: 336 EEDVPKLDSILTSIPTDVILRKQRLLANPAMKQAMLFPQPAQAGDAFHQILNGL 389
>gi|359473830|ref|XP_003631364.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10L-like [Vitis
vinifera]
Length = 412
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 93/333 (27%), Positives = 143/333 (42%), Gaps = 51/333 (15%)
Query: 104 QTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SCHKMRGKGTSYENMTVI 161
+ PR L +A+E + + + S T +P+ A F+ PI +C +
Sbjct: 64 KDPRCLNHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYTTCDLTPNGLPLPFKSPRM 123
Query: 162 VKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV-------RATE-GVPFLIKNAIRV--- 210
++ + + + +PYWNRT GADHFFV HD G +A E G+ L++ A V
Sbjct: 124 MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTF 183
Query: 211 -----VCSPSYDVAFIPHKDVALPQVLQPFALPRGGR------------DVENRTVLGFW 253
VC + P+ A PQ +Q +P+ DV N G++
Sbjct: 184 GQRNHVCLNEGSITIPPY---APPQKMQAHLIPQETPRSIFVYFRGLFYDVNNDPEGGYY 240
Query: 254 AGHRNSKIRVILARVWEN--DTEL-DISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVN 310
A R A VWEN D L DIS P Y + FC+CP G
Sbjct: 241 A-------RGARAAVWENFKDNPLFDISTEH------PTTYYEDMQRAIFCLCPLGWAPW 287
Query: 311 SARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVS 370
S R+ +++ +GC+PVI+++ LPF D + W + V + E DV L IL +I +
Sbjct: 288 SPRLVEAVIFGCIPVIIADDIVLPFADAIPWEEIGVFVAEEDVPNLDTILTSIPPEVILR 347
Query: 371 LHNNLV--KVQKHFQWNSPPVRFDAFHMVMYEL 401
L +++ + P DAFH ++ L
Sbjct: 348 KQRLLANPSMKQAMLFPQPAQSGDAFHQILNGL 380
>gi|125582313|gb|EAZ23244.1| hypothetical protein OsJ_06939 [Oryza sativa Japonica Group]
Length = 434
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 92/337 (27%), Positives = 148/337 (43%), Gaps = 59/337 (17%)
Query: 104 QTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SCHKMRGKGTSYENMTVI 161
+ PR L +A+E + + + S T +P++A F+ P+ +C +
Sbjct: 85 KDPRCLNHMFAAEIFMHRFLLSSAVRTLNPEQADWFYAPVYTTCDLTHAGLPLPFKSPRM 144
Query: 162 VKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV-------RATE-GVPFLIKNAIRV--- 210
++ + L K+P+WNRT GADHFFV HD G +A E G+ L++ A V
Sbjct: 145 MRSAIQFLSRKWPFWNRTDGADHFFVVPHDFGACFHYQEEKAIERGILPLLRRATLVQTF 204
Query: 211 -----VCSPSYDVAFIPHKDVALPQVLQPFALP-----------RG-----GRDVENRTV 249
VC + P+ A PQ +Q +P RG G D E
Sbjct: 205 GQKNHVCLKEGSITIPPY---APPQKMQAHLIPPDTPRSIFVYFRGLFYDNGNDPEG--- 258
Query: 250 LGFWAGHRNSKIRVILARVWEN---DTELDISNNRINRAIGPLVYQRRFYKTKFCICPGG 306
G++A R A +WEN + DIS P Y ++ FC+CP G
Sbjct: 259 -GYYA-------RGARASLWENFKNNPLFDISTEH------PATYYEDMQRSVFCLCPLG 304
Query: 307 SQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDA 366
S R+ +++ +GC+PVI+++ LPF D + W + V ++E DV +L IL +I
Sbjct: 305 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWDEIGVFVDEEDVPRLDSILTSIPID 364
Query: 367 EFVSLHNNLV--KVQKHFQWNSPPVRFDAFHMVMYEL 401
+ + L +++ + P DAFH ++ L
Sbjct: 365 DILRKQRLLANPSMKQAMLFPQPAQPRDAFHQILNGL 401
>gi|226503833|ref|NP_001149319.1| secondary cell wall-related glycosyltransferase family 47 precursor
[Zea mays]
gi|195626366|gb|ACG35013.1| secondary cell wall-related glycosyltransferase family 47 [Zea
mays]
Length = 421
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 94/354 (26%), Positives = 149/354 (42%), Gaps = 54/354 (15%)
Query: 86 VTNFKIYIY--PDG-DPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIP 142
V K+Y+Y P + + PR L +A+E + + + S T +P+ A F+ P
Sbjct: 52 VGRLKVYVYDLPSKYNKKLVKKDPRCLNHMFAAEIFMHRFLLSSAVRTFNPEEADWFYTP 111
Query: 143 I--SCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV------ 194
+ +C +++ ++ + +PYWNR+ GADHFFVT HD G
Sbjct: 112 VYATCDLTPSGLPLPFKSPRMMRSAIELIATNWPYWNRSEGADHFFVTPHDFGACFHYQE 171
Query: 195 --RATEGVPFLIKNAIRV--------VCSPSYDVAFIPHKDVALPQVLQ----PFALPRG 240
G+ L++ A V VC + P A PQ +Q P PR
Sbjct: 172 EKAIGRGILPLLQRATLVQTFGQKNHVCLKGGSIXIPP---FAPPQKMQAHLIPLDTPRS 228
Query: 241 --------GRDVENRTVLGFWAGHRNSKIRVILARVWEN---DTELDISNNRINRAIGPL 289
D N G++A R A VWEN + DIS + P
Sbjct: 229 IFVYFRGLFYDTSNDPEGGYYA-------RGARASVWENFKNNPLFDISTDH------PP 275
Query: 290 VYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILN 349
Y ++ FC+CP G S R+ +++ +GC+PVI+++ LPF D + W + V +
Sbjct: 276 TYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFVA 335
Query: 350 ERDVYQLKQILKNISDAEFVSLHNNLV--KVQKHFQWNSPPVRFDAFHMVMYEL 401
E DV +L IL +I + L +++ + P DAFH ++ L
Sbjct: 336 EEDVPKLDSILTSIPTDVILRKQRLLANPAMKQAMLFPQPAQAGDAFHQILNGL 389
>gi|75122905|sp|Q6H4N0.1|GT21_ORYSJ RecName: Full=Probable glucuronosyltransferase Os02g0520750
gi|49389081|dbj|BAD26319.1| putative pectin-glucuronyltransferase [Oryza sativa Japonica Group]
gi|215769122|dbj|BAH01351.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 434
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 92/337 (27%), Positives = 148/337 (43%), Gaps = 59/337 (17%)
Query: 104 QTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SCHKMRGKGTSYENMTVI 161
+ PR L +A+E + + + S T +P++A F+ P+ +C +
Sbjct: 85 KDPRCLNHMFAAEIFMHRFLLSSAVRTLNPEQADWFYAPVYTTCDLTHAGLPLPFKSPRM 144
Query: 162 VKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV-------RATE-GVPFLIKNAIRV--- 210
++ + L K+P+WNRT GADHFFV HD G +A E G+ L++ A V
Sbjct: 145 MRSAIQFLSRKWPFWNRTDGADHFFVVPHDFGACFHYQEEKAIERGILPLLRRATLVQTF 204
Query: 211 -----VCSPSYDVAFIPHKDVALPQVLQPFALP-----------RG-----GRDVENRTV 249
VC + P+ A PQ +Q +P RG G D E
Sbjct: 205 GQKNHVCLKEGSITIPPY---APPQKMQAHLIPPDTPRSIFVYFRGLFYDNGNDPEG--- 258
Query: 250 LGFWAGHRNSKIRVILARVWEN---DTELDISNNRINRAIGPLVYQRRFYKTKFCICPGG 306
G++A R A +WEN + DIS P Y ++ FC+CP G
Sbjct: 259 -GYYA-------RGARASLWENFKNNPLFDISTEH------PATYYEDMQRSVFCLCPLG 304
Query: 307 SQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDA 366
S R+ +++ +GC+PVI+++ LPF D + W + V ++E DV +L IL +I
Sbjct: 305 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWDEIGVFVDEEDVPRLDSILTSIPID 364
Query: 367 EFVSLHNNLV--KVQKHFQWNSPPVRFDAFHMVMYEL 401
+ + L +++ + P DAFH ++ L
Sbjct: 365 DILRKQRLLANPSMKQAMLFPQPAQPRDAFHQILNGL 401
>gi|356512936|ref|XP_003525170.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10L-like
[Glycine max]
Length = 416
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 92/333 (27%), Positives = 144/333 (43%), Gaps = 51/333 (15%)
Query: 104 QTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SCHKMRGKGTSYENMTVI 161
+ PR L +A+E + + + S T +P+ A F+ P+ +C +
Sbjct: 68 KDPRCLNHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRM 127
Query: 162 VKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV-------RATE-GVPFLIKNAIRV--- 210
++ + + + +PYWNRT GADHFFV HD G +A E G+ L++ A V
Sbjct: 128 MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILTLLRRATLVQTF 187
Query: 211 -----VCSPSYDVAFIPHKDVALPQVLQPFALP-----------RGG-RDVENRTVLGFW 253
VC + P+ A PQ + +P RG DV N G++
Sbjct: 188 GQRNHVCLKEGSITIPPY---APPQKMHTHLIPDKTPRSIFVYFRGLFYDVGNDPEGGYY 244
Query: 254 AGHRNSKIRVILARVWENDTE---LDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVN 310
A R A VWEN + DIS P Y + FC+CP G
Sbjct: 245 A-------RGARAAVWENFKDNLLFDISTEH------PTTYYEDMQRAVFCLCPLGWAPW 291
Query: 311 SARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVS 370
S R+ +++ +GC+PVI+++ LPF D + W + V ++E DV +L IL +I +
Sbjct: 292 SPRLVEAVIFGCIPVIIADDIVLPFADAIPWEEIGVFVDEEDVPKLDTILTSIPPEVILR 351
Query: 371 LHNNLV--KVQKHFQWNSPPVRFDAFHMVMYEL 401
L +++ + P DAFH V+ L
Sbjct: 352 KQRLLANPSMKQAMLFPQPAQPGDAFHQVLNGL 384
>gi|226499780|ref|NP_001150738.1| secondary cell wall-related glycosyltransferase family 47 precursor
[Zea mays]
gi|195641374|gb|ACG40155.1| secondary cell wall-related glycosyltransferase family 47 [Zea
mays]
gi|223974207|gb|ACN31291.1| unknown [Zea mays]
Length = 415
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 90/333 (27%), Positives = 144/333 (43%), Gaps = 51/333 (15%)
Query: 104 QTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPI-SCHKMRGKGTSYE-NMTVI 161
+ PR L +A+E + + + S T +P+ A F+ P+ + + KG +
Sbjct: 67 KDPRCLNHMFAAEIFMHRFLLSSAVRTFNPEEADWFYTPVYTTCDLTPKGLPLPFKSPRM 126
Query: 162 VKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV--------RATEGVPFLIKNAIRV--- 210
++ + + +PYWNR+ GADHFFVT HD G G+ L++ A V
Sbjct: 127 MRSAIQLIATNWPYWNRSEGADHFFVTPHDFGACFHYQEEKAIGRGILPLLQRATLVQTF 186
Query: 211 -----VCSPSYDVAFIPHKDVALPQVLQPFALP-----------RGG-RDVENRTVLGFW 253
VC + P+ A PQ +Q +P RG D N G++
Sbjct: 187 GQKNHVCLKDGSITIPPY---APPQKMQTHLIPADTPRSIFVYFRGLFYDTGNDPEGGYY 243
Query: 254 AGHRNSKIRVILARVWEN---DTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVN 310
A R A VWEN + DIS + P Y ++ FC+CP G
Sbjct: 244 A-------RGARASVWENFKNNPLFDISTDH------PPTYYEDMQRSVFCLCPLGWAPW 290
Query: 311 SARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVS 370
S R+ +++ +GC+PVI+++ LPF D + W + V + E DV +L IL +I +
Sbjct: 291 SPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFVAEEDVPRLDSILTSIPTDVVLR 350
Query: 371 LHNNLV--KVQKHFQWNSPPVRFDAFHMVMYEL 401
L +++ + P DAFH ++ L
Sbjct: 351 KQRLLANPSMKQAMLFPQPAQAGDAFHQILNGL 383
>gi|414878899|tpg|DAA56030.1| TPA: secondary cell wall glycosyltransferase family 47 isoform 1
[Zea mays]
gi|414878900|tpg|DAA56031.1| TPA: secondary cell wall glycosyltransferase family 47 isoform 2
[Zea mays]
Length = 419
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 96/355 (27%), Positives = 153/355 (43%), Gaps = 56/355 (15%)
Query: 86 VTNFKIYIYP---DGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIP 142
V K+Y+Y + + R L+ +A+E + + + S T +P+ A F+ P
Sbjct: 50 VGRLKVYVYELPTKYNKKMVAKDSRCLSHMFAAEIFMHRFLLSSAIRTLNPEVADWFYTP 109
Query: 143 I--SCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV------ 194
+ +C I++ + + N++PYWNRT GADHFFV HD G
Sbjct: 110 VYTTCDLTPWGHPLPFKSPRIMRSAIQFISNRWPYWNRTEGADHFFVVPHDFGACFHYQE 169
Query: 195 -RATE-GVPFLIKNAIRV--------VCSPSYDVAFIPHKDVALPQVLQPFALPRGGR-- 242
+A E GV L++ A V VC + P+ A PQ ++ +P G
Sbjct: 170 EKAIERGVLPLLRRATLVQTFGQKDHVCLKEGSITIPPY---APPQKMKTHLVPPGTPRS 226
Query: 243 ----------DVENRTVLGFWAGHRNSKIRVILARVWEN---DTELDISNNRINRAIGPL 289
D N G++A R A VWEN + DIS + P
Sbjct: 227 IFVYFRGLFYDTANDPEGGYYA-------RGARASVWENFKNNPLFDISTDH------PP 273
Query: 290 VYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILN 349
Y + FC+CP G S R+ +++ +GC+PVI+++ LPF D + W + V +
Sbjct: 274 TYYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFVA 333
Query: 350 ERDVYQLKQILKNISDAEFVSLHNNLV---KVQKHFQWNSPPVRFDAFHMVMYEL 401
E DV +L IL +I E + L+ +++ + P DAFH ++ L
Sbjct: 334 EDDVPRLDTILTSIP-MEVILRKQRLLANPSMKQAMLFPQPAQAGDAFHQILNGL 387
>gi|226499284|ref|NP_001142343.1| uncharacterized protein LOC100274514 precursor [Zea mays]
gi|194708308|gb|ACF88238.1| unknown [Zea mays]
Length = 419
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 96/355 (27%), Positives = 153/355 (43%), Gaps = 56/355 (15%)
Query: 86 VTNFKIYIYP---DGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIP 142
V K+Y+Y + + R L+ +A+E + + + S T +P+ A F+ P
Sbjct: 50 VGRLKVYVYELPTKYNKKMVAKDSRCLSHMFAAEIFMHRFLLSSAIRTLNPEVADWFYTP 109
Query: 143 I--SCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV------ 194
+ +C I++ + + N++PYWNRT GADHFFV HD G
Sbjct: 110 VYTTCDLTPWGHPLPFRSPRIMRSAIQFISNRWPYWNRTEGADHFFVVPHDFGACFHYQE 169
Query: 195 -RATE-GVPFLIKNAIRV--------VCSPSYDVAFIPHKDVALPQVLQPFALPRGGR-- 242
+A E GV L++ A V VC + P+ A PQ ++ +P G
Sbjct: 170 EKAIERGVLPLLRRATLVQTFGQKDHVCLKEGSITIPPY---APPQKMKTHLVPPGTPRS 226
Query: 243 ----------DVENRTVLGFWAGHRNSKIRVILARVWEN---DTELDISNNRINRAIGPL 289
D N G++A R A VWEN + DIS + P
Sbjct: 227 IFVYFRGLFYDTANDPEGGYYA-------RGARASVWENFKNNPLFDISTDH------PP 273
Query: 290 VYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILN 349
Y + FC+CP G S R+ +++ +GC+PVI+++ LPF D + W + V +
Sbjct: 274 TYYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFVA 333
Query: 350 ERDVYQLKQILKNISDAEFVSLHNNLV---KVQKHFQWNSPPVRFDAFHMVMYEL 401
E DV +L IL +I E + L+ +++ + P DAFH ++ L
Sbjct: 334 EDDVPRLDTILTSIP-MEVILRKQRLLANPSMKQAMLFPQPAQAGDAFHQILNGL 387
>gi|168049543|ref|XP_001777222.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162671450|gb|EDQ58002.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 368
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 86/350 (24%), Positives = 153/350 (43%), Gaps = 47/350 (13%)
Query: 80 LNYLEMVTNFKIYIY--PDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAH 137
N L + +IY+Y P + R +ASE + + S T DP A
Sbjct: 11 FNPLNLTNAIRIYVYDLPPKFNEDWLVDERCSNHLFASEVAIHKILLTSPIRTLDPYEAD 70
Query: 138 LFFIPI--SCHKMRGKGTSY-ENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV 194
FF+P+ SC G + + +++ V+ + K +WNR+ G DH FV HD G
Sbjct: 71 FFFMPVYVSCKFSPKTGFPWLGHAPKLMQAAVNHVSTKMEFWNRSWGRDHIFVAAHDYGA 130
Query: 195 --------RATEGVPFLIKNA---------------------IRVVCSPSYDVAFIPHKD 225
+G+P ++N+ I SPS V+++ KD
Sbjct: 131 CFHTLETQAIAQGIPQFMRNSLILQTFGVKGFHPCQAAEHIQIPPYISPSVAVSYV--KD 188
Query: 226 VALPQVLQPFALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRA 285
Q FA RG ++ + V G + IR +L + + + + +R++ +
Sbjct: 189 PLEHQQRDIFAYFRGKMEINPKNVSGLL---YSKGIRTVLYKRFSRNKRFVLKRHRVDNS 245
Query: 286 IGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFA 345
Q+ ++ FC+CP G S RI +++ YGC+PVI+++ LP++ +DW +
Sbjct: 246 ------QQEMLRSTFCLCPLGWAPWSPRIVEAVTYGCIPVIIADNISLPYSHTIDWSSIS 299
Query: 346 VILNERDVYQLKQILKNISDAEFVSLHNNLVKVQ--KHFQWNSPPVRFDA 393
+ + E DV +L +IL ++ ++ NL + + + + P V+ DA
Sbjct: 300 LTVPEHDVPKLDKILIGVAVTNLTAIQRNLWRGENRRALLFTDPLVQGDA 349
>gi|224106838|ref|XP_002314302.1| predicted protein [Populus trichocarpa]
gi|222850710|gb|EEE88257.1| predicted protein [Populus trichocarpa]
Length = 462
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 91/357 (25%), Positives = 156/357 (43%), Gaps = 54/357 (15%)
Query: 85 MVTNFKIYIY--PDGDPNTFYQTPRKLTGKYASEGYFFQNIRES-RFLTHDPDRAHLFFI 141
++ + K+YIY P + R +ASE + + S T DP A FF+
Sbjct: 104 LLKDLKVYIYELPSKYNTDWLANERCSNHLFASEVAIHKALSNSLDIRTFDPYEADFFFV 163
Query: 142 PI--SCHKMRGKG-TSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV---- 194
P+ SC+ G + + ++ V + + YP+WNR+ G+DH FV HD G
Sbjct: 164 PVYVSCNFSTVNGFPAIGHARSLLSSAVQLISSNYPFWNRSQGSDHVFVASHDYGACFHA 223
Query: 195 ---RATE-GVPFLIKNAIRVVC------SPSYDVAFIPHKDVALPQVLQP---------- 234
RA E G+P +K +I + P DV ++V +P + P
Sbjct: 224 MEERAMEDGIPEFLKRSIILQTFGVKFNHPCQDV-----ENVVIPPYISPERVRTTLENY 278
Query: 235 --------FALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAI 286
+A RG +V + + G + + K+R ++ R + D + +R
Sbjct: 279 PLNGRRDIWAFFRGKMEVHPKNISG---RYYSKKVRTVIWRKYSGDRRFYLQRHRFAG-- 333
Query: 287 GPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAV 346
YQ ++ FC+CP G S R+ +S+ GCVPVI+++ LPF + W + ++
Sbjct: 334 ----YQSEIVRSVFCLCPLGWAPWSPRLVESVALGCVPVIIADGIRLPFPTAVRWSEISL 389
Query: 347 ILNERDVYQLKQILKNISDAEFVSLHNNL--VKVQKHFQWNSPPVRFDAFHMVMYEL 401
+ E+DV L +L +++ ++ NL V++ +N DA V+Y L
Sbjct: 390 TVAEKDVANLGTLLDHVAATNLSAIQKNLWDPDVRRALLFNDRVQEGDATWQVLYAL 446
>gi|412991334|emb|CCO16179.1| predicted protein [Bathycoccus prasinos]
Length = 558
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 81/353 (22%), Positives = 150/353 (42%), Gaps = 40/353 (11%)
Query: 88 NFKIYIY---PDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPIS 144
+FKI++Y P + R +T +Y +E NI +S+ T DP A F++P+
Sbjct: 207 SFKIFVYNLPPKFHVEMLKKNKRCVTDQYGTEIRIHANIMQSKMYTLDPLEAEFFYVPVY 266
Query: 145 CHKMRGKGTSYENMTVIVK------------DYVDSLINKYPYWNRTLGADHFFVTCHDV 192
G+ +EN+ + + + + ++YP+WNRT G DH F
Sbjct: 267 -----GECKLFENIATLGAKKGLQETNAWWLEAMKLVTDQYPFWNRTQGRDHVFTFAGAR 321
Query: 193 GVRATEGVPFLIKNAIRVVCSPSYDVA----FIPHKDVALPQVLQPFALPRGG----RDV 244
G + IK +I +P D + F KD+ +P + A G ++V
Sbjct: 322 GPHIFKDWKRHIKKSI--FLTPEGDRSLSEQFNTWKDIVIPGLEPEKAFWSGSLRKQKEV 379
Query: 245 ENRTVLGFWAGHRNSK--------IRVILARVWENDTELDISNNRINRAIGPLVYQRRFY 296
+ ++ G +K IR+ + +++ D+ + + Y+
Sbjct: 380 KRAKTFAYFRGTIANKLGKQYSKGIRIKMKEAFKDIK--DVVFTEQHSSCDKTCYREEMR 437
Query: 297 KTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQL 356
+ FC+CP G + R ++ GC+PVI+++ + P+ + DWR+ ++ + E+ +
Sbjct: 438 ASTFCLCPRGWSPWTLRAYQALMVGCIPVIIADEIEFPYENSFDWRQVSIKIPEKRHLET 497
Query: 357 KQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIK 409
IL+++ D + K W P DAFH+VM EL ++ K
Sbjct: 498 IDILRSVPDDVVERKRKAMAKFWPSVAWKKPAADDDAFHLVMKELERKKRGFK 550
>gi|363543255|ref|NP_001241842.1| uncharacterized protein LOC100857042 precursor [Zea mays]
gi|194704652|gb|ACF86410.1| unknown [Zea mays]
gi|224034207|gb|ACN36179.1| unknown [Zea mays]
gi|414878896|tpg|DAA56027.1| TPA: hypothetical protein ZEAMMB73_615997 [Zea mays]
Length = 418
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 94/354 (26%), Positives = 150/354 (42%), Gaps = 54/354 (15%)
Query: 86 VTNFKIYIY--PDG-DPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIP 142
V K+Y+Y P + + PR L+ +A+E + + + S T +P+ A F+ P
Sbjct: 48 VGRLKVYVYDLPSKYNKKLLKKDPRCLSHMFAAEIFMHRFLLSSAVRTFNPEEADWFYTP 107
Query: 143 I--SCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV------ 194
+ +C +++ ++ + +PYWNR+ GADHFFVT HD G
Sbjct: 108 VYTTCDLTPSGLPLPFKSPRMMRSAIELVATNWPYWNRSEGADHFFVTPHDFGACFHYQE 167
Query: 195 --RATEGVPFLIKNAIRV--------VCSPSYDVAFIPHKDVALPQVLQPFALP------ 238
G+ L++ A V VC + P A PQ +Q +P
Sbjct: 168 EKAIGRGILPLLQRATLVQTFGQKNHVCLKDGSITIPP---FAPPQKMQAHLIPADTPRS 224
Query: 239 -----RGG-RDVENRTVLGFWAGHRNSKIRVILARVWEN---DTELDISNNRINRAIGPL 289
RG D N G++A R A VWEN + DIS + P
Sbjct: 225 IFVYFRGLFYDTSNDPEGGYYA-------RGARASVWENFKNNPLFDISTDH------PA 271
Query: 290 VYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILN 349
Y ++ FC+CP G S R+ +++ +GC+PVI+++ LPF D + W V +
Sbjct: 272 TYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIVADDIVLPFADAIPWEDIGVFVA 331
Query: 350 ERDVYQLKQILKNISDAEFVSLHNNLV--KVQKHFQWNSPPVRFDAFHMVMYEL 401
E DV +L IL +I + L +++ + P DAFH ++ L
Sbjct: 332 EEDVPRLDTILTSIPTDVVLRKQRLLANPSMKQAMLFPQPAQAGDAFHQILNGL 385
>gi|21592312|gb|AAM64263.1| unknown [Arabidopsis thaliana]
Length = 273
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 53/133 (39%), Positives = 85/133 (63%), Gaps = 8/133 (6%)
Query: 65 SDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIR 124
+DL + +F + SVF +Y M K+YIYPDGD F++ L G YASEG+F + +
Sbjct: 135 TDLFAPLFRNLSVFKRSYELMELILKVYIYPDGDKPIFHEP--HLNGIYASEGWFMKLME 192
Query: 125 E-SRFLTHDPDRAHLFFIPISCHKMRGK-----GTSYENMTVIVKDYVDSLINKYPYWNR 178
++F+T +P++AHLF++P S +++ + + +++ ++DYV+ L KYP+WNR
Sbjct: 193 SNTQFVTKNPEKAHLFYMPYSVKQLQKSIFVPGSHNIKPLSIFLRDYVNMLSIKYPFWNR 252
Query: 179 TLGADHFFVTCHD 191
T G+DHF V CHD
Sbjct: 253 THGSDHFLVACHD 265
>gi|194697530|gb|ACF82849.1| unknown [Zea mays]
Length = 418
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 94/354 (26%), Positives = 149/354 (42%), Gaps = 54/354 (15%)
Query: 86 VTNFKIYIY--PDG-DPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIP 142
V K+Y+Y P + + PR L +A+E + + + S T +P+ A F+ P
Sbjct: 48 VGRLKVYVYDLPSKYNKKLLKKDPRCLNHMFAAEIFMHRFLLSSAVRTFNPEEADWFYTP 107
Query: 143 I--SCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV------ 194
+ +C +++ ++ + +PYWNR+ GADHFFVT HD G
Sbjct: 108 VYTTCDLTPSGLPLPFKSPRMMRSAIELVATNWPYWNRSEGADHFFVTPHDFGACFHYQE 167
Query: 195 --RATEGVPFLIKNAIRV--------VCSPSYDVAFIPHKDVALPQVLQPFALP------ 238
G+ L++ A V VC + P A PQ +Q +P
Sbjct: 168 EKAIGRGILPLLQRATLVQTFGQKNHVCLKDGSITIPP---FAPPQKMQAHLIPADTPRS 224
Query: 239 -----RGG-RDVENRTVLGFWAGHRNSKIRVILARVWEN---DTELDISNNRINRAIGPL 289
RG D N G++A R A VWEN + DIS + P
Sbjct: 225 IFVYFRGLFYDTSNDPEGGYYA-------RGARASVWENFKNNPLFDISTDH------PA 271
Query: 290 VYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILN 349
Y ++ FC+CP G S R+ +++ +GC+PVI+++ LPF D + W V +
Sbjct: 272 TYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIVADDIVLPFADAIPWEDIGVFVA 331
Query: 350 ERDVYQLKQILKNISDAEFVSLHNNLV--KVQKHFQWNSPPVRFDAFHMVMYEL 401
E DV +L IL +I + L +++ + P DAFH ++ L
Sbjct: 332 EEDVPRLDTILTSIPTDVVLRKQRLLANPSMKQAMLFPQPAQAGDAFHQILNGL 385
>gi|293334733|ref|NP_001169191.1| uncharacterized protein LOC100383044 [Zea mays]
gi|223975431|gb|ACN31903.1| unknown [Zea mays]
gi|413957212|gb|AFW89861.1| hypothetical protein ZEAMMB73_311893 [Zea mays]
Length = 428
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 83/300 (27%), Positives = 138/300 (46%), Gaps = 41/300 (13%)
Query: 133 PDRAHLFFIPI--SCHKMRGKG-TSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTC 189
P+ A LFF+P+ SC+ G S + ++ D VD + PYWNR+ GADH FV
Sbjct: 121 PEDADLFFVPVYVSCNFSTPNGFPSLSHARGMLADAVDLVQAGMPYWNRSAGADHVFVAS 180
Query: 190 HDVG--------VRATEGVPFLIKNAIRV---------VCSPSYDVAFIPH------KDV 226
HD G V +G+P +K +I + C V PH ++
Sbjct: 181 HDFGACFHPMEDVAIADGIPEFLKRSILLQTFGVQGHHTCQEVEHVVIPPHVPPEVEHEL 240
Query: 227 ALPQVLQP--FALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINR 284
P+ Q FA RG +V + + G + + K+R L + + + + + R +
Sbjct: 241 PEPEKAQRDIFAFFRGKMEVHPKNISGRFY---SKKVRTELLQHYGRNRKFYLKRKRFDN 297
Query: 285 AIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKF 344
Y+ ++ FC+CP G S R+ +S+ GC+PVI+++ +PF +L W +
Sbjct: 298 ------YRSEMARSLFCLCPLGWAPWSPRLVESVLLGCIPVIIADNIRMPFPSVLQWPEI 351
Query: 345 AVILNERDVYQLKQILKNISDAEFVSLHNNL---VKVQKHFQWNSPPVRFDAFHMVMYEL 401
++ + E+DV L+ +L ++ + NL VK +K +N P DA V+ EL
Sbjct: 352 SLQVAEKDVANLEVVLDHVVATNLSVIQKNLWDPVK-RKALVFNRPMEVGDATWQVLREL 410
>gi|357163140|ref|XP_003579636.1| PREDICTED: probable glucuronosyltransferase Os04g0398600-like
[Brachypodium distachyon]
Length = 429
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 89/333 (26%), Positives = 144/333 (43%), Gaps = 51/333 (15%)
Query: 104 QTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SCHKMRGKGTSYENMTVI 161
+ PR L +A+E + + + S T +P A F+ P+ +C +
Sbjct: 81 KDPRCLNHMFAAEIFMHRFLLSSAVRTLNPKEADWFYAPVYTTCDLTPAGLPLPFKSPRV 140
Query: 162 VKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV-------RATE-GVPFLIKNAIRV--- 210
++ + + NK+P+WN+T GADHFFV HD G +A E G+ L++ A V
Sbjct: 141 MRSAIQYISNKWPFWNKTDGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTF 200
Query: 211 -----VCSPSYDVAFIPHKDVALPQVLQPFALP-----------RGG-RDVENRTVLGFW 253
VC + P A PQ +Q +P RG D N G++
Sbjct: 201 GQENHVCLKEGSIIIPP---FAPPQKMQAHLIPPDTPRSIFVYFRGLFYDTGNDPEGGYY 257
Query: 254 AGHRNSKIRVILARVWEN---DTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVN 310
A R A +WEN + DIS P Y + FC+CP G
Sbjct: 258 A-------RGARASLWENFKNNPLFDISTEH------PATYYEDMQRAVFCLCPLGWAPW 304
Query: 311 SARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVS 370
S R+ +++ +GC+PVI+++ LPF D + W + V + E+DV +L IL ++ + +
Sbjct: 305 SPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFIEEKDVPKLDTILTSMPIEDILR 364
Query: 371 LHNNLV--KVQKHFQWNSPPVRFDAFHMVMYEL 401
L +++ + P DAFH ++ L
Sbjct: 365 KQRLLANPSMKQAMLFPQPAQARDAFHQILNGL 397
>gi|357492457|ref|XP_003616517.1| Xylogalacturonan beta-1,3-xylosyltransferase [Medicago truncatula]
gi|355517852|gb|AES99475.1| Xylogalacturonan beta-1,3-xylosyltransferase [Medicago truncatula]
Length = 259
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 53/152 (34%), Positives = 84/152 (55%), Gaps = 14/152 (9%)
Query: 85 MVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPIS 144
M NFK+++Y PNT + + K E F+ +++ S +LT P++AHLFF+P S
Sbjct: 1 MAQNFKVFMY---QPNTNTNIT-QFSFKTEQESLFYSSLQNSSYLTQHPEQAHLFFLPFS 56
Query: 145 CHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLI 204
+ T + ++ + N +PYWNR+LGADHF+++C + + L
Sbjct: 57 S----------DTSTRSLARFISRIRNDFPYWNRSLGADHFYLSCDGISHVNDRNIVELK 106
Query: 205 KNAIRVVCSPSYDVAFIPHKDVALPQVLQPFA 236
KNAI++ C P+ +FIPHKD+ LP + P A
Sbjct: 107 KNAIQIACFPTRHRSFIPHKDITLPPITNPHA 138
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 38/123 (30%), Positives = 63/123 (51%), Gaps = 16/123 (13%)
Query: 294 RFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYY--DLPFNDILDWRKFAVILNER 351
R +FC+ + + + +++ GCVPV+++ D+PF ++L WR+ AV +
Sbjct: 141 RLSNEEFCVVEYQND-DVLWLGEALRLGCVPVVVTEEAVNDMPFMEVLRWREMAVFVKSG 199
Query: 352 DVYQLKQILKNISDAEFVSLHNNLVKV----QKHFQWNSPPVRFDAFHMVMYELWLRRHV 407
NI + N+ ++ KH +WN P FDAF+ +MY+LWLRRH
Sbjct: 200 ---------VNIETDTWRERKGNMRRLGVVGSKHLRWNRPAQPFDAFNTIMYQLWLRRHT 250
Query: 408 IKY 410
I+Y
Sbjct: 251 IRY 253
>gi|326514612|dbj|BAJ96293.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 428
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 82/301 (27%), Positives = 139/301 (46%), Gaps = 42/301 (13%)
Query: 133 PDRAHLFFIPI--SCHKMRGKG-TSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTC 189
P+ A LF +P+ C+ G S + ++ D V + + P+WNR+ GADH FV
Sbjct: 121 PEEADLFLVPVYACCNFSTPTGLPSLAHARGLLADAVGLVRAQMPFWNRSAGADHVFVAS 180
Query: 190 HDVG--------VRATEGVPFLIKNAI---------RVVCSPSYDVAFIPH--KDVALPQ 230
HD G V G+P +K +I R C V P+ ++A +
Sbjct: 181 HDFGACFHPMEDVAMAAGIPEFLKGSILLQTFGVQGRHPCQDVEHVVIPPYVPPELAPRE 240
Query: 231 VLQP-------FALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRIN 283
+ +P FA RG +V + + G + + K+R L R++ + + + R +
Sbjct: 241 LPEPEKAHRDIFAFFRGKMEVHPKNISGHFY---SRKVRTELLRLYGRNRKFYLKRKRND 297
Query: 284 RAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRK 343
Y+ ++ FCICP G S R+ +S+ GC+PV++++ LPF +L W
Sbjct: 298 ------GYRSEMARSLFCICPLGWAPWSPRLVESVLLGCIPVVIADDIRLPFPGVLRWPD 351
Query: 344 FAVILNERDVYQLKQILKNISDAEFVSLHNNL---VKVQKHFQWNSPPVRFDAFHMVMYE 400
++ + ERDV L+ +L +++ ++ NL VK +K +N P DA V+ E
Sbjct: 352 ISLQVAERDVAGLEAVLDHVAATNLTTIQGNLWDPVK-RKALVFNRPMEEGDATWQVLKE 410
Query: 401 L 401
L
Sbjct: 411 L 411
>gi|10176877|dbj|BAB10084.1| unnamed protein product [Arabidopsis thaliana]
gi|23821294|dbj|BAC20929.1| NpGUT1 homolog [Arabidopsis thaliana]
Length = 341
Score = 104 bits (260), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 88/321 (27%), Positives = 136/321 (42%), Gaps = 51/321 (15%)
Query: 113 YASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SCHKMRGKGTSYENMTVIVKDYVDSLI 170
+A+E Y + + S T +P+ A F++P+ +C +++ + +
Sbjct: 2 FAAEIYMQRFLLSSPVRTLNPEEADWFYVPVYTTCDLTPNGLPLPFKSPRMMRSAIQLIA 61
Query: 171 NKYPYWNRTLGADHFFVTCHDVGV--------RATEGVPFLIKNAIRV--------VCSP 214
+ +PYWNRT GADHFFV HD G G+ L++ A V VC
Sbjct: 62 SNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIGRGILPLLQRATLVQTFGQRNHVCLK 121
Query: 215 SYDVAFIPHKDVALPQVLQPFALPRGGR------------DVENRTVLGFWAGHRNSKIR 262
+ P+ A PQ +Q +P DV N G++A R
Sbjct: 122 EGSITVPPY---APPQKMQSHLIPEKTPRSIFVYFRGLFYDVGNDPEGGYYA-------R 171
Query: 263 VILARVWEN--DTEL-DISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIH 319
A VWEN D L DIS P Y + FC+CP G S R+ +++
Sbjct: 172 GARAAVWENFKDNPLFDISTEH------PTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVI 225
Query: 320 YGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLV--K 377
+GC+PVI+++ LPF D + W V ++E+DV L IL +I + L
Sbjct: 226 FGCIPVIIADDIVLPFADAIPWEDIGVFVDEKDVPYLDTILTSIPPEVILRKQRLLANPS 285
Query: 378 VQKHFQWNSPPVRFDAFHMVM 398
+++ + P DAFH V+
Sbjct: 286 MKQAMLFPQPAQPGDAFHQVL 306
>gi|115458146|ref|NP_001052673.1| Os04g0398600 [Oryza sativa Japonica Group]
gi|75143148|sp|Q7XLG3.2|GT42_ORYSJ RecName: Full=Probable glucuronosyltransferase Os04g0398600
gi|38347167|emb|CAE05157.2| OSJNBa0039C07.13 [Oryza sativa Japonica Group]
gi|113564244|dbj|BAF14587.1| Os04g0398600 [Oryza sativa Japonica Group]
gi|116309465|emb|CAH66536.1| H0209H04.3 [Oryza sativa Indica Group]
gi|125548129|gb|EAY93951.1| hypothetical protein OsI_15724 [Oryza sativa Indica Group]
gi|125590243|gb|EAZ30593.1| hypothetical protein OsJ_14642 [Oryza sativa Japonica Group]
gi|215706942|dbj|BAG93402.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215741563|dbj|BAG98058.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 420
Score = 104 bits (260), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 89/333 (26%), Positives = 146/333 (43%), Gaps = 51/333 (15%)
Query: 104 QTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SCHKMRGKGTSYENMTVI 161
+ PR L +A+E + + + S T +P A F+ P+ +C +
Sbjct: 72 KDPRCLNHMFAAEIFMHRFLLSSAVRTLNPKEADWFYTPVYTTCDLTPAGLPLPFKSPRV 131
Query: 162 VKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV-------RATE-GVPFLIKNAIRV--- 210
++ + + +K+P+WNRT GADHFFV HD G +A E G+ L++ A V
Sbjct: 132 MRSAIQYISHKWPFWNRTDGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTF 191
Query: 211 -----VCSPSYDVAFIPHKDVALPQVLQPFALP-----------RGG-RDVENRTVLGFW 253
VC + P+ A PQ +Q +P RG D N G++
Sbjct: 192 GQENHVCLKEGSITIPPY---APPQKMQAHLIPPDTPRSIFVYFRGLFYDTGNDPEGGYY 248
Query: 254 AGHRNSKIRVILARVWEN---DTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVN 310
A R A +WEN + DIS + P Y + FC+CP G
Sbjct: 249 A-------RGARASLWENFKNNPLFDISTDH------PPTYYEDMQRAVFCLCPLGWAPW 295
Query: 311 SARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVS 370
S R+ +++ +GC+PVI+++ LPF D + W + V + E+DV +L IL ++ + +
Sbjct: 296 SPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFVEEKDVPKLDTILTSMPIDDILR 355
Query: 371 LHNNLV--KVQKHFQWNSPPVRFDAFHMVMYEL 401
L +++ + P DAFH ++ L
Sbjct: 356 KQRLLANPSMKQAMLFPQPAQPRDAFHQILNGL 388
>gi|242090769|ref|XP_002441217.1| hypothetical protein SORBIDRAFT_09g022500 [Sorghum bicolor]
gi|241946502|gb|EES19647.1| hypothetical protein SORBIDRAFT_09g022500 [Sorghum bicolor]
Length = 422
Score = 104 bits (260), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 96/361 (26%), Positives = 153/361 (42%), Gaps = 66/361 (18%)
Query: 86 VTNFKIYIYPDGDPNTFYQT------PRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLF 139
V K+++Y D Y T PR LT +A+E + +++ S T DP+ A F
Sbjct: 51 VGKLKVFVY---DLPAKYNTKPVEKDPRCLTHMFATEIFVHRSLLSSAVRTLDPEEADWF 107
Query: 140 FIPI--SCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV--- 194
+ P+ +C + +++ + + +++PYWNR+ GADHFFVT HD G
Sbjct: 108 YTPVYTTCDLTASGHPMPFDSPRMMRSAIRLIADRWPYWNRSEGADHFFVTPHDFGACFH 167
Query: 195 -----RATEGVPFLIKNAIRV--------VC--SPSYDVAFIPHKDV-ALPQVLQPFALP 238
G+ +++ A V VC + P+ + L P A P
Sbjct: 168 FQEEKAMARGILPVLRRATLVQTFGQRNHVCLKDGGGSITIPPYAPPWKMEAQLLPPATP 227
Query: 239 R------------GGRDVENRTVLGFWAGHRNSKIRVILARVWEN---DTELDISNNRIN 283
R G D E G++A R A VWEN + DIS
Sbjct: 228 RSIFVYFRGLFYDAGNDPEG----GYYA-------RGARASVWENFKSNPLFDISTAH-- 274
Query: 284 RAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRK 343
P Y + + FC+CP G S R+ +++ +GC+PV++++ LPF D + W
Sbjct: 275 ----PTTYYQDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVVIADDIVLPFADAIPWAD 330
Query: 344 FAVILNERDVYQLKQILKNISDAEFVSLHNNLV---KVQKHFQWNSPPVRFDAFHMVMYE 400
V + E DV +L IL +I E V L+ +++ + P DAFH ++
Sbjct: 331 IGVFVAEDDVPRLDTILTSIP-VEVVLRKQRLLASPAMKRAVLFPQPAQPGDAFHQILNG 389
Query: 401 L 401
L
Sbjct: 390 L 390
>gi|297822463|ref|XP_002879114.1| hypothetical protein ARALYDRAFT_481698 [Arabidopsis lyrata subsp.
lyrata]
gi|297324953|gb|EFH55373.1| hypothetical protein ARALYDRAFT_481698 [Arabidopsis lyrata subsp.
lyrata]
Length = 452
Score = 104 bits (260), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 88/354 (24%), Positives = 157/354 (44%), Gaps = 45/354 (12%)
Query: 84 EMVTNFKIYIY--PDGDPNTFYQTPRKLTGKYASEGYFFQNIR--ESRFLTHDPDRAHLF 139
++ N KIY+Y P + R +A+E + E T DP A F
Sbjct: 93 DVFNNLKIYVYDLPSKFNKDWLANDRCSNHLFAAEVALHKAFLSLEGDIRTEDPYEADFF 152
Query: 140 FIPI--SCHKMRGKG-TSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV-- 194
F+P+ SC+ G + + ++ D + + +YP+WNR G+DH F HD G
Sbjct: 153 FVPVYVSCNFSTINGFPAIGHARTLINDAIKFVSTQYPFWNRNNGSDHVFTATHDFGSCF 212
Query: 195 -----RA-TEGVPFLIKNAIRVVCSPSYDVAF----IPHKDVALPQVLQPFALPRGGRDV 244
RA +GVP +++++I + ++ V F ++V +P + P +L + +++
Sbjct: 213 HTMEDRAIADGVPKILRSSIVL---QTFGVTFNHPCQEVENVVIPPYISPESLHKTLKNI 269
Query: 245 ---ENRTVLGFWAGHR------------NSKIRVILARVWENDTELDISNNRINRAIGPL 289
+ R + F+ G + ++R + R + D + R +
Sbjct: 270 PVNKERDIWAFFRGKMELHPKNISGRFYSKRVRTKIWRSYGGDRRFYLQRQRFSG----- 324
Query: 290 VYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILN 349
YQ ++ FC+CP G S R+ +S+ GCVPVI+++ LPF + W ++ +
Sbjct: 325 -YQLEIARSVFCLCPLGWAPWSPRLVESVALGCVPVIIADGIRLPFPSAVRWPDISLTVA 383
Query: 350 ERDVYQLKQILKNISDAEFVSLHNNLV--KVQKHFQWNSPPVRFDAFHMVMYEL 401
ERDV +L IL+++ + NL V++ +N P DA V+ L
Sbjct: 384 ERDVGKLGDILEHVVATNLSVIQRNLEDPSVRRALMFNVPSREGDATWQVLEAL 437
>gi|386576428|gb|AFJ12118.1| glycosyltransferase, partial [Nicotiana tabacum]
Length = 294
Score = 104 bits (259), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 86/289 (29%), Positives = 128/289 (44%), Gaps = 49/289 (16%)
Query: 114 ASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SCHKMRGKGTSYENMTVIVKDYVDSLIN 171
A+E Y + + S T +P+ A F+ P+ +C +++ + + +
Sbjct: 1 AAEIYMHRFLLSSAVRTFNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRMMRSAIQLIAS 60
Query: 172 KYPYWNRTLGADHFFVTCHDVGV-------RATE-GVPFLIKNAIRV--------VCSPS 215
+PYWNRT GADHFF+T HD G +A E G+ L++ A V VC
Sbjct: 61 NWPYWNRTEGADHFFITPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLKD 120
Query: 216 YDVAFIPHKDVALPQVLQPFALP-----------RG-GRDVENRTVLGFWAGHRNSKIRV 263
+ P+ A PQ +Q +P RG DV N G++A R
Sbjct: 121 GSITIPPY---APPQKMQSHLIPPDTPRSIFVYFRGLFYDVGNDPEGGYYA-------RG 170
Query: 264 ILARVWEN--DTEL-DISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHY 320
A VWEN D L DIS P Y + FC+CP G S R+ +++ +
Sbjct: 171 ARAAVWENFKDNPLFDISTEH------PTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVIF 224
Query: 321 GCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFV 369
GC+PVI+++ LPF D + W V + E+DV L IL +I E +
Sbjct: 225 GCIPVIIADDIVLPFADAIPWEDIGVFVAEKDVPNLDTILTSIPPEEIL 273
>gi|34394613|dbj|BAC83915.1| limonene cyclase like protein [Oryza sativa Japonica Group]
gi|50508945|dbj|BAD31849.1| limonene cyclase like protein [Oryza sativa Japonica Group]
Length = 417
Score = 104 bits (259), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 52/139 (37%), Positives = 82/139 (58%), Gaps = 9/139 (6%)
Query: 59 INKQDYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGY 118
+ +DY +G V+ + FH +YLEM FK+++Y +G+P F+ P + Y++EG
Sbjct: 243 LTDKDYVPVGP-VYRNAYAFHRSYLEMEKVFKVFVYEEGEPPVFHDGPCR--SIYSTEGR 299
Query: 119 FFQNIR-ESRFLTHDPDRAHLFFIPISCHKM-----RGKGTSYENMTVIVKDYVDSLINK 172
F + E+R T DPD+AH+FF+P S KM + + + DY++ + K
Sbjct: 300 FIYAMEMENRMRTRDPDQAHVFFLPFSVVKMVKMIYEPNSHDMDPLRRTISDYINVVSTK 359
Query: 173 YPYWNRTLGADHFFVTCHD 191
YP+WNR+LGADHF ++CHD
Sbjct: 360 YPHWNRSLGADHFMLSCHD 378
>gi|326490197|dbj|BAJ94172.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326526653|dbj|BAK00715.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 422
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 92/330 (27%), Positives = 147/330 (44%), Gaps = 51/330 (15%)
Query: 107 RKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SCHKMRGKGTSYENMTVIVKD 164
R L +A+E + Q + S T +P+ A F+ P+ +C + I++
Sbjct: 77 RCLYHMFAAEIFMHQFLLASAVRTKNPEEADWFYTPVYVTCDLTQQGFPLPFRAPRIMRS 136
Query: 165 YVDSLINKYPYWNRTLGADHFFVTCHDVGV-------RATE-GVPFLIKNAIRV------ 210
+ + +PYWNRT GADHFF+ HD G RA E G+ L++ A V
Sbjct: 137 AIQYIAATWPYWNRTEGADHFFLAPHDFGACFHYQEERAIERGILPLLRRATLVQTFGQR 196
Query: 211 --VCSPSYDVAFIPHKDVALPQVLQP-----------FALPRG-GRDVENRTVLGFWAGH 256
VC + P+ + PQ +Q FA RG D+ N G++A
Sbjct: 197 NHVCMQEGSITIPPYAN---PQKMQAHLISPGTPRSIFAYFRGLFYDMGNDPEGGYYA-- 251
Query: 257 RNSKIRVILARVWEN--DTEL-DISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSAR 313
R A VWEN D L D+S+ P Y + FC+CP G S R
Sbjct: 252 -----RGARASVWENFKDNPLFDMSSEH------PSTYYEDMQRAIFCLCPLGWAPWSPR 300
Query: 314 IADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHN 373
+ +++ +GC+PVI+++ LPF D + W + +V + E DV +L IL +++ + +
Sbjct: 301 LVEAVVFGCIPVIIADDIVLPFADAIPWEQISVFVAEADVPRLDSILASVAPEDVLRKQR 360
Query: 374 NLVK--VQKHFQWNSPPVRFDAFHMVMYEL 401
L +++ ++ P DAF V+ L
Sbjct: 361 LLASPAMKQAVLFHQPARPGDAFDQVLNGL 390
>gi|125539672|gb|EAY86067.1| hypothetical protein OsI_07437 [Oryza sativa Indica Group]
Length = 342
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 89/328 (27%), Positives = 144/328 (43%), Gaps = 59/328 (17%)
Query: 113 YASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SCHKMRGKGTSYENMTVIVKDYVDSLI 170
+A+E + + + S T +P++A F+ P+ +C +++ + L
Sbjct: 2 FAAEIFMHRFLLSSAVRTLNPEQADWFYAPVYTTCDLTHAGLPLPFKSPRMMRSAIQFLS 61
Query: 171 NKYPYWNRTLGADHFFVTCHDVGV-------RATE-GVPFLIKNAIRV--------VCSP 214
K+P+WNRT GADHFFV HD G +A E G+ L++ A V VC
Sbjct: 62 RKWPFWNRTDGADHFFVVPHDFGACFHYQEEKAIERGILPLLRRATLVQTFGQKNHVCLK 121
Query: 215 SYDVAFIPHKDVALPQVLQPFALP-----------RG-----GRDVENRTVLGFWAGHRN 258
+ P+ A PQ +Q +P RG G D E G++A
Sbjct: 122 EGSITIPPY---APPQKMQAHLIPPDTPRSIFVYFRGLFYDNGNDPEG----GYYA---- 170
Query: 259 SKIRVILARVWEN---DTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIA 315
R A +WEN + DIS P Y ++ FC+CP G S R+
Sbjct: 171 ---RGARASLWENFKNNPLFDISTEH------PATYYEDMQRSVFCLCPLGWAPWSPRLV 221
Query: 316 DSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNL 375
+++ +GC+PVI+++ LPF D + W + V ++E DV +L IL +I + + L
Sbjct: 222 EAVVFGCIPVIIADDIVLPFADAIPWDEIGVFVDEEDVPRLDSILTSIPIDDILRKQRLL 281
Query: 376 V--KVQKHFQWNSPPVRFDAFHMVMYEL 401
+++ + P DAFH ++ L
Sbjct: 282 ANPSMKQAMLFPQPAQPRDAFHQILNGL 309
>gi|428181360|gb|EKX50224.1| hypothetical protein GUITHDRAFT_104038 [Guillardia theta CCMP2712]
Length = 723
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 98/345 (28%), Positives = 154/345 (44%), Gaps = 59/345 (17%)
Query: 113 YASEGYFFQNIRESRFLTHDPDRAHLFFIPI------SCHKMRGKGTSYENMTVIVKDY- 165
YA+E Y + + S LT DP A LFFIP+ S H +R +N I K Y
Sbjct: 372 YAAEVYIHEQLLLSDSLTLDPGAADLFFIPLYAACFLSSHFVRPGPGWPDNDVDIGKTYQ 431
Query: 166 -----VDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFL-IKNAIRVVCS------ 213
++ + YP+++R+ GADH V D G + +G PFL + N+I +V S
Sbjct: 432 AVQLVLEHVRQTYPFFDRSAGADHVLVLSSDWG--SCQG-PFLELHNSILLVTSGDRTLV 488
Query: 214 -PSYDVA--------------------FIPHKDVALPQVLQPFALPRG--GRDVENRTVL 250
P++ A F KDV +P ++ AL G R +L
Sbjct: 489 RPAWYAARAADHMGSSEEFAVRSRLPCFQLFKDVVIPPLVPHPALTASYMGERTRGRDIL 548
Query: 251 GFWAGHRNSKIRVILARVWENDTELDISN---NRINRAIGPLV--------YQRRFYKTK 299
++ G ++ +L + D L I R +R G +V Y ++
Sbjct: 549 VYFRGTAAGSVKALL---YNKDYSLGIRQLLLRRYSRVRGWVVSDRINSSSYHDELLRSV 605
Query: 300 FCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQI 359
FC+ P G ++ S R ++I GC+PV+L++ LPF LD+ +F V + +R + +L+ I
Sbjct: 606 FCLAPAGWELWSVRFFEAILLGCIPVLLTDDVQLPFQQRLDYSRFTVKVEQRRILELESI 665
Query: 360 LKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLR 404
L +I++ L +V K + PP DAF +M EL R
Sbjct: 666 LSSINETVIRRKQEGLKEVWKRMTYQRPPEDGDAFTGIMDELARR 710
>gi|242072854|ref|XP_002446363.1| hypothetical protein SORBIDRAFT_06g014730 [Sorghum bicolor]
gi|241937546|gb|EES10691.1| hypothetical protein SORBIDRAFT_06g014730 [Sorghum bicolor]
Length = 430
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 88/333 (26%), Positives = 144/333 (43%), Gaps = 51/333 (15%)
Query: 104 QTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SCHKMRGKGTSYENMTVI 161
+ PR L+ +A+E + + + S T +P A F+ P+ +C +
Sbjct: 82 KDPRCLSHMFAAEIFMHRFLLSSAVRTLNPKEADWFYTPVYTTCDLTNAGLPLPFKSPRV 141
Query: 162 VKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV-------RATE-GVPFLIKNAIRV--- 210
++ + + NK+P+WNRT GADHFFV HD +A E G+ L++ A V
Sbjct: 142 MRSAIQYISNKWPFWNRTDGADHFFVVPHDFAACFHYQEEKAIERGILPLLRRATLVQTF 201
Query: 211 -----VCSPSYDVAFIPHKDVALPQVLQPFAL-PRGGR-----------DVENRTVLGFW 253
VC + P A PQ +Q + P R D N G++
Sbjct: 202 GQENHVCLKEGSIIIPP---FAPPQKMQAHLISPDTPRSIFVYFRGLFYDTGNDPEGGYY 258
Query: 254 AGHRNSKIRVILARVWEN---DTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVN 310
A R A +WEN + DIS + P Y + FC+CP G
Sbjct: 259 A-------RGARASLWENFKSNPLFDISTDH------PATYYEDMQRAVFCLCPLGWAPW 305
Query: 311 SARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVS 370
S R+ +++ +GC+PVI+++ LPF D + W + V + E+DV +L IL ++ + +
Sbjct: 306 SPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFVEEKDVPKLDTILTSMPIDDILR 365
Query: 371 LHNNLV--KVQKHFQWNSPPVRFDAFHMVMYEL 401
L +++ + P DAFH ++ L
Sbjct: 366 KQRLLANPSMKQAMLFPQPAQPRDAFHQILNGL 398
>gi|110288702|gb|ABB46945.2| exostosin family protein, putative, expressed [Oryza sativa
Japonica Group]
gi|215766610|dbj|BAG98714.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 341
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 86/321 (26%), Positives = 136/321 (42%), Gaps = 45/321 (14%)
Query: 113 YASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SCHKMRGKGTSYENMTVIVKDYVDSLI 170
+A+E + + + S T +PD A F+ P+ +C +++ + +
Sbjct: 2 FATEIFMHRFLLSSAIRTSNPDEADWFYTPVYTTCDLTPWGHPLTTKSPRMMRSAIKFIS 61
Query: 171 NKYPYWNRTLGADHFFVTCHDVGV-------RATE-GVPFLIKNAIRVVCSPSYDVAFIP 222
+PYWNRT GADHFFV HD +A E G+ +++ A V + A +
Sbjct: 62 KYWPYWNRTEGADHFFVVPHDFAACFYFQEAKAIERGILPVLRRATLVQTFGQKNHACLK 121
Query: 223 HKDVALPQV---------LQPFALPRG--------GRDVENRTVLGFWAGHRNSKIRVIL 265
+ +P L P PR D N G++A R
Sbjct: 122 DGSITVPPYTPAHKIRAHLVPPETPRSIFVYFRGLFYDTSNDPEGGYYA-------RGAR 174
Query: 266 ARVWEN---DTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGC 322
A VWEN + DIS + P Y + FC+CP G S R+ +++ +GC
Sbjct: 175 ASVWENFKNNPMFDISTDH------PQTYYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGC 228
Query: 323 VPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVK--VQK 380
+PVI+++ LPF+D + W + AV + E DV QL IL +I + L + +++
Sbjct: 229 IPVIIADDIVLPFSDAIPWEEIAVFVAEDDVPQLDTILTSIPTEVILRKQAMLAEPSMKQ 288
Query: 381 HFQWNSPPVRFDAFHMVMYEL 401
+ P D FH VM L
Sbjct: 289 TMLFPQPAEPGDGFHQVMNAL 309
>gi|413932967|gb|AFW67518.1| hypothetical protein ZEAMMB73_420245 [Zea mays]
Length = 386
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 89/334 (26%), Positives = 147/334 (44%), Gaps = 54/334 (16%)
Query: 104 QTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SCH-KMRGKGTSYENMTV 160
+ PR L +A+E + + + S T +P+ A F+ P+ +C G +++ +
Sbjct: 39 KDPRCLNHMFAAEIFMHRLLLSSAVRTFNPEEADWFYTPVYATCDLTPSGLPLPFKSPRM 98
Query: 161 IVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV--------RATEGVPFLIKNAIRV-- 210
++ ++ + +PYWNR+ GADHFFVT HD G G+ L+++A V
Sbjct: 99 MLS-AIELIATNWPYWNRSEGADHFFVTPHDFGACFHYQDEKAIGRGILPLLQHATLVQT 157
Query: 211 ------VCSPSYDVAFIPHKDVALPQVLQPFALP-----------RG-GRDVENRTVLGF 252
VC + P A PQ +Q +P RG D N G+
Sbjct: 158 FGQKNHVCLKGGSITIPP---FAPPQKMQAHLIPADTPRSIFVYFRGLFYDTSNDPEGGY 214
Query: 253 WAGHRNSKIRVILARVWE---NDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQV 309
+A R A VWE N+ DIS + P Y ++ FC+CP G
Sbjct: 215 YA-------RGARASVWENFKNNPLFDISTDH------PSTYYEDMERSVFCLCPLGWAP 261
Query: 310 NSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFV 369
S R+ +++ +GC+P+I+++ LPF D + W + V + E DV +L IL +I +
Sbjct: 262 WSPRLVEAVVFGCIPLIIADIV-LPFADAIPWEEIGVFVAEEDVPKLDSILTSIPTDVIL 320
Query: 370 SLHNNLV--KVQKHFQWNSPPVRFDAFHMVMYEL 401
L +++ + P DAFH ++ L
Sbjct: 321 RKQRLLANPSMKQAMLFPQPAQAGDAFHQILNGL 354
>gi|326509565|dbj|BAJ86998.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326509967|dbj|BAJ87200.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 415
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 92/331 (27%), Positives = 145/331 (43%), Gaps = 53/331 (16%)
Query: 107 RKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SCHKMRGKGTSYENMTVIVKD 164
R L+ +A+E + + + S T +P+ A F+ P+ +C I++
Sbjct: 70 RCLSHMFAAEIFMHRFLLSSAIRTMNPEEADWFYTPVYTTCDLTPWGHPLPFKSPRIMRS 129
Query: 165 YVDSLINKYPYWNRTLGADHFFVTCHDVGV-------RATE-GVPFLIKNAIRV------ 210
V + + +PYWNRT GADHFFV HD G +A E G+ L++ A V
Sbjct: 130 AVQFISSHWPYWNRTAGADHFFVVPHDFGACFHYQEEKAIERGILPLLRRATLVQTFGQK 189
Query: 211 --VCSPSYDVAFIPHKDVALPQVLQPFALP-----------RGG-RDVENRTVLGFWAGH 256
VC + P+ A PQ ++ +P RG D N G++A
Sbjct: 190 DHVCLKEGSINIPPY---APPQKMKTHLVPPETPRSIFVYFRGLFYDTANDPEGGYYA-- 244
Query: 257 RNSKIRVILARVWEN---DTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSAR 313
R A VWEN + DIS + P Y + FC+CP G S R
Sbjct: 245 -----RGARASVWENFKNNPLFDISTDH------PPTYYEDMQRAIFCLCPLGWAPWSPR 293
Query: 314 IADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHN 373
+ +++ +GC+PVI+++ LPF D + W + V + E DV +L IL +I E +
Sbjct: 294 LVEAVVFGCIPVIIADDIVLPFADAIPWDEIGVFVAEDDVPKLDTILTSIP-MEVILRKQ 352
Query: 374 NLV---KVQKHFQWNSPPVRFDAFHMVMYEL 401
L+ +++ + P DAFH ++ L
Sbjct: 353 RLLANPSMKQAMLFPQPAQAGDAFHQILNGL 383
>gi|301072484|gb|ADK56172.1| glycosyltransferase 47 [Triticum aestivum]
Length = 415
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 92/331 (27%), Positives = 145/331 (43%), Gaps = 53/331 (16%)
Query: 107 RKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SCHKMRGKGTSYENMTVIVKD 164
R L+ +A+E + + + S T +P+ A F+ P+ +C I++
Sbjct: 70 RCLSHMFAAEIFMHRFLLSSAIRTMNPEEADWFYTPVYTTCDLTPWGHPLPFKSPRIMRS 129
Query: 165 YVDSLINKYPYWNRTLGADHFFVTCHDVGV-------RATE-GVPFLIKNAIRV------ 210
V + + +PYWNRT GADHFFV HD G +A E G+ L++ A V
Sbjct: 130 AVQFISSHWPYWNRTAGADHFFVVPHDFGACFHYQEEKAIERGILPLLRRATLVQTFGQK 189
Query: 211 --VCSPSYDVAFIPHKDVALPQVLQPFALP-----------RGG-RDVENRTVLGFWAGH 256
VC + P+ A PQ ++ +P RG D N G++A
Sbjct: 190 DHVCLKEGSINIPPY---APPQKMKTHLVPPETPRSIFVYFRGLFYDTANDPEGGYYA-- 244
Query: 257 RNSKIRVILARVWEN---DTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSAR 313
R A VWEN + DIS + P Y + FC+CP G S R
Sbjct: 245 -----RGARASVWENFKNNPLFDISTDH------PPTYYEDMQRAIFCLCPLGWAPWSPR 293
Query: 314 IADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHN 373
+ +++ +GC+PVI+++ LPF D + W + V + E DV +L IL +I E +
Sbjct: 294 LVEAVVFGCIPVIIADDIVLPFADAIPWDEIGVFVAEDDVPKLDTILTSIP-MEVILRKQ 352
Query: 374 NLV---KVQKHFQWNSPPVRFDAFHMVMYEL 401
L+ +++ + P DAFH ++ L
Sbjct: 353 RLLANPSMKQAMLFPQPAQPGDAFHQILNGL 383
>gi|357126596|ref|XP_003564973.1| PREDICTED: probable glucuronosyltransferase Os01g0926600-like
[Brachypodium distachyon]
Length = 411
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 91/331 (27%), Positives = 145/331 (43%), Gaps = 53/331 (16%)
Query: 107 RKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SCHKMRGKGTSYENMTVIVKD 164
R L+ +A+E + + + S T +P+ A F+ P+ +C I++
Sbjct: 66 RCLSHMFAAEIFMHRFLLSSAIRTMNPEEADWFYTPVYTTCDLTPWGHPLPFKSPRIMRS 125
Query: 165 YVDSLINKYPYWNRTLGADHFFVTCHDVGV-------RATE-GVPFLIKNAIRV------ 210
+ + + +PYWNRT GADHFFV HD G +A E G+ L++ A V
Sbjct: 126 AIQFISSHWPYWNRTAGADHFFVVPHDFGACFHYQEEKAIERGILPLLRRATLVQTFGQK 185
Query: 211 --VCSPSYDVAFIPHKDVALPQVLQPFALP-----------RGG-RDVENRTVLGFWAGH 256
VC + P+ A PQ ++ +P RG D N G++A
Sbjct: 186 DHVCLKEGSINIPPY---APPQKMKTHLVPPETPRSIFVYFRGLFYDTANDPEGGYYA-- 240
Query: 257 RNSKIRVILARVWEN---DTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSAR 313
R A VWEN + DIS + P Y + FC+CP G S R
Sbjct: 241 -----RGARASVWENFKNNPLFDISTDH------PPTYYEDMQRAIFCLCPLGWAPWSPR 289
Query: 314 IADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHN 373
+ +++ +GC+PVI+++ LPF D + W + V + E DV +L IL +I E +
Sbjct: 290 LVEAVVFGCIPVIIADDIVLPFADAIPWDEIGVFVAEDDVPKLDTILTSIP-MEVILRKQ 348
Query: 374 NLV---KVQKHFQWNSPPVRFDAFHMVMYEL 401
L+ +++ + P DAFH ++ L
Sbjct: 349 RLLANPSMKQAMLFPQPAQPGDAFHQILNGL 379
>gi|194700342|gb|ACF84255.1| unknown [Zea mays]
gi|414587472|tpg|DAA38043.1| TPA: secondary cell wall glycosyltransferase family 47 [Zea mays]
Length = 427
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 87/330 (26%), Positives = 143/330 (43%), Gaps = 51/330 (15%)
Query: 107 RKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SCHKMRGKGTSYENMTVIVKD 164
R L+ +A+E + + + S T +P A F+ P+ +C +++
Sbjct: 82 RCLSHMFAAEIFMHRFLLSSAVRTLNPKEADWFYTPVYTTCDLTNAGLPLPFKSPRVMRS 141
Query: 165 YVDSLINKYPYWNRTLGADHFFVTCHDVGV-------RATE-GVPFLIKNAIRV------ 210
+ + NK+P+WNRT GADHFFV HD +A E G+ L++ A V
Sbjct: 142 AIQYISNKWPFWNRTDGADHFFVVPHDFAACFHYQEEKAIERGILPLLRRATLVQTFGQE 201
Query: 211 --VCSPSYDVAFIPHKDVALPQVLQPFAL-PRGGR-----------DVENRTVLGFWAGH 256
VC + P+ A PQ +Q + P R D N G++A
Sbjct: 202 NHVCLKEGSIIIPPY---APPQKMQAHLISPDTPRSIFVYFRGLFYDTGNDPEGGYYA-- 256
Query: 257 RNSKIRVILARVWEN---DTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSAR 313
R A +WEN + DIS + P Y + FC+CP G S R
Sbjct: 257 -----RGARASLWENFKSNPLFDISTDH------PATYYEDMQRAVFCLCPLGWAPWSPR 305
Query: 314 IADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHN 373
+ +++ +GC+PVI+++ LPF D + W + V + E+DV +L IL ++ + +
Sbjct: 306 LVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFVEEKDVPKLDTILTSMPIDDILRKQR 365
Query: 374 NLV--KVQKHFQWNSPPVRFDAFHMVMYEL 401
L +++ + P DAFH ++ L
Sbjct: 366 LLANPSMKQAMLFPQPAQPRDAFHQILNGL 395
>gi|307111456|gb|EFN59690.1| hypothetical protein CHLNCDRAFT_133230 [Chlorella variabilis]
Length = 569
Score = 101 bits (252), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 85/324 (26%), Positives = 142/324 (43%), Gaps = 47/324 (14%)
Query: 87 TNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCH 146
NFKIY+Y + NT + + F + ES T DP A LF+IP +
Sbjct: 242 ANFKIYMY---ELNTQWAYDNDHHIAWTGHDPFMEEFLESPVRTEDPSEASLFYIPAFLY 298
Query: 147 KMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPF--LI 204
G + T ++ D++ + +PYW+R G DHF D G G F LI
Sbjct: 299 SYSGNMAGGDEHTQLLLDHIRA---TWPYWDRHGGRDHFLFVPADRGT-CPWGSRFSDLI 354
Query: 205 K------NAIRVVCSPSYD-------VAFIPHKDVALPQVLQPFALPRGGRDVENRTVLG 251
+ ++ R +P + + P +D+ P +LP G L
Sbjct: 355 RIVHFGMHSTRTNHNPHFGHQGHPEFGCYNPLRDIVAAGTGAPLSLPWAG-------WLF 407
Query: 252 FWAGH-------RNSKIRVILARV---WENDTELDISNNRINRAIGPLVYQRRFYKTKFC 301
F+AG + + R+IL+ + W ND E S +N Y F + KFC
Sbjct: 408 FFAGSIRTDDNVYSGRTRLILSELVAQW-NDPEFSFSGGYVNN------YPAGFREAKFC 460
Query: 302 ICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILK 361
+ P G R+ SI GCVPV++ + P+ ++L + F++ L+ D+ QL++ L+
Sbjct: 461 LAPWGYGFG-MRLHQSILGGCVPVVIQEHVFQPYEEVLPYETFSLRLSNEDLPQLRETLR 519
Query: 362 NISDAEFVSLHNNLVKVQKHFQWN 385
+++D ++ L +V+ ++ F W
Sbjct: 520 SVTDEQYRELLEGVVRYKEAFSWE 543
>gi|357126594|ref|XP_003564972.1| PREDICTED: probable glucuronosyltransferase Os01g0926400-like
[Brachypodium distachyon]
Length = 423
Score = 101 bits (251), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 92/327 (28%), Positives = 144/327 (44%), Gaps = 51/327 (15%)
Query: 107 RKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SCHKMRGKGTSYENMTVIVKD 164
R L +A+E + Q + S T DP+ A F+ P+ +C + +++
Sbjct: 78 RCLYHMFAAEIFMHQFLLASAVRTMDPEEADWFYTPVYTTCDLTQQGFPLPFRAPRMMRS 137
Query: 165 YVDSLINKYPYWNRTLGADHFFVTCHDVGV-------RATE-GVPFLIKNAIRV------ 210
V + +PYWNRT GADHFF+ HD G RA E G+ +++ A V
Sbjct: 138 AVQYIAATWPYWNRTEGADHFFLAPHDFGACFHYQEERAIERGILPVLRRATLVQTFGQR 197
Query: 211 --VCSPSYDVAFIPHKDVALPQVLQP-----------FALPRG-GRDVENRTVLGFWAGH 256
VC + P+ A PQ +Q FA RG D+ N G++A
Sbjct: 198 NHVCMQEGSITIPPY---ANPQKMQAHLISPGTPRSIFAYFRGLFYDMGNDPEGGYYA-- 252
Query: 257 RNSKIRVILARVWEN--DTEL-DISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSAR 313
R A VWEN D L D+S P Y + FC+CP G S R
Sbjct: 253 -----RGARASVWENFKDNPLFDMSTEH------PSTYYEDMQRAIFCLCPLGWAPWSPR 301
Query: 314 IADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHN 373
+ +++ +GC+PVI+++ LPF D + W + +V + E DV +L IL ++ + +
Sbjct: 302 LVEAVVFGCIPVIIADDIVLPFADAIPWDQISVFVPEADVPRLDSILASVPPEDVLRKQR 361
Query: 374 NLVK--VQKHFQWNSPPVRFDAFHMVM 398
L +++ ++ P DAF V+
Sbjct: 362 LLASPAMKQAVLFHQPAQPRDAFDQVL 388
>gi|226499262|ref|NP_001149267.1| secondary cell wall-related glycosyltransferase family 47 precursor
[Zea mays]
gi|195625922|gb|ACG34791.1| secondary cell wall-related glycosyltransferase family 47 [Zea
mays]
Length = 427
Score = 101 bits (251), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 87/330 (26%), Positives = 143/330 (43%), Gaps = 51/330 (15%)
Query: 107 RKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SCHKMRGKGTSYENMTVIVKD 164
R L+ +A+E + + + S T +P A F+ P+ +C +++
Sbjct: 82 RCLSHMFAAEIFMHRFLLSSAVRTLNPKEADWFYTPVYTTCDLTNAGLPLPFKSPRVMRS 141
Query: 165 YVDSLINKYPYWNRTLGADHFFVTCHDVGV-------RATE-GVPFLIKNAIRV------ 210
+ + NK+P+WNRT GADHFFV HD +A E G+ L++ A V
Sbjct: 142 AIQYISNKWPFWNRTDGADHFFVVPHDFAACFHYQEEKAIERGILPLLRRATLVQTFGQE 201
Query: 211 --VCSPSYDVAFIPHKDVALPQVLQPFAL-PRGGR-----------DVENRTVLGFWAGH 256
VC + P+ A PQ +Q + P R D N G++A
Sbjct: 202 NHVCLKEGSIIIPPY---APPQKMQAHLISPDTPRSIFVYFRGLFYDTGNDPEGGYYA-- 256
Query: 257 RNSKIRVILARVWEN---DTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSAR 313
R A +WEN + DIS + P Y + FC+CP G S R
Sbjct: 257 -----RGARASLWENFKSNPLFDISTDH------PATYYEDMQRAVFCLCPLGWAPWSPR 305
Query: 314 IADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHN 373
+ +++ +GC+PVI+++ LPF D + W + V + E+DV +L IL ++ + +
Sbjct: 306 LVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFVEEKDVPKLDTILTSMPIDDILRKQR 365
Query: 374 NLV--KVQKHFQWNSPPVRFDAFHMVMYEL 401
L +++ + P DAFH ++ L
Sbjct: 366 LLANPSMKQAMLFPQPAQPRDAFHQILNGL 395
>gi|326511078|dbj|BAJ91886.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 415
Score = 101 bits (251), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 91/331 (27%), Positives = 145/331 (43%), Gaps = 53/331 (16%)
Query: 107 RKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SCHKMRGKGTSYENMTVIVKD 164
R L+ +A+E + + + S T +P+ A F+ P+ +C I++
Sbjct: 70 RCLSHMFAAEIFMHRFLLSSAIRTMNPEEADWFYTPVYTTCDLTPWGHPLPFKSPRIMRS 129
Query: 165 YVDSLINKYPYWNRTLGADHFFVTCHDVGV-------RATE-GVPFLIKNAIRV------ 210
V + + +PYWNRT GADHFFV HD G +A E G+ L++ A V
Sbjct: 130 AVQFISSHWPYWNRTAGADHFFVVPHDFGACFHYQEEKAIERGILPLLRRATLVQTFGQK 189
Query: 211 --VCSPSYDVAFIPHKDVALPQVLQPFALP-----------RGG-RDVENRTVLGFWAGH 256
VC + P+ A PQ ++ +P RG D N G++A
Sbjct: 190 DHVCLKEGSINIPPY---APPQKMKTHLVPPETPRSIFVYFRGLFYDTANDPEGGYYA-- 244
Query: 257 RNSKIRVILARVWEN---DTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSAR 313
R A VWEN + DIS + P Y + FC+CP G S R
Sbjct: 245 -----RGARASVWENFKNNPLFDISTDH------PPTYYEDMQRAIFCLCPLGWAPWSPR 293
Query: 314 IADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHN 373
+ +++ +GC+PVI+++ LPF D + W + + + E DV +L IL +I E +
Sbjct: 294 LVEAVVFGCIPVIIADDIVLPFADAIPWDEIGMFVAEDDVPKLDTILTSIP-MEVILRKQ 352
Query: 374 NLV---KVQKHFQWNSPPVRFDAFHMVMYEL 401
L+ +++ + P DAFH ++ L
Sbjct: 353 RLLANPSMKQAMLFPQPAQAGDAFHQILNGL 383
>gi|125528940|gb|EAY77054.1| hypothetical protein OsI_05011 [Oryza sativa Indica Group]
Length = 422
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 93/337 (27%), Positives = 144/337 (42%), Gaps = 51/337 (15%)
Query: 100 NTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIP--ISCHKMRGKGTSYEN 157
N + R L +A+E + Q + S T DP+ A F+ P +C
Sbjct: 70 NLLAKDSRCLQHMFAAEIFMHQFLLSSPVRTLDPEEADWFYTPAYTTCDLTPQGFPLPFR 129
Query: 158 MTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV-------RATE-GVPFLIKNAIR 209
I++ V + +PYWNRT GADHFF+ HD G RA E G+ +++ A
Sbjct: 130 APRIMRSAVRYVAATWPYWNRTDGADHFFLAPHDFGACFHYQEERAIERGILPVLRRATL 189
Query: 210 V--------VCSPSYDVAFIPHKDVALPQVLQPF----ALPRG--------GRDVENRTV 249
V C + P+ D P+ ++ A PR D+ N
Sbjct: 190 VQTFGQRHHPCLQPGSITVPPYAD---PRKMEAHRISPATPRSIFVYFRGLFYDMGNDPE 246
Query: 250 LGFWAGHRNSKIRVILARVWEN--DTEL-DISNNRINRAIGPLVYQRRFYKTKFCICPGG 306
G++A R A VWEN D L DIS P Y + FC+CP G
Sbjct: 247 GGYYA-------RGARASVWENFKDNPLFDISTEH------PATYYEDMQRAIFCLCPLG 293
Query: 307 SQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDA 366
S R+ +++ +GC+PVI+++ LPF D + W + +V + E DV +L IL ++
Sbjct: 294 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWGEISVFVAEEDVPRLDTILASVPLD 353
Query: 367 EFVSLHNNLVK--VQKHFQWNSPPVRFDAFHMVMYEL 401
E + L +++ ++ P DAFH ++ L
Sbjct: 354 EVIRKQRLLASPAMKQAVLFHQPARPGDAFHQILNGL 390
>gi|168003473|ref|XP_001754437.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162694539|gb|EDQ80887.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 353
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 84/345 (24%), Positives = 147/345 (42%), Gaps = 47/345 (13%)
Query: 85 MVTNFKIYIY--PDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIP 142
+ +IY+Y P + R +A+E + + S T DP A FFIP
Sbjct: 1 LTNGIRIYVYDLPAKFNEDWLADERCSNHLFAAEVAIHKVLMTSPIRTLDPCEADFFFIP 60
Query: 143 I--SCHKMRGKGTSYENMT-VIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV----- 194
+ SC G + ++ V+ + + +WNR+ G DH FV HD G
Sbjct: 61 VYVSCKFTPKTGFPWLGQARKFMEAAVNHVSTRMEFWNRSGGRDHIFVASHDYGACFHTL 120
Query: 195 ---RATEGVPFLIKNA---------------------IRVVCSPSYDVAFIPHKDVALPQ 230
G+P ++ + I SPS ++I KD Q
Sbjct: 121 ETEAIAHGIPEFMRKSLILQTFGVQDFHPCQAAEHIQIPPYVSPSVAASYI--KDPPERQ 178
Query: 231 VLQPFALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLV 290
FA RG ++ + V G + +R +L + + ++ + +R +
Sbjct: 179 KRNIFAFFRGKMEINPKNVSGLVY---SRGVRTVLYKKFSHNRRFLLKRHRTDN------ 229
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
YQ ++ FC+CP G S RI +++ +GCVPVI+++ LP++ +DW ++ + E
Sbjct: 230 YQLEMLRSTFCLCPVGWAPWSPRIVEAVVHGCVPVIIADNISLPYSHAIDWTGISLSVRE 289
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQ--KHFQWNSPPVRFDA 393
DV +L +IL N++ ++ +NL K + + + P V+ DA
Sbjct: 290 HDVPKLDKILLNVAATNLSTIQHNLWKEENRRALLFTDPLVKGDA 334
>gi|115441967|ref|NP_001045263.1| Os01g0926600 [Oryza sativa Japonica Group]
gi|75159222|sp|Q8S1X8.1|GT14_ORYSJ RecName: Full=Probable glucuronosyltransferase Os01g0926600
gi|20160727|dbj|BAB89669.1| P0482D04.16 [Oryza sativa Japonica Group]
gi|20805225|dbj|BAB92892.1| putative pectin-glucuronyltransferase [Oryza sativa Japonica Group]
gi|113534794|dbj|BAF07177.1| Os01g0926600 [Oryza sativa Japonica Group]
gi|125528941|gb|EAY77055.1| hypothetical protein OsI_05013 [Oryza sativa Indica Group]
gi|125573177|gb|EAZ14692.1| hypothetical protein OsJ_04617 [Oryza sativa Japonica Group]
gi|215687152|dbj|BAG90922.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 415
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 90/330 (27%), Positives = 144/330 (43%), Gaps = 51/330 (15%)
Query: 107 RKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SCHKMRGKGTSYENMTVIVKD 164
R L+ +A+E + + + S T +P+ A F+ P+ +C I++
Sbjct: 70 RCLSHMFAAEIFMHRFLLSSAIRTLNPEEADWFYTPVYTTCDLTPWGHPLPFKSPRIMRS 129
Query: 165 YVDSLINKYPYWNRTLGADHFFVTCHDVGV-------RATE-GVPFLIKNAIRV------ 210
+ + + +PYWNRT GADHFFV HD G +A E G+ L++ A V
Sbjct: 130 AIQFISSHWPYWNRTDGADHFFVVPHDFGACFHYQEEKAIERGILPLLRRATLVQTFGQK 189
Query: 211 --VCSPSYDVAFIPHKDVALPQVLQPFALP-----------RGG-RDVENRTVLGFWAGH 256
VC + P+ A PQ ++ +P RG D N G++A
Sbjct: 190 DHVCLKEGSITIPPY---APPQKMKTHLVPPETPRSIFVYFRGLFYDTANDPEGGYYA-- 244
Query: 257 RNSKIRVILARVWEN---DTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSAR 313
R A VWEN + DIS + P Y ++ FC+CP G S R
Sbjct: 245 -----RGARASVWENFKNNPLFDISTDH------PPTYYEDMQRSIFCLCPLGWAPWSPR 293
Query: 314 IADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHN 373
+ +++ +GC+PVI+++ LPF D + W + V + E DV +L IL +I +
Sbjct: 294 LVEAVVFGCIPVIIADDIVLPFADAIPWDEIGVFVAEDDVPKLDTILTSIPMDVILRKQR 353
Query: 374 NLV--KVQKHFQWNSPPVRFDAFHMVMYEL 401
L +++ + P DAFH ++ L
Sbjct: 354 LLANPSMKQAMLFPQPAQPGDAFHQILNGL 383
>gi|115441965|ref|NP_001045262.1| Os01g0926400 [Oryza sativa Japonica Group]
gi|75159223|sp|Q8S1X9.1|GT13_ORYSJ RecName: Full=Probable glucuronosyltransferase Os01g0926400;
AltName: Full=OsGT47D
gi|20160726|dbj|BAB89668.1| P0482D04.15 [Oryza sativa Japonica Group]
gi|20805224|dbj|BAB92891.1| putative pectin-glucuronyltransferase [Oryza sativa Japonica Group]
gi|113534793|dbj|BAF07176.1| Os01g0926400 [Oryza sativa Japonica Group]
gi|215686656|dbj|BAG88909.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215707279|dbj|BAG93739.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 422
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 93/337 (27%), Positives = 144/337 (42%), Gaps = 51/337 (15%)
Query: 100 NTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIP--ISCHKMRGKGTSYEN 157
N + R L +A+E + Q + S T DP+ A F+ P +C
Sbjct: 70 NLLAKDSRCLQHMFAAEIFMHQFLLSSPVRTLDPEEADWFYTPAYTTCDLTPQGFPLPFR 129
Query: 158 MTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV-------RATE-GVPFLIKNAIR 209
I++ V + +PYWNRT GADHFF+ HD G RA E G+ +++ A
Sbjct: 130 APRIMRSAVRYVAATWPYWNRTDGADHFFLAPHDFGACFHYQEERAIERGILPVLRRATL 189
Query: 210 V--------VCSPSYDVAFIPHKDVALPQVLQPF----ALPRG--------GRDVENRTV 249
V C + P+ D P+ ++ A PR D+ N
Sbjct: 190 VQTFGQRHHPCLQPGSITVPPYAD---PRKMEAHRISPATPRSIFVYFRGLFYDMGNDPE 246
Query: 250 LGFWAGHRNSKIRVILARVWEN--DTEL-DISNNRINRAIGPLVYQRRFYKTKFCICPGG 306
G++A R A VWEN D L DIS P Y + FC+CP G
Sbjct: 247 GGYYA-------RGARASVWENFKDNPLFDISTEH------PATYYEDMQRAIFCLCPLG 293
Query: 307 SQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDA 366
S R+ +++ +GC+PVI+++ LPF D + W + +V + E DV +L IL ++
Sbjct: 294 WAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWGEISVFVAEEDVPRLDTILASVPLD 353
Query: 367 EFVSLHNNLVK--VQKHFQWNSPPVRFDAFHMVMYEL 401
E + L +++ ++ P DAFH ++ L
Sbjct: 354 EVIRKQRLLASPAMKQAVLFHQPARPGDAFHQILNGL 390
>gi|359472749|ref|XP_002276440.2| PREDICTED: probable glycosyltransferase At5g11130-like [Vitis
vinifera]
gi|147815974|emb|CAN68074.1| hypothetical protein VITISV_007510 [Vitis vinifera]
gi|297737986|emb|CBI27187.3| unnamed protein product [Vitis vinifera]
Length = 461
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 82/303 (27%), Positives = 144/303 (47%), Gaps = 39/303 (12%)
Query: 112 KYASEGYFFQN-IRESR-----FLTHDPDRAHLFFI-----------PISCHKMRGKGTS 154
+++ E Y F + IRE R DP+ A LF++ PI G GT
Sbjct: 115 QHSGEWYLFSDLIREDRGRRYVVRVSDPEEADLFYVSFFSSLSLVVNPIRPANGEGAGTG 174
Query: 155 YENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCS- 213
Y + + ++ + + + YW R G DH F+ C D A + +KN + +V
Sbjct: 175 YSDEEM--QESLMEWLEQQEYWKRNNGRDHVFI-CQDPN--ALHLIVDRVKNGVLLVSDF 229
Query: 214 --PSYDVAFIPHKDVALPQV--LQPFALPRGGRDVENRTVLGFWAGHR----NSKIRVIL 265
D A + KDV LP ++ ++ G VENR L F+ G+R KIR +L
Sbjct: 230 GRLRSDTASLV-KDVILPYAHRIKSYS---GEIGVENRKSLLFFMGNRYRKEGGKIRDLL 285
Query: 266 ARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPV 325
++ E + ++ I + +R + Q + +KFC+ P G ++ R+ D+I CVPV
Sbjct: 286 FQILEQEEDVIIKHGAQSRESRRMASQG-MHSSKFCLHPAGDTPSACRLFDAIVSLCVPV 344
Query: 326 ILSNYYDLPFNDILDWRKFAVILNERDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHF 382
I+S+ +LPF D++D+RK A+ ++ + L + L+ I+ + + +V ++F
Sbjct: 345 IVSDQIELPFEDVIDYRKIAIFVDSTSAVKPGFLVKNLRKITRERILEYQREMQEVTRYF 404
Query: 383 QWN 385
++
Sbjct: 405 EYE 407
>gi|168059925|ref|XP_001781950.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666596|gb|EDQ53246.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 356
Score = 98.2 bits (243), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 76/294 (25%), Positives = 138/294 (46%), Gaps = 33/294 (11%)
Query: 113 YASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SCHKMRGKG-TSYENMTVIVKDYVDSL 169
+ASE + + S T +P A FF+P+ SC G + +++D V+ +
Sbjct: 31 FASEVAIHKILLSSPIKTLNPYEADFFFMPVYFSCKFSSKTGFPRLGHAPKLMEDAVNHV 90
Query: 170 INKYPYWNRTLGADHFFVTCHDVGV--------RATEGVPFLIKNAIRVVCSPSYDV-AF 220
+ +WNR+ G DH FV HD G G+P ++++++ + ++ V F
Sbjct: 91 SSMMEFWNRSGGKDHVFVAAHDFGACFHSLESEAIAHGIPEIVQSSLIL---QTFGVHGF 147
Query: 221 IP---HKDVALPQVLQP---FALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWENDTE 274
P +++ +P + P F+ + + + R + F+ G + + V+
Sbjct: 148 HPCQAAENIQIPPYISPSTVFSYVKKPPEEQRRNIFAFFRGKMEINPKNVSGLVYSRGVR 207
Query: 275 LDI----SNNRI-----NRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPV 325
I S NR +RA YQ ++ FC+CP G S RI +++ YGCVPV
Sbjct: 208 TYIYKKFSRNRRFFLKRHRADN---YQLDLLRSTFCLCPLGWAPWSPRIVEAVAYGCVPV 264
Query: 326 ILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQ 379
I+++ LP++ +DW ++ + E DV++L +IL N++ S+ NL K +
Sbjct: 265 IIADNIRLPYSHAIDWSNMSLNIREHDVHKLYKILLNVAAKNLSSIQKNLWKEE 318
>gi|116787500|gb|ABK24530.1| unknown [Picea sitchensis]
Length = 446
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 78/296 (26%), Positives = 141/296 (47%), Gaps = 34/296 (11%)
Query: 132 DPDRAHLFFIPI--------SCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGAD 183
DPD+A +F++P MR T ++ I + VD ++ + W R+ G D
Sbjct: 139 DPDQADVFYVPFFASLSFNTHGQNMRDPETEFDKKLQI--EVVD-MLKRSKSWQRSGGRD 195
Query: 184 HFFVTCHDVGVR----ATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALP--QVLQPFAL 237
H V H R F++ + R S V+F+ KDV P V+ +
Sbjct: 196 HVIVIHHPNAFRFLRDEVNASIFVVADFGRYPRS----VSFL-RKDVVAPYVHVVDTYVN 250
Query: 238 PRGGRDVENRTVLGFWAGHRNSK----IRVILARVWENDTELDISNNRINRAIGPLVYQR 293
E+RT+L ++ G K +R+ LA++ N + ++ + G V ++
Sbjct: 251 DDSSDPFESRTMLLYFRGRTKRKDEGFVRLKLAKILGNHKRVHFEDS-LATTEGFEVAKQ 309
Query: 294 RFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDV 353
++FC+ P G +S R+ D+I CVPVI+S+ +LPF D +D+++F++ + ++
Sbjct: 310 GMRSSRFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDRIELPFEDEIDYQEFSLFFSVKEA 369
Query: 354 YQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRH 406
+ L Q L+ +++ + N L +V HF++ PP++ DA +M LW + H
Sbjct: 370 LRPGYLMQKLETFPKEKWLKMWNKLKQVAHHFEYQYPPIKDDAVNM----LWRQIH 421
>gi|15237870|ref|NP_197191.1| Exostosin family protein [Arabidopsis thaliana]
gi|9755690|emb|CAC01702.1| putative protein [Arabidopsis thaliana]
gi|15810401|gb|AAL07088.1| unknown protein [Arabidopsis thaliana]
gi|23296585|gb|AAN13125.1| unknown protein [Arabidopsis thaliana]
gi|332004972|gb|AED92355.1| Exostosin family protein [Arabidopsis thaliana]
Length = 511
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 71/241 (29%), Positives = 111/241 (46%), Gaps = 35/241 (14%)
Query: 170 INKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAI----------------RVVCS 213
+ P W R+ G DH F H ++ V +KNAI +V
Sbjct: 224 VTDQPAWKRSEGRDHIFPIHHPWSFKS---VRKFVKNAIWLLPDMDSTGNWYKPGQVSLE 280
Query: 214 PSYDVAFIPHKDVALPQVLQPFALPRGGRDVENRTVLGFWAGH--RNS--KIRVILARVW 269
+ ++P+ D+ + L A RT L F+ G RN+ KIR L
Sbjct: 281 KDLILPYVPNVDICDTKCLSESA--------PMRTTLLFFRGRLKRNAGGKIRAKLGAEL 332
Query: 270 ENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSN 329
++ IS G L QR ++ FC+CP G +SAR+ D+I GC+PVI+S+
Sbjct: 333 SGIKDIIISEGTAGEG-GKLAAQRGMRRSLFCLCPAGDTPSSARLFDAIVSGCIPVIVSD 391
Query: 330 YYDLPFNDILDWRKFAVILNERDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
+ PF ILD++K AV+++ D Q L L++++ + L NNL + +HF ++S
Sbjct: 392 ELEFPFEGILDYKKVAVLVSSSDAIQPGWLVNHLRSLTPFQVKGLQNNLAQYSRHFLYSS 451
Query: 387 P 387
P
Sbjct: 452 P 452
>gi|27817890|dbj|BAC55656.1| exostosin family protein-like protein [Oryza sativa Japonica Group]
Length = 453
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 80/307 (26%), Positives = 140/307 (45%), Gaps = 38/307 (12%)
Query: 112 KYASEGYFFQNIRESRFLTH------DPDRAHLFFIP------ISCHKMRGKGTSYENMT 159
++++E + F+++R L DP A LF++P + + +R +
Sbjct: 103 QHSAEWWLFKDLRRRGPLDRPVARVDDPADADLFYVPFFSSLSLVVNPIRPSAAAANASD 162
Query: 160 VIVKDYVDS--------LINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVV 211
Y D + + PYW R G DH F+ C D A V I NA+ ++
Sbjct: 163 AAEPAYSDESTQEELLVWLERQPYWRRHQGRDHVFI-CQDPN--ALYRVVDRISNAVLLI 219
Query: 212 CS----PSYDVAFIPHKDVALPQVLQPFALPRGGRDVENRTVLGFWAGHR----NSKIRV 263
S + + KDV LP + + +G VE+R L F+ G+R K+R
Sbjct: 220 SDFGRLRSEQASLV--KDVILPYAHRINSF-QGDVGVESRPSLLFFMGNRYRKEGGKVRD 276
Query: 264 ILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCV 323
L +V EN+ ++ I + +R + R + +KFC+ P G ++ R+ D++ CV
Sbjct: 277 TLFQVLENEADVIIKHGAQSRESRRMA-TRGMHSSKFCLHPAGDTPSACRLFDALVSLCV 335
Query: 324 PVILSNYYDLPFNDILDWRKFAVILNERDVYQ---LKQILKNISDAEFVSLHNNLVKVQK 380
PVI+S+Y +LPF D++D+R ++ + Q L L+ IS + + KV+
Sbjct: 336 PVIVSDYIELPFEDVIDYRNISIFVETSKAVQPGFLTSTLRGISSQRILEYQREIKKVKH 395
Query: 381 HFQWNSP 387
+F++ P
Sbjct: 396 YFEYEDP 402
>gi|348538380|ref|XP_003456670.1| PREDICTED: exostosin-1a-like [Oreochromis niloticus]
Length = 776
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 84/330 (25%), Positives = 141/330 (42%), Gaps = 57/330 (17%)
Query: 87 TNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCH 146
+ FK+YIYP + +T +K+ +I ESRF T DP RA LF + +
Sbjct: 141 SGFKVYIYPQQRGSEISETYKKI----------LTSIEESRFHTTDPLRACLFILAVDT- 189
Query: 147 KMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVG-VRATEGVPFLIK 205
+ ++V + S I P WN G +H + TE + F +
Sbjct: 190 ------LDRDQLSVQYVQNIRSRIQNLPTWND--GRNHLIFNLYSGSWPDYTEDLGFEVG 241
Query: 206 NAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGG------------RDVE-----NRT 248
A+ S V F + D+++P + L GG R + R
Sbjct: 242 QAMLAKASADV-VNFRSNYDISIPLFSKDHPLKGGGIGYLTLNDAPPSRKYQLVFKGKRY 300
Query: 249 VLGFWAGHRNSKIRV------IL------ARVWENDTELDISNNRINRAIGPLVYQRRFY 296
+ G + RN+ + IL + WE D +R N YQ +
Sbjct: 301 LTGIGSETRNALYHIHNGEDIILLTTCKHGKDWEKHK--DSRCDRDNEDYSKFDYQELLH 358
Query: 297 KTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQL 356
+ FC+ P G ++ S R +++ C+PVILSN ++LPF++++DWRK A+I +ER + Q+
Sbjct: 359 NSTFCLVPRGRRLGSFRFLEALQAACIPVILSNGWELPFSEVIDWRKAAIIGDERLLLQV 418
Query: 357 KQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
I +++ ++L + Q F W++
Sbjct: 419 PSITRSVGRDRILAL-----RQQTQFLWDA 443
>gi|222637299|gb|EEE67431.1| hypothetical protein OsJ_24780 [Oryza sativa Japonica Group]
Length = 500
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 80/307 (26%), Positives = 140/307 (45%), Gaps = 38/307 (12%)
Query: 112 KYASEGYFFQNIRESRFLTH------DPDRAHLFFIP------ISCHKMRGKGTSYENMT 159
++++E + F+++R L DP A LF++P + + +R +
Sbjct: 150 QHSAEWWLFKDLRRRGPLDRPVARVDDPADADLFYVPFFSSLSLVVNPIRPSAAAANASD 209
Query: 160 VIVKDYVDS--------LINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVV 211
Y D + + PYW R G DH F+ C D A V I NA+ ++
Sbjct: 210 AAEPAYSDESTQEELLVWLERQPYWRRHQGRDHVFI-CQDPN--ALYRVVDRISNAVLLI 266
Query: 212 CS----PSYDVAFIPHKDVALPQVLQPFALPRGGRDVENRTVLGFWAGHR----NSKIRV 263
S + + KDV LP + + +G VE+R L F+ G+R K+R
Sbjct: 267 SDFGRLRSEQASLV--KDVILPYAHRINSF-QGDVGVESRPSLLFFMGNRYRKEGGKVRD 323
Query: 264 ILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCV 323
L +V EN+ ++ I + +R + R + +KFC+ P G ++ R+ D++ CV
Sbjct: 324 TLFQVLENEADVIIKHGAQSRESRRMA-TRGMHSSKFCLHPAGDTPSACRLFDALVSLCV 382
Query: 324 PVILSNYYDLPFNDILDWRKFAVILNERDVYQ---LKQILKNISDAEFVSLHNNLVKVQK 380
PVI+S+Y +LPF D++D+R ++ + Q L L+ IS + + KV+
Sbjct: 383 PVIVSDYIELPFEDVIDYRNISIFVETSKAVQPGFLTSTLRGISSQRILEYQREIKKVKH 442
Query: 381 HFQWNSP 387
+F++ P
Sbjct: 443 YFEYEDP 449
>gi|115472841|ref|NP_001060019.1| Os07g0567000 [Oryza sativa Japonica Group]
gi|113611555|dbj|BAF21933.1| Os07g0567000 [Oryza sativa Japonica Group]
Length = 500
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 80/307 (26%), Positives = 140/307 (45%), Gaps = 38/307 (12%)
Query: 112 KYASEGYFFQNIRESRFLTH------DPDRAHLFFIP------ISCHKMRGKGTSYENMT 159
++++E + F+++R L DP A LF++P + + +R +
Sbjct: 150 QHSAEWWLFKDLRRRGPLDRPVARVDDPADADLFYVPFFSSLSLVVNPIRPSAAAANASD 209
Query: 160 VIVKDYVDS--------LINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVV 211
Y D + + PYW R G DH F+ C D A V I NA+ ++
Sbjct: 210 AAEPAYSDESTQEELLVWLERQPYWRRHQGRDHVFI-CQDPN--ALYRVVDRISNAVLLI 266
Query: 212 CS----PSYDVAFIPHKDVALPQVLQPFALPRGGRDVENRTVLGFWAGHR----NSKIRV 263
S + + KDV LP + + +G VE+R L F+ G+R K+R
Sbjct: 267 SDFGRLRSEQASLV--KDVILPYAHRINSF-QGDVGVESRPSLLFFMGNRYRKEGGKVRD 323
Query: 264 ILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCV 323
L +V EN+ ++ I + +R + R + +KFC+ P G ++ R+ D++ CV
Sbjct: 324 TLFQVLENEADVIIKHGAQSRESRRMA-TRGMHSSKFCLHPAGDTPSACRLFDALVSLCV 382
Query: 324 PVILSNYYDLPFNDILDWRKFAVILNERDVYQ---LKQILKNISDAEFVSLHNNLVKVQK 380
PVI+S+Y +LPF D++D+R ++ + Q L L+ IS + + KV+
Sbjct: 383 PVIVSDYIELPFEDVIDYRNISIFVETSKAVQPGFLTSTLRGISSQRILEYQREIKKVKH 442
Query: 381 HFQWNSP 387
+F++ P
Sbjct: 443 YFEYEDP 449
>gi|218199855|gb|EEC82282.1| hypothetical protein OsI_26516 [Oryza sativa Indica Group]
Length = 393
Score = 94.4 bits (233), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 79/307 (25%), Positives = 139/307 (45%), Gaps = 38/307 (12%)
Query: 112 KYASEGYFFQNIRESR------FLTHDPDRAHLFFIP------ISCHKMRGKGTSYENMT 159
++++E + F+++R DP A LF++P + + +R +
Sbjct: 43 QHSAEWWLFKDLRRRGPRDRPVARVDDPADADLFYVPFFSSLSLVVNPIRPSAAAANASD 102
Query: 160 VIVKDYVDS--------LINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVV 211
Y D + + PYW R G DH F+ C D A V I NA+ ++
Sbjct: 103 AAEPAYSDESTQEELLVWLERQPYWRRHQGRDHVFI-CQDPN--ALYRVVDRISNAVLLI 159
Query: 212 CS----PSYDVAFIPHKDVALPQVLQPFALPRGGRDVENRTVLGFWAGHR----NSKIRV 263
S + + KDV LP + + +G VE+R L F+ G+R K+R
Sbjct: 160 SDFGRLRSEQASLV--KDVILPYAHRINSF-QGDVGVESRPSLLFFMGNRYRKEGGKVRD 216
Query: 264 ILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCV 323
L +V EN+ ++ I + +R + R + +KFC+ P G ++ R+ D++ CV
Sbjct: 217 TLFQVLENEADVIIKHGAQSRESRRMA-TRGMHSSKFCLHPAGDTPSACRLFDALVSLCV 275
Query: 324 PVILSNYYDLPFNDILDWRKFAVILNERDVYQ---LKQILKNISDAEFVSLHNNLVKVQK 380
PVI+S+Y +LPF D++D+R ++ + Q L L+ IS + + KV+
Sbjct: 276 PVIVSDYIELPFEDVIDYRNISIFVETSKAVQPGFLTSTLRGISSQRILEYQREIKKVKH 335
Query: 381 HFQWNSP 387
+F++ P
Sbjct: 336 YFEYEDP 342
>gi|356519776|ref|XP_003528545.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10L-like
[Glycine max]
Length = 452
Score = 94.4 bits (233), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 81/328 (24%), Positives = 147/328 (44%), Gaps = 45/328 (13%)
Query: 93 IYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESR----------FLTHDPDRAHLFFIP 142
+ PD D P+ ++ +E Y F ++ + L DP+ A LFF+P
Sbjct: 89 VTPDDDA----AAPKYPGHQHMAEWYLFADLSRAESERAGSGSPVVLVADPEEADLFFVP 144
Query: 143 ------ISCHKMRGKGTSYENMTVIVKDYVD-----SLINKYPYWNRTLGADHFFVTCHD 191
+ + +R G++ + + D + + K YW R G DH V
Sbjct: 145 FFSSLSLIVNPVRPPGSNSGSEKPVYSDEENQEALVEWLEKQEYWKRNSGRDHVIVASDP 204
Query: 192 VGVRATEGVPFLIKNAIRVVCSPSYDVAFI-PHKDVALPQVLQPFA----LPRGGRDVEN 246
A V ++NA+ +V D + P + + V+ P++ +G VE+
Sbjct: 205 ---NAMYRVIDRVRNAVLLVS----DFGRLRPDQGSLVKDVVVPYSHRIRTYQGDAGVED 257
Query: 247 RTVLGFWAGHR----NSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCI 302
R L F+ G+R KIR IL ++ EN+ ++ I + +R Q + +KFC+
Sbjct: 258 RNTLLFFMGNRYRKEGGKIRDILFKILENEKDVIIKHGAQSRESRRAASQG-MHTSKFCL 316
Query: 303 CPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQ---LKQI 359
P G ++ R+ D+I C+PVI+S+ +LPF D +D+RK AV + + L
Sbjct: 317 HPAGDTPSACRLFDAIVSLCIPVIVSDNIELPFEDTIDYRKLAVFIETSSAIKPGYLVSK 376
Query: 360 LKNISDAEFVSLHNNLVKVQKHFQWNSP 387
L+ ++ ++ L +V+++F++ P
Sbjct: 377 LRALTPDRVLAYQKELKEVKRYFEYEEP 404
>gi|242046000|ref|XP_002460871.1| hypothetical protein SORBIDRAFT_02g036640 [Sorghum bicolor]
gi|241924248|gb|EER97392.1| hypothetical protein SORBIDRAFT_02g036640 [Sorghum bicolor]
Length = 500
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 67/229 (29%), Positives = 114/229 (49%), Gaps = 18/229 (7%)
Query: 170 INKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCS----PSYDVAFIPHKD 225
+ + PYW R +G DH F+ C D A V I NA+ +V S + + KD
Sbjct: 226 LERQPYWRRHMGRDHVFI-CQDPN--ALYRVIDRISNAVLLVSDFGRLRSDQASLV--KD 280
Query: 226 VALPQVLQPFALPRGGRDVENRTVLGFWAGHR----NSKIRVILARVWENDTELDISNNR 281
V LP + + +G V+ R +L F+ G+R K+R L ++ EN+ ++ I +
Sbjct: 281 VILPYSHRINSF-KGEVGVDGRPLLLFFMGNRYRKEGGKVRDALFQILENEDDVTIKHGT 339
Query: 282 INRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDW 341
+R Q + +KFC+ P G ++ R+ D++ CVPVI+S+Y +LPF DI+D+
Sbjct: 340 QSRESRRAARQG-MHSSKFCLHPAGDTPSACRLFDALVSLCVPVIVSDYIELPFEDIIDY 398
Query: 342 RKFAVILNERDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSP 387
K ++ + Q L +L+ IS + KV+++F++ P
Sbjct: 399 NKISIFVGTSKAVQPGYLTSMLRRISSERILEYQRETKKVKRYFEYEDP 447
>gi|255558009|ref|XP_002520033.1| catalytic, putative [Ricinus communis]
gi|223540797|gb|EEF42357.1| catalytic, putative [Ricinus communis]
Length = 507
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 74/235 (31%), Positives = 109/235 (46%), Gaps = 23/235 (9%)
Query: 170 INKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVA---FIP---- 222
+ P W R+ G DH H ++ V +KNAI ++ P D + P
Sbjct: 225 VTDQPAWKRSGGRDHILPVHHPWSFKS---VRRYVKNAIWLL--PDMDSTGNWYKPGQVF 279
Query: 223 -HKDVALPQV--LQPFALPRGGRDVENRTVLGFWAGH--RNS--KIRVILARVWENDTEL 275
KD+ LP V + + RT L F+ G RN+ KIR L +
Sbjct: 280 LEKDLILPYVPNVDLCDAKCASENESKRTTLLFFRGRLKRNAGGKIRAKLVAELSGAEGV 339
Query: 276 DISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPF 335
+ G Q K+ FC+ P G +SAR+ D+I GC+PVI+S+ +LPF
Sbjct: 340 VVEEGTAGEG-GKAAAQTGMRKSIFCLSPAGDTPSSARLFDAIVSGCIPVIVSDELELPF 398
Query: 336 NDILDWRKFAVILNERDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSP 387
ILD+RK AV ++ D Q L + LK++S A+ + NLVK +HF ++SP
Sbjct: 399 EGILDYRKIAVFVSSSDAIQPGWLIKFLKDVSPAQTREMQRNLVKYSRHFLYSSP 453
>gi|390338044|ref|XP_783281.3| PREDICTED: exostosin-1b-like [Strongylocentrotus purpuratus]
Length = 702
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 85/330 (25%), Positives = 145/330 (43%), Gaps = 60/330 (18%)
Query: 88 NFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHK 147
FK+YIYP YQ P +L +IR SR+ T + D A LF I
Sbjct: 62 GFKVYIYP-----VSYQKPSQLYSD------ILTSIRASRYFTTNADEACLFVTSIDTLD 110
Query: 148 MRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHD-VGVRATEGVPFLIKN 206
++Y VK+ ++S I+K +WN G +H + +E + F I
Sbjct: 111 RDKLSSTY------VKN-IESSISKLSHWNN--GKNHLIFNLYSGTWPDYSEDLGFNIGE 161
Query: 207 AIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRD---------VENRTVLGFWAG-- 255
AI S S + F P+ D++LP + +GG V+ + +L F
Sbjct: 162 AILAKASFS-EQYFRPNFDISLPLFAKTHP-QKGGNSGDLQGNNFPVQRKYLLAFKGKRY 219
Query: 256 -------------HRNSKIRVIL------ARVWENDTELDISNNRINRAIGPLVYQRRFY 296
H ++ + +IL + W+ D+ + N YQ +
Sbjct: 220 LFGIGSDTRNALYHLHNGVDIILLTTCKHGKNWQKHK--DVRCDHDNAEFDKFDYQVLLH 277
Query: 297 KTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQL 356
+ FC+ P G ++ S R +S+ C+P++LSN ++LPF++++DW K V +ER + Q+
Sbjct: 278 NSTFCLVPRGRRLGSFRFLESLQAACIPMLLSNGWELPFSEVIDWSKAVVFGDERLLLQV 337
Query: 357 KQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
I+++I+ + + L++ Q F WNS
Sbjct: 338 PSIVRSITAEQIL-----LLRQQTQFLWNS 362
>gi|168016669|ref|XP_001760871.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162687880|gb|EDQ74260.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 378
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 82/284 (28%), Positives = 134/284 (47%), Gaps = 31/284 (10%)
Query: 139 FFIPISCHKMRGKGTSYENMTVIVKDYVDSL-------INKYPYWNRTLGADHFFVTCHD 191
FF +S ++ GK + + +T ++KD D L + + P W + G DH FV H
Sbjct: 82 FFASLSYNRF-GKASEEKRLTDLIKDQNDVLQLKLVKFLEEQPAWKASGGRDHVFVIHHP 140
Query: 192 VGVRATEGVPFLIKNAIRVVCS-PSYD--VAFIPHKDVALP--QVLQPFALPRGGRDVEN 246
++AT ++N++ +V YD VA I KDV P V+ F
Sbjct: 141 NSMQATRN---RLRNSLFIVSDFGRYDSEVANI-QKDVVAPYKHVIPTFDFDDSS--FHT 194
Query: 247 RTVLGFWAG----HRNSKIRVILARVWEND--TELDISNNRINRAIGPLVYQRRFYKTKF 300
R +L F+ G KIR L R+ ++ N ++ + R +KF
Sbjct: 195 RKILLFFQGAIVRKEGGKIRHELYRLLKDKPGVRFTTGNTALDGFQSATIGMR---SSKF 251
Query: 301 CICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQ---LK 357
C+ G +S R+ DSI CVPVI+S+ +LPF D LD+ F + +N + +
Sbjct: 252 CLNMAGDTPSSNRLFDSIVSHCVPVIISDDIELPFEDTLDYSNFCIFINSSLALKPGYVI 311
Query: 358 QILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYEL 401
+L+N+S+ E+ L N L+ V+ HF++ P + DA +MV ++
Sbjct: 312 NMLRNVSEEEWTQLWNQLLLVEHHFEYQHPTRKNDAVNMVWKDI 355
>gi|225461772|ref|XP_002285599.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase
IRX7-like [Vitis vinifera]
Length = 513
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 75/235 (31%), Positives = 108/235 (45%), Gaps = 23/235 (9%)
Query: 170 INKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVA---FIP---- 222
+ P W R+ G DH H + +KNAI ++ P D + P
Sbjct: 234 VTDQPAWKRSEGRDHILPVHHPWSFKTVRKS---MKNAIWLL--PDMDSTGNWYKPGQVS 288
Query: 223 -HKDVALPQVLQ-PFALPRGGRDVEN-RTVLGFWAGH--RNS--KIRVILARVWENDTEL 275
KD+ LP V + + E+ R L F+ G RN+ KIR L D +
Sbjct: 289 LEKDLILPYVPNVDLCDAKCSSESESKRKTLLFFRGRLKRNAGGKIRAKLMAELSGDDGV 348
Query: 276 DISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPF 335
I G QR K+ FC+ P G +SAR+ D+I GC+PVI+S+ +LPF
Sbjct: 349 VIQEGTAGEG-GKEAAQRGMRKSIFCLSPAGDTPSSARLFDAIVSGCIPVIVSDELELPF 407
Query: 336 NDILDWRKFAVILNERDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSP 387
ILD+RK A+ ++ D Q L LK+IS A+ + NL K +HF ++SP
Sbjct: 408 EGILDYRKIALFVSSSDAMQPGWLLTFLKSISPAQIKEMQRNLAKYSRHFVYSSP 462
>gi|302142837|emb|CBI20132.3| unnamed protein product [Vitis vinifera]
Length = 379
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 75/235 (31%), Positives = 108/235 (45%), Gaps = 23/235 (9%)
Query: 170 INKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVA---FIP---- 222
+ P W R+ G DH H + +KNAI ++ P D + P
Sbjct: 100 VTDQPAWKRSEGRDHILPVHHPWSFKTVRKS---MKNAIWLL--PDMDSTGNWYKPGQVS 154
Query: 223 -HKDVALPQVLQ-PFALPRGGRDVEN-RTVLGFWAGH--RNS--KIRVILARVWENDTEL 275
KD+ LP V + + E+ R L F+ G RN+ KIR L D +
Sbjct: 155 LEKDLILPYVPNVDLCDAKCSSESESKRKTLLFFRGRLKRNAGGKIRAKLMAELSGDDGV 214
Query: 276 DISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPF 335
I G QR K+ FC+ P G +SAR+ D+I GC+PVI+S+ +LPF
Sbjct: 215 VIQEGTAGEG-GKEAAQRGMRKSIFCLSPAGDTPSSARLFDAIVSGCIPVIVSDELELPF 273
Query: 336 NDILDWRKFAVILNERDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSP 387
ILD+RK A+ ++ D Q L LK+IS A+ + NL K +HF ++SP
Sbjct: 274 EGILDYRKIALFVSSSDAMQPGWLLTFLKSISPAQIKEMQRNLAKYSRHFVYSSP 328
>gi|289166878|gb|ADC84489.1| glycosyltransferase family 47C [Salix sachalinensis]
Length = 252
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 66/246 (26%), Positives = 114/246 (46%), Gaps = 44/246 (17%)
Query: 161 IVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV-------RATE-GVPFLIKNAIRVVC 212
++ V+ + + +P+WNR+ G+DH FV HD G RA E G+P +K R +
Sbjct: 5 LLSSAVNLISSNHPFWNRSRGSDHVFVASHDYGACFHAMEERAAEDGIPEFLK---RSII 61
Query: 213 SPSYDVAFI-PHKDVALPQVLQPFALP----------------------RGGRDVENRTV 249
++ V F P +DV V+ PF P RG +V + +
Sbjct: 62 LQTFGVKFDHPCQDVE-NVVIPPFITPESVQTTLEKYPLTGRRDIWVFFRGKMEVHPKNI 120
Query: 250 LGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQV 309
G + + K+R ++ R + D + +R YQ ++ FC+CP G
Sbjct: 121 SG---RYYSKKVRTVIWRKYSGDPRFYLRRHRFAG------YQSEIARSVFCLCPLGWAP 171
Query: 310 NSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFV 369
S R+ +SI GCVPVI+++ LPF + W ++ + E+DV L+ +L +++ +
Sbjct: 172 WSPRLVESIALGCVPVIIADGIRLPFPAAVRWSDISLTVAEKDVADLRTLLDHVAASNLS 231
Query: 370 SLHNNL 375
++ NL
Sbjct: 232 AIQKNL 237
>gi|269785149|ref|NP_001161530.1| exostosin 1-like protein [Saccoglossus kowalevskii]
gi|268054045|gb|ACY92509.1| exostosin 1-like protein [Saccoglossus kowalevskii]
Length = 737
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 92/353 (26%), Positives = 149/353 (42%), Gaps = 66/353 (18%)
Query: 88 NFKIYIYPDGDPNTFYQTPRKLTGKYASEGY--FFQNIRESRFLTHDPDRAHLFFIPISC 145
+FK+Y+YP + G SE Y IRESRF T DP +A LF I
Sbjct: 92 DFKVYVYPTQE------------GNKVSEAYDKILSAIRESRFYTSDPKKACLFIPSIDT 139
Query: 146 HKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHF-FVTCHDVGVRATE-GVPFL 203
Y VK+ S I P WN G +H FV ++ + F
Sbjct: 140 LDRDHLSPDY------VKN-AQSKIQSLPLWNN--GQNHLIFVLYSGTWPEYSDLDLGFE 190
Query: 204 IKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGR-DVEN------RTVLGFWAGH 256
+ A+ S + + F P D+++P + A G R D++ R L + G
Sbjct: 191 LGQAMLAKASTT-SINFRPGFDISIPLFSKDHAQKGGSRGDLQTNNFPVARKYLLVFKGK 249
Query: 257 R-----NSKIR-----------VIL------ARVWEN--DTELDISNNRINRAIGPLVYQ 292
R S+ R +IL + WE D+ + N +R +
Sbjct: 250 RYLSGIGSETRNALYHIHNGQDIILLTTCKHGKSWEKNADSRCEQDNAEFDR----YDFH 305
Query: 293 RRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERD 352
+ + FC+ P G ++ S R +S+ C+PV+LSN ++LPF++++DW + ++I +ER
Sbjct: 306 ILLHNSTFCLVPRGRRLGSFRFLESLQAACIPVLLSNGWELPFSEVIDWNRASIIGDERL 365
Query: 353 VYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
+ Q+ I++ +S+ E +SL + Q F W + D M E+ R
Sbjct: 366 LLQIPSIVRTVSNDEILSL-----RQQTQFLWETYFSSVDKIVMTTLEIIQER 413
>gi|326499682|dbj|BAJ86152.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 304
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 79/265 (29%), Positives = 117/265 (44%), Gaps = 51/265 (19%)
Query: 171 NKYPYWNRTLGADHFFVTCHDVGV-------RATE-GVPFLIKNAIRV--------VCSP 214
+ +PYWNRT GADHFFV HD G +A E G+ L++ A V VC
Sbjct: 25 SHWPYWNRTAGADHFFVVPHDFGACFHYQEEKAIERGILPLLRRATLVQTFGQKDHVCLK 84
Query: 215 SYDVAFIPHKDVALPQVLQ----PFALPRG--------GRDVENRTVLGFWAGHRNSKIR 262
+ P+ A PQ ++ P PR D N G++A R
Sbjct: 85 EGSINIPPY---APPQKMKTHLVPPETPRSIFVYFRGLFYDTANDPEGGYYA-------R 134
Query: 263 VILARVWEN---DTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIH 319
A VWEN + DIS + P Y + FC+CP G S R+ +++
Sbjct: 135 GARASVWENFKNNPLFDISTDH------PPTYYEDMQRAIFCLCPLGWAPWSPRLVEAVV 188
Query: 320 YGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLV--- 376
+GC+PVI+++ LPF D + W + V + E DV +L IL +I E + L+
Sbjct: 189 FGCIPVIIADDIVLPFADAIPWDEIGVFVAEDDVPKLDTILTSIP-MEVILRKQRLLANP 247
Query: 377 KVQKHFQWNSPPVRFDAFHMVMYEL 401
+++ + P DAFH ++ L
Sbjct: 248 SMKQAMLFPQPAQAGDAFHQILNGL 272
>gi|297807715|ref|XP_002871741.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
gi|297317578|gb|EFH48000.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
Length = 510
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 70/241 (29%), Positives = 109/241 (45%), Gaps = 35/241 (14%)
Query: 170 INKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAI----------------RVVCS 213
+ P W R+ G DH F H ++ V +KNAI +V
Sbjct: 223 VTDQPAWKRSEGRDHIFPIHHPWSFKS---VRKFVKNAIWLLPDMDSTGNWYKPGQVSLE 279
Query: 214 PSYDVAFIPHKDVALPQVLQPFALPRGGRDVENRTVLGFWAGH--RNS--KIRVILARVW 269
+ ++P+ D+ + L A RT L F+ G RN+ KIR L
Sbjct: 280 KDLILPYVPNVDICDAKCLSESA--------PMRTTLLFFRGRLKRNAGGKIRAKLGAEL 331
Query: 270 ENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSN 329
+ IS G L Q ++ FC+CP G +SAR+ D+I GC+PVI+S+
Sbjct: 332 SGVKGVIISEGTAGEG-GKLAAQGGMRRSLFCLCPAGDTPSSARLFDAIVSGCIPVIVSD 390
Query: 330 YYDLPFNDILDWRKFAVILNERDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
+ PF ILD++K AV+++ DV Q L L++++ + L NL + +HF ++S
Sbjct: 391 ELEFPFEGILDYKKVAVLVSSNDVVQPGWLVNHLRSLTPFQIKELQKNLAQYSRHFLYSS 450
Query: 387 P 387
P
Sbjct: 451 P 451
>gi|357122399|ref|XP_003562903.1| PREDICTED: xylogalacturonan beta-1,3-xylosyltransferase-like
[Brachypodium distachyon]
Length = 498
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 79/308 (25%), Positives = 144/308 (46%), Gaps = 40/308 (12%)
Query: 112 KYASEGYFFQNIRE----SRFLTH--DPDRAHLFFIP------ISCHKMRGKGTSYENMT 159
++++E + F+++R R + DP A LF++P + + +R + ++
Sbjct: 147 QHSAEWWLFKDLRRRGPRERPVARVDDPAEADLFYVPFFSSLSLVVNPIRPLAAANASVA 206
Query: 160 VIVKDYVDSL--------INKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVV 211
Y D + + PYW R G DH F+ C D A V I NA+ +V
Sbjct: 207 AAEPAYSDEAMQDELVEWLERQPYWRRHRGRDHVFI-CQDPN--ALYRVVDRISNAVLLV 263
Query: 212 CSPSY---DVAFIPHKDVALP--QVLQPFALPRGGRDVENRTVLGFWAGHR----NSKIR 262
D A + KDV LP + PF +G +E R L F+ G+R K+R
Sbjct: 264 SDFGRLRGDQASL-VKDVILPYSHRINPF---QGDVSIEARPALLFFMGNRYRKEGGKVR 319
Query: 263 VILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGC 322
L +V EN+ ++ I + +R + Q + +KFC+ P G ++ R+ D++ C
Sbjct: 320 DTLFQVLENEGDVIIKHGTQSRVSRRMATQG-MHSSKFCLHPAGDTPSACRLFDALVSLC 378
Query: 323 VPVILSNYYDLPFNDILDWRKFAVILNERDVYQ---LKQILKNISDAEFVSLHNNLVKVQ 379
VPVI+S++ +LPF D++D+ ++ ++ Q L +L+ +S + + +V+
Sbjct: 379 VPVIISDHIELPFEDVIDYSNISIFVDTSKAVQPGFLTSMLRRVSSERILEYQREIKRVK 438
Query: 380 KHFQWNSP 387
+F++ P
Sbjct: 439 HYFEYEDP 446
>gi|289166880|gb|ADC84490.1| glycosyltransferase family 47C [Salix miyabeana]
Length = 252
Score = 91.3 bits (225), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 66/246 (26%), Positives = 113/246 (45%), Gaps = 44/246 (17%)
Query: 161 IVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV-------RATE-GVPFLIKNAIRVVC 212
++ V+ + + +P+WNR+ G+DH FV HD G RA E G+P +K R +
Sbjct: 5 LLSSAVNLISSNHPFWNRSRGSDHVFVASHDYGACFHAMEERAAEDGIPEFLK---RSII 61
Query: 213 SPSYDVAFI-PHKDVALPQVLQPFALP----------------------RGGRDVENRTV 249
++ V F P +DV V+ PF P RG +V + +
Sbjct: 62 LQTFGVKFDHPCQDVE-NVVIPPFITPESVQTTLEKYPLTGRRDIWVFFRGKMEVHPKNI 120
Query: 250 LGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQV 309
G + + K+R ++ R + D + +R YQ ++ FC+CP G
Sbjct: 121 SG---RYYSKKVRTVIWRKYSGDPRFYLRRHRFAG------YQSEIARSVFCLCPLGWAP 171
Query: 310 NSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFV 369
S R+ +SI GCVPVI+++ LPF + W ++ + E+DV L +L +++ +
Sbjct: 172 WSPRLVESIALGCVPVIIADGIRLPFPAAVRWSDISLTVAEKDVADLGTLLDHVAASNLS 231
Query: 370 SLHNNL 375
++ NL
Sbjct: 232 AIQKNL 237
>gi|156362595|ref|XP_001625861.1| predicted protein [Nematostella vectensis]
gi|156212714|gb|EDO33761.1| predicted protein [Nematostella vectensis]
Length = 728
Score = 91.3 bits (225), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 106/427 (24%), Positives = 174/427 (40%), Gaps = 91/427 (21%)
Query: 18 SLRSSLLTLALVTLLCFTCLSFNSFRSYPLQNNFNSTLSFAINKQDYSDLG-----SDVF 72
SL + + +A L+ F L+ N F+ + +N +++K + DL DVF
Sbjct: 8 SLLAVVALVACGLLVAFKILTDNEFKPPNIDDNTRILSYLSVSKAGFLDLNDELELQDVF 67
Query: 73 HSP--SVFHLNY---------------LEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYAS 115
+P S+ N + +++F+IY+YP KL+ Y +
Sbjct: 68 KTPKDSISSKNVPPQPNKCTMETCFDKTKCLSDFRIYVYP-------VHPGAKLSTTYTN 120
Query: 116 EGYFFQNIRESRFLTHDPDRAHLFFIPISCHKMRGKGTSYENMTVIVKDYVDSLINK--- 172
+ I+ESR+ T P+ A LF I + + DYV ++ NK
Sbjct: 121 ---IIKVIKESRYYTEFPEEACLFITAIDT----------LDRDKLSADYVHNIYNKIRQ 167
Query: 173 YPYWNRTLGADHF-FVTCHDVGVRATEGVPFLIKNAIRVVCS-------PSYDVAFI--- 221
PYW G +H F +E V F AI V S P +DV+
Sbjct: 168 LPYWKN--GENHIIFNLFAGTWPDYSEDVGFDFGKAILVKASLSSDLIRPGFDVSLPLFP 225
Query: 222 ---PHKDVA-LPQVLQPFALPR------------GGRDVENRTVLGFWAGHRNSKIRVIL 265
PHKD+ LP F L R G E+R L + H I ++
Sbjct: 226 KTHPHKDLGNLPHSCSAFPLERKYKLAFKGKRYLNGIGSESRNAL--YHIHNGRDIVLLT 283
Query: 266 ----ARVWEN--DTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIH 319
+ W D D N +R Y FC+ P G ++ S R +S+
Sbjct: 284 TCKHGKAWHKHKDERCDGDNALYDR----YSYDELLLNATFCLVPRGRRLGSFRFLESLK 339
Query: 320 YGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQ 379
GC+P +LS+ ++LPF +++DW+K + +ER + Q+ I+++ S ++ +++ K Q
Sbjct: 340 VGCIPFLLSDGWELPFAEVIDWKKAVIDGSERLLMQVPGIVRSYSRSQVLAM-----KQQ 394
Query: 380 KHFQWNS 386
F WN+
Sbjct: 395 SLFLWNA 401
>gi|405975819|gb|EKC40363.1| Exostosin-1b [Crassostrea gigas]
Length = 714
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 87/341 (25%), Positives = 138/341 (40%), Gaps = 83/341 (24%)
Query: 86 VTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGY--FFQNIRESRFLTHDPDRAHLFFIPI 143
V FK+Y+YP + SE Y I+ES++ T +PD A +F + I
Sbjct: 84 VRGFKVYVYP--------------VLERVSESYSKIISTIQESKYYTTNPDEACIFILSI 129
Query: 144 SCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFL 203
N K+ + + + WN D+ V D G
Sbjct: 130 DTLDRDI------NSVNSAKNIMPKIAHLGALWNNGTWPDYSEVLDFDTG---------- 173
Query: 204 IKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGR------DVEN----------- 246
AI S S + F P+ D++LP V P+GG + N
Sbjct: 174 --EAIMARASIS-ESRFRPNFDISLPLVGSTHP-PKGGERGYMYTSINNIPPLRHYLLGF 229
Query: 247 ---RTVLGFWAGHRNSKIR-------VIL-----ARVWE------NDTELDISNNRINRA 285
R + G + RNS V+L + W+ NDT DI N +R
Sbjct: 230 KGKRYLTGVGSETRNSLYHMHNGDDIVLLTTCRHGKFWQKKAKELNDTRCDIDNREFDR- 288
Query: 286 IGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFA 345
Y++ Y FC+ P G ++ S R +++ GC+PV+LSN ++LPF +++DW+K A
Sbjct: 289 ---YDYKKLLYNATFCLVPRGRRLGSFRFLETLQAGCIPVLLSNGWELPFGEVIDWKKAA 345
Query: 346 VILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
V +ER ++Q+ I+ +S E ++ Q F W +
Sbjct: 346 VWADERLLFQVPSIVHGLSQPEIFAMRQ-----QTQFLWEA 381
>gi|357475227|ref|XP_003607899.1| Xylogalacturonan beta-1,3-xylosyltransferase [Medicago truncatula]
gi|355508954|gb|AES90096.1| Xylogalacturonan beta-1,3-xylosyltransferase [Medicago truncatula]
Length = 481
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 74/279 (26%), Positives = 129/279 (46%), Gaps = 31/279 (11%)
Query: 132 DPDRAHLFFIP--------ISCHKMRGKGTSYENMTVIVKDYVDSL---INKYPYWNRTL 180
DP+ A LFF+P ++ + G G+ E ++ ++L + +W R+
Sbjct: 163 DPEEADLFFVPFFSSLSLIVNPVRPAGSGSVPEKTAYSDEENQEALMEWLEMQEFWKRSK 222
Query: 181 GADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFI-PHKDVALPQVLQPFA--- 236
G DH V A V +KN + +V D + P + + V+ P++
Sbjct: 223 GRDHVIVASDP---NAMYRVVDRVKNCVLLVS----DFGRLRPDQGSLVKDVIVPYSHRI 275
Query: 237 -LPRGGRDVENRTVLGFWAGHR----NSKIRVILARVWENDTELDISNNRINRAIGPLVY 291
GG V+ R L F+ G+R KIR L ++ E + ++ I + +R
Sbjct: 276 RTYDGGIGVDKRNTLLFFMGNRYRKEGGKIRDTLFQILEKEDDVIIKHGAQSRESRRAAS 335
Query: 292 QRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNER 351
Q + +KFC+ P G ++ R+ D+I CVPVI+S+ +LPF D +D+RK AV +
Sbjct: 336 QG-MHTSKFCLHPAGDTPSACRLFDAIVSLCVPVIVSDSIELPFEDTIDYRKIAVFVETA 394
Query: 352 DVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSP 387
Q L IL+ ++ V L +V+++F+++ P
Sbjct: 395 AAIQPGYLVSILRGMAPDRIVEYQKELKEVKRYFKYDEP 433
>gi|255565307|ref|XP_002523645.1| catalytic, putative [Ricinus communis]
gi|223537097|gb|EEF38731.1| catalytic, putative [Ricinus communis]
Length = 452
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 74/284 (26%), Positives = 136/284 (47%), Gaps = 44/284 (15%)
Query: 132 DPDRAHLFFIPI------SCHKMRGKGT--------SYENMTVIVKDYVDSLINKYPYWN 177
DPD A LF++P+ + +R GT S E M + ++++ + YW
Sbjct: 131 DPDEADLFYVPVFSSLSLIVNPVRPAGTEPGLVQHYSDEEMQEQLVEWLE----QQEYWK 186
Query: 178 RTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFI-PHKDVALPQVLQPFA 236
R G DH + A V +KNAI ++ D + P + + ++ P++
Sbjct: 187 RNNGRDHVIIAGDP---NALYRVLDRVKNAILLLS----DFGRVRPDQGSLVKDIIVPYS 239
Query: 237 ----LPRGGRDVENRTVLGFWAGHR----NSKIRVILARVWENDTELDI---SNNRINRA 285
+ G V +R L F+ G+R KIR +L ++ E++ ++ I + +R NR
Sbjct: 240 HRINVYNGDIGVRDRNTLLFFMGNRYRKDGGKIRDLLFQMLESEEDVVIKHGTQSRENRR 299
Query: 286 IGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFA 345
R + +KFC+ P G ++ R+ DSI CVPVI+S+ +LPF D++D+ K A
Sbjct: 300 AA----SRGMHTSKFCLNPAGDTPSACRLFDSIVSLCVPVIVSDSIELPFEDVIDYTKIA 355
Query: 346 VILNERDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
+ + D + L ++L+ ++ + L KV ++F++++
Sbjct: 356 IFVETTDSLKPGYLVKLLREVTSERILEYQKELKKVTRYFEYDN 399
>gi|195611602|gb|ACG27631.1| exostosin-like [Zea mays]
Length = 456
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 80/307 (26%), Positives = 138/307 (44%), Gaps = 40/307 (13%)
Query: 112 KYASEGYFFQNIRESR------FLTHDPDRAHLFFIP------ISCHKMR--------GK 151
++++E + F+++ R DP A LF++P + + +R G
Sbjct: 106 QHSAEWWLFKDLLRRRPQDRPVARVDDPSDADLFYVPFFSSLSLVVNPIRSPLAANASGA 165
Query: 152 GTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVV 211
GT+Y + + + Y W R G DH F+ C D A V I NA+ +V
Sbjct: 166 GTAYSDEALQEELLEWLERQLY--WQRHRGRDHVFI-CQDPN--ALYRVVDRISNAVLLV 220
Query: 212 CS----PSYDVAFIPHKDVALPQVLQPFALPRGGRDVENRTVLGFWAGHR----NSKIRV 263
S + + KDV LP + + +G V+ R L F+ G+R K+R
Sbjct: 221 SDFGRLRSDQASLV--KDVILPYSHRINSF-KGEVGVDGRPSLLFFMGNRYRKEGGKVRD 277
Query: 264 ILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCV 323
L ++ EN+ ++ I + +R Q + +KFC+ P G ++ R+ D++ CV
Sbjct: 278 ALFQILENEDDVTIKHGTQSRESRRAATQG-MHSSKFCLHPAGDTPSACRLFDALVSLCV 336
Query: 324 PVILSNYYDLPFNDILDWRKFAVILNERDVYQ---LKQILKNISDAEFVSLHNNLVKVQK 380
PVI S+Y +LPF DI+D+ K ++ + Q L L+ IS + + KV+
Sbjct: 337 PVIASDYIELPFEDIIDYNKISIFVGTSKAVQPGYLTSTLRRISSERILEYQREIKKVRH 396
Query: 381 HFQWNSP 387
+F++ P
Sbjct: 397 YFEYEDP 403
>gi|297826983|ref|XP_002881374.1| hypothetical protein ARALYDRAFT_482474 [Arabidopsis lyrata subsp.
lyrata]
gi|297327213|gb|EFH57633.1| hypothetical protein ARALYDRAFT_482474 [Arabidopsis lyrata subsp.
lyrata]
Length = 446
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 76/282 (26%), Positives = 133/282 (47%), Gaps = 42/282 (14%)
Query: 132 DPDRAHLFFIPI------SCHKMR----GKGTSYENMTVIVKDYVDSLINKYPYWNRTLG 181
DPD A LF++P+ + R G G S E M + ++++ +W R G
Sbjct: 129 DPDDADLFYVPVFSSLSLIVNAGRPVEPGSGYSDEKMQEGLMEWLEG----QEWWRRNGG 184
Query: 182 ADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPH------KDVALPQVLQPF 235
DH A + +KN++ +V D + H KDV +P +
Sbjct: 185 RDHVIPAGDP---NALYRILDRVKNSVLLVA----DFGRLRHDQGSFVKDVVIPYSHR-V 236
Query: 236 ALPRGGRDVENRTVLGFWAGHR----NSKIRVILARVWENDTELDI---SNNRINRAIGP 288
L G V++R L F+ G+R K+R +L +V E + ++ I + +R NR
Sbjct: 237 NLFNGEIGVQDRNTLLFFMGNRYRKDGGKVRDLLFQVLEKEDDVTIKHGTQSRENRRAA- 295
Query: 289 LVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVIL 348
+ + +KFC+ P G ++ R+ DSI CVPVI+S+ +LPF D++D+RKF++ +
Sbjct: 296 ---TKGMHTSKFCLNPAGDTPSACRLFDSIVSLCVPVIVSDSIELPFEDVIDYRKFSIFV 352
Query: 349 NERDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSP 387
Q L Q+L+ I + + + V+++F +++P
Sbjct: 353 EANAALQPGFLVQMLRKIKTKKILEYQREMQPVRRYFDYDNP 394
>gi|449520748|ref|XP_004167395.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase
IRX7-like [Cucumis sativus]
Length = 517
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 71/235 (30%), Positives = 107/235 (45%), Gaps = 23/235 (9%)
Query: 170 INKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVA---FIP---- 222
+ P W R+ G DH H + +KNAI ++ P D + P
Sbjct: 234 VTDQPAWKRSEGRDHILPVHHPWSFKTVRK---FMKNAIWLL--PDMDSTGNWYKPGQVF 288
Query: 223 -HKDVALPQVLQPFALPRGGRDVEN--RTVLGFWAGH--RNS--KIRVILARVWENDTEL 275
KD+ LP V R + R++L F+ G RN+ KIR L ++
Sbjct: 289 LEKDLILPYVPNVELCDRKCLSYQQSKRSILLFFRGRLKRNAGGKIRAKLGGELSGADDV 348
Query: 276 DISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPF 335
I G Q K+ FC+ P G +SAR+ D+I GC+PVI+S+ +LPF
Sbjct: 349 LIEEGTAGEG-GKAAAQTGMRKSIFCLSPAGDTPSSARLFDAIVSGCIPVIVSDELELPF 407
Query: 336 NDILDWRKFAVILNERDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSP 387
ILD+RK A+ ++ D + L L++ S A+ L NL K+ +HF ++SP
Sbjct: 408 EGILDYRKIALFVSSSDALKSGWLLTYLRSFSAADIRRLQQNLAKLSRHFIYSSP 462
>gi|47224077|emb|CAG12906.1| unnamed protein product [Tetraodon nigroviridis]
Length = 743
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 97/381 (25%), Positives = 159/381 (41%), Gaps = 73/381 (19%)
Query: 46 PLQNNFNSTLSFAINKQDYSDLGSDVFHS-----PSVFHLNYLEMVTNFKIYIYPDGDPN 100
PL+ N L I+ + D+ S ++ S F + L FK+Y+YP
Sbjct: 64 PLEEGQNVDLGEHISPKQKRDVHSGIYTGKRCRMESCFDFS-LCRRNGFKVYVYP----- 117
Query: 101 TFYQTPRKLTGKYASEGY--FFQNIRESRFLTHDPDRAHLFFIPISCHKMRGKGTSYENM 158
+ G+ SE Y +I SRF T DP++A LF + + +
Sbjct: 118 -------QQKGEKMSESYQNILSSIEGSRFYTSDPEQACLFVLSLDTL----------DR 160
Query: 159 TVIVKDYVDSL---INKYPYWNRTLGADHFFVTCHD-VGVRATEGVPFLIKNAIRVVCSP 214
+ YV +L I P WN G +H + TE + F I A+ S
Sbjct: 161 DQLSPQYVHNLKGRIQSLPLWNG--GKNHIIFNLYSGTWPDYTEDLGFDIGFAMLAKASI 218
Query: 215 SYDVAFIPHKDVALPQVLQPFALPRGGRD-VENRTVLGF------WAGHR---------- 257
S + +F P+ DV++P + G R + + T+ F + G R
Sbjct: 219 STE-SFRPNFDVSIPLFSKDHPRTGGERGYLRHNTIPPFRKYILVFKGKRYLTGIGSDTR 277
Query: 258 -------NSKIRVIL-----ARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPG 305
NS+ V+L + W+ D +R N Y+ Y + FC+ P
Sbjct: 278 NALYHVHNSEDVVLLTTCKHGKDWQKHK--DARCDRDNMEYDKYDYKEMLYNSTFCLVPR 335
Query: 306 GSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISD 365
G ++ S R +++ CVPV+LSN ++LPF++I+DW AVI +ER + Q+ +++I
Sbjct: 336 GRRLGSFRFLEALQAACVPVMLSNGWELPFSEIIDWNTAAVIGDERLLLQIPSTVRSIHQ 395
Query: 366 AEFVSLHNNLVKVQKHFQWNS 386
+ +SL + Q F W +
Sbjct: 396 DQILSL-----RQQTQFLWEA 411
>gi|356544337|ref|XP_003540609.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase F8H-like
[Glycine max]
Length = 494
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 75/235 (31%), Positives = 105/235 (44%), Gaps = 23/235 (9%)
Query: 170 INKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVA---FIP---- 222
I P W R+ G DH H ++ V +KNAI ++ P D + P
Sbjct: 210 ITDQPAWKRSGGRDHILPVHHPWSFKS---VRRYVKNAIWLL--PDMDSTGNWYKPGQVY 264
Query: 223 -HKDVALPQVLQPFALPRGGRDVEN--RTVLGFWAGH--RNS--KIRVILARVWENDTEL 275
KD+ LP V N R+ L F+ G RN+ KIR L +
Sbjct: 265 LEKDLILPYVPNVDLCDAKCLSETNPKRSTLLFFRGRLKRNAGGKIRSKLGAELSGADGV 324
Query: 276 DISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPF 335
I G QR K+ FC+ P G +SAR+ D+I GC+PVI+S+ +LPF
Sbjct: 325 VIEEGTAGEG-GKEAAQRGMRKSLFCLSPAGDTPSSARLFDAIVSGCIPVIISDELELPF 383
Query: 336 NDILDWRKFAVILNERDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSP 387
ILD+RK AV ++ D + L + LK I A + NL K +HF ++SP
Sbjct: 384 EGILDYRKIAVFISSNDAVKPGWLLKYLKGIRPAHIKEMQQNLAKYSRHFLYSSP 438
>gi|326490259|dbj|BAJ84793.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 498
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 81/314 (25%), Positives = 146/314 (46%), Gaps = 50/314 (15%)
Query: 112 KYASEGYFFQNIRESR------FLTHDPDRAHLFFIP------ISCHKMR---------- 149
++++E + F+++R+ DP A LF++P + + +R
Sbjct: 145 QHSAEWWLFKDLRQRGPRDRPVARVDDPVEADLFYVPFFSSLSLVVNPIRPPAAANASEA 204
Query: 150 ----GKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIK 205
G S E M +D + + + YW R G DH F+ C D A V I
Sbjct: 205 AATVGPWYSDEAM----QDELVEWLERQSYWRRYRGRDHVFI-CQDPN--ALYRVVDRIS 257
Query: 206 NAIRVVCSPSY---DVAFIPHKDVALP--QVLQPFALPRGGRDVENRTVLGFWAGHR--- 257
NA+ +V D A + KDV LP + PF +G +V++R L F+ G+R
Sbjct: 258 NAVLLVSDFGRLRGDQASL-VKDVILPYSHRINPF---KGDVNVDSRPALLFFMGNRYRK 313
Query: 258 -NSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIAD 316
KIR L +V EN+ ++ I + +R + Q + +KFC+ P G ++ R+ D
Sbjct: 314 EGGKIRDTLFQVLENEGDVIIKHGAQSRVSRRMATQG-MHSSKFCLHPAGDTPSACRLFD 372
Query: 317 SIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQ---LKQILKNISDAEFVSLHN 373
++ CVPVI+S++ +LPF D++D+ ++ ++ Q L +L+ +S +
Sbjct: 373 ALVSLCVPVIVSDHIELPFEDVIDYSNISIFVDTSKAIQPGFLTSMLRKVSSERILEYQR 432
Query: 374 NLVKVQKHFQWNSP 387
+ +V+ +F++ P
Sbjct: 433 EIQRVKHYFEYEDP 446
>gi|223975191|gb|ACN31783.1| unknown [Zea mays]
gi|414887161|tpg|DAA63175.1| TPA: exostosin-like protein [Zea mays]
Length = 500
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 80/307 (26%), Positives = 138/307 (44%), Gaps = 40/307 (13%)
Query: 112 KYASEGYFFQNIRESR------FLTHDPDRAHLFFIP------ISCHKMR--------GK 151
++++E + F+++ R DP A LF++P + + +R G
Sbjct: 150 QHSAEWWLFKDLLRRRPQDRPVARVDDPSDADLFYVPFFSSLSLVVNPIRSPLAANASGA 209
Query: 152 GTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVV 211
GT+Y + + + Y W R G DH F+ C D A V I NA+ +V
Sbjct: 210 GTAYSDEALQEELLEWLERQLY--WQRHRGRDHVFI-CQDPN--ALYRVVDRISNAVLLV 264
Query: 212 CS----PSYDVAFIPHKDVALPQVLQPFALPRGGRDVENRTVLGFWAGHR----NSKIRV 263
S + + KDV LP + + +G V+ R L F+ G+R K+R
Sbjct: 265 SDFGRLRSDQASLV--KDVILPYSHRINSF-KGEVGVDGRPSLLFFMGNRYRKEGGKVRD 321
Query: 264 ILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCV 323
L ++ EN+ ++ I + +R Q + +KFC+ P G ++ R+ D++ CV
Sbjct: 322 ALFQILENEDDVTIKHGTQSRESRREATQG-MHSSKFCLHPAGDTPSACRLFDALVSLCV 380
Query: 324 PVILSNYYDLPFNDILDWRKFAVILNERDVYQ---LKQILKNISDAEFVSLHNNLVKVQK 380
PVI S+Y +LPF DI+D+ K ++ + Q L L+ IS + + KV+
Sbjct: 381 PVIASDYIELPFEDIIDYNKISIFVGTSKAVQPGYLTSTLRRISSERILEYQREIKKVRH 440
Query: 381 HFQWNSP 387
+F++ P
Sbjct: 441 YFEYEDP 447
>gi|356538648|ref|XP_003537813.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase F8H-like
[Glycine max]
Length = 502
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 76/235 (32%), Positives = 107/235 (45%), Gaps = 23/235 (9%)
Query: 170 INKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVA---FIP---- 222
I P W R+ G DH H ++ V +KNAI ++ P D + P
Sbjct: 218 ITDQPAWKRSGGRDHILPVHHPWSFKS---VRRYVKNAIWLL--PDMDSTGNWYKPGQVY 272
Query: 223 -HKDVALPQVLQPFALPRGGRDVEN--RTVLGFWAGH--RNS--KIRVILARVWENDTEL 275
KD+ LP V N R+ L F+ G RN+ KIR L +
Sbjct: 273 LEKDLILPYVPNVDLCDAKCLSETNPKRSTLLFFRGRLKRNAGGKIRSKLGAELSGVDGV 332
Query: 276 DISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPF 335
I G QR K+ FC+ P G +SAR+ D+I GC+PVI+S+ +LPF
Sbjct: 333 VIEEGTAGDG-GKEAAQRGMRKSLFCLSPAGDTPSSARLFDAIVSGCIPVIISDELELPF 391
Query: 336 NDILDWRKFAVILNERDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSP 387
ILD+RK AV ++ D + L + LK I A ++ NLVK +HF ++SP
Sbjct: 392 EGILDYRKIAVFISSIDAVKPGWLLKYLKGIRPAHIKAMQQNLVKYSRHFLYSSP 446
>gi|356525152|ref|XP_003531191.1| PREDICTED: probable glycosyltransferase At3g07620-like isoform 1
[Glycine max]
Length = 427
Score = 88.2 bits (217), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 75/289 (25%), Positives = 125/289 (43%), Gaps = 34/289 (11%)
Query: 132 DPDRAHLFFIPI--------SCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGAD 183
DP+ A FF+P H M+ T + + D ++ L+ K YW R+ G D
Sbjct: 120 DPELAQAFFVPFFSSLSFNTHGHTMKDPATQIDRQLQV--DLME-LLKKSKYWQRSGGRD 176
Query: 184 HFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIP------HKDVALP--QVLQPF 235
H F H R G + +I+VV D P +KDV P V+ F
Sbjct: 177 HVFPMTHPNAFRFLRG---QLNESIQVVV----DFGRYPRGMSNLNKDVVSPYVHVVDSF 229
Query: 236 ALPRGGRDVENRTVLGFWAGHRNSK----IRVILARVWENDTELDISNNRINRAIGPLVY 291
E+R+ L F+ G K +RV LA++ ++ + +
Sbjct: 230 TDDEPQDPYESRSTLLFFRGRTYRKDEGIVRVKLAKILAGYDDVHYERS-VATEENIKAS 288
Query: 292 QRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNER 351
+ +KFC+ P G +S R+ D+I CVPVI+S+ +LPF D +D+ +F+V + +
Sbjct: 289 SKGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDQIELPFEDDIDYSQFSVFFSFK 348
Query: 352 DVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMV 397
+ Q + L+ ++ + L + H+++ PP R DA M+
Sbjct: 349 EALQPGYMIDQLRKFPKEKWTEMWRQLKSISHHYEFEYPPKREDAVDML 397
>gi|440797580|gb|ELR18663.1| exostosin, putative [Acanthamoeba castellanii str. Neff]
Length = 423
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 74/270 (27%), Positives = 116/270 (42%), Gaps = 27/270 (10%)
Query: 123 IRESRFLTHDPDRAHLFFIPISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGA 182
+R S++ DP++A +P H + S E T +V + + P W R G
Sbjct: 102 LRASKYFVEDPEKA-CVLVPNFEHTL----ASNEMGTSLV---IAQTLRSLPTWERYGGP 153
Query: 183 --DHFFVTCHD-VGVRATEGVPFLIKNAIRVV-CSPSYDVAFIPH--------KDVALPQ 230
+H HD VGV + K P++D++ P KD A
Sbjct: 154 GFNHLLFNKHDDVGVEYDPAYAMVAKVGWSTGHYRPAFDISLNPPCGKGRPGLKDAAGHV 213
Query: 231 VLQPFALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIG--P 288
V +A P R EN+ L F RN +R +A + + I + IG P
Sbjct: 214 VPTWWANPD--RKRENKYFLTFLGTMRNYPLRRAIAERFHDPDNGVIIQTSVEEQIGGKP 271
Query: 289 LV---YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFA 345
V Y + T+F +CP G + S R ++I G +PVIL + Y P+N+++DWR FA
Sbjct: 272 SVEVEYLDTLFHTQFTLCPRGRALYSYRTTEAIAAGAIPVILGDGYAFPYNELIDWRSFA 331
Query: 346 VILNERDVYQLKQILKNISDAEFVSLHNNL 375
VIL E + +L++ + E + N+
Sbjct: 332 VILPESSWETMMDVLRSFTSEEIARMRRNM 361
>gi|356577103|ref|XP_003556667.1| PREDICTED: probable glucuronosyltransferase Os02g0520750-like
[Glycine max]
Length = 459
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 73/279 (26%), Positives = 126/279 (45%), Gaps = 31/279 (11%)
Query: 132 DPDRAHLFFIP--------ISCHKMRGKGTSYENMTVIVKDYVDSLIN---KYPYWNRTL 180
DP+ A LFF+P ++ + G + E ++ ++L+ K YW R
Sbjct: 140 DPEEADLFFVPFFSSLSLIVNPVRPPGSNSGLEKPVYSDEENQEALVEWLEKQEYWKRNN 199
Query: 181 GADHFFVTCHDVGVRATEGVPFLIKNAIRVVCS-----PSYDVAFIPHKDVALPQVLQPF 235
G DH V A V ++NA+ +V P KDV +P +
Sbjct: 200 GRDHVIVASDP---NAMYRVIDRVRNAVLLVSDFGRLRPDQGSLV---KDVVVPYSHRIR 253
Query: 236 ALPRGGRDVENRTVLGFWAGHR----NSKIRVILARVWENDTELDISNNRINRAIGPLVY 291
P G VE+R L F+ G+R KIR +L ++ EN+ ++ I + +R
Sbjct: 254 TYP-GDVGVEDRKTLLFFMGNRYRKEGGKIRDLLFQILENEKDVIIKHGAQSRE-SRRAA 311
Query: 292 QRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNER 351
+ +KFC+ P G ++ R+ D+I C+PVI+S+ +LPF D +D+RK AV +
Sbjct: 312 SHGMHTSKFCLHPAGDTPSACRLFDAIVSLCIPVIVSDNIELPFEDTIDYRKIAVFVETS 371
Query: 352 DVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSP 387
+ L L+ ++ + L +V+++F++ P
Sbjct: 372 SAIKPGHLLSKLRAVTPDRVLEYQKKLKEVKRYFEYEEP 410
>gi|357125540|ref|XP_003564451.1| PREDICTED: probable glucuronosyltransferase GUT1-like [Brachypodium
distachyon]
Length = 495
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 71/234 (30%), Positives = 107/234 (45%), Gaps = 21/234 (8%)
Query: 170 INKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVA---FIP---- 222
+ P W R+ G DH H ++ V +K AI ++ P D + P
Sbjct: 215 VTDQPAWQRSEGRDHIIPVHHPWSFKS---VRRFVKKAIWLL--PDMDSTGNWYKPGQVY 269
Query: 223 -HKDVALPQV--LQPFALPRGGRDVENRTVLGFWAGH--RNSKIRVILARVWE-NDTELD 276
KDV LP V + R++L F+ G RN+ +V V E D E
Sbjct: 270 LEKDVILPYVPNVDLCDYKCASETQSKRSMLLFFRGRLKRNAGGKVRSKLVTELKDAEDV 329
Query: 277 ISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFN 336
+ A G + Q K+ FC+ P G +SAR+ D+I GC+PVI+S+ +LPF
Sbjct: 330 VIEEGTAGAEGKVAAQNGMRKSLFCLNPAGDTPSSARLFDAIVSGCIPVIISDELELPFE 389
Query: 337 DILDWRKFAVILNERDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSP 387
ILD+RK A+ ++ D Q L + L+ I + +NLVK +HF ++ P
Sbjct: 390 GILDYRKIALFVSSSDALQPGWLVKYLRGIDAKRVREMQSNLVKYSRHFIYSKP 443
>gi|30686300|ref|NP_850241.1| Exostosin family protein [Arabidopsis thaliana]
gi|50253448|gb|AAT71926.1| At2g35100 [Arabidopsis thaliana]
gi|53828615|gb|AAU94417.1| At2g35100 [Arabidopsis thaliana]
gi|110737872|dbj|BAF00874.1| hypothetical protein [Arabidopsis thaliana]
gi|330253970|gb|AEC09064.1| Exostosin family protein [Arabidopsis thaliana]
Length = 447
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 73/280 (26%), Positives = 131/280 (46%), Gaps = 38/280 (13%)
Query: 132 DPDRAHLFFIPI-SCHKM---------RGKGTSYENMTVIVKDYVDSLINKYPYWNRTLG 181
DP A LF++P+ S + G G S E M + ++++ +W R G
Sbjct: 130 DPADADLFYVPVFSSLSLIVNAGRPVEAGSGYSDEKMQEGLVEWLEG----QEWWRRNAG 185
Query: 182 ADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFA----L 237
DH A + +KNA+ +V P + + V+ P++ L
Sbjct: 186 RDHVIPAGDP---NALYRILDRVKNAVLLVSDFGR---LRPDQGSFVKDVVIPYSHRVNL 239
Query: 238 PRGGRDVENRTVLGFWAGHR----NSKIRVILARVWENDTELDI---SNNRINRAIGPLV 290
G VE+R L F+ G+R K+R +L +V E + ++ I + +R NR
Sbjct: 240 FNGEIGVEDRNTLLFFMGNRYRKDGGKVRDLLFQVLEKEDDVTIKHGTQSRENRRAA--- 296
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
+ + +KFC+ P G ++ R+ DSI CVP+I+S+ +LPF D++D+RKF++ +
Sbjct: 297 -TKGMHTSKFCLNPAGDTPSACRLFDSIVSLCVPLIVSDSIELPFEDVIDYRKFSIFVEA 355
Query: 351 RDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSP 387
Q L Q+L+ I + + + V+++F +++P
Sbjct: 356 NAALQPGFLVQMLRKIKTKKILEYQREMKSVRRYFDYDNP 395
>gi|125547211|gb|EAY93033.1| hypothetical protein OsI_14833 [Oryza sativa Indica Group]
Length = 429
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 80/343 (23%), Positives = 145/343 (42%), Gaps = 35/343 (10%)
Query: 88 NFKIYIYPDGDPNTFYQTPRK----------LTGKYASEGYFFQNIRESRFLTHDPDRAH 137
+ +IY+Y + + + R + G++ ++ Q + SRF T D D A
Sbjct: 54 DLRIYVYAEDEVDGLRALLRGRNDDVTAATCIKGQWGTQVKIHQLLLSSRFRTFDKDEAD 113
Query: 138 LFFIP--ISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVR 195
LFF+P + C +M GK E I + YV ++++ PY+ R+ G DH FV G
Sbjct: 114 LFFVPTYVKCVRMTGKLNDKE----INQTYVKVVLSQMPYFRRSGGRDHIFVFPSGAGAH 169
Query: 196 ATEGVPFLIKNAIRVVCSPSYD-------VAFIPHKDVALP-----QVLQPFALPRGGRD 243
+ +I + +P D AF KD+ +P +++ L
Sbjct: 170 LFRSWATFLNRSI--ILTPEGDRTDKRGISAFNTWKDIIIPGNVDDSMVKSDRLAVKPIP 227
Query: 244 VENRTVLGFWAGHRNSKIR----VILARVWENDTELDISNNRINRAIGPLVYQRRFYKTK 299
+ R L + G K+ V LA+ + + E +G + Y + K
Sbjct: 228 LTKRKYLANFLGRAQGKVGRLQLVKLAKQYPDKLESPELKLSGPDKLGRIDYFKHLRNAK 287
Query: 300 FCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVY-QLKQ 358
FC+ P G + R +S CVPVILS+ +LPF +++D+ + ++ + L +
Sbjct: 288 FCLAPRGESSWTLRFYESFFVECVPVILSDEVELPFQNVIDYTEVSIKWPASKIGPGLLE 347
Query: 359 ILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYEL 401
L++I D + +++ + + + R A +M+EL
Sbjct: 348 YLESIPDGRVEEMIGRGREIRCLWVYAADTERCSAMSAIMWEL 390
>gi|68052297|sp|Q5IGR6.1|EXT1C_DANRE RecName: Full=Exostosin-1c; AltName:
Full=Glucuronosyl-N-acetylglucosaminyl-proteoglycan/N-
acetylglucosaminyl-proteoglycan
4-alpha-N-acetylglucosaminyltransferase 1c; AltName:
Full=Multiple exostoses protein 1 homolog c
gi|56785795|gb|AAW29035.1| EXT1c [Danio rerio]
Length = 737
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 93/363 (25%), Positives = 143/363 (39%), Gaps = 61/363 (16%)
Query: 47 LQNNFNSTLSFAINKQDYSDLGSDV-FHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQT 105
LQN S KQ +S + D + F + + F++YIYP
Sbjct: 63 LQNGGGPGDSPRQRKQAWSSIYKDSRCRMDTCFDFGRCQTQSGFRVYIYP---------- 112
Query: 106 PRKLTGKYASEGY--FFQNIRESRFLTHDPDRAHLFFIPISCHKMRGKGTSYENMTVIVK 163
P K G+ SEGY ++ ESR+ T DP A LF + I + ++
Sbjct: 113 PEK--GERVSEGYRKILTSVSESRYYTSDPREACLFVLGIDT-------LDRDQLSQQFV 163
Query: 164 DYVDSLINKYPYWNRTLGADHFFVTCHD-VGVRATEGVPFLIKNAIRVVCSPSYDVAFIP 222
VD I YP WN G +H + TE + F + AI S + + F P
Sbjct: 164 PNVDERIRGYPLWND--GRNHVIFNLYSGTWPNYTEDLGFNVGQAILAKASLNTE-HFRP 220
Query: 223 HKDVALPQVLQPFALPRGGRD--VEN-----RTVLGFWAGHR-----------------N 258
D+++P + G R V N R L + G R N
Sbjct: 221 GFDISIPLFSKEHPQKGGKRGWLVRNSVPPRRKYLLMFKGKRYLTGIGSDTRNALHHIHN 280
Query: 259 SKIRVIL-----ARVWEN--DTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNS 311
K V L + WE D D N R YQ + + FC+ P G ++ S
Sbjct: 281 GKDIVSLTTCRHGKDWEKHKDARCDHDNQEYER----FDYQELLHNSTFCLVPRGRRLGS 336
Query: 312 ARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSL 371
R +S+ C+PV+LSN ++LPF+D++ W + V +ER + Q+ ++ + ++L
Sbjct: 337 FRFLESLQAACIPVLLSNGWELPFSDVIQWNQAVVEGDERLLLQVPSTVRAVGIDRVLAL 396
Query: 372 HNN 374
Sbjct: 397 RQQ 399
>gi|449453962|ref|XP_004144725.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase
IRX7-like [Cucumis sativus]
Length = 518
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 70/235 (29%), Positives = 106/235 (45%), Gaps = 23/235 (9%)
Query: 170 INKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVA---FIP---- 222
+ P W R+ G DH H + +KNAI ++ P D + P
Sbjct: 234 VTDQPAWKRSEGRDHILPVHHPWSFKTVRK---FMKNAIWLL--PDMDSTGNWYKPGQVF 288
Query: 223 -HKDVALPQVLQPFALPRGGRDVEN--RTVLGFWAGH--RNS--KIRVILARVWENDTEL 275
KD+ LP V + R++L F+ G RN+ KIR L ++
Sbjct: 289 LEKDLILPYVPNVELCDSKCLSYQQSKRSILLFFRGRLKRNAGGKIRAKLGGELSGADDV 348
Query: 276 DISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPF 335
I G Q K+ FC+ P G +SAR+ D+I GC+PVI+S+ +LPF
Sbjct: 349 LIEEGTAGEG-GKAAAQTGMRKSIFCLSPAGDTPSSARLFDAIVSGCIPVIVSDELELPF 407
Query: 336 NDILDWRKFAVILNERDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSP 387
ILD+RK A+ ++ D + L L++ S A+ L NL K+ +HF ++SP
Sbjct: 408 EGILDYRKIALFVSSSDALKSGWLLTYLRSFSAADIRRLQQNLAKLSRHFIYSSP 462
>gi|22330483|ref|NP_176908.2| exostosin-like protein [Arabidopsis thaliana]
gi|115311405|gb|ABI93883.1| At1g67410 [Arabidopsis thaliana]
gi|332196520|gb|AEE34641.1| exostosin-like protein [Arabidopsis thaliana]
Length = 430
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 82/304 (26%), Positives = 138/304 (45%), Gaps = 38/304 (12%)
Query: 132 DPDRAHLFFIPI------SCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHF 185
DPD A +F++P + H E ++ + ++ L N YWNR+ G DH
Sbjct: 122 DPDLADVFYVPFFSSLSFNTHGKNMTDPDTEFDRLLQVELMEFLENS-KYWNRSGGKDHV 180
Query: 186 FVTCHDVGVRATEGVPFL---IKNAIRVVCS---PSYDVAFIPHKDVALPQVLQPFALPR 239
H R FL + +I +V S D+A + KDV P V +L
Sbjct: 181 IPMTHPNAFR------FLRQQVNASILIVVDFGRYSKDMARL-SKDVVSPYVHVVESLNE 233
Query: 240 GGRD-----VENRTVLGFWAGHR----NSKIRVILARVWENDTELDISNNRINRAIGPLV 290
G D E RT L ++ G+ KIR+ L ++ ++++ + + V
Sbjct: 234 EGDDGMGDPFEARTTLLYFRGNTVRKDEGKIRLRLEKLLAGNSDVHFEKS-VATTQNIKV 292
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
+KFC+ P G +S R+ D+I C+PVI+S+ +LPF D +D+ +F++ +
Sbjct: 293 STEGMRSSKFCLHPAGDTPSSCRLFDAIVSHCIPVIISDKIELPFEDEIDYSEFSLFFSI 352
Query: 351 RDVYQLKQILKNISD---AEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLR-RH 406
++ + IL N+ +++ + L V HF++ PP R DA +M LW + +H
Sbjct: 353 KESLEPGYILNNLRQFPKEKWLEMWKRLKNVSHHFEFQYPPKREDAVNM----LWRQVKH 408
Query: 407 VIKY 410
I Y
Sbjct: 409 KIPY 412
>gi|302773760|ref|XP_002970297.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
gi|300161813|gb|EFJ28427.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
Length = 437
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 81/304 (26%), Positives = 135/304 (44%), Gaps = 50/304 (16%)
Query: 132 DPDRAHLFFIP--------ISCHKMRGKGTSYEN-MTVIVKDYVDSLINKYPYWNRTLGA 182
DP A +FF+P + MR T ++ + V + +Y ++K P++ R+ G
Sbjct: 128 DPGEADVFFVPYFASLSFNVFGVSMRDPETEHDKKLQVGMIEY----LSKSPWYQRSGGR 183
Query: 183 DHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIP------HKDVALP--QVLQP 234
DH V H R +K+ + + D P HKDV P ++
Sbjct: 184 DHVLVLHHPNAFR-------FLKDRLNLSLLVVADFGRFPKGVAALHKDVVAPYSHMVPT 236
Query: 235 FALPRGGRDVENRTVLGFWAGHRNSK----IRVILARVWENDT----ELDISNN-RINRA 285
+ G E RT L F+ G K +R LA + EN E I+ N + +A
Sbjct: 237 YNGDDGTDPFEERTTLLFFQGRVKRKDDGVVRTQLAAILENQPRVHFEEGIATNFTVEQA 296
Query: 286 IGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFA 345
+ + ++FC+ P G +S R+ D+I CVPVI+S+ +LPF D LD+ +F+
Sbjct: 297 MQGM------RSSRFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDKIELPFEDELDYSEFS 350
Query: 346 VILNERDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELW 402
+ + + + L L+ S ++ + L +V +HF++ P R DA +M LW
Sbjct: 351 LFFSVDEAVRPGFLLGALEKFSKRRWMKMWRRLKQVTRHFEYQHPSQRDDAVNM----LW 406
Query: 403 LRRH 406
+ H
Sbjct: 407 SQIH 410
>gi|168051833|ref|XP_001778357.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162670236|gb|EDQ56808.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 406
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 78/311 (25%), Positives = 147/311 (47%), Gaps = 33/311 (10%)
Query: 113 YASEGYFFQNIRESRFLTHDPDRAHLFFIPISC--------HKMRGKGTSYE-NMTVIVK 163
Y +G+ ++ R + DP +A +FF+P + M G G + N+ V
Sbjct: 85 YLLDGWDRKDGRRAAIRVRDPYQADVFFVPFFASLSFNNYGYGMEGPGAELDKNLQECV- 143
Query: 164 DYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSP---SYDVAF 220
V+ L+N +W + G DH V H R L+ +++ +V S DVA
Sbjct: 144 --VNILLNS-KWWKASQGRDHVIVLHHPNAFRHYR---HLLNSSMLIVADFGRFSTDVAC 197
Query: 221 IPHKDVALP--QVLQPFALPRGGRDVENRTVLGFWAG--HRNSK--IRVILARVWENDTE 274
+ KD+ P V+Q + + R +L ++ G HR + +R LA+ N+ +
Sbjct: 198 L-QKDIVAPYEHVVQSY-VDDHSNSFSQRHILLYFQGRIHRKADGIVRAKLAKALMNEKD 255
Query: 275 LD-ISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDL 333
+ + + + A+ R ++FC+ P G +S R+ D+I CVPVI+S+ +L
Sbjct: 256 VHYMDSEASSEALAEATSGMR--SSRFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDRIEL 313
Query: 334 PFNDILDWRKFAVILNERDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVR 390
PF D +D+ +F++ + + + L +IL+ I++ ++ + L V HF++ P +
Sbjct: 314 PFEDDIDYNEFSLFFSSEEAVRPQYLLRILRGINETKWTQMWTKLKAVSHHFEFQHPAKK 373
Query: 391 FDAFHMVMYEL 401
DA +M+ ++
Sbjct: 374 DDAVNMIFKQV 384
>gi|443712570|gb|ELU05824.1| hypothetical protein CAPTEDRAFT_92231 [Capitella teleta]
Length = 668
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 84/336 (25%), Positives = 147/336 (43%), Gaps = 71/336 (21%)
Query: 88 NFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHK 147
FK+Y+YP+ T K++ KY+ +R+SR+ T +P+ A LF I
Sbjct: 38 GFKVYVYPN--------TQDKMSSKYSE---ILTALRQSRYHTTNPEEACLFVPAIDTLD 86
Query: 148 MRGKGTSY-ENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCH----------DVGVRA 196
Y +NM +S I PYWN G +H + D+G
Sbjct: 87 RDKLSAEYIQNM--------ESKIQSLPYWND--GLNHIIFNLYPGTWPHYDETDLGFNT 136
Query: 197 TEGVPFLIKNAIRVVC-SPSYDVAF-IPHKDVA------------LPQVLQPFALPRGGR 242
G L K ++ + P++D++F + HK+ L L+ + L G+
Sbjct: 137 --GKAMLAKASVSDMWFRPNFDISFPLFHKEHKFKGGEPGFLTENLVPPLRSYTLSFKGK 194
Query: 243 DVENRTVLGFWAGHRNSKIR-------VILARVWENDTELDISNNRINRAIGPLVYQRRF 295
R + G + RNS V+L + D+ ++R R Y++
Sbjct: 195 ----RYLTGIGSETRNSLYHIHNDDDIVMLTTCKHGKSWKDMKDDRCERDNAE--YEKYD 248
Query: 296 YK-----TKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
YK + FC+ P G ++ S R +++ C+PV LSN + LPF++++DW + A+ +E
Sbjct: 249 YKILLHNSTFCLVPRGRRLGSYRFLEALQAACIPVFLSNNWVLPFSEVIDWNQAAIWGDE 308
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
R + Q+ I+++I A+ ++L + Q F W +
Sbjct: 309 RLLLQIPSIVRSIRHADLLAL-----RQQTQFLWET 339
>gi|242059273|ref|XP_002458782.1| hypothetical protein SORBIDRAFT_03g040220 [Sorghum bicolor]
gi|241930757|gb|EES03902.1| hypothetical protein SORBIDRAFT_03g040220 [Sorghum bicolor]
Length = 426
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 77/308 (25%), Positives = 136/308 (44%), Gaps = 36/308 (11%)
Query: 87 TNFKIYIYPDGDPNTFYQTPRK----------LTGKYASEGYFFQNIRESRFLTHDPDRA 136
+ +IY+Y + + R L G++ ++ Q + +SRF T + D+A
Sbjct: 51 ADLRIYVYAEDEIQGLRALLRGRDGTITAATCLKGQWGTQVKIHQFLLKSRFRTFNKDQA 110
Query: 137 HLFFIP--ISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV 194
+LFF+P + C +M G + E I + YV ++++ PY+ R+ G DH FV G
Sbjct: 111 NLFFVPSYVKCVRMTGALSDKE----INQTYV-KVLSQMPYFRRSGGRDHIFVFPSGAGA 165
Query: 195 RATEGVPFLIKNAIRVVCSPSYD-------VAFIPHKDVALPQVLQPFALPRGGRDVE-- 245
+ +I + +P D AF KD+ +P + + R V+
Sbjct: 166 HLFRSWATFLNRSI--ILTPEGDRTDKRGTSAFNTWKDIIIPGNVDDSMVKSDARAVQPI 223
Query: 246 ---NRTVLGFWAGHRNSKIR----VILARVWENDTELDISNNRINRAIGPLVYQRRFYKT 298
R L + G K V LA+ + + E + +G + Y +
Sbjct: 224 PLTKRKYLANFLGRAQGKAGRLQLVELAKQYPDKLESPVLKLSGPNKLGRIEYFKHLRNA 283
Query: 299 KFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVIL-NERDVYQLK 357
KFC+ P G + R +S CVPVILS+ +LPF +++D+ + ++ + R +L
Sbjct: 284 KFCLAPRGESSWTLRFYESFFVECVPVILSDEVELPFQNVIDYGEISIKWPSSRIGPELL 343
Query: 358 QILKNISD 365
+ L++ISD
Sbjct: 344 EYLESISD 351
>gi|302793388|ref|XP_002978459.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
gi|300153808|gb|EFJ20445.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
Length = 437
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 81/304 (26%), Positives = 134/304 (44%), Gaps = 50/304 (16%)
Query: 132 DPDRAHLFFIP--------ISCHKMRGKGTSYEN-MTVIVKDYVDSLINKYPYWNRTLGA 182
DP A +FF+P + MR T ++ + V + +Y ++K P++ R+ G
Sbjct: 128 DPGEADVFFVPYFASLSFNVFGVSMRDPETEHDKKLQVGMIEY----LSKSPWYQRSGGR 183
Query: 183 DHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIP------HKDVALP--QVLQP 234
DH V H R +K+ + D P HKDV P ++
Sbjct: 184 DHVLVLHHPNAFR-------FLKDRLNSSLLVVADFGRFPKGVAALHKDVVAPYSHMVPT 236
Query: 235 FALPRGGRDVENRTVLGFWAGHRNSK----IRVILARVWENDT----ELDISNN-RINRA 285
+ G E RT L F+ G K +R LA + EN E I+ N + +A
Sbjct: 237 YNGDDGSDPFEERTTLLFFQGRVKRKDDGVVRTQLAAILENQPRVHFEEGIATNFTVEQA 296
Query: 286 IGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFA 345
+ + ++FC+ P G +S R+ D+I CVPVI+S+ +LPF D LD+ +F+
Sbjct: 297 MQGM------RSSRFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDKIELPFEDELDYSEFS 350
Query: 346 VILNERDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELW 402
+ + + + L L+ S ++ + L +V +HF++ P R DA +M LW
Sbjct: 351 LFFSVDEAVRPGFLLGALEKFSKRRWMKMWRRLKQVTRHFEYQHPSQRDDAVNM----LW 406
Query: 403 LRRH 406
+ H
Sbjct: 407 SQIH 410
>gi|410905063|ref|XP_003966011.1| PREDICTED: exostosin-1b-like [Takifugu rubripes]
Length = 743
Score = 85.5 bits (210), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 91/369 (24%), Positives = 149/369 (40%), Gaps = 68/369 (18%)
Query: 46 PLQNNFNSTLSFAINKQDYSDLGSDVFHS-----PSVFHLNYLEMVTNFKIYIYPDGDPN 100
PL+ + N I+ + D+ S ++ S F E FK+++YP
Sbjct: 64 PLEEDQNVDSDEHISLKQKRDVSSGIYTGKRCRMESCFDFTLCER-NGFKVFVYP----- 117
Query: 101 TFYQTPRKLTGKYASEGY--FFQNIRESRFLTHDPDRAHLFFIPISCHKMRGKGTSYENM 158
+ G+ SE Y +I SRF T DP++A LF + + +
Sbjct: 118 -------QQKGEKMSESYQNILSSIEGSRFYTSDPEQACLFVLSLDTL----------DR 160
Query: 159 TVIVKDYVDSL---INKYPYWNRTLGADHFFVTCHD-VGVRATEGVPFLIKNAIRVVCSP 214
+ YV +L I P WN G +H + TE + F I A+ S
Sbjct: 161 DQLSPQYVHNLKGRIQSLPLWNE--GKNHIIFNLYSGTWPNYTEDLGFDIGYAMLAKASI 218
Query: 215 SYDVAFIPHKDVALPQVLQPFALPRGGRDV-----------------ENRTVLGFWAGHR 257
S + F P+ DV++P + G R R + G + R
Sbjct: 219 STE-NFRPNFDVSIPLFSKDHPRTGGERGYLRHNSIPPFRKYMLVFKGKRYLTGIGSDTR 277
Query: 258 N-------SKIRVIL-----ARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPG 305
N S+ V+L + W+ D +R N Y+ Y + FC+ P
Sbjct: 278 NALYHVHNSEDVVLLTTCKHGKDWQKHK--DARCDRDNMEYDKYDYREMLYNSTFCLVPR 335
Query: 306 GSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISD 365
G ++ S R +++ CVPV+LSN ++LPF++I+DW AVI +ER + Q+ +++I
Sbjct: 336 GRRLGSFRFLEALQAACVPVMLSNGWELPFSEIIDWNTAAVIGDERLLLQIPSTVRSIHQ 395
Query: 366 AEFVSLHNN 374
+ +SL
Sbjct: 396 DQILSLRQQ 404
>gi|34394612|dbj|BAC83914.1| limonene cyclase like protein [Oryza sativa Japonica Group]
gi|50508943|dbj|BAD31847.1| limonene cyclase like protein [Oryza sativa Japonica Group]
Length = 332
Score = 85.5 bits (210), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 41/115 (35%), Positives = 68/115 (59%), Gaps = 2/115 (1%)
Query: 292 QRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNER 351
QRR K + G A I ++I+ CVPV++ + Y LPF D+L+W F+V +
Sbjct: 209 QRR-RDVKLELLELGLAKARATIREAIYLECVPVVIGDDYTLPFADVLNWAAFSVRVAVG 267
Query: 352 DVYQLKQILKNISDAEFVSLHNNLVKVQKHFQ-WNSPPVRFDAFHMVMYELWLRR 405
D+ +LK+IL +S +++ + + V++HF + P RFD FHM+++ +WLRR
Sbjct: 268 DIPRLKEILAAVSPRQYIRMQRRVRAVRRHFMVSDGAPRRFDVFHMILHSIWLRR 322
>gi|168060641|ref|XP_001782303.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666233|gb|EDQ52893.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 426
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 68/247 (27%), Positives = 116/247 (46%), Gaps = 26/247 (10%)
Query: 168 SLINKYPYWNRTLGADHFFVTCHDVGVRATE----GVPFLIKNAIRVVCSPSYDVAFIPH 223
S ++ P W + G++H V H + T V F++ + R +VA +
Sbjct: 156 SFLSSQPAWRASEGSNHVVVIHHPNAMLHTREKFRSVMFVVADFGRY----GAEVANMA- 210
Query: 224 KDVALPQVLQPFALPRGGRDVE------NRTVLGFWAGHRNSK----IRVILARVWENDT 273
KDV P +P DV+ +RT L F+ G K IR L + +
Sbjct: 211 KDVVAPY---KHVIPNFDEDVDAALSFKSRTTLLFFQGAIARKEGGIIRQQLYELLGEEP 267
Query: 274 ELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDL 333
+ SN + A G ++KFC+ G +S R+ D++ CVP+I+SN +L
Sbjct: 268 NIIFSNGTTSNA-GIRSATAGMRQSKFCLHLAGDTPSSNRLFDAVASHCVPLIISNEIEL 326
Query: 334 PFNDILDWRKFAVILNERDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVR 390
PF D+L++ +F++ +N D + + +L N+ + E+ +H+ L +V++HFQ+ P
Sbjct: 327 PFEDVLNYSEFSLFVNSSDALRKGFVTDLLSNVGEKEWTRMHDRLRQVERHFQYQLPAQI 386
Query: 391 FDAFHMV 397
DA HM
Sbjct: 387 GDAVHMT 393
>gi|224144703|ref|XP_002325382.1| predicted protein [Populus trichocarpa]
gi|222862257|gb|EEE99763.1| predicted protein [Populus trichocarpa]
Length = 505
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 71/235 (30%), Positives = 106/235 (45%), Gaps = 23/235 (9%)
Query: 170 INKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVA---FIP---- 222
+ P W R+ G +H F H ++ V +KNAI ++ P D + P
Sbjct: 224 VTDQPAWKRSEGRNHIFPIHHPWSFKS---VRRYVKNAIWLL--PDMDSTGNWYKPGQVF 278
Query: 223 -HKDVALPQVLQPFALPRG--GRDVENRTVLGFWAGH--RNS--KIRVILARVWENDTEL 275
KD+ LP V R+ L ++ G RN+ KIR L +
Sbjct: 279 LEKDLILPYVPNVNLCDTKCISESESKRSTLLYFRGRLKRNAGGKIRAKLVAELSGAEGV 338
Query: 276 DISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPF 335
I G Q K+ FC+ P G +SAR+ D+I GC+PV++S+ +LPF
Sbjct: 339 FIEEGTAGEG-GKAAAQIGMRKSIFCLSPAGDTPSSARLFDAIVSGCIPVVVSDELELPF 397
Query: 336 NDILDWRKFAVILNERDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSP 387
ILD+RK A+ ++ D Q L + LK IS A+ + NL K +HF ++SP
Sbjct: 398 EGILDYRKIALFVSSSDAVQPGWLLKFLKGISLAQIRGMQRNLAKYSRHFIYSSP 452
>gi|307106650|gb|EFN54895.1| hypothetical protein CHLNCDRAFT_135016 [Chlorella variabilis]
Length = 584
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 82/331 (24%), Positives = 139/331 (41%), Gaps = 62/331 (18%)
Query: 107 RKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHKMRG---------KGTSYEN 157
+K Y + F + S T DP A+LF+IP+ + G + S N
Sbjct: 239 QKFDENYIAYQQFLEQFLLSPVRTEDPSEANLFYIPMLLYGYSGTPGGPSRAPQVDSLCN 298
Query: 158 M---TVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVV--- 211
M + +D + +K+PYWNRT G DHF+ D G +G L + AI+V
Sbjct: 299 MMPGQAHIDLVLDQIAHKWPYWNRTRGRDHFYWAPADRGACYHKG---LAEQAIKVSHFG 355
Query: 212 ------------------CSPSYDVAFIPHKDVALPQVLQPFA-----LPRGGRDV---- 244
SP + + P +DV P + A R G D
Sbjct: 356 LHATNNSIDLGDLYSHNQMSPDHGC-YHPLRDVVAPPFEKLAASWLNTTLRLGLDGNIKG 414
Query: 245 ENRT------VLG---FWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRF 295
+N T V G ++G K++ ++ + W+ D E R+ Y++R
Sbjct: 415 KNATFYFSGNVQGINLMYSGGTRQKLQALI-KQWD-DPEFGFVEGRLQEG----AYEQRI 468
Query: 296 YKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQ 355
+++FC+ P G R+ I G +PVI+ + P D+L + F++ L D+ Q
Sbjct: 469 RESRFCLAPYGHGY-GMRLGQCIFAGSIPVIVQEHVFQPLEDVLPYEAFSIRLTNDDLPQ 527
Query: 356 LKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
L++IL+ I++A++ L L++ W++
Sbjct: 528 LREILRGITEAQYRELMTGLLRYSLALSWDT 558
>gi|21537341|gb|AAM61682.1| unknown [Arabidopsis thaliana]
Length = 462
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 84/343 (24%), Positives = 142/343 (41%), Gaps = 36/343 (10%)
Query: 88 NFKIYIYP----DGDPNTFY------QTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAH 137
+ KIY+Y DG Y +T L G++ S+ + + ES+F T D A
Sbjct: 88 SLKIYVYDENEIDGLKELLYGRDGSVKTTACLKGQWGSQVKIHKLLLESKFRTIKKDEAD 147
Query: 138 LFFIP--ISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVR 195
LFF+P + C +M G E VK ++++ PY+ R+ G DH FV G
Sbjct: 148 LFFVPAYVKCVRMLGGLNDKEINQTYVK-----VLSQMPYFRRSGGRDHIFVFPSGAGAH 202
Query: 196 ATEGVPFLIKNAIRVVCSPSYD-------VAFIPHKDVALPQVLQPFALPRGGRDVE--- 245
I +I + +P D AF KD+ +P + G DV+
Sbjct: 203 LFRSWSTFINRSI--ILTPEADRTDKKDTTAFNSWKDIIIPGNVDDAMTKNGQPDVQPLP 260
Query: 246 --NRTVLGFWAGHRNSK---IRVI-LARVWENDTELDISNNRINRAIGPLVYQRRFYKTK 299
R L + G K +++I L++ + + E G Y K
Sbjct: 261 LSKRKYLANYLGRAQGKAGRLKLIDLSKQFPDKLECPDLKFSGTEKFGRTTYFEHLRNAK 320
Query: 300 FCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVIL-NERDVYQLKQ 358
FC+ P G + R +S CVPV+LS++ +LPF +++D+ + ++ + R +
Sbjct: 321 FCLAPRGESSWTLRFYESFFVECVPVLLSDHAELPFQNVIDYAQVSIKWPSTRIGSEFLD 380
Query: 359 ILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYEL 401
L +ISD + + K++ F + A +++EL
Sbjct: 381 YLASISDRDIEGMIARGRKIRCLFVYGPDSAPCSAVKGILWEL 423
>gi|18394994|ref|NP_564141.1| exostosin-like protein [Arabidopsis thaliana]
gi|332191985|gb|AEE30106.1| exostosin-like protein [Arabidopsis thaliana]
Length = 462
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 84/343 (24%), Positives = 142/343 (41%), Gaps = 36/343 (10%)
Query: 88 NFKIYIYP----DGDPNTFY------QTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAH 137
+ KIY+Y DG Y +T L G++ S+ + + ES+F T D A
Sbjct: 88 SLKIYVYDENEIDGLKELLYGRDGSVKTTACLKGQWGSQVKIHKLLLESKFRTIKKDEAD 147
Query: 138 LFFIP--ISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVR 195
LFF+P + C +M G E VK ++++ PY+ R+ G DH FV G
Sbjct: 148 LFFVPAYVKCVRMLGGLNDKEINQTYVK-----VLSQMPYFRRSGGRDHIFVFPSGAGAH 202
Query: 196 ATEGVPFLIKNAIRVVCSPSYD-------VAFIPHKDVALPQVLQPFALPRGGRDVE--- 245
I +I + +P D AF KD+ +P + G DV+
Sbjct: 203 LFRSWSTFINRSI--ILTPEADRTDKKDTTAFNSWKDIIIPGNVDDAMTKNGQPDVQPLP 260
Query: 246 --NRTVLGFWAGHRNSK---IRVI-LARVWENDTELDISNNRINRAIGPLVYQRRFYKTK 299
R L + G K +++I L++ + + E G Y K
Sbjct: 261 LSKRKYLANYLGRAQGKAGRLKLIDLSKQFPDKLECPDLKFSGTEKFGRTTYFEHLRNAK 320
Query: 300 FCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVIL-NERDVYQLKQ 358
FC+ P G + R +S CVPV+LS++ +LPF +++D+ + ++ + R +
Sbjct: 321 FCLAPRGESSWTLRFYESFFVECVPVLLSDHAELPFQNVIDYAQVSIKWPSTRIGSEFLD 380
Query: 359 ILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYEL 401
L +ISD + + K++ F + A +++EL
Sbjct: 381 YLASISDRDIEGMIARGRKIRCLFVYGPDSAPCSAVKGILWEL 423
>gi|449681334|ref|XP_002163001.2| PREDICTED: exostosin-1a-like [Hydra magnipapillata]
Length = 469
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 36/92 (39%), Positives = 60/92 (65%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + FC+ P G ++ S R ++I YGC+PVI+SN +DLPFND++DW KF+++L+E
Sbjct: 184 YTELLANSTFCLIPRGRRLASFRFLEAIQYGCIPVIMSNGWDLPFNDVIDWVKFSIVLDE 243
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHF 382
+ QL IL+ IS + +++ + V K++
Sbjct: 244 SLLLQLPSILRGISFDQVLAMKQQTIFVWKNY 275
>gi|219363227|ref|NP_001136572.1| hypothetical protein precursor [Zea mays]
gi|194696226|gb|ACF82197.1| unknown [Zea mays]
gi|414866559|tpg|DAA45116.1| TPA: hypothetical protein ZEAMMB73_757616 [Zea mays]
Length = 453
Score = 84.7 bits (208), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 73/243 (30%), Positives = 116/243 (47%), Gaps = 33/243 (13%)
Query: 174 PYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCS-----PSYDVAFIPHKDVAL 228
P W RT G DH + H G+ + + V+C PS VA + KD+
Sbjct: 181 PEWRRTGGRDHVVLAHHPNGMLDAR---YRFWPCVFVLCDFGRYPPS--VANL-DKDIIA 234
Query: 229 P--QVLQPFALPRGGRDVENRTVLGFWAG----HRNSKIRVILARVWENDTEL-----DI 277
P ++ FA G D +R L ++ G IR L + +++ ++ +
Sbjct: 235 PYRHLVANFANDTAGYD--DRPTLLYFQGAIYRKDGGSIRQELYYLLKDEKDVHFSFGSV 292
Query: 278 SNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFND 337
+ N I +A + +KFC+ G +S R+ DSI CVPVI+S+ +LPF D
Sbjct: 293 AGNGIEQAT------QGMRSSKFCLNIAGDTPSSNRLFDSIVSHCVPVIISDEIELPFED 346
Query: 338 ILDWRKFAVILNERDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAF 394
+LD+ KF+VI+ D + LK ++K IS E+ + N L +V+KHF++ P DA
Sbjct: 347 VLDYSKFSVIVRGADAVKKGFLKSLIKGISQEEWTRMWNKLKEVEKHFEYQYPSQTDDAV 406
Query: 395 HMV 397
M+
Sbjct: 407 QMI 409
>gi|18650643|gb|AAL75891.1| At1g21480/F24J8_23 [Arabidopsis thaliana]
Length = 462
Score = 84.7 bits (208), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 84/343 (24%), Positives = 142/343 (41%), Gaps = 36/343 (10%)
Query: 88 NFKIYIYP----DGDPNTFY------QTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAH 137
+ KIY+Y DG Y +T L G++ S+ + + ES+F T D A
Sbjct: 88 SLKIYVYDENEIDGLKELLYGRDGSVKTTACLKGQWGSQVKIHKLLLESKFRTIKKDEAD 147
Query: 138 LFFIP--ISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVR 195
LFF+P + C +M G E VK ++++ PY+ R+ G DH FV G
Sbjct: 148 LFFVPAYVKCVRMLGGLNDKEINQTYVK-----VLSQMPYFRRSGGRDHIFVFPSGAGAH 202
Query: 196 ATEGVPFLIKNAIRVVCSPSYD-------VAFIPHKDVALPQVLQPFALPRGGRDVE--- 245
I +I + +P D AF KD+ +P + G DV+
Sbjct: 203 LFRSWSTFINRSI--ILTPEADRTDKKDTTAFNSWKDIIIPGNVDDAMTKNGQPDVQPLP 260
Query: 246 --NRTVLGFWAGHRNSK---IRVI-LARVWENDTELDISNNRINRAIGPLVYQRRFYKTK 299
R L + G K +++I L++ + + E G Y K
Sbjct: 261 LSKRKYLANYLGRAQGKAGRLKLIDLSKQFPDKLECPDLKFSGTEKFGRTTYFEHLRNAK 320
Query: 300 FCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVIL-NERDVYQLKQ 358
FC+ P G + R +S CVPV+LS++ +LPF +++D+ + ++ + R +
Sbjct: 321 FCLAPRGESSWTLRFYESFFVECVPVLLSDHAELPFQNVIDYAQVSIKWPSTRIGSEFLD 380
Query: 359 ILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYEL 401
L +ISD + + K++ F + A +++EL
Sbjct: 381 YLASISDRDIEGMIARGRKIRCLFVYGPDSAPCSAVKGILWEL 423
>gi|55297487|dbj|BAD68203.1| exostosin family protein-like [Oryza sativa Japonica Group]
gi|55297674|dbj|BAD68245.1| exostosin family protein-like [Oryza sativa Japonica Group]
Length = 512
Score = 84.7 bits (208), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 71/234 (30%), Positives = 106/234 (45%), Gaps = 21/234 (8%)
Query: 170 INKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVA---FIP---- 222
+ P W R+ G DH H ++ V +K AI ++ P D + P
Sbjct: 219 VTDQPAWQRSEGRDHVIPVHHPWSFKS---VRRFVKKAIWLL--PDMDSTGNWYKPGQVY 273
Query: 223 -HKDVALPQVLQPFALPRG--GRDVENRTVLGFWAGH--RNSKIRVILARVWE-NDTELD 276
KDV LP V R+ L F+ G RN+ ++ V E D E
Sbjct: 274 LEKDVILPYVPNVDLCDSKCVSETQSRRSTLLFFRGRLRRNAGGKIRSKLVTELKDAEGI 333
Query: 277 ISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFN 336
I A G Q K+ FC+ P G +SAR+ D+I GC+PVI+S+ +LPF
Sbjct: 334 IIEEGTAGADGKAAAQNGMRKSLFCLNPAGDTPSSARLFDAIVSGCIPVIVSDELELPFE 393
Query: 337 DILDWRKFAVILNERDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSP 387
ILD+RK A+ ++ D Q L + L++I + +NL+K +HF ++SP
Sbjct: 394 GILDYRKIALFVSSNDAVQPGWLVKYLRSIDAKRIRQMQSNLLKYSRHFLYSSP 447
>gi|226496701|ref|NP_001141657.1| uncharacterized protein LOC100273782 [Zea mays]
gi|194705444|gb|ACF86806.1| unknown [Zea mays]
Length = 497
Score = 84.7 bits (208), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 71/235 (30%), Positives = 109/235 (46%), Gaps = 23/235 (9%)
Query: 170 INKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVA---FIP---- 222
+ P W R+ G DH H ++ V +K AI ++ P D + P
Sbjct: 217 VTDQPAWQRSEGRDHVIPVHHPWSFKS---VRRSVKKAIWLL--PDMDSTGNWYKPGQVY 271
Query: 223 -HKDVALPQVLQPFALPRGG--RDVENRTVLGFWAGH--RNS--KIRVILARVWENDTEL 275
KDV LP V R++L F+ G RN+ KIR L ++ ++
Sbjct: 272 LEKDVILPYVPNVDLCDHKCVLETQSKRSILLFFRGRLKRNAGGKIRSKLVEELKSAKDI 331
Query: 276 DISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPF 335
I A G Q K+ FC+ P G +SAR+ D+I GC+PVI+S+ +LPF
Sbjct: 332 VIEEGSTG-AQGKAAAQDGMRKSFFCLSPAGDTPSSARLFDAIVTGCIPVIISDELELPF 390
Query: 336 NDILDWRKFAVILNERDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSP 387
ILD+R+ A+ ++ D Q L + L+ I+ + +NLVK +HF ++SP
Sbjct: 391 EGILDYREIALFVSASDAVQPGWLLKYLRGINAKRIREIQSNLVKYSRHFLYSSP 445
>gi|297597828|ref|NP_001044591.2| Os01g0811400 [Oryza sativa Japonica Group]
gi|255673802|dbj|BAF06505.2| Os01g0811400 [Oryza sativa Japonica Group]
Length = 497
Score = 84.7 bits (208), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 71/234 (30%), Positives = 106/234 (45%), Gaps = 21/234 (8%)
Query: 170 INKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVA---FIP---- 222
+ P W R+ G DH H ++ V +K AI ++ P D + P
Sbjct: 219 VTDQPAWQRSEGRDHVIPVHHPWSFKS---VRRFVKKAIWLL--PDMDSTGNWYKPGQVY 273
Query: 223 -HKDVALPQVLQPFALPRG--GRDVENRTVLGFWAGH--RNSKIRVILARVWE-NDTELD 276
KDV LP V R+ L F+ G RN+ ++ V E D E
Sbjct: 274 LEKDVILPYVPNVDLCDSKCVSETQSRRSTLLFFRGRLRRNAGGKIRSKLVTELKDAEGI 333
Query: 277 ISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFN 336
I A G Q K+ FC+ P G +SAR+ D+I GC+PVI+S+ +LPF
Sbjct: 334 IIEEGTAGADGKAAAQNGMRKSLFCLNPAGDTPSSARLFDAIVSGCIPVIVSDELELPFE 393
Query: 337 DILDWRKFAVILNERDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSP 387
ILD+RK A+ ++ D Q L + L++I + +NL+K +HF ++SP
Sbjct: 394 GILDYRKIALFVSSNDAVQPGWLVKYLRSIDAKRIRQMQSNLLKYSRHFLYSSP 447
>gi|414880038|tpg|DAA57169.1| TPA: hypothetical protein ZEAMMB73_490377 [Zea mays]
Length = 497
Score = 84.7 bits (208), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 71/235 (30%), Positives = 109/235 (46%), Gaps = 23/235 (9%)
Query: 170 INKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVA---FIP---- 222
+ P W R+ G DH H ++ V +K AI ++ P D + P
Sbjct: 217 VTDQPAWQRSEGRDHVIPVHHPWSFKS---VRRSVKKAIWLL--PDMDSTGNWYKPGQVY 271
Query: 223 -HKDVALPQVLQPFALPRGG--RDVENRTVLGFWAGH--RNS--KIRVILARVWENDTEL 275
KDV LP V R++L F+ G RN+ KIR L ++ ++
Sbjct: 272 LEKDVILPYVPNVDLCDHKCVLETQSKRSILLFFRGRLKRNAGGKIRSKLVEELKSAKDI 331
Query: 276 DISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPF 335
I A G Q K+ FC+ P G +SAR+ D+I GC+PVI+S+ +LPF
Sbjct: 332 VIEEGSTG-AQGKAAAQDGMRKSFFCLSPAGDTPSSARLFDAIVSGCIPVIISDELELPF 390
Query: 336 NDILDWRKFAVILNERDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSP 387
ILD+R+ A+ ++ D Q L + L+ I+ + +NLVK +HF ++SP
Sbjct: 391 EGILDYREIALFVSASDAVQPGWLLKYLRGINAKRIREIQSNLVKYSRHFLYSSP 445
>gi|242018853|ref|XP_002429885.1| Exostosin-1, putative [Pediculus humanus corporis]
gi|212514919|gb|EEB17147.1| Exostosin-1, putative [Pediculus humanus corporis]
Length = 725
Score = 84.7 bits (208), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 110/409 (26%), Positives = 169/409 (41%), Gaps = 90/409 (22%)
Query: 28 LVTLLCFTCLSFNSFRSYPLQN---NFNSTLSFA-INKQDY-SDLGSDVFHSPSVFH--- 79
L+ LL L++ F Y L++ N S SF IN Q Y DL +V+ ++
Sbjct: 8 LLVLLSCAFLAYCYFGGYRLKSKSENLESLKSFTDINDQVYDKDLSGNVYQRRKLWPNDI 67
Query: 80 --------------LNYLEMVTN-FKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIR 124
+Y + N FKIY+YP D + P L AS I
Sbjct: 68 DGKTIKEKCRMETCFDYSKCEKNGFKIYVYPQTDESGI---PLPLVPS-ASYQKIINVII 123
Query: 125 ESRFLTHDPDRAHLFFIPISCHKMRGKGTSYENMTVIVKDYVDSL---INKYPYWNRTLG 181
ESR+ T DP +A LF + I + + DYV +L I K PYWN+ G
Sbjct: 124 ESRYYTSDPSQACLFVLAIDTL----------DRDSLSNDYVRNLPLRIQKLPYWNK--G 171
Query: 182 ADHFFVTCHDVGV--RATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALP---------- 229
+H + A E + F +I S S V F P+ DV++P
Sbjct: 172 RNHVIFNLYSGTWPDYAEENLGFDYGFSILAKASMS-TVHFRPNFDVSIPLFHKNHREKG 230
Query: 230 -----------QVLQPFALPRGGRDVENRTVLGFWAGHRNS-------KIRVIL-----A 266
V++ + L G+ R V G + RNS K V++
Sbjct: 231 GESGYVSSNNFPVIKKYVLAFKGK----RYVHGIGSETRNSLYHLHNEKDMVLVTTCKHG 286
Query: 267 RVWEN--DTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVP 324
+ W+ D D N +R Y+ + FC+ P G ++ S R +++ GC+P
Sbjct: 287 KSWKEMKDERCDEDNQEYDR----YDYEVLLQNSTFCLVPRGRRLGSFRFLEALQAGCIP 342
Query: 325 VILSNYYDLPFNDILDWRKFAVILNERDVYQ--LKQILKNISDAEFVSL 371
V+LSN + LPF +++DW K AV +ER + Q + I+++IS + +
Sbjct: 343 VLLSNGWVLPFQEVIDWTKAAVWADERLLLQAIVPDIVRSISATKIFEM 391
>gi|297850568|ref|XP_002893165.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
gi|297339007|gb|EFH69424.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
Length = 462
Score = 84.3 bits (207), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 83/343 (24%), Positives = 142/343 (41%), Gaps = 36/343 (10%)
Query: 88 NFKIYIYP----DGDPNTFY------QTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAH 137
+ KIY+Y DG Y +T L G++ S+ + + ES+F T D A
Sbjct: 88 SLKIYVYDENEIDGLKELLYGRDGSVKTTACLKGQWGSQVKIHKLLLESKFRTIKKDEAD 147
Query: 138 LFFIP--ISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVR 195
LFF+P + C +M G E VK ++++ PY+ R+ G DH FV G
Sbjct: 148 LFFVPAYVKCVRMLGGLNDKEINQTYVK-----VLSQMPYFRRSGGRDHIFVFPSGAGAH 202
Query: 196 ATEGVPFLIKNAIRVVCSPSYD-------VAFIPHKDVALPQVLQPFALPRGGRDVE--- 245
I +I + +P D AF KD+ +P + G DV+
Sbjct: 203 LFRSWSTFINRSI--ILTPEADRTDKKDTTAFNTWKDIIIPGNVDDAMTKNGQPDVQPLP 260
Query: 246 --NRTVLGFWAGHRNSK---IRVI-LARVWENDTELDISNNRINRAIGPLVYQRRFYKTK 299
R L + G K +++I L++ + + E G Y K
Sbjct: 261 LSKRKYLANYLGRAQGKAGRLKLIDLSKQYPDKLECPDLKFSGTEKFGRTTYFEHLRNAK 320
Query: 300 FCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVIL-NERDVYQLKQ 358
FC+ P G + R +S CVPV+LS++ +LPF +++D+ + ++ + R +
Sbjct: 321 FCLAPRGESSWTLRFYESFFVECVPVLLSDHAELPFQNVIDYAQVSIKWPSTRIGAEFLD 380
Query: 359 ILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYEL 401
L +ISD + + +++ F + A +++EL
Sbjct: 381 YLASISDKDIEGMIARGREIRCLFVYGPDSAPCSAVKGILWEL 423
>gi|125528116|gb|EAY76230.1| hypothetical protein OsI_04166 [Oryza sativa Indica Group]
Length = 513
Score = 84.3 bits (207), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 71/234 (30%), Positives = 106/234 (45%), Gaps = 21/234 (8%)
Query: 170 INKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVA---FIP---- 222
+ P W R+ G DH H ++ V +K AI ++ P D + P
Sbjct: 220 VTDQPAWQRSEGRDHVIPVHHPWSFKS---VRRFVKKAIWLL--PDMDSTGNWYKPGQVY 274
Query: 223 -HKDVALPQVLQPFALPRG--GRDVENRTVLGFWAGH--RNSKIRVILARVWE-NDTELD 276
KDV LP V R+ L F+ G RN+ ++ V E D E
Sbjct: 275 LEKDVILPYVPNVDLCDSKCVSETQSRRSTLLFFRGRLRRNAGGKIRSKLVTELKDAEGI 334
Query: 277 ISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFN 336
I A G Q K+ FC+ P G +SAR+ D+I GC+PVI+S+ +LPF
Sbjct: 335 IIEEGTAGADGKAAAQNGMRKSLFCLNPAGDTPSSARLFDAIVSGCIPVIVSDELELPFE 394
Query: 337 DILDWRKFAVILNERDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSP 387
ILD+RK A+ ++ D Q L + L++I + +NL+K +HF ++SP
Sbjct: 395 GILDYRKIALFVSSNDAVQPGWLVKYLRSIDAKRIRQMQSNLLKYSRHFLYSSP 448
>gi|94536924|ref|NP_001035420.1| exostosin-1c [Danio rerio]
gi|92098257|gb|AAI15217.1| Exostoses (multiple) 1c [Danio rerio]
Length = 737
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 92/363 (25%), Positives = 142/363 (39%), Gaps = 61/363 (16%)
Query: 47 LQNNFNSTLSFAINKQDYSDLGSDV-FHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQT 105
LQN S KQ +S + D + F + + F++YIYP
Sbjct: 63 LQNGGGPGDSPRQRKQAWSSIYKDSRCRMDTCFDFGRCQTQSGFRVYIYP---------- 112
Query: 106 PRKLTGKYASEGY--FFQNIRESRFLTHDPDRAHLFFIPISCHKMRGKGTSYENMTVIVK 163
P K G+ SE Y ++ ESR+ T DP A LF + I + ++
Sbjct: 113 PEK--GERVSESYRKILTSVSESRYYTSDPREACLFVLGIDT-------LDRDQLSQQFV 163
Query: 164 DYVDSLINKYPYWNRTLGADHFFVTCHD-VGVRATEGVPFLIKNAIRVVCSPSYDVAFIP 222
VD I YP WN G +H + TE + F + AI S + + F P
Sbjct: 164 PNVDERIRGYPLWND--GRNHVIFNLYSGTWPNYTEDLGFNVGQAILAKASLNTE-HFRP 220
Query: 223 HKDVALPQVLQPFALPRGGRD--VEN-----RTVLGFWAGHR-----------------N 258
D+++P + G R V N R L + G R N
Sbjct: 221 GFDISIPLFSKEHPQKGGKRGWLVRNSVPPRRKYLLMFKGKRYLTGIGSDTRNALHHIHN 280
Query: 259 SKIRVIL-----ARVWEN--DTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNS 311
K V L + WE D D N R YQ + + FC+ P G ++ S
Sbjct: 281 GKDIVSLTTCRHGKDWEKHKDARCDHDNQEYER----FDYQELLHNSTFCLVPRGRRLGS 336
Query: 312 ARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSL 371
R +S+ C+PV+LSN ++LPF+D++ W + V +ER + Q+ ++ + ++L
Sbjct: 337 FRFLESLQAACIPVLLSNGWELPFSDVIQWNQAVVEGDERLLLQVPSTVRAVGMDRVLAL 396
Query: 372 HNN 374
Sbjct: 397 RQQ 399
>gi|242055419|ref|XP_002456855.1| hypothetical protein SORBIDRAFT_03g044100 [Sorghum bicolor]
gi|241928830|gb|EES01975.1| hypothetical protein SORBIDRAFT_03g044100 [Sorghum bicolor]
Length = 432
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 79/296 (26%), Positives = 131/296 (44%), Gaps = 29/296 (9%)
Query: 132 DPDRAHLFFIPI------SCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHF 185
DPD A FF+P + H E ++ + VD ++ K YW R+ G DH
Sbjct: 127 DPDAADAFFVPFFSSLSFNVHGRNMTDPDTEADRLLQVEIVD-ILWKSKYWQRSAGRDHV 185
Query: 186 FVTCHDVGVRATEGVPFLIKNAIRVVCS---PSYDVAFIPHKDVALP--QVLQPFALPRG 240
H R ++ +I +V + ++A + KDV P V+ F
Sbjct: 186 IPMHHPNAFRFLRA---MVNASILIVSDFGRYTKELASL-RKDVVAPYVHVVDSFLDDDP 241
Query: 241 GRDVENRTVLGFWAGHR----NSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFY 296
E R L F+ G KIR L +V + + ++ I G +
Sbjct: 242 PDPFEARHTLLFFRGRTVRKDEGKIRAKLGKVLKGKEGVRFEDS-IATGDGIKISTEGMR 300
Query: 297 KTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDV--- 353
+KFC+ P G +S R+ D+I CVPVI+S+ +LPF D +D+ +F++ + +
Sbjct: 301 SSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSSRIELPFEDEIDYSEFSLFFSVEEALRP 360
Query: 354 -YQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVI 408
Y L Q L+ I ++V + + L V H+++ PP + DA +M+ ++ RH I
Sbjct: 361 DYLLNQ-LRQIPKKKWVDMWSKLKNVSHHYEFQYPPRKGDAVNMIWRQV---RHKI 412
>gi|223943427|gb|ACN25797.1| unknown [Zea mays]
Length = 380
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 71/235 (30%), Positives = 109/235 (46%), Gaps = 23/235 (9%)
Query: 170 INKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVA---FIP---- 222
+ P W R+ G DH H ++ V +K AI ++ P D + P
Sbjct: 100 VTDQPAWQRSEGRDHVIPVHHPWSFKS---VRRSVKKAIWLL--PDMDSTGNWYKPGQVY 154
Query: 223 -HKDVALPQVLQPFALPRGG--RDVENRTVLGFWAGH--RNS--KIRVILARVWENDTEL 275
KDV LP V R++L F+ G RN+ KIR L ++ ++
Sbjct: 155 LEKDVILPYVPNVDLCDHKCVLETQSKRSILLFFRGRLKRNAGGKIRSKLVEELKSAKDI 214
Query: 276 DISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPF 335
I A G Q K+ FC+ P G +SAR+ D+I GC+PVI+S+ +LPF
Sbjct: 215 VIEEGSTG-AQGKAAAQDGMRKSFFCLSPAGDTPSSARLFDAIVSGCIPVIISDELELPF 273
Query: 336 NDILDWRKFAVILNERDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSP 387
ILD+R+ A+ ++ D Q L + L+ I+ + +NLVK +HF ++SP
Sbjct: 274 EGILDYREIALFVSASDAVQPGWLLKYLRGINAKRIREIQSNLVKYSRHFLYSSP 328
>gi|168059257|ref|XP_001781620.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666934|gb|EDQ53576.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 460
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 77/324 (23%), Positives = 147/324 (45%), Gaps = 41/324 (12%)
Query: 113 YASEGYFFQNIRESRFLTHDPDRAHLFFIPI--------SCHKMRGKGTSYENMTVI--- 161
Y +G+ ++ + + DP++A +FF+P H M + + + ++
Sbjct: 120 YLLDGWDRKDGKTAAVRVIDPEQADVFFVPFFSALSFNSHGHGMSEGAAADKRLQIVLLT 179
Query: 162 -----------VKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRV 210
V+ + +++K +W + G DH V H +R ++ +I +
Sbjct: 180 FGRHVNASCHLVQAGLVDILSKSKWWQASQGRDHILVAHHPNALRHYRD---MLNQSIFI 236
Query: 211 VCS-PSYD--VAFIPHKDVALP--QVLQPFALPRGGRDVENRTVLGFWAG--HRNSK--I 261
V YD VA + KDV P VL + R L F+ G HR +
Sbjct: 237 VADFGRYDKTVARLS-KDVVAPYVHVLPSYDQDNPADPFSLRKTLLFFQGRIHRKGDGIV 295
Query: 262 RVILARVWENDTELDISNNRIN-RAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHY 320
R LA + N++++ ++ + AI R ++FC+ P G +S R+ D+I
Sbjct: 296 RTKLAELLANNSDVHYVDSLASAEAIATSTAGMR--TSRFCLHPAGDTPSSCRLFDAIVS 353
Query: 321 GCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQ---LKQILKNISDAEFVSLHNNLVK 377
CVPVI+S+ +LPF D L+++ F++ + + + L + L++I+ ++ + N L
Sbjct: 354 HCVPVIISDRIELPFEDDLNYKDFSIFFSSEESVKPGHLLRTLRSITRERWLRMWNALKT 413
Query: 378 VQKHFQWNSPPVRFDAFHMVMYEL 401
V HF++ PP + DA +M+ ++
Sbjct: 414 VSHHFEYQHPPKKDDAVNMIFKQV 437
>gi|161612233|gb|AAI55813.1| Exostoses (multiple) 1c [Danio rerio]
Length = 737
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 92/363 (25%), Positives = 142/363 (39%), Gaps = 61/363 (16%)
Query: 47 LQNNFNSTLSFAINKQDYSDLGSDV-FHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQT 105
LQN S KQ +S + D + F + + F++YIYP
Sbjct: 63 LQNGGGPGDSPRQRKQAWSSIYKDSRCRMDTCFDFGRCQTQSGFRVYIYP---------- 112
Query: 106 PRKLTGKYASEGY--FFQNIRESRFLTHDPDRAHLFFIPISCHKMRGKGTSYENMTVIVK 163
P K G+ SE Y ++ ESR+ T DP A LF + I + ++
Sbjct: 113 PEK--GERVSESYRKILTSVSESRYYTSDPREACLFVLGIDT-------LDRDQLSQQFV 163
Query: 164 DYVDSLINKYPYWNRTLGADHFFVTCHD-VGVRATEGVPFLIKNAIRVVCSPSYDVAFIP 222
VD I YP WN G +H + TE + F + AI S + + F P
Sbjct: 164 PNVDERIRGYPLWND--GRNHVIFNLYSGTWPNYTEDLGFNVGQAILAKASLNTE-HFRP 220
Query: 223 HKDVALPQVLQPFALPRGGRD--VEN-----RTVLGFWAGHR-----------------N 258
D+++P + G R V N R L + G R N
Sbjct: 221 GFDISIPLFSKEHPQKGGKRGWLVRNSVPPRRKYLLMFKGKRYLTGIGSDTRNALHHIHN 280
Query: 259 SKIRVIL-----ARVWEN--DTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNS 311
K V L + WE D D N R YQ + + FC+ P G ++ S
Sbjct: 281 GKDIVSLTTCRHGKDWEKHKDARCDHDNQEYER----FDYQELLHNSTFCLVPRGRRLGS 336
Query: 312 ARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSL 371
R +S+ C+PV+LSN ++LPF+D++ W + V +ER + Q+ ++ + ++L
Sbjct: 337 FRFLESLQAACIPVLLSNGWELPFSDVIQWNQAVVEGDERLLLQVPSTVRAVGIDRVLAL 396
Query: 372 HNN 374
Sbjct: 397 RQQ 399
>gi|224120296|ref|XP_002318294.1| predicted protein [Populus trichocarpa]
gi|222858967|gb|EEE96514.1| predicted protein [Populus trichocarpa]
Length = 382
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 72/284 (25%), Positives = 135/284 (47%), Gaps = 41/284 (14%)
Query: 131 HDPDRAHLFFIPI-----------SCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRT 179
+DP+ A LF++P+ K+ Y + + +D + + K YW R
Sbjct: 61 NDPEEADLFYVPVFSSLSLTVNPVQVGKVPVSDPVYSDEKM--QDELVEWLEKQEYWRRN 118
Query: 180 LGADH-FFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVAL-PQVLQPFA- 236
G DH F + R + V KNA+ ++ D + +L V+ P+A
Sbjct: 119 NGRDHVLFAGDPNALYRVLDRV----KNAVLLLS----DFGRVRSDQGSLVKDVIVPYAH 170
Query: 237 ---LPRGGRDVENRTVLGFWAGHR----NSKIRVILARVWENDTELDISN---NRINRAI 286
+ G V+ R L F+ G+R KIR +L ++ E + ++ IS+ +R +R
Sbjct: 171 RINVYNGDIGVDERKTLLFFMGNRYRKDGGKIRDMLFQLLEKEEDVLISHGTQSRESRRT 230
Query: 287 GPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAV 346
L + +KFC+ P G ++ R+ DSI CVP+I+S+ +LPF D++D+RK A+
Sbjct: 231 ATL----GMHTSKFCLNPAGDTPSACRLFDSIVSLCVPLIVSDSIELPFEDVIDYRKIAI 286
Query: 347 ILNERDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSP 387
++ + L ++L+ +S + + +V+++F+++ P
Sbjct: 287 FVDTESSLKPGYLVKLLRAVSTERILEYQKEMREVKRYFEYSDP 330
>gi|224136520|ref|XP_002322350.1| predicted protein [Populus trichocarpa]
gi|222869346|gb|EEF06477.1| predicted protein [Populus trichocarpa]
Length = 343
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 71/234 (30%), Positives = 104/234 (44%), Gaps = 47/234 (20%)
Query: 161 IVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV-------RATE-GVPFLIKNAIRV-- 210
+++ + + +PYWNRT GADHFFV HD G +A E G+ L+++A V
Sbjct: 55 MIRSATQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQHATLVQT 114
Query: 211 ------VCSPSYDVAFIPHKDVALPQVLQPFALPRGGR------------DVENRTVLGF 252
VC + +P A PQ +Q +P DV N G+
Sbjct: 115 FGQRNHVCLKDGSIT-VP--SYAPPQKMQTHLIPEKTPRSIFVYFRGLFYDVGNDPEGGY 171
Query: 253 WAGHRNSKIRVILARVWEN--DTEL-DISNNRINRAIGPLVYQRRFYKTKFCICPGGSQV 309
+A R A VWEN D L DIS P Y + FC+CP
Sbjct: 172 YA-------RGARAAVWENFKDNPLFDISTEH------PTTYYEDMQQAVFCLCPLSWAP 218
Query: 310 NSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNI 363
S R+ +++ +G +PVI+ + LPF D + W + V ++E+DV L IL +I
Sbjct: 219 WSPRLVEALIFGYIPVIIVDDIVLPFADAIPWEEIGVFVDEKDVPNLDTILTSI 272
>gi|356512371|ref|XP_003524893.1| PREDICTED: probable glycosyltransferase At3g07620-like [Glycine
max]
Length = 427
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 73/289 (25%), Positives = 124/289 (42%), Gaps = 34/289 (11%)
Query: 132 DPDRAHLFFIPI--------SCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGAD 183
DP+ A FF+P H M+ T + + D ++ L+ K YW R+ G D
Sbjct: 120 DPELAQAFFVPFFSSLSFNTHGHTMKDPATQIDRQLQV--DLME-LLKKSNYWQRSGGRD 176
Query: 184 HFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIP------HKDVALP--QVLQPF 235
H F H R + +I+VV D P +KDV P V+ F
Sbjct: 177 HVFPMTHPNAFRFLRD---QLNESIQVVV----DFGRYPRGMSNLNKDVVSPYVHVVDSF 229
Query: 236 ALPRGGRDVENRTVLGFWAGHRNSK----IRVILARVWENDTELDISNNRINRAIGPLVY 291
E+R+ L F+ G K +RV LA++ ++ + +
Sbjct: 230 TDDEPQDPYESRSTLLFFRGRTYRKDEGIVRVKLAKILAGYDDVHYERS-VATEENIKAS 288
Query: 292 QRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNER 351
+ +KFC+ P G +S R+ D+I C+PVI+S+ +LPF D +D+ +F+V + +
Sbjct: 289 SKGMRSSKFCLHPAGDTPSSCRLFDAIVSHCIPVIVSDQIELPFEDEIDYSQFSVFFSFK 348
Query: 352 DVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMV 397
+ Q + L+ ++ + L + H+++ PP R DA M+
Sbjct: 349 EALQPGYMIDQLRKFPKEKWTEMWRQLKSISHHYEFRYPPKREDAVDML 397
>gi|326530125|dbj|BAK08342.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 461
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 80/288 (27%), Positives = 128/288 (44%), Gaps = 34/288 (11%)
Query: 132 DPDRAHLFFIPISCH-------KMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADH 184
DP RA + F+P K+ + E+ T+ + + + P W R+ G DH
Sbjct: 142 DPARADVVFVPFFASLSFNRHSKVVPPARTSEDRTL--QRRLIEFLAARPEWRRSGGRDH 199
Query: 185 FFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPH------KDVALP--QVLQPFA 236
+ H G+ + + + V+C D PH KDV P V+ F
Sbjct: 200 VVLAHHPNGMLDAR---YKLWPCVFVLC----DFGRYPHSVANIDKDVIAPYQHVVDDFL 252
Query: 237 LPRGGRDVENRTVLGFWAGHRNSK----IRVILARVWENDTELDISNNRINRAIGPLVYQ 292
G D +R L ++ G K IR L + +++ ++ S + G
Sbjct: 253 NDSTGYD--DRPTLLYFQGAIYRKDGGFIRQELYYLLKDEKDVHFSFGSVA-GNGIEEST 309
Query: 293 RRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERD 352
R +KFC+ G +S R+ DSI CVPVI+S+ +LPF D+LD+ KF +I+ D
Sbjct: 310 RGMRASKFCLNIAGDTPSSNRLFDSIVSHCVPVIISDEIELPFEDMLDYSKFCIIVRGAD 369
Query: 353 VYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMV 397
+ L ++K IS E+ S+ N L +V+ HF++ P DA M+
Sbjct: 370 AVKKGFLINLIKGISPEEWTSMWNKLREVEGHFEYQYPSQPEDAVQMI 417
>gi|125572389|gb|EAZ13904.1| hypothetical protein OsJ_03829 [Oryza sativa Japonica Group]
Length = 504
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 71/234 (30%), Positives = 106/234 (45%), Gaps = 21/234 (8%)
Query: 170 INKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVA---FIP---- 222
+ P W R+ G DH H ++ V +K AI ++ P D + P
Sbjct: 211 VTDQPAWQRSEGRDHVIPVHHPWSFKS---VRRFVKKAIWLL--PDMDSTGNWYKPGQVY 265
Query: 223 -HKDVALPQVLQPFALPRG--GRDVENRTVLGFWAGH--RNSKIRVILARVWE-NDTELD 276
KDV LP V R+ L F+ G RN+ ++ V E D E
Sbjct: 266 LEKDVILPYVPNVDLCDSKCVSETQSRRSTLLFFRGRLRRNAGGKIRSKLVTELKDAEGI 325
Query: 277 ISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFN 336
I A G Q K+ FC+ P G +SAR+ D+I GC+PVI+S+ +LPF
Sbjct: 326 IIEEGTAGADGKAAAQNGMRKSLFCLNPAGDTPSSARLFDAIVSGCIPVIVSDELELPFE 385
Query: 337 DILDWRKFAVILNERDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSP 387
ILD+RK A+ ++ D Q L + L++I + +NL+K +HF ++SP
Sbjct: 386 GILDYRKIALFVSSNDAVQPGWLVKYLRSIDAKRIRQMQSNLLKYSRHFLYSSP 439
>gi|118781611|ref|XP_311558.3| AGAP010388-PA [Anopheles gambiae str. PEST]
gi|116130033|gb|EAA07205.3| AGAP010388-PA [Anopheles gambiae str. PEST]
Length = 744
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 82/345 (23%), Positives = 146/345 (42%), Gaps = 47/345 (13%)
Query: 88 NFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHK 147
NF +Y+YP +P P ++ Y +I+ESR+ T DP++A LF + I
Sbjct: 100 NFYVYVYPP-EPLNSLGAPPPISQNYQK---IISSIQESRYYTTDPEQACLFVLGIDT-- 153
Query: 148 MRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRA---TEGVPFLI 204
++++ V S + + P+WN G +H + G G+ F
Sbjct: 154 -----LDRDSLSEDFVRNVPSRLQRLPHWNN--GRNHIIFNLYS-GTWPDYNENGLGFDP 205
Query: 205 KNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRD--------VENRTVLGFWAGH 256
AI S S + P DV++P + F L RGG N+ L + G
Sbjct: 206 GQAILAKASMSIQ-SLRPGFDVSIPLFHKQFPL-RGGNTGFVISNNFPANKKYLLAFKGK 263
Query: 257 R-----------------NSKIRVILARVWENDTELDISNNRI---NRAIGPLVYQRRFY 296
R N++ V++ + D+ + R NR Y+
Sbjct: 264 RYVHGIGSETRNSLFHLHNARDFVLVTTCKHGKSWRDLQDARCDEDNREYDRYDYETLLQ 323
Query: 297 KTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQL 356
+ FC+ P G ++ S R + + GC+PV+LSN + LPF +DW++ A+ +ER + Q+
Sbjct: 324 NSTFCLVPRGRRLGSFRFLEVLQAGCIPVLLSNSWVLPFQSKIDWKQAAIWADERLLLQV 383
Query: 357 KQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYEL 401
I+++IS + ++L + + + + + F F ++ L
Sbjct: 384 PDIVRSISTSRILALRQQTQVLWERYFSSIEKIIFTTFEIIRERL 428
>gi|194707922|gb|ACF88045.1| unknown [Zea mays]
Length = 240
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 58/176 (32%), Positives = 92/176 (52%), Gaps = 17/176 (9%)
Query: 246 NRTVLGFWAGH--RNS--KIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFC 301
R++L F+ G RN+ KIR L ++ ++ I A G Q K+ FC
Sbjct: 41 KRSILLFFRGRLKRNAGGKIRSKLVEELKSAKDIVIEEGSTG-AQGKAAAQDGMRKSFFC 99
Query: 302 ICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQ---LKQ 358
+ P G +SAR+ D+I GC+PVI+S+ +LPF ILD+R+ A+ ++ D Q L +
Sbjct: 100 LSPAGDTPSSARLFDAIVSGCIPVIISDELELPFEGILDYREIALFVSASDAVQPGWLLK 159
Query: 359 ILKNISDAEFVSLHNNLVKVQKHFQWNSP--PV-------RFDAFHMVMYELWLRR 405
L+ I+ + +NLVK +HF ++SP P+ R A +V +L +RR
Sbjct: 160 YLRGINAKRIREIQSNLVKYSRHFLYSSPAQPLGPEDLTWRMIAGKLVNIKLQIRR 215
>gi|145352087|ref|XP_001420390.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144580624|gb|ABO98683.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 517
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 50/123 (40%), Positives = 72/123 (58%), Gaps = 6/123 (4%)
Query: 288 PLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVI 347
P Y K+K+C+ G + ++AR+ D I +GCVPVI+++ YDLPF+ + DW KF+V
Sbjct: 398 PRDYMTMLSKSKYCLYVYGDRAHTARLYDIITFGCVPVIVADGYDLPFSWLFDWSKFSVR 457
Query: 348 LNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW-NSPPVRFDAFHMVMYELWLRRH 406
+ E DV L IL A++ SL LVKV FQ+ N + DAF + M L +RR
Sbjct: 458 VLEDDVATLPSILDR---ADYDSLRRELVKVHSFFQYHNRGSIFGDAFWITM--LGVRRQ 512
Query: 407 VIK 409
+ K
Sbjct: 513 LAK 515
>gi|255542540|ref|XP_002512333.1| catalytic, putative [Ricinus communis]
gi|223548294|gb|EEF49785.1| catalytic, putative [Ricinus communis]
Length = 434
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 72/295 (24%), Positives = 122/295 (41%), Gaps = 38/295 (12%)
Query: 132 DPDRAHLFFIPI--------SCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGAD 183
DP++A FF+P H M T + + D +D ++ K YW ++ G D
Sbjct: 127 DPEKADAFFVPFFSSLSFNTHGHTMTDPETEIDRQLQV--DVID-MLYKSKYWQKSGGRD 183
Query: 184 HFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPH------KDVALP--QVLQPF 235
H H R ++ + D P KDV P V+ F
Sbjct: 184 HVIPMTHPNAFR-------FLRQQLNASILIVADFGRYPKSMSTLSKDVVAPYVHVVDSF 236
Query: 236 ALPRGGRDVENRTVLGFWAGHR----NSKIRVILARVWE--NDTELDISNNRINRAIGPL 289
E+RT L F+ G+ K+R LA++ +D + S+
Sbjct: 237 TDDEVSNPFESRTTLLFFRGNTIRKDEGKVRAKLAKILTGYDDIHFERSSATAETIKAST 296
Query: 290 VYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILN 349
R +KFC+ P G +S R+ D+I CVPVI+S+ +LP+ D +D+ +F+V +
Sbjct: 297 EGMR---SSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDQIELPYEDEIDYSQFSVFFS 353
Query: 350 ERDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYEL 401
+ Q + L+ + ++ + L + HF++ PP + DA M+ E+
Sbjct: 354 VNEAIQPGYMVDQLRQLPKERWLEMWRKLKSISHHFEFQYPPEKEDAVDMLWREV 408
>gi|195431447|ref|XP_002063753.1| GK15839 [Drosophila willistoni]
gi|194159838|gb|EDW74739.1| GK15839 [Drosophila willistoni]
Length = 776
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 85/352 (24%), Positives = 148/352 (42%), Gaps = 47/352 (13%)
Query: 81 NYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFF 140
++ + F +Y+YP N+ P A+ I+ESR+ T DP A LF
Sbjct: 103 DFTKCFEQFLVYVYPPEPLNSLGAAP----PTSANYQKILTAIQESRYYTTDPSIACLFV 158
Query: 141 IPISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTC----------H 190
+ I Y V++ V S + + P+WN G +H H
Sbjct: 159 LAIDTLDRDSLSEDY------VRN-VPSRLARLPHWNN--GRNHIIFNLYSGTWPDYAEH 209
Query: 191 DVGVRATEGVPFLIKNAIRVV-CSPSYDVAFIP--HKDVAL----PQVLQPFALPRGGRD 243
+G A E + L K ++ V+ P +DV+ IP HK L +Q P +
Sbjct: 210 SLGFDAGEAI--LAKASMSVLQLRPGFDVS-IPLFHKQFPLRAGATGSVQSNNFPANKKY 266
Query: 244 V----ENRTVLGFWAGHRNS-------KIRVILARVWENDTELDISNNRI---NRAIGPL 289
+ R V G + RNS + V++ + ++ +NR NR
Sbjct: 267 LLAFKGKRYVHGIGSETRNSLFHLHNGRDMVLVTTCRHGKSWRELQDNRCDEDNREYDRY 326
Query: 290 VYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILN 349
Y+ + FC+ P G ++ S R +++ GC+PV+LSN + LPF +DW++ A+ +
Sbjct: 327 DYETLLQNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNAWVLPFESKIDWKQAAIWAD 386
Query: 350 ERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYEL 401
ER + Q+ I+++IS +L + + + + + F F ++ L
Sbjct: 387 ERLLLQVPDIVRSISAERIFALRQQTQVLWERYFGSIEKIVFTTFEIIRERL 438
>gi|302783725|ref|XP_002973635.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
gi|300158673|gb|EFJ25295.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
Length = 452
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 68/264 (25%), Positives = 120/264 (45%), Gaps = 30/264 (11%)
Query: 146 HKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV----RATEGVP 201
H+ + S +N + V L+ W R+ G DH FV + +
Sbjct: 147 HRGEFRKKSSKNSDFDRQRRVVELVTSSLEWRRSNGVDHVFVLADPMAMWHVREQISTAM 206
Query: 202 FLI-----------KNAIRVVCSPSYDVAFIPHKDVALPQ--VLQPFALPRGGRDVENRT 248
FL+ KN + + P KDV +P +L P + D ++RT
Sbjct: 207 FLVVDFGGWYLEDAKNKLNSSTIIQHS-QVSPIKDVIIPHTHLLPPLKIA----DDQHRT 261
Query: 249 VLGFWAG----HRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICP 304
VL ++ G HR+ +R L ++ +N+ E+ + + A G R ++FC+ P
Sbjct: 262 VLLYFRGARHRHRSGLVREKLWKILDNEPEVLLEKGLPDDA-GLAEATRGMRSSEFCLTP 320
Query: 305 GGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQ---LKQILK 361
G +S R+ D+I C+PVI+S+ LPF +++ +F V ++ RD Q L Q L+
Sbjct: 321 AGDTPSSCRLYDAIASLCIPVIVSDDIQLPFEGFVNYEEFCVFVSTRDATQPGWLVQKLR 380
Query: 362 NISDAEFVSLHNNLVKVQKHFQWN 385
+I E ++ L +VQ++F+++
Sbjct: 381 SIGSEERSTMRQTLSRVQRYFEYD 404
>gi|302787839|ref|XP_002975689.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
gi|300156690|gb|EFJ23318.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
Length = 452
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 68/264 (25%), Positives = 120/264 (45%), Gaps = 30/264 (11%)
Query: 146 HKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV----RATEGVP 201
H+ + S +N + V L+ W R+ G DH FV + +
Sbjct: 147 HRGEFRKRSSKNSDFDRQRRVVELVTSSLEWRRSNGVDHVFVLADPMAMWHVREQISTAV 206
Query: 202 FLI-----------KNAIRVVCSPSYDVAFIPHKDVALPQ--VLQPFALPRGGRDVENRT 248
FL+ KN + + P KDV +P +L P + D ++RT
Sbjct: 207 FLVVDFGGWYLEDAKNKLNSSTIIQHS-QVSPIKDVIIPHTHLLPPLKIA----DDQHRT 261
Query: 249 VLGFWAG----HRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICP 304
VL ++ G HR+ +R L ++ +N+ E+ + + A G R ++FC+ P
Sbjct: 262 VLLYFRGARHRHRSGLVREKLWKILDNEPEVLLEEGLPDDA-GLAEATRGMRSSEFCLTP 320
Query: 305 GGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQ---LKQILK 361
G +S R+ D+I C+PVI+S+ LPF +++ +F V ++ RD Q L Q L+
Sbjct: 321 AGDTPSSCRLYDAIASLCIPVIVSDDIQLPFEGFVNYEEFCVFVSARDATQPGWLVQKLR 380
Query: 362 NISDAEFVSLHNNLVKVQKHFQWN 385
+I E ++ L +VQ++F+++
Sbjct: 381 SIGSEERSTMRQTLSRVQRYFEYD 404
>gi|302845214|ref|XP_002954146.1| hypothetical protein VOLCADRAFT_118607 [Volvox carteri f.
nagariensis]
gi|300260645|gb|EFJ44863.1| hypothetical protein VOLCADRAFT_118607 [Volvox carteri f.
nagariensis]
Length = 401
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 79/320 (24%), Positives = 128/320 (40%), Gaps = 43/320 (13%)
Query: 120 FQNIRESRFLTHDPDRAHLFFIPISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRT 179
Q + S T D D A FFIP+ MR KG S ++T +V Y+ +P+W R
Sbjct: 1 MQRLLSSGVRTADGDAADFFFIPLV---MRTKGQSANHLTAVVS-YIQ---QHWPWWGRY 53
Query: 180 LGAD-HFFVTCHDVGVRA--------TEGVPFLI---------KNAIRVVCSPSYDVAFI 221
G H V D+G R E V FL + A P D+
Sbjct: 54 GGGHRHLLVVPADLGRRMLPEELLKLVENVTFLTHWGSHTNHSEGAWVESHRPGKDIVVP 113
Query: 222 PHKDVALPQVLQPFALPRGGRDVENRTVLGFWAGH----RNSKIRVILARVWENDTELDI 277
P + P V P R RT F++G + R + + +
Sbjct: 114 PLHNADEPIVFSPLHTLHSKRR-RQRTSGLFFSGRICSDGSEPHRGRCRTNSQGNVRHKV 172
Query: 278 SNNRINRAIGPLVYQRRFYKTK-----FCICPGGSQVNSARIADSIHYGCVPVILSNYYD 332
+ NR L + + Y + FC+ PGG + ++ GCVPV++ +
Sbjct: 173 LKHHWNRTTWTLTTRAKAYASALSSHTFCLSPGGGGYGRRSVQAAV-MGCVPVLIGDGLH 231
Query: 333 LPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS------ 386
PF LDW +F++ + E+D+ L IL++++ + ++ L +H +++
Sbjct: 232 QPFEPELDWSQFSMSVPEQDIPHLHTILESMNSSTIAAMQEQLRCAAQHLYYSTTFGEVM 291
Query: 387 -PPVRFDAFHMVMYELWLRR 405
R+DAF +M L +RR
Sbjct: 292 GEDGRYDAFETLMEVLRMRR 311
>gi|410911648|ref|XP_003969302.1| PREDICTED: exostosin-1b-like [Takifugu rubripes]
Length = 740
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 93/379 (24%), Positives = 150/379 (39%), Gaps = 69/379 (18%)
Query: 46 PLQNNFNSTLSFAINKQDYSDLGSDVFHSP-----SVFHLNYLEMVTNFKIYIYPDGDPN 100
P + + I+ + D+ + V+ S S F L+ L FK+Y+YP
Sbjct: 61 PWDQSDTEEYNLHISPRQKRDINTSVYKSKRCRMHSCFDLS-LCRKDGFKVYVYP----- 114
Query: 101 TFYQTPRKLTGKYASEGY--FFQNIRESRFLTHDPDRAHLFFIPISCHKMRGKGTSY-EN 157
+ G+ SE Y I SRF T DP +A LF + + Y N
Sbjct: 115 -------QQKGEKISESYQNILSTIEGSRFYTSDPGQACLFVLSLDTLDRDQLSPQYVHN 167
Query: 158 MTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHD-VGVRATEGVPFLIKNAIRVVCSPSY 216
M V++ P WN G +H + TE + F I A+ S S
Sbjct: 168 MKTKVQNL--------PLWND--GRNHLIFNLYSGTWPDYTEDLGFDIGQAMLAKASIST 217
Query: 217 DVAFIPHKDVALPQVLQPFALPRGGRDV-----------------ENRTVLGFWAGHRNS 259
D F P+ DV++P + G R R + G + RN+
Sbjct: 218 D-NFRPNFDVSIPLFSKEHPRTGGERGYLKYNSIPPFRKYMLVFKGKRYLTGIGSDTRNA 276
Query: 260 ------KIRVIL------ARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGS 307
V+L + W+ D ++ N Y+ + + FC+ P G
Sbjct: 277 LHHVHNAEDVVLLTTCKHGKDWQKHK--DARCDKDNAEYDKYDYKEMLHNSTFCLVPRGR 334
Query: 308 QVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAE 367
++ S R +++ CVPV+LSN ++LPF++I+DW AVI +ER + Q+ + +I +
Sbjct: 335 RLGSFRFLEALQAACVPVMLSNGWELPFSEIIDWNTAAVIGDERLLLQIPTTVHSIHQDK 394
Query: 368 FVSLHNNLVKVQKHFQWNS 386
+SL + Q F W +
Sbjct: 395 ILSL-----RQQTQFLWEA 408
>gi|42571577|ref|NP_973879.1| exostosin-like protein [Arabidopsis thaliana]
gi|332191986|gb|AEE30107.1| exostosin-like protein [Arabidopsis thaliana]
Length = 410
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 74/287 (25%), Positives = 121/287 (42%), Gaps = 35/287 (12%)
Query: 88 NFKIYIYP----DGDPNTFY------QTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAH 137
+ KIY+Y DG Y +T L G++ S+ + + ES+F T D A
Sbjct: 88 SLKIYVYDENEIDGLKELLYGRDGSVKTTACLKGQWGSQVKIHKLLLESKFRTIKKDEAD 147
Query: 138 LFFIP--ISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVR 195
LFF+P + C +M G E VK ++++ PY+ R+ G DH FV G
Sbjct: 148 LFFVPAYVKCVRMLGGLNDKEINQTYVK-----VLSQMPYFRRSGGRDHIFVFPSGAGAH 202
Query: 196 ATEGVPFLIKNAIRVVCSPSYD-------VAFIPHKDVALPQVLQPFALPRGGRDVE--- 245
I +I + +P D AF KD+ +P + G DV+
Sbjct: 203 LFRSWSTFINRSI--ILTPEADRTDKKDTTAFNSWKDIIIPGNVDDAMTKNGQPDVQPLP 260
Query: 246 --NRTVLGFWAGHRNSK---IRVI-LARVWENDTELDISNNRINRAIGPLVYQRRFYKTK 299
R L + G K +++I L++ + + E G Y K
Sbjct: 261 LSKRKYLANYLGRAQGKAGRLKLIDLSKQFPDKLECPDLKFSGTEKFGRTTYFEHLRNAK 320
Query: 300 FCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAV 346
FC+ P G + R +S CVPV+LS++ +LPF +++D+ + ++
Sbjct: 321 FCLAPRGESSWTLRFYESFFVECVPVLLSDHAELPFQNVIDYAQVSI 367
>gi|432883098|ref|XP_004074204.1| PREDICTED: exostosin-1c-like [Oryzias latipes]
Length = 740
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 81/316 (25%), Positives = 128/316 (40%), Gaps = 56/316 (17%)
Query: 88 NFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHK 147
F++YIYP + ++ RK+ +I ESR+ T DP A LF + I
Sbjct: 108 GFRVYIYPSEKNDRVSESYRKI----------LASIAESRYYTSDPREACLFVLGIDT-- 155
Query: 148 MRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHD-VGVRATEGVPFLIKN 206
+ ++ VD I YP WN G +H + TE + F I
Sbjct: 156 -----LDRDQLSGQFVSNVDDRIRGYPLWNE--GRNHLIFNLYSGTWPNYTEDLGFNIGQ 208
Query: 207 AIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRD--VEN-----RTVLGFWAGHR-- 257
AI S + + F P DV++P + G R V N R L + G R
Sbjct: 209 AILAKASLNTE-HFRPGFDVSIPLFSKDHPQKGGERGWLVRNSTPPRRKYLLMFKGKRYL 267
Query: 258 ---------------NSKIRVIL-----ARVWEN--DTELDISNNRINRAIGPLVYQRRF 295
N K V L + WE D D N R YQ
Sbjct: 268 TGIGSDTRNALHHIHNGKDIVSLTTCRHGKDWEKHKDARCDHDNLEYER----FDYQELL 323
Query: 296 YKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQ 355
+ + FC+ P G ++ S R +S+ C+PV+LSN ++LPF+D++ W + + +ER + Q
Sbjct: 324 HNSTFCLVPRGRRLGSFRFLESLQAACIPVLLSNGWELPFSDVIQWNQAVIEGDERLLLQ 383
Query: 356 LKQILKNISDAEFVSL 371
+ ++ + + ++L
Sbjct: 384 VPSTVRAVGNERVLAL 399
>gi|148230947|ref|NP_001082080.1| exostosin [Xenopus laevis]
gi|13183631|gb|AAK15278.1|AF319538_1 exostosin [Xenopus laevis]
gi|37921194|gb|AAO84329.1| exostosin 1 [Xenopus laevis]
Length = 738
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 80/331 (24%), Positives = 135/331 (40%), Gaps = 61/331 (18%)
Query: 88 NFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHK 147
FK+YIYP Q KL+ Y + I SRF T DP +A +F + +
Sbjct: 105 GFKVYIYPQ-------QKGDKLSDSYQ---HILAAIETSRFYTSDPSQACVFVLSLDT-- 152
Query: 148 MRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHD-VGVRATEGVPFLIKN 206
+ ++ + S + WN G +H + TE V F I
Sbjct: 153 -----LDRDQLSPHYVHNLKSKVQNLHLWNN--GRNHLIFNLYSGTWPDYTEDVGFDIGQ 205
Query: 207 AIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDVE-------------------NR 247
A+ S S + F P+ DV++P + PR G D R
Sbjct: 206 AMLAKASISTE-NFRPNFDVSIPLFSKDH--PRTGGDKGFLRFNNIPPMRKYMLVFKGKR 262
Query: 248 TVLGFWAGHRNSKIRVILA------------RVWENDTELDISNNRINRAIGPLVYQRRF 295
+ G + RN+ VI+ + W+ D ++ N Y+
Sbjct: 263 YLTGIGSDTRNALYHVIMGEDVVLLTTCKHGKKWQKHK--DARCDKDNAEYEKYDYREML 320
Query: 296 YKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQ 355
+ FC+ P G ++ S R +++ CVPV+LSN ++LPF++++DW + AVI +ER + Q
Sbjct: 321 HNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVIDWNQAAVIGDERLLLQ 380
Query: 356 LKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
+ +++I + ++L + Q F W +
Sbjct: 381 IPSTIRSIHQDKILAL-----RQQTQFLWEA 406
>gi|129270192|ref|NP_001012368.2| exostosin-1a [Danio rerio]
gi|126632203|gb|AAI33118.1| Exostoses (multiple) 1a [Danio rerio]
Length = 730
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 86/341 (25%), Positives = 139/341 (40%), Gaps = 81/341 (23%)
Query: 88 NFKIYIYPDGDPNTFYQTPRKLTGKYASEGY--FFQNIRESRFLTHDPDRAHLFFIPISC 145
FK+Y+YP + G+ SE Y +I SRF T DP +A LF +
Sbjct: 97 GFKVYVYP------------QQKGEKISESYQNILSSIEGSRFYTSDPGQACLFVL---- 140
Query: 146 HKMRGKGTSYENMTVIVKD-----YVDSL---INKYPYWNRTLGADHFFVTCHD-VGVRA 196
N+ + +D YV +L I WN G +H +
Sbjct: 141 -----------NLDTLDRDQLSPQYVHNLKTKIQNLNLWNN--GRNHLIFNLYSGTWPDY 187
Query: 197 TEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDV------------ 244
TE + F I A+ S S + +F P+ D+++P + G R
Sbjct: 188 TEDLGFDIGQAMLAKASISTE-SFRPNFDISIPLFSKDHPRTGGERGFLKYNTIPPFRKY 246
Query: 245 -----ENRTVLGFWAGHRNSKIR------VIL------ARVWEN--DTELDISNNRINRA 285
R + G + RN+ V+L + W+ D D N +R
Sbjct: 247 MLVFKGKRYLTGIGSDTRNALYHIHNAEDVVLLTTCKHGKDWQKHKDARCDKDNAEYDR- 305
Query: 286 IGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFA 345
Y+ + + FC+ P G ++ S R +++ CVPV+LSN ++LPF++I+DWR A
Sbjct: 306 ---YDYKEMLHNSTFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEIIDWRTAA 362
Query: 346 VILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
VI +ER + Q+ +++I +SL + Q F W +
Sbjct: 363 VIGDERLLLQIPSTVRSIHQDRLLSL-----RQQTQFLWEA 398
>gi|348512501|ref|XP_003443781.1| PREDICTED: exostosin-1a-like [Oreochromis niloticus]
Length = 743
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 83/322 (25%), Positives = 134/322 (41%), Gaps = 62/322 (19%)
Query: 88 NFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHK 147
FK+Y+YP Q KL+ Y + +I SRF T DP +A LF + +
Sbjct: 110 GFKVYVYPQ-------QKGEKLSESYQN---ILSSIEGSRFYTSDPGQACLFVLSLDTL- 158
Query: 148 MRGKGTSYENMTVIVKDYVDSL---INKYPYWNRTLGADHFFVTCHD-VGVRATEGVPFL 203
+ + YV +L I P WN G +H + TE + F
Sbjct: 159 ---------DRDQLSPQYVHNLKAKIQNLPLWNG--GKNHIIFNLYSGTWPDYTEDLGFD 207
Query: 204 IKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDVE------------------ 245
I A+ S S + F P DV++P + PR G +
Sbjct: 208 IGLAMLAKASISTE-NFRPDFDVSIPLFSKDH--PRTGGEKGYLKYNTIPPYRKYMLVFK 264
Query: 246 -NRTVLGFWAGHRN-------SKIRVIL-----ARVWENDTELDISNNRINRAIGPLVYQ 292
R + G + RN S+ V+L + W+ D ++ N Y+
Sbjct: 265 GKRYLTGIGSDTRNALYHIHNSEDVVLLTTCKHGKDWQKHK--DARCDKDNAEYDKYDYR 322
Query: 293 RRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERD 352
Y + FC+ P G ++ S R +++ CVPV+LSN ++LPF++I+DW + AVI +ER
Sbjct: 323 EMLYNSTFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEIIDWNRAAVIGDERL 382
Query: 353 VYQLKQILKNISDAEFVSLHNN 374
+ Q+ +++I + +SL
Sbjct: 383 LLQIPSTVRSIHQDKILSLRQQ 404
>gi|432908332|ref|XP_004077815.1| PREDICTED: exostosin-1b-like [Oryzias latipes]
Length = 741
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 84/332 (25%), Positives = 138/332 (41%), Gaps = 63/332 (18%)
Query: 88 NFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHK 147
FK+Y+YP Q KL+ Y + +I SRF T DP +A LF + +
Sbjct: 108 GFKVYVYPQ-------QKGEKLSESYQN---ILSSIEGSRFHTPDPAKACLFVLSLDTL- 156
Query: 148 MRGKGTSYENMTVIVKDYVDSL---INKYPYWNRTLGADHFFVTCHD-VGVRATEGVPFL 203
+ + YV +L I P WN G +H + TE + F
Sbjct: 157 ---------DRDQLSPQYVHNLKAKIQSLPLWNE--GRNHIIFNLYSGTWPDYTEDLGFD 205
Query: 204 IKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDV-----------------EN 246
I A+ S S + F P+ DV++P + G R
Sbjct: 206 IGFAMLAKASISTE-NFRPNFDVSVPLFSKDHPRTGGERGFLKHNTIPPYRKYMLVFKGK 264
Query: 247 RTVLGFWAGHRN-------SKIRVIL-----ARVWENDTELDISNNRINRAIGPLVYQRR 294
R + G + RN S+ V+L + W+ D+ ++ N Y+
Sbjct: 265 RYLTGIGSDTRNALYHVHNSEDVVLLTTCKHGKDWQKHK--DVRCDKDNVEYDKYDYREM 322
Query: 295 FYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVY 354
Y + FC+ P G ++ S R +++ CVPV+LSN ++LPF++I++W AVI +ER +
Sbjct: 323 LYNSTFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEIINWNTAAVIGDERLLL 382
Query: 355 QLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
Q+ +++I + +SL + Q F W +
Sbjct: 383 QIPSTVRSIHQDKILSL-----RQQTQFLWEA 409
>gi|68052299|sp|Q5IGR8.1|EXT1A_DANRE RecName: Full=Exostosin-1a; AltName:
Full=Glucuronosyl-N-acetylglucosaminyl-proteoglycan/N-
acetylglucosaminyl-proteoglycan
4-alpha-N-acetylglucosaminyltransferase 1a; AltName:
Full=Multiple exostoses protein 1 homolog a
gi|56785791|gb|AAW29033.1| EXT1a [Danio rerio]
Length = 730
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 86/341 (25%), Positives = 139/341 (40%), Gaps = 81/341 (23%)
Query: 88 NFKIYIYPDGDPNTFYQTPRKLTGKYASEGY--FFQNIRESRFLTHDPDRAHLFFIPISC 145
FK+Y+YP + G+ SE Y +I SRF T DP +A LF +
Sbjct: 97 GFKVYVYP------------QQKGEKISESYQNILSSIEGSRFYTSDPGQACLFVL---- 140
Query: 146 HKMRGKGTSYENMTVIVKD-----YVDSL---INKYPYWNRTLGADHFFVTCHD-VGVRA 196
N+ + +D YV +L I WN G +H +
Sbjct: 141 -----------NLDTLDRDQLSPQYVHNLKTKIQNLNLWNN--GRNHLIFNLYSGTWPDY 187
Query: 197 TEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDV------------ 244
TE + F I A+ S S + +F P+ D+++P + G R
Sbjct: 188 TEDLGFDIGQAMLAKASISTE-SFRPNFDISIPLFSKDHPRTGGERGFLKYNTIPPFRKY 246
Query: 245 -----ENRTVLGFWAGHRNSKIR------VIL------ARVWEN--DTELDISNNRINRA 285
R + G + RN+ V+L + W+ D D N +R
Sbjct: 247 MLVFKGKRYLTGIGSDTRNALYHIHNAEDVVLLTTCKHGKDWQKHKDARCDKDNAEYDR- 305
Query: 286 IGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFA 345
Y+ + + FC+ P G ++ S R +++ CVPV+LSN ++LPF++I+DWR A
Sbjct: 306 ---YDYKEMLHNSTFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEIIDWRTAA 362
Query: 346 VILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
VI +ER + Q+ +++I +SL + Q F W +
Sbjct: 363 VIGDERLLLQIPSTVRSIHQDRILSL-----RQQTQFLWEA 398
>gi|168010648|ref|XP_001758016.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162690893|gb|EDQ77258.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 481
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 78/294 (26%), Positives = 127/294 (43%), Gaps = 32/294 (10%)
Query: 128 FLTHDPDRAHLFFIPI-----------SCHKMRGKGTSYENMTVIVKDYVDSLINKYPYW 176
F +D A L F+P S HK+ G+ + +++ + + + P W
Sbjct: 169 FRVNDWRAADLMFVPFFASLAYNRYTKSEHKVGGELDLVGDKNQKLQEKLLKFLEQQPAW 228
Query: 177 NRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCS----PSYDVAFIPHKDVALP--Q 230
+ G+DH V H A AI +V PS +VA + KDV P
Sbjct: 229 QASGGSDHIVVIHHPNSFHAMRN---FFSKAIFIVADFGRYPS-EVANL-RKDVVAPYKH 283
Query: 231 VLQPFALPRGGRDVENRTVLGFWAG----HRNSKIRVILARVWENDTELDISNNRINRAI 286
V+ F E R +L F+ G + IR L + +N+ + A
Sbjct: 284 VIPSFV--DDSTPFEEREILLFFQGTIVRKQGGVIRQQLYEMLKNEKGVHFEEGSAGSA- 340
Query: 287 GPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAV 346
G ++K C+ G +S R+ D+I CVPVI+S+ +LPF D LD+ F++
Sbjct: 341 GIHSATTGMRRSKCCLNIAGDTPSSNRLFDAIASHCVPVIISDEIELPFEDELDYSGFSI 400
Query: 347 ILNERDVYQLK---QILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMV 397
+N D Q K +++++S E++ L L +V HF++ P +DA +MV
Sbjct: 401 FINSTDAVQEKFVINLIRSVSRKEWMRLWKRLKEVSLHFEYQHPTKPYDAVNMV 454
>gi|307103001|gb|EFN51266.1| hypothetical protein CHLNCDRAFT_141180 [Chlorella variabilis]
Length = 404
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 81/323 (25%), Positives = 133/323 (41%), Gaps = 49/323 (15%)
Query: 119 FFQNIRESRFLTHDPDRAHLFFIPISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNR 178
F + + S T DP A+LFF+P G + ++ ++ D + + YPYWNR
Sbjct: 90 FIEQLLMSPVRTEDPSEANLFFVPALSWSYGGNALNAVHLDLVA----DHIASHYPYWNR 145
Query: 179 TLGADHFFVTCHDVGVRATEG----------------------VPFLIKNAIRVVCSPSY 216
+ G DH F +D G A G P N +P
Sbjct: 146 SQGRDHIFWLTNDRGACALTGRTEAAIKLTHFGLNTINISVGWGPGAATNPENACYNPLR 205
Query: 217 DVAFIPHKDVA--LPQVLQPFALPRGGRDVENRTVLGF----------WAGHRNSKIRVI 264
DV P D+A L +V + ++ + +T L F ++G+ +R +
Sbjct: 206 DVVAPPFDDMARELMEVSRKLSVE---DIIAAKTSLFFFSGAVSNDSEYSGNTRQLLREL 262
Query: 265 LARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVP 324
+ R W ND E+ I + +G Y +R +KFC G R+ + G VP
Sbjct: 263 VKR-W-NDPEI-IFETEGDTGLGD--YVKRLRASKFCPAVFGYGFG-MRLLTCVFSGSVP 316
Query: 325 VILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
+++ P D+L + F++ LN + L +IL++I+D ++ L LV+ + F W
Sbjct: 317 LVIQERVAQPLEDLLPYETFSLRLNNGHLPDLPRILRSITDQQYQRLVQGLVRYRDAFHW 376
Query: 385 NSPPVRFDAFHMVMYELWLRRHV 407
P AF + L RRH+
Sbjct: 377 E-PAAGGKAFEYTIASL-RRRHL 397
>gi|196014406|ref|XP_002117062.1| hypothetical protein TRIADDRAFT_1026 [Trichoplax adhaerens]
gi|190580284|gb|EDV20368.1| hypothetical protein TRIADDRAFT_1026 [Trichoplax adhaerens]
Length = 657
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 80/332 (24%), Positives = 148/332 (44%), Gaps = 64/332 (19%)
Query: 89 FKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHKM 148
FK+Y+Y DG+ K++ YA + I+ S + T+ PD+A LF + +
Sbjct: 17 FKVYVY-DGN--------NKMSDSYAR---VVRTIQNSPYYTNQPDQACLFVLAVDTLD- 63
Query: 149 RGKGTSYENMTVIVKDYVD--SLINKYPYWNRTLGADHF-FVTCHDVGVRATEGVPFLIK 205
R K + +DYV+ S I+ + WNR G +H F E + ++
Sbjct: 64 RDKSS---------EDYVNRVSKISSHKLWNR--GYNHIIFNLFAGTWPDYAEDLSLSLE 112
Query: 206 NAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDVEN------------------- 246
NAI + S S D + D++ P + + L D
Sbjct: 113 NAILIKASFS-DSTYRLGFDISWPLFGKDYPLHNLQNDGRQPGSLSSIFPIHRKYKAAFK 171
Query: 247 --RTVLGFWAGHRNSKIR-------VILARVWENDTELDISNNRI---NRAIGPLVYQRR 294
R VLG + RN+ +++ +T + ++R G YQ
Sbjct: 172 GKRYVLGIGSETRNALHHLHDDLNYIMVTTCKHGNTWREHQDSRCVVDELTYGEYDYQDL 231
Query: 295 FYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVY 354
+ FC+ P G ++ S R +++ +GC+P++LSN + LPF++++DW+K V ++ER ++
Sbjct: 232 LINSTFCLVPRGRRLGSFRFLEALQFGCIPIVLSNGWVLPFSEVIDWKKACVQIDERQLF 291
Query: 355 QLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
+ +++++ISD + +++ K Q F W +
Sbjct: 292 DVPELIESISDEKILAM-----KQQSIFLWQT 318
>gi|348519387|ref|XP_003447212.1| PREDICTED: exostosin-1b-like [Oreochromis niloticus]
Length = 740
Score = 81.3 bits (199), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 82/332 (24%), Positives = 133/332 (40%), Gaps = 63/332 (18%)
Query: 88 NFKIYIYPDGDPNTFYQTPRKLTGKYASEGY--FFQNIRESRFLTHDPDRAHLFFIPISC 145
FK+Y+YP + G+ SE Y I SRF T DP +A LF + +
Sbjct: 107 GFKVYVYP------------QQKGEKISESYQNILSTIEGSRFYTSDPGQACLFVLSLDT 154
Query: 146 HKMRGKGTSY-ENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHD-VGVRATEGVPFL 203
Y N+ V++ P WN G +H + TE + F
Sbjct: 155 LDRDQLSPQYVHNLKTKVQNL--------PLWND--GRNHLIFNLYSGTWPDYTEDLGFD 204
Query: 204 IKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDV-----------------EN 246
I A+ S S + F P+ DV++P + G R
Sbjct: 205 IGQAMLAKASISTE-NFRPNFDVSIPLFSKEHPRTGGERGYLKYNSIPPFRKYMLVFKGK 263
Query: 247 RTVLGFWAGHRNSKIRVILA------------RVWENDTELDISNNRINRAIGPLVYQRR 294
R + G + RN+ V A + W+ D ++ N Y+
Sbjct: 264 RYLTGIGSDTRNALYHVHNAEDVVLLTTCKHGKDWQKHK--DARCDKDNAEYDKYDYKEM 321
Query: 295 FYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVY 354
+ + FC+ P G ++ S R +++ CVPV+LSN ++LPF++I+DW AVI +ER +
Sbjct: 322 LHNSTFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEIIDWNTAAVIGDERLLL 381
Query: 355 QLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
Q+ +++I + +SL + Q F W +
Sbjct: 382 QIPTTVRSIHQDKILSL-----RQQTQFLWEA 408
>gi|357474287|ref|XP_003607428.1| Exostosin family protein-like protein [Medicago truncatula]
gi|355508483|gb|AES89625.1| Exostosin family protein-like protein [Medicago truncatula]
Length = 502
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 72/235 (30%), Positives = 104/235 (44%), Gaps = 23/235 (9%)
Query: 170 INKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVA---FIP---- 222
I P W R+ G DH H + V +K AI ++ P D + P
Sbjct: 218 ITDQPAWKRSGGRDHILPVHHPWSFKT---VRRYVKKAIWLL--PDMDSTGNWYKPGQVY 272
Query: 223 -HKDVALPQVLQ-PFALPRGGRDVE-NRTVLGFWAGH--RNS--KIRVILARVWENDTEL 275
KD+ LP V F ++ R L F+ G RN+ KIR L +
Sbjct: 273 LEKDLILPYVANVDFCDATCLSEINPKRNTLLFFRGRLKRNAGGKIRSKLVDQLRGADGV 332
Query: 276 DISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPF 335
I G Q K+ FC+ P G +SAR+ D+I GC+PVI+S+ +LPF
Sbjct: 333 VIEEGTSGEG-GKEAAQNGMRKSLFCLNPAGDTPSSARLFDAIVSGCIPVIVSDELELPF 391
Query: 336 NDILDWRKFAVILNERDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSP 387
ILD+RK A+ ++ D + L + LK+I A + NL K +HF ++SP
Sbjct: 392 EGILDYRKIALFVSSNDALKPSWLLKYLKDIRSAHIKEMQQNLAKYSRHFLYSSP 446
>gi|357474305|ref|XP_003607437.1| Cation proton exchanger [Medicago truncatula]
gi|355508492|gb|AES89634.1| Cation proton exchanger [Medicago truncatula]
Length = 1198
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 72/235 (30%), Positives = 104/235 (44%), Gaps = 23/235 (9%)
Query: 170 INKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVA---FIP---- 222
I P W R+ G DH H + V +K AI ++ P D + P
Sbjct: 218 ITDQPAWKRSGGRDHILPVHHPWSFKT---VRRYVKKAIWLL--PDMDSTGNWYKPGQVY 272
Query: 223 -HKDVALPQVLQ-PFALPRGGRDVE-NRTVLGFWAGH--RNS--KIRVILARVWENDTEL 275
KD+ LP V F ++ R L F+ G RN+ KIR L +
Sbjct: 273 LEKDLILPYVANVDFCDATCLSEINPKRNTLLFFRGRLKRNAGGKIRSKLVDQLRGADGV 332
Query: 276 DISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPF 335
I G Q K+ FC+ P G +SAR+ D+I GC+PVI+S+ +LPF
Sbjct: 333 VIEEGTSGEG-GKEAAQNGMRKSLFCLNPAGDTPSSARLFDAIVSGCIPVIVSDELELPF 391
Query: 336 NDILDWRKFAVILNERDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSP 387
ILD+RK A+ ++ D + L + LK+I A + NL K +HF ++SP
Sbjct: 392 EGILDYRKIALFVSSNDALKPSWLLKYLKDIRSAHIKEMQQNLAKYSRHFLYSSP 446
>gi|449462180|ref|XP_004148819.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase F8H-like
[Cucumis sativus]
gi|449524512|ref|XP_004169266.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase F8H-like
[Cucumis sativus]
Length = 458
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 89/365 (24%), Positives = 154/365 (42%), Gaps = 40/365 (10%)
Query: 23 LLTLALVTLLC-FTCLSFNSFRS--YPLQNNFNSTLSFAINKQDYSDLGSDVFHSPSVFH 79
LT A LL F+ SF+ FR Y LQ+N + ++S + D G S ++
Sbjct: 37 FLTRAFDRLLVPFSPNSFSGFRQSHYALQSNHDGSIS-------WPDRGYGSHLSLKIYV 89
Query: 80 LNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLF 139
+ E + K +Y T L G++ ++ + + +SRF T + + A F
Sbjct: 90 YDETE-IQGLKALMYGRDGKITAAAC---LKGQWGTQVKIHRLLLQSRFRTRNKEEADFF 145
Query: 140 FIP--ISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRAT 197
F+P + C +M G E I + Y+ ++ + PY+ + G DH FV G
Sbjct: 146 FVPAYVKCVRMLGGLNDKE----INEAYIQ-VLGQMPYFRLSGGRDHIFVFPSGAGAHLF 200
Query: 198 EGVPFLIKNAIRVVCSPSYD-------VAFIPHKDVALPQVLQPFALPRGGRDVE----- 245
+ I +I + +P D AF KD+ +P + G + V+
Sbjct: 201 KSWATYINRSI--ILTPEGDRTDKKDFSAFNTWKDIIIPGNVDDGMTSPGAKIVQPLPLS 258
Query: 246 NRTVLGFWAGHRNSKIR----VILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFC 301
R L + G K+ + LA+ + E + +G L Y KFC
Sbjct: 259 KRKHLANYLGRDQGKVGRLKLIELAKQFPEKLESPVLKFSGPDKLGKLEYFEHLRNAKFC 318
Query: 302 ICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVY-QLKQIL 360
+ P G + R +S CVPV+LS+ +LPF +++D+ + ++ ++ QL + L
Sbjct: 319 LAPRGESSWTLRFYESFFVECVPVVLSDQVELPFQNVIDYSQISIKWPSSEIGPQLLEYL 378
Query: 361 KNISD 365
++I D
Sbjct: 379 ESIPD 383
>gi|212722910|ref|NP_001131485.1| uncharacterized protein LOC100192822 [Zea mays]
gi|194691662|gb|ACF79915.1| unknown [Zea mays]
Length = 426
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 73/283 (25%), Positives = 123/283 (43%), Gaps = 40/283 (14%)
Query: 109 LTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIP--ISCHKMRGKGTSYENMTVIVKDYV 166
L G++ ++ Q + +SRF T + D A+LFF+P + C +M G + E VK
Sbjct: 83 LKGQWGTQVKVHQFLLKSRFRTFNKDHANLFFVPSYVKCVRMTGALSDKEINQTYVK--- 139
Query: 167 DSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYD-------VA 219
++++ PY+ R+ G DH FV G + +I + +P D A
Sbjct: 140 --VLSQMPYFRRSGGRDHIFVFPSGAGAHLFRSWAIFLNRSI--ILTPEGDRTDKRGTSA 195
Query: 220 FIPHKDVALPQ------------VLQPFALPRGGRDVENRTVLGFWAGHRNSKIR----V 263
F KD+ +P +QP L + R L + G K V
Sbjct: 196 FNTWKDIIIPGNVDDSMVKSDAPAVQPIPLTK-------RKYLANFLGRAQGKAGRLQLV 248
Query: 264 ILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCV 323
LA+ + + E +G + Y + KFC+ P G + R +S CV
Sbjct: 249 ELAKQYPDKLESPELKLSGPNKLGRIEYFKHLRNAKFCLAPRGESSWTLRFYESFFVECV 308
Query: 324 PVILSNYYDLPFNDILDWRKFAVIL-NERDVYQLKQILKNISD 365
PVILS+ +LPF +++D+ + ++ + R +L + L++ISD
Sbjct: 309 PVILSDEVELPFQNVIDYSEISIKWPSSRIGPELLEYLESISD 351
>gi|326522384|dbj|BAK07654.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 496
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 69/235 (29%), Positives = 107/235 (45%), Gaps = 23/235 (9%)
Query: 170 INKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVA---FIP---- 222
+ P W R+ G DH H ++ V +K AI ++ P D + P
Sbjct: 216 VTDQPAWQRSEGRDHVIPVHHPWSFKS---VRRFVKKAIWLL--PDMDSTGNWYKPGQVY 270
Query: 223 -HKDVALPQV--LQPFALPRGGRDVENRTVLGFWAGH--RNS--KIRVILARVWENDTEL 275
KDV LP V + R+ L F+ G RN+ KIR L +N ++
Sbjct: 271 LEKDVILPYVPNVDLCDYKCVSETQSKRSTLLFFRGRLKRNAGGKIRSKLVTELQNIEDI 330
Query: 276 DISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPF 335
I A G + K+ FC+ P G +SAR+ D+I GC+PVI+S+ +LPF
Sbjct: 331 IIEEGSAG-AKGKVAALTGMRKSLFCLNPAGDTPSSARLFDAIVSGCIPVIISDELELPF 389
Query: 336 NDILDWRKFAVILNERDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSP 387
ILD+ K A+ ++ D Q L + L+ + + +NL+K +HF ++SP
Sbjct: 390 EGILDYSKIALFVSSTDAVQPGWLVKYLRGVDGKRVREMQSNLLKYSRHFLYSSP 444
>gi|348519397|ref|XP_003447217.1| PREDICTED: exostosin-1c-like [Oreochromis niloticus]
Length = 740
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 81/318 (25%), Positives = 128/318 (40%), Gaps = 56/318 (17%)
Query: 88 NFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHK 147
F++YIYP + ++ RK+ +I ESR+ T DP A LF + I
Sbjct: 108 GFRVYIYPSEKNDRVSESYRKI----------LASIAESRYYTSDPREACLFVLGIDT-- 155
Query: 148 MRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHD-VGVRATEGVPFLIKN 206
+ ++ VD I YP WN G +H + TE + F I
Sbjct: 156 -----LDRDQLSGQFVPNVDERIRGYPLWND--GRNHLIFNLYSGTWPNYTEDLGFNIGQ 208
Query: 207 AIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRD--VEN-----RTVLGFWAGHR-- 257
AI S + + F P DV++P + G R V N R L + G R
Sbjct: 209 AILAKASLNTE-HFRPGFDVSIPLFSKDHPQKGGERGWLVRNTVPPRRKYLLMFKGKRYL 267
Query: 258 ---------------NSKIRVIL-----ARVWEN--DTELDISNNRINRAIGPLVYQRRF 295
N K V L + WE D D N R YQ
Sbjct: 268 TGIGSDTRNALHHIHNGKDIVSLTTCRHGKDWEKHKDARCDHDNLEYER----FDYQELL 323
Query: 296 YKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQ 355
+ + FC+ P G ++ S R +S+ C+PV+LSN ++LPF+D++ W + + +ER + Q
Sbjct: 324 HNSTFCLVPRGRRLGSFRFLESLQAACIPVLLSNGWELPFSDVIQWNQAVIEGDERLLLQ 383
Query: 356 LKQILKNISDAEFVSLHN 373
+ ++ + + ++L
Sbjct: 384 VPSTVRAVGNERVLALRQ 401
>gi|357112427|ref|XP_003558010.1| PREDICTED: probable glycosyltransferase At3g07620-like
[Brachypodium distachyon]
Length = 464
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 77/288 (26%), Positives = 127/288 (44%), Gaps = 34/288 (11%)
Query: 132 DPDRAHLFFIPISCH-------KMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADH 184
DP RA + F+P K+ + E+ + + + + P W R+ G DH
Sbjct: 145 DPARADVVFVPFFASLSFNRHSKVVPPARTSEDRAL--QRRLIEFLAARPEWRRSGGRDH 202
Query: 185 FFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPH------KDVALP--QVLQPFA 236
+ H G+ + + + V+C D PH KDV P V+ F
Sbjct: 203 VVLAHHPNGMLDAR---YKLWPCVFVLC----DFGRYPHSVANIDKDVIAPYLHVVGNFF 255
Query: 237 LPRGGRDVENRTVLGFWAGHRNSK----IRVILARVWENDTELDISNNRINRAIGPLVYQ 292
G D R L ++ G K IR L + +++ ++ S + G
Sbjct: 256 NDSAGYDA--RPTLLYFQGAIYRKDGGFIRQELYYLLKDEKDVHFSFGSVA-GNGIEQST 312
Query: 293 RRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERD 352
+ +KFC+ G +S R+ DSI CVP+I+S+ +LPF D+LD+ KF +I+ D
Sbjct: 313 QGMRASKFCLNIAGDTPSSNRLFDSIVSHCVPIIISDEIELPFEDVLDYSKFCIIVRGVD 372
Query: 353 VYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMV 397
+ L ++K IS E+ S+ N L +V++HF++ P DA M+
Sbjct: 373 AVKKGFLINLIKGISRQEWTSMWNKLKEVERHFEYQYPSQHDDAVQMI 420
>gi|449455387|ref|XP_004145434.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10L-like
[Cucumis sativus]
Length = 464
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 70/278 (25%), Positives = 126/278 (45%), Gaps = 32/278 (11%)
Query: 132 DPDRAHLFFIP------ISCHKMRGKGTSYENMTVIV------KDYVDSLINKYPYWNRT 179
DP+ A LF++P + + +R S + +V +D + K YW R+
Sbjct: 141 DPEVADLFYVPFFSSLSLIVNPIRPATGSDQQQRKLVYSDEETQDAFMEWLEKQEYWKRS 200
Query: 180 LGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQ-VLQPFA-- 236
G DH + A + +KN+I +V D + +L + V+ P++
Sbjct: 201 NGRDHVIIAQDP---NALYRLIDRVKNSILLVS----DFGRLRADQASLVKDVIVPYSHR 253
Query: 237 --LPRGGRDVENRTVLGFWAGHR----NSKIRVILARVWENDTELDISNNRINRAIGPLV 290
G VENR L F+ G+R KIR +L + E + ++ I + +R
Sbjct: 254 INTYTGDIGVENRKTLLFFMGNRYRKEGGKIRDMLFNILEQEQDVIIKHGTQSRE-SRRA 312
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
+ +KFC+ P G ++ R+ DS+ CVPVI+S+ +LPF D++D+ K AV +
Sbjct: 313 ATHGMHTSKFCLNPAGDTPSACRLFDSVVSLCVPVIVSDSIELPFEDVIDYSKIAVFFDS 372
Query: 351 RDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWN 385
+ L L+ IS+ + + K++++F++
Sbjct: 373 VSAVKPEFLISKLRRISEERILDYQREMKKIKRYFEYT 410
>gi|3435314|gb|AAC32397.1| putative tumor suppressor homolog [Drosophila melanogaster]
Length = 760
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 81/346 (23%), Positives = 138/346 (39%), Gaps = 51/346 (14%)
Query: 89 FKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHKM 148
F +YIYP N+ P A+ I+ESR+ T DP A LF +
Sbjct: 104 FLVYIYPPEPLNSLGAAP----PTSANYQKILTAIQESRYYTSDPTAACLFVL------- 152
Query: 149 RGKGTSYENMTVIVKDYV---DSLINKYPYWNRTLGADHFFVTCHDVGVR--ATEGVPFL 203
G + + +DYV S + + PYWN G +H + A + F
Sbjct: 153 ---GIDTLDRDSLSEDYVRNVPSRLARLPYWNN--GRNHIIFNLYSGTWPDYAENSLGFD 207
Query: 204 IKNAIRVVCSPSYDVAFIPHK-DVALPQVLQPFALPRGGRDV-------ENRTVLGFWAG 255
AI + S V + H DV++P + F L G N+ L + G
Sbjct: 208 AGEAI--LAKASMGVLQLRHGFDVSIPLFHKQFPLRAGATGTVQSNNFPANKKYLLAFKG 265
Query: 256 HR-----------------NSKIRVILARVWENDTELDISNNRI---NRAIGPLVYQRRF 295
R N + V++ + ++ +NR NR Y+
Sbjct: 266 KRYVHGIGSETRNSLFHLHNGRDMVLVTTCRHGKSWRELQDNRCDEDNREYDRYDYETLL 325
Query: 296 YKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQ 355
+ FC+ P G ++ S R +++ GC+PV+LSN + LPF +DW++ A+ +ER + Q
Sbjct: 326 QNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNAWVLPFESKIDWKQAAIWADERLLLQ 385
Query: 356 LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYEL 401
+ I+++I +L + + + + + F F ++ L
Sbjct: 386 VPDIVRSIPAERIFALRQQTQVLWERYFGSIEKIVFTTFEIIRERL 431
>gi|326437066|gb|EGD82636.1| hypothetical protein PTSG_03293 [Salpingoeca sp. ATCC 50818]
Length = 366
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 55/177 (31%), Positives = 84/177 (47%), Gaps = 15/177 (8%)
Query: 65 SDLGSDVFHSP----SVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFF 120
SD GSD +P V + + ++Y+YP P + Q P KYA+E F
Sbjct: 78 SDTGSDDTKAPLQTDVVADVLGALFSRHLRVYMYPL--PESL-QLPPTRDYKYAAEATFT 134
Query: 121 QNIRESRFLTHDPDRAHLFFIPISCHKMRGKGTSYENMTVIVKDYVDSLI----NKYPYW 176
+ +R S F T P+ A LFF+ +SC + R E + +++ +YPYW
Sbjct: 135 RMLRASTFSTDSPEEAQLFFVRVSCAEARFTQRDREAGQRAADAHATAVLAHVQQRYPYW 194
Query: 177 NRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVC-SPSYDVA---FIPHKDVALP 229
NRT G DHFFV HD+G + +A ++ + DV ++ HKD++LP
Sbjct: 195 NRTQGRDHFFVCGHDMGAAPRTAAARMFPSARNMIALVNTADVTEPDYVVHKDISLP 251
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 32/106 (30%), Positives = 49/106 (46%), Gaps = 7/106 (6%)
Query: 306 GSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISD 365
G + R+ D+I GCVPV ++++YD P +DW AV + E D +K L+ +
Sbjct: 255 GDGCPTPRLMDAIWAGCVPVFIADHYDPPLAKYVDWALLAVFIAEADAAHIKAHLEMDAR 314
Query: 366 AEFVSLHNNLVKVQKHFQWNSPPVR-------FDAFHMVMYELWLR 404
+ + +V+ W P R AF +VM EL LR
Sbjct: 315 TMYAHRSAYIARVRDRLTWWDPAQRQHTMGRSTSAFDLVMLELALR 360
>gi|195334376|ref|XP_002033859.1| GM21554 [Drosophila sechellia]
gi|194125829|gb|EDW47872.1| GM21554 [Drosophila sechellia]
Length = 760
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 81/346 (23%), Positives = 138/346 (39%), Gaps = 51/346 (14%)
Query: 89 FKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHKM 148
F +YIYP N+ P A+ I+ESR+ T DP A LF +
Sbjct: 104 FLVYIYPPEPLNSLGAAP----PTSANYQKILTAIQESRYYTSDPTAACLFVL------- 152
Query: 149 RGKGTSYENMTVIVKDYV---DSLINKYPYWNRTLGADHFFVTCHDVGVR--ATEGVPFL 203
G + + +DYV S + + PYWN G +H + A + F
Sbjct: 153 ---GIDTLDRDSLSEDYVRNVPSRLARLPYWNN--GRNHIIFNLYSGTWPDYAENSLGFD 207
Query: 204 IKNAIRVVCSPSYDVAFIPHK-DVALPQVLQPFALPRGGRDV-------ENRTVLGFWAG 255
AI + S V + H DV++P + F L G N+ L + G
Sbjct: 208 AGEAI--LAKASMGVLQLRHGFDVSIPLFHKQFPLRAGATGTVQSNNFPANKKYLLAFKG 265
Query: 256 HR-----------------NSKIRVILARVWENDTELDISNNRI---NRAIGPLVYQRRF 295
R N + V++ + ++ +NR NR Y+
Sbjct: 266 KRYVHGIGSETRNSLFHLHNGRDMVLVTTCRHGKSWRELQDNRCDEDNREYDRYDYETLL 325
Query: 296 YKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQ 355
+ FC+ P G ++ S R +++ GC+PV+LSN + LPF +DW++ A+ +ER + Q
Sbjct: 326 QNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNAWVLPFESKIDWKQAAIWADERLLLQ 385
Query: 356 LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYEL 401
+ I+++I +L + + + + + F F ++ L
Sbjct: 386 VPDIVRSIPAERIFALRQQTQVLWERYFGSIEKIVFTTFEIIRERL 431
>gi|195583406|ref|XP_002081513.1| GD11060 [Drosophila simulans]
gi|194193522|gb|EDX07098.1| GD11060 [Drosophila simulans]
Length = 760
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 81/346 (23%), Positives = 138/346 (39%), Gaps = 51/346 (14%)
Query: 89 FKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHKM 148
F +YIYP N+ P A+ I+ESR+ T DP A LF +
Sbjct: 104 FLVYIYPPEPLNSLGAAP----PTSANYQKILTAIQESRYYTSDPTAACLFVL------- 152
Query: 149 RGKGTSYENMTVIVKDYV---DSLINKYPYWNRTLGADHFFVTCHDVGVR--ATEGVPFL 203
G + + +DYV S + + PYWN G +H + A + F
Sbjct: 153 ---GIDTLDRDSLSEDYVRNVPSRLARLPYWNN--GRNHIIFNLYSGTWPDYAENSLGFD 207
Query: 204 IKNAIRVVCSPSYDVAFIPHK-DVALPQVLQPFALPRGGRDV-------ENRTVLGFWAG 255
AI + S V + H DV++P + F L G N+ L + G
Sbjct: 208 AGEAI--LAKASMGVLQLRHGFDVSIPLFHKQFPLRAGATGTVQSNNFPANKKYLLAFKG 265
Query: 256 HR-----------------NSKIRVILARVWENDTELDISNNRI---NRAIGPLVYQRRF 295
R N + V++ + ++ +NR NR Y+
Sbjct: 266 KRYVHGIGSETRNSLFHLHNGRDMVLVTTCRHGKSWRELQDNRCDEDNREYDRYDYETLL 325
Query: 296 YKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQ 355
+ FC+ P G ++ S R +++ GC+PV+LSN + LPF +DW++ A+ +ER + Q
Sbjct: 326 QNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNAWVLPFESKIDWKQAAIWADERLLLQ 385
Query: 356 LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYEL 401
+ I+++I +L + + + + + F F ++ L
Sbjct: 386 VPDIVRSIPAERIFALRQQTQVLWERYFGSIEKIVFTTFEIIRERL 431
>gi|17137330|ref|NP_477231.1| tout-velu, isoform A [Drosophila melanogaster]
gi|442623684|ref|NP_001260971.1| tout-velu, isoform B [Drosophila melanogaster]
gi|61212925|sp|Q9V730.1|EXT1_DROME RecName: Full=Exostosin-1; AltName: Full=Protein tout-velu
gi|7303172|gb|AAF58236.1| tout-velu, isoform A [Drosophila melanogaster]
gi|60678089|gb|AAX33551.1| LD10920p [Drosophila melanogaster]
gi|85701308|dbj|BAE78509.1| Tout-velu [Drosophila melanogaster]
gi|220960240|gb|ACL92656.1| ttv-PA [synthetic construct]
gi|440214386|gb|AGB93503.1| tout-velu, isoform B [Drosophila melanogaster]
Length = 760
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 81/346 (23%), Positives = 138/346 (39%), Gaps = 51/346 (14%)
Query: 89 FKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHKM 148
F +YIYP N+ P A+ I+ESR+ T DP A LF +
Sbjct: 104 FLVYIYPPEPLNSLGAAP----PTSANYQKILTAIQESRYYTSDPTAACLFVL------- 152
Query: 149 RGKGTSYENMTVIVKDYV---DSLINKYPYWNRTLGADHFFVTCHDVGVR--ATEGVPFL 203
G + + +DYV S + + PYWN G +H + A + F
Sbjct: 153 ---GIDTLDRDSLSEDYVRNVPSRLARLPYWNN--GRNHIIFNLYSGTWPDYAENSLGFD 207
Query: 204 IKNAIRVVCSPSYDVAFIPHK-DVALPQVLQPFALPRGGRDV-------ENRTVLGFWAG 255
AI + S V + H DV++P + F L G N+ L + G
Sbjct: 208 AGEAI--LAKASMGVLQLRHGFDVSIPLFHKQFPLRAGATGTVQSNNFPANKKYLLAFKG 265
Query: 256 HR-----------------NSKIRVILARVWENDTELDISNNRI---NRAIGPLVYQRRF 295
R N + V++ + ++ +NR NR Y+
Sbjct: 266 KRYVHGIGSETRNSLFHLHNGRDMVLVTTCRHGKSWRELQDNRCDEDNREYDRYDYETLL 325
Query: 296 YKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQ 355
+ FC+ P G ++ S R +++ GC+PV+LSN + LPF +DW++ A+ +ER + Q
Sbjct: 326 QNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNAWVLPFESKIDWKQAAIWADERLLLQ 385
Query: 356 LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYEL 401
+ I+++I +L + + + + + F F ++ L
Sbjct: 386 VPDIVRSIPAERIFALRQQTQVLWERYFGSIEKIVFTTFEIIRERL 431
>gi|212722274|ref|NP_001131151.1| uncharacterized protein LOC100192459 [Zea mays]
gi|194690716|gb|ACF79442.1| unknown [Zea mays]
gi|413955856|gb|AFW88505.1| hypothetical protein ZEAMMB73_716681 [Zea mays]
Length = 391
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 73/243 (30%), Positives = 115/243 (47%), Gaps = 33/243 (13%)
Query: 174 PYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCS-----PSYDVAFIPHKDVAL 228
P W RT G DH + H G+ + + V+C PS VA + KDV
Sbjct: 118 PEWRRTGGRDHVVLAHHPNGMLDAR---YRFWPCVFVLCDFGRYPPS--VANL-DKDVIA 171
Query: 229 P--QVLQPFALPRGGRDVENRTVLGFWAGHRNSK----IRVILARVWENDTEL-----DI 277
P ++ FA G D +R L ++ G K IR L + +++ ++ +
Sbjct: 172 PYRHLVANFANDTAGYD--DRPTLLYFQGAIYRKDGGFIRQELYYLLKDEKDVHFSFGSV 229
Query: 278 SNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFND 337
+ N I +A + +KFC+ G +S R+ DSI CVPV +S+ +LPF D
Sbjct: 230 AGNGIEQAT------QGMRSSKFCLNIAGDTPSSNRLFDSIVSHCVPVTISDEIELPFED 283
Query: 338 ILDWRKFAVILNERDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAF 394
+LD+ KF+VI+ D + L ++K IS E+ + N L +V+KHF++ P DA
Sbjct: 284 VLDYSKFSVIVRGADAVKKGFLMNLIKGISREEWTRMWNRLKEVEKHFEYQYPSQTDDAV 343
Query: 395 HMV 397
M+
Sbjct: 344 QMI 346
>gi|449487654|ref|XP_004157734.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10L-like
[Cucumis sativus]
Length = 464
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 70/278 (25%), Positives = 126/278 (45%), Gaps = 32/278 (11%)
Query: 132 DPDRAHLFFIP------ISCHKMRGKGTSYENMTVIV------KDYVDSLINKYPYWNRT 179
DP+ A LF++P + + +R S + +V +D + K YW R+
Sbjct: 141 DPEEADLFYVPFFSSLSLIVNPIRPATGSDQQQRKLVYSDEETQDAFMEWLEKQEYWKRS 200
Query: 180 LGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQ-VLQPFA-- 236
G DH + A + +KN+I +V D + +L + V+ P++
Sbjct: 201 NGRDHVIIAQDP---NALYRLIDRVKNSILLVS----DFGRLRADQASLVKDVIVPYSHR 253
Query: 237 --LPRGGRDVENRTVLGFWAGHR----NSKIRVILARVWENDTELDISNNRINRAIGPLV 290
G VENR L F+ G+R KIR +L + E + ++ I + +R
Sbjct: 254 INTYTGDIGVENRKTLLFFMGNRYRKEGGKIRDMLFNILELEQDVIIKHGTQSRE-SRRA 312
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
+ +KFC+ P G ++ R+ DS+ CVPVI+S+ +LPF D++D+ K AV +
Sbjct: 313 ATHGMHTSKFCLNPAGDTPSACRLFDSVVSLCVPVIVSDSIELPFEDVIDYSKIAVFFDS 372
Query: 351 RDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWN 385
+ L L+ IS+ + + K++++F++
Sbjct: 373 VSAVKPEFLISKLRRISEERILDYQREMKKIKRYFEYT 410
>gi|224085069|ref|XP_002307477.1| predicted protein [Populus trichocarpa]
gi|222856926|gb|EEE94473.1| predicted protein [Populus trichocarpa]
Length = 460
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 76/309 (24%), Positives = 131/309 (42%), Gaps = 36/309 (11%)
Query: 88 NFKIYIYPDGDPNTFYQTPRKLTGKYASE----GYFFQNIR------ESRFLTHDPDRAH 137
+ KIY+Y + + + + R GK +++ G + ++ +SRF T A+
Sbjct: 86 SLKIYVYEEDEIDGLKELLRGREGKISADACVKGQWGTQVKIHRLLLQSRFRTRKKGEAN 145
Query: 138 LFFIPI--SCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVR 195
LFF+P C +M G E I YV +L ++ PY+ R+ G DH FV G
Sbjct: 146 LFFVPAYAKCVRMMGGLNDKE----INHTYVKAL-SQMPYFRRSGGRDHIFVFPSGAGAH 200
Query: 196 ATEGVPFLIKNAIRVVCSPSYD-------VAFIPHKDVALPQVLQPFALPRGGR-----D 243
I +I + SP D +F KD+ +P ++ RG
Sbjct: 201 LFRSWATYINRSI--ILSPEGDRTDKKDTSSFNTWKDIIIPGNVEDGMTKRGAAMAQPLP 258
Query: 244 VENRTVLGFWAGHRNSKIR----VILARVWENDTELDISNNRINRAIGPLVYQRRFYKTK 299
+ R L + G K+ + LA+ + + E G + Y + K
Sbjct: 259 LSKRKYLANYLGRAQGKVGRLKLIELAKQYPDKLECPELKFSGPEKFGRMEYFQHLRNAK 318
Query: 300 FCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVIL-NERDVYQLKQ 358
FC+ P G + R +S CVPVILS+ + PF +++D+ + ++ + R +L +
Sbjct: 319 FCVAPRGESSWTLRFYESFFVECVPVILSDQAEFPFQNVIDYTQISIKWPSTRIGLELLE 378
Query: 359 ILKNISDAE 367
L++I D +
Sbjct: 379 YLESIPDED 387
>gi|326932982|ref|XP_003212589.1| PREDICTED: exostosin-1c-like [Meleagris gallopavo]
Length = 734
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 86/343 (25%), Positives = 139/343 (40%), Gaps = 85/343 (24%)
Query: 88 NFKIYIYPDGDPNTFYQTPRKLTGKYASEGY--FFQNIRESRFLTHDPDRAHLFFI---- 141
FK++ YP + G+ SE Y +I SR+ T P+ A LF +
Sbjct: 103 GFKVFTYP------------REHGEPMSESYSKILASIESSRYYTPHPEEACLFVLSIDT 150
Query: 142 ----PISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCH-DVGVRA 196
P+S H +R V+ I +P WN G +H +
Sbjct: 151 LDRDPLSAHYIRS---------------VEQRIRSFPLWNS--GRNHLIFNLYAGTWPNY 193
Query: 197 TEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALP-RGGRD------------ 243
TE + F I +A+ S + F P DV++P L P P RGG+
Sbjct: 194 TEELGFDIGHAMLAKAS-FHSKNFRPGFDVSIP--LFPREHPQRGGQSGWLHHNSVPPKK 250
Query: 244 ------VENRTVLGFWAGHRNS-----KIRVILA-------RVWE--NDTELDISNNRIN 283
R + G +G RN+ R I++ + WE DT D N
Sbjct: 251 KYLLVFKGKRYLTGIGSGTRNALHHIHNGRDIVSLTTCRHGKDWERHKDTRCDQDNVDYE 310
Query: 284 RAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRK 343
+ YQ + + FCI P G ++ S R +++ C+PV+LS+ ++LPF++ +DW K
Sbjct: 311 K----FDYQELLHNSTFCIVPRGRRLGSFRFLEALQAACIPVLLSDGWELPFSEAIDWGK 366
Query: 344 FAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
AV+ +ER + Q+ ++ I ++ Q F W++
Sbjct: 367 AAVVGSERLLLQIPSAVRCIRPERVLAFQQ-----QTQFLWDA 404
>gi|324508547|gb|ADY43609.1| Exostosin-1b, partial [Ascaris suum]
Length = 628
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 81/318 (25%), Positives = 147/318 (46%), Gaps = 52/318 (16%)
Query: 76 SVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDR 135
S F+++ +F++Y+YPD + +T ++ YA+ + +RES + T DP +
Sbjct: 69 SCFNMSMCLSRNHFRVYVYPDNNEST-------VSIVYAN---ILKVLRESVYYTDDPSK 118
Query: 136 AHLFFIPISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTC------ 189
A LF + I K EN V + +++L + WN G +H
Sbjct: 119 ACLFVLSIDTIDRDRKS---ENYVKHVDEQIEALHSDL--WNG--GRNHIIFNLYHGTYP 171
Query: 190 ----HDVGVRATEGVPFLIKNAIRV-VCSPSYDVAF-IPHKDVALPQVLQ---PFALP-- 238
HD+G G + + + + P++D++F + HK+ +L V++ P L
Sbjct: 172 DYSDHDLGFDV--GYAMVARASANAQIFRPNFDLSFPLFHKEHSLRTVVESVWPLKLKDE 229
Query: 239 -----RGGR---DVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLV 290
+G R + + T + H N+ V++ N+ + R + +
Sbjct: 230 YLVSFKGKRYVYGIGSETRDSLYHLH-NAHSVVMVTTCKHNNDWKKYEDERCDE--DNIE 286
Query: 291 YQRRFYKTK-----FCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFA 345
Y+R Y+T FC+ P G ++ S R +S+ GC+PVILS+ ++LPF++I+DW + A
Sbjct: 287 YERWDYETTMSNSTFCLTPRGRRLGSFRFLESLRLGCIPVILSDDWELPFSEIIDWSQAA 346
Query: 346 VILNERDVYQLKQILKNI 363
VI +E V + +L I
Sbjct: 347 VIAHEDTVLTISDVLNAI 364
>gi|194701034|gb|ACF84601.1| unknown [Zea mays]
gi|413955857|gb|AFW88506.1| hypothetical protein ZEAMMB73_716681 [Zea mays]
Length = 462
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 73/243 (30%), Positives = 115/243 (47%), Gaps = 33/243 (13%)
Query: 174 PYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCS-----PSYDVAFIPHKDVAL 228
P W RT G DH + H G+ + + V+C PS VA + KDV
Sbjct: 189 PEWRRTGGRDHVVLAHHPNGMLDAR---YRFWPCVFVLCDFGRYPPS--VANL-DKDVIA 242
Query: 229 P--QVLQPFALPRGGRDVENRTVLGFWAGHRNSK----IRVILARVWENDTEL-----DI 277
P ++ FA G D +R L ++ G K IR L + +++ ++ +
Sbjct: 243 PYRHLVANFANDTAGYD--DRPTLLYFQGAIYRKDGGFIRQELYYLLKDEKDVHFSFGSV 300
Query: 278 SNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFND 337
+ N I +A + +KFC+ G +S R+ DSI CVPV +S+ +LPF D
Sbjct: 301 AGNGIEQAT------QGMRSSKFCLNIAGDTPSSNRLFDSIVSHCVPVTISDEIELPFED 354
Query: 338 ILDWRKFAVILNERDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAF 394
+LD+ KF+VI+ D + L ++K IS E+ + N L +V+KHF++ P DA
Sbjct: 355 VLDYSKFSVIVRGADAVKKGFLMNLIKGISREEWTRMWNRLKEVEKHFEYQYPSQTDDAV 414
Query: 395 HMV 397
M+
Sbjct: 415 QMI 417
>gi|259155096|ref|NP_001158790.1| exostosin-1c [Salmo salar]
gi|223647436|gb|ACN10476.1| Exostosin-1c [Salmo salar]
Length = 759
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 84/319 (26%), Positives = 128/319 (40%), Gaps = 62/319 (19%)
Query: 88 NFKIYIYPDGDPNTFYQTPRKLTGKYASEGY--FFQNIRESRFLTHDPDRAHLFFIPI-S 144
F++Y+YP G+ SE Y +I ESR+ T DP A LF + I +
Sbjct: 108 GFRVYVYPSEK------------GERGSESYRKILTSIGESRYYTTDPREACLFVLGIDT 155
Query: 145 CHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHD-VGVRATEGVPFL 203
+ + G N+ D I YP WN G +H + TE + F
Sbjct: 156 LDRDQLSGQFVPNL--------DDRIKGYPLWNE--GQNHLIFNLYSGTWPNYTEDLGFN 205
Query: 204 IKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRD--VEN-----RTVLGFWAGH 256
I AI S + + F P DV++P + G R V N R L + G
Sbjct: 206 IGQAILAKASLNTE-HFRPGFDVSIPLFSKDHPQKGGERGWLVRNTVPPRRKYLLMFKGK 264
Query: 257 R-----------------NSKIRVIL-----ARVWEN--DTELDISNNRINRAIGPLVYQ 292
R N K V L + WE D D N R YQ
Sbjct: 265 RYLTGIGSDTRNALHHIHNGKDIVSLTTCRHGKDWEKHKDARCDHDNLEYER----FDYQ 320
Query: 293 RRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERD 352
+ + FC+ P G ++ S R +S+ CVPV+LSN ++LPF+D++ W + + +ER
Sbjct: 321 ELLHNSTFCLVPRGRRLGSFRFLESLQAACVPVLLSNGWELPFSDVIQWNQAVIEGDERL 380
Query: 353 VYQLKQILKNISDAEFVSL 371
+ Q+ + + + ++L
Sbjct: 381 LLQVPSTVHAVGNERVLAL 399
>gi|115476598|ref|NP_001061895.1| Os08g0438600 [Oryza sativa Japonica Group]
gi|42408650|dbj|BAD09870.1| Exostosin family-like protein [Oryza sativa Japonica Group]
gi|42408898|dbj|BAD10156.1| Exostosin family-like protein [Oryza sativa Japonica Group]
gi|113623864|dbj|BAF23809.1| Os08g0438600 [Oryza sativa Japonica Group]
gi|215715329|dbj|BAG95080.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215740942|dbj|BAG97437.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 566
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 71/257 (27%), Positives = 116/257 (45%), Gaps = 40/257 (15%)
Query: 176 WNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSY----------DVAFIPHKD 225
W R GADH V H P + +A R + + + DVA + KD
Sbjct: 311 WRRWGGADHLVVPHH----------PNSMMDARRRLSAAMFVLSDFGRYPPDVANL-RKD 359
Query: 226 VALPQVLQPFALPRGGR-DVENRTVLGFWAG--HR--NSKIRVILARVWENDTEL----- 275
V P +L G E R VL ++ G HR ++R L ++ +++ ++
Sbjct: 360 VIAPYKHVVPSLGDGDSPGFEQRPVLAYFQGAIHRKNGGRVRQRLYQLIKDEKDVHFTYG 419
Query: 276 DISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPF 335
+ N I RA + +KFC+ G +S R+ D+I CVPVI+S+ +LPF
Sbjct: 420 SVRQNGIRRATKGMA------SSKFCLNIAGDTPSSNRLFDAIVSHCVPVIISDDIELPF 473
Query: 336 NDILDWRKFAVILNERDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFD 392
D+LD+ F V + D + L +L+ IS E+ ++ L +V HF++ P D
Sbjct: 474 EDVLDYSAFCVFVRASDAVKRGFLLHLLRGISQEEWTAMWRRLKEVAHHFEYQYPSQPGD 533
Query: 393 AFHMVMYELWLRRHVIK 409
A M+ + + H++K
Sbjct: 534 AVQMIWGAVARKMHLVK 550
>gi|218201205|gb|EEC83632.1| hypothetical protein OsI_29363 [Oryza sativa Indica Group]
Length = 566
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 71/257 (27%), Positives = 116/257 (45%), Gaps = 40/257 (15%)
Query: 176 WNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSY----------DVAFIPHKD 225
W R GADH V H P + +A R + + + DVA + KD
Sbjct: 311 WRRWGGADHLVVPHH----------PNSMMDARRRLSAAMFVLSDFGRYPPDVANL-RKD 359
Query: 226 VALPQVLQPFALPRGGR-DVENRTVLGFWAG--HR--NSKIRVILARVWENDTEL----- 275
V P +L G E R VL ++ G HR ++R L ++ +++ ++
Sbjct: 360 VIAPYKHVVPSLGDGDSPGFEQRPVLAYFQGAIHRKNGGRVRQRLYQLIKDEKDVHFTYG 419
Query: 276 DISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPF 335
+ N I RA + +KFC+ G +S R+ D+I CVPVI+S+ +LPF
Sbjct: 420 SVRQNGIRRATKGMA------SSKFCLNIAGDTPSSNRLFDAIVSHCVPVIISDDIELPF 473
Query: 336 NDILDWRKFAVILNERDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFD 392
D+LD+ F V + D + L +L+ IS E+ ++ L +V HF++ P D
Sbjct: 474 EDVLDYSDFCVFVRASDAVKRGFLLHLLRGISQEEWTAMWRRLKEVAHHFEYQYPSQPGD 533
Query: 393 AFHMVMYELWLRRHVIK 409
A M+ + + H++K
Sbjct: 534 AVQMIWGAVARKMHLVK 550
>gi|195381971|ref|XP_002049706.1| GJ20607 [Drosophila virilis]
gi|194144503|gb|EDW60899.1| GJ20607 [Drosophila virilis]
Length = 757
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 80/363 (22%), Positives = 142/363 (39%), Gaps = 67/363 (18%)
Query: 80 LNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLF 139
++ + F +Y+YP N+ P A+ I+ESR+ T DP A LF
Sbjct: 91 FDFTKCYDQFLVYVYPPEPLNSLGAAP----PSSANYQKILTAIQESRYHTSDPAAACLF 146
Query: 140 FIPISCHKMRGKGTSYENMTVIVKDYV---DSLINKYPYWNRTLGADHFFVTC------- 189
+ G + + +DYV S + + P+WN G +H
Sbjct: 147 VL----------GIDTLDRDSLSEDYVRNVPSRLARLPHWNN--GRNHIIFNLYSGTWPD 194
Query: 190 ---HDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHK-DVALPQVLQPFALPRGGRDV- 244
H +G A E + + S V + H DV++P + F L G
Sbjct: 195 YAEHSLGFDAGEAI----------LAKASMSVLQMRHGFDVSIPLFHKQFPLRAGATGSV 244
Query: 245 ------ENRTVLGFWAGHR-----------------NSKIRVILARVWENDTELDISNNR 281
N+ L + G R N + V++ + ++ +NR
Sbjct: 245 QSNNFPANKKYLLAFKGKRYVHGIGSETRNSLFHLHNGRDMVMVTTCRHGKSWRELQDNR 304
Query: 282 I---NRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDI 338
NR Y+ + FC+ P G ++ S R +++ GC+PV+LSN + LPF
Sbjct: 305 CDEDNREYDRYDYETLLQNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNAWVLPFESK 364
Query: 339 LDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVM 398
+DW++ A+ +ER + Q+ I+++IS +L + + + + + F F ++
Sbjct: 365 IDWKQAAIWADERLLLQVPDIVRSISAERIFALRQQTQVLWERYFGSIEKIVFTTFEIIR 424
Query: 399 YEL 401
L
Sbjct: 425 ERL 427
>gi|3176669|gb|AAC18793.1| End is cut off [Arabidopsis thaliana]
Length = 440
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 82/321 (25%), Positives = 137/321 (42%), Gaps = 65/321 (20%)
Query: 132 DPDRAHLFFIPI------SCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHF 185
DPD A +F++P + H E ++ + ++ L N YWNR+ G DH
Sbjct: 122 DPDLADVFYVPFFSSLSFNTHGKNMTDPDTEFDRLLQVELMEFLENS-KYWNRSGGKDHV 180
Query: 186 FVTCHDVGVRATEGVPFL---IKNAIRVVCS---PSYDVAFIPHKDVALPQVLQPFALPR 239
H R FL + +I +V S D+A + KDV P V +L
Sbjct: 181 IPMTHPNAFR------FLRQQVNASILIVVDFGRYSKDMARL-SKDVVSPYVHVVESLNE 233
Query: 240 GGRD-----VENRTVLGFWAGHR----NSKIRVILARVWENDTEL--------------- 275
G D E RT L ++ G+ KIR+ L ++ ++++
Sbjct: 234 EGDDGMGDPFEARTTLLYFRGNTVRKDEGKIRLRLEKLLAGNSDVHFEKSVATTQNIKVS 293
Query: 276 DISNNRINRAIGPLVYQ----------------RRFYKTKFCICPGGSQVNSARIADSIH 319
D+ NR + L YQ +KFC+ P G +S R+ D+I
Sbjct: 294 DLEQNRGRYLM--LTYQNDSNCSTCVLFCYQSTEGMRSSKFCLHPAGDTPSSCRLFDAIV 351
Query: 320 YGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISD---AEFVSLHNNLV 376
C+PVI+S+ +LPF D +D+ +F++ + ++ + IL N+ +++ + L
Sbjct: 352 SHCIPVIISDKIELPFEDEIDYSEFSLFFSIKESLEPGYILNNLRQFPKEKWLEMWKRLK 411
Query: 377 KVQKHFQWNSPPVRFDAFHMV 397
V HF++ PP R DA +M+
Sbjct: 412 NVSHHFEFQYPPKREDAVNML 432
>gi|194883052|ref|XP_001975618.1| GG20465 [Drosophila erecta]
gi|190658805|gb|EDV56018.1| GG20465 [Drosophila erecta]
Length = 760
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 81/346 (23%), Positives = 138/346 (39%), Gaps = 51/346 (14%)
Query: 89 FKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHKM 148
F +YIYP N+ P A+ I+ESR+ T DP A LF +
Sbjct: 104 FLVYIYPPEPLNSLGAAP----PTSANYQKILTAIQESRYYTSDPTAACLFVL------- 152
Query: 149 RGKGTSYENMTVIVKDYV---DSLINKYPYWNRTLGADHFFVTCHDVGVR--ATEGVPFL 203
G + + +DYV S + + PYWN G +H + A + F
Sbjct: 153 ---GIDTLDRDSLSEDYVRNVPSRLARLPYWNN--GRNHIIFNLYSGTWPDYAENSLGFD 207
Query: 204 IKNAIRVVCSPSYDVAFIPHK-DVALPQVLQPFALPRGGRDV-------ENRTVLGFWAG 255
AI + S V + H DV++P + F L G N+ L + G
Sbjct: 208 AGEAI--LAKASMGVLQLRHGFDVSIPLFHKQFPLRAGATGSVQSNNFPANKKYLLAFKG 265
Query: 256 HR-----------------NSKIRVILARVWENDTELDISNNRI---NRAIGPLVYQRRF 295
R N + V++ + ++ +NR NR Y+
Sbjct: 266 KRYVHGIGSETRNSLFHLHNGRDMVLVTTCRHGKSWRELQDNRCDEDNREYDRYDYETLL 325
Query: 296 YKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQ 355
+ FC+ P G ++ S R +++ GC+PV+LSN + LPF +DW++ A+ +ER + Q
Sbjct: 326 QNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNAWVLPFESKIDWKQAAIWADERLLLQ 385
Query: 356 LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYEL 401
+ I+++I +L + + + + + F F ++ L
Sbjct: 386 VPDIVRSIPAERIFALRQQTQVLWERYFGSIEKIVFTTFEIIRERL 431
>gi|296085534|emb|CBI29266.3| unnamed protein product [Vitis vinifera]
Length = 390
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 88/333 (26%), Positives = 130/333 (39%), Gaps = 73/333 (21%)
Query: 104 QTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SCHKMRGKGTSYENMTVI 161
+ PR L +A+E + + + S T +P+ A F+ PI +C +
Sbjct: 64 KDPRCLNHMFAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYTTCDLTPNGLPLPFKSPRM 123
Query: 162 VKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV-------RATE-GVPFLIKNAIRV--- 210
++ + + + +PYWNRT GADHFFV HD G +A E G+ L++ A V
Sbjct: 124 MRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTF 183
Query: 211 -----VCSPSYDVAFIPHKDVALPQVLQPFALPRGGR------------DVENRTVLGFW 253
VC + P+ A PQ +Q +P+ DV N G++
Sbjct: 184 GQRNHVCLNEGSITIPPY---APPQKMQAHLIPQETPRSIFVYFRGLFYDVNNDPEGGYY 240
Query: 254 AGHRNSKIRVILARVWEN--DTEL-DISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVN 310
A R A VWEN D L DIS P Y + FC+CP
Sbjct: 241 A-------RGARAAVWENFKDNPLFDISTEH------PTTYYEDMQRAIFCLCP------ 281
Query: 311 SARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVS 370
+AD I LPF D + W + V + E DV L IL +I +
Sbjct: 282 ---LADDI-------------VLPFADAIPWEEIGVFVAEEDVPNLDTILTSIPPEVILR 325
Query: 371 LHNNLV--KVQKHFQWNSPPVRFDAFHMVMYEL 401
L +++ + P DAFH ++ L
Sbjct: 326 KQRLLANPSMKQAMLFPQPAQSGDAFHQILNGL 358
>gi|224136688|ref|XP_002322391.1| predicted protein [Populus trichocarpa]
gi|222869387|gb|EEF06518.1| predicted protein [Populus trichocarpa]
Length = 457
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 69/280 (24%), Positives = 129/280 (46%), Gaps = 37/280 (13%)
Query: 131 HDPDRAHLFFIPI-----------SCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRT 179
+DP+ A LF++P+ + G Y + + ++ + + + YW R
Sbjct: 136 NDPEEADLFYVPVFSSLSLIVNPARAGTVPGSDPVYSDEKM--QEELVEWLEEQEYWRRN 193
Query: 180 LGADHF-FVTCHDVGVRA---TEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPF 235
G DH F + R + V L+ + RV + KDV +P
Sbjct: 194 NGRDHVVFAGDPNALYRVLDRVKNVVLLLSDFGRVRSDQGSLI-----KDVIVPYS-HRI 247
Query: 236 ALPRGGRDVENRTVLGFWAGHR----NSKIRVILARVWENDTELDI---SNNRINRAIGP 288
+ G VE R L F+ G+R KIR +L ++ E + ++ I + +R NR
Sbjct: 248 NVYNGDIGVEERKTLLFFMGNRYRKDGGKIRDLLFQMLEKEEDVVIRHGTQSRENRRTA- 306
Query: 289 LVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVIL 348
R + +KFC+ P G ++ R+ DSI CVP+I+S+ +LPF D++D+RK A+ +
Sbjct: 307 ---TRGMHTSKFCLNPAGDTPSACRLFDSIVSLCVPLIVSDSIELPFEDVIDYRKIAIFV 363
Query: 349 NERDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWN 385
+ + L ++L+ +S + + + +V+++F ++
Sbjct: 364 DTESSLKPGYLVRMLRAVSTEKILEYQKQMREVKRYFVYS 403
>gi|449452903|ref|XP_004144198.1| PREDICTED: probable glycosyltransferase At3g07620-like [Cucumis
sativus]
Length = 429
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 67/287 (23%), Positives = 121/287 (42%), Gaps = 30/287 (10%)
Query: 132 DPDRAHLFFIPI-SCHKMRGKGTSYENMTVIVKDYVD----SLINKYPYWNRTLGADHFF 186
DP+ A FF+P S G + + V + +++ YW R+ G DH
Sbjct: 122 DPENADAFFVPFFSSLSFNSHGRNMTDPATEVDHQLQIDLMKFLSESKYWQRSKGRDHVI 181
Query: 187 VTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPH------KDVALP--QVLQPFALP 238
H R ++N + D P KDV P V+ F
Sbjct: 182 PMTHPNAFR-------FLRNQVNASIQIVVDFGRYPKTMSNLGKDVVAPYVHVVSSFIDD 234
Query: 239 RGGRDVENRTVLGFWAGHRNSK----IRVILARVWENDTELDISNNR-INRAIGPLVYQR 293
E+R L F+ G K IRV LA++ + ++ + ++I +
Sbjct: 235 NPPDPFESRPTLLFFQGKTFRKDDGIIRVKLAKILDGYDDVHYERSAATEKSIK--TSSQ 292
Query: 294 RFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDV 353
+KFC+ P G +S R+ D+I CVPVI+S+ +LP+ D +D+ +F + + +
Sbjct: 293 GMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDQIELPYEDEIDYSQFTLFFSFEEA 352
Query: 354 YQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMV 397
Q + + L+ ++ + L ++ +H+++ PP + DA +M+
Sbjct: 353 LQPGYMVEKLREFPKERWIEMWKQLKEISRHYEFQYPPKKEDAVNML 399
>gi|195486003|ref|XP_002091324.1| GE13596 [Drosophila yakuba]
gi|194177425|gb|EDW91036.1| GE13596 [Drosophila yakuba]
Length = 760
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 81/346 (23%), Positives = 138/346 (39%), Gaps = 51/346 (14%)
Query: 89 FKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHKM 148
F +YIYP N+ P A+ I+ESR+ T DP A LF +
Sbjct: 104 FLVYIYPPEPLNSLGAAP----PTSANYQKILTAIQESRYYTSDPTAACLFVL------- 152
Query: 149 RGKGTSYENMTVIVKDYV---DSLINKYPYWNRTLGADHFFVTCHDVGVR--ATEGVPFL 203
G + + +DYV S + + PYWN G +H + A + F
Sbjct: 153 ---GIDTLDRDSLSEDYVRNVPSRLARLPYWNN--GRNHIIFNLYSGTWPDYAENSLGFD 207
Query: 204 IKNAIRVVCSPSYDVAFIPHK-DVALPQVLQPFALPRGGRDV-------ENRTVLGFWAG 255
AI + S V + H DV++P + F L G N+ L + G
Sbjct: 208 AGEAI--LAKASMGVLQLRHGFDVSIPLFHKQFPLRAGATGSVQSNNFPANKKYLLAFKG 265
Query: 256 HR-----------------NSKIRVILARVWENDTELDISNNRI---NRAIGPLVYQRRF 295
R N + V++ + ++ +NR NR Y+
Sbjct: 266 KRYVHGIGSETRNSLFHLHNGRDMVLVTTCRHGKSWRELQDNRCDEDNREYDRYDYETLL 325
Query: 296 YKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQ 355
+ FC+ P G ++ S R +++ GC+PV+LSN + LPF +DW++ A+ +ER + Q
Sbjct: 326 QNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNAWVLPFESKIDWKQAAIWADERLLLQ 385
Query: 356 LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYEL 401
+ I+++I +L + + + + + F F ++ L
Sbjct: 386 VPDIVRSIPAERIFALRQQTQVLWERYFGSIEKIVFTTFEIIRERL 431
>gi|195025057|ref|XP_001985992.1| GH20787 [Drosophila grimshawi]
gi|193901992|gb|EDW00859.1| GH20787 [Drosophila grimshawi]
Length = 754
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 91/403 (22%), Positives = 159/403 (39%), Gaps = 73/403 (18%)
Query: 38 SFNSF-RSYPLQNNFNSTLSFAINKQDYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPD 96
SF SF + PL N S+ + A +K++ + S ++ + F +Y+YP
Sbjct: 57 SFISFEQRKPLANE--SSAAIAASKKNACRMES---------CFDFTKCYDQFLVYVYPP 105
Query: 97 GDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHKMRGKGTSYE 156
N+ P A+ I+ESR+ T DP A LF + I Y
Sbjct: 106 EPLNSLGAAP----PSSANYQKILTAIQESRYHTSDPAAACLFVLGIDTLDRDSLSDDY- 160
Query: 157 NMTVIVKDYVDSLINKYPYWNRTLGADHFFVTC----------HDVGVRATEGVPFLIKN 206
V++ V S + + P+WN G +H H +G A E +
Sbjct: 161 -----VRN-VPSRLARLPHWNN--GRNHIIFNLYSGTWPDYAEHSLGFDAGEAI------ 206
Query: 207 AIRVVCSPSYDVAFIPHK-DVALPQVLQPFALPRGGRDV-------ENRTVLGFWAGHR- 257
+ S V + H DV++P + F L G N+ L + G R
Sbjct: 207 ----LAKASMSVLQMRHGFDVSIPLFHKQFPLRAGATGSVQSNNFPANKKYLLAFKGKRY 262
Query: 258 ----------------NSKIRVILARVWENDTELDISNNRI---NRAIGPLVYQRRFYKT 298
N + V++ + ++ +NR NR Y+ +
Sbjct: 263 VHGIGSETRNSLFHLHNGRDMVMVTTCRHGKSWRELQDNRCDEDNREYDRYDYETLLQNS 322
Query: 299 KFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQ 358
FC+ P G ++ S R +++ GC+PV+LSN + LPF +DW++ A+ +ER + Q+
Sbjct: 323 TFCLVPRGRRLGSFRFLEALQAGCIPVLLSNAWVLPFESKIDWKQAAIWADERLLLQVPD 382
Query: 359 ILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYEL 401
I+++IS +L + + + + + F F ++ L
Sbjct: 383 IVRSISAERIFALRQQTQVLWERYFGSIEKIVFTTFEIIRERL 425
>gi|255570432|ref|XP_002526175.1| catalytic, putative [Ricinus communis]
gi|223534552|gb|EEF36251.1| catalytic, putative [Ricinus communis]
Length = 453
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 85/399 (21%), Positives = 164/399 (41%), Gaps = 61/399 (15%)
Query: 42 FRSYPLQNNFNSTLSFAINKQDYSDLGSDVFHSPSVFHLNYLEMVT--NFKIYIYPDGDP 99
F + + +F++ S+ + Q S H+ ++ HL T + KIY+Y + +
Sbjct: 38 FLTRLVDQSFSTCTSYDVASQSQS-------HAQTLVHLINGGYGTHPSMKIYVYEEKEI 90
Query: 100 NTFYQTPRK----------LTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIP--ISCHK 147
+ + R + G++ ++ + ++ SR+ T + A LFF+P + C +
Sbjct: 91 DGLKELLRGRDGKISADTCVKGQWGTQVKIHRLLQNSRYRTRKKEEADLFFVPAYVKCVR 150
Query: 148 MRGKGTSYE-NMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKN 206
M G E N+T + ++++ PY+ R+ G DH FV G I
Sbjct: 151 MLGGLNDKEINLTYV------KVLSQMPYFRRSGGRDHIFVFPSGAGAHLFRSWATYINR 204
Query: 207 AIRVVCSPSYD-------VAFIPHKDVALP------------QVLQPFALPRGGRDVENR 247
+ V+ +P D AF KD+ +P +++P L + R
Sbjct: 205 S--VILTPEGDRTDKKDTSAFNTWKDIIIPGNVDDGMTKIGTTIVKPLPLSK-------R 255
Query: 248 TVLGFWAGHRNSKIR----VILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCIC 303
L + G K+ + LA+ + + E G + Y KFC+
Sbjct: 256 KFLANYLGRAQGKVGRLKLIELAKQYPDKLECPELKFSGPEKFGKMEYFEHLRNAKFCLA 315
Query: 304 PGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVY-QLKQILKN 362
P G + R +S CVPV+LS+ +LPF +++D+ ++ + +L + L++
Sbjct: 316 PRGESSWTLRFYESFFVECVPVLLSDQAELPFQNVIDYTHVSIKWPSTKIGPELLEYLES 375
Query: 363 ISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYEL 401
I D + + N +V+ + + + A +M+EL
Sbjct: 376 IPDEDIERMIANGRQVRCLWVYAPESEQCSAMQGIMWEL 414
>gi|115441893|ref|NP_001045226.1| Os01g0921300 [Oryza sativa Japonica Group]
gi|19386797|dbj|BAB86176.1| OJ1485_B09.5 [Oryza sativa Japonica Group]
gi|57899432|dbj|BAD88370.1| exostosin-like [Oryza sativa Japonica Group]
gi|113534757|dbj|BAF07140.1| Os01g0921300 [Oryza sativa Japonica Group]
gi|125573139|gb|EAZ14654.1| hypothetical protein OsJ_04578 [Oryza sativa Japonica Group]
gi|215741014|dbj|BAG97509.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767487|dbj|BAG99715.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 437
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 72/289 (24%), Positives = 130/289 (44%), Gaps = 26/289 (8%)
Query: 132 DPDRAHLFFIPI------SCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHF 185
DPD A FF+P + H E ++ + ++ ++ K YW R+ G DH
Sbjct: 132 DPDAAEAFFVPFFSSLSFNVHGRNMTDPETEADRLLQVELME-ILWKSKYWQRSAGRDHV 190
Query: 186 FVTCHDVGVRATEGVPFLIKNAIRVVCS---PSYDVAFIPHKDVALP--QVLQPFALPRG 240
H R ++ +I +V + ++A + KDV P V+ F
Sbjct: 191 IPMHHPNAFRFLRD---MVNASILIVADFGRYTKELASL-RKDVVAPYVHVVDSFLNDDP 246
Query: 241 GRDVENRTVLGFWAGHR----NSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFY 296
++R L F+ G KIR LA++ + + ++ + G
Sbjct: 247 PDPFDDRPTLLFFRGRTVRKDEGKIRAKLAKILKGKDGVRFEDS-LATGEGIKTSTEGMR 305
Query: 297 KTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDV--- 353
+KFC+ P G +S R+ D+I CVPVI+S+ +LPF D +D+ +F++ + +
Sbjct: 306 SSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSSRIELPFEDEIDYSEFSLFFSVEEALRP 365
Query: 354 -YQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYEL 401
Y L Q L+ I ++V + + L V H+++ +PP + DA +M+ ++
Sbjct: 366 DYLLNQ-LRQIQKTKWVEIWSKLKNVSHHYEFQNPPRKGDAVNMIWRQV 413
>gi|168012366|ref|XP_001758873.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162690010|gb|EDQ76379.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 426
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 60/233 (25%), Positives = 106/233 (45%), Gaps = 18/233 (7%)
Query: 170 INKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALP 229
I + W R+ G DH H +++ +K+AI ++ + +V+L
Sbjct: 144 ITRQAAWQRSGGRDHVLAVHHPWSMKSHR---RFLKSAIWLLSDLDSSGNWYKEGEVSLE 200
Query: 230 Q-VLQPFALPRGGRD-------VENRTVLGFWAGH--RNS--KIRVILARVWENDTELDI 277
+ V+ P+ D +R L F+ G R S K+R LA V ++ E +
Sbjct: 201 KDVIMPYVANVDACDDNCLATSKPSRKTLLFFQGRIVRGSAGKVRSRLAAVLRDEKERIV 260
Query: 278 SNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFND 337
+ A G Q + FC+ P G +SAR+ D+I GC+PV++S+ +LPF
Sbjct: 261 FQEGFSGAEGKATAQHGMRSSVFCLSPAGDTPSSARLFDAIVSGCIPVVVSDELELPFEG 320
Query: 338 ILDWRKFAVILNERDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSP 387
ILD+R+ A+ + Q L L+N + + ++ L + +HF++ +P
Sbjct: 321 ILDYRQVALFVPAARAAQKGWLVAHLRNKTPQDVAAMQQRLAQYGRHFRYGTP 373
>gi|168032656|ref|XP_001768834.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162679946|gb|EDQ66387.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 368
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 76/281 (27%), Positives = 129/281 (45%), Gaps = 33/281 (11%)
Query: 139 FFIPISCHK-----MRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCH-DV 192
FF +S +K +RG+ N + V + S + P W + GADH + H +
Sbjct: 71 FFASLSYNKYSRPAVRGRKKMDRNQELQVN--LLSFLRSQPAWRASNGADHVLIIHHPNA 128
Query: 193 GVRATEGVPFLIKNAIRVVCS-PSYD--VAFIPHKDVALPQVLQPFALPRGGRDVEN--- 246
V E ++A+ VV YD VA I KDV P +P D+++
Sbjct: 129 MVYKREQ----FRSAMFVVADFGRYDAEVANIA-KDVVAPY---KHIIPNFDDDIDSVSS 180
Query: 247 ---RTVLGFWAG----HRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTK 299
RT L F+ G IR L + +++++ N A G ++K
Sbjct: 181 FNTRTTLLFFQGAIVRKEGGIIRQKLYELLRDESDVVFVNGTTTSA-GIRSATSGMRQSK 239
Query: 300 FCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQ---L 356
FC+ G +S R+ D++ CVP+I+S+ +LPF D++++ +F + +N D + L
Sbjct: 240 FCLHMEGDTPSSNRLFDAVASHCVPLIVSDDIELPFEDVINYTEFCLFVNSSDALRKGFL 299
Query: 357 KQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMV 397
+L+N + E+ +H+ + +VQKHF++ P DA M
Sbjct: 300 TNLLRNFGEKEWTRMHDRMREVQKHFEYQLPSEIGDAVQMT 340
>gi|3668093|gb|AAC61825.1| unknown protein [Arabidopsis thaliana]
Length = 460
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 71/268 (26%), Positives = 124/268 (46%), Gaps = 38/268 (14%)
Query: 132 DPDRAHLFFIPI-SCHKM---------RGKGTSYENMTVIVKDYVDSLINKYPYWNRTLG 181
DP A LF++P+ S + G G S E M + ++++ +W R G
Sbjct: 130 DPADADLFYVPVFSSLSLIVNAGRPVEAGSGYSDEKMQEGLVEWLEG----QEWWRRNAG 185
Query: 182 ADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFA----L 237
DH + D A + +KNA+ +V P + + V+ P++ L
Sbjct: 186 RDHV-IPAGDPN--ALYRILDRVKNAVLLVSDFG---RLRPDQGSFVKDVVIPYSHRVNL 239
Query: 238 PRGGRDVENRTVLGFWAGHR----NSKIRVILARVWENDTELDI---SNNRINRAIGPLV 290
G VE+R L F+ G+R K+R +L +V E + ++ I + +R NR
Sbjct: 240 FNGEIGVEDRNTLLFFMGNRYRKDGGKVRDLLFQVLEKEDDVTIKHGTQSRENRRAA--- 296
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
+ + +KFC+ P G ++ R+ DSI CVP+I+S+ +LPF D++D+RKF++ +
Sbjct: 297 -TKGMHTSKFCLNPAGDTPSACRLFDSIVSLCVPLIVSDSIELPFEDVIDYRKFSIFVEA 355
Query: 351 RDVYQ---LKQILKNISDAEFVSLHNNL 375
Q L Q+L+ I + + +
Sbjct: 356 NAALQPGFLVQMLRKIKTKKILEYQREM 383
>gi|168014691|ref|XP_001759885.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689015|gb|EDQ75389.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 383
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 66/240 (27%), Positives = 112/240 (46%), Gaps = 19/240 (7%)
Query: 170 INKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCS---PSYDVAFIPHKDV 226
+ K+P W + G +H V H ++A ++NA+ VV + A I KDV
Sbjct: 130 LEKHPAWKASNGKNHVMVIHHPNSMQAVRD---RLRNALYVVSDFGRYENETANI-RKDV 185
Query: 227 ALP--QVLQPFALPRGGRDVENRTVLGFWAG----HRNSKIRVILARVWENDTELDISNN 280
P VL F R+ + ++ G KIR L + +++ ++ +
Sbjct: 186 VAPYKHVLPTFT--DDSSSFHTRSTVVYFQGSIVRKEGGKIRHELYDLLKDEPDVHFTTG 243
Query: 281 RINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILD 340
I + G R ++FC+ G +S R+ DSI CVPVI+S+ +LPF D L+
Sbjct: 244 -ITASEGFHSATRGMRSSRFCLNLAGDTPSSNRLFDSIASHCVPVIISDDLELPFEDDLN 302
Query: 341 WRKFAVILNERDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMV 397
+ F + +N Q + +L+N+S E+ + L+ V++HF++ P V DA +MV
Sbjct: 303 YSSFCIFINSTRALQPGYVINLLRNVSSEEWTLMWERLLVVERHFEYQFPSVANDAVNMV 362
>gi|224075401|ref|XP_002304618.1| predicted protein [Populus trichocarpa]
gi|222842050|gb|EEE79597.1| predicted protein [Populus trichocarpa]
Length = 105
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 31/55 (56%), Positives = 46/55 (83%)
Query: 306 GSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQIL 360
G +VN++R++D+IHYGC+PV++SN DLPF D+LDW KF+V++N+RD+ LK L
Sbjct: 23 GYEVNTSRVSDAIHYGCIPVVISNNRDLPFADVLDWSKFSVVINQRDIAFLKTKL 77
>gi|190402228|gb|ACE77644.1| exostosin 1 (predicted) [Sorex araneus]
Length = 746
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 79/329 (24%), Positives = 132/329 (40%), Gaps = 57/329 (17%)
Query: 88 NFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHK 147
FK+Y+YP Q K+ Y + I SRF T DP +A LF + +
Sbjct: 113 GFKVYVYPQ-------QKGEKIAESYQN---ILAAIEGSRFYTSDPSQACLFVLSLDT-- 160
Query: 148 MRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHD-VGVRATEGVPFLIKN 206
+ ++ + S + WN G +H + TE V F I
Sbjct: 161 -----LDRDQLSPQYVHNLRSKVQSLHLWNN--GRNHLIFNLYSGTWPDYTEDVGFDIGQ 213
Query: 207 AIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDV-----------------ENRTV 249
A+ S S + F PH DV++P + G R R +
Sbjct: 214 AMLAKASISTE-NFRPHFDVSIPLFSKEHPRTGGERGFLKFNTIPPLRKYMLVFKGKRYL 272
Query: 250 LGFWAGHRNSKIRV-----IL-------ARVWENDTELDISNNRINRAIGPLVYQRRFYK 297
G + RN+ V IL + W+ D +R N Y+ +
Sbjct: 273 TGIGSDTRNALYHVHNGEDILLLTTCKHGKDWQKHK--DSRCDRDNTEYEKYDYREMLHN 330
Query: 298 TKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLK 357
FC+ P G ++ S R +++ CVPV+LSN ++LPF+++++W + AVI +ER + Q+
Sbjct: 331 ATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIP 390
Query: 358 QILKNISDAEFVSLHNNLVKVQKHFQWNS 386
+++I + ++L + Q F W +
Sbjct: 391 STIRSIHQDKILAL-----RQQTQFLWEA 414
>gi|198457239|ref|XP_001360599.2| GA10084 [Drosophila pseudoobscura pseudoobscura]
gi|198135910|gb|EAL25174.2| GA10084 [Drosophila pseudoobscura pseudoobscura]
Length = 765
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 80/347 (23%), Positives = 140/347 (40%), Gaps = 51/347 (14%)
Query: 88 NFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHK 147
F +Y+YP N+ P + Y I+ESR+ T DP A LF +
Sbjct: 107 QFLVYVYPPEPLNSLGAAP-PTSANYQK---ILTAIQESRYHTSDPSAACLFVL------ 156
Query: 148 MRGKGTSYENMTVIVKDYV---DSLINKYPYWNRTLGADHFFVTCHDVGVR--ATEGVPF 202
G + + +DYV S + + P+WN G +H + A + F
Sbjct: 157 ----GMDTLDRDSLSEDYVRNVPSRLARLPHWNN--GRNHIIFNLYSGTWPDYAENSLGF 210
Query: 203 LIKNAIRVVCSPSYDVAFIPHK-DVALPQVLQPFALPRGGRDV-------ENRTVLGFWA 254
+AI + S V + H DV++P + F L G N+ L +
Sbjct: 211 DAGHAI--LAKASMSVLQVRHGFDVSIPLFHKQFPLRAGATGSVQSNNFPANKKYLLAFK 268
Query: 255 GHR-----------------NSKIRVILARVWENDTELDISNNRI---NRAIGPLVYQRR 294
G R N + V++ + ++ +NR NR Y+
Sbjct: 269 GKRYVHGIGSETRNSLFHLHNGRDMVLVTTCRHGKSWRELQDNRCDEDNREYDRYDYETL 328
Query: 295 FYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVY 354
+ FC+ P G ++ S R +++ GC+PV+LSN + LPF +DW++ A+ +ER +
Sbjct: 329 LQNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNAWVLPFESKIDWKQAAIWADERLLL 388
Query: 355 QLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYEL 401
Q+ I+++IS +L + + + + + F F ++ L
Sbjct: 389 QVPDIVRSISAERIFALRQQTQVLWERYFGSIEKIVFTTFEIIRERL 435
>gi|226508962|ref|NP_001152546.1| exostosin-like [Zea mays]
gi|195657373|gb|ACG48154.1| exostosin-like [Zea mays]
gi|414878975|tpg|DAA56106.1| TPA: exostosin-like protein [Zea mays]
Length = 427
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 75/295 (25%), Positives = 129/295 (43%), Gaps = 27/295 (9%)
Query: 132 DPDRAHLFFIPI------SCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHF 185
DPD A FF+P + H E ++ + VD ++ K YW R+ G DH
Sbjct: 122 DPDAADAFFVPFFSSLSFNVHGRNMTDPDTEADRLLQVELVD-ILWKSKYWQRSAGRDHV 180
Query: 186 FVTCHDVGVRATEGVPFLIKNAIRVVCS---PSYDVAFIPHKDVALP--QVLQPFALPRG 240
H R ++ +I +V + ++A + KDV P V+ F
Sbjct: 181 IPMHHPNAFRFLRA---MVNASILIVSDFGRYTKELASL-RKDVVAPYVHVVGSFLDDDP 236
Query: 241 GRDVENRTVLGFWAGHR----NSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFY 296
E R L F+ G KIR L ++ + + ++ I G +
Sbjct: 237 PDPFEARHTLLFFRGRTVRKDEGKIRSKLEKILKGKEGVRFEDS-IATGDGINISTEGMR 295
Query: 297 KTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQ- 355
+KFC+ P G +S R+ D+I CVPVI+S+ +LPF D +D+ +F++ + + +
Sbjct: 296 SSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSSRIELPFEDEIDYSEFSLFFSVEEALRP 355
Query: 356 --LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVI 408
L L+ + ++V + L V H+++ PP + DA +M+ ++ RH I
Sbjct: 356 DYLLNELRQVPKRKWVDMWLKLKNVSHHYEFQYPPRKGDAVNMIWRQV---RHKI 407
>gi|50416406|gb|AAH77234.1| XEXT1 protein [Xenopus laevis]
Length = 735
Score = 79.0 bits (193), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 80/331 (24%), Positives = 135/331 (40%), Gaps = 61/331 (18%)
Query: 88 NFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHK 147
FK+YIYP Q KL+ Y + I SRF T DP +A +F + +
Sbjct: 102 GFKVYIYPQ-------QKGDKLSDSYQ---HILAAIETSRFYTSDPSQACVFVLSLDT-- 149
Query: 148 MRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHD-VGVRATEGVPFLIKN 206
+ ++ + S + WN G +H + TE V F I
Sbjct: 150 -----LDRDQLSPHYVHNLKSKVQNLHLWNN--GRNHLIFNLYSGTWPDYTEDVGFDIGQ 202
Query: 207 AIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDVE-------------------NR 247
A+ S S + F P+ DV++P + PR G D R
Sbjct: 203 AMLAKASISTE-NFRPNFDVSIPLFSKDH--PRTGGDKGFLRFNNIPPMRKYMLVFKGKR 259
Query: 248 TVLGFWAGHRNSKIRV------IL------ARVWENDTELDISNNRINRAIGPLVYQRRF 295
+ G + RN+ V +L + W+ D ++ N Y+
Sbjct: 260 YLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHK--DARCDKDNAEYEKYDYREML 317
Query: 296 YKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQ 355
+ FC+ P G ++ S R +++ CVPV+LSN ++LPF++++DW + AVI +ER + Q
Sbjct: 318 HNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVIDWNQAAVIGDERLLLQ 377
Query: 356 LKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
+ +++I + ++L + Q F W +
Sbjct: 378 IPSTIRSIHQDKILAL-----RQQTQFLWEA 403
>gi|339247053|ref|XP_003375160.1| exostosin-1 [Trichinella spiralis]
gi|316971539|gb|EFV55297.1| exostosin-1 [Trichinella spiralis]
Length = 1019
Score = 79.0 bits (193), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 75/306 (24%), Positives = 124/306 (40%), Gaps = 44/306 (14%)
Query: 89 FKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHKM 148
FK+YIYPD + + K + Y Q +R+S++ T DPD+A LF +
Sbjct: 381 FKVYIYPDI---SNFDEESKTSASYRK---ILQILRQSKYFTDDPDQACLFVLSYDTLSR 434
Query: 149 RGKGTSY-ENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNA 207
Y ENM +K +L N + G +H + + +
Sbjct: 435 DSLSAEYVENMNAKIKSLPTNLWNN----GKVSGMNHLIFNLYSGTWPDYDLTELGFEPG 490
Query: 208 IRVVCSPSYDVA-FIPHKDVALPQVLQPFALPRG--GRDVEN------------------ 246
++ S+ + H D++L L RG DVEN
Sbjct: 491 QAILAKASFSTRHYRSHFDISLALFHDILPL-RGLNATDVENVDLNWPRSNWSYTLVFKG 549
Query: 247 -RTVLGFWAGHRNS-------KIRVILARVWENDTELDISNNRI---NRAIGPLVYQRRF 295
R V G + RN+ K ++L + + R N Y+
Sbjct: 550 KRYVFGIGSETRNALYHLHNAKDIIMLTTCKHGKDWMKNQDERCSIDNDLYDNWNYEELM 609
Query: 296 YKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQ 355
+KFC+ P G ++ S R +++ GC+PVILSN + LPF++++DW + V +ER ++Q
Sbjct: 610 ANSKFCLVPRGRRLGSFRFLEALEKGCIPVILSNDWVLPFSEVIDWDQAVVRGDERTLFQ 669
Query: 356 LKQILK 361
L +L+
Sbjct: 670 LPSLLR 675
>gi|198437606|ref|XP_002128024.1| PREDICTED: similar to Exostosin-1
(Glucuronosyl-N-acetylglucosaminyl-proteoglycan/N-
acetylglucosaminyl-proteoglycan
4-alpha-N-acetylglucosaminyltransferase) (Putative tumor
suppressor protein EXT1) (Multiple exostoses protein 1)
[Ciona intestinalis]
Length = 766
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 82/337 (24%), Positives = 141/337 (41%), Gaps = 74/337 (21%)
Query: 87 TNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNI----RESRFLTHDPDRAHLFFIP 142
T FK+Y+YPD G+ S FQNI R S++ T DP++A LF +P
Sbjct: 125 TGFKVYVYPD-------------VGEKTSTN--FQNILASLRASQYYTSDPEKACLF-VP 168
Query: 143 ISCHKMRGKGTSYENMTV--IVKDYVDSL---INKYPYWNRTLGADHFFVTCHD-VGVRA 196
+Y+ + + DY+ +L I++ YWN G +H +
Sbjct: 169 -----------AYDTLDRDHLSADYIHNLGAKISRLKYWNN--GKNHIIFNLYSGTWPEY 215
Query: 197 TEGVPFLIKNAIRVVCS-------PSYDVAFI------PH--------KDVALP---QVL 232
E V F + AI S +D++F PH K P + L
Sbjct: 216 LEDVGFNLGEAILAKASFGDNYYRHGFDISFPLIGKTHPHMQGTQGFLKANYFPPRRKYL 275
Query: 233 QPFALPRGGRDVENRTVLGFWAGHRNSKIRVIL----ARVWENDTELDISNNRINRAIGP 288
F R + + T + H I ++ + W++ ++ + N
Sbjct: 276 LSFKGKRYTYGIGSSTRNALYHIHNGDDIIILTTCKHGKNWQSFSDQKCETD--NEEYEK 333
Query: 289 LVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVIL 348
YQ + + FC+ P G ++ S R +S+ C+PV+L+N + LPF++++DW K ++
Sbjct: 334 WDYQSLLHNSTFCMVPRGRRLGSFRFLESLQAACIPVVLANGWKLPFDEVIDWSKASLAW 393
Query: 349 NERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWN 385
ER + Q+ IL+ + D + L++ Q F W+
Sbjct: 394 EERLLLQVPGILREVQDNRIM-----LLRQQSQFLWD 425
>gi|357628255|gb|EHJ77645.1| hypothetical protein KGM_04618 [Danaus plexippus]
Length = 655
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 77/319 (24%), Positives = 137/319 (42%), Gaps = 55/319 (17%)
Query: 87 TNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCH 146
++ K+Y+YP P + T RK+ +RESR+ THDP A LF IP +
Sbjct: 12 SDPKVYVYPTDGPVS--ATYRKV----------LSVVRESRYATHDPAEACLF-IP-AVD 57
Query: 147 KMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV---RATEGVPFL 203
+ S E++ + S +++ PYW G +H + G A + F
Sbjct: 58 TLDADPLSPEHIPDVA-----SRLSRLPYWKN--GRNHLLFNLY-AGTWPDYAEGALGFD 109
Query: 204 IKNAIRVVCSPSYDVAFIPHKDVALP---------------QVLQPFALPR-------GG 241
+AI S S + F D++LP PF PR G
Sbjct: 110 PGDAILARASAS-ETIFRDGFDISLPLFHKEHPERGGVPPSATGNPFPAPRKHLLAFKGK 168
Query: 242 R---DVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRI---NRAIGPLVYQRRF 295
R + + T W H + + +++ + D+ + R NR Y++
Sbjct: 169 RYVHGIGSETRNSLWHLHDGNNL-ILVTTCRHGKSWKDLRDERCDEDNREYDKFDYEQLL 227
Query: 296 YKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQ 355
+ FC+ G ++ S R +++ GCVPV+LSN + LPF++ +DWR+ + +ER + Q
Sbjct: 228 ANSTFCLVARGRRLGSYRFLEALAAGCVPVLLSNGWRLPFDERIDWRRAVIWADERLLLQ 287
Query: 356 LKQILKNISDAEFVSLHNN 374
+ ++++++ ++L
Sbjct: 288 VPELVRSVPPERILALRQQ 306
>gi|303271929|ref|XP_003055326.1| glycosyltransferase family 47 protein [Micromonas pusilla CCMP1545]
gi|226463300|gb|EEH60578.1| glycosyltransferase family 47 protein [Micromonas pusilla CCMP1545]
Length = 595
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 43/135 (31%), Positives = 75/135 (55%), Gaps = 3/135 (2%)
Query: 260 KIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIH 319
+ RVI A N T+ D+ ++ A P Y + ++KFC+ G++V + R+ +++
Sbjct: 439 RQRVIPALRSLNRTDWDLDSDG---ATTPSGYMKLLARSKFCLHVRGTRVYAPRLVEAML 495
Query: 320 YGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQ 379
+GCVPVI+++ YDLP + LDW F+V + ER+ + + + A++ H L +V
Sbjct: 496 FGCVPVIIADGYDLPLSWFLDWDAFSVRMTEREGVNATRAAEIVDAADWREKHEALRRVV 555
Query: 380 KHFQWNSPPVRFDAF 394
F ++ PPV DA
Sbjct: 556 GFFMYHDPPVFGDAL 570
>gi|148234643|ref|NP_001083782.1| exostosin 1 [Xenopus laevis]
gi|62871603|gb|AAH94398.1| XEXT1 protein [Xenopus laevis]
Length = 738
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 80/331 (24%), Positives = 135/331 (40%), Gaps = 61/331 (18%)
Query: 88 NFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHK 147
FK+YIYP Q KL+ Y + I SRF T DP +A +F + +
Sbjct: 105 GFKVYIYPQ-------QKGDKLSDSYQ---HILAAIETSRFYTSDPSQACVFVLSLDT-- 152
Query: 148 MRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHD-VGVRATEGVPFLIKN 206
+ ++ + S + WN G +H + TE V F I
Sbjct: 153 -----LDRDQLSPHYVHNLKSKVQNLHLWNN--GRNHLIFNLYSGTWPDYTEDVGFDIGQ 205
Query: 207 AIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDVE-------------------NR 247
A+ S S + F P+ DV++P + PR G D R
Sbjct: 206 AMLAKASISTE-NFRPNFDVSIPLFSKDH--PRTGGDKGFLRFNNIPPMRKYMLVFKGKR 262
Query: 248 TVLGFWAGHRNSKIRV------IL------ARVWENDTELDISNNRINRAIGPLVYQRRF 295
+ G + RN+ V +L + W+ D ++ N Y+
Sbjct: 263 YLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHK--DARCDKDNAEYEKYDYREML 320
Query: 296 YKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQ 355
+ FC+ P G ++ S R +++ CVPV+LSN ++LPF++++DW + AVI +ER + Q
Sbjct: 321 HNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVIDWNQAAVIGDERLLLQ 380
Query: 356 LKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
+ +++I + ++L + Q F W +
Sbjct: 381 IPSTIRSIHQDKILAL-----RQQTQFLWEA 406
>gi|449489243|ref|XP_004158257.1| PREDICTED: LOW QUALITY PROTEIN: probable glycosyltransferase
At3g07620-like [Cucumis sativus]
Length = 429
Score = 78.6 bits (192), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 67/287 (23%), Positives = 120/287 (41%), Gaps = 30/287 (10%)
Query: 132 DPDRAHLFFIPI-SCHKMRGKGTSYENMTVIVKDYVD----SLINKYPYWNRTLGADHFF 186
DP+ A FF+P S G + + V + +++ YW R+ G DH
Sbjct: 122 DPENADAFFVPFFSSLSFNSHGRNMTDPATEVDHQLQIDLMKFLSESKYWQRSKGRDHVI 181
Query: 187 VTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPH------KDVALP--QVLQPFALP 238
H R ++N + D P KDV P V+ F
Sbjct: 182 PMTHPNAFR-------FLRNQVNASIQIVVDFGRYPKTMSNLGKDVVAPYVHVVSSFIDD 234
Query: 239 RGGRDVENRTVLGFWAGHRNSK----IRVILARVWENDTELDISNNR-INRAIGPLVYQR 293
E+R L F+ G K IRV LA++ + ++ + ++I +
Sbjct: 235 NPPDPFESRPTLLFFQGKTFRKDDGIIRVKLAKILDGYDDVHYERSAATEKSIK--TSSQ 292
Query: 294 RFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDV 353
+KFC+ P G +S R+ D+I CVPVI+S+ +LP+ D +D+ +F + +
Sbjct: 293 GMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDQIELPYEDEIDYSQFTLFFXFEEA 352
Query: 354 YQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMV 397
Q + + L+ ++ + L ++ +H+++ PP + DA +M+
Sbjct: 353 LQPGYMVEKLREFPKERWIEMWKQLKEISRHYEFQYPPKKEDAVNML 399
>gi|19909908|dbj|BAB87180.1| XEXT1 [Xenopus laevis]
Length = 735
Score = 78.6 bits (192), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 80/331 (24%), Positives = 135/331 (40%), Gaps = 61/331 (18%)
Query: 88 NFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHK 147
FK+YIYP Q KL+ Y + I SRF T DP +A +F + +
Sbjct: 102 GFKVYIYPQ-------QKGDKLSDSYQ---HILAAIETSRFYTSDPSQACVFVLSLDT-- 149
Query: 148 MRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHD-VGVRATEGVPFLIKN 206
+ ++ + S + WN G +H + TE V F I
Sbjct: 150 -----LDRDQLSPHYVHNLKSKVQNLHLWNN--GRNHLIFNLYSGTWPDYTEDVGFDIGQ 202
Query: 207 AIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDVE-------------------NR 247
A+ S S + F P+ DV++P + PR G D R
Sbjct: 203 AMLAKASISTE-NFRPNFDVSIPLFSKDH--PRTGGDKGFLRFNNIPPMRKYMLVFKGKR 259
Query: 248 TVLGFWAGHRNSKIRV------IL------ARVWENDTELDISNNRINRAIGPLVYQRRF 295
+ G + RN+ V +L + W+ D ++ N Y+
Sbjct: 260 YLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHK--DARCDKDNAEYEKYDYREML 317
Query: 296 YKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQ 355
+ FC+ P G ++ S R +++ CVPV+LSN ++LPF++++DW + AVI +ER + Q
Sbjct: 318 HNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVIDWNQAAVIGDERLLLQ 377
Query: 356 LKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
+ +++I + ++L + Q F W +
Sbjct: 378 IPSTIRSIHQDKILAL-----RQQTQFLWEA 403
>gi|168017876|ref|XP_001761473.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162687479|gb|EDQ73862.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 399
Score = 78.6 bits (192), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 66/275 (24%), Positives = 123/275 (44%), Gaps = 21/275 (7%)
Query: 109 LTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHKMRGKGTSYENMTVIVKDYVDS 168
+ G++ ++ + + +SR+ T D RA+ FF+P+ +R G E V ++
Sbjct: 46 VRGQWGTQVKIHKLLMKSRYRTLDKSRANFFFVPVYVKCVRIFGGLNEKE---VNEHFLK 102
Query: 169 LINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYD-------VAFI 221
++ + PY++R+ G DH FV G +G P + +I +P D +F
Sbjct: 103 ILRQMPYFHRSGGRDHIFVFPSGAGAHLVKGWPNFLNRSI--FLTPEGDRTDKKAFSSFN 160
Query: 222 PHKDVALPQVLQPFALPRGGRDV-----ENRTVLGFWAGHRNSKIRVILARVWEN-DTEL 275
KD+ +P + P + + V + + K R+ L + + EL
Sbjct: 161 TWKDIIIPGNVDIINHPSNSATSPLPLSKRKYVANYLGRAQGKKGRLQLIELAKQFPAEL 220
Query: 276 DISNNRINRA--IGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDL 333
D + +G + Y R KFC+ P G + R ++ CVPVILS+ +L
Sbjct: 221 DAPELAFQGSAKLGRIEYYNRLRNAKFCLAPRGESSWTLRFYEAFFVECVPVILSDEIEL 280
Query: 334 PFNDILDWRKFAVIL-NERDVYQLKQILKNISDAE 367
P+ ++LD+ F++ + R +L + L++I + E
Sbjct: 281 PYQNVLDYSGFSIKWPSSRTNEELLRYLRSIPEFE 315
>gi|321468156|gb|EFX79142.1| hypothetical protein DAPPUDRAFT_304945 [Daphnia pulex]
Length = 729
Score = 78.6 bits (192), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 85/331 (25%), Positives = 139/331 (41%), Gaps = 65/331 (19%)
Query: 81 NYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFF 140
++ + V+ FK+YI+P + T RK+ I ESR+ T D +A LF
Sbjct: 84 DFSKCVSGFKVYIHPVEETVAMSSTYRKI----------LNVITESRYHTTDASQACLFI 133
Query: 141 IPISCHKMRGKGTSYENMTVIVKDYV---DSLINKYPYWNRTLGADHFFVTCHD-VGVRA 196
+ G + + DYV S +N P+WN G +H +
Sbjct: 134 L----------GLDTLDRDSLSSDYVRGMQSKLNSLPHWNG--GQNHIIFNFYSGTWPDY 181
Query: 197 TEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDV----EN------ 246
TE + I AI S S + P D++LP V + L RGG + EN
Sbjct: 182 TEDLGMDIGRAILAKASISVQ-NYRPSFDISLPLVHKEH-LERGGDILPIYAENIPAASK 239
Query: 247 ---------RTVLGFWAGHRNSKIR------VIL------ARVWEN--DTELDISNNRIN 283
R V G + RNS VI+ + W+ D + N +
Sbjct: 240 SYLLAFKGKRYVYGIGSETRNSLYHLHNSRDVIMVTTCKHGKSWKELKDERCEEDNAEYD 299
Query: 284 RAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRK 343
R Y+ + + FC+ P G ++ S R + + GC+PV+LSN + +PF++I+DW+
Sbjct: 300 R----YDYEILLHNSTFCLVPRGRRLGSFRFIEVLQAGCIPVLLSNNWVIPFSEIIDWKT 355
Query: 344 FAVILNERDVYQLKQILKNISDAEFVSLHNN 374
A+ +ER + Q+ I+++I ++L
Sbjct: 356 SAIWADERLLLQVPDIVRSIEAERVMALRQQ 386
>gi|308799499|ref|XP_003074530.1| Acetylglucosaminyltransferase EXT1/exostosin 1 (ISS) [Ostreococcus
tauri]
gi|116000701|emb|CAL50381.1| Acetylglucosaminyltransferase EXT1/exostosin 1 (ISS), partial
[Ostreococcus tauri]
Length = 439
Score = 78.6 bits (192), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 40/99 (40%), Positives = 62/99 (62%), Gaps = 3/99 (3%)
Query: 288 PLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVI 347
P Y K+++C+ G + ++AR+ D I +GCVPVI+++ YDLPF+ + DW KF+V
Sbjct: 338 PRDYMMLLSKSRYCLYVYGDRAHTARLYDIITFGCVPVIVADGYDLPFSWLFDWSKFSVR 397
Query: 348 LNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
+ E DV +L IL A++ SL LVKV FQ+++
Sbjct: 398 VPEDDVAKLPGILDQ---ADYDSLRGELVKVHSFFQYHA 433
>gi|224097754|ref|XP_002311068.1| predicted protein [Populus trichocarpa]
gi|222850888|gb|EEE88435.1| predicted protein [Populus trichocarpa]
Length = 380
Score = 78.6 bits (192), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 77/302 (25%), Positives = 125/302 (41%), Gaps = 52/302 (17%)
Query: 132 DPDRAHLFFIPI------SCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHF 185
DP+ A +F+P + H E + D +D + K YW R+ G DH
Sbjct: 76 DPEIAEAYFVPFFSSLSFNTHGRNMTDPETEKDRQLQVDLID-FLQKSKYWQRSGGRDHV 134
Query: 186 FVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPH------KDVALPQV-------- 231
H R L+ +I +V D P KDV P V
Sbjct: 135 IPMTHPNAFRFLRQ---LVNASILIVA----DFGRYPKSLSTLSKDVVSPYVHNVDSFKD 187
Query: 232 ---LQPFALPRGGRDVENRTVLGFWAGHR----NSKIRVILARVWE--NDTELDISNNRI 282
L PF E+R L F+ G+ K+R L ++ +D + S+
Sbjct: 188 DDLLDPF---------ESRKTLLFFRGNTVRKDKGKVRAKLEKILAGYDDVRYERSSP-T 237
Query: 283 NRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWR 342
AI R +KFC+ P G +S R+ D+I CVPVI+S+ +LP+ D +D+
Sbjct: 238 AEAIQASTQGMR--SSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDLIELPYEDEIDYS 295
Query: 343 KFAVILNERDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMY 399
+F++ + + Q L L+ ++ + L K+ HF++ PPV+ DA +M+
Sbjct: 296 QFSIFFSINEAIQPDYLVNQLRKFPKDRWIEMWRQLKKISHHFEFQYPPVKEDAVNMLWR 355
Query: 400 EL 401
++
Sbjct: 356 QV 357
>gi|242059031|ref|XP_002458661.1| hypothetical protein SORBIDRAFT_03g037670 [Sorghum bicolor]
gi|241930636|gb|EES03781.1| hypothetical protein SORBIDRAFT_03g037670 [Sorghum bicolor]
Length = 499
Score = 78.6 bits (192), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 70/236 (29%), Positives = 108/236 (45%), Gaps = 25/236 (10%)
Query: 170 INKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVA---FIP---- 222
+ P W R+ G DH H ++ V +K AI ++ P D + P
Sbjct: 219 VTDQPAWQRSEGRDHVIPVHHPWSFKS---VRRFVKKAIWLL--PDMDSTGNWYKPGQVY 273
Query: 223 -HKDVALPQVLQPFALPRGGRDVE---NRTVLGFWAGH--RNS--KIRVILARVWENDTE 274
KDV LP V L +E R++L F+ G RN+ KIR L ++ +
Sbjct: 274 LEKDVILPYVPN-VDLCDHKCVLETQFKRSILLFFRGRLKRNAGGKIRSKLVEELKSAED 332
Query: 275 LDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLP 334
+ I K+ FC+ P G +SAR+ D+I GC+PVI+S+ +LP
Sbjct: 333 IVIEEGSAGAQGKAAAQDG-MRKSLFCLSPAGDTPSSARLFDAIVSGCIPVIISDELELP 391
Query: 335 FNDILDWRKFAVILNERDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSP 387
F ILD+R+ A+ ++ D Q L + L+ I + +NLVK +HF ++SP
Sbjct: 392 FEGILDYREIALFVSSSDAVQPGWLVKYLRGIDAKRIREIQSNLVKYSRHFLYSSP 447
>gi|357463031|ref|XP_003601797.1| hypothetical protein MTR_3g085480 [Medicago truncatula]
gi|355490845|gb|AES72048.1| hypothetical protein MTR_3g085480 [Medicago truncatula]
Length = 425
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 73/294 (24%), Positives = 130/294 (44%), Gaps = 36/294 (12%)
Query: 132 DPDRAHLFFIPI--------SCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGAD 183
DP+ ++F+P H MR T ++ I + L+ + YW R+ G D
Sbjct: 119 DPEVVDVYFVPFFSSLSFNTHGHHMRDPETEIDHQLQID---LMGLLGQSKYWQRSGGRD 175
Query: 184 HFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIP------HKDVALPQV--LQPF 235
H F H R + +I+VV D P +KDV P V + +
Sbjct: 176 HIFPMTHPNAFRFLRD---QLNESIQVVV----DFGRYPKGVSNLNKDVVSPYVHFVDSY 228
Query: 236 ALPRGGRDVENRTVLGFWAG--HRNSKIRVILARVWENDTELDISNNRINRAIGPLVY-- 291
E+RT L F+ G HR K ++ A+ + D + + A G +
Sbjct: 229 VDDEPHDPFESRTTLLFFRGGTHRKDK-GIVRAKFTKILAGFDDVHYERSSATGENIKLS 287
Query: 292 QRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNER 351
+ +KFC+ P G +S R+ D+I CVPVI+S+ +LPF + +D+ +F++ + +
Sbjct: 288 SKGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDKIELPFENEIDYSQFSLFFSFK 347
Query: 352 DV----YQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYEL 401
+ Y + Q L++ + + L + H++++ PP R DA +M+ ++
Sbjct: 348 EALEPGYMINQ-LRSFPKQNWTEMWRQLKNISHHYEFHYPPEREDAVNMLWRQI 400
>gi|384251887|gb|EIE25364.1| exostosin-like glycosyltransferase [Coccomyxa subellipsoidea C-169]
Length = 705
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 87/343 (25%), Positives = 141/343 (41%), Gaps = 75/343 (21%)
Query: 113 YASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SC--HKMRGKGTS-------------Y 155
Y E + + +S T DP+ A F++P+ SC H + G +
Sbjct: 317 YQIEPALHEMLLQSPHRTLDPEAADFFYVPVYTSCFIHPVYGWADTPWFHNPGSPRVMHA 376
Query: 156 ENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVG-------VRAT----------- 197
M + K ++++ + PYWNRT G DH ++ HD G +R++
Sbjct: 377 ATMMLEAKRWLET---ELPYWNRTGGRDHIWLISHDEGSCWAPSEIRSSIILSHWGRKAL 433
Query: 198 -----EGVPF--LIKNAIR-----------VVCSPSYDVAFIPHKDVALPQVLQPFAL-- 237
PF NA+ + P YD P KD+ +P + P +
Sbjct: 434 DHESYSAYPFDNYSDNAVHPEWRPHGWRHIIEGHPCYD----PDKDLIIPAFVPPARIVP 489
Query: 238 -PRGGRDVENRTVLGFWAG-----HRNSKIRVILARVWENDTELDISNN-RI---NRAIG 287
P G + R +L F+ G R R I R++ E RI +
Sbjct: 490 SPLTGAREDPRPLLLFFRGDVGLNRRPHYSRGIRQRIYALSKEQRWREKYRIWIGTKEDT 549
Query: 288 PLVYQRRFYKTKFC-ICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAV 346
P Y +KFC + PG S R D++ +GCVPV++++ D F +LDW +FAV
Sbjct: 550 PGGYSELLSSSKFCLVVPGDGW--SPRAEDAMLHGCVPVVVNDGVDQVFETLLDWEEFAV 607
Query: 347 ILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPV 389
+ ER++ L +IL +IS + L + +V F + + P+
Sbjct: 608 RIPEREMEFLPEILLSISPSRLQQLQKGVRRVWHRFMYRALPL 650
>gi|242041079|ref|XP_002467934.1| hypothetical protein SORBIDRAFT_01g036720 [Sorghum bicolor]
gi|241921788|gb|EER94932.1| hypothetical protein SORBIDRAFT_01g036720 [Sorghum bicolor]
Length = 462
Score = 77.8 bits (190), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 72/239 (30%), Positives = 111/239 (46%), Gaps = 25/239 (10%)
Query: 174 PYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCS-----PSYDVAFIPHKDVAL 228
P W RT G DH + H G+ + + V+C PS VA + KDV
Sbjct: 189 PEWRRTGGRDHVVLAHHPNGMLDAR---YRFWPCVFVLCDFGRYPPS--VANL-DKDVIA 242
Query: 229 P--QVLQPFALPRGGRDVENRTVLGFWAG----HRNSKIRVILARVWENDTELDISNNRI 282
P ++ FA G D +R L ++ G IR L + +++ ++ S +
Sbjct: 243 PYRHLVANFANDTAGYD--DRPTLLYFQGAIYRKDGGSIRQELYYLLKDEKDVHFSFGSV 300
Query: 283 -NRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDW 341
I + R +KFC+ G +S R+ DSI CVPVI+S+ +LPF D+LD+
Sbjct: 301 AGNGIEQSTHGMR--SSKFCLNIAGDTPSSNRLFDSIVSHCVPVIISDEIELPFEDVLDY 358
Query: 342 RKFAVILNERDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMV 397
KF+VI+ D + L ++ IS E+ + N L +V+KHF + P DA M+
Sbjct: 359 SKFSVIVRGADAVKKGFLMSLITGISQEEWAHMWNKLKEVEKHFVYQYPSQTDDAVQMI 417
>gi|412994079|emb|CCO14590.1| predicted protein [Bathycoccus prasinos]
Length = 553
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 80/352 (22%), Positives = 151/352 (42%), Gaps = 61/352 (17%)
Query: 99 PNTF-YQTPRKLTG------------KYASEGYFFQNIRESR--FLTHDPDRAHLFFIPI 143
P F Y P++LT +Y +E +F +N R+ + T +P+ A LFF+P
Sbjct: 198 PKVFVYDVPKQLTSELAKRYGRCERDQYGTEIWFHRNFRDDKNGVRTMNPEEADLFFVPQ 257
Query: 144 --SCH-------KMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV 194
C + +G + E + + + K PY+NRT G DH FV G
Sbjct: 258 YGECFLWSREMLRHENQGQAMEETNEYFLEVLSHVKGKLPYFNRTDGRDHIFVFA---GA 314
Query: 195 RATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQ--VLQPFALPRGGRDVE------- 245
R P + ++ + + + + P D LPQ + +P D
Sbjct: 315 RG----PTIFRDWQKEI---PHSIYLTPEGDRTLPQFDTWKDIVIPGLEYDKRMYLEEHR 367
Query: 246 --------NRTVLGFW-------AGHRNSK-IRVILARVWENDTELDISNNRINRAIGPL 289
R +L + AG SK +R L ++++N T++ I + +I +
Sbjct: 368 NELVTNPPKRKILAMFRGTIDHPAGFAYSKGLRPKLKKIFQNATDV-IYDTKI-KDCDRD 425
Query: 290 VYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILN 349
Y R ++ FC+ P G + R ++ C+P+I+++ + PF +++ +FA+ +
Sbjct: 426 CYVREMTESVFCLNPLGWTPWTLRFYQAVMTRCIPIIIADNIEFPFESEINYSEFALKIP 485
Query: 350 ERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYEL 401
E+DV + + ++++ + E + K+ K F + P DA++ + EL
Sbjct: 486 EKDVSDILETMRHMPEEERERRRRYMDKIWKQFTYQRPAEIGDAYYSTVKEL 537
>gi|363731065|ref|XP_003640902.1| PREDICTED: exostosin-1-like [Gallus gallus]
Length = 740
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 82/334 (24%), Positives = 136/334 (40%), Gaps = 67/334 (20%)
Query: 88 NFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHK 147
FK+Y+YP Q K+ Y + I SRF T DP +A LF + +
Sbjct: 107 GFKVYVYPQ-------QKGEKIAESYQN---VLAAIEGSRFYTSDPSQACLFVLSLDTL- 155
Query: 148 MRGKGTSYENMTVIVKDYVDSLINKYP---YWNRTLGADHFFVTCHD-VGVRATEGVPFL 203
+ + YV +L +K WN G +H + TE V F
Sbjct: 156 ---------DRDQLSPQYVHNLRSKVQSLHLWNN--GRNHLIFNLYSGTWPDYTEDVGFD 204
Query: 204 IKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDVE------------------ 245
I A+ S S + F P+ DV++P + PR G +
Sbjct: 205 IGQAMLAKASISTE-NFRPNFDVSIPLFSKDH--PRTGGEKGFLRFNTIPPLRKYMLVFK 261
Query: 246 -NRTVLGFWAGHRNSKIRV------IL------ARVWENDTELDISNNRINRAIGPLVYQ 292
R + G + RN+ V +L + W+ D +R N Y+
Sbjct: 262 GKRYLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHK--DSRCDRDNAEYEKYDYR 319
Query: 293 RRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERD 352
+ FC+ P G ++ S R +++ CVPV+LSN ++LPF++++DW++ AVI +ER
Sbjct: 320 EMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVIDWKQAAVIGDERL 379
Query: 353 VYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
+ Q+ +++I + ++L Q F W +
Sbjct: 380 LLQIPSTIRSIHQDKILALRQ-----QTQFLWEA 408
>gi|302837496|ref|XP_002950307.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
gi|300264312|gb|EFJ48508.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
Length = 368
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 75/291 (25%), Positives = 126/291 (43%), Gaps = 43/291 (14%)
Query: 130 THDPDRAHLFFIPISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTC 189
T +P A+LF++P + T+ N + ++ + K+P++NRT G DHF
Sbjct: 57 TQNPYEANLFYVPALTYFY---ATNVRNGQWQAEAVIEYIRTKWPFYNRTGGRDHFVFFT 113
Query: 190 HDVGVRATEGVPFLIKNA-IRVV--------------------CSPSYDVAFIPHKDVAL 228
D RA+ I+++ I+VV C + +P + V L
Sbjct: 114 GD---RASCHFQRWIQDSVIKVVHFGMQHRNLTWNEISNRDYACIQNKRDLVVPPRTVNL 170
Query: 229 PQVLQPFALP-------RGGRDVENRTVLGFWAG-----HRNSKIRVILARVWENDTEL- 275
+L F+ P G D NRT+L F+AG + +R+ + ++ + T L
Sbjct: 171 GPLLPSFSTPYYKWLVSNQGYD-GNRTLLFFFAGGVADGEYSGGVRLAIKQMLSSITHLP 229
Query: 276 -DISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLP 334
D+ G Y +KFCI P G + R+ ++H GCVPVI+ +Y
Sbjct: 230 ADVKFVEGRVGGGEDEYFAMIRASKFCIAPYGHGWGN-RLVQAVHLGCVPVIIQDYVYQA 288
Query: 335 FNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWN 385
F D L + F+V + DV + +L++ S+A+ L L + + F W
Sbjct: 289 FEDFLPYEDFSVRMRLADVPHMIDLLRSYSEADLARLRLGLARYYRAFIWE 339
>gi|297841429|ref|XP_002888596.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
gi|297334437|gb|EFH64855.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
Length = 429
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 75/291 (25%), Positives = 128/291 (43%), Gaps = 35/291 (12%)
Query: 132 DPDRAHLFFIPISCHKMRGKGTSYENMTVIVKDYVDSL-------INKYPYWNRTLGADH 184
DPD A F++P T +NMT ++ L + YWNR+ G DH
Sbjct: 121 DPDLADAFYVPF--FSSLSFNTHGKNMTDPDTEFDRQLQVELMEFLEGSEYWNRSGGKDH 178
Query: 185 FFVTCHDVGVRATEGVPFL---IKNAIRVVCS---PSYDVAFIPHKDVALPQVLQPFALP 238
H R FL + +I +V + D+A + KDV P V +L
Sbjct: 179 VIPMTHPNAFR------FLRQQVNASILIVVDFGRYAKDMARL-SKDVVSPYVHVVESLN 231
Query: 239 RGGRD-----VENRTVLGFWAGHR----NSKIRVILARVWENDTELDISNNRINRAIGPL 289
D E RT L ++ G+ KIR+ L ++ ++++ + +
Sbjct: 232 EEDDDGLTDPFEARTTLLYFRGNTVRKDEGKIRLRLEKLLAGNSDVHFEKS-VATTQNIK 290
Query: 290 VYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILN 349
V +KFC+ P G +S R+ D+I C+PVI+S+ +LPF D +D+ +F++ +
Sbjct: 291 VSTEGMRSSKFCLHPAGDTPSSCRLFDAIVSHCIPVIISDKIELPFEDEIDYSEFSLFFS 350
Query: 350 ERDVYQLKQI---LKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMV 397
++ + I L+ +++ + L V HF++ PP R DA +M+
Sbjct: 351 IKESLEPGYILNKLRQFPKEKWLEMWKRLKNVSHHFEFQYPPKREDAVNML 401
>gi|326432404|gb|EGD77974.1| hypothetical protein PTSG_09607 [Salpingoeca sp. ATCC 50818]
Length = 377
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 81/301 (26%), Positives = 126/301 (41%), Gaps = 41/301 (13%)
Query: 112 KYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHKMRGKGTSYENMTVIVKDYVDSLIN 171
K+ASE Q IR S T D + A L I G E M I L+
Sbjct: 51 KFASELLLPQAIRNSALHTDDAEAADLILIQYCVMIW---GMQGEKMMDI-----RPLLE 102
Query: 172 KYPYWNR-----------TLGADHFFVTCHD-----VGVRATEG-VPFLIKNAIRVVCSP 214
K P R L +DH CH+ G R + + ++NA ++
Sbjct: 103 KRPVLARRFQQNPARFLVVLSSDHG--PCHNFNEHLTGRRLVKRWIDRSLENATLLMNDG 160
Query: 215 SY-DVAFIPHKDVALPQVL----QPFALPRGGRDVENRTVLGFWAGHRNSKIRVILARVW 269
S + + P KDV +P FA R + +R F+AG +S IR +
Sbjct: 161 SLVNKCYRPGKDVVIPPSTWIGNATFAC---SRPITDRKHFAFFAGAASSLIREYIINEL 217
Query: 270 ENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSN 329
N+ L I ++ + Y FC+ P G S R+ +++ GC+PVI+++
Sbjct: 218 GNEDWLFIPHDLQHEE-----YMCEMGNAVFCLAPRGRAAWSPRLVEALEAGCIPVIIAD 272
Query: 330 YYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPV 389
PF+D+LD+ F V ++E + L + L +IS + LH N + + HF++ PP
Sbjct: 273 MNHEPFHDVLDYSTFTVQVHEDKLETLGEQLHSISSGQVARLHANGQRARAHFRY-PPPA 331
Query: 390 R 390
R
Sbjct: 332 R 332
>gi|222619798|gb|EEE55930.1| hypothetical protein OsJ_04615 [Oryza sativa Japonica Group]
Length = 401
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 85/337 (25%), Positives = 135/337 (40%), Gaps = 72/337 (21%)
Query: 100 NTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIP--ISCHKMRGKGTSYEN 157
N + R L +A+E + Q + S T DP+ A F+ P +C
Sbjct: 70 NLLAKDSRCLQHMFAAEIFMHQFLLSSPVRTLDPEEADWFYTPAYTTCDLTPQGFPLPFR 129
Query: 158 MTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV-------RATE-GVPFLIKNAIR 209
I++ V + +PYWNRT GADHFF+ HD G RA E G+ +++ A
Sbjct: 130 APRIMRSAVRYVAATWPYWNRTDGADHFFLAPHDFGACFHYQEERAIERGILPVLRRATL 189
Query: 210 V--------VCSPSYDVAFIPHKDVALPQVLQPF----ALPRG--------GRDVENRTV 249
V C + P+ D P+ ++ A PR D+ N
Sbjct: 190 VQTFGQRHHPCLQPGSITVPPYAD---PRKMEAHRISPATPRSIFVYFRGLFYDMGNDPE 246
Query: 250 LGFWAGHRNSKIRVILARVWEN--DTEL-DISNNRINRAIGPLVYQRRFYKTKFCICPGG 306
G++A R A VWEN D L DIS
Sbjct: 247 GGYYA-------RGARASVWENFKDNPLFDISTE-------------------------- 273
Query: 307 SQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDA 366
+ R+ +++ +GC+PVI+++ LPF D + W + +V + E DV +L IL ++
Sbjct: 274 -HPATPRLVEAVVFGCIPVIIADDIVLPFADAIPWGEISVFVAEEDVPRLDTILASVPLD 332
Query: 367 EFVSLHNNLVK--VQKHFQWNSPPVRFDAFHMVMYEL 401
E + L +++ ++ P DAFH ++ L
Sbjct: 333 EVIRKQRLLASPAMKQAVLFHQPARPGDAFHQILNGL 369
>gi|112382285|ref|NP_004446.2| exostosin-like 1 [Homo sapiens]
gi|93141259|sp|Q92935.2|EXTL1_HUMAN RecName: Full=Exostosin-like 1; AltName: Full=Exostosin-L; AltName:
Full=Glucuronosyl-N-acetylglucosaminyl-proteoglycan
4-alpha-N-acetylglucosaminyltransferase; AltName:
Full=Multiple exostosis-like protein
gi|41388846|gb|AAH65528.1| Exostoses (multiple)-like 1 [Homo sapiens]
gi|119628256|gb|EAX07851.1| exostoses (multiple)-like 1 [Homo sapiens]
Length = 676
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 60/221 (27%), Positives = 98/221 (44%), Gaps = 30/221 (13%)
Query: 176 WNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPF 235
WNR G +H + H T F + A+ SP+ D +F P DVALP + +
Sbjct: 153 WNR--GRNHLVLRLHPAPCPRT----FQLGQAMVAEASPTVD-SFRPGFDVALPFLPEAH 205
Query: 236 ALPRGGRDVENRTVLGFWAGHRNSKIRVILA--------RVWENDTELDISNNRINRAIG 287
L RGG + R H +LA R + + + R + G
Sbjct: 206 PL-RGGAPGQLRQ-------HSPQPGVALLALEEERGGWRTADTGSSACPWDGRCEQDPG 257
Query: 288 PLVYQRR--FYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFA 345
P QR+ FC+ G ++R ++ GC+PV+LS ++LPF++++DW K A
Sbjct: 258 PGQTQRQETLPNATFCLISGHRPEAASRFLQALQAGCIPVLLSPRWELPFSEVIDWTKAA 317
Query: 346 VILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
++ +ER Q+ L+ +S A ++L Q F W++
Sbjct: 318 IVADERLPLQVLAALQEMSPARVLALRQ-----QTQFLWDA 353
>gi|68052298|sp|Q5IGR7.1|EXT1B_DANRE RecName: Full=Exostosin-1b; AltName:
Full=Glucuronosyl-N-acetylglucosaminyl-proteoglycan/N-
acetylglucosaminyl-proteoglycan
4-alpha-N-acetylglucosaminyltransferase 1b; AltName:
Full=Multiple exostoses protein 1 homolog b
gi|56785793|gb|AAW29034.1| EXT1b [Danio rerio]
Length = 741
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 81/333 (24%), Positives = 135/333 (40%), Gaps = 65/333 (19%)
Query: 88 NFKIYIYPDGDPNTFYQTPRKLTGKYASEGY--FFQNIRESRFLTHDPDRAHLFFIPISC 145
FK+Y+YP + G+ SE Y I SRF T DP +A +F + +
Sbjct: 108 GFKVYVYP------------QQKGEKISESYQNILSTIEGSRFYTSDPGQACVFVLSLDT 155
Query: 146 HKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHD-VGVRATEGVPFLI 204
Y + +K V SL WN G +H + TE + F I
Sbjct: 156 LDRDQLSPQYVHN---LKTKVQSLA----LWNN--GRNHLIFNLYSGTWPDYTEDLGFDI 206
Query: 205 KNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRD-------------------VE 245
A+ S S + F P+ D+++P + PR G D
Sbjct: 207 GQAMLAKASISTE-NFRPNFDISIPLFSKEH--PRTGGDRGYLKYNTIPPFRKYMLVFKG 263
Query: 246 NRTVLGFWAGHRNSKIRVILA------------RVWENDTELDISNNRINRAIGPLVYQR 293
R + G + RN+ V A + W+ D ++ N Y+
Sbjct: 264 KRYLTGIGSDTRNALYHVHNAEDVVLLTTCKHGKDWQKHK--DARCDKDNAEYDKYDYRE 321
Query: 294 RFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDV 353
+ + FC+ P G ++ S R +++ CVPV+LSN ++LPF++++DW AVI +ER +
Sbjct: 322 MLHNSTFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVIDWNTAAVIGDERLL 381
Query: 354 YQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
Q+ +++I + ++L + Q F W +
Sbjct: 382 LQIPSTVRSIHQDKILAL-----RQQTQFLWEA 409
>gi|168004467|ref|XP_001754933.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162694037|gb|EDQ80387.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 378
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 69/289 (23%), Positives = 134/289 (46%), Gaps = 26/289 (8%)
Query: 132 DPDRAHLFFIP----ISCHKMRGKGTSYENMTV--IVKDYVDSLINKYPYWNRTLGADHF 185
DP++A +FF+P +S + G+ + N +++ V +++ +W ++ G DH
Sbjct: 70 DPEQAEVFFVPFFASLSFNSF-GRNMAAPNAAKDRELQEGVVEMLSNSKWWQKSQGRDHI 128
Query: 186 FVTCHDVGVRATEGVPFLIKNAIRVVCS---PSYDVAFIPHKDVALP--QVLQPFALPRG 240
V H R ++ ++ +V + VA + KD+ P V+ +
Sbjct: 129 IVIHHPNAFRYYRD---MMNQSMFIVADFGRYNQTVARL-KKDIVAPYAHVVPSYNEDNP 184
Query: 241 GRDVENRTVLGFWAGHRNSK----IRVILARVWENDTELDISNNRI-NRAIGPLVYQRRF 295
R L F+ G K IR L ++ N T++ ++ AI RF
Sbjct: 185 SDPFSARKTLLFFQGRVRRKADGVIRAKLGKLLMNQTDVYYEDSLARTEAIAMSTQGMRF 244
Query: 296 YKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVY- 354
++FC+ P G +S R+ D+I CVPVI+S+ +LPF D LD+ +F++ + ++
Sbjct: 245 --SRFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDRIELPFEDDLDYSEFSIFFSAKEAII 302
Query: 355 --QLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYEL 401
L L++I+ ++ + N L + HF++ +P DA +++ ++
Sbjct: 303 PGHLLGTLRSITRERWLQMWNKLKAISHHFEYQNPSKEDDAVNLIFKQV 351
>gi|117606177|ref|NP_001012369.2| exostosin-1b [Danio rerio]
gi|116487511|gb|AAI25901.1| Exostoses (multiple) 1b [Danio rerio]
gi|182891996|gb|AAI65657.1| Ext1b protein [Danio rerio]
Length = 741
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 82/333 (24%), Positives = 135/333 (40%), Gaps = 65/333 (19%)
Query: 88 NFKIYIYPDGDPNTFYQTPRKLTGKYASEGY--FFQNIRESRFLTHDPDRAHLFFIPISC 145
FK+Y+YP + G+ SE Y I SRF T DP +A +F + +
Sbjct: 108 GFKVYVYP------------QQKGEKISESYQNILSTIEGSRFYTSDPGQACVFVLSLDT 155
Query: 146 HKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHD-VGVRATEGVPFLI 204
Y + +K V SL WN G +H + TE + F I
Sbjct: 156 LDRDQLSPQYVHN---LKTKVQSLA----LWNN--GRNHLIFNLYSGTWPDYTEDLGFDI 206
Query: 205 KNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRD-------------------VE 245
A+ S S + F P+ D+++P L PR G D
Sbjct: 207 GQAMLAKASISTE-NFRPNFDISIP--LFSKEHPRTGGDRGYLKYNTIPPFRKYMLVFKG 263
Query: 246 NRTVLGFWAGHRNSKIRVILA------------RVWENDTELDISNNRINRAIGPLVYQR 293
R + G + RN+ V A + W+ D ++ N Y+
Sbjct: 264 KRYLTGIGSDTRNALYHVHNAEDVVLLTTCKHGKDWQKHK--DARCDKDNAEYDKYDYRE 321
Query: 294 RFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDV 353
+ + FC+ P G ++ S R +++ CVPV+LSN ++LPF++++DW AVI +ER +
Sbjct: 322 MLHNSTFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVIDWNTAAVIGDERLL 381
Query: 354 YQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
Q+ +++I + ++L + Q F W +
Sbjct: 382 LQIPSTVRSIHQDKILAL-----RQQTQFLWEA 409
>gi|1524413|gb|AAC51141.1| multiple exostosis-like protein [Homo sapiens]
gi|4106426|gb|AAD02840.1| multiple exostoses-like 1 [Homo sapiens]
gi|8132002|gb|AAF73172.1| exostoses-like protein 1 [Homo sapiens]
Length = 676
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 60/221 (27%), Positives = 98/221 (44%), Gaps = 30/221 (13%)
Query: 176 WNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPF 235
WNR G +H + H T F + A+ SP+ D +F P DVALP + +
Sbjct: 153 WNR--GRNHLVLRLHPAPCPRT----FQLGQAMVAEASPTVD-SFRPGFDVALPFLPEAH 205
Query: 236 ALPRGGRDVENRTVLGFWAGHRNSKIRVILA--------RVWENDTELDISNNRINRAIG 287
L RGG + R H +LA R + + + R + G
Sbjct: 206 PL-RGGAPGQLRQ-------HSPQPGVALLALEEERGGWRTADTGSSACPWDGRCEQDPG 257
Query: 288 PLVYQRR--FYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFA 345
P QR+ FC+ G ++R ++ GC+PV+LS ++LPF++++DW K A
Sbjct: 258 PGQTQRQETLPNATFCLISGHRPEAASRFLQALQAGCIPVLLSPRWELPFSEVIDWTKAA 317
Query: 346 VILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
++ +ER Q+ L+ +S A ++L Q F W++
Sbjct: 318 IVADERLPLQVLAALQEMSPARVLALRQ-----QTQFLWDA 353
>gi|115487106|ref|NP_001066040.1| Os12g0124400 [Oryza sativa Japonica Group]
gi|77552918|gb|ABA95714.1| secondary cell wall-related glycosyltransferase family 47,
putative, expressed [Oryza sativa Japonica Group]
gi|113648547|dbj|BAF29059.1| Os12g0124400 [Oryza sativa Japonica Group]
gi|215678612|dbj|BAG92267.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218186350|gb|EEC68777.1| hypothetical protein OsI_37312 [Oryza sativa Indica Group]
Length = 475
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 62/237 (26%), Positives = 109/237 (45%), Gaps = 13/237 (5%)
Query: 174 PYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALP--QV 231
P W R+ GADH V H + V F + + VA + KDV P +
Sbjct: 219 PEWKRSGGADHVIVAHHPNSLLHARSVLFPVVFVLSDFGRYHPRVASL-EKDVIAPYKHM 277
Query: 232 LQPFALPRGGRDVENRTVLGFWAG----HRNSKIRVILARVWENDTELDISNNRINRAIG 287
+ F G D +R L ++ G IR L + +++ ++ + + G
Sbjct: 278 AKTFVNDSAGFD--DRPTLLYFRGAIFRKEGGNIRQELYYMLKDEKDVYFAFGSVQDH-G 334
Query: 288 PLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVI 347
+ + +KFC+ G +S R+ D+I CVPVI+S+ +LP+ D LD+ KF++
Sbjct: 335 ASKASKGMHASKFCLNIAGDTPSSNRLFDAIVSHCVPVIISDDIELPYEDALDYSKFSIF 394
Query: 348 LNERDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYEL 401
+ D + L ++++ +S ++ + N L +V KHF++ P + DA M+ L
Sbjct: 395 VRSSDAVKKGYLMRLIRGVSKHQWTRMWNRLKEVDKHFEYQYPSQKDDAVQMIWQAL 451
>gi|351697844|gb|EHB00763.1| Exostosin-like 1 [Heterocephalus glaber]
Length = 674
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 58/218 (26%), Positives = 99/218 (45%), Gaps = 33/218 (15%)
Query: 181 GADHFFVTCH-DVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALP- 238
G +H + H D RA++ + A+ SP D F P DVALP L P A P
Sbjct: 155 GRNHLVINLHPDSCPRASQ-----LGQAMVAEASPRVDT-FRPGFDVALP--LLPEAHPF 206
Query: 239 RGGRDVENRT----------VLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGP 288
RGG + R LG G ++ W+ E D+ + +
Sbjct: 207 RGGAPGQLRQHSPHPSAALLALGEEGGRWHTASTHSATCPWDGRCEQDLGPKQTHPG--- 263
Query: 289 LVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVIL 348
+ FC+ PG + ++R ++ GC+PV+LS +++LPF++++DW K A++
Sbjct: 264 ----EKLPNATFCLIPG-RRAEASRFLQALQAGCIPVLLSPHWELPFSEVIDWTKAAIVA 318
Query: 349 NERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
+ER Q+ L+ ++ A ++L + Q F W++
Sbjct: 319 DERLPLQVLAALQEMAPARVLAL-----RQQTQFLWDA 351
>gi|195123279|ref|XP_002006135.1| GI20872 [Drosophila mojavensis]
gi|193911203|gb|EDW10070.1| GI20872 [Drosophila mojavensis]
Length = 761
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 81/365 (22%), Positives = 142/365 (38%), Gaps = 71/365 (19%)
Query: 80 LNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGY--FFQNIRESRFLTHDPDRAH 137
++ + F +Y+YP N+ P +S Y I+ESR+ T DP A
Sbjct: 95 FDFTKCYDQFLVYVYPPEPLNSLGAAPP------SSPNYQKILTAIQESRYHTSDPAAAC 148
Query: 138 LFFIPISCHKMRGKGTSYENMTVIVKDYV---DSLINKYPYWNRTLGADHFFVTC----- 189
LF + G + + +DYV S + + P+WN G +H
Sbjct: 149 LFVL----------GIDTLDRDSLSEDYVRNVPSRLARLPHWNN--GRNHIIFNLYSGTW 196
Query: 190 -----HDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHK-DVALPQVLQPFALPRGGRD 243
H +G A E + + S V + H DV++P + F L G
Sbjct: 197 PDYAEHSLGFDAGEAI----------LAKASMSVLQMRHGFDVSIPLFHKQFPLRAGATG 246
Query: 244 V-------ENRTVLGFWAGHR-----------------NSKIRVILARVWENDTELDISN 279
N+ L + G R N + V++ + ++ +
Sbjct: 247 SVQSNNFPANKKYLLAFKGKRYVHGIGSETRNSLFHLHNGRDMVMVTTCKHGKSWRELQD 306
Query: 280 NRI---NRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFN 336
NR NR Y + FC+ P G ++ S R +++ GC+PV+LSN + LPF
Sbjct: 307 NRCDEDNREYDRYDYGTLLQNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNAWVLPFE 366
Query: 337 DILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHM 396
+DW++ A+ +ER + Q+ I+++IS +L + + + + + F F +
Sbjct: 367 SKIDWKQAAIWADERLLLQVPDIVRSISAERIFALRQQTQVLWERYFGSIEKIVFTTFEI 426
Query: 397 VMYEL 401
+ L
Sbjct: 427 IRERL 431
>gi|302782213|ref|XP_002972880.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
gi|300159481|gb|EFJ26101.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
Length = 459
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 81/290 (27%), Positives = 123/290 (42%), Gaps = 34/290 (11%)
Query: 132 DPDRAHLFFIPI-----------SCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTL 180
DP A +FF+P + H+ +G EN + K V+ L + W R
Sbjct: 156 DPATADIFFVPFFSSLSYNRYCRTGHRFQGGRGCVEN-DRLEKRLVEFLRGQ-ELWRRNG 213
Query: 181 GADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSP---SYDVAFIPHKDVALP--QVLQPF 235
GADH V H + L+K A+ VV S VA + KD+ P V+ F
Sbjct: 214 GADHVIVMHHPNSLMVARS---LLKEAMFVVADFGRFSRAVANM-RKDIVAPYKHVIPSF 269
Query: 236 ALPRGGRDVENRTVLGFWAG----HRNSKIRVILARVWENDTELD-ISNNRINRAIGPLV 290
A R E+R L F+ G IR L + ++ + ++ N I
Sbjct: 270 A--RDATTFESRETLLFFQGAIVRKEGGIIRQKLYEILKDSPGVHFVTGNTQKDGIRSAT 327
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
R KFC+ G +S R+ D+I CVPVI+S+ +LPF D LD+ +F V +
Sbjct: 328 AGMR--NAKFCLHLAGDTPSSNRLFDAIASHCVPVIISDEIELPFEDELDYSQFCVFVES 385
Query: 351 RDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMV 397
+ + + L+ I E+ L V++HF++ P + DA HM
Sbjct: 386 DKALRKGFVVRALERIGRDEWTRKWAMLKSVERHFEYQHPSLPEDAVHMT 435
>gi|194755878|ref|XP_001960206.1| GF13249 [Drosophila ananassae]
gi|190621504|gb|EDV37028.1| GF13249 [Drosophila ananassae]
Length = 765
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 79/346 (22%), Positives = 139/346 (40%), Gaps = 51/346 (14%)
Query: 89 FKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHKM 148
F +Y+YP N+ P A+ I+ESR+ T+DP A LF +
Sbjct: 105 FLVYVYPPEPLNSLGAAP----PSSANYQKILTAIQESRYHTNDPSAACLFVL------- 153
Query: 149 RGKGTSYENMTVIVKDY---VDSLINKYPYWNRTLGADHFFVTCHDVGVR--ATEGVPFL 203
G + + +DY V S + + P+WN G +H + A + F
Sbjct: 154 ---GIDTLDRDSLSEDYVRNVPSRLARLPHWNN--GRNHIIFNLYSGTWPDYAENSLGFD 208
Query: 204 IKNAIRVVCSPSYDVAFIPHK-DVALPQVLQPFALPRGGRDV-------ENRTVLGFWAG 255
AI + S V + H DV++P + F L G N+ L + G
Sbjct: 209 AGEAI--LAKASMSVLQVRHGFDVSIPLFHKQFPLRAGATGSVQSNNFPANKKYLLAFKG 266
Query: 256 HR-----------------NSKIRVILARVWENDTELDISNNRI---NRAIGPLVYQRRF 295
R N + V++ + ++ +NR NR Y+
Sbjct: 267 KRYVHGIGSETRNSLFHLHNGRDMVLVTTCRHGKSWRELQDNRCDEDNREYDRYDYETLL 326
Query: 296 YKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQ 355
+ FC+ P G ++ S R +++ GC+PV+LSN + LPF +DW++ A+ +ER + Q
Sbjct: 327 QNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNAWVLPFESKIDWKQAAIWADERLLLQ 386
Query: 356 LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYEL 401
+ +++IS +L + + + + + F F ++ L
Sbjct: 387 VPDTVRSISVERIFALRQQTQVLWERYFGSIEKIVFTTFEIIRERL 432
>gi|222616547|gb|EEE52679.1| hypothetical protein OsJ_35064 [Oryza sativa Japonica Group]
Length = 528
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 63/245 (25%), Positives = 112/245 (45%), Gaps = 13/245 (5%)
Query: 174 PYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALP--QV 231
P W R+ GADH V H + V F + + VA + KDV P +
Sbjct: 272 PEWKRSGGADHVIVAHHPNSLLHARSVLFPVVFVLSDFGRYHPRVASL-EKDVIAPYKHM 330
Query: 232 LQPFALPRGGRDVENRTVLGFWAG----HRNSKIRVILARVWENDTELDISNNRINRAIG 287
+ F G D +R L ++ G IR L + +++ ++ + + G
Sbjct: 331 AKTFVNDSAGFD--DRPTLLYFRGAIFRKEGGNIRQELYYMLKDEKDVYFAFGSVQDH-G 387
Query: 288 PLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVI 347
+ + +KFC+ G +S R+ D+I CVPVI+S+ +LP+ D LD+ KF++
Sbjct: 388 ASKASKGMHASKFCLNIAGDTPSSNRLFDAIVSHCVPVIISDDIELPYEDALDYSKFSIF 447
Query: 348 LNERDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLR 404
+ D + L ++++ +S ++ + N L +V KHF++ P + DA M+ L +
Sbjct: 448 VRSSDAVKKGYLMRLIRGVSKHQWTRMWNRLKEVDKHFEYQYPSQKDDAVQMIWQALARK 507
Query: 405 RHVIK 409
I+
Sbjct: 508 VPAIR 512
>gi|357126548|ref|XP_003564949.1| PREDICTED: probable glycosyltransferase At3g42180-like
[Brachypodium distachyon]
Length = 432
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 73/296 (24%), Positives = 130/296 (43%), Gaps = 29/296 (9%)
Query: 132 DPDRAHLFFIPI------SCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHF 185
DP+ A FF+P + H E ++ + +D L K YW R+ G DH
Sbjct: 128 DPEAAEAFFVPFFSSLSFNVHGRNMTDPDTEADRLLQVELIDVLW-KSKYWQRSAGRDHV 186
Query: 186 FVTCHDVGVRATEGVPFLIKNAIRVVCS---PSYDVAFIPHKDVALP--QVLQPFALPRG 240
H R ++ ++ +V + ++A + KDV P V+ F
Sbjct: 187 IPMHHPNAFRFLRD---MVNASVLIVADFGRYTQELASL-RKDVVAPYVHVVDSFINDDP 242
Query: 241 GRDVENRTVLGFWAGHR----NSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFY 296
E R L F+ G KIR LA++ ++ + ++ + G
Sbjct: 243 PDPFEARPTLLFFRGRTVRKAEGKIRAKLAKILKDKDGVRFEDS-LATGEGINTSTEGMR 301
Query: 297 KTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDV--- 353
+KFC+ P G +S R+ D+I C+PVI+S+ +LPF D +D+ +F++ + +
Sbjct: 302 SSKFCLHPAGDTPSSCRLFDAIVSHCIPVIVSSRIELPFEDEIDYSEFSLFFSVEEALKP 361
Query: 354 -YQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVI 408
Y L Q L+ + ++V + + L V H+++ P + DA +M+ ++ RH I
Sbjct: 362 DYLLDQ-LRQMPKEKWVEMWSKLKNVSSHYEFQYPTRKGDAVNMIWRQV---RHKI 413
>gi|55742053|ref|NP_001006730.1| exostosin 1 [Xenopus (Silurana) tropicalis]
gi|49523045|gb|AAH75481.1| exostoses (multiple) 1 [Xenopus (Silurana) tropicalis]
Length = 738
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 80/331 (24%), Positives = 133/331 (40%), Gaps = 61/331 (18%)
Query: 88 NFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHK 147
FK+YIYP Q KL+ Y + I SR T DP +A LF + +
Sbjct: 105 GFKVYIYPQ-------QKGDKLSESYQ---HILAAIESSRLYTSDPSQACLFVLSLDTLD 154
Query: 148 MRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHD-VGVRATEGVPFLIKN 206
Y + + S + WN G +H + TE V F I
Sbjct: 155 RDQLSPQYVHN-------LKSKVQNLHLWNN--GRNHLIFNLYSGTWPDYTEDVGFDIGQ 205
Query: 207 AIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDVE-------------------NR 247
A+ S S + F P+ DV++P + PR G + R
Sbjct: 206 AMLAKASISTE-NFRPNFDVSIPLFSKDH--PRTGGEKGFLRFNNIPPMRKYMLVFKGKR 262
Query: 248 TVLGFWAGHRNSKIRV------IL------ARVWENDTELDISNNRINRAIGPLVYQRRF 295
+ G + RN+ V +L + W+ D ++ N Y+
Sbjct: 263 YLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHK--DARCDKDNAEYEKYDYREML 320
Query: 296 YKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQ 355
+ FC+ P G ++ S R +++ CVPV+LSN ++LPF++++DW + AVI +ER + Q
Sbjct: 321 HNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVIDWNQAAVIGDERLLLQ 380
Query: 356 LKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
+ +++I + ++L + Q F W +
Sbjct: 381 IPSTIRSIHQDKILAL-----RQQTQFLWEA 406
>gi|395512319|ref|XP_003760388.1| PREDICTED: exostosin-1 [Sarcophilus harrisii]
Length = 746
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 82/333 (24%), Positives = 136/333 (40%), Gaps = 65/333 (19%)
Query: 88 NFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHK 147
FK+Y+YP Q K+ Y + I SRF T DP +A LF + +
Sbjct: 113 GFKVYVYPQ-------QKGEKIAESYQN---ILAAIEGSRFYTSDPSQACLFVLSLDTLD 162
Query: 148 MRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHD-VGVRATEGVPFLIKN 206
Y + +K V SL WN G +H + TE V F I
Sbjct: 163 RDQLSPQYVHN---LKSKVQSL----HLWNN--GRNHLIFNLYSGTWPDYTEDVGFDIGQ 213
Query: 207 AIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDVE-------------------NR 247
A+ S S + F P+ DV++P + PR G + R
Sbjct: 214 AMLAKASISTE-NFRPNFDVSIPLFSKDH--PRTGGEKGFLRFNTIPPLRKYMLVFKGKR 270
Query: 248 TVLGFWAGHRNSKIRV------IL------ARVWEN--DTELDISNNRINRAIGPLVYQR 293
+ G + RN+ V +L + W+ D+ D N + Y+
Sbjct: 271 YLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHKDSRCDKDNTEYEK----YDYRE 326
Query: 294 RFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDV 353
+ FC+ P G ++ S R +++ CVPV+LSN ++LPF++++DW + A+I +ER +
Sbjct: 327 MLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVIDWNQAAIIGDERLL 386
Query: 354 YQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
Q+ +++I + ++L + Q F W +
Sbjct: 387 LQIPSTIRSIHQDKILAL-----RQQTQFLWEA 414
>gi|126322251|ref|XP_001370057.1| PREDICTED: exostosin-1 [Monodelphis domestica]
Length = 746
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 82/333 (24%), Positives = 136/333 (40%), Gaps = 65/333 (19%)
Query: 88 NFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHK 147
FK+Y+YP Q K+ Y + I SRF T DP +A LF + +
Sbjct: 113 GFKVYVYPQ-------QKGEKIAESYQN---ILAAIEGSRFYTSDPSQACLFVLSLDTLD 162
Query: 148 MRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHD-VGVRATEGVPFLIKN 206
Y + +K V SL WN G +H + TE V F I
Sbjct: 163 RDQLSPQYVHN---LKSKVQSL----HLWNN--GRNHLIFNLYSGTWPDYTEDVGFDIGQ 213
Query: 207 AIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDVE-------------------NR 247
A+ S S + F P+ DV++P + PR G + R
Sbjct: 214 AMLAKASISTE-NFRPNFDVSIPLFSKDH--PRTGGEKGFLRFNTIPPLRKYMLVFKGKR 270
Query: 248 TVLGFWAGHRNSKIRV------IL------ARVWEN--DTELDISNNRINRAIGPLVYQR 293
+ G + RN+ V +L + W+ D+ D N + Y+
Sbjct: 271 YLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHKDSRCDKDNTEYEK----YDYRE 326
Query: 294 RFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDV 353
+ FC+ P G ++ S R +++ CVPV+LSN ++LPF++++DW + A+I +ER +
Sbjct: 327 MLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVIDWNQAAIIGDERLL 386
Query: 354 YQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
Q+ +++I + ++L + Q F W +
Sbjct: 387 LQIPSTIRSIHQDKILAL-----RQQTQFLWEA 414
>gi|356516800|ref|XP_003527081.1| PREDICTED: xylogalacturonan beta-1,3-xylosyltransferase-like
[Glycine max]
Length = 493
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 68/301 (22%), Positives = 132/301 (43%), Gaps = 31/301 (10%)
Query: 131 HDPDRAHLFFIP----ISCHK-MRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHF 185
D +A + F+P +S ++ + G ++ +++D + + W R+ G DH
Sbjct: 185 QDSSQADVIFVPFFSSLSYNRHSKLNGEEKVSLNKMLQDRLVQFLMGQKEWKRSGGKDHL 244
Query: 186 FVTCHD---VGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRG-G 241
V H + R G L+ + ++ KD+ P +P+
Sbjct: 245 IVAHHPNSLLDARRKLGAAMLVLADFGRYPTELANIK----KDIIAPYRHLVSTIPKAKS 300
Query: 242 RDVENRTVLGFWAG----HRNSKIRVILARVWENDTEL-----DISNNRINRAIGPLVYQ 292
E RT L ++ G IR L + +++ ++ I N IN+A +
Sbjct: 301 ASFEKRTTLVYFQGAIYRKDGGAIRQELYYLLKDEKDVHFTFGSIGGNGINQASQGMAM- 359
Query: 293 RRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERD 352
+KFC+ G +S R+ D+I CVPVI+S+ +LPF D+LD+ F++ + D
Sbjct: 360 -----SKFCLNIAGDTPSSNRLFDAIVSHCVPVIISDEIELPFEDVLDYSDFSIFVRASD 414
Query: 353 VYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIK 409
+ L +L++I+ E+ + L ++ HF++ P DA +M+ ++ + I+
Sbjct: 415 SMKKGYLLNLLRSITQKEWSKMWERLKQITHHFEYQYPSQPGDAVNMIWQQVERKISSIR 474
Query: 410 Y 410
+
Sbjct: 475 F 475
>gi|125586098|gb|EAZ26762.1| hypothetical protein OsJ_10674 [Oryza sativa Japonica Group]
Length = 364
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 69/240 (28%), Positives = 114/240 (47%), Gaps = 27/240 (11%)
Query: 174 PYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCS-----PSYDVAFIPHKDVAL 228
P W R+ G DH + H G+ + + + V+C PS VA + KDV
Sbjct: 92 PEWRRSGGRDHVVLAHHPNGMLDAR---YKLWPCVFVLCDFGRYPPS--VAGL-DKDVIA 145
Query: 229 P--QVLQPFALPRGGRDVENRTVLGFWAGHRNSK----IRVILARVWENDTELDISNNRI 282
P V+ FA G D +R L ++ G K IR L + +++ ++ S
Sbjct: 146 PYRHVVPNFANDSAGYD--DRPTLLYFQGAIYRKDGGFIRQELYYLLKDEKDVHFS---F 200
Query: 283 NRAIGPLVYQ--RRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILD 340
+G + Q + +KFC+ G +S R+ DSI CVP+I+S+ +LPF D+LD
Sbjct: 201 GSVVGNGIEQATQGMRASKFCLNIAGDTPSSNRLFDSIVSHCVPIIISDEIELPFEDVLD 260
Query: 341 WRKFAVILNERDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMV 397
+ KF +I+ D + L ++ IS ++ + N L +V++HF++ P DA M+
Sbjct: 261 YSKFCIIVRGADAVKKGFLMNLINGISREDWTRMWNRLKEVERHFEYQYPSQNDDAVQMI 320
>gi|356553158|ref|XP_003544925.1| PREDICTED: probable glucuronosyltransferase Os01g0926700-like
[Glycine max]
Length = 440
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 84/366 (22%), Positives = 151/366 (41%), Gaps = 37/366 (10%)
Query: 65 SDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRK----------LTGKYA 114
S L + V H S + + + KIY+Y + + + + R L G++
Sbjct: 44 STLSTSVVHV-SQPRFQWGQSQLSLKIYVYQEDEIDGLKELLRGRDAKITDEACLKGQWG 102
Query: 115 SEGYFFQNIRESRFLTHDPDRAHLFFIP--ISCHKMRGKGTSYENMTVIVKDYVDSLINK 172
S+ + + +S+ T + A LFF+P + C +M G E + VK +I++
Sbjct: 103 SQVKIHKLLLQSKQRTWKKEEADLFFVPSYVKCARMMGGLNDKEINSTYVK-----VISQ 157
Query: 173 YPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYD-------VAFIPHKD 225
PY+ + G +H FV G + I +I + +P D AF KD
Sbjct: 158 MPYFRLSGGRNHIFVFPSGAGAHLFKSWATYINRSI--ILTPEGDRTDKRDTSAFNTWKD 215
Query: 226 VALPQVLQPFALPRGGRDVE-----NRTVLGFWAGHRNSK---IRVI-LARVWENDTELD 276
+ +P + G V+ R L + G K +++I L++ + E
Sbjct: 216 IIIPGNIDDGMTKTGDTTVQPLPLSKRKYLANYLGRAQGKAGRLKLIELSKQFPEKLECP 275
Query: 277 ISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFN 336
+G Y +KFC+ P G + R +S CVPVILS+ +LPF
Sbjct: 276 DLKFSGPDKLGRKEYFEHLRNSKFCLAPRGESSWTLRFYESFFVECVPVILSDQIELPFQ 335
Query: 337 DILDWRKFAVILNERDVY-QLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFH 395
+++D+ + ++ + +L Q L++I D E + +V+ + + S A
Sbjct: 336 NVIDYSQISIKWPSSQIGPELLQYLESIPDEEIEKIIARGRQVRCWWVYASDSESCSAMR 395
Query: 396 MVMYEL 401
+M+EL
Sbjct: 396 GIMWEL 401
>gi|115452759|ref|NP_001049980.1| Os03g0324700 [Oryza sativa Japonica Group]
gi|108707908|gb|ABF95703.1| secondary cell wall-related glycosyltransferase family 47,
putative, expressed [Oryza sativa Japonica Group]
gi|113548451|dbj|BAF11894.1| Os03g0324700 [Oryza sativa Japonica Group]
gi|125543687|gb|EAY89826.1| hypothetical protein OsI_11372 [Oryza sativa Indica Group]
gi|215768347|dbj|BAH00576.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 468
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 69/240 (28%), Positives = 115/240 (47%), Gaps = 27/240 (11%)
Query: 174 PYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCS-----PSYDVAFIPHKDVAL 228
P W R+ G DH + H G+ + + + V+C PS VA + KDV
Sbjct: 196 PEWRRSGGRDHVVLAHHPNGMLDAR---YKLWPCVFVLCDFGRYPPS--VAGL-DKDVIA 249
Query: 229 P--QVLQPFALPRGGRDVENRTVLGFWAGHRNSK----IRVILARVWENDTELDISNNRI 282
P V+ FA G D +R L ++ G K IR L + +++ ++ S +
Sbjct: 250 PYRHVVPNFANDSAGYD--DRPTLLYFQGAIYRKDGGFIRQELYYLLKDEKDVHFSFGSV 307
Query: 283 NRAIGPLVYQ--RRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILD 340
+G + Q + +KFC+ G +S R+ DSI CVP+I+S+ +LPF D+LD
Sbjct: 308 ---VGNGIEQATQGMRASKFCLNIAGDTPSSNRLFDSIVSHCVPIIISDEIELPFEDVLD 364
Query: 341 WRKFAVILNERDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMV 397
+ KF +I+ D + L ++ IS ++ + N L +V++HF++ P DA M+
Sbjct: 365 YSKFCIIVRGADAVKKGFLMNLINGISREDWTRMWNRLKEVERHFEYQYPSQNDDAVQMI 424
>gi|42566010|ref|NP_191322.3| exostosin family protein [Arabidopsis thaliana]
gi|44917463|gb|AAS49056.1| At3g57630 [Arabidopsis thaliana]
gi|46931284|gb|AAT06446.1| At3g57630 [Arabidopsis thaliana]
gi|332646159|gb|AEE79680.1| exostosin family protein [Arabidopsis thaliana]
Length = 793
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 86/333 (25%), Positives = 139/333 (41%), Gaps = 68/333 (20%)
Query: 113 YASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SCHKMRGKGTSYENMT----------- 159
Y S+ F++NI + T + + A FF+P+ SC R + NM
Sbjct: 391 YGSQMAFYENILATAHRTMNGEEADFFFVPVLDSCIINRADDAPHINMQNHTGLRSSLTL 450
Query: 160 VIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVV-------- 211
K + ++ KYPYWNR+ G DH + D G P I N++ +V
Sbjct: 451 EFYKRAYEHIVEKYPYWNRSAGRDHIWFFSWDEGACY---APKEIWNSMMLVHWGNTNSK 507
Query: 212 ---CSPSY------DVA---------FIPHKDVALP--QVLQPFALPRG--GRDVENRTV 249
+ +Y D++ F P KD+ +P +V P+++ + R E R
Sbjct: 508 HNHSTTAYFGDNWDDISDERRGDHPCFDPRKDLVIPAWKVPDPYSMRKNYWERPREKRKT 567
Query: 250 LGFWAGHRNSK-------------IRVILARVWEN--DTELDISNNRINRAI-GPLV--- 290
L ++ G+ IR LA + + + E + I PL
Sbjct: 568 LFYFNGNLGPAYEKGRPEDSYSMGIRQKLAEEFGSSPNKEGKLGKQHAEDVIVTPLRSDN 627
Query: 291 YQRRFYKTKFCIC-PGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILN 349
Y + + FC PG S R+ DSI GCVPVI+ + LP+ ++L++ FAV +N
Sbjct: 628 YHKDIANSIFCGAFPGDGW--SGRMEDSILQGCVPVIIQDGIYLPYENMLNYESFAVRVN 685
Query: 350 ERDVYQLKQILKNISDAEFVSLHNNLVKVQKHF 382
E D+ L L+ S+AE N+ ++ + F
Sbjct: 686 EDDIPNLINTLRGFSEAEIQFRLGNVKELWQRF 718
>gi|444706299|gb|ELW47642.1| Exostosin-like 1, partial [Tupaia chinensis]
Length = 595
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 59/217 (27%), Positives = 90/217 (41%), Gaps = 31/217 (14%)
Query: 181 GADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALP-R 239
G +H ++ H T F + A+ SP+ D F P DVALP L P A P R
Sbjct: 75 GRNHLVISLHAAPCPRT----FQLGQAMVAEASPTVDT-FRPGFDVALP--LLPEAHPLR 127
Query: 240 GGR-------DVENRTVLGFWAGHRNSKIRVIL---ARVWENDTELDISNNRINRAIGPL 289
GG + R L WAG A W E D + +
Sbjct: 128 GGAPGRLRQYSPQPREALLAWAGESGGGPPAGTDSSACPWHGRCEQDAGAEQTHPG---- 183
Query: 290 VYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILN 349
FC+ PG + + R ++ GC+PV+LS ++LPF++++DW K A++ +
Sbjct: 184 ---DTLPSATFCLIPG-RRPGAGRFLQALQAGCIPVLLSPRWELPFSEVIDWTKAAIVAD 239
Query: 350 ERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
ER Q+ L+ + A ++L Q F W +
Sbjct: 240 ERLPLQVLAALQEMPPARVLALRQ-----QTQFLWGA 271
>gi|302800594|ref|XP_002982054.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
gi|300150070|gb|EFJ16722.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
Length = 453
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 73/293 (24%), Positives = 124/293 (42%), Gaps = 48/293 (16%)
Query: 132 DPDRAHLFFIPI------SCHKMRGKG--------TSYENMTVIVKDYVDSLINKYPYWN 177
DPD+A + F+P GKG YE +++ ++ W
Sbjct: 116 DPDQADVVFVPFFAALSAEAQLRNGKGHFRHRKDNEDYERQKAVME-----IVTSSSRWQ 170
Query: 178 RTLGADHFFVTCHDVGV---RATEGVPFLI-----------KNAIRVVCSPS--YDVAFI 221
R+ G DH FV + + RA L+ + R + SP Y
Sbjct: 171 RSGGRDHVFVLTDPMAMYHFRAEIANSILLVVDFGGWYMEDAKSSRNLSSPQPIYHTQVS 230
Query: 222 PHKDVALP--QVLQPFALPRGGRDVENRTVLGFWAG----HRNSKIRVILARVWENDTEL 275
KDV +P +L AL +D RT L ++ G HR +R L V + + +
Sbjct: 231 LIKDVIVPYTHLLPTLAL---SQDNAVRTTLLYFKGARYRHRTGLVRDQLWSVLDGEPGV 287
Query: 276 DISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPF 335
+ NR G + + + FC+ P G +S R+ D++ C+PVI+S+ +LPF
Sbjct: 288 LLEEGFPNRT-GQVQAVQGMRNSHFCLHPAGDTPSSCRLFDAVASLCIPVIVSDSIELPF 346
Query: 336 NDILDWRKFAVILNERDVYQLKQILKNISDAEFV---SLHNNLVKVQKHFQWN 385
+LD+ +FA+ ++ D K +++++S + +NL VQ HF++
Sbjct: 347 EGMLDYTQFAIFVSVHDALLPKWLVRHLSSLSSKVRNQMRHNLASVQHHFEYE 399
>gi|42572713|ref|NP_974452.1| exostosin family protein [Arabidopsis thaliana]
gi|110740929|dbj|BAE98560.1| hypothetical protein [Arabidopsis thaliana]
gi|332646160|gb|AEE79681.1| exostosin family protein [Arabidopsis thaliana]
Length = 791
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 86/333 (25%), Positives = 139/333 (41%), Gaps = 68/333 (20%)
Query: 113 YASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SCHKMRGKGTSYENMT----------- 159
Y S+ F++NI + T + + A FF+P+ SC R + NM
Sbjct: 389 YGSQMAFYENILATAHRTMNGEEADFFFVPVLDSCIINRADDAPHINMQNHTGLRSSLTL 448
Query: 160 VIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVV-------- 211
K + ++ KYPYWNR+ G DH + D G P I N++ +V
Sbjct: 449 EFYKRAYEHIVEKYPYWNRSAGRDHIWFFSWDEGACY---APKEIWNSMMLVHWGNTNSK 505
Query: 212 ---CSPSY------DVA---------FIPHKDVALP--QVLQPFALPRG--GRDVENRTV 249
+ +Y D++ F P KD+ +P +V P+++ + R E R
Sbjct: 506 HNHSTTAYFGDNWDDISDERRGDHPCFDPRKDLVIPAWKVPDPYSMRKNYWERPREKRKT 565
Query: 250 LGFWAGHRNSK-------------IRVILARVWEN--DTELDISNNRINRAI-GPLV--- 290
L ++ G+ IR LA + + + E + I PL
Sbjct: 566 LFYFNGNLGPAYEKGRPEDSYSMGIRQKLAEEFGSSPNKEGKLGKQHAEDVIVTPLRSDN 625
Query: 291 YQRRFYKTKFCIC-PGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILN 349
Y + + FC PG S R+ DSI GCVPVI+ + LP+ ++L++ FAV +N
Sbjct: 626 YHKDIANSIFCGAFPGDGW--SGRMEDSILQGCVPVIIQDGIYLPYENMLNYESFAVRVN 683
Query: 350 ERDVYQLKQILKNISDAEFVSLHNNLVKVQKHF 382
E D+ L L+ S+AE N+ ++ + F
Sbjct: 684 EDDIPNLINTLRGFSEAEIQFRLGNVKELWQRF 716
>gi|311253457|ref|XP_001925015.2| PREDICTED: exostosin-1 [Sus scrofa]
Length = 746
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 81/329 (24%), Positives = 133/329 (40%), Gaps = 57/329 (17%)
Query: 88 NFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHK 147
FK+Y+YP Q K+ Y + I SRF T DP +A LF + +
Sbjct: 113 GFKVYVYPQ-------QKGEKIAESYQN---ILAAIEGSRFYTSDPSQACLFVLSLDTLD 162
Query: 148 MRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHD-VGVRATEGVPFLIKN 206
Y + +K V SL WN G +H + TE V F I
Sbjct: 163 RDQLSPQYVHN---LKSKVQSL----HLWNN--GRNHLIFNLYSGTWPDYTEDVGFDIGQ 213
Query: 207 AIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDV-----------------ENRTV 249
A+ S S + F P+ DV++P + G R R +
Sbjct: 214 AMLAKASISTE-NFRPNFDVSIPLFSKDHPRTGGERGFLKFNTIPPLRKYMLVFKGKRYL 272
Query: 250 LGFWAGHRNSKIRV------IL------ARVWENDTELDISNNRINRAIGPLVYQRRFYK 297
G + RN+ V +L + W+ D +R N Y+ +
Sbjct: 273 TGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHK--DSRCDRDNTEYEKYDYREMLHN 330
Query: 298 TKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLK 357
FC+ P G ++ S R +++ CVPV+LSN ++LPF+++++W + AVI +ER + Q+
Sbjct: 331 ATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIP 390
Query: 358 QILKNISDAEFVSLHNNLVKVQKHFQWNS 386
+++I + ++L + Q F W +
Sbjct: 391 STIRSIHQDKILAL-----RQQTQFLWEA 414
>gi|426328448|ref|XP_004025264.1| PREDICTED: exostosin-like 1 [Gorilla gorilla gorilla]
Length = 676
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 60/221 (27%), Positives = 97/221 (43%), Gaps = 30/221 (13%)
Query: 176 WNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPF 235
WNR G +H + H T F + A+ SP+ D +F P DVALP + +
Sbjct: 153 WNR--GRNHLVLRLHPAPCPRT----FQLGQAMVAEASPTVD-SFRPGFDVALPFLPEAH 205
Query: 236 ALPRGGRDVENRTVLGFWAGHRNSKIRVILA--------RVWENDTELDISNNRINRAIG 287
L RGG + R H +LA R + + + R + G
Sbjct: 206 PL-RGGAPGQLRQ-------HSPQPGVALLALEEERGGWRTADTGSSACPWDGRCEQDPG 257
Query: 288 PLVYQRR--FYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFA 345
P QR+ FC+ G + R ++ GC+PV+LS ++LPF++++DW K A
Sbjct: 258 PGQTQRQETLPNATFCLISGHRPEAALRFLQALQAGCIPVLLSPRWELPFSEVIDWTKAA 317
Query: 346 VILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
++ +ER Q+ L+ +S A ++L Q F W++
Sbjct: 318 IVADERLPLQVLAALQEMSPARVLALRQ-----QTQFLWDA 353
>gi|125528898|gb|EAY77012.1| hypothetical protein OsI_04969 [Oryza sativa Indica Group]
Length = 416
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 63/247 (25%), Positives = 113/247 (45%), Gaps = 19/247 (7%)
Query: 168 SLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCS---PSYDVAFIPHK 224
++ K YW R+ G DH H R ++ +I +V + ++A + K
Sbjct: 152 EILWKSKYWQRSAGRDHVIPMHHPNAFRFLRD---MVNASILIVADFGRYTKELASL-RK 207
Query: 225 DVALP--QVLQPFALPRGGRDVENRTVLGFWAGHR----NSKIRVILARVWENDTELDIS 278
DV P V+ F + R L F+ G KIR LA++ + +
Sbjct: 208 DVVAPYVHVVDSFLNDDPPDPFDARPTLLFFRGRTVRKDEGKIRAKLAKILKGKDGVRFE 267
Query: 279 NNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDI 338
++ + G +KFC+ P G +S R+ D+I CVPVI+S+ +LPF D
Sbjct: 268 DS-LATGEGIKTSTEGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSSRIELPFEDE 326
Query: 339 LDWRKFAVILNERDV----YQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAF 394
+D+ +F++ + + Y L Q L+ I ++V + + L V H+++ +PP + DA
Sbjct: 327 IDYSEFSLFFSVEEALRPDYLLNQ-LRQIQKTKWVEIWSKLKNVSHHYEFQNPPRKGDAV 385
Query: 395 HMVMYEL 401
+M+ ++
Sbjct: 386 NMIWRQV 392
>gi|302837063|ref|XP_002950091.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
gi|300264564|gb|EFJ48759.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
Length = 593
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 71/305 (23%), Positives = 122/305 (40%), Gaps = 46/305 (15%)
Query: 141 IPISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVG------- 193
I ++C +G + M +V+D +D + + +PYW R G DH ++ HD G
Sbjct: 283 ITVTCRLPVNRGPRVKQMVNMVRDTLDWINSTFPYWRRRGGRDHIWLFTHDEGACWAPTA 342
Query: 194 --------------VRATEGVPFLIKNAIRVVCSPSYDVAFI----------PHKDVALP 229
+ T FL P F+ P KD+ +P
Sbjct: 343 INASIWLTHWGRTELNHTSNTAFLADKYDSDFAGPLQPEGFVKYIKGHPCFNPEKDLVIP 402
Query: 230 QVLQP---FALPRGGRDVENRTVLGFWAGHRNSK-----IRVILARVWENDTELDISNN- 280
P A P G R +L F+ G R + ++++ E +
Sbjct: 403 AFKAPSHYHASPLQGNPARERDLLFFFRGDVGKNRLPNYSRGVRQQIYKMAKEGGWAEKY 462
Query: 281 RINRAIGPLV---YQRRFYKTKFC-ICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFN 336
R G V Y + FC + PG SAR+ D++ +GC+PV++++ + F
Sbjct: 463 RFYIGDGSDVEGDYSEMLSRAIFCLVAPGDGW--SARMEDAVLHGCIPVVIADGVEAVFE 520
Query: 337 DILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHM 396
++L+ FA+ L + V +L +L+ + S +L +V + ++W S P DAF
Sbjct: 521 NVLELDAFALRLPQEAVPRLLDVLRAVPQRAIRSKQAHLGRVWQRYRWASLPKLDDAFAT 580
Query: 397 VMYEL 401
+M L
Sbjct: 581 IMQWL 585
>gi|359473738|ref|XP_002272591.2| PREDICTED: LOW QUALITY PROTEIN: probable glycosyltransferase
At5g25310-like [Vitis vinifera]
Length = 437
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 72/292 (24%), Positives = 124/292 (42%), Gaps = 37/292 (12%)
Query: 132 DPDRAHLFFIPI--------SCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGAD 183
DP+ A FF+P H M T ++ I + ++ + YW R+ G D
Sbjct: 126 DPEMADAFFVPFFSSLSFNTHGHNMTDPDTEFDRQLQID---ILKILRESKYWQRSGGRD 182
Query: 184 HFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIP------HKDVALP--QVLQPF 235
H H R + +I +V D P KDV P V+ F
Sbjct: 183 HVIPMHHPNAFRFFRE---QVNTSILIVA----DFGRYPKEISNLRKDVVAPYVHVVDSF 235
Query: 236 ALPRGGRDVENRTVLGFWAGHRNSK----IRVILARVWENDTELDISNNRINRAIGPLVY 291
E+RT L F+ G K +R L ++ + + + LV
Sbjct: 236 TDDNSPDPYESRTTLLFFRGRTIRKDEGIVRDKLVKLLAGXDDYLQLHFHHRSYLSFLVX 295
Query: 292 Q--RRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILN 349
Q + +KFC+ P G +S R+ D+I CVPVI+S+ +LP+ D +D+ +F++ +
Sbjct: 296 QSTQGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDQIELPYEDEIDYTQFSIFFS 355
Query: 350 ERDV----YQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMV 397
+++ Y ++Q L+ I +V + +L + H+++ PP + DA M+
Sbjct: 356 DKEALEPGYMIEQ-LRQIPKERWVEMWRHLKYISHHYEFQYPPKKGDAIDML 406
>gi|302852375|ref|XP_002957708.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
gi|300257002|gb|EFJ41257.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
Length = 600
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 72/313 (23%), Positives = 135/313 (43%), Gaps = 53/313 (16%)
Query: 113 YASEGYFFQN-IRESRFLTHDPDRAHLFFIPISCHKMRGKGTSYENMTVIVKDYVDSLIN 171
YA+ YF +++ T +P A+LF+IP+ + + + DYV +
Sbjct: 260 YAAYEYFMTYFLQDHAVRTENPYEANLFYIPMLAYFYIANVRNPVPQVTLALDYVRT--- 316
Query: 172 KYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVV----CSPSYDVAFIPHKDVA 227
K+P++NRT G DHF+ D G +T +L + I++V +P+++
Sbjct: 317 KWPFYNRTGGRDHFYFLTGDRGACSTP--RWLQDSCIKLVHFGLQGEELPGTGVPNREYG 374
Query: 228 LPQVLQPFALP----------------------RGGRDVENRTVLGFWAGHRN------- 258
QV + +P + G D NR +L F+AG
Sbjct: 375 CVQVKRDLVIPPINLFTDLVPSETQAYYKWLVSKKGYD-SNRKLLFFFAGGVGQVPEYSG 433
Query: 259 ---SKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIA 315
I+ +L+ + +++ R++ Y+ +KFCI P G R+
Sbjct: 434 GVRQAIKGLLSSLTPKPEDVEFFEGRVHN------YKELLQSSKFCIAPYGFGWG-LRLI 486
Query: 316 DSIHYGCVPVILSNYYDLPFN---DILDWRKFAVILNERDVYQLKQILKNISDAEFVSLH 372
+I YGC+P+I+ ++ PF D L + +F+V + D+ + ++L++ ++A+ L
Sbjct: 487 QAIEYGCIPLIIQDHVYQPFERPKDFLPYEEFSVRMGLVDIPYMIELLRSYTEAQLAQLR 546
Query: 373 NNLVKVQKHFQWN 385
+ K + F WN
Sbjct: 547 LGMAKYYQAFIWN 559
>gi|327280286|ref|XP_003224883.1| PREDICTED: exostosin-1-like [Anolis carolinensis]
Length = 753
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 81/334 (24%), Positives = 137/334 (41%), Gaps = 67/334 (20%)
Query: 88 NFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHK 147
FK+Y+YP Q K+ Y + + SRF T DP +A LF + +
Sbjct: 120 GFKVYVYPQ-------QKGEKIAESYQN---ILAAVEGSRFYTSDPSQACLFILSLDTL- 168
Query: 148 MRGKGTSYENMTVIVKDYVDSLINKYP---YWNRTLGADHFFVTCHD-VGVRATEGVPFL 203
+ + YV +L +K WN G +H + TE V F
Sbjct: 169 ---------DRDQLSPQYVHNLRSKVQGLHLWNN--GRNHLVFNLYSGTWPDYTEDVGFD 217
Query: 204 IKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDVE------------------ 245
I +A+ S S + F P+ DV++P + PR G +
Sbjct: 218 IGHAMLAKASISTE-NFRPNFDVSIPLFSKDH--PRTGGEKGFLRFNTIPPLRKYMLVFK 274
Query: 246 -NRTVLGFWAGHRNSKIRV------IL------ARVWENDTELDISNNRINRAIGPLVYQ 292
R + G + RN+ V +L + W+ D +R N Y+
Sbjct: 275 GKRYLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHK--DSRCDRDNAEYEKYDYR 332
Query: 293 RRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERD 352
+ FC+ P G ++ S R +++ CVPV+LSN ++LPF++++DW + AVI +ER
Sbjct: 333 EMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVIDWNQAAVIGDERL 392
Query: 353 VYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
+ Q+ +++I + ++L + Q F W +
Sbjct: 393 LLQIPSTIRSIHQDKILAL-----RQQTQFLWEA 421
>gi|159490314|ref|XP_001703124.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158270754|gb|EDO96589.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 490
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 79/327 (24%), Positives = 124/327 (37%), Gaps = 61/327 (18%)
Query: 118 YFFQNIRESRFLTHDPDRAHLFFIPISCHKMRGKGTSYENMTV---IVKDYVDSLINKYP 174
+F+Q + S D D+A ++IPI + MT + + V + YP
Sbjct: 130 HFYQRLLGSGARIADGDKADWYYIPIR-----------QRMTADSRFLSEAVAYISATYP 178
Query: 175 YWNRTLGADHFFVTCHDVGVRATE-GVPFLIKNAIRVV--------CSPSYDVAFIPHKD 225
+WNRT G+ HF + D+G T+ G N + + A P KD
Sbjct: 179 WWNRTGGSRHFVIHTGDLGADETQLGARLQAPNITWLTHWGLTMDKVFSGWKKAHRPDKD 238
Query: 226 VALPQVLQP-----FALPRGG----RDVENRTVLGFWAGHRNSKIRVILARVWENDTELD 276
V +P L P F L R D + RT F+AG + W N
Sbjct: 239 VVIPVFLTPGHFKHFGLERTPLHPLMDKQERTTTFFFAGRICGDRKPPKTGSWPN---CG 295
Query: 277 ISNNRINRAIGPLV------------------YQRRFYKTKFCICPGGSQVNSARIADSI 318
+ + + LV Y +KFC+ P G +I S
Sbjct: 296 PRSPGYSAGVRQLVHHHHWDPPGFKVVLHEPNYGAALGSSKFCLAPLGGGHGQRQIIVSF 355
Query: 319 HYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKV 378
GC+PV +++ PF +W +F V E D+ +L IL+++S E+ + L
Sbjct: 356 -MGCLPVCIADDVYEPFEPQYNWTQFGVRPAESDIPELHTILESVSAKEYAAKQRALRCA 414
Query: 379 QKHFQWNS-------PPVRFDAFHMVM 398
+HF ++S R+DAF +
Sbjct: 415 AQHFVYSSIVGGLFGEDGRYDAFETTL 441
>gi|348588241|ref|XP_003479875.1| PREDICTED: exostosin-1 [Cavia porcellus]
Length = 746
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 87/380 (22%), Positives = 150/380 (39%), Gaps = 69/380 (18%)
Query: 45 YPLQNNFNSTLSFAINKQDYSDLGSDVFHS-----PSVFHLNYLEMVTNFKIYIYPDGDP 99
+P + + N S ++ + D S ++ S F + FK+Y+YP
Sbjct: 66 FPWEQSENEDSSVHMSPRQKRDANSSIYKGKKCRMESCFDFTLCKK-NGFKVYVYPQ--- 121
Query: 100 NTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHKMRGKGTSYENMT 159
Q K+ Y + I SRF T DP +A LF + + +
Sbjct: 122 ----QKGEKIAESYQN---ILAAIEGSRFYTSDPSQACLFVLSLDTL----------DRD 164
Query: 160 VIVKDYVDSLINKYP---YWNRTLGADHFFVTCHD-VGVRATEGVPFLIKNAIRVVCSPS 215
+ YV +L +K WN G +H + TE V F I A+ S S
Sbjct: 165 QLSPQYVHNLRSKVQSLHLWNN--GRNHLIFNLYSGTWPDYTEDVGFDIGQAMLAKASIS 222
Query: 216 YDVAFIPHKDVALPQVLQPFALPRGGRDV-----------------ENRTVLGFWAGHRN 258
+ F P+ DV++P + G R R + G + RN
Sbjct: 223 TE-NFRPNFDVSIPLFSKDHPRTGGERGFLKFNTIPPLRKYMLVFKGKRYLTGIGSDTRN 281
Query: 259 SKIRV-----IL-------ARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGG 306
+ V +L + W+ D +R N Y+ + FC+ P G
Sbjct: 282 ALYHVHNGEDVLLLTTCKHGKDWQKHK--DSRCDRDNTEYEKYDYREMLHNATFCLVPRG 339
Query: 307 SQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDA 366
++ S R +++ CVPV+LSN ++LPF+++++W + AVI +ER + Q+ +++I
Sbjct: 340 RRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIPSTIRSIHQD 399
Query: 367 EFVSLHNNLVKVQKHFQWNS 386
+ ++L + Q F W +
Sbjct: 400 KILAL-----RQQTQFLWEA 414
>gi|226526925|gb|ACO71282.1| exostoses 1 (predicted) [Dasypus novemcinctus]
Length = 744
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 81/332 (24%), Positives = 134/332 (40%), Gaps = 63/332 (18%)
Query: 88 NFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHK 147
FK+Y+YP Q K+ Y + I SRF T DP +A LF + +
Sbjct: 111 GFKVYVYPQ-------QKGEKIAESYQN---ILAAIEGSRFYTSDPSQACLFVLSLDTL- 159
Query: 148 MRGKGTSYENMTVIVKDYVDSLINKYP---YWNRTLGADHFFVTCHD-VGVRATEGVPFL 203
+ + YV +L +K WN G +H + TE V F
Sbjct: 160 ---------DRDQLSPQYVHNLRSKVQSLHLWNN--GRNHLIFNLYSGTWPDYTEDVGFD 208
Query: 204 IKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDV-----------------EN 246
I A+ S S + F P+ DV++P + G R
Sbjct: 209 IGQAMLAKASISTE-NFRPNFDVSIPLFSKDHPRTGGERGFLKFNTIPPLRKYMLVFKGK 267
Query: 247 RTVLGFWAGHRNSKIRV-----IL-------ARVWENDTELDISNNRINRAIGPLVYQRR 294
R + G + RN+ V IL + W+ D +R N Y+
Sbjct: 268 RYLTGIGSDTRNALYHVHNGEDILLLTTCKHGKDWQKHK--DSRCDRDNTEYEKYDYREM 325
Query: 295 FYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVY 354
+ FC+ P G ++ S R +++ CVPV+LSN ++LPF+++++W + AVI +ER +
Sbjct: 326 LHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLL 385
Query: 355 QLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
Q+ +++I + ++L + Q F W +
Sbjct: 386 QIPSTIRSIHQDKILAL-----RQQTQFLWEA 412
>gi|307107123|gb|EFN55367.1| hypothetical protein CHLNCDRAFT_52572 [Chlorella variabilis]
Length = 815
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 81/346 (23%), Positives = 138/346 (39%), Gaps = 79/346 (22%)
Query: 110 TGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SC-----------------HKM-R 149
+G Y +E + + +S+ T DP+ A F+IP+ SC HK+ R
Sbjct: 414 SGGYLAETGLHEALLQSKHRTLDPEEADYFYIPVYSSCYMYPIHGFADTPFFHAFHKIPR 473
Query: 150 GKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIR 209
T+ NM + V + L +PYW+R+ G DH + HD G + +P +++ A
Sbjct: 474 VHATT--NMLIEVYHW---LRAHHPYWDRSGGRDHIILQSHDEG---SCWLPAVLRPATM 525
Query: 210 VVCSPSYDVA-----------------------------------FIPHKDVALPQVLQP 234
+ D+ + P KD+ +P + P
Sbjct: 526 LTHWGRMDLGHTSSTGYIDDVYSRPARHPIYMPEGTEGKLGDFPCYDPAKDLVVPPMTSP 585
Query: 235 FAL---PRGGRDVENRTVLGFWAG--HRNSK-----IRVILARVWENDTELDISNNRINR 284
P G NRT L F+ G +N+K IR L + + I
Sbjct: 586 LKYELSPLVGAFTRNRTTLAFFKGRTQQNNKPYSRGIRQTLENLCRDKDWWGKFKIWIGE 645
Query: 285 AIGP---LVYQRRFYKTKFC-ICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILD 340
P Y + + FC + PG S R D++ +GC+PVI+ + L F I+D
Sbjct: 646 GNPPDMDRTYSQLLASSTFCFVLPGDG--FSPRFEDAVQHGCLPVIIQDEVHLAFESIID 703
Query: 341 WRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
+RKF V + ++D+ ++ +IL I + ++ L V + + +
Sbjct: 704 YRKFVVRIQQKDMERVPEILGAIPPEKVQTMQKALATVWRKWSYTG 749
>gi|159470363|ref|XP_001693329.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158277587|gb|EDP03355.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 626
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 77/292 (26%), Positives = 129/292 (44%), Gaps = 44/292 (15%)
Query: 130 THDPDRAHLFFIPISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTC 189
T +P A+LF++P+ + G T ++V S ++P+W+R+ G DHF+
Sbjct: 289 TENPWEANLFYVPMLLYFYIGNVRDAVPQTAWAINHVRS---RWPFWDRSGGRDHFYFM- 344
Query: 190 HDVGVRATEGVPFLIKN-AIRVV----CSPSYDVAFIPHKDVALPQVLQPFALP------ 238
G R T +P +++ AI+VV D + +KD A Q+ + +P
Sbjct: 345 --TGDRGTCHLPRELQDQAIKVVHWGMQVAGTDWIGLDNKDYACIQLKRDLVVPPINMFA 402
Query: 239 ----------------RGGRDVENRTVLGFWAGHRNSKIRV---ILARVWENDTELDISN 279
GG+D RT+L F+AG + + E T + I+N
Sbjct: 403 EILPTDTVKHYQTVVANGGQDF-GRTLLFFFAGGIAQSMEYSGGTRQAIKELLTSVHIAN 461
Query: 280 NRINRAIGPLV------YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDL 333
A V Y++ +KFCI P G R+ +I +GC+PVI+ ++
Sbjct: 462 GNSTPADVVFVEGRTQEYKKLLLTSKFCIAPYGFGWG-LRLVQAIEFGCIPVIIQDHVYQ 520
Query: 334 PFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWN 385
F D L + +F+V L RDV +L IL++ S + +L + K + F W+
Sbjct: 521 AFEDFLPYEEFSVRLPLRDVPRLLDILRSYSPEQQAALRLGMAKYYRAFVWH 572
>gi|302852377|ref|XP_002957709.1| hypothetical protein VOLCADRAFT_119761 [Volvox carteri f.
nagariensis]
gi|300257003|gb|EFJ41258.1| hypothetical protein VOLCADRAFT_119761 [Volvox carteri f.
nagariensis]
Length = 1481
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 80/332 (24%), Positives = 133/332 (40%), Gaps = 57/332 (17%)
Query: 113 YASEGYFFQNIRESRFLTHDPDRAHLFFIP--ISCHK------------MRGKGTSYENM 158
YA++ + + S T DP+ A F++P SC + G M
Sbjct: 350 YAADTLLHELLLISEHRTFDPEEADFFYVPHSASCLPFPMGSWADYPWFLGPGGPRIRQM 409
Query: 159 TVIVKDYVDSLINKYPYWNRTLGADHFFVTCHD---------------------VGVRAT 197
++++ VD + YP+W R G DH ++ HD +G+
Sbjct: 410 VNMLREVVDWIDKTYPFWRRRGGRDHIWLFTHDEGACWAPKVLENSTWLTHWGRMGLEHR 469
Query: 198 EGVPFLIKNAIRVVCSPSYDVAFIPH----------KDVALPQVLQPF---ALPRGGRDV 244
G FL SP F+ H KD+ +P QP + P G
Sbjct: 470 SGTAFLADKYDIDFVSPHQPEGFLTHIKGHPCYDSTKDLVVPAFKQPRHYRSSPLLGSAT 529
Query: 245 ENRTVLGFWAGHRNSKIRVILAR-VWENDTELDISNN--RINRAIGPLVYQRRFY----- 296
+ R + F+ G +R V + +L + NN N IG R Y
Sbjct: 530 KQRDIFLFFRGDVGKHRMAHYSRGVRQKLYKLSVENNWKSKNVLIGGTHEVRGEYSDLLS 589
Query: 297 KTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQL 356
+++FC+ G SAR+ D++ +GC+PVI+ + + F IL+ FAV ++E+ + Q+
Sbjct: 590 RSQFCLVAAGDGW-SARLEDAVLHGCIPVIVIDEVHVVFESILNVDSFAVRIDEQQLPQI 648
Query: 357 KQILKNISDAEFVSLHNNLVKVQKHFQWNSPP 388
IL I + + + +L V F++ S P
Sbjct: 649 LDILAAIPERKIRAKQAHLGHVWHRFRYGSLP 680
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 68/294 (23%), Positives = 117/294 (39%), Gaps = 31/294 (10%)
Query: 113 YASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHKM--------------RGKGTSYENM 158
Y +E + + S T DP+ A F++P M G +
Sbjct: 1043 YGAESALHEYLLLSEHRTFDPEEADFFYVPYYGTCMIWPVLHWADFPYFHTTGGPRILQV 1102
Query: 159 TVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDV 218
++ D VD + YP+W R G DH F+ HD G P ++ NA + D+
Sbjct: 1103 INMLIDTVDWINKMYPFWGRRGGRDHIFLFPHDEGACW---APNVLVNATWLTHWGRTDM 1159
Query: 219 AFIPHKDVALPQVLQPFALPRGGRD-VENRTVLGFWAGHRNSKI--RVILARVWENDTEL 275
+ + F RD V R GF R V + R+ + +
Sbjct: 1160 ---------IHESKTSFDADNYTRDYVGWRQPGGFVNLIRGHPCYDPVKIYRLAKENNWQ 1210
Query: 276 DISNNRINRAIG-PLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLP 334
D N I A P Y ++ FC+ G SAR D++ +GC+PVI+ + +
Sbjct: 1211 DKHNILIGDAADVPGDYSDLLSRSLFCLVATGDGW-SARTEDAVLHGCIPVIIIDGVHIK 1269
Query: 335 FNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPP 388
F + +F++ + E + ++ +ILK I + S+ +L +V +++ + P
Sbjct: 1270 FETVFSVDEFSIRIPEANASRILEILKEIPKTKIRSIQAHLGRVWHRYRYANLP 1323
>gi|350537983|ref|NP_001233696.1| exostosin-1 [Cricetulus griseus]
gi|20138355|sp|Q9JK82.1|EXT1_CRIGR RecName: Full=Exostosin-1; AltName: Full=Heparan sulfate
copolymerase; AltName: Full=Multiple exostoses protein 1
homolog
gi|7960285|gb|AAF71276.1|AF252858_1 exostosin 1 [Cricetulus griseus]
Length = 746
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 80/332 (24%), Positives = 134/332 (40%), Gaps = 63/332 (18%)
Query: 88 NFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHK 147
FK+Y+YP Q K+ Y + I SRF T DP +A LF + +
Sbjct: 113 GFKVYVYPQ-------QKGEKIAESYQN---ILAAIEGSRFYTSDPSQACLFVLSLDTL- 161
Query: 148 MRGKGTSYENMTVIVKDYVDSLINKYP---YWNRTLGADHFFVTCHD-VGVRATEGVPFL 203
+ + YV +L +K WN G +H + TE V F
Sbjct: 162 ---------DRDQLSPQYVHNLRSKVQSLHLWNN--GRNHLIFNLYSGTWPDYTEDVGFD 210
Query: 204 IKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDV-----------------EN 246
I A+ S S + F P+ DV++P + G R
Sbjct: 211 IGQAMLAKASISTE-NFRPNFDVSIPLFSKDHPRTGGERGFLKFNTIPPLRKYMLVFKGK 269
Query: 247 RTVLGFWAGHRNSKIRV-----IL-------ARVWENDTELDISNNRINRAIGPLVYQRR 294
R + G + RN+ V +L + W+ D +R N Y+
Sbjct: 270 RYLTGIGSDTRNALYHVHNGEDVLLLTTCKHGKDWQKHK--DSRCDRDNTEYEKYDYREM 327
Query: 295 FYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVY 354
+ FC+ P G ++ S R +++ CVPV+LSN ++LPF+++++W + AVI +ER +
Sbjct: 328 LHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLL 387
Query: 355 QLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
Q+ +++I + ++L + Q F W +
Sbjct: 388 QIPSTIRSIHQDKILAL-----RQQTQFLWEA 414
>gi|357141481|ref|XP_003572240.1| PREDICTED: xylogalacturonan beta-1,3-xylosyltransferase-like
[Brachypodium distachyon]
Length = 543
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 78/305 (25%), Positives = 137/305 (44%), Gaps = 49/305 (16%)
Query: 135 RAHLFFIPISCH-------KMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFV 187
+A +FF+P K++GK N +++ + + + W R G DH V
Sbjct: 242 QADVFFVPFFASLSYNRHSKLQGKEKMSRNR--LLQAELVKYLARQEEWRRWGGKDHLVV 299
Query: 188 TCHDVGVRATEGVPFLIKNAIRVVCSPSY---DVAFIPHKDVA--LPQVLQPFA-LPRGG 241
H P + A + + + Y D P DVA V+ P+ + R
Sbjct: 300 PHH----------PNSMMQARKKLSAAMYVLSDFGRYP-PDVANLKKDVVAPYKHVVRSL 348
Query: 242 RDVEN-----RTVLGFWAG--HR--NSKIRVILARVWENDTEL-----DISNNRINRAIG 287
RD E+ R VL ++ G HR K+R L ++ +++ ++ + N I RA
Sbjct: 349 RDDESPTFDQRPVLAYFQGAIHRKDGGKVRQKLYQLLKDEKDVHFTYGSVRQNGIRRAT- 407
Query: 288 PLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVI 347
+ +KFC+ G +S R+ D+I CVPV++S+ +LPF D+LD+ +F V
Sbjct: 408 -----KGMASSKFCLNIAGDTPSSNRLFDAIVSHCVPVMISDDIELPFEDVLDYSEFCVF 462
Query: 348 LNERDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLR 404
+ D + L ++L+ I+ E+ ++ L +V HF++ P DA M+ + +
Sbjct: 463 VRASDAVRKGFLLRLLRGITRDEWNTMWERLKEVAHHFEYQYPSKPDDAVQMIWGAVARK 522
Query: 405 RHVIK 409
H +K
Sbjct: 523 MHSLK 527
>gi|426235700|ref|XP_004011818.1| PREDICTED: exostosin-1 [Ovis aries]
Length = 746
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 80/332 (24%), Positives = 134/332 (40%), Gaps = 63/332 (18%)
Query: 88 NFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHK 147
FK+Y+YP Q K+ Y + I SRF T DP +A LF + +
Sbjct: 113 GFKVYVYPQ-------QKGEKIAESYQN---ILAAIEGSRFYTSDPSQACLFVLSLDTL- 161
Query: 148 MRGKGTSYENMTVIVKDYVDSLINKYP---YWNRTLGADHFFVTCHD-VGVRATEGVPFL 203
+ + YV +L +K WN G +H + TE V F
Sbjct: 162 ---------DRDQLSPQYVHNLRSKVQSLHLWNN--GRNHLIFNLYSGTWPDYTEDVGFD 210
Query: 204 IKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDV-----------------EN 246
I A+ S S + F P+ DV++P + G R
Sbjct: 211 IGQAMLAKASISTE-NFRPNFDVSIPLFSKDHPRTGGERGFLKFNTIPPLRKYMLVFKGK 269
Query: 247 RTVLGFWAGHRNSKIRV------IL------ARVWENDTELDISNNRINRAIGPLVYQRR 294
R + G + RN+ V +L + W+ D +R N Y+
Sbjct: 270 RYLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHK--DSRCDRDNTEYEKYDYREM 327
Query: 295 FYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVY 354
+ FC+ P G ++ S R +++ CVPV+LSN ++LPF+++++W + AVI +ER +
Sbjct: 328 LHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLL 387
Query: 355 QLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
Q+ +++I + ++L + Q F W +
Sbjct: 388 QIPSTIRSIHQDKILAL-----RQQTQFLWEA 414
>gi|147902262|ref|NP_001091564.1| exostosin-1 [Bos taurus]
gi|238686639|sp|A5D7I4.1|EXT1_BOVIN RecName: Full=Exostosin-1; AltName:
Full=Glucuronosyl-N-acetylglucosaminyl-proteoglycan/N-
acetylglucosaminyl-proteoglycan
4-alpha-N-acetylglucosaminyltransferase; AltName:
Full=Multiple exostoses protein 1 homolog
gi|146186820|gb|AAI40569.1| EXT1 protein [Bos taurus]
gi|296480555|tpg|DAA22670.1| TPA: exostosin-1 [Bos taurus]
Length = 746
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 80/332 (24%), Positives = 134/332 (40%), Gaps = 63/332 (18%)
Query: 88 NFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHK 147
FK+Y+YP Q K+ Y + I SRF T DP +A LF + +
Sbjct: 113 GFKVYVYPQ-------QKGEKIAESYQN---ILAAIEGSRFYTSDPSQACLFVLSLDTL- 161
Query: 148 MRGKGTSYENMTVIVKDYVDSLINKYP---YWNRTLGADHFFVTCHD-VGVRATEGVPFL 203
+ + YV +L +K WN G +H + TE V F
Sbjct: 162 ---------DRDQLSPQYVHNLRSKVQSLHLWNN--GRNHLIFNLYSGTWPDYTEDVGFD 210
Query: 204 IKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDV-----------------EN 246
I A+ S S + F P+ DV++P + G R
Sbjct: 211 IGQAMLAKASISTE-NFRPNFDVSIPLFSKDHPRTGGERGFLKFNTIPPLRKYMLVFKGK 269
Query: 247 RTVLGFWAGHRNSKIRV------IL------ARVWENDTELDISNNRINRAIGPLVYQRR 294
R + G + RN+ V +L + W+ D +R N Y+
Sbjct: 270 RYLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHK--DSRCDRDNTEYEKYDYREM 327
Query: 295 FYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVY 354
+ FC+ P G ++ S R +++ CVPV+LSN ++LPF+++++W + AVI +ER +
Sbjct: 328 LHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLL 387
Query: 355 QLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
Q+ +++I + ++L + Q F W +
Sbjct: 388 QIPSTIRSIHQDKILAL-----RQQTQFLWEA 414
>gi|395818023|ref|XP_003782438.1| PREDICTED: exostosin-1 [Otolemur garnettii]
gi|197215619|gb|ACH53015.1| exostosin 1 (predicted) [Otolemur garnettii]
Length = 746
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 80/332 (24%), Positives = 134/332 (40%), Gaps = 63/332 (18%)
Query: 88 NFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHK 147
FK+Y+YP Q K+ Y + I SRF T DP +A LF + +
Sbjct: 113 GFKVYVYPQ-------QKGEKIAESYQN---ILAAIEGSRFYTSDPSQACLFVLSLDTL- 161
Query: 148 MRGKGTSYENMTVIVKDYVDSLINKYP---YWNRTLGADHFFVTCHD-VGVRATEGVPFL 203
+ + YV +L +K WN G +H + TE V F
Sbjct: 162 ---------DRDQLSPQYVHNLRSKVQSLHLWNN--GRNHLIFNLYSGTWPDYTEDVGFD 210
Query: 204 IKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDV-----------------EN 246
I A+ S S + F P+ DV++P + G R
Sbjct: 211 IGQAMLAKASISTE-NFRPNFDVSIPLFSKDHPRTGGERGFLRFNTIPPLRKYMLVFKGK 269
Query: 247 RTVLGFWAGHRNSKIRV------IL------ARVWENDTELDISNNRINRAIGPLVYQRR 294
R + G + RN+ V +L + W+ D +R N Y+
Sbjct: 270 RYLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHK--DSRCDRDNMEYEKYDYREM 327
Query: 295 FYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVY 354
+ FC+ P G ++ S R +++ CVPV+LSN ++LPF+++++W + AVI +ER +
Sbjct: 328 LHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLL 387
Query: 355 QLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
Q+ +++I + ++L + Q F W +
Sbjct: 388 QIPSTIRSIHQDKILAL-----RQQTQFLWEA 414
>gi|148697308|gb|EDL29255.1| exostoses (multiple) 1, isoform CRA_b [Mus musculus]
Length = 566
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 77/329 (23%), Positives = 132/329 (40%), Gaps = 57/329 (17%)
Query: 88 NFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHK 147
FK+Y+YP Q K+ Y + I SRF T DP +A LF + +
Sbjct: 127 GFKVYVYPQ-------QKGEKIAESYQN---ILAAIEGSRFYTSDPSQACLFVLSLDT-- 174
Query: 148 MRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHD-VGVRATEGVPFLIKN 206
+ ++ + S + WN G +H + TE V F I
Sbjct: 175 -----LDRDQLSPQYVHNLRSKVQSLHLWNN--GRNHLIFNLYSGTWPDYTEDVGFDIGQ 227
Query: 207 AIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDV-----------------ENRTV 249
A+ S S + F P+ DV++P + G R R +
Sbjct: 228 AMLAKASISTE-NFRPNFDVSIPLFSKDHPRTGGERGFLKFNTIPPLRKYMLVFKGKRYL 286
Query: 250 LGFWAGHRNSKIRV-----IL-------ARVWENDTELDISNNRINRAIGPLVYQRRFYK 297
G + RN+ V +L + W+ D +R N Y+ +
Sbjct: 287 TGIGSDTRNALYHVHNGEDVLLLTTCKHGKDWQKHK--DSRCDRDNTEYEKYDYREMLHN 344
Query: 298 TKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLK 357
FC+ P G ++ S R +++ CVPV+LSN ++LPF+++++W + AVI +ER + Q+
Sbjct: 345 ATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIP 404
Query: 358 QILKNISDAEFVSLHNNLVKVQKHFQWNS 386
+++I + ++L + Q F W +
Sbjct: 405 STIRSIHQDKILAL-----RQQTQFLWEA 428
>gi|73974379|ref|XP_539145.2| PREDICTED: exostosin-1 isoform 1 [Canis lupus familiaris]
Length = 746
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 80/332 (24%), Positives = 134/332 (40%), Gaps = 63/332 (18%)
Query: 88 NFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHK 147
FK+Y+YP Q K+ Y + I SRF T DP +A LF + +
Sbjct: 113 GFKVYVYPQ-------QKGEKIAESYQN---ILAAIEGSRFYTSDPSQACLFVLSLDTL- 161
Query: 148 MRGKGTSYENMTVIVKDYVDSLINKYP---YWNRTLGADHFFVTCHD-VGVRATEGVPFL 203
+ + YV +L +K WN G +H + TE V F
Sbjct: 162 ---------DRDQLSPQYVHNLRSKVQSLHLWNN--GRNHLIFNLYSGTWPDYTEDVGFD 210
Query: 204 IKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDV-----------------EN 246
I A+ S S + F P+ DV++P + G R
Sbjct: 211 IGQAMLAKASISTE-NFRPNFDVSIPLFSKDHPRTGGERGFLKFNTIPPLRKYMLVFKGK 269
Query: 247 RTVLGFWAGHRNSKIRV------IL------ARVWENDTELDISNNRINRAIGPLVYQRR 294
R + G + RN+ V +L + W+ D +R N Y+
Sbjct: 270 RYLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHK--DSRCDRDNTEYEKYDYREM 327
Query: 295 FYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVY 354
+ FC+ P G ++ S R +++ CVPV+LSN ++LPF+++++W + AVI +ER +
Sbjct: 328 LHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLL 387
Query: 355 QLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
Q+ +++I + ++L + Q F W +
Sbjct: 388 QIPSTIRSIHQDKILAL-----RQQTQFLWEA 414
>gi|166183792|gb|ABY84155.1| exostosin 1 (predicted) [Callithrix jacchus]
Length = 746
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 80/332 (24%), Positives = 134/332 (40%), Gaps = 63/332 (18%)
Query: 88 NFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHK 147
FK+Y+YP Q K+ Y + I SRF T DP +A LF + +
Sbjct: 113 GFKVYVYPQ-------QKGEKIAESYQN---ILAAIEGSRFYTSDPSQACLFVLSLDTL- 161
Query: 148 MRGKGTSYENMTVIVKDYVDSLINKYP---YWNRTLGADHFFVTCHD-VGVRATEGVPFL 203
+ + YV +L +K WN G +H + TE V F
Sbjct: 162 ---------DRDQLSPQYVHNLRSKVQSLHLWNN--GRNHLIFNLYSGTWPDYTEDVGFD 210
Query: 204 IKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDV-----------------EN 246
I A+ S S + F P+ DV++P + G R
Sbjct: 211 IGQAMLAKASISTE-NFRPNFDVSIPLFSKDHPRTGGERGFLKFNTIPPLRKYMLVFKGK 269
Query: 247 RTVLGFWAGHRNSKIRV------IL------ARVWENDTELDISNNRINRAIGPLVYQRR 294
R + G + RN+ V +L + W+ D +R N Y+
Sbjct: 270 RYLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHK--DSRCDRDNTEYEKYDYREM 327
Query: 295 FYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVY 354
+ FC+ P G ++ S R +++ CVPV+LSN ++LPF+++++W + AVI +ER +
Sbjct: 328 LHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLL 387
Query: 355 QLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
Q+ +++I + ++L + Q F W +
Sbjct: 388 QIPSTIRSIHQDKILAL-----RQQTQFLWEA 414
>gi|281183009|ref|NP_001162444.1| exostosin-1 [Papio anubis]
gi|384475833|ref|NP_001245062.1| exostosin 1 [Macaca mulatta]
gi|114621433|ref|XP_001141496.1| PREDICTED: exostosin-1 isoform 1 [Pan troglodytes]
gi|397505670|ref|XP_003823375.1| PREDICTED: exostosin-1 [Pan paniscus]
gi|238687365|sp|A9X1C8.1|EXT1_PAPAN RecName: Full=Exostosin-1; AltName:
Full=Glucuronosyl-N-acetylglucosaminyl-proteoglycan/N-
acetylglucosaminyl-proteoglycan
4-alpha-N-acetylglucosaminyltransferase; AltName:
Full=Multiple exostoses protein 1 homolog
gi|163781066|gb|ABY40823.1| exostoses 1 (predicted) [Papio anubis]
gi|355698182|gb|EHH28730.1| Exostosin-1 [Macaca mulatta]
gi|383410151|gb|AFH28289.1| exostosin-1 [Macaca mulatta]
gi|387541472|gb|AFJ71363.1| exostosin-1 [Macaca mulatta]
gi|410224816|gb|JAA09627.1| exostosin 1 [Pan troglodytes]
gi|410259738|gb|JAA17835.1| exostosin 1 [Pan troglodytes]
gi|410302704|gb|JAA29952.1| exostosin 1 [Pan troglodytes]
gi|410353629|gb|JAA43418.1| exostosin 1 [Pan troglodytes]
Length = 746
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 80/332 (24%), Positives = 134/332 (40%), Gaps = 63/332 (18%)
Query: 88 NFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHK 147
FK+Y+YP Q K+ Y + I SRF T DP +A LF + +
Sbjct: 113 GFKVYVYPQ-------QKGEKIAESYQN---ILAAIEGSRFYTSDPSQACLFVLSLDTL- 161
Query: 148 MRGKGTSYENMTVIVKDYVDSLINKYP---YWNRTLGADHFFVTCHD-VGVRATEGVPFL 203
+ + YV +L +K WN G +H + TE V F
Sbjct: 162 ---------DRDQLSPQYVHNLRSKVQSLHLWNN--GRNHLIFNLYSGTWPDYTEDVGFD 210
Query: 204 IKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDV-----------------EN 246
I A+ S S + F P+ DV++P + G R
Sbjct: 211 IGQAMLAKASISTE-NFRPNFDVSIPLFSKDHPRTGGERGFLKFNTIPPLRKYMLVFKGK 269
Query: 247 RTVLGFWAGHRNSKIRV------IL------ARVWENDTELDISNNRINRAIGPLVYQRR 294
R + G + RN+ V +L + W+ D +R N Y+
Sbjct: 270 RYLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHK--DSRCDRDNTEYEKYDYREM 327
Query: 295 FYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVY 354
+ FC+ P G ++ S R +++ CVPV+LSN ++LPF+++++W + AVI +ER +
Sbjct: 328 LHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLL 387
Query: 355 QLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
Q+ +++I + ++L + Q F W +
Sbjct: 388 QIPSTIRSIHQDKILAL-----RQQTQFLWEA 414
>gi|403283504|ref|XP_003933159.1| PREDICTED: exostosin-1 [Saimiri boliviensis boliviensis]
Length = 746
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 80/332 (24%), Positives = 134/332 (40%), Gaps = 63/332 (18%)
Query: 88 NFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHK 147
FK+Y+YP Q K+ Y + I SRF T DP +A LF + +
Sbjct: 113 GFKVYVYPQ-------QKGEKIAESYQN---ILAAIEGSRFYTSDPSQACLFVLSLDTL- 161
Query: 148 MRGKGTSYENMTVIVKDYVDSLINKYP---YWNRTLGADHFFVTCHD-VGVRATEGVPFL 203
+ + YV +L +K WN G +H + TE V F
Sbjct: 162 ---------DRDQLSPQYVHNLRSKVQSLHLWNN--GRNHLIFNLYSGTWPDYTEDVGFD 210
Query: 204 IKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDV-----------------EN 246
I A+ S S + F P+ DV++P + G R
Sbjct: 211 IGQAMLAKASISTE-NFRPNFDVSIPLFSKDHPRTGGERGFLKFNTIPPLRKYMLVFKGK 269
Query: 247 RTVLGFWAGHRNSKIRV------IL------ARVWENDTELDISNNRINRAIGPLVYQRR 294
R + G + RN+ V +L + W+ D +R N Y+
Sbjct: 270 RYLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHK--DSRCDRDNTEYEKYDYREM 327
Query: 295 FYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVY 354
+ FC+ P G ++ S R +++ CVPV+LSN ++LPF+++++W + AVI +ER +
Sbjct: 328 LHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLL 387
Query: 355 QLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
Q+ +++I + ++L + Q F W +
Sbjct: 388 QIPSTIRSIHQDKILAL-----RQQTQFLWEA 414
>gi|149721624|ref|XP_001496484.1| PREDICTED: exostosin-1 [Equus caballus]
Length = 746
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 77/329 (23%), Positives = 132/329 (40%), Gaps = 57/329 (17%)
Query: 88 NFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHK 147
FK+Y+YP Q K+ Y + I SRF T DP +A LF + +
Sbjct: 113 GFKVYVYPQ-------QKGEKIAESYQN---ILAAIEGSRFYTSDPSQACLFVLSLDT-- 160
Query: 148 MRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHD-VGVRATEGVPFLIKN 206
+ ++ + S + WN G +H + TE V F I
Sbjct: 161 -----LDRDQLSPQYVHNLRSKVQSLHLWNN--GRNHLIFNLYSGTWPDYTEDVGFDIGQ 213
Query: 207 AIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDV-----------------ENRTV 249
A+ S S + F P+ DV++P + G R R +
Sbjct: 214 AMLAKASISTE-NFRPNFDVSIPLFSKDHPRTGGERGFLKFNTIPPLRKYMLVFKGKRYL 272
Query: 250 LGFWAGHRNSKIRV-----IL-------ARVWENDTELDISNNRINRAIGPLVYQRRFYK 297
G + RN+ V +L + W+ D +R N Y+ +
Sbjct: 273 TGIGSDTRNALYHVHNGEDVLLLTTCKHGKDWQKHK--DSRCDRDNTEYEKYDYREMLHN 330
Query: 298 TKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLK 357
FC+ P G ++ S R +++ CVPV+LSN ++LPF+++++W + AVI +ER + Q+
Sbjct: 331 ATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIP 390
Query: 358 QILKNISDAEFVSLHNNLVKVQKHFQWNS 386
+++I + ++L + Q F W +
Sbjct: 391 STIRSIHQDKILAL-----RQQTQFLWEA 414
>gi|326525254|dbj|BAK07897.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 440
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 71/286 (24%), Positives = 125/286 (43%), Gaps = 28/286 (9%)
Query: 132 DPDRAHLFFIPI------SCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHF 185
DP A FF+P + H E ++ + +D ++ K YW R+ G DH
Sbjct: 136 DPAAAEAFFVPFFSSLSFNVHGRNMTDPDTEADRLLQVELMD-ILGKSEYWQRSAGRDHV 194
Query: 186 FVTCHDVGVRATEGVPFLIKNAIRVVCSP----SYDVAFIPHKDVALP--QVLQPFALPR 239
H R + NA ++ S + ++A + KDV P V+ F
Sbjct: 195 IPMHHPNAFRFMRD----MVNASVLIVSDFGRYTKELASL-RKDVVAPYVHVVDSFLDDN 249
Query: 240 GGRDVENRTVLGFWAGH----RNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRF 295
E L F+ G KIR LA++ ++ + ++ + G +
Sbjct: 250 ASDPFEADPTLLFFRGRPVRKAEGKIRGKLAKILKDRDGVRFEDS-LAIGDGIKISTDGM 308
Query: 296 YKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDV-- 353
+KFC+ P G +S R+ D+I C+PVI+S+ +LPF D +D+ +F+ + +
Sbjct: 309 RSSKFCLHPAGDTPSSCRLFDAIVSHCIPVIISSRIELPFEDEIDYSEFSPFFSVEEALE 368
Query: 354 --YQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMV 397
Y L Q L+ + ++V + + L V H+++ PP + DA +M+
Sbjct: 369 PDYLLNQ-LRQMPKEKWVEMWSKLKNVSSHYEFQYPPRKDDAVNMI 413
>gi|46370066|ref|NP_000118.2| exostosin-1 [Homo sapiens]
gi|20141422|sp|Q16394.2|EXT1_HUMAN RecName: Full=Exostosin-1; AltName:
Full=Glucuronosyl-N-acetylglucosaminyl-proteoglycan/N-
acetylglucosaminyl-proteoglycan
4-alpha-N-acetylglucosaminyltransferase; AltName:
Full=Multiple exostoses protein 1; AltName:
Full=Putative tumor suppressor protein EXT1
gi|12654671|gb|AAH01174.1| Exostoses (multiple) 1 [Homo sapiens]
gi|119612378|gb|EAW91972.1| exostoses (multiple) 1 [Homo sapiens]
gi|123981930|gb|ABM82794.1| exostoses (multiple) 1 [synthetic construct]
gi|123996761|gb|ABM85982.1| exostoses (multiple) 1 [synthetic construct]
gi|189053697|dbj|BAG35949.1| unnamed protein product [Homo sapiens]
gi|261859910|dbj|BAI46477.1| exostoses (multiple) 1 [synthetic construct]
Length = 746
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 80/332 (24%), Positives = 134/332 (40%), Gaps = 63/332 (18%)
Query: 88 NFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHK 147
FK+Y+YP Q K+ Y + I SRF T DP +A LF + +
Sbjct: 113 GFKVYVYPQ-------QKGEKIAESYQN---ILAAIEGSRFYTSDPSQACLFVLSLDTL- 161
Query: 148 MRGKGTSYENMTVIVKDYVDSLINKYP---YWNRTLGADHFFVTCHD-VGVRATEGVPFL 203
+ + YV +L +K WN G +H + TE V F
Sbjct: 162 ---------DRDQLSPQYVHNLRSKVQSLHLWNN--GRNHLIFNLYSGTWPDYTEDVGFD 210
Query: 204 IKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDV-----------------EN 246
I A+ S S + F P+ DV++P + G R
Sbjct: 211 IGQAMLAKASISTE-NFRPNFDVSIPLFSKDHPRTGGERGFLKFNTIPPLRKYMLVFKGK 269
Query: 247 RTVLGFWAGHRNSKIRV------IL------ARVWENDTELDISNNRINRAIGPLVYQRR 294
R + G + RN+ V +L + W+ D +R N Y+
Sbjct: 270 RYLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHK--DSRCDRDNTEYEKYDYREM 327
Query: 295 FYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVY 354
+ FC+ P G ++ S R +++ CVPV+LSN ++LPF+++++W + AVI +ER +
Sbjct: 328 LHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLL 387
Query: 355 QLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
Q+ +++I + ++L + Q F W +
Sbjct: 388 QIPSTIRSIHQDKILAL-----RQQTQFLWEA 414
>gi|326491429|dbj|BAJ94192.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 400
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 73/306 (23%), Positives = 130/306 (42%), Gaps = 41/306 (13%)
Query: 109 LTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIP--ISCHKMRGKGTSYENMTVIVKDYV 166
L G++ ++ Q + SR+ T D D A+LFF+P + C +M G T E VK
Sbjct: 84 LKGQWGTQVKIHQLLLRSRYRTLDKDEANLFFVPSYVKCVRMTGGLTDKEINQTYVK--- 140
Query: 167 DSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYD-------VA 219
++++ PY+ R+ G DH FV G + +I + +P D A
Sbjct: 141 --VLSQMPYFRRSGGRDHIFVFPSGAGAHLFRSWATFLNRSI--ILTPEGDRTDKRGISA 196
Query: 220 FIPHKDVALPQVLQPFALPRGGRDVENRTVLGFWAGHRNSKIRVI-LARVWENDTELDIS 278
F KD+ +P G D V +++++ LA+ + + E
Sbjct: 197 FNTWKDIIIP----------GNVDDSMGKV---------GRLKLVELAKQYPDKLESPEL 237
Query: 279 NNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDI 338
+G + Y + KFC+ P G + R +S CVPV+LS+ +LPF ++
Sbjct: 238 KLSGPDKLGRIDYFKHLRNAKFCLAPRGESSWTLRFYESFFVECVPVLLSDEVELPFQNV 297
Query: 339 LDWRKFAVILNERDVY-QLKQILKNISDAEFVSL--HNNLVKVQKHFQWNSPPVRFDAFH 395
+D+ K ++ + +L Q L++I + + V+ + ++ P A
Sbjct: 298 IDYTKISIKWPASKIGPELFQYLESIPEERIEEMIARGREVRCMWVYALDTEPC--SAMT 355
Query: 396 MVMYEL 401
+M+EL
Sbjct: 356 AIMWEL 361
>gi|1168162|gb|AAB62283.1| putative tumour suppressor/hereditary multiple exostoses candidate
gene [Homo sapiens]
gi|1586817|prf||2204384A EXT1 gene
Length = 746
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 77/329 (23%), Positives = 132/329 (40%), Gaps = 57/329 (17%)
Query: 88 NFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHK 147
FK+Y+YP Q K+ Y + I SRF T DP +A LF + +
Sbjct: 113 GFKVYVYPQ-------QKGEKIAESYQN---ILAAIEGSRFYTSDPSQACLFVLSLDT-- 160
Query: 148 MRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHD-VGVRATEGVPFLIKN 206
+ ++ + S + WN G +H + TE V F I
Sbjct: 161 -----LDRDQLSPQYVHNLRSKVQSLHLWNN--GRNHLIFNLYSGTWPDYTEDVGFDIGQ 213
Query: 207 AIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDV-----------------ENRTV 249
A+ S S + F P+ DV++P + G R R +
Sbjct: 214 AMLAKASISTE-NFRPNFDVSIPLFSKDHPRTGGERGFLKFNTIPPLRKYMLVFKGKRYL 272
Query: 250 LGFWAGHRNSKIRV------IL------ARVWENDTELDISNNRINRAIGPLVYQRRFYK 297
G + RN+ V +L + W+ D +R N Y+ +
Sbjct: 273 TGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHK--DSRCDRDNTEYEKYDYREMLHN 330
Query: 298 TKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLK 357
FC+ P G ++ S R +++ CVPV+LSN ++LPF+++++W + AVI +ER + Q+
Sbjct: 331 ATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIP 390
Query: 358 QILKNISDAEFVSLHNNLVKVQKHFQWNS 386
+++I + ++L + Q F W +
Sbjct: 391 STIRSIHQDKILAL-----RQQTQFLWEA 414
>gi|225457795|ref|XP_002265632.1| PREDICTED: probable glucuronosyltransferase Os03g0107900 [Vitis
vinifera]
gi|302142755|emb|CBI19958.3| unnamed protein product [Vitis vinifera]
Length = 459
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 84/352 (23%), Positives = 149/352 (42%), Gaps = 38/352 (10%)
Query: 88 NFKIYIYP----DGDPNTFYQ------TPRKLTGKYASEGYFFQNIRESRFLTHDPDRAH 137
+ KIY+Y DG + Y T +TG++ ++ + + +SRF T + A
Sbjct: 85 SLKIYVYEEDEIDGLKSLLYGRDGSIPTEVCVTGQWGTQVKIHRLLLKSRFRTRRKEEAD 144
Query: 138 LFFIP--ISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVR 195
LFF+P I C +M+G E + VK ++++ PY+ + G +H FV G
Sbjct: 145 LFFVPTYIKCVRMKGGLNDKEIDQMYVK-----VLSQMPYFRLSGGRNHIFVFPSGAGPH 199
Query: 196 ATEGVPFLIKNAIRVVCSPSYD-------VAFIPHKDVALPQVLQPFALPRGGRDVE--- 245
+ + +I + +P D AF KD+ +P + G V+
Sbjct: 200 LFKSWATYLNRSI--ILTPEGDRTDKKDTSAFNTWKDIIIPGNVADEMTTNGATFVQPLP 257
Query: 246 --NRTVLGFWAGHRNSKIR----VILARVWENDTELDISNNRINRAIGPLVYQRRFYKTK 299
R L + G K+ + LA+ + + E +G + Y K
Sbjct: 258 LSKRKFLANFLGRAQRKLGRLQLIELAKQYPDKLESPELQFSGPDKLGRIEYFHHLRNAK 317
Query: 300 FCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVY-QLKQ 358
FC P G + R +S CVPVILS+ +LPF +++D+ + ++ + QL +
Sbjct: 318 FCFAPRGESSWTLRFYESFFVECVPVILSDQVELPFQNVIDYTQVSIKWPSSQIGPQLLE 377
Query: 359 ILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIKY 410
L++I D + + +V+ + + S A +M+E L+R V K+
Sbjct: 378 YLESIPDKVIEEMISRGREVRCWWVYASESEPCSAMRGIMWE--LQRKVRKF 427
>gi|112807209|ref|NP_034292.2| exostosin-1 [Mus musculus]
gi|283837898|ref|NP_001124012.1| exostosin 1 [Rattus norvegicus]
gi|3023731|sp|P97464.1|EXT1_MOUSE RecName: Full=Exostosin-1; AltName:
Full=Glucuronosyl-N-acetylglucosaminyl-proteoglycan/N-
acetylglucosaminyl-proteoglycan
4-alpha-N-acetylglucosaminyltransferase; AltName:
Full=Multiple exostoses protein 1 homolog
gi|1813640|gb|AAB41728.1| Ext1 [Mus musculus]
gi|13435765|gb|AAH04741.1| Exostoses (multiple) 1 [Mus musculus]
gi|74205633|dbj|BAE21106.1| unnamed protein product [Mus musculus]
gi|148697307|gb|EDL29254.1| exostoses (multiple) 1, isoform CRA_a [Mus musculus]
gi|149066396|gb|EDM16269.1| similar to Ext1 [Rattus norvegicus]
Length = 746
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 80/332 (24%), Positives = 134/332 (40%), Gaps = 63/332 (18%)
Query: 88 NFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHK 147
FK+Y+YP Q K+ Y + I SRF T DP +A LF + +
Sbjct: 113 GFKVYVYPQ-------QKGEKIAESYQN---ILAAIEGSRFYTSDPSQACLFVLSLDTL- 161
Query: 148 MRGKGTSYENMTVIVKDYVDSLINKYP---YWNRTLGADHFFVTCHD-VGVRATEGVPFL 203
+ + YV +L +K WN G +H + TE V F
Sbjct: 162 ---------DRDQLSPQYVHNLRSKVQSLHLWNN--GRNHLIFNLYSGTWPDYTEDVGFD 210
Query: 204 IKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDV-----------------EN 246
I A+ S S + F P+ DV++P + G R
Sbjct: 211 IGQAMLAKASISTE-NFRPNFDVSIPLFSKDHPRTGGERGFLKFNTIPPLRKYMLVFKGK 269
Query: 247 RTVLGFWAGHRNSKIRV-----IL-------ARVWENDTELDISNNRINRAIGPLVYQRR 294
R + G + RN+ V +L + W+ D +R N Y+
Sbjct: 270 RYLTGIGSDTRNALYHVHNGEDVLLLTTCKHGKDWQKHK--DSRCDRDNTEYEKYDYREM 327
Query: 295 FYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVY 354
+ FC+ P G ++ S R +++ CVPV+LSN ++LPF+++++W + AVI +ER +
Sbjct: 328 LHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLL 387
Query: 355 QLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
Q+ +++I + ++L + Q F W +
Sbjct: 388 QIPSTIRSIHQDKILAL-----RQQTQFLWEA 414
>gi|302847711|ref|XP_002955389.1| hypothetical protein VOLCADRAFT_96311 [Volvox carteri f.
nagariensis]
gi|300259231|gb|EFJ43460.1| hypothetical protein VOLCADRAFT_96311 [Volvox carteri f.
nagariensis]
Length = 1222
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 67/292 (22%), Positives = 115/292 (39%), Gaps = 61/292 (20%)
Query: 119 FFQNIRESRFLTHDPDRAHLFFIPISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNR 178
F + + + D D A +FIP++ +R +Y + V + +P+WNR
Sbjct: 126 FQERVTATGVRVGDGDTADWYFIPVT---LRHTADAY-----TLSAAVSYIREMHPWWNR 177
Query: 179 TLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALP 238
T G HF + D+G +E S +V F+ H + P++
Sbjct: 178 THGHRHFVIAIGDMGRLESE------------RGRQSTNVTFVTHWGLHAPKLFS----- 220
Query: 239 RGGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRA-----IGPLVYQR 293
G+ A HRN+ D L + + NR +G Y +
Sbjct: 221 ------------GWKASHRNA-----------TDIVLPVHFHHWNRTGYFIQLGDRHYAK 257
Query: 294 RFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDV 353
+K+C P G + R ++ GCVPV++S+ F LDW F V L E D+
Sbjct: 258 HLLTSKYCFGPTGGG-HGQRQMQAVQAGCVPVVISDDVLEAFEPFLDWNTFGVRLAEADI 316
Query: 354 YQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS-------PPVRFDAFHMVM 398
++ ++L+ IS E+ L +H +++ R+DAF ++
Sbjct: 317 PRMHEVLEAISPEEYAHKEVLLRCAAQHMAFSTVTGSYIGESGRYDAFETLL 368
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 78/329 (23%), Positives = 122/329 (37%), Gaps = 60/329 (18%)
Query: 119 FFQNIRESRFLTHDPDRAHLFFIPISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNR 178
F + I + D D A +FIP+ T + V + +P+WNR
Sbjct: 603 FQERITATGVRVGDGDTADWYFIPVVLRHFCDATT--------LSAAVSYIREMHPWWNR 654
Query: 179 TLGADHFFVTCHDVGVRATE------GVPFLIKNAIRV-VCSPSYDVAFIPHKDVALPQV 231
T G HF + D+G +E V F+ + S + + D+ LP
Sbjct: 655 TQGHRHFVIATGDMGRSESERGHLTANVTFVSYWGLHAPKLSSGWRASHRNATDIVLPVF 714
Query: 232 LQPFALPRGG----------------RDVENRTVLGFWAGH---RNSKIRVILARVWEN- 271
L L R G E + F+AG +SK +V VW N
Sbjct: 715 LGSPKLSRMGIFTSRLHPKFATKAPHELRERNGPIFFFAGRICGDHSKPQV--DGVWPNC 772
Query: 272 ----------DTELDISNNRINRA-----IGPLVYQRRFYKTKFCICPGGSQVNSARIAD 316
T I + NR +G Y + +KFC P G + R
Sbjct: 773 KSPHNMGYSGGTRQKIHFHHWNRTGYFIQLGDRHYAKHLLTSKFCFGPTGGG-HGQRQMQ 831
Query: 317 SIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLV 376
++ GCVPV++S+ F LDW F V L E D+ ++ ++L+ IS E+ L
Sbjct: 832 AVQAGCVPVVISDDVLEAFEPFLDWNTFGVRLAEADIPRMHEVLEAISPEEYARKEVLLR 891
Query: 377 KVQKHFQWNS-------PPVRFDAFHMVM 398
+H +++ R+DAF ++
Sbjct: 892 CAAQHMAFSTVTGSYIGESGRYDAFETLL 920
>gi|224062946|ref|XP_002300942.1| predicted protein [Populus trichocarpa]
gi|222842668|gb|EEE80215.1| predicted protein [Populus trichocarpa]
Length = 460
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 74/312 (23%), Positives = 128/312 (41%), Gaps = 46/312 (14%)
Query: 88 NFKIYIYPDGDPNTFYQTPRKLTGKYASE----GYFFQNIR------ESRFLTHDPDRAH 137
+ KIY+Y + + + + R GK +++ G + ++ ESRF T + A
Sbjct: 86 SLKIYVYEEDEIDGLKELLRGRDGKISADACLKGQWGTQVKIHGLLLESRFRTRKKEEAD 145
Query: 138 LFFIP--ISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVR 195
LFF+P + C +M G E VK ++++ PY+ R+ G DH FV G
Sbjct: 146 LFFVPAYVKCVRMMGGLNDKEINHTYVK-----VLSQMPYFRRSGGRDHIFVFPSGAGAH 200
Query: 196 ATEGVPFLIKNAIRVVCSPSYD-----VAFIPHKDVALP------------QVLQPFALP 238
I +I + AF KD+ +P ++QP L
Sbjct: 201 LFRSWATYINRSIILTTEADRTDKKDTSAFNTWKDIIIPGNVEDGMTKRRIAMVQPLPLS 260
Query: 239 RGGRDVENRTVLGFWAGHRNSKIR----VILARVWENDTELDISNNRINRAIGPLVYQRR 294
+ R L + G K+ + LA+ + + E G + Y +
Sbjct: 261 K-------RKYLANYLGRAQGKVGRLKLIELAKQYPDKLESPELKFSGPGKFGRMEYFQH 313
Query: 295 FYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVIL-NERDV 353
KFC+ P G + R +S CVPVILS+ + PF +++D+ + ++ + R
Sbjct: 314 LRNAKFCLAPRGESSWTLRFYESFFVECVPVILSDQAEFPFQNVIDYTQISIKWPSTRIG 373
Query: 354 YQLKQILKNISD 365
+L + L++I D
Sbjct: 374 LELLEYLESIPD 385
>gi|183637125|gb|ACC64545.1| exostosin 1 (predicted) [Rhinolophus ferrumequinum]
Length = 746
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 78/329 (23%), Positives = 131/329 (39%), Gaps = 57/329 (17%)
Query: 88 NFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHK 147
FK+Y+YP Q K+ Y + I SRF T DP +A LF + +
Sbjct: 113 GFKVYVYPQ-------QKGEKIAESYQN---ILAAIEGSRFYTSDPSQACLFVLSLDTLD 162
Query: 148 MRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHD-VGVRATEGVPFLIKN 206
Y + + S + WN G +H + TE V F I
Sbjct: 163 RDQLSPQYVHN-------LRSKVQSLHLWNN--GRNHLIFNLYSGTWPDYTEDVGFDIGQ 213
Query: 207 AIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDV-----------------ENRTV 249
A+ S S + F P+ DV++P + G R R +
Sbjct: 214 AMLAKASISTE-NFRPNFDVSIPLFSKDHPRTGGERGFLKFNTIPPLRKYMLVFKGKRYL 272
Query: 250 LGFWAGHRNSKIRV-----IL-------ARVWENDTELDISNNRINRAIGPLVYQRRFYK 297
G + RN+ V +L + W+ D +R N Y+ +
Sbjct: 273 TGIGSDTRNALYHVHNGEDVLLLTTCKHGKDWQKHK--DSRCDRDNTEYEKYDYREMLHN 330
Query: 298 TKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLK 357
FC+ P G ++ S R +++ CVPV+LSN ++LPF+++++W + AVI +ER + Q+
Sbjct: 331 ATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIP 390
Query: 358 QILKNISDAEFVSLHNNLVKVQKHFQWNS 386
+++I + ++L + Q F W +
Sbjct: 391 STIRSIHQDKILAL-----RQQTQFLWEA 414
>gi|224046654|ref|XP_002200457.1| PREDICTED: exostosin-1 [Taeniopygia guttata]
Length = 741
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 81/334 (24%), Positives = 136/334 (40%), Gaps = 67/334 (20%)
Query: 88 NFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHK 147
FK+Y+YP Q K+ Y + I SRF T DP +A LF + +
Sbjct: 108 GFKVYVYPQ-------QKGEKIAESYQN---VLAAIEGSRFYTSDPGQACLFVLSLDTL- 156
Query: 148 MRGKGTSYENMTVIVKDYVDSLINKYP---YWNRTLGADHFFVTCHD-VGVRATEGVPFL 203
+ + YV +L +K WN G +H + TE V F
Sbjct: 157 ---------DRDQLSPQYVHNLRSKVQSLHLWNN--GRNHLIFNLYSGTWPDYTEDVGFD 205
Query: 204 IKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDVE------------------ 245
I A+ S S + F P+ DV++P + PR G +
Sbjct: 206 IGQAMLAKASISTE-NFRPNFDVSIPLFSKDH--PRTGGEKGFLRFNTIPPLRKYMLVFK 262
Query: 246 -NRTVLGFWAGHRNSKIRV------IL------ARVWENDTELDISNNRINRAIGPLVYQ 292
R + G + RN+ V +L + W+ D +R N Y+
Sbjct: 263 GKRYLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHK--DSRCDRDNAEYEKYDYR 320
Query: 293 RRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERD 352
+ FC+ P G ++ S R +++ CVPV+LSN ++LPF++++DW + A+I +ER
Sbjct: 321 EMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVIDWNQAAIIGDERL 380
Query: 353 VYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
+ Q+ +++I + ++L + Q F W +
Sbjct: 381 LLQIPSTIRSIHQDKILAL-----RQQTQFLWEA 409
>gi|302812737|ref|XP_002988055.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
gi|300144161|gb|EFJ10847.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
Length = 459
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 80/290 (27%), Positives = 122/290 (42%), Gaps = 34/290 (11%)
Query: 132 DPDRAHLFFIPI-----------SCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTL 180
DP A +FF+P + H+ +G EN + K V+ L + W R
Sbjct: 156 DPATADIFFVPFFSSLSYNRYCRTGHRFQGGRGCVEN-DRLEKRLVEFLRGQ-ELWRRNG 213
Query: 181 GADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSP---SYDVAFIPHKDVALP--QVLQPF 235
G DH V H + L+K A+ VV S VA + KD+ P V+ F
Sbjct: 214 GVDHVIVMHHPNSLMVARS---LLKEAMFVVADFGRFSRAVANM-RKDIVAPYKHVIPSF 269
Query: 236 ALPRGGRDVENRTVLGFWAG----HRNSKIRVILARVWENDTELD-ISNNRINRAIGPLV 290
A R E+R L F+ G IR L + ++ + ++ N I
Sbjct: 270 A--RDATTFESRETLLFFQGAIVRKEGGIIRQKLYEILKDSPGVHFVTGNTQKDGIRSAT 327
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
R KFC+ G +S R+ D+I CVPVI+S+ +LPF D LD+ +F V +
Sbjct: 328 AGMR--NAKFCLHLAGDTPSSNRLFDAIASHCVPVIISDEIELPFEDELDYSQFCVFVES 385
Query: 351 RDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMV 397
+ + + L+ I E+ L V++HF++ P + DA HM
Sbjct: 386 DKALRKGFVVRALERIGRDEWTRKWAMLKSVERHFEYQHPSLPEDAVHMT 435
>gi|413915952|gb|AFW55884.1| secondary cell wall glycosyltransferase family 47 [Zea mays]
Length = 484
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 66/251 (26%), Positives = 112/251 (44%), Gaps = 25/251 (9%)
Query: 174 PYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIP------HKDVA 227
P W R GADH V H + V + A+ V+ D P KDV
Sbjct: 228 PEWRRYGGADHVIVAHHPNSLLHARAV---LHPAVFVLS----DFGRYPPRVASLEKDVI 280
Query: 228 LP--QVLQPFALPRGGRDVENRTVLGFWAG----HRNSKIRVILARVWENDTELDISNNR 281
P + + +A G D +R L ++ G IR L + + + ++ S
Sbjct: 281 APYKHMAKTYANDSAGFD--DRPTLLYFRGAIYRKEGGSIRQELYYMLKEEKDVYFSFGS 338
Query: 282 INRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDW 341
+ G + + +KFC+ G +S R+ D+I CVPVI+S+ +LP+ D+LD+
Sbjct: 339 VQDH-GASKASQGMHSSKFCLNIAGDTPSSNRLFDAIVTHCVPVIISDDIELPYEDVLDY 397
Query: 342 RKFAVILNERDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVM 398
KF++ + D + L ++L +S ++ + + L +V KHF++ P + DA M+
Sbjct: 398 SKFSIFVRSSDAVKKGYLMRLLSGVSKQQWTKMWDRLKEVDKHFEYQYPSQKDDAVQMIW 457
Query: 399 YELWLRRHVIK 409
L + IK
Sbjct: 458 QALSRKVPSIK 468
>gi|301774588|ref|XP_002922714.1| PREDICTED: exostosin-1-like [Ailuropoda melanoleuca]
Length = 746
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 78/329 (23%), Positives = 131/329 (39%), Gaps = 57/329 (17%)
Query: 88 NFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHK 147
FK+Y+YP Q K+ Y + I SRF T DP +A LF + +
Sbjct: 113 GFKVYVYPQ-------QKGEKIAESYQN---ILAAIEGSRFYTSDPSQACLFVLSLDTLD 162
Query: 148 MRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHD-VGVRATEGVPFLIKN 206
Y + + S + WN G +H + TE V F I
Sbjct: 163 RDQLSPQYVHN-------LRSKVQSLHLWNN--GRNHLIFNLYSGTWPDYTEDVGFDIGQ 213
Query: 207 AIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDV-----------------ENRTV 249
A+ S S + F P+ DV++P + G R R +
Sbjct: 214 AMLAKASISTE-NFRPNFDVSIPLFSKDHPRTGGERGFLKFNTIPPLRKYMLVFKGKRYL 272
Query: 250 LGFWAGHRNSKIRV------IL------ARVWENDTELDISNNRINRAIGPLVYQRRFYK 297
G + RN+ V +L + W+ D +R N Y+ +
Sbjct: 273 TGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHK--DSRCDRDNTEYEKYDYREMLHN 330
Query: 298 TKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLK 357
FC+ P G ++ S R +++ CVPV+LSN ++LPF+++++W + AVI +ER + Q+
Sbjct: 331 ATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIP 390
Query: 358 QILKNISDAEFVSLHNNLVKVQKHFQWNS 386
+++I + ++L + Q F W +
Sbjct: 391 STIRSIHQDKILAL-----RQQTQFLWEA 414
>gi|291388448|ref|XP_002710791.1| PREDICTED: exostosin 1 [Oryctolagus cuniculus]
Length = 746
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 80/332 (24%), Positives = 134/332 (40%), Gaps = 63/332 (18%)
Query: 88 NFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHK 147
FK+Y+YP Q K+ Y + I SRF T DP +A LF + +
Sbjct: 113 GFKVYVYPQ-------QKGEKIAESYQN---ILAAIEGSRFYTSDPSQACLFVLSLDTL- 161
Query: 148 MRGKGTSYENMTVIVKDYVDSLINKYP---YWNRTLGADHFFVTCHD-VGVRATEGVPFL 203
+ + YV +L +K WN G +H + TE V F
Sbjct: 162 ---------DRDQLSPQYVHNLRSKVQSLHLWNN--GRNHLIFNLYSGTWPDYTEDVGFD 210
Query: 204 IKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDV-----------------EN 246
I A+ S S + F P+ DV++P + G R
Sbjct: 211 IGQAMLAKASISTE-NFRPNFDVSIPLFSKDHPRTGGERGFLKFNTIPPLRKYMLVFKGK 269
Query: 247 RTVLGFWAGHRNSKIRV-----IL-------ARVWENDTELDISNNRINRAIGPLVYQRR 294
R + G + RN+ V +L + W+ D +R N Y+
Sbjct: 270 RYLTGIGSDTRNALYHVHNGEDVLLLTTCKHGKDWQKHK--DSRCDRDNTEYEKYDYREM 327
Query: 295 FYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVY 354
+ FC+ P G ++ S R +++ CVPV+LSN ++LPF+++++W + AVI +ER +
Sbjct: 328 LHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLL 387
Query: 355 QLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
Q+ +++I + ++L + Q F W +
Sbjct: 388 QIPSTIRSIHQDKILAL-----RQQTQFLWEA 414
>gi|168017794|ref|XP_001761432.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162687438|gb|EDQ73821.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 377
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 65/250 (26%), Positives = 108/250 (43%), Gaps = 15/250 (6%)
Query: 170 INKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSY---DVAFIPHKDV 226
+ + P W + G DH V H + A +N + V+ DVA + KDV
Sbjct: 124 LKQQPAWQASDGCDHILVMHHPNSMHAMRDS---FRNVLFVLADFGRYPPDVANV-EKDV 179
Query: 227 ALPQVLQPFALPRGGRDVENRTVLGFWAG----HRNSKIRVILARVWENDTELDISNNRI 282
P + E+R L F+ G + IR L + +++ +
Sbjct: 180 VAPYKHIIPSFDNDSSSFEDRETLLFFQGTIVRKQGGVIRQQLYEMLKDEEGVHFEEGS- 238
Query: 283 NRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWR 342
+ + G +KFC+ G +S R+ DSI CVPVI+S+ +LPF D LD+
Sbjct: 239 SGSEGVHSATSGMRGSKFCLNIAGDTPSSNRLFDSIASHCVPVIISDDIELPFEDELDYS 298
Query: 343 KFAVILNERDVYQLK---QILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMY 399
+F V + D + K +L++I+ ++ L L V +HF++ P +DA +MV
Sbjct: 299 EFCVFIKSEDALKEKYVINLLRSITRVQWTFLWKRLKAVARHFEYQHPTKPYDAVNMVWR 358
Query: 400 ELWLRRHVIK 409
+ R +K
Sbjct: 359 AIARRAPSVK 368
>gi|355745047|gb|EHH49672.1| hypothetical protein EGM_00373 [Macaca fascicularis]
Length = 675
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 58/222 (26%), Positives = 99/222 (44%), Gaps = 32/222 (14%)
Query: 176 WNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPF 235
WNR G +H + H T F + A+ SP+ D +F P DVALP + +
Sbjct: 152 WNR--GRNHLVLRLHPAPCPKT----FQLGQAMVAEASPTVD-SFRPGFDVALPFLPEAH 204
Query: 236 ALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNN---------RINRAI 286
L RGG + + + + R+ L + E ++ R +
Sbjct: 205 PL-RGGAPGQLQ--------QHSPQPRIALLALEEERGGWRTADTGSSACPWDARCEQDP 255
Query: 287 GPLVYQR--RFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKF 344
GP QR FC+ G ++R ++ GC+PV+LS ++LPF++++DW K
Sbjct: 256 GPGQTQRGETLPNATFCLISGHRPEAASRFLQALQAGCIPVLLSPRWELPFSEVIDWTKA 315
Query: 345 AVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
A++ +ER Q+ L+ +S A ++L + Q F W++
Sbjct: 316 AIVADERLPLQVLAALQEMSPARVLAL-----RQQTQFLWDA 352
>gi|357168395|ref|XP_003581626.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10-like
[Brachypodium distachyon]
Length = 348
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 63/236 (26%), Positives = 107/236 (45%), Gaps = 36/236 (15%)
Query: 121 QNIRESRFLTHDPDRAHLFFIP--ISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNR 178
Q + +SRF T D D AHLFF+P + C +M G T E I + YV ++++ PY+ R
Sbjct: 44 QLLLKSRFRTLDKDEAHLFFVPSYVKCVRMTGALTDKE----INQTYV-KVLSQMPYFRR 98
Query: 179 TLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYD-------VAFIPHKDVALPQV 231
+ G DH FV G + +I + +P D AF KD+ +P
Sbjct: 99 SGGRDHIFVFPSGAGAHLFRSWATFLNRSI--ILTPEGDRTDKRGISAFNTWKDIIIPGN 156
Query: 232 LQPFALPRGGRDVENRTVLGFWAGHRNSKIRVI-LARVWENDTELDISNNRINRAIGPLV 290
+ ++ + GR ++++ LA+ + + E +G +
Sbjct: 157 VDD-SMGKAGR------------------LKLVELAKQYPDKLESPELKLSGPDKLGRID 197
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAV 346
Y + KFC+ P G + R +S CVPVILS+ +LPF +++D+ + ++
Sbjct: 198 YFKHLRNAKFCLAPRGESSWTLRFYESFFVECVPVILSDEVELPFQNMIDYTEISI 253
>gi|355557695|gb|EHH14475.1| hypothetical protein EGK_00405 [Macaca mulatta]
Length = 675
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 60/221 (27%), Positives = 97/221 (43%), Gaps = 30/221 (13%)
Query: 176 WNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPF 235
WNR G +H + H T F + A+ SP+ D +F P DVALP + +
Sbjct: 152 WNR--GRNHLVLRLHPAPCPKT----FQLGQAMVAEASPTVD-SFRPGFDVALPFLPEAH 204
Query: 236 ALPRGGRDVENRTVLGFWAGHRNSKIRVILA--------RVWENDTELDISNNRINRAIG 287
L RGG G H +LA R + + + R + G
Sbjct: 205 PL-RGG-------APGQLQQHSPQPGIALLALEEERGGWRTADTGSSACPWDARCEQDPG 256
Query: 288 PLVYQR--RFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFA 345
P QR FC+ G ++R ++ GC+PV+LS ++LPF++++DW K A
Sbjct: 257 PGQTQRGETLPNATFCLISGHRPEAASRFLQALQAGCIPVLLSPRWELPFSEVIDWTKAA 316
Query: 346 VILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
++ +ER Q+ L+ +S A ++L + Q F W++
Sbjct: 317 IVADERLPLQVLAALQEMSPARVLAL-----RQQTQFLWDA 352
>gi|380810494|gb|AFE77122.1| exostosin-like 1 [Macaca mulatta]
gi|380810496|gb|AFE77123.1| exostosin-like 1 [Macaca mulatta]
Length = 675
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 60/221 (27%), Positives = 97/221 (43%), Gaps = 30/221 (13%)
Query: 176 WNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPF 235
WNR G +H + H T F + A+ SP+ D +F P DVALP + +
Sbjct: 152 WNR--GRNHLVLRLHPAPCPKT----FQLGQAMVAEASPTVD-SFRPGFDVALPFLPEAH 204
Query: 236 ALPRGGRDVENRTVLGFWAGHRNSKIRVILA--------RVWENDTELDISNNRINRAIG 287
L RGG G H +LA R + + + R + G
Sbjct: 205 PL-RGG-------APGQLQQHSPQPGIALLALEEERGGWRTADTGSSACPWDARCEQDPG 256
Query: 288 PLVYQR--RFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFA 345
P QR FC+ G ++R ++ GC+PV+LS ++LPF++++DW K A
Sbjct: 257 PGQTQRGETLPNATFCLISGHRPEAASRFLQALQAGCIPVLLSPRWELPFSEVIDWTKAA 316
Query: 346 VILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
++ +ER Q+ L+ +S A ++L + Q F W++
Sbjct: 317 IVADERLPLQVLAALQEMSPARVLAL-----RQQTQFLWDA 352
>gi|297282579|ref|XP_001108035.2| PREDICTED: exostoses (multiple)-like 1 isoform 1 [Macaca mulatta]
Length = 675
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 60/221 (27%), Positives = 97/221 (43%), Gaps = 30/221 (13%)
Query: 176 WNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPF 235
WNR G +H + H T F + A+ SP+ D +F P DVALP + +
Sbjct: 152 WNR--GRNHLVLRLHPAPCPKT----FQLGQAMVAEASPTVD-SFRPGFDVALPFLPEAH 204
Query: 236 ALPRGGRDVENRTVLGFWAGHRNSKIRVILA--------RVWENDTELDISNNRINRAIG 287
L RGG G H +LA R + + + R + G
Sbjct: 205 PL-RGG-------APGQLQQHSPQPGIALLALEEERGGWRTADTGSSACPWDARCEQDPG 256
Query: 288 PLVYQR--RFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFA 345
P QR FC+ G ++R ++ GC+PV+LS ++LPF++++DW K A
Sbjct: 257 PGQTQRGETLPNATFCLISGHRPEAASRFLQALQAGCIPVLLSPRWELPFSEVIDWTKAA 316
Query: 346 VILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
++ +ER Q+ L+ +S A ++L + Q F W++
Sbjct: 317 IVADERLPLQVLAALQEMSPARVLAL-----RQQTQFLWDA 352
>gi|226529660|ref|NP_001151894.1| secondary cell wall-related glycosyltransferase family 47 [Zea
mays]
gi|195650683|gb|ACG44809.1| secondary cell wall-related glycosyltransferase family 47 [Zea
mays]
Length = 488
Score = 74.7 bits (182), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 66/251 (26%), Positives = 112/251 (44%), Gaps = 25/251 (9%)
Query: 174 PYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIP------HKDVA 227
P W R GADH V H + V + A+ V+ D P KDV
Sbjct: 232 PEWRRFGGADHVIVAHHPNSLLHARAV---LHPAVFVLS----DFGRYPPRVASLEKDVI 284
Query: 228 LP--QVLQPFALPRGGRDVENRTVLGFWAG----HRNSKIRVILARVWENDTELDISNNR 281
P + + +A G D +R L ++ G IR L + + + ++ S
Sbjct: 285 APYKHMAKTYANDSAGFD--DRPTLLYFRGAIYRKEGGSIRQELYYMLKEEKDVYFSFGS 342
Query: 282 INRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDW 341
+ G + + +KFC+ G +S R+ D+I CVPVI+S+ +LP+ D+LD+
Sbjct: 343 VQDH-GASKASQGMHSSKFCLNIAGDTPSSNRLFDAIVTHCVPVIISDDIELPYEDVLDY 401
Query: 342 RKFAVILNERDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVM 398
KF++ + D + L ++L +S ++ + + L +V KHF++ P + DA M+
Sbjct: 402 SKFSIFVRSSDAVKKGYLMRLLSGVSKQQWTKMWDRLKEVDKHFEYQYPSQKDDAVQMIW 461
Query: 399 YELWLRRHVIK 409
L + IK
Sbjct: 462 QALSRKVPSIK 472
>gi|355686831|gb|AER98199.1| exostoses 1 [Mustela putorius furo]
Length = 702
Score = 74.7 bits (182), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 78/329 (23%), Positives = 131/329 (39%), Gaps = 57/329 (17%)
Query: 88 NFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHK 147
FK+Y+YP Q K+ Y + I SRF T DP +A LF + +
Sbjct: 69 GFKVYVYPQ-------QKGEKIAESYQN---ILAAIEGSRFYTSDPSQACLFVLSLDTLD 118
Query: 148 MRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHD-VGVRATEGVPFLIKN 206
Y + + S + WN G +H + TE V F I
Sbjct: 119 RDQLSPQYVHN-------LRSKVQSLHLWNN--GRNHLIFNLYSGTWPDYTEDVGFDIGQ 169
Query: 207 AIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDV-----------------ENRTV 249
A+ S S + F P+ DV++P + G R R +
Sbjct: 170 AMLAKASISTE-NFRPNFDVSIPLFSKDHPRTGGERGFLKFNTIPPLRKYMLVFKGKRYL 228
Query: 250 LGFWAGHRNSKIRV------IL------ARVWENDTELDISNNRINRAIGPLVYQRRFYK 297
G + RN+ V +L + W+ D +R N Y+ +
Sbjct: 229 TGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHK--DSRCDRDNTEYEKYDYREMLHN 286
Query: 298 TKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLK 357
FC+ P G ++ S R +++ CVPV+LSN ++LPF+++++W + AVI +ER + Q+
Sbjct: 287 ATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIP 346
Query: 358 QILKNISDAEFVSLHNNLVKVQKHFQWNS 386
+++I + ++L + Q F W +
Sbjct: 347 STIRSIHQDKILAL-----RQQTQFLWEA 370
>gi|397476221|ref|XP_003809508.1| PREDICTED: exostosin-like 1 [Pan paniscus]
Length = 676
Score = 74.7 bits (182), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 59/221 (26%), Positives = 97/221 (43%), Gaps = 30/221 (13%)
Query: 176 WNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPF 235
WNR G +H + T F + A+ SP+ D +F P DVALP + +
Sbjct: 153 WNR--GRNHLVLRLRPAPCPRT----FQLGQAMVAEASPTVD-SFRPGFDVALPFLPEAH 205
Query: 236 ALPRGGRDVENRTVLGFWAGHRNSKIRVILA--------RVWENDTELDISNNRINRAIG 287
L RGG + R H +LA R + + + R + G
Sbjct: 206 PL-RGGAPGQLRQ-------HSPQPGVALLALEEERGGWRTADTGSSACPWDGRCEQDPG 257
Query: 288 PLVYQRR--FYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFA 345
P QR+ FC+ G ++R ++ GC+PV+LS ++LPF++++DW K A
Sbjct: 258 PGQTQRQETLPNATFCLISGHRPEAASRFLQALQAGCIPVLLSPRWELPFSEVIDWTKAA 317
Query: 346 VILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
++ +ER Q+ L+ +S A ++L Q F W++
Sbjct: 318 IVADERLPLQVLAALQEMSPARVLALRQ-----QTQFLWDA 353
>gi|402853480|ref|XP_003891421.1| PREDICTED: exostosin-like 1 [Papio anubis]
Length = 675
Score = 74.7 bits (182), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 59/221 (26%), Positives = 98/221 (44%), Gaps = 30/221 (13%)
Query: 176 WNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPF 235
WNR G +H + H T F + A+ SP+ D +F P DVALP + +
Sbjct: 152 WNR--GRNHLVLRLHPAPCPKT----FQLGQAMVAEASPTVD-SFRPGFDVALPFLPEAH 204
Query: 236 ALPRGGRDVENRTVLGFWAGHRNSKIRVILA--------RVWENDTELDISNNRINRAIG 287
L RGG + + H +LA R + + + R + G
Sbjct: 205 PL-RGGAPGQLQQ-------HSPQPGIALLALEEERGGWRTADTGSSACPWDARCEQDPG 256
Query: 288 PLVYQR--RFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFA 345
P QR FC+ G ++R ++ GC+PV+LS ++LPF++++DW K A
Sbjct: 257 PGQTQRGETLPNATFCLISGHRPEAASRFLQALQAGCIPVLLSPRWELPFSEVIDWTKAA 316
Query: 346 VILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
++ +ER Q+ L+ +S A ++L + Q F W++
Sbjct: 317 IVADERLPLQVLAALQEMSPARVLAL-----RQQTQFLWDA 352
>gi|332808073|ref|XP_524623.3| PREDICTED: exostoses (multiple)-like 1 isoform 2 [Pan troglodytes]
Length = 676
Score = 74.3 bits (181), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 59/221 (26%), Positives = 97/221 (43%), Gaps = 30/221 (13%)
Query: 176 WNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPF 235
WNR G +H + T F + A+ SP+ D +F P DVALP + +
Sbjct: 153 WNR--GRNHLVLRLRPAPCPRT----FQLGQAMVAEASPTVD-SFRPGFDVALPFLPEAH 205
Query: 236 ALPRGGRDVENRTVLGFWAGHRNSKIRVILA--------RVWENDTELDISNNRINRAIG 287
L RGG + R H +LA R + + + R + G
Sbjct: 206 PL-RGGAPGQLRQ-------HSPQPGVALLALEEERGGWRTADTGSSACPWDGRCEQDPG 257
Query: 288 PLVYQRR--FYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFA 345
P QR+ FC+ G ++R ++ GC+PV+LS ++LPF++++DW K A
Sbjct: 258 PGQTQRQETLPNATFCLISGHRPEAASRFLQALQAGCIPVLLSPRWELPFSEVIDWTKAA 317
Query: 346 VILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
++ +ER Q+ L+ +S A ++L Q F W++
Sbjct: 318 IVADERLPLQVLAALQEMSPARVLALRQ-----QTQFLWDA 353
>gi|15241619|ref|NP_199306.1| Exostosin family protein [Arabidopsis thaliana]
gi|30694651|ref|NP_851132.1| Exostosin family protein [Arabidopsis thaliana]
gi|10177484|dbj|BAB10875.1| unnamed protein product [Arabidopsis thaliana]
gi|15081733|gb|AAK82521.1| AT5g44930/K21C13_11 [Arabidopsis thaliana]
gi|27363262|gb|AAO11550.1| At5g44930/K21C13_11 [Arabidopsis thaliana]
gi|332007793|gb|AED95176.1| Exostosin family protein [Arabidopsis thaliana]
gi|332007794|gb|AED95177.1| Exostosin family protein [Arabidopsis thaliana]
Length = 443
Score = 74.3 bits (181), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 71/271 (26%), Positives = 124/271 (45%), Gaps = 29/271 (10%)
Query: 132 DPDRAHLFFIP-------ISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADH 184
DP A LF++ I G G S E M ++ + S + +W R G DH
Sbjct: 129 DPAEADLFYVSAFSSLSLIVDSGRPGFGYSDEEM----QESLVSWLESQEWWRRNNGRDH 184
Query: 185 FFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPH----KDVALPQVLQPFALPRG 240
V A + V +KNA+ +V +D KDV +P + A G
Sbjct: 185 VIVAGDP---NALKRVMDRVKNAVLLVTD--FDRLRADQGSLVKDVIIPYSHRIDAY-EG 238
Query: 241 GRDVENRTVLGFWAGHR----NSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFY 296
V+ RT L F+ G+R K+R +L ++ E + ++ I +R V Q +
Sbjct: 239 ELGVKQRTNLLFFMGNRYRKDGGKVRDLLFKLLEKEEDVVIKRGTQSRENMRAVKQG-MH 297
Query: 297 KTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQ- 355
+KFC+ G ++ R+ D+I CVPVI+S+ +LPF D++D+RKF++ L +
Sbjct: 298 TSKFCLHLAGDTSSACRLFDAIASLCVPVIVSDGIELPFEDVIDYRKFSIFLRRDAALKP 357
Query: 356 --LKQILKNISDAEFVSLHNNLVKVQKHFQW 384
+ + L+ + + + + +V+++F +
Sbjct: 358 GFVVKKLRKVKPGKILKYQKVMKEVRRYFDY 388
>gi|1235559|emb|CAA65443.1| ext1 [Mus musculus]
Length = 745
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 79/332 (23%), Positives = 134/332 (40%), Gaps = 63/332 (18%)
Query: 88 NFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHK 147
FK+Y+YP Q K+ Y + I SRF T DP +A LF + +
Sbjct: 113 GFKVYVYPQ-------QKGEKIAESYQN---ILAAIEGSRFYTSDPSQACLFVLSLDTL- 161
Query: 148 MRGKGTSYENMTVIVKDYVDSLINKYP---YWNRTLGADHFFVTCHD-VGVRATEGVPFL 203
+ + YV +L +K WN G +H + TE V F
Sbjct: 162 ---------DRDQLSPQYVHNLRSKVQSLHLWNN--GRNHLIFNLYSGTWPDYTEDVGFD 210
Query: 204 IKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDV-----------------EN 246
I A+ + S + F P+ DV++P + G R
Sbjct: 211 IGQAMLAKATISTE-NFRPNFDVSIPLFSKDHPRTGGERGFLKFNTIPPLRKYMLVFKGK 269
Query: 247 RTVLGFWAGHRNSKIRV-----IL-------ARVWENDTELDISNNRINRAIGPLVYQRR 294
R + G + RN+ V +L + W+ D +R N Y+
Sbjct: 270 RYLTGIGSDTRNALYHVHNGEDVLLLTTCKHGKDWQKHK--DSRCDRDNTEYEKYDYREM 327
Query: 295 FYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVY 354
+ FC+ P G ++ S R +++ CVPV+LSN ++LPF+++++W + AVI +ER +
Sbjct: 328 LHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLL 387
Query: 355 QLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
Q+ +++I + ++L + Q F W +
Sbjct: 388 QIPSTIRSIHQDKILAL-----RQQTQFLWEA 414
>gi|159480086|ref|XP_001698117.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158273916|gb|EDO99702.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 821
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 87/349 (24%), Positives = 136/349 (38%), Gaps = 71/349 (20%)
Query: 108 KLTG-KYASEGYFFQNIRESRFLTHDPDRAHLFFIP--ISCHKMRGKGTSYENMTVIVKD 164
+LTG Y+ E F + + +S T DP+ A F++P ++C+ G + +
Sbjct: 385 ELTGWTYSVETMFHELLLQSEHRTFDPEEADFFYVPHYVTCYFWPIMG--WADAPWWHAP 442
Query: 165 YVDS---------------LINKYPYWNRTLGADHFFVTCHDVGV--------------- 194
YVDS L PYW+R G DH ++ D G
Sbjct: 443 YVDSRPMHGANMLTELHGWLRTNLPYWDRRGGRDHIWLMAADEGACWMPTAIYNTSIVLT 502
Query: 195 -------RATEGVPFLIKNAIRVVCS----PSYDV--------AFIPHKDVALPQVLQPF 235
T G +L R V P D F P KD+ +P PF
Sbjct: 503 HWGRLEANHTSGTAYLQDVYDRPVYGFQRWPGVDYHHDIEGHPCFDPKKDLVIPAFKPPF 562
Query: 236 ALPRG---GRDVENRTVLGFWAGHRNS-----KIRVILARVWENDTELDISNN-RINRAI 286
R G R +L ++ G + R I R+ + D N +I +
Sbjct: 563 HFARSPLLGAPPLQRDILLYFRGDSGAFRLPQYSRGIRQRITDLSNRQDWFNRYKIVISH 622
Query: 287 GPLV---YQRRFYKTKFC-ICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWR 342
G +V Y ++KFC + PG S R D+I +GC+PV++ + F ILDW
Sbjct: 623 GGMVGGDYSEHLARSKFCLVAPGDGW--SPRAEDAILHGCIPVVVMDGVQAVFESILDWD 680
Query: 343 KFAVILNERD--VYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPV 389
F++ + E D + L Q+L +IS + +L +V F + + P+
Sbjct: 681 SFSLRIREDDAALEALPQLLASISPERLAHMQRHLARVWHRFAYTTGPL 729
>gi|387015838|gb|AFJ50038.1| Exostosin-1-like [Crotalus adamanteus]
Length = 751
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 77/329 (23%), Positives = 130/329 (39%), Gaps = 57/329 (17%)
Query: 88 NFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHK 147
FK+Y+YP Q K+ Y + I SRF T DP +A LF + +
Sbjct: 118 GFKVYVYPQ-------QKGEKIAESYQN---ILAAIEGSRFYTSDPSQACLFVLSLDTLD 167
Query: 148 MRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHD-VGVRATEGVPFLIKN 206
Y + + S + WN G +H + TE V F I
Sbjct: 168 RDQLSPQYVHN-------LRSKVQSLHLWNN--GRNHLVFNLYSGTWPDYTEDVGFDIGQ 218
Query: 207 AIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDV-----------------ENRTV 249
A+ S S + F P DV++P + G R R +
Sbjct: 219 AMLAKASISTE-NFRPRFDVSIPLFSKDHPRTGGERGFLRFNTIPPLRKYMLVFKGKRYL 277
Query: 250 LGFWAGHRNSKIRV------IL------ARVWENDTELDISNNRINRAIGPLVYQRRFYK 297
G + RN+ V +L + W+ D +R N Y+ +
Sbjct: 278 TGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHK--DSRCDRDNAEYEKYDYREMLHN 335
Query: 298 TKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLK 357
FC+ P G ++ S R +++ C+PV+LSN ++LPF+++++W + AVI +ER + Q+
Sbjct: 336 ATFCLVPRGRRLGSFRFLEALQAACIPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIP 395
Query: 358 QILKNISDAEFVSLHNNLVKVQKHFQWNS 386
+++I + ++L + Q F W +
Sbjct: 396 STIRSIHQDKILAL-----RQQTQFLWEA 419
>gi|197101852|ref|NP_001125538.1| exostosin-1 [Pongo abelii]
gi|75042009|sp|Q5RBC3.1|EXT1_PONAB RecName: Full=Exostosin-1; AltName:
Full=Glucuronosyl-N-acetylglucosaminyl-proteoglycan/N-
acetylglucosaminyl-proteoglycan
4-alpha-N-acetylglucosaminyltransferase; AltName:
Full=Multiple exostoses protein 1 homolog
gi|55728386|emb|CAH90937.1| hypothetical protein [Pongo abelii]
Length = 746
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 80/332 (24%), Positives = 133/332 (40%), Gaps = 63/332 (18%)
Query: 88 NFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHK 147
FK+Y YP Q K+ Y + I SRF T DP +A LF + +
Sbjct: 113 GFKVYAYPQ-------QKGEKIAESYQN---ILAAIEGSRFYTSDPSQACLFVLSLDTL- 161
Query: 148 MRGKGTSYENMTVIVKDYVDSLINKYP---YWNRTLGADHFFVTCHD-VGVRATEGVPFL 203
+ + YV +L +K WN G +H + TE V F
Sbjct: 162 ---------DRDQLSPQYVHNLRSKVQSLHLWNN--GRNHLIFNLYSGTWPDYTEDVGFD 210
Query: 204 IKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDV-----------------EN 246
I A+ S S + F P+ DV++P + G R
Sbjct: 211 IGQAMLAKASISTE-NFRPNFDVSIPLFSKDHPRTGGERGFLKFNTIPPLRKYMLVFKGK 269
Query: 247 RTVLGFWAGHRNSKIRV------IL------ARVWENDTELDISNNRINRAIGPLVYQRR 294
R + G + RN+ V +L + W+ D +R N Y+
Sbjct: 270 RYLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHK--DSRCDRDNTEYEKYDYREM 327
Query: 295 FYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVY 354
+ FC+ P G ++ S R +++ CVPV+LSN ++LPF+++++W + AVI +ER +
Sbjct: 328 LHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLL 387
Query: 355 QLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
Q+ +++I + ++L + Q F W +
Sbjct: 388 QIPSTIRSIHQDKILAL-----RQQTQFLWEA 414
>gi|388512347|gb|AFK44235.1| unknown [Lotus japonicus]
Length = 267
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 63/246 (25%), Positives = 110/246 (44%), Gaps = 18/246 (7%)
Query: 168 SLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCS-PSYDVAFIPHKDV 226
L+ K +W R+ G DH F H R + +I+VV Y +KDV
Sbjct: 2 ELLKKSVHWQRSRGRDHVFPMTHPNAFRFLRN---QLNESIQVVVDFGRYPKGSNLNKDV 58
Query: 227 ALP--QVLQPFALPRGGRDVENRTVLGFWAGHRNSK----IRVILARVWENDTELDISNN 280
P V+ F E+R L F+ G K +R LA++ T D +
Sbjct: 59 VSPYVHVVDSFTDDEPQDPYESRPTLLFFRGRTFRKDEGIVRAKLAKIL---TGFDDVHY 115
Query: 281 RINRAIGPLVY--QRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDI 338
+ A G + + +KFC+ P G +S R+ D+I CVPVI+S+ +LPF D
Sbjct: 116 ERSFATGENIKLSSQGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDQIELPFEDE 175
Query: 339 LDWRKFAVILNERDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFH 395
+D+ +F++ + ++ Q + L+ ++ + L + H+++ PP + DA +
Sbjct: 176 IDYSQFSLFFSFKEALQPGYMIDQLRKFPKDKWSEMWRQLKNISHHYEFQYPPKKEDAVN 235
Query: 396 MVMYEL 401
M+ ++
Sbjct: 236 MLWRQV 241
>gi|91076108|ref|XP_968944.1| PREDICTED: similar to tout-velu CG10117-PA [Tribolium castaneum]
gi|270014706|gb|EFA11154.1| hypothetical protein TcasGA2_TC004758 [Tribolium castaneum]
Length = 719
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 76/317 (23%), Positives = 128/317 (40%), Gaps = 51/317 (16%)
Query: 88 NFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPI-SCH 146
F++Y+YP D +T + +KL + ESR+ T DP A LF + I +
Sbjct: 81 GFRVYVYPPDDNSTPSPSYQKL----------LNVLMESRYYTADPRLACLFVLSIDTLD 130
Query: 147 KMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKN 206
+ R NM S + P+WN L F + + F
Sbjct: 131 RDRLSADYVRNM--------QSRLQHLPHWNNGLNHVIFNLYSGTWPNYTENNLDFDYGM 182
Query: 207 AIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDVEN-------------------R 247
AI S S D P D+++P + P G +V + R
Sbjct: 183 AILAKASMS-DSHMRPGFDISIPLFHK--VHPEKGGEVGSVLANSLPLQKNYLLAFKGKR 239
Query: 248 TVLGFWAGHRNS-------KIRVILARVWENDTELDISNNRI---NRAIGPLVYQRRFYK 297
V G + RNS K +++ + D+ + R N+ Y+
Sbjct: 240 YVHGIGSDTRNSLYHLHNRKDMIMVTTCRHGKSWKDMKDERCDQDNKEYDKYDYEVLLQN 299
Query: 298 TKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLK 357
+ FC+ P G ++ S R +++ GC+PV+LSN + LPF +DW K A+ +ER + Q+
Sbjct: 300 STFCLVPRGRRLGSFRFLEALQAGCIPVLLSNGWALPFAQKIDWSKAAIWADERLLLQVP 359
Query: 358 QILKNISDAEFVSLHNN 374
I+++++ A+ + L
Sbjct: 360 YIVRSLAPAKILQLRQQ 376
>gi|224145217|ref|XP_002325567.1| predicted protein [Populus trichocarpa]
gi|222862442|gb|EEE99948.1| predicted protein [Populus trichocarpa]
Length = 476
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 102/419 (24%), Positives = 170/419 (40%), Gaps = 56/419 (13%)
Query: 17 CSLRSSLLTLALVTLLCFTCL-SFNSFRSYP-----LQNNFNS----------TLSFAIN 60
CS+ + L ++ LCF+ +N S+P LQ + NS +L++ +
Sbjct: 13 CSIPTLFLAFTTLSFLCFSLFFLYNKNPSFPNPQTTLQTSQNSIKVYVADLPRSLNYGLL 72
Query: 61 KQDYSDLGSDVFHSPSVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYAS-EGYF 119
Q +S D S H + N K YP+ Y +TG + E
Sbjct: 73 DQYWSSSIPDTRISSDPDHQIRPKPTKNQKFLDYPENPLIKQYSAEYWITGDLMTPEKLK 132
Query: 120 FQNIRESRFLTHDPDRAHL-FFIPISCHKMRGKGTSYENMTVIVKDY-----VDSLINKY 173
F++ + F ++ D + FF +S KG +DY V ++
Sbjct: 133 FRSFAKRVFDCNEADVVFVPFFATLSAEMELAKGKGSFRRKEGNEDYRRQKQVVDIVRNS 192
Query: 174 PYWNRTLGADHFFVTCHDVGV---RATEGVPFLIKNAIRVVCS------------PSYDV 218
W R+ G DH FV V + RA I AI +V
Sbjct: 193 DAWKRSGGKDHVFVLTDPVAMWHLRAE------IAPAILLVVDFGGWYRLDSKSSNGSSS 246
Query: 219 AFIPHKDVAL-PQVLQPFA--LPRGG-RDVENRTVLGFWAG----HRNSKIRVILARVWE 270
I H V+L V+ P+ LPR + + R+ L ++ G HR +R L +
Sbjct: 247 DMIQHTQVSLLKDVIVPYTHLLPRLQLSENKKRSTLLYFKGAKHRHRGGIVREKLWDLLV 306
Query: 271 NDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNY 330
N+ + I N A G R ++FC+ P G +S R+ D+I C+PV++S+
Sbjct: 307 NEPGVIIEEGFPN-ATGREQSIRGMRSSEFCLHPAGDTPSSCRLFDAIQSLCIPVVVSDN 365
Query: 331 YDLPFNDILDWRKFAVILNERDVYQLKQI---LKNISDAEFVSLHNNLVKVQKHFQWNS 386
+LPF ++D+ +FAV + D + + + L++IS + N+ KVQ Q+++
Sbjct: 366 IELPFEGMVDYTEFAVFVAVDDALKPRWLVDRLRSISVKQRNEFRRNMAKVQPILQYDN 424
>gi|302766093|ref|XP_002966467.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
gi|300165887|gb|EFJ32494.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
Length = 453
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 73/295 (24%), Positives = 126/295 (42%), Gaps = 52/295 (17%)
Query: 132 DPDRAHLFFIPI------SCHKMRGKG--------TSYENMTVIVKDYVDSLINKYPYWN 177
DPD+A + F+P GKG YE +++ ++ W
Sbjct: 116 DPDQADVVFVPFFAALSAEAQLRNGKGHFRHRKDNEDYERQRAVME-----IVTSSSRWQ 170
Query: 178 RTLGADHFFVTCHDVGV---RATEGVPFLI-----------KNAIRVVCSPS--YDVAFI 221
R+ G DH FV + + RA L+ + R + SP Y
Sbjct: 171 RSGGRDHVFVLTDPMAMYHFRAEIANSILLVVDFGGWYMEDAKSSRNLSSPQPIYHTQVS 230
Query: 222 PHKDVALP--QVLQPFALPRGGRDVENRTVLGFWAG----HRNSKIRVILARVWENDTEL 275
KDV +P +L AL +D R+ L ++ G HR +R L V + + +
Sbjct: 231 LIKDVIVPYTHLLPTLAL---SQDNAVRSTLLYFKGARYRHRTGLVRDQLWSVLDGEPGV 287
Query: 276 DISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPF 335
+ NR G + + + FC+ P G +S R+ D++ C+PVI+S+ +LPF
Sbjct: 288 LLEEGFPNRT-GQVQAVQGMRNSHFCLHPAGDTPSSCRLFDAVASLCIPVIVSDSIELPF 346
Query: 336 NDILDWRKFAVILNERDVYQLKQILKNISDAEFVS-----LHNNLVKVQKHFQWN 385
+LD+ +FA+ ++ D K +++++S F S + +NL +Q HF++
Sbjct: 347 EGMLDYTQFAIFVSVHDALLPKWLVRHLS--SFSSKVRNQMRHNLASLQHHFEYE 399
>gi|260813939|ref|XP_002601673.1| hypothetical protein BRAFLDRAFT_94550 [Branchiostoma floridae]
gi|229286975|gb|EEN57685.1| hypothetical protein BRAFLDRAFT_94550 [Branchiostoma floridae]
Length = 786
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 88/349 (25%), Positives = 146/349 (41%), Gaps = 63/349 (18%)
Query: 110 TGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHKMRGKGTSYENMTVIVKDYVDS- 168
T ++ E Q + S F T DP++A +F+IP + E ++ I +D
Sbjct: 420 THMFSLEVILHQKLLSSTFRTLDPEKADVFYIPY----YPALAAACEPVSTIDSPALDRE 475
Query: 169 ----LINKYPYWNR------TLGA---DHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPS 215
+ + YPY+ + LG +H VT + R + V F+ AI P
Sbjct: 476 LWQFITSNYPYFQQGKPHMMALGRIEREHADVTGGILKTRESRSVTFV---AIEHESDPK 532
Query: 216 YDVAFIPHKDVALPQVLQPFALPR--------GGRDVENRT-------VLGFWAGHR--N 258
+ FI + LP V+ P+ GG RT VL +AG R +
Sbjct: 533 -TLKFI--RRSGLPMVVAPYPSCGHLLSDNKFGGESKSERTQLDIPRDVLVLFAGSRRMS 589
Query: 259 SKIRVILARVWENDTELDISNNRINRAIGPLVYQRR------------FYKTKFCICPGG 306
IR IL++ +E + + +N+ + Q + + FC+ P G
Sbjct: 590 HDIRRILSQQLRPTSEKYDATSSLNKQNVWFITQECRDRSWQENLVEWMHHSVFCLQPPG 649
Query: 307 SQVNSARIADSIHYGCVPVILSNYYD--LPFNDILDWRKFAVILNERDVYQLKQ----IL 360
D++ GC+PVI ++ PF+D+LD+ KF V + + D +Q K+ IL
Sbjct: 650 DSPTRKSFFDAVQCGCIPVIFKLDHEPVYPFDDVLDYSKFTVKVTDGDFFQEKRSIVDIL 709
Query: 361 KNISDAEFVSLHNNLVKVQKHFQWNSPPVRF----DAFHMVMYELWLRR 405
++I +A + L +V Q++ PP+ DAF M+M E+ R
Sbjct: 710 QDIPEAVIAAKRAELRQVTPLLQYSYPPLPETHVQDAFDMIMQEIGRTR 758
>gi|242079279|ref|XP_002444408.1| hypothetical protein SORBIDRAFT_07g021480 [Sorghum bicolor]
gi|241940758|gb|EES13903.1| hypothetical protein SORBIDRAFT_07g021480 [Sorghum bicolor]
Length = 539
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 45/161 (27%), Positives = 82/161 (50%), Gaps = 14/161 (8%)
Query: 257 RNSKIRVILARVWENDTEL-----DISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNS 311
+ K+R L ++ +++ ++ + N I RA + +KFC+ G +S
Sbjct: 369 QGGKVRQKLYQLLKDERDVHFTYGSVRQNGIRRATAGM------STSKFCLNIAGDTPSS 422
Query: 312 ARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQ---LKQILKNISDAEF 368
R+ D+I CVPVI+S+ +LPF D+LD+ +F V + D + L ++L+ IS E+
Sbjct: 423 NRLFDAIVSHCVPVIISDDIELPFEDVLDYSEFCVFVRSADAAKRGFLLRLLRGISRDEW 482
Query: 369 VSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVIK 409
+ L KV +HF++ P DA M+ + + H ++
Sbjct: 483 TKMWMRLKKVTRHFEYQYPSRSGDAVQMIWSAVARKMHSVQ 523
>gi|344287424|ref|XP_003415453.1| PREDICTED: exostosin-like 1 [Loxodonta africana]
Length = 675
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 76/310 (24%), Positives = 124/310 (40%), Gaps = 63/310 (20%)
Query: 88 NFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHK 147
+ K+++YP +P + R++ +I SR+ T +P A L
Sbjct: 94 SLKVFVYPAAEPTSEIH--RRI----------LASIEGSRYHTSNPGEACLLLF------ 135
Query: 148 MRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNA 207
+ + E V P WN G +H + H T F + A
Sbjct: 136 LSLDAPAAEASPVP------------PQWNG--GKNHLVFSLHPAPCPKT----FQLGQA 177
Query: 208 IRVVCSPSYDVAFIPHKDVALPQVLQPFALP-RGGRDVENRTVLGFWAGHRNSKIRVILA 266
+ SP+ D F DVALP L P A P RGG G H +LA
Sbjct: 178 MVAEASPTVDT-FRRGFDVALP--LLPEAHPFRGGAP-------GKLQQHSPHPGETLLA 227
Query: 267 --------RVWENDTELDISNNRINRAIGPLVYQRR--FYKTKFCICPGGSQVNSARIAD 316
R D+ + + R + GP R FC+ PG + ++ R
Sbjct: 228 LAEERGGWRTAGTDSSTCLWDGRYEQDPGPTQTHPRGTLPNATFCLIPG-RRSDALRFLQ 286
Query: 317 SIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLV 376
++ GC+PV+LS ++LPF++++DW K A+I ++R Q+ L+ + A ++L
Sbjct: 287 ALQAGCIPVLLSPRWELPFSEVIDWTKAAIIADKRLPLQVLAALQEMPPARVLAL----- 341
Query: 377 KVQKHFQWNS 386
+ Q F W++
Sbjct: 342 RQQTQFLWDA 351
>gi|356507133|ref|XP_003522325.1| PREDICTED: probable glycosyltransferase At3g42180-like [Glycine
max]
Length = 484
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 64/238 (26%), Positives = 107/238 (44%), Gaps = 26/238 (10%)
Query: 176 WNRTLGADHFFVTCHD---VGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVL 232
W R+ G DH V H + R G L+ + ++A I KD+ P
Sbjct: 233 WKRSGGKDHLIVAHHPNSLLDARRRLGAAMLV---LADFGRYPVELANI-KKDIIAPYRH 288
Query: 233 QPFALPRG-GRDVENRTVLGFWAG----HRNSKIRVILARVWENDTEL-----DISNNRI 282
+PR E RT L ++ G IR L + +++ ++ I N I
Sbjct: 289 LVGTIPRAESASFEKRTTLVYFQGAIYRKDGGAIRQELYYLLKDENDVHFTFGSIGGNGI 348
Query: 283 NRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWR 342
N+A + +KFC+ G +S R+ D+I CVPVI+S+ +LPF D LD+
Sbjct: 349 NQASQGMAL------SKFCLNIAGDTPSSNRLFDAIVSHCVPVIISDEIELPFEDDLDYS 402
Query: 343 KFAVILNERDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMV 397
F++I++ D + L +L++I E+ + L ++ HF++ P DA +M+
Sbjct: 403 DFSIIVHASDAMKKGYLLNLLRSIKRDEWNKMWERLKQITHHFEYQYPSQPGDAVNMI 460
>gi|222640628|gb|EEE68760.1| hypothetical protein OsJ_27458 [Oryza sativa Japonica Group]
Length = 518
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 42/136 (30%), Positives = 69/136 (50%), Gaps = 9/136 (6%)
Query: 277 ISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFN 336
+ N I RA + +KFC+ G +S R+ D+I CVPVI+S+ +LPF
Sbjct: 373 VRQNGIRRAT------KGMASSKFCLNIAGDTPSSNRLFDAIVSHCVPVIISDDIELPFE 426
Query: 337 DILDWRKFAVILNERDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDA 393
D+LD+ F V + D + L +L+ IS E+ ++ L +V HF++ P DA
Sbjct: 427 DVLDYSAFCVFVRASDAVKRGFLLHLLRGISQEEWTAMWRRLKEVAHHFEYQYPSQPGDA 486
Query: 394 FHMVMYELWLRRHVIK 409
M+ + + H++K
Sbjct: 487 VQMIWGAVARKMHLVK 502
>gi|291238132|ref|XP_002738985.1| PREDICTED: tout-velu-like, partial [Saccoglossus kowalevskii]
Length = 403
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 33/90 (36%), Positives = 55/90 (61%), Gaps = 5/90 (5%)
Query: 297 KTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQL 356
+ FCI G+++ ++D++ GC+PV+LS+ Y LPF+ +LDW++ A+ LNE D+ Q+
Sbjct: 1 EATFCIVLRGARLGQTALSDALKAGCIPVVLSDTYVLPFSQVLDWKRAAIRLNEEDLDQV 60
Query: 357 KQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
+L++IS SL Q F WN+
Sbjct: 61 ASVLRSISPTRINSLRK-----QVTFFWNT 85
>gi|363731063|ref|XP_418396.3| PREDICTED: exostosin-1 [Gallus gallus]
Length = 581
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 30/96 (31%), Positives = 57/96 (59%), Gaps = 5/96 (5%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y+ + FC+ P G ++ S R +++ CVPV+LSN ++LPF++++DW++ AVI +E
Sbjct: 159 YREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVIDWKQAAVIGDE 218
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
R + Q+ +++I + ++L Q F W +
Sbjct: 219 RLLLQIPSTIRSIHQDKILALRQ-----QTQFLWEA 249
>gi|412986347|emb|CCO14773.1| predicted protein [Bathycoccus prasinos]
Length = 445
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 91/357 (25%), Positives = 140/357 (39%), Gaps = 55/357 (15%)
Query: 90 KIYIYPDGD------PNTFYQTPR------KLTGKYASEGYFFQNIRESRFLTHDPDRAH 137
KIY+Y D D P Q + K+ G F +E R T +P A
Sbjct: 86 KIYVYNDADVKKLLFPGKETQAYKSGVCGMKMYGSQVHIADFLLKSKELR--TENPSDAD 143
Query: 138 LFFIPISCHKM-----RGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDV 192
FF+P M G G + + + ++ +I K PY ++ G DH FV
Sbjct: 144 FFFLPGWPKCMLDAPPNGAGLTDDELA----KRLNGVIEKLPYIKKSGGRDHVFVWPSGR 199
Query: 193 GVRATEGVPFLIKNAIRVV-----CSPSYDVA--FIPHKDVALPQVL--QPFALPRGGRD 243
G + I N+I + P +A F P KDV LP + + + +
Sbjct: 200 GPTLYKNWRCKIPNSIFLTPEGFYTDPYRTLAPYFDPWKDVVLPGFMDGRKDSYLETNKR 259
Query: 244 VENRTVLGFWAG-------------HRNSKIRVILARVWEN--DTELDISNNRINRAIGP 288
RT L +AG H + R L ++ + D L IS
Sbjct: 260 TSKRTKLASFAGTVPDGQALKGDEKHVKAHPRERLLKLSKKYPDDLLAISGRTPK----- 314
Query: 289 LVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVIL 348
Y +KFCI P G + R ++ GCVPVI+S+ LPF + LDW ++
Sbjct: 315 --YAEILGDSKFCIVPRGLSPWTLRTYETFFAGCVPVIISDSVRLPFQEFLDWSLISIKW 372
Query: 349 NERDVYQ-LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLR 404
E + + L LK+I D E + +V+ F + + + +AF +M+ L L+
Sbjct: 373 PEAKIDESLLTYLKSIPDEEIEKIVRRGEQVRCVFAYQADATKCNAFSAIMWALSLK 429
>gi|414882087|tpg|DAA59218.1| TPA: hypothetical protein ZEAMMB73_484283 [Zea mays]
Length = 479
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 75/305 (24%), Positives = 128/305 (41%), Gaps = 28/305 (9%)
Query: 124 RESRFLTHDPDRAHLFFIPISC-----HKMRGKGTSYENMTVIVKDYVDSLINKYPYWNR 178
R S DP A L F+P R +++ + + P W R
Sbjct: 168 RHSAVRVTDPRDADLVFVPFFASLSYNRHSRPLPPEKVGRDKALQEKLVGYLTARPEWRR 227
Query: 179 TLGADHFFVTCHDVGV----RATEGVPFLIKNAIRV---VCSPSYDVAFIPHKDVALPQV 231
GADH V H + A F++ + R V S DV P+K +A
Sbjct: 228 FGGADHVIVAHHPNSLLHARAALSPAVFVLSDFGRYPPRVASLEKDV-IAPYKHMA---- 282
Query: 232 LQPFALPRGGRDVENRTVLGFWAG----HRNSKIRVILARVWENDTELDISNNRINRAIG 287
+ F G D +R L ++ G IR L + +++ ++ S + G
Sbjct: 283 -KTFVNDSAGFD--DRPTLLYFRGAIYRKEGGTIRQELYYMLKDEKDVYFSFGSVQDH-G 338
Query: 288 PLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVI 347
+ + +KFC+ G +S R+ D+I CVPVI+S+ +LP+ D+LD+ KF++
Sbjct: 339 ASKASQGMHSSKFCLNIAGDTPSSNRMFDAIVSHCVPVIISDDIELPYEDVLDYSKFSIF 398
Query: 348 LNERDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLR 404
+ D + L ++L +S + + + L +V +HF++ P + DA M+ L +
Sbjct: 399 VRSSDAVEKGHLMRLLSGVSKQRWTEMWSRLREVDRHFEYQYPSQKDDAVQMIWRSLSRK 458
Query: 405 RHVIK 409
IK
Sbjct: 459 VPSIK 463
>gi|384252594|gb|EIE26070.1| exostosin-domain-containing protein [Coccomyxa subellipsoidea
C-169]
Length = 898
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 87/355 (24%), Positives = 135/355 (38%), Gaps = 79/355 (22%)
Query: 113 YASEGYFFQNIRE---SRFLTHDPDRAHLFFIPI--SC--HKMRG--KGTSYENMTVIVK 163
Y E F + + + S T DP+ A FF+P+ SC H + G Y T+ +
Sbjct: 489 YGIESAFHERLLQALSSEHRTLDPEEADFFFMPVYTSCFLHPVWGYVDHPWYYGPTIDCR 548
Query: 164 DYVDSLINK-----------------------YPYWNRTLGADHFFVTCHDVG------- 193
D I + +P+W R G DH ++ HD G
Sbjct: 549 RDGDMQICQTGANRVMQAMFMLLEAQKWVEVNHPWWRRKGGRDHIWLITHDEGSCWAPKE 608
Query: 194 --------------VRATEGVPFL-IKNAIRVVCSPSY-----------DVAFIPHKDVA 227
V T F N + V P + + P KD+
Sbjct: 609 IRLSIILSHWGRKDVNHTSNSAFKPWDNYTQEVIHPEWWPEGYTHHIKGHACYDPIKDLI 668
Query: 228 LPQVLQPFAL----PRGGRDVENRTVLGFWAG----HRNSKI-RVILARVWENDTELDIS 278
+P + P P G R +L + G HR R I R++ E D +
Sbjct: 669 IPNLKHPAEFANFSPLVGHPQPPRDILFLFRGDVGKHRLPHYSRGIRQRLFALAQEHDWA 728
Query: 279 NNRI----NRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLP 334
+R Y ++KFC+ G S R D+I +GCVPV++ + D
Sbjct: 729 GRHAILIGDRDDVAGDYSELLTRSKFCLVAPGDGF-SPRAEDAILHGCVPVVVMDEVDPV 787
Query: 335 FNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPV 389
F+ ILDW F++ + E D+ QL QIL + +A ++ +L V + F+W+S P+
Sbjct: 788 FSSILDWSAFSLRIAEADIEQLPQILLAVPEARLQAMQRSLRNVWQRFKWSSLPI 842
>gi|108863947|gb|ABA91286.2| secondary cell wall-related glycosyltransferase family 47,
putative, expressed [Oryza sativa Japonica Group]
gi|215769393|dbj|BAH01622.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 476
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 61/237 (25%), Positives = 107/237 (45%), Gaps = 13/237 (5%)
Query: 174 PYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALP--QV 231
P W R+ GADH V H + V F + VA + KDV P +
Sbjct: 219 PEWKRSGGADHVIVAHHPNSLLHARSVLFPAVFVLSDFGRYHPRVASL-EKDVIAPYKHM 277
Query: 232 LQPFALPRGGRDVENRTVLGFWAG----HRNSKIRVILARVWENDTELDISNNRINRAIG 287
+ F G D +R L ++ G IR L + +++ ++ + + G
Sbjct: 278 AKTFVNDSAGFD--DRPTLLYFRGAIFRKEGGNIRQELHYMLKDEKDVYFAFGSVQDH-G 334
Query: 288 PLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVI 347
+ + +KFC+ G +S R+ D+I CVPVI+S+ +LP+ D LD+ KF++
Sbjct: 335 ASKASQGMHASKFCLNIAGDTPSSNRLFDAIVSHCVPVIISDDIELPYEDALDYSKFSIF 394
Query: 348 LNERDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYEL 401
+ D + L ++++ +S ++ + L +V KHF++ P + DA M+ L
Sbjct: 395 VRSSDAVKKGYLMRLIRGVSKHQWTMMWRRLKEVDKHFEYQYPSQKDDAVQMIWQTL 451
>gi|297820564|ref|XP_002878165.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
gi|297324003|gb|EFH54424.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
Length = 792
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 86/339 (25%), Positives = 137/339 (40%), Gaps = 80/339 (23%)
Query: 113 YASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SCHKMRGKGTSYENMT----------- 159
Y S+ F++NI + T + + A FF+P+ SC R + NM
Sbjct: 390 YGSQMAFYENILATAHRTLNGEEADFFFVPVLDSCIINRADDAPHINMQNHTGLRSSFTL 449
Query: 160 VIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVV-------- 211
K + ++ KYPYWNR+ G DH + D G P I N++ +V
Sbjct: 450 EFYKRAYEHIVEKYPYWNRSAGRDHIWFFSWDEGACY---APKEIWNSMMLVHWGNTNSK 506
Query: 212 ---CSPSY------DVA---------FIPHKDVALP--QVLQPFALPRG--GRDVENRTV 249
+ +Y D++ F P KD+ +P +V P+++ R E R
Sbjct: 507 HNHSTTAYWGDNWDDISDERRGDHPCFDPRKDLVIPAWKVPDPYSMRANYWARPREKRKT 566
Query: 250 LGFWAGHRNSK-------------IRVILARVWENDTELDISNNRINR---------AIG 287
L ++ G+ IR LA E S N+ + +
Sbjct: 567 LFYFNGNLGPAYEKGRPEDSYSMGIRQKLAE------EFGSSPNKEGKLGKQHAEDVIVT 620
Query: 288 PLV---YQRRFYKTKFCIC-PGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRK 343
PL Y + + FC PG S R+ DSI GCVPVI+ + LP+ ++L++
Sbjct: 621 PLRSDNYHKDIANSIFCGAFPGDGW--SGRMEDSILQGCVPVIIQDGIYLPYENMLNYES 678
Query: 344 FAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHF 382
FAV ++E D+ L L+ S+ E N+ K+ + F
Sbjct: 679 FAVRVSEDDIPNLINTLRGFSETEIQFRLANVKKLWQRF 717
>gi|302845050|ref|XP_002954064.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
gi|300260563|gb|EFJ44781.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
Length = 1122
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 72/282 (25%), Positives = 115/282 (40%), Gaps = 48/282 (17%)
Query: 119 FFQNIRESRFLTHDPDRAHLFFIPISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNR 178
F+Q + + T D D A +FIP++ T E +V+ + + YP+W++
Sbjct: 637 FWQRLMSAGIRTVDGDEADYYFIPVN--------TRTELAPGMVEWVLSYIRRTYPWWSK 688
Query: 179 TLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCS----------------PSYDVAFIP 222
G H + D+G+ +P +++ ++ S + A P
Sbjct: 689 DNGNRHLIIHTGDMGI---ADLPADMRSRLKSAFSNITWLTHWGIYQYHPVAKWYPAHRP 745
Query: 223 HKDVALPQVL--QPFALPRGGRDVENRTVLGFWAGHRNSKI----------RVILARVWE 270
KDV LP ++ Q F L VE R A RN + R+ R
Sbjct: 746 GKDVVLPVMVTTQGFHLSPLNPRVEAR------ARRRNQTMARSGTFFFAGRICGDRKPP 799
Query: 271 NDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNY 330
+ D S R + + G V Q KFC+ P G ++ S+ GCVPV++ N
Sbjct: 800 DPATGDCSRTRPDYSGG--VRQLDISSHKFCLAPLGGGHGKRQVLVSL-MGCVPVLIGNG 856
Query: 331 YDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLH 372
PF +DW +F+V + E D+ L +IL NISD +
Sbjct: 857 VLQPFEPEIDWSRFSVSVPEADIPDLPRILANISDQRVADMQ 898
>gi|357508935|ref|XP_003624756.1| Cysteine synthase [Medicago truncatula]
gi|355499771|gb|AES80974.1| Cysteine synthase [Medicago truncatula]
Length = 407
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 60/103 (58%), Gaps = 3/103 (2%)
Query: 298 TKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLK 357
+KFC+ G +S R+ D+I CVPVI+S+ +LP+ D+LD+ KF V + RD + K
Sbjct: 48 SKFCLNIAGDTPSSNRLFDAIASHCVPVIISDEIELPYEDVLDYSKFCVFVRTRDAVKKK 107
Query: 358 QI---LKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMV 397
+ +++I E+ + N L +V+K F++ P DA M+
Sbjct: 108 YLINFIRSIGKDEWTRMWNRLKEVEKFFEFQFPSKEGDAVEMI 150
>gi|297815694|ref|XP_002875730.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
gi|297321568|gb|EFH51989.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
Length = 479
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 39/124 (31%), Positives = 70/124 (56%), Gaps = 9/124 (7%)
Query: 277 ISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFN 336
+ N IN+A + + +KFC+ G +S R+ D+I CVPVI+S+ +LPF
Sbjct: 319 VRNGGINKA------SQGMHNSKFCLNIAGDTPSSNRLFDAIASHCVPVIISDDIELPFE 372
Query: 337 DILDWRKFAVILNERDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDA 393
D++D+ +FAV + D + L +++ IS E+ + N L +V+K+++++ P DA
Sbjct: 373 DVIDYSEFAVFVRTSDALKENFLVNLIRGISKEEWTRMWNRLKEVEKYYEFHFPSKVDDA 432
Query: 394 FHMV 397
M+
Sbjct: 433 VQMI 436
>gi|225439168|ref|XP_002268382.1| PREDICTED: xylogalacturonan beta-1,3-xylosyltransferase [Vitis
vinifera]
Length = 488
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 71/301 (23%), Positives = 136/301 (45%), Gaps = 39/301 (12%)
Query: 121 QNIRESRFLTHDPDRAHLFFIPISC-----HKMRGKGTSYENMTVIVKDYVDSLINKYPY 175
QN R + H+ A + F+P H + KG +N +++D + +
Sbjct: 161 QNARAA-IRVHNSSEADVIFVPFFSSLSYNHFGKFKGRQKKNENNLLQDKLVKFLTAQEE 219
Query: 176 WNRTLGADHFFVTCH-----DVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALP- 229
W R+ G DH + H D ++ + F++ + R ++A + KDV P
Sbjct: 220 WIRSEGRDHIIMAHHPNSMLDARMKLWPAI-FILSDFGRYPP----NIANV-GKDVIAPY 273
Query: 230 -QVLQPFALPRGGRDVENRTVLGFWAGHRNSK----IRVILARVWENDTEL-----DISN 279
V++ F D ++R L ++ G K IR L + +++ ++ +
Sbjct: 274 KHVIKSFI--NDTSDFDSRPTLLYFQGAIYRKDGGFIRQELFYLLKDEKDVHFAFGNTQG 331
Query: 280 NRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDIL 339
N IN+A + + +KFC+ G +S R+ D+I CVPVI+S+ +LP+ D+L
Sbjct: 332 NGINKA------SQGMHSSKFCLNIAGDTPSSNRLFDAIASHCVPVIISDEIELPYEDVL 385
Query: 340 DWRKFAVILNERDVYQLK---QILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHM 396
D+ +F + + D + K +++++I E+ + L +V+ F++ P DA M
Sbjct: 386 DYSQFCIFVRTSDALKDKFLIKLIRSIKKDEWTRMWRRLKEVENFFEFQYPSKEGDAVQM 445
Query: 397 V 397
+
Sbjct: 446 I 446
>gi|356565097|ref|XP_003550781.1| PREDICTED: xylogalacturonan beta-1,3-xylosyltransferase-like
[Glycine max]
Length = 472
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 63/266 (23%), Positives = 117/266 (43%), Gaps = 26/266 (9%)
Query: 152 GTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV---RATEGVPFLIKNAI 208
G ++ +++ + L+ + W R+ G DH V H + R G L+
Sbjct: 197 GKEKVSVNRMLQQRLVQLLMEREEWKRSGGRDHVIVAHHPNSILRARRKLGSAMLVLADF 256
Query: 209 RVVCSPSYDVAFIPHKDVALPQVLQPFALPRG-GRDVENRTVLGFWAG----HRNSKIRV 263
S ++ KD+ P +PR E R+ L ++ G IR
Sbjct: 257 GRYPSQLANIK----KDIIAPYRHLVSTVPRAESASYEERSTLLYFQGAIYRKDGGAIRQ 312
Query: 264 ILARVWENDTEL-----DISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSI 318
L + +++ ++ I N IN+A + +KFC+ G +S R+ D+I
Sbjct: 313 KLYYLLKDEKDVHFAFGSIRKNGINQASQGMAL------SKFCLNVAGDTPSSNRLFDAI 366
Query: 319 HYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQ---LKQILKNISDAEFVSLHNNL 375
CVPVI+S+ +LPF D+LD+ +F + ++ D + L +L++I ++ + L
Sbjct: 367 VSHCVPVIISDEIELPFEDVLDYSEFGLFVHASDAVRKGYLLNLLRSIKPEKWTQMWERL 426
Query: 376 VKVQKHFQWNSPPVRFDAFHMVMYEL 401
+ +HF++ P DA +M+ E+
Sbjct: 427 KDITQHFEYQYPSQPGDAVNMIWEEV 452
>gi|224090294|ref|XP_002308967.1| predicted protein [Populus trichocarpa]
gi|222854943|gb|EEE92490.1| predicted protein [Populus trichocarpa]
Length = 793
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 79/331 (23%), Positives = 134/331 (40%), Gaps = 71/331 (21%)
Query: 113 YASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SCHKMRGKGTSYENMT----------- 159
Y ++ +++I S + T + + A FF+P+ SC R + +M
Sbjct: 392 YGAQMALYESILASPYRTLNGEEADFFFVPVLDSCIITRADDAPHLSMEQHLGLRSSLTL 451
Query: 160 VIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVV-------- 211
+ D ++ YP+WNR+ G DH + D G P I N++ VV
Sbjct: 452 EFYRKAYDHIVEHYPFWNRSSGRDHLWSFSWDEGACY---APKEIWNSMMVVHWGNTNSK 508
Query: 212 --------CSPSYDV----------AFIPHKDVALPQVLQP----FALPRGGRDVENRTV 249
+ ++D F P KD+ LP +P + R +E R
Sbjct: 509 HNHSTTAYWADNWDKISSDRRGKHPCFDPDKDLVLPAWKRPDVNALSTKLWARPLEKRKT 568
Query: 250 LGFWAGHRNSK-------------IRVILARVWENDTELDIS---NNRINRAIGPL---V 290
L ++ G+ IR LA + + D + + N + PL
Sbjct: 569 LFYFNGNLGPAYLNGRPEALYSMGIRQKLAEEFGSTPNKDGNLGKQHAENVIVSPLRSES 628
Query: 291 YQRRFYKTKFC-ICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILN 349
Y + FC + PG S R+ DSI GC+PV++ + LP+ ++L++ FAV +
Sbjct: 629 YHEDLASSVFCGVMPGDGW--SGRMEDSILQGCIPVVIQDGIYLPYENVLNYESFAVRIL 686
Query: 350 ERDVYQLKQILKNISDAEFVSLHNNLVKVQK 380
E ++ L +IL+ ++ E + N L VQK
Sbjct: 687 EDEIPNLIKILQGFNETE---IENKLTSVQK 714
>gi|125585107|gb|EAZ25771.1| hypothetical protein OsJ_09611 [Oryza sativa Japonica Group]
Length = 779
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 81/330 (24%), Positives = 140/330 (42%), Gaps = 62/330 (18%)
Query: 113 YASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SCHKMRGKGTSYENM--TVIVKDY--- 165
Y ++ +++I S T + D A F++P SC R + M + ++ Y
Sbjct: 379 YGAQIALYESILASPHRTLNGDEADYFYVPALDSCLITRSDDAPHLQMPRDLRLRSYHTL 438
Query: 166 ------VDSLINKYPYWNRTLGADHFFVTCHDVGV----RATEGVPFLI---------KN 206
D + +YPYWNRT G DH + D G + L+ KN
Sbjct: 439 EYYRMTYDHIAQRYPYWNRTSGRDHIWFFSWDEGACYAPKEIWNSMMLVHWGNTNTKHKN 498
Query: 207 AIRVVCSPSYDV----------AFIPHKDVALPQVLQP----FALPRGGRDVENRTVLGF 252
+ + +++ F P KD+ LP QP L R NRT L +
Sbjct: 499 STTAYWADNWNYIPIDRRGNHPCFDPRKDLVLPAWKQPNPAAIWLKLWARTRNNRTTLFY 558
Query: 253 WAG----------HRNS---KIRVILARVWENDTELDISNNRINRAIGPLVYQR--RFYK 297
+ G H ++ IR LA + + + R + A + Y R ++Y+
Sbjct: 559 FNGNLGPAYKDGRHEDTYSMGIRQKLAAEFGSTPDKQGKLGRQHTANVTVTYLRTEKYYE 618
Query: 298 ----TKFC-ICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERD 352
+ FC + PG S R+ DS+ GC+PVI+ + LP+ ++L++ FAV + E D
Sbjct: 619 ELASSIFCGVLPGDGW--SGRMEDSMLQGCIPVIIQDGILLPYENMLNYNSFAVRIQEDD 676
Query: 353 VYQLKQILKNISDAEFVSLHNNLVKVQKHF 382
+ L +IL+ I++ + + N+ ++ + F
Sbjct: 677 IPNLIRILRGINETQVEFMLRNVRQIWQRF 706
>gi|218192182|gb|EEC74609.1| hypothetical protein OsI_10218 [Oryza sativa Indica Group]
Length = 737
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 81/330 (24%), Positives = 140/330 (42%), Gaps = 62/330 (18%)
Query: 113 YASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SCHKMRGKGTSYENM--TVIVKDY--- 165
Y ++ +++I S T + D A F++P SC R + M + ++ Y
Sbjct: 337 YGAQIALYESILASPHRTLNGDEADYFYVPALDSCLITRSDDAPHLQMPRDLRLRSYHTL 396
Query: 166 ------VDSLINKYPYWNRTLGADHFFVTCHDVGV----RATEGVPFLI---------KN 206
D + +YPYWNRT G DH + D G + L+ KN
Sbjct: 397 EYYRMTYDHIAQRYPYWNRTSGRDHIWFFSWDEGACYAPKEIWNSMMLVHWGNTNTKHKN 456
Query: 207 AIRVVCSPSYDV----------AFIPHKDVALPQVLQP----FALPRGGRDVENRTVLGF 252
+ + +++ F P KD+ LP QP L R NRT L +
Sbjct: 457 STTAYWADNWNYIPIDRRGNHPCFDPRKDLVLPAWKQPNPAAIWLKLWARTRNNRTTLFY 516
Query: 253 WAG----------HRNS---KIRVILARVWENDTELDISNNRINRAIGPLVYQR--RFYK 297
+ G H ++ IR LA + + + R + A + Y R ++Y+
Sbjct: 517 FNGNLGPAYKDGRHEDTYSMGIRQKLAAEFGSTPDKQGKLGRQHTANVTVTYLRTEKYYE 576
Query: 298 ----TKFC-ICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERD 352
+ FC + PG S R+ DS+ GC+PVI+ + LP+ ++L++ FAV + E D
Sbjct: 577 ELASSIFCGVLPGDGW--SGRMEDSMLQGCIPVIIQDGILLPYENMLNYNSFAVRIQEDD 634
Query: 353 VYQLKQILKNISDAEFVSLHNNLVKVQKHF 382
+ L +IL+ I++ + + N+ ++ + F
Sbjct: 635 IPNLIRILRGINETQVEFMLRNVRQIWQRF 664
>gi|414866628|tpg|DAA45185.1| TPA: hypothetical protein ZEAMMB73_313698 [Zea mays]
Length = 588
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 47/164 (28%), Positives = 75/164 (45%), Gaps = 18/164 (10%)
Query: 243 DVENRTVLGFWAGHRNSKIRVILARVWEN---DTELDISNNRINRAIGPLVYQRRFYKTK 299
D N G++A R A VWEN + DIS + P Y ++
Sbjct: 406 DTSNDPEGGYYA-------RGACASVWENFKNNPLFDISTDH------PPTYYEDMQRSV 452
Query: 300 FCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQI 359
FC+CP G S R+ +++ +GC+PVI+++ LPF D + W + V + E DV +L I
Sbjct: 453 FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFVAEEDVPKLDSI 512
Query: 360 LKNISDAEFVSLHNNLV--KVQKHFQWNSPPVRFDAFHMVMYEL 401
L +I + L +++ + P DAFH ++ L
Sbjct: 513 LMSIPTDVILRKQRLLANPSMKQAMLFPQPAQAGDAFHQILNGL 556
>gi|357508931|ref|XP_003624754.1| Secondary cell wall-related glycosyltransferase family [Medicago
truncatula]
gi|355499769|gb|AES80972.1| Secondary cell wall-related glycosyltransferase family [Medicago
truncatula]
Length = 510
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 60/103 (58%), Gaps = 3/103 (2%)
Query: 298 TKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLK 357
+KFC+ G +S R+ D+I CVPVI+S+ +LP+ D+LD+ KF V + RD + K
Sbjct: 364 SKFCLNIAGDTPSSNRLFDAIASHCVPVIISDEIELPYEDVLDYSKFCVFVRTRDAVKKK 423
Query: 358 QI---LKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMV 397
+ +++I E+ + N L +V+K F++ P DA M+
Sbjct: 424 YLINFIRSIGKDEWTRMWNRLKEVEKFFEFQFPSKEGDAVEMI 466
>gi|395521813|ref|XP_003765009.1| PREDICTED: exostosin-like 1 [Sarcophilus harrisii]
Length = 694
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 31/95 (32%), Positives = 54/95 (56%), Gaps = 5/95 (5%)
Query: 292 QRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNER 351
Q + + FC P S R+ ++ GCVPV+LS ++LPF +++DW AVI++ER
Sbjct: 283 QNGLHNSTFCFIPSSCHAGSFRLLQALKAGCVPVLLSRGWELPFAEVIDWGTAAVIIDER 342
Query: 352 DVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
+ Q+K +L+ + A ++L + Q F W++
Sbjct: 343 HLLQIKSVLQGLPPARVLAL-----RQQTQFLWDA 372
>gi|441647993|ref|XP_004090847.1| PREDICTED: LOW QUALITY PROTEIN: exostosin-1 [Nomascus leucogenys]
Length = 746
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 77/329 (23%), Positives = 130/329 (39%), Gaps = 57/329 (17%)
Query: 88 NFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHK 147
FK+Y+YP Q K+ Y + I SRF T DP +A LF + +
Sbjct: 113 GFKVYVYPQ-------QKGEKIAESYQN---ILAAIEGSRFYTSDPSQACLFXLSLDTLD 162
Query: 148 MRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHD-VGVRATEGVPFLIKN 206
Y + + S + WN G +H + TE V F I
Sbjct: 163 RDQLSPQYVHN-------LRSKVQSLHLWNN--GRNHLIFNLYSGTWPDYTEDVGFDIGQ 213
Query: 207 AIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDV-----------------ENRTV 249
A+ S S + F P+ DV++P + G R R +
Sbjct: 214 AMLAKASISTE-NFRPNFDVSIPLFSKDHPRTGGERGFLKFNTIPPLRKYMLVFKGKRYL 272
Query: 250 LGFWAGHRNSKIRV------IL------ARVWENDTELDISNNRINRAIGPLVYQRRFYK 297
G + RN+ V +L + W+ D +R N Y+ +
Sbjct: 273 TGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHK--DSRCDRDNTEYEKYDYREMLHN 330
Query: 298 TKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLK 357
FC+ P G ++ S +++ CVPV+LSN ++LPF+++++W + AVI +ER + Q+
Sbjct: 331 ATFCLVPRGRRLGSFIFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIP 390
Query: 358 QILKNISDAEFVSLHNNLVKVQKHFQWNS 386
+++I + ++L + Q F W +
Sbjct: 391 STIRSIHQDKILAL-----RQQTQFLWEA 414
>gi|147818414|emb|CAN68950.1| hypothetical protein VITISV_039537 [Vitis vinifera]
Length = 488
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 70/301 (23%), Positives = 136/301 (45%), Gaps = 39/301 (12%)
Query: 121 QNIRESRFLTHDPDRAHLFFIPISC-----HKMRGKGTSYENMTVIVKDYVDSLINKYPY 175
QN R + H+ A + F+P H + KG +N +++D + +
Sbjct: 161 QNARXA-IRVHNSSEADVIFVPFFSSLSYNHFGKFKGRQKKNENNLLQDKLVKFLTAQEE 219
Query: 176 WNRTLGADHFFVTCH-----DVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALP- 229
W R+ G DH + H D ++ + F++ + R ++A + KD+ P
Sbjct: 220 WIRSEGRDHIIMAHHPNSMLDARMKLWPAI-FILSDFGRYPP----NIANV-GKDLIAPY 273
Query: 230 -QVLQPFALPRGGRDVENRTVLGFWAGHRNSK----IRVILARVWENDTEL-----DISN 279
V++ F D ++R L ++ G K IR L + +++ ++ +
Sbjct: 274 KHVIKSFI--NDTSDFDSRPTLLYFQGAIYRKDGGFIRQELFYLLKDEKDVHFAFGNTQG 331
Query: 280 NRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDIL 339
N IN+A + + +KFC+ G +S R+ D+I CVPVI+S+ +LP+ D+L
Sbjct: 332 NGINKA------SQGMHSSKFCLNIAGDTPSSNRLFDAIASHCVPVIISDEIELPYEDVL 385
Query: 340 DWRKFAVILNERDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHM 396
D+ +F + + D + L +++++I E+ + L +V+ F++ P DA M
Sbjct: 386 DYSQFCIFVRTSDALKDKFLXKLIRSIKKDEWTRMWRRLKEVENFFEFQYPSKEGDAVQM 445
Query: 397 V 397
+
Sbjct: 446 I 446
>gi|384490457|gb|EIE81679.1| hypothetical protein RO3G_06384 [Rhizopus delemar RA 99-880]
Length = 793
Score = 71.6 bits (174), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 68/310 (21%), Positives = 130/310 (41%), Gaps = 51/310 (16%)
Query: 128 FLTHDPDRAHLFFIPI--SCHKMR---GKGTSYENMTVIVKDYVDSL----INKYPYWNR 178
++T +P+ A FFIP SC+ +++ + YVD L I +YPYWN+
Sbjct: 483 YVTENPEEADFFFIPFFGSCYLYNCWYENKWNWDERCEVDAKYVDPLMDMVIQEYPYWNK 542
Query: 179 TLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIP----HKDVALPQV--- 231
T G +H + D + P ++AI + ++ H+D+ +P
Sbjct: 543 TGGRNHIMIHPMDKTFTYYQSNPRF-QSAIFLKTVGDKRNKWMSRHRYHRDIVIPSATRM 601
Query: 232 -----------LQPFALPRGGRDVENRTVLGFWAG---------HRNSKIRVILARVWEN 271
L P+ G+ R + + G ++ IR + + +
Sbjct: 602 IHHLRANPLDYLNAQGQPKSGK----RDIFALFQGCCPDVQPTDEYSNGIRSLFFNHFAH 657
Query: 272 DTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYY 331
I +++ Y + + K+ + P G +++ RI + + +G VPV++++
Sbjct: 658 -----YPGYEIGQSVADEEYLEKLSRAKYGLSPMGWTLDTTRIWEFMAFGVVPVVIADGI 712
Query: 332 DLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRF 391
PF +DW KF V + +V++L +ILK+I D + NL + + +
Sbjct: 713 IEPFEFDVDWDKFIVRIRRDEVHRLDEILKSIDDKTYEYKQKNLWEFGRRVG-----LEM 767
Query: 392 DAFHMVMYEL 401
DA+H ++ EL
Sbjct: 768 DAWHFIVREL 777
>gi|296085881|emb|CBI31205.3| unnamed protein product [Vitis vinifera]
Length = 455
Score = 71.6 bits (174), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 68/292 (23%), Positives = 132/292 (45%), Gaps = 38/292 (13%)
Query: 130 THDPDRAHLFFIPISC-----HKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADH 184
H+ A + F+P H + KG +N +++D + + W R+ G DH
Sbjct: 136 VHNSSEADVIFVPFFSSLSYNHFGKFKGRQKKNENNLLQDKLVKFLTAQEEWIRSEGRDH 195
Query: 185 FFVTCH-----DVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALP--QVLQPFAL 237
+ H D ++ + F++ + R ++A + KDV P V++ F
Sbjct: 196 IIMAHHPNSMLDARMKLWPAI-FILSDFGRY----PPNIANV-GKDVIAPYKHVIKSFI- 248
Query: 238 PRGGRDVENRTVLGFWAGHRNSK----IRVILARVWENDTEL-----DISNNRINRAIGP 288
D ++R L ++ G K IR L + +++ ++ + N IN+A
Sbjct: 249 -NDTSDFDSRPTLLYFQGAIYRKDGGFIRQELFYLLKDEKDVHFAFGNTQGNGINKA--- 304
Query: 289 LVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVIL 348
+ + +KFC+ G +S R+ D+I CVPVI+S+ +LP+ D+LD+ +F + +
Sbjct: 305 ---SQGMHSSKFCLNIAGDTPSSNRLFDAIASHCVPVIISDEIELPYEDVLDYSQFCIFV 361
Query: 349 NERDVYQLK---QILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMV 397
D + K +++++I E+ + L +V+ F++ P DA M+
Sbjct: 362 RTSDALKDKFLIKLIRSIKKDEWTRMWRRLKEVENFFEFQYPSKEGDAVQMI 413
>gi|255070683|ref|XP_002507423.1| glycosyltransferase family 47 protein [Micromonas sp. RCC299]
gi|226522698|gb|ACO68681.1| glycosyltransferase family 47 protein [Micromonas sp. RCC299]
Length = 698
Score = 71.2 bits (173), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 36/114 (31%), Positives = 65/114 (57%), Gaps = 3/114 (2%)
Query: 281 RINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILD 340
R + + P Y R +KFC+ G++V S R+ + + +GCVPVI+++ Y P + + D
Sbjct: 520 RSDGQVSPSRYMRLMRDSKFCLHVRGTRVQSPRLIEGMLFGCVPVIVADGYVPPLSWLFD 579
Query: 341 WRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAF 394
W KF+V L E + +L ++L+ + ++ +L NL +V F ++ P+ DA
Sbjct: 580 WSKFSVRLPEVEHERLPEVLQGV---DWATLQANLRRVAPFFVYHRTPIPGDAL 630
>gi|326918032|ref|XP_003205297.1| PREDICTED: exostosin-1-like, partial [Meleagris gallopavo]
Length = 535
Score = 71.2 bits (173), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 30/96 (31%), Positives = 58/96 (60%), Gaps = 5/96 (5%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y+ + FC+ P G ++ S R +++ CVPV+LSN ++LPF++++DW++ AVI +E
Sbjct: 162 YREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVIDWKQAAVIGDE 221
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
R + Q+ +++I + ++L + Q F W +
Sbjct: 222 RLLLQIPSTIRSIHQDKILAL-----RQQTQFLWEA 252
>gi|145355370|ref|XP_001421936.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144582175|gb|ABP00230.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 490
Score = 71.2 bits (173), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 34/104 (32%), Positives = 65/104 (62%), Gaps = 1/104 (0%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + ++FC+ G++V+S R+ +S+ +GCVPVIL++ Y+LP + ++DW F+V++ E
Sbjct: 350 YTAKMKNSRFCLYMRGTRVHSPRLIESMLFGCVPVILADDYELPLSWLVDWSAFSVMIPE 409
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAF 394
RD + L+ +++++ ++H L V F + P+ DAF
Sbjct: 410 RDFQTIPDALER-ANSDWDAMHMRLQMVLPLFLYRRRPLVGDAF 452
>gi|391340944|ref|XP_003744793.1| PREDICTED: exostosin-1a-like [Metaseiulus occidentalis]
Length = 1047
Score = 71.2 bits (173), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 78/319 (24%), Positives = 133/319 (41%), Gaps = 77/319 (24%)
Query: 88 NFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHK 147
F++YIYP D T KL IR SRF T +P+ A LF +
Sbjct: 358 GFRVYIYPLEDGQAVSATYMKL----------LSAIRRSRFYTSNPEEACLFVPNVDTL- 406
Query: 148 MRGKGTSYENMTVIVKDYVDSLINKY---PYWNRTLGADHFFVT---------CHDVGVR 195
+ ++ +YV S + P+WN G +H D+G
Sbjct: 407 ---------DRDILSDEYVRSAQARLWNLPHWNG--GRNHLIFNLFSGSWPDYSQDLGF- 454
Query: 196 ATEGVPFLIKN-AIRVVCSPSYDVAFIP-----HKDV--------ALPQVLQP----FAL 237
G+ L K+ A + P YD++ IP H ++ + +++ P F L
Sbjct: 455 -DPGLAMLAKSSAPETIFRPGYDIS-IPLFPRTHPEIGGEPGFSSSESKIITPLRKRFLL 512
Query: 238 PRGGRDVENRTVLGFWAGHRNSKIR------VIL------ARVWE--NDTELDISNNRIN 283
G+ R + G + RNS V+L + W+ D D N +
Sbjct: 513 TFKGK----RYLYGIGSEIRNSLFHLNNVNDVLLLTTCKHGKQWKLKKDERCDSDNADYD 568
Query: 284 RAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRK 343
+ ++ Q + FC+ P G ++ S R +S+ GC+P++LSN + LPF++++DW+
Sbjct: 569 KQDYTVLMQ----NSTFCLVPRGRRLGSFRFLESLQAGCIPIVLSNDWRLPFDEVIDWKS 624
Query: 344 FAVILNERDVYQLKQILKN 362
+ +ER ++QL L++
Sbjct: 625 ATIRWDERLLFQLPHFLRS 643
>gi|322795454|gb|EFZ18199.1| hypothetical protein SINV_05251 [Solenopsis invicta]
Length = 711
Score = 71.2 bits (173), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 81/328 (24%), Positives = 132/328 (40%), Gaps = 58/328 (17%)
Query: 88 NFKIYIYPDGDP-NTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCH 146
F +Y+YP D + YQ I ESR+ T DP RA +F + +
Sbjct: 88 GFTVYVYPVEDVISPLYQK-------------ILNVITESRYYTSDPARACIFVLALDTL 134
Query: 147 KMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV--RATEGVPFLI 204
T + V + L++ PYWN G +H + A E + F +
Sbjct: 135 DRDPLSTEF------VHNLPSKLLH-LPYWNN--GRNHLIFNLYSGTWPDYAEESLAFDV 185
Query: 205 KNAIRVVCSPSYDVAFIPHKDVALPQVLQ--PFALPRGGRDVEN---------------R 247
A+ S S P DV++P + P G+ +EN R
Sbjct: 186 GYAMLAKASMSI-FRHRPDFDVSIPLFGKQHPERGGESGQALENNFPNNKKYVAAFKGKR 244
Query: 248 TVLGFWAGHRNSK---------IRVILARVWENDTEL-DISNNRINRAIGPLVYQRRFYK 297
V G + RN+ + V R + EL D + N+ Y+
Sbjct: 245 YVHGIGSETRNALYHLHNGKDLVFVTTCRHGKAWRELQDEHCQQDNQEYDTYDYEILLMN 304
Query: 298 TKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLK 357
FC+ P G ++ S R +++ GC+PVILSN + LPF++ +DW + + +ER + Q+
Sbjct: 305 ATFCLVPRGRRLGSFRFLEALRSGCIPVILSNGWALPFHERIDWTQAVIFSDERLLLQIP 364
Query: 358 QILKNISDAEFVSLHNNLVKVQKHFQWN 385
IL+++S+ + + V+ Q F W
Sbjct: 365 DILRSVSNVQILK-----VRQQTQFLWE 387
>gi|47207620|emb|CAG13862.1| unnamed protein product [Tetraodon nigroviridis]
Length = 426
Score = 71.2 bits (173), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 32/96 (33%), Positives = 55/96 (57%), Gaps = 5/96 (5%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y+ + + FC+ P G ++ S R +++ CVPV+LSN ++LPF++I+DW AVI +E
Sbjct: 4 YKEMLHNSTFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEIIDWNTAAVIGDE 63
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
R + Q+ +++I +SL Q F W +
Sbjct: 64 RLLLQIPTTVRSIHQDRILSLRQ-----QTQFLWEA 94
>gi|393907069|gb|EFO19790.2| exostosin-1 [Loa loa]
Length = 677
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 74/304 (24%), Positives = 132/304 (43%), Gaps = 50/304 (16%)
Query: 89 FKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHKM 148
F++Y+YP + + + + IRES + T DP +A LF + I
Sbjct: 81 FRVYVYPSDNSSAMSIVYNNI----------LKVIRESMYYTDDPQKACLFVLGIDTVDR 130
Query: 149 RGKGTSYENMTVIVKDYVDSLINKYP--YWNRTLGADHFFVTC----------HDVGVRA 196
+ +Y VK YV+ LI P WNR G +H HD+G
Sbjct: 131 DRRSENY------VK-YVNELIGNLPTEIWNR--GRNHVIFNLYHGTYPDYSDHDLGFDT 181
Query: 197 TEGVPFLIKNAIRV-VCSPSYDVAF-IPHKDVALPQVLQP-FALPRGGRDV----ENRTV 249
G + + + V ++D++F + HK+ L ++ ++L + + R V
Sbjct: 182 --GYALIARASANTRVFRENFDLSFPLFHKEHPLRTTIKAEWSLKIKDKYLVSFKGKRYV 239
Query: 250 LGFWAGHRNS------KIRVILARVWENDTEL----DISNNRINRAIGPLVYQRRFYKTK 299
G + R+S VI+ +++T+ D N Y+ +
Sbjct: 240 YGIGSETRDSLYHLHNGQSVIMVTTCKHNTDWKKYEDGRCEEDNVEYDHWDYEMTMANST 299
Query: 300 FCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQI 359
FC+ P G ++ S R +++ GC+PV+LS+ ++LPF++++DWR+ VI +E V + +
Sbjct: 300 FCLTPRGRRLGSFRFLEALRLGCIPVVLSDDWELPFSEVIDWRQAVVIGHEDTVLTISDV 359
Query: 360 LKNI 363
L I
Sbjct: 360 LSAI 363
>gi|312084451|ref|XP_003144281.1| exostosin-1 [Loa loa]
Length = 694
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 74/304 (24%), Positives = 132/304 (43%), Gaps = 50/304 (16%)
Query: 89 FKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHKM 148
F++Y+YP + + + + IRES + T DP +A LF + I
Sbjct: 81 FRVYVYPSDNSSAMSIVYNNI----------LKVIRESMYYTDDPQKACLFVLGIDTVDR 130
Query: 149 RGKGTSYENMTVIVKDYVDSLINKYP--YWNRTLGADHFFVTC----------HDVGVRA 196
+ +Y VK YV+ LI P WNR G +H HD+G
Sbjct: 131 DRRSENY------VK-YVNELIGNLPTEIWNR--GRNHVIFNLYHGTYPDYSDHDLGFDT 181
Query: 197 TEGVPFLIKNAIRV-VCSPSYDVAF-IPHKDVALPQVLQP-FALPRGGRDV----ENRTV 249
G + + + V ++D++F + HK+ L ++ ++L + + R V
Sbjct: 182 --GYALIARASANTRVFRENFDLSFPLFHKEHPLRTTIKAEWSLKIKDKYLVSFKGKRYV 239
Query: 250 LGFWAGHRNS------KIRVILARVWENDTEL----DISNNRINRAIGPLVYQRRFYKTK 299
G + R+S VI+ +++T+ D N Y+ +
Sbjct: 240 YGIGSETRDSLYHLHNGQSVIMVTTCKHNTDWKKYEDGRCEEDNVEYDHWDYEMTMANST 299
Query: 300 FCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQI 359
FC+ P G ++ S R +++ GC+PV+LS+ ++LPF++++DWR+ VI +E V + +
Sbjct: 300 FCLTPRGRRLGSFRFLEALRLGCIPVVLSDDWELPFSEVIDWRQAVVIGHEDTVLTISDV 359
Query: 360 LKNI 363
L I
Sbjct: 360 LSAI 363
>gi|154152143|ref|NP_001093807.1| exostosin-like 1 [Bos taurus]
gi|151556262|gb|AAI50062.1| EXTL1 protein [Bos taurus]
gi|296489981|tpg|DAA32094.1| TPA: exostoses-like 1 [Bos taurus]
Length = 675
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 79/316 (25%), Positives = 122/316 (38%), Gaps = 74/316 (23%)
Query: 88 NFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHK 147
K+++YP P +TPRK+ +I SR+ T A L + +S
Sbjct: 93 GLKVFVYPAVGP--ISETPRKV----------LTSIEGSRYHTASAAEACLLILVLSPDT 140
Query: 148 MRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNA 207
G+ + P W+ G +H ++ H F + A
Sbjct: 141 PAGE-----------------CHPELPQWDG--GKNHLVLSLHPAPCLRI----FQLGQA 177
Query: 208 IRVVCSPSYDVAFIPHKDVALPQVLQPFALP-RGGRDVENRTVLGFWAGHRNSKIRVILA 266
+ SP+ D F P DVALP L P A P RGG LG H ++A
Sbjct: 178 MVAEASPTMDT-FRPGFDVALP--LLPEAHPLRGG-------ALGQLQQHSPLPGVALIA 227
Query: 267 RVWENDTELDISNN--------RINRAIGPLVYQRR--FYKTKFCICPGGSQVNSARIAD 316
E + N R + GP R FC+ PG R D
Sbjct: 228 MAEERGGWRTVGTNFSACPWDGRCEQDHGPKQTHPRATLPNATFCLIPG-------RNPD 280
Query: 317 SIHY------GCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVS 370
++H+ GC+PV+LS ++LPF++++DW K AV+ +ER Q+ L+ + ++
Sbjct: 281 ALHFLQALQAGCIPVLLSPRWELPFSEVIDWTKAAVVADERLPLQVLAALQEMPLTRVLA 340
Query: 371 LHNNLVKVQKHFQWNS 386
L Q F W++
Sbjct: 341 LRQ-----QAQFLWDA 351
>gi|307197832|gb|EFN78943.1| Exostosin-1 [Harpegnathos saltator]
Length = 711
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 81/328 (24%), Positives = 133/328 (40%), Gaps = 58/328 (17%)
Query: 88 NFKIYIYPDGDP-NTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCH 146
F +Y+YP D + YQ I ESR+ T DP RA +F + +
Sbjct: 88 GFTVYVYPVEDVISPLYQK-------------ILNVITESRYYTSDPARACIFVLALDTL 134
Query: 147 KMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV--RATEGVPFLI 204
T + V + L++ PYWN G +H + A E + F +
Sbjct: 135 DRDPLSTEF------VHNLPSKLLH-LPYWNN--GRNHLIFNLYSGTWPDYAEESLVFDV 185
Query: 205 KNAIRVVCSPSYDVAFIPHKDVALPQVLQ--PFALPRGGRDVEN---------------R 247
AI S S + P DV++P + P G+ +EN R
Sbjct: 186 GYAILAKASMSI-FRYRPDFDVSIPLFGKQHPERGGEPGQALENNFPSNKKYVAAFKGKR 244
Query: 248 TVLGFWAGHRNSK---------IRVILARVWENDTEL-DISNNRINRAIGPLVYQRRFYK 297
V G + RN+ + V R + EL D + N+ Y+
Sbjct: 245 YVHGIGSETRNALYHLHNGKDLVFVTTCRHGKAWRELQDEHCQQDNQEYDTYDYEILLMN 304
Query: 298 TKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLK 357
FC+ P G ++ S R +++ GC+PVILSN + LPF++ +DW + + +ER + Q+
Sbjct: 305 ATFCLVPRGRRLGSFRFLEALRAGCIPVILSNGWALPFHERIDWTQAVIFSDERLLLQIP 364
Query: 358 QILKNISDAEFVSLHNNLVKVQKHFQWN 385
I++++S+ + + L + Q F W
Sbjct: 365 DIVRSVSNVQILKL-----RQQTQFLWE 387
>gi|255571564|ref|XP_002526728.1| catalytic, putative [Ricinus communis]
gi|223533917|gb|EEF35642.1| catalytic, putative [Ricinus communis]
Length = 728
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 80/333 (24%), Positives = 136/333 (40%), Gaps = 68/333 (20%)
Query: 113 YASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SCHKMRGKGTSYENM-------TVIVK 163
Y ++ ++++ S + T + + A FF+PI SC R + +M + +
Sbjct: 328 YGAQMALYESLLASPYRTLNGEEADFFFVPILDSCIITRADDAPHLSMQDHMGLRSSLTL 387
Query: 164 DYV----DSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVV-------- 211
+Y D ++ YPYWNRT G DH + D G P I N++ +V
Sbjct: 388 EYYRKAYDHIVEHYPYWNRTSGRDHIWFFSWDEGACY---APKEIWNSMMLVHWGNTNSK 444
Query: 212 --------CSPSYDV----------AFIPHKDVALPQVLQP----FALPRGGRDVENRTV 249
+ ++D F P KD+ LP +P + R +E R
Sbjct: 445 HNHSTTAYWADNWDKISSDRRGRHPCFDPDKDLVLPAWKRPDVSALSTKLWARPLERRKT 504
Query: 250 LGFWAGHRNSK-------------IRVILARVWENDTELDISNNRINRA---IGPLV--- 290
L F+ G+ IR LA + + D + + + PL
Sbjct: 505 LFFFNGNLGPAYPNGRPELSYSMGIRQKLAEEFGSSPNKDGKLGKQHAEDVIVTPLRSEN 564
Query: 291 YQRRFYKTKFC-ICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILN 349
Y + FC + PG S R+ DSI GC+PVI+ + LP+ ++L++ FAV +
Sbjct: 565 YHEDLASSIFCGVLPGDGW--SGRMEDSILQGCIPVIIQDGIFLPYENVLNYESFAVRIR 622
Query: 350 ERDVYQLKQILKNISDAEFVSLHNNLVKVQKHF 382
E ++ L +IL+ ++ E N+ K+ + F
Sbjct: 623 EDEISNLLKILRGFNETEKEFKLANVRKIWQRF 655
>gi|357157638|ref|XP_003577864.1| PREDICTED: probable glycosyltransferase At3g07620-like
[Brachypodium distachyon]
Length = 485
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 58/242 (23%), Positives = 110/242 (45%), Gaps = 23/242 (9%)
Query: 174 PYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALP--QV 231
P W R+ GADH V H + F + VA + KD+ P +
Sbjct: 228 PEWKRSGGADHVIVAHHPNSLLHARSALFPAVFVLSDFGRYHPRVASL-EKDLVAPYRHM 286
Query: 232 LQPFALPRGGRDVENRTVLGFWAG----HRNSKIRVILARVWENDTEL-----DISNNRI 282
+ F G D +R L ++ G IR L + +++ ++ + ++ +
Sbjct: 287 AKTFVNDTAGFD--DRPTLLYFRGAIYRKEGGNIRQELYNMLKDEKDVFFSFGSVQDHGV 344
Query: 283 NRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWR 342
++A + + +KFC+ G +S R+ D+I CVPVI+S+ +LP+ D+LD+
Sbjct: 345 SKA------SQGMHSSKFCLNIAGDTPSSNRLFDAIVSHCVPVIISDDIELPYEDVLDYS 398
Query: 343 KFAVILNERDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMY 399
KF++ + D + L ++++ ++ + + L +V KHF++ P + DA M+
Sbjct: 399 KFSIFVRSSDAVKRGYLMKLIRGVTKHRWTRMWKRLKEVDKHFEYQFPSRKDDAVQMIWQ 458
Query: 400 EL 401
L
Sbjct: 459 AL 460
>gi|449278615|gb|EMC86416.1| Exostosin-1 [Columba livia]
Length = 1015
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 30/96 (31%), Positives = 56/96 (58%), Gaps = 5/96 (5%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y+ + FC+ P G ++ S R +++ CVPV+LSN ++LPF++++DW + AVI +E
Sbjct: 38 YREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVIDWNQAAVIGDE 97
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
R + Q+ +++I + ++L Q F W +
Sbjct: 98 RLLLQIPSTIRSIHQDKILALRQ-----QTQFLWEA 128
>gi|432882349|ref|XP_004073986.1| PREDICTED: exostosin-1b-like [Oryzias latipes]
Length = 429
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 31/96 (32%), Positives = 56/96 (58%), Gaps = 5/96 (5%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y+ + + FC+ P G ++ S R +++ CVPV+LSN ++LPF++I+DW + VI +E
Sbjct: 7 YKEMLHNSTFCLVPRGRRLGSFRFLEALQAACVPVVLSNGWELPFSEIIDWNRATVIGDE 66
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
R + Q+ +++I + +SL Q F W +
Sbjct: 67 RLLLQIPTTVRSIHPDKILSLRQ-----QTQFLWEA 97
>gi|356506196|ref|XP_003521873.1| PREDICTED: probable glucuronosyltransferase GUT1-like [Glycine max]
Length = 505
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 63/273 (23%), Positives = 123/273 (45%), Gaps = 21/273 (7%)
Query: 139 FFIPISCHKMRGKGTSYE--NMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCH-----D 191
FF + ++ K +E +M ++++ + + + W R+ G DH V H D
Sbjct: 208 FFSSLCYNRFSSKTNPHEKRSMNKVLQEKLVKYVTEQEEWKRSGGKDHVIVAHHPNSMLD 267
Query: 192 VGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDVENRTVLG 251
++ G F++ + R P+ ++A + KDV P + ++R L
Sbjct: 268 ARMKLWPGT-FILSDFGRY---PT-NIANV-EKDVIAPYKHVVGSYDNDQSSFDSRPTLL 321
Query: 252 FWAG----HRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGS 307
++ G +R L + +N+ ++ S + + G +KFC+ G
Sbjct: 322 YFQGAIYRKDGGHVRHELYYLVKNEKDVHFSFGNVEKG-GVRNAAEGMRSSKFCLNIAGD 380
Query: 308 QVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQ---LKQILKNIS 364
+S R+ D+I CVPVI+S+ +LP+ D++D+ +F V + RD + L +++I
Sbjct: 381 TPSSNRLFDAIASHCVPVIISDEIELPYEDVIDYSQFCVFVRTRDALKKRYLINFIRSIG 440
Query: 365 DAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMV 397
E+ + N L +V+ F++ P DA M+
Sbjct: 441 KEEWTRMWNRLKEVESFFEFQFPSKEGDAVQMI 473
>gi|395854733|ref|XP_003799834.1| PREDICTED: exostosin-like 1 [Otolemur garnettii]
Length = 678
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 60/222 (27%), Positives = 98/222 (44%), Gaps = 32/222 (14%)
Query: 176 WNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPF 235
WN G +H ++ H T + A+ SP+ D +F P DVALP L P
Sbjct: 155 WNG--GKNHLVLSLHPALCPRT----LQLGQAMVAKASPTVD-SFRPGFDVALP--LLPE 205
Query: 236 ALP-RGGRDVENRTVLGFWAGHRNSKIRVILA--------RVWENDTELDISNNRINRAI 286
A P +GG + R H +LA R ++ + + +
Sbjct: 206 AHPLQGGVSGQLRQ-------HSPLPGVALLALEEERGGWRTAGTNSSACLWDGHCKQDP 258
Query: 287 GPLVYQRR--FYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKF 344
GP + R FC+ G S R ++ GC+PV+LS ++LPF++++DW K
Sbjct: 259 GPELTHSRETLPNATFCLISGKRPDASLRFLQALQAGCIPVLLSPPWELPFSEVIDWTKA 318
Query: 345 AVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
A+I +ER + Q+ L+ +S A ++L + Q F W++
Sbjct: 319 AIIADERLLLQVPAALQEMSPARVLAL-----RQQTQFLWDA 355
>gi|414866629|tpg|DAA45186.1| TPA: hypothetical protein ZEAMMB73_313698 [Zea mays]
Length = 206
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 47/164 (28%), Positives = 75/164 (45%), Gaps = 18/164 (10%)
Query: 243 DVENRTVLGFWAGHRNSKIRVILARVWEN---DTELDISNNRINRAIGPLVYQRRFYKTK 299
D N G++A R A VWEN + DIS + P Y ++
Sbjct: 24 DTSNDPEGGYYA-------RGACASVWENFKNNPLFDISTDH------PPTYYEDMQRSV 70
Query: 300 FCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQI 359
FC+CP G S R+ +++ +GC+PVI+++ LPF D + W + V + E DV +L I
Sbjct: 71 FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFVAEEDVPKLDSI 130
Query: 360 LKNISDAEFVSLHNNLV--KVQKHFQWNSPPVRFDAFHMVMYEL 401
L +I + L +++ + P DAFH ++ L
Sbjct: 131 LMSIPTDVILRKQRLLANPSMKQAMLFPQPAQAGDAFHQILNGL 174
>gi|124301269|gb|ABN04853.1| Exostosin-like [Medicago truncatula]
Length = 196
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 40/134 (29%), Positives = 71/134 (52%), Gaps = 4/134 (2%)
Query: 257 RNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIAD 316
+ KIR L ++ E + ++ I + +R Q + +KFC+ P G ++ R+ D
Sbjct: 16 QGGKIRDTLFQILEKEDDVIIKHGAQSRESRRAASQG-MHTSKFCLHPAGDTPSACRLFD 74
Query: 317 SIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQ---LKQILKNISDAEFVSLHN 373
+I CVPVI+S+ +LPF D +D+RK AV + Q L IL+ ++ V
Sbjct: 75 AIVSLCVPVIVSDSIELPFEDTIDYRKIAVFVETAAAIQPGYLVSILRGMAPDRIVEYQK 134
Query: 374 NLVKVQKHFQWNSP 387
L +V+++F+++ P
Sbjct: 135 ELKEVKRYFKYDEP 148
>gi|242077606|ref|XP_002448739.1| hypothetical protein SORBIDRAFT_06g032380 [Sorghum bicolor]
gi|241939922|gb|EES13067.1| hypothetical protein SORBIDRAFT_06g032380 [Sorghum bicolor]
Length = 456
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 62/244 (25%), Positives = 109/244 (44%), Gaps = 35/244 (14%)
Query: 170 INKYPYWNRTLGADHFFVTCHDVG---VRATEGVPFLI----------------KNAIRV 210
+ +P W R+ G DH FV V VRA L+ +N+ R+
Sbjct: 169 VTSHPAWRRSGGRDHVFVMTDPVAMWHVRAEIAPAILLVVDFGGWYKVDSKSANRNSSRM 228
Query: 211 VCSPSYDVAFIPHKDVALPQV-LQPFALPRGGRDVENRTVLGFWAG----HRNSKIRVIL 265
+ + KDV +P L P L +D R L ++ G HR +R L
Sbjct: 229 IQHTQVSLL----KDVIVPYTHLLPTLLLSENKD---RPTLLYFKGAKHRHRGGLVREKL 281
Query: 266 ARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPV 325
+ N+ ++ + N A G + ++FC+ P G S R+ D+I C+PV
Sbjct: 282 WDLLGNEPDVIMEEGFPN-ATGREQSIKGMQTSEFCLHPAGDTPTSCRLFDAIASLCIPV 340
Query: 326 ILSNYYDLPFNDILDWRKFAVILNERDVYQLKQI---LKNISDAEFVSLHNNLVKVQKHF 382
I+S+ +LP+ ++D+ +F++ ++ R+ + K + L+NI + NL +VQ F
Sbjct: 341 IVSDEVELPYEGMIDYTEFSIFVSVRNAMRPKWLTSYLRNIPKQQKDEFRKNLARVQPIF 400
Query: 383 QWNS 386
++N+
Sbjct: 401 EYNT 404
>gi|340377881|ref|XP_003387457.1| PREDICTED: exostosin-1b-like [Amphimedon queenslandica]
Length = 729
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 34/115 (29%), Positives = 61/115 (53%), Gaps = 5/115 (4%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + FC+ P G ++ S R +S+H GC+PV LSN LPF++++DW K + +E
Sbjct: 306 YNHLISNSTFCLVPRGRRLGSFRFLESLHAGCIPVSLSNGLVLPFHELIDWSKALFVFDE 365
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
R ++Q+ +L++I + + +S+ ++ F W DA E+ +R
Sbjct: 366 RQLFQVPHMLRHIPEDKILSM-----RLHTQFYWEQYFSSVDAIMHTTLEIVKQR 415
>gi|255644422|gb|ACU22716.1| unknown [Glycine max]
Length = 189
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 44/159 (27%), Positives = 81/159 (50%), Gaps = 14/159 (8%)
Query: 251 GFWAGHRNSKIRVILARVWENDTEL-----DISNNRINRAIGPLVYQRRFYKTKFCICPG 305
G WA + IR L + +++ ++ I N IN+A + +KFC+
Sbjct: 10 GAWAFLKGGAIRQELYYLLKDEKDVHFTFGSIGGNGINQASQGMAM------SKFCLNIA 63
Query: 306 GSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQ---LKQILKN 362
G +S R+ D+I CVPVI+S+ +LPF D+LD+ F++ + D + L +L++
Sbjct: 64 GDTPSSNRLFDAIVSHCVPVIISDEIELPFEDVLDYSDFSIFVRASDSMKKGYLLNLLRS 123
Query: 363 ISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYEL 401
I+ E+ + L ++ HF++ P DA +M+ ++
Sbjct: 124 ITQKEWSKMWERLKQITHHFEYQYPSQPGDAVNMIWQQV 162
>gi|348571154|ref|XP_003471361.1| PREDICTED: exostosin-like 1-like [Cavia porcellus]
Length = 674
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 64/246 (26%), Positives = 108/246 (43%), Gaps = 48/246 (19%)
Query: 160 VIVKDYVDSLINKYP-YWNRTLGADHFFVTCH-DVGVRATEGVPFLIKNAIRVVCSPSYD 217
V+ + D P +WN G +H + H D +A++ + A+ SP D
Sbjct: 135 VLSPEDPDGQCGPAPQHWNG--GRNHLVIRLHPDPCPQASQ-----LGQAMVAQASPRVD 187
Query: 218 VAFIPHKDVALPQVLQPFALP-RGGRDVE-------NRTVL-------GFW--AGHRNSK 260
F P DVALP L P A P RGG + R L G W AG +S
Sbjct: 188 T-FRPGFDVALP--LLPEAHPFRGGAPGQLQQHSPHPRAALLALAEEGGRWRTAGTYSST 244
Query: 261 IRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHY 320
W+ E D+ + + + FC+ PG + ++R ++
Sbjct: 245 C------PWDGHCEQDLGPKQTRP-------EEKLPNATFCLIPG-QRAEASRFLQALQA 290
Query: 321 GCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQK 380
GC+PV+LS +++LPF++++DW K A++ ++R Q+ L+ ++ A ++L + Q
Sbjct: 291 GCIPVLLSPHWELPFSEVIDWTKAAIVADKRLPLQVLAALQEMAPARVLAL-----RQQT 345
Query: 381 HFQWNS 386
F W +
Sbjct: 346 QFLWAA 351
>gi|218185155|gb|EEC67582.1| hypothetical protein OsI_34941 [Oryza sativa Indica Group]
Length = 483
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 61/244 (25%), Positives = 108/244 (44%), Gaps = 20/244 (8%)
Query: 174 PYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALP--QV 231
P W R+ GADH V H + V F + VA + KDV P +
Sbjct: 219 PEWKRSGGADHVIVAHHPNSLLHARSVLFPAVFVLSDFGRYHPRVASL-EKDVIAPYKHM 277
Query: 232 LQPFALPRGGRDVENRTVLGFWAGH-----------RNSKIRVILARVWENDTELDISNN 280
+ F G D +R L ++ G + IR L + +++ ++ +
Sbjct: 278 AKTFVNDSAGFD--DRPTLLYFRGAIFRKEVKIDSWKGGNIRQELHYMLKDEKDVYFAFG 335
Query: 281 RINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILD 340
+ G + + +KFC+ G +S R+ D+I CVPVI+S+ +LP+ D LD
Sbjct: 336 SVQDH-GASKASQGMHASKFCLNIAGDTPSSNRLFDAIVSHCVPVIISDDIELPYEDALD 394
Query: 341 WRKFAVILNERDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMV 397
+ KF++ + D + L ++++ +S ++ + L +V KHF++ P + DA M+
Sbjct: 395 YSKFSIFVRSSDAVKKGYLMRLIRGVSKHQWTMMWRRLKEVDKHFEYQYPSQKDDAVQMI 454
Query: 398 MYEL 401
L
Sbjct: 455 WQTL 458
>gi|255543228|ref|XP_002512677.1| catalytic, putative [Ricinus communis]
gi|223548638|gb|EEF50129.1| catalytic, putative [Ricinus communis]
Length = 253
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 48/164 (29%), Positives = 73/164 (44%), Gaps = 18/164 (10%)
Query: 243 DVENRTVLGFWAGHRNSKIRVILARVWEN---DTELDISNNRINRAIGPLVYQRRFYKTK 299
DV N G++A R A VWEN + DIS P Y +
Sbjct: 71 DVNNDPEGGYYA-------RGARAAVWENFKNNPLFDISTEH------PTTYYEDMQRAI 117
Query: 300 FCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQI 359
FC+CP G S R+ +++ +GC+PVI+++ LPF D + W + V + E DV L I
Sbjct: 118 FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFVAEEDVPNLDTI 177
Query: 360 LKNISDAEFVSLHNNLV--KVQKHFQWNSPPVRFDAFHMVMYEL 401
L +I + L +++ + P DAFH ++ L
Sbjct: 178 LTSIPTQVVLRKQRLLANPSMKRAMLFPQPAQSGDAFHQILNGL 221
>gi|440905948|gb|ELR56264.1| Exostosin-like 1 [Bos grunniens mutus]
Length = 675
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 81/316 (25%), Positives = 123/316 (38%), Gaps = 74/316 (23%)
Query: 88 NFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHK 147
K+++YP P +TPRK+ +I SR+ T A L + +S
Sbjct: 93 GLKVFVYPAVGP--ISETPRKV----------LTSIEGSRYHTASAAEACLLILVLSPDT 140
Query: 148 MRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNA 207
G E V+ P W+ G +H ++ H F + A
Sbjct: 141 PAG-----ECHPVL------------PQWDG--GKNHLVLSLHPAPCLRI----FQLGQA 177
Query: 208 IRVVCSPSYDVAFIPHKDVALPQVLQPFALP-RGGRDVENRTVLGFWAGHRNSKIRVILA 266
+ SP+ D F P DVALP L P A P RGG LG H ++A
Sbjct: 178 MVAEASPTMDT-FRPGFDVALP--LLPEAHPLRGG-------ALGQLQQHSPLPGVALIA 227
Query: 267 RVWENDTELDISNN--------RINRAIGPLVYQRR--FYKTKFCICPGGSQVNSARIAD 316
E + N R + GP R FC+ PG R D
Sbjct: 228 VAEERGGWRTVGTNFSACPWDGRCEQDHGPKQTHPRATLPNATFCLIPG-------RNPD 280
Query: 317 SIHY------GCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVS 370
++H+ GC+PV+LS ++LPF++++DW K AV+ +ER Q+ L+ + ++
Sbjct: 281 ALHFLQALQAGCIPVLLSPRWELPFSEVIDWTKAAVVADERLPLQVLAALQEMPLTRVLA 340
Query: 371 LHNNLVKVQKHFQWNS 386
L Q F W++
Sbjct: 341 LRQ-----QAQFLWDA 351
>gi|168042843|ref|XP_001773896.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674740|gb|EDQ61244.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 373
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 66/268 (24%), Positives = 118/268 (44%), Gaps = 22/268 (8%)
Query: 156 ENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCH-DVGVRATEGVPFLIKNAIRVVCS- 213
E+ +++ + + K P W + G DH V H + G + ++ A+ VV
Sbjct: 101 EDKNQELQEKLMQFLEKQPAWQASGGVDHVIVIHHPNSGYFMRDH----LRKAMFVVADF 156
Query: 214 --PSYDVAFIPHKDVALP--QVLQPFALPRGGRDVENRTVLGFWAG----HRNSKIRVIL 265
+ DVA I KD+ P V+ F E R L F+ G IR+ L
Sbjct: 157 GRYASDVANIG-KDIVAPYKHVVNDFE-AEATISYEKRKTLLFFQGAIMRKEGGIIRLQL 214
Query: 266 ARVWENDTELDISN-NRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVP 324
++ + ++ N N AI +KFC+ G +S R+ D+I CVP
Sbjct: 215 YKLLNGEPDVHFEGGNTTNSAIRSA--SEGMQNSKFCLNLAGDTPSSNRLFDAIASHCVP 272
Query: 325 VILSNYYDLPFNDILDWRKFAVILNERDVYQ---LKQILKNISDAEFVSLHNNLVKVQKH 381
VI+S+ ++PF D L++ F++ + D + + +L+ +S ++ + L +V+ H
Sbjct: 273 VIISDDIEVPFEDTLNYSTFSIFIKSSDALKSNFIIDLLRGVSREKWTKMWATLKQVEHH 332
Query: 382 FQWNSPPVRFDAFHMVMYELWLRRHVIK 409
F++ P DA HM + + H ++
Sbjct: 333 FKYQYPTQPDDAVHMTWKAIARKIHKVR 360
>gi|149410585|ref|XP_001509292.1| PREDICTED: exostosin-1 [Ornithorhynchus anatinus]
Length = 443
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 29/96 (30%), Positives = 56/96 (58%), Gaps = 5/96 (5%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y+ + FC+ P G ++ S R +++ CVPV+LSN ++LPF++++DW + A+I +E
Sbjct: 21 YREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVIDWNQAAIIGDE 80
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
R + Q+ +++I + ++L Q F W +
Sbjct: 81 RLLLQIPSTIRSIHQDKILALRQ-----QTQFLWEA 111
>gi|224093354|ref|XP_002334839.1| predicted protein [Populus trichocarpa]
gi|222875141|gb|EEF12272.1| predicted protein [Populus trichocarpa]
Length = 175
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 44/138 (31%), Positives = 66/138 (47%), Gaps = 11/138 (7%)
Query: 266 ARVWEN--DTEL-DISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGC 322
A VWEN D L DIS P Y + FC+CP G S R+ +++ +GC
Sbjct: 9 AAVWENFKDNPLFDISTEH------PATYYEDMQRAVFCLCPLGWAPWSPRLVEAVIFGC 62
Query: 323 VPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLV--KVQK 380
+PVI+++ LPF D + W + V ++E DV L IL +I + L +++
Sbjct: 63 IPVIIADDIVLPFADAIPWEEIGVYVDEEDVPNLDTILTSIPPEVILRKQRLLANPSMKQ 122
Query: 381 HFQWNSPPVRFDAFHMVM 398
+ P DAFH V+
Sbjct: 123 AMLFPQPAQPGDAFHQVL 140
>gi|380027202|ref|XP_003697318.1| PREDICTED: exostosin-1-like [Apis florea]
Length = 711
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 75/329 (22%), Positives = 130/329 (39%), Gaps = 60/329 (18%)
Query: 88 NFKIYIYPDGDP-NTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCH 146
F +Y+YP D + YQ I ESR+ T DP RA +F + +
Sbjct: 88 GFTVYVYPIEDVISPLYQK-------------ILNVITESRYYTSDPTRACIFVLALDTL 134
Query: 147 KMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV--RATEGVPFLI 204
T + + + S + + PYWN G +H + A E + F +
Sbjct: 135 DRDPLSTEFVHN-------LPSKLMRLPYWNN--GRNHLIFNLYSGTWPDYAEESLAFDL 185
Query: 205 KNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDVE--------NRTVLGFWAGH 256
A+ S S P D+++P + RGG + N+ + + G
Sbjct: 186 GYAMLAKASMSI-FRHRPEFDISIPLFGKQHP-ERGGESGQALENNFPNNKKYIAAFKGK 243
Query: 257 R-----NSKIRVILARVWENDTELDISNNRINRAIGPLV---------------YQRRFY 296
R S+ R L + + ++ R +A L Y+
Sbjct: 244 RYVHGIGSETRNALYHLHNGKDLVFVTTCRHGKAWRELQDEHCQQDNQEYDMYDYEILLM 303
Query: 297 KTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQL 356
FC+ P G ++ S R +++ GC+PVILSN + LPF++ +DW + + +ER + Q+
Sbjct: 304 NATFCLVPRGRRLGSFRFLEALRAGCIPVILSNGWALPFHERIDWTQAVIFSDERLLLQI 363
Query: 357 KQILKNISDAEFVSLHNNLVKVQKHFQWN 385
I++++S+ + L + Q F W
Sbjct: 364 PDIVRSVSNVHILKL-----RQQTQFLWE 387
>gi|302825130|ref|XP_002994198.1| glycosyltransferase-like protein [Selaginella moellendorffii]
gi|300137940|gb|EFJ04733.1| glycosyltransferase-like protein [Selaginella moellendorffii]
Length = 762
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 78/322 (24%), Positives = 127/322 (39%), Gaps = 72/322 (22%)
Query: 114 ASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SCHKMRGKGTSYENMT-----------V 160
+E F + + S T + D A FF P+ +C R + +M
Sbjct: 364 GAELAFLEGLLASPHRTMNGDEADYFFAPVLGACAITRADDAPHFSMEKHMGLRGYFSGE 423
Query: 161 IVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVV--------- 211
+ K+ + +YP+WNR+ G DH ++ D G + P I N +V
Sbjct: 424 LYKNAYMHIKEQYPFWNRSSGRDHIWLFPWDEGACS---APKEIWNGTMLVHWGNTNSKH 480
Query: 212 -------CSPSYD----------VAFIPHKDVALPQVLQPFALPRG------GRDVENRT 248
+ S+D + P KD+ LP P PR R E R
Sbjct: 481 KKSTTGYFADSWDDIPKEWRGDHPCYDPLKDIVLPAWKNP--DPRSVAERFWSRPREERK 538
Query: 249 VLGFWAGH----------RNSKIRVILARVWENDTELDISNNRINRAIGPLV-------- 290
L ++ G+ + I RV E ++ ++ R P V
Sbjct: 539 TLFYFNGNLGKGYDFGRPEDRYSMGIRQRVAEEFGSTPNNHGKLGRQAAPDVVVTPQRSD 598
Query: 291 -YQRRFYKTKFC-ICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVIL 348
Y + ++FC + PG S R+ D++ +GC+PVI+ + LP+ +LD+ F V +
Sbjct: 599 DYAKELSSSRFCGVFPGDGW--SGRMEDAVLHGCIPVIIQDGIHLPYESLLDYESFTVRV 656
Query: 349 NERDVYQLKQILKNISDAEFVS 370
E + +L IL+NIS+AE S
Sbjct: 657 AEDKIPELITILRNISNAEVES 678
>gi|356570361|ref|XP_003553358.1| PREDICTED: probable glycosyltransferase At3g07620-like [Glycine
max]
Length = 520
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 63/280 (22%), Positives = 122/280 (43%), Gaps = 24/280 (8%)
Query: 135 RAHLFFIPI---SCHKMRGKGTSYENMT--VIVKDYVDSLINKYPYWNRTLGADHFFVTC 189
A + F+P C+ K +E + ++++ + + W R+ G DH +
Sbjct: 206 EADVIFVPFFSSLCYNRLSKTGPHEKRSRNKLLQEKLVKYVTAQEEWKRSGGKDHVILAH 265
Query: 190 H-----DVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDV 244
H D ++ G F++ + R ++A + KDV P +
Sbjct: 266 HPNSMLDARMKLWPGT-FILSDFGRYPT----NIANV-EKDVIAPYKHVVGSYDNDQSSF 319
Query: 245 ENRTVLGFWAG----HRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKF 300
++RT L ++ G +R L + +N+ ++ S + + G +KF
Sbjct: 320 DSRTTLLYFQGAIYRKDGGHVRHELYYLLKNEKDVHFSFGSVQKG-GVRKATEGMRSSKF 378
Query: 301 CICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQ---LK 357
C+ G +S R+ D+I CVPVI+S+ +LP+ D+LD+ +F + + RD + L
Sbjct: 379 CLNIAGDTPSSNRLFDAIASHCVPVIISDDIELPYEDVLDYSQFCIFVRTRDALKKRYLI 438
Query: 358 QILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMV 397
+++I E+ + N L +V+ F++ P DA M+
Sbjct: 439 NFIRSIGKEEWTRMWNRLKEVESFFEFQFPSKEGDAVQMI 478
>gi|392575274|gb|EIW68408.1| hypothetical protein TREMEDRAFT_32575 [Tremella mesenterica DSM 1558]
Length = 1176
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 52/200 (26%), Positives = 91/200 (45%), Gaps = 26/200 (13%)
Query: 202 FLIKNAIRVVCSPSYDV-AFIPHKDVALP-QVLQPFALPRGGRDVEN------RTVLGFW 253
++++NA+ Y+ + PH+D+ +P + + L ++ N RT L W
Sbjct: 931 WILRNALVWSVMGDYNSPCYRPHQDIVVPPRTCKSIDLREHFPEITNVTPMRQRTKLVTW 990
Query: 254 AG-----HRNSKIRVILARVWENDTELDISNNRINRAIGPLV------YQRRFYKTKFCI 302
+G +N ++R+ R EL GP+ Y + +FC
Sbjct: 991 SGTYWGTGKNMRLRLTCERGGAGKEELVPGG-------GPMSSWYNWEYMKEISGARFCP 1043
Query: 303 CPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKN 362
P G S RI D+I+ GC+PV+ + PF D LDW KF++ + ++ QL++IL
Sbjct: 1044 QPTGIAGWSPRINDAIYAGCIPVLTAEGTHYPFADFLDWSKFSIRIKPTELDQLERILSA 1103
Query: 363 ISDAEFVSLHNNLVKVQKHF 382
I + + NL+ V++ F
Sbjct: 1104 IPLEQLEEMQANLMLVREAF 1123
>gi|15230692|ref|NP_190126.1| exostosin family protein [Arabidopsis thaliana]
gi|6967106|emb|CAB72489.1| putative protein [Arabidopsis thaliana]
gi|332644505|gb|AEE78026.1| exostosin family protein [Arabidopsis thaliana]
Length = 475
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 37/124 (29%), Positives = 70/124 (56%), Gaps = 9/124 (7%)
Query: 277 ISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFN 336
+ N IN+A + + +KFC+ G +S R+ D+I CVPVI+S+ +LPF
Sbjct: 315 VRNGGINKA------SQGMHNSKFCLNIAGDTPSSNRLFDAIASHCVPVIISDDIELPFE 368
Query: 337 DILDWRKFAVILNERDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDA 393
D++D+ +F+V + D + L +++ I+ E+ + N L +V+K+++++ P DA
Sbjct: 369 DVIDYSEFSVFVRTSDALKENFLVNLIRGITKEEWTRMWNRLKEVEKYYEFHFPSKVDDA 428
Query: 394 FHMV 397
M+
Sbjct: 429 VQMI 432
>gi|255547405|ref|XP_002514760.1| catalytic, putative [Ricinus communis]
gi|223546364|gb|EEF47866.1| catalytic, putative [Ricinus communis]
Length = 490
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 65/263 (24%), Positives = 118/263 (44%), Gaps = 28/263 (10%)
Query: 152 GTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHD---VGVRATEGVP-FLIKNA 207
G +M ++++ + + W R+ G DH V H + R G F++ +
Sbjct: 212 GKEKVSMNKMLQNRLVEFLMGQDEWKRSGGRDHLIVAHHPNSMLDARKMLGAAMFVLADF 271
Query: 208 IRVVCSPSYDVAFIPHKDVALPQVLQPFALPRG-GRDVENRTVLGFWAGHRNSK----IR 262
R ++A + KDV P +P G E R +L F+ G K IR
Sbjct: 272 GRYPV----EIANL-KKDVIAPYKHVVRTIPSGESAQFEERPILVFFQGAIYRKDGGIIR 326
Query: 263 VILARVWENDTEL-----DISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADS 317
L + +++ ++ + N +N+A + +KFC+ G +S R+ D+
Sbjct: 327 QELYYLLKDEKDVHFTFGTVRKNGVNKA------GQGMASSKFCLNIAGDTPSSNRLFDA 380
Query: 318 IHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQ---LKQILKNISDAEFVSLHNN 374
I CVPVI+S+ +LPF D+LD+ +F+V + D + L +L++I ++ +
Sbjct: 381 IVSHCVPVIISDDIELPFEDVLDYSEFSVFVRASDAVKEGYLLNLLQSIDRDKWTMMWER 440
Query: 375 LVKVQKHFQWNSPPVRFDAFHMV 397
L ++ HF++ P DA M+
Sbjct: 441 LKEIAPHFEYQYPSQSGDAVDMI 463
>gi|118481039|gb|ABK92473.1| unknown [Populus trichocarpa]
Length = 239
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 44/169 (26%), Positives = 81/169 (47%), Gaps = 11/169 (6%)
Query: 235 FALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRR 294
+A RG +V + + G + + K+R ++ R + D + +R YQ
Sbjct: 64 WAFFRGKMEVHPKNISG---RYYSKKVRTVIWRKYSGDRRFYLQRHRFAG------YQSE 114
Query: 295 FYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVY 354
++ FC+CP G S R+ +S+ GCVPVI+++ LPF + W + ++ + E+DV
Sbjct: 115 IVRSVFCLCPLGWAPWSPRLVESVALGCVPVIIADGIRLPFPTAVRWSEISLTVAEKDVA 174
Query: 355 QLKQILKNISDAEFVSLHNNL--VKVQKHFQWNSPPVRFDAFHMVMYEL 401
L +L +++ ++ NL V++ +N DA V+Y L
Sbjct: 175 NLGTLLDHVAATNLSAIQKNLWDPDVRRALLFNDRVQEGDATWQVLYAL 223
>gi|159479930|ref|XP_001698039.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158273838|gb|EDO99624.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 810
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 82/346 (23%), Positives = 136/346 (39%), Gaps = 69/346 (19%)
Query: 109 LTG-KYASEGYFFQNIRESRFLTHDPDRAHLFFIP--ISCH--KMRG----------KGT 153
+TG Y E + + +S T DP+ A F++P I+C+ + G G
Sbjct: 409 ITGYTYGIETLMHELMLQSEHRTFDPEEADFFYVPMYITCYFWPILGWADGPWWHAPNGL 468
Query: 154 SYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV----RATEGVPFLIKNAIR 209
+ ++ + D L K PYW+R G DH ++ D G +A ++ + R
Sbjct: 469 RVMHGANMITELHDWLRTKLPYWDRRGGRDHIWLMAADEGACWMPKAVYDTSIVLTHWGR 528
Query: 210 VVCSPSYDVAFI-------------------------------PHKDVALPQVLQPFALP 238
+ + A++ P KD+ +P P P
Sbjct: 529 LDPEHKSNTAYLQDNYTAKPESAFDAWRGVDFGDRIKGHPCFDPRKDLVVPAFKSPNHFP 588
Query: 239 R----GGRDVENRTVLGFWAGHRNSKI----RVILARVWENDTELDISNNRINRAIGPLV 290
R G +E +L F S++ R I R++ + D N R AIG
Sbjct: 589 RSPLIGAPPLERDLLLFFRGDVGASRLPHYSRGIRQRLFHLAHKHDWYN-RFKIAIGSGD 647
Query: 291 -----YQRRFYKTKFC-ICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKF 344
Y + ++KFC + PG S R D+I +GC+PV++ + F ILDW F
Sbjct: 648 SLKGDYSEQLARSKFCLVAPGDGW--SPRAEDAILHGCIPVVVMDGVQAVFESILDWDSF 705
Query: 345 AVILNERD--VYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPP 388
++ + E D + L Q+L +IS + +L +V F + P
Sbjct: 706 SLRIREDDAALEALPQLLASISPERLAHMQRHLARVWHRFAYTQTP 751
>gi|383847108|ref|XP_003699197.1| PREDICTED: exostosin-1-like [Megachile rotundata]
Length = 711
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 79/328 (24%), Positives = 133/328 (40%), Gaps = 58/328 (17%)
Query: 88 NFKIYIYPDGDP-NTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCH 146
F +Y+YP D + YQ I ESR+ T DP RA +F + +
Sbjct: 88 GFTVYVYPVEDVISPLYQK-------------ILNVITESRYYTSDPARACIFVLALDTL 134
Query: 147 KMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV--RATEGVPFLI 204
T + + + + + + PYWN G +H + A E + F I
Sbjct: 135 DRDPLSTEFIHN-------LPAKLMRLPYWNN--GRNHLIFNLYSGTWPDYAEESLAFDI 185
Query: 205 KNAIRVVCSPSYDVAFIPHKDVALPQVLQ--PFALPRGGRDVEN---------------R 247
A+ S S P+ DV++P + P G+ +EN R
Sbjct: 186 GYAMLAKASMSI-FRHRPNFDVSIPLFGKQHPERGGEPGQALENNFPNSKKYVAAFKGKR 244
Query: 248 TVLGFWAGHRNSK---------IRVILARVWENDTEL-DISNNRINRAIGPLVYQRRFYK 297
V G + RN+ + V R + EL D + N+ Y+
Sbjct: 245 YVHGIGSETRNALYHLHNGKDLVFVTTCRHGKAWRELQDEHCQQDNQEYDMYDYEILLMN 304
Query: 298 TKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLK 357
FC+ P G ++ S R +++ GC+PVILSN + LPF++ +DW + + +ER + Q+
Sbjct: 305 ATFCLVPRGRRLGSFRFLEALRAGCIPVILSNGWALPFHERIDWTQAVIFSDERLLLQIP 364
Query: 358 QILKNISDAEFVSLHNNLVKVQKHFQWN 385
I++++S+ + + L + Q F W
Sbjct: 365 DIVRSVSNVQILKL-----RQQTQFLWE 387
>gi|449501885|ref|XP_004161484.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein
LOC101226446 [Cucumis sativus]
Length = 859
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 81/342 (23%), Positives = 138/342 (40%), Gaps = 86/342 (25%)
Query: 113 YASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SCHKMRGKGTSYENMTVIVKDYV---- 166
Y +E F+++I S T + + A FF+P+ SC R + + ++DY+
Sbjct: 391 YGAEMAFYESILASPHRTLNGEEADFFFVPVLDSCIITRADDAPH----LSLRDYMGLRS 446
Query: 167 -----------DSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVV---- 211
D ++ +YPYWNR+ G DH + D G P I N++ +V
Sbjct: 447 FLTLDFYKKAHDHIVEQYPYWNRSSGRDHIWFFSWDEGACY---APKEIWNSMMLVHWGN 503
Query: 212 ------------CSPSYD----------VAFIPHKDVALPQVLQP---------FALPRG 240
++D F P KD+ +P +P +A PR
Sbjct: 504 TNSKHNHSTTAYWGDNWDNIPSSKRGNHPCFDPEKDLVVPAWKRPDGSRLSKKLWARPR- 562
Query: 241 GRDVENRTVLGFWAGHRNSK-------------IRVILARVWENDTELDISNNRINRA-- 285
E R F+ G+ IR +A + + + + + A
Sbjct: 563 ----EERKTFFFFNGNLGPAYERGRPESTYSMGIRQKVAEEFGSSPNKEGKLGKQHAADV 618
Query: 286 -IGPLV---YQRRFYKTKFC-ICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILD 340
+ PL Y + FC + PG S R+ DSI GC+PVI+ + LP+ ++L+
Sbjct: 619 IVTPLRSENYHEDLASSVFCGVMPGDGW--SGRMEDSILQGCIPVIIQDGIFLPYENVLN 676
Query: 341 WRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHF 382
+ FAV + E D+ L IL+ +++E +N+ K+ + F
Sbjct: 677 YDSFAVRIGEDDIPNLINILRGFNESEIEFKLSNVRKIWQRF 718
>gi|17541994|ref|NP_502180.1| Protein RIB-1 [Caenorhabditis elegans]
gi|6434267|emb|CAB61014.1| Protein RIB-1 [Caenorhabditis elegans]
gi|130381603|dbj|BAF48989.1| heparan sulfate polymerase [Caenorhabditis elegans]
Length = 382
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 76/314 (24%), Positives = 131/314 (41%), Gaps = 57/314 (18%)
Query: 87 TNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCH 146
T+ K+YI+P F ++P+ + Y+ ++ ES T+DP+ A +F + I
Sbjct: 43 TSKKVYIHPM--EKRFEESPQSVI--YSK---ILKHFLESNHYTNDPNEACIFLLGID-- 93
Query: 147 KMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHF--------FVTCHDVGVRATE 198
+ +N V DY++SL WN G +H F D +
Sbjct: 94 -TTDRDVRSQNYVKNVNDYIESL--DPSVWNN--GRNHLIFNFYHGTFPDYDDHNLNFDT 148
Query: 199 GVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQ--PFAL-----PRGGRDVEN----- 246
G + + + S + FI DV+LP + P+ + R +EN
Sbjct: 149 GEAMIAR-------ASSSENNFIKVFDVSLPLFHENHPYEIKESKSERNDDRIENQRKYL 201
Query: 247 ------RTVLGFWAGHRN--------SKIRVILARVWENDTEL--DISNNRINRAIGPLV 290
R V G +G RN I ++ ND ++ D R N
Sbjct: 202 VSFKGKRYVYGIGSGTRNLVHHLHNGDDIVMVTTCKHNNDWQVYQDDRCQRDNDEYDRWE 261
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + FC+ P G ++ S R +++ GCVPV++S+ + LPF++ +DW A+++ E
Sbjct: 262 YDELLANSTFCLVPRGRRLGSFRFLETLRSGCVPVVISDSWILPFSETIDWNSAAIVVAE 321
Query: 351 RDVYQLKQILKNIS 364
RD + ++L + S
Sbjct: 322 RDALSIPELLMSTS 335
>gi|297721259|ref|NP_001172992.1| Os02g0520750 [Oryza sativa Japonica Group]
gi|255670949|dbj|BAH91721.1| Os02g0520750 [Oryza sativa Japonica Group]
Length = 213
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 41/141 (29%), Positives = 69/141 (48%), Gaps = 11/141 (7%)
Query: 266 ARVWEN---DTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGC 322
A +WEN + DIS P Y ++ FC+CP G S R+ +++ +GC
Sbjct: 46 ASLWENFKNNPLFDISTEH------PATYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGC 99
Query: 323 VPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLV--KVQK 380
+PVI+++ LPF D + W + V ++E DV +L IL +I + + L +++
Sbjct: 100 IPVIIADDIVLPFADAIPWDEIGVFVDEEDVPRLDSILTSIPIDDILRKQRLLANPSMKQ 159
Query: 381 HFQWNSPPVRFDAFHMVMYEL 401
+ P DAFH ++ L
Sbjct: 160 AMLFPQPAQPRDAFHQILNGL 180
>gi|225441752|ref|XP_002277596.1| PREDICTED: uncharacterized protein LOC100267584 [Vitis vinifera]
Length = 794
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 76/333 (22%), Positives = 133/333 (39%), Gaps = 68/333 (20%)
Query: 113 YASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SCHKMRGKGTSYENMTV---------- 160
Y ++ +++I S T D + A FF+P+ SC +R + NM
Sbjct: 393 YGAQMAIYESILASPHRTLDGEEADFFFVPVLDSCIIVRADDAPHLNMHAHGGLRSSLTL 452
Query: 161 -IVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVV-------- 211
K D ++ +YP+WNR+ G DH + D G P I +++ +V
Sbjct: 453 EFYKTAYDHIVEQYPFWNRSSGRDHIWFFSWDEGACY---APKEIWDSMMLVHWGNTNSK 509
Query: 212 --------CSPSYDV----------AFIPHKDVALPQVLQP----FALPRGGRDVENRTV 249
+ ++D F P+KD+ LP +P + R E R
Sbjct: 510 HNHSTTAYWADNWDSVSSDRRGNHPCFDPYKDLVLPAWKRPDVVSLSSKLWSRPREQRKT 569
Query: 250 LGFWAGH----------RNSKIRVILARVWENDTELDISNNRINRA------IGPLV--- 290
L ++ G+ + I +V E ++ + + PL
Sbjct: 570 LFYFNGNLGPAYEGGRPETTYSMGIRQKVAEEFGSSPNKEGKLGKQHAEDVIVTPLRSGN 629
Query: 291 YQRRFYKTKFC-ICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILN 349
Y + FC + PG S R DSI GC+PV++ + LPF ++L++ FAV +
Sbjct: 630 YHESLASSVFCGVMPGDGW--SGRFEDSILQGCIPVVIQDGIFLPFENMLNYESFAVRIR 687
Query: 350 ERDVYQLKQILKNISDAEFVSLHNNLVKVQKHF 382
E ++ L +IL+ +++ E N+ K+ + F
Sbjct: 688 EDEIPNLIKILRGMNETEIEFKLENVRKIWQRF 720
>gi|449437619|ref|XP_004136589.1| PREDICTED: uncharacterized protein LOC101206674 [Cucumis sativus]
Length = 791
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 81/342 (23%), Positives = 138/342 (40%), Gaps = 86/342 (25%)
Query: 113 YASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SCHKMRGKGTSYENMTVIVKDYV---- 166
Y +E F+++I S T + + A FF+P+ SC R + + ++DY+
Sbjct: 391 YGAEMAFYESILASPHRTLNGEEADFFFVPVLDSCIITRADDAPH----LSLRDYMGLRS 446
Query: 167 -----------DSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVV---- 211
D ++ +YPYWNR+ G DH + D G P I N++ +V
Sbjct: 447 FLTLDFYKKAHDHIVEQYPYWNRSSGRDHIWFFSWDEGACY---APKEIWNSMMLVHWGN 503
Query: 212 ------------CSPSYD----------VAFIPHKDVALPQVLQP---------FALPRG 240
++D F P KD+ +P +P +A PR
Sbjct: 504 TNSKHNHSTTAYWGDNWDNIPSSKRGNHPCFDPEKDLVVPAWKRPDGSRLSKKLWARPR- 562
Query: 241 GRDVENRTVLGFWAGHRNSK-------------IRVILARVWENDTELDISNNRINRA-- 285
E R F+ G+ IR +A + + + + + A
Sbjct: 563 ----EERKTFFFFNGNLGPAYERGRPESTYSMGIRQKVAEEFGSSPNKEGKLGKQHAADV 618
Query: 286 -IGPLV---YQRRFYKTKFC-ICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILD 340
+ PL Y + FC + PG S R+ DSI GC+PVI+ + LP+ ++L+
Sbjct: 619 IVTPLRSENYHEDLASSVFCGVMPGDGW--SGRMEDSILQGCIPVIIQDGIFLPYENVLN 676
Query: 341 WRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHF 382
+ FAV + E D+ L IL+ +++E +N+ K+ + F
Sbjct: 677 YDSFAVRIGEDDIPNLINILRGFNESEIEFKLSNVRKIWQRF 718
>gi|125576050|gb|EAZ17272.1| hypothetical protein OsJ_32791 [Oryza sativa Japonica Group]
Length = 506
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 34/110 (30%), Positives = 62/110 (56%), Gaps = 3/110 (2%)
Query: 295 FYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVY 354
+ +KFC+ G +S R+ D+I CVPVI+S+ +LP+ D LD+ KF++ + D
Sbjct: 372 MHASKFCLNIAGDTPSSNRLFDAIVSHCVPVIISDDIELPYEDALDYSKFSIFVRSSDAV 431
Query: 355 Q---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYEL 401
+ L ++++ +S ++ + L +V KHF++ P + DA M+ L
Sbjct: 432 KKGYLMRLIRGVSKHQWTMMWRRLKEVDKHFEYQYPSQKDDAVQMIWQTL 481
>gi|432851754|ref|XP_004067068.1| PREDICTED: exostosin-2-like [Oryzias latipes]
Length = 719
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 29/94 (30%), Positives = 58/94 (61%), Gaps = 7/94 (7%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + ++ FC+ G+++ A ++D + GCVPVIL++ Y LPF+++LDW++ +V++ E
Sbjct: 309 YPQILQESSFCVVLRGARLGQATLSDVLQAGCVPVILADSYILPFSEVLDWKRASVVIPE 368
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
+ ++ ILK+I H + ++Q+ +W
Sbjct: 369 EKLSEMYTILKSIP-------HRQVEEMQRQARW 395
>gi|66517433|ref|XP_391845.2| PREDICTED: exostosin-1 [Apis mellifera]
Length = 711
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 75/329 (22%), Positives = 130/329 (39%), Gaps = 60/329 (18%)
Query: 88 NFKIYIYPDGDP-NTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCH 146
F +Y+YP D + YQ I ESR+ T DP RA +F + +
Sbjct: 88 GFTVYVYPIEDVISPLYQK-------------ILNVITESRYYTSDPTRACIFVLALDTL 134
Query: 147 KMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV--RATEGVPFLI 204
T + + + S + + PYWN G +H + A E + F +
Sbjct: 135 DRDPLSTEFVHN-------LPSKLMRLPYWNN--GRNHLIFNLYSGTWPDYAEESLAFDL 185
Query: 205 KNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDVE--------NRTVLGFWAGH 256
A+ S S P D+++P + RGG + N+ + + G
Sbjct: 186 GYAMLAKASMSI-FRHRPEFDISIPLFGKQHP-ERGGEPGQALENNFPNNKKYIAAFKGK 243
Query: 257 R-----NSKIRVILARVWENDTELDISNNRINRAIGPLV---------------YQRRFY 296
R S+ R L + + ++ R +A L Y+
Sbjct: 244 RYVHGIGSETRNALYHLHNGKDLVFVTTCRHGKAWRELQDEHCQQDNQEYDMYDYEILLM 303
Query: 297 KTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQL 356
FC+ P G ++ S R +++ GC+PVILSN + LPF++ +DW + + +ER + Q+
Sbjct: 304 NATFCLVPRGRRLGSFRFLEALRAGCIPVILSNGWALPFHERIDWTQAVIFSDERLLLQI 363
Query: 357 KQILKNISDAEFVSLHNNLVKVQKHFQWN 385
I++++S+ + L + Q F W
Sbjct: 364 PDIVRSVSNVHILKL-----RQQTQFLWE 387
>gi|321261385|ref|XP_003195412.1| hypothetical protein CGB_G5240C [Cryptococcus gattii WM276]
gi|317461885|gb|ADV23625.1| Conserved hypothetical protein [Cryptococcus gattii WM276]
Length = 1130
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 50/193 (25%), Positives = 87/193 (45%), Gaps = 14/193 (7%)
Query: 203 LIKNAIRVVCSPSYDV-AFIPHKDVALPQ-------VLQPFALPRGGRDVENRTVLGFWA 254
++ NA+ YD + PH+DV +P + + F + + R+ L W+
Sbjct: 900 ILNNALVWSVMGDYDSPCYRPHQDVVIPARTCRSNTLRETFPNVEAIKPMRERSNLLMWS 959
Query: 255 G-----HRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQV 309
G ++ +IR+ R D EL I Y + +FC P G
Sbjct: 960 GTYSGTGKSERIRLTCNRGGAGDREL-IKGGGKQSNFASSDYMKDLNNARFCAQPRGIAG 1018
Query: 310 NSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFV 369
S + +D+I+ GC+PV +S PF D LDW K +V + ++ +++++L I ++
Sbjct: 1019 WSPQTSDAIYAGCIPVFISEGTHYPFADFLDWSKLSVRVAPTELDKIEKVLAAIPLSKVE 1078
Query: 370 SLHNNLVKVQKHF 382
L NLV V++ F
Sbjct: 1079 ELQANLVCVREAF 1091
>gi|3023724|sp|O01704.1|EXT1_CAEEL RecName: Full=Multiple exostoses homolog 1; AltName: Full=Related
to mammalian RIB protein 1
gi|2058697|gb|AAC47509.1| multiple exostoses homolog 1 [Caenorhabditis elegans]
Length = 378
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 76/314 (24%), Positives = 131/314 (41%), Gaps = 57/314 (18%)
Query: 87 TNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCH 146
T+ K+YI+P F ++P+ + Y+ ++ ES T+DP+ A +F + I
Sbjct: 39 TSKKVYIHPM--EKRFEESPQSVI--YSK---ILKHFLESNHYTNDPNEACIFLLGID-- 89
Query: 147 KMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHF--------FVTCHDVGVRATE 198
+ +N V DY++SL WN G +H F D +
Sbjct: 90 -TTDRDVRSQNYVKNVNDYIESL--DPSVWNN--GRNHLIFNFYHGTFPDYDDHNLNFDT 144
Query: 199 GVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQ--PFAL-----PRGGRDVEN----- 246
G + + + S + FI DV+LP + P+ + R +EN
Sbjct: 145 GEAMIAR-------ASSSENNFIKVFDVSLPLFHENHPYEIKESKSERNDDRIENQRKYL 197
Query: 247 ------RTVLGFWAGHRN--------SKIRVILARVWENDTEL--DISNNRINRAIGPLV 290
R V G +G RN I ++ ND ++ D R N
Sbjct: 198 VSFKGKRYVYGIGSGTRNLVHHLHNGDDIVMVTTCKHNNDWQVYQDDRCQRDNDEYDRWE 257
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + FC+ P G ++ S R +++ GCVPV++S+ + LPF++ +DW A+++ E
Sbjct: 258 YDELLANSTFCLVPRGRRLGSFRFLETLRSGCVPVVISDSWILPFSETIDWNSAAIVVAE 317
Query: 351 RDVYQLKQILKNIS 364
RD + ++L + S
Sbjct: 318 RDALSIPELLMSTS 331
>gi|348509623|ref|XP_003442347.1| PREDICTED: exostosin-2-like [Oreochromis niloticus]
Length = 719
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 29/94 (30%), Positives = 58/94 (61%), Gaps = 7/94 (7%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + ++ FC+ G+++ A ++D + GCVPVIL++ Y LPF+++LDW++ +V++ E
Sbjct: 309 YPQILQESSFCVVLRGARLGQAALSDVLQAGCVPVILADSYILPFSEVLDWKRASVVIPE 368
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
+ ++ ILK+I H + ++Q+ +W
Sbjct: 369 EKLSEMYTILKSIP-------HRQVEEMQRQARW 395
>gi|307104488|gb|EFN52741.1| hypothetical protein CHLNCDRAFT_138314 [Chlorella variabilis]
Length = 647
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 84/351 (23%), Positives = 137/351 (39%), Gaps = 79/351 (22%)
Query: 110 TGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SCHKMRGKGTS-------YENMTV 160
T Y+ E + + +S T DP+ A F++P+ SC + T+ Y
Sbjct: 231 TWTYSLEFGLLEMMLQSEHRTLDPEEADFFYVPVFPSCFIWPVRSTADSLRDFYYGWAQS 290
Query: 161 IVKDYVDSLINKY-------PYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVC- 212
V+ + L+ Y PYW+R G DH ++ HD A+ VP IK+A ++
Sbjct: 291 RVQGAANLLLEAYHWLRAHYPYWDRRGGRDHIWLVTHD---EASCYVPAAIKSASIILSH 347
Query: 213 ----------------------------SPSYDVAFI--------PHKDVALPQVLQP-- 234
P +A + P KD+ LP + P
Sbjct: 348 WGRKDPNHTSGTGFPGNVYHLNVSHPHWEPEGSMAKLDLSQPCHDPVKDLVLPLMKTPDH 407
Query: 235 -FALPRGGRDVENRTVLGFWAGHRNSK-----IRVILARVWENDTE---LD-----ISNN 280
P G NRT L F G ++ R + R+W E LD + N
Sbjct: 408 YHQSPLVGAPTRNRTWLAFHRGRQHKTDAPEYSRGVRQRLWSASQEHGWLDKYGILLGEN 467
Query: 281 RINRAIGPLV----YQRRFYKTKFC-ICPGGSQVNSARIADSIHYGCVPVILSNYYDLPF 335
+ + Y + + FC + PG SAR+ D+ +GC+PVI+ + D+ F
Sbjct: 468 PSSPGAEEVKLAGDYSQLLASSIFCLVLPGDGW--SARMDDATLHGCIPVIVMDEVDVSF 525
Query: 336 NDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
++D ++F V + + DV +L +IL IS + L +V ++S
Sbjct: 526 ESVIDLQQFTVRVAQADVERLPEILLEISQERRQEMQRALGRVWHKLTYSS 576
>gi|297739695|emb|CBI29877.3| unnamed protein product [Vitis vinifera]
Length = 822
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 79/336 (23%), Positives = 135/336 (40%), Gaps = 74/336 (22%)
Query: 113 YASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SCHKMRGKGTSYENMTV---------- 160
Y ++ +++I S T D + A FF+P+ SC +R + NM
Sbjct: 421 YGAQMAIYESILASPHRTLDGEEADFFFVPVLDSCIIVRADDAPHLNMHAHGGLRSSLTL 480
Query: 161 -IVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVV-------- 211
K D ++ +YP+WNR+ G DH + D G P I +++ +V
Sbjct: 481 EFYKTAYDHIVEQYPFWNRSSGRDHIWFFSWDEGACY---APKEIWDSMMLVHWGNTNSK 537
Query: 212 --------CSPSYDV----------AFIPHKDVALPQVLQP----FALPRGGRDVENRTV 249
+ ++D F P+KD+ LP +P + R E R
Sbjct: 538 HNHSTTAYWADNWDSVSSDRRGNHPCFDPYKDLVLPAWKRPDVVSLSSKLWSRPREQRKT 597
Query: 250 LGFWAGH----------RNSKIRVILARVWENDTELDISNNRINR---------AIGPLV 290
L ++ G+ + I +V E E S N+ + + PL
Sbjct: 598 LFYFNGNLGPAYEGGRPETTYSMGIRQKVAE---EFGSSPNKEGKLGKQHAEDVIVTPLR 654
Query: 291 ---YQRRFYKTKFC-ICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAV 346
Y + FC + PG S R DSI GC+PV++ + LPF ++L++ FAV
Sbjct: 655 SGNYHESLASSVFCGVMPGDGW--SGRFEDSILQGCIPVVIQDGIFLPFENMLNYESFAV 712
Query: 347 ILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHF 382
+ E ++ L +IL+ +++ E N+ K+ + F
Sbjct: 713 RIREDEIPNLIKILRGMNETEIEFKLENVRKIWQRF 748
>gi|358256597|dbj|GAA50186.1| glucuronyl/N-acetylglucosaminyl transferase EXT1 [Clonorchis
sinensis]
Length = 802
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 80/384 (20%), Positives = 142/384 (36%), Gaps = 77/384 (20%)
Query: 81 NYLEMVTNFKIYIY--PDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHL 138
+Y +FK+Y+Y P G P+ ++ G +R R LT +P A +
Sbjct: 100 DYARCANDFKVYVYSAPPGQPHV-----------SSTYGKILAALRRYRLLTTNPYEACI 148
Query: 139 FFIPISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWN---RTLGADHFFVTCHDVGVR 195
F + + T D +N+ PYWN R A+ + D +
Sbjct: 149 FIPSLDTLDRDPLSPGFGVQT-------DQQLNQLPYWNELPRLKTAEQIEIEADDPQRK 201
Query: 196 A-TEGVPFLIKN------------------AIRVVCSPSYDVAFIPHK-DVALP----QV 231
T G LI N ++ S+ I H D++LP Q
Sbjct: 202 TLTPGRNHLIFNLYAGTWPGYHEDEYRLSLGQAILAKASFSTTKIRHTFDISLPLIHPQH 261
Query: 232 LQPFALPRGGRDVE--------------------NRTVLGFWAGHRNSKIR------VIL 265
+ +L G D R V G + RN+ VI+
Sbjct: 262 PEKISLEASGHDASYHRVHQRTKVRRPILLSFKGKRYVSGIGSASRNTLFHLHNGDDVIM 321
Query: 266 ARVWENDTE-LDISNNRINRAIGPLV---YQRRFYKTKFCICPGGSQVNSARIADSIHYG 321
+ T+ + ++ R + + Y + + FC+ P G ++ S R +++
Sbjct: 322 VTTCRHGTDWIRYADKRCSVDMATYDQYDYNELMHNSTFCLVPRGRRLGSYRFLEALEAS 381
Query: 322 CVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKH 381
C+PV+LSN ++LPF++++DW K + +E L +L+ I D V L + +
Sbjct: 382 CIPVMLSNDWELPFSEVIDWSKAVIWADEHLPLTLSLMLRRIPDYRIVQLRQQITFLYTT 441
Query: 382 FQWNSPPVRFDAFHMVMYELWLRR 405
+ + + F ++ L +RR
Sbjct: 442 YFQSVESIVFTTLEIIRDRLAMRR 465
>gi|291190070|ref|NP_001167078.1| Exostosin-2 [Salmo salar]
gi|223647990|gb|ACN10753.1| Exostosin-2 [Salmo salar]
Length = 724
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 29/94 (30%), Positives = 58/94 (61%), Gaps = 7/94 (7%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + ++ FC+ G+++ A ++D + GCVPVIL++ Y LPF+++LDW++ +V++ E
Sbjct: 314 YPQILQESSFCVVLRGARLGQATLSDVLQAGCVPVILADSYILPFSEVLDWKRASVVIPE 373
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
+ ++ ILK+I H + ++Q+ +W
Sbjct: 374 EKLPEMYTILKSIP-------HRQVEEMQRQARW 400
>gi|224135879|ref|XP_002327326.1| predicted protein [Populus trichocarpa]
gi|222835696|gb|EEE74131.1| predicted protein [Populus trichocarpa]
Length = 449
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 75/279 (26%), Positives = 116/279 (41%), Gaps = 38/279 (13%)
Query: 139 FFIPISCHKMRGKGTSYENMTVIVKDY-----VDSLINKYPYWNRTLGADHFFVTCHDVG 193
FF +S KG +DY V + W R+ G DH FV V
Sbjct: 126 FFATLSAEMELAKGKGSFRRKEGNEDYQRQKEVVDFVRNSEAWKRSGGKDHVFVLTDPVA 185
Query: 194 ---VRATEGVPFLIKNAIRVVCS------------PSYDVAFIPHKDVAL-PQVLQPFA- 236
VRA I AI +V I H V+L V+ P+
Sbjct: 186 MWHVRAE------IAPAILLVVDFGGWYRLDSKSSNGSSSDMIRHTQVSLLKDVIVPYTH 239
Query: 237 -LPRGG-RDVENRTVLGFWAG----HRNSKIRVILARVWENDTELDISNNRINRAIGPLV 290
LPR + + R L ++ G HR +R L + N+ + + N A G +
Sbjct: 240 LLPRFQFSENKKRNTLLYFKGAKHRHRGGIVRENLWDLLVNEPGVIMEEGFPN-ATGREL 298
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
R ++FC+ P G S R+ D+I C+PVI+S+ +LPF ILD+ +F+V +
Sbjct: 299 SIRGMRTSEFCLHPAGDTPTSCRLFDAIQSLCIPVIVSDNIELPFEGILDYTEFSVFVAG 358
Query: 351 RDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
D + L L++IS+ + L N+ K+Q +Q+ +
Sbjct: 359 DDALKPTWLMDHLRSISEKQKEELRRNMAKIQLIYQYEN 397
>gi|343171904|gb|AEL98656.1| exostosin-like protein, partial [Silene latifolia]
Length = 250
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 46/187 (24%), Positives = 91/187 (48%), Gaps = 10/187 (5%)
Query: 224 KDVALP--QVLQPFALPRGGRDVENRTVLGFWAGHRNSK----IRVILARVWENDTELDI 277
KDV P V+ + + RT L F+ G K +R LA++ + ++
Sbjct: 42 KDVVAPYVHVVDSYTDDDAPDPFKARTTLLFFRGRTVRKDEGVVRARLAKILVDYEDIHY 101
Query: 278 SNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFND 337
+ + L Q +KFC+ P G +S R+ D+I CVPVI+S+ +LPF D
Sbjct: 102 ERSVASEESIKLSTQG-MRSSKFCLDPAGDTPSSCRLFDAIVSHCVPVIVSDQIELPFED 160
Query: 338 ILDWRKFAVILNERDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAF 394
LD+ F++ + + + + + L+ + + ++++++ L + H+++ PP R D
Sbjct: 161 ELDYSNFSIFFSTEEALKPGYMVEELRKVPEEKWLNMYKKLKNISHHYEFQYPPKREDGV 220
Query: 395 HMVMYEL 401
+M+ ++
Sbjct: 221 NMLWRQI 227
>gi|340719750|ref|XP_003398310.1| PREDICTED: exostosin-1-like isoform 1 [Bombus terrestris]
gi|340719752|ref|XP_003398311.1| PREDICTED: exostosin-1-like isoform 2 [Bombus terrestris]
Length = 711
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 80/328 (24%), Positives = 131/328 (39%), Gaps = 58/328 (17%)
Query: 88 NFKIYIYPDGDP-NTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCH 146
F +Y+YP D + YQ I ESR+ T DP RA +F + +
Sbjct: 88 GFTVYVYPVEDVISPLYQK-------------ILNVITESRYYTSDPTRACIFVLALDTL 134
Query: 147 KMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV--RATEGVPFLI 204
T + V + LI + PYWN G +H + A E + F +
Sbjct: 135 DRDPLSTEF------VHNLPSKLI-RLPYWNN--GRNHLIFNLYSGTWPDYAEESLAFDL 185
Query: 205 KNAIRVVCSPSYDVAFIPHKDVALPQVLQ--PFALPRGGRDVEN---------------R 247
A+ S S P D+++P + P G+ +EN R
Sbjct: 186 GYAMLAKASMSI-FRHRPDFDISIPLFGKQHPERGGEPGQALENNFPNNKKYVAAFKGKR 244
Query: 248 TVLGFWAGHRNSK---------IRVILARVWENDTEL-DISNNRINRAIGPLVYQRRFYK 297
V G + RN+ + V R + EL D + N+ Y+
Sbjct: 245 YVHGIGSETRNALYHLHNGKDLVFVTTCRHGKAWRELQDEHCQQDNQEYDTYDYEILLMN 304
Query: 298 TKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLK 357
FC+ P G ++ S R +++ GC+PVILSN + LPF++ +DW + + +ER + Q+
Sbjct: 305 ATFCLVPRGRRLGSFRFLEALRAGCIPVILSNGWALPFHERIDWTQAVIFSDERLLLQIP 364
Query: 358 QILKNISDAEFVSLHNNLVKVQKHFQWN 385
I++++S+ + L + Q F W
Sbjct: 365 DIVRSVSNVHILKL-----RQQTQFLWE 387
>gi|307177266|gb|EFN66444.1| Exostosin-1 [Camponotus floridanus]
Length = 711
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 72/297 (24%), Positives = 124/297 (41%), Gaps = 54/297 (18%)
Query: 123 IRESRFLTHDPDRAHLFFIPISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGA 182
I ESR+ T DP RA +F + + T + V + L++ PYWN G
Sbjct: 111 ITESRYYTSDPARACIFVLALDTLDRDPLSTEF------VHNLPSKLLH-LPYWNN--GR 161
Query: 183 DHFFVTCHDVGV--RATEGVPFLIKNAIRVVCSPSYDVAFIPHK---DVALPQVLQ--PF 235
+H + + E + F + AI S ++ H+ DV++P + P
Sbjct: 162 NHLIFNLYSGTWPDYSEESLAFDMGYAILAKAS----MSIFRHRSDFDVSIPLFGKQHPE 217
Query: 236 ALPRGGRDVEN---------------RTVLGFWAGHRNS-------KIRVIL-----ARV 268
G+ +EN R V G + RN+ K V + +
Sbjct: 218 RGGEPGQALENNFPNNKKYVAAFKGKRYVHGIGSETRNALYHLHNGKDLVFVTTCRHGKA 277
Query: 269 WENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILS 328
W + N N+ Y+ FC+ P G ++ S R +++ GC+PVILS
Sbjct: 278 WREFQDEHCQQN--NQEYDTYDYEILLMNATFCLVPRGRRLGSFRFLEALRAGCIPVILS 335
Query: 329 NYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWN 385
N + LPF++ +DW + A+ +ER + Q+ I++++S+ + + L + Q F W
Sbjct: 336 NGWALPFHERIDWTQAAIFSDERLLLQIPDIVRSVSNVQILKL-----RQQTQFLWE 387
>gi|342320455|gb|EGU12395.1| Proteophosphoglycan ppg4 [Rhodotorula glutinis ATCC 204091]
Length = 1322
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 45/167 (26%), Positives = 78/167 (46%), Gaps = 8/167 (4%)
Query: 224 KDVALPQV-------LQPFALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELD 276
+DV +P V + F P V +R L F+AG + ++ T D
Sbjct: 1119 RDVVVPAVTKHTKALFETFKTPADVAPVNSRKHLAFFAGGVRGFGAIARTKIGCGRTGQD 1178
Query: 277 ISNNRINRAIGP-LVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPF 335
++ + + P Y +KFC+ P G R ++I+ GC+P + + PF
Sbjct: 1179 PNSAILYQQFSPGQRYLGTLNASKFCLLPRGIPAWMTRTFEAIYAGCIPAFIVDRNLFPF 1238
Query: 336 NDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHF 382
DILD+ +F+V + E D +++++IL + + L NLVKV++ F
Sbjct: 1239 QDILDYSRFSVTIPEADAHRIEEILSAYTPEQLSELQANLVKVREAF 1285
>gi|350421075|ref|XP_003492724.1| PREDICTED: exostosin-1-like isoform 1 [Bombus impatiens]
gi|350421078|ref|XP_003492725.1| PREDICTED: exostosin-1-like isoform 2 [Bombus impatiens]
Length = 711
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 80/328 (24%), Positives = 131/328 (39%), Gaps = 58/328 (17%)
Query: 88 NFKIYIYPDGDP-NTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCH 146
F +Y+YP D + YQ I ESR+ T DP RA +F + +
Sbjct: 88 GFTVYVYPVEDVISPLYQK-------------ILNVITESRYYTSDPTRACIFVLALDTL 134
Query: 147 KMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV--RATEGVPFLI 204
T + V + LI + PYWN G +H + A E + F +
Sbjct: 135 DRDPLSTEF------VHNLPSKLI-RLPYWNN--GRNHLIFNLYSGTWPDYAEESLAFDL 185
Query: 205 KNAIRVVCSPSYDVAFIPHKDVALPQVLQ--PFALPRGGRDVEN---------------R 247
A+ S S P D+++P + P G+ +EN R
Sbjct: 186 GYAMLAKASMSI-FRHRPDFDISIPLFGKQHPERGGEPGQALENNFPNNKKYVAAFKGKR 244
Query: 248 TVLGFWAGHRNSK---------IRVILARVWENDTEL-DISNNRINRAIGPLVYQRRFYK 297
V G + RN+ + V R + EL D + N+ Y+
Sbjct: 245 YVHGIGSETRNALYHLHNGKDLVFVTTCRHGKAWRELQDEHCQQDNQEYDTYDYEILLMN 304
Query: 298 TKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLK 357
FC+ P G ++ S R +++ GC+PVILSN + LPF++ +DW + + +ER + Q+
Sbjct: 305 ATFCLVPRGRRLGSFRFLEALRAGCIPVILSNGWALPFHERIDWTQAVIFSDERLLLQIP 364
Query: 358 QILKNISDAEFVSLHNNLVKVQKHFQWN 385
I++++S+ + L + Q F W
Sbjct: 365 DIVRSVSNVHILKL-----RQQTQFLWE 387
>gi|332017043|gb|EGI57842.1| Exostosin-1 [Acromyrmex echinatior]
Length = 911
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 80/329 (24%), Positives = 131/329 (39%), Gaps = 60/329 (18%)
Query: 88 NFKIYIYPDGDP-NTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCH 146
F +Y+YP D + YQ I ESR+ T DP RA +F + +
Sbjct: 88 GFTVYVYPVEDVISPLYQK-------------ILNVITESRYYTSDPARACIFVLALDTL 134
Query: 147 KMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV---RATEGVPFL 203
T + V + L++ PYWN G +H + G A E + F
Sbjct: 135 DRDPLSTEF------VHNLPLKLLH-LPYWNN--GRNHLIFNLYS-GTWPDYAEESLAFD 184
Query: 204 IKNAIRVVCSPSYDVAFIPHKDVALPQVLQ--PFALPRGGRDVEN--------------- 246
+ A+ S S P DV++P + P G+ +EN
Sbjct: 185 VGYAMLAKASMSI-FKHRPDFDVSIPLFGKQHPERGGEPGQALENNFPNNKKYVAAFKGK 243
Query: 247 RTVLGFWAGHRNS-------KIRVILARVWENDTELDISNNRI---NRAIGPLVYQRRFY 296
R V G + RN+ K V + T + + N+ Y+
Sbjct: 244 RYVHGIGSETRNALYHLHNGKDLVFVTTCRHGKTWREFQDEHCQQDNQEYDTYDYEILLM 303
Query: 297 KTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQL 356
FC+ P G ++ S R +++ GC+PVILSN + LPF++ +DW + + +ER + Q+
Sbjct: 304 NATFCLVPRGRRLGSFRFLEALRSGCIPVILSNSWALPFHERIDWNQAVIFSDERLLLQI 363
Query: 357 KQILKNISDAEFVSLHNNLVKVQKHFQWN 385
IL+++S+ + + L + Q F W
Sbjct: 364 PDILRSVSNVQILKL-----RQQTQFLWE 387
>gi|343171902|gb|AEL98655.1| exostosin-like protein, partial [Silene latifolia]
Length = 250
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 30/107 (28%), Positives = 63/107 (58%), Gaps = 3/107 (2%)
Query: 298 TKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQ-- 355
+KFC+ P G +S R+ D+I CVPVI+S+ +LPF D LD+ F++ + + +
Sbjct: 121 SKFCLDPAGDTPSSCRLFDAIVSHCVPVIVSDQIELPFEDELDYSNFSIFFSTEEALKPG 180
Query: 356 -LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYEL 401
+ + L+ + + ++++++ L + H+++ PP R D +M+ ++
Sbjct: 181 YMVEELRKVPEEKWLNMYKKLKNISHHYEFQYPPKREDGVNMLWRQI 227
>gi|297611147|ref|NP_001065633.2| Os11g0128000 [Oryza sativa Japonica Group]
gi|255679743|dbj|BAF27478.2| Os11g0128000, partial [Oryza sativa Japonica Group]
Length = 199
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 33/106 (31%), Positives = 61/106 (57%), Gaps = 3/106 (2%)
Query: 295 FYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVY 354
+ +KFC+ G +S R+ D+I CVPVI+S+ +LP+ D LD+ KF++ + D
Sbjct: 65 MHASKFCLNIAGDTPSSNRLFDAIVSHCVPVIISDDIELPYEDALDYSKFSIFVRSSDAV 124
Query: 355 Q---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMV 397
+ L ++++ +S ++ + L +V KHF++ P + DA M+
Sbjct: 125 KKGYLMRLIRGVSKHQWTMMWRRLKEVDKHFEYQYPSQKDDAVQMI 170
>gi|307111406|gb|EFN59640.1| hypothetical protein CHLNCDRAFT_133110 [Chlorella variabilis]
Length = 767
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 77/344 (22%), Positives = 134/344 (38%), Gaps = 74/344 (21%)
Query: 113 YASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SC--HKMRGKGTSYEN-MTVIVKDYVD 167
Y E F + + +S T DP+ A F++P+ SC + +R +S ++ V++ V
Sbjct: 372 YGLESAFLEMLMQSEHRTLDPEEADFFYVPVFTSCFIYPVRSGASSLQDWFYAPVQNRVQ 431
Query: 168 SLIN-----------KYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSY 216
N +P+W R G DH ++ HD G + VP I+ +I +
Sbjct: 432 GAANMLLEAFHWIQSHHPWWERRGGRDHIWLVTHDEG---SCWVPAAIRPSIILSHWGRM 488
Query: 217 DV--------------------------------------------AFIPHKDVALPQVL 232
D+ + P KD+ +P +
Sbjct: 489 DLNHSSTTGYWEDDYRQANARKLQHAEQQQHLFEPDGFQQKIAGHACYDPVKDLVVPLIK 548
Query: 233 QPFA---LPRGGRDVENRTVLGFWAGHRNSKI----RVILARVWENDTE---LDISNNRI 282
P P G NRT L F G N + R + RV E L+ ++
Sbjct: 549 TPNRNKHSPLFGAPTRNRTWLAFHRGRVNHEFPRYSRGVRQRVDNASREHQWLENYGSKF 608
Query: 283 NRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWR 342
Y + FC+ G SAR+ D++ +GC+PV++ + + F +LD
Sbjct: 609 GDESLQGDYSELLASSIFCLVLQGDGW-SARMDDAMSHGCIPVVIIDDVHVSFESVLDLS 667
Query: 343 KFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
+F++ + DV +L +IL+ +S L NL +V + + ++S
Sbjct: 668 QFSLRVKSADVERLPEILQAVSQERREELQRNLARVWQRYSYSS 711
>gi|301755012|ref|XP_002913342.1| PREDICTED: exostosin-like 1-like [Ailuropoda melanoleuca]
gi|281351586|gb|EFB27170.1| hypothetical protein PANDA_001136 [Ailuropoda melanoleuca]
Length = 668
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 58/221 (26%), Positives = 97/221 (43%), Gaps = 27/221 (12%)
Query: 174 PYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQ 233
P WN G +H ++ H T F + A+ P+ D F P DVALP L
Sbjct: 147 PQWNG--GRNHLVLSLHPAFCPQT----FQLGQAMVAKAGPTVDT-FRPGFDVALP--LL 197
Query: 234 PFALP-RGGRDVENRTVLGFWAGHRNSKIRVILA-----RVWENDTELDISNNRINRAIG 287
P A P RGG + LG + H + + R D+ + R + G
Sbjct: 198 PEAHPLRGGAPGQ----LGQHSPHPGVPLLALAEERGGWRTAGADSSACPWDGRCEQDRG 253
Query: 288 PLVYQ--RRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFA 345
P FC+ PG ++ ++ GC+PV+LS ++LPF++++DW K A
Sbjct: 254 PEQTHPGATLPNATFCLIPGHGP-DALHFLQALQAGCIPVLLSPRWELPFSEVIDWTKAA 312
Query: 346 VILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
++ +ER ++Q+ L+ + ++L + Q F W++
Sbjct: 313 IVADERLLFQVLTALQEMPPTRVLAL-----RQQTQFLWDA 348
>gi|320165867|gb|EFW42766.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 508
Score = 68.6 bits (166), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 83/385 (21%), Positives = 145/385 (37%), Gaps = 52/385 (13%)
Query: 74 SPSVFHLNYLEMVTNFKIYIYPDG----DPNTFYQTPRKLTGKYASEGYFFQNIRESRFL 129
SP+ + + KIY+Y + +P+ QT +T + E + +S
Sbjct: 118 SPTTSQFKWEAPIGTLKIYVYKEMMKVMEPHKCLQT---VTPMWLEEVELPSWVVKSIHY 174
Query: 130 THDPDRAHLFFIPISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTC 189
T++PD AH+FFIP + + ++T + VD L K+ Y+ R G DHF +
Sbjct: 175 TNNPDEAHMFFIPAMVRCILDFNRTQFHLTSEFTEMVDVLHTKHDYYRRNHGHDHFIINP 234
Query: 190 HDVGVR------ATEGVPFLI----KNAIRVVCSPSYDVAFIPHKDVALPQ----VLQPF 235
+ A E P NA +++ + A+ D +P + F
Sbjct: 235 GGGSMNVISSLLAGELHPVAANDWYSNATKLLSEAARSRAYFSGLDFVIPGSADYIFGKF 294
Query: 236 ALPRGGRDVENRTVLGFWAGHRNSKIRVILARVW---ENDTELD-------ISNNRINRA 285
+ E + + G R L R+ + D+E + N+I+
Sbjct: 295 MDVSQKIEEERPMLFLYLGGTSLGDQRQALGRLRKLVQGDSEQAAFFRDKVLIANKIDDP 354
Query: 286 IGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFA 345
+ P +Y R FC P G+ + R DS+ GC+PV + F D +DW
Sbjct: 355 V-PELYSLRIQNFTFCAAPHGTSPWTQRFYDSLISGCIPVQFDRRFRFGFYDHIDWDSIV 413
Query: 346 V--ILNERDVYQLKQILKNIS-DAEFVSLHNNLVKVQKHFQW-----------------N 385
V ++ D + + L +S D +F+ + H +
Sbjct: 414 VRYPTSQLDSFSFLEYLYKLSLDEDFIRERRRQIAAVAHLFYYGESSKALQTTAAALSDP 473
Query: 386 SPPVRFDAFHMVMYELWLRRHVIKY 410
P VR +A+H+ Y + +R +Y
Sbjct: 474 DPVVRREAYHLNAYSMTIRELADRY 498
>gi|431901712|gb|ELK08589.1| Exostosin-1 [Pteropus alecto]
Length = 452
Score = 68.6 bits (166), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 29/96 (30%), Positives = 56/96 (58%), Gaps = 5/96 (5%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y+ + FC+ P G ++ S R +++ CVPV+LSN ++LPF+++++W + AVI +E
Sbjct: 30 YREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDE 89
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
R + Q+ +++I + ++L Q F W +
Sbjct: 90 RLLLQIPSTIRSIHQDKILALRQ-----QTQFLWEA 120
>gi|296227312|ref|XP_002759355.1| PREDICTED: exostosin-1 [Callithrix jacchus]
Length = 475
Score = 68.6 bits (166), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 29/96 (30%), Positives = 56/96 (58%), Gaps = 5/96 (5%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y+ + FC+ P G ++ S R +++ CVPV+LSN ++LPF+++++W + AVI +E
Sbjct: 53 YREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDE 112
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
R + Q+ +++I + ++L Q F W +
Sbjct: 113 RLLLQIPSTIRSIHQDKILALRQ-----QTQFLWEA 143
>gi|432094719|gb|ELK26199.1| Exostosin-1 [Myotis davidii]
Length = 544
Score = 68.6 bits (166), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 29/96 (30%), Positives = 56/96 (58%), Gaps = 5/96 (5%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y+ + FC+ P G ++ S R +++ CVPV+LSN ++LPF+++++W + AVI +E
Sbjct: 122 YREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDE 181
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
R + Q+ +++I + ++L Q F W +
Sbjct: 182 RLLLQIPSTIRSIHQDKILALRQ-----QTQFLWEA 212
>gi|357113818|ref|XP_003558698.1| PREDICTED: uncharacterized protein LOC100844507 [Brachypodium
distachyon]
Length = 781
Score = 68.6 bits (166), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 85/323 (26%), Positives = 137/323 (42%), Gaps = 58/323 (17%)
Query: 107 RKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHKMRGKGTSYENMTVIVKDYV 166
R L G A YF+ + +S +T D HL P H +R SY + K Y
Sbjct: 398 RTLNGDEAD--YFYVPVLDSCLITRSDDAPHLL-TPEDLH-LR----SYHALEYYRKAY- 448
Query: 167 DSLINKYPYWNRTLGADHFFVTCHDVGV----RATEGVPFLI---------KNAIRVVCS 213
D + +Y YWNRT G DH + D G + L+ +N+ +
Sbjct: 449 DHISQRYAYWNRTSGRDHIWFFSWDEGACYAPKEIWNSMMLVHWGNTNTKHENSTTAYWA 508
Query: 214 PSYD----------VAFIPHKDVALP--QVLQPFA--LPRGGRDVENRTVLGFWAGHRNS 259
++D F P KD+ LP +V +P A L R NRT L ++ G+
Sbjct: 509 DNWDDIPLDRRGNHPCFDPRKDLVLPAWKVPEPGAIWLKLWARPRINRTTLFYFNGNLGP 568
Query: 260 K-------------IRVILARVWENDTELDISNNRINRAIGPLVYQR--RFYK----TKF 300
IR LA + + + R + A + Y R ++Y+ + F
Sbjct: 569 AYEQGRPEDTYSMGIRQKLAAEFGSTPSKEGKLGRQHTANVTVTYLRSEKYYEELASSVF 628
Query: 301 C-ICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQI 359
C PG S R+ DS+ GC+PVI+ + LP+ ++L++ FAV + E D+ L +I
Sbjct: 629 CGALPGDGW--SGRMEDSMLQGCIPVIIQDGIFLPYENVLNYNSFAVRIQEHDIPNLIRI 686
Query: 360 LKNISDAEFVSLHNNLVKVQKHF 382
L I++ + + N+ ++ + F
Sbjct: 687 LGGINETQIEFMLGNVRQIWQRF 709
>gi|302835860|ref|XP_002949491.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
gi|300265318|gb|EFJ49510.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
Length = 554
Score = 68.6 bits (166), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 71/322 (22%), Positives = 123/322 (38%), Gaps = 80/322 (24%)
Query: 119 FFQNIRESRFLTHDPDRAHLFFIPISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNR 178
F+Q + S D D A ++IPI ++R S +K ++ + YP+WNR
Sbjct: 147 FYQRLLGSGARVADGDLADWYYIPI---RLRTATDS-----AFLKYAIEYIREAYPWWNR 198
Query: 179 TLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALP 238
T GA HF + D+G + V + ++ ++ H + +
Sbjct: 199 TGGARHFVIHTGDLGA----------DEVMDDVYGMAANMTWLTHWGLTV---------- 238
Query: 239 RGGRDVENRTVLGFWAGHRNSKIRVILARVWEND----TELDISNNRINRAIGP------ 288
++ G+W HR K R W T + ++ R + GP
Sbjct: 239 -------DKNTSGWWKAHRPDKARA--GARWGTRGGYYTRVSVNRRRGSHMWGPPSPAPH 289
Query: 289 --------------------LVYQRRFYK----TKFCICPGGSQVNSARIADSIHYGCVP 324
+ ++R + K +KFC+ P G +I S + GC+P
Sbjct: 290 RAGVRQKVHFHHWNRTGFRIVTFERNYGKALVSSKFCLAPLGGGHGQRQIIVS-YMGCIP 348
Query: 325 VILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDA-EFVSLHNNLVKVQKHFQ 383
V +++ PF DW +FAV E D+ +L +IL+ IS + + L +H
Sbjct: 349 VCIADGVYEPFEPQTDWTEFAVRPAEADIPRLHEILEGISAGNKLAEMQVALRCAAQHLL 408
Query: 384 WNS-------PPVRFDAFHMVM 398
++S R+DAF +
Sbjct: 409 YSSMVGGLFGEDGRYDAFETTL 430
>gi|302830910|ref|XP_002947021.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
gi|300268065|gb|EFJ52247.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
Length = 834
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 81/354 (22%), Positives = 132/354 (37%), Gaps = 79/354 (22%)
Query: 113 YASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SC--------------HKMRGKGTSY- 155
Y E F + + +S T DP+ A F++P+ SC H G +
Sbjct: 432 YGLEVLFHEMLLQSEHRTFDPEAADYFYVPVYGSCFIFPLHCYADGPWWHVPSGPRVMHV 491
Query: 156 ENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV--------------------- 194
NM + V+D++ +PYW+R G DH ++ HD G
Sbjct: 492 TNMMLEVRDWIR---KHFPYWDRRGGRDHIWLMTHDEGACYAPTEIYNSSIFLTHWGRID 548
Query: 195 -RATEGVPFLIKNAIRVVCSPSYDVAFI----------PHKDVALPQVLQP--------- 234
F N + P ++ P KD+ +P + P
Sbjct: 549 KHHASNTAFTPDNYTQEYVHPEQPGGWLHLIDGHPCYTPGKDLVVPALKLPHHFRQSPLL 608
Query: 235 FALPRGGRDVENRTVLGFWAG----HR----NSKIRVILARVWENDTELDISNNRINRAI 286
F PR R +L + G HR + IR L R+W + L N I
Sbjct: 609 FHPPR------QRDILLYLRGDVGKHRLPNYSRGIRQRLYRLWRDQQWLQGYNVMIGDGS 662
Query: 287 G-PLVYQRRFYKTKFC-ICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKF 344
P Y ++KFC + PG S R+ D++ +GCVPVI+ + + D L+ +F
Sbjct: 663 DVPGDYSEHLSRSKFCLVVPGDGW--SPRLEDAVLHGCVPVIIMDGVHGVWEDQLELERF 720
Query: 345 AVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVM 398
++ + E ++ L Q L + + L KV + + S P+ + V+
Sbjct: 721 SIRVGEDELEGLPQQLAVVPQRVLEDMQRKLRKVWHRYAYVSHPLLSEEMKAVL 774
>gi|355757951|gb|EHH61385.1| Exostosin-1, partial [Macaca fascicularis]
Length = 431
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 29/96 (30%), Positives = 56/96 (58%), Gaps = 5/96 (5%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y+ + FC+ P G ++ S R +++ CVPV+LSN ++LPF+++++W + AVI +E
Sbjct: 9 YREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDE 68
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
R + Q+ +++I + ++L Q F W +
Sbjct: 69 RLLLQIPSTIRSIHQDKILALRQ-----QTQFLWEA 99
>gi|440904844|gb|ELR55304.1| Exostosin-1, partial [Bos grunniens mutus]
Length = 426
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 29/96 (30%), Positives = 56/96 (58%), Gaps = 5/96 (5%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y+ + FC+ P G ++ S R +++ CVPV+LSN ++LPF+++++W + AVI +E
Sbjct: 4 YREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDE 63
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
R + Q+ +++I + ++L Q F W +
Sbjct: 64 RLLLQIPSTIRSIHQDKILALRQ-----QTQFLWEA 94
>gi|281354411|gb|EFB29995.1| hypothetical protein PANDA_011713 [Ailuropoda melanoleuca]
Length = 426
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 29/96 (30%), Positives = 56/96 (58%), Gaps = 5/96 (5%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y+ + FC+ P G ++ S R +++ CVPV+LSN ++LPF+++++W + AVI +E
Sbjct: 4 YREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDE 63
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
R + Q+ +++I + ++L Q F W +
Sbjct: 64 RLLLQIPSTIRSIHQDKILALRQ-----QTQFLWEA 94
>gi|403287279|ref|XP_003934878.1| PREDICTED: exostosin-like 1 [Saimiri boliviensis boliviensis]
Length = 675
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 59/221 (26%), Positives = 99/221 (44%), Gaps = 30/221 (13%)
Query: 176 WNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPF 235
WNR G +H + H T F + A+ SP+ D +F DVALP + +
Sbjct: 152 WNR--GRNHLVLRLHPAPCPRT----FQLGQAMVAEASPTVD-SFRAGFDVALPFLPEAH 204
Query: 236 ALPRGGRDVENRTVLGFWAGHRNSKIRVILA--------RVWENDTELDISNNRINRAIG 287
L RGG + R H +LA R ++ + R + G
Sbjct: 205 PL-RGGAPGQLRQ-------HSPQPGVALLALEEERGGWRTAGTNSSACPWDGRCEQDPG 256
Query: 288 PLVYQR--RFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFA 345
P QR FC+ G ++R ++ GC+PV+LS ++LPF++++DW K A
Sbjct: 257 PGQTQRWEMLPNATFCLISGHHPQAASRFLQALQAGCIPVLLSPRWELPFSEVIDWTKAA 316
Query: 346 VILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
++ +ER + Q+ L+ +S A+ ++L + Q F W++
Sbjct: 317 IVADERLLLQVLAALQEMSPAQVLAL-----RQQTQFLWDA 352
>gi|56606002|ref|NP_001008400.1| exostosin-2 [Danio rerio]
gi|55247892|gb|AAV48783.1| exostosin-2 [Danio rerio]
Length = 719
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 28/94 (29%), Positives = 58/94 (61%), Gaps = 7/94 (7%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + ++ FC+ G+++ A ++D + GCVPVI+++ Y LPF+++LDW++ +V++ E
Sbjct: 309 YPQILQESSFCVVLRGARLGQATLSDVLQAGCVPVIMADSYILPFSEVLDWKRASVVIPE 368
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
+ ++ ILK+I H + ++Q+ +W
Sbjct: 369 EKLPEMYTILKSIP-------HRQVEEMQRQARW 395
>gi|326511980|dbj|BAJ95971.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 784
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 79/334 (23%), Positives = 132/334 (39%), Gaps = 70/334 (20%)
Query: 113 YASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SCHKMRGKGT------------SYENM 158
Y ++ +++I S T + D A F++P+ SC R SY +
Sbjct: 382 YGAQMALYESILASPHRTLNGDEADYFYVPVLDSCLITRSDDAPHLRMPEDLRLRSYHTL 441
Query: 159 TVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVV------- 211
K Y D + +YPYWNRT G DH + D G P I N++ +V
Sbjct: 442 EYYRKAY-DHIAQRYPYWNRTSGRDHIWFFSWDEGACY---APKEIWNSMMLVHWGNTNT 497
Query: 212 ---------CSPSYD----------VAFIPHKDVALPQVLQP----FALPRGGRDVENRT 248
+ ++D F P KD+ LP +P L R NRT
Sbjct: 498 KHEKSTTAYWADNWDDIPFDRRGNHPCFDPRKDLVLPAWKEPNPGAIWLKLWARPKINRT 557
Query: 249 VLGFWAGHRNSK-------------IRVILARVWENDTELDISNNRINRA------IGPL 289
L ++ G+ IR LA + + R A +
Sbjct: 558 TLFYFNGNLGPAYEEGRPEDTYSMGIRQKLAAEFGSTPNKQGKLGRQQTANVTVTYLKSE 617
Query: 290 VYQRRFYKTKFC-ICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVIL 348
+Y + FC + PG S R+ DS+ GC+PVI+ + LP+ ++L++ F+V +
Sbjct: 618 MYYEELASSIFCGVLPGDGW--SGRMEDSMLQGCIPVIIQDGIFLPYENVLNYNSFSVRI 675
Query: 349 NERDVYQLKQILKNISDAEFVSLHNNLVKVQKHF 382
E D+ L ++L+ ++ + + N+ +V + F
Sbjct: 676 QEDDIPNLIKVLQGLNGTQIDFMLGNVRQVWQRF 709
>gi|159470401|ref|XP_001693348.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158277606|gb|EDP03374.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 356
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 77/308 (25%), Positives = 130/308 (42%), Gaps = 47/308 (15%)
Query: 113 YASEGYFFQN-IRESRFLTHDPDRAHLFFIPISCHKMRGKGTSYENMTVIVKDYVDSLIN 171
Y + YF + I ++ T +P AHLF++P G E V D+V +
Sbjct: 25 YTAYEYFLKYFITDNIVRTENPYEAHLFYVPALNFFYSGNLRPPEYHLEAVMDHVKT--- 81
Query: 172 KYPYWNRTLGADHF-FVT-----CH---DVGVRATEGVPFLIK----NAIRVVCSPSY-- 216
+P++NR+ G DHF F+T CH D+ + V F ++ N + + Y
Sbjct: 82 AWPFYNRSGGRDHFIFLTGDRGACHMPRDMQDSMIKVVHFGMQKQGLNWTSMEHNKEYGC 141
Query: 217 -----DVAFIPHKD-------VALPQVLQPFALPRGGRDVENRTVL-------GFWAGHR 257
D+ PH + V Q A G N T L G ++G
Sbjct: 142 IRMRQDLVVPPHPNDHKPLWPVGAAAYFQRIAAAGGHDAGRNITFLFAGGVGEGEYSGGT 201
Query: 258 NSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADS 317
+R +L + D + R + + L ++++FC+ G R+ S
Sbjct: 202 RQAVRALLLNI--TDPAIMFVEGRRDDYVDLL------WRSQFCLAAYGHGW-GIRVMQS 252
Query: 318 IHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVK 377
I +GC+PVI+ ++ F D L + +F+V L RDV +L ++L++ S + +L + K
Sbjct: 253 IQFGCIPVIIQDHVYQAFEDFLPYEEFSVRLPLRDVPRLLELLRSYSPEQLAALRLGMAK 312
Query: 378 VQKHFQWN 385
+ F WN
Sbjct: 313 YFRAFIWN 320
>gi|349603640|gb|AEP99427.1| Exostosin-1-like protein, partial [Equus caballus]
Length = 435
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 29/96 (30%), Positives = 56/96 (58%), Gaps = 5/96 (5%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y+ + FC+ P G ++ S R +++ CVPV+LSN ++LPF+++++W + AVI +E
Sbjct: 13 YREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDE 72
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
R + Q+ +++I + ++L Q F W +
Sbjct: 73 RLLLQIPSTIRSIHQDKILALRQ-----QTQFLWEA 103
>gi|195150443|ref|XP_002016164.1| GL11446 [Drosophila persimilis]
gi|194110011|gb|EDW32054.1| GL11446 [Drosophila persimilis]
Length = 676
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 36/147 (24%), Positives = 72/147 (48%), Gaps = 3/147 (2%)
Query: 258 NSKIRVILARVWENDTELDISNNRI---NRAIGPLVYQRRFYKTKFCICPGGSQVNSARI 314
N + V++ + ++ +NR NR Y+ + FC+ P G ++ S R
Sbjct: 289 NGRDMVLVTTCRHGKSWRELQDNRCDEDNREYDRYDYETLLQNSTFCLVPRGRRLGSFRF 348
Query: 315 ADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNN 374
+++ GC+PV+LSN + LPF +DW++ A+ +ER + Q+ I+++IS +L
Sbjct: 349 LEALQAGCIPVLLSNAWVLPFESKIDWKQAAIWADERLLLQVPDIVRSISAERIFALRQQ 408
Query: 375 LVKVQKHFQWNSPPVRFDAFHMVMYEL 401
+ + + + + F F ++ L
Sbjct: 409 TQVLWERYFGSIEKIVFTTFEIIRERL 435
>gi|402589373|gb|EJW83305.1| hypothetical protein WUBG_05784 [Wuchereria bancrofti]
Length = 689
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 72/303 (23%), Positives = 133/303 (43%), Gaps = 48/303 (15%)
Query: 89 FKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHKM 148
F++Y+YP + + ++ Y++ + IRES + T DP +A LF + I
Sbjct: 82 FRVYVYPSDNSSA-------MSVVYSN---ILKVIRESIYYTDDPQKACLFVLGIDTVDR 131
Query: 149 RGKGTSYENMTVIVKDYVDSLINKYP--YWNRTLGADHFFVTC----------HDVGVRA 196
+ +Y VK YV+ LI P WN G +H HD+G
Sbjct: 132 DRRSENY------VK-YVNELIENLPSEIWNH--GRNHIIFNLYHGTYPDYSDHDLGFDT 182
Query: 197 TEGVPFLIKNAIRVVCSPSYDVAF-IPHKDVALPQVLQP-FALPRGGRDV----ENRTVL 250
V R+ ++D++F + HK+ L ++ ++L + + R V
Sbjct: 183 GYAVIARASANTRIF-RENFDLSFPLFHKEHPLRTTVKAKWSLKIKDKYLVTFKGKRYVY 241
Query: 251 GFWAGHRNS------KIRVILARVWENDTEL----DISNNRINRAIGPLVYQRRFYKTKF 300
G + R+S V++ +++T+ D N Y+ + F
Sbjct: 242 GIGSETRDSLYHLHNGQSVVMVTTCKHNTDWKKYEDERCEEDNVEYDHWDYEMIMSNSTF 301
Query: 301 CICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQIL 360
C+ P G ++ S R +++ GC+PV+LS+ ++LPF++++DWR+ +I +E V + +L
Sbjct: 302 CLTPRGRRLGSFRFLEALRLGCIPVVLSDDWELPFSEVIDWRQAVIIGHEDTVLTISDVL 361
Query: 361 KNI 363
I
Sbjct: 362 SAI 364
>gi|410987732|ref|XP_004000149.1| PREDICTED: exostosin-1 [Felis catus]
Length = 488
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 29/96 (30%), Positives = 56/96 (58%), Gaps = 5/96 (5%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y+ + FC+ P G ++ S R +++ CVPV+LSN ++LPF+++++W + AVI +E
Sbjct: 66 YREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDE 125
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
R + Q+ +++I + ++L Q F W +
Sbjct: 126 RLLLQIPSTIRSIHQDKILALRQ-----QTQFLWEA 156
>gi|354495337|ref|XP_003509787.1| PREDICTED: exostosin-like 1 [Cricetulus griseus]
Length = 668
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 56/225 (24%), Positives = 96/225 (42%), Gaps = 38/225 (16%)
Query: 174 PYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQ 233
PYWN G +H ++ + + A+ SPS D+ F P DVALP + +
Sbjct: 147 PYWNG--GRNHLVLSLYPAPCTR-------LGQAMVAEASPSLDI-FRPGFDVALPYLPE 196
Query: 234 PFALPRGGRDVE--------NRTVLGFWAGHRNSKIRVILARV----WENDTELDISNNR 281
L +GG + T+L + R+I W E D ++
Sbjct: 197 AHPL-QGGAPGQLQQHSPQPGATLLA--VAEEKGRWRIIDTHASTCPWGKHCEQDPGLHQ 253
Query: 282 INRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDW 341
+ FC+ PG S+ + ++ GC+PV+LS ++LPF++++DW
Sbjct: 254 THPG-------ETLPNATFCLIPGHRSAASS-LLQALQAGCIPVLLSPRWELPFSEVIDW 305
Query: 342 RKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
K A+I +ER Q+ L+++ ++ +SL Q F W +
Sbjct: 306 TKAAIIADERLPLQVLTALRDMLPSKILSLRQ-----QTQFLWTA 345
>gi|351697330|gb|EHB00249.1| Exostosin-1, partial [Heterocephalus glaber]
Length = 426
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 29/96 (30%), Positives = 56/96 (58%), Gaps = 5/96 (5%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y+ + FC+ P G ++ S R +++ CVPV+LSN ++LPF+++++W + AVI +E
Sbjct: 4 YREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDE 63
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
R + Q+ +++I + ++L Q F W +
Sbjct: 64 RLLLQIPSTIRSIHQDKILALRQ-----QTQFLWEA 94
>gi|77748248|gb|AAI05840.1| Ext1 protein [Rattus norvegicus]
Length = 557
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 29/96 (30%), Positives = 56/96 (58%), Gaps = 5/96 (5%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y+ + FC+ P G ++ S R +++ CVPV+LSN ++LPF+++++W + AVI +E
Sbjct: 135 YREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDE 194
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
R + Q+ +++I + ++L Q F W +
Sbjct: 195 RLLLQIPSTIRSIHQDKILALRQ-----QTQFLWEA 225
>gi|380799435|gb|AFE71593.1| exostosin-1, partial [Macaca mulatta]
Length = 542
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 29/96 (30%), Positives = 56/96 (58%), Gaps = 5/96 (5%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y+ + FC+ P G ++ S R +++ CVPV+LSN ++LPF+++++W + AVI +E
Sbjct: 120 YREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDE 179
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
R + Q+ +++I + ++L Q F W +
Sbjct: 180 RLLLQIPSTIRSIHQDKILALRQ-----QTQFLWEA 210
>gi|293336592|ref|NP_001167834.1| uncharacterized protein LOC100381534 [Zea mays]
gi|223944319|gb|ACN26243.1| unknown [Zea mays]
Length = 241
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 57/232 (24%), Positives = 105/232 (45%), Gaps = 32/232 (13%)
Query: 190 HDVGVRATEGVPFLIKNAIRV---------VCSPSYDVAFIPH------KDVALPQVLQP 234
DV + +G+P +K +I + VC V PH ++ P+ Q
Sbjct: 2 EDVAI--ADGIPEFLKRSILLQTFGVQGHHVCQEVEHVVIPPHVPPEVAHELPEPEKAQR 59
Query: 235 --FALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQ 292
FA RG +V + + G + + K+R L + + + + + R + Y+
Sbjct: 60 DIFAFFRGKMEVHPKNISGRFY---SKKVRTELLQHYGRNRKFYLKRKRFDN------YR 110
Query: 293 RRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERD 352
++ FC+CP G S R+ +S+ GC+PVI+++ LPF +L W++ ++ + E+D
Sbjct: 111 SEMARSLFCLCPLGWAPWSPRLVESVLLGCIPVIIADDIRLPFPPVLQWQEISLQVAEKD 170
Query: 353 VYQLKQILKNISDAEFVSLHNNL---VKVQKHFQWNSPPVRFDAFHMVMYEL 401
+ L +L ++ + NL VK ++ +N P DA V+ EL
Sbjct: 171 IASLGMVLDHVVATNLTVIQKNLWDPVK-RRALVFNRPMEAGDATWQVLREL 221
>gi|302795426|ref|XP_002979476.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
gi|300152724|gb|EFJ19365.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
Length = 434
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 68/283 (24%), Positives = 128/283 (45%), Gaps = 24/283 (8%)
Query: 121 QNIRESRFLTHDPDRAHLFFIPI--SCHKMRGKGTSYENMTVI----VKDYVDSLINKYP 174
+ + +S +P A +F++P S + G G S ++ ++ + + + +
Sbjct: 117 RRLSDSSVRVMNPQEADVFYVPFFSSLSLVVGNGKSEDDEDPYSDEDTQEELMAWLEEQE 176
Query: 175 YWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQP 234
W + G DH V C D A + + +KN + ++ F P + + V+ P
Sbjct: 177 SWKKNKGRDHV-VICQDPN--ALKRLRDRLKNTVLLLSDFE---RFKPDQASLVKDVVLP 230
Query: 235 FALPRGGRDVEN----RTVLGFWAGHR----NSKIRVILARVWENDTELDISNNRINRAI 286
+ EN R L F+ G+R KIR L +V + + ++ + + +R
Sbjct: 231 YTHRIDSYSNENVTLDRDTLLFFMGNRYRKEGGKIRDQLFQVLDVEPDMVMKHGTQSRE- 289
Query: 287 GPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAV 346
G + + +KFC+ P G ++ R+ D+I CVPVI+S+ +LPF D LD+ +FA+
Sbjct: 290 GRRLAKVGMQTSKFCLHPAGDTPSACRLFDAIVSVCVPVIVSDDIELPFEDELDYSEFAI 349
Query: 347 I---LNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
+N + L L++IS L +V+K+F++
Sbjct: 350 FVPSINALEPGYLGSYLRSISPDLLKQKQQRLREVRKYFEYEE 392
>gi|47230478|emb|CAF99671.1| unnamed protein product [Tetraodon nigroviridis]
Length = 722
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 29/94 (30%), Positives = 57/94 (60%), Gaps = 7/94 (7%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + ++ FC+ G+++ A ++D + GCVPVIL++ Y LPF+++LDW++ +V + E
Sbjct: 309 YPQILQESSFCVVLRGARLGQAVLSDVLQAGCVPVILADSYILPFSEVLDWKRASVFIPE 368
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
+ ++ ILK+I H + ++Q+ +W
Sbjct: 369 EKLSEMYSILKSIP-------HRQVEEMQRQARW 395
>gi|302792172|ref|XP_002977852.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
gi|300154555|gb|EFJ21190.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
Length = 434
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 75/315 (23%), Positives = 144/315 (45%), Gaps = 35/315 (11%)
Query: 112 KYASEGYFFQNIR--ESRFLTH------DPDRAHLFFIPI--SCHKMRGKGTSYENMTVI 161
++++E + F ++ E R L+ +P A +F++P S + G G S ++
Sbjct: 100 QHSAEWWLFYDLEQGEDRRLSDASVRVMNPQEADVFYVPFFSSLSLVVGNGKSEDDEDPY 159
Query: 162 ----VKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYD 217
+++ + + + + W + G DH V C D A + + +KN + ++
Sbjct: 160 SDEDIQEELMAWLEEQESWKKNKGRDHV-VICQDPN--ALKRLRDRLKNTVLLLSDFE-- 214
Query: 218 VAFIPHKDVALPQVLQPFALPRGGRDVEN----RTVLGFWAGHR----NSKIRVILARVW 269
F P + + V+ P+ EN R L F+ G+R KIR L +V
Sbjct: 215 -RFKPDQASLVKDVVLPYTHRIDSYFNENVTLDRDTLLFFMGNRYRKEGGKIRDQLFQVL 273
Query: 270 ENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSN 329
+ + ++ + + +R G + + +KFC+ P G ++ R+ D+I CVPVI+S+
Sbjct: 274 DVEPDMVMKHGTQSRE-GRRLAKVGMQTSKFCLHPAGDTPSACRLFDAIVSVCVPVIVSD 332
Query: 330 YYDLPFNDILDWRKFAVI---LNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
+LPF D LD+ +FA+ +N + L L++IS L +V+K+F++
Sbjct: 333 DIELPFEDELDYSEFAIFVPSINALEPGYLGSYLRSISPDLLKQKQQRLREVRKYFEYEE 392
Query: 387 PPVRFDAFHMVMYEL 401
+ A M+ E+
Sbjct: 393 ---KGGAVEMIWLEV 404
>gi|291399546|ref|XP_002716175.1| PREDICTED: exostoses-like 1 [Oryctolagus cuniculus]
Length = 671
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 62/224 (27%), Positives = 95/224 (42%), Gaps = 33/224 (14%)
Query: 174 PYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQ 233
P WN G +H +T H T F + A+ SP+ D F P DVALP L
Sbjct: 147 PQWNG--GRNHLVLTLHPAPRPRT----FQLAQAMVAEASPTVDT-FRPGCDVALP--LL 197
Query: 234 PFALP-RGGRDVENRTVLGFWAGHRNSKIRVILA--------RVWENDTELDISNNRINR 284
P A P R G + R H +LA R + ++ + R +
Sbjct: 198 PEAHPLRAGAPGQLRQ-------HSPHPRAALLALASERGRWRTADANSSACSWDGRCEQ 250
Query: 285 AIGPLVYQ--RRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWR 342
G + Q FC+ PG + ++ R ++ GC+PV+LS ++LPF++ +DW
Sbjct: 251 DPGGELSQPTETLPNATFCLIPG-HRSDAWRFLQALQAGCIPVLLSPLWELPFSERIDWT 309
Query: 343 KFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
K +I +ER Q+ L+ + A +SL Q F W +
Sbjct: 310 KATIIADERQPRQVLAALQEMPLARVLSLRQ-----QTQFLWAA 348
>gi|148229160|ref|NP_001083108.1| uncharacterized protein LOC398750 [Xenopus laevis]
gi|37805422|gb|AAH60367.1| MGC68803 protein [Xenopus laevis]
Length = 718
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 34/127 (26%), Positives = 68/127 (53%), Gaps = 19/127 (14%)
Query: 270 ENDTELDISNNRINRAIGPLVYQRRFYK------------TKFCICPGGSQVNSARIADS 317
EN + I N + G ++ ++R YK + FCI G+++ ++D
Sbjct: 275 ENGESVLILEKCSNYSDGAMLLRKRCYKNMMYDYPQILQESTFCIVLRGARLGQGVLSDV 334
Query: 318 IHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVK 377
+ GCVPVI+++ Y LPF+++LDW++ +V++ E ++++ IL+ IS + +
Sbjct: 335 LQAGCVPVIIADSYVLPFSEVLDWKRASVVIPEEKMFEMYSILQGIS-------QRQVEE 387
Query: 378 VQKHFQW 384
+Q+ +W
Sbjct: 388 MQRQARW 394
>gi|297738432|emb|CBI27633.3| unnamed protein product [Vitis vinifera]
Length = 441
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/108 (29%), Positives = 64/108 (59%), Gaps = 5/108 (4%)
Query: 298 TKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDV---- 353
+KFC+ P G +S R+ D+I CVPVI+S+ +LP+ D +D+ +F++ ++++
Sbjct: 308 SKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDQIELPYEDEIDYTQFSIFFSDKEALEPG 367
Query: 354 YQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYEL 401
Y ++Q L+ I +V + +L + H+++ PP + DA M+ ++
Sbjct: 368 YMIEQ-LRQIPKERWVEMWRHLKYISHHYEFQYPPKKGDAIDMLWRQV 414
>gi|296207111|ref|XP_002750527.1| PREDICTED: exostosin-like 1 [Callithrix jacchus]
Length = 675
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 78/310 (25%), Positives = 124/310 (40%), Gaps = 62/310 (20%)
Query: 88 NFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHK 147
K+++YP P +T RK+ +I SRF T PD A C
Sbjct: 94 GLKVFVYPAAGP--ISETHRKI----------LASIEGSRFHTFSPDGA--------C-- 131
Query: 148 MRGKGTSYENMTVIVKDYVDSLINKYPY-WNRTLGADHFFVTCHDVGVRATEGVPFLIKN 206
+ ++ D + P WNR G +H + H T F +
Sbjct: 132 ---------LLLLLSLDAQIGECSSMPLQWNR--GRNHLVLRLHPARCPRT----FQLGQ 176
Query: 207 AIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDVENRTVLGFWAGHRNSKIRVILA 266
A+ SP+ D +F DVALP + + L RGG + R H +LA
Sbjct: 177 AMVAEASPTVD-SFRVGFDVALPFLPEAHPL-RGGAPGQLRQ-------HSLQPGVALLA 227
Query: 267 RVWENDTELDISNN--------RINRAIGPLVYQR--RFYKTKFCICPGGSQVNSARIAD 316
E N R + GP QR FC+ G + R
Sbjct: 228 LEEERGGWRTAGTNFSACPWDGRCKQDPGPGQTQRWEMLPNATFCLISGHRSQAALRFLQ 287
Query: 317 SIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLV 376
++ GC+PV+LS ++LPF++++DW K A++ +ER + Q+ L+ +S A+ ++L
Sbjct: 288 ALQAGCIPVLLSPRWELPFSEVIDWTKAAIVADERLLLQVLAALQEMSPAQVLAL----- 342
Query: 377 KVQKHFQWNS 386
+ Q F W++
Sbjct: 343 RQQTQFLWDA 352
>gi|412986477|emb|CCO14903.1| predicted protein [Bathycoccus prasinos]
Length = 574
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 60/236 (25%), Positives = 106/236 (44%), Gaps = 27/236 (11%)
Query: 180 LGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVA-----LPQVLQ- 233
+ HF V+ ++GV E PF + + V+ S S+ ++P + V+ L L+
Sbjct: 324 INTTHFIVSNSEIGV---ENAPFDAQKDVAVIPSLSF---YLPREAVSRGLVDLSHFLRN 377
Query: 234 -------PFALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAI 286
LP+ + E L F +R + + EN + D +
Sbjct: 378 EEEYASVGLELPK--LEAERNIRLMFRGNNRGPLREKVFRYLIENGSPED--SIETTGVA 433
Query: 287 GPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAV 346
P Y +K+C+ G++V S R+ + + +GCVPVI+++ Y+LP LDW KF++
Sbjct: 434 SPQAYMSLMEHSKYCLHVRGTRVMSPRLIELMLFGCVPVIVADAYELPLAWFLDWTKFSI 493
Query: 347 ILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS-PPVRFDAFHMVMYEL 401
+ E + + I + A + LH+NL +V F ++ P+ DAF+ L
Sbjct: 494 RVPESE---YENIHAYVEKANWRELHSNLGRVISFFVYHKDKPIIGDAFYATSLAL 546
>gi|410912512|ref|XP_003969733.1| PREDICTED: exostosin-2-like [Takifugu rubripes]
Length = 719
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 76/144 (52%), Gaps = 22/144 (15%)
Query: 254 AGHRNSKIRVILARVWENDTE-LDISNNRINRAIGPLVYQRRFYK------------TKF 300
A HR + R L R+ E + E L + + N + G ++R YK + F
Sbjct: 261 AIHR--EYRAELERLKEENGEALLLLDKCSNLSQGAASARKRCYKGQVYDYPQVLQDSSF 318
Query: 301 CICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQIL 360
C+ G+++ A ++D + GCVPVIL++ Y LPF+++LDW++ +V + E + ++ IL
Sbjct: 319 CVVLRGARLGQAALSDVLQAGCVPVILADSYILPFSEVLDWKRASVFIPEEKLSEMYGIL 378
Query: 361 KNISDAEFVSLHNNLVKVQKHFQW 384
K+I H + ++Q+ +W
Sbjct: 379 KSIP-------HRQVEEMQRQARW 395
>gi|255633862|gb|ACU17292.1| unknown [Glycine max]
Length = 141
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 37/109 (33%), Positives = 57/109 (52%), Gaps = 6/109 (5%)
Query: 297 KTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQL 356
+ FC+CP G S R+ +++ +GC+PVI+++ LPF D + W + V ++E+DV QL
Sbjct: 3 RAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFVDEKDVPQL 62
Query: 357 KQILKNISDAEFVS----LHNNLVKVQKHFQWNSPPVRFDAFHMVMYEL 401
IL +I + L N +K F P DAFH V+ L
Sbjct: 63 DTILTSIPPEVILRKQRLLANPFMKQAMLF--PQPAQPGDAFHQVLNGL 109
>gi|219362519|ref|NP_001137073.1| uncharacterized protein LOC100217246 [Zea mays]
gi|194691480|gb|ACF79824.1| unknown [Zea mays]
gi|413919898|gb|AFW59830.1| hypothetical protein ZEAMMB73_270023 [Zea mays]
Length = 458
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 66/246 (26%), Positives = 109/246 (44%), Gaps = 26/246 (10%)
Query: 163 KDYVDSLINKYPYWNRTLGADHFFVTCHDVG---VRATEGVPFLIK------NAIRVVCS 213
++ VD + +P W R+ G DH FV V VRA L+ + S
Sbjct: 165 REVVDR-VTSHPAWRRSSGRDHIFVLTDPVAMWHVRAEIAPAILLVVDFGGWYKVDSKSS 223
Query: 214 PSYDVAFIPH------KDVALPQV-LQPFALPRGGRDVENRTVLGFWAG---HRNSKIRV 263
I H KDV +P L P L +D RT+L F HR +R
Sbjct: 224 SKNSSRVIQHTQVSLLKDVIVPYTHLLPTLLLSENKD--RRTLLYFKGAKHRHRGGLVRE 281
Query: 264 ILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCV 323
L + N+ ++ + N A G + ++FC+ P G S R+ D+I C+
Sbjct: 282 KLWDLLGNEPDVIMEEGFPN-ATGREQSIKGLRTSEFCLHPAGDTPTSCRLFDAIASLCI 340
Query: 324 PVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQI---LKNISDAEFVSLHNNLVKVQK 380
PVI+S+ +LPF I+D+ + ++ ++ + + K + L+NIS + NL +VQ
Sbjct: 341 PVIVSDEVELPFEGIIDYTEISIFVSVSNAMRPKWLTSYLRNISKQQKDEFRRNLARVQP 400
Query: 381 HFQWNS 386
F++++
Sbjct: 401 IFEYDT 406
>gi|289166872|gb|ADC84486.1| glycosyltransferase family 47A [Salix miyabeana]
Length = 215
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 43/124 (34%), Positives = 60/124 (48%), Gaps = 16/124 (12%)
Query: 243 DVENRTVLGFWAGHRNSKIRVILARVWEN--DTEL-DISNNRINRAIGPLVYQRRFYKTK 299
DV N G++A R A VWEN D L DIS P Y +
Sbjct: 83 DVGNDPEGGYYA-------RGARAAVWENFKDNPLFDISTEH------PATYYEDMQRAV 129
Query: 300 FCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQI 359
FC+CP G S R+ +++ +GC+PVI+++ LPF D + W V ++E DV L I
Sbjct: 130 FCLCPLGWAPWSPRLVEAVIFGCIPVIIADDIVLPFADAIPWEDIGVYVDEEDVPNLDTI 189
Query: 360 LKNI 363
L +I
Sbjct: 190 LTSI 193
>gi|307215037|gb|EFN89864.1| Exostosin-2 [Harpegnathos saltator]
Length = 594
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 34/108 (31%), Positives = 63/108 (58%), Gaps = 10/108 (9%)
Query: 279 NNRINRAIGPLVYQRR--FYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFN 336
NN I R IG +Y+ + FC+ G+++ + + D++ GC+PVI+++ +PF+
Sbjct: 174 NNTI-RCIGEDLYKYPDILHSATFCLIIRGARLAQSVLLDAMAAGCIPVIIADSLIMPFH 232
Query: 337 DILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
D++DW K A+++ E D+ + Q+LK IS H +V++Q+ W
Sbjct: 233 DVIDWTKAAILVREVDILLIIQLLKKIS-------HQRIVEMQEQNAW 273
>gi|363742350|ref|XP_003642625.1| PREDICTED: exostosin-1b-like, partial [Gallus gallus]
Length = 456
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 35/120 (29%), Positives = 61/120 (50%), Gaps = 11/120 (9%)
Query: 269 WE--NDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVI 326
WE DT D N + YQ + + FCI P G ++ S R +++ C+PV+
Sbjct: 20 WERHKDTRCDQDNVDYEK----FDYQELLHNSTFCIVPRGRRLGSFRFLEALQAACIPVL 75
Query: 327 LSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
LS+ ++LPF++ +DW K AV+ +ER + Q+ ++ I ++ Q F W++
Sbjct: 76 LSDGWELPFSEAIDWGKAAVVGSERLLLQIPSAVRCIRPERVLAFQQ-----QTQFLWDA 130
>gi|307111637|gb|EFN59871.1| hypothetical protein CHLNCDRAFT_133704 [Chlorella variabilis]
Length = 833
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 74/322 (22%), Positives = 132/322 (40%), Gaps = 64/322 (19%)
Query: 113 YASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SCH------------------KMRGKG 152
Y E + + +S T DP+ A F++P+ SC R G
Sbjct: 413 YQPETGIHEMLLQSEHRTLDPEEADYFYLPVYASCAIWPVLYSNDFPYFHGGPAAQRTHG 472
Query: 153 TSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAI---- 208
+ NM + V+ +V S +PYW+R G DH +T HD G + +P +++ AI
Sbjct: 473 AT--NMFMEVQSWVRS---HFPYWDRNGGRDHIVLTVHDEG---SCWLPAVLRPAIVMSH 524
Query: 209 --RVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDVENRTVLGFWAGHRNSKIRVILA 266
R +P + D +V P P G + + LG + + SK+ IL
Sbjct: 525 WGRTDVNPPAGTGY--DADTYSNEVRHPVWQPEG-----HLSKLGEFPCYDPSKVTYILG 577
Query: 267 -RVWENDTELDISNNR----INRAIG-----------------PLVYQRRFYKTKFCICP 304
R+ + + I+ A G P Y ++ FC+
Sbjct: 578 GRIQPENARYSRGTRQFLANISEAEGWWDKYRIHVGAGSPPGGPGDYSECMARSVFCLAL 637
Query: 305 GGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNIS 364
G +S R D++ +GC+PVI+ + +L ++ +LD +++ + + D+ ++ QIL+ +
Sbjct: 638 MGDGYSS-RFDDAVLHGCIPVIVQDGIELTWHSLLDIPAYSLRVPQADMARIPQILQAVP 696
Query: 365 DAEFVSLHNNLVKVQKHFQWNS 386
+ + NL KV + W
Sbjct: 697 QEDIARMQANLAKVWRRHIWTG 718
>gi|302843170|ref|XP_002953127.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
gi|300261514|gb|EFJ45726.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
Length = 616
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 39/121 (32%), Positives = 57/121 (47%), Gaps = 8/121 (6%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y R +KFC+ P G ++ YGC+PV +++Y PF LDW F+V + E
Sbjct: 413 YVRDMSTSKFCLAPTGGGHGKRQVLVG-RYGCIPVPITDYVLQPFEPELDWPAFSVTVKE 471
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS-------PPVRFDAFHMVMYELWL 403
DV L IL I+D + + L KH ++S R+DAF +M L +
Sbjct: 472 EDVPNLHTILAAINDTKLAEMQRALACAAKHLWYSSMWGAIFGEDSRYDAFATLMEILRV 531
Query: 404 R 404
R
Sbjct: 532 R 532
>gi|431891237|gb|ELK02114.1| Exostosin-like 1 [Pteropus alecto]
Length = 674
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 73/321 (22%), Positives = 119/321 (37%), Gaps = 83/321 (25%)
Query: 87 TNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTH---DPDRAHLFFIPI 143
FK++ YP P +T R++ EG + +R + H PD P+
Sbjct: 92 AGFKVFAYPAAGP--ISETQRRILASI--EGSHYHTLRPAEACLHLLLSPDTPAGECSPV 147
Query: 144 SCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFL 203
+ P WN G +H + H F
Sbjct: 148 A-----------------------------PQWNG--GKNHLVLRLHPASCSRN----FQ 172
Query: 204 IKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALP-RGGRD-----------------VE 245
+ A+ SP D F P DVALP L P A P RGG E
Sbjct: 173 LGQAMVAEASPPVDT-FRPGFDVALP--LLPEAHPFRGGAPGQLQQHSPHPGVALLALAE 229
Query: 246 NRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPG 305
R G+ NS A +W+ E D + + FC+ PG
Sbjct: 230 ERG--GWRTADTNSS-----ACLWDGHCEQDRGPEQSHPGA-------TLPNATFCLIPG 275
Query: 306 GSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISD 365
+ ++ R ++ GC+PV+LS ++LPF++++DW K A++ +ER +Q+ L+ +
Sbjct: 276 -RRPDALRFLQTLQAGCIPVLLSPRWELPFSEVIDWTKAAIVADERLPFQVLAALQEMPP 334
Query: 366 AEFVSLHNNLVKVQKHFQWNS 386
+ +++ Q F W++
Sbjct: 335 TRVL-----VLRQQAQFLWDT 350
>gi|194882629|ref|XP_001975413.1| GG22301 [Drosophila erecta]
gi|190658600|gb|EDV55813.1| GG22301 [Drosophila erecta]
Length = 717
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 72/317 (22%), Positives = 124/317 (39%), Gaps = 51/317 (16%)
Query: 89 FKIYIYPDGD--PNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCH 146
K+YIYP + Y+T L+ +Y + + +SR+ T +P+ A LF
Sbjct: 108 LKVYIYPLQEFVDEQSYKTATTLSSEYFQ---ILEAVLKSRYYTSNPNEACLFL------ 158
Query: 147 KMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKN 206
N V K + + +W+R GA+H G + V +
Sbjct: 159 ----PSLDLLNQNVFDKHLAGAALASLDFWDR--GANHIIFNMLPGGAPSYNPV-LDVNT 211
Query: 207 AIRVVCSPSYDV-AFIPHKDVALPQVLQPFALPRGGRDVENRTVLGFWAGHRNSKIRVIL 265
++ +D ++ P DVA+P V P + + R L A IL
Sbjct: 212 DNAIIFGGGFDSWSYRPGFDVAIP-VWSPRLVRQHAHATAQRKFLLVVAQLN------IL 264
Query: 266 ARVWENDTELDISNNRINRAIGP------------------LVYQRRFYKTKFCICPGGS 307
R E+ ++N+ +G L Y R + KFC+
Sbjct: 265 PRFLRTLREISLANSEQLLLLGACESLDLKMRCPLSQHHKSLEYPRLLSRGKFCLLGRSL 324
Query: 308 QVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAE 367
++ + + + C+PVI + Y LPF D++DW +V + E +++ + Q LK IS +
Sbjct: 325 RMGQPDLVEIMSQHCIPVIAVDNYVLPFEDVIDWSLASVRIRENELHSVMQKLKAISSVK 384
Query: 368 FVSLHNNLVKVQKHFQW 384
+V++QK QW
Sbjct: 385 -------IVEMQKQVQW 394
>gi|388496634|gb|AFK36383.1| unknown [Medicago truncatula]
Length = 316
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 58/208 (27%), Positives = 93/208 (44%), Gaps = 25/208 (12%)
Query: 86 VTNFKIYIY--PDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPI 143
+ N KI++Y P + + PR T +ASE + + S T DP A FF+P+
Sbjct: 104 LKNLKIFVYDLPPKYNKNWLKNPRCKTHLFASEVAIHRALLTSDVRTFDPYEADFFFVPV 163
Query: 144 --SCHKMRGKG-TSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVG------- 193
SC+ G + + ++ V + +YP+WNR+ G+DH FV HD G
Sbjct: 164 YVSCNFSTVNGFPAIGHARSLISSAVKLISTEYPFWNRSTGSDHVFVASHDFGSCFHTLE 223
Query: 194 -VRATEGVPFLIKNAIRV-VCSPSYDVAFIPHKDVALPQVLQPFALPRGGRD-VEN---- 246
V +GVP + KN+I + +YD + V +P PF P R+ +EN
Sbjct: 224 DVAMKDGVPEITKNSIVLQTFGVTYDHPCQKVEHVVIP----PFVSPESVRNTLENFPVN 279
Query: 247 --RTVLGFWAGHRNSKIRVILARVWEND 272
R + F+ G + + R + N+
Sbjct: 280 GRRDIWVFFRGKMEVHPKNVSGRFYSNE 307
>gi|159483641|ref|XP_001699869.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158281811|gb|EDP07565.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 427
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 67/308 (21%), Positives = 116/308 (37%), Gaps = 54/308 (17%)
Query: 119 FFQNIRESRFLTHDPDRAHLFFIPISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNR 178
F + I TH+P+ A F+IP S ++ +++ + + +P+WN
Sbjct: 125 FMERILSGGHRTHNPEEADFFYIPGSSRDLK--------KAFLLQPLLAYISTTWPFWNA 176
Query: 179 TLGADHFFVTCHDVGVRATEGVPFLIK----NAIRVVCSPSYDV-------------AFI 221
T GA H DVG T +P ++ N + YD +
Sbjct: 177 TGGARHIMPAEGDVG---TCELPLKVRLFTANVTWLQFWGMYDFHPHWTQIFHNRIPCMV 233
Query: 222 PHKDVALPQ---------VLQPFALPRGGRDVENRTVLGFWAG-------------HRNS 259
P +D+ +P V++ PR + NRT F+AG H
Sbjct: 234 PGRDIVVPFMAMSSHDRFVIETPLHPRNQK--RNRTNTFFFAGGVCGSGNKRALPPHCTY 291
Query: 260 KIRVILARVWENDTELDISNNRINRAI-GPLVYQRRFYKTKFCICPGGSQVNSARIADSI 318
+V + L N R + G Y R + ++FC+ G I ++
Sbjct: 292 YKQVRYSGGVRQAVYLHFHNRTGWRVVPGTDDYARDYASSRFCLAAAGGGWGKRGIVAAM 351
Query: 319 HYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKV 378
YGC+PV ++ F +DW +F V + + ++ QL L+ S+AE +
Sbjct: 352 -YGCIPVAATDMLYEAFEPEMDWGRFGVRITQAEIPQLADKLEAYSEAEVARMQERTACA 410
Query: 379 QKHFQWNS 386
+H W++
Sbjct: 411 AQHLHWST 418
>gi|195334813|ref|XP_002034071.1| GM20090 [Drosophila sechellia]
gi|194126041|gb|EDW48084.1| GM20090 [Drosophila sechellia]
Length = 717
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 74/317 (23%), Positives = 122/317 (38%), Gaps = 51/317 (16%)
Query: 89 FKIYIYPDGDPNTFYQTPRKLTGKYASEGYF--FQNIRESRFLTHDPDRAHLFFIPISCH 146
K+YIYP + F T S YF + + +SR+ T +P+ A LF
Sbjct: 108 LKVYIYPLQE---FVDEQSDKTATTLSSEYFQILEAVLKSRYYTSNPNEACLFL------ 158
Query: 147 KMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKN 206
N V K S + +W+R GA+H G + V +
Sbjct: 159 ----PSLDLLNQNVFDKHLAGSALASLDFWDR--GANHLIFNMLPGGAPSYNTV-LDVNT 211
Query: 207 AIRVVCSPSYDV-AFIPHKDVALPQVLQPFALPRGGRDVENRTVLGFWAGHRNSKIRVIL 265
++ +D ++ P DVA+P V P + + R L A IL
Sbjct: 212 DNAIIFGGGFDSWSYRPGFDVAIP-VWSPRLVRQHAHATAQRKFLLVVAQLN------IL 264
Query: 266 ARVWENDTELDISNNRINRAIGP------------------LVYQRRFYKTKFCICPGGS 307
R EL ++++ +G L Y R + KFC+
Sbjct: 265 PRFVRTLRELSLAHSEQLLLLGACENLDLTMRCPLSQHHKSLEYPRLLSRGKFCLLGRSL 324
Query: 308 QVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAE 367
++ + + + C+PVI + Y LPF D++DW +V + E +++ + Q LK IS +
Sbjct: 325 RMGQPDLVEIMSQHCIPVIAVDNYVLPFEDVIDWSLASVRVRENELHSVMQKLKAISSVK 384
Query: 368 FVSLHNNLVKVQKHFQW 384
+V++QK QW
Sbjct: 385 -------IVEMQKQVQW 394
>gi|195488335|ref|XP_002092270.1| GE14097 [Drosophila yakuba]
gi|194178371|gb|EDW91982.1| GE14097 [Drosophila yakuba]
Length = 717
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 72/317 (22%), Positives = 122/317 (38%), Gaps = 51/317 (16%)
Query: 89 FKIYIYPDGDPNTFYQTPRKLTGKYASEGYF--FQNIRESRFLTHDPDRAHLFFIPISCH 146
K+YIYP + F T S YF + + +SR+ T +P+ A LF
Sbjct: 108 LKVYIYPLQE---FVDEQSDKTATTLSSEYFQILEAVLKSRYYTSNPNEACLFL------ 158
Query: 147 KMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKN 206
N V K + + +W+R GA+H G + V +
Sbjct: 159 ----PSLDLLNQNVFDKHLAGAALASLDFWDR--GANHIIFNMLPGGAPSYNTV-LDVNT 211
Query: 207 AIRVVCSPSYDV-AFIPHKDVALPQVLQPFALPRGGRDVENRTVLGFWAGHRNSKIRVIL 265
++ +D ++ P DVA+P V P + + R L A IL
Sbjct: 212 DNAIIFGGGFDSWSYRPGFDVAIP-VWSPRLVRQHAHATAQRKFLLVVAQLN------IL 264
Query: 266 ARVWENDTELDISNNRINRAIGP------------------LVYQRRFYKTKFCICPGGS 307
R E+ ++N+ +G L Y R + KFC+
Sbjct: 265 PRFLRTLKEVSLANSEQLLLLGACESLDLTMRCPLSQHHKSLEYPRLLSRGKFCLLGRSL 324
Query: 308 QVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAE 367
++ + + + C+PVI + Y LPF D++DW ++ + E +++ + Q LK IS +
Sbjct: 325 RMGQPDLVEIMSQHCIPVIAVDNYVLPFEDVIDWSLASIRIRENELHSVMQKLKAISSVK 384
Query: 368 FVSLHNNLVKVQKHFQW 384
+V++QK QW
Sbjct: 385 -------IVEMQKQVQW 394
>gi|302842849|ref|XP_002952967.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
gi|300261678|gb|EFJ45889.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
Length = 728
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 87/334 (26%), Positives = 129/334 (38%), Gaps = 59/334 (17%)
Query: 119 FFQNIRESRFLTHDPDRAHLFFIPISCHKMRGKGTSYENMTVI-VKDYVDSLINKYPYWN 177
+Q + + D A F +PI ++R +Y++ VI Y+ S+ +PYWN
Sbjct: 259 IWQRLLSAGLRVADAASADFFLVPI---RVR---MAYDSDRVIQAVSYIRSV---WPYWN 309
Query: 178 RTLGAD-HFFVTCHDVGV-RATEGVPFLIKNAIRVV-CSPSYDVAFI-------PHKDVA 227
T G H FV D G +E L +NA + + D F P KDV
Sbjct: 310 ATRGGGRHIFVHTGDWGRDELSEDAQLLTRNATWLTHWGLARDHEFAGWRQSHRPGKDVV 369
Query: 228 LP-----QVLQPFALPRG------GRDVENRTVLGFWAGHRNSKIRVILARVWEN----- 271
LP +L + LPR G E T L F S+ L + N
Sbjct: 370 LPLMLAASLLSTYQLPRASPLHPAGPRPERTTTLFFAGRICGSRATPSLNGTYPNCPNVL 429
Query: 272 ---DTELDISNNRI-----NRAIGPLVYQRR-----FYKTKFCICP-GGSQVNSARIADS 317
D + R RA LV R KFC+ P GG Q + +A
Sbjct: 430 GSEDAYSAATRQRAYFYHHGRANWKLVTASRAPAAEMATAKFCLAPSGGGQGKRSVLAPL 489
Query: 318 IHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVK 377
+ GCVPV +++ PF L W +FAV + ERD+ + ++L + + L
Sbjct: 490 M--GCVPVPVTDGLMQPFEPELRWERFAVGVRERDLPVMHELLDRLMPEQVAGFQAELTC 547
Query: 378 VQKHFQWNS-------PPVRFDAFHMVMYELWLR 404
+H W+S +DAF ++ L +R
Sbjct: 548 AAQHLFWSSLYGSVFGEDGAYDAFETLVQVLRMR 581
>gi|56751461|ref|YP_172162.1| hypothetical protein syc1452_c [Synechococcus elongatus PCC 6301]
gi|56686420|dbj|BAD79642.1| hypothetical protein [Synechococcus elongatus PCC 6301]
Length = 788
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 28/93 (30%), Positives = 58/93 (62%)
Query: 290 VYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILN 349
VY +++F + P G + S R+ + + G +PVIL++ + LPF+++LDW +F++ +
Sbjct: 204 VYTDLIARSRFSVAPRGHDIFSYRLLEVMAGGAIPVILADDWVLPFSELLDWSEFSLSVA 263
Query: 350 ERDVYQLKQILKNISDAEFVSLHNNLVKVQKHF 382
E ++L Q+L+ IS ++ + +L +V +H+
Sbjct: 264 EDRCWELPQLLQAISTDQWQVMQQHLQQVYQHY 296
>gi|307176975|gb|EFN66281.1| Exostosin-2 [Camponotus floridanus]
Length = 563
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 40/143 (27%), Positives = 73/143 (51%), Gaps = 15/143 (10%)
Query: 250 LGFWAGHRNSKI-RVILARVWENDTELDISNNRINRA---IGPLVYQRRF----YKTKFC 301
L FW N I ++ V + +T +D+ R A + L Y+ F FC
Sbjct: 107 LPFWKKGENHLIFNMVPGSVPDYNTVIDVPVGRAMIAGAGMSSLTYRPGFDISLPSATFC 166
Query: 302 ICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILK 361
+ G+++ + + D++ GC+PVI+++ +PF+D++DW K AV + E D+ + Q+LK
Sbjct: 167 LIIRGARLAQSSLLDAMAAGCIPVIIADSLTMPFHDVIDWTKAAVFVREVDILLIIQLLK 226
Query: 362 NISDAEFVSLHNNLVKVQKHFQW 384
IS H ++++Q+ W
Sbjct: 227 KIS-------HQRIMEMQEQNAW 242
>gi|81298858|ref|YP_399066.1| TPR repeat-containing protein [Synechococcus elongatus PCC 7942]
gi|81167739|gb|ABB56079.1| TPR repeat [Synechococcus elongatus PCC 7942]
Length = 788
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 28/93 (30%), Positives = 58/93 (62%)
Query: 290 VYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILN 349
VY +++F + P G + S R+ + + G +PVIL++ + LPF+++LDW +F++ +
Sbjct: 204 VYTDLIARSRFSVAPRGHDIFSYRLLEVMAGGAIPVILADDWVLPFSELLDWSEFSLSVA 263
Query: 350 ERDVYQLKQILKNISDAEFVSLHNNLVKVQKHF 382
E ++L Q+L+ IS ++ + +L +V +H+
Sbjct: 264 EDRCWELPQLLQAISTDQWQVMQQHLQQVYQHY 296
>gi|159485716|ref|XP_001700890.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158281389|gb|EDP07144.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 704
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 79/342 (23%), Positives = 143/342 (41%), Gaps = 67/342 (19%)
Query: 113 YASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SC--HKMRG----------KGTSYENM 158
Y E + + S T DP+ A F++P +C + + G G +
Sbjct: 310 YGLEAALHEYLLISEHRTFDPEEADYFYVPFYGACMIYPVAGWADYPWFWTPGGPRVMQV 369
Query: 159 TVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAI------RVVC 212
++++ V+ + +YP+W R G DH ++ HD G P +IK+++ R+
Sbjct: 370 INMIREIVEWIDKQYPFWKRRGGRDHIWLFTHDEGACW---APSVIKDSVWLTHWGRLDP 426
Query: 213 SPSYDVAFI----------------------------PHKDVALPQVLQP---FALPRGG 241
+ + AF+ P KD+ +P P P
Sbjct: 427 EHTSNTAFVGDNYTHDMVNWRQPEGYIKYIKGHPCYDPQKDLVVPNFKSPPHYVRSPLQS 486
Query: 242 RDVENRTVLGFWAG----HRNSKI-RVILARVWENDTELDISNNRINR-AIGPLV---YQ 292
+ R + F+ G HR S R I ++++ E D +N + + G V Y
Sbjct: 487 TPSKPRDIFFFFKGDVGKHRLSHYSRGIRQKIYKMAMEQDWANTQKSLIGDGGNVHGDYS 546
Query: 293 RRFYKTKFC-ICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNER 351
++ FC + PG S R+ D++ +GC+PVI+++ F +LD FAV + E
Sbjct: 547 DLLSRSLFCLVAPGDGW--SPRLEDAVLHGCIPVIIADRVHAVFESVLDIDSFAVRVAEA 604
Query: 352 DVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS-PPVRFD 392
DV ++ IL+ +SD + + L +V +++ + P +R D
Sbjct: 605 DVPRVMDILRAVSDIKIRLKQSRLGQVWHRYRYGALPGLRSD 646
>gi|302829362|ref|XP_002946248.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
gi|300269063|gb|EFJ53243.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
Length = 656
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 83/360 (23%), Positives = 152/360 (42%), Gaps = 52/360 (14%)
Query: 88 NFKIYIY--PDG--DPNTFYQTPRKLTGKYAS----EGYFFQNIRESRFLTHDPDRAHLF 139
+ KIY+Y P P+ F + +Y S +G F ++ S T +P A+LF
Sbjct: 290 SLKIYMYDIPPNIVGPHQFEDGNGGIHPQYESFLRFQGLFLNDV--SGIRTENPHEANLF 347
Query: 140 FIPISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEG 199
+IP + ++ + T V+ + +P++NRT G DHF + D G +
Sbjct: 348 YIPAFTYY---SSSNLGDPTGAAVRAVNWVAATFPFFNRTGGRDHFVLLSGDRGACYLKT 404
Query: 200 VPFLIKNAIRV----------------VCSPSYDVAFIPHKDVALPQ-----------VL 232
+P +N IRV V + Y F +DV +P V
Sbjct: 405 LP-QTENLIRVTHFGYERPNITDMGPLVTNTEYG-CFKAGRDVVMPPYVKSNVAGIQGVR 462
Query: 233 QPFALPRGGRDV-ENRTVLGFWAG-------HRNSKIRVILARVWENDTELDISNNRINR 284
P G + + L F++G + +R LA + N + D+
Sbjct: 463 AKLEEPGGAEALLAGKDTLLFFSGDIRHNEPEYSGGVRQALALLLANTSYPDVVFKGGYM 522
Query: 285 AIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKF 344
+G Y+ ++KFC+ P G R+ +I + C+PVI+ + PF DIL + F
Sbjct: 523 MMGMGEYESLLRRSKFCLAPYGHGWG-IRLIHAITHACIPVIIQDKVRQPFEDILHYPDF 581
Query: 345 AVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLR 404
+V +++ ++ +L +IL+ + + + + + +V + F W P + A+++ + L R
Sbjct: 582 SVRVSKAELPRLVEILRAVPEPDLLRMIKENSRVYRAFLWQ-PELGGLAYNITIASLRRR 640
>gi|302845052|ref|XP_002954065.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
gi|300260564|gb|EFJ44782.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
Length = 600
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 88/389 (22%), Positives = 143/389 (36%), Gaps = 90/389 (23%)
Query: 81 NYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYA-------SEGYFFQNIRESRFLTHDP 133
NY+E T K+YIY + P +T Y F+Q + S T D
Sbjct: 155 NYVERKTGVKVYIY---------ELPSNMTSWYPFMRMDRPVHLMFWQRLMSSGMRTLDG 205
Query: 134 DRAHLFFIPISCHKMRGKGTSYENMTVIVKDYVDSLINK-YPYWNRTLGADHFFVTCHDV 192
++A F+IPI+ R + E + ++ I K YP+W++ G H + D+
Sbjct: 206 NKADYFYIPIN---TRTGSLAREEL-----EWTLPYIKKTYPWWSKDNGNRHLIIHTGDM 257
Query: 193 GVRATEGVPFLIKNAIRVVCS----------------PSYDVAFIPHKDVALPQVL--QP 234
G+ P + + S + A P KD+ +P ++ Q
Sbjct: 258 GI---NDFPLATRRELNESLSNITWLTHWGLHEYHPIAKWYPAHRPGKDIVIPVMIMTQG 314
Query: 235 FAL----------------PR-------------GGRDVENRTVLGFWAGHRNSKIRVIL 265
F L PR G RD+ + GH + V
Sbjct: 315 FHLSPMNPRMEAEIKAQGAPRLRNGTLFFAGRICGDRDLPDPKTGKCGPGHEDYSFGVRQ 374
Query: 266 ARVWENDTELDISNNRINRAIG-PLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVP 324
A L N + R + Y KFC+ P G I + GC+P
Sbjct: 375 A------VYLQHRNVKGFRIVAWTSTYLEDISSHKFCLAPVGGGHGKRNILVAF-MGCLP 427
Query: 325 VILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
V++ ++ PF +DW +F++ + E D+ L +IL N+ +E S L +H +
Sbjct: 428 VLIGDHVLQPFEPEIDWSRFSISVPEADIPDLPRILANVPASEVASKQKRLRCAAQHMFY 487
Query: 385 NS-------PPVRFDAFHMVMYELWLRRH 406
+S R+DAF +M L +R+
Sbjct: 488 SSTLGAILGEDGRYDAFETLMEILRVRKE 516
>gi|413956860|gb|AFW89509.1| hypothetical protein ZEAMMB73_873038 [Zea mays]
Length = 783
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 78/330 (23%), Positives = 133/330 (40%), Gaps = 62/330 (18%)
Query: 113 YASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SCHKMRGKGTSYENMTVIVKDYV---- 166
Y ++ ++I S T + D A F++P+ SC R + + ++
Sbjct: 383 YGAQIALHESILASPHRTLNGDEADYFYVPVLDSCLITRSDDAPHLLLPRDLRRRSYHAL 442
Query: 167 -------DSLINKYPYWNRTLGADHFFVTCHDVGV----RATEGVPFLI---------KN 206
+ +YPYWNRT G DH + D G + L+ KN
Sbjct: 443 EYYRMAHGHIAQQYPYWNRTSGRDHIWFFSWDEGACYAPKEIWKSMMLVHWGNTNTKHKN 502
Query: 207 AIRVVCSPSYD----------VAFIPHKDVALPQVLQP----FALPRGGRDVENRTVLGF 252
+ + ++D F P KD+ LP +P L R NRT L +
Sbjct: 503 STTAYWADNWDDIPLDKRGNHPCFDPRKDLVLPAWKEPNPGAIWLKLWARPRNNRTTLFY 562
Query: 253 WAGHRNSK-------------IRVILARVWENDTELDISNNRINRAIGPLVYQR--RFYK 297
+ G+ S IR LA + + R + A + Y R ++Y+
Sbjct: 563 FNGNLGSAYEGGRPEDTYSMGIRQKLAAEFGSTPNKQGRLGRQHAADVTVTYLRTEKYYE 622
Query: 298 ----TKFC-ICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERD 352
+ FC + PG S R+ DS+ GC+PVI+ + LP+ ++L++ FAV + E D
Sbjct: 623 ELASSVFCGVLPGDGW--SGRMEDSMLQGCIPVIIQDGIFLPYENVLNYNSFAVRIQEDD 680
Query: 353 VYQLKQILKNISDAEFVSLHNNLVKVQKHF 382
+ L L+ I+D + + N+ ++ + F
Sbjct: 681 IPGLISTLRGINDTQVEFMLGNVRQMWQRF 710
>gi|302848153|ref|XP_002955609.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
gi|300259018|gb|EFJ43249.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
Length = 572
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 73/335 (21%), Positives = 129/335 (38%), Gaps = 63/335 (18%)
Query: 119 FFQNIRESRFLTHDPDRAHLFFIPISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNR 178
+ I T DP++A F+IP S ++ +++ ++ +I +P WN+
Sbjct: 168 LMERILSGGHRTADPEKADFFYIPASARDLK--------RAFLLEPLLNYIIEAWPIWNQ 219
Query: 179 TLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVC----------SPSYDVAF-------I 221
T GA H DVG T +P I+N V P + F +
Sbjct: 220 TGGARHIMPAEGDVG---TCELPMKIRNMTANVTWLQFWGMYDFHPHWTQIFHNRVPCMV 276
Query: 222 PHKDVALPQ---------VLQPFALPRGGRDVENRTVLGFWAGH-RNSKIRVILARVWEN 271
P +D+ +P V++ PR + NRT F+AG S + L
Sbjct: 277 PGRDIVVPFMAMSSHDRFVIETPLHPRNQK--RNRTNTFFFAGGICGSGNKRALPPHCTY 334
Query: 272 DTELDISNNRINRAI--------------GPLVYQRRFYKTKFCICPGGSQVNSARIADS 317
++ S + +A+ G Y R + + FC+ G I +
Sbjct: 335 YKQVRYSGG-VRQAVYYHYHKRPGWRVVPGTDDYARDYASSIFCLAAAGGGWGKRGIVAT 393
Query: 318 IHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVK 377
+ YGC+PV ++ F +DW +F V +++ + QL +L+ + + +
Sbjct: 394 M-YGCIPVAATDMLYEAFEPEMDWNRFGVRVSQAQIPQLGDMLEAFTPEQIRQMQIRTAC 452
Query: 378 VQKHFQWNS-------PPVRFDAFHMVMYELWLRR 405
+H W++ FDAF+ +M L +RR
Sbjct: 453 AAQHLHWSTNLGGIMGETGEFDAFNTIMAILRMRR 487
>gi|345480513|ref|XP_001603840.2| PREDICTED: exostosin-1-like isoform 1 [Nasonia vitripennis]
gi|345480515|ref|XP_003424163.1| PREDICTED: exostosin-1-like isoform 2 [Nasonia vitripennis]
Length = 713
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 71/334 (21%), Positives = 132/334 (39%), Gaps = 71/334 (21%)
Query: 80 LNYLEMVTNFKIYIYPDGDP-NTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHL 138
++ + F +Y+YP D + YQ I ESR+ T DP RA L
Sbjct: 82 FDFTKCKEGFTVYVYPIEDTISPLYQK-------------ILNVITESRYYTSDPSRACL 128
Query: 139 FFIPISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATE 198
F + + T + + + S + + PYWN G +H + +
Sbjct: 129 FVLALDTLDRDPLSTEFVHN-------LPSKVARLPYWNN--GKNHLIFNLY------SG 173
Query: 199 GVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPF--ALP--------RGGRD----- 243
P I++A+ Y A + +++ ++ + F ++P RGG
Sbjct: 174 TWPDYIEDAMAF----DYGYAMLAKASMSIMKLREDFDVSIPLFTKQHPERGGEPGLAIH 229
Query: 244 ---VENRTVLGFWAGHR-----NSKIRVILARVWENDTELDISNNRINRAIGPLV----- 290
+ L + G R S+ R L + + ++ R ++ L
Sbjct: 230 NHFPNKKKYLAAFKGKRYVHGIGSETRNALHHLHNGKDLIFVTTCRHGKSWRELQDEHCP 289
Query: 291 ----------YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILD 340
Y + FC+ P G ++ S R +++ GC+PVILSN + LPF+D +D
Sbjct: 290 QDIREYDMYDYDVLLLNSTFCLVPRGRRLGSFRFLEALRAGCIPVILSNGWALPFHDRID 349
Query: 341 WRKFAVILNERDVYQLKQILKNISDAEFVSLHNN 374
W + + +ER ++Q+ IL+++ + + + L
Sbjct: 350 WFQAVIYADERLLFQVPDILRSVVEEKIMVLRQT 383
>gi|308491504|ref|XP_003107943.1| hypothetical protein CRE_12808 [Caenorhabditis remanei]
gi|308249890|gb|EFO93842.1| hypothetical protein CRE_12808 [Caenorhabditis remanei]
Length = 847
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 70/312 (22%), Positives = 128/312 (41%), Gaps = 60/312 (19%)
Query: 90 KIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHKMR 149
K+YI+P F ++P+ + Y+ ++ +ES T+DP+ A LF + I
Sbjct: 15 KVYIHPM--EKRFEESPQSVI--YSK---ILKHFQESEHYTNDPNEACLFLLGIDTTDRD 67
Query: 150 GKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVT----------CHDVGVRATEG 199
+ +Y V +Y++SL WN G +H H++ E
Sbjct: 68 IRSLNY---VKNVNEYIESL--DQAVWNN--GRNHLIYNFYHGTFPDYDDHNLNFNTGEA 120
Query: 200 VPFLIKNAIRVVCSPSYDVAFIPHKDVALP--------QVLQPFALPRGGRDVE------ 245
+ + + S + F D++LP Q+ AL ++ +
Sbjct: 121 I---------IARASSSENNFFKDFDISLPLFHENHPYQIESQRALHNEPKEEKRRYLAS 171
Query: 246 ---NRTVLGFWAGHRN--------SKIRVILARVWENDTEL--DISNNRINRAIGPLVYQ 292
R V G +G RN I ++ ND ++ D R N Y
Sbjct: 172 FKGKRYVYGIGSGTRNLVHHLHNGDDIVMVTTCKHNNDWQVYQDDRCQRDNEEYDRWEYD 231
Query: 293 RRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERD 352
+ FC+ P G ++ S R +++ GC+PV++S+ + LPF++ +DW A+++ ERD
Sbjct: 232 DLLSNSTFCLVPRGRRLGSFRFLETLRSGCIPVVISDSWILPFSETIDWHSAAIVVAERD 291
Query: 353 VYQLKQILKNIS 364
+ ++L + S
Sbjct: 292 ALSIPELLMSTS 303
>gi|115451051|ref|NP_001049126.1| Os03g0174300 [Oryza sativa Japonica Group]
gi|108706455|gb|ABF94250.1| exostosin family protein, putative, expressed [Oryza sativa
Japonica Group]
gi|113547597|dbj|BAF11040.1| Os03g0174300 [Oryza sativa Japonica Group]
gi|215695509|dbj|BAG90700.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 576
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 78/309 (25%), Positives = 129/309 (41%), Gaps = 62/309 (20%)
Query: 113 YASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SCHKMRGKGTSYENM--TVIVKDY--- 165
Y ++ +++I S T + D A F++P SC R + M + ++ Y
Sbjct: 258 YGAQIALYESILASPHRTLNGDEADYFYVPALDSCLITRSDDAPHLQMPRDLRLRSYHTL 317
Query: 166 ------VDSLINKYPYWNRTLGADHFFVTCHDVGV----RATEGVPFLI---------KN 206
D + +YPYWNRT G DH + D G + L+ KN
Sbjct: 318 EYYRMTYDHIAQRYPYWNRTSGRDHIWFFSWDEGACYAPKEIWNSMMLVHWGNTNTKHKN 377
Query: 207 AIRVVCSPSYDV----------AFIPHKDVALPQVLQP----FALPRGGRDVENRTVLGF 252
+ + +++ F P KD+ LP QP L R NRT L +
Sbjct: 378 STTAYWADNWNYIPIDRRGNHPCFDPRKDLVLPAWKQPNPAAIWLKLWARTRNNRTTLFY 437
Query: 253 WAG----------HRNS---KIRVILARVWENDTELDISNNRINRAIGPLVYQR--RFYK 297
+ G H ++ IR LA + + + R + A + Y R ++Y+
Sbjct: 438 FNGNLGPAYKDGRHEDTYSMGIRQKLAAEFGSTPDKQGKLGRQHTANVTVTYLRTEKYYE 497
Query: 298 ----TKFC-ICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERD 352
+ FC + PG S R+ DS+ GC+PVI+ + LP+ ++L++ FAV + E D
Sbjct: 498 ELASSIFCGVLPGDGW--SGRMEDSMLQGCIPVIIQDGILLPYENMLNYNSFAVRIQEDD 555
Query: 353 VYQLKQILK 361
+ L +IL+
Sbjct: 556 IPNLIRILR 564
>gi|156378657|ref|XP_001631258.1| predicted protein [Nematostella vectensis]
gi|156218295|gb|EDO39195.1| predicted protein [Nematostella vectensis]
Length = 563
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 27/96 (28%), Positives = 57/96 (59%), Gaps = 5/96 (5%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + FC+ G+++ + DS+ GC+P+++S+ Y LPF+++LDW++ AV+++E
Sbjct: 230 YPHIMQRATFCLVIRGARLGQTALLDSLMMGCIPIVVSDDYILPFSEVLDWKRAAVVVSE 289
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
++ ++ ILK+ S + + ++Q F W +
Sbjct: 290 NEIDRIPLILKDYSQNQIKDM-----RLQGKFMWEN 320
>gi|24654051|ref|NP_725536.1| Ext2, isoform B [Drosophila melanogaster]
gi|78709115|ref|NP_725537.2| Ext2, isoform A [Drosophila melanogaster]
gi|61212929|sp|Q9Y169.1|EXT2_DROME RecName: Full=Exostosin-2; AltName: Full=Protein sister of
tout-velu
gi|5052486|gb|AAD38573.1|AF145598_1 BcDNA.GH02288 [Drosophila melanogaster]
gi|21645351|gb|AAF58055.2| Ext2, isoform A [Drosophila melanogaster]
gi|28317058|gb|AAO39548.1| RE05051p [Drosophila melanogaster]
gi|28380805|gb|AAO41379.1| Ext2, isoform B [Drosophila melanogaster]
gi|220943622|gb|ACL84354.1| Ext2-PA [synthetic construct]
Length = 717
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 73/317 (23%), Positives = 122/317 (38%), Gaps = 51/317 (16%)
Query: 89 FKIYIYPDGDPNTFYQTPRKLTGKYASEGYF--FQNIRESRFLTHDPDRAHLFFIPISCH 146
K+YIYP + F T S YF + + +SR+ T +P+ A LF
Sbjct: 108 LKVYIYPLQE---FVDEQSDKTATTLSSEYFQILEAVLKSRYYTSNPNEACLFL------ 158
Query: 147 KMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKN 206
N V K + + +W+R GA+H G + V +
Sbjct: 159 ----PSLDLLNQNVFDKHLAGAALASLDFWDR--GANHIIFNMLPGGAPSYNTV-LDVNT 211
Query: 207 AIRVVCSPSYDV-AFIPHKDVALPQVLQPFALPRGGRDVENRTVLGFWAGHRNSKIRVIL 265
++ +D ++ P DVA+P V P + + R L A IL
Sbjct: 212 DNAIIFGGGFDSWSYRPGFDVAIP-VWSPRLVRQHAHATAQRKFLLVVAQLN------IL 264
Query: 266 ARVWENDTELDISNNRINRAIGP------------------LVYQRRFYKTKFCICPGGS 307
R EL ++++ +G L Y R + KFC+
Sbjct: 265 PRFVRTLRELSLAHSEQLLLLGACENLDLTMRCPLSQHHKSLEYPRLLSRGKFCLLGRSL 324
Query: 308 QVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAE 367
++ + + + C+PVI + Y LPF D++DW +V + E +++ + Q LK IS +
Sbjct: 325 RMGQPDLVEIMSQHCIPVIAVDNYVLPFEDVIDWSLASVRIRENELHSVMQKLKAISSVK 384
Query: 368 FVSLHNNLVKVQKHFQW 384
+V++QK QW
Sbjct: 385 -------IVEMQKQVQW 394
>gi|255580236|ref|XP_002530948.1| catalytic, putative [Ricinus communis]
gi|223529463|gb|EEF31420.1| catalytic, putative [Ricinus communis]
Length = 512
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 34/124 (27%), Positives = 65/124 (52%), Gaps = 9/124 (7%)
Query: 277 ISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFN 336
+ + IN+A + + +KFC+ G +S R+ D+I CVPVI+S+ +LP+
Sbjct: 341 VQKDGINKA------SQGMHTSKFCLNIAGDTPSSNRLFDAIASHCVPVIISDDIELPYE 394
Query: 337 DILDWRKFAVILNERDVYQLK---QILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDA 393
D+LD+ +F + + D + K +++ I E+ + L +V++ F++ P DA
Sbjct: 395 DVLDYSQFCIFVRTSDAIKEKFLINLIRGIGKDEWTQMWQKLKEVERFFEFQYPSKEGDA 454
Query: 394 FHMV 397
M+
Sbjct: 455 VQMI 458
>gi|168033234|ref|XP_001769121.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162679650|gb|EDQ66095.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 765
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 77/325 (23%), Positives = 131/325 (40%), Gaps = 73/325 (22%)
Query: 113 YASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHKMRGKGTSYENMTVIVK--------- 163
Y + ++++ S T + D A F++P + +G + ++T K
Sbjct: 367 YGAGIALYESLLASEHRTTNGDEADFFYVPFLQACIVEQGDAAPHLTFQGKYMGLRQYFA 426
Query: 164 -DYVDSLI----NKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRV-------- 210
DY + YPYWNR+ G DH + D G + P I N++ +
Sbjct: 427 GDYSKQIYFHIQQNYPYWNRSAGRDHIWFFPWDEGACS---APKEIWNSMMLSHWGNTNA 483
Query: 211 ---VCSPSYDV---------------AFIPHKDVALPQVLQPFALP----RGGRDVENRT 248
+ +Y + P KD+ LP P P R ++R
Sbjct: 484 KHKASTTAYRADNWDLIPPEWRGDHPCYDPAKDLVLPAWKFPDPYPIVQNLSSRHRQDRP 543
Query: 249 VLGFWAGHRNSK-------------IRVILARVWENDTELDISNNRINRAIGPLV----- 290
L ++ G+ S IR LA E ++ + +A+ +V
Sbjct: 544 TLFYFNGNLGSAYDNGRPEPGYSMGIRQKLAA--EFGSQPNKKGLLGRQAVDDVVVQAQR 601
Query: 291 ---YQRRFYKTKFC-ICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAV 346
Y+ K++FC + PG S R+ DSI GC+PVI+ + LPF ++LD+ F V
Sbjct: 602 SPQYKLELSKSRFCGVLPGDGW--SGRMEDSILSGCIPVIIQDGIHLPFENVLDYESFTV 659
Query: 347 ILNERDVYQLKQILKNISDAEFVSL 371
+ E +++ L ILK I++A+ S+
Sbjct: 660 RVAEDNIHNLITILKAINEAQVDSM 684
>gi|224129654|ref|XP_002320639.1| predicted protein [Populus trichocarpa]
gi|222861412|gb|EEE98954.1| predicted protein [Populus trichocarpa]
Length = 375
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 57/103 (55%), Gaps = 3/103 (2%)
Query: 298 TKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQ-- 355
+KFC+ G +S R+ D+I CVPVI+S+ +LPF D+LD+ +F + + D +
Sbjct: 249 SKFCLNIAGDTPSSNRLFDAIASHCVPVIISDDIELPFEDVLDYSEFCLFVRASDAVKKG 308
Query: 356 -LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMV 397
L +L+ I ++ L L ++ HF+++ P DA MV
Sbjct: 309 YLLDLLRGIEKDQWTKLWERLKEIAPHFEYSYPSQPGDAVDMV 351
>gi|195583832|ref|XP_002081720.1| GD25567 [Drosophila simulans]
gi|194193729|gb|EDX07305.1| GD25567 [Drosophila simulans]
Length = 717
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 73/317 (23%), Positives = 122/317 (38%), Gaps = 51/317 (16%)
Query: 89 FKIYIYPDGDPNTFYQTPRKLTGKYASEGYF--FQNIRESRFLTHDPDRAHLFFIPISCH 146
K+YIYP + F T S YF + + +SR+ T +P+ A LF
Sbjct: 108 LKVYIYPLQE---FVDEQSDKTATTLSSEYFQILEAVLKSRYYTSNPNEACLFL------ 158
Query: 147 KMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKN 206
N V K + + +W+R GA+H G + V +
Sbjct: 159 ----PSLDLLNQNVFDKHLAGAALASLDFWDR--GANHIIFNMLPGGAPSYNTV-LDVNT 211
Query: 207 AIRVVCSPSYDV-AFIPHKDVALPQVLQPFALPRGGRDVENRTVLGFWAGHRNSKIRVIL 265
++ +D ++ P DVA+P V P + + R L A IL
Sbjct: 212 DNAIIFGGGFDSWSYRPGFDVAIP-VWSPRLVRQHAHATAQRKFLLVVAQLN------IL 264
Query: 266 ARVWENDTELDISNNRINRAIGP------------------LVYQRRFYKTKFCICPGGS 307
R EL ++++ +G L Y R + KFC+
Sbjct: 265 PRFVRTLRELSLAHSEQLLLLGACENLDLTMRCPLSQHHKSLEYPRLLSRGKFCLLGRSL 324
Query: 308 QVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAE 367
++ + + + C+PVI + Y LPF D++DW +V + E +++ + Q LK IS +
Sbjct: 325 RMGQPDLVEIMSQHCIPVIAVDNYVLPFEDVIDWSLASVRIRENELHSVMQKLKAISSVK 384
Query: 368 FVSLHNNLVKVQKHFQW 384
+V++QK QW
Sbjct: 385 -------IVEMQKQVQW 394
>gi|159470791|ref|XP_001693540.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158283043|gb|EDP08794.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 510
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 86/376 (22%), Positives = 142/376 (37%), Gaps = 62/376 (16%)
Query: 81 NYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFF 140
Y ++YIY T ++ +L ++ + +F + + + DP A F+
Sbjct: 112 GYTPRAAGPRVYIYDLPPELTTWRNDDRLD-RWTTR-HFLEMLTATGARVGDPAAADWFY 169
Query: 141 IPISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGV 200
+P+ ++R + +++ ++ + P++N T G DHF + D+G +E
Sbjct: 170 LPV---RLRSSSDGH-----VLRRALEYVQAAQPWFNATGGKDHFVLAVGDMGRLESERG 221
Query: 201 PF-----------LIKNAIRVVCSPSYDVAFIPHKDVALP-----QVLQPFALPRGGR-- 242
P L ++ + SP + + D+ LP + LQ F + G R
Sbjct: 222 PLSANVTFVSHWGLYRSKAEQLQSPHWRASHRNATDIVLPVYLTLRKLQKFGI-LGSRHH 280
Query: 243 ---------DVENRTVLGFW-AGHRNSKIRVILARVWEN-DTELDISN-----------N 280
DV R FW AG VW N + S N
Sbjct: 281 PKFATVAPPDVRERNGPLFWFAGRVCQDSSPPRTDVWPNCPKAMGYSAMTRQAVYFHHWN 340
Query: 281 RINRAI--GPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDI 338
R A+ G Y + KFC P G + R + GCVPV++ + +
Sbjct: 341 RTGFAVLRGDKQYAKHMLTAKFCFGPMGGG-HGQRQFQAALAGCVPVVIGDGVLEAWEPY 399
Query: 339 LDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS-------PPVRF 391
LDW F V + E D+ +L IL I E+ +L +H ++S RF
Sbjct: 400 LDWNDFGVRVAEADIPRLHTILGAIGPEEYARKVRSLRCAAQHMAFSSVTGAYMGESGRF 459
Query: 392 DAFHMVMYELWLR-RH 406
DAF ++ L R RH
Sbjct: 460 DAFETLLAVLAARARH 475
>gi|302809300|ref|XP_002986343.1| hypothetical protein SELMODRAFT_425371 [Selaginella moellendorffii]
gi|300145879|gb|EFJ12552.1| hypothetical protein SELMODRAFT_425371 [Selaginella moellendorffii]
Length = 421
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 31/100 (31%), Positives = 58/100 (58%), Gaps = 3/100 (3%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
+ +R + FC+ P G +SAR+ D+I GC+PVI+S+ + PF ++D+RK A+ +
Sbjct: 283 WSQRHTFSIFCLSPAGDTPSSARLFDAIVSGCIPVIVSDELEPPFEGLVDYRKVALFVPS 342
Query: 351 RDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSP 387
+ L L+ I+ + L ++++ +HFQ++SP
Sbjct: 343 VKTTEKGWLVSYLRAITARQLSMLRGHMLEFSRHFQYSSP 382
>gi|302794690|ref|XP_002979109.1| hypothetical protein SELMODRAFT_418827 [Selaginella moellendorffii]
gi|300153427|gb|EFJ20066.1| hypothetical protein SELMODRAFT_418827 [Selaginella moellendorffii]
Length = 435
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 31/100 (31%), Positives = 59/100 (59%), Gaps = 3/100 (3%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
+ +R + FC+ P G +SAR+ D+I GC+PVI+S+ + PF ++D+RK A+ +
Sbjct: 283 WSQRHTFSIFCLSPAGDTPSSARLFDAIVSGCIPVIVSDELEPPFEGLVDYRKVALFVPS 342
Query: 351 RDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSP 387
+ L L+ I+ + L +++++ +HFQ++SP
Sbjct: 343 VKTTEKGWLVSYLRAITARQLSMLRSHMLEFSRHFQYSSP 382
>gi|260786761|ref|XP_002588425.1| hypothetical protein BRAFLDRAFT_153069 [Branchiostoma floridae]
gi|229273587|gb|EEN44436.1| hypothetical protein BRAFLDRAFT_153069 [Branchiostoma floridae]
Length = 107
Score = 65.1 bits (157), Expect = 6e-08, Method: Composition-based stats.
Identities = 27/96 (28%), Positives = 55/96 (57%), Gaps = 5/96 (5%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + + FC+ P G ++ S R +++ C+PVI +N ++LPF+++++W + + +E
Sbjct: 3 YNTLLHNSTFCLVPRGRRLGSFRFLEALQAACIPVIEANGWELPFSEVIEWDRATITADE 62
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
R ++QL IL+ I + ++L + Q F W +
Sbjct: 63 RLLFQLPSILRAIPPEKILAL-----RQQTQFLWET 93
>gi|395543711|ref|XP_003773757.1| PREDICTED: exostosin-2 [Sarcophilus harrisii]
Length = 917
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 26/94 (27%), Positives = 53/94 (56%)
Query: 297 KTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQL 356
+ FCI G+++ A ++D + GCVPVI+++ Y LPF+++LDW++ +V++ E + +
Sbjct: 293 EASFCIVLRGARLGQAALSDVLQAGCVPVIIADSYILPFSEVLDWKRASVVIPEEKMRDM 352
Query: 357 KQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVR 390
+L++I + + + PP+R
Sbjct: 353 YSVLRSIPQRQIEEMQRQVRAGPSVGSLPEPPLR 386
>gi|241997492|ref|XP_002433395.1| exostosin-2, putative [Ixodes scapularis]
gi|215490818|gb|EEC00459.1| exostosin-2, putative [Ixodes scapularis]
Length = 714
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 25/78 (32%), Positives = 50/78 (64%)
Query: 297 KTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQL 356
+ KFC+ +++ + ++D++ GCVPVI+++ Y LPF+++LDW++ A+ + E D+ L
Sbjct: 306 EGKFCLVVRAARLGQSVLSDALMAGCVPVIVADEYILPFSEVLDWKRAAIQIREDDLEDL 365
Query: 357 KQILKNISDAEFVSLHNN 374
+LK +S A + +
Sbjct: 366 VTVLKGVSKARLFEMRSQ 383
>gi|444732432|gb|ELW72727.1| Exostosin-1 [Tupaia chinensis]
Length = 436
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 29/76 (38%), Positives = 50/76 (65%), Gaps = 2/76 (2%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y+ + FC+ P G ++ S R +++ CVPV+LSN ++LPF+++++W + AVI +E
Sbjct: 54 YREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDE 113
Query: 351 RDVYQLKQ--ILKNIS 364
R + Q+ Q I K+IS
Sbjct: 114 RLLLQIIQDRIFKHIS 129
>gi|188528923|ref|NP_001120887.1| exostosin 2 [Xenopus (Silurana) tropicalis]
gi|183986340|gb|AAI66243.1| ext2 protein [Xenopus (Silurana) tropicalis]
Length = 718
Score = 65.1 bits (157), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 26/94 (27%), Positives = 57/94 (60%), Gaps = 7/94 (7%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + ++ FCI G+++ + ++D + GCVPVI+++ Y LPF+++LDW++ +V++ E
Sbjct: 308 YPQILQESTFCIVLRGARLGQSVLSDVLQAGCVPVIIADSYVLPFSEVLDWKRASVVIPE 367
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
++++ IL+ + L ++Q+ +W
Sbjct: 368 EKMFEMYSILQAVP-------QRQLEEMQRQARW 394
>gi|32488405|emb|CAE02830.1| OSJNBa0043A12.35 [Oryza sativa Japonica Group]
gi|90265244|emb|CAH67697.1| H0624F09.5 [Oryza sativa Indica Group]
Length = 464
Score = 64.7 bits (156), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 65/247 (26%), Positives = 115/247 (46%), Gaps = 28/247 (11%)
Query: 163 KDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVC---------- 212
++ VD + +P W R+ G DH FV V A V I AI +V
Sbjct: 171 REVVDR-VTAHPAWRRSGGRDHVFVLTDPV---AMWHVRKEIAPAILLVVDFGGWYKLDS 226
Query: 213 -SPSYDVA-FIPHKDVAL-PQVLQPFA--LPRGG-RDVENRTVLGFWAG----HRNSKIR 262
S S +V+ I H V+L V+ P+ LP + ++R L ++ G HR +R
Sbjct: 227 NSASSNVSHMIQHTQVSLLKDVIVPYTHLLPTMHLSENKDRPTLLYFKGAKHRHRGGLVR 286
Query: 263 VILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGC 322
L + N+ ++ + N A G + ++FC+ P G S R+ D++ C
Sbjct: 287 EKLWDLMVNEPDVVMEEGYPN-ATGREQSIKGMRTSEFCLHPAGDTPTSCRLFDAVASLC 345
Query: 323 VPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQI---LKNISDAEFVSLHNNLVKVQ 379
+PVI+S+ +LPF ++D+ +FA+ ++ + + K + L+N+ + N+ VQ
Sbjct: 346 IPVIVSDEIELPFEGMIDYTEFAIFVSVNNSMRPKWLTNYLRNVPRQQKDEFRRNMAHVQ 405
Query: 380 KHFQWNS 386
F+++S
Sbjct: 406 PIFEYDS 412
>gi|260826381|ref|XP_002608144.1| hypothetical protein BRAFLDRAFT_91377 [Branchiostoma floridae]
gi|229293494|gb|EEN64154.1| hypothetical protein BRAFLDRAFT_91377 [Branchiostoma floridae]
Length = 595
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 67/278 (24%), Positives = 115/278 (41%), Gaps = 59/278 (21%)
Query: 120 FQNIRESRFLTHDPDRAHLFFIPISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWN-- 177
I +S + T+DP +A LF I N ++ ++ + P WN
Sbjct: 48 LTTIADSEYNTNDPAKACLFVPSIDLL----------NQNILRLRETGQVLGRLPRWNHG 97
Query: 178 ----RTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDV-AFIPHKDVALPQVL 232
+ GA+H G+ + ++ + + P DV++P V
Sbjct: 98 HHDFQLQGANHLLFNMLP-GMEPDYNTALEVPRGKAILAGGGFSSWTYRPGYDVSIP-VF 155
Query: 233 QPFA----LPRGGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGP 288
PF LP G D GF G RN K ND +L S +R+ +
Sbjct: 156 NPFTADMELP-GKPD-------GF--GCRNCK----------NDYQLS-STSRLLIS--- 191
Query: 289 LVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVIL 348
+ FC+ G+++ + ++D++ GC+PVI + Y +PF+++LDW++ AVIL
Sbjct: 192 -------WTATFCMILRGARMGQSALSDAMMAGCIPVIAIDTYVMPFSEVLDWKRAAVIL 244
Query: 349 NERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
E D+ + +L+ IS ++ + Q F W S
Sbjct: 245 REEDLPDVHNVLRRISQERITNM-----RRQVEFFWRS 277
>gi|218195798|gb|EEC78225.1| hypothetical protein OsI_17865 [Oryza sativa Indica Group]
gi|222629749|gb|EEE61881.1| hypothetical protein OsJ_16573 [Oryza sativa Japonica Group]
Length = 452
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 65/247 (26%), Positives = 115/247 (46%), Gaps = 28/247 (11%)
Query: 163 KDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVC---------- 212
++ VD + +P W R+ G DH FV V A V I AI +V
Sbjct: 159 REVVDR-VTAHPAWRRSGGRDHVFVLTDPV---AMWHVRKEIAPAILLVVDFGGWYKLDS 214
Query: 213 -SPSYDVA-FIPHKDVAL-PQVLQPFA--LPRGG-RDVENRTVLGFWAG----HRNSKIR 262
S S +V+ I H V+L V+ P+ LP + ++R L ++ G HR +R
Sbjct: 215 NSASSNVSHMIQHTQVSLLKDVIVPYTHLLPTMHLSENKDRPTLLYFKGAKHRHRGGLVR 274
Query: 263 VILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGC 322
L + N+ ++ + N A G + ++FC+ P G S R+ D++ C
Sbjct: 275 EKLWDLMVNEPDVVMEEGYPN-ATGREQSIKGMRTSEFCLHPAGDTPTSCRLFDAVASLC 333
Query: 323 VPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQI---LKNISDAEFVSLHNNLVKVQ 379
+PVI+S+ +LPF ++D+ +FA+ ++ + + K + L+N+ + N+ VQ
Sbjct: 334 IPVIVSDEIELPFEGMIDYTEFAIFVSVNNSMRPKWLTNYLRNVPRQQKDEFRRNMAHVQ 393
Query: 380 KHFQWNS 386
F+++S
Sbjct: 394 PIFEYDS 400
>gi|148235505|ref|NP_001080448.1| exostosin 2 [Xenopus laevis]
gi|27881752|gb|AAH44703.1| Ext2-prov protein [Xenopus laevis]
Length = 718
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 26/94 (27%), Positives = 56/94 (59%), Gaps = 7/94 (7%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + ++ FCI G+++ ++D + GCVPVI+++ Y LPF+++LDW++ +V++ E
Sbjct: 308 YPQILQESTFCIVLRGARLGQGLLSDVLQAGCVPVIIADSYVLPFSEVLDWKRASVVIPE 367
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
++++ IL+ I + ++Q+ +W
Sbjct: 368 EKMFEMYSILQGIP-------QRQVEEMQRQARW 394
>gi|387015840|gb|AFJ50039.1| Exostosin-2-like [Crotalus adamanteus]
Length = 718
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 26/94 (27%), Positives = 57/94 (60%), Gaps = 7/94 (7%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + + FCI G+++ A ++D++ GC+PV++++ Y LPF+++LDW++ +V++ E
Sbjct: 308 YPQVLQEATFCIVLRGARLGQAVLSDALQAGCIPVVIADSYVLPFSEVLDWKRASVVIPE 367
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
+ +L IL++I + ++Q+ +W
Sbjct: 368 EKMPELYNILQSIP-------QRQIEEMQRQARW 394
>gi|260825668|ref|XP_002607788.1| hypothetical protein BRAFLDRAFT_64151 [Branchiostoma floridae]
gi|229293137|gb|EEN63798.1| hypothetical protein BRAFLDRAFT_64151 [Branchiostoma floridae]
Length = 733
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 71/293 (24%), Positives = 120/293 (40%), Gaps = 46/293 (15%)
Query: 120 FQNIRESRFLTHDPDRAHLFFIPISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWN-- 177
I +S + T+DP +A LF I N ++ ++ + P WN
Sbjct: 143 LTTIADSEYNTNDPAKACLFVPSIDLL----------NQNILRLRETGQVLGRLPRWNHG 192
Query: 178 ----RTLGADHFFVTC-------HDVGVRATEGVPFLIKNAIRV-VCSPSYDVA---FIP 222
+ GA+H ++ + G L P YDV+ F P
Sbjct: 193 HHDFQLQGANHLLFNMLPGMEPDYNTALEVPRGKAILAGGGFSSWTYRPGYDVSIPVFNP 252
Query: 223 -HKDVALPQVLQPFALPRGGRDVENRTVL------GFWAGHRNSKIRVILARVWENDTEL 275
D+ LP +P R V +TV+ A R++ ++L R ND
Sbjct: 253 FTADMELPG--KPDGYSRPWLIVSAQTVIHMDFRAELDAVARDNDDVLVLDRC--NDLPE 308
Query: 276 DISNNRINRAIGPLVYQRR--FYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDL 333
I +R R VY + FC+ G+++ + ++D++ GC+PVI + Y +
Sbjct: 309 GIPAHR-RRCKEDRVYNYPDILQEATFCMILRGARMGQSALSDAMMAGCIPVIAIDTYVM 367
Query: 334 PFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
PF+++LDW++ AVIL E D+ + +L+ IS ++ + Q F W S
Sbjct: 368 PFSEVLDWKRAAVILREEDLPDVHNVLRRISQERITNM-----RRQVEFFWRS 415
>gi|359491711|ref|XP_002284930.2| PREDICTED: xylogalacturonan beta-1,3-xylosyltransferase isoform 1
[Vitis vinifera]
Length = 498
Score = 64.3 bits (155), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 32/103 (31%), Positives = 56/103 (54%), Gaps = 3/103 (2%)
Query: 298 TKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQ-- 355
+KFC+ G +S R+ D+I CVPVI+S+ +LPF D+LD+ +F + + D +
Sbjct: 365 SKFCLNIAGDTPSSNRLFDAIVSHCVPVIISDEIELPFEDVLDYSEFCIFVRASDAVKNG 424
Query: 356 -LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMV 397
L +L+ I ++ + L ++ HF++ P DA M+
Sbjct: 425 FLLNLLRGIKREKWTKMWERLKEIAHHFEYQYPSQAGDAVDMI 467
>gi|357162592|ref|XP_003579459.1| PREDICTED: xylogalacturonan beta-1,3-xylosyltransferase-like
[Brachypodium distachyon]
Length = 474
Score = 64.3 bits (155), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 66/249 (26%), Positives = 112/249 (44%), Gaps = 34/249 (13%)
Query: 163 KDYVDSLINKYPYWNRTLGADHFFVTCHDVG---VRATEGVPFLIKNAIRVV-------- 211
++ VD + +P W R+ G DH FV + VRA I AI +V
Sbjct: 181 REVVDR-VTAHPAWRRSGGRDHVFVLTDPMAMWHVRAE------IAPAILLVVDFGGWYK 233
Query: 212 ----CSPSYDVAFIPHKDVAL-PQVLQPFA--LP--RGGRDVENRTVLGFWAG---HRNS 259
+ S I H V+L V+ P+ LP + +++ T+L F HR
Sbjct: 234 LDSKSAGSNSSHMIQHTQVSLLKDVIIPYTHLLPTLQLSENMDRPTLLYFKGAKHRHRGG 293
Query: 260 KIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIH 319
+R L V N+ + + N A G + ++FC+ P G +S R+ D++
Sbjct: 294 LVREKLWDVMINEPGVVMEEGFPN-ATGREQSIKGMRTSEFCLHPAGDTPSSCRLFDAVA 352
Query: 320 YGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQI---LKNISDAEFVSLHNNLV 376
C+PVI+S+ +LPF ++D+ +F++ ++ + + K + LK IS + NL
Sbjct: 353 SLCIPVIVSDDIELPFEGMIDYTEFSIFVSVGNAMRPKWLASYLKTISKQQKDEFRRNLA 412
Query: 377 KVQKHFQWN 385
KVQ F++
Sbjct: 413 KVQHIFEYE 421
>gi|443721868|gb|ELU10993.1| hypothetical protein CAPTEDRAFT_157657 [Capitella teleta]
Length = 712
Score = 64.3 bits (155), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 82/343 (23%), Positives = 141/343 (41%), Gaps = 72/343 (20%)
Query: 69 SDVFHSP-SVFHLNYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASE-GYFFQNIRES 126
S FHS V+H Y + + IYIYP ++ LT ++ E Q + +S
Sbjct: 81 SCTFHSCFDVYHCGYNDD-SRISIYIYPVT--KYLDESGVPLTLGFSQEFSELLQTVADS 137
Query: 127 RFLTHDPDRAHLFFIPISC---HKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGAD 183
+ T +PDRA LF P+ + +R K ++ ++ WN G +
Sbjct: 138 VYYTSEPDRACLFLPPLDLLNQNNLRLKESA-------------QILTSLQSWNH--GTN 182
Query: 184 HF-----------FVTCHDV--GVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQ 230
H F T DV G G F + SY + DVA+P
Sbjct: 183 HLLFNMLPGNMPDFNTALDVHRGKAVLAGGGF---------STWSYRRTY----DVAIP- 228
Query: 231 VLQPFA----LPRGGRDVENRTVLGFWA--------------GHRNSKIRVILARVWEND 272
V P A LP G E RT L A +N +IL +
Sbjct: 229 VFSPLAAATDLPEGDSQ-EERTWLLMSAQVGLHVEYIEDLMKARQNQPRFIILQQCAGIY 287
Query: 273 TELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYD 332
D N A L Y + +++FC+ S++ ++D++ GCVPVI+++ +
Sbjct: 288 NMTDRCTQEGNTA---LEYPQVLQESEFCLVIRTSRLGQLTLSDAMKAGCVPVIVADSFI 344
Query: 333 LPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNL 375
LPF++++DW++ A+++ E ++ + ++++ IS + + +
Sbjct: 345 LPFSEVIDWKRAAIVVAEDNLSTVNEVVRAISRDSLLQMRRQV 387
>gi|356497977|ref|XP_003517832.1| PREDICTED: xylogalacturonan beta-1,3-xylosyltransferase-like
[Glycine max]
Length = 509
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 31/103 (30%), Positives = 58/103 (56%), Gaps = 3/103 (2%)
Query: 298 TKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLK 357
+KFC+ G +S R+ D+I CVPVI+S+ +LP+ D++D+ +F + + D + K
Sbjct: 365 SKFCLNIAGDTPSSNRLFDAIASHCVPVIISDKIELPYEDVIDYSEFCIFVRTSDAIKEK 424
Query: 358 ---QILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMV 397
++ I+ E+ + N L +V+ F+++ P DA M+
Sbjct: 425 FLINFIRGIAKEEWTRMWNKLKEVEHFFEFHFPSKENDAVQMI 467
>gi|297733976|emb|CBI15223.3| unnamed protein product [Vitis vinifera]
Length = 455
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 32/103 (31%), Positives = 56/103 (54%), Gaps = 3/103 (2%)
Query: 298 TKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQ-- 355
+KFC+ G +S R+ D+I CVPVI+S+ +LPF D+LD+ +F + + D +
Sbjct: 322 SKFCLNIAGDTPSSNRLFDAIVSHCVPVIISDEIELPFEDVLDYSEFCIFVRASDAVKNG 381
Query: 356 -LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMV 397
L +L+ I ++ + L ++ HF++ P DA M+
Sbjct: 382 FLLNLLRGIKREKWTKMWERLKEIAHHFEYQYPSQAGDAVDMI 424
>gi|307110230|gb|EFN58466.1| hypothetical protein CHLNCDRAFT_140476 [Chlorella variabilis]
Length = 632
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 78/344 (22%), Positives = 134/344 (38%), Gaps = 80/344 (23%)
Query: 113 YASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SC--HKMRGKGTSYE------------ 156
YA E F + + +S T DP+ A F++P+ SC + +R S
Sbjct: 236 YALEAGFLEMLLQSEHRTLDPEEADFFYVPVFTSCFINPVRDGADSLRDFFYGVSHNRVQ 295
Query: 157 ---NMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRV--- 210
NM + +V ++ +PYW R G DH ++ HD A+ VP I++ +
Sbjct: 296 GAANMLLEAYHWVQAM---FPYWERRGGRDHIWLVTHD---EASCWVPAAIRSTSIILSH 349
Query: 211 ----------------------VCSPSYDV------------AFIPHKDVALPQVLQP-- 234
V P ++ + P KD+ +P + P
Sbjct: 350 WGRMDAHHTSGTGYSADVYSNDVTHPQFEPDGFLGKLNLTQPCYDPVKDLVVPLMKTPEH 409
Query: 235 FAL-PRGGRDVENRTVLGFWAGHRNSK-------IRVILARVWENDTELD---ISNNRIN 283
+ L P G RT L F G + IR LA+ L+ I+ +
Sbjct: 410 YRLSPLVGAPPRQRTWLAFHRGRVQADNPPYSRGIRQRLAKAAAEGGWLEKHKIAVGEYD 469
Query: 284 RAIGPLVYQRRFYKTKFC-ICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWR 342
G Y + FC + PG SAR+ D++ +GC+PV++ + + F ++D
Sbjct: 470 TLQGD--YSELLASSVFCPVIPGDGW--SARMDDAMLHGCIPVLIMDEVQVSFESVVDLS 525
Query: 343 KFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
F + + E D +L IL+ ++ + L +V + F ++S
Sbjct: 526 TFTIRIPEADAEKLPDILQAVTQERREEMQRALARVWQRFTYSS 569
>gi|47222207|emb|CAG11086.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1040
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 24/83 (28%), Positives = 49/83 (59%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
YQ + + FC+ P G ++ S R +S+ C+PV+LSN ++LPF+D++ W + + +E
Sbjct: 568 YQELLHNSTFCLVPRGRRLGSFRFLESLQAACIPVLLSNGWELPFSDVIQWNQAVIEGDE 627
Query: 351 RDVYQLKQILKNISDAEFVSLHN 373
R + Q+ ++ + + ++L
Sbjct: 628 RLLLQVPSTVRAVGNERVLALRQ 650
>gi|47203855|emb|CAG13860.1| unnamed protein product [Tetraodon nigroviridis]
Length = 72
Score = 64.3 bits (155), Expect = 1e-07, Method: Composition-based stats.
Identities = 26/69 (37%), Positives = 45/69 (65%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y+ + + FC+ P G ++ S R +++ CVPV+LSN ++LPF++I+DW AVI +E
Sbjct: 4 YKEMLHNSTFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEIIDWNTAAVIGDE 63
Query: 351 RDVYQLKQI 359
R + Q+ +
Sbjct: 64 RLLLQVNSL 72
>gi|341881773|gb|EGT37708.1| CBN-RIB-1 protein [Caenorhabditis brenneri]
Length = 383
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 73/326 (22%), Positives = 131/326 (40%), Gaps = 60/326 (18%)
Query: 90 KIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHKMR 149
K+YI+P F P+ + Y+ +FQ ES T++P+ A +F + I
Sbjct: 49 KVYIHPM--EKIFEVAPQ--SAIYSKILKYFQ---ESEHYTNNPNEACIFLLGIDT---T 98
Query: 150 GKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTC----------HDVGVRATEG 199
+ +N V +Y+DSL WN G +H H++G E
Sbjct: 99 DRDVRSQNYVKNVNEYIDSL--DPAVWNN--GRNHLIFNFYHGTFPDYDDHNLGFDTGEA 154
Query: 200 VPFLIKNAIRVVCSPSYDVAFIPHKDVALP-------QVLQPFALPRGGRDVENRT---- 248
+ + + S + F DV+LP ++P R+ R
Sbjct: 155 M---------IARASSSENNFFKGFDVSLPLFHENHPDDVEPKVKIDDQRNESRRKYLVS 205
Query: 249 ------VLGFWAGHRN--------SKIRVILARVWENDTEL--DISNNRINRAIGPLVYQ 292
V G +G RN + ++ ND ++ D R N Y+
Sbjct: 206 FKGKRYVYGIGSGTRNLVHHLHNGDDMVMVTTCKHNNDWQVYQDDRCQRDNDEYDQWDYE 265
Query: 293 RRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERD 352
+ FC+ P G ++ S R +++ GC+PV++S+ + LPF + +DW A+++ ERD
Sbjct: 266 DLLTNSTFCLVPRGRRLGSFRFLETLRSGCIPVVISDSWILPFTETIDWSSAAIVVAERD 325
Query: 353 VYQLKQILKNISDAEFVSLHNNLVKV 378
+ ++L ++S + L ++ V
Sbjct: 326 ALSIPELLMSMSRRKVEKLRDSARDV 351
>gi|359359217|gb|AEV41121.1| putative exostosin family domain-containing protein [Oryza
officinalis]
Length = 460
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 63/247 (25%), Positives = 115/247 (46%), Gaps = 28/247 (11%)
Query: 163 KDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVC---------- 212
++ VD + +P W R+ G DH FV V A V I +I +V
Sbjct: 167 REVVDR-VTAHPAWRRSGGRDHVFVLTDPV---AMWHVRKEIAPSILLVVDFGGWYKLDS 222
Query: 213 -SPSYDVA-FIPHKDVAL-PQVLQPFA--LPRGG-RDVENRTVLGFWAG----HRNSKIR 262
S S +V+ I H V+L V+ P+ LP + ++R L ++ G HR +R
Sbjct: 223 NSASSNVSHMIQHTQVSLLKDVIVPYTHLLPTMQLSENKDRLTLLYFKGAKHRHRGGLVR 282
Query: 263 VILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGC 322
L + N+ ++ + N A G + ++FC+ P G S R+ D++ C
Sbjct: 283 EKLWDLMVNEPDVVMEEGYPN-ATGREQSIKGMRTSEFCLHPAGDTPTSCRLFDAVASLC 341
Query: 323 VPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQI---LKNISDAEFVSLHNNLVKVQ 379
+PVI+S+ +LPF ++D+ +F + ++ + + K + L+N+ + N+ +VQ
Sbjct: 342 IPVIVSDEIELPFEGMIDYTEFTIFVSVSNAMRPKWLTNYLRNVPRQQKDEFRRNMARVQ 401
Query: 380 KHFQWNS 386
F+++S
Sbjct: 402 PIFEYDS 408
>gi|289166870|gb|ADC84485.1| glycosyltransferase family 47A [Salix sachalinensis]
Length = 215
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 40/124 (32%), Positives = 58/124 (46%), Gaps = 16/124 (12%)
Query: 243 DVENRTVLGFWAGHRNSKIRVILARVWEN---DTELDISNNRINRAIGPLVYQRRFYKTK 299
DV N G++A R A VWEN + DIS + P Y +
Sbjct: 83 DVNNDPEGGYYA-------RGARAAVWENFKNNPLFDISTDH------PTTYYEDMQRAI 129
Query: 300 FCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQI 359
FC+CP G S R+ +++ +GC+ VI+++ LPF D + W + V + E DV L I
Sbjct: 130 FCLCPLGWAPWSPRLVEAVVFGCISVIIADDIVLPFADAIPWEEIGVFVAEEDVPNLDTI 189
Query: 360 LKNI 363
L I
Sbjct: 190 LTTI 193
>gi|5882750|gb|AAD55303.1|AC008263_34 F25A4.34 [Arabidopsis thaliana]
gi|12324821|gb|AAG52383.1|AC011765_35 unknown protein; 115857-117304 [Arabidopsis thaliana]
Length = 458
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 65/272 (23%), Positives = 119/272 (43%), Gaps = 19/272 (6%)
Query: 139 FFIPISCH---KMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVR 195
FF +S + K+RG TS ++ ++++ + + W R G DH V H +
Sbjct: 160 FFASLSYNRKSKLRGNETSSDDR--LLQERLVEFLKSQDEWKRFDGKDHLIVAHHPNSLL 217
Query: 196 ATEGVPFLIKNAIRVVCS-PSYDVAFIP-HKDVALPQVLQPFALPRG-GRDVENRTVLGF 252
+ +A+ V+ Y A KD+ P V + E R VL +
Sbjct: 218 YARN---FLGSAMFVLSDFGRYSSAIANLEKDIIAPYVHVVKTISNNESASFEKRPVLAY 274
Query: 253 WAG----HRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQ 308
+ G IR L + +++ ++ + + R G + +KFC+ G
Sbjct: 275 FQGAIYRKDGGTIRQELYNLLKDEKDVHFAFGTV-RGNGTKQTGKGMASSKFCLNIAGDT 333
Query: 309 VNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQ---LKQILKNISD 365
+S R+ D+I CVPVI+S+ +LPF D LD+ F+V ++ + + L IL+ I++
Sbjct: 334 PSSNRLFDAIVSHCVPVIISDQIELPFEDTLDYSGFSVFVHASEAVKKEFLVNILRGITE 393
Query: 366 AEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMV 397
++ L +V F++ P D+ +M+
Sbjct: 394 DQWKKKWGRLKEVAGCFEYRFPSQVGDSVNMI 425
>gi|443682352|gb|ELT86989.1| hypothetical protein CAPTEDRAFT_143941, partial [Capitella teleta]
Length = 186
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 51/86 (59%), Gaps = 5/86 (5%)
Query: 301 CICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQIL 360
C+ P G ++ S R +++ C+PV LSN + LPF++++DW + A+ +ER + Q+ I+
Sbjct: 1 CLVPRGRRLGSYRFLEALQAACIPVFLSNNWVLPFSEVIDWNQAAIWGDERLLLQIPSIV 60
Query: 361 KNISDAEFVSLHNNLVKVQKHFQWNS 386
++I A+ ++L Q F W +
Sbjct: 61 RSIRHADLLALRQ-----QTQFLWET 81
>gi|354491358|ref|XP_003507822.1| PREDICTED: exostosin-2 [Cricetulus griseus]
Length = 718
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 26/94 (27%), Positives = 55/94 (58%), Gaps = 7/94 (7%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + + FC+ G+++ A ++D + GCVPV++++ Y LPF+++LDW++ +V++ E
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
+ + IL+NI + ++Q+ +W
Sbjct: 368 EKMSDMYSILQNIP-------QRQIEEMQRQARW 394
>gi|344257014|gb|EGW13118.1| Exostosin-2 [Cricetulus griseus]
Length = 701
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 26/94 (27%), Positives = 55/94 (58%), Gaps = 7/94 (7%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + + FC+ G+++ A ++D + GCVPV++++ Y LPF+++LDW++ +V++ E
Sbjct: 291 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 350
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
+ + IL+NI + ++Q+ +W
Sbjct: 351 EKMSDMYSILQNIP-------QRQIEEMQRQARW 377
>gi|308491052|ref|XP_003107717.1| CRE-RIB-1 protein [Caenorhabditis remanei]
gi|308249664|gb|EFO93616.1| CRE-RIB-1 protein [Caenorhabditis remanei]
Length = 347
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 67/311 (21%), Positives = 125/311 (40%), Gaps = 59/311 (18%)
Query: 90 KIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHKMR 149
K+YI+P F ++P+ + Y+ ++ +E T+DP+ A LF + I
Sbjct: 15 KVYIHPM--EKRFEESPQSVI--YSK---ILKHFQEREHYTNDPNEACLFLLGIDTTDRD 67
Query: 150 GKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTC----------HDVGVRATEG 199
+ +Y V +Y++SL WN G +H H++ + E
Sbjct: 68 VRSLNY---VKNVNEYIESL--DQAVWNN--GRNHLIYNFYHGTFPDYDDHNLNFKTGEA 120
Query: 200 VPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDVEN------------- 246
+ + + S + F D++LP + + + N
Sbjct: 121 I---------IARASSSENNFFKDFDISLPLFHENHPYQIESQRLHNEPKEEKRRYLASF 171
Query: 247 ---RTVLGFWAGHRN--------SKIRVILARVWENDTEL--DISNNRINRAIGPLVYQR 293
R V G +G RN I ++ ND ++ D R N Y
Sbjct: 172 KGKRYVYGIGSGTRNLVHHLHNGDDIVMVTTCKHNNDWQVYQDDRCQRDNEEYDRWEYDD 231
Query: 294 RFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDV 353
+ FC+ P G ++ S R +++ GC+PV++S+ + LPF++ +DW A+++ ERD
Sbjct: 232 LLSNSTFCLVPRGRRLGSFRFLETLRSGCIPVVISDSWILPFSETIDWHSAAIVVAERDA 291
Query: 354 YQLKQILKNIS 364
+ ++L + S
Sbjct: 292 LSIPELLMSTS 302
>gi|224128538|ref|XP_002320357.1| predicted protein [Populus trichocarpa]
gi|222861130|gb|EEE98672.1| predicted protein [Populus trichocarpa]
Length = 137
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 37/111 (33%), Positives = 60/111 (54%), Gaps = 4/111 (3%)
Query: 298 TKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQ-- 355
+KFC+ G +S R+ D+I CVPVI+S+ +LPF D+LD+ +F V + D +
Sbjct: 4 SKFCLNIAGDTPSSNRLFDAIASQCVPVIISDGIELPFEDVLDYSEFGVFVRASDAVKKG 63
Query: 356 -LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
L +L+ I ++ L L ++ F++ P DA MV +E LR+
Sbjct: 64 YLLYLLRGIKKDQWTILWERLKEIAPQFEYRYPSQPGDAVDMV-WEAVLRK 113
>gi|224051022|ref|XP_002199808.1| PREDICTED: exostosin-2 [Taeniopygia guttata]
Length = 718
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 26/94 (27%), Positives = 57/94 (60%), Gaps = 7/94 (7%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + ++ FC+ G+++ A ++D + GCVPVI+++ Y LPF+++LDW++ +V++ E
Sbjct: 308 YPQVLQESTFCVVLRGARLGQAVLSDVLQAGCVPVIIADSYILPFSEVLDWKRASVVIPE 367
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
+ ++ IL++I + ++Q+ +W
Sbjct: 368 EKMPEMYSILQSIP-------QRQIEEMQRQARW 394
>gi|388503508|gb|AFK39820.1| unknown [Lotus japonicus]
Length = 141
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 34/108 (31%), Positives = 57/108 (52%), Gaps = 4/108 (3%)
Query: 297 KTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQL 356
+ FC+CP G S R+ +++ +GC+PVI+++ LPF D + W V ++E DV +L
Sbjct: 3 RAVFCLCPLGWAPWSPRLVEAVVFGCIPVIVADDIVLPFADAIPWEDIGVFVDEEDVPKL 62
Query: 357 KQILKNISDAEFVSLHNNLV---KVQKHFQWNSPPVRFDAFHMVMYEL 401
IL +I E + L+ +++ + P DAFH V+ L
Sbjct: 63 DTILTSIP-PEIILRKQRLLANPSMKQAMLFPQPAQPGDAFHQVLNGL 109
>gi|149695085|ref|XP_001504162.1| PREDICTED: exostosin-like 1 [Equus caballus]
Length = 675
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 69/310 (22%), Positives = 123/310 (39%), Gaps = 62/310 (20%)
Query: 88 NFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHK 147
K+++Y P + +T R++ +I SR+ T + A L +P++
Sbjct: 93 GLKVFVYSAAGPVS--ETHRRI----------LASIEGSRYHTSNAAEACLLLLPLTWDS 140
Query: 148 MRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNA 207
G+ + P WN G +H ++ H T F + A
Sbjct: 141 SAGECSPVP-----------------PQWNG--GRNHLVLSLHPAPCPRT----FQLGQA 177
Query: 208 IRVVCSPSYDVAFIPHKDVALPQVLQPFALP-RGGRDVENRT----------VLGFWAGH 256
+ SP+ D F DVALP L P A P RGG + R L G
Sbjct: 178 MVAEASPTVDT-FRHGFDVALP--LLPEAHPLRGGAPGQLRQHSPHPRVALLALAEERGG 234
Query: 257 RNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIAD 316
++ A W+ E D + + FC+ PG + +++
Sbjct: 235 WHTAGTNFSACPWDGRCEQDHGPEQTHPGA-------TLPNATFCLIPG-HRPDASHFLQ 286
Query: 317 SIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLV 376
++ GC+PV+LS ++LPF++++DW K A++ ++R Q+ L+ + ++L
Sbjct: 287 ALQAGCIPVLLSPRWELPFSEVIDWTKAAIVTDKRLPLQVLAALQQMPLTRVLAL----- 341
Query: 377 KVQKHFQWNS 386
+ Q F W++
Sbjct: 342 RQQTQFLWDA 351
>gi|345305619|ref|XP_001510258.2| PREDICTED: exostosin-2 [Ornithorhynchus anatinus]
Length = 615
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 26/94 (27%), Positives = 56/94 (59%), Gaps = 7/94 (7%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + + FC+ G+++ + ++D + GCVPVIL++ Y LPF+++LDW++ +V++ E
Sbjct: 308 YPQVLQEATFCVVLRGARLGQSVLSDVLQAGCVPVILADSYILPFSEVLDWKRASVVIPE 367
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
+ ++ IL++I + ++Q+ +W
Sbjct: 368 EKMPEMYSILQSIP-------QRQIEEMQRQARW 394
>gi|18410670|ref|NP_565089.1| exostosin-like protein [Arabidopsis thaliana]
gi|16209709|gb|AAL14411.1| At1g74680/F1M20_36 [Arabidopsis thaliana]
gi|27363220|gb|AAO11529.1| At1g74680/F1M20_36 [Arabidopsis thaliana]
gi|332197500|gb|AEE35621.1| exostosin-like protein [Arabidopsis thaliana]
Length = 461
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 65/272 (23%), Positives = 119/272 (43%), Gaps = 19/272 (6%)
Query: 139 FFIPISCH---KMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVR 195
FF +S + K+RG TS ++ ++++ + + W R G DH V H +
Sbjct: 163 FFASLSYNRKSKLRGNETSSDDR--LLQERLVEFLKSQDEWKRFDGKDHLIVAHHPNSLL 220
Query: 196 ATEGVPFLIKNAIRVVCS-PSYDVAFIP-HKDVALPQVLQPFALPRG-GRDVENRTVLGF 252
+ +A+ V+ Y A KD+ P V + E R VL +
Sbjct: 221 YARN---FLGSAMFVLSDFGRYSSAIANLEKDIIAPYVHVVKTISNNESASFEKRPVLAY 277
Query: 253 WAG----HRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQ 308
+ G IR L + +++ ++ + + R G + +KFC+ G
Sbjct: 278 FQGAIYRKDGGTIRQELYNLLKDEKDVHFAFGTV-RGNGTKQTGKGMASSKFCLNIAGDT 336
Query: 309 VNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQ---LKQILKNISD 365
+S R+ D+I CVPVI+S+ +LPF D LD+ F+V ++ + + L IL+ I++
Sbjct: 337 PSSNRLFDAIVSHCVPVIISDQIELPFEDTLDYSGFSVFVHASEAVKKEFLVNILRGITE 396
Query: 366 AEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMV 397
++ L +V F++ P D+ +M+
Sbjct: 397 DQWKKKWGRLKEVAGCFEYRFPSQVGDSVNMI 428
>gi|356503460|ref|XP_003520526.1| PREDICTED: uncharacterized protein LOC100775594 [Glycine max]
Length = 761
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 80/337 (23%), Positives = 139/337 (41%), Gaps = 77/337 (22%)
Query: 113 YASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SC------HKMRGKGTSYENMTVIV-- 162
Y ++ +++I S T + D A FF+P+ SC H ++E + +
Sbjct: 360 YGAQIALYESILASPHRTLNGDEADFFFVPVLDSCLIDRADHAPHLSTQNHEGLRSFLTL 419
Query: 163 ---KDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVV-------- 211
K+ + ++ +YPYWN + G DH + D G P I +++ +V
Sbjct: 420 DFYKNAYNHIVEQYPYWNCSSGRDHIWFFSWDEGACY---APKEIWSSMMLVHWGNTNTK 476
Query: 212 --------CSPSYDV----------AFIPHKDVALP-------QVLQP--FALPRGGRDV 244
C ++D F P KD+ +P VL +A P +
Sbjct: 477 HYHSTTAYCPDNWDGIPSDRRGFHPCFDPEKDLVIPAWKVTHVHVLSSKLWAWP-----L 531
Query: 245 ENRTVLGFWAGH--------RNS----KIRVILARVWEN--DTELDISNNRINRAIGPLV 290
E R L ++ G+ RN IR LA + + + E + R +
Sbjct: 532 EKRKTLFYFNGNLGPAYPYGRNEWYSMGIRQKLAEEFGSKPNKEGKLGKQRAKDVVVTAE 591
Query: 291 ----YQRRFYKTKFC-ICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFA 345
Y+ + FC + PG S R+ DS+ GC+PVI+ + LP+ ++L++ FA
Sbjct: 592 RSENYEVELASSVFCGVLPGDGW--SGRMEDSVLQGCIPVIIQDGIFLPYENVLNYDSFA 649
Query: 346 VILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHF 382
V + E ++ L +IL+ I+D E N+ K+ + F
Sbjct: 650 VRIPEDEIPNLIKILRGINDTEIKFKLANVQKIWQRF 686
>gi|359359170|gb|AEV41075.1| putative exostosin family domain-containing protein [Oryza minuta]
Length = 317
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 63/247 (25%), Positives = 115/247 (46%), Gaps = 28/247 (11%)
Query: 163 KDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVC---------- 212
++ VD + +P W R+ G DH FV V A V I +I +V
Sbjct: 24 REVVDR-VTAHPAWRRSGGRDHVFVLTDPV---AMWHVRKEIAPSILLVVDFGGWYKLDS 79
Query: 213 -SPSYDVA-FIPHKDVAL-PQVLQPFA--LPRGG-RDVENRTVLGFWAG----HRNSKIR 262
S S +V+ I H V+L V+ P+ LP + ++R L ++ G HR +R
Sbjct: 80 NSASSNVSHMIQHTQVSLLKDVIVPYTHLLPTMQLSENKDRLTLLYFKGAKHRHRGGLVR 139
Query: 263 VILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGC 322
L + N+ ++ + N A G + ++FC+ P G S R+ D++ C
Sbjct: 140 EKLWDLMVNEPDVVMEEGYPN-ATGREQSIKGMRTSEFCLHPAGDTPTSCRLFDAVASLC 198
Query: 323 VPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQI---LKNISDAEFVSLHNNLVKVQ 379
+PVI+S+ +LPF ++D+ +F + ++ + + K + L+N+ + N+ +VQ
Sbjct: 199 IPVIVSDEIELPFEGMIDYTEFTIFVSVSNAMRPKWLTNYLRNVPRQQKDEFRRNMARVQ 258
Query: 380 KHFQWNS 386
F+++S
Sbjct: 259 PIFEYDS 265
>gi|348558689|ref|XP_003465149.1| PREDICTED: exostosin-2-like isoform 2 [Cavia porcellus]
Length = 669
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 26/94 (27%), Positives = 55/94 (58%), Gaps = 7/94 (7%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + + FC+ G+++ A ++D + GCVPVI+++ Y LPF+++LDW++ +V++ E
Sbjct: 307 YPQVLQEATFCVVLRGARLGQAALSDVLQAGCVPVIIADSYILPFSEVLDWKRASVVVPE 366
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
+ + IL++I + ++Q+ +W
Sbjct: 367 EKIADVYSILQSIP-------RRQMEEMQRQARW 393
>gi|410931012|ref|XP_003978890.1| PREDICTED: exostosin-1c-like, partial [Takifugu rubripes]
Length = 424
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 24/83 (28%), Positives = 49/83 (59%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
YQ + + FC+ P G ++ S R +S+ C+PV+LSN ++LPF+D++ W + + +E
Sbjct: 3 YQELLHNSTFCLVPRGRRLGSFRFLESLQAACIPVLLSNGWELPFSDVIQWNQAVIEGDE 62
Query: 351 RDVYQLKQILKNISDAEFVSLHN 373
R + Q+ ++ + + ++L
Sbjct: 63 RLLLQVPSTVRAVGNDRVLALRQ 85
>gi|223975397|gb|ACN31886.1| unknown [Zea mays]
Length = 134
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 35/114 (30%), Positives = 64/114 (56%), Gaps = 6/114 (5%)
Query: 298 TKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQ-- 355
+KFC+ P G +S R+ D+I CVPVI+S+ +LPF D +D+ +F++ + + +
Sbjct: 4 SKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSSRIELPFEDEIDYSEFSLFFSVEEALRPD 63
Query: 356 -LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRHVI 408
L L+ + ++V + L V H+++ PP + DA +M+ ++ RH I
Sbjct: 64 YLLNELRQVPKRKWVDMWLKLKNVSHHYEFQYPPRKGDAVNMIWRQV---RHKI 114
>gi|426221913|ref|XP_004005150.1| PREDICTED: LOW QUALITY PROTEIN: exostosin-like 1 [Ovis aries]
Length = 674
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 74/310 (23%), Positives = 124/310 (40%), Gaps = 64/310 (20%)
Query: 88 NFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHK 147
+ K+++Y P +TPRK+ +I SR+ T A L + +S
Sbjct: 90 SLKVFVYSAVGP--ISETPRKV----------LTSIEGSRYHTASAAEACLLVLVLSPDT 137
Query: 148 MRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNA 207
G+ P W+ G +H ++ H F + A
Sbjct: 138 PAGE-----------------CHPALPQWDG--GKNHLVLSLHPTPCLRI----FQLGQA 174
Query: 208 IRVVCSPSYDVAFIPHKDVALPQVLQPFALPRG--GRDVENRTVLGFW--------AGHR 257
+ SP+ D F P DVALP L P A P G G+ ++ + G G R
Sbjct: 175 MVAEASPTVDT-FRPGFDVALP--LLPEAHPGGALGQLQQHSPLPGVALIAVAEERGGWR 231
Query: 258 NSKIRVILARVWENDTELDISNNRIN-RAIGPLVYQRRFYKTKFCICPGGSQVNSARIAD 316
++ A W+ E D + + RA P FC+ PG S ++
Sbjct: 232 LARTN-FSACPWDGRCEQDHGPKQTHPRATLP--------NATFCLIPGHSP-DALHFLQ 281
Query: 317 SIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLV 376
++ GC+PV+LS ++LPF++++DW K A++ +ER Q+ L+ + ++L
Sbjct: 282 ALQAGCIPVLLSPRWELPFSEVIDWTKAAIVADERLPLQVLAALQEMPLTRVLALRQ--- 338
Query: 377 KVQKHFQWNS 386
Q F W++
Sbjct: 339 --QAQFLWDA 346
>gi|405121937|gb|AFR96705.1| hypothetical protein CNAG_03480 [Cryptococcus neoformans var. grubii
H99]
Length = 1125
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 49/176 (27%), Positives = 79/176 (44%), Gaps = 15/176 (8%)
Query: 220 FIPHKDVALPQVLQPFALPRGG-------RDVENRTVLGFWAG-----HRNSKIRVILAR 267
+ PH+DV +P RG + + R+ L WAG ++ +IR+ R
Sbjct: 913 YRPHQDVVVPARTCGTNTVRGTFPNVGSIKPMSERSNLLMWAGTHWVTGKSERIRLTCDR 972
Query: 268 VWENDTEL-DISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVI 326
D EL + N A G Y +FC P G S + D+I+ GC+PV
Sbjct: 973 GGAGDRELIKGGGKQSNFANGD--YINDLNNARFCPQPRGITGWSPQTNDAIYAGCIPVF 1030
Query: 327 LSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHF 382
++ PF LDW K +V + ++ ++++IL I ++ L NLV V++ F
Sbjct: 1031 IAEGTHYPFAGFLDWSKLSVRVAPTELDKIEKILAAIPLSKVEELQANLVSVREAF 1086
>gi|147846684|emb|CAN80640.1| hypothetical protein VITISV_016911 [Vitis vinifera]
Length = 1363
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 73/316 (23%), Positives = 124/316 (39%), Gaps = 68/316 (21%)
Query: 113 YASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SCHKMRGKGTSYENMTV---------- 160
Y ++ +++I S T D + A FF+P+ SC +R + NM
Sbjct: 409 YGAQMAIYESILASPHRTLDGEEADFFFVPVLDSCIIVRADDAPHLNMHAHGGLRSSLTL 468
Query: 161 -IVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVV-------- 211
K D ++ +YP+WNR+ G DH + D G P I +++ +V
Sbjct: 469 EFYKTAYDHIVEQYPFWNRSSGRDHIWFFSWDEGACY---APKEIWDSMMLVHWGNTNSK 525
Query: 212 --------CSPSYDV----------AFIPHKDVALPQVLQPFALPRGG----RDVENRTV 249
+ ++D F P+KD+ LP +P + R E R
Sbjct: 526 HNHSTTAYWADNWDSVSSDRRGNHPCFDPYKDLVLPAWKRPDVVSLSSKLWSRPREQRKT 585
Query: 250 LGFWAGH----------RNSKIRVILARVWENDTELDISNNRINRA------IGPLV--- 290
L ++ G+ + I +V E ++ + + PL
Sbjct: 586 LFYFNGNLGPAYEGGRPETTYSMGIRQKVAEEFGSSPNKEGKLGKQHAEDVIVTPLRSGN 645
Query: 291 YQRRFYKTKFC-ICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILN 349
Y + FC + PG S R DSI GC+PV++ + LPF ++L++ FAV +
Sbjct: 646 YHESLASSVFCGVMPGDGW--SGRFEDSILQGCIPVVIQDGIFLPFENMLNYESFAVRIR 703
Query: 350 ERDVYQLKQILKNISD 365
E ++ L +IL+ D
Sbjct: 704 EDEIPNLIKILRLSGD 719
>gi|348558687|ref|XP_003465148.1| PREDICTED: exostosin-2-like isoform 1 [Cavia porcellus]
Length = 717
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 26/94 (27%), Positives = 55/94 (58%), Gaps = 7/94 (7%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + + FC+ G+++ A ++D + GCVPVI+++ Y LPF+++LDW++ +V++ E
Sbjct: 307 YPQVLQEATFCVVLRGARLGQAALSDVLQAGCVPVIIADSYILPFSEVLDWKRASVVVPE 366
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
+ + IL++I + ++Q+ +W
Sbjct: 367 EKIADVYSILQSIP-------RRQMEEMQRQARW 393
>gi|322790909|gb|EFZ15575.1| hypothetical protein SINV_11283 [Solenopsis invicta]
Length = 676
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 33/108 (30%), Positives = 59/108 (54%), Gaps = 10/108 (9%)
Query: 279 NNRINRAIGPLVYQ--RRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFN 336
NN I R IG +Y+ FC+ G+++ + + D++ GC+P I+++ +PF+
Sbjct: 256 NNTI-RCIGKDIYKYPDSLQTATFCLIIRGARLAQSVLLDAMAAGCIPAIIADSLMMPFH 314
Query: 337 DILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
D++DW K AV + E D+ Q+LK IS H ++++Q+ W
Sbjct: 315 DVIDWTKAAVFIREVDILLTIQLLKKIS-------HQRIMEMQEQNAW 355
>gi|389744786|gb|EIM85968.1| hypothetical protein STEHIDRAFT_168993 [Stereum hirsutum FP-91666
SS1]
Length = 1111
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 50/184 (27%), Positives = 82/184 (44%), Gaps = 28/184 (15%)
Query: 201 PFLIKNAIRVVCSPSYDVAFIPHKDVALP-------QVLQPFALPRGGRDVENRTVLG-- 251
PF+ V + + PH+DV +P + + F R +R VL
Sbjct: 862 PFVRSTTAWSVMGDLHSACYRPHQDVIIPPRTCLSPSLFKSFPTVADVRPARDRRVLVAF 921
Query: 252 ---FWAGHRNSKIRVILARV-WENDTELDISNNRINRAIGP----LV-------YQRRFY 296
W ++ R++ R W++D D ++ R++ A GP LV Y
Sbjct: 922 NGVLWGTGALNRNRLVCPRSHWDSD---DNASRRLH-ASGPNLKSLVGTNGDYEYMSLLN 977
Query: 297 KTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQL 356
T FC P G+ + R+ DS++ GC+PV++ PF D+LDW K ++ + D+ QL
Sbjct: 978 DTVFCPQPAGTTGWATRLVDSMYAGCIPVLIGQASHFPFYDMLDWGKISIRVEPSDLAQL 1037
Query: 357 KQIL 360
+ IL
Sbjct: 1038 EDIL 1041
>gi|255585230|ref|XP_002533317.1| catalytic, putative [Ricinus communis]
gi|223526861|gb|EEF29074.1| catalytic, putative [Ricinus communis]
Length = 478
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 80/321 (24%), Positives = 134/321 (41%), Gaps = 50/321 (15%)
Query: 109 LTGKYASEGYFF------QNIRESRFLTH--DPDRAHLFFIPI------SCHKMRGKGT- 153
L +Y++E + +N+R F D ++A + F+P RG+GT
Sbjct: 113 LIKQYSAEYWIMGDLMTPENLRSQSFAKRVFDFNQADVVFVPFFATLSAEMELARGEGTF 172
Query: 154 --SYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVG---VRATEGVPFLIKNAI 208
N + V + W R+ G DH FV V VRA I A+
Sbjct: 173 RKKEGNEDYKRQKEVIEFVKSSDAWKRSGGKDHVFVLTDPVAMWHVRAE------IAPAV 226
Query: 209 RVVC---------SPSYD---VAFIPHKDVAL-PQVLQPFA--LPRGG-RDVENRTVLGF 252
+V S S D I H V+L V+ P+ LP+ + + R L +
Sbjct: 227 LLVVDFGGWYRLDSKSSDGNSSNIIRHTQVSLLKDVIVPYTHLLPQLPLSENKKRQTLLY 286
Query: 253 WAG----HRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQ 308
+ G HR +R L + N+ + + N A G + ++FC+ P G
Sbjct: 287 FKGAKYRHRGGMVREKLWDLLVNEPGVIMEEGFPN-ATGREQSIKGMRTSEFCLHPAGDT 345
Query: 309 VNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQ---LKQILKNISD 365
S R+ D+I C+P+I+S+ +LPF I+D+ +F+V + D + L LK+IS
Sbjct: 346 PTSCRLFDAIQSLCIPIIVSDNIELPFEGIVDYLEFSVFMAVDDALKPNWLVDHLKSISK 405
Query: 366 AEFVSLHNNLVKVQKHFQWNS 386
+ + +VQ F++++
Sbjct: 406 KQRDEFRQKMAEVQSIFEYDN 426
>gi|58269252|ref|XP_571782.1| hypothetical protein [Cryptococcus neoformans var. neoformans JEC21]
gi|57228018|gb|AAW44475.1| conserved hypothetical protein [Cryptococcus neoformans var.
neoformans JEC21]
Length = 1132
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 48/176 (27%), Positives = 80/176 (45%), Gaps = 15/176 (8%)
Query: 220 FIPHKDVALPQ-------VLQPFALPRGGRDVENRTVLGFWAG-----HRNSKIRVILAR 267
+ PH+DV +P + F+ + + R L W+G ++ +IR+ R
Sbjct: 920 YRPHQDVVVPARTCHTNTLRATFSNVGSIKPMRERLNLLTWSGTYEVAGKSERIRLTCGR 979
Query: 268 VWENDTEL-DISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVI 326
D EL + N A G Y +FC P G S + D+I+ GC+PV
Sbjct: 980 GGAGDRELIKGGGKQSNFANGD--YMNDLNNARFCPQPRGITGWSPQTNDAIYAGCIPVF 1037
Query: 327 LSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHF 382
+S PF D LDW K +V + ++ ++++IL I ++ L NLV +++ F
Sbjct: 1038 ISEGTHYPFADFLDWSKLSVRVAPTELDKIEKILAAIPLSKVEELQANLVSMREAF 1093
>gi|359359070|gb|AEV40977.1| putative exostosin family domain-containing protein [Oryza
punctata]
Length = 459
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 64/247 (25%), Positives = 113/247 (45%), Gaps = 28/247 (11%)
Query: 163 KDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVC---------- 212
++ VD + +P W R+ G DH FV V A V I AI +V
Sbjct: 166 REVVDR-VTAHPAWRRSGGRDHVFVLTDPV---AMWHVRKEIAPAILLVVDFGGWYKLDS 221
Query: 213 -SPSYDVA-FIPHKDVA-LPQVLQPFA--LPRGG-RDVENRTVLGFWAG----HRNSKIR 262
S S + + I H V+ L V+ P+ LP + + RT L ++ G HR +R
Sbjct: 222 NSASSNFSHMIQHTQVSLLKDVIVPYTHLLPTMQLSENKERTTLLYFKGAKHRHRGGLVR 281
Query: 263 VILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGC 322
L + N+ ++ + N A G + ++FC+ P G S R+ D++ C
Sbjct: 282 EKLWDLMVNEPDVVMEEGYPN-ATGREQSIKGMRTSEFCLHPAGDTPTSCRLFDAVASLC 340
Query: 323 VPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQI---LKNISDAEFVSLHNNLVKVQ 379
+PVI+S+ +LPF ++D+ +F + ++ + + K + L+N+ + N+ VQ
Sbjct: 341 IPVIVSDEIELPFEGMIDYTEFVIFVSVSNAMRPKWLTNYLRNVPRQKKDEFRRNMAHVQ 400
Query: 380 KHFQWNS 386
F+++S
Sbjct: 401 PIFEYDS 407
>gi|449270126|gb|EMC80844.1| Exostosin-2, partial [Columba livia]
Length = 714
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 25/94 (26%), Positives = 57/94 (60%), Gaps = 7/94 (7%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + ++ FC+ G+++ A ++D + GCVPVI+++ Y LPF+++LDW++ +V++ E
Sbjct: 304 YPQVLQESTFCVVLRGARLGQAVLSDVLQAGCVPVIIADSYILPFSEVLDWKRASVVIPE 363
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
+ ++ IL+++ + ++Q+ +W
Sbjct: 364 EKMPEMYSILQSVP-------QRQIEEMQRQARW 390
>gi|359359122|gb|AEV41028.1| putative exostosin family domain-containing protein [Oryza minuta]
Length = 459
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 64/247 (25%), Positives = 113/247 (45%), Gaps = 28/247 (11%)
Query: 163 KDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVC---------- 212
++ VD + +P W R+ G DH FV V A V I AI +V
Sbjct: 166 REVVDR-VTAHPAWRRSGGRDHVFVLTDPV---AMWHVRKEIAPAILLVVDFGGWYKLDS 221
Query: 213 -SPSYDVA-FIPHKDVA-LPQVLQPFA--LPRGG-RDVENRTVLGFWAG----HRNSKIR 262
S S + + I H V+ L V+ P+ LP + + RT L ++ G HR +R
Sbjct: 222 NSASSNFSHMIQHTQVSLLKDVIVPYTHLLPTMQLSENKERTTLLYFKGAKHRHRGGLVR 281
Query: 263 VILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGC 322
L + N+ ++ + N A G + ++FC+ P G S R+ D++ C
Sbjct: 282 EKLWDLMVNEPDVVMEEGYPN-ATGREQSIKGMRTSEFCLHPAGDTPTSCRLFDAVASLC 340
Query: 323 VPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQI---LKNISDAEFVSLHNNLVKVQ 379
+PVI+S+ +LPF ++D+ +F + ++ + + K + L+N+ + N+ VQ
Sbjct: 341 IPVIVSDEIELPFEGMIDYTEFVIFVSVSNAMRPKWLTNYLRNVPRQKKDEFRRNMAHVQ 400
Query: 380 KHFQWNS 386
F+++S
Sbjct: 401 PIFEYDS 407
>gi|268536482|ref|XP_002633376.1| C. briggsae CBR-RIB-1 protein [Caenorhabditis briggsae]
Length = 349
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 71/316 (22%), Positives = 125/316 (39%), Gaps = 68/316 (21%)
Query: 90 KIYIYPDGDPNTFYQTPRKLTGKYASEGY--FFQNIRESRFLTHDPDRAHLFFIPISCHK 147
K+YI+P ++ S+ Y ++ +ES T+DP+ A +F + I
Sbjct: 15 KLYIHP---------MEKRFEDSPQSQIYTKILKHYQESDHYTNDPNEACIFLLGID--- 62
Query: 148 MRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHF--------FVTCHDVGVRATEG 199
+ +N V DY+DSL WN G +H F D + G
Sbjct: 63 TTDRDIRSQNYVKNVPDYIDSL--DPAVWNN--GRNHLIFNFYHGTFPDYDDHNLNFDTG 118
Query: 200 VPFLIKNAIRVVCSPSYDVAFIPHKDVALP------------QVLQPFALPRGGRDV--- 244
+ + + S D F+ D++LP Q+ + A +G R
Sbjct: 119 EAMIAR-------ASSSDKNFLKGFDISLPLFHENHPFQIESQLDEGHAKNKGSRKYLVS 171
Query: 245 --ENRTVLGFWAGHRN------SKIRVILARV------WE--NDTELDISNNRINRAIGP 288
R V G +G RN + +++ W+ D NN +R
Sbjct: 172 FKGKRYVYGIGSGTRNLVHHLHNGEDIVMVTTCKHNSDWQAYQDDRCQGDNNEYDR---- 227
Query: 289 LVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVIL 348
Y + FC+ P G ++ S R +++ GC+PV++S+ + LPF++ DW +++
Sbjct: 228 WEYDDLLANSTFCLVPRGRRLGSFRFLETLRSGCIPVVISDSWVLPFSETTDWNSAVIVV 287
Query: 349 NERDVYQLKQILKNIS 364
ERD + ++L + S
Sbjct: 288 AERDALSIPELLMSTS 303
>gi|332836218|ref|XP_003313041.1| PREDICTED: exostosin-2 isoform 1 [Pan troglodytes]
Length = 728
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 29/117 (24%), Positives = 63/117 (53%), Gaps = 7/117 (5%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + + FC+ G+++ A ++D + GCVPV++++ Y LPF+++LDW++ +V++ E
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367
Query: 351 RDVYQLKQILKNISDAEFVSLHNNL-VKVQKHFQWNSPPVRFDAFHMVMYELWLRRH 406
+ + IL++I + + L ++ + W++ A H + +W R
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQRQLFIEPARRENWSA------ANHQMNSLIWPREQ 418
>gi|60657600|gb|AAX33321.1| secondary cell wall-related glycosyltransferase family 47 [Populus
tremula x Populus tremuloides]
Length = 509
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 31/106 (29%), Positives = 58/106 (54%), Gaps = 3/106 (2%)
Query: 295 FYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVY 354
+ +KFC+ G +S R+ D+I CVPVI+S+ +LP+ D+LD+ +F + + D
Sbjct: 362 MHSSKFCLNIAGDTPSSNRLFDAIASHCVPVIISDDIELPYEDVLDYSQFCIFVRTSDAV 421
Query: 355 QLK---QILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMV 397
+ K ++++I E+ + L +V+ F++ P DA M+
Sbjct: 422 REKFLINLVRSIKKDEWTRMWQRLKEVENFFEFQYPSKEGDAVQMI 467
>gi|410966412|ref|XP_003989727.1| PREDICTED: exostosin-like 1 [Felis catus]
Length = 670
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 74/323 (22%), Positives = 122/323 (37%), Gaps = 66/323 (20%)
Query: 88 NFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHK 147
K++++P P + A+ +I SR+ T P+ A L +S
Sbjct: 93 GLKVFVHPVAGPIS------------AAHRKILASIESSRYHTASPEEACLLLF-LSLEV 139
Query: 148 MRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNA 207
G+ + WN G +H + H T F + A
Sbjct: 140 PAGECSPVPQQ-----------------WNG--GRNHLVLDLHPAPCPQT----FQLGRA 176
Query: 208 IRVVCSPSYDVAFIPHKDVALPQVLQPFALP-RGGRDVENRTVLGFWAGHRNSKIRVILA 266
+ SP+ D F P DVALP L P A P RGG + R H +LA
Sbjct: 177 MVAKASPTADT-FRPGFDVALP--LLPEAHPLRGGAPGQLRQ-------HSPHPGVPLLA 226
Query: 267 --------RVWENDTELDISNNRINRAIGPLVYQR--RFYKTKFCICPGGSQVNSARIAD 316
R D + R + GP FC+ P + ++
Sbjct: 227 LAEERGGWRTAGTDPSACPWDGRCEQDRGPEQTHPGGTLPNATFCLIPS-HRPDALHFLQ 285
Query: 317 SIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLV 376
++ GC+PV+LS ++LPF++++DW K A++ +ER +Q+ L+ + ++L
Sbjct: 286 ALQAGCIPVLLSPRWELPFSEVIDWTKAAIVADERLPFQVLAALQEMPLTRVLALRQ--- 342
Query: 377 KVQKHFQWNSPPVRFDAFHMVMY 399
Q F W++ F + VM+
Sbjct: 343 --QTQFLWDA---YFSSVEKVMH 360
>gi|134114275|ref|XP_774385.1| hypothetical protein CNBG3660 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50257020|gb|EAL19738.1| hypothetical protein CNBG3660 [Cryptococcus neoformans var.
neoformans B-3501A]
Length = 1132
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 48/176 (27%), Positives = 80/176 (45%), Gaps = 15/176 (8%)
Query: 220 FIPHKDVALPQ-------VLQPFALPRGGRDVENRTVLGFWAG-----HRNSKIRVILAR 267
+ PH+DV +P + F+ + + R L W+G ++ +IR+ R
Sbjct: 920 YRPHQDVVVPARTCHTNTLRATFSNVGSIKPMGERLNLLTWSGTYEVAGKSERIRLTCGR 979
Query: 268 VWENDTEL-DISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVI 326
D EL + N A G Y +FC P G S + D+I+ GC+PV
Sbjct: 980 GGAGDRELIKGGGKQSNFANGD--YMNDLNNARFCPQPRGITGWSPQTNDAIYAGCIPVF 1037
Query: 327 LSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHF 382
+S PF D LDW K +V + ++ ++++IL I ++ L NLV +++ F
Sbjct: 1038 ISEGTHYPFADFLDWSKLSVRVAPTELDKIEKILAAIPLSKVEELQANLVSMREAF 1093
>gi|326920358|ref|XP_003206441.1| PREDICTED: exostosin-2-like [Meleagris gallopavo]
Length = 718
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 25/94 (26%), Positives = 56/94 (59%), Gaps = 7/94 (7%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + + FC+ G+++ A ++D + GCVPVI+++ Y LPF+++LDW++ +V++ E
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVIIADSYILPFSEVLDWKRASVVIPE 367
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
+ ++ IL+++ + ++Q+ +W
Sbjct: 368 EKMPEMYSILQSVP-------QRQIEEMQRQARW 394
>gi|224123938|ref|XP_002319201.1| predicted protein [Populus trichocarpa]
gi|222857577|gb|EEE95124.1| predicted protein [Populus trichocarpa]
Length = 450
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 31/105 (29%), Positives = 58/105 (55%), Gaps = 3/105 (2%)
Query: 296 YKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQ 355
+ +KFC+ G +S R+ D+I CVPVI+S+ +LP+ D+LD+ +F + + D +
Sbjct: 324 HSSKFCLNIAGDTPSSNRLFDAIASHCVPVIISDDIELPYEDVLDYSQFCIFVRTSDAVR 383
Query: 356 LK---QILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMV 397
K ++++I E+ + L +V+ F++ P DA M+
Sbjct: 384 EKFLINLVRSIKKDEWTRMWQRLKEVENFFEFQYPSKEGDAVQMI 428
>gi|449444196|ref|XP_004139861.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10L-like
[Cucumis sativus]
Length = 478
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 63/273 (23%), Positives = 116/273 (42%), Gaps = 26/273 (9%)
Query: 139 FFIPISCHKMRGKGTSYENMTVIVKDY-----VDSLINKYPYWNRTLGADHFFVTCHDVG 193
FF +S G V +DY V + W ++ G DH FV V
Sbjct: 155 FFATMSAEMQLGMAKGAFRKKVGNEDYERQRNVMDFLKSTDAWKKSGGRDHVFVLTDPVA 214
Query: 194 V--RATEGVPFLIKNA-------IRVVCSPSYDVAFIPHKDVA-LPQVLQPFA--LPRG- 240
+ TE P ++ + S I H V+ L V+ P+ LPR
Sbjct: 215 MWHVKTEIAPAVLLVVDFGGWFRLDTKSSNGSSPDMIQHTQVSVLKDVIVPYTHLLPRLH 274
Query: 241 -GRDVENRTVLGFWAGHRNSKIRVILARVWE---NDTELDISNNRINRAIGPLVYQRRFY 296
+ + +T+L F R + ++ ++W+ N+ ++ + N A G +
Sbjct: 275 LSANKKRQTLLYFKGAKRRHRGGLVREKLWDLLVNEPDVIMEEGFPN-ATGKEQSIKGMR 333
Query: 297 KTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQ- 355
++FC+ P G S R+ D+I C+PV++S+ +LPF D++D+ +F+V + D +
Sbjct: 334 SSEFCLHPAGDTPTSCRLFDAIQSLCIPVVVSDNIELPFEDMVDYSEFSVFVAVNDALKP 393
Query: 356 --LKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
L + L+ I + + + +VQ F++ +
Sbjct: 394 NWLVKHLRTIPEEQRNGFRLYMARVQSVFEYEN 426
>gi|431915722|gb|ELK16055.1| Exostosin-2 [Pteropus alecto]
Length = 1849
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 26/94 (27%), Positives = 55/94 (58%), Gaps = 7/94 (7%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + + FC+ G+++ A ++D + GCVPV++++ Y LPF+++LDW++ +V++ E
Sbjct: 1439 YPQVLQEATFCVVLRGARLGQAVLSDVLRAGCVPVVIADSYILPFSEVLDWKRASVVVPE 1498
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
+ + IL++I + ++QK +W
Sbjct: 1499 EKISDVYSILQSIP-------QRQIEEMQKQARW 1525
>gi|332030617|gb|EGI70305.1| Exostosin-2 [Acromyrmex echinatior]
Length = 594
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 30/96 (31%), Positives = 55/96 (57%), Gaps = 3/96 (3%)
Query: 279 NNRINRAIGPLVYQ--RRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFN 336
NN I R +G +Y+ + FC+ G+++ + + D++ GC+PVI+++ +PF+
Sbjct: 174 NNTI-RCVGEDIYKYPDSLQTSTFCLIIRGARLAQSALLDAMAAGCIPVIIADSLMMPFH 232
Query: 337 DILDWRKFAVILNERDVYQLKQILKNISDAEFVSLH 372
D++DW K AV + E D+ Q+LK IS + +
Sbjct: 233 DVIDWTKAAVFIREVDILLTIQLLKKISPQRIMDMQ 268
>gi|74214457|dbj|BAE31083.1| unnamed protein product [Mus musculus]
Length = 690
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 26/94 (27%), Positives = 54/94 (57%), Gaps = 7/94 (7%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + + FC G+++ A ++D + GCVPV++++ Y LPF+++LDW++ +V++ E
Sbjct: 412 YPQVLQEATFCTVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 471
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
+ + IL+NI + ++Q+ +W
Sbjct: 472 EKMSDVYSILQNIP-------QRQIEEMQRQARW 498
>gi|148695683|gb|EDL27630.1| exostoses (multiple) 2, isoform CRA_b [Mus musculus]
Length = 706
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 26/94 (27%), Positives = 54/94 (57%), Gaps = 7/94 (7%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + + FC G+++ A ++D + GCVPV++++ Y LPF+++LDW++ +V++ E
Sbjct: 372 YPQVLQEATFCTVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 431
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
+ + IL+NI + ++Q+ +W
Sbjct: 432 EKMSDVYSILQNIP-------QRQIEEMQRQARW 458
>gi|157820411|ref|NP_001101221.1| exostosin-2 [Rattus norvegicus]
gi|149022701|gb|EDL79595.1| exostoses (multiple) 2 (predicted) [Rattus norvegicus]
Length = 670
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 26/94 (27%), Positives = 54/94 (57%), Gaps = 7/94 (7%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + + FC G+++ A ++D + GCVPV++++ Y LPF+++LDW++ +V++ E
Sbjct: 308 YPQVLQEATFCTVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVIPE 367
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
+ + IL+NI + ++Q+ +W
Sbjct: 368 EKMSDVYSILQNIP-------QRQIEEMQRQARW 394
>gi|332210817|ref|XP_003254509.1| PREDICTED: exostosin-2 isoform 2 [Nomascus leucogenys]
Length = 728
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 29/116 (25%), Positives = 63/116 (54%), Gaps = 7/116 (6%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + + FC+ G+++ A ++D + GCVPV++++ Y LPF+++LDW++ +V++ E
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367
Query: 351 RDVYQLKQILKNISDAEFVSLHNNL-VKVQKHFQWNSPPVRFDAFHMVMYELWLRR 405
+ + IL++I + + L ++ + W++ A H + +W R
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQRQLFMEPARRENWSA------ANHQINSLIWPRE 417
>gi|71897059|ref|NP_001026520.1| exostosin-2 [Gallus gallus]
gi|60098757|emb|CAH65209.1| hypothetical protein RCJMB04_7p22 [Gallus gallus]
Length = 567
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 25/94 (26%), Positives = 56/94 (59%), Gaps = 7/94 (7%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + + FC+ G+++ A ++D + GCVPVI+++ Y LPF+++LDW++ +V++ E
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVIIADSYILPFSEVLDWKRASVVIPE 367
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
+ ++ IL+++ + ++Q+ +W
Sbjct: 368 DKMPEMYSILQSVP-------QRQIEEMQRQARW 394
>gi|74196713|dbj|BAE43097.1| unnamed protein product [Mus musculus]
Length = 669
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 49/195 (25%), Positives = 83/195 (42%), Gaps = 29/195 (14%)
Query: 204 IKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDVE--------NRTVLGFWAG 255
+ A+ SPS D+ F P D+ALP + + L RGG + T+L
Sbjct: 169 LGQAMVAEASPSSDI-FRPGFDIALPYLPEAHPL-RGGAPGKLQQHSPQPGATLLA--VA 224
Query: 256 HRNSKIRVIL----ARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNS 311
+ R+ A +W+ E D + FC+ PG S
Sbjct: 225 EEKGRWRITSTHASACLWDRHCEQDPGPQQTYPG-------ETLPNATFCLIPGHRSATS 277
Query: 312 ARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSL 371
+ ++ GC+PV+LS ++LPF++++DW K A+I +ER Q+ L+ + + ++L
Sbjct: 278 CFL-QALQAGCIPVLLSPRWELPFSEVIDWTKAAIIADERLPLQVLAALREMLPSRVLAL 336
Query: 372 HNNLVKVQKHFQWNS 386
Q F W +
Sbjct: 337 RQ-----QTQFLWTA 346
>gi|1524415|gb|AAC53143.1| multiple exostosis protein [Mus musculus]
Length = 718
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 26/82 (31%), Positives = 49/82 (59%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + + FC +++ A ++D + GCVPV++++ Y LPF++ILDW+K +V++ E
Sbjct: 308 YPQVLQEATFCTVLRRARLGQAVLSDVLQAGCVPVVIADSYILPFSEILDWKKASVVVPE 367
Query: 351 RDVYQLKQILKNISDAEFVSLH 372
+ + IL+NI + +H
Sbjct: 368 EKMSDVYSILQNIPQRQIEEMH 389
>gi|291384884|ref|XP_002708900.1| PREDICTED: exostosin 2 [Oryctolagus cuniculus]
Length = 765
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 25/94 (26%), Positives = 56/94 (59%), Gaps = 7/94 (7%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + + FC+ G+++ A ++D + GCVPV++++ Y LPF+++LDW++ +V++ E
Sbjct: 355 YPQVLQEATFCVVLRGARLGQAALSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 414
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
+ + +IL++I + ++Q+ +W
Sbjct: 415 EKMADVYKILQSIP-------QRQIQEMQRQARW 441
>gi|7407104|gb|AAF61913.1|AF224461_1 multiple exostoses-like 1 protein [Mus musculus]
gi|111305034|gb|AAI20892.1| Exostoses (multiple)-like 1 [Mus musculus]
gi|111307622|gb|AAI20891.1| Exostoses (multiple)-like 1 [Mus musculus]
gi|148698066|gb|EDL30013.1| exostoses (multiple)-like 1 [Mus musculus]
Length = 669
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 49/195 (25%), Positives = 83/195 (42%), Gaps = 29/195 (14%)
Query: 204 IKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDVE--------NRTVLGFWAG 255
+ A+ SPS D+ F P D+ALP + + L RGG + T+L
Sbjct: 169 LGQAMVAEASPSSDI-FRPGFDIALPYLPEAHPL-RGGAPGKLQQHSPQPGATLLA--VA 224
Query: 256 HRNSKIRVIL----ARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNS 311
+ R+ A +W+ E D + FC+ PG S
Sbjct: 225 EEKGRWRITSTHASACLWDRHCEQDPGPQQTYPG-------ETLPNATFCLIPGHRSATS 277
Query: 312 ARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSL 371
+ ++ GC+PV+LS ++LPF++++DW K A+I +ER Q+ L+ + + ++L
Sbjct: 278 CFL-QALQAGCIPVLLSPRWELPFSEVIDWTKAAIIADERLPLQVLAALREMLPSRVLAL 336
Query: 372 HNNLVKVQKHFQWNS 386
Q F W +
Sbjct: 337 RQ-----QTQFLWTA 346
>gi|356576131|ref|XP_003556187.1| PREDICTED: probable glucuronosyltransferase Os01g0926400-like
[Glycine max]
Length = 481
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 111/443 (25%), Positives = 178/443 (40%), Gaps = 80/443 (18%)
Query: 6 AAQPNQAQTT-------LCSLRSSLLTLALVTLLCFTCL--------SFNSFRSYPLQ-- 48
AA+PN+ + LC+L S+L+LAL L F S + F S +
Sbjct: 5 AAKPNKNPSLYVPKLFLLCTL-FSILSLALFLFLIFPTATTTTTLHPSLSPFSSSTINVY 63
Query: 49 -NNFNSTLSFAINKQDYSD-----LGSDVFHS--PSVFHLNYLEMVTNFKIYIYPDGDPN 100
+ +L++A+ +S L +D H+ PS H KI YP+
Sbjct: 64 VADLPRSLNYALLHTYWSSFSDSRLPTDADHTTPPSSLH-------QTAKIPPYPENPLI 116
Query: 101 TFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIP----ISCHKMRGKGTSYE 156
Y +TG + R L DP A + F+P +S G
Sbjct: 117 KQYSAEYWITGDLMTPPQHRATSFAKRVL--DPLLADVVFVPFFATLSAEMQLGANKGAF 174
Query: 157 NMTVIVKDY------VDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRV 210
DY +D++ N + WNR+ G DH FV V + + I A+ +
Sbjct: 175 RKKHDNDDYKRQRQVMDAVKNTH-AWNRSGGRDHVFVLTDPVAMWHVKDE---IAPAVLL 230
Query: 211 V----------------CSPSYDVAFIPHKDVA-LPQVLQPFA--LPRGG-RDVENRTVL 250
V CS S DV IPH V+ + V+ P+ LPR D + R L
Sbjct: 231 VVDFGGWYRLDSRGGSNCSES-DV--IPHTQVSVIKDVIVPYTHLLPRLDLSDNKERHQL 287
Query: 251 GFWAG----HRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGG 306
++ G HR IR L + ++ + + N A G + ++FC+ P G
Sbjct: 288 LYFKGAKHRHRGGIIREKLWDLLVSEPGVIMEEGFPN-ATGREQSIKGMQTSEFCLHPAG 346
Query: 307 SQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQ---LKQILKNI 363
S R+ D+I C+PVI+S+ +LPF ++D+ +F+V D + L L++
Sbjct: 347 DTPTSCRLFDAIQSLCIPVIVSDNIELPFEGMVDYAEFSVFAAVSDALKPSWLVSHLQSF 406
Query: 364 SDAEFVSLHNNLVKVQKHFQWNS 386
S + N+ +VQ F +++
Sbjct: 407 SKEQKDRFRQNMARVQPIFVYDN 429
>gi|57044145|ref|XP_544489.1| PREDICTED: exostoses (multiple)-like 1 [Canis lupus familiaris]
Length = 672
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 53/196 (27%), Positives = 85/196 (43%), Gaps = 27/196 (13%)
Query: 202 FLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALP-RGGRDVENRTVLGFWAGHRNSK 260
F A+ SP+ D F P DVALP L P A P RGG + R H
Sbjct: 173 FPPGQAMVAKASPAVDT-FRPGFDVALP--LLPEAHPLRGGAPGQLRQ-------HSPHP 222
Query: 261 IRVILA--------RVWENDTELDISNNRINRAIGP--LVYQRRFYKTKFCICPGGSQVN 310
+LA RV D+ + R + GP FC+ PG +
Sbjct: 223 GVPLLALAEERGGWRVAGADSPACPWDGRCEQNRGPEQTPLGATLPNATFCVIPG-HRPE 281
Query: 311 SARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVS 370
+ ++ GC+PV+LS ++LPF++++DW K A++ ++R +Q+ L+ + ++
Sbjct: 282 ALHFLQALQAGCIPVLLSPRWELPFSEVIDWTKAAIVADKRLPFQVLAALQEMPPTRVLA 341
Query: 371 LHNNLVKVQKHFQWNS 386
L Q F W++
Sbjct: 342 LRQ-----QTQFLWDA 352
>gi|351696937|gb|EHA99855.1| Exostosin-2 [Heterocephalus glaber]
Length = 717
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 24/94 (25%), Positives = 55/94 (58%), Gaps = 7/94 (7%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + + FC+ G+++ A ++D + GCVPV++++ Y LPF+++LDW++ +V++ E
Sbjct: 307 YPQVLQEATFCVVLRGARLGQAALSDVLRAGCVPVVIADSYILPFSEVLDWKRASVVVPE 366
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
+ + +L++I + ++Q+ +W
Sbjct: 367 EKIADMYGVLQSIP-------RRQMEEMQRQVRW 393
>gi|18399194|ref|NP_564443.1| exostosin-like protein [Arabidopsis thaliana]
gi|5091619|gb|AAD39607.1|AC007454_6 F23M19.7 [Arabidopsis thaliana]
gi|15450928|gb|AAK96735.1| Unknown protein [Arabidopsis thaliana]
gi|20148711|gb|AAM10246.1| unknown protein [Arabidopsis thaliana]
gi|332193570|gb|AEE31691.1| exostosin-like protein [Arabidopsis thaliana]
Length = 477
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 72/273 (26%), Positives = 111/273 (40%), Gaps = 27/273 (9%)
Query: 139 FFIPISCHKMRGKGTSYENMTVIVKDY-----VDSLINKYPYWNRTLGADHFFVTCHDVG 193
FF +S G G +DY V + W R+ G DH FV V
Sbjct: 153 FFATLSAEMELGNGKGSFRKKSGNEDYQRQRQVLDFVKNTKAWKRSNGRDHVFVLTDPVA 212
Query: 194 ---VRATEGVPFLI----KNAIRVVCSPSYDVAF---IPHKDVA-LPQVLQPFA--LPRG 240
VR + L+ R S + I H V+ + V+ P+ LPR
Sbjct: 213 MWHVREEIALSILLVVDFGGWFRQDSKSSNGTSLPERIQHTQVSVIKDVIVPYTHLLPRL 272
Query: 241 G-RDVENRTVLGFWAG----HRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRF 295
+ R L ++ G HR IR L + N+ + + N A G R
Sbjct: 273 DLSQNQRRHSLLYFKGAKHRHRGGLIREKLWDLLVNEPGVVMEEGFPN-ATGREQSIRGM 331
Query: 296 YKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQ 355
++FC+ P G S R+ D+I C+PVI+S+ +LPF I+D+ +F+V + D
Sbjct: 332 RNSEFCLHPAGDTPTSCRLFDAIQSLCIPVIVSDTIELPFEGIIDYSEFSVFASVSDALT 391
Query: 356 LKQI---LKNISDAEFVSLHNNLVKVQKHFQWN 385
K + L S+ E +L + + KVQ F ++
Sbjct: 392 PKWLANHLGRFSEREKETLRSRIAKVQSVFVYD 424
>gi|187469677|gb|AAI66748.1| Ext2 protein [Rattus norvegicus]
Length = 718
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 26/94 (27%), Positives = 54/94 (57%), Gaps = 7/94 (7%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + + FC G+++ A ++D + GCVPV++++ Y LPF+++LDW++ +V++ E
Sbjct: 308 YPQVLQEATFCTVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVIPE 367
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
+ + IL+NI + ++Q+ +W
Sbjct: 368 EKMSDVYSILQNIP-------QRQIEEMQRQARW 394
>gi|327259699|ref|XP_003214673.1| PREDICTED: exostosin-2-like [Anolis carolinensis]
Length = 718
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 25/94 (26%), Positives = 55/94 (58%), Gaps = 7/94 (7%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + + FCI G+++ A ++D + GC+PV++++ Y LPF+++LDW++ +V++ E
Sbjct: 308 YPQVLQEATFCIVLRGARLGQAVLSDVLQAGCIPVVIADSYILPFSEVLDWKRASVVIPE 367
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
+ ++ IL +I + ++Q+ +W
Sbjct: 368 EKMSEMYSILHSIP-------QRQIEEMQRQARW 394
>gi|296010875|ref|NP_001171554.1| exostosin-2 isoform 3 [Homo sapiens]
gi|119588476|gb|EAW68070.1| exostoses (multiple) 2, isoform CRA_c [Homo sapiens]
Length = 728
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 24/85 (28%), Positives = 51/85 (60%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + + FC+ G+++ A ++D + GCVPV++++ Y LPF+++LDW++ +V++ E
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367
Query: 351 RDVYQLKQILKNISDAEFVSLHNNL 375
+ + IL++I + + L
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQRQL 392
>gi|297688867|ref|XP_002821894.1| PREDICTED: exostosin-2 isoform 2 [Pongo abelii]
Length = 728
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 29/117 (24%), Positives = 63/117 (53%), Gaps = 7/117 (5%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + + FC+ G+++ A ++D + GCVPV++++ Y LPF+++LDW++ +V++ E
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367
Query: 351 RDVYQLKQILKNISDAEFVSLHNNL-VKVQKHFQWNSPPVRFDAFHMVMYELWLRRH 406
+ + IL++I + + L ++ + W++ A H + +W R
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQRQLFMEPARRENWSA------ANHQMNSLIWPREQ 418
>gi|2251238|gb|AAB62718.1| multiple exostoses type II protein EXT2.I [Homo sapiens]
Length = 728
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 24/85 (28%), Positives = 51/85 (60%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + + FC+ G+++ A ++D + GCVPV++++ Y LPF+++LDW++ +V++ E
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367
Query: 351 RDVYQLKQILKNISDAEFVSLHNNL 375
+ + IL++I + + L
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQRQL 392
>gi|126332604|ref|XP_001362716.1| PREDICTED: exostosin-2 [Monodelphis domestica]
Length = 718
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 25/94 (26%), Positives = 55/94 (58%), Gaps = 7/94 (7%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + + FC+ G+++ A ++D + GCVPVI+++ Y LPF+++LDW++ +V++ E
Sbjct: 308 YPQVLQEASFCMVLRGARLGQAVLSDVLQAGCVPVIIADSYILPFSEVLDWKRASVVIPE 367
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
+ + +L++I + ++Q+ +W
Sbjct: 368 EKMRDMYSVLRSIP-------QRQIEEMQRQARW 394
>gi|426368047|ref|XP_004051026.1| PREDICTED: exostosin-2 isoform 2 [Gorilla gorilla gorilla]
Length = 728
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 24/85 (28%), Positives = 51/85 (60%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + + FC+ G+++ A ++D + GCVPV++++ Y LPF+++LDW++ +V++ E
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367
Query: 351 RDVYQLKQILKNISDAEFVSLHNNL 375
+ + IL++I + + L
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQRQL 392
>gi|170583869|ref|XP_001896766.1| exostosin-1 [Brugia malayi]
gi|158595933|gb|EDP34387.1| exostosin-1, putative [Brugia malayi]
Length = 411
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 24/73 (32%), Positives = 46/73 (63%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y+ + FC+ P G ++ S R +++ GC+PV+LS+ ++LPF++++DWR+ +I +E
Sbjct: 152 YEMIMSNSTFCLTPRGRRLGSFRFLEALRLGCIPVVLSDDWELPFSEVIDWRQAVIIGHE 211
Query: 351 RDVYQLKQILKNI 363
V + +L I
Sbjct: 212 DTVLTISDVLNAI 224
>gi|74213823|dbj|BAE29346.1| unnamed protein product [Mus musculus]
Length = 786
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 26/94 (27%), Positives = 54/94 (57%), Gaps = 7/94 (7%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + + FC G+++ A ++D + GCVPV++++ Y LPF+++LDW++ +V++ E
Sbjct: 376 YPQVLQEATFCTVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 435
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
+ + IL+NI + ++Q+ +W
Sbjct: 436 EKMSDVYSILQNIP-------QRQIEEMQRQARW 462
>gi|427796253|gb|JAA63578.1| Putative tout-velu, partial [Rhipicephalus pulchellus]
Length = 835
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 76/317 (23%), Positives = 124/317 (39%), Gaps = 77/317 (24%)
Query: 88 NFKIYIYP---DGDPNT-FYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPI 143
+FK+Y+YP D P + YQ + IR+S + T D A +F +
Sbjct: 186 DFKVYVYPSESDAPPASPVYQR-------------ILRVIRQSSYATADASEACVFVPAV 232
Query: 144 SCHKMRGKGTSYENMTVIVKDYVDSL-INKYPYWNRTLGADHFFVT---------CHDVG 193
+ + DY + + P WN G +H ++G
Sbjct: 233 DT----------VDRDPLSPDYARTARLTDSPLWNG--GQNHLIFNLFSGTWPDYSEELG 280
Query: 194 VRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALP-RGGRDV-------- 244
+ G+ L K++I + AF P D+ALP L P A P RGG+
Sbjct: 281 LDI--GLAMLAKSSI-------PESAFRPGFDIALP--LFPKAHPERGGKPAIQSAGPVD 329
Query: 245 --------ENRTVLGFWAGHRNS-------KIRVILARVWENDTELDISNNRI---NRAI 286
R V G + RN+ + ++L ++ + R NR
Sbjct: 330 KGYLLVFKGKRYVYGIGSDTRNALHHLNNGRDVLLLTTCRHGKQWMERRDERCEADNRLY 389
Query: 287 GPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAV 346
Y FC+ P G ++ S R +S+ GCVPV+L+N ++LPF + L W A+
Sbjct: 390 DRYDYGSLMENATFCLVPRGRRLGSFRFLESLQAGCVPVLLANGWELPFGESLRWEGAAL 449
Query: 347 ILNERDVYQLKQILKNI 363
+ER + Q+ L+++
Sbjct: 450 RADERLLLQVPDTLRSM 466
>gi|344281096|ref|XP_003412316.1| PREDICTED: LOW QUALITY PROTEIN: exostosin-2-like [Loxodonta
africana]
Length = 728
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 33/140 (23%), Positives = 66/140 (47%), Gaps = 20/140 (14%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + + FC+ G+++ A ++D + GCVPV++++ Y LPF+++LDW++ +V++ E
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367
Query: 351 RDVYQLKQILKNISDAEFVSL------------------HNNLVKVQKHFQWNSPPV--R 390
+ + IL++I + + H + QWNS + R
Sbjct: 368 EKMSDMYSILQSIPQRQIEEMQRQVSMEPAVRENWSAANHXRSSLIXPKGQWNSQIINDR 427
Query: 391 FDAFHMVMYELWLRRHVIKY 410
+ + YE W +K+
Sbjct: 428 IYPYAAISYEEWNDPPTVKW 447
>gi|13879260|gb|AAH06597.1| Exostoses (multiple) 2 [Mus musculus]
Length = 718
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 26/94 (27%), Positives = 54/94 (57%), Gaps = 7/94 (7%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + + FC G+++ A ++D + GCVPV++++ Y LPF+++LDW++ +V++ E
Sbjct: 308 YPQVLQEATFCTVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
+ + IL+NI + ++Q+ +W
Sbjct: 368 EKMSDVYSILQNIP-------QRQIEEMQRQARW 394
>gi|194698238|gb|ACF83203.1| unknown [Zea mays]
gi|413919897|gb|AFW59829.1| hypothetical protein ZEAMMB73_270023 [Zea mays]
Length = 264
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 48/170 (28%), Positives = 84/170 (49%), Gaps = 10/170 (5%)
Query: 224 KDVALPQV-LQPFALPRGGRDVENRTVLGFWAG---HRNSKIRVILARVWENDTELDISN 279
KDV +P L P L +D RT+L F HR +R L + N+ ++ +
Sbjct: 46 KDVIVPYTHLLPTLLLSENKD--RRTLLYFKGAKHRHRGGLVREKLWDLLGNEPDVIMEE 103
Query: 280 NRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDIL 339
N A G + ++FC+ P G S R+ D+I C+PVI+S+ +LPF I+
Sbjct: 104 GFPN-ATGREQSIKGLRTSEFCLHPAGDTPTSCRLFDAIASLCIPVIVSDEVELPFEGII 162
Query: 340 DWRKFAVILNERDVYQLKQI---LKNISDAEFVSLHNNLVKVQKHFQWNS 386
D+ + ++ ++ + + K + L+NIS + NL +VQ F++++
Sbjct: 163 DYTEISIFVSVSNAMRPKWLTSYLRNISKQQKDEFRRNLARVQPIFEYDT 212
>gi|226442845|ref|NP_034293.2| exostosin-2 [Mus musculus]
gi|341940670|sp|P70428.2|EXT2_MOUSE RecName: Full=Exostosin-2; AltName:
Full=Glucuronosyl-N-acetylglucosaminyl-proteoglycan/N-
acetylglucosaminyl-proteoglycan
4-alpha-N-acetylglucosaminyltransferase; AltName:
Full=Multiple exostoses protein 2 homolog
gi|74216635|dbj|BAE37749.1| unnamed protein product [Mus musculus]
gi|148695682|gb|EDL27629.1| exostoses (multiple) 2, isoform CRA_a [Mus musculus]
Length = 718
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 26/94 (27%), Positives = 54/94 (57%), Gaps = 7/94 (7%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + + FC G+++ A ++D + GCVPV++++ Y LPF+++LDW++ +V++ E
Sbjct: 308 YPQVLQEATFCTVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
+ + IL+NI + ++Q+ +W
Sbjct: 368 EKMSDVYSILQNIP-------QRQIEEMQRQARW 394
>gi|74201817|dbj|BAE28510.1| unnamed protein product [Mus musculus]
Length = 701
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 26/94 (27%), Positives = 54/94 (57%), Gaps = 7/94 (7%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + + FC G+++ A ++D + GCVPV++++ Y LPF+++LDW++ +V++ E
Sbjct: 291 YPQVLQEATFCTVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 350
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
+ + IL+NI + ++Q+ +W
Sbjct: 351 EKMSDVYSILQNIP-------QRQIEEMQRQARW 377
>gi|172072650|ref|NP_062524.2| exostosin-like 1 [Mus musculus]
gi|341940671|sp|Q9JKV7.2|EXTL1_MOUSE RecName: Full=Exostosin-like 1; AltName: Full=Exostosin-L; AltName:
Full=Glucuronosyl-N-acetylglucosaminyl-proteoglycan
4-alpha-N-acetylglucosaminyltransferase; AltName:
Full=Multiple exostosis-like protein
Length = 669
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 49/195 (25%), Positives = 83/195 (42%), Gaps = 29/195 (14%)
Query: 204 IKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDVE--------NRTVLGFWAG 255
+ A+ SPS D+ F P D+ALP + + L RGG + T+L
Sbjct: 169 LGQAMVAEASPSSDI-FRPGFDLALPYLPEAHPL-RGGAPGKLQQHSPQPGATLLA--VA 224
Query: 256 HRNSKIRVIL----ARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNS 311
+ R+ A +W+ E D + FC+ PG S
Sbjct: 225 EEKGRWRITSTHASACLWDRHCEQDPGPQQTYPG-------ETLPNATFCLIPGHRSATS 277
Query: 312 ARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSL 371
+ ++ GC+PV+LS ++LPF++++DW K A+I +ER Q+ L+ + + ++L
Sbjct: 278 CFL-QALQAGCIPVLLSPRWELPFSEVIDWTKAAIIADERLPLQVLAALREMLPSRVLAL 336
Query: 372 HNNLVKVQKHFQWNS 386
Q F W +
Sbjct: 337 RQ-----QTQFLWTA 346
>gi|221041480|dbj|BAH12417.1| unnamed protein product [Homo sapiens]
Length = 337
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 25/94 (26%), Positives = 55/94 (58%), Gaps = 7/94 (7%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + + FC+ G+++ A ++D + GCVPV++++ Y LPF+++LDW++ +V++ E
Sbjct: 189 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 248
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
+ + IL++I + ++Q+ +W
Sbjct: 249 EKMSDVYSILQSIP-------QRQIEEMQRQARW 275
>gi|338711993|ref|XP_001489915.2| PREDICTED: LOW QUALITY PROTEIN: exostosin-2 [Equus caballus]
Length = 728
Score = 62.0 bits (149), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 24/85 (28%), Positives = 51/85 (60%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + + FC+ G+++ A ++D + GCVPV++++ Y LPF+++LDW++ +V++ E
Sbjct: 308 YPQVLQEATFCMVLRGARLGQAVLSDVLRAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367
Query: 351 RDVYQLKQILKNISDAEFVSLHNNL 375
+ + IL++I + + L
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQRQL 392
>gi|355752199|gb|EHH56319.1| Exostosin-2, partial [Macaca fascicularis]
Length = 733
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 25/94 (26%), Positives = 55/94 (58%), Gaps = 7/94 (7%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + + FC+ G+++ A ++D + GCVPV++++ Y LPF+++LDW++ +V++ E
Sbjct: 323 YPQVLQEATFCVVLRGARLGQAMLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 382
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
+ + IL++I + ++Q+ +W
Sbjct: 383 EKMSDVYSILQSIP-------QRQIEEMQRQARW 409
>gi|194755024|ref|XP_001959792.1| GF13049 [Drosophila ananassae]
gi|190621090|gb|EDV36614.1| GF13049 [Drosophila ananassae]
Length = 717
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 68/317 (21%), Positives = 121/317 (38%), Gaps = 51/317 (16%)
Query: 89 FKIYIYPDGDPNTFYQTPRKLTGKYASEGYF--FQNIRESRFLTHDPDRAHLFFIPISCH 146
K+YIYP D F S YF + + +SR+ T +P+ A LF
Sbjct: 108 LKVYIYPLQD---FVDERSDNVASTLSSEYFQILEAVLKSRYYTSNPNEACLFL------ 158
Query: 147 KMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKN 206
N K + + +W+R GA+H G + +
Sbjct: 159 ----PSLDLLNQNNFDKHLAGAALASLDFWDR--GANHIIFNMLP-GAAPSYNTVLDVNT 211
Query: 207 AIRVVCSPSYDV-AFIPHKDVALPQVLQPFALPRGGRDVENRTVLGFWAGHRNSKIRVIL 265
++ +D ++ P DV++P V P +P+ R L A IL
Sbjct: 212 DNAIIFGGGFDSWSYRPGFDVSIP-VWSPRLVPQHAHATAQRKFLLLVAQLN------IL 264
Query: 266 ARVWENDTELDISNNRINRAIGP------------------LVYQRRFYKTKFCICPGGS 307
R E+ +++N +G L Y R + KFC+
Sbjct: 265 PRFVRTMREVALAHNDQMLLLGACENMDLTSRCPVSQHHKSLEYPRLLSRGKFCLITKSL 324
Query: 308 QVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAE 367
++ + + + C+PV+ + Y +PF D++DW +V + E +++ + Q L+ IS+ +
Sbjct: 325 RLGQPDLVEIMSQHCIPVVAVDNYIMPFEDVIDWSLASVRIRESELHSVMQKLQAISNIK 384
Query: 368 FVSLHNNLVKVQKHFQW 384
+V++QK QW
Sbjct: 385 -------IVEMQKQVQW 394
>gi|74183570|dbj|BAE36631.1| unnamed protein product [Mus musculus]
Length = 636
Score = 61.6 bits (148), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 49/195 (25%), Positives = 84/195 (43%), Gaps = 29/195 (14%)
Query: 204 IKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDVE--------NRTVLGFWAG 255
+ A+ SPS D+ F P D+ALP + + L RGG + T+L
Sbjct: 136 LGQAMVAEASPSSDI-FRPGFDLALPYLPEAHPL-RGGAPGKLQQHSPQPGATLLAV--A 191
Query: 256 HRNSKIRVIL----ARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNS 311
+ R+ A +W+ E D + FC+ PG S
Sbjct: 192 EEKGRWRITSTHASACLWDRHCEQDPGPQQTYPG-------ETLPNATFCLIPGHRSATS 244
Query: 312 ARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSL 371
+ ++ GC+PV+LS ++LPF++++DW K A+I +ER Q+ L+ + + ++L
Sbjct: 245 CFL-QALQAGCIPVLLSPRWELPFSEVIDWTKAAIIADERLPLQVLAALREMLPSRVLAL 303
Query: 372 HNNLVKVQKHFQWNS 386
+ Q F W +
Sbjct: 304 -----RQQTQFLWTA 313
>gi|74144073|dbj|BAE22143.1| unnamed protein product [Mus musculus]
Length = 435
Score = 61.6 bits (148), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 26/94 (27%), Positives = 54/94 (57%), Gaps = 7/94 (7%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + + FC G+++ A ++D + GCVPV++++ Y LPF+++LDW++ +V++ E
Sbjct: 25 YPQVLQEATFCTVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 84
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
+ + IL+NI + ++Q+ +W
Sbjct: 85 EKMSDVYSILQNIP-------QRQIEEMQRQARW 111
>gi|374922021|gb|AFA26188.1| hypothetical protein, partial [Lolium perenne]
Length = 282
Score = 61.6 bits (148), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 60/263 (22%), Positives = 109/263 (41%), Gaps = 49/263 (18%)
Query: 167 DSLINKYPYWNRTLGADHFFVTCHDVGV----RATEGVPFLI---------KNAIRVVCS 213
D + +YPYWNRT G DH + D G + L+ +N+ +
Sbjct: 6 DHIAQRYPYWNRTSGRDHIWFFSWDEGACYAPKEIWNSMMLVHWGNTNTKHENSTTAYWA 65
Query: 214 PSYD----------VAFIPHKDVALPQVLQP----FALPRGGRDVENRTVLGFWAGHRNS 259
++D F P KD+ LP P L R NR L ++ G+ S
Sbjct: 66 DNWDNIPLDRRGDHPCFDPTKDLVLPAWKDPDPAAIWLKLWARPRSNRRTLFYFNGNLGS 125
Query: 260 K-------------IRVILARVWEN--DTELDISNNRINRAIGPLVYQRRFYK----TKF 300
IR LA + + + + + + + ++Y+ + F
Sbjct: 126 AYEQGRPEDTYSMGIRQKLAAEFGSTPNKQGKLGRQHVANVTVTHLRSEKYYEELASSIF 185
Query: 301 C-ICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQI 359
C + PG S R+ DS+ GC+PVI+ + LP+ ++L++ FAV + E D+ L +
Sbjct: 186 CGVLPGDGW--SGRMEDSMLQGCIPVIIQDGIFLPYENVLNYNSFAVRIQEDDIPNLITV 243
Query: 360 LKNISDAEFVSLHNNLVKVQKHF 382
L+ +++ + + N+ ++ + F
Sbjct: 244 LRGMNETQIEFMLGNVRQIWQRF 266
>gi|410973605|ref|XP_003993238.1| PREDICTED: exostosin-2 [Felis catus]
Length = 718
Score = 61.6 bits (148), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 24/94 (25%), Positives = 56/94 (59%), Gaps = 7/94 (7%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + + FC+ G+++ A ++D + GCVPV++++ Y LPF+++LDW++ +V++ E
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLRAGCVPVVIADSYVLPFSEVLDWKRASVVVPE 367
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
+ ++ IL+++ + ++Q+ +W
Sbjct: 368 EKMSEVYSILQSVP-------QRQIEEMQRQARW 394
>gi|401888173|gb|EJT52138.1| hypothetical protein A1Q1_06676 [Trichosporon asahii var. asahii CBS
2479]
Length = 1189
Score = 61.6 bits (148), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 43/176 (24%), Positives = 77/176 (43%), Gaps = 6/176 (3%)
Query: 214 PSYDVAFIPHKDVALPQVLQPFALPRGGRDVENRTVLGFWAG-----HRNSKIRVILARV 268
P+ DV +P + P++L F + V R L WAG ++ ++R+ R
Sbjct: 982 PAQDV-VVPPRSCTSPRLLASFPTVSHIKPVSERPRLISWAGTYWGSGKSERLRLACPRG 1040
Query: 269 WENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILS 328
EL N Y +FC P G S R+ D+I GC+PV+ S
Sbjct: 1041 GAGMRELLPGAGPQNHIDKYDDYLEELNTARFCPQPRGIAGWSPRVNDAIFAGCIPVLTS 1100
Query: 329 NYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
PF ++DW + +V ++ ++ ++++L +I A + N+V ++ F +
Sbjct: 1101 EDTHYPFAGLIDWSQISVRVHPTELDHVEELLASIPLARLEQIQANIVAIRDAFMY 1156
>gi|426245345|ref|XP_004016473.1| PREDICTED: exostosin-2 [Ovis aries]
Length = 718
Score = 61.6 bits (148), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 26/94 (27%), Positives = 55/94 (58%), Gaps = 7/94 (7%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + + FC+ G+++ A ++D + GCVPVI+++ Y LPF+++LDW++ +V++ E
Sbjct: 308 YPQVLQEATFCMVLRGARLGQAVLSDVLRAGCVPVIIADSYVLPFSEVLDWKRASVVVPE 367
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
+ + IL++I + ++Q+ +W
Sbjct: 368 EKMLDVYSILQSIP-------RRQIEEMQRQARW 394
>gi|359493064|ref|XP_002270238.2| PREDICTED: probable glucuronosyltransferase Os01g0926700-like
[Vitis vinifera]
Length = 483
Score = 61.6 bits (148), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 74/278 (26%), Positives = 112/278 (40%), Gaps = 36/278 (12%)
Query: 139 FFIPISCHKMRGKGTSYENMTVIVKDY-----VDSLINKYPYWNRTLGADHFFVTCHDVG 193
FF IS G G +DY V + W R+ G DH FV V
Sbjct: 160 FFATISAEIQLGGGKGVFRKKEGNEDYERQRQVMEFVRGTEAWKRSGGRDHVFVLTDPV- 218
Query: 194 VRATEGVPFLIKNAIRVV------------CSPSYDVAFIPHKDVAL-PQVLQPFA--LP 238
A V I AI +V S + I H V+L V+ P+ LP
Sbjct: 219 --AMWHVKAEIAPAILLVVDFGGWYKLDSKASNNSLSEMIQHTQVSLLKDVIVPYTHLLP 276
Query: 239 RGG-RDVENRTVLGFWAG----HRNSKIRVIL--ARVWENDTELDISNNRINRAIGPLVY 291
R + + R L ++ G HR +R L V+E I A G
Sbjct: 277 RLHLSENQIRQTLLYFKGAKHRHRGGLVREKLWDLLVYEQGV---IMEEGFPNATGREQS 333
Query: 292 QRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNER 351
+ ++FC+ P G S R+ D+I C+PVI+S+ +LPF ++D+ +F+V + R
Sbjct: 334 IKGMRTSEFCLHPAGDTPTSCRLFDAIQSLCIPVIVSDNIELPFEGMVDYSEFSVFVAVR 393
Query: 352 DVY---QLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
D L L++ S + N+ +VQ FQ+++
Sbjct: 394 DSLLPNWLVSHLRSFSKGQRDRFRQNMARVQPIFQYDN 431
>gi|168277694|dbj|BAG10825.1| exostosin-2 [synthetic construct]
Length = 746
Score = 61.2 bits (147), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 25/94 (26%), Positives = 55/94 (58%), Gaps = 7/94 (7%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + + FC+ G+++ A ++D + GCVPV++++ Y LPF+++LDW++ +V++ E
Sbjct: 336 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 395
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
+ + IL++I + ++Q+ +W
Sbjct: 396 EKMSDVYSILQSIP-------QRQIEEMQRQARW 422
>gi|402893738|ref|XP_003910046.1| PREDICTED: exostosin-2-like isoform 2 [Papio anubis]
Length = 515
Score = 61.2 bits (147), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 25/94 (26%), Positives = 55/94 (58%), Gaps = 7/94 (7%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + + FC+ G+++ A ++D + GCVPV++++ Y LPF+++LDW++ +V++ E
Sbjct: 321 YPQVLQEATFCVVLRGARLGQAILSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 380
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
+ + IL++I + ++Q+ +W
Sbjct: 381 EKMSDVYSILQSIP-------QRQIEEMQRQARW 407
>gi|297688865|ref|XP_002821893.1| PREDICTED: exostosin-2 isoform 1 [Pongo abelii]
Length = 731
Score = 61.2 bits (147), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 25/94 (26%), Positives = 55/94 (58%), Gaps = 7/94 (7%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + + FC+ G+++ A ++D + GCVPV++++ Y LPF+++LDW++ +V++ E
Sbjct: 321 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 380
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
+ + IL++I + ++Q+ +W
Sbjct: 381 EKMSDVYSILQSIP-------QRQIEEMQRQARW 407
>gi|119588473|gb|EAW68067.1| exostoses (multiple) 2, isoform CRA_a [Homo sapiens]
gi|119588475|gb|EAW68069.1| exostoses (multiple) 2, isoform CRA_a [Homo sapiens]
Length = 731
Score = 61.2 bits (147), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 25/94 (26%), Positives = 55/94 (58%), Gaps = 7/94 (7%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + + FC+ G+++ A ++D + GCVPV++++ Y LPF+++LDW++ +V++ E
Sbjct: 321 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 380
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
+ + IL++I + ++Q+ +W
Sbjct: 381 EKMSDVYSILQSIP-------QRQIEEMQRQARW 407
>gi|402893736|ref|XP_003910045.1| PREDICTED: exostosin-2-like isoform 1 [Papio anubis]
Length = 535
Score = 61.2 bits (147), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 25/94 (26%), Positives = 55/94 (58%), Gaps = 7/94 (7%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + + FC+ G+++ A ++D + GCVPV++++ Y LPF+++LDW++ +V++ E
Sbjct: 341 YPQVLQEATFCVVLRGARLGQAILSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 400
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
+ + IL++I + ++Q+ +W
Sbjct: 401 EKMSDVYSILQSIP-------QRQIEEMQRQARW 427
>gi|390470425|ref|XP_002755247.2| PREDICTED: exostosin-2 isoform 1 [Callithrix jacchus]
Length = 718
Score = 61.2 bits (147), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 25/94 (26%), Positives = 55/94 (58%), Gaps = 7/94 (7%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + + FC+ G+++ A ++D + GCVPV++++ Y LPF+++LDW++ +V++ E
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
+ + IL++I + ++Q+ +W
Sbjct: 368 EKMSDVYSILQSIP-------QRQIEEMQRQARW 394
>gi|335281986|ref|XP_003122905.2| PREDICTED: exostosin-2-like [Sus scrofa]
Length = 718
Score = 61.2 bits (147), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 25/94 (26%), Positives = 55/94 (58%), Gaps = 7/94 (7%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + + FC+ G+++ A ++D + GCVPV++++ Y LPF+++LDW++ +V++ E
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLRAGCVPVVIADSYVLPFSEVLDWKRASVVVPE 367
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
+ + IL++I + ++Q+ +W
Sbjct: 368 EKMSDVYSILQSIP-------RRQMEEMQRQARW 394
>gi|432090368|gb|ELK23794.1| Exostosin-2 [Myotis davidii]
Length = 718
Score = 61.2 bits (147), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 25/94 (26%), Positives = 55/94 (58%), Gaps = 7/94 (7%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + + FC+ G+++ A ++D + GCVPV++++ Y LPF+++LDW++ +V++ E
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLRAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
+ + IL++I + ++Q+ +W
Sbjct: 368 EKMADVYSILQSIP-------QRQIEEMQRQARW 394
>gi|403254631|ref|XP_003920065.1| PREDICTED: exostosin-2 [Saimiri boliviensis boliviensis]
Length = 718
Score = 61.2 bits (147), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 25/94 (26%), Positives = 55/94 (58%), Gaps = 7/94 (7%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + + FC+ G+++ A ++D + GCVPV++++ Y LPF+++LDW++ +V++ E
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
+ + IL++I + ++Q+ +W
Sbjct: 368 EKMSDVYSILQSIP-------QRQIEEMQRQARW 394
>gi|395742863|ref|XP_003777829.1| PREDICTED: exostosin-2 [Pongo abelii]
Length = 751
Score = 61.2 bits (147), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 25/94 (26%), Positives = 55/94 (58%), Gaps = 7/94 (7%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + + FC+ G+++ A ++D + GCVPV++++ Y LPF+++LDW++ +V++ E
Sbjct: 341 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 400
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
+ + IL++I + ++Q+ +W
Sbjct: 401 EKMSDVYSILQSIP-------QRQIEEMQRQARW 427
>gi|224144904|ref|XP_002325457.1| predicted protein [Populus trichocarpa]
gi|222862332|gb|EEE99838.1| predicted protein [Populus trichocarpa]
Length = 463
Score = 61.2 bits (147), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 58/105 (55%), Gaps = 3/105 (2%)
Query: 296 YKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQ 355
+ +KFC+ G +S R+ D+I CVPVI+S+ +LP+ ++LD+ +F + + D +
Sbjct: 317 HSSKFCLNIAGDTPSSNRLFDAIASHCVPVIISDDIELPYENVLDYSQFCIFVRTSDAVR 376
Query: 356 LK---QILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMV 397
K ++++I E+ + L +V+ F++ P DA M+
Sbjct: 377 EKFLVNLIRSIKKDEWTRMWKRLKEVENFFEFQYPSREGDAVQMI 421
>gi|296010873|ref|NP_000392.3| exostosin-2 isoform 1 [Homo sapiens]
gi|119588477|gb|EAW68071.1| exostoses (multiple) 2, isoform CRA_d [Homo sapiens]
Length = 751
Score = 61.2 bits (147), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 25/94 (26%), Positives = 55/94 (58%), Gaps = 7/94 (7%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + + FC+ G+++ A ++D + GCVPV++++ Y LPF+++LDW++ +V++ E
Sbjct: 341 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 400
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
+ + IL++I + ++Q+ +W
Sbjct: 401 EKMSDVYSILQSIP-------QRQIEEMQRQARW 427
>gi|46370069|ref|NP_997005.1| exostosin-2 isoform 2 [Homo sapiens]
gi|3023739|sp|Q93063.1|EXT2_HUMAN RecName: Full=Exostosin-2; AltName:
Full=Glucuronosyl-N-acetylglucosaminyl-proteoglycan/N-
acetylglucosaminyl-proteoglycan
4-alpha-N-acetylglucosaminyltransferase; AltName:
Full=Multiple exostoses protein 2; AltName:
Full=Putative tumor suppressor protein EXT2
gi|1518042|gb|AAB07008.1| EXT2 [Homo sapiens]
gi|1519605|gb|AAC51219.1| multiple exostosis 2 [Homo sapiens]
gi|1621113|gb|AAC50764.1| hereditary multiple exostoses gene 2 protein [Homo sapiens]
gi|14603196|gb|AAH10058.1| Exostoses (multiple) 2 [Homo sapiens]
gi|119588474|gb|EAW68068.1| exostoses (multiple) 2, isoform CRA_b [Homo sapiens]
gi|189065454|dbj|BAG35293.1| unnamed protein product [Homo sapiens]
gi|325463523|gb|ADZ15532.1| exostoses (multiple) 2 [synthetic construct]
Length = 718
Score = 61.2 bits (147), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 25/94 (26%), Positives = 55/94 (58%), Gaps = 7/94 (7%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + + FC+ G+++ A ++D + GCVPV++++ Y LPF+++LDW++ +V++ E
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
+ + IL++I + ++Q+ +W
Sbjct: 368 EKMSDVYSILQSIP-------QRQIEEMQRQARW 394
>gi|397473541|ref|XP_003808267.1| PREDICTED: exostosin-2 [Pan paniscus]
gi|410305316|gb|JAA31258.1| exostosin 2 [Pan troglodytes]
Length = 751
Score = 61.2 bits (147), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 25/94 (26%), Positives = 55/94 (58%), Gaps = 7/94 (7%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + + FC+ G+++ A ++D + GCVPV++++ Y LPF+++LDW++ +V++ E
Sbjct: 341 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 400
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
+ + IL++I + ++Q+ +W
Sbjct: 401 EKMSDVYSILQSIP-------QRQIEEMQRQARW 427
>gi|332210815|ref|XP_003254508.1| PREDICTED: exostosin-2 isoform 1 [Nomascus leucogenys]
Length = 718
Score = 61.2 bits (147), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 25/94 (26%), Positives = 55/94 (58%), Gaps = 7/94 (7%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + + FC+ G+++ A ++D + GCVPV++++ Y LPF+++LDW++ +V++ E
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
+ + IL++I + ++Q+ +W
Sbjct: 368 EKMSDVYSILQSIP-------QRQIEEMQRQARW 394
>gi|449439621|ref|XP_004137584.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase
IRX7-like [Cucumis sativus]
Length = 494
Score = 61.2 bits (147), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 62/124 (50%), Gaps = 9/124 (7%)
Query: 277 ISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFN 336
+ N IN+A + +KFC+ G +S R+ DSI CVPVI+S+ +LP+
Sbjct: 346 VKGNGINKA------GQGMASSKFCLNIAGDTPSSNRLFDSIASHCVPVIISDDIELPYE 399
Query: 337 DILDWRKFAVILNERDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDA 393
DILD+ +F V + D + L +L+ I + + + + ++ F++ P DA
Sbjct: 400 DILDYSEFCVFVRAADSIRKGYLLNLLRGIGRERWTKMWDRIKEIVHEFEYQYPSQSGDA 459
Query: 394 FHMV 397
M+
Sbjct: 460 VDMI 463
>gi|332836220|ref|XP_003313042.1| PREDICTED: exostosin-2 isoform 2 [Pan troglodytes]
Length = 751
Score = 61.2 bits (147), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 25/94 (26%), Positives = 55/94 (58%), Gaps = 7/94 (7%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + + FC+ G+++ A ++D + GCVPV++++ Y LPF+++LDW++ +V++ E
Sbjct: 341 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 400
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
+ + IL++I + ++Q+ +W
Sbjct: 401 EKMSDVYSILQSIP-------QRQIEEMQRQARW 427
>gi|332210819|ref|XP_003254510.1| PREDICTED: exostosin-2 isoform 3 [Nomascus leucogenys]
Length = 751
Score = 61.2 bits (147), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 25/94 (26%), Positives = 55/94 (58%), Gaps = 7/94 (7%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + + FC+ G+++ A ++D + GCVPV++++ Y LPF+++LDW++ +V++ E
Sbjct: 341 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 400
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
+ + IL++I + ++Q+ +W
Sbjct: 401 EKMSDVYSILQSIP-------QRQIEEMQRQARW 427
>gi|355566589|gb|EHH22968.1| Exostosin-2 [Macaca mulatta]
Length = 751
Score = 61.2 bits (147), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 26/96 (27%), Positives = 55/96 (57%), Gaps = 5/96 (5%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + + FC+ G+++ A ++D + GCVPV++++ Y LPF+++LDW++ +V++ E
Sbjct: 341 YPQVLQEATFCVVLRGARLGQAMLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 400
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
+ + IL++I + + K Q + W +
Sbjct: 401 EKMSDVYSILQSIPQRQIEEM-----KRQARWFWEA 431
>gi|332836216|ref|XP_508383.3| PREDICTED: exostosin-2 isoform 3 [Pan troglodytes]
Length = 718
Score = 61.2 bits (147), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 25/94 (26%), Positives = 55/94 (58%), Gaps = 7/94 (7%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + + FC+ G+++ A ++D + GCVPV++++ Y LPF+++LDW++ +V++ E
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
+ + IL++I + ++Q+ +W
Sbjct: 368 EKMSDVYSILQSIP-------QRQIEEMQRQARW 394
>gi|123482097|ref|XP_001323700.1| Exostosin family protein [Trichomonas vaginalis G3]
gi|121906570|gb|EAY11477.1| Exostosin family protein [Trichomonas vaginalis G3]
Length = 346
Score = 61.2 bits (147), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 64/290 (22%), Positives = 129/290 (44%), Gaps = 29/290 (10%)
Query: 111 GKYASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SCHKMRGKGTSYENMTVIVKDYVDS 168
Y E QN+ +SR +P A LF++PI S + + K SY++ ++ Y+
Sbjct: 25 SNYIFEYIIHQNLVKSRTAVENPQDADLFYVPIYLSAYNLYKKKASYQS---VITPYL-- 79
Query: 169 LINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKDVAL 228
L N Y ++ + G DH F +++ E +P +I S + + + +++
Sbjct: 80 LDNSY-WYEKHGGVDHIFTQIYNLNSNLQE-LPSMISTG-----DISNEYSTMSPRELWR 132
Query: 229 PQVLQPFALPRGGRDVENRTVL-GFWAGHRNSKIRVILARVWENDTELDISNNRINRAIG 287
++ P++ + + R +L F+ H + +A+ + ++S R + I
Sbjct: 133 LTIV-PYSSSYPDNENQTRRILSAFFESHTSIYSTNQIAKSIRTNLIAELSQMRDSLTIA 191
Query: 288 PLVYQRRFYK----------TKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFND 337
V + R + FC P G NS R D+I C+PV+LS+ LPF++
Sbjct: 192 KKVSKERATTNFDVVYLMSISDFCPSPHGDTPNSKRFFDAIKRRCIPVVLSDDVHLPFDE 251
Query: 338 IL-DWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQK--HFQW 384
+ D+ + + RD+ + I+ I ++E + + + ++ + +F W
Sbjct: 252 LFADYSGSLIQVPMRDIRSVPAIVGMIPESEKQRIRHRIDEISELLNFSW 301
>gi|46250443|gb|AAH68545.1| Exostoses (multiple) 2 [Homo sapiens]
Length = 718
Score = 61.2 bits (147), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 25/94 (26%), Positives = 55/94 (58%), Gaps = 7/94 (7%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + + FC+ G+++ A ++D + GCVPV++++ Y LPF+++LDW++ +V++ E
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
+ + IL++I + ++Q+ +W
Sbjct: 368 EKMSDVYSILQSIP-------QRQIEEMQRQARW 394
>gi|297688871|ref|XP_002821896.1| PREDICTED: exostosin-2 isoform 4 [Pongo abelii]
Length = 718
Score = 61.2 bits (147), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 25/94 (26%), Positives = 55/94 (58%), Gaps = 7/94 (7%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + + FC+ G+++ A ++D + GCVPV++++ Y LPF+++LDW++ +V++ E
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
+ + IL++I + ++Q+ +W
Sbjct: 368 EKMSDVYSILQSIP-------QRQIEEMQRQARW 394
>gi|426368045|ref|XP_004051025.1| PREDICTED: exostosin-2 isoform 1 [Gorilla gorilla gorilla]
Length = 718
Score = 61.2 bits (147), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 25/94 (26%), Positives = 55/94 (58%), Gaps = 7/94 (7%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + + FC+ G+++ A ++D + GCVPV++++ Y LPF+++LDW++ +V++ E
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
+ + IL++I + ++Q+ +W
Sbjct: 368 EKMSDVYSILQSIP-------QRQIEEMQRQARW 394
>gi|157818843|ref|NP_001101455.1| exostosin-like 1 [Rattus norvegicus]
gi|149024222|gb|EDL80719.1| exostoses (multiple)-like 1 (predicted) [Rattus norvegicus]
Length = 670
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 47/192 (24%), Positives = 80/192 (41%), Gaps = 23/192 (11%)
Query: 204 IKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDVENR-------TVLGFWAGH 256
+ A+ SPS D+ F P DVALP + + L G R T++
Sbjct: 170 LGQAMVAEASPSLDI-FRPGFDVALPYLPEAHPLKGGAPGSLQRHGPQPGATLVAVAEER 228
Query: 257 RNSKIRVILARV--WENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARI 314
+I A W+ + E D + FC+ PG S +
Sbjct: 229 GRWRITTTHASACPWDRNCEQDPGPQQTYPG-------EILPNVTFCLIPGHRSATSCFL 281
Query: 315 ADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNN 374
++ GC+PV+LS ++LPF++++DW K A++ +ER Q+ L+ + + ++L
Sbjct: 282 -QALQAGCIPVLLSPRWELPFSEVIDWTKAAIVADERLPLQVLAALREMLPSRVLALRQ- 339
Query: 375 LVKVQKHFQWNS 386
Q F W +
Sbjct: 340 ----QTQFLWTA 347
>gi|20138161|sp|O77783.1|EXT2_BOVIN RecName: Full=Exostosin-2; AltName:
Full=Glucuronosyl-N-acetylglucosaminyl-proteoglycan/N-
acetylglucosaminyl-proteoglycan
4-alpha-N-acetylglucosaminyltransferase; AltName:
Full=HS-polymerase; Short=HS-POL; AltName: Full=Multiple
exostoses protein 2 homolog
gi|3599568|gb|AAC35386.1| glucuronyl/N-acetylglucosaminyl transferase [Bos taurus]
Length = 718
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 26/94 (27%), Positives = 55/94 (58%), Gaps = 7/94 (7%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + + FC+ G+++ A ++D + GCVPVI+++ Y LPF+++LDW++ +V++ E
Sbjct: 308 YPQVLQEATFCMVLRGARLGQAVLSDVLRAGCVPVIIADSYVLPFSEVLDWKRASVVVPE 367
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
+ + IL++I + ++Q+ +W
Sbjct: 368 EKMSDVYSILQSIP-------RRQIEEMQRQARW 394
>gi|440897880|gb|ELR49485.1| Exostosin-2, partial [Bos grunniens mutus]
Length = 731
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 26/94 (27%), Positives = 55/94 (58%), Gaps = 7/94 (7%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + + FC+ G+++ A ++D + GCVPVI+++ Y LPF+++LDW++ +V++ E
Sbjct: 321 YPQVLQEATFCMVLRGARLGQAVLSDVLRAGCVPVIIADSYVLPFSEVLDWKRASVVVPE 380
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
+ + IL++I + ++Q+ +W
Sbjct: 381 EKMSDVYSILQSIP-------RRQIEEMQRQARW 407
>gi|73982548|ref|XP_533196.2| PREDICTED: exostosin-2 isoform 2 [Canis lupus familiaris]
Length = 718
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 25/94 (26%), Positives = 55/94 (58%), Gaps = 7/94 (7%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + + FC+ G+++ A ++D + GCVPV++++ Y LPF+++LDW++ +V++ E
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLRAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
+ + IL++I + ++Q+ +W
Sbjct: 368 EKMSDVYSILQSIP-------QRQIEEMQRQARW 394
>gi|297268065|ref|XP_001106468.2| PREDICTED: exostosin-2 [Macaca mulatta]
Length = 718
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 25/94 (26%), Positives = 55/94 (58%), Gaps = 7/94 (7%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + + FC+ G+++ A ++D + GCVPV++++ Y LPF+++LDW++ +V++ E
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAILSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
+ + IL++I + ++Q+ +W
Sbjct: 368 EKMSDVYSILQSIP-------QRQIEEMQRQARW 394
>gi|117935049|ref|NP_803462.2| exostosin-2 [Bos taurus]
gi|117306659|gb|AAI26571.1| Exostoses (multiple) 2 [Bos taurus]
gi|296479641|tpg|DAA21756.1| TPA: exostosin-2 [Bos taurus]
Length = 718
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 26/94 (27%), Positives = 55/94 (58%), Gaps = 7/94 (7%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + + FC+ G+++ A ++D + GCVPVI+++ Y LPF+++LDW++ +V++ E
Sbjct: 308 YPQVLQEATFCMVLRGARLGQAVLSDVLRAGCVPVIIADSYVLPFSEVLDWKRASVVVPE 367
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
+ + IL++I + ++Q+ +W
Sbjct: 368 EKMSDVYSILQSIP-------RRQIEEMQRQARW 394
>gi|405969796|gb|EKC34747.1| Exostosin-2 [Crassostrea gigas]
Length = 698
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 22/67 (32%), Positives = 48/67 (71%)
Query: 298 TKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLK 357
+KFC+ +++ ++D++ GC+PVI+++ Y LPF+++LDW++ AV++ E ++ +
Sbjct: 297 SKFCMVLRSARLGHTALSDALRTGCIPVIVADGYVLPFSEVLDWKRAAVVIREENLKDVV 356
Query: 358 QILKNIS 364
++LK+ S
Sbjct: 357 EVLKSYS 363
>gi|449533617|ref|XP_004173769.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase
IRX7-like, partial [Cucumis sativus]
Length = 330
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 62/124 (50%), Gaps = 9/124 (7%)
Query: 277 ISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFN 336
+ N IN+A + +KFC+ G +S R+ DSI CVPVI+S+ +LP+
Sbjct: 182 VKGNGINKAGQGMA------SSKFCLNIAGDTPSSNRLFDSIASHCVPVIISDDIELPYE 235
Query: 337 DILDWRKFAVILNERDVYQ---LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDA 393
DILD+ +F V + D + L +L+ I + + + + ++ F++ P DA
Sbjct: 236 DILDYSEFCVFVRAADSIRKGYLLNLLRGIGRERWTKMWDRIKEIVHEFEYQYPSQSGDA 295
Query: 394 FHMV 397
M+
Sbjct: 296 VDMI 299
>gi|355686834|gb|AER98200.1| exostoses 2 [Mustela putorius furo]
Length = 717
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 26/94 (27%), Positives = 54/94 (57%), Gaps = 7/94 (7%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + + FC+ G+++ A ++D + GCVPVI+++ Y LPF+++LDW++ +V++ E
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAALSDVLRAGCVPVIVADSYVLPFSEVLDWKRASVVVPE 367
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
+ + IL+ I + ++Q+ +W
Sbjct: 368 EKLSDVYGILQGIP-------RRQIEEMQRQARW 394
>gi|196004286|ref|XP_002112010.1| hypothetical protein TRIADDRAFT_24279 [Trichoplax adhaerens]
gi|190585909|gb|EDV25977.1| hypothetical protein TRIADDRAFT_24279 [Trichoplax adhaerens]
Length = 668
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 29/89 (32%), Positives = 53/89 (59%), Gaps = 5/89 (5%)
Query: 298 TKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLK 357
+ FC+ G ++ + D++ +GC+PV+LS+ Y LPF+++LDW++ A++ E + L
Sbjct: 258 STFCLMLRGYRLIQSNFLDALKFGCIPVVLSDEYILPFSEVLDWKRAALVFREDQLLSLP 317
Query: 358 QILKNISDAEFVSLHNNLVKVQKHFQWNS 386
+L +IS + HN ++ Q F W S
Sbjct: 318 AVLSSISTK---TRHN--LRKQGMFFWQS 341
>gi|384251773|gb|EIE25250.1| hypothetical protein COCSUDRAFT_40557 [Coccomyxa subellipsoidea
C-169]
Length = 280
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 29/102 (28%), Positives = 56/102 (54%), Gaps = 3/102 (2%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y F ++ FC+ G+ R+ ++ +GC+PVI+++ +PF D+L ++ FAV + E
Sbjct: 147 YMADFGRSTFCLAATGAGWG-VRLKLALMHGCIPVIIADNVQMPFEDVLPYQDFAVHVRE 205
Query: 351 RDVYQLKQILKNISDAEFV--SLHNNLVKVQKHFQWNSPPVR 390
+Y+L ++L I E + + N+ + ++F W P R
Sbjct: 206 HALYRLPEVLDAILSTEGLVKRMQINVSCIWRYFTWRDPQAR 247
>gi|335290723|ref|XP_003356260.1| PREDICTED: exostoses (multiple)-like 1 [Sus scrofa]
Length = 674
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 56/209 (26%), Positives = 88/209 (42%), Gaps = 51/209 (24%)
Query: 201 PFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQPFALP-RGGRDVENR------------ 247
PF + A+ P+ D F DVALP L P A P RGG + R
Sbjct: 170 PFQLGQAMVAQARPTVDT-FRAGFDVALP--LLPEAHPLRGGAPGQLRQHSPHPGMALLA 226
Query: 248 --TVLGFW--AGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCIC 303
G W AG +S A W+ E D + + FC+
Sbjct: 227 LAEERGGWHTAGTNSS------ACPWDGHCEQDHGPKQTHPGA-------TLPNATFCLI 273
Query: 304 PGGSQVNSARIADSIHY------GCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLK 357
PG R +D++H+ GC+PV+LS ++LPF++++DW K A++ +ER Q+
Sbjct: 274 PG-------RSSDALHFLQALQAGCIPVLLSPRWELPFSEVIDWTKAAIVADERLSLQVL 326
Query: 358 QILKNISDAEFVSLHNNLVKVQKHFQWNS 386
L+ + ++L + Q F W++
Sbjct: 327 TALQEMPLTRVLAL-----RQQTQFLWDA 350
>gi|417404013|gb|JAA48784.1| Putative glycosyl transferase family 64 domain protein [Desmodus
rotundus]
Length = 701
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 25/94 (26%), Positives = 55/94 (58%), Gaps = 7/94 (7%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + + FC+ G+++ A ++D + GCVPV++++ Y LPF+++LDW++ +V++ E
Sbjct: 291 YPQVLQEATFCVVLRGARLGQAVLSDILRAGCVPVVIADSYILPFSEVLDWKRASVVVPE 350
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
+ + IL++I + ++Q+ +W
Sbjct: 351 EKMSDVYGILQSIP-------QRQIGEMQRQARW 377
>gi|224284379|gb|ACN39924.1| unknown [Picea sitchensis]
Length = 787
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 74/333 (22%), Positives = 139/333 (41%), Gaps = 68/333 (20%)
Query: 113 YASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SCHKMRGKGTSYENMT--VIVKDYV-- 166
Y S+ ++++ S + T + + A F++P+ +C R + +M + ++ Y+
Sbjct: 381 YGSQMALYESLLASPYRTLNGEEADYFYVPVLDACLITRADDAPHLSMKNHMGLRSYLTL 440
Query: 167 -------DSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVV-------- 211
D ++ Y YWNR+ G DH + D G P I N++ +V
Sbjct: 441 DFYKKAYDHIMEHYTYWNRSSGHDHIWFFAWDEGACY---APKEIWNSMMLVHWGNTNSK 497
Query: 212 --------CSPSYD----------VAFIPHKDVALPQVLQP----FALPRGGRDVENRTV 249
+ ++D F P KD+ LP +P R R
Sbjct: 498 HNHSTTAYLADNWDHIPIERRGRHPCFDPEKDLVLPAWKRPDPYNVKARFWARSRRERFT 557
Query: 250 LGFWAGH-----RNSK--------IRVILARVWENDTELDISNNRINRAIGPLVYQR--- 293
L ++ G+ +N++ IR LA + ++ + R + +V Q+
Sbjct: 558 LFYFNGNLGASFKNNRPEPTYSLGIRQKLAAEFASEPNKEGKFGRQSTKDVIVVSQKSPN 617
Query: 294 ---RFYKTKFC-ICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILN 349
+ FC + PG S R+ DS+ GC+PVI+ + + + ++L++ FAV +
Sbjct: 618 YYSELGSSLFCGVFPGDGW--SGRMEDSVLQGCIPVIIQDGIQVAYENVLNYDSFAVRIA 675
Query: 350 ERDVYQLKQILKNISDAEFVSLHNNLVKVQKHF 382
E D+ L QIL+ I++ E N+ K+++ F
Sbjct: 676 EDDIPHLVQILRGINETELEFKLANVQKLRQRF 708
>gi|159466264|ref|XP_001691329.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158279301|gb|EDP05062.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 815
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 77/347 (22%), Positives = 133/347 (38%), Gaps = 81/347 (23%)
Query: 113 YASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SCHKM------------RGKGTSYENM 158
Y E + + +S T DP+ A F++P+ SC G ++
Sbjct: 411 YGLEVLMHEMLLQSEHRTFDPEAADFFYVPVYGSCFIFPLHCYADGPWWYAPSGPRVMHV 470
Query: 159 TVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV-------------------RATE- 198
T ++ + D + +PYW+R G DH ++ HD G R ++
Sbjct: 471 TNMMLEVRDLIRKHFPYWDRRGGRDHIWLMTHDEGACYAPSDIYGSSIFLTHWGRRDSDH 530
Query: 199 --GVPFLIKNAIRVVCSPSYDVAFI----------PHKDVALPQVLQP---------FAL 237
F N + P D ++ P KD+ +P + P F
Sbjct: 531 KSNTAFTPDNYTQEYVHPMQDKGWLHLIEGHPCYTPGKDLIVPALKLPHHFSGSPLLFHP 590
Query: 238 PRGGRDVENRTVLGFWAG----HR----NSKIRVILARVWENDTELDISNNRINRAIG-- 287
PR R +L + G HR + IR L ++W++ N+ N IG
Sbjct: 591 PR------PRDILLYLRGDVGKHRLPNYSRGIRQRLYKLWKDHD----WQNKYNAMIGDG 640
Query: 288 ---PLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKF 344
P Y +KFC+ G SAR+ D++ +GCVPVI+ + F + LD+ F
Sbjct: 641 SDVPGGYSEHLASSKFCVVAPGDGW-SARLEDAVLHGCVPVIVMDNVSAVFEEQLDFNSF 699
Query: 345 AVILN--ERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPV 389
++ + E ++ L + LK++ + L + + + S P+
Sbjct: 700 SIRVGEAEAELAYLPERLKSVPPRILEGMQKKLRTIWHRYAYVSHPL 746
>gi|1619954|gb|AAB17006.1| multiple exostosis 2 protein [Mus musculus]
Length = 718
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 26/94 (27%), Positives = 53/94 (56%), Gaps = 7/94 (7%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + + FC +++ A ++D + GCVPV++++ Y LPF+++LDW+K +V++ E
Sbjct: 308 YPQVLQEATFCTVLRRARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKKASVVVPE 367
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
+ + IL+NI + ++Q+ +W
Sbjct: 368 EKMSDVYSILQNIP-------QRQIEEMQRQARW 394
>gi|159470093|ref|XP_001693194.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158277452|gb|EDP03220.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 673
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 71/284 (25%), Positives = 113/284 (39%), Gaps = 53/284 (18%)
Query: 113 YASEGYFFQNIRESRFLTHDPDRAHLFFIP--ISCHKM-----------RGKGT----SY 155
YA++ + + S T DP+ A F++P SC +G G
Sbjct: 297 YAADTLLHELLLVSPHRTFDPEEADFFYVPHQASCLPFPIGNWADWPWFKGPGGPRIRQM 356
Query: 156 ENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGV---PFLIKNAIRVVC 212
NM + +D++D YP+W R G DH + HD G V + + R+
Sbjct: 357 LNMIMETRDWID---QHYPFWKRRGGRDHIWTFTHDEGACWAPNVLNTSIWLTHWGRMDP 413
Query: 213 SPSYDVAFIPHK-DVALPQVLQPFALP---------RGGRDVENRTVLGFWAGHRNSKIR 262
+ + AF+P + D QP R G+D+ + F R R
Sbjct: 414 DHTSNTAFVPDRYDRDFKSAYQPEGYRVHMQGHPCYRPGQDL---VIPAF---KRPDHYR 467
Query: 263 VI-LARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYG 321
LA EL P Y ++ FC+ G SAR+ D++ +G
Sbjct: 468 ASPLAAATSKPREL------------PGDYSDMLSRSLFCLVAAGDGW-SARLEDAVLHG 514
Query: 322 CVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISD 365
C+PVI+ + + F ILD F+V + E DV ++ +IL+ I +
Sbjct: 515 CIPVIIIDNVHVVFESILDIDSFSVRIAEADVDRILEILQAIPE 558
>gi|444707556|gb|ELW48821.1| Exostosin-2 [Tupaia chinensis]
Length = 740
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 25/94 (26%), Positives = 54/94 (57%), Gaps = 7/94 (7%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + + FC+ G+++ A ++D + GCVPVI+++ Y LPF+++LDW++ +V + E
Sbjct: 330 YPQVLQEATFCLVLRGARLGQAVLSDVLQAGCVPVIVADSYILPFSEVLDWKRASVAVPE 389
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
+ + IL+++ + ++Q+ +W
Sbjct: 390 EKLSDVYSILQSVP-------QRQIEEMQRQARW 416
>gi|313235568|emb|CBY11023.1| unnamed protein product [Oikopleura dioica]
Length = 663
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 22/74 (29%), Positives = 48/74 (64%)
Query: 298 TKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLK 357
+ FC+ P G ++ S R +++ CVPV+L++ + LPF++++DW + + E+ + +L
Sbjct: 239 STFCLVPRGRRLGSFRFIETLQQACVPVLLADDWVLPFSEVIDWERSTISWEEKLLLELG 298
Query: 358 QILKNISDAEFVSL 371
Q L+++S A+ + +
Sbjct: 299 QHLEDVSPADVLRM 312
>gi|395815565|ref|XP_003781296.1| PREDICTED: exostosin-2 [Otolemur garnettii]
Length = 718
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 24/94 (25%), Positives = 54/94 (57%), Gaps = 7/94 (7%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + FC+ G+++ A +++ + GCVPV++++ Y LPF+++LDW++ +V++ E
Sbjct: 308 YPQVLQDATFCVVLRGARLGQAVLSEVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
+ + IL++I + ++Q+ +W
Sbjct: 368 EKMSDVYSILQSIP-------QRQIEEMQRQARW 394
>gi|159473595|ref|XP_001694919.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158276298|gb|EDP02071.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 674
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 36/128 (28%), Positives = 64/128 (50%), Gaps = 8/128 (6%)
Query: 258 NSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADS 317
+ +R L ++ N T D+S IN+ GP + R ++FC P G R+ +
Sbjct: 528 SGGVRQGLLALFGNTTRPDVS---INKGGGPSLMLR----SRFCFTPMGFGWG-VRLTQA 579
Query: 318 IHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVK 377
GCVPV++ ++ D+L + KF++ ++ ++Y+L +IL +I+ E SL L
Sbjct: 580 AMTGCVPVMVQDHVWPTLWDVLPYEKFSIRVSRHNLYRLFEILDSITAEELASLQAGLAH 639
Query: 378 VQKHFQWN 385
+ F W
Sbjct: 640 WHRAFVWQ 647
>gi|255076909|ref|XP_002502118.1| glycosyltransferase family 47 protein [Micromonas sp. RCC299]
gi|226517383|gb|ACO63376.1| glycosyltransferase family 47 protein [Micromonas sp. RCC299]
Length = 717
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 31/118 (26%), Positives = 59/118 (50%), Gaps = 8/118 (6%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y++ + + FCI P GS R+ + GC+PVI+ + P++D+L + +F+V + +
Sbjct: 493 YEKNYASSTFCIAPTGSGWGR-RMNLATQSGCIPVIVQDNIAAPYDDVLPYDEFSVRVAK 551
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS-------PPVRFDAFHMVMYEL 401
D+ ++ I+K I+ + + L + QW+S DAF ++M L
Sbjct: 552 ADIPKIPDIVKAITPEKLDRMRQQLACAARALQWSSILGSDFGEGGENDAFALLMLTL 609
>gi|402218829|gb|EJT98904.1| hypothetical protein DACRYDRAFT_110241 [Dacryopinax sp. DJM-731 SS1]
Length = 1153
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 24/85 (28%), Positives = 47/85 (55%)
Query: 298 TKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLK 357
T +C P G + R+ D ++ GC+PV + + PF D+LDW K ++ + +D+ +++
Sbjct: 1020 TIWCPLPEGVTGWATRLEDVVYGGCIPVFVGHASQYPFYDMLDWSKLSIAIERKDLQRIE 1079
Query: 358 QILKNISDAEFVSLHNNLVKVQKHF 382
++L + + E NL+ V+ F
Sbjct: 1080 EVLMSYTMEEIERFQTNLMLVRDAF 1104
>gi|9454585|gb|AAF87908.1|AC015447_18 Hypothetical protein [Arabidopsis thaliana]
Length = 414
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 75/337 (22%), Positives = 125/337 (37%), Gaps = 72/337 (21%)
Query: 88 NFKIYIYP----DGDPNTFY------QTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAH 137
+ KIY+Y DG Y +T L G++ S+ + + ES+F T D A
Sbjct: 88 SLKIYVYDENEIDGLKELLYGRDGSVKTTACLKGQWGSQVKIHKLLLESKFRTIKKDEAD 147
Query: 138 LFFIP--ISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVR 195
LFF+P + C +M G E VK ++++ PY+ R+ G DH FV G
Sbjct: 148 LFFVPAYVKCVRMLGGLNDKEINQTYVK-----VLSQMPYFRRSGGRDHIFVFPSGAGAH 202
Query: 196 ATEGVPFLIKNAIRVVCSPSYD-------VAFIPHKDVALPQVLQPFALPRGGRDVENRT 248
I +I + +P D AF KD+ +P + G DV+
Sbjct: 203 LFRSWSTFINRSI--ILTPEADRTDKKDTTAFNSWKDIIIPGNVDDAMTKNGQPDVQ--- 257
Query: 249 VLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQ 308
PL +R Y + G
Sbjct: 258 ---------------------------------------PLPLSKRKYLANYLGRAQGKA 278
Query: 309 VNSARIADSIHY---GCVPVILSNYYDLPFNDILDWRKFAVIL-NERDVYQLKQILKNIS 364
I S + CVPV+LS++ +LPF +++D+ + ++ + R + L +IS
Sbjct: 279 GRLKLIDLSKQFPDKECVPVLLSDHAELPFQNVIDYAQVSIKWPSTRIGSEFLDYLASIS 338
Query: 365 DAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYEL 401
D + + K++ F + A +++EL
Sbjct: 339 DRDIEGMIARGRKIRCLFVYGPDSAPCSAVKGILWEL 375
>gi|168023962|ref|XP_001764506.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162684370|gb|EDQ70773.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 438
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 60/275 (21%), Positives = 118/275 (42%), Gaps = 46/275 (16%)
Query: 118 YFFQNIRESRFLTHDPDRAHLFFIPISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWN 177
YF + S + T DP++A L+F+ + + + SL PYWN
Sbjct: 123 YFKDALERSPYYTADPEKACLYFVTVDRRAEK----------------IPSL-PTLPYWN 165
Query: 178 RTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPSYDVAFIPHKD--VALPQVLQPF 235
G +H V+ D V+ + P + S ++ ++ D VALPQ + +
Sbjct: 166 H--GLNHVIVSISDFWVQ-RKATPVETIEMASTMTSIAHHASYRAGFDISVALPQK-RSY 221
Query: 236 ALPRGGRDVENRTVLGF----WAGHRNSKIR-----------VILARVWEND--TELDIS 278
+ +G + E + L F + G S+I VI+A + D +E +S
Sbjct: 222 SDVQGLKAFERKYFLTFTGMQFLGSSGSRINPVLRSMHNGEDVIIAITCKQDPNSEALLS 281
Query: 279 NNRI------NRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYD 332
+ ++++ R + F + G +S+R+ + + G +PV++S+ +
Sbjct: 282 RPELRAECVQDQSVYEQYTTRELMDSTFGLVQAGRGSSSSRLLEVLSAGSIPVVISDNFV 341
Query: 333 LPFNDILDWRKFAVILNERDVYQLKQILKNISDAE 367
LPF +LDWR+ ++ + ++ + L+++S E
Sbjct: 342 LPFESLLDWRRCLLVFPSSQMQRIVRTLRSLSKGE 376
>gi|356534007|ref|XP_003535549.1| PREDICTED: LOW QUALITY PROTEIN: xylogalacturonan
beta-1,3-xylosyltransferase-like [Glycine max]
Length = 373
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 71/277 (25%), Positives = 117/277 (42%), Gaps = 28/277 (10%)
Query: 132 DPDRAHLFFIPISCHKMRGKGT---SYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVT 188
DP A + F+P KG + N + V + WNR+ G DH FV
Sbjct: 51 DPLLADVVFVPFFATLSANKGAFRKKHGNDDYKRQRQVVDAVKSTQVWNRSGGRDHVFVL 110
Query: 189 ----CHDVGVRATEGVPFLIKNAIRVV--CSPSYDVAFIPHKDVA-LPQVLQPFA--LPR 239
C + G F + C S DV +PH V+ + V+ P+ LPR
Sbjct: 111 TGAFCKNPSFSFVPGGDFGGWSRGGGGSNCGES-DV--VPHTQVSVIKDVIVPYMHLLPR 167
Query: 240 GGRDVENRTV---LGFWAG----HRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQ 292
D+ V L ++ G HR IR L + ++ + + N A G
Sbjct: 168 --LDLSENKVRHQLLYFKGAKHRHRGGIIREKLWDLLVSEPGVIMEEGFPN-ATGREQSI 224
Query: 293 RRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAV---ILN 349
+ ++FC+ P G S R+ D+I C+PVI+S+ +LPF ++D+ +F+V + +
Sbjct: 225 KGMRTSEFCLHPAGDTPTSCRLFDAIQSLCIPVIVSDIIELPFEGMVDYAEFSVFPAVND 284
Query: 350 ERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
R L L++ S + N+ +VQ F +++
Sbjct: 285 ARKPSWLGNHLQSFSKEQKDRFRQNMAQVQPIFVYDN 321
>gi|159478058|ref|XP_001697121.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158274595|gb|EDP00376.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 802
Score = 58.2 bits (139), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 60/124 (48%), Gaps = 9/124 (7%)
Query: 269 WENDTELDISNNRINRAIGPLVYQRRFYKTKFC-ICPGGSQVNSARIADSIHYGCVPVIL 327
W ++ I N + R GP Y +++FC + PG S R D++ +GC+PV++
Sbjct: 621 WAAKHKIYIGNGEMVR--GP--YSEHLLRSRFCLVLPGDGW--SPRAEDAVLHGCIPVVI 674
Query: 328 SNYYDLPFNDILDWRKFAVILNERD--VYQLKQILKNISDAEFVSLHNNLVKVQKHFQWN 385
+ F ILDW F++ + E D + L Q+L+ + + NL +V F +
Sbjct: 675 MDNVHAVFESILDWESFSIRIREDDAALEALPQLLEAVPPERVAKMQRNLARVWHRFAYA 734
Query: 386 SPPV 389
+ PV
Sbjct: 735 TGPV 738
Score = 39.3 bits (90), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 24/96 (25%), Positives = 44/96 (45%), Gaps = 15/96 (15%)
Query: 114 ASEGYFFQNIRESRFLTHDPDRAHLFFIP--ISCHKMRGKGTS-----YENMTVIVKDYV 166
A E Y + + +S T DPD A F++P ++C+ G + Y + +V
Sbjct: 400 AVEAYLHETLLQSEHRTFDPDEADFFYVPVYVTCYMWPILGWADGPWWYAPLAHTRTMHV 459
Query: 167 DSLINK--------YPYWNRTLGADHFFVTCHDVGV 194
+++++ +P+WNR G DH ++ D G
Sbjct: 460 SNMLSEVHAHISSTFPWWNRRGGRDHIWLMAADEGA 495
>gi|334328334|ref|XP_001369230.2| PREDICTED: LOW QUALITY PROTEIN: exostosin-like 1-like [Monodelphis
domestica]
Length = 674
Score = 58.2 bits (139), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 66/299 (22%), Positives = 118/299 (39%), Gaps = 51/299 (17%)
Query: 88 NFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPISCHK 147
FK+++YP G + K+ GY T +P+ A LF +P
Sbjct: 99 GFKVFVYPVGLTQPISEIHHKVLASIEGSGYH----------TANPEEACLFVLP----- 143
Query: 148 MRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNA 207
G S + + P WN G +H H G G F + A
Sbjct: 144 --GDALSPQLL---------------PLWNG--GQNHLIFHLHP-GSWPDLG--FDLGQA 181
Query: 208 IRVVCSPSYDVAFIPHKDVALPQVLQPFALPRGGRDVENRTVLGFWAGHRNSKIRVILAR 267
+ SP + + DVALP Q + RGG +R L + ++
Sbjct: 182 MLARASPKPETLRL-GFDVALPLFPQSYPQ-RGG----SRGQLAHLIPPPGEDLLAFESQ 235
Query: 268 VWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVIL 327
+ D + S+N ++++ ++ + G N S GC+PV+L
Sbjct: 236 L---DARVTGSSNHLDQSTILIISDHHVMSEESGPRALGFSDNILLSCFSFQAGCIPVLL 292
Query: 328 SNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
S+ ++LPF++++DW ++++ER + Q+K L+ + A ++L + Q F W++
Sbjct: 293 SSGWELPFSEVIDWGTATIMIDERHLLQIKPALQELPPARILAL-----RQQTQFLWDA 346
>gi|159478515|ref|XP_001697348.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158274506|gb|EDP00288.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 620
Score = 58.2 bits (139), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 70/293 (23%), Positives = 113/293 (38%), Gaps = 51/293 (17%)
Query: 130 THDPDRAHLFFIPISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTC 189
T +P+ A FF+P C RG + I++ Y YWNR G DH
Sbjct: 194 TVNPEEADYFFLP-GCG--RGCNKWDQKFKYIMEHYAK-------YWNRRNGRDHLMTHA 243
Query: 190 HDVG----VRATEGVPFLIK----NAIRVVCSPS-------YDVAFIPHKDVALP----- 229
D G + PF+ N + S ++ IP +D+ +P
Sbjct: 244 GDWGRCEKAWGPDSAPFIANLTMLNHWGITVDRSQETEHDLFNACHIPDQDIQVPVLCGD 303
Query: 230 ---QVLQPFALPRGGRDVENRTVLGFWAGH---RNSKIRV----------ILARVWENDT 273
Q P+ N+TVL AG NS + A +W T
Sbjct: 304 LYPQFEHNVWHPKRRAHPVNKTVLASVAGSICGWNSAEEPPCKNKYYSFGVRAALWT--T 361
Query: 274 ELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDL 333
D I + + P++ Q +++FC P G+ + R S+ GC+PVI+S++
Sbjct: 362 LRDKPGFHIAKRV-PVLGQS-MAESEFCFAPTGAG-HGKRQVVSVTLGCMPVIISDHVAQ 418
Query: 334 PFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
PF LDW F V + E D+ ++ IL+ + + + L +H + +
Sbjct: 419 PFEPFLDWNDFGVWIAEADLPDVEAILRGFTPQQKAAKMKKLYCAARHVAYTT 471
>gi|390332053|ref|XP_003723408.1| PREDICTED: LOW QUALITY PROTEIN: exostosin-2-like
[Strongylocentrotus purpuratus]
Length = 707
Score = 58.2 bits (139), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 25/82 (30%), Positives = 48/82 (58%), Gaps = 3/82 (3%)
Query: 283 NRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWR 342
N+A G Y FCI +++ A ++D++ GC+PVI+S+ Y LPF++++DW+
Sbjct: 292 NKAFG---YPHILQDATFCIVLRRTRLGQAALSDALQAGCIPVIISDAYILPFSEVIDWK 348
Query: 343 KFAVILNERDVYQLKQILKNIS 364
+ ++++ E + L IL +
Sbjct: 349 RASLVVREDRIPDLPDILHAVE 370
>gi|449502905|ref|XP_004161776.1| PREDICTED: probable glucuronosyltransferase Os01g0926700-like
[Cucumis sativus]
Length = 482
Score = 57.8 bits (138), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 70/311 (22%), Positives = 127/311 (40%), Gaps = 41/311 (13%)
Query: 109 LTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIP----ISCHKMRGKGTSYENMTVIVKD 164
+T + +G F + + E+ + A + F+P +S G V +D
Sbjct: 128 MTPQEQRDGSFAKRVFEA-------EEADVIFVPFFATMSAEMQLGMAKGAFRKKVGNED 180
Query: 165 Y-----VDSLINKYPYWNRTLGADHFFVTCHDV-GVRATEGVPFLIKNAIRVVC------ 212
Y V + W ++ G DH + H + A V I A+ +V
Sbjct: 181 YERQRNVMDFLKSTDAWKKSGGRDHVLFSLHSLTDPVAMWHVKAEIAPAVLLVVDFGGWF 240
Query: 213 ------SPSYDVAFIPHKDVA-LPQVLQPFA--LPRGGRDV-ENRTVLGFWAG----HRN 258
S I H V+ L V+ P+ LPR + R L ++ G HR
Sbjct: 241 RLDTKSSNGSSPDMIQHTQVSVLKDVIVPYTHLLPRLHLSANKKRQTLLYFKGAKHRHRG 300
Query: 259 SKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSI 318
+R L + N+ ++ + N A G + ++FC+ P G S R+ D+I
Sbjct: 301 GLVREKLWDLLVNEPDVIMEEGFPN-ATGKEQSIKGMRSSEFCLHPAGDTPTSCRLFDAI 359
Query: 319 HYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQ---LKQILKNISDAEFVSLHNNL 375
C+PV++S+ +LPF D++D+ +F+V + D + L + L+ I + + +
Sbjct: 360 QSLCIPVVVSDNIELPFEDMVDYSEFSVFVAVNDALKPNWLVKHLRTIPEEQRNRFRLYM 419
Query: 376 VKVQKHFQWNS 386
+VQ F++ +
Sbjct: 420 ARVQSVFEYEN 430
>gi|242085256|ref|XP_002443053.1| hypothetical protein SORBIDRAFT_08g007160 [Sorghum bicolor]
gi|241943746|gb|EES16891.1| hypothetical protein SORBIDRAFT_08g007160 [Sorghum bicolor]
Length = 523
Score = 57.8 bits (138), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 82/376 (21%), Positives = 134/376 (35%), Gaps = 74/376 (19%)
Query: 96 DGDPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIP----ISCHK-MRG 150
DGD + T T +Y+ E F +R LT DP A ++P + H+ + G
Sbjct: 142 DGDTDIIPSTGWYSTDQYSLEVIFHNWMRRYECLTDDPAAATAVYVPYYPALELHQHLCG 201
Query: 151 KGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVP--------- 201
T+ + + ++ P W G DHF V + E
Sbjct: 202 FNTTVRDGP---SEAFLRWLSSQPTWAALGGRDHFMVAAKTTWMFRREPGAGGGDEGCGN 258
Query: 202 -FLIKNAIRVVCSPSYDVAFIPHKDVALP-----------QVLQPFALPRGGRDVENRTV 249
FL + R + +Y+ +D+A+P +V A RG R
Sbjct: 259 NFLGQPESRNMTVLTYESNIWAPRDIAVPYPSYFHPSSAGEVAAWQARARGA----PRPF 314
Query: 250 LGFWAGHRNSKIRVILARVWENDTELDISNNRINRAI-----------------GPLVYQ 292
L +AG R K ++ + D D+ + R P
Sbjct: 315 LFAFAGARRMKGQLAI-----RDRVFDVCDAAARRGRCGMLDCSHGLEGSITCRSPRKLV 369
Query: 293 RRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDL----------PFNDILDWR 342
F +FC+ P G DS+ GC+PV L P D R
Sbjct: 370 SLFTSARFCLQPRGDSFARRSSIDSVMAGCIPVFFHRPSTLKAQYRWHEPEPGRSDGDGR 429
Query: 343 KFAVILNERDVYQ----LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRF-----DA 393
++ V++N +DV + +++ L +D E ++ ++K+ F + P VRF DA
Sbjct: 430 RYYVLINSKDVLEGRVDIEEELSRYTDEEVAAMREEVIKMMPRFLYKDPRVRFEGEMRDA 489
Query: 394 FHMVMYELWLRRHVIK 409
F + + E+ R IK
Sbjct: 490 FDITIDEMIARMRRIK 505
>gi|323456573|gb|EGB12440.1| hypothetical protein AURANDRAFT_70660 [Aureococcus anophagefferens]
Length = 1731
Score = 57.8 bits (138), Expect = 9e-06, Method: Composition-based stats.
Identities = 29/94 (30%), Positives = 51/94 (54%), Gaps = 3/94 (3%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + K+C+ G S R+++++ GCVP LS P+ +LDW F+V + E
Sbjct: 1514 YTESMRRAKYCVVTEGFSPWSPRLSEAVALGCVPCFLSPSLAPPYATVLDWSAFSVEIAE 1573
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
DV +L ++L + ++ LH NL++V+ F +
Sbjct: 1574 ADVGRLPEVL---AAYDWAYLHANLLRVRPLFAF 1604
>gi|115461230|ref|NP_001054215.1| Os04g0670600 [Oryza sativa Japonica Group]
gi|113565786|dbj|BAF16129.1| Os04g0670600, partial [Oryza sativa Japonica Group]
Length = 275
Score = 57.8 bits (138), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 37/149 (24%), Positives = 75/149 (50%), Gaps = 8/149 (5%)
Query: 245 ENRTVLGFWAG----HRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKF 300
++R L ++ G HR +R L + N+ ++ + N A G + ++F
Sbjct: 76 KDRPTLLYFKGAKHRHRGGLVREKLWDLMVNEPDVVMEEGYPN-ATGREQSIKGMRTSEF 134
Query: 301 CICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQI- 359
C+ P G S R+ D++ C+PVI+S+ +LPF ++D+ +FA+ ++ + + K +
Sbjct: 135 CLHPAGDTPTSCRLFDAVASLCIPVIVSDEIELPFEGMIDYTEFAIFVSVNNSMRPKWLT 194
Query: 360 --LKNISDAEFVSLHNNLVKVQKHFQWNS 386
L+N+ + N+ VQ F+++S
Sbjct: 195 NYLRNVPRQQKDEFRRNMAHVQPIFEYDS 223
>gi|296088686|emb|CBI38136.3| unnamed protein product [Vitis vinifera]
Length = 291
Score = 57.8 bits (138), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 65/236 (27%), Positives = 101/236 (42%), Gaps = 31/236 (13%)
Query: 176 WNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVV------------CSPSYDVAFIPH 223
W R+ G DH FV V A V I AI +V S + I H
Sbjct: 10 WKRSGGRDHVFVLTDPV---AMWHVKAEIAPAILLVVDFGGWYKLDSKASNNSLSEMIQH 66
Query: 224 KDVAL-PQVLQPFA--LPRGG-RDVENRTVLGFWAG----HRNSKIRVIL--ARVWENDT 273
V+L V+ P+ LPR + + R L ++ G HR +R L V+E
Sbjct: 67 TQVSLLKDVIVPYTHLLPRLHLSENQIRQTLLYFKGAKHRHRGGLVREKLWDLLVYEQGV 126
Query: 274 ELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDL 333
I A G + ++FC+ P G S R+ D+I C+PVI+S+ +L
Sbjct: 127 ---IMEEGFPNATGREQSIKGMRTSEFCLHPAGDTPTSCRLFDAIQSLCIPVIVSDNIEL 183
Query: 334 PFNDILDWRKFAVILNERDVY---QLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
PF ++D+ +F+V + RD L L++ S + N+ +VQ FQ+++
Sbjct: 184 PFEGMVDYSEFSVFVAVRDSLLPNWLVSHLRSFSKGQRDRFRQNMARVQPIFQYDN 239
>gi|412988024|emb|CCO19420.1| exostosin-like glycosyltransferase [Bathycoccus prasinos]
Length = 909
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 96/409 (23%), Positives = 156/409 (38%), Gaps = 106/409 (25%)
Query: 71 VFHSPSVF------HLNYLEMVTNFKIYIYPD---------GDPNTFYQTPRKLTGKYAS 115
V+ PS F ++N E V + Y P+ P F + P K Y+
Sbjct: 448 VYEMPSKFTSHWTKYVNRGEFVCGDRFYEKPNDIDNDNDDDASPPPFPERPSKWF--YSL 505
Query: 116 EGYFFQNIRESRFLTHDPDRAHLFFIPI--SCHKMRGKGTS------------------- 154
E + + S T +P+ A +FFIP +C+++ + S
Sbjct: 506 ENTLHEFLLRSAHRTINPENADVFFIPQYGTCYRLAYQTPSPQVSLSLIKTKPGDRSHAA 565
Query: 155 ---YENMTVIVKDYVDSLINK-----YPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKN 206
E +T V++ ++IN Y++R G DH + +D G P I N
Sbjct: 566 NLFLERVTEYVRNIPFNVINNEKGEIQSYFDRNEGRDHAVIAAYDEGAVH---FPDSIAN 622
Query: 207 AIRVV-------CSPSYDVAFIP---------------------HKDVALPQVLQPFALP 238
AI + S A+ P +KD+ P QP
Sbjct: 623 AIFITHWGNTGYPRNSSHTAYSPDKWDELVKQGVVTGAWRAYNRNKDIVAPPWSQPKTNE 682
Query: 239 -RGGRDVEN------RTVLGF----------WAGHRNSKIRVILARVWENDTELDISNNR 281
R DV + RT F W + +R +AR W+N DI ++
Sbjct: 683 VREPADVNSWTPATQRTTFCFFSGNLGLEKPWGEDYSRGLRQKVARRWQNVYGFDILSHT 742
Query: 282 INRAIGPLVYQRRFYKTKFCIC-PGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDI-L 339
+ Y R +KFC+ PG S ++ I GC+PVI+ + D+P+ L
Sbjct: 743 DD-------YLGRIRSSKFCLALPGDGW--SGGLSVYIRNGCIPVIVQDGVDMPWEGTFL 793
Query: 340 DWRKFAVILNERDVY-QLKQILKNISDAEFVSLHNNLVKVQKHFQWNSP 387
D+ KF++ + E DV +L+ +L+ ++ E +L N L V F ++ P
Sbjct: 794 DYSKFSIRVREGDVENRLQSVLETVTPEELQNLQNGLKNVWHFFSYDVP 842
>gi|215686890|dbj|BAG89740.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215693850|dbj|BAG89049.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 264
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 37/149 (24%), Positives = 75/149 (50%), Gaps = 8/149 (5%)
Query: 245 ENRTVLGFWAG----HRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKF 300
++R L ++ G HR +R L + N+ ++ + N A G + ++F
Sbjct: 65 KDRPTLLYFKGAKHRHRGGLVREKLWDLMVNEPDVVMEEGYPN-ATGREQSIKGMRTSEF 123
Query: 301 CICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQI- 359
C+ P G S R+ D++ C+PVI+S+ +LPF ++D+ +FA+ ++ + + K +
Sbjct: 124 CLHPAGDTPTSCRLFDAVASLCIPVIVSDEIELPFEGMIDYTEFAIFVSVNNSMRPKWLT 183
Query: 360 --LKNISDAEFVSLHNNLVKVQKHFQWNS 386
L+N+ + N+ VQ F+++S
Sbjct: 184 NYLRNVPRQQKDEFRRNMAHVQPIFEYDS 212
>gi|391348729|ref|XP_003748596.1| PREDICTED: exostosin-like 3-like [Metaseiulus occidentalis]
Length = 868
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 27/68 (39%), Positives = 43/68 (63%)
Query: 305 GGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNIS 364
S V + R+ + +G +PVIL +Y LPF+D++ W K AVIL + +L+ +LK+ S
Sbjct: 394 ASSDVFTRRLMKIMRHGAIPVILGDYIGLPFDDLVHWEKAAVILAKAHASELQFVLKSYS 453
Query: 365 DAEFVSLH 372
DA+ V+L
Sbjct: 454 DADIVALR 461
>gi|347963135|ref|XP_311070.5| AGAP000081-PA [Anopheles gambiae str. PEST]
gi|333467342|gb|EAA06302.6| AGAP000081-PA [Anopheles gambiae str. PEST]
Length = 761
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 23/83 (27%), Positives = 46/83 (55%)
Query: 300 FCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQI 359
FC+ G ++ + +++ GC+PV++++ Y LPF D+LDW AV L E +++ + +
Sbjct: 362 FCLVARGVRLGQPALLEAMAAGCIPVVMADNYVLPFADLLDWELLAVRLPEANLHTIVPV 421
Query: 360 LKNISDAEFVSLHNNLVKVQKHF 382
L+ IS + + V + +
Sbjct: 422 LRAISAERVAEMQAQIRSVYRRY 444
>gi|159489064|ref|XP_001702517.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158280539|gb|EDP06296.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 489
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 70/316 (22%), Positives = 126/316 (39%), Gaps = 55/316 (17%)
Query: 119 FFQNIRESRFLTHDPDRAHLFFIPISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNR 178
Q + S T D D A +FIP+ MR + + ++ +V YV +P+W+R
Sbjct: 147 LLQRMLSSGVRTTDGDSADYYFIPL---LMRTRTHTVNHLAAVVH-YVR---KHWPWWDR 199
Query: 179 TLGAD-HFFVTCHDVGVR--------ATEGVPFLIKNAI-RVVCSPSYDVAFIPHKDVAL 228
T G H V D+G R TE FL + R ++ + P KD+ +
Sbjct: 200 TGGGHRHLLVAPGDIGRRILTPELLHMTENCTFLTHWGLHRNHSGGNWLASHRPGKDIVV 259
Query: 229 PQVLQP-----FALPRGGRDVENRTVLG--FWAG------HRNSKIRVILARV-WENDTE 274
P + P ++ ++ LG F+AG + + + R + T
Sbjct: 260 PPLTPPDEPIVYSPLHATLKKNRKSRLGELFFAGRICGDNQKPTDGKCSEKRQDYSAGTR 319
Query: 275 LDISNNRINRAIGPLVYQRRFYKTK-----FCICPGGS-QVNSARI-------------- 314
++++ NR + Y FC+ P G+ +++ AR+
Sbjct: 320 QQVAHHHWNRPNWTITTHTPAYAEALSTHIFCLSPTGTARLSHARLDLSTLQRPGGGYGR 379
Query: 315 --ADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNI--SDAEFVS 370
S+ GC+PV ++++ PF +DW +F+V + E D+ QL +L + S
Sbjct: 380 RSVQSLLMGCIPVTVTDHVHQPFEPEVDWARFSVPVREDDIAQLHHVLTGLRASPHTLAQ 439
Query: 371 LHNNLVKVQKHFQWNS 386
+ L +H ++S
Sbjct: 440 MQVRLRCAAQHMYYSS 455
>gi|256088012|ref|XP_002580154.1| exostosin-1 [Schistosoma mansoni]
gi|353230126|emb|CCD76297.1| putative exostosin-1 [Schistosoma mansoni]
Length = 766
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 64/321 (19%), Positives = 115/321 (35%), Gaps = 77/321 (23%)
Query: 81 NYLEMVTNFKIYIYPDGDPNTFYQTPRKLTGKYASEGYFFQNI----RESRFLTHDPDRA 136
+Y + FK+Y+YP L+ K A+ +Q I S+ LT DP+ A
Sbjct: 87 DYTPCQSEFKVYVYP-------------LSSKQANLSLTYQKILTALHNSKLLTSDPNEA 133
Query: 137 HLFFIPISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWN--------RTLGADHFFVT 188
+F + + ++ Y+ + P+WN + G +H
Sbjct: 134 CIFIPSLDT-------LDRDRLSPHFGQYIAHELVNLPFWNSLPRRDLDKYAGRNHLIFN 186
Query: 189 CHDVGVRATEGVPFLIKNAIRV------VCSPSYDVA-FIPHKDVALPQVLQPFALPRGG 241
H P+ ++ R+ + S+ F P D++LP + L G
Sbjct: 187 LH------AGTWPYYYEDEYRLWLGQAMLAKASFSTKHFRPKFDISLPLIHSQHPLQSGS 240
Query: 242 RDVEN----------------------RTVLGFWAGHR-------NSKIRVILARVWEND 272
+ R V G + R N K ++L
Sbjct: 241 SQLNQLVSSEHLRGRLDLPYLLSFKGKRYVSGIGSASRDILFHLHNGKDIIMLTTCRHGT 300
Query: 273 TELDISNNRINRAIG---PLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSN 329
++ R + Y Y + FC+ P G ++ S R + + GC+PV+LSN
Sbjct: 301 DWTRYADKRCATDMALYDAYDYWELMYNSTFCLVPRGRRLGSYRFLEVLQAGCIPVMLSN 360
Query: 330 YYDLPFNDILDWRKFAVILNE 350
+LPF++++DW + + +E
Sbjct: 361 DLELPFSEVIDWNRAVIWADE 381
>gi|123504861|ref|XP_001328850.1| Exostosin family protein [Trichomonas vaginalis G3]
gi|121911798|gb|EAY16627.1| Exostosin family protein [Trichomonas vaginalis G3]
Length = 325
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 29/125 (23%), Positives = 64/125 (51%), Gaps = 3/125 (2%)
Query: 290 VYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDI-LDWRKFAVIL 348
V + + FCI P G S R+ D+I + C+P+I+++Y LPF+ +++ + + +
Sbjct: 184 VLPTKMGSSDFCIIPPGDAPTSKRLYDAISHLCIPIIVADYMTLPFDGTSINYTECVIQI 243
Query: 349 NERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWN--SPPVRFDAFHMVMYELWLRRH 406
+D+ ++ ++ N + + L V++ F W+ +PP AF + L+ +
Sbjct: 244 PSKDIEKIPDLVNNFDKNKIKEMRKKLEIVREMFIWDYKNPPNAGQAFWNFAWNLYYKSE 303
Query: 407 VIKYF 411
++K +
Sbjct: 304 MMKPY 308
>gi|297851878|ref|XP_002893820.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
gi|297339662|gb|EFH70079.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
Length = 478
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 29/92 (31%), Positives = 51/92 (55%), Gaps = 3/92 (3%)
Query: 298 TKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLK 357
++FC+ P G S R+ D+I C+PVI+S+ +LPF I+D+ +F+V + D K
Sbjct: 335 SEFCLHPAGDTPTSCRLFDAIQSLCIPVIVSDTIELPFEGIIDYSEFSVFVPVSDALTPK 394
Query: 358 QI---LKNISDAEFVSLHNNLVKVQKHFQWNS 386
+ L+ S+ E + + KVQ F +++
Sbjct: 395 WLANHLRRFSEREKETFRGRMAKVQTVFVYDN 426
>gi|302851968|ref|XP_002957506.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
gi|300257148|gb|EFJ41400.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
Length = 706
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 50/196 (25%), Positives = 88/196 (44%), Gaps = 29/196 (14%)
Query: 199 GVPFLIKNAIRVVCSPSYDVAFIPHKDVALPQVLQ---------PFALPRGGRDV---EN 246
G P A+R+ P F P +DVA+P L+ +A R G V +
Sbjct: 433 GFPPYTLEALRMEREP----CFRPEQDVAVPNYLERGWIGRLQEAYAYDREGNAVHRDKQ 488
Query: 247 RTVLGFWAGHR------NSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKF 300
R L ++ G+ + +R L ++ N T D++ IN Y +++F
Sbjct: 489 RPYLFYFNGYSKPDMAYSGGVRQGLLSMYHNLTRGDVA---INPGCCTAEY---MLQSRF 542
Query: 301 CICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQIL 360
C+CP G R+ ++ GCVPVI+ ++ F D+L + KF+V +N ++++L +L
Sbjct: 543 CLCPLGYGW-GIRLTQAMQSGCVPVIVQDHTYSAFWDLLPYEKFSVRINRHNLHRLFDLL 601
Query: 361 KNISDAEFVSLHNNLV 376
++ + L L
Sbjct: 602 DAVTPEQLKDLQKGLA 617
>gi|297841569|ref|XP_002888666.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
gi|297334507|gb|EFH64925.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
Length = 457
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 81/405 (20%), Positives = 160/405 (39%), Gaps = 60/405 (14%)
Query: 18 SLRSSLLTLALVTLLCFTCLSFNSFRSYPLQNNFNSTLSFAINKQDYSDLGSDVFHSPSV 77
+++ +LTL L +CF L + +P F ++ I ++ Y + V+ P
Sbjct: 30 TIKLIVLTLLLSFSICFLFLILH----FPFTTEFTAS----IPRKCYHNFTVYVYDLPKE 81
Query: 78 FHLNYLEMVTNFKIY------IYPDGDPNTFYQTPRK---LTGKYASEGYFFQNIRESRF 128
F++ L+ + IY + +G Y++ R T ++ +E F ++
Sbjct: 82 FNIGILQNCRHLNIYTNMCPHVANNGLGQPLYRSGRTSWFATHQFIAEMIFHARVKNHPC 141
Query: 129 LTHDPDRAHLFFIPI-----SCHKMRGKG-TSYENMTVIVKDYVDSLINKYPYWNRTLGA 182
T +P+ A +F++P + R + T+ + + V + DY I+ +W R+ G
Sbjct: 142 RTCEPNNADIFYVPFYGGLYASSVFREQNLTNRDELAVRLVDY----ISGQRWWKRSNGR 197
Query: 183 DHFFVTCHDVG--VRATEGVPF---LIKNAIRVVCSPSYDVAFIP-----HKDVALPQVL 232
DHF +R+++ F ++ RV V P H + P
Sbjct: 198 DHFLAIGRTAWDFMRSSDTDDFGANMLMQMPRVKNMSVLTVERQPWKGDNHFGIPYPSYF 257
Query: 233 QPFALPR------GGRDVENRTVLGFWAGHRNS------KIRVILARVWENDTELDISNN 280
P+ R V+ + F G R + ++I + EL N
Sbjct: 258 HPYTSAEMVTWQDKMRRVDRPNLFSFVGGPRKGLEKAAIRDKLIKQCAESSHCELLKCEN 317
Query: 281 RINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNY-------YDL 333
+R P+ +++FC+ G D++ GC+PV S + + L
Sbjct: 318 GGSRCHDPMTVLGVMARSRFCLQAPGDSYTRRSTFDAMLAGCIPVFFSPHTMYTQYLWYL 377
Query: 334 PFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKV 378
P D R ++V ++E++ ++Q L IS++E V + ++ +
Sbjct: 378 P----DDKRSYSVFMDEKNNTHIEQELLRISESEVVQMRETVIDL 418
>gi|242020116|ref|XP_002430502.1| Exostosin-2, putative [Pediculus humanus corporis]
gi|212515659|gb|EEB17764.1| Exostosin-2, putative [Pediculus humanus corporis]
Length = 696
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/75 (30%), Positives = 45/75 (60%)
Query: 300 FCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQI 359
FCI G+++ + +S+ GC+PVI ++ LPF D++DW++ ++ + E D+ L +
Sbjct: 297 FCIIIRGARLAQQALLESLSAGCIPVIAADLMVLPFQDVIDWKRASITILESDLSSLIEK 356
Query: 360 LKNISDAEFVSLHNN 374
L ++SD + + L
Sbjct: 357 LSSVSDDKKLELQQQ 371
>gi|297839341|ref|XP_002887552.1| F25A4.34 [Arabidopsis lyrata subsp. lyrata]
gi|297333393|gb|EFH63811.1| F25A4.34 [Arabidopsis lyrata subsp. lyrata]
Length = 458
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 41/160 (25%), Positives = 77/160 (48%), Gaps = 8/160 (5%)
Query: 245 ENRTVLGFWAG----HRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKF 300
E R VL ++ G IR L + ++ ++ + + R G + +KF
Sbjct: 267 EKRPVLAYFQGAIYRKDGGTIRQELYNLLRDEKDVHFAFGTVRRN-GTKQTGKGMASSKF 325
Query: 301 CICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQ---LK 357
C+ G +S R+ D+I CVPVI+S+ +LPF D LD+ F+V ++ + + L
Sbjct: 326 CLNIAGDTPSSNRLFDAIVSHCVPVIISDQIELPFEDSLDYSGFSVFVHASEAVKKGFLV 385
Query: 358 QILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMV 397
+L+ I++ ++ L +V F++ P D+ +M+
Sbjct: 386 NLLRGITEDQWKKKWGRLKEVAGCFEYRFPSHPGDSVNMI 425
>gi|449662559|ref|XP_002154865.2| PREDICTED: exostosin-2-like [Hydra magnipapillata]
Length = 729
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/66 (34%), Positives = 39/66 (59%)
Query: 288 PLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVI 347
P Y + FC+ G S+ + D++ GC+PVI+ N Y LPFN+++DW + A+I
Sbjct: 342 PFYYPAVLQEGTFCLLLPGYYYGSSLLLDAMMMGCIPVIMMNDYVLPFNEVIDWSRAAII 401
Query: 348 LNERDV 353
+ E+ +
Sbjct: 402 VREQQI 407
>gi|390340500|ref|XP_003725255.1| PREDICTED: xyloglucan galactosyltransferase KATAMARI1 homolog
[Strongylocentrotus purpuratus]
Length = 495
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 36/112 (32%), Positives = 54/112 (48%), Gaps = 10/112 (8%)
Query: 300 FCICPGGSQVNSARIADSIHYGCVPVILSN---YYDLPFNDILDWRKFAVILNERDVYQL 356
FC+ P G DSI GC+PV + + PF LD+R+F V + +V
Sbjct: 379 FCLQPPGYSNIRKSFYDSIMSGCIPVTFRSKRSHVIYPFERTLDYRRFTVNIPIDEVLSG 438
Query: 357 K----QILKNISDAEFVSLHNNLVKVQKHFQWNSPPVR---FDAFHMVMYEL 401
K ILK I+ + L L +V FQ++ PP+R +DAF ++ E+
Sbjct: 439 KTNVTNILKGITKWKIAELQTELAEVAPKFQYSYPPIRGPDYDAFAAIIEEM 490
>gi|260782087|ref|XP_002586123.1| hypothetical protein BRAFLDRAFT_105910 [Branchiostoma floridae]
gi|229271215|gb|EEN42134.1| hypothetical protein BRAFLDRAFT_105910 [Branchiostoma floridae]
Length = 551
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 37/118 (31%), Positives = 57/118 (48%), Gaps = 8/118 (6%)
Query: 300 FCICPGGSQVNSARIADSIHYGCVPV--ILSNYYDLPFNDILDWRKFAVILNERDVYQ-- 355
FC+ P G D++ GCVPV L + PF+ +L++ F+VI++ +DV
Sbjct: 419 FCLQPPGDSPTRKSFYDAVACGCVPVTFTLEHPVRYPFDQVLNYSDFSVIIDGKDVTDRN 478
Query: 356 --LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPV--RFDAFHMVMYELWLRRHVIK 409
+ IL+ I L +NL KV Q++ P DAF MV+ E+ R V +
Sbjct: 479 ITILNILRKIPSERIKMLQDNLKKVAPLLQYSYPSTVPSQDAFTMVLEEMAQRVDVAR 536
>gi|384245403|gb|EIE18897.1| hypothetical protein COCSUDRAFT_45043 [Coccomyxa subellipsoidea
C-169]
Length = 420
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 84/403 (20%), Positives = 149/403 (36%), Gaps = 108/403 (26%)
Query: 90 KIYIYPDGDPNTFYQTPRKLTGKYASEGY-----FFQNIRESRFLTHDPDRAHLFFIPI- 143
K ++Y P F + ++L + E Y +++ S T DP A +F+IP+
Sbjct: 31 KEFVYVYDLPEKFNKDIKELPTIWHPEQYDIDQVLHKHLTHSEVNTKDPSVAKVFYIPVY 90
Query: 144 -----------------------SCHKMRG------KGTSYENMTV-IVKDYVDSLINKY 173
CH + K EN T +V+ + + Y
Sbjct: 91 LGRYFNAQWQRFSDPSDAWLINKECHGLDSVDCWAEKWKVAENATSDLVRSAIAHVKENY 150
Query: 174 PYWNRTLGADHFFVTCHDVG------------------VRATEGVPFLIKNAIRVV---- 211
PYWN + GADHF V +D G ++A + + ++ V
Sbjct: 151 PYWNASNGADHFMVFSYDHGKCEMAKALRFEEFGEMFSIQAYGSLVYRSNAKVQAVDRGD 210
Query: 212 ----CSPSYDVAFIPHKDVALP--------QVLQPFALPRGGRDV--------ENRTVLG 251
PS + P DV +P +L PFA+ R + + ++++
Sbjct: 211 SYSWSGPSTWACYRPDADVLVPVFSPYGHNTILSPFAVERNISLLMRFDYPLNDGKSLV- 269
Query: 252 FWAGHRNSKIRVILARVWE----NDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGS 307
H ++R L W+ N ++L + + + P+ R +
Sbjct: 270 ---AHHGHRLRKELIDYWQEQPLNGSDLGLRSTKAR----PIPAAR-------------N 309
Query: 308 QVNSARIADSIHYGCVPVILSNYYDLPFNDILD--WRKFAVILNERDVYQLKQILKNISD 365
+SAR+ +I +GC+PV +DLPF + + F + + D QL ++ + D
Sbjct: 310 TQDSARVWRAITFGCIPVTFFRAFDLPFQRHMGMPYADFVLNIQPDDYRQLNARIQALLD 369
Query: 366 --AEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELWLRRH 406
++ + L + Q+ F W+ + MV EL LR H
Sbjct: 370 NPSQLRRMQEALERHQRFFVWDEAG-DGGIYTMVEKELTLRAH 411
>gi|321463593|gb|EFX74608.1| hypothetical protein DAPPUDRAFT_307216 [Daphnia pulex]
Length = 724
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 62/287 (21%), Positives = 121/287 (42%), Gaps = 30/287 (10%)
Query: 91 IYIYPDGDPNTFYQTPRKLTGKYASEGY-FFQNIRESRFLTHDPDRAHLFFIPISCHKMR 149
+Y+YP + + P +T K SE Y + I +S + T DP+ A LF I
Sbjct: 114 VYVYPVVEFVDKDKVP--ITKKITSEFYNILKAITKSMYFTSDPNEACLFIPTIDLL--- 168
Query: 150 GKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFL-IKNAI 208
N I V ++ PYWN G +H + E PFL +
Sbjct: 169 -------NQNRIRPKDVGKALSSLPYWNE--GRNHLIFNI--LPGSMPEYHPFLEVDIGY 217
Query: 209 RVVCSPSYD-VAFIPHKDVALPQVLQPFALPRGGRDVENRTVLGFWAG-HRNSKIRVILA 266
++ S + + + DV++P V P A + ++R L + + + R +++
Sbjct: 218 AMMTSGGFSSLTYRSGFDVSVP-VYSPLAAELRTQSSQDRNWLVISSQPYIHEDFREVIS 276
Query: 267 RVWENDTELDISNNRINRAIGPLVYQRR---------FYKTKFCICPGGSQVNSARIADS 317
+ + N+ + + + R FC+ G+++ + +S
Sbjct: 277 EMAAEHPGFLVLNSCGSSPLDTKLRCRDEETYNFPDILMNATFCLVVRGARLGQPTLMES 336
Query: 318 IHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNIS 364
+ GC+PV++S+ Y LP+ +++DW+ + L E D+ ++ +L+ +S
Sbjct: 337 LAAGCIPVVVSDSYVLPYEEVIDWKSAVLQLYEDDLSKMMDLLRGVS 383
>gi|302829555|ref|XP_002946344.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
gi|300268090|gb|EFJ52271.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
Length = 785
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 31/102 (30%), Positives = 52/102 (50%), Gaps = 5/102 (4%)
Query: 291 YQRRFYKTKFC-ICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILN 349
Y ++KFC + PG SAR D+I +GCVP+++ + F ILDW F++ +
Sbjct: 624 YSEHLARSKFCLVAPGDGW--SARAEDAILHGCVPLVVMDGVHAVFESILDWDSFSIRIR 681
Query: 350 E--RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPV 389
E + + + ++L IS + NL +V F + + PV
Sbjct: 682 EDNQALQAIPELLTAISPERLAKMQRNLARVWHRFAYATGPV 723
Score = 38.5 bits (88), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 25/101 (24%), Positives = 45/101 (44%), Gaps = 15/101 (14%)
Query: 109 LTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIP--ISCHKMRGKG-------------T 153
L+ Y+ E Y + + +S T DP+ A F++P I+C G T
Sbjct: 379 LSMTYSVEVYLHEMMLQSEHRTFDPEEADFFYVPMYITCFMWPVMGWADFPWWHAPLAHT 438
Query: 154 SYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV 194
+++ ++ + + L +P+WNR G DH ++ D G
Sbjct: 439 RPMHVSNMILEAYEWLSTTFPWWNRRGGRDHIWLMAPDEGA 479
>gi|326429052|gb|EGD74622.1| hypothetical protein PTSG_05987 [Salpingoeca sp. ATCC 50818]
Length = 452
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 41/169 (24%), Positives = 84/169 (49%), Gaps = 17/169 (10%)
Query: 247 RTVLGFWAGH-RNSKIRVILARVWE-------NDTELDISNNRINRAIGPLVYQR----- 293
R F+AG N+++R +L + E +D ++ + ++ R+ L+ ++
Sbjct: 275 RDTFLFFAGTLSNNRVRTVLKKAVEGHPHCVIHDAQVKMMQDKNPRSSHSLLVRQVVRDT 334
Query: 294 --RFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDI-LDWRKFAVILNE 350
++FC+CP G + RI +++ GC+PVI+S+ Y PF + + +V + E
Sbjct: 335 LAEMATSEFCLCPRGLTAGTRRIFEAVLVGCIPVIVSDGYTWPFPHLAAELDAASVRVPE 394
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNSP-PVRFDAFHMVM 398
+D ++ IL ++S E V+ L + + ++ P P DAF+ ++
Sbjct: 395 KDAARVLDILGHVSRRERVAKRVRLAHLAHNVTYHLPAPQPGDAFYNII 443
>gi|195382432|ref|XP_002049934.1| GJ20474 [Drosophila virilis]
gi|194144731|gb|EDW61127.1| GJ20474 [Drosophila virilis]
Length = 719
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 70/320 (21%), Positives = 122/320 (38%), Gaps = 52/320 (16%)
Query: 89 FKIYIYPDGDPNTFYQTPRKLTGKYASEGYF--FQNIRESRFLTHDPDRAHLFFIPISCH 146
K+YIYP + F S YF + + +SR+ T +P+ A LF
Sbjct: 105 LKVYIYPLQE---FLDEKSDKVANTLSSEYFQILEIVLKSRYYTSNPNEACLFL------ 155
Query: 147 KMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFL-IK 205
N + K+ + +W+R G +H A + + I
Sbjct: 156 ----PSLDLLNQDMFDKNLAGPALASLNFWDR--GENHLIFNMLPGSAPAYNTILDVNID 209
Query: 206 NAIRVVCSPSYDV-AFIPHKDVALPQVLQPFALPRGGRDVEN------------------ 246
NAI V +D + P DV++P V P + N
Sbjct: 210 NAI--VLGGGFDSWTYRPGFDVSIP-VWSPLVHQTNSIPIHNGAHRKYLLVAAQLNLLPQ 266
Query: 247 --RTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICP 304
RT+ + N++ ++L N + S + + Y R + +FC
Sbjct: 267 YARTLTELMSTANNAEQMLLLGPCDNNQLK---SRCTLTQHQKRWEYPRVLGRGRFCFIG 323
Query: 305 GGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNIS 364
+V + + + GC+PVI + Y LPF D++DW AV + E +++ + L++IS
Sbjct: 324 RSLRVGQPDLIEIMSQGCIPVIAIDNYVLPFEDVIDWSLAAVRVRESELHSFIRKLESIS 383
Query: 365 DAEFVSLHNNLVKVQKHFQW 384
+ + +V++QK QW
Sbjct: 384 NVK-------VVEMQKQVQW 396
>gi|297814922|ref|XP_002875344.1| hypothetical protein ARALYDRAFT_904882 [Arabidopsis lyrata subsp.
lyrata]
gi|297321182|gb|EFH51603.1| hypothetical protein ARALYDRAFT_904882 [Arabidopsis lyrata subsp.
lyrata]
Length = 64
Score = 55.8 bits (133), Expect = 4e-05, Method: Composition-based stats.
Identities = 22/63 (34%), Positives = 42/63 (66%)
Query: 297 KTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQL 356
++ FC+CP G + S R+ +S GCVPV+++N LPF++I+ W + +++ ++D L
Sbjct: 2 RSIFCLCPFGWGIWSPRLVESAVSGCVPVVIANGIQLPFSEIVRWPEILLMMAKKDDMNL 61
Query: 357 KQI 359
++I
Sbjct: 62 QKI 64
>gi|327285402|ref|XP_003227422.1| PREDICTED: exostosin-1c-like [Anolis carolinensis]
Length = 746
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 76/345 (22%), Positives = 131/345 (37%), Gaps = 84/345 (24%)
Query: 88 NFKIYIYP--DGDPNTFYQTPRKLTGKYASEGYF--FQNIRESRFLTHDPDRAHLFFIPI 143
FK+++YP GDP SE Y ++ ESR+ T +P+ A LF + I
Sbjct: 105 GFKVFVYPLERGDP--------------VSESYLKIIASLEESRYFTSNPEEACLFVLNI 150
Query: 144 SCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHD-VGVRATEGVPF 202
+++++ +D I +P WN G +H + TE + F
Sbjct: 151 DT-------LDRDHLSLRFVRNIDEKIQGFPLWND--GRNHLIFNLYSGTWPNYTEDLGF 201
Query: 203 LIKNAIRVVCS-------PSYDVAFIP----------------HKDVALPQVLQPFALPR 239
I AI S P +D++ IP HK ++P + + L
Sbjct: 202 DIGQAILAKASFYMEHFRPGFDIS-IPLFSKDHPRKGGERGWLHK-ASVPPPKKKYLLAF 259
Query: 240 GGRDVENRTVLGFWAGHRN------------SKIRVILARVWEN--DTELDISNNRINRA 285
G+ R + G + RN S + WE DT D N
Sbjct: 260 KGK----RYLTGIGSDARNALHHIHNGKDIISLTTCKHGKDWEKHKDTRCDKDNADYEN- 314
Query: 286 IGPLVYQRRFYKTKFCICPGGS----QVNSARIADSIHYGCVPVILSNYYDLPFNDILDW 341
++ ++ C S + S + C+PV+LSN ++LPF++++DW
Sbjct: 315 ---IITEQTILLLGCCELSALSGHVPEAFSRFLLSLDQAACIPVLLSNGWELPFSEVIDW 371
Query: 342 RKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
K A++ +ER + Q+ ++ + E ++ Q F W +
Sbjct: 372 SKSAIVGDERLLLQIPSTVRCLHSDEILAFQQ-----QTQFLWEA 411
>gi|159470095|ref|XP_001693195.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158277453|gb|EDP03221.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 659
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 62/287 (21%), Positives = 109/287 (37%), Gaps = 37/287 (12%)
Query: 113 YASEGYFFQNIRESRFLTHDPDRAHLFFIP--ISC--------------HKMRGKGTSYE 156
Y E + + S T DP+ A F++P ISC + G +
Sbjct: 338 YGVESGLHEYLLLSEHRTFDPEEADFFYVPVYISCLIWPVLNYADFPVFYSNGGTRVMHA 397
Query: 157 -NMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCSPS 215
NM +D++D+ YP+W R G DH + HD G P I ++I +
Sbjct: 398 VNMLSEARDWIDA---NYPFWKRRGGRDHIWTFPHDEGACW---APNSIVSSIWLTHWGR 451
Query: 216 YDVAFIPHKDVALPQVLQPFALPRGGRDVENRTVLGFWAGHRNSKIRVI----LARVWEN 271
D + F PR + + GH + I +A W
Sbjct: 452 MDPDHTSKSSFDADNYTRDFVSPR-----QPKGYTHLIQGHGCYDPKKIYNMSIANNWRQ 506
Query: 272 DTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYY 331
+ + + + + Y ++ FC+ G SAR D++ +GC+PV++ +
Sbjct: 507 KYNVLVGDGQDVQGD----YSDLLSRSLFCLVATGDGW-SARTEDAVLHGCIPVVVIDGV 561
Query: 332 DLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVKV 378
+ F + D F++ + E DV + ILK + + ++ NL +
Sbjct: 562 HMKFETLFDVDSFSIRIPEADVANILTILKALPEERVRAMQANLGQA 608
>gi|372266629|ref|ZP_09502677.1| exostosin [Alteromonas sp. S89]
Length = 328
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 29/79 (36%), Positives = 44/79 (55%), Gaps = 3/79 (3%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y +TKFC+CP GS NS R+ ++I +GCVPV+LS+ DL L + V ++
Sbjct: 226 YVEVLKETKFCLCPSGSGPNSIRLWEAIKFGCVPVLLSDDLDL---RGLSAKALRVTESK 282
Query: 351 RDVYQLKQILKNISDAEFV 369
+V +L L N E++
Sbjct: 283 EEVEKLPARLANTDPGEYL 301
>gi|4263719|gb|AAD15405.1| hypothetical protein [Arabidopsis thaliana]
Length = 487
Score = 55.1 bits (131), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 53/241 (21%), Positives = 105/241 (43%), Gaps = 33/241 (13%)
Query: 170 INKYPYWNRTLGADHFFVTCH-------DVGVRATEGVPFLIKNAIRVVCSPSYDVAFIP 222
+ K P W G +HF VT + G R+ G F++ + + S + +
Sbjct: 203 LKKQPQWKDMSGKNHFLVTGRISRDFRRNSGSRSAWGTNFMLLSESLNLTFLSIERSLTS 262
Query: 223 HKDVALP--QVLQPFALP-----RGGRDVENRTVLGFWAG------HRNSKIRVILARVW 269
H + A+P P + P + + NRTVL +AG ++N +R + +
Sbjct: 263 HNEFAIPYPTYFHPTSTPEILQWQEKIRLTNRTVLFSFAGAQRPSRNQNGVVRTEVIKQC 322
Query: 270 ENDTE----LDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPV 325
++ ++ LD N N P+ + F + FC+ P G + + DSI GC+PV
Sbjct: 323 KSSSKTCRFLDCDVN-ANSCDDPISLMKLFESSTFCLQPPGDSLTRKSVFDSILAGCIPV 381
Query: 326 IL---SNYYDLPFNDILDWRKFAVILNERDVY-----QLKQILKNISDAEFVSLHNNLVK 377
S Y ++ + K++V + +++ ++++IL+ I + V + N+++
Sbjct: 382 FFNQGSAYKQYLWHIPKNSSKYSVYITVKELRTGGKNKIEEILRGIPNERVVGMRENVIR 441
Query: 378 V 378
+
Sbjct: 442 L 442
>gi|42569541|ref|NP_180759.2| exostosin-like protein [Arabidopsis thaliana]
gi|53828531|gb|AAU94375.1| At2g31990 [Arabidopsis thaliana]
gi|59958346|gb|AAX12883.1| At2g31990 [Arabidopsis thaliana]
gi|330253523|gb|AEC08617.1| exostosin-like protein [Arabidopsis thaliana]
Length = 479
Score = 55.1 bits (131), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 53/241 (21%), Positives = 105/241 (43%), Gaps = 33/241 (13%)
Query: 170 INKYPYWNRTLGADHFFVTCH-------DVGVRATEGVPFLIKNAIRVVCSPSYDVAFIP 222
+ K P W G +HF VT + G R+ G F++ + + S + +
Sbjct: 195 LKKQPQWKDMSGKNHFLVTGRISRDFRRNSGSRSAWGTNFMLLSESLNLTFLSIERSLTS 254
Query: 223 HKDVALP--QVLQPFALP-----RGGRDVENRTVLGFWAG------HRNSKIRVILARVW 269
H + A+P P + P + + NRTVL +AG ++N +R + +
Sbjct: 255 HNEFAIPYPTYFHPTSTPEILQWQEKIRLTNRTVLFSFAGAQRPSRNQNGVVRTEVIKQC 314
Query: 270 ENDTE----LDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPV 325
++ ++ LD N N P+ + F + FC+ P G + + DSI GC+PV
Sbjct: 315 KSSSKTCRFLDCDVN-ANSCDDPISLMKLFESSTFCLQPPGDSLTRKSVFDSILAGCIPV 373
Query: 326 IL---SNYYDLPFNDILDWRKFAVILNERDVY-----QLKQILKNISDAEFVSLHNNLVK 377
S Y ++ + K++V + +++ ++++IL+ I + V + N+++
Sbjct: 374 FFNQGSAYKQYLWHIPKNSSKYSVYITVKELRTGGKNKIEEILRGIPNERVVGMRENVIR 433
Query: 378 V 378
+
Sbjct: 434 L 434
>gi|195122819|ref|XP_002005908.1| GI20737 [Drosophila mojavensis]
gi|193910976|gb|EDW09843.1| GI20737 [Drosophila mojavensis]
Length = 720
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 27/94 (28%), Positives = 50/94 (53%), Gaps = 7/94 (7%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y R + KFC ++ + + + GC+PVI + Y LPF D++DW +V + E
Sbjct: 311 YPRVLGRGKFCFLGRSLRIGQPDLIEIMSQGCIPVIAIDNYVLPFEDVIDWSLTSVRVRE 370
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
+++ + + L+ IS+ + +V++QK QW
Sbjct: 371 SELHSVMRKLEAISNVK-------VVEMQKQVQW 397
>gi|195150235|ref|XP_002016060.1| GL11396 [Drosophila persimilis]
gi|194109907|gb|EDW31950.1| GL11396 [Drosophila persimilis]
Length = 717
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 28/96 (29%), Positives = 49/96 (51%), Gaps = 7/96 (7%)
Query: 289 LVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVIL 348
+ Y R KFC ++ + + + C+PVI + Y LPF D++DW +V +
Sbjct: 306 MEYPRLLSHGKFCFVGRSLRLGQPDLVEIMSESCIPVIAIDNYVLPFEDVIDWSLASVRI 365
Query: 349 NERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
E +++ + Q LK IS+ + +V++QK QW
Sbjct: 366 REFELHSIMQKLKAISNVK-------IVEMQKQVQW 394
>gi|125807793|ref|XP_001360522.1| GA17244 [Drosophila pseudoobscura pseudoobscura]
gi|54635694|gb|EAL25097.1| GA17244 [Drosophila pseudoobscura pseudoobscura]
Length = 717
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 28/96 (29%), Positives = 49/96 (51%), Gaps = 7/96 (7%)
Query: 289 LVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVIL 348
+ Y R KFC ++ + + + C+PVI + Y LPF D++DW +V +
Sbjct: 306 MEYPRLLSHGKFCFVGRSLRLGQPDLVEIMSESCIPVIAIDNYVLPFEDVIDWSLASVRI 365
Query: 349 NERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
E +++ + Q LK IS+ + +V++QK QW
Sbjct: 366 REFELHSIMQKLKAISNVK-------IVEMQKQVQW 394
>gi|159471277|ref|XP_001693783.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158283286|gb|EDP09037.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 611
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 66/296 (22%), Positives = 113/296 (38%), Gaps = 54/296 (18%)
Query: 118 YFFQNIRESRFLTHDPDRAHLFFIPISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWN 177
+F + + S D +A +FIP+ ++R +Y +++ + L + +P++N
Sbjct: 183 HFLEMLTASGARVADGAKADWYFIPV---RLRSSSDAY-----VLQRAIHHLRHAHPWFN 234
Query: 178 RTLGADHFFVTCHDVGVRATEGVPFLI-------------KNAIRVVCSPSYDVAFIPHK 224
T G HF + D+G +E P K A R+ SP + +
Sbjct: 235 ATGGGRHFVIAVGDMGRLESERGPLSANVTFVSHWGLASSKKAERLNSSP-WRASHRNAT 293
Query: 225 DVALP-----QVLQPFALPRGGR----------DVENRTVLGFWAGHRNSKIRVILAR-- 267
D+ LP + L+ + + R ++ R FW R + R A
Sbjct: 294 DIVLPVYISLRKLEKYGITRSRHHPKFATMAPPEIRERNGPLFWFAGRICQDRSKPATDG 353
Query: 268 VWEND---------TELDISNNRINR-----AIGPLVYQRRFYKTKFCICPGGSQVNSAR 313
VW N T + + +R A G Y R +KFC G +
Sbjct: 354 VWPNCPGAMGYSAMTRQAVHFHHWDRPGYFVAPGDKNYSRHLLTSKFCFGAMGGGHGQRQ 413
Query: 314 IADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFV 369
+ ++ GCVPV++ + + LDW F V + E D+ +L IL I E+
Sbjct: 414 LQAAL-AGCVPVVIGDGVLEAWEPYLDWNDFGVRVAEADIPRLHTILGAIGPEEYA 468
>gi|384247592|gb|EIE21078.1| hypothetical protein COCSUDRAFT_48268 [Coccomyxa subellipsoidea
C-169]
Length = 334
Score = 54.7 bits (130), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 27/89 (30%), Positives = 50/89 (56%), Gaps = 4/89 (4%)
Query: 298 TKFCICPGGSQVNSARIADSIHYGCVPV-ILSNYYDLPFNDILDWRKFAVILNERDVYQL 356
+ F ICP G S R+A++I G +P+ + LP++++LDW FA++++ ++L
Sbjct: 219 SNFSICPRGFGSTSFRVAETIQLGTLPIYVWEQEAWLPYSNLLDWNDFAIVVSS---HKL 275
Query: 357 KQILKNISDAEFVSLHNNLVKVQKHFQWN 385
++ + I A+ + L KVQ F +N
Sbjct: 276 AELPEKIRQADVGKMQEALKKVQHMFTYN 304
>gi|302838103|ref|XP_002950610.1| hypothetical protein VOLCADRAFT_104742 [Volvox carteri f.
nagariensis]
gi|300264159|gb|EFJ48356.1| hypothetical protein VOLCADRAFT_104742 [Volvox carteri f.
nagariensis]
Length = 1119
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 47/203 (23%), Positives = 86/203 (42%), Gaps = 35/203 (17%)
Query: 215 SYDVAFIPHKDVALPQVL-----QPFA---------------LPRGGRDVENRTVLGFWA 254
+++ P KDV P VL +P A + R D+E RT F+
Sbjct: 479 EHEICLRPEKDVVTPNVLHQGWVEPGAYRQVWDVKFVDGERVVTRQQDDLEKRTYTLFFG 538
Query: 255 GHRNS------KIRVILARVWENDTELDISNNRINRAIGPLVYQRR------FYKTKFCI 302
G+ S +R L ++ + D N N LV R ++KFC+
Sbjct: 539 GYTKSIMAYSQGVRQALRSMFGPGGKYD--PNGPNARSDFLVTDPRHDAIDLMARSKFCL 596
Query: 303 CPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKN 362
P G+ R+A+++ GCVPV++ ++ P D++ + +F++ + RDV L L +
Sbjct: 597 APMGAGWG-IRLAEAMVRGCVPVVIQDHVYQPLWDVVPFEEFSLRFSRRDVADLVDHLDD 655
Query: 363 ISDAEFVSLHNNLVKVQKHFQWN 385
++ + L + + + +W
Sbjct: 656 VTSEQLARLQGGVERYHRWEEWG 678
>gi|195431325|ref|XP_002063693.1| GK15814 [Drosophila willistoni]
gi|194159778|gb|EDW74679.1| GK15814 [Drosophila willistoni]
Length = 728
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 27/96 (28%), Positives = 52/96 (54%), Gaps = 7/96 (7%)
Query: 289 LVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVIL 348
L Y K+KFC+ ++ + + + C+PVI + Y LPF D++DW +V +
Sbjct: 317 LDYPHLLAKSKFCLVARSLRLGQPDLLEIMSQNCIPVIAIDNYILPFEDVVDWSLASVRI 376
Query: 349 NERDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQW 384
E +++ + + L++IS+ + +V++QK QW
Sbjct: 377 RESELHSVLRKLESISNVK-------IVEMQKQVQW 405
>gi|356533217|ref|XP_003535163.1| PREDICTED: uncharacterized protein LOC100807663 [Glycine max]
Length = 795
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 74/334 (22%), Positives = 133/334 (39%), Gaps = 70/334 (20%)
Query: 113 YASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SCHKMRGKGTSYENM--------TVIV 162
Y ++ ++++ S T + + A FF+P+ SC R + +M ++ +
Sbjct: 395 YGAQIALYESLLASPHRTLNGEEADFFFVPVLDSCIITRADDAPHLSMQEHMGLRSSLTL 454
Query: 163 KDYVDS---LINKYPYWNRTLGADHFFVTCHDVGVRATEGVPFLIKNAIRVV-------- 211
+ Y ++ ++ +YPYW+ + G DH + D G P I N++ +V
Sbjct: 455 EYYKNTYTHIVEQYPYWSHSSGRDHIWSFSWDEGACY---APKEIWNSMMLVHWGNTNTK 511
Query: 212 --------CSPSYDV----------AFIPHKDVALPQVLQPFAL----PRGGRDVENRTV 249
+ ++D F P KD+ LP P A R E R
Sbjct: 512 HNHSTTAYWADNWDKISSDRRGIHPCFDPDKDLVLPAWKVPDAYVLTSKLWARSHEKRKT 571
Query: 250 LGFWAGHRNSK-------------IRVILARVWENDTELDISNNRINRAIGPLV------ 290
L ++ G+ IR LA + + D + A +V
Sbjct: 572 LFYFNGNLGPAYPHGRPEDTYSMGIRQKLAEEFGSSPNKDGKLGK-QHAKDVIVTPERSE 630
Query: 291 -YQRRFYKTKFC-ICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVIL 348
Y + FC + PG S R+ DSI GC+PV++ + LP+ ++L++ FAV +
Sbjct: 631 DYHMDLASSVFCGVFPGDGW--SGRMEDSILQGCIPVVIQDGIFLPYENVLNYDSFAVRI 688
Query: 349 NERDVYQLKQILKNISDAEFVSLHNNLVKVQKHF 382
E ++ L + L+ +D E N+ K+ + F
Sbjct: 689 PEAEIPNLIKTLRGFNDTEIEFKLANVQKIWQRF 722
>gi|397643551|gb|EJK75938.1| hypothetical protein THAOC_02324 [Thalassiosira oceanica]
Length = 505
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 32/110 (29%), Positives = 51/110 (46%), Gaps = 3/110 (2%)
Query: 259 SKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADSI 318
+ V L V T D N + A+G + + ++ FC+CP G + R SI
Sbjct: 319 TGTEVFLPAVDAVHTAKDEHGNAMIGAVG--ITMQDTFEATFCLCPAGDSDVARRFFTSI 376
Query: 319 HYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQL-KQILKNISDAE 367
GC+PV++S + LPF ++D+ F V + D K IL + D +
Sbjct: 377 LAGCIPVVMSQHIVLPFESLIDYSTFVVFVAFDDTENAEKNILPTVGDKD 426
>gi|320164161|gb|EFW41060.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 409
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 71/318 (22%), Positives = 122/318 (38%), Gaps = 66/318 (20%)
Query: 106 PRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIPI--SCH---KMRGKGTSYENMTV 160
P T Y+SE F Q + +S T +P A +++P+ +CH +M + + +
Sbjct: 54 PVCTTSFYSSEWAFHQLLLDSPLRTLNPRDADYYYVPVYGTCHGFNRMAVQPNASAELFS 113
Query: 161 IVKDYV--------DSLINKY----PYWNR-----------------TLGADHFFVTCHD 191
D++ D+L +Y P WN DH ++
Sbjct: 114 AALDWITSHGSIPRDTLPWRYDPYSPDWNSLGTIEQVATRGEYPPFPAFAQDHLWLFSQG 173
Query: 192 VGVRATEGVPFLIKNAIRVVCSPSYDVA-FIPHKDVALPQVLQ-----PFALPRGGRDVE 245
G + G IKNA+ + + A F KDV +P L P + ++E
Sbjct: 174 HGAKLF-GDYSRIKNAVFLTANGQLSAAEFTLAKDVTIPPRLTHYVPTPIYANKSVDELE 232
Query: 246 -----NRTVLGFWAGHR-------------NSKIRVILARVWENDTELDISNNRINRAIG 287
R L + G + + +R L + + I R +
Sbjct: 233 VILTGQRPTLACFGGTKLPCFVNDARGSCHSRGVRPYLKETFSKHPDFRILGIRSSG--- 289
Query: 288 PLVYQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAV- 346
Y++ + FC+CP G + R+ ++I GC+PV++S+ LPF ++D+ F V
Sbjct: 290 ---YEKALRSSTFCLCPEGWHAWTPRVFEAILSGCIPVLISDDLALPFESLIDYDAFIVR 346
Query: 347 ILNERDVYQLKQILKNIS 364
I R L L++IS
Sbjct: 347 IPPARVAADLLSTLQSIS 364
>gi|227206252|dbj|BAH57181.1| AT3G03650 [Arabidopsis thaliana]
Length = 244
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 29/108 (26%), Positives = 59/108 (54%), Gaps = 4/108 (3%)
Query: 298 TKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQ-- 355
+KFC+ G +S R+ D+I C+PVI+S+ +LP+ D+L++ +F + + D +
Sbjct: 114 SKFCLNIAGDTPSSNRLFDAIASHCIPVIISDDIELPYEDVLNYNEFCLFVRSSDALKKG 173
Query: 356 -LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELW 402
L ++++I E+ + L +V+++F PV+ D + +W
Sbjct: 174 FLMGLVRSIGREEYNKMWLRLKEVERYFDLRF-PVKDDEGDYAVQMIW 220
>gi|159483755|ref|XP_001699926.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158281868|gb|EDP07622.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 785
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 34/144 (23%), Positives = 74/144 (51%), Gaps = 9/144 (6%)
Query: 258 NSKIRVILARVWENDTELDISNNRINRAIGPLVYQRRFYKTKFCICPGGSQVNSARIADS 317
++ +R L ++ N T D+S IN+ G +R +++FC P G R++ +
Sbjct: 636 SAGVRQGLLALFGNSTRADLS---INKGGG----SQRMLRSRFCFTPMGFGWG-IRLSQA 687
Query: 318 IHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNLVK 377
+ GCVP+++ ++ D+L + +F++ ++ ++Y+L L++I+ + L + + +
Sbjct: 688 MLTGCVPIMVHDHVWPTLWDVLPYEQFSIRVSRHNMYRLLDYLESITPQQLARLQDGVAQ 747
Query: 378 VQKHFQWNSPPVRFDAFHMVMYEL 401
K F W P V A++ + L
Sbjct: 748 WHKAFVWQ-PEVGGLAYNYTLTSL 770
>gi|15228598|ref|NP_187015.1| exostosin-like protein [Arabidopsis thaliana]
gi|6091757|gb|AAF03467.1|AC009327_6 hypothetical protein [Arabidopsis thaliana]
gi|30102722|gb|AAP21279.1| At3g03650 [Arabidopsis thaliana]
gi|110736553|dbj|BAF00242.1| hypothetical protein [Arabidopsis thaliana]
gi|332640446|gb|AEE73967.1| exostosin-like protein [Arabidopsis thaliana]
Length = 499
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 29/108 (26%), Positives = 59/108 (54%), Gaps = 4/108 (3%)
Query: 298 TKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQ-- 355
+KFC+ G +S R+ D+I C+PVI+S+ +LP+ D+L++ +F + + D +
Sbjct: 369 SKFCLNIAGDTPSSNRLFDAIASHCIPVIISDDIELPYEDVLNYNEFCLFVRSSDALKKG 428
Query: 356 -LKQILKNISDAEFVSLHNNLVKVQKHFQWNSPPVRFDAFHMVMYELW 402
L ++++I E+ + L +V+++F PV+ D + +W
Sbjct: 429 FLMGLVRSIGREEYNKMWLRLKEVERYFDLRF-PVKDDEGDYAVQMIW 475
>gi|198416199|ref|XP_002119284.1| PREDICTED: similar to exostoses (multiple) 2, partial [Ciona
intestinalis]
Length = 659
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 27/96 (28%), Positives = 50/96 (52%), Gaps = 5/96 (5%)
Query: 291 YQRRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNE 350
Y + FC+ S++ + + +S+ GC+PV + Y LPF+++LDW + +V++ E
Sbjct: 298 YPEILQEGTFCLLLPTSRLGQSALMESMQAGCIPVFACDTYILPFSEVLDWSRASVLIRE 357
Query: 351 RDVYQLKQILKNISDAEFVSLHNNLVKVQKHFQWNS 386
+ + IL+ I + V L+K Q F + S
Sbjct: 358 DSLPDIMNILRRIPHEQVV-----LMKKQVEFLYTS 388
>gi|388510202|gb|AFK43167.1| unknown [Medicago truncatula]
Length = 216
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 33/114 (28%), Positives = 55/114 (48%), Gaps = 5/114 (4%)
Query: 86 VTNFKIYIY--PDG-DPNTFYQTPRKLTGKYASEGYFFQNIRESRFLTHDPDRAHLFFIP 142
V K+++Y P + + PR LT +A+E + + + S T +P+ A F+ P
Sbjct: 47 VGRLKVFVYELPSKYNKKILQKDPRCLTHMFAAEIFMHRFLLSSAVRTLNPEEADWFYTP 106
Query: 143 I--SCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFVTCHDVGV 194
+ +C +++ + + + +PYWNRT GADHFFV HD G
Sbjct: 107 VYTTCDLTPNGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGA 160
>gi|159489402|ref|XP_001702686.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158280708|gb|EDP06465.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 615
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 69/337 (20%), Positives = 126/337 (37%), Gaps = 63/337 (18%)
Query: 119 FFQNIRESRFLTHDPDRAHLFFIPISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNR 178
F+Q + + T + D A FFIP++ + + Y+ N +PYW+R
Sbjct: 207 FWQRLMSAGLRTVNGDEADYFFIPLNTRTLMAP-----EQAAWILPYIR---NTWPYWDR 258
Query: 179 TLGADHFFVTCHDVGVRATEGVPFLIKNAIRVVCS----------------PSYDVAFIP 222
G H + D+G+ +P ++ + S ++ A P
Sbjct: 259 DNGHRHLIIHTGDMGLHE---LPLGLRRKMNETLSNITWLTHWGLHTYHPIGTWFPAHRP 315
Query: 223 HKDVALPQV-------LQPF-------ALPRGGRDVENRTVL--GFWAGHRNSKIRVILA 266
KD+ +P + L P A RG +T G G R +
Sbjct: 316 GKDIVIPVMITTPGFQLSPLNPAVAEKAAKRGRPYTREQTFFFAGRICGDRKPP-DPLTH 374
Query: 267 RVWENDTELDIS---------NNRINRAI--GPLVYQRRFYKTKFCICPGGSQVNSARIA 315
T+ S +NR + G Y + KFC+ P G ++
Sbjct: 375 ECAPKRTDYSASVRQRVYFHHHNRTGFKVLTGTSKYMQEITSHKFCLAPTGGGHGKRQVL 434
Query: 316 DSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLHNNL 375
++ GC+PV +++ PF L W F+V + E D+ +L ++L+ + + + + L
Sbjct: 435 VAL-MGCIPVTITDGVYQPFEPELPWADFSVPVAEDDIPRLHEVLEALPPEQVEQMQSRL 493
Query: 376 VKVQKHFQWNS-------PPVRFDAFHMVMYELWLRR 405
+H ++S R+DAF ++ L +R+
Sbjct: 494 HCAAQHMFYSSSLGAIIGEDGRYDAFETMIEILRVRK 530
>gi|328788198|ref|XP_001121972.2| PREDICTED: exostosin-2 [Apis mellifera]
Length = 708
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 26/97 (26%), Positives = 47/97 (48%), Gaps = 2/97 (2%)
Query: 278 SNNRINRAIGPLVYQ--RRFYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPF 335
S N R G VY+ FC+ G+++ + + + G +PVI+++ +PF
Sbjct: 286 SMNNTIRCAGEDVYKYPNVLQTATFCLVIRGARLGQTTLLECMAAGSIPVIIADSLTMPF 345
Query: 336 NDILDWRKFAVILNERDVYQLKQILKNISDAEFVSLH 372
+ I+DW + + + E D+ L +LK IS + L
Sbjct: 346 HGIIDWTRAVIFIREVDILSLISVLKKISQERIIELQ 382
>gi|310831249|ref|YP_003969892.1| putative exostosin family protein [Cafeteria roenbergensis virus
BV-PW1]
gi|309386433|gb|ADO67293.1| putative exostosin family protein [Cafeteria roenbergensis virus
BV-PW1]
Length = 300
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 42/130 (32%), Positives = 65/130 (50%), Gaps = 12/130 (9%)
Query: 235 FALPRGGRDVENRTVLGFWAGHRNSKIRVILARVWENDTELDISNNRINRAIGPLVYQRR 294
F LP + N T G W H N +++ + EL+I N+ IN Y
Sbjct: 158 FNLPIKPYNYINNT--GNW--HFN---KIVYSTKQNAKQELNIDNSHINNTKN---YNNI 207
Query: 295 FYKTKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVY 354
+ F +CP GS NS R +S+ G +PV+LS+ DLP N ++W+ +I+ E+++
Sbjct: 208 LLDSIFSLCPSGSGPNSIRFWESLAVGSIPVLLSDKLDLPKN--INWQDIIIIIPEKEIS 265
Query: 355 QLKQILKNIS 364
+ IL NIS
Sbjct: 266 NIDIILNNIS 275
>gi|222629835|gb|EEE61967.1| hypothetical protein OsJ_16742 [Oryza sativa Japonica Group]
Length = 196
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 33/112 (29%), Positives = 55/112 (49%), Gaps = 16/112 (14%)
Query: 88 NFKIYIYPDGDPNTFYQTPRK----------LTGKYASEGYFFQNIRESRFLTHDPDRAH 137
+ +IY+Y + + + R + G++ ++ Q + SRF T D D A
Sbjct: 54 DLRIYVYAEDEVDGLRALLRGRNDDVTAATCIKGQWGTQVKIHQLLLSSRFRTFDKDEAD 113
Query: 138 LFFIP--ISCHKMRGKGTSYENMTVIVKDYVDSLINKYPYWNRTLGADHFFV 187
LFF+P + C +M GK E I + YV ++++ PY+ R+ G DH FV
Sbjct: 114 LFFVPTYVKCVRMTGKLNDKE----INQTYVKVVLSQMPYFRRSGGRDHIFV 161
>gi|297833038|ref|XP_002884401.1| hypothetical protein ARALYDRAFT_477609 [Arabidopsis lyrata subsp.
lyrata]
gi|297330241|gb|EFH60660.1| hypothetical protein ARALYDRAFT_477609 [Arabidopsis lyrata subsp.
lyrata]
Length = 495
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 27/93 (29%), Positives = 53/93 (56%), Gaps = 3/93 (3%)
Query: 298 TKFCICPGGSQVNSARIADSIHYGCVPVILSNYYDLPFNDILDWRKFAVILNERDVYQ-- 355
+KFC+ G +S R+ D+I C+PVI+S+ +LP+ D+L++ +F + + D +
Sbjct: 365 SKFCLNIAGDTPSSNRLFDAIASHCIPVIISDDIELPYEDVLNYNEFCLFVRSSDALKKG 424
Query: 356 -LKQILKNISDAEFVSLHNNLVKVQKHFQWNSP 387
L ++K+I E+ + L +V+++F P
Sbjct: 425 FLMGLVKSIGRDEYNKMWLRLKEVERYFDLRFP 457
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.326 0.140 0.441
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,683,894,479
Number of Sequences: 23463169
Number of extensions: 286855082
Number of successful extensions: 725320
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1131
Number of HSP's successfully gapped in prelim test: 193
Number of HSP's that attempted gapping in prelim test: 722145
Number of HSP's gapped (non-prelim): 1733
length of query: 411
length of database: 8,064,228,071
effective HSP length: 145
effective length of query: 266
effective length of database: 8,957,035,862
effective search space: 2382571539292
effective search space used: 2382571539292
T: 11
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 78 (34.7 bits)