BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 019788
(336 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|359481095|ref|XP_002265438.2| PREDICTED: probable glycosyltransferase At5g03795-like [Vitis
vinifera]
Length = 336
Score = 591 bits (1524), Expect = e-166, Method: Compositional matrix adjust.
Identities = 280/333 (84%), Positives = 301/333 (90%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
MNRSF++Y YPH+R+DPFAN LLPVDFEP GNYASESYFKK MKSHF+TKDPSKADLFF
Sbjct: 1 MNRSFKIYCYPHKRDDPFANALLPVDFEPGGNYASESYFKKVLMKSHFITKDPSKADLFF 60
Query: 61 LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
LPFSIAR+RHD R+G GI DFI YIFNISQ YPYWN+TGGADHFYVACHSIGRSAM+K
Sbjct: 61 LPFSIARLRHDPRVGVGGIQDFIRDYIFNISQNYPYWNQTGGADHFYVACHSIGRSAMEK 120
Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLAFFAGAVNS 180
A EVKLNAIQVVCSSSYF+SG+IAHKD SLPQIWPRQ DPP L S+R KLAFFAG++NS
Sbjct: 121 ADEVKLNAIQVVCSSSYFLSGYIAHKDASLPQIWPRQGDPPDLALSERKKLAFFAGSINS 180
Query: 181 PVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVP 240
PVRE+LLQVWRNDSEI H GRL TPYADELLGSKFCLHVKGFE+NTARIADSLYYGCVP
Sbjct: 181 PVRERLLQVWRNDSEISVHFGRLTTPYADELLGSKFCLHVKGFEINTARIADSLYYGCVP 240
Query: 241 VIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQW 300
VIIANHYDLPFAD+LNWKSFSIVVATLDIPLLK++LKGIS EYL+LQS VLKVR HFQW
Sbjct: 241 VIIANHYDLPFADILNWKSFSIVVATLDIPLLKQVLKGISLNEYLMLQSNVLKVRNHFQW 300
Query: 301 HVFPSDYDAFYMVMYELWLRRSSVRVQWSTSLD 333
HV P DYDAFYMVMYELWLRRSSVRV ST +D
Sbjct: 301 HVSPVDYDAFYMVMYELWLRRSSVRVPLSTPMD 333
>gi|296089301|emb|CBI39073.3| unnamed protein product [Vitis vinifera]
Length = 467
Score = 587 bits (1513), Expect = e-165, Method: Compositional matrix adjust.
Identities = 277/326 (84%), Positives = 297/326 (91%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
MNRSF++Y YPH+R+DPFAN LLPVDFEP GNYASESYFKK MKSHF+TKDPSKADLFF
Sbjct: 120 MNRSFKIYCYPHKRDDPFANALLPVDFEPGGNYASESYFKKVLMKSHFITKDPSKADLFF 179
Query: 61 LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
LPFSIAR+RHD R+G GI DFI YIFNISQ YPYWN+TGGADHFYVACHSIGRSAM+K
Sbjct: 180 LPFSIARLRHDPRVGVGGIQDFIRDYIFNISQNYPYWNQTGGADHFYVACHSIGRSAMEK 239
Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLAFFAGAVNS 180
A EVKLNAIQVVCSSSYF+SG+IAHKD SLPQIWPRQ DPP L S+R KLAFFAG++NS
Sbjct: 240 ADEVKLNAIQVVCSSSYFLSGYIAHKDASLPQIWPRQGDPPDLALSERKKLAFFAGSINS 299
Query: 181 PVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVP 240
PVRE+LLQVWRNDSEI H GRL TPYADELLGSKFCLHVKGFE+NTARIADSLYYGCVP
Sbjct: 300 PVRERLLQVWRNDSEISVHFGRLTTPYADELLGSKFCLHVKGFEINTARIADSLYYGCVP 359
Query: 241 VIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQW 300
VIIANHYDLPFAD+LNWKSFSIVVATLDIPLLK++LKGIS EYL+LQS VLKVR HFQW
Sbjct: 360 VIIANHYDLPFADILNWKSFSIVVATLDIPLLKQVLKGISLNEYLMLQSNVLKVRNHFQW 419
Query: 301 HVFPSDYDAFYMVMYELWLRRSSVRV 326
HV P DYDAFYMVMYELWLRRSSVRV
Sbjct: 420 HVSPVDYDAFYMVMYELWLRRSSVRV 445
>gi|255569522|ref|XP_002525728.1| catalytic, putative [Ricinus communis]
gi|223535028|gb|EEF36711.1| catalytic, putative [Ricinus communis]
Length = 336
Score = 584 bits (1506), Expect = e-164, Method: Compositional matrix adjust.
Identities = 272/331 (82%), Positives = 300/331 (90%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
MNRSF++YVYPHR+NDPFANVLLPVDFEP GNYASESYFKK MKSHF+TKDP+KADLFF
Sbjct: 1 MNRSFKIYVYPHRQNDPFANVLLPVDFEPGGNYASESYFKKVLMKSHFITKDPTKADLFF 60
Query: 61 LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
LPFSIAR+RHD RIG EGI DFI Y++NISQKYPYWNRTGG DHFYVACHSIGR+AM+K
Sbjct: 61 LPFSIARLRHDPRIGVEGIQDFIRAYVYNISQKYPYWNRTGGTDHFYVACHSIGRTAMEK 120
Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLAFFAGAVNS 180
A EVK NAIQVVCSSSY++SG+IAHKD SLPQ+WPRQ DPP L SS+R KLAFFAG++NS
Sbjct: 121 AEEVKFNAIQVVCSSSYYLSGYIAHKDASLPQVWPRQGDPPNLASSERQKLAFFAGSINS 180
Query: 181 PVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVP 240
PVRE+LLQVWRNDSEIY H GRL T YADELLGSKFCLHVKGFEVNTARIADSLYYGCVP
Sbjct: 181 PVRERLLQVWRNDSEIYVHYGRLNTSYADELLGSKFCLHVKGFEVNTARIADSLYYGCVP 240
Query: 241 VIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQW 300
+IIANHYDLPF D+LNW+SFS+VVATLDI LKKIL+G+SS+ Y++LQS VLKVRKHFQW
Sbjct: 241 IIIANHYDLPFTDILNWESFSVVVATLDILYLKKILQGVSSDRYVMLQSNVLKVRKHFQW 300
Query: 301 HVFPSDYDAFYMVMYELWLRRSSVRVQWSTS 331
H P DYDAF+MVMYELWLRRSSVRV W S
Sbjct: 301 HFPPVDYDAFHMVMYELWLRRSSVRVLWHAS 331
>gi|224144815|ref|XP_002325425.1| predicted protein [Populus trichocarpa]
gi|222862300|gb|EEE99806.1| predicted protein [Populus trichocarpa]
Length = 332
Score = 550 bits (1417), Expect = e-154, Method: Compositional matrix adjust.
Identities = 264/335 (78%), Positives = 289/335 (86%), Gaps = 4/335 (1%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
MNRSF++YVYPHRRNDPFANVLLPVDFEP GNYASESYFKKA MKSHF+TKDP+KADLFF
Sbjct: 1 MNRSFKIYVYPHRRNDPFANVLLPVDFEPGGNYASESYFKKALMKSHFITKDPAKADLFF 60
Query: 61 LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
LPFSI R+RHD R+G GI DFI YI NIS+KYP+WNRTGGADHFY ACHSIGRSAM+K
Sbjct: 61 LPFSITRLRHDPRVGVGGIQDFIRDYILNISRKYPFWNRTGGADHFYAACHSIGRSAMEK 120
Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLAFFAGAVNS 180
+ EVK NAIQVVCSSSYF+SG+IAHKDVS P Q + R KLAFFAG++NS
Sbjct: 121 SEEVKFNAIQVVCSSSYFLSGYIAHKDVSFPGCHLSQV----VKCDYRKKLAFFAGSINS 176
Query: 181 PVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVP 240
PVRE+LL WRNDSEI+AH GRL TPYADELLGSKFCLHVKGFEVNTARI DSLYYGCVP
Sbjct: 177 PVRERLLHSWRNDSEIFAHFGRLTTPYADELLGSKFCLHVKGFEVNTARIGDSLYYGCVP 236
Query: 241 VIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQW 300
VIIANHYDLPFAD+LNWKSFS+VVATLDIPLLKKILKGISS++YL+ Q VL+VRKHFQW
Sbjct: 237 VIIANHYDLPFADILNWKSFSVVVATLDIPLLKKILKGISSDQYLMFQKKVLEVRKHFQW 296
Query: 301 HVFPSDYDAFYMVMYELWLRRSSVRVQWSTSLDSN 335
H P DYDAFYMVMYELWLRR+SVRV S + N
Sbjct: 297 HCPPVDYDAFYMVMYELWLRRTSVRVSLPVSKNPN 331
>gi|357508695|ref|XP_003624636.1| Exostosin-like protein [Medicago truncatula]
gi|87162615|gb|ABD28410.1| Exostosin-like [Medicago truncatula]
gi|116831751|gb|ABK28848.1| exostosin-like protein [Medicago truncatula]
gi|355499651|gb|AES80854.1| Exostosin-like protein [Medicago truncatula]
Length = 486
Score = 545 bits (1405), Expect = e-153, Method: Compositional matrix adjust.
Identities = 254/327 (77%), Positives = 286/327 (87%), Gaps = 1/327 (0%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
MNRSF++YVYPH+++DPFANVLLPV EP GNYASESYFKKA MKSHF+TKDP+KADLFF
Sbjct: 155 MNRSFKIYVYPHKKDDPFANVLLPVKTEPSGNYASESYFKKALMKSHFITKDPTKADLFF 214
Query: 61 LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
+PFSIA +RHDRR+G GI DFI Y+ N+ KYPYWNRT GADHFYVACHSIGRSAM K
Sbjct: 215 MPFSIASLRHDRRVGVGGIQDFIRDYVQNMIHKYPYWNRTNGADHFYVACHSIGRSAMDK 274
Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLAFFAGAVNS 180
A +VK NAIQVVCSSSYF+SG+IAHKD LPQIWPR E+PP L SS R KLAFFAG VNS
Sbjct: 275 APDVKFNAIQVVCSSSYFLSGYIAHKDACLPQIWPRNENPPNLVSSNRKKLAFFAGEVNS 334
Query: 181 PVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVP 240
PVR L++ W+ND+EI+ H+GRLKTPY DELLGSKFC HV+G+EVNTARI DSLYYGCVP
Sbjct: 335 PVRINLVETWKNDTEIFVHNGRLKTPYGDELLGSKFCFHVRGYEVNTARIGDSLYYGCVP 394
Query: 241 VIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGI-SSEEYLLLQSYVLKVRKHFQ 299
VIIAN+YDLPFADVLNWKSFS+VV TLDIPLLKKILKGI +S EYL+LQ VLKVR+HFQ
Sbjct: 395 VIIANYYDLPFADVLNWKSFSVVVTTLDIPLLKKILKGIVNSGEYLMLQKNVLKVREHFQ 454
Query: 300 WHVFPSDYDAFYMVMYELWLRRSSVRV 326
WH P D+DAFYMVMYELWLRRSS+ +
Sbjct: 455 WHSPPIDFDAFYMVMYELWLRRSSIPI 481
>gi|356571729|ref|XP_003554026.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
max]
Length = 487
Score = 544 bits (1402), Expect = e-152, Method: Compositional matrix adjust.
Identities = 259/333 (77%), Positives = 286/333 (85%), Gaps = 1/333 (0%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
MNRS ++YVYPHR +DPFANVLLPV+ EP GNY SESYFKK MKSHF+TKDP +ADLFF
Sbjct: 153 MNRSLKIYVYPHREDDPFANVLLPVESEPGGNYTSESYFKKVLMKSHFITKDPPEADLFF 212
Query: 61 LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
LPFS+AR+ HDRR+G GI DFI YI NIS +YPYWN TGGADHFYVACHSIGRSAM K
Sbjct: 213 LPFSMARLWHDRRVGVGGIQDFIRDYIHNISHRYPYWNNTGGADHFYVACHSIGRSAMDK 272
Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLAFFAGAVNS 180
A + K NAIQVVCSSSYF++G+ AHKD LPQIWPR+ +PP L SSKR +LAFFAG VNS
Sbjct: 273 APDEKFNAIQVVCSSSYFLTGYFAHKDACLPQIWPRKGNPPNLVSSKRKRLAFFAGGVNS 332
Query: 181 PVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVP 240
PVR KLL+ W+NDSEI+ H GRLKTPYADELLGSKFCLHVKGFEVNTARI DSLYYGCVP
Sbjct: 333 PVRVKLLETWKNDSEIFVHHGRLKTPYADELLGSKFCLHVKGFEVNTARIGDSLYYGCVP 392
Query: 241 VIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKG-ISSEEYLLLQSYVLKVRKHFQ 299
VIIAN+YDLPFADVLNWKSFS+VV TLDIPLLKKILK ISS +YL+LQS VLKVRKHFQ
Sbjct: 393 VIIANYYDLPFADVLNWKSFSVVVTTLDIPLLKKILKDIISSNKYLMLQSNVLKVRKHFQ 452
Query: 300 WHVFPSDYDAFYMVMYELWLRRSSVRVQWSTSL 332
WH P D+DAFYMVMYELWLRRSS++ W S
Sbjct: 453 WHSPPQDFDAFYMVMYELWLRRSSIKNTWVDSF 485
>gi|449527873|ref|XP_004170933.1| PREDICTED: probable glycosyltransferase At5g03795-like [Cucumis
sativus]
Length = 466
Score = 521 bits (1343), Expect = e-145, Method: Compositional matrix adjust.
Identities = 238/326 (73%), Positives = 282/326 (86%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
MN+SF++YVYPH+R+DPFA LLP +FEP GNYASESYFKK+ +KSHF+T DP +AD FF
Sbjct: 137 MNKSFKIYVYPHKRSDPFARSLLPENFEPHGNYASESYFKKSLIKSHFITNDPKEADFFF 196
Query: 61 LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
LPFSI +R+DRR+ GIP+FI YIF++S KYPYWNRTGGADHFYVACHS+GRSAM K
Sbjct: 197 LPFSITGLRNDRRVSVSGIPNFIRDYIFDVSHKYPYWNRTGGADHFYVACHSVGRSAMDK 256
Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLAFFAGAVNS 180
+ E K + +QVVCSSSYF++G+I+HKD +LPQIWPR+EDP L SSKR +LAFFAGA+NS
Sbjct: 257 SSEAKSSIVQVVCSSSYFLTGYISHKDAALPQIWPRKEDPSNLASSKRTRLAFFAGAMNS 316
Query: 181 PVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVP 240
P R+ L+QVW DSEI+A+SGRLKTPYADELL SKFCLHVKGFEVNTAR+ DS++YGCVP
Sbjct: 317 PTRQALVQVWGKDSEIFAYSGRLKTPYADELLRSKFCLHVKGFEVNTARVGDSIFYGCVP 376
Query: 241 VIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQW 300
VIIAN+YDLPF D+LNWKSFSIVV T DIP LK+ILKGI+ EEY LQS VLKVRKHF+W
Sbjct: 377 VIIANYYDLPFGDILNWKSFSIVVTTSDIPRLKEILKGINDEEYARLQSNVLKVRKHFKW 436
Query: 301 HVFPSDYDAFYMVMYELWLRRSSVRV 326
H P DYD F+MVMY+LWLRR+SVR+
Sbjct: 437 HSSPVDYDTFHMVMYQLWLRRTSVRL 462
>gi|449446746|ref|XP_004141132.1| PREDICTED: probable glycosyltransferase At5g03795-like [Cucumis
sativus]
Length = 466
Score = 518 bits (1334), Expect = e-144, Method: Compositional matrix adjust.
Identities = 237/326 (72%), Positives = 281/326 (86%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
MN+SF++YVYPH+R+DPFA LLP +FEP GNYASESYFKK+ +KSHF+T DP +AD F
Sbjct: 137 MNKSFKIYVYPHKRSDPFARSLLPENFEPHGNYASESYFKKSLIKSHFITNDPKEADFFS 196
Query: 61 LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
LPFSI +R+DRR+ GIP+FI YIF++S KYPYWNRTGGADHFYVACHS+GRSAM K
Sbjct: 197 LPFSITGLRNDRRVSVSGIPNFIRDYIFDVSHKYPYWNRTGGADHFYVACHSVGRSAMDK 256
Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLAFFAGAVNS 180
+ E K + +QVVCSSSYF++G+I+HKD +LPQIWPR+EDP L SSKR +LAFFAGA+NS
Sbjct: 257 SSEAKSSIVQVVCSSSYFLTGYISHKDAALPQIWPRKEDPSNLASSKRTRLAFFAGAMNS 316
Query: 181 PVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVP 240
P R+ L+QVW DSEI+A+SGRLKTPYADELL SKFCLHVKGFEVNTAR+ DS++YGCVP
Sbjct: 317 PTRQALVQVWGKDSEIFAYSGRLKTPYADELLRSKFCLHVKGFEVNTARVGDSIFYGCVP 376
Query: 241 VIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQW 300
VIIAN+YDLPF D+LNWKSFSIVV T DIP LK+ILKGI+ EEY LQS VLKVRKHF+W
Sbjct: 377 VIIANYYDLPFGDILNWKSFSIVVTTSDIPRLKEILKGINDEEYARLQSNVLKVRKHFKW 436
Query: 301 HVFPSDYDAFYMVMYELWLRRSSVRV 326
H P DYD F+MVMY+LWLRR+SVR+
Sbjct: 437 HSSPVDYDTFHMVMYQLWLRRTSVRL 462
>gi|302804767|ref|XP_002984135.1| glycosyltransferase-like protein [Selaginella moellendorffii]
gi|300147984|gb|EFJ14645.1| glycosyltransferase-like protein [Selaginella moellendorffii]
Length = 376
Score = 427 bits (1097), Expect = e-117, Method: Compositional matrix adjust.
Identities = 196/326 (60%), Positives = 250/326 (76%), Gaps = 1/326 (0%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
M + +V+VYP DPF+++ LP P GNYASE+YFKKA S VT DPS+ADLFF
Sbjct: 47 MVNTMKVFVYPCSPRDPFSHIFLPTSSAPSGNYASEAYFKKALAGSGMVTDDPSQADLFF 106
Query: 61 LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
+PFSI R+R+D ++G +P F+ Y+ NIS ++PYWNRTGG+DHFYVACHSIG+ A++K
Sbjct: 107 MPFSITRLRNDPKVGVGRMPAFVRDYVKNISHRWPYWNRTGGSDHFYVACHSIGKVALEK 166
Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGS-SKRNKLAFFAGAVN 179
A VKLNAIQVVCSS+Y++ G I HKDV++PQIWPR E ++ + +R LAFFAG N
Sbjct: 167 AQHVKLNAIQVVCSSNYYVQGFIPHKDVAIPQIWPRSESFREIKTIEQRKVLAFFAGGSN 226
Query: 180 SPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCV 239
SPVR +++ WRND++I+A+ R++ YA+ LL SKFCLHVKG+EVNTAR+ D+ +YGCV
Sbjct: 227 SPVRANVVRTWRNDTQIHAYPSRIQGSYAEALLRSKFCLHVKGYEVNTARLGDAFFYGCV 286
Query: 240 PVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQ 299
PV+IANHYDLPF+ VLNWKSFS+VV T +IP LK IL GIS E+Y + VL R+HFQ
Sbjct: 287 PVVIANHYDLPFSSVLNWKSFSVVVTTANIPKLKAILSGISREDYSQMHRLVLDARRHFQ 346
Query: 300 WHVFPSDYDAFYMVMYELWLRRSSVR 325
WH P +YDAFYMVMY+LWLRR VR
Sbjct: 347 WHAPPREYDAFYMVMYQLWLRRHVVR 372
>gi|302780894|ref|XP_002972221.1| glycosyltransferase-like protein [Selaginella moellendorffii]
gi|300159688|gb|EFJ26307.1| glycosyltransferase-like protein [Selaginella moellendorffii]
Length = 376
Score = 425 bits (1092), Expect = e-116, Method: Compositional matrix adjust.
Identities = 194/326 (59%), Positives = 251/326 (76%), Gaps = 1/326 (0%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
M + +V+VYP DPF+++ LP P GNYASE+YFKKA +S VT DPS+ADLFF
Sbjct: 47 MVNTMKVFVYPCSPRDPFSHIFLPTSSAPSGNYASEAYFKKALAESGMVTDDPSQADLFF 106
Query: 61 LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
+PFSI R+R+D ++G +P F+ Y+ NIS ++PYWNRTGG+DHFYVACHSIG+ A++K
Sbjct: 107 MPFSITRLRNDPKVGVGRMPAFVRDYVKNISHRWPYWNRTGGSDHFYVACHSIGKVALEK 166
Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGS-SKRNKLAFFAGAVN 179
A V+LNAIQVVCSS+Y++ G I HKDV++PQIWPR E ++ + +R LAFFAG N
Sbjct: 167 AQHVRLNAIQVVCSSNYYVQGFIPHKDVAMPQIWPRSESFREIKTIEQRKVLAFFAGGSN 226
Query: 180 SPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCV 239
SPVR +++ WRND++I+A+ R++ YA+ LL SKFCLHVKG+EVNTAR+ D+ +YGCV
Sbjct: 227 SPVRANVVRTWRNDTQIHAYPSRIQGSYAEALLRSKFCLHVKGYEVNTARLGDAFFYGCV 286
Query: 240 PVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQ 299
PV+IAN+YDLPF+ VLNWKSFS+VV T +IP LK IL GIS E+Y + VL R+HFQ
Sbjct: 287 PVVIANYYDLPFSSVLNWKSFSVVVTTANIPKLKAILSGISREDYSQMHRLVLDARRHFQ 346
Query: 300 WHVFPSDYDAFYMVMYELWLRRSSVR 325
WH P +YDAFYMVMY+LWLRR VR
Sbjct: 347 WHAPPREYDAFYMVMYQLWLRRHVVR 372
>gi|356560377|ref|XP_003548469.1| PREDICTED: LOW QUALITY PROTEIN: probable glycosyltransferase
At5g03795-like [Glycine max]
Length = 334
Score = 421 bits (1082), Expect = e-115, Method: Compositional matrix adjust.
Identities = 204/271 (75%), Positives = 227/271 (83%), Gaps = 9/271 (3%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
MNRS ++VYPHR +D FANVLLPV+ +P GNYASESYFKK MKSHF+TKDP++ADLFF
Sbjct: 63 MNRSLXIHVYPHREDDSFANVLLPVESKPGGNYASESYFKKVPMKSHFITKDPTEADLFF 122
Query: 61 LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
LPFSIAR+RH+RR+G G DFI YI NIS KYPYWNRTGGADHFYVACHSIGRSAM K
Sbjct: 123 LPFSIARLRHNRRVGVGGKQDFIRDYIQNISHKYPYWNRTGGADHFYVACHSIGRSAMDK 182
Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLAFFAGAVNS 180
A +VK NAIQVVCSSSYF++G+IAHKD LPQIWPR+ +PP L SSKR +LAFFAG VNS
Sbjct: 183 APDVKFNAIQVVCSSSYFLTGNIAHKDTCLPQIWPRKGNPPILVSSKRKRLAFFAGGVNS 242
Query: 181 PVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVP 240
PVR KLL+ W+NDSEI+ H GRLKTPYADELLGSKF LHVKGFEVNT RI
Sbjct: 243 PVRVKLLETWKNDSEIFVHHGRLKTPYADELLGSKFGLHVKGFEVNTTRIG--------- 293
Query: 241 VIIANHYDLPFADVLNWKSFSIVVATLDIPL 271
VIIAN+YDLPFADVLNWKSFS+VV TLDI L
Sbjct: 294 VIIANYYDLPFADVLNWKSFSVVVTTLDIQL 324
>gi|168059172|ref|XP_001781578.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666988|gb|EDQ53629.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 344
Score = 393 bits (1010), Expect = e-107, Method: Compositional matrix adjust.
Identities = 177/327 (54%), Positives = 242/327 (74%), Gaps = 2/327 (0%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
M R R++VYPH R DPF + + P GNYASE +F+++ + S F+TK S+AD FF
Sbjct: 14 MKRKLRIFVYPHDRKDPFHMIFESGNKVPSGNYASEEFFQQSLLTSTFLTKTASEADFFF 73
Query: 61 LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
+P SI + R D+RI G+ F ++YI ++ ++ YWNR+ GADHFY++CHSI R+AM +
Sbjct: 74 MPVSITKARMDKRINVGGLQSFCANYITDVRSQWSYWNRSNGADHFYLSCHSIARNAMDR 133
Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPR-QEDPPKLGS-SKRNKLAFFAGAV 178
+V+ NAIQ++C +SYF+ +I HKD S+PQIWPR ++P ++ + ++R +LAFFAGA+
Sbjct: 134 VPDVRQNAIQLLCPASYFLPSYITHKDASVPQIWPRLGKEPEEVRTITQRKRLAFFAGAL 193
Query: 179 NSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGC 238
NSPVR+ L + W NDS+I H GR+ PY++ LL +KFCLH KGFEVNTAR+ D++YYGC
Sbjct: 194 NSPVRKDLERTWANDSKILVHKGRVPYPYSEALLTTKFCLHAKGFEVNTARLGDAMYYGC 253
Query: 239 VPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHF 298
VPV+IAN+YDLPF D+L+W FSIVV++LDIPLLKK L+ ++ E+Y L VL RKHF
Sbjct: 254 VPVVIANYYDLPFQDILDWTKFSIVVSSLDIPLLKKTLEAVTDEQYAELHRQVLLARKHF 313
Query: 299 QWHVFPSDYDAFYMVMYELWLRRSSVR 325
QWH P +YDAF+ VMYELW RR VR
Sbjct: 314 QWHAPPEEYDAFHTVMYELWKRRHIVR 340
>gi|302790387|ref|XP_002976961.1| glycosyltransferase-like protein [Selaginella moellendorffii]
gi|300155439|gb|EFJ22071.1| glycosyltransferase-like protein [Selaginella moellendorffii]
Length = 408
Score = 333 bits (853), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 169/331 (51%), Positives = 228/331 (68%), Gaps = 6/331 (1%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLP-VDFEPR--GNYASESYFKKAFMKSHFVTKDPSKAD 57
M +SF+VYVYP +D ++ V LP D R GN+ SE FK + S F T+DP +A
Sbjct: 77 MVKSFKVYVYPFGNSD-YSQVFLPHPDPYDRKLGNFFSEHMFKINLLNSTFATRDPGEAH 135
Query: 58 LFFLPFSIARMRHDRRIGTEG-IPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRS 116
LFF+PFSI MR+ RI +E I F+ Y+ ISQ+Y +WNRT G DHFYV CHS+GR+
Sbjct: 136 LFFMPFSINAMRNHPRIRSEAMISSFVESYVEEISQRYKFWNRTEGVDHFYVGCHSVGRN 195
Query: 117 AMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLAFFAG 176
A + ++ NAIQV CS++Y+ ++ HKDV+LPQ+WPR D + KR KLAFF+G
Sbjct: 196 AASNSRALQQNAIQVTCSANYYQKLYVPHKDVALPQVWPRPLDTFIVPPEKRTKLAFFSG 255
Query: 177 -AVNSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLY 235
A NS +RE LL++W NDS++ +G ++ Y D L SKFCLHVKG+EVNTARI+D+L+
Sbjct: 256 RAQNSHLRETLLKLWSNDSDMDIFAGTMQGSYEDALSRSKFCLHVKGYEVNTARISDALH 315
Query: 236 YGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVR 295
+GCVPVII+N YDLP ++VLNW+SFSIV++ IP LK L+ ++ +EY L S +V+
Sbjct: 316 FGCVPVIISNQYDLPLSNVLNWRSFSIVLSYTQIPALKAKLQSVTHDEYARLWSNGRRVK 375
Query: 296 KHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
+HF WH P +YD+F M MYELW +R VR
Sbjct: 376 RHFGWHHSPREYDSFQMTMYELWSKRHFVRA 406
>gi|255543306|ref|XP_002512716.1| catalytic, putative [Ricinus communis]
gi|223548677|gb|EEF50168.1| catalytic, putative [Ricinus communis]
Length = 426
Score = 331 bits (848), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 162/336 (48%), Positives = 223/336 (66%), Gaps = 8/336 (2%)
Query: 1 MNRSFRVYVYPHRRN--DPFANVLLPVD---FEPR-GNYASESYFKKAFMKSHFVTKDPS 54
M + +++VY N PFAN+ LP++ P+ GNY SE FK A ++S VT DP+
Sbjct: 91 MKQQLKIFVYSDVSNKSSPFANIFLPIENPFHHPKLGNYFSEHIFKVALLRSSLVTLDPA 150
Query: 55 KADLFFLPFSIARMRHDRRIGTE-GIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSI 113
KA FFLPFSI +R+D R +E I +F++HY ISQ++ YWN + GADHFYV CHS+
Sbjct: 151 KALFFFLPFSINNLRNDPRFHSEESISEFVAHYTTTISQRFSYWNASAGADHFYVCCHSV 210
Query: 114 GRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLAF 173
GR A + + NAIQ+ CSSSYF ++HKDV LPQ+WPR R++L +
Sbjct: 211 GRQAASRHPALHNNAIQLTCSSSYFQRFFVSHKDVGLPQVWPRPPQTALNPPHARHRLVY 270
Query: 174 FAGAV-NSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIAD 232
FAG V NS VR +L+ +W ND+E+ +G PY + SK+CLHVKG+EVNTAR++D
Sbjct: 271 FAGRVQNSQVRRELVNLWGNDTEMDIINGSPSFPYEEGFKRSKYCLHVKGYEVNTARVSD 330
Query: 233 SLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVL 292
S++YGC+PVII+N+YDLPFA VL+W FS+V+ DIP LK L I+ + Y+ + +
Sbjct: 331 SIHYGCIPVIISNYYDLPFATVLDWSKFSVVINQADIPFLKTTLLAITRKTYITMFQNLC 390
Query: 293 KVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQW 328
+VR+HF+WH P YD+FYM Y+LWLRRS R+ +
Sbjct: 391 RVRRHFEWHTTPKGYDSFYMTAYQLWLRRSIHRLSY 426
>gi|224053619|ref|XP_002297901.1| predicted protein [Populus trichocarpa]
gi|222845159|gb|EEE82706.1| predicted protein [Populus trichocarpa]
Length = 457
Score = 320 bits (820), Expect = 5e-85, Method: Compositional matrix adjust.
Identities = 160/348 (45%), Positives = 224/348 (64%), Gaps = 20/348 (5%)
Query: 1 MNRSFRVYVYPH--RRNDPFANVLLP----VDFEPRGNYASESYFKKAFMKSHFVTKDPS 54
M R +++VYP R+ PFAN+ LP ++ GNY SE FK + + S +T P
Sbjct: 110 MMRHLKIFVYPDTFNRSSPFANIFLPHENPLNNPKLGNYFSEHMFKVSLLHSPLLTATPE 169
Query: 55 KADLFFLPFSIARMRHDRRIGTEG-IPDFISHYIFNISQKYPYWNRTGGADHFYVACHSI 113
KA FFLPFSI +R+D R+ +E I F++ Y +IS + +WN +GGADHFYV CHS+
Sbjct: 170 KAHFFFLPFSINDLRNDPRVHSEAKISQFVAQYTSSISSSFRFWNASGGADHFYVCCHSV 229
Query: 114 GRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQE----DPPKLGSS--- 166
GR A + ++ NAIQ+ C SSYF +++HKDV LPQ+WPR + +PP
Sbjct: 230 GREAPSRHHGLRNNAIQLTCCSSYFQRFYLSHKDVGLPQVWPRTDQTALNPPHASVCYLD 289
Query: 167 -----KRNKLAFFAGAV-NSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHV 220
R++L +FAG V NS VR++L+ +W ND++ +G PY + SKFCLHV
Sbjct: 290 VNTYRCRHRLVYFAGRVQNSQVRQQLVNLWGNDTQFDIFNGNPTFPYEEGFKRSKFCLHV 349
Query: 221 KGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGIS 280
KG+EVNTAR++D+++YGC+PVII+N+YDLPFA+VL+W FS+V+ DI LK L I
Sbjct: 350 KGYEVNTARVSDAIHYGCIPVIISNYYDLPFANVLDWSKFSVVINQRDIAFLKTKLLSIK 409
Query: 281 SEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQW 328
E YL + + KVR+HF WH P YD+FYM Y+LWLRRS++R+ +
Sbjct: 410 REMYLRMYHNLFKVRRHFVWHTTPRGYDSFYMTAYQLWLRRSTLRLSY 457
>gi|359473867|ref|XP_002268876.2| PREDICTED: probable glycosyltransferase At5g03795-like [Vitis
vinifera]
gi|296085575|emb|CBI29307.3| unnamed protein product [Vitis vinifera]
Length = 449
Score = 319 bits (818), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 161/341 (47%), Positives = 225/341 (65%), Gaps = 17/341 (4%)
Query: 1 MNRSFRVYVYP----HRRNDPFANVLLPVD--FEPR-GNYASESYFKKAFMKSHFVTKDP 53
M +++VYP ++ + PFA V LP F P+ NY SE FK A ++S +T P
Sbjct: 113 MLHKLKIFVYPDASMNQSSSPFARVFLPNPNPFHPKLANYFSEHMFKVALLRSSLLTPHP 172
Query: 54 SKADLFFLPFSIARMRHDRRIGTEG-IPDFISHYIFNISQKYPYWNRTGGADHFYVACHS 112
A FFLPFS+ +R+D R+ +E I DF++ Y IS +Y +WN + G DHFY+ CHS
Sbjct: 173 QDAHFFFLPFSVNTLRNDPRVHSEASISDFVTQYTTRISWEYKFWNASRGTDHFYICCHS 232
Query: 113 IGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQED----PPKLGSSKR 168
+GR A K ++ NAIQV CSSSYF +I+HKDV LPQ+WPR + PP+L R
Sbjct: 233 VGREAASKHHDLHNNAIQVTCSSSYFQRLYISHKDVGLPQVWPRPPEKLLNPPEL----R 288
Query: 169 NKLAFFAGAV-NSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNT 227
+KL FFAG V NS +R++L+ VW ND++I SG PY + L SK+CLHVKG+EVNT
Sbjct: 289 HKLVFFAGRVQNSHIRQELMAVWGNDTDIDLFSGSPPFPYEEGLRKSKYCLHVKGYEVNT 348
Query: 228 ARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLL 287
AR+ D+++YGC+PVI++N+YDLPF++VL+W FS++++ I LKKIL IS ++YL +
Sbjct: 349 ARVCDAIHYGCIPVIVSNYYDLPFSNVLDWSKFSVIISHKSIATLKKILLSISKQKYLSM 408
Query: 288 QSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQW 328
+ VR+HF WH P YD+F+M Y+LWLRR R+ +
Sbjct: 409 YQNLCLVRRHFAWHTTPRGYDSFHMTAYQLWLRRGVHRLSY 449
>gi|357138738|ref|XP_003570946.1| PREDICTED: probable glycosyltransferase At5g11130-like
[Brachypodium distachyon]
Length = 405
Score = 271 bits (692), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 148/332 (44%), Positives = 207/332 (62%), Gaps = 14/332 (4%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
M RSF+VY+YP DP P G YASE YF + +S F T+DP ADLFF
Sbjct: 80 MERSFKVYIYPD--GDPKTFYQTPRKLT--GKYASEGYFFQNIRESRFRTEDPDSADLFF 135
Query: 61 LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
+P S +MR + E + + Y+ + KYPYWNRT GADHF+V CH +G A +
Sbjct: 136 VPISPHKMR-GKGTSYENMTIIVKDYVEGLINKYPYWNRTLGADHFFVTCHDVGVRAFEG 194
Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNK--LAFFAGAV 178
+ N+I+VVCS SY + I HKDV+LPQ+ + PK G+ N+ L F+AG
Sbjct: 195 LPFMVKNSIRVVCSPSYNVD-FIPHKDVALPQVL-QPFALPKGGNDVENRTNLGFWAGHR 252
Query: 179 NSPVREKLLQVWRNDSEIYAHSGRL-----KTPYADELLGSKFCLHVKGFEVNTARIADS 233
NS +R L +VW ND+E+ + R+ + Y + +KFC+ G +VN+ARI+DS
Sbjct: 253 NSKIRVILARVWENDTELAISNNRISRAIGELVYQKQFYRTKFCICPGGSQVNSARISDS 312
Query: 234 LYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLK 293
++YGCVPVI++++YDLPF DVL+WK F+IV+ D+ LK ILK IS EE++ L + +++
Sbjct: 313 IHYGCVPVILSDYYDLPFNDVLDWKKFAIVLKERDVYELKSILKSISQEEFVALHNSLVQ 372
Query: 294 VRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
++KHF WH P YDAF+MVMYELWLR ++
Sbjct: 373 IQKHFVWHSPPIPYDAFHMVMYELWLRHHVIK 404
>gi|224084457|ref|XP_002307304.1| predicted protein [Populus trichocarpa]
gi|222856753|gb|EEE94300.1| predicted protein [Populus trichocarpa]
Length = 406
Score = 270 bits (690), Expect = 7e-70, Method: Compositional matrix adjust.
Identities = 142/331 (42%), Positives = 204/331 (61%), Gaps = 12/331 (3%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
M R+F++++YP DP N + G YASE YF + +S F T+DP +A LFF
Sbjct: 81 MERNFKIFIYPD--GDP--NTFYQTPRKLTGKYASEGYFFQNIRESRFQTQDPDQAHLFF 136
Query: 61 LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
+P S +MR + I E + + +Y+ ++ KYPYWNRT GADHF+V CH +G A +
Sbjct: 137 IPISCHKMR-GKGISYENMTIIVDNYVESLKSKYPYWNRTLGADHFFVTCHDVGVRATEG 195
Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGS-SKRNKLAFFAGAVN 179
+ NAI+VVCS SY + G I HKD++LPQ+ P KR L F+AG N
Sbjct: 196 VPFLIKNAIRVVCSPSYDV-GFIPHKDIALPQVLQPFALPAGGNDVEKRTTLGFWAGHRN 254
Query: 180 SPVREKLLQVWRNDSEIYAHSGRLKTP-----YADELLGSKFCLHVKGFEVNTARIADSL 234
S +R L +VW ND+E+ + R+ Y GSK+C+ G +VN+ARIADS+
Sbjct: 255 SRIRVILARVWENDTELDISNNRINRATGHLVYQKRFYGSKYCICPGGSQVNSARIADSI 314
Query: 235 YYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKV 294
+YGC+PVI++N+YDLPF D+L+W FS+++ D+ LK+ILK I + + L ++KV
Sbjct: 315 HYGCIPVILSNYYDLPFNDILDWHKFSVILKEQDVYRLKQILKDIPDNKLVSLHKNLVKV 374
Query: 295 RKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
+KHFQW+ P YDAF+MVMY+LWLR ++
Sbjct: 375 QKHFQWNSPPVKYDAFHMVMYDLWLRHHVIK 405
>gi|359495862|ref|XP_002266299.2| PREDICTED: probable glycosyltransferase At5g03795 [Vitis vinifera]
Length = 594
Score = 270 bits (690), Expect = 7e-70, Method: Compositional matrix adjust.
Identities = 146/332 (43%), Positives = 206/332 (62%), Gaps = 14/332 (4%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
M ++F+VY+YP DP N + G YASE YF + S F T DP +A LFF
Sbjct: 269 MEKNFKVYIYPD--GDP--NTFYQTPRKLTGKYASEGYFFQNIRDSRFRTNDPDQAHLFF 324
Query: 61 LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
+P S +MR + E + + +Y+ ++ KYPYWNRT GADHF+V CH +G A +
Sbjct: 325 IPISCHKMR-GKGTSYENMTVIVQNYVGSLISKYPYWNRTLGADHFFVTCHDVGVRATEG 383
Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGS-SKRNKLAFFAGAVN 179
+ N+I+VVCS SY + G I HKDV+LPQ+ P R L F+AG N
Sbjct: 384 VPFLVKNSIRVVCSPSYDV-GFIPHKDVALPQVLQPFALPAGGNDIENRTTLGFWAGHRN 442
Query: 180 SPVREKLLQVWRNDSEIYAHSGRLKTPYADELL------GSKFCLHVKGFEVNTARIADS 233
S +R L +VW ND+E+ + R+ A ELL +KFC+ G +VN+ARIADS
Sbjct: 443 SKIRVILARVWENDTELDIMNNRINRA-AGELLYQKRFYRTKFCICPGGSQVNSARIADS 501
Query: 234 LYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLK 293
++YGCVPVI++N+YDLPF D+L+W+ FS+V+ LD+ LK+ILK I E++ L + +++
Sbjct: 502 IHYGCVPVILSNYYDLPFNDILDWRKFSVVLKELDVYQLKQILKDIPDAEFVALHNNLVQ 561
Query: 294 VRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
V+KHFQW+ P YDAF+M+MYELWLR ++
Sbjct: 562 VQKHFQWNSPPIRYDAFHMIMYELWLRHQVIK 593
>gi|296090519|emb|CBI40850.3| unnamed protein product [Vitis vinifera]
Length = 416
Score = 270 bits (690), Expect = 8e-70, Method: Compositional matrix adjust.
Identities = 146/332 (43%), Positives = 206/332 (62%), Gaps = 14/332 (4%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
M ++F+VY+YP DP N + G YASE YF + S F T DP +A LFF
Sbjct: 91 MEKNFKVYIYPD--GDP--NTFYQTPRKLTGKYASEGYFFQNIRDSRFRTNDPDQAHLFF 146
Query: 61 LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
+P S +MR + E + + +Y+ ++ KYPYWNRT GADHF+V CH +G A +
Sbjct: 147 IPISCHKMR-GKGTSYENMTVIVQNYVGSLISKYPYWNRTLGADHFFVTCHDVGVRATEG 205
Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGS-SKRNKLAFFAGAVN 179
+ N+I+VVCS SY + G I HKDV+LPQ+ P R L F+AG N
Sbjct: 206 VPFLVKNSIRVVCSPSYDV-GFIPHKDVALPQVLQPFALPAGGNDIENRTTLGFWAGHRN 264
Query: 180 SPVREKLLQVWRNDSEIYAHSGRLKTPYADELL------GSKFCLHVKGFEVNTARIADS 233
S +R L +VW ND+E+ + R+ A ELL +KFC+ G +VN+ARIADS
Sbjct: 265 SKIRVILARVWENDTELDIMNNRINRA-AGELLYQKRFYRTKFCICPGGSQVNSARIADS 323
Query: 234 LYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLK 293
++YGCVPVI++N+YDLPF D+L+W+ FS+V+ LD+ LK+ILK I E++ L + +++
Sbjct: 324 IHYGCVPVILSNYYDLPFNDILDWRKFSVVLKELDVYQLKQILKDIPDAEFVALHNNLVQ 383
Query: 294 VRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
V+KHFQW+ P YDAF+M+MYELWLR ++
Sbjct: 384 VQKHFQWNSPPIRYDAFHMIMYELWLRHQVIK 415
>gi|255563758|ref|XP_002522880.1| catalytic, putative [Ricinus communis]
gi|223537865|gb|EEF39480.1| catalytic, putative [Ricinus communis]
Length = 406
Score = 269 bits (687), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 142/331 (42%), Positives = 205/331 (61%), Gaps = 12/331 (3%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
M R+F+VY+YP DP N + G YASE YF + +S F T DP +A LFF
Sbjct: 81 MERNFKVYIYPD--GDP--NTFYQTPRKLTGKYASEGYFFQNIRESKFRTNDPDQAHLFF 136
Query: 61 LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
+P S +MR + E + + +Y+ +++ KYPYWNRT GADHF+V CH +G A +
Sbjct: 137 IPISCHKMR-GKGTSYENMTIIVQNYVESLAVKYPYWNRTLGADHFFVTCHDVGVRATEG 195
Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGS-SKRNKLAFFAGAVN 179
+ NAI+VVCS SY + G I HKDV+LPQ+ P R L F+AG N
Sbjct: 196 VPFLVKNAIRVVCSPSYDV-GFIPHKDVALPQVLQPFALPAGGNDLENRTTLGFWAGHRN 254
Query: 180 SPVREKLLQVWRNDSEIYAHSGRLKTP-----YADELLGSKFCLHVKGFEVNTARIADSL 234
S +R L +VW ND+E+ + R+ Y +KFC+ G +VN+ARIADS+
Sbjct: 255 SKIRVILARVWENDTELDISNNRINRATGHLVYQKRFYRTKFCICPGGSQVNSARIADSI 314
Query: 235 YYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKV 294
+YGCVPVI++++YDLPF D+L+WK FS+++ D+ LK++LK IS +E++ L +++V
Sbjct: 315 HYGCVPVILSDYYDLPFNDILDWKRFSVILKEKDVYRLKQVLKDISDDEFVALHENLVEV 374
Query: 295 RKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
+KHFQW+ P YDAF+MVM++LWLR ++
Sbjct: 375 QKHFQWNSPPIKYDAFHMVMFDLWLRHHVIK 405
>gi|356546741|ref|XP_003541781.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
max]
Length = 410
Score = 268 bits (684), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 145/332 (43%), Positives = 202/332 (60%), Gaps = 13/332 (3%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
M F+V+VYP DP P G YASE YF K +S F T DP +A LFF
Sbjct: 84 MEEEFKVFVYPD--GDPETYFHTPRKLT--GKYASEGYFFKNIRESRFFTDDPRRAHLFF 139
Query: 61 LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
LP S +MR R + E + D + Y+ ++ +YPYWNRT GADHF+V CH IG A +
Sbjct: 140 LPISCHKMR-GRGLTNERMIDEVEKYVEHLKFEYPYWNRTLGADHFFVTCHDIGVKATKG 198
Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK-RNKLAFFAGAVN 179
+ N+I+V+CSS Y G+I HKDV+LPQ+ PP K RN LAF+AG +
Sbjct: 199 VPHMMKNSIRVICSSRYDDDGYIPHKDVTLPQVQLPFFHPPGGNDIKNRNTLAFWAGRSD 258
Query: 180 SPVREKLLQVWRNDSEIYAHSGRLK------TPYADELLGSKFCLHVKGFEVNTARIADS 233
S ++E L+ +W ND+EI + R+ Y ++L SKFCL G + ++RIADS
Sbjct: 259 SRLKEDLIAIWDNDTEIDIQNSRVDLRATGPVVYMEKLYKSKFCLCPHG-PIGSSRIADS 317
Query: 234 LYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLK 293
+++GCVPVI++ +YDLPF D+L+W FSIV+ D+ LK L+ IS + ++ L ++K
Sbjct: 318 IHFGCVPVIMSKYYDLPFNDILDWTQFSIVLKETDVYQLKYTLRSISEKHFITLNHNIVK 377
Query: 294 VRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
++KHF+W+ P DAF+MVMYELW RR +R
Sbjct: 378 IQKHFKWNTPPVRQDAFHMVMYELWRRRHLIR 409
>gi|297802132|ref|XP_002868950.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
gi|297314786|gb|EFH45209.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
Length = 428
Score = 267 bits (682), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 140/331 (42%), Positives = 201/331 (60%), Gaps = 12/331 (3%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
M + F+VY+YP DP N + G YASE YF + +S F T DP +ADLFF
Sbjct: 103 MEKRFKVYIYPD--GDP--NTFYQTPRKVTGKYASEGYFFQNIRESRFRTLDPEEADLFF 158
Query: 61 LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
+P S +MR + E + + +Y+ + KYPYWNRT GADHF+V CH +G A +
Sbjct: 159 IPISCHKMR-GKGTSYENMTVIVQNYVDGLIAKYPYWNRTLGADHFFVTCHDVGVRAFEG 217
Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGS-SKRNKLAFFAGAVN 179
+ + N I+VVCS SY + G I HKDV+LPQ+ P R L F+AG N
Sbjct: 218 SLLLIKNTIRVVCSPSYNV-GFIPHKDVALPQVLQPFALPAGGNDVENRTTLGFWAGHRN 276
Query: 180 SPVREKLLQVWRNDSEIYAHSGRLKTP-----YADELLGSKFCLHVKGFEVNTARIADSL 234
S +R L +VW ND+E+ + R+ Y +KFC+ G +VN+ARI DS+
Sbjct: 277 SKIRVILARVWENDTELDISNNRINRATGHLVYQKRFYRTKFCICPGGSQVNSARITDSI 336
Query: 235 YYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKV 294
+YGC+PVI++++YDLPF D+LNW+ F++V+ D+ LK+ILK I E++ L + ++KV
Sbjct: 337 HYGCIPVILSDYYDLPFNDILNWRKFAVVLREQDVYNLKQILKNIPHSEFISLHNNLVKV 396
Query: 295 RKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
+KHFQW+ P +DAF+M+MYELWLR ++
Sbjct: 397 QKHFQWNSPPVKFDAFHMIMYELWLRHHVIK 427
>gi|449431924|ref|XP_004133750.1| PREDICTED: probable glycosyltransferase At5g03795-like [Cucumis
sativus]
Length = 412
Score = 267 bits (682), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 143/331 (43%), Positives = 203/331 (61%), Gaps = 12/331 (3%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
M F+VY+YP DP N + G YASE YF + +S F T+DP +A LFF
Sbjct: 87 MESKFKVYIYPD--GDP--NTFYQTPRKLTGKYASEGYFFQNIRESRFRTEDPDQAHLFF 142
Query: 61 LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
+P S +MR + E + + +Y+ + KYPYWNRT GADHF+V CH +G A +
Sbjct: 143 IPISCHKMR-GKGTSYENMTVIVQNYVEGLISKYPYWNRTLGADHFFVTCHDVGVRASEG 201
Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK-RNKLAFFAGAVN 179
+ NAI+VVCS SY + G I HKDV+LPQ+ P ++ R L F+AG N
Sbjct: 202 LPFLIKNAIRVVCSPSYDV-GFIPHKDVALPQVLQPFALPAGGNDTENRTTLGFWAGHRN 260
Query: 180 SPVREKLLQVWRNDSEIYAHSGRLKTP-----YADELLGSKFCLHVKGFEVNTARIADSL 234
S +R L +VW ND+E+ + R+ Y +KFC+ G +VN+ARIADS+
Sbjct: 261 SKIRVILARVWENDTELDISNNRISRATGHLLYQKRFYKTKFCICPGGSQVNSARIADSI 320
Query: 235 YYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKV 294
+YGCVPVI++++YDLPF D+L+W+ FS++V D+ LK+ILK IS E++ L +++V
Sbjct: 321 HYGCVPVILSDYYDLPFNDILDWRKFSVIVKERDVYQLKQILKDISDIEFIKLHKNLMQV 380
Query: 295 RKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
+KHFQW+ P YDAF+MVMY+LWLR ++
Sbjct: 381 QKHFQWNSPPIKYDAFHMVMYDLWLRHHVIK 411
>gi|115444695|ref|NP_001046127.1| Os02g0187200 [Oryza sativa Japonica Group]
gi|46390046|dbj|BAD15422.1| putative pectin-glucuronyltransferase [Oryza sativa Japonica Group]
gi|46390077|dbj|BAD15452.1| putative pectin-glucuronyltransferase [Oryza sativa Japonica Group]
gi|113535658|dbj|BAF08041.1| Os02g0187200 [Oryza sativa Japonica Group]
gi|215694561|dbj|BAG89554.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 402
Score = 266 bits (679), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 145/332 (43%), Positives = 205/332 (61%), Gaps = 14/332 (4%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
M RSF+VY+YP DP P G YASE YF + +S F T DP KA LFF
Sbjct: 77 MERSFKVYMYPD--GDPKTFYQTPRKLT--GKYASEGYFFQNIRESRFRTGDPDKAHLFF 132
Query: 61 LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
+P S +MR + E + + Y+ + KYPYWNRT GADHF+V CH +G A +
Sbjct: 133 VPISPHKMR-GKGTSYENMTIIVKDYVEGLINKYPYWNRTLGADHFFVTCHDVGVRAFEG 191
Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNK--LAFFAGAV 178
+ N+I+VVCS SY + I HKD++LPQ+ + P+ G+ N+ L F+AG
Sbjct: 192 LPFMVKNSIRVVCSPSYNVD-FIPHKDIALPQVL-QPFALPEGGNDVENRTILGFWAGHR 249
Query: 179 NSPVREKLLQVWRNDSEIYAHSGRL-----KTPYADELLGSKFCLHVKGFEVNTARIADS 233
NS +R L QVW ND+E+ + R+ + Y + +KFC+ G +VN+ARI+DS
Sbjct: 250 NSKIRVILAQVWENDTELAISNNRISRAIGELVYQKQFYHTKFCICPGGSQVNSARISDS 309
Query: 234 LYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLK 293
++YGCVPVI++++YDLPF D+L+W+ F++V+ D+ LK ILK IS EE++ L +++
Sbjct: 310 IHYGCVPVILSDYYDLPFNDILDWRKFAVVLKERDVYQLKSILKSISQEEFVELHKSLVQ 369
Query: 294 VRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
V+KHF WH P YDAF+MVMYELWLR ++
Sbjct: 370 VQKHFVWHSPPLPYDAFHMVMYELWLRHHVIK 401
>gi|15233650|ref|NP_195517.1| Exostosin family protein [Arabidopsis thaliana]
gi|4467110|emb|CAB37544.1| putative protein [Arabidopsis thaliana]
gi|7270787|emb|CAB80469.1| putative protein [Arabidopsis thaliana]
gi|15293111|gb|AAK93666.1| unknown protein [Arabidopsis thaliana]
gi|21280961|gb|AAM45007.1| unknown protein [Arabidopsis thaliana]
gi|332661466|gb|AEE86866.1| Exostosin family protein [Arabidopsis thaliana]
Length = 425
Score = 266 bits (679), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 141/331 (42%), Positives = 200/331 (60%), Gaps = 12/331 (3%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
M + F+VY+YP DP N + G YASE YF + +S F T DP +ADLFF
Sbjct: 100 MEKRFKVYIYPD--GDP--NTFYQTPRKVTGKYASEGYFFQNIRESRFRTLDPDEADLFF 155
Query: 61 LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
+P S +MR + E + + +Y+ + KYPYWNRT GADHF+V CH +G A +
Sbjct: 156 IPISCHKMR-GKGTSYENMTVIVQNYVDGLIAKYPYWNRTLGADHFFVTCHDVGVRAFEG 214
Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGS-SKRNKLAFFAGAVN 179
+ + N I+VVCS SY + G I HKDV+LPQ+ P R L F+AG N
Sbjct: 215 SPLLIKNTIRVVCSPSYNV-GFIPHKDVALPQVLQPFALPAGGNDVENRTTLGFWAGHRN 273
Query: 180 SPVREKLLQVWRNDSEIYAHSGRLKTP-----YADELLGSKFCLHVKGFEVNTARIADSL 234
S +R L VW ND+E+ + R+ Y +KFC+ G +VN+ARI DS+
Sbjct: 274 SKIRVILAHVWENDTELDISNNRINRATGHLVYQKRFYRTKFCICPGGSQVNSARITDSI 333
Query: 235 YYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKV 294
+YGC+PVI++++YDLPF D+LNW+ F++V+ D+ LK+ILK I E++ L + ++KV
Sbjct: 334 HYGCIPVILSDYYDLPFNDILNWRKFAVVLREQDVYNLKQILKNIPHSEFVSLHNNLVKV 393
Query: 295 RKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
+KHFQW+ P +DAF+M+MYELWLR V+
Sbjct: 394 QKHFQWNSPPVKFDAFHMIMYELWLRHHVVK 424
>gi|356496874|ref|XP_003517290.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
max]
Length = 404
Score = 265 bits (677), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 139/331 (41%), Positives = 203/331 (61%), Gaps = 12/331 (3%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
M + F+VY+YP DP N + G YASE YF + +S F T++P +A LFF
Sbjct: 79 MEKKFKVYIYPD--GDP--NTFYQTPRKLTGKYASEGYFFQNIRESRFCTENPDEAHLFF 134
Query: 61 LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
+P S +MR + E + + +Y+ ++ KYPYWNRT GADHF+V CH +G A +
Sbjct: 135 IPISCHKMR-GKGTSYENMTIIVQNYVESLISKYPYWNRTLGADHFFVTCHDVGVRATEG 193
Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGS-SKRNKLAFFAGAVN 179
+ N+I+ VCS SY + G I HKDV+LPQ+ P R L F+AG N
Sbjct: 194 LEFLVKNSIRAVCSPSYDV-GFIPHKDVALPQVLQPFALPAGGNDIENRTTLGFWAGHRN 252
Query: 180 SPVREKLLQVWRNDSEIYAHSGRLKTP-----YADELLGSKFCLHVKGFEVNTARIADSL 234
S +R L +VW ND+E+ + R+ Y SKFC+ G +VN+ARIADS+
Sbjct: 253 SKIRVILARVWENDTELDISNNRISRATGHLVYQKRFYRSKFCICPGGSQVNSARIADSI 312
Query: 235 YYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKV 294
+YGC+PVI++N+YDLPF D+L+W F++V+ D+ LK+ILK IS E++ L + ++KV
Sbjct: 313 HYGCIPVILSNYYDLPFNDILDWNKFAVVLKESDVYQLKQILKNISDAEFVTLHNNLVKV 372
Query: 295 RKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
+KHFQW+ P +DAF++VMY+LWLR +++
Sbjct: 373 QKHFQWNSPPIRFDAFHLVMYDLWLRHHTIK 403
>gi|356557595|ref|XP_003547101.1| PREDICTED: LOW QUALITY PROTEIN: probable glycosyltransferase
At5g03795-like [Glycine max]
Length = 412
Score = 265 bits (677), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 143/332 (43%), Positives = 201/332 (60%), Gaps = 13/332 (3%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
M F+++VYP DP P G YASE YF K +S F T DP +A LFF
Sbjct: 86 MEEEFKIFVYPD--GDPETYFHTPRKLT--GKYASEGYFFKNIRESRFFTDDPRRAHLFF 141
Query: 61 LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
LP S +MR R + E + D + Y+ ++ KYPYWNRT GADHF+V CH IG A +
Sbjct: 142 LPISCHKMR-GRGLTIERMIDEVEKYVEHLKLKYPYWNRTLGADHFFVTCHDIGVKATKG 200
Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK-RNKLAFFAGAVN 179
+ N+I+V CSSSY ++ HKDV+LPQ+ PP K RN AF+AG +
Sbjct: 201 VPHLTKNSIRVACSSSYDDDDYVPHKDVTLPQVQLPFFHPPGENDIKNRNTFAFWAGRSD 260
Query: 180 SPVREKLLQVWRNDSEIYAHSGRLK------TPYADELLGSKFCLHVKGFEVNTARIADS 233
S +++ L+ +W ND+E+ + R+ Y ++L SKFCL G V + IADS
Sbjct: 261 SRLKDDLMAMWDNDTELDIQNXRVDLRATGPVVYMEKLYKSKFCLCPHG-PVGNSLIADS 319
Query: 234 LYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLK 293
+++GCVPVI+ N+YDLPF D+L+W FS+V+ +I LLK IL+ IS + ++ L ++K
Sbjct: 320 IHFGCVPVIMPNYYDLPFNDILDWSQFSVVLKETNIYLLKDILRSISEKHFISLNRNIVK 379
Query: 294 VRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
++KHF+W+ P DAF+MVMYE+WLRR +R
Sbjct: 380 IQKHFKWNTPPVRQDAFHMVMYEIWLRRHLIR 411
>gi|357485421|ref|XP_003612998.1| Exostosin-like protein [Medicago truncatula]
gi|355514333|gb|AES95956.1| Exostosin-like protein [Medicago truncatula]
Length = 415
Score = 264 bits (675), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 139/334 (41%), Positives = 203/334 (60%), Gaps = 18/334 (5%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPR---GNYASESYFKKAFMKSHFVTKDPSKAD 57
M + F+VY+YP + F PR G YASE YF + +S F T DP +A
Sbjct: 90 MEKKFKVYIYPDGDSKTFYQT-------PRKLTGKYASEGYFFQNIRESRFRTLDPDEAH 142
Query: 58 LFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSA 117
LFF+P S +MR + E + + +Y+ ++ KYPYWNRT GADHF+V CH +G A
Sbjct: 143 LFFIPISCHKMR-GKGTSYENMTIIVQNYVESLISKYPYWNRTLGADHFFVTCHDVGVRA 201
Query: 118 MQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGS-SKRNKLAFFAG 176
+ + N+I+ VCS SY + G I HKDV+LPQ+ P R L F+AG
Sbjct: 202 TEGLPLLVKNSIRAVCSPSYDV-GFIPHKDVALPQVLQPFALPAGGNDVENRTSLGFWAG 260
Query: 177 AVNSPVREKLLQVWRNDSEIYAHSGRLKTP-----YADELLGSKFCLHVKGFEVNTARIA 231
NS +R L +VW ND+E+ + R+ Y +KFC+ G +VN+ARIA
Sbjct: 261 HRNSKIRVILARVWENDTELDISNNRISRATGHLVYQKRFYSTKFCICPGGSQVNSARIA 320
Query: 232 DSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYV 291
DS++YGC+PVI++N+YDLPF D+L+W+ F++V+ D+ LK+ILK S +E++ L + +
Sbjct: 321 DSIHYGCIPVILSNYYDLPFNDILDWRKFAVVLKESDVYQLKQILKNKSQDEFVALHNNL 380
Query: 292 LKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
+K++KHFQW+ P YDAF+MVMY+LWLR +++
Sbjct: 381 VKIQKHFQWNSPPVRYDAFHMVMYDLWLRHHTIQ 414
>gi|296088784|emb|CBI38234.3| unnamed protein product [Vitis vinifera]
Length = 326
Score = 264 bits (674), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 141/331 (42%), Positives = 201/331 (60%), Gaps = 12/331 (3%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
M F+V+VYP DP P G YASE YF + +S F T DP +A LFF
Sbjct: 1 MEMKFKVFVYPD--GDPNTYYQTPRKLT--GKYASEGYFFQNIRQSQFRTDDPDQAHLFF 56
Query: 61 LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
+P S +MR + E + + +Y+ ++ KYPYWNRT GADHF++ CH +G A +
Sbjct: 57 IPISCHKMR-GKGTSYENMTIIVQNYVQSLMSKYPYWNRTLGADHFFLTCHDVGVRATEG 115
Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK-RNKLAFFAGAVN 179
+ N+I+VVCS SY + G I HKDV+LPQ+ P K R L F+AG N
Sbjct: 116 VPLLVKNSIRVVCSPSYDV-GFIPHKDVALPQVLQPFALPTGGRDIKNRTTLGFWAGHRN 174
Query: 180 SPVREKLLQVWRNDSEIYAHSGRLKTP-----YADELLGSKFCLHVKGFEVNTARIADSL 234
S +R L ++W ND+E+ + R+ Y ++ +KFC+ G +VN+ARIADS+
Sbjct: 175 SKIRVILARIWENDTELDIKNNRINRATGHLVYQNKFYRTKFCICPGGSQVNSARIADSI 234
Query: 235 YYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKV 294
+YGCVPVI++++YDLPF D+L+W+ FS+++ D+ LK ILK I E++ L ++KV
Sbjct: 235 HYGCVPVILSDYYDLPFNDILDWRKFSVILKERDVYRLKYILKDIPDAEFIALHDNLVKV 294
Query: 295 RKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
+KHFQW+ P YDAF+MVMYELWLR ++
Sbjct: 295 QKHFQWNTPPIKYDAFHMVMYELWLRHHVIK 325
>gi|168019297|ref|XP_001762181.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162686585|gb|EDQ72973.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 344
Score = 264 bits (674), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 149/334 (44%), Positives = 203/334 (60%), Gaps = 17/334 (5%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
M F+VYVYP DP P G Y+SE YF + +S FVT D + ADLFF
Sbjct: 18 MEEKFKVYVYPD--GDPETYYQTPRKLT--GKYSSEGYFFQNLRESRFVTNDSAAADLFF 73
Query: 61 LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
LP S +MR + + E + D + Y+ ++ KYP+WNRT GADHF+V CH +G A K
Sbjct: 74 LPVSCHKMR-GKGLSYEKMADIVRAYVESLIIKYPFWNRTVGADHFFVTCHDVGVRATAK 132
Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSS---KRNKLAFFAGA 177
+ N+I+VVCS SY S I HKDV+LPQ+ Q P G R L F+AG
Sbjct: 133 VEHLVKNSIRVVCSPSYNGS-FIPHKDVALPQVL--QPFPLPAGGDDIHNRTVLGFWAGH 189
Query: 178 VNSPVREKLLQVWRNDSEIYAHSGRLKTP-----YADELLGSKFCLHVKGFEVNTARIAD 232
NS VR L W+ D ++ + RL Y ++ SKFC+ G +VN+ARIA+
Sbjct: 190 RNSKVRVNLADAWQYDPILFVANNRLNRSTGDYIYQNQFYRSKFCICPAGSQVNSARIAE 249
Query: 233 SLYYGCVP-VIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYV 291
S++YGCVP VI+A+ YDLPF D+L+W+ FS+VV + LKKIL+ ++ ++Y +L + V
Sbjct: 250 SIHYGCVPEVIMADFYDLPFNDILDWRKFSLVVREREYDNLKKILQAVTVQKYRMLHAGV 309
Query: 292 LKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
+VR+HF+WH P YDAF+MVMYELWLRR ++R
Sbjct: 310 RQVRRHFEWHSPPIKYDAFHMVMYELWLRRFTIR 343
>gi|225464075|ref|XP_002268836.1| PREDICTED: probable glycosyltransferase At5g03795-like [Vitis
vinifera]
Length = 417
Score = 263 bits (673), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 141/331 (42%), Positives = 201/331 (60%), Gaps = 12/331 (3%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
M F+V+VYP DP P G YASE YF + +S F T DP +A LFF
Sbjct: 92 MEMKFKVFVYPD--GDPNTYYQTPRKLT--GKYASEGYFFQNIRQSQFRTDDPDQAHLFF 147
Query: 61 LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
+P S +MR + E + + +Y+ ++ KYPYWNRT GADHF++ CH +G A +
Sbjct: 148 IPISCHKMR-GKGTSYENMTIIVQNYVQSLMSKYPYWNRTLGADHFFLTCHDVGVRATEG 206
Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK-RNKLAFFAGAVN 179
+ N+I+VVCS SY + G I HKDV+LPQ+ P K R L F+AG N
Sbjct: 207 VPLLVKNSIRVVCSPSYDV-GFIPHKDVALPQVLQPFALPTGGRDIKNRTTLGFWAGHRN 265
Query: 180 SPVREKLLQVWRNDSEIYAHSGRLKTP-----YADELLGSKFCLHVKGFEVNTARIADSL 234
S +R L ++W ND+E+ + R+ Y ++ +KFC+ G +VN+ARIADS+
Sbjct: 266 SKIRVILARIWENDTELDIKNNRINRATGHLVYQNKFYRTKFCICPGGSQVNSARIADSI 325
Query: 235 YYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKV 294
+YGCVPVI++++YDLPF D+L+W+ FS+++ D+ LK ILK I E++ L ++KV
Sbjct: 326 HYGCVPVILSDYYDLPFNDILDWRKFSVILKERDVYRLKYILKDIPDAEFIALHDNLVKV 385
Query: 295 RKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
+KHFQW+ P YDAF+MVMYELWLR ++
Sbjct: 386 QKHFQWNTPPIKYDAFHMVMYELWLRHHVIK 416
>gi|224071347|ref|XP_002303415.1| predicted protein [Populus trichocarpa]
gi|222840847|gb|EEE78394.1| predicted protein [Populus trichocarpa]
Length = 326
Score = 263 bits (673), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 142/331 (42%), Positives = 200/331 (60%), Gaps = 12/331 (3%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
M +F++Y+YP DP N + G YASE YF + +S F T DP +A LFF
Sbjct: 1 MEHNFKIYIYPD--GDP--NTFYQTPRKLTGKYASEGYFFQNIRESRFRTLDPDQAHLFF 56
Query: 61 LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
+P S +MR + E + + +Y+ ++ KY YWNRT GADHF+V CH +G A +
Sbjct: 57 IPISCHKMR-GKGTSYENMTVIVENYVESLIAKYSYWNRTLGADHFFVTCHDVGVRATEG 115
Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGS-SKRNKLAFFAGAVN 179
+ NAI+VVCS SY + G I HKDV+LPQ+ P R L F+AG N
Sbjct: 116 VPFLIKNAIRVVCSPSYDV-GFIPHKDVALPQVLQPFALPAGGNDVENRTTLGFWAGHRN 174
Query: 180 SPVREKLLQVWRNDSEIYAHSGRLKTP-----YADELLGSKFCLHVKGFEVNTARIADSL 234
S +R L +VW ND+E+ S R+ Y G+KFC+ G +VN+ARIADS+
Sbjct: 175 SRIRVILARVWENDTELDISSNRINRATGHLVYQKRFYGTKFCICPGGSQVNSARIADSI 234
Query: 235 YYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKV 294
+YGCVPVI++N+YDLPF D+L+W FS+++ D+ LK+ILK I +++ L ++KV
Sbjct: 235 HYGCVPVILSNYYDLPFNDILDWHKFSVILKEQDVYQLKQILKDIPDNQFVSLHKNLIKV 294
Query: 295 RKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
+KHFQW+ P DAF+MVMY+LWLRR ++
Sbjct: 295 QKHFQWNSPPIKNDAFHMVMYDLWLRRHVIK 325
>gi|242064274|ref|XP_002453426.1| hypothetical protein SORBIDRAFT_04g005890 [Sorghum bicolor]
gi|241933257|gb|EES06402.1| hypothetical protein SORBIDRAFT_04g005890 [Sorghum bicolor]
Length = 403
Score = 263 bits (673), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 142/332 (42%), Positives = 204/332 (61%), Gaps = 14/332 (4%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
M RSF+VY+YP DP P G YASE YF + +S F T DP +A LFF
Sbjct: 78 MERSFKVYIYPD--GDPKTFYQTPRKLT--GKYASEGYFFQNIRESRFRTDDPDQAHLFF 133
Query: 61 LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
+P S +MR + E + + Y+ + KYPYWNRT GADHF+V CH +G A +
Sbjct: 134 VPISPHKMR-GKGTTYENMTVIVKDYVEGLINKYPYWNRTLGADHFFVTCHDVGVRAFEG 192
Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNK--LAFFAGAV 178
+ N+I+VVCS SY + I HKD++LPQ+ + P+ G+ N+ L F+AG
Sbjct: 193 LPFMVKNSIRVVCSPSYNVD-FIPHKDIALPQVL-QPFALPEGGNDIENRTILGFWAGHR 250
Query: 179 NSPVREKLLQVWRNDSEIYAHSGRLKTP-----YADELLGSKFCLHVKGFEVNTARIADS 233
NS +R L +VW ND+E+ + R+ Y + +KFC+ G +VN+ARI+DS
Sbjct: 251 NSKIRVILARVWENDTELAISNNRISRAIGELVYQKQFYRTKFCICPGGSQVNSARISDS 310
Query: 234 LYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLK 293
++YGCVPVI++++YDLPF D L+W+ F++++ D+ LK ILK IS EE++ L +++
Sbjct: 311 IHYGCVPVILSDYYDLPFNDALDWRKFAVILRERDVYQLKNILKSISQEEFVSLHKSLVQ 370
Query: 294 VRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
V+KHF WH P YDAF+MVMYELWLR + ++
Sbjct: 371 VQKHFVWHSPPVSYDAFHMVMYELWLRHNVIK 402
>gi|212721192|ref|NP_001132213.1| exostosin-like protein [Zea mays]
gi|194693780|gb|ACF80974.1| unknown [Zea mays]
gi|413935957|gb|AFW70508.1| exostosin-like protein [Zea mays]
Length = 403
Score = 263 bits (672), Expect = 9e-68, Method: Compositional matrix adjust.
Identities = 141/332 (42%), Positives = 204/332 (61%), Gaps = 14/332 (4%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
M RSF+VY+YP DP P G YASE YF + +S F T DP +A LFF
Sbjct: 78 MERSFKVYIYPD--GDPKTFYQTPRKLT--GKYASEGYFFQNIRESRFRTDDPDQAHLFF 133
Query: 61 LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
+P S +MR + E + + Y+ + KYPYWNRT GADHF+V CH +G A +
Sbjct: 134 VPISPHKMR-GKGTSYENMTVIVKDYVEGLINKYPYWNRTLGADHFFVTCHDVGVRAFEG 192
Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNK--LAFFAGAV 178
+ N+I+VVCS SY + I HKD++LPQ+ + P+ G+ N+ L F+AG
Sbjct: 193 LPFMVKNSIRVVCSPSYNVD-FIPHKDIALPQVL-QPFALPEGGNDIENRTTLGFWAGHR 250
Query: 179 NSPVREKLLQVWRNDSEIYAHSGRLKTP-----YADELLGSKFCLHVKGFEVNTARIADS 233
NS +R L +VW ND+E+ + R+ Y + +KFC+ G +VN+ARI+DS
Sbjct: 251 NSKIRVILARVWENDTELAISNNRISRAIGELVYQKQFYRTKFCICPGGSQVNSARISDS 310
Query: 234 LYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLK 293
++YGCVPVI++++YDLPF D L+W+ F++++ D+ LK ILK IS EE++ L +++
Sbjct: 311 IHYGCVPVILSDYYDLPFNDALDWRKFAVILRERDVYQLKNILKSISQEEFVSLHKSLVQ 370
Query: 294 VRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
V+KHF WH P YDAF+M+MYELWLR + ++
Sbjct: 371 VQKHFVWHSPPVSYDAFHMIMYELWLRHNVIK 402
>gi|195624236|gb|ACG33948.1| exostosin-like [Zea mays]
Length = 403
Score = 261 bits (668), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 140/332 (42%), Positives = 202/332 (60%), Gaps = 14/332 (4%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
M R F+VY+YP DP P G YASE YF + +S F T DP +A LFF
Sbjct: 78 MERXFKVYIYPD--GDPKTFYQTPRKLT--GKYASEGYFFQNIRESRFRTDDPDQAHLFF 133
Query: 61 LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
+P S +MR + E + + Y+ + KYPYWNRT GADHF+V CH +G A +
Sbjct: 134 VPISPHKMR-GKGTSYENMTVIVKDYVEGLINKYPYWNRTLGADHFFVTCHDVGVRAFEG 192
Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGS--SKRNKLAFFAGAV 178
+ N+I+VVCS SY + I HKD++LPQ+ + P+ G+ R L F+AG
Sbjct: 193 LPFMVKNSIRVVCSPSYNVD-FIPHKDIALPQVL-QPFALPEGGNDIENRTTLGFWAGHR 250
Query: 179 NSPVREKLLQVWRNDSEIYAHSGRLKTP-----YADELLGSKFCLHVKGFEVNTARIADS 233
NS +R L +VW ND+E+ + R+ Y + +KFC+ G +VN+ARI+DS
Sbjct: 251 NSKIRVILARVWENDTELAISNNRISRAIGELVYQKQFYRTKFCICPGGSQVNSARISDS 310
Query: 234 LYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLK 293
++YGCVPVI++++YDLPF D L+W+ F++++ D+ LK ILK IS EE++ L +++
Sbjct: 311 IHYGCVPVILSDYYDLPFNDALDWRKFAVILRERDVYQLKNILKSISQEEFVSLHKSLVQ 370
Query: 294 VRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
V+KHF WH P YDAF+M+MYELWLR + ++
Sbjct: 371 VQKHFVWHSPPVSYDAFHMIMYELWLRHNVIK 402
>gi|226498050|ref|NP_001147269.1| exostosin-like [Zea mays]
gi|195609318|gb|ACG26489.1| exostosin-like [Zea mays]
gi|413926327|gb|AFW66259.1| exostosin-like protein [Zea mays]
Length = 403
Score = 261 bits (667), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 141/331 (42%), Positives = 200/331 (60%), Gaps = 12/331 (3%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
M RSF+VY+YP DP P G YASE YF + +S F T DP +A LFF
Sbjct: 78 MERSFKVYIYPD--GDPKTFYQTPRKLT--GKYASEGYFFQNIRESRFRTDDPDQAHLFF 133
Query: 61 LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
+P S +MR + E + + Y+ + KYPYWNRT GADHF+V CH +G A +
Sbjct: 134 VPISPHKMR-GKGTSYENMTVIVKDYVEGLINKYPYWNRTLGADHFFVTCHDVGVRAFEG 192
Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIW-PRQEDPPKLGSSKRNKLAFFAGAVN 179
+ N+I+VVCS SY + I HKD++LPQ+ P R L F+AG N
Sbjct: 193 LKFMVKNSIRVVCSPSYNVD-FIPHKDIALPQVLQPFALHEGGNDIDNRVILGFWAGHRN 251
Query: 180 SPVREKLLQVWRNDSEIYAHSGRL-----KTPYADELLGSKFCLHVKGFEVNTARIADSL 234
S +R L +VW ND+E+ + R+ + Y + +KFC+ G +VN+ARI+DS+
Sbjct: 252 SKIRVILARVWENDTELAISNNRISRAIGELVYQKQFYRTKFCICPGGSQVNSARISDSI 311
Query: 235 YYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKV 294
+YGCVPVI++++YDLPF D L+W+ F++++ D+ LK ILK IS EE++ L +++V
Sbjct: 312 HYGCVPVILSDYYDLPFNDALDWRKFAVILRERDVYQLKNILKSISQEEFISLHKSLVQV 371
Query: 295 RKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
+KHF WH P YDAF+MVMYELWLR + ++
Sbjct: 372 QKHFVWHSPPVSYDAFHMVMYELWLRHNVIK 402
>gi|125581086|gb|EAZ22017.1| hypothetical protein OsJ_05674 [Oryza sativa Japonica Group]
Length = 408
Score = 259 bits (662), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 145/338 (42%), Positives = 204/338 (60%), Gaps = 20/338 (5%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
M RSF+VY+YP DP P G YASE YF + +S F T DP KA LFF
Sbjct: 77 MERSFKVYMYPD--GDPKTFYQTPRKLT--GKYASEGYFFQNIRESRFRTGDPDKAHLFF 132
Query: 61 LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
+P S +MR + E + + Y+ + KYPYWNRT GADHF+V CH +G A +
Sbjct: 133 VPISPHKMR-GKGTSYENMTIIVKDYVEGLINKYPYWNRTLGADHFFVTCHDVGVRAFEG 191
Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNK--LAFFAGAV 178
+ N+I+VVCS SY + I HKD++LPQ+ + P+ G+ N+ L F+AG
Sbjct: 192 LPFMVKNSIRVVCSPSYNVD-FIPHKDIALPQVL-QPFALPEGGNDVENRTILGFWAGHR 249
Query: 179 NSPVREKLLQVWRNDSEIYAHSGRLKTP-----YADELLGSKFCLHVKGFEVNTARIADS 233
NS +R L QVW ND+E+ + R+ Y + +KFC+ G +VN+ARI+DS
Sbjct: 250 NSKIRVILAQVWENDTELAISNNRISRAIGELVYQKQFYHTKFCICPGGSQVNSARISDS 309
Query: 234 LYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVL- 292
++YGCVPVI++++YDLPF D+L+W+ F++V+ D+ LK ILK IS EE++ L ++
Sbjct: 310 IHYGCVPVILSDYYDLPFNDILDWRKFAVVLKERDVYQLKSILKSISQEEFVELHKSLVQ 369
Query: 293 -----KVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
+V+KHF WH P YDAF+MVMYELWLR ++
Sbjct: 370 NISCPQVQKHFVWHSPPLPYDAFHMVMYELWLRHHVIK 407
>gi|363808264|ref|NP_001242494.1| uncharacterized protein LOC100777368 [Glycine max]
gi|255635742|gb|ACU18220.1| unknown [Glycine max]
Length = 409
Score = 259 bits (662), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 137/331 (41%), Positives = 201/331 (60%), Gaps = 12/331 (3%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
M + F+VY+YP DP N + G YASE YF + S F T++P +A LFF
Sbjct: 84 MEKKFKVYIYPD--GDP--NTFYQTPRKLTGKYASEGYFFQNIRDSRFRTENPDEAHLFF 139
Query: 61 LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
+P S +MR + E + + +Y+ ++ KYPYWNRT GADHF+V CH +G A +
Sbjct: 140 IPISCHKMR-GKGTSYENMTIIVQNYVESLISKYPYWNRTLGADHFFVTCHDVGVRATEG 198
Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGS-SKRNKLAFFAGAVN 179
+ N+I+ VCS SY + G I HKDV+LPQ+ P R L F+AG N
Sbjct: 199 LEFLVKNSIRAVCSPSYDV-GFIPHKDVALPQVLQPFALPAGGNDIENRTTLGFWAGHRN 257
Query: 180 SPVREKLLQVWRNDSEIYAHSGRLKTP-----YADELLGSKFCLHVKGFEVNTARIADSL 234
S +R L +VW ND+E+ + R+ Y SKFC+ G +VN+ARIADS+
Sbjct: 258 SKIRVILARVWENDTELDISNNRISRATGHLVYQKRFYRSKFCICPGGSQVNSARIADSI 317
Query: 235 YYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKV 294
+YGC+PV+++N+YDLPF D+L+W F++V+ D+ LK+ILK IS E++ L + ++KV
Sbjct: 318 HYGCIPVMLSNYYDLPFNDILDWNKFAVVLKESDVYQLKQILKNISDAEFVTLHNNLVKV 377
Query: 295 RKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
+KHFQW+ +DAF++VMY+LWLR +++
Sbjct: 378 QKHFQWNSPSIRFDAFHLVMYDLWLRHHTIK 408
>gi|125538390|gb|EAY84785.1| hypothetical protein OsI_06153 [Oryza sativa Indica Group]
Length = 332
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 145/338 (42%), Positives = 205/338 (60%), Gaps = 20/338 (5%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
M RSF+VY+YP DP P G YASE YF + +S F T DP KA LFF
Sbjct: 1 MERSFKVYMYPD--GDPKTFYQTPRKLT--GKYASEGYFFQNIRESRFRTGDPDKAHLFF 56
Query: 61 LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
+P S +MR + E + + Y+ + KYPYWNRT GADHF+V CH +G A +
Sbjct: 57 VPISPHKMR-GKGTSYENMTIIVKDYVEGLINKYPYWNRTLGADHFFVTCHDVGVRAFEG 115
Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNK--LAFFAGAV 178
+ N+I+VVCS SY + I HKD++LPQ+ + P+ G+ N+ L F+AG
Sbjct: 116 LPFMVKNSIRVVCSPSYNVD-FIPHKDIALPQVL-QPFALPEGGNDVENRTILGFWAGHR 173
Query: 179 NSPVREKLLQVWRNDSEIYAHSGRL-----KTPYADELLGSKFCLHVKGFEVNTARIADS 233
NS +R L QVW ND+E+ + R+ + Y + +KFC+ G +VN+ARI+DS
Sbjct: 174 NSKIRVILAQVWENDTELAISNNRISRAIGELVYQKQFYRTKFCICPGGSQVNSARISDS 233
Query: 234 LYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVL- 292
++YGCVPVI++++YDLPF D+L+W+ F++V+ D+ LK ILK IS EE++ L ++
Sbjct: 234 IHYGCVPVILSDYYDLPFNDILDWRKFAVVLKERDVYQLKSILKSISQEEFVELHKSLVQ 293
Query: 293 -----KVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
+V+KHF WH P YDAF+MVMYELWLR ++
Sbjct: 294 NISCPQVQKHFVWHSPPLPYDAFHMVMYELWLRHHVIK 331
>gi|358343604|ref|XP_003635890.1| Exostosin-like protein [Medicago truncatula]
gi|358344094|ref|XP_003636128.1| Exostosin-like protein [Medicago truncatula]
gi|355501825|gb|AES83028.1| Exostosin-like protein [Medicago truncatula]
gi|355502063|gb|AES83266.1| Exostosin-like protein [Medicago truncatula]
Length = 435
Score = 253 bits (647), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 137/329 (41%), Positives = 199/329 (60%), Gaps = 24/329 (7%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPR---GNYASESYFKKAFMKSHFVTKDPSKAD 57
M + F+VY+YP + F PR G YASE YF + +S F T DP +A
Sbjct: 126 MEKKFKVYIYPDGDSKTFYQT-------PRKLTGKYASEGYFFQNIRESRFRTLDPDQAH 178
Query: 58 LFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSA 117
LFF+P S +MR + + + + +Y+ ++ KYPYWNRT GADHF+V CH +G A
Sbjct: 179 LFFIPISCHKMR-GKGTSYDNMTIIVQNYVESLISKYPYWNRTLGADHFFVTCHDVGVRA 237
Query: 118 MQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGS-SKRNKLAFFAG 176
+ + N+I+ VCS SY + G I HKDV+LPQ+ P R L F+AG
Sbjct: 238 TEGLPLLVKNSIRAVCSPSYDV-GFIPHKDVALPQVLQPFALPAGGNDVENRTSLGFWAG 296
Query: 177 AVNSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYY 236
NS +R L +VW ND+E+ + R+ T KFC+ G +VN+ARIADS++Y
Sbjct: 297 HRNSKIRVILARVWENDTELDISNNRIYT---------KFCICPGGSQVNSARIADSIHY 347
Query: 237 GCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRK 296
GC+PVI++N+YDLPF D+L+W+ F++V D+ LK+ILK S +E++ L + + ++K
Sbjct: 348 GCIPVILSNYYDLPFNDILDWRKFAVVHNESDVYQLKQILKNKSQDEFIALHNNL--IQK 405
Query: 297 HFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
HFQW+ P YDAF+MVMY+LWLR +++
Sbjct: 406 HFQWNSPPVRYDAFHMVMYDLWLRHHTIQ 434
>gi|302823230|ref|XP_002993269.1| glycosyltransferase-like protein [Selaginella moellendorffii]
gi|300138939|gb|EFJ05690.1| glycosyltransferase-like protein [Selaginella moellendorffii]
Length = 353
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 142/335 (42%), Positives = 197/335 (58%), Gaps = 33/335 (9%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
M + +V+VYP DP L G YASE YF + M+S FVT DP KA LFF
Sbjct: 41 MEQQLKVFVYP----DPVVYTKLA------GKYASEGYFFRNLMESRFVTTDPEKAQLFF 90
Query: 61 LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
+P S AR+R + + + I D ++ ++ ++ K+PYWNRT GADHF+V CH IG A K
Sbjct: 91 VPISCARLR-EEGLDHDEISDNVASFVESVIAKFPYWNRTMGADHFFVTCHEIGTRATSK 149
Query: 121 AWEVKLNAIQVVCSSSYFISG-HIAHKDVSLPQIW-----PRQEDPPKLGSSKRNKLAFF 174
+ N+I+VVC+SSY SG I HKDV+LPQI PR D + KR L F+
Sbjct: 150 VAHLVKNSIRVVCASSY--SGPFIPHKDVALPQILQPFPSPRGGD----DTEKRETLGFW 203
Query: 175 AGAVNSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSL 234
AG NS R L + W+ DS++ + + + SKFC+ G V+TARI +S+
Sbjct: 204 AGPANSKTRILLTKTWQEDSDMVISTKHVGM---QQFYRSKFCICPSGTRVSTARIVESI 260
Query: 235 YYGCVP----VIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSY 290
++GCVP +I+++HYDLPF DVL+W+ F++++ D LK L+ Y L
Sbjct: 261 HFGCVPGKSFLILSDHYDLPFNDVLDWRKFAVILPEQDAGTLKDALE---LAPYATLHRN 317
Query: 291 VLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
+L+ + HF+WH P YD F+MVMYELWLR S+VR
Sbjct: 318 LLQAQAHFEWHSPPIKYDTFHMVMYELWLRHSTVR 352
>gi|302820946|ref|XP_002992138.1| glycosyltransferase-like protein [Selaginella moellendorffii]
gi|300140064|gb|EFJ06793.1| glycosyltransferase-like protein [Selaginella moellendorffii]
Length = 342
Score = 251 bits (640), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 140/345 (40%), Positives = 205/345 (59%), Gaps = 26/345 (7%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
M + F++++YP DP N + G YASE YF + +S FVTK+P+KA LFF
Sbjct: 1 MLKRFKIFIYPD--GDP--NTYYQTPRKITGKYASEGYFFQNLRESKFVTKNPNKAHLFF 56
Query: 61 LPFSIARMRHD------------RRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYV 108
+P S +MR + I E + D + Y+ + KYPYWNRT GADHF+V
Sbjct: 57 IPISCHKMRGKVPYYLTSNWNKMQGISYEKMADIVQEYVEGLIVKYPYWNRTLGADHFFV 116
Query: 109 ACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKR 168
CH +G A K + N+I+VVCS SY I HKD+++PQ+ + P+ G+ R
Sbjct: 117 TCHDVGARATNKVANLVKNSIRVVCSPSYN-GDFIPHKDIAMPQVL-QPFALPRGGNDVR 174
Query: 169 NK--LAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRL-----KTPYADELLGSKFCLHVK 221
N+ L F+AG NS +R L ++W D + + R+ + Y + SKFC+
Sbjct: 175 NRTILGFWAGHRNSKIRVVLAKLWEEDDVLAISNNRISRATGELVYQKQFYRSKFCICPG 234
Query: 222 GFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISS 281
G +VN+ARI DS++YGCVPVI+++HYDLPF DVL+WK F++++ D+ LK L+ +S
Sbjct: 235 GSQVNSARIVDSIHYGCVPVILSDHYDLPFNDVLDWKRFALLLRERDVGDLKLKLQSVSK 294
Query: 282 EEYLLLQSYVLK-VRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
E+YL L +++ V+ F+WH P YDAF+MV+YELWLR +V+
Sbjct: 295 EQYLSLHRGLVEVVQDRFEWHTPPRPYDAFHMVVYELWLRHFTVQ 339
>gi|302824287|ref|XP_002993788.1| glycosyltransferase-like protein [Selaginella moellendorffii]
gi|300138384|gb|EFJ05154.1| glycosyltransferase-like protein [Selaginella moellendorffii]
Length = 353
Score = 249 bits (635), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 139/335 (41%), Positives = 197/335 (58%), Gaps = 33/335 (9%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
M + +V+VYP DP L G YASE YF + M+S FVT DP +A LFF
Sbjct: 41 MEQQLKVFVYP----DPVVYTKLA------GKYASEGYFFRNLMESRFVTTDPEEAQLFF 90
Query: 61 LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
+P S AR++ + + + I D ++ ++ ++ K+PYWNRT GADHF+V CH IG A K
Sbjct: 91 VPISCARLK-EEGLDHDEISDNVASFVESVIAKFPYWNRTMGADHFFVTCHEIGTRATSK 149
Query: 121 AWEVKLNAIQVVCSSSYFISG-HIAHKDVSLPQIW-----PRQEDPPKLGSSKRNKLAFF 174
+ N+I+VVC+SSY SG I HKDV+LPQI PR D + KR L F+
Sbjct: 150 VAHLVKNSIRVVCASSY--SGPFIPHKDVALPQILQPFPSPRGGD----DTEKRETLGFW 203
Query: 175 AGAVNSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSL 234
AG NS R L + W+ DS++ + + + SKFC+ G V+TARI +S+
Sbjct: 204 AGPANSKTRILLTKTWQEDSDMVISTKHVGM---QQFYRSKFCICPSGTRVSTARIVESI 260
Query: 235 YYGCVP----VIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSY 290
++GCVP +I+++HYDLPF DVL+W+ F++++ D LK L+ Y L
Sbjct: 261 HFGCVPGKSYLILSDHYDLPFNDVLDWRKFAVILPEQDAGTLKDALE---LAPYATLHRN 317
Query: 291 VLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
+L+ + HF+WH P YD F+MVMYELWLR S++R
Sbjct: 318 LLQAQAHFEWHSPPIKYDTFHMVMYELWLRHSTIR 352
>gi|302790752|ref|XP_002977143.1| glycosyltransferase-like protein [Selaginella moellendorffii]
gi|300155119|gb|EFJ21752.1| glycosyltransferase-like protein [Selaginella moellendorffii]
Length = 342
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 136/345 (39%), Positives = 198/345 (57%), Gaps = 26/345 (7%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
M + F++++YP DP N + G YASE YF + +S FVTK+P+KA LFF
Sbjct: 1 MLKRFKIFIYPD--GDP--NTYYQTPRKITGKYASEGYFFQNLRESKFVTKNPNKAHLFF 56
Query: 61 LPFSIARMRHD------------RRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYV 108
+P S +MR + I E + D + Y+ + KYPYWNRT GADHF+V
Sbjct: 57 IPISCHKMRGKVPYYLTSNWNKMQGISYEKMADIVQEYVEGLIVKYPYWNRTLGADHFFV 116
Query: 109 ACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKR 168
CH +G A K + N+I+VVCS SY I HKD+++PQ+ + P+ G+ R
Sbjct: 117 TCHDVGARATNKVANLVKNSIRVVCSPSYN-GDFIPHKDIAMPQVL-QPFALPRGGNDVR 174
Query: 169 NK--LAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRL-----KTPYADELLGSKFCLHVK 221
N+ L F+AG NS +R L ++W D + + R+ + Y + SKFC+
Sbjct: 175 NRTILGFWAGHRNSKIRVVLAKLWEEDDVLAISNNRISRATGELVYQKQFYRSKFCICPG 234
Query: 222 GFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISS 281
G +VN+ARI DS++YGCVPVI+++HYDLPF DVL+WK F++++ D+ LK
Sbjct: 235 GSQVNSARIVDSIHYGCVPVILSDHYDLPFNDVLDWKRFALLLRERDVGDLKLFFFSFFL 294
Query: 282 EEY-LLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
+L+ S +V+ F+WH P YDAF+MV+YELWLR +V+
Sbjct: 295 FSSRVLIASLFRQVQDRFEWHTPPRPYDAFHMVVYELWLRHFTVQ 339
>gi|224093268|ref|XP_002309859.1| predicted protein [Populus trichocarpa]
gi|222852762|gb|EEE90309.1| predicted protein [Populus trichocarpa]
Length = 420
Score = 236 bits (602), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 139/354 (39%), Positives = 197/354 (55%), Gaps = 45/354 (12%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGN----YASESYFKKAFMKSHFVTKDPSKA 56
M F+V+VYP R DP ++PR YASE YF K + S F T DP+ A
Sbjct: 82 MENDFKVFVYPGR--DPTTC------YDPRDKLKRKYASEHYFLKNLIPSSFFTDDPTVA 133
Query: 57 DLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRS 116
LF +P S ++ G D I +Y+ ++ YPYWNRT GADHFY +CH I
Sbjct: 134 HLFLIPLSC------KKTGGREEKD-IENYVKSLISSYPYWNRTLGADHFYFSCHGIDSG 186
Query: 117 AMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQ-----------IWPR---QEDP-- 160
+++ + N I++VCS SY S +I HKD+SLPQ +W R P
Sbjct: 187 TIEEVPLLMKNVIRLVCSPSY-DSKYIPHKDISLPQTLELSLHDGDDVWSRSTVMSRPLM 245
Query: 161 --PKLGSSKRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAH------SGRLKTPYADELL 212
P++ +R KL F+AG++NS VR+ L ++ E H + Y +EL
Sbjct: 246 IYPEMMLPRRTKLGFWAGSLNSDVRKNLQVFYKGAPEFNFHFFDKMKKAAILDAYENELY 305
Query: 213 GSKFCLHVKGFE-VNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPL 271
GSKFC+ +G V + + +S+ +GCVPVI+ ++YD PF DVL+W +FS+++ +P
Sbjct: 306 GSKFCICPRGNNHVGSVCLTESMTFGCVPVILHDYYDFPFNDVLDWNNFSVILKEEHVPD 365
Query: 272 LKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
L+KILKGI E Y + +L+VRKHFQW+ P YD F M+MYELWLRR ++
Sbjct: 366 LEKILKGIPEENYKKMHQNLLQVRKHFQWNSLPVKYDLFRMIMYELWLRRHIIK 419
>gi|413947133|gb|AFW79782.1| putative homeodomain-like transcription factor superfamily protein
[Zea mays]
Length = 544
Score = 233 bits (594), Expect = 8e-59, Method: Compositional matrix adjust.
Identities = 138/341 (40%), Positives = 196/341 (57%), Gaps = 30/341 (8%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAF-MKSHFVTKDPSKADLF 59
M R +V++Y H P + E +G YASE +F K +FV +DP+ A LF
Sbjct: 209 MERLLKVFIY-HDGAKPIFH-----SPELKGIYASEGWFMKLMETNQNFVVRDPNTAHLF 262
Query: 60 FLPFSIARMRHDRRI----GTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGR 115
+LP+S ++ H+ + E + F+ +YI IS KYPYWNRT GADHF+VACH G
Sbjct: 263 YLPYSSRQLEHNLYVPGSNTIEPLSIFVKNYIDLISAKYPYWNRTKGADHFFVACHDWGP 322
Query: 116 SAMQKAWEVKLNAIQVVC----SSSYFISGHIAHKDVSLPQIWPRQEDPP--KLG---SS 166
+ E++ N I+ +C S +FI G KDVSLP+ + R P +G ++
Sbjct: 323 YTTKLHDELRKNTIKALCNADLSEGFFIRG----KDVSLPETFLRSPRRPLRDIGGRPAA 378
Query: 167 KRNKLAFFAGAVNSPVREKLLQVWRN---DSEIYA---HSGRLKTPYADELLGSKFCLHV 220
+R LAFFAG ++ VR LL+ W N D IY+ H K Y + SK+C+
Sbjct: 379 QRTILAFFAGQMHGRVRPVLLKYWGNKDDDMRIYSRLPHRITRKRNYVQHMKSSKYCICP 438
Query: 221 KGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGIS 280
G+EVN+ RI +++YY CVPVIIA+H+ LPF LNW +FS+VV D+P LK+IL I
Sbjct: 439 MGYEVNSPRIVEAIYYECVPVIIADHFVLPFDAALNWSTFSVVVPESDVPKLKQILLAIP 498
Query: 281 SEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
Y+ LQ+ V +V+KHF WH P YD F+M+++ +W R
Sbjct: 499 ESRYITLQANVKRVQKHFMWHPNPVKYDIFHMILHSVWFSR 539
>gi|115434052|ref|NP_001041784.1| Os01g0107700 [Oryza sativa Japonica Group]
gi|52076197|dbj|BAD44851.1| pectin-glucuronyltransferase -like [Oryza sativa Japonica Group]
gi|52076236|dbj|BAD44890.1| pectin-glucuronyltransferase -like [Oryza sativa Japonica Group]
gi|113531315|dbj|BAF03698.1| Os01g0107700 [Oryza sativa Japonica Group]
gi|215697737|dbj|BAG91731.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222617591|gb|EEE53723.1| hypothetical protein OsJ_00064 [Oryza sativa Japonica Group]
Length = 550
Score = 233 bits (594), Expect = 9e-59, Method: Compositional matrix adjust.
Identities = 133/337 (39%), Positives = 195/337 (57%), Gaps = 22/337 (6%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFM-KSHFVTKDPSKADLF 59
M R +V+VY H P + E +G YASE +F K HFV +DP++A LF
Sbjct: 213 MERLLKVFVY-HDGAKPIFH-----SPELKGIYASEGWFMKLMEGNQHFVVRDPNRAHLF 266
Query: 60 FLPFSIARMRHDRRI----GTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGR 115
+LP+S ++ H+ + E + F+ YI IS K+PYWNRT GADHF+VACH G
Sbjct: 267 YLPYSSRQLEHNLYVPGSNTIEPLSIFVKKYIDFISTKFPYWNRTKGADHFFVACHDWGP 326
Query: 116 SAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLG-----SSKRNK 170
+ E++ N I+ +C++ I +DVSLP+ + R P G +++R+
Sbjct: 327 YTTKLHDELRKNTIKALCNADLSEGVFIHGRDVSLPETFLRSPRRPLRGIGGKPAAERSI 386
Query: 171 LAFFAGAVNSPVREKLLQVWRN---DSEIY---AHSGRLKTPYADELLGSKFCLHVKGFE 224
LAFFAG ++ VR LLQ W D IY H + Y + SK+C+ G+E
Sbjct: 387 LAFFAGQMHGRVRPVLLQYWGGKDADMRIYDRLPHRITRRMNYIQHMKSSKYCICPMGYE 446
Query: 225 VNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEY 284
VN+ RI +++YY CVPVIIA+++ LPF D LNW +FS+V+ D+P LK+IL I ++Y
Sbjct: 447 VNSPRIVEAIYYECVPVIIADNFVLPFDDALNWSAFSVVIPEKDVPKLKQILLAIPDDQY 506
Query: 285 LLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
+ +QS V +V+KHF WH P YD F+M+++ +W R
Sbjct: 507 MAMQSNVQRVQKHFIWHPNPIKYDIFHMILHSIWYSR 543
>gi|224093262|ref|XP_002309856.1| predicted protein [Populus trichocarpa]
gi|222852759|gb|EEE90306.1| predicted protein [Populus trichocarpa]
Length = 326
Score = 232 bits (592), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 128/329 (38%), Positives = 191/329 (58%), Gaps = 19/329 (5%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGN--YASESYFKKAFMKSHFVTKDPSKADL 58
M R F+V+VY R N+ D + N Y SE YF S F+T D ++A L
Sbjct: 1 MEREFKVFVYQDR------NITKHCDLPSKHNSRYESEEYFFSNLKMSPFLTDDAAEAHL 54
Query: 59 FFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAM 118
FF+P +M R I + ++ ++ KYPYWNRT GADHF+V C I +A
Sbjct: 55 FFIPIFSQKMTKKRSEDERAIA--VEDFVKSLISKYPYWNRTLGADHFFVTCADINVTAT 112
Query: 119 QKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQ-IWPRQEDPPKLGSSKRNKLAFFAGA 177
+ + N+I+V+C+ SY ++ HKDVSLPQ + P P + R LAF+ G
Sbjct: 113 ARIANLMKNSIKVMCTPSY-NDEYVPHKDVSLPQRVPPLALTPAGNNITNRITLAFWRGL 171
Query: 178 VNSPVREKLLQVWRNDSEIYAHSGRLKTPYADELL------GSKFCLHVKGFEVNTARIA 231
NS +R+KLL+ W ND E++ GR + +L+ SK+C+ G E++ IA
Sbjct: 172 NNSDIRQKLLEAWENDLELFIQKGRKPSLEQGDLVHHEAFNNSKYCICPGGPELDRT-IA 230
Query: 232 DSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYV 291
+++YGCVPVI++++YDLPF D+L+W+ FSI++ + L++ LK + EY +Q+
Sbjct: 231 LAIHYGCVPVIMSDYYDLPFKDILDWRKFSIILEESQVYYLREHLKEMLEHEYRAMQTNT 290
Query: 292 LKVRKHFQWHVFPSDYDAFYMVMYELWLR 320
+ VRKHFQW++ P+ YDAF+M MY+LWLR
Sbjct: 291 VMVRKHFQWNLVPAKYDAFHMTMYDLWLR 319
>gi|102139904|gb|ABF70053.1| exostosin family protein [Musa acuminata]
Length = 585
Score = 231 bits (588), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 124/307 (40%), Positives = 184/307 (59%), Gaps = 16/307 (5%)
Query: 31 GNYASESYFKKAFMK-SHFVTKDPSKADLFFLPFSIARMRHDRRIG----TEGIPDFISH 85
G YASE +F K + + FV KDP+KA LF+LP+S ++R + + + F+
Sbjct: 273 GIYASEGWFMKLMEENTQFVVKDPNKAHLFYLPYSSRQLRTHLYVAGSRSMQPLSIFLRD 332
Query: 86 YIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH 145
Y+ +I+ KYP+WNRT GADHF VACH +++ N I+VVC++ +
Sbjct: 333 YVNSIAAKYPFWNRTRGADHFLVACHDWATYTTNLHEDLRKNTIKVVCNADVSEGVFVRG 392
Query: 146 KDVSLPQIWPRQEDPPKLG-----SSKRNKLAFFAGAVNSPVREKLLQVWRNDS------ 194
KDVSL + + R + P+ +S+R+ LAFFAG ++ VR LL+ WR
Sbjct: 393 KDVSLAETYVRTPNSPRKAIGGRPASRRSILAFFAGQMHGRVRPILLRHWRGRDRDMRIY 452
Query: 195 EIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
E+ K Y + + SKFC+ G+EVN+ RI +++YY CVPVIIAN++ LPF +V
Sbjct: 453 EVLPDEIAAKMSYIEHMKSSKFCICPMGYEVNSPRIVEAIYYDCVPVIIANNFVLPFEEV 512
Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
L+W +FS+VVA DIP LK+IL GIS Y+ +Q+ V ++RKHF W+ P YD F+M++
Sbjct: 513 LDWGAFSVVVAEKDIPKLKQILLGISGRRYVRMQTNVRRLRKHFLWNDKPVKYDLFHMIL 572
Query: 315 YELWLRR 321
+ +W R
Sbjct: 573 HSIWFNR 579
>gi|102140005|gb|ABF70140.1| exostosin family protein [Musa balbisiana]
Length = 533
Score = 230 bits (587), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 135/339 (39%), Positives = 195/339 (57%), Gaps = 26/339 (7%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPR--GNYASESYFKKAFMK-SHFVTKDPSKAD 57
M R +VY+Y + P P+ P G YASE +F K + + FV DP+KA
Sbjct: 197 MERILKVYIY---EDGP-----KPLCHTPHLDGIYASEGWFMKLMEENTQFVVNDPNKAH 248
Query: 58 LFFLPFSIARMR-HDRRIGTEGIPD---FISHYIFNISQKYPYWNRTGGADHFYVACHSI 113
LF+LP+S ++R H G+ + F+ Y+ +IS KYP+WNRT GADHF VACH
Sbjct: 249 LFYLPYSSRQLRTHLYVAGSRSMRPLSIFLRDYVNSISAKYPFWNRTRGADHFLVACHDW 308
Query: 114 GRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLG-----SSKR 168
E++ N I+VVC++ + KDVSL + + R + P+ +S+R
Sbjct: 309 ATYTTNLHEELRKNTIKVVCNADVSEGVFVRGKDVSLAETYVRTPNSPRKAIGGRPASRR 368
Query: 169 NKLAFFAGAVNSPVREKLLQVWRNDS------EIYAHSGRLKTPYADELLGSKFCLHVKG 222
+ LAFFAG ++ VR LL+ WR E+ K Y + + SKFC+ G
Sbjct: 369 SILAFFAGQMHGRVRPILLRYWRGRDRDMRIYEVLPDEIAAKMSYIEHMKSSKFCICPMG 428
Query: 223 FEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSE 282
+EVN+ RI +++YY CVPVIIAN++ LPF +VL+W +FS+VVA DIP LK+IL GIS
Sbjct: 429 YEVNSPRIVEAIYYDCVPVIIANNFVLPFEEVLDWGAFSVVVAEKDIPKLKQILLGISGR 488
Query: 283 EYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
Y+ +Q V ++RKHF W+ P YD F+M+++ +W R
Sbjct: 489 RYVRMQRNVRRLRKHFLWNDKPVKYDLFHMILHSIWFNR 527
>gi|218187361|gb|EEC69788.1| hypothetical protein OsI_00072 [Oryza sativa Indica Group]
Length = 536
Score = 230 bits (587), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 132/337 (39%), Positives = 194/337 (57%), Gaps = 22/337 (6%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFM-KSHFVTKDPSKADLF 59
M R +V+VY H P + E +G YASE +F K HFV +DP++A LF
Sbjct: 199 MERLLKVFVY-HDGAKPIFH-----SPELKGIYASEGWFMKLMEGNQHFVVRDPNRAHLF 252
Query: 60 FLPFSIARMRHDRRI----GTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGR 115
+LP+S ++ H+ + E + F+ YI IS K+PYWNRT GADHF+VACH G
Sbjct: 253 YLPYSSRQLEHNLYVPGSNTIEPLSIFVKKYIDFISTKFPYWNRTKGADHFFVACHDWGP 312
Query: 116 SAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLG-----SSKRNK 170
+ E++ N I+ +C++ I +DVSLP+ + R P G +++R+
Sbjct: 313 YTTKLHDELRKNTIKALCNADLSEGVFIHGRDVSLPETFLRSPRRPLRGIGGKPAAERSI 372
Query: 171 LAFFAGAVNSPVREKLLQVWRN---DSEIY---AHSGRLKTPYADELLGSKFCLHVKGFE 224
LAFFAG ++ VR LLQ W D IY H + Y + SK+C+ G+E
Sbjct: 373 LAFFAGQMHGRVRPVLLQYWGGKDADMRIYDRLPHRITRRMNYIQHMKSSKYCICPMGYE 432
Query: 225 VNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEY 284
VN+ RI +++YY CVPVIIA+++ LPF D NW +FS+V+ D+P LK+IL I ++Y
Sbjct: 433 VNSPRIVEAIYYECVPVIIADNFVLPFDDAFNWSAFSVVILEKDVPKLKQILLEIPDDQY 492
Query: 285 LLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
+ +QS V +V+KHF WH P YD F+M+++ +W R
Sbjct: 493 MAMQSNVQRVQKHFIWHPNPIKYDIFHMILHSIWYSR 529
>gi|224068931|ref|XP_002326234.1| predicted protein [Populus trichocarpa]
gi|222833427|gb|EEE71904.1| predicted protein [Populus trichocarpa]
Length = 328
Score = 229 bits (584), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 139/336 (41%), Positives = 194/336 (57%), Gaps = 20/336 (5%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
M + F+V+VY F N + GN+ + + HF TKDP KA ++F
Sbjct: 1 MEKQFKVFVYGEGEPPVFHNGPCRSIYSMEGNF-----IHRMEIDGHFRTKDPDKAHVYF 55
Query: 61 LPFSIARM------RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIG 114
LPFS+A M R R G I +S YI IS KYP+WNR+ GADHF +ACH G
Sbjct: 56 LPFSVAMMVRFVYERESRDFGP--IRRTVSDYINLISGKYPFWNRSLGADHFMLACHDWG 113
Query: 115 RSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPK----LGSSKRNK 170
A + +I+ +C+++ + KDVSLP+I R L SKR+
Sbjct: 114 PEASFSVPHLGKISIRALCNANTSEKFNPI-KDVSLPEINLRTGSIKGFVGGLSPSKRSI 172
Query: 171 LAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLK-TPYADELLGSKFCLHVKGFEVNTA 228
LAFFAG ++ P+R +L+ W N D +I H K Y + + GSKFCL G+EV +
Sbjct: 173 LAFFAGRLHGPIRPVVLEHWENKDDDIKVHQQLPKGVSYYEMMRGSKFCLCPSGYEVASP 232
Query: 229 RIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQ 288
RI ++LY GCVPV+I++HY PF+DVLNWKSFS+ V DIP LKKIL IS +Y+ +Q
Sbjct: 233 RIVEALYAGCVPVLISDHYVPPFSDVLNWKSFSVEVPVSDIPSLKKILTSISPRQYIRMQ 292
Query: 289 SYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSV 324
VL+VR+HF+ + P +D F+M+++ +WLRR +V
Sbjct: 293 RRVLQVRRHFEVNSPPKRFDVFHMILHSIWLRRLNV 328
>gi|357125316|ref|XP_003564340.1| PREDICTED: probable glycosyltransferase At5g03795-like
[Brachypodium distachyon]
Length = 553
Score = 229 bits (584), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 132/337 (39%), Positives = 195/337 (57%), Gaps = 22/337 (6%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFM-KSHFVTKDPSKADLF 59
M R +V++Y H P + E +G YASE +F K +FV +DP++A LF
Sbjct: 216 MERLLKVFIY-HDGAKPIFH-----SPELKGIYASEGWFMKLIEGNQNFVVRDPNRAHLF 269
Query: 60 FLPFSIARMRHDRRI----GTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGR 115
+LP+S ++ H+ + E + F+ +YI IS K+PYWNRT GADHF+VACH G
Sbjct: 270 YLPYSSRQLEHNLYVPGSNTLEPLSIFVKNYIDMISAKFPYWNRTKGADHFFVACHDWGP 329
Query: 116 SAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP--KLG---SSKRNK 170
+ E++ N I+ +C++ I +DVSLP+ + R P +G +++R+
Sbjct: 330 YTTKLHDELRRNTIKALCNADLSEGVFIRGRDVSLPETFVRSPRRPLRDIGGKPATERSI 389
Query: 171 LAFFAGAVNSPVREKLLQVW---RNDSEIYAHSGRLKT---PYADELLGSKFCLHVKGFE 224
LAFFAG ++ VR LLQ W D IY R T Y + SK+C+ G+E
Sbjct: 390 LAFFAGQMHGRVRPILLQYWGGKDTDMRIYGRLPRRITRRMNYVQHMKSSKYCICPMGYE 449
Query: 225 VNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEY 284
VN+ RI +++YY C+PVIIA+++ LPF D L+W +FS+VV D+P LK+IL I Y
Sbjct: 450 VNSPRIVEAIYYECIPVIIADNFVLPFDDALDWSTFSVVVPEKDVPRLKEILLRIPESRY 509
Query: 285 LLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
+ +QS V KV+KHF WH P YD F+M+++ +W R
Sbjct: 510 ITMQSNVKKVQKHFLWHAKPVKYDIFHMILHSVWFSR 546
>gi|242056743|ref|XP_002457517.1| hypothetical protein SORBIDRAFT_03g008610 [Sorghum bicolor]
gi|241929492|gb|EES02637.1| hypothetical protein SORBIDRAFT_03g008610 [Sorghum bicolor]
Length = 521
Score = 229 bits (583), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 133/337 (39%), Positives = 195/337 (57%), Gaps = 22/337 (6%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAF-MKSHFVTKDPSKADLF 59
M R +V++Y H P + E +G YASE +F + +FV +DP++A LF
Sbjct: 186 MERLLKVFIY-HDGAKPIFH-----SPELKGIYASEGWFMRLMETNQNFVVRDPNRAHLF 239
Query: 60 FLPFSIARMRHDRRI----GTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGR 115
+LP+S ++ H+ + E + F+ +YI IS KYPYWNRT GADHF+VACH G
Sbjct: 240 YLPYSSRQLEHNLYVPGSNSIEPLSIFVKNYIDLISAKYPYWNRTKGADHFFVACHDWGP 299
Query: 116 SAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP--KLG---SSKRNK 170
+ E++ N I+ +C++ I KDVSLP+ + R P +G +++R
Sbjct: 300 YTTKLHDELRKNTIKALCNADLSEGIFIRGKDVSLPETFLRSPRRPLRDIGGRPAAQRTI 359
Query: 171 LAFFAGAVNSPVREKLLQVWRN---DSEIYA---HSGRLKTPYADELLGSKFCLHVKGFE 224
LAFFAG ++ VR LL+ W + D IY+ H K Y + SK+C+ G+E
Sbjct: 360 LAFFAGQMHGRVRPVLLKYWGDKDADMRIYSRLPHRITRKRNYVQHMKSSKYCICPMGYE 419
Query: 225 VNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEY 284
VN+ RI +++YY CVPVIIA+++ LPF LNW +FS+VV D+P LK+IL I Y
Sbjct: 420 VNSPRIVEAIYYECVPVIIADNFVLPFDAALNWSAFSVVVPESDVPKLKEILLAIPESRY 479
Query: 285 LLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
+ LQS V +V+KHF WH P YD F+M+++ +W R
Sbjct: 480 ITLQSNVKRVQKHFLWHPNPVKYDIFHMILHSVWFSR 516
>gi|30683790|ref|NP_567512.2| Exostosin family protein [Arabidopsis thaliana]
gi|19347795|gb|AAL86348.1| unknown protein [Arabidopsis thaliana]
gi|26983908|gb|AAN86206.1| unknown protein [Arabidopsis thaliana]
gi|332658395|gb|AEE83795.1| Exostosin family protein [Arabidopsis thaliana]
Length = 542
Score = 229 bits (583), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 140/353 (39%), Positives = 206/353 (58%), Gaps = 32/353 (9%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEP--RGNYASESYFKKAFMKSH--FVTKDPSKA 56
M +VY+YP P+ EP G YASE +F K M+S+ FVTK+P +A
Sbjct: 196 MELILKVYIYPDGDK--------PIFHEPHLNGIYASEGWFMK-LMESNKQFVTKNPERA 246
Query: 57 DLFFLPFSIARMRHDRRI-GTEGIPD---FISHYIFNISQKYPYWNRTGGADHFYVACHS 112
LF++P+S+ +++ + G+ I F+ Y+ +S KYP+WNRT G+DHF VACH
Sbjct: 247 HLFYMPYSVKQLQKSIFVPGSHNIKPLSIFLRDYVNMLSIKYPFWNRTHGSDHFLVACHD 306
Query: 113 IGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP--KLGS----S 166
G + + E+K NAI+ +C++ + KDVSLP+ R P +G+ S
Sbjct: 307 WGPYTVNEHPELKRNAIKALCNADLSDGIFVPGKDVSLPETSIRNAGRPLRNIGNGNRVS 366
Query: 167 KRNKLAFFAGAVNSPVREKLLQVWRN---DSEIYA---HSGRLKTPYADELLGSKFCLHV 220
+R LAFFAG ++ VR KLL+ WRN D +IY H+ K Y + SK+CL
Sbjct: 367 QRPILAFFAGNLHGRVRPKLLKHWRNKDEDMKIYGPLPHNVARKMTYVQHMKSSKYCLCP 426
Query: 221 KGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGIS 280
G+EVN+ RI +++YY CVPV+IA+++ LPF+DVL+W +FS+VV +IP LK+IL I
Sbjct: 427 MGYEVNSPRIVEAIYYECVPVVIADNFMLPFSDVLDWSAFSVVVPEKEIPRLKEILLEIP 486
Query: 281 SEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQWSTSLD 333
YL +QS V V++HF W P YD F+M+++ +W ++ Q TS D
Sbjct: 487 MRRYLKMQSNVKMVQRHFLWSPKPRKYDVFHMILHSIWF---NLLNQNQTSFD 536
>gi|302768010|ref|XP_002967425.1| glycosyltransferase-like protein [Selaginella moellendorffii]
gi|300165416|gb|EFJ32024.1| glycosyltransferase-like protein [Selaginella moellendorffii]
Length = 372
Score = 228 bits (581), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 132/337 (39%), Positives = 197/337 (58%), Gaps = 23/337 (6%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYF-KKAFMKSHFVTKDPSKADLF 59
M R F+VYVY + +P P+ G Y+SE F + S FVT DP +A +F
Sbjct: 28 MERLFKVYVY--KEGEPRLVHKGPLT----GIYSSEGRFIHEMNQNSRFVTHDPQEAHMF 81
Query: 60 FLPFSIARMRHDRRI-GTEG---IPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGR 115
FLP+S+A M D + G+ + FI Y+ I+ K+P+WN T G+DHF+ +CH G
Sbjct: 82 FLPYSVAHMVLDLYVPGSHSMLPLATFIKDYVNLIASKHPFWNLTRGSDHFFASCHDWGP 141
Query: 116 SAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQ-EDPPKLGS---SKRNKL 171
+ + E++ N+++VVC+S + KD SLP+ + + P KLG SKR L
Sbjct: 142 ATARDHPELRKNSVKVVCNSD-LTEEFVPDKDASLPETYLHAVKLPTKLGGPGPSKRPIL 200
Query: 172 AFFAGAVNSPVREKLLQVWRNDS-------EIYAHSGRLKTPYADELLGSKFCLHVKGFE 224
AFFAG ++ VR L++ W++ E+ +T Y + SKFC+ GFE
Sbjct: 201 AFFAGQMHGRVRPALIKHWKDRGDPDMRIYEVLPPDVARRTSYVQHMKSSKFCICAMGFE 260
Query: 225 VNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEY 284
VN+ RI +S+YY CVPV+IA+++ LPF+DVLNW SFS+ V+ D+P LK++L +S + Y
Sbjct: 261 VNSPRIVESIYYDCVPVLIADNFVLPFSDVLNWGSFSLTVSEKDVPRLKELLLAVSEDRY 320
Query: 285 LLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
+QS + KVRKHF WH +D F+M+++ +W RR
Sbjct: 321 RKMQSRLKKVRKHFLWHDSAERFDMFHMILHSVWTRR 357
>gi|302753728|ref|XP_002960288.1| glycosylransferase-like protein [Selaginella moellendorffii]
gi|300171227|gb|EFJ37827.1| glycosylransferase-like protein [Selaginella moellendorffii]
Length = 372
Score = 228 bits (581), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 132/337 (39%), Positives = 197/337 (58%), Gaps = 23/337 (6%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYF-KKAFMKSHFVTKDPSKADLF 59
M R F+VYVY + +P P+ G Y+SE F + S FVT DP +A +F
Sbjct: 28 MERLFKVYVY--KEGEPRLVHKGPLT----GIYSSEGRFIHEMNQNSRFVTHDPQEAHMF 81
Query: 60 FLPFSIARMRHDRRI-GTEG---IPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGR 115
FLP+S+A M D + G+ + FI Y+ I+ K+P+WN T G+DHF+ +CH G
Sbjct: 82 FLPYSVAHMVLDLYVPGSHTMLPLATFIKDYVNLIASKHPFWNLTRGSDHFFTSCHDWGP 141
Query: 116 SAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQ-EDPPKLGS---SKRNKL 171
+ + E++ N+++VVC+S + KD SLP+ + + P KLG SKR L
Sbjct: 142 ATARDHPELRKNSVKVVCNSD-LTEEFVPDKDASLPETYLHAVKLPTKLGGPGPSKRPIL 200
Query: 172 AFFAGAVNSPVREKLLQVWRNDS-------EIYAHSGRLKTPYADELLGSKFCLHVKGFE 224
AFFAG ++ VR L++ W++ E+ +T Y + SKFC+ GFE
Sbjct: 201 AFFAGQMHGRVRPALIKHWKDRGDPDMRIYEVLPPEVARRTSYVQHMKSSKFCICAMGFE 260
Query: 225 VNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEY 284
VN+ RI +S+YY CVPV+IA+++ LPF+DVLNW SFS+ V+ D+P LK++L +S + Y
Sbjct: 261 VNSPRIVESIYYDCVPVLIADNFVLPFSDVLNWGSFSLTVSEKDVPRLKELLLAVSEDRY 320
Query: 285 LLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
+QS + KVRKHF WH +D F+M+++ +W RR
Sbjct: 321 RKMQSRLKKVRKHFLWHDSAERFDMFHMILHSVWTRR 357
>gi|224133768|ref|XP_002321656.1| predicted protein [Populus trichocarpa]
gi|222868652|gb|EEF05783.1| predicted protein [Populus trichocarpa]
Length = 407
Score = 228 bits (581), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 135/355 (38%), Positives = 199/355 (56%), Gaps = 39/355 (10%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
M + +V+VYP +P +D + + NYASE YF F+T++P +A LFF
Sbjct: 61 MEKDLKVFVYP--GGNP-KTCYHSIDKKLKSNYASEHYFFMNLRNGSFLTENPDEAHLFF 117
Query: 61 LPFSIARMRHDRRIGTEGIPDF----ISHYIFNISQKYPYWNRTGGADHFYVACHSIGRS 116
+P S M + +P + I +Y+ ++ KYPYWNRT GADHF+V+CH IG
Sbjct: 118 IPLSCQPME-----DQDALPRYKEMVIQNYVRALTIKYPYWNRTLGADHFFVSCHGIGNR 172
Query: 117 AMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI----WPRQED------------- 159
A + NAI++VCS SY S +I HKDVSLPQI +P + D
Sbjct: 173 ATAAFPFLLKNAIRLVCSPSY-DSNYIPHKDVSLPQILELSFPPEGDGMWNDSTMESLPI 231
Query: 160 --PPKLGSSKRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAH------SGRLKTPYADEL 211
P R KL F+AG+ NS VR+ L ++ E H + + E+
Sbjct: 232 QLSPVETHPSRTKLCFWAGSPNSEVRKNLRVHYKGLEEFEIHFVENVKRALVLDTFQKEI 291
Query: 212 LGSKFCLHVKG-FEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIP 270
SKFC+ +G +V +A+S+ +GCVPVI++++YDLPF D+L+W +FS+++ D+P
Sbjct: 292 HRSKFCICPRGKTQVGGVCLAESMAFGCVPVIMSDYYDLPFNDILDWNAFSVILKEHDVP 351
Query: 271 LLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
++ +ILKGI + + ++ VLKV K+F+WH P YD F+MVMYELW RR +R
Sbjct: 352 IMGEILKGIPEDMFEKMRQNVLKVSKYFKWHFRPVKYDEFHMVMYELWKRRHIIR 406
>gi|297804498|ref|XP_002870133.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
gi|297315969|gb|EFH46392.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
Length = 540
Score = 228 bits (580), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 136/352 (38%), Positives = 202/352 (57%), Gaps = 30/352 (8%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEP--RGNYASESYFKKAF-MKSHFVTKDPSKAD 57
M +VY+YP P+ +P G YASE +F K + FVTK+P +A
Sbjct: 194 MELILKVYIYPDGEK--------PIFHQPHLNGIYASEGWFMKLMESNTQFVTKNPERAH 245
Query: 58 LFFLPFSIARMRHDRRI-GTEGIPD---FISHYIFNISQKYPYWNRTGGADHFYVACHSI 113
LF++P+S+ +++ + G+ I F+ Y+ +S KYP+WNRT G+DHF VACH
Sbjct: 246 LFYMPYSVKQLQTSIFVPGSHNIKPLSIFLRDYVNMLSTKYPFWNRTHGSDHFLVACHDW 305
Query: 114 GRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP--KLGS----SK 167
G + + E++ N I+ +C++ I KDVSLP+ R P +G+ S+
Sbjct: 306 GPYTVNEHPELRRNTIKALCNADLADGIFIPGKDVSLPETSIRNAGKPLRNIGNGNRVSQ 365
Query: 168 RNKLAFFAGAVNSPVREKLLQVWRN---DSEIYA---HSGRLKTPYADELLGSKFCLHVK 221
R LAFFAG ++ VR KLL+ WRN D +IY H+ K Y + SK+CL
Sbjct: 366 RPILAFFAGNLHGRVRPKLLKHWRNKDDDMKIYGPLPHNVARKMTYVQHMKSSKYCLCPM 425
Query: 222 GFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISS 281
G+EVN+ RI +++YY CVPV+IA+++ LPF+DVL+W +FS+VV +IP LK+IL I
Sbjct: 426 GYEVNSPRIVEAIYYECVPVVIADNFMLPFSDVLDWSAFSVVVPEKEIPRLKEILLEIPM 485
Query: 282 EEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQWSTSLD 333
YL +QS V V++HF W P YD F+M+++ +W ++ Q TS D
Sbjct: 486 RRYLKMQSNVKMVQRHFLWSPKPRKYDVFHMILHSIWF---NLLNQNQTSFD 534
>gi|356537024|ref|XP_003537031.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
max]
Length = 472
Score = 228 bits (580), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 134/338 (39%), Positives = 191/338 (56%), Gaps = 20/338 (5%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
M + F+V+VY F N + GN+ M HF TKDP KA +FF
Sbjct: 139 MEKQFKVFVYEEGETPVFHNGPCKSIYSMEGNF-----IHAIEMNDHFRTKDPKKAHVFF 193
Query: 61 LPFSIARM------RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIG 114
LPFS+ M R R G I + YI I+ +Y YWNR+ GADHF +ACH G
Sbjct: 194 LPFSVVMMVRFVYQRDSRDFGP--IRKTVIDYINLIAARYSYWNRSLGADHFMLACHDWG 251
Query: 115 RSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKL----GSSKRNK 170
A + N+I+V+C+++ G KDVS P+I + +SKR+
Sbjct: 252 PEASLSLPYLHKNSIRVLCNANT-SEGFKPAKDVSFPEINLQTGSINGFIGGPSASKRSI 310
Query: 171 LAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLK-TPYADELLGSKFCLHVKGFEVNTA 228
LAFFAG V+ P+R LL+ W N D +I H K Y D+L SKFCL G+EV +
Sbjct: 311 LAFFAGGVHGPIRPILLEHWENKDEDIQVHKYLPKGVSYYDKLRNSKFCLCPSGYEVASP 370
Query: 229 RIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQ 288
R+ +++Y GCVPV+I+ HY PF+DVLNWKSFS+ ++ DIP LK IL IS +Y+ +Q
Sbjct: 371 RVVEAIYTGCVPVLISEHYVPPFSDVLNWKSFSVELSVKDIPNLKDILMSISPRQYIRMQ 430
Query: 289 SYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
V+++++HF+ H P +D F+M+++ +WLRR + R+
Sbjct: 431 RRVIQIQRHFEVHSPPKRFDVFHMILHSVWLRRLNFRM 468
>gi|356546040|ref|XP_003541440.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
max]
Length = 491
Score = 226 bits (577), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 134/339 (39%), Positives = 191/339 (56%), Gaps = 20/339 (5%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
M + F+V+VY F N + GN+ M HF TKDP KA +FF
Sbjct: 158 MEKQFKVFVYEEGETPVFHNGPCKSIYSMEGNF-----IHAIEMNDHFRTKDPKKAHVFF 212
Query: 61 LPFSIARM------RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIG 114
LPFS+ M R R G I + Y+ I+ +YPYWNR+ GADHF +ACH G
Sbjct: 213 LPFSVVMMVRFVYERDSRDFGP--IKKTVIDYVNLIATRYPYWNRSLGADHFMLACHDWG 270
Query: 115 RSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKL----GSSKRNK 170
A + N+I+V+C+++ G KDVS P+I + +SKR+
Sbjct: 271 PEASFSLPYLHKNSIRVLCNANT-SEGFKPAKDVSFPEINLQTGSINGFVGGPSASKRSI 329
Query: 171 LAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLK-TPYADELLGSKFCLHVKGFEVNTA 228
LAFFAG V+ P+R LL+ W N D +I H K Y L SKFCL G+EV +
Sbjct: 330 LAFFAGGVHGPIRPILLEHWENKDEDIQVHKYLPKGVSYYGMLRKSKFCLCPSGYEVASP 389
Query: 229 RIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQ 288
R+ +++Y GCVPV+I+ HY PF+DVLNWKSFS+ ++ DIP+LK IL IS +++ +Q
Sbjct: 390 RVVEAIYTGCVPVLISEHYVPPFSDVLNWKSFSVELSVKDIPILKDILMSISPRQHIRMQ 449
Query: 289 SYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
V ++R+HF+ H P +D F+M+++ +WLRR + RV+
Sbjct: 450 RRVGQIRRHFEVHSPPKRFDVFHMILHSVWLRRLNFRVR 488
>gi|356570806|ref|XP_003553575.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
max]
Length = 537
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 134/339 (39%), Positives = 192/339 (56%), Gaps = 22/339 (6%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAF-MKSHFVTKDPSKADLF 59
M + F+V+VY F N + Y+ E F A M F T+DP +A +F
Sbjct: 204 MEKQFKVFVYEEGEPPVFHNGPC------KSIYSMEGNFIHAIEMNDQFRTRDPEEAHVF 257
Query: 60 FLPFSIARM------RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSI 113
FLPFS+A + R G I ++ Y+ I +YPYWNR+ GADHFY+ACH
Sbjct: 258 FLPFSVAMLVQFVYVRDSHDFGP--IKKTVTDYVNVIGGRYPYWNRSLGADHFYLACHDW 315
Query: 114 GRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKL----GSSKRN 169
G + + N+I+V+C+++ G KDVS P+I + +S+R
Sbjct: 316 GPETSRSIPNLNKNSIRVLCNANTS-EGFKPSKDVSFPEINLQTGSINGFIGGPSASRRP 374
Query: 170 KLAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLK-TPYADELLGSKFCLHVKGFEVNT 227
LAFFAG ++ P+R LL+ W N D +I H K Y + L SKFCL G+EV +
Sbjct: 375 LLAFFAGGLHGPIRPVLLEHWENKDEDIQVHKYLPKGVSYYEMLRKSKFCLCPSGYEVAS 434
Query: 228 ARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLL 287
R+ +++Y GCVPV+I++HY PF DVLNWKSFS+ V+ DIP LK+IL IS +Y+ +
Sbjct: 435 PRVVEAIYTGCVPVLISDHYVPPFNDVLNWKSFSVEVSVKDIPRLKEILLSISPRQYIRM 494
Query: 288 QSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
Q V +VR+HF+ H P YD F+M+++ +WLRR + RV
Sbjct: 495 QRRVGQVRRHFEVHSPPKRYDVFHMILHSVWLRRLNFRV 533
>gi|227202794|dbj|BAH56870.1| AT4G38040 [Arabidopsis thaliana]
Length = 407
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 123/302 (40%), Positives = 178/302 (58%), Gaps = 12/302 (3%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
M + F+VY+YP DP N + G YASE YF + +S F T DP +ADLFF
Sbjct: 100 MEKRFKVYIYPD--GDP--NTFYQTPRKVTGKYASEGYFFQNIRESRFRTLDPDEADLFF 155
Query: 61 LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
+P S +MR + E + + +Y+ + KYPYWNRT GADHF+V CH +G A +
Sbjct: 156 IPISCHKMR-GKGTSYENMTVIVQNYVDGLIAKYPYWNRTLGADHFFVTCHDVGVRAFEG 214
Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGS-SKRNKLAFFAGAVN 179
+ + N I+VVCS SY + G I HKDV+LPQ+ P R L F+AG N
Sbjct: 215 SPLLIKNTIRVVCSPSYNV-GFIPHKDVALPQVLQPFALPAGGNDVENRTTLGFWAGHRN 273
Query: 180 SPVREKLLQVWRNDSEIYAHSGRLKTP-----YADELLGSKFCLHVKGFEVNTARIADSL 234
S +R L VW ND+E+ + R+ Y +KFC+ G +VN+ARI DS+
Sbjct: 274 SKIRVILAHVWENDTELDISNNRINRATGHLVYQKRFYRTKFCICPGGSQVNSARITDSI 333
Query: 235 YYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKV 294
+YGC+PVI++++YDLPF D+LNW+ F++V+ D+ LK+ILK I E++ L + ++KV
Sbjct: 334 HYGCIPVILSDYYDLPFNDILNWRKFAVVLREQDVYNLKQILKNIPHSEFVSLHNNLVKV 393
Query: 295 RK 296
++
Sbjct: 394 KR 395
>gi|168059393|ref|XP_001781687.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666856|gb|EDQ53500.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 376
Score = 225 bits (574), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 129/339 (38%), Positives = 194/339 (57%), Gaps = 26/339 (7%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPR-GNYASESYF-KKAFMKSHFVTKDPSKADL 58
M + F+VY+Y R P V P+ G YASE F ++ S F+T DPS+A +
Sbjct: 29 MEKVFKVYIYKDGRK-PL------VHSGPQLGIYASEGQFIERMEAASEFLTDDPSRAHM 81
Query: 59 FFLPFSIARMRHDRRIGTE----GIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIG 114
FFLP+S+ RM + + FI Y+ ++++YPYWNRT GADHF+V+CH G
Sbjct: 82 FFLPYSVYRMVTHLYVPNSRSMLPLATFIKDYVEALARQYPYWNRTKGADHFFVSCHDWG 141
Query: 115 RSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIW---PRQEDPPKLGS---SKR 168
+ + ++ NA++VVC++ + KD SLP+++ + + P KLG +R
Sbjct: 142 PATARDHPTLRSNAVKVVCNAD-LTEEFVVGKDASLPEVYMHKSKTKAPIKLGGPGYDER 200
Query: 169 NKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKTP------YADELLGSKFCLHVKG 222
LAFFAG ++ VR LL W++ G L P Y + SK+C+ G
Sbjct: 201 PYLAFFAGQMHGRVRPILLDHWKDKDPDLMIYGVLPKPIAKQISYVQHMKMSKYCICAAG 260
Query: 223 FEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSE 282
+EVN+ RI +S++Y CVPVIIA+++ LPF+DVLNW +FS+ + DIP LK IL I +
Sbjct: 261 YEVNSPRIVESIHYDCVPVIIADNFVLPFSDVLNWDAFSVTMPESDIPKLKAILNDIPEK 320
Query: 283 EYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
Y +Q + K+R+HF WH P YD F+M+++ +W+ R
Sbjct: 321 TYRSMQIRLRKIRQHFVWHKKPEKYDVFHMILHSVWMSR 359
>gi|356533401|ref|XP_003535253.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
max]
Length = 523
Score = 225 bits (573), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 139/345 (40%), Positives = 197/345 (57%), Gaps = 32/345 (9%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPV--DFEPRGNYASESYFKKAF-MKSHFVTKDPSKAD 57
M + F+V+VY LPV D Y++E F A M HF T+DP KA+
Sbjct: 182 MEKQFKVFVYEEGE--------LPVFHDGPCSSIYSTEGSFIHAIEMNEHFRTRDPKKAN 233
Query: 58 LFFLPFSIARM------RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACH 111
+FFLPFSIA M R+ G I + Y+ I+ +YPYWNR+ GADHF ++CH
Sbjct: 234 VFFLPFSIAWMVRYVYIRNSYDFGP--IKRTVRDYVNVIATRYPYWNRSLGADHFMLSCH 291
Query: 112 SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI--WPRQED-----PPKLG 164
G + ++ N+I+V+C+++ G KD S P+I P +D PP
Sbjct: 292 DWGPETSKSIPYLRKNSIRVLCNANT-SEGFDPIKDASFPEINLQPGLKDSFVGGPP--- 347
Query: 165 SSKRNKLAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLK-TPYADELLGSKFCLHVKG 222
+SKR+ LAFFAG + P+R LL+ W N D +I H K Y L SKFCL G
Sbjct: 348 ASKRSILAFFAGGNHGPIRPILLEHWENKDEDIQVHKYLPKGVSYYGMLRNSKFCLCPSG 407
Query: 223 FEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSE 282
+EV + R+ +++Y GCVPV+I+ HY PF+DVLNWK FS+ V+ +IP LK IL IS
Sbjct: 408 YEVASPRVVEAIYTGCVPVLISEHYVPPFSDVLNWKMFSVNVSVKEIPNLKDILTSISPR 467
Query: 283 EYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
+Y+ +Q V ++R+HF+ H P YD F+M+++ +WLRR + RV
Sbjct: 468 QYIRMQKRVGQIRRHFEVHSPPKRYDVFHMILHSVWLRRLNFRVH 512
>gi|356548353|ref|XP_003542567.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
max]
Length = 505
Score = 223 bits (569), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 138/341 (40%), Positives = 199/341 (58%), Gaps = 26/341 (7%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFE-PRGN-YASESYFKKAF-MKSHFVTKDPSKAD 57
M + F+V+VY LPV E P + Y++E F A M HF T+DP KA
Sbjct: 172 MEKQFKVFVYEEGE--------LPVFHEGPCASIYSTEGSFIHAIEMNEHFRTRDPKKAH 223
Query: 58 LFFLPFSIARM------RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACH 111
+FFLPFS+ M R G I + YI I+ +YPYWNR+ GADHF ++CH
Sbjct: 224 VFFLPFSVVMMVRYVYIRDSHDFGP--IKRTVRDYINVIAARYPYWNRSLGADHFMLSCH 281
Query: 112 SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI-WPRQEDPPKLG---SSK 167
G A + + ++ N+I+V+C+++ G KDVS P+I R LG +S+
Sbjct: 282 DWGPEASKFSPYLRKNSIRVLCNANT-SEGFDPRKDVSFPEINLQRGPIDGLLGGPSASQ 340
Query: 168 RNKLAFFAGAVNSPVREKLLQVW-RNDSEIYAHSGRLK-TPYADELLGSKFCLHVKGFEV 225
R+ LAFFAG ++ P+R LL+ W + D +I H K Y L SKFCL G+EV
Sbjct: 341 RSILAFFAGGIHGPIRPILLEHWEKKDEDIQVHQYLPKGVSYYGMLRKSKFCLCPSGYEV 400
Query: 226 NTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYL 285
+ R+ +++Y GCVPV+I++HY PF+DVLNWK FS+ V+ +IP LK IL IS +Y+
Sbjct: 401 ASPRVVEAIYTGCVPVLISDHYVPPFSDVLNWKMFSVEVSMKEIPNLKDILMNISPRKYI 460
Query: 286 LLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
+Q V ++R+HF+ H P YD F+M+++ +WLRR + RV
Sbjct: 461 RMQKRVRQIRRHFEVHSPPKRYDVFHMILHSVWLRRLNFRV 501
>gi|356505300|ref|XP_003521429.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
max]
Length = 534
Score = 223 bits (569), Expect = 8e-56, Method: Compositional matrix adjust.
Identities = 134/339 (39%), Positives = 190/339 (56%), Gaps = 22/339 (6%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAF-MKSHFVTKDPSKADLF 59
M + F+V+VY F N + Y+ E F A M F T+DP KA +F
Sbjct: 201 MEKQFKVFVYEEGEPPVFHNGPC------KSIYSMEGNFIHAIEMNDQFRTRDPEKAHVF 254
Query: 60 FLPFSIARM------RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSI 113
FLPFS+A + R G I ++ Y+ I+ +YPYWNR+ GADHFY+ACH
Sbjct: 255 FLPFSVAMLVQFVYVRDSHDFGP--IKKTVTDYVNVIAGRYPYWNRSLGADHFYLACHDW 312
Query: 114 GRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKL----GSSKRN 169
G + + N+I+V+C+++ G KDVS P+I + +S R
Sbjct: 313 GPETSRSIPNLNENSIRVLCNANTS-EGFKPSKDVSFPEINLQTGSINGFIGGPSASGRP 371
Query: 170 KLAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLK-TPYADELLGSKFCLHVKGFEVNT 227
LAFFAG ++ P+R LL+ W N D +I H K Y + L S+FCL G+EV +
Sbjct: 372 LLAFFAGGLHGPIRPVLLEHWENRDEDIQVHKYLPKGVSYYEMLRKSRFCLCPSGYEVAS 431
Query: 228 ARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLL 287
R+ +++Y GCVPV+I++HY PF DVLNWKSFS+ V+ DIP LK+IL IS Y+ +
Sbjct: 432 PRVVEAIYTGCVPVLISDHYVPPFNDVLNWKSFSVEVSVKDIPRLKEILLSISPRHYIRM 491
Query: 288 QSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
Q V VR+HF+ H P YD F+M+++ +WLRR + RV
Sbjct: 492 QRRVGLVRRHFEVHSPPKRYDVFHMILHSVWLRRLNFRV 530
>gi|414881250|tpg|DAA58381.1| TPA: hypothetical protein ZEAMMB73_758303 [Zea mays]
Length = 474
Score = 223 bits (568), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 137/351 (39%), Positives = 198/351 (56%), Gaps = 36/351 (10%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSH-FVTKDPSKADLF 59
M R +VY+Y R F L G YASE +F K +S V D KA LF
Sbjct: 133 MERILKVYIYQDGRRPIFHTPPL------SGIYASEGWFMKLLKESRRHVVADAGKAHLF 186
Query: 60 FLPFSIARMR--------HDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACH 111
+LP+S ++R H+ R + ++ +++ ++ KYP+WNRT GADHF VACH
Sbjct: 187 YLPYSSQQLRLTLYEAGSHNLR----PLAAYLRNFVRGLASKYPFWNRTRGADHFLVACH 242
Query: 112 SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP-----KLGSS 166
G +++ NAI+ +C++ KDVSLP+ R P L S
Sbjct: 243 DWGPYTTTAHRDLRKNAIKALCNADSSEGIFTPGKDVSLPETTIRTPRRPLRYVGGLPVS 302
Query: 167 KRNKLAFFAGAVNSPVREKLLQVWRN----DSEIYAH-----SGRLKTPYADELLGSKFC 217
+R+ LAFFAG V+ VR LL+ W N D +Y+ S R+ Y + S+FC
Sbjct: 303 RRSILAFFAGNVHGRVRPVLLRHWGNGQDDDMRVYSLLPSRVSRRMN--YIQHMKNSRFC 360
Query: 218 LHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILK 277
L G+EVN+ RI ++LYY CVPVIIA+++ LPF++VL+W +FS+V+A DIP LKKILK
Sbjct: 361 LCPMGYEVNSPRIVEALYYECVPVIIADNFVLPFSEVLDWSAFSVVIAEKDIPDLKKILK 420
Query: 278 GISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR-SSVRVQ 327
GIS Y+ + V ++++HF WH P YD F+M+++ +WL R + V+VQ
Sbjct: 421 GISLRRYVAMHDSVKRLQRHFLWHARPIKYDLFHMILHSIWLSRVNQVQVQ 471
>gi|359481952|ref|XP_002284018.2| PREDICTED: probable glycosyltransferase At5g03795 [Vitis vinifera]
Length = 546
Score = 223 bits (567), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 133/339 (39%), Positives = 187/339 (55%), Gaps = 20/339 (5%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
M + +VYVY F N + GN+ K M SHF TKDP KA LFF
Sbjct: 210 MEKQLKVYVYDEGEPPLFHNGPCKSIYSMEGNF-----IHKMEMDSHFRTKDPEKAHLFF 264
Query: 61 LPFSIARM------RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIG 114
LPFS+A + R +G I + Y+ +S KYPYWNR+ GADHF +ACH G
Sbjct: 265 LPFSVAMLVRFVYVRDSHDLGP--IKQTVIDYVNVVSTKYPYWNRSLGADHFMLACHDWG 322
Query: 115 RSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKL----GSSKRNK 170
+ N+I+V+C+++ G KDVS P+I S R
Sbjct: 323 PETSFSIPYLHKNSIRVLCNANTS-EGFNPSKDVSFPEINLLTGSTDSFIGGPSPSHRTL 381
Query: 171 LAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLK-TPYADELLGSKFCLHVKGFEVNTA 228
LAFFAG ++ P+R LL+ W N D ++ H K Y + + SK+CL G+EV +
Sbjct: 382 LAFFAGGLHGPIRPILLEHWENKDEDVKVHKYLPKGVSYYEMMRKSKYCLCPSGYEVASP 441
Query: 229 RIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQ 288
R+ ++LY GCVPV+I++HY PF+DVLNWKSFS+ V +IP LK+IL IS +Y+ +Q
Sbjct: 442 RVVEALYTGCVPVLISDHYVPPFSDVLNWKSFSVEVPVREIPNLKRILMDISPRQYIRMQ 501
Query: 289 SYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
++ R+HF+ + P YD F+M+++ LWLRR + RV
Sbjct: 502 RRGIQARRHFEVNSPPKRYDVFHMILHSLWLRRLNFRVH 540
>gi|326519558|dbj|BAK00152.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 564
Score = 222 bits (566), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 130/337 (38%), Positives = 196/337 (58%), Gaps = 23/337 (6%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAF-MKSHFVTKDPSKADLF 59
M R +V++Y H P + E +G YASE +F K +FV +DP++A LF
Sbjct: 226 MERLLKVFIY-HDGAKPIFH-----SPELKGIYASEGWFMKLIEADQNFVVRDPNRAHLF 279
Query: 60 FLPFSIARMRHDRRIGTEGIPD----FISHYIFNISQKYPYWNRTGGADHFYVACHSIGR 115
+LP+S ++ H+ + D F+ +YI IS K+PYWNRT GADHF+VACH G
Sbjct: 280 YLPYSSRQLEHNLYVPGSNTLDPLSVFVKNYIDMISAKFPYWNRTKGADHFFVACHDWGP 339
Query: 116 SAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP--KLG---SSKRNK 170
+ E++ N I+ +C++ I +DVSLP+ + + P +G +++R+
Sbjct: 340 YTTKLHDELRKNTIKALCNADVSEGVFIRGRDVSLPETYIKSARRPVRDIGGKPAAERSI 399
Query: 171 LAFFAGAVNSPVREKLLQVW---RNDSEIYAHSGRLKT---PYADELLGSKFCLHVKGFE 224
LAFFAG ++ VR +L+ W D IY+ R T YA + SK+C+ G+E
Sbjct: 400 LAFFAGQMHGRVR-PVLKYWGGKDTDMRIYSRIPRQITRRMNYAKHMKSSKYCICPMGYE 458
Query: 225 VNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEY 284
VN+ RI +++YY CVPVIIA+++ LPF D L+W +FS+VVA D+P LK IL I Y
Sbjct: 459 VNSPRIVEAIYYECVPVIIADNFVLPFDDALDWSAFSVVVAEKDVPKLKAILLAIPESRY 518
Query: 285 LLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
+ ++S V KV++HF WH P YD F+M+++ +W R
Sbjct: 519 ITMRSNVKKVQRHFLWHAKPVKYDIFHMILHSVWFSR 555
>gi|255575457|ref|XP_002528630.1| catalytic, putative [Ricinus communis]
gi|223531919|gb|EEF33733.1| catalytic, putative [Ricinus communis]
Length = 574
Score = 222 bits (566), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 130/339 (38%), Positives = 193/339 (56%), Gaps = 20/339 (5%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
M + F+V+VY F N + GN+ + + F TKDP KA ++F
Sbjct: 236 MEKQFKVFVYEEGEPPVFHNGPCKSIYSMEGNF-----IHRMEIDDQFRTKDPEKAHVYF 290
Query: 61 LPFSIARM------RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIG 114
LPFS+A M R G I + Y+ ++ KYPYWNR+ GADHF +ACH G
Sbjct: 291 LPFSVAMMVQFVYVRDSHDFGP--IKRTVRDYVNLVAGKYPYWNRSLGADHFMLACHDWG 348
Query: 115 RSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKL----GSSKRNK 170
++ N+I+ +C+++ + KDVS P+I + SKR+
Sbjct: 349 PETSFSLPDLAKNSIRALCNANTSERFNPI-KDVSFPEINLQTGTTKGFIGGPSPSKRSI 407
Query: 171 LAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLK-TPYADELLGSKFCLHVKGFEVNTA 228
LAFFAG ++ P+R LL+ W N D+++ H K Y + + SKFCL G+EV +
Sbjct: 408 LAFFAGGLHGPIRPILLEHWENKDNDMKVHRYLPKGVSYYEMMRKSKFCLCPSGYEVASP 467
Query: 229 RIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQ 288
R+ ++LY GCVPV+I++HY PF+DVLNWKSFS+ V DIP LK+IL ISS +Y+ +Q
Sbjct: 468 RVVEALYTGCVPVLISDHYVPPFSDVLNWKSFSVEVPVSDIPNLKRILTSISSRQYIRMQ 527
Query: 289 SYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
VL+VR+HF+ + P YD F+M+++ +WLRR +V++
Sbjct: 528 RRVLQVRRHFEVNSPPKRYDVFHMILHSIWLRRLNVKIH 566
>gi|147784355|emb|CAN72733.1| hypothetical protein VITISV_033460 [Vitis vinifera]
Length = 321
Score = 222 bits (565), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 112/255 (43%), Positives = 163/255 (63%), Gaps = 7/255 (2%)
Query: 77 EGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSS 136
E + + +Y+ ++ KYPYWNRT GADHF++ CH G A + + N+I+VVCS S
Sbjct: 67 ENMTIIVQNYVQSLMSKYPYWNRTLGADHFFLTCHDXGVRATEGVPLLVKNSIRVVCSPS 126
Query: 137 YFISGHIAHKDVSLPQIWPRQEDPPKLGSSK-RNKLAFFAGAVNSPVREKLLQVWRNDSE 195
Y + G I HKDV+LPQ+ P K R L F+AG NS +R L ++W ND+E
Sbjct: 127 YDV-GFIPHKDVALPQVLQPFALPTGGRDIKNRTTLGFWAGHRNSKIRVILARIWENDTE 185
Query: 196 IYAHSGRLKTP-----YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLP 250
+ + R+ Y ++ +KFC+ G +VN+ARIADS++YGCVPVI++++YDLP
Sbjct: 186 LDIKNNRINRATGHLVYQNKFYRTKFCICPGGSQVNSARIADSIHYGCVPVILSDYYDLP 245
Query: 251 FADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAF 310
F D+L+W+ FS+++ D+ LK ILK I E++ L ++KV+KHFQW+ P YDAF
Sbjct: 246 FNDILDWRKFSVILKERDVYRLKYILKDIPDAEFIALHDNLVKVQKHFQWNTPPIKYDAF 305
Query: 311 YMVMYELWLRRSSVR 325
+MVMYELWLR ++
Sbjct: 306 HMVMYELWLRHHVIK 320
>gi|297740031|emb|CBI30213.3| unnamed protein product [Vitis vinifera]
Length = 337
Score = 221 bits (564), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 133/339 (39%), Positives = 187/339 (55%), Gaps = 20/339 (5%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
M + +VYVY F N + GN+ K M SHF TKDP KA LFF
Sbjct: 1 MEKQLKVYVYDEGEPPLFHNGPCKSIYSMEGNF-----IHKMEMDSHFRTKDPEKAHLFF 55
Query: 61 LPFSIARM------RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIG 114
LPFS+A + R +G I + Y+ +S KYPYWNR+ GADHF +ACH G
Sbjct: 56 LPFSVAMLVRFVYVRDSHDLGP--IKQTVIDYVNVVSTKYPYWNRSLGADHFMLACHDWG 113
Query: 115 RSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKL----GSSKRNK 170
+ N+I+V+C+++ G KDVS P+I S R
Sbjct: 114 PETSFSIPYLHKNSIRVLCNANT-SEGFNPSKDVSFPEINLLTGSTDSFIGGPSPSHRTL 172
Query: 171 LAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLK-TPYADELLGSKFCLHVKGFEVNTA 228
LAFFAG ++ P+R LL+ W N D ++ H K Y + + SK+CL G+EV +
Sbjct: 173 LAFFAGGLHGPIRPILLEHWENKDEDVKVHKYLPKGVSYYEMMRKSKYCLCPSGYEVASP 232
Query: 229 RIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQ 288
R+ ++LY GCVPV+I++HY PF+DVLNWKSFS+ V +IP LK+IL IS +Y+ +Q
Sbjct: 233 RVVEALYTGCVPVLISDHYVPPFSDVLNWKSFSVEVPVREIPNLKRILMDISPRQYIRMQ 292
Query: 289 SYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
++ R+HF+ + P YD F+M+++ LWLRR + RV
Sbjct: 293 RRGIQARRHFEVNSPPKRYDVFHMILHSLWLRRLNFRVH 331
>gi|51535578|dbj|BAD37522.1| pectin-glucuronyltransferase-like [Oryza sativa Japonica Group]
Length = 342
Score = 221 bits (563), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 136/347 (39%), Positives = 197/347 (56%), Gaps = 28/347 (8%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
M RSFRV+VYP DP P G YASE YF + +S F T D KA LFF
Sbjct: 1 MERSFRVFVYPD--GDPGTFYQTPRKLT--GKYASEGYFFQNIRESRFRTDDLEKAHLFF 56
Query: 61 LPFSIARMRHD-----RRIGTEGIPDFISHYIFNISQKYPYW----------NRTGGADH 105
+P S +MR + + I Y +I YW + GADH
Sbjct: 57 VPISPHKMRGKVPSSLLLVTYAWLILHIRSYDRSILFLDLYWWCPLCSSFRGHWGVGADH 116
Query: 106 FYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGS 165
F+V CH +G A + + N+I+VVCS SY +G+I HKDV+LPQI + P G+
Sbjct: 117 FFVTCHDVGVRAFEGLPFIIKNSIRVVCSPSY-NAGYIPHKDVALPQIL-QPFALPAGGN 174
Query: 166 SKRNK--LAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKTP-----YADELLGSKFCL 218
N+ L F+AG NS +R L ++W ND+E+ + R+ Y +KFC+
Sbjct: 175 DIENRTILGFWAGHRNSKIRVILARIWENDTELAISNNRINRAIGNLVYQKHFFRTKFCV 234
Query: 219 HVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKG 278
G +VN+ARI+DS++YGC+PVI++++YDL F+ +LNW+ F++V+ D+ LK ILK
Sbjct: 235 CPGGSQVNSARISDSIHYGCMPVILSDYYDLRFSGILNWRKFAVVLKESDVYELKSILKS 294
Query: 279 ISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
+S +E++ L +++V+KHF+WH P YDAF+M+MYELWLR ++
Sbjct: 295 LSQKEFVSLHKSLVQVQKHFEWHSPPVPYDAFHMIMYELWLRHHVIK 341
>gi|326529697|dbj|BAK04795.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 348
Score = 221 bits (563), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 137/342 (40%), Positives = 190/342 (55%), Gaps = 31/342 (9%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSH-FVTKDPSKADLF 59
M R +VY+Y R F L G YASE +F K +S FV D +KA LF
Sbjct: 9 MERILKVYIYQDGRRPIFHTPPLS------GIYASEGWFMKLLKESRRFVVADAAKAHLF 62
Query: 60 FLPFSIARMR--------HDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACH 111
+LP+S +R H+ R + DF+ ++ KYP+WNRT GADHF VACH
Sbjct: 63 YLPYSSQHLRLSLYVPDSHNLRPLAVYLRDFVK----GLAAKYPFWNRTRGADHFLVACH 118
Query: 112 SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP-----KLGSS 166
G ++ N+I+ +C++ KDVSLP+ R P L S
Sbjct: 119 DWGPYTTTAHRDLSKNSIKALCNADSSEGIFTPGKDVSLPETTIRTPKRPLRYVGGLPVS 178
Query: 167 KRNKLAFFAGAVNSPVREKLLQVW----RNDSEIYAH-SGRLKTP--YADELLGSKFCLH 219
+R LAFFAG V+ VR LLQ W +D +YA GR+ Y + SKFCL
Sbjct: 179 RRRILAFFAGNVHGRVRPVLLQHWGKGQDDDMRVYALLPGRVSRTMNYIQHMKNSKFCLC 238
Query: 220 VKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGI 279
G+EVN+ RI ++LYY CVPVIIA+++ LPF+DVL+W +FS+VVA DIP LK+IL+GI
Sbjct: 239 PMGYEVNSPRIVEALYYECVPVIIADNFVLPFSDVLDWSAFSVVVAEKDIPELKRILQGI 298
Query: 280 SSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
S Y+ + V ++++HF W+ P YD F+M+++ +WL R
Sbjct: 299 SLRRYVAMHDCVKRLQRHFLWYDRPLRYDLFHMILHSIWLSR 340
>gi|242053777|ref|XP_002456034.1| hypothetical protein SORBIDRAFT_03g029210 [Sorghum bicolor]
gi|241928009|gb|EES01154.1| hypothetical protein SORBIDRAFT_03g029210 [Sorghum bicolor]
Length = 517
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 132/342 (38%), Positives = 193/342 (56%), Gaps = 31/342 (9%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSH-FVTKDPSKADLF 59
M R +VY+Y R F L G YASE +F K +S V D KA LF
Sbjct: 178 MERILKVYIYQDGRRPIFHTPPL------SGIYASEGWFMKLLKESRRHVVADAGKAHLF 231
Query: 60 FLPFSIARMR--------HDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACH 111
+LP+S ++R H+ R + ++ +++ ++ KYP+WNRT GADHF VACH
Sbjct: 232 YLPYSSQQLRLTLYQADSHNLR----PLAAYLRNFVRGLANKYPFWNRTRGADHFLVACH 287
Query: 112 SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP-----KLGSS 166
G +++ NAI+ +C++ KDVSLP+ R P L S
Sbjct: 288 DWGPYTTAAHRDLRKNAIKALCNADSSEGIFTPGKDVSLPETTIRNPRRPLRYVGGLPVS 347
Query: 167 KRNKLAFFAGAVNSPVREKLLQVWRN--DSEIYAHS---GRL--KTPYADELLGSKFCLH 219
+R+ LAFFAG V+ VR LL+ W + D E+ +S R+ + Y + S+FCL
Sbjct: 348 RRSILAFFAGNVHGRVRPVLLRHWGDGQDDEMRVYSLLPNRVSRRMNYIQHMKNSRFCLC 407
Query: 220 VKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGI 279
G+EVN+ RI ++ YY CVPVIIA+++ LP ++VL+W +FS+VVA DIP LKKIL+GI
Sbjct: 408 PMGYEVNSPRIVEAFYYECVPVIIADNFVLPLSEVLDWSAFSVVVAEKDIPDLKKILQGI 467
Query: 280 SSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
S Y+ + S V ++++HF WH P YD F+M+++ +WL R
Sbjct: 468 SPRRYVAMHSCVKRLQRHFLWHARPIKYDLFHMILHSIWLSR 509
>gi|218188734|gb|EEC71161.1| hypothetical protein OsI_03019 [Oryza sativa Indica Group]
Length = 499
Score = 219 bits (558), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 130/343 (37%), Positives = 190/343 (55%), Gaps = 33/343 (9%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSH-FVTKDPSKADLF 59
M R +VY+Y R F L G YASE +F K +S F DP+KA LF
Sbjct: 160 MERILKVYIYQDGRRPIFHTPPL------SGIYASEGWFMKLLKESRRFAVTDPAKAHLF 213
Query: 60 FLPFSIARMR--------HDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACH 111
+LP+S ++R H+ R + DF+ ++ KYP+WNRT GADHF VACH
Sbjct: 214 YLPYSSQQLRISLYVPDSHNLRPLAAYLRDFVK----GLAAKYPFWNRTRGADHFLVACH 269
Query: 112 SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP-----KLGSS 166
G +++ N ++ +C++ +DVSLP+ R P L S
Sbjct: 270 DWGSYTTTAHGDLRRNTVKALCNADSSEGIFTPGRDVSLPETTIRTPRRPLRYVGGLPVS 329
Query: 167 KRNKLAFFAGAVNSPVREKLLQVW----RNDSEIY----AHSGRLKTPYADELLGSKFCL 218
+R LAFFAG V+ VR LL+ W +D +Y A R + Y + S+FCL
Sbjct: 330 RRGILAFFAGNVHGRVRPVLLKHWGDGRDDDMRVYGPLPARVSR-RMSYIQHMKNSRFCL 388
Query: 219 HVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKG 278
G+EVN+ RI ++LYY CVPVIIA+++ LP +DVL+W +F++VVA D+P LKKIL+G
Sbjct: 389 CPMGYEVNSPRIVEALYYECVPVIIADNFVLPLSDVLDWSAFAVVVAEKDVPDLKKILQG 448
Query: 279 ISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
I+ +Y+ + V ++++HF WH P YD F+M+++ +WL R
Sbjct: 449 ITLRKYVAMHGCVKRLQRHFLWHARPLRYDLFHMILHSIWLSR 491
>gi|296083497|emb|CBI23466.3| unnamed protein product [Vitis vinifera]
Length = 585
Score = 219 bits (558), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 128/337 (37%), Positives = 187/337 (55%), Gaps = 22/337 (6%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKS-HFVTKDPSKADLF 59
M +VY+YP F L RG YASE +F K ++ FVT+DP KA LF
Sbjct: 199 METILKVYIYPDGARPIFHAPHL------RGIYASEGWFMKLMEENRQFVTRDPKKAHLF 252
Query: 60 FLPFSIARMRHDRRI----GTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGR 115
+LP+S ++ + + F+ ++ I+ KYP+WNRT G+DHF VACH G
Sbjct: 253 YLPYSARQLETALYVPNSHNIRPLSIFLRDHVNMIAAKYPFWNRTHGSDHFLVACHDWGP 312
Query: 116 SAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP--KLGS---SKRNK 170
+ + E+ N I+ +C++ +A KDVSLP+ R P +G S+R
Sbjct: 313 YTVNEHQELSRNTIKALCNADLSEGIFVAGKDVSLPETTIRNPRRPLRNVGGRRVSQRPI 372
Query: 171 LAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRL------KTPYADELLGSKFCLHVKGFE 224
LAFFAG ++ VR LL+ W + E G L K Y + S+FC+ G+E
Sbjct: 373 LAFFAGNMHGRVRPTLLKYWSDKDEDMRIYGPLPNRISRKMSYIQHMKSSRFCICPMGYE 432
Query: 225 VNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEY 284
VN+ RI +++YY CVPVIIA+++ P DVL+W +FS++VA DIP LK+IL I Y
Sbjct: 433 VNSPRIVEAIYYECVPVIIADNFVPPLNDVLDWTAFSVIVAEKDIPKLKEILLAIPLRRY 492
Query: 285 LLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
L++Q+ V V+KHF W+ P YD F+M+++ +W R
Sbjct: 493 LVMQTNVKMVQKHFLWNPKPVRYDLFHMILHSIWFSR 529
>gi|356565368|ref|XP_003550913.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
max]
Length = 534
Score = 219 bits (558), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 128/315 (40%), Positives = 185/315 (58%), Gaps = 29/315 (9%)
Query: 30 RGNYASESYFKKAFMKS-HFVTKDPSKADLFFLPFSIARM--------RHDRRIGTEGIP 80
+G YASE +F K ++ FVTKDP KA LF+LP+S +M HD + +
Sbjct: 217 KGIYASEGWFMKLMEENKQFVTKDPEKAHLFYLPYSARQMGLTLYVPGSHD----LKPLS 272
Query: 81 DFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFIS 140
F+ Y+ I+ KYP+WNRT G+DHF VACH G + E+K N I+ +C++
Sbjct: 273 IFLRDYVNKIAAKYPFWNRTQGSDHFLVACHDWGPYTVTGHEELKRNTIKALCNADLSEG 332
Query: 141 GHIAHKDVSLPQIWPRQEDPPK--LGSSK---RNKLAFFAGAVNSPVREKLLQVW----R 191
+A +DVSLP+ R P LG ++ R LAFFAG+++ VR LL W
Sbjct: 333 VFVAGRDVSLPETTIRAPRRPLRYLGGNRVSLRPILAFFAGSMHGRVRPTLLTYWGGGKD 392
Query: 192 NDSEIYAH-----SGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANH 246
D +IY S R+ Y + SK+C+ GFEVN+ RI +++YY CVPVIIA++
Sbjct: 393 EDMKIYKRLPLRVSQRMT--YIQHMKSSKYCVCPMGFEVNSPRIVEAIYYECVPVIIADN 450
Query: 247 YDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSD 306
+ LPF++VL+W +FS+VVA DIP LK+IL I +YL +Q+ V V+KHF W+ P
Sbjct: 451 FVLPFSEVLDWSAFSVVVAEKDIPRLKEILLSIPLRKYLTMQNNVKMVQKHFLWNPRPIR 510
Query: 307 YDAFYMVMYELWLRR 321
YD F+M+++ +W +
Sbjct: 511 YDLFHMILHSIWFNK 525
>gi|413926326|gb|AFW66258.1| hypothetical protein ZEAMMB73_148778 [Zea mays]
Length = 267
Score = 218 bits (555), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 110/255 (43%), Positives = 161/255 (63%), Gaps = 7/255 (2%)
Query: 77 EGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSS 136
E + + Y+ + KYPYWNRT GADHF+V CH +G A + + N+I+VVCS S
Sbjct: 13 ENMTVIVKDYVEGLINKYPYWNRTLGADHFFVTCHDVGVRAFEGLKFMVKNSIRVVCSPS 72
Query: 137 YFISGHIAHKDVSLPQIW-PRQEDPPKLGSSKRNKLAFFAGAVNSPVREKLLQVWRNDSE 195
Y + I HKD++LPQ+ P R L F+AG NS +R L +VW ND+E
Sbjct: 73 YNVD-FIPHKDIALPQVLQPFALHEGGNDIDNRVILGFWAGHRNSKIRVILARVWENDTE 131
Query: 196 IYAHSGRL-----KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLP 250
+ + R+ + Y + +KFC+ G +VN+ARI+DS++YGCVPVI++++YDLP
Sbjct: 132 LAISNNRISRAIGELVYQKQFYRTKFCICPGGSQVNSARISDSIHYGCVPVILSDYYDLP 191
Query: 251 FADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAF 310
F D L+W+ F++++ D+ LK ILK IS EE++ L +++V+KHF WH P YDAF
Sbjct: 192 FNDALDWRKFAVILRERDVYQLKNILKSISQEEFISLHKSLVQVQKHFVWHSPPVSYDAF 251
Query: 311 YMVMYELWLRRSSVR 325
+MVMYELWLR + ++
Sbjct: 252 HMVMYELWLRHNVIK 266
>gi|359496494|ref|XP_002269459.2| PREDICTED: probable glycosyltransferase At5g03795-like [Vitis
vinifera]
Length = 554
Score = 218 bits (554), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 127/335 (37%), Positives = 187/335 (55%), Gaps = 22/335 (6%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKS-HFVTKDPSKADLF 59
M +VY+YP F L RG YASE +F K ++ FVT+DP KA LF
Sbjct: 199 METILKVYIYPDGARPIFHAPHL------RGIYASEGWFMKLMEENRQFVTRDPKKAHLF 252
Query: 60 FLPFSIARMRHDRRI----GTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGR 115
+LP+S ++ + + F+ ++ I+ KYP+WNRT G+DHF VACH G
Sbjct: 253 YLPYSARQLETALYVPNSHNIRPLSIFLRDHVNMIAAKYPFWNRTHGSDHFLVACHDWGP 312
Query: 116 SAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP--KLGS---SKRNK 170
+ + E+ N I+ +C++ +A KDVSLP+ R P +G S+R
Sbjct: 313 YTVNEHQELSRNTIKALCNADLSEGIFVAGKDVSLPETTIRNPRRPLRNVGGRRVSQRPI 372
Query: 171 LAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRL------KTPYADELLGSKFCLHVKGFE 224
LAFFAG ++ VR LL+ W + E G L K Y + S+FC+ G+E
Sbjct: 373 LAFFAGNMHGRVRPTLLKYWSDKDEDMRIYGPLPNRISRKMSYIQHMKSSRFCICPMGYE 432
Query: 225 VNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEY 284
VN+ RI +++YY CVPVIIA+++ P DVL+W +FS++VA DIP LK+IL I Y
Sbjct: 433 VNSPRIVEAIYYECVPVIIADNFVPPLNDVLDWTAFSVIVAEKDIPKLKEILLAIPLRRY 492
Query: 285 LLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWL 319
L++Q+ V V+KHF W+ P YD F+M+++ +W+
Sbjct: 493 LVMQTNVKMVQKHFLWNPKPVRYDLFHMILHSIWV 527
>gi|356511373|ref|XP_003524401.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
max]
Length = 643
Score = 218 bits (554), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 131/337 (38%), Positives = 193/337 (57%), Gaps = 26/337 (7%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKS-HFVTKDPSKADLF 59
M R+ +VY+Y F ++ +G YASE +F K ++ HFV KDP+KA LF
Sbjct: 310 MERTLKVYIYKDGNKPIFHQPIM------KGLYASEGWFMKLMEENKHFVLKDPAKAHLF 363
Query: 60 FLPFSIARMRHDRRI----GTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGR 115
++PFS + H + + F+ Y IS KY Y+NRTGGADHF VACH
Sbjct: 364 YMPFSSRMLEHALYVRNSHNRTNLRQFLKDYTDKISAKYRYFNRTGGADHFLVACHDWAP 423
Query: 116 SAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQ-EDPPK-LGSS---KRNK 170
+ E I+ +C++ G +DVSLP+ + R DP + LG +R
Sbjct: 424 YETRHHMEY---CIKALCNAD-VTQGFKIGRDVSLPEAYVRSVRDPQRDLGGKPPHQRPI 479
Query: 171 LAFFAGAVNSPVREKLLQVWRN---DSEIYA---HSGRLKTPYADELLGSKFCLHVKGFE 224
LAF+AG ++ +R LL+ W++ D +IY H K Y + + SK+C+ KG+E
Sbjct: 480 LAFYAGNMHGYLRPILLKHWKDKDPDMKIYGPMPHGAASKMNYINHMKNSKYCICPKGYE 539
Query: 225 VNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEY 284
VN+ R+ ++++Y CVPVII++++ PF +VLNW +FSI++A DIP LK+IL +S E+Y
Sbjct: 540 VNSPRVVEAIFYECVPVIISDNFVPPFFEVLNWDAFSIILAEKDIPNLKQILLSVSQEKY 599
Query: 285 LLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
L LQ V K +KHF WHV P YD F+M ++ +W R
Sbjct: 600 LKLQLGVRKAQKHFFWHVKPLKYDLFHMTLHSIWYNR 636
>gi|357476811|ref|XP_003608691.1| hypothetical protein MTR_4g100730 [Medicago truncatula]
gi|355509746|gb|AES90888.1| hypothetical protein MTR_4g100730 [Medicago truncatula]
Length = 535
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 133/345 (38%), Positives = 193/345 (55%), Gaps = 37/345 (10%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKS-HFVTKDPSKADLF 59
M +VY+Y F N L +G YASE +F K ++ FVTKDP +A LF
Sbjct: 200 METVLKVYIYRDGSRPIFHNPSL------KGIYASEGWFMKLMQENKQFVTKDPERAHLF 253
Query: 60 FLPFSIARMR--------HDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACH 111
+LP+S +M HD + + F+ Y+ I+ KYP+WNRT G+DHF VACH
Sbjct: 254 YLPYSARQMEVTLYVPGSHD----LKPLSIFLRDYVNKIAAKYPFWNRTHGSDHFLVACH 309
Query: 112 SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPK--LGSSK-- 167
G + + E+ N ++ +C++ I +DVSLP+ R P LG ++
Sbjct: 310 DWGPYTVTEHEELARNTLKALCNADLSERIFIEGRDVSLPETTIRAPRRPLRYLGGNRAS 369
Query: 168 -RNKLAFFAGAVNSPVREKLLQVWRN----DSEIYAHSGRL------KTPYADELLGSKF 216
R LAFFAG+++ VR LL+ W D +IY RL K Y + SK+
Sbjct: 370 LRPILAFFAGSMHGRVRPTLLKYWGGEKYEDMKIYK---RLPLRVSKKMTYIQHMKSSKY 426
Query: 217 CLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKIL 276
CL GFEVN+ RI +++YY CVPVIIA+++ LP ++VL+W +FS+VVA DIP LK IL
Sbjct: 427 CLCPMGFEVNSPRIVEAIYYECVPVIIADNFVLPLSEVLDWSAFSVVVAEKDIPRLKDIL 486
Query: 277 KGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
I +Y+ +Q+ V V+KHF W+ P YD F+M+++ +WL +
Sbjct: 487 LSIPMRKYVAMQNNVKMVQKHFLWNPKPIRYDLFHMILHSIWLNK 531
>gi|222618931|gb|EEE55063.1| hypothetical protein OsJ_02778 [Oryza sativa Japonica Group]
Length = 482
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 126/335 (37%), Positives = 184/335 (54%), Gaps = 34/335 (10%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSH-FVTKDPSKADLF 59
M R +VY+Y R F L G YASE +F K +S F DP+KA LF
Sbjct: 160 MERILKVYIYQDGRRPIFHTPPL------SGIYASEGWFMKLLKESRRFAVTDPAKAHLF 213
Query: 60 FLPFSIARMR--------HDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACH 111
+LP+S ++R H+ R + DF+ ++ KYP+WNRT GADHF VACH
Sbjct: 214 YLPYSSQQLRISLYVPDSHNLRPLAAYLRDFVK----GLAAKYPFWNRTRGADHFLVACH 269
Query: 112 SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP-----KLGSS 166
G +++ N ++ +C++ +DVSLP+ R P L S
Sbjct: 270 DWGSYTTTAHGDLRRNTVKALCNADSSEGIFTPGRDVSLPETTIRTPRRPLRYVGGLPVS 329
Query: 167 KRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVN 226
+R LAFFAG V+ VR LL+ W G + + S+FCL G+EVN
Sbjct: 330 RRGILAFFAGNVHGRVRPVLLKHW----------GDGRDDDMRHMKNSRFCLCPMGYEVN 379
Query: 227 TARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLL 286
+ RI ++LYY CVPVIIA+++ LP +DVL+W +F++VVA D+P LKKIL+GI+ +Y+
Sbjct: 380 SPRIVEALYYECVPVIIADNFVLPLSDVLDWSAFAVVVAEKDVPDLKKILQGITLRKYVA 439
Query: 287 LQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
+ V ++++HF WH P YD F+M+++ +WL R
Sbjct: 440 MHGCVKRLQRHFLWHARPLRYDLFHMILHSIWLSR 474
>gi|168033546|ref|XP_001769276.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162679541|gb|EDQ65988.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 408
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 130/348 (37%), Positives = 194/348 (55%), Gaps = 35/348 (10%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSH--FVTKDPSKADL 58
MN++F++YVY D F ++ + G YA+E F K S+ F +PSKA +
Sbjct: 61 MNKTFKIYVY----RDGFKPLVHGA--KTGGIYATEGLFLKRMEDSNNRFTVSEPSKAHM 114
Query: 59 FFLPFSIARM---------RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVA 109
F LP+S+ +M R R + T FIS+Y+ ++ KYPYWNRT GADHF+V+
Sbjct: 115 FLLPYSVRQMVDILQDPYSRSMRPLKT-----FISNYVDTLASKYPYWNRTHGADHFFVS 169
Query: 110 CHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQE----DPPKLGS 165
CH + E+ N+++VVC++ ++ I KDVS+PQ D LG
Sbjct: 170 CHDWAPLSTMLHGELHTNSMKVVCNADLTVNFDI-EKDVSIPQTLKGGNQSDLDVGSLGP 228
Query: 166 SKRNKLAFFAGAVNSPVREKLLQVWRNDS------EIYAHSGRLKTPYADELLGSKFCLH 219
+R+ LAF+AG ++ VR LL W+ E+ + YA + S++CL
Sbjct: 229 EERDFLAFYAGQMHGTVRPVLLDYWKGKDPTMKVYEVLPSDIAVNISYAQHMKRSRYCLC 288
Query: 220 VKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGI 279
KGFEVN+ RI +++ GCVPVIIA+++ LP+ DVL+W FS+ V DIP LKKIL I
Sbjct: 289 PKGFEVNSPRIVEAILSGCVPVIIADNFVLPYNDVLDWTKFSVTVPEEDIPDLKKILSSI 348
Query: 280 SSEEYLLLQSYVLKVRKHFQWHVFPSD--YDAFYMVMYELWLRRSSVR 325
S+ Y +Q + +R+HF W P D YD+F+M +Y +W + ++R
Sbjct: 349 SNVTYRSMQRRLRYIRRHFLWLEDPEDTQYDSFHMTLYSIWRQSMNLR 396
>gi|449482564|ref|XP_004156325.1| PREDICTED: probable glycosyltransferase At5g03795-like [Cucumis
sativus]
Length = 516
Score = 216 bits (549), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 132/330 (40%), Positives = 189/330 (57%), Gaps = 20/330 (6%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAF-MKSHFVTKDPSKADLF 59
M + +++VY F N + Y++E F A M S F TKDP+KA +F
Sbjct: 186 MEKEMKIFVYEEGEPPLFHNGPC------KSIYSTEGNFIHAIEMDSQFRTKDPNKAHVF 239
Query: 60 FLPFSIARMR-----HDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIG 114
FLP S+A + HD T I + YI I KYP+WNR+ GADHF ++CH G
Sbjct: 240 FLPLSVAMLVRFVYVHDSHDFTP-IRHTVVDYINVIGTKYPFWNRSLGADHFMLSCHDWG 298
Query: 115 RSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPK-LGS---SKRNK 170
A + + N+I+V+C+++ G KDVS P+I + LG S R
Sbjct: 299 PEASKSVPNLYKNSIRVLCNANT-SEGFNPSKDVSFPEINLQTGHLTGFLGGPSPSHRPI 357
Query: 171 LAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLK-TPYADELLGSKFCLHVKGFEVNTA 228
LAFFAG ++ P+R L+Q W N D +I H K Y D + SKFCL G+EV +
Sbjct: 358 LAFFAGGLHGPIRPILIQQWENQDQDIQVHQYLPKGVSYIDMMRKSKFCLCPSGYEVASP 417
Query: 229 RIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQ 288
RI +++Y GCVPV+I++HY PF+DV+NWKSFS+ V+ DIP LK IL GIS+ +YL +
Sbjct: 418 RIVEAIYTGCVPVLISDHYVPPFSDVINWKSFSVEVSVDDIPNLKTILTGISTRQYLRMY 477
Query: 289 SYVLKVRKHFQWHVFPSDYDAFYMVMYELW 318
V+KVR+HF+ + P YD ++M+++ +W
Sbjct: 478 RRVVKVRRHFEVNSPPKRYDVYHMILHSVW 507
>gi|115438799|ref|NP_001043679.1| Os01g0640600 [Oryza sativa Japonica Group]
gi|55297176|dbj|BAD68851.1| pectin-glucuronyltransferase-like [Oryza sativa Japonica Group]
gi|113533210|dbj|BAF05593.1| Os01g0640600 [Oryza sativa Japonica Group]
Length = 501
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 130/345 (37%), Positives = 190/345 (55%), Gaps = 35/345 (10%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSH-FVTKDPSKADLF 59
M R +VY+Y R F L G YASE +F K +S F DP+KA LF
Sbjct: 160 MERILKVYIYQDGRRPIFHTPPL------SGIYASEGWFMKLLKESRRFAVTDPAKAHLF 213
Query: 60 FLPFSIARMR--------HDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACH 111
+LP+S ++R H+ R + DF+ ++ KYP+WNRT GADHF VACH
Sbjct: 214 YLPYSSQQLRISLYVPDSHNLRPLAAYLRDFVK----GLAAKYPFWNRTRGADHFLVACH 269
Query: 112 S--IGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP-----KLG 164
G +++ N ++ +C++ +DVSLP+ R P L
Sbjct: 270 DWLQGSYTTTAHGDLRRNTVKALCNADSSEGIFTPGRDVSLPETTIRTPRRPLRYVGGLP 329
Query: 165 SSKRNKLAFFAGAVNSPVREKLLQVW----RNDSEIY----AHSGRLKTPYADELLGSKF 216
S+R LAFFAG V+ VR LL+ W +D +Y A R + Y + S+F
Sbjct: 330 VSRRGILAFFAGNVHGRVRPVLLKHWGDGRDDDMRVYGPLPARVSR-RMSYIQHMKNSRF 388
Query: 217 CLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKIL 276
CL G+EVN+ RI ++LYY CVPVIIA+++ LP +DVL+W +F++VVA D+P LKKIL
Sbjct: 389 CLCPMGYEVNSPRIVEALYYECVPVIIADNFVLPLSDVLDWSAFAVVVAEKDVPDLKKIL 448
Query: 277 KGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
+GI+ +Y+ + V ++++HF WH P YD F+M+++ +WL R
Sbjct: 449 QGITLRKYVAMHGCVKRLQRHFLWHARPLRYDLFHMILHSIWLSR 493
>gi|356563757|ref|XP_003550126.1| PREDICTED: probable glycosyltransferase At5g03795 [Glycine max]
Length = 645
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 127/340 (37%), Positives = 188/340 (55%), Gaps = 33/340 (9%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEP--RGNYASESYFKKAF-MKSHFVTKDPSKAD 57
M ++ +VYVY P+ P G YASE +F K F+T+DP+KA
Sbjct: 313 MEQTLKVYVYREGAR--------PIMHSPFFTGLYASEGWFMKQMEANKRFLTRDPNKAH 364
Query: 58 LFFLPFSIARMRHDRRIGT----EGIPDFISHYIFNISQKYPYWNRTGGADHFYVACH-- 111
LF+LPFS + + + + ++ +Y+ I+ KY +WNRTGGADHF V CH
Sbjct: 365 LFYLPFSSRMLEETLYVQNSHNHKNLVQYLHNYVEMIAGKYTFWNRTGGADHFLVGCHDW 424
Query: 112 SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKL-----GSS 166
+ G + + A N I+ +C++ G + KD SLP+ + R P +S
Sbjct: 425 APGETKVDMA-----NCIRSLCNAD-VKEGFVFGKDASLPETYVRDAKIPTKDLSGNSAS 478
Query: 167 KRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKTP-----YADELLGSKFCLHVK 221
KR LAFFAG+++ VR LLQ W N GRL Y + SK+C+ K
Sbjct: 479 KRTTLAFFAGSMHGYVRPILLQHWENKDPDMKIFGRLPKSKGNRNYIQYMKSSKYCICAK 538
Query: 222 GFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISS 281
G+EVN+ R+ ++++Y CVPVII++++ PF +VLNW+SF+++V DIP LK IL I
Sbjct: 539 GYEVNSPRVVEAIFYECVPVIISDNFVPPFLEVLNWESFAVIVLEKDIPNLKNILLSIPE 598
Query: 282 EEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
++YL LQ V KV++HF WH P YD F+M+++ +W R
Sbjct: 599 KQYLRLQMRVKKVQQHFLWHKNPVKYDIFHMILHSVWYNR 638
>gi|224092294|ref|XP_002309547.1| predicted protein [Populus trichocarpa]
gi|222855523|gb|EEE93070.1| predicted protein [Populus trichocarpa]
Length = 471
Score = 215 bits (547), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 134/341 (39%), Positives = 195/341 (57%), Gaps = 33/341 (9%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFM-KSHFVTKDPSKADLF 59
M +VY+Y F LL G YASE +F K FVTKD KA LF
Sbjct: 137 MEDILKVYIYKEGEMPIFHQPLL------NGIYASEGWFMKLLEGNKKFVTKDSKKAHLF 190
Query: 60 FLPFSI----ARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGR 115
+LPFS R+ + + +++ Y+ IS+KYP+WNRT GADHF ACH
Sbjct: 191 YLPFSSRYLEIRLYVPNSHSHKNLIEYLKKYLDMISEKYPFWNRTQGADHFLAACHDWAP 250
Query: 116 SAMQKAWEVKLNAIQVVCSSSY---FISGHIAHKDVSLPQIWP-RQEDPPK-LG---SSK 167
S ++ N I+ +C+S F+ G KD SLP+ + QE+P + LG +SK
Sbjct: 251 SETRQHMA---NCIRALCNSDAKEDFVYG----KDASLPETYVLTQENPLRDLGGNRASK 303
Query: 168 RNKLAFFAGAVNSPVREKLLQVWRN---DSEIYAH----SGRLKTPYADELLGSKFCLHV 220
R+ LAFFAG+++ +R LLQ W N D +I+ GR K YA + SK+C+
Sbjct: 304 RSILAFFAGSMHGYLRPILLQHWENKDPDMKIFGRLPKVKGRGKMNYARYMKSSKYCICA 363
Query: 221 KGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGIS 280
KG+EVN+ R+ ++++Y CVPVII++++ PF +VLNW+SF++ V DIP LKKIL I
Sbjct: 364 KGYEVNSPRVVEAIFYECVPVIISDNFVPPFLEVLNWESFAVFVLEKDIPNLKKILLSIP 423
Query: 281 SEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
+++Y +Q V +V++HF WH P YD F+M+++ +W R
Sbjct: 424 AKKYRRMQMRVKRVQQHFLWHARPVKYDVFHMILHSIWYNR 464
>gi|449451243|ref|XP_004143371.1| PREDICTED: probable glycosyltransferase At5g03795-like [Cucumis
sativus]
Length = 516
Score = 215 bits (547), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 131/330 (39%), Positives = 189/330 (57%), Gaps = 20/330 (6%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAF-MKSHFVTKDPSKADLF 59
M + +++VY F N + Y++E F A M S F TKDP+KA +F
Sbjct: 186 MEKEMKIFVYEEGEPPLFHNGPC------KSIYSTEGNFIHAIEMDSQFRTKDPNKAHVF 239
Query: 60 FLPFSIARMR-----HDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIG 114
FLP S+A + HD T I + YI I KYP+WNR+ GADHF ++CH G
Sbjct: 240 FLPLSVAMLVRFVYVHDSHDFTP-IRHTVVDYINVIGTKYPFWNRSLGADHFMLSCHDWG 298
Query: 115 RSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPK-LGS---SKRNK 170
A + + N+I+V+C+++ G KDVS P+I + LG S R
Sbjct: 299 PEASKSVPNLYKNSIRVLCNANT-SEGFNPSKDVSFPEINLQTGHLTGFLGGPSPSHRPI 357
Query: 171 LAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLK-TPYADELLGSKFCLHVKGFEVNTA 228
+AFFAG ++ P+R L+Q W N D +I H K Y D + SKFCL G+EV +
Sbjct: 358 MAFFAGGLHGPIRPILIQRWENQDQDIQVHQYLPKGVSYIDMMRKSKFCLCPSGYEVASP 417
Query: 229 RIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQ 288
RI +++Y GCVPV+I++HY PF+DV+NWKSFS+ V+ DIP LK IL GIS+ +YL +
Sbjct: 418 RIVEAIYTGCVPVLISDHYVPPFSDVINWKSFSVEVSVDDIPNLKTILTGISTRQYLRMY 477
Query: 289 SYVLKVRKHFQWHVFPSDYDAFYMVMYELW 318
V+KVR+HF+ + P YD ++M+++ +W
Sbjct: 478 RRVVKVRRHFEVNSPPKRYDVYHMILHSVW 507
>gi|357119191|ref|XP_003561329.1| PREDICTED: probable glycosyltransferase At5g03795-like
[Brachypodium distachyon]
Length = 569
Score = 214 bits (546), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 130/342 (38%), Positives = 190/342 (55%), Gaps = 27/342 (7%)
Query: 1 MNRSFRVYVY-----PHRRNDPFANVLLPVDFEPRGNYASESYFKKAF-MKSHFVTKDPS 54
M + F++YVY P + P N+ Y++E F A M++ T DP
Sbjct: 233 MEKLFKIYVYDEGEPPIYHDGPCHNI-----------YSTEGRFIHAMEMENRMRTTDPG 281
Query: 55 KADLFFLPFSIARMRHDRRI----GTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVAC 110
A +FFLPFSIA+M + E + + YI +S K+PYWNR+ GADHF ++C
Sbjct: 282 LAHVFFLPFSIAKMEKTIYVPGSHTMEPLRRTVFDYIDVLSTKHPYWNRSQGADHFMLSC 341
Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLG---SSK 167
H G + N+I+V+C+++ G I KDVSLP+I + +G +S
Sbjct: 342 HDWGPYVSSVDGNLFSNSIRVLCNANTS-EGFIPSKDVSLPEINHLNDFKKDIGGPSASG 400
Query: 168 RNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKT--PYADELLGSKFCLHVKGFEV 225
R LAFFAG + PVR LL+ W+ S L Y + + SKFCL GFEV
Sbjct: 401 RPILAFFAGGNHGPVRPLLLKHWKGKDPDVQVSEYLPAGVSYVETMRRSKFCLCPSGFEV 460
Query: 226 NTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYL 285
+ R+A+++Y CVPV+IA+ Y LPF+DVL+W +FS+ VA DIP +K+IL +S Y+
Sbjct: 461 ASPRVAEAIYVECVPVVIADDYVLPFSDVLSWPAFSLRVAVRDIPDIKRILSAVSPRRYI 520
Query: 286 LLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
+Q V VR+HF + P YD F+M+++ +WLRR +VR+
Sbjct: 521 RMQRRVRAVRRHFMLNGVPQRYDVFHMILHSIWLRRLNVRIH 562
>gi|168029668|ref|XP_001767347.1| predicted protein [Physcomitrella patens subsp. patens]
gi|168029742|ref|XP_001767384.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162681411|gb|EDQ67838.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162681448|gb|EDQ67875.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 333
Score = 214 bits (545), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 129/345 (37%), Positives = 197/345 (57%), Gaps = 28/345 (8%)
Query: 1 MNRSFRVYVY-----PHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSH-FVTKDPS 54
M R F+VYVY P N P + YA E F + + FVT DP
Sbjct: 1 MERRFKVYVYSEGEEPLVHNGPCKEI-----------YAVEGRFIQELQGDNPFVTHDPD 49
Query: 55 KADLFFLPFSIARM---RHDRRIG-TEGIPDFISHYIFNISQKYPYWNRTGGADHFYVAC 110
A ++FLPFS+A M +++ G + + F+ Y+ + KYP+WNR+GGADHF ++C
Sbjct: 50 NAHVYFLPFSVAMMVAYLYEKESGDMDPLRLFVGDYVDVLMHKYPFWNRSGGADHFMLSC 109
Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI-WPRQEDPPKLG---SS 166
H G ++ + +I+V+C+++ G++ KDVSLP+I P +LG +
Sbjct: 110 HDWGPLITRENMNLGTRSIRVLCNANS-SEGYVPWKDVSLPEIHLVGGHIPAELGGPPAK 168
Query: 167 KRNKLAFFAGAVNSPVREKLLQVW--RNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFE 224
R LAFFAG + PVR +L + W ++D I Y + + S++C+ G+E
Sbjct: 169 DRPHLAFFAGRDHGPVRPQLFKHWEGKDDDVIVYQWLPAHLKYHELMKTSRYCICPGGYE 228
Query: 225 VNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEY 284
VN+ RI +++Y CVPVIIA+ + LPF+DVLNW+SFS+ V DIP LK IL+ ++ E Y
Sbjct: 229 VNSPRIVEAIYNECVPVIIADSFILPFSDVLNWESFSLHVKESDIPNLKSILQNVTMETY 288
Query: 285 LLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQWS 329
+Q V +V++HF H P YD F+M+++ +WLRR ++RV +S
Sbjct: 289 TSMQERVSQVQRHFVLHQPPKRYDVFHMILHSVWLRRLNLRVGFS 333
>gi|79326862|ref|NP_001031828.1| putative glycosyltransferase [Arabidopsis thaliana]
gi|292630778|sp|Q9FFN2.2|GLYT3_ARATH RecName: Full=Probable glycosyltransferase At5g03795
gi|332003272|gb|AED90655.1| putative glycosyltransferase [Arabidopsis thaliana]
Length = 518
Score = 214 bits (544), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 130/340 (38%), Positives = 193/340 (56%), Gaps = 22/340 (6%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASE-SYFKKAFMKSHFVTKDPSKADLF 59
M + F++YVY F D + Y+ E S+ + + F T +P KA +F
Sbjct: 186 MEKQFKIYVYKEGEPPLFH------DGPCKSIYSMEGSFIYEIETDTRFRTNNPDKAHVF 239
Query: 60 FLPFSIARM------RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSI 113
+LPFS+ +M R+ R I + + YI + KYPYWNR+ GADHF ++CH
Sbjct: 240 YLPFSVVKMVRYVYERNSRDFSP--IRNTVKDYINLVGDKYPYWNRSIGADHFILSCHDW 297
Query: 114 GRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKL----GSSKRN 169
G A + N+I+ +C+++ KDVS+P+I R L S R
Sbjct: 298 GPEASFSHPHLGHNSIRALCNANTS-ERFKPRKDVSIPEINLRTGSLTGLVGGPSPSSRP 356
Query: 170 KLAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLK-TPYADELLGSKFCLHVKGFEVNT 227
LAFFAG V+ PVR LLQ W N D++I H + T Y+D + SKFC+ G+EV +
Sbjct: 357 ILAFFAGGVHGPVRPVLLQHWENKDNDIRVHKYLPRGTSYSDMMRNSKFCICPSGYEVAS 416
Query: 228 ARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLL 287
RI ++LY GCVPV+I + Y PF+DVLNW+SFS++V+ DIP LK IL IS +YL +
Sbjct: 417 PRIVEALYSGCVPVLINSGYVPPFSDVLNWRSFSVIVSVEDIPNLKTILTSISPRQYLRM 476
Query: 288 QSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
VLKVR+HF+ + +D F+M+++ +W+RR +V+++
Sbjct: 477 YRRVLKVRRHFEVNSPAKRFDVFHMILHSIWVRRLNVKIR 516
>gi|297808029|ref|XP_002871898.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
gi|297317735|gb|EFH48157.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
Length = 610
Score = 214 bits (544), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 127/342 (37%), Positives = 191/342 (55%), Gaps = 36/342 (10%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFM-KSHFVTKDPSKADLF 59
M R +VYVY F +L +G YASE +F K + KDP KA L+
Sbjct: 277 MERILKVYVYKEGNRPIFHTPIL------KGLYASEGWFMKLMEGNKQYTVKDPRKAHLY 330
Query: 60 FLPFSIARM---------RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVAC 110
++PFS ARM H+R + F+ Y +IS KYP++NRT GADHF VAC
Sbjct: 331 YMPFS-ARMLEYTLYVRNSHNR----TNLRQFLKEYTEHISSKYPFFNRTDGADHFLVAC 385
Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP--KLGS--- 165
H + E + I+ +C++ +G +D+SLP+ + R P LG
Sbjct: 386 HDWAPYETRHHME---HCIKALCNAD-VTAGFKIGRDISLPETYVRAAKNPLRDLGGKPP 441
Query: 166 SKRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRL------KTPYADELLGSKFCLH 219
S+R LAF+AG+++ +R+ LLQ W++ GR+ K Y +++ SK+C+
Sbjct: 442 SQRRTLAFYAGSMHGYLRQILLQHWKDKDPDMKIFGRMPFGVASKMNYIEQMKSSKYCIC 501
Query: 220 VKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGI 279
KG+EVN+ R+ +S++Y CVPVII++++ PF +VL+W +FS++VA DIP LK IL I
Sbjct: 502 PKGYEVNSPRVVESIFYECVPVIISDNFVPPFFEVLDWSAFSVIVAEKDIPRLKDILSSI 561
Query: 280 SSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
E+Y+ +Q V K ++HF WH P YD F+MV++ +W R
Sbjct: 562 PEEKYVKMQMAVRKAQRHFLWHAKPEKYDLFHMVLHSIWYNR 603
>gi|334186614|ref|NP_001190743.1| Exostosin family protein [Arabidopsis thaliana]
gi|332658396|gb|AEE83796.1| Exostosin family protein [Arabidopsis thaliana]
Length = 589
Score = 214 bits (544), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 132/330 (40%), Positives = 191/330 (57%), Gaps = 29/330 (8%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEP--RGNYASESYFKKAFMKSH--FVTKDPSKA 56
M +VY+YP P+ EP G YASE +F K M+S+ FVTK+P +A
Sbjct: 196 MELILKVYIYPDGDK--------PIFHEPHLNGIYASEGWFMK-LMESNKQFVTKNPERA 246
Query: 57 DLFFLPFSIARMRHDRRI-GTEGIPD---FISHYIFNISQKYPYWNRTGGADHFYVACHS 112
LF++P+S+ +++ + G+ I F+ Y+ +S KYP+WNRT G+DHF VACH
Sbjct: 247 HLFYMPYSVKQLQKSIFVPGSHNIKPLSIFLRDYVNMLSIKYPFWNRTHGSDHFLVACHD 306
Query: 113 IGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP--KLGS----S 166
G + + E+K NAI+ +C++ + KDVSLP+ R P +G+ S
Sbjct: 307 WGPYTVNEHPELKRNAIKALCNADLSDGIFVPGKDVSLPETSIRNAGRPLRNIGNGNRVS 366
Query: 167 KRNKLAFFAGAVNSPVREKLLQVWRN---DSEIYA---HSGRLKTPYADELLGSKFCLHV 220
+R LAFFAG ++ VR KLL+ WRN D +IY H+ K Y + SK+CL
Sbjct: 367 QRPILAFFAGNLHGRVRPKLLKHWRNKDEDMKIYGPLPHNVARKMTYVQHMKSSKYCLCP 426
Query: 221 KGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGIS 280
G+EVN+ RI +++YY CVPV+IA+++ LPF+DVL+W +FS+VV +IP LK+IL I
Sbjct: 427 MGYEVNSPRIVEAIYYECVPVVIADNFMLPFSDVLDWSAFSVVVPEKEIPRLKEILLEIP 486
Query: 281 SEEYLLLQSYVLKVRKHFQWHVFPSDYDAF 310
YL +QS V V++HF W P F
Sbjct: 487 MRRYLKMQSNVKMVQRHFLWSPKPRKIKPF 516
>gi|449461995|ref|XP_004148727.1| PREDICTED: probable glycosyltransferase At5g03795-like [Cucumis
sativus]
gi|449501299|ref|XP_004161331.1| PREDICTED: probable glycosyltransferase At5g03795-like [Cucumis
sativus]
Length = 664
Score = 214 bits (544), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 127/341 (37%), Positives = 189/341 (55%), Gaps = 34/341 (9%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFM-KSHFVTKDPSKADLF 59
M R+ ++YVY + F +L +G YASE +F K FV KDP KA LF
Sbjct: 331 MERTLKIYVYRDGKKPIFHQPIL------KGLYASEGWFMKLMEGNKRFVVKDPRKAHLF 384
Query: 60 FLPFS--------IARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACH 111
++PFS R H+R + F+ Y NI+ KYPYWNRTGGADHF CH
Sbjct: 385 YMPFSSRMLEYTLYVRNSHNR----TNLRQFLKEYAENIAAKYPYWNRTGGADHFLAGCH 440
Query: 112 SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP--KLG---SS 166
+ E + I+ +C++ + I +DVSLP+ + R P LG +S
Sbjct: 441 DWAPYETRHHME---HCIKALCNADVTVGFKIG-RDVSLPETYVRSARNPLRDLGGKPAS 496
Query: 167 KRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRL------KTPYADELLGSKFCLHV 220
+R+ LAF+AG ++ VR LL+ W++ + G + K Y + SK+C+
Sbjct: 497 QRHILAFYAGNMHGYVRPILLKYWKDKNPDMKIFGPMPPGVASKMNYIQHMKSSKYCICP 556
Query: 221 KGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGIS 280
KG+EVN+ R+ ++++Y CVPVII++++ PF +VL+W++FS++VA DIP L+ IL I
Sbjct: 557 KGYEVNSPRVVEAIFYECVPVIISDNFVPPFFEVLDWEAFSVIVAEKDIPNLQDILLSIP 616
Query: 281 SEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
+ YL +Q V KV+KHF WH P YD F+M ++ +W R
Sbjct: 617 KDRYLEMQLRVRKVQKHFLWHAKPLKYDLFHMTLHSIWYNR 657
>gi|9758008|dbj|BAB08605.1| unnamed protein product [Arabidopsis thaliana]
Length = 408
Score = 213 bits (543), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 130/340 (38%), Positives = 193/340 (56%), Gaps = 22/340 (6%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASE-SYFKKAFMKSHFVTKDPSKADLF 59
M + F++YVY F D + Y+ E S+ + + F T +P KA +F
Sbjct: 76 MEKQFKIYVYKEGEPPLFH------DGPCKSIYSMEGSFIYEIETDTRFRTNNPDKAHVF 129
Query: 60 FLPFSIARM------RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSI 113
+LPFS+ +M R+ R I + + YI + KYPYWNR+ GADHF ++CH
Sbjct: 130 YLPFSVVKMVRYVYERNSRDFSP--IRNTVKDYINLVGDKYPYWNRSIGADHFILSCHDW 187
Query: 114 GRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKL----GSSKRN 169
G A + N+I+ +C+++ KDVS+P+I R L S R
Sbjct: 188 GPEASFSHPHLGHNSIRALCNANTS-ERFKPRKDVSIPEINLRTGSLTGLVGGPSPSSRP 246
Query: 170 KLAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLK-TPYADELLGSKFCLHVKGFEVNT 227
LAFFAG V+ PVR LLQ W N D++I H + T Y+D + SKFC+ G+EV +
Sbjct: 247 ILAFFAGGVHGPVRPVLLQHWENKDNDIRVHKYLPRGTSYSDMMRNSKFCICPSGYEVAS 306
Query: 228 ARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLL 287
RI ++LY GCVPV+I + Y PF+DVLNW+SFS++V+ DIP LK IL IS +YL +
Sbjct: 307 PRIVEALYSGCVPVLINSGYVPPFSDVLNWRSFSVIVSVEDIPNLKTILTSISPRQYLRM 366
Query: 288 QSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
VLKVR+HF+ + +D F+M+++ +W+RR +V+++
Sbjct: 367 YRRVLKVRRHFEVNSPAKRFDVFHMILHSIWVRRLNVKIR 406
>gi|357440259|ref|XP_003590407.1| hypothetical protein MTR_1g061760 [Medicago truncatula]
gi|355479455|gb|AES60658.1| hypothetical protein MTR_1g061760 [Medicago truncatula]
Length = 508
Score = 213 bits (543), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 129/340 (37%), Positives = 190/340 (55%), Gaps = 21/340 (6%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
M + F+V+VY F N + GN+ + F T+DP KA ++F
Sbjct: 174 MEKQFKVFVYEEGEPPVFHNGPCKSIYSMEGNF-----IHAIELNDQFRTRDPQKAHVYF 228
Query: 61 LPFSIARM------RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIG 114
LPFS+ + R R G I ++ YI I+ KYPYWNR+ GADHF +ACH G
Sbjct: 229 LPFSVVMLVRFVYLRDSRDFGP--IRKTVTDYINVIAGKYPYWNRSLGADHFMLACHDWG 286
Query: 115 RSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDP----PKLGSSKRNK 170
+ N+I+V+C+++ + A KDVS P+I + L +SKR
Sbjct: 287 PETSFSVPYLHKNSIRVLCNANTSERFNPA-KDVSFPEINLQTGSINGFLGGLSASKRPI 345
Query: 171 LAFFAGAVNSPVREKLLQVWRN--DSEIYAHSGRLK-TPYADELLGSKFCLHVKGFEVNT 227
LAFFAG ++ +R LL+ W N D ++ K Y + L SKFCL G+EV +
Sbjct: 346 LAFFAGGLHGHIRAILLEHWENNKDQDMMIQKYLPKGVSYYEMLRKSKFCLCPSGYEVAS 405
Query: 228 ARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLL 287
RI +++Y GCVPV+I++HY PF+DVLNWKSFS+ ++ DIP LK IL IS +Y+ +
Sbjct: 406 PRIVEAIYTGCVPVLISDHYVPPFSDVLNWKSFSVEISVEDIPKLKDILMRISPTQYIRM 465
Query: 288 QSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
Q V+++R+HF+ H P +D F+M+++ +WLRR + RV
Sbjct: 466 QRRVVQIRRHFEVHSPPKRFDVFHMILHSVWLRRLNFRVH 505
>gi|297810461|ref|XP_002873114.1| hypothetical protein ARALYDRAFT_487158 [Arabidopsis lyrata subsp.
lyrata]
gi|297318951|gb|EFH49373.1| hypothetical protein ARALYDRAFT_487158 [Arabidopsis lyrata subsp.
lyrata]
Length = 519
Score = 213 bits (543), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 130/340 (38%), Positives = 192/340 (56%), Gaps = 22/340 (6%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASE-SYFKKAFMKSHFVTKDPSKADLF 59
M + F++YVY F D + Y+ E S+ + + F T +P KA F
Sbjct: 187 MEKQFKIYVYKEGEPPLFH------DGPCKSIYSMEGSFIYEMETDTRFRTNNPDKAHAF 240
Query: 60 FLPFSIARM------RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSI 113
+LPFS+ +M R+ R I + + YI + KYPYWNR+ GADHF ++CH
Sbjct: 241 YLPFSVVKMVRYVYERNSRDFSP--IRNTVRDYINLVGDKYPYWNRSIGADHFILSCHDW 298
Query: 114 GRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKL----GSSKRN 169
G A + N+I+ +C+++ KDVS+P+I R L S R
Sbjct: 299 GPEASFSHPHLGHNSIRALCNANTS-EKFKPRKDVSIPEINLRTGSLTGLVGGPSPSSRP 357
Query: 170 KLAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLK-TPYADELLGSKFCLHVKGFEVNT 227
LAFFAG V+ PVR LL+ W N D++I H + T Y+D + SKFC+ G+EV +
Sbjct: 358 ILAFFAGGVHGPVRPVLLEHWENKDNDIRVHKYLPRGTSYSDMMRNSKFCICPSGYEVAS 417
Query: 228 ARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLL 287
RI ++LY GCVPV+I + Y PF+DVLNW+SFS++V+ DIP LK IL IS +YL +
Sbjct: 418 PRIVEALYSGCVPVLINSGYVPPFSDVLNWRSFSVIVSVEDIPNLKTILTAISPRQYLRM 477
Query: 288 QSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
VLKVR+HF+ + +D F+M+++ +W+RR +VR++
Sbjct: 478 YRRVLKVRRHFEVNSPAKRFDVFHMILHSIWVRRLNVRIR 517
>gi|359484345|ref|XP_002280595.2| PREDICTED: probable glycosyltransferase At5g03795-like [Vitis
vinifera]
gi|297738776|emb|CBI28021.3| unnamed protein product [Vitis vinifera]
Length = 665
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 129/340 (37%), Positives = 197/340 (57%), Gaps = 32/340 (9%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPR--GNYASESYFKKAFM-KSHFVTKDPSKAD 57
M R ++Y+Y P+ +PR G YASE +F K FV +DP KA
Sbjct: 332 MERVLKIYIYREGEK--------PIFHQPRLRGIYASEGWFMKLIEGNKRFVVRDPRKAH 383
Query: 58 LFFLPFSIARMR---HDRRIGT-EGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHS- 112
LF++PFS +R +++ T + + +Y+ I+ KY +WNRTGGADH VACH
Sbjct: 384 LFYVPFSSKMLRTVFYEQNSSTPRDLEKYFKNYVGLIAGKYRFWNRTGGADHLIVACHDW 443
Query: 113 IGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQ-EDPPK-LGS---SK 167
R Q +W N+I+ +C+S+ SG KD +LP + R+ EDP K LG S+
Sbjct: 444 APRITRQCSW----NSIRALCNSN-IASGFKIGKDTTLPVTYIRKSEDPLKYLGGKPPSQ 498
Query: 168 RNKLAFFAGAVNSPVREKLLQVWRN---DSEIYAHSGR---LKTPYADELLGSKFCLHVK 221
R LAFFAG+++ +R LLQ W N D +I+ R K+ Y D + SK+C+ +
Sbjct: 499 RPILAFFAGSMHGYLRPILLQYWENKEQDIKIFGPMSRDDGGKSRYRDHMKSSKYCICAR 558
Query: 222 GFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISS 281
G+EV+T R+ ++++Y CVPVII+++Y PF ++LNW++F++ + D+P L+ IL I
Sbjct: 559 GYEVHTPRVVEAIFYECVPVIISDNYVPPFFEILNWEAFAVFILEKDVPNLRNILLSIPE 618
Query: 282 EEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
E+YL +Q V V++HF WH P YD F+M+++ +W R
Sbjct: 619 EKYLQMQMRVKMVQQHFLWHKKPVKYDLFHMILHSVWYNR 658
>gi|225429942|ref|XP_002281263.1| PREDICTED: probable glycosyltransferase At5g03795 [Vitis vinifera]
Length = 675
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 131/343 (38%), Positives = 189/343 (55%), Gaps = 38/343 (11%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEP--RGNYASESYFKKAFMKSH-FVTKDPSKAD 57
M + +VY Y PV +P +G YASE +F K + FVTK+ KA
Sbjct: 336 MENTLKVYTYREGER--------PVFHQPPIKGIYASEGWFMKLMQANKKFVTKNGRKAH 387
Query: 58 LFFLPFSIARMR--------HDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVA 109
LF+LPFS + H R+ + ++ +Y+ I KYP+WNRTGGADHF VA
Sbjct: 388 LFYLPFSSLMLEEALYVPNSHSRK----NLEQYLKNYLDMIGAKYPFWNRTGGADHFLVA 443
Query: 110 CHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP--KLGS-- 165
CH S K + N+I+ +C+S G KDVSLP+ R P +LG
Sbjct: 444 CHDWAPSETLK---LMANSIRALCNSD-IREGFKLGKDVSLPETCVRIPQNPLRQLGGKP 499
Query: 166 -SKRNKLAFFAGAVNSPVREKLLQVWRN---DSEIYAHSGRLKTP---YADELLGSKFCL 218
S+R LAFFAG+++ VR LL+ W N D +IY + K Y + SK+C+
Sbjct: 500 PSQRRILAFFAGSMHGYVRPILLKYWENKDPDMKIYGRMPKAKKGTMNYIQHMKSSKYCI 559
Query: 219 HVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKG 278
KG+EVN+ R+ ++++Y CVPVII++++ PF VLNW+SF++ + DIP LK IL
Sbjct: 560 CAKGYEVNSPRVVEAIFYECVPVIISDNFVPPFFGVLNWESFAVFILEKDIPNLKSILLS 619
Query: 279 ISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
I + YL +Q V +V++HF WH P YD F+M+++ +W R
Sbjct: 620 IPEKSYLEIQMRVKQVQQHFLWHAKPVKYDVFHMILHSVWYNR 662
>gi|224071129|ref|XP_002303362.1| predicted protein [Populus trichocarpa]
gi|222840794|gb|EEE78341.1| predicted protein [Populus trichocarpa]
Length = 386
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 130/345 (37%), Positives = 188/345 (54%), Gaps = 37/345 (10%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPR--GNYASESYFKKAFMKS--HFVTKDPSKA 56
M +VY+YP P+ +P G YASE +F K FM++ FV++DP KA
Sbjct: 46 METILKVYIYPDGDK--------PIFHQPHLYGIYASEGWFMK-FMEASREFVSRDPEKA 96
Query: 57 DLFFLPFSIARMR--------HDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYV 108
LF+LP+S ++ H+ R + F+ Y I+ KYPYWNRT G DHF V
Sbjct: 97 HLFYLPYSARQLEVAVYVPNSHNLR----PLSIFMRDYANMIAAKYPYWNRTHGRDHFLV 152
Query: 109 ACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP------K 162
ACH G A+ E+ N ++ +C++ A +DVSLP+ R P
Sbjct: 153 ACHDWGPYALTMHEELTKNTMKALCNADVSEGIFTAGQDVSLPETTIRSPKRPLRNVGGG 212
Query: 163 LGSSKRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRL------KTPYADELLGSKF 216
+ S+R LAFFAG ++ VR LL+ W N + G L K Y + SK+
Sbjct: 213 IRVSQRPILAFFAGNLHGRVRPTLLKYWHNKDDDMKIYGPLPIGISRKMTYVQHMKSSKY 272
Query: 217 CLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKIL 276
C+ G+EVN+ RI +++YY CVPVIIA+++ LPF +VL+W +FS+VVA DIP LK+IL
Sbjct: 273 CICPMGYEVNSPRIVEAIYYECVPVIIADNFVLPFNEVLDWSAFSVVVAEKDIPKLKEIL 332
Query: 277 KGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
I YL + + + V+KHF W+ P YD F+M+++ +W R
Sbjct: 333 LAIPLRRYLTMLANLKTVQKHFLWNPRPLRYDLFHMILHSIWFSR 377
>gi|334187775|ref|NP_197468.2| Exostosin family protein [Arabidopsis thaliana]
gi|332005353|gb|AED92736.1| Exostosin family protein [Arabidopsis thaliana]
Length = 610
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 126/342 (36%), Positives = 191/342 (55%), Gaps = 36/342 (10%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFM-KSHFVTKDPSKADLF 59
M R +VYVY F +L +G YASE +F K + KDP KA L+
Sbjct: 277 MERILKVYVYKEGNRPIFHTPIL------KGLYASEGWFMKLMEGNKQYTVKDPRKAHLY 330
Query: 60 FLPFSIARM---------RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVAC 110
++PFS ARM H+R + F+ Y +IS KYP++NRT GADHF VAC
Sbjct: 331 YMPFS-ARMLEYTLYVRNSHNR----TNLRQFLKEYTEHISSKYPFFNRTDGADHFLVAC 385
Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP--KLGS--- 165
H + E + I+ +C++ +G +D+SLP+ + R P LG
Sbjct: 386 HDWAPYETRHHME---HCIKALCNAD-VTAGFKIGRDISLPETYVRAAKNPLRDLGGKPP 441
Query: 166 SKRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRL------KTPYADELLGSKFCLH 219
S+R LAF+AG+++ +R+ LLQ W++ GR+ K Y +++ SK+C+
Sbjct: 442 SQRRTLAFYAGSMHGYLRQILLQHWKDKDPDMKIFGRMPFGVASKMNYIEQMKSSKYCIC 501
Query: 220 VKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGI 279
KG+EVN+ R+ +S++Y CVPVII++++ PF +VL+W +FS++VA DIP LK IL I
Sbjct: 502 PKGYEVNSPRVVESIFYECVPVIISDNFVPPFFEVLDWSAFSVIVAEKDIPRLKDILLSI 561
Query: 280 SSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
++Y+ +Q V K ++HF WH P YD F+MV++ +W R
Sbjct: 562 PEDKYVKMQMAVRKAQRHFLWHAKPEKYDLFHMVLHSIWYNR 603
>gi|255579651|ref|XP_002530666.1| catalytic, putative [Ricinus communis]
gi|223529799|gb|EEF31735.1| catalytic, putative [Ricinus communis]
Length = 528
Score = 211 bits (537), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 129/339 (38%), Positives = 185/339 (54%), Gaps = 25/339 (7%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSH--FVTKDPSKADL 58
M +VY+YP + F L G YASE +F K FM+ + FVT+DP KA L
Sbjct: 188 MELILKVYIYPDGKRPIFHVPHL------NGIYASEGWFMK-FMEENRQFVTRDPEKAHL 240
Query: 59 FFLPFSIARMRHDRRI----GTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIG 114
F+LP+S +++ + + F+ Y I+ KYP+WNRT G DHF VACH G
Sbjct: 241 FYLPYSARQLQMALYVPNSHNLRPLSIFMRDYANMIATKYPFWNRTHGRDHFLVACHDWG 300
Query: 115 RSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP------KLGSSKR 168
+ E+ N I+ +C++ KDVSLP+ R P + S+R
Sbjct: 301 PYTLTMHEELTKNTIKALCNADASEGIFDPTKDVSLPETTIRIPRRPLKNVGGGIRVSQR 360
Query: 169 NKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRL------KTPYADELLGSKFCLHVKG 222
LAFFAG ++ VR LLQ W+N E G L K Y + S++C+ G
Sbjct: 361 PILAFFAGNMHGRVRPTLLQYWQNKDEDLKIYGPLPARISRKMNYVQHMKSSRYCICPMG 420
Query: 223 FEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSE 282
EVN+ RI +++YY CVPVIIA+++ LPF+DVL+W +FSIVVA DIP LK+IL I
Sbjct: 421 HEVNSPRIVEAIYYECVPVIIADNFVLPFSDVLDWSAFSIVVAEKDIPKLKEILLAIPLR 480
Query: 283 EYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
YL + + + +++HF W+ P YD F+M+++ +W R
Sbjct: 481 RYLTMLTNLKMLQRHFLWNPRPLRYDLFHMILHSIWFSR 519
>gi|359484343|ref|XP_002283936.2| PREDICTED: uncharacterized protein LOC100268163 [Vitis vinifera]
Length = 738
Score = 211 bits (536), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 129/341 (37%), Positives = 190/341 (55%), Gaps = 34/341 (9%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKS-HFVTKDPSKADLF 59
M R +VYVY F +L +G YASE +F K ++ HFV KDP +A LF
Sbjct: 405 MERILKVYVYKDGEKPIFHQPIL------KGLYASEGWFMKLMERNKHFVVKDPRQAQLF 458
Query: 60 FLPFS--------IARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACH 111
++PFS R H+R + ++ Y I+ KY +WNRTGGADHF VACH
Sbjct: 459 YMPFSSRMLEYKLYVRNSHNR----TNLRQYLKQYSEKIAAKYRFWNRTGGADHFLVACH 514
Query: 112 SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP--KLGS---S 166
+ E I+ +C++ +G +DVSLP+ + R P LG S
Sbjct: 515 DWAPYETRHHME---QCIKALCNAD-VTAGFKIGRDVSLPETYVRSARNPLRDLGGKPPS 570
Query: 167 KRNKLAFFAGAVNSPVREKLLQVWRN---DSEIYAH---SGRLKTPYADELLGSKFCLHV 220
+R+ LAF+AG ++ +R LL+ W++ D +IY K Y + SKFC+
Sbjct: 571 ERHILAFYAGNMHGYLRPILLKYWKDKDPDMKIYGPMPPGVASKMNYIQHMKSSKFCICP 630
Query: 221 KGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGIS 280
KG+EVN+ R+ ++++Y CVPVII++++ PF DVL+W +FSI++A DIP LK +L I
Sbjct: 631 KGYEVNSPRVVEAIFYECVPVIISDNFVPPFFDVLDWGAFSIILAEKDIPNLKDVLLSIP 690
Query: 281 SEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
+++YL +Q V KV+KHF WH P YD F+M ++ +W R
Sbjct: 691 NDKYLQMQLGVRKVQKHFLWHAKPLKYDLFHMTLHSIWYNR 731
>gi|357130589|ref|XP_003566930.1| PREDICTED: probable glycosyltransferase At5g03795-like
[Brachypodium distachyon]
Length = 510
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 132/349 (37%), Positives = 187/349 (53%), Gaps = 38/349 (10%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSH-FVTKDPSKADLF 59
M R +VY+Y R F L G YASE +F K +S FV D +KA LF
Sbjct: 164 MERILKVYIYQDGRRPIFHTPPL------SGIYASEGWFMKLLKESRRFVVADGAKAHLF 217
Query: 60 FLPFSIARMR--------HDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACH 111
+LP+S +R H+ R + DF+ ++ KYP+WNR GADHF VACH
Sbjct: 218 YLPYSSQHLRLSLYVPDSHNLRPLAVYLRDFVQ----GLAAKYPFWNRNRGADHFLVACH 273
Query: 112 S-------IGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP--- 161
G +++ N+I+ +C++ KDVSLP+ R P
Sbjct: 274 DWRKFYYLQGPYTTTAHRDLRRNSIKALCNADSSERIFSPGKDVSLPETTIRTPKRPLRY 333
Query: 162 --KLGSSKRNKLAFFAGAVNSPVREKLLQVW----RNDSEIYA---HSGRLKTPYADELL 212
L S+R LAFFAG V+ VR LL+ W +D +Y + + Y +
Sbjct: 334 VGGLPVSRRRILAFFAGNVHGRVRPVLLKHWGDGRDDDMRVYGPLPNRVSRQMSYIQHMK 393
Query: 213 GSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLL 272
S+FCL G EVN+ RI ++LYY CVPV+IA+++ LPF+DVL+W +FS+VVA DIP L
Sbjct: 394 NSRFCLCPMGHEVNSPRIVEALYYECVPVVIADNFVLPFSDVLDWTAFSVVVAEKDIPDL 453
Query: 273 KKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
KKIL+GIS Y+ + V ++++HF WH P YD F+M+++ +WL R
Sbjct: 454 KKILQGISLRRYVAMHDCVKRLQRHFLWHARPLRYDLFHMILHSIWLSR 502
>gi|356518344|ref|XP_003527839.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
max]
Length = 637
Score = 209 bits (532), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 129/339 (38%), Positives = 187/339 (55%), Gaps = 31/339 (9%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEP--RGNYASESYFKKAFMKS-HFVTKDPSKAD 57
M ++ +VYVY R D P+ P G YASE +F + S FVTKDP KA
Sbjct: 305 MEKTLKVYVY--REGDK------PIMHSPYLLGIYASEGWFMRLMEASKQFVTKDPKKAH 356
Query: 58 LFFLPFSIARMRHD-----RRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHS 112
LF+LPFS +RM + + + ++ +Y+ I+ K+ +WNRTGGADHF VACH
Sbjct: 357 LFYLPFS-SRMLEETLYVPNSHSSRNLIQYLKNYVDMIAGKHRFWNRTGGADHFLVACHD 415
Query: 113 IGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP--KLGS---SK 167
+ ++ L A+ C++ G + KD+SLP+ + R P +G SK
Sbjct: 416 WAPTETRQHMARCLRAL---CNAD-VKEGFVLGKDISLPETYVRNAQKPTRNIGGNRVSK 471
Query: 168 RNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKTP-----YADELLGSKFCLHVKG 222
R LAFFAG ++ VR LLQ W N G L Y + SK+C+ KG
Sbjct: 472 RKTLAFFAGGMHGYVRPILLQHWENKDPAMKIFGILPKSKGNRNYIQYMKSSKYCICAKG 531
Query: 223 FEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSE 282
+EVN+ R+ +++ Y CVPVI+++++ PF ++LNW+SF++ V DIP LK IL I +
Sbjct: 532 YEVNSPRVVEAILYECVPVILSDNFVPPFFEMLNWESFAVFVLEKDIPNLKNILLSIPQK 591
Query: 283 EYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
YL +Q V KV++HF WH P YD F+MV++ +W R
Sbjct: 592 RYLQMQMMVRKVQQHFLWHRSPVKYDIFHMVLHSIWYNR 630
>gi|115470965|ref|NP_001059081.1| Os07g0188700 [Oryza sativa Japonica Group]
gi|34394611|dbj|BAC83913.1| putative pectin-glucuronyltransferase [Oryza sativa Japonica Group]
gi|50508944|dbj|BAD31848.1| putative pectin-glucuronyltransferase [Oryza sativa Japonica Group]
gi|113610617|dbj|BAF20995.1| Os07g0188700 [Oryza sativa Japonica Group]
gi|125599387|gb|EAZ38963.1| hypothetical protein OsJ_23384 [Oryza sativa Japonica Group]
gi|215707162|dbj|BAG93622.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 606
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 135/349 (38%), Positives = 193/349 (55%), Gaps = 38/349 (10%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPV--DFEPRGNYASESYFKKAF-MKSHFVTKDPSKAD 57
M + F+V+VY +P PV D R Y++E F A M++ T+DP +A
Sbjct: 268 MEKVFKVFVY--EEGEP------PVFHDGPCRSIYSTEGRFIYAMEMENRMRTRDPDQAH 319
Query: 58 LFFLPFSIARM--------RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVA 109
+FFLPFS+ +M HD + + IS YI +S KYP+WNR+ GADHF ++
Sbjct: 320 VFFLPFSVVKMVKMIYEPNSHD----MDPLRRTISDYINVVSTKYPHWNRSLGADHFMLS 375
Query: 110 CHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI------WPRQEDPPKL 163
CH G + N+I+V+C+++ G +DVSLP+I RQ P
Sbjct: 376 CHDWGPYVSSANGHLFSNSIRVLCNANTS-EGFDPSRDVSLPEINLRSDVVDRQVGGPS- 433
Query: 164 GSSKRNKLAFFAGAVNSPVREKLLQVWR--NDSEIYAHS---GRLKTPYADELLGSKFCL 218
+S R LAFFAG + PVR LLQ W D++I R Y D + S+FCL
Sbjct: 434 -ASHRPILAFFAGGDHGPVRPLLLQHWGKGQDADIQVSEYLPRRHGMSYTDMMRRSRFCL 492
Query: 219 HVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKG 278
G+EV + R+ +++Y CVPV+I + Y LPFADVLNW +FS+ VA DIP LK+IL
Sbjct: 493 CPSGYEVASPRVVEAIYLECVPVVIGDDYTLPFADVLNWAAFSVRVAVGDIPRLKEILAA 552
Query: 279 ISSEEYLLLQSYVLKVRKHFQW-HVFPSDYDAFYMVMYELWLRRSSVRV 326
+S +Y+ +Q V VR+HF P +D F+M+++ +WLRR +VRV
Sbjct: 553 VSPRQYIRMQRRVRAVRRHFMVSDGAPRRFDVFHMILHSIWLRRLNVRV 601
>gi|224142972|ref|XP_002324801.1| predicted protein [Populus trichocarpa]
gi|222866235|gb|EEF03366.1| predicted protein [Populus trichocarpa]
Length = 398
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 122/309 (39%), Positives = 180/309 (58%), Gaps = 21/309 (6%)
Query: 30 RGNYASESYFKKAF-MKSHFVTKDPSKADLFFLPFSIARMRHDRRIGT----EGIPDFIS 84
+G YASE +F K FVTKDP K+ LF+LPFS + + + + + ++
Sbjct: 87 KGIYASEGWFMKLLETNKKFVTKDPKKSHLFYLPFSSRNLEVNLYVPNSHSHKNLIQYLK 146
Query: 85 HYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIA 144
+Y+ IS KYP+WNRT GADHF VACH + ++ N I+ +C+S G +
Sbjct: 147 NYLDMISAKYPFWNRTRGADHFLVACHDWAPTETRQHMA---NCIRALCNSDA-KGGFVF 202
Query: 145 HKDVSLPQIWPR--QEDPPKLG---SSKRNKLAFFAGAVNSPVREKLLQVWRN---DSEI 196
KD +LP+ R Q LG +SKR+ LAFFAG+++ +R LLQ W N D ++
Sbjct: 203 GKDAALPETTVRTPQNLLRDLGGKPASKRSILAFFAGSMHGYLRPILLQHWGNKDPDVKV 262
Query: 197 YAH----SGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFA 252
+ GR K Y + SK+C+ KGFEVN+ R+ ++++Y CVPVII++++ PF
Sbjct: 263 FGKLPKVKGRGKMNYPQYMKSSKYCICAKGFEVNSPRVVEAIFYECVPVIISDNFVPPFF 322
Query: 253 DVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYM 312
+VLNW+SF++ V DIP LK IL I +Y +Q V KV++HF WH P YD F+M
Sbjct: 323 EVLNWESFAVFVLEKDIPNLKNILLSIPENKYREMQMRVKKVQQHFLWHARPVKYDIFHM 382
Query: 313 VMYELWLRR 321
+++ +W R
Sbjct: 383 ILHSVWYNR 391
>gi|255546377|ref|XP_002514248.1| catalytic, putative [Ricinus communis]
gi|223546704|gb|EEF48202.1| catalytic, putative [Ricinus communis]
Length = 676
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 125/341 (36%), Positives = 190/341 (55%), Gaps = 34/341 (9%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFM-KSHFVTKDPSKADLF 59
M R+ +VY+Y + F LP+ +G YASE +F K HF+ KDP +A LF
Sbjct: 343 MERTLKVYIYKDGKKPIFH---LPI---MKGLYASEGWFMKLMQGNKHFLVKDPRRAHLF 396
Query: 60 FLPFS--------IARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACH 111
++PFS R H+R + ++ Y I+ KYP+WNRT GADHF VACH
Sbjct: 397 YMPFSSRMLEYTLYVRNSHNR----TNLRQYLKDYSEKIAAKYPFWNRTDGADHFLVACH 452
Query: 112 SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP--KLGS---S 166
+ E + I+ +C++ +G +D+SLP+ + R P LG S
Sbjct: 453 DWAPYETRHHME---HCIKALCNAD-VTAGFKIGRDISLPETYVRSARNPLRDLGGKPPS 508
Query: 167 KRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRL------KTPYADELLGSKFCLHV 220
+R+ LAF+AG+++ +R LL+ W++ G + K Y + SK+C+
Sbjct: 509 QRHILAFYAGSMHGYLRPILLKYWKDKDPSMKIFGPMPPGVASKMNYIQHMKSSKYCICP 568
Query: 221 KGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGIS 280
KG+EVN+ R+ ++++Y CVPVII++++ PF +V NW +FS+++A DIP LK+IL I
Sbjct: 569 KGYEVNSPRVVEAIFYECVPVIISDNFVPPFFEVFNWGAFSLILAEKDIPNLKEILLSIP 628
Query: 281 SEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
E+YL +Q V KV+KHF WH P YD FYM ++ +W R
Sbjct: 629 EEKYLEMQLGVRKVQKHFLWHPSPMKYDLFYMTLHAIWYNR 669
>gi|302761638|ref|XP_002964241.1| xylosyltransferase-like protein [Selaginella moellendorffii]
gi|300167970|gb|EFJ34574.1| xylosyltransferase-like protein [Selaginella moellendorffii]
Length = 332
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 130/341 (38%), Positives = 188/341 (55%), Gaps = 36/341 (10%)
Query: 1 MNRSFRVYVYPHRR-----NDPFANVLLPVDFEPRGNYASESYFKKAFM-KSHFVTKDPS 54
M + +V+ YP N P + YA E F + K+ ++T DP
Sbjct: 1 MEKRLKVFAYPEGEEPLVHNGPCKEI-----------YAIEGRFIQELQGKNSYLTSDPE 49
Query: 55 KADLFFLPFSIARM--------RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHF 106
KA LFFLPFS+A M HD +G G F YI IS +Y WNR+ GADHF
Sbjct: 50 KAHLFFLPFSVAMMVTYLYTPGSHD--MGPLG--RFTRDYIDVISHRYSAWNRSRGADHF 105
Query: 107 YVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI-WPRQEDPPKLGS 165
V+CH G + ++ N+I+V+C+++ G++ KD SLP+I + P LG
Sbjct: 106 MVSCHDWGPHISRAHPDLMANSIRVLCNANT-SEGYVPSKDASLPEIHLVGGQVPSVLGG 164
Query: 166 ---SKRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRL--KTPYADELLGSKFCLHV 220
+R LAFFAG + PVR L + W+ E +L + Y D + SK+CL
Sbjct: 165 PPPEERRYLAFFAGGDHGPVRPVLFKYWKEKDEDVRVFEKLPSRDAYLDYMSHSKYCLCP 224
Query: 221 KGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGIS 280
G+EVN+ RI +++Y CVPV+IA+ + LPF+DVL+W +FS+ V DIP LK IL+ I
Sbjct: 225 GGYEVNSPRIVEAIYNDCVPVVIADDFVLPFSDVLDWDAFSVKVLERDIPRLKTILQAIP 284
Query: 281 SEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
+ YL +Q+ V KVR+HF+++ P YD F M+++ +WLRR
Sbjct: 285 TARYLEMQARVSKVRRHFRFNQPPERYDVFNMILHSVWLRR 325
>gi|224092292|ref|XP_002309546.1| predicted protein [Populus trichocarpa]
gi|222855522|gb|EEE93069.1| predicted protein [Populus trichocarpa]
Length = 417
Score = 208 bits (529), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 129/340 (37%), Positives = 185/340 (54%), Gaps = 33/340 (9%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAF-MKSHFVTKDPSKADLF 59
M + +VYVY F +L G YASE +F K +FVTKDP KA LF
Sbjct: 85 MEKMLKVYVYQDGEKPIFHQPIL------DGIYASEGWFMKHMEANENFVTKDPGKAHLF 138
Query: 60 FLPFS--------IARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACH 111
+LPFS R H R + +++ +Y I+ KY +WNRTGGADHF ACH
Sbjct: 139 YLPFSSRLLELTLYVRHSHSR----TNLIEYMRNYAGMIAAKYHFWNRTGGADHFVAACH 194
Query: 112 SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLG-----SS 166
+ + LN I+ +C++ + I KDVSLP+ + R P S
Sbjct: 195 DWAPAETRGPL---LNCIRALCNADIEVGFSIG-KDVSLPETYVRSAQNPLKNLEGNPPS 250
Query: 167 KRNKLAFFAGAVNSPVREKLLQVWRN---DSEIYAHSGRLK--TPYADELLGSKFCLHVK 221
+R LAFFAG ++ VR LL W N D +I+ +K T Y + SKFC+ +
Sbjct: 251 QRPILAFFAGNMHGYVRPVLLDYWGNKDPDMKIFGPMPHVKGNTNYIQHMKSSKFCICPR 310
Query: 222 GFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISS 281
G EVN+ RI ++++ CVPVII++++ PF +VL+W+SF+++V DIP LK IL IS
Sbjct: 311 GHEVNSPRIVEAIFLECVPVIISDNFVPPFFEVLDWESFAVIVLEKDIPNLKNILVSISE 370
Query: 282 EEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
E+Y+ + V KV++HF WH P YD F+M+++ +W R
Sbjct: 371 EKYIEMHKRVKKVQQHFLWHSKPEKYDLFHMILHSVWYNR 410
>gi|255546379|ref|XP_002514249.1| catalytic, putative [Ricinus communis]
gi|223546705|gb|EEF48203.1| catalytic, putative [Ricinus communis]
Length = 334
Score = 208 bits (529), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 128/341 (37%), Positives = 188/341 (55%), Gaps = 34/341 (9%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFM-KSHFVTKDPSKADLF 59
M R +VYVY F + RG YASE +F K F+ KDP KA LF
Sbjct: 1 MERLLKVYVYKEGEKPIFHQSKM------RGIYASEGWFMKHIEGNKKFLVKDPRKAHLF 54
Query: 60 FLPFSIARMRH----DRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHS--- 112
FLPFS +R + + + +++ +Y+ +++KY +WNRTGG DHF V CH
Sbjct: 55 FLPFSPQMLRTVIFGQKLQSQKDLEEYLKNYVDLVARKYSFWNRTGGTDHFLVGCHDWAS 114
Query: 113 -IGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP--KLGS---S 166
I R MQ N I+V+C+++ G KD +LP + R + P +G S
Sbjct: 115 RITRKYMQ-------NCIRVLCNAN-VAKGFKIGKDTTLPVTYIRSAENPLKDVGGKHPS 166
Query: 167 KRNKLAFFAGAVNSPVREKLLQVWRN---DSEIYAHSGR---LKTPYADELLGSKFCLHV 220
+R LAFFAG ++ +R L+Q W N D +I+ R K Y + + SK+C+
Sbjct: 167 ERYTLAFFAGGMHGYLRPILVQFWENKESDMKIFGPMPRDIEGKRLYREYMKSSKYCICA 226
Query: 221 KGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGIS 280
+G+EV+T RI +++ Y CVPVII+++Y PF +VLNW++FS+ V DIP L+ IL I
Sbjct: 227 RGYEVHTPRIVEAILYECVPVIISDNYVPPFFEVLNWEAFSVFVQEKDIPNLRSILLSIP 286
Query: 281 SEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
E+YL +Q V V++HF WH P YD F+M+++ +W R
Sbjct: 287 EEKYLEMQLRVKMVQQHFLWHKNPVKYDLFHMILHSVWHNR 327
>gi|357520911|ref|XP_003630744.1| Xylogalacturonan beta-1,3-xylosyltransferase [Medicago truncatula]
gi|355524766|gb|AET05220.1| Xylogalacturonan beta-1,3-xylosyltransferase [Medicago truncatula]
Length = 653
Score = 208 bits (529), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 128/341 (37%), Positives = 189/341 (55%), Gaps = 34/341 (9%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKS-HFVTKDPSKADLF 59
M + +VY+Y F +L +G YASE +F K ++ FV KDP+KA LF
Sbjct: 318 MELTLKVYIYMEGNKPIFHQPIL------KGLYASEGWFMKLMEENKQFVVKDPAKAHLF 371
Query: 60 FLPFS--------IARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACH 111
++PFS R H+R + ++ Y IS KY Y+NRTGGADHF VACH
Sbjct: 372 YMPFSSRMLEFSVYVRNSHNR----TNLRQYLKEYTDKISAKYRYFNRTGGADHFLVACH 427
Query: 112 SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPK--LGS---S 166
+ E I+ +C+S G +DVSLP+ R P+ LG
Sbjct: 428 DWAPYETRHHMEY---CIKALCNSD-VTQGFKIGRDVSLPETMVRSVRNPQRDLGGKPPQ 483
Query: 167 KRNKLAFFAGAVNSPVREKLLQVWRN---DSEIYA---HSGRLKTPYADELLGSKFCLHV 220
+R+ LAF+AG ++ +R LL+ W+ D +I+ H K Y + + SK+C+
Sbjct: 484 QRSILAFYAGNMHGYLRSILLKHWKEKDPDMKIFGPMPHGVAHKMNYIEHMKSSKYCICP 543
Query: 221 KGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGIS 280
KG+EVN+ R+ ++++Y CVPVII++++ PF +VLNW SFS+++A DIP LK+IL +
Sbjct: 544 KGYEVNSPRVVEAIFYECVPVIISDNFVPPFFEVLNWDSFSLILAEKDIPNLKQILLSVP 603
Query: 281 SEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
E+YL LQ V +V+KHF WH P YD F+M ++ +W R
Sbjct: 604 EEKYLKLQLGVRRVQKHFLWHTKPLKYDLFHMTLHSIWYNR 644
>gi|297802718|ref|XP_002869243.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
gi|297315079|gb|EFH45502.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
Length = 583
Score = 207 bits (528), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 129/341 (37%), Positives = 189/341 (55%), Gaps = 34/341 (9%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEP--RGNYASESYFKKAFMKSH-FVTKDPSKAD 57
M + +VYVY + PV +P +G YASE +F K S FVTKDP KA
Sbjct: 253 MEKKLKVYVYREGKR--------PVLHKPVLKGIYASEGWFMKQLKSSRTFVTKDPRKAH 304
Query: 58 LFFLPFSIARMRHDRRI----GTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSI 113
LF+LPFS + + + + F+ +Y+ IS KY +WN+TGG+DHF VACH
Sbjct: 305 LFYLPFSSKMLEETLYVPGSHSDQNLIQFLKNYLDMISSKYNFWNKTGGSDHFLVACHDW 364
Query: 114 GRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQ---IWPRQEDPPKLGS--SKR 168
S ++ I+ +C+S G + KDV+LP+ + PR+ G S+R
Sbjct: 365 APSETRQYMA---KCIRALCNSD-VSEGFVFGKDVALPETTILVPRRPLRALGGKPVSQR 420
Query: 169 NKLAFFAGAVNSPVREKLLQVWRND--------SEIYAHSGRLKTPYADELLGSKFCLHV 220
LAFFAG ++ +R LL+ W + SEI G K Y + + SKFC+
Sbjct: 421 QILAFFAGGMHGYLRPLLLRNWGGNRDPDMKIFSEIPKSKG--KKSYMEYMKSSKFCICP 478
Query: 221 KGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGIS 280
KG EVN+ R+ ++L+Y CVPVII++++ PF +VLNW++F++ V DIP LK IL I+
Sbjct: 479 KGHEVNSPRVVEALFYECVPVIISDNFVPPFFEVLNWEAFAVFVLEKDIPDLKNILVSIT 538
Query: 281 SEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
E Y +Q+ V V+KHF WH P +D F+M+++ +W R
Sbjct: 539 EERYREMQTRVKMVQKHFLWHSKPERFDIFHMILHSIWYNR 579
>gi|449442160|ref|XP_004138850.1| PREDICTED: probable glycosyltransferase At5g25310-like [Cucumis
sativus]
gi|449530249|ref|XP_004172108.1| PREDICTED: probable glycosyltransferase At5g25310-like [Cucumis
sativus]
Length = 465
Score = 207 bits (528), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 133/343 (38%), Positives = 191/343 (55%), Gaps = 26/343 (7%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFE-PRGN-YASESYFKKAFMK--SHFVTKDPSKA 56
M + F+VYVYP LP+ P N Y E F + F T DPS+A
Sbjct: 131 MEKRFKVYVYPEGE--------LPITHAGPCKNIYTIEGRFIHEMEDGGNGFRTVDPSRA 182
Query: 57 DLFFLPFSIARM----RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHS 112
+ F+PFS+A M D + F+S Y+ +S+KYP+WN+T GADHF +ACH
Sbjct: 183 HVLFMPFSVAWMVKYLYKDGSYDQTPLRMFVSDYVEVVSKKYPFWNKTNGADHFILACHD 242
Query: 113 IGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQ-EDPPKLGSSK---- 167
G A + + +I+V+C+++ G KDVSLP+I E PKL S+
Sbjct: 243 WGPIATEGNRFLYNTSIRVLCNANS-SEGFNPQKDVSLPEIHLYDGEISPKLLSASNSHH 301
Query: 168 -RNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRL---KTPYADELLGSKFCLHVKGF 223
R LAFFAG ++ P+R LL W+N + H + Y DE+L S+FCL G+
Sbjct: 302 HRPHLAFFAGGLHGPIRPILLNHWKNRTHTNIHVYEYLPKQLDYYDEMLQSRFCLCPSGY 361
Query: 224 EVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEE 283
EV + RI +++Y CVPVII+ Y LPF+DVL W+ FSI V+ +IP L++IL G+S E
Sbjct: 362 EVASPRIVEAIYAECVPVIISERYVLPFSDVLRWEGFSIEVSVSEIPRLEEILMGVSEER 421
Query: 284 YLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
Y L + VRKHF + +DAF+M+++ +WLRR +V++
Sbjct: 422 YEKLIQGLRTVRKHFVLNRPAKRFDAFHMILHSVWLRRLNVKL 464
>gi|255550904|ref|XP_002516500.1| catalytic, putative [Ricinus communis]
gi|223544320|gb|EEF45841.1| catalytic, putative [Ricinus communis]
Length = 456
Score = 207 bits (528), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 127/340 (37%), Positives = 186/340 (54%), Gaps = 33/340 (9%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSH-FVTKDPSKADLF 59
M +V++Y F +L G YASE +F K + FVTKDP +A LF
Sbjct: 124 MENMLKVFIYQEGEKPIFHQSIL------EGIYASEGWFIKLMEANEKFVTKDPKEAHLF 177
Query: 60 FLPFS--------IARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACH 111
++PFS R H R + +++ +Y I+ KYP+W+RTGGADHF ACH
Sbjct: 178 YIPFSSRLLELTLYVRKSHSRN----NLIEYMKNYTDMIAAKYPFWSRTGGADHFVAACH 233
Query: 112 SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLG-----SS 166
+ + LN I+ +C++ + I KDVSLP+ + R P S
Sbjct: 234 DWAPAETRGRM---LNCIRALCNADIDVGFRIG-KDVSLPETYVRSAQNPLKNLDGNPPS 289
Query: 167 KRNKLAFFAGAVNSPVREKLLQVWRN---DSEIYAHSGRLK--TPYADELLGSKFCLHVK 221
+R LAFFAG V+ VR LL+ W N + +I+ R+K T Y + SK+C+ +
Sbjct: 290 QRPILAFFAGNVHGFVRPILLEYWENKDPEMKIFGPMPRVKGNTNYIQLMKSSKYCICPR 349
Query: 222 GFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISS 281
G EVN+ RI +S++Y CVPVII+++Y PF +VL+W+SF++ V DIP LKKIL I
Sbjct: 350 GHEVNSPRIVESIFYECVPVIISDNYVPPFFEVLDWESFAVFVLEKDIPNLKKILLSIPE 409
Query: 282 EEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
E Y+ + V KV++HF WH P +D F+M+++ +W R
Sbjct: 410 ETYVEMHKRVKKVQQHFLWHSEPEKHDLFHMILHSVWYNR 449
>gi|147799433|emb|CAN76867.1| hypothetical protein VITISV_012309 [Vitis vinifera]
Length = 1908
Score = 207 bits (528), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 128/341 (37%), Positives = 188/341 (55%), Gaps = 34/341 (9%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSH-FVTKDPSKADLF 59
M R +VYVY F +L +G YASE +F K ++ FV KDP +A LF
Sbjct: 1061 MERILKVYVYKDGEKPIFHQPIL------KGLYASEGWFMKLMERNKXFVVKDPRQAQLF 1114
Query: 60 FLPFS--------IARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACH 111
++PFS R H+R + ++ Y I+ KY +WNRTGG DHF VACH
Sbjct: 1115 YMPFSSRMLEYKLYVRNSHNR----TNLRQYLKQYSEKIAAKYRFWNRTGGXDHFLVACH 1170
Query: 112 SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP--KLGS---S 166
+ E I+ +C++ +G +DVSLP+ + R P LG S
Sbjct: 1171 DWAPYETRHHME---QCIKALCNAD-VTAGFKIGRDVSLPETYVRSARNPLRDLGGKPPS 1226
Query: 167 KRNKLAFFAGAVNSPVREKLLQVWRN---DSEIYAH---SGRLKTPYADELLGSKFCLHV 220
+R+ LAF+AG ++ +R LL+ W++ D +IY K Y + SKFC+
Sbjct: 1227 ERHILAFYAGNMHGYLRPILLKYWKDKDPDMKIYGPMPPGVASKMNYIQHMKSSKFCICP 1286
Query: 221 KGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGIS 280
KG+EVN+ R+ ++++Y CVPVII++++ PF DVL+W +FSI++A DIP LK +L I
Sbjct: 1287 KGYEVNSPRVVEAIFYECVPVIISDNFVPPFFDVLDWGAFSIILAEKDIPNLKDVLLSIP 1346
Query: 281 SEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
+E+YL +Q V KV+KHF WH P YD F+M ++ +W R
Sbjct: 1347 NEKYLQMQLGVRKVQKHFLWHAKPLKYDLFHMTLHSIWYNR 1387
Score = 195 bits (496), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 129/384 (33%), Positives = 199/384 (51%), Gaps = 72/384 (18%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPR--GNYASESYFKKAFMKS-HFVTKDPSKAD 57
M R ++Y+Y P+ +PR G YASE +F K + FV +DP KA
Sbjct: 332 MERVLKIYIYREGEK--------PIFHQPRLRGIYASEGWFMKLIEGNKRFVVRDPRKAH 383
Query: 58 LFFLPFSIARMR---HDRRIGT-EGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHS- 112
LF++PFS +R +++ T + + +Y+ I+ KY +WNRTGGADH VACH
Sbjct: 384 LFYVPFSSKMLRTVFYEQNSSTPRDLEKYFKNYVGLIAGKYRFWNRTGGADHLIVACHDW 443
Query: 113 --IGRSAMQKAWEVKL------------------------------------------NA 128
I R+ +K N+
Sbjct: 444 NPIYRTISTNTIRIKSQAITMPPFIFVGGESTYDLVSGTFSNKGFNSQAPRITRQCSWNS 503
Query: 129 IQVVCSSSYFISGHIAHKDVSLPQIWPRQ-EDPPK-LGS---SKRNKLAFFAGAVNSPVR 183
I+ +C+S+ SG KD +LP + R+ EDP K LG S+R LAFFAG+++ +R
Sbjct: 504 IRALCNSN-IASGFKIGKDTTLPVTYIRKSEDPLKYLGGKPPSQRPILAFFAGSMHGYLR 562
Query: 184 EKLLQVWRN---DSEIYAHSGR---LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYG 237
LLQ W N D +I+ R K+ Y D + SK+C+ +G+EV+T R+ ++++Y
Sbjct: 563 PILLQYWENKEQDIKIFGPMSRDDGGKSRYRDHMKSSKYCICARGYEVHTPRVVEAIFYE 622
Query: 238 CVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKH 297
CVPVII+++Y PF ++LNW++F++ + D+P L+ IL I E+YL +Q V V++H
Sbjct: 623 CVPVIISDNYVPPFFEILNWEAFAVFILEKDVPNLRNILLSIPEEKYLQMQMRVKMVQQH 682
Query: 298 FQWHVFPSDYDAFYMVMYELWLRR 321
F WH P YD F+M+++ +W R
Sbjct: 683 FLWHKKPVKYDLFHMILHSVWYNR 706
>gi|15233924|ref|NP_195005.1| Exostosin family protein [Arabidopsis thaliana]
gi|3063691|emb|CAA18582.1| putative protein [Arabidopsis thaliana]
gi|7270226|emb|CAB79996.1| putative protein [Arabidopsis thaliana]
gi|332660719|gb|AEE86119.1| Exostosin family protein [Arabidopsis thaliana]
Length = 593
Score = 207 bits (528), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 130/341 (38%), Positives = 188/341 (55%), Gaps = 34/341 (9%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEP--RGNYASESYFKKAFMKSH-FVTKDPSKAD 57
M + +VYVY + PV +P +G YASE +F K S FVTKDP KA
Sbjct: 263 MEKKLKVYVYREGKR--------PVLHKPVLKGIYASEGWFMKQLKSSRTFVTKDPRKAH 314
Query: 58 LFFLPFSIARMRHDRRI----GTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSI 113
LF+LPFS + + + + F+ +Y+ IS KY +WN+TGG+DHF VACH
Sbjct: 315 LFYLPFSSKMLEETLYVPGSHSDKNLIQFLKNYLDMISSKYSFWNKTGGSDHFLVACHDW 374
Query: 114 GRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQ---IWPRQEDPPKLGS--SKR 168
S ++ I+ +C+S G + KDV+LP+ + PR+ G S+R
Sbjct: 375 APSETRQYMA---KCIRALCNSD-VSEGFVFGKDVALPETTILVPRRPLRALGGKPVSQR 430
Query: 169 NKLAFFAGAVNSPVREKLLQVWRND--------SEIYAHSGRLKTPYADELLGSKFCLHV 220
LAFFAG ++ +R LLQ W + SEI G K Y + + SK+C+
Sbjct: 431 QILAFFAGGMHGYLRPLLLQNWGGNRDPDMKIFSEIPKSKG--KKSYMEYMKSSKYCICP 488
Query: 221 KGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGIS 280
KG EVN+ R+ ++L+Y CVPVII++++ PF +VLNW+SF++ V DIP LK IL I+
Sbjct: 489 KGHEVNSPRVVEALFYECVPVIISDNFVPPFFEVLNWESFAVFVLEKDIPDLKNILVSIT 548
Query: 281 SEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
E Y +Q V V+KHF WH P +D F+M+++ +W R
Sbjct: 549 EERYREMQMRVKMVQKHFLWHSKPERFDIFHMILHSIWYNR 589
>gi|302815783|ref|XP_002989572.1| xylosyltransferase-like protein [Selaginella moellendorffii]
gi|300142750|gb|EFJ09448.1| xylosyltransferase-like protein [Selaginella moellendorffii]
Length = 332
Score = 207 bits (526), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 129/341 (37%), Positives = 187/341 (54%), Gaps = 36/341 (10%)
Query: 1 MNRSFRVYVYPHRR-----NDPFANVLLPVDFEPRGNYASESYFKKAFM-KSHFVTKDPS 54
M + +V+ YP N P + YA E F + K+ ++T DP
Sbjct: 1 MEKRLKVFAYPEGEEPLVHNGPCKEI-----------YAIEGRFIQELQGKNSYLTSDPE 49
Query: 55 KADLFFLPFSIARM--------RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHF 106
KA LFFLPFS+A M HD +G G F YI IS +Y WNR+ GADHF
Sbjct: 50 KAHLFFLPFSVAMMVTYLYTPGSHD--MGPLG--RFTRDYIDVISHRYSSWNRSRGADHF 105
Query: 107 YVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI-WPRQEDPPKLGS 165
V+CH G + ++ N+I+V+C+++ G++ KD SLP+I + P LG
Sbjct: 106 MVSCHDWGPHISRAHPDLMANSIRVLCNANT-SEGYVPSKDASLPEIHLVGGQVPSVLGG 164
Query: 166 ---SKRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRL--KTPYADELLGSKFCLHV 220
+R LAFFAG + PVR L + W+ E +L + Y D + SK+CL
Sbjct: 165 PPPEERRYLAFFAGGDHGPVRPVLFKYWKEKDEDVRVFEKLPSRDAYLDYMSHSKYCLCP 224
Query: 221 KGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGIS 280
G+EVN+ RI +++Y CVPV+IA+ + LPF+DVL+W +FS+ V DIP LK IL+ I
Sbjct: 225 GGYEVNSPRIVEAIYNDCVPVVIADDFVLPFSDVLDWDAFSVKVLERDIPRLKTILQAIP 284
Query: 281 SEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
+ YL +Q+ KVR+HF+++ P YD F M+++ +WLRR
Sbjct: 285 TARYLEMQARASKVRRHFRFNQPPERYDVFNMILHSVWLRR 325
>gi|224142181|ref|XP_002324437.1| predicted protein [Populus trichocarpa]
gi|222865871|gb|EEF03002.1| predicted protein [Populus trichocarpa]
Length = 334
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 126/337 (37%), Positives = 187/337 (55%), Gaps = 26/337 (7%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFM-KSHFVTKDPSKADLF 59
M R +VYVY F + RG YASE +F K FV +DP KA LF
Sbjct: 1 MERMLKVYVYKEGEKPIFHQSKM------RGIYASEGWFMKLIEGNKKFVVRDPRKAHLF 54
Query: 60 FLPFSIARMR----HDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGR 115
+LPFS +R + + + +F+ +Y+ +++KY +WNRTGG DHF V CH
Sbjct: 55 YLPFSPHMLRTALFDHNSLNQKELAEFLKNYVDLVAKKYSFWNRTGGTDHFLVGCHDW-- 112
Query: 116 SAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP--KLGS---SKRNK 170
A Q N I+V+C+S+ G KD +LP + R + P +LG S+R
Sbjct: 113 -ASQMTRHHMRNCIRVLCNSN-VAKGFKIGKDTTLPVTYIRSVENPLKELGGKSPSERPI 170
Query: 171 LAFFAGAVNSPVREKLLQVWRN---DSEIYAHSGR---LKTPYADELLGSKFCLHVKGFE 224
LAFFAG ++ +R LL+ W N D +I R K Y + + SK+C+ +G+E
Sbjct: 171 LAFFAGNMHGYLRPILLEYWENKEPDMKILGPMSRDIAGKRRYREYMKRSKYCICARGYE 230
Query: 225 VNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEY 284
V+T R+ +S++Y CVPVII+++Y P +VLNW++FS+ + DIP L+ IL I E+Y
Sbjct: 231 VHTPRVVESIFYECVPVIISDNYVPPLFEVLNWEAFSVFIQEKDIPNLRNILLSIPQEKY 290
Query: 285 LLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
+ +Q V KV++HF WH P YD F+M+++ +W R
Sbjct: 291 VAMQLGVKKVQQHFLWHKKPVKYDLFHMILHSVWHSR 327
>gi|356510025|ref|XP_003523741.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
max]
Length = 619
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 121/336 (36%), Positives = 183/336 (54%), Gaps = 25/336 (7%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAF-MKSHFVTKDPSKADLF 59
M +VY+Y F LL G YASE +F K FVT+DP KA LF
Sbjct: 287 MENMLKVYIYQDGDRPIFHEPLL------DGIYASEGWFMKLMEANKQFVTRDPGKAHLF 340
Query: 60 FLPFSIARMRHDRRIGTE----GIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGR 115
++PFS ++ + + +++ +Y+ I+ KYP+WNRT GADHF VACH
Sbjct: 341 YIPFSSRLLQQTLYVRNSHRRSNLIEYMKNYVDMIAGKYPFWNRTSGADHFVVACHDWAP 400
Query: 116 SAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP--KLGS---SKRNK 170
+ + L+ I+ +C++ + I KDVSLP+ + R + P +G SKR
Sbjct: 401 AETRGRM---LSCIRALCNADIEVGFKIG-KDVSLPETYIRSSENPVKNIGGDPPSKRPI 456
Query: 171 LAFFAGAVNSPVREKLLQVWRNDSEIYAHSG-----RLKTPYADELLGSKFCLHVKGFEV 225
LAFFAG ++ VR LL+ W N SG R Y + SKFC+ +G EV
Sbjct: 457 LAFFAGGLHGYVRPILLKHWENKEPDMKISGPLPHVRGNVNYIQLMKSSKFCICARGHEV 516
Query: 226 NTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYL 285
N+ R+ +++++ C+PVII++++ PF ++LNW+SF++ V +IP L+ IL IS E YL
Sbjct: 517 NSPRVVEAIFHECIPVIISDNFIPPFFEILNWESFAVFVKEEEIPNLRNILLSISEERYL 576
Query: 286 LLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
+ KV++HF WH P YD F+M+++ +W R
Sbjct: 577 EMHKRAKKVQEHFLWHAEPVKYDLFHMLLHSIWYNR 612
>gi|125557509|gb|EAZ03045.1| hypothetical protein OsI_25187 [Oryza sativa Indica Group]
Length = 601
Score = 205 bits (522), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 134/349 (38%), Positives = 193/349 (55%), Gaps = 38/349 (10%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPV--DFEPRGNYASESYFKKAF-MKSHFVTKDPSKAD 57
M + F+V+VY +P PV D Y++E F A M++ T+DP++A
Sbjct: 263 MEKVFKVFVY--EEGEP------PVFHDGPCHSIYSTEGRFIYAMEMENRMRTRDPNQAH 314
Query: 58 LFFLPFSIARM--------RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVA 109
+FFLPFS+ +M HD + + IS YI +S KYP+WNR+ GADHF ++
Sbjct: 315 VFFLPFSVVKMVKMIYEPNSHD----MDPLRRTISDYINVVSTKYPHWNRSLGADHFMLS 370
Query: 110 CHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI------WPRQEDPPKL 163
CH G + N+I+V+C+++ G +DVSLP+I RQ P
Sbjct: 371 CHDWGPYVSSANGHLFSNSIRVLCNANTS-EGFDPSRDVSLPEINLRSDVVARQVGGPS- 428
Query: 164 GSSKRNKLAFFAGAVNSPVREKLLQVWR--NDSEIYAHS---GRLKTPYADELLGSKFCL 218
+S R LAFFAG + PVR LLQ W D++I R Y D + S+FCL
Sbjct: 429 -ASHRPILAFFAGGDHGPVRPLLLQHWGKGQDADIQVSEYLPRRHSMSYTDMMRRSRFCL 487
Query: 219 HVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKG 278
G+EV + R+ +++Y CVPV+I + Y LPFADVLNW +FS+ VA DIP LK+IL
Sbjct: 488 CPSGYEVASPRVVEAIYLECVPVVIGDDYALPFADVLNWAAFSVRVAVGDIPRLKEILAA 547
Query: 279 ISSEEYLLLQSYVLKVRKHFQW-HVFPSDYDAFYMVMYELWLRRSSVRV 326
+S +Y+ +Q V VR+HF P +D F+M+++ +WLRR +VRV
Sbjct: 548 VSPRQYIRMQRRVRAVRRHFMVSDGAPWRFDVFHMILHSIWLRRLNVRV 596
>gi|359484716|ref|XP_002263848.2| PREDICTED: probable glycosyltransferase At5g20260-like [Vitis
vinifera]
gi|296084516|emb|CBI25537.3| unnamed protein product [Vitis vinifera]
Length = 477
Score = 205 bits (521), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 124/335 (37%), Positives = 185/335 (55%), Gaps = 17/335 (5%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
M + F+V+ Y F L + G + E F KS F+ K P +A FF
Sbjct: 139 MVKRFKVWTYREGAQPIFHEGPLTNIYAIEGQFIDEMDFIVG--KSPFIAKHPDEAHAFF 196
Query: 61 LPFSIARMRHDRRIGTEGIPDF--------ISHYIFNISQKYPYWNRTGGADHFYVACHS 112
LP S+ ++ + D+ ++ Y+ ++ KYPYWNR+GGADHF V+CH
Sbjct: 197 LPLSVVKVVQFLYLPITSPEDYSRKRLQRIVTDYVKVVADKYPYWNRSGGADHFMVSCHD 256
Query: 113 IGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI-WPRQE-DPPKLGSSKRNK 170
S E+ N I+V+C+++ G +DVSLP++ P E PP LG N+
Sbjct: 257 WAPSVSYANPELFKNFIRVLCNANS-SEGFRPGRDVSLPEVNLPAGELGPPHLGQPSNNR 315
Query: 171 --LAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLKTPYADELLG-SKFCLHVKGFEVN 226
LAFFAG + +R+ L + W++ D+E+ H K +L+G SKFCL G+EV
Sbjct: 316 PVLAFFAGRAHGNIRKILFEHWKDQDNEVLVHERLHKGQNYAKLMGQSKFCLCPSGYEVA 375
Query: 227 TARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLL 286
+ R+ ++++ GCVPVII+N+Y LPF DVL+W FSI + IP +K IL GIS +YL
Sbjct: 376 SPRVVEAIHAGCVPVIISNNYSLPFNDVLDWSQFSIQIPVAKIPEIKTILLGISKNKYLK 435
Query: 287 LQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
+Q VL+VR+HF + +D +M+++ LWLRR
Sbjct: 436 MQERVLRVRRHFVLNRPARPFDIIHMILHSLWLRR 470
>gi|357466145|ref|XP_003603357.1| hypothetical protein MTR_3g106690 [Medicago truncatula]
gi|355492405|gb|AES73608.1| hypothetical protein MTR_3g106690 [Medicago truncatula]
Length = 393
Score = 204 bits (520), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 129/340 (37%), Positives = 191/340 (56%), Gaps = 21/340 (6%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKS--HFVTKDPSKADL 58
M + F+VYVYP ++ + D + Y++E F + F T DP+ A +
Sbjct: 59 MEKIFKVYVYPD------GDLPIVHDGPCKDIYSTEGRFLHEMERGVGKFRTNDPNAAHV 112
Query: 59 FFLPFSIARMRH----DRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIG 114
+FLPFS+ M + F+S Y+ +S +YP+WNRT GADHF +ACH G
Sbjct: 113 YFLPFSVTWMVKYLYTPSSYDITPLTQFVSDYVRVVSMRYPFWNRTHGADHFMLACHDWG 172
Query: 115 RSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI-WPRQEDPPKLGS-----SKR 168
A + + +I+V+C+++ G KDVSLP+I E PKL S + R
Sbjct: 173 PHASKGNPFLYNTSIRVLCNANT-SEGFNPLKDVSLPEIHLYGGEVSPKLLSLPPENAPR 231
Query: 169 NKLAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLK-TPYADELLGSKFCLHVKGFEVN 226
LAFFAG ++ P+R LLQ W+N D +I + K Y +L SKFCL GFEV
Sbjct: 232 RYLAFFAGGMHGPIRPILLQHWKNRDKDILVNEYLPKGIDYYSIMLSSKFCLCPSGFEVA 291
Query: 227 TARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLL 286
+ RI +S+Y CVPVI++N+Y LPF+DVL W++FS+ V DIP LK+IL I +Y
Sbjct: 292 SPRIVESIYAECVPVILSNYYVLPFSDVLQWEAFSVQVDVSDIPRLKEILSAIPESKYKK 351
Query: 287 LQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
L+ V VR+HF + +D F+M+++ +WLRR ++++
Sbjct: 352 LKQGVRAVRRHFTLNQPAKRFDVFHMILHSIWLRRLNIKL 391
>gi|15239118|ref|NP_196722.1| Exostosin family protein [Arabidopsis thaliana]
gi|7573418|emb|CAB87721.1| putative protein [Arabidopsis thaliana]
gi|332004317|gb|AED91700.1| Exostosin family protein [Arabidopsis thaliana]
Length = 546
Score = 204 bits (520), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 126/339 (37%), Positives = 191/339 (56%), Gaps = 31/339 (9%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPR----GNYASESYFKKAFMKSH-FVTKDPSK 55
M ++ +VYVY P+ +P G YASE +F K SH F+TKDP+K
Sbjct: 216 MEQTLKVYVYSEGDR--------PIFHQPEAIMEGIYASEGWFMKLMESSHRFLTKDPTK 267
Query: 56 ADLFFLPFSIARMR-----HDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVAC 110
A LF++PFS ++ HD + ++ +YI I+ YP WNRT G+DHF+ AC
Sbjct: 268 AHLFYIPFSSRILQQKLYVHDSH-SRNNLVKYLGNYIDLIASNYPSWNRTCGSDHFFTAC 326
Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQ--IWPRQEDPPKLGSS-- 166
H + + + +N I+ +C++ I + KDVSLP+ + Q K+G S
Sbjct: 327 HDWAPTETRGPY---INCIRALCNADVGID-FVVGKDVSLPETKVSSLQNPNGKIGGSRP 382
Query: 167 -KRNKLAFFAGAVNSPVREKLLQVW--RNDSEIYAHSGRLKTPYADELLGSKFCLHVKGF 223
KR LAFFAG+++ VR LL W R + ++ + Y + S+FC+ KG+
Sbjct: 383 SKRTILAFFAGSLHGYVRPILLNQWSSRPEQDMKIFNRIDHKSYIRYMKRSRFCVCAKGY 442
Query: 224 EVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEE 283
EVN+ R+ +S+ YGCVPVII++++ PF ++LNW+SF++ V +IP L+KIL I
Sbjct: 443 EVNSPRVVESILYGCVPVIISDNFVPPFLEILNWESFAVFVPEKEIPNLRKILISIPVRR 502
Query: 284 YLLLQSYVLKVRKHFQWH-VFPSDYDAFYMVMYELWLRR 321
Y+ +Q VLKV+KHF WH P YD F+M+++ +W R
Sbjct: 503 YVEMQKRVLKVQKHFMWHDGEPVRYDIFHMILHSVWYNR 541
>gi|26450497|dbj|BAC42362.1| unknown protein [Arabidopsis thaliana]
Length = 546
Score = 204 bits (520), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 126/339 (37%), Positives = 191/339 (56%), Gaps = 31/339 (9%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPR----GNYASESYFKKAFMKSH-FVTKDPSK 55
M ++ +VYVY P+ +P G YASE +F K SH F+TKDP+K
Sbjct: 216 MEQTLKVYVYSEGDR--------PIFHQPEAIMEGIYASEGWFMKLMESSHRFLTKDPTK 267
Query: 56 ADLFFLPFSIARMR-----HDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVAC 110
A LF++PFS ++ HD + ++ +YI I+ YP WNRT G+DHF+ AC
Sbjct: 268 AHLFYIPFSSRILQQKLYVHDSH-SRNNLVKYLGNYIDLIASNYPSWNRTCGSDHFFTAC 326
Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQ--IWPRQEDPPKLGSS-- 166
H + + + +N I+ +C++ I + KDVSLP+ + Q K+G S
Sbjct: 327 HDWAPTETRGPY---INCIRALCNADVGID-FVVGKDVSLPETKVSSLQNPNGKIGGSRP 382
Query: 167 -KRNKLAFFAGAVNSPVREKLLQVW--RNDSEIYAHSGRLKTPYADELLGSKFCLHVKGF 223
KR LAFFAG+++ VR LL W R + ++ + Y + S+FC+ KG+
Sbjct: 383 SKRTILAFFAGSLHGYVRPILLNQWSSRPEQDMKIFNRIDHKSYIRYMKRSRFCVCAKGY 442
Query: 224 EVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEE 283
EVN+ R+ +S+ YGCVPVII++++ PF ++LNW+SF++ V +IP L+KIL I
Sbjct: 443 EVNSPRVVESILYGCVPVIISDNFVPPFLEILNWESFAVFVPEKEIPNLRKILISIPVRR 502
Query: 284 YLLLQSYVLKVRKHFQWH-VFPSDYDAFYMVMYELWLRR 321
Y+ +Q VLKV+KHF WH P YD F+M+++ +W R
Sbjct: 503 YVEMQKRVLKVQKHFMWHDGEPVRYDIFHMILHSVWYNR 541
>gi|449469432|ref|XP_004152424.1| PREDICTED: probable glycosyltransferase At5g03795-like [Cucumis
sativus]
Length = 472
Score = 204 bits (519), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 120/308 (38%), Positives = 177/308 (57%), Gaps = 16/308 (5%)
Query: 30 RGNYASESYFKKAFMKS-HFVTKDPSKADLFFLPFSIARMRHDRRI----GTEGIPDFIS 84
RG YASE +F K ++ FVTKDP KA LF+L +S +++ + + + ++
Sbjct: 156 RGIYASEGWFMKLMEENRQFVTKDPEKAHLFYLAYSSRQLQTALYVPDSHNMKPLSIYLR 215
Query: 85 HYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIA 144
++ I+ KYPYWNRT G DHF VACH G + + E+ + I+ +C++
Sbjct: 216 DHVNWIAGKYPYWNRTHGYDHFLVACHDWGPYTVNEHRELSQHTIKALCNADLSEGVFKL 275
Query: 145 HKDVSLPQIWPRQEDPP--KLGS---SKRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAH 199
KDVSLP+ R P +G S+R LAFFAG ++ VR LL+ W + +
Sbjct: 276 GKDVSLPETTIRTPRKPLRNVGGKRVSQRPILAFFAGNMHGRVRPILLKHWNDKDDDIRV 335
Query: 200 SGRL------KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFAD 253
G L K Y + SK+C+ G+EVN+ RI +++YY CVPVIIA+++ LPF++
Sbjct: 336 YGPLPLRVSRKMTYIQHMKSSKYCICPMGYEVNSPRIIEAIYYECVPVIIADNFVLPFSE 395
Query: 254 VLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
L+W +FS+VVA DIP LK+IL I + YL +Q V V+KHF W+ P YD F+MV
Sbjct: 396 FLDWSAFSVVVAEKDIPKLKEILTAIPLKRYLTMQINVKMVQKHFLWNPKPLKYDLFHMV 455
Query: 314 MYELWLRR 321
++ +W R
Sbjct: 456 LHSIWFSR 463
>gi|15239502|ref|NP_197954.1| Exostosin family protein [Arabidopsis thaliana]
gi|110738111|dbj|BAF00988.1| hypothetical protein [Arabidopsis thaliana]
gi|332006108|gb|AED93491.1| Exostosin family protein [Arabidopsis thaliana]
Length = 654
Score = 204 bits (519), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 132/344 (38%), Positives = 184/344 (53%), Gaps = 38/344 (11%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSH--FVTKDPSKADL 58
M + +VY Y + +L RG YASE +F ++ FVTKDP+KA L
Sbjct: 320 MEKILKVYAYKEGNKPIMHSPIL------RGIYASEGWFMNIIESNNNKFVTKDPAKAHL 373
Query: 59 FFLPFSIARM---------RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVA 109
F+LPFS +RM H R + ++ YI IS KYP+WNRT GADHF A
Sbjct: 374 FYLPFS-SRMLEVTLYVQDSHSHR----NLIKYLKDYIDFISAKYPFWNRTSGADHFLAA 428
Query: 110 CHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP--KLGSSK 167
CH S +K +I+ +C+S G + KD SLP+ + R P +G
Sbjct: 429 CHDWAPSETRKHMA---KSIRALCNSD-VKEGFVFGKDTSLPETFVRDPKKPLSNMGGKS 484
Query: 168 RNK---LAFFAGAVNSP-VREKLLQVWRN----DSEIYAHSGRLK--TPYADELLGSKFC 217
N+ LAFFAG + +R LL W N D +I+ R K Y + SK+C
Sbjct: 485 ANQRPILAFFAGKPDHGYLRPILLSYWGNNKDPDLKIFGKLPRTKGNKNYLQFMKTSKYC 544
Query: 218 LHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILK 277
+ KGFEVN+ R+ ++++Y CVPVII++++ PF +VLNW+SF+I + DIP LKKIL
Sbjct: 545 ICAKGFEVNSPRVVEAIFYDCVPVIISDNFVPPFFEVLNWESFAIFIPEKDIPNLKKILM 604
Query: 278 GISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
I Y +Q V KV+KHF WH P YD F+M+++ +W R
Sbjct: 605 SIPESRYRSMQMRVKKVQKHFLWHAKPEKYDMFHMILHSIWYNR 648
>gi|9758523|dbj|BAB08970.1| unnamed protein product [Arabidopsis thaliana]
Length = 559
Score = 204 bits (518), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 123/342 (35%), Positives = 191/342 (55%), Gaps = 32/342 (9%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSH--FVTKDPSKADL 58
M R ++YVY F + PRG YASE +F K M+S+ FV KDP KA L
Sbjct: 222 MERKLKIYVYKEGGKPIFHTPM------PRGIYASEGWFMK-LMESNKKFVVKDPRKAHL 274
Query: 59 FFLPFSIARMRHDRRIG---TEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGR 115
F++P SI +R + + + D + Y+ I+ KY +WNRTGGADHF VACH G
Sbjct: 275 FYIPISIKALRSSLGLDFQTPKSLADHLKEYVDLIAGKYKFWNRTGGADHFLVACHDWGN 334
Query: 116 SAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPK--LG---SSKRNK 170
K + N+++ +C+S+ G D +LP + R + P LG SS+R
Sbjct: 335 KLTTKTMK---NSVRSLCNSN-VAQGFRIGTDTALPVTYIRSSEAPLEYLGGKTSSERKI 390
Query: 171 LAFFAGAVNSPVREKLLQVWRN---DSEIYA---HSGRLKTPYADELLGS-----KFCLH 219
LAFFAG+++ +R L+++W N D +I+ + K Y + + S ++C+
Sbjct: 391 LAFFAGSMHGYLRPILVKLWENKEPDMKIFGPMPRDPKSKKQYREYMKSSSSHFNRYCIC 450
Query: 220 VKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGI 279
+G+EV+T R+ +++ CVPVIIA++Y PF +VLNW+ F++ V DIP L+ IL I
Sbjct: 451 ARGYEVHTPRVVEAIINECVPVIIADNYVPPFFEVLNWEEFAVFVEEKDIPNLRNILLSI 510
Query: 280 SSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
+ Y+ +Q+ V V++HF WH P +D F+M+++ +W R
Sbjct: 511 PEDRYIGMQARVKAVQQHFLWHKKPVKFDQFHMILHSIWYSR 552
>gi|224129250|ref|XP_002320538.1| predicted protein [Populus trichocarpa]
gi|222861311|gb|EEE98853.1| predicted protein [Populus trichocarpa]
Length = 332
Score = 203 bits (517), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 123/340 (36%), Positives = 190/340 (55%), Gaps = 22/340 (6%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPV--DFEPRGNYASESYFKKAFMKSH-FVTKDPSKAD 57
M + F++Y+Y + DP P+ D + Y+SE F K F T DP +A
Sbjct: 1 MEKLFKIYIY--KEGDP------PMFHDGPCKSIYSSEGRFIHELEKGKSFTTTDPDEAL 52
Query: 58 LFFLPFSIARMRHDRRI----GTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSI 113
++FLPFS+ + + + I + + YI I+ KYP+WNR+ GADHF ++CH
Sbjct: 53 VYFLPFSVVMLVQYLYVPGSHEIDAIGNTVVDYINVIADKYPFWNRSLGADHFILSCHDW 112
Query: 114 GRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKL----GSSKRN 169
G + N+I+V+C+++ G KD S P+I R + L S+R+
Sbjct: 113 GPRTSSYVPHLFNNSIRVLCNANTS-EGFNPKKDASFPEIHLRTGEITGLVGGPSPSRRS 171
Query: 170 KLAFFAGAVNSPVREKLLQVWRN-DSEIYAHSG-RLKTPYADELLGSKFCLHVKGFEVNT 227
LAFFAG ++ +R LL+ W++ D ++ H R Y L S+FCL G+EV +
Sbjct: 172 ILAFFAGRLHGHIRRLLLEQWKDKDQDVQVHDQLRNGMSYDSMLKNSRFCLCPSGYEVAS 231
Query: 228 ARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLL 287
RI +++Y CVPV+I++ Y PF+DVLNWK+FSI V DIP +K IL GIS +YL +
Sbjct: 232 PRIVEAIYAECVPVLISDGYVPPFSDVLNWKAFSIQVQVKDIPKIKDILMGISQRQYLRM 291
Query: 288 QSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
Q V +V++HF + P +D F+M ++ +WLRR ++R+
Sbjct: 292 QRRVKQVQRHFVVNGIPKRFDVFHMTIHSIWLRRLNIRIH 331
>gi|449531189|ref|XP_004172570.1| PREDICTED: probable glycosyltransferase At5g03795-like, partial
[Cucumis sativus]
Length = 344
Score = 203 bits (517), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 120/308 (38%), Positives = 177/308 (57%), Gaps = 16/308 (5%)
Query: 30 RGNYASESYFKKAFMKS-HFVTKDPSKADLFFLPFSIARMRHDRRI----GTEGIPDFIS 84
RG YASE +F K ++ FVTKDP KA LF+L +S +++ + + + ++
Sbjct: 28 RGIYASEGWFMKLMEENRQFVTKDPEKAHLFYLAYSSRQLQTALYVPDSHNMKPLSIYLR 87
Query: 85 HYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIA 144
++ I+ KYPYWNRT G DHF VACH G + + E+ + I+ +C++
Sbjct: 88 DHVNWIAGKYPYWNRTHGYDHFLVACHDWGPYTVNEHRELSQHTIKALCNADLSEGVFKL 147
Query: 145 HKDVSLPQIWPRQEDPP--KLGS---SKRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAH 199
KDVSLP+ R P +G S+R LAFFAG ++ VR LL+ W + +
Sbjct: 148 GKDVSLPETTIRTPRKPLRNVGGKRVSQRPILAFFAGNMHGRVRPILLKHWNDKDDDIRV 207
Query: 200 SGRL------KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFAD 253
G L K Y + SK+C+ G+EVN+ RI +++YY CVPVIIA+++ LPF++
Sbjct: 208 YGPLPLRVSRKMTYIQHMKSSKYCICPMGYEVNSPRIIEAIYYECVPVIIADNFVLPFSE 267
Query: 254 VLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
L+W +FS+VVA DIP LK+IL I + YL +Q V V+KHF W+ P YD F+MV
Sbjct: 268 FLDWSAFSVVVAEKDIPKLKEILTAIPLKRYLTMQINVKMVQKHFLWNPKPLKYDLFHMV 327
Query: 314 MYELWLRR 321
++ +W R
Sbjct: 328 LHSIWFSR 335
>gi|51477380|gb|AAU04753.1| EXO [Cucumis melo]
Length = 343
Score = 202 bits (515), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 122/338 (36%), Positives = 189/338 (55%), Gaps = 18/338 (5%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSH-FVTKDPSKADLF 59
M R ++YVY + +P P + Y++E F K + + T DP +A L+
Sbjct: 12 MERLLKIYVY--KEGEPPMFHGGPC----KSIYSTEGRFIHEMEKGNLYTTNDPDQALLY 65
Query: 60 FLPFSIARMRHDRRI----GTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGR 115
FLPFS+ + + I I+ YI IS+K+P+W+R+ GADHF ++CH G
Sbjct: 66 FLPFSVVNLVQYLYVPNSHEVNAIGRAITDYINVISKKHPFWDRSLGADHFMLSCHDWGP 125
Query: 116 SAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKL----GSSKRNKL 171
+ N+I+V+C+++ G + KD S P+I R + L S+R+ L
Sbjct: 126 RTTSYVPLLFNNSIRVLCNAN-VSEGFLPSKDASFPEIHLRTGEIDGLIGGLSPSRRSVL 184
Query: 172 AFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKT--PYADELLGSKFCLHVKGFEVNTAR 229
AFFAG ++ +R LLQ W+ E L + Y L S+FCL G+EV + R
Sbjct: 185 AFFAGRLHGHIRYLLLQEWKEKDEDVLVYEELPSGISYNSMLKKSRFCLCPSGYEVASPR 244
Query: 230 IADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQS 289
+ +++Y CVPV+I+ Y PF+DVLNWKSFS+ + DIP +KKILKGIS +YL +Q
Sbjct: 245 VVEAIYAECVPVLISESYVPPFSDVLNWKSFSVQIQVKDIPNIKKILKGISQTQYLRMQR 304
Query: 290 YVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
V +V++HF + P +DAF+M+++ +WLRR ++ +Q
Sbjct: 305 RVKQVQRHFALNGTPKRFDAFHMILHSIWLRRLNIHIQ 342
>gi|358348291|ref|XP_003638181.1| hypothetical protein MTR_121s0013 [Medicago truncatula]
gi|355504116|gb|AES85319.1| hypothetical protein MTR_121s0013 [Medicago truncatula]
Length = 550
Score = 202 bits (515), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 127/341 (37%), Positives = 189/341 (55%), Gaps = 35/341 (10%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEP--RGNYASESYFKKAFM-KSHFVTKDPSKAD 57
M R +VY+Y R + P+ +P RG YASE +F K F+ KDP KA
Sbjct: 222 MERKLKVYIY--REGEK------PIFHQPKMRGIYASEGWFMKLMEGNKRFIVKDPKKAH 273
Query: 58 LFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHS----I 113
LF+LPFS +R + + + ++ Y+ I+ KY +WNRTGGADHF VACH I
Sbjct: 274 LFYLPFSSQMLRANLS-DNKKMEQYLDKYVNIIAGKYRFWNRTGGADHFLVACHDWASRI 332
Query: 114 GRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP--KLGS---SKR 168
R M+ N I+ +C+++ G KD +LP + P K+ S+R
Sbjct: 333 TRQPMK-------NCIRSLCNAN-VAKGFQIGKDTTLPATYIHSVMNPLRKIAGKHPSER 384
Query: 169 NKLAFFAGAVNSPVREKLLQVWRN---DSEIY---AHSGRLKTPYADELLGSKFCLHVKG 222
LAFFAG+++ +R LL+ W N D +I+ A K Y D + SK+C+ +G
Sbjct: 385 TILAFFAGSMHGYLRPILLKHWENKEPDMKIFGAMARDAEGKRIYMDYMNSSKYCICARG 444
Query: 223 FEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSE 282
+EV + RI ++++ CVPVII+++Y PF +VL W++FS+ V D+P L+ IL I+ E
Sbjct: 445 YEVYSPRIVEAIFSECVPVIISDNYVPPFFEVLKWEAFSVFVRERDVPNLRSILLSITEE 504
Query: 283 EYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSS 323
+YL L V KV++HF WH P YD F+M+++ +W R S
Sbjct: 505 KYLALHLGVKKVQQHFLWHKVPVKYDLFHMILHSIWNNRLS 545
>gi|297805260|ref|XP_002870514.1| hypothetical protein ARALYDRAFT_915839 [Arabidopsis lyrata subsp.
lyrata]
gi|297316350|gb|EFH46773.1| hypothetical protein ARALYDRAFT_915839 [Arabidopsis lyrata subsp.
lyrata]
Length = 536
Score = 201 bits (512), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 122/334 (36%), Positives = 185/334 (55%), Gaps = 41/334 (12%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSH--FVTKDPSKADL 58
M R +VYVY F L PRG YASE +F K M+S+ FV +DP KA L
Sbjct: 224 MERKLKVYVYKEGGKPIFHKPL------PRGIYASEGWFMK-LMESNKKFVVRDPRKAHL 276
Query: 59 FFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAM 118
F++P + + Y+ I+ KY +WNRTGGADHF VACH G
Sbjct: 277 FYIPI-----------------NHLKEYVDLIAGKYKFWNRTGGADHFIVACHDWGNKLT 319
Query: 119 QKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPK--LGS---SKRNKLAF 173
+K + N+++ +C+S+ G D +LP + R + P LG SKR LAF
Sbjct: 320 KKTMK---NSVRALCNSN-VAQGFRIGTDTALPVTYIRSAESPLEYLGGKTPSKRKILAF 375
Query: 174 FAGAVNSPVREKLLQVWRN---DSEIYAHSGR---LKTPYADELLGSKFCLHVKGFEVNT 227
FAG+++ +R L+++W N D +I R KT Y + + SK+C+ +G+EV+T
Sbjct: 376 FAGSMHGYLRPILVKLWENKEPDMKIVGPMPRDPESKTQYREYMKSSKYCICARGYEVHT 435
Query: 228 ARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLL 287
R+ +++ CVPVIIA++Y PF ++LNW+ F++ V +I L+ IL IS E Y+++
Sbjct: 436 PRVVEAIINECVPVIIADNYVPPFFEILNWEEFAVFVEEKEIANLRNILLSISEERYIVM 495
Query: 288 QSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
Q+ V V++HF WH P +D F+M+++ +W R
Sbjct: 496 QARVKAVQQHFLWHKKPVKFDLFHMILHSIWHSR 529
>gi|449462352|ref|XP_004148905.1| PREDICTED: probable glycosyltransferase At5g25310-like [Cucumis
sativus]
gi|449523501|ref|XP_004168762.1| PREDICTED: LOW QUALITY PROTEIN: probable glycosyltransferase
At5g25310-like [Cucumis sativus]
Length = 684
Score = 200 bits (509), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 128/336 (38%), Positives = 185/336 (55%), Gaps = 25/336 (7%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSH-FVTKDPSKADLF 59
M + +VY+Y F L + YASE +F K + FVTK+P KA LF
Sbjct: 351 MESTLKVYIYREGARPIFHQGPL------QSIYASEGWFMKILESNKKFVTKNPRKAHLF 404
Query: 60 FLPFSIARMRHDRRI-GTEGIPDFISH---YIFNISQKYPYWNRTGGADHFYVACHSIGR 115
+LPFS ++ + + + I H Y+ I+ KYP+WNRTGGADHF VACH
Sbjct: 405 YLPFSSRQLEEVLYVRDSHSHKNLIQHLKNYLDFIAAKYPHWNRTGGADHFLVACHDWAP 464
Query: 116 SAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP--KLG---SSKRNK 170
+ +K I+ +C+S G + KDVSLP+ + R P +G SSKR
Sbjct: 465 AETRKYMA---KCIRALCNSD-VKEGFVFGKDVSLPETFVRVARNPLRDVGGNPSSKRPI 520
Query: 171 LAFFAGAVNSPVREKLLQVW-RNDSEIYAHSGRLKTPYADELL----GSKFCLHVKGFEV 225
LAFFAG+++ +R LL+ W R D ++ K + L SK+C+ KG+EV
Sbjct: 521 LAFFAGSMHGYLRSTLLEYWERKDPDMKISGPMPKVKGSKNYLWHMKNSKYCICAKGYEV 580
Query: 226 NTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYL 285
N+ R+ +S+ Y CVPVII++++ P +VLNW+SF++ VA DIP LKKIL I + Y
Sbjct: 581 NSPRVVESILYECVPVIISDNFVPPLFEVLNWESFAVFVAEKDIPNLKKILLSIPEKRYR 640
Query: 286 LLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
+Q V K++ HF WH P YD F+M+++ +W R
Sbjct: 641 EMQMRVKKLQPHFLWHAKPQKYDMFHMILHSIWYNR 676
>gi|168062371|ref|XP_001783154.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162665352|gb|EDQ52040.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 351
Score = 200 bits (509), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 126/341 (36%), Positives = 186/341 (54%), Gaps = 35/341 (10%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMK--SHFVTKDPSKADL 58
M ++F+VYVY D + ++ + G YA+E F K + + DP++A +
Sbjct: 1 MQKTFKVYVY----KDGYKPLVHAA--KTGGIYATEGLFLKRMDDPGNRYTVSDPTQAHM 54
Query: 59 FFLPFSIARM---------RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVA 109
F LP+S+ ++ R R + T FI++Y+ I+ KYPYWNRT GADHF+V+
Sbjct: 55 FLLPYSVRQLVDFIQDPYSRSMRPLKT-----FIANYVERITSKYPYWNRTRGADHFFVS 109
Query: 110 CHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWP----RQEDPPKLGS 165
CH + E+ N+++VVC++ + I KDVS+PQ + D L
Sbjct: 110 CHDWAPLSTILHDELHNNSMKVVCNADLTANFDI-QKDVSIPQAVKGGNQSELDIDNLPP 168
Query: 166 SKRNKLAFFAGAVNSPVREKLLQVWRNDS------EIYAHSGRLKTPYADELLGSKFCLH 219
KR+ LAF+AG ++ VR L+Q WR E+ YA + SKFCL
Sbjct: 169 GKRDYLAFYAGQMHGLVRPVLIQHWRGKDSSMKVYEVLPPEIAKNISYAQHMKRSKFCLC 228
Query: 220 VKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGI 279
KGFEVN+ RI +++ GCVPVIIA+++ LPF++VL+W FSI V DIP LK+IL +
Sbjct: 229 PKGFEVNSPRIVEAILSGCVPVIIADNFVLPFSNVLDWSKFSITVEEKDIPNLKRILTNV 288
Query: 280 SSEEYLLLQSYVLKVRKHFQWHVFPSD--YDAFYMVMYELW 318
Y +QS + +R+HF W D YD+F+M MY +W
Sbjct: 289 PDGTYRSMQSCLKYIRRHFVWLEDQEDTQYDSFHMTMYSIW 329
>gi|224120358|ref|XP_002331028.1| predicted protein [Populus trichocarpa]
gi|222872958|gb|EEF10089.1| predicted protein [Populus trichocarpa]
Length = 334
Score = 199 bits (506), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 124/344 (36%), Positives = 188/344 (54%), Gaps = 30/344 (8%)
Query: 1 MNRSFRVYVY-----PHRRNDPFANVLLPVDFEPRGNYASESYFKKAFM--KSHFVTKDP 53
M + F+++VY P N P ++ Y+ E F KS F+ ++
Sbjct: 1 MVKRFKIWVYREGETPMVHNGPMKHI-----------YSIEGQFIDEMESGKSPFLARNH 49
Query: 54 SKADLFFLPFSIARMRHDRRIGT-----EGIPDFISHYIFNISQKYPYWNRTGGADHFYV 108
+A FFLP S+A + + E + Y+ ++ KYPYWNR+ G DHF V
Sbjct: 50 DEAHAFFLPISVAYIVEFVYLPITTYHRERLVRIFKDYVTVVANKYPYWNRSRGGDHFMV 109
Query: 109 ACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI--WPRQEDPP--KLG 164
+CH + E+ N I+V+C+++ G +D +LP++ P + P L
Sbjct: 110 SCHDWAPQVSRDDPELYKNLIRVMCNANTS-EGFRPRRDATLPELNCPPLKLTPACRGLA 168
Query: 165 SSKRNKLAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLKTPYADELLG-SKFCLHVKG 222
+R AFFAG + +R+ LL+ W+ D EI H K EL+G SKFCL G
Sbjct: 169 PHERKIFAFFAGGAHGDIRKILLRHWKEKDDEIQVHEYLPKDQDYMELMGQSKFCLCPSG 228
Query: 223 FEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSE 282
FEV + R+A+S+Y GCVPVII++HY+LPF+DVL+W FS+ + IP +K IL+GIS +
Sbjct: 229 FEVASPRVAESIYSGCVPVIISDHYNLPFSDVLDWSQFSVQIPVEKIPEIKTILRGISYD 288
Query: 283 EYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
EYL +Q V+KV++HF + YD +MV++ +WLRR ++RV
Sbjct: 289 EYLKMQKGVMKVQRHFVLNRPAKPYDVLHMVLHSVWLRRLNIRV 332
>gi|356518348|ref|XP_003527841.1| PREDICTED: LOW QUALITY PROTEIN: probable glycosyltransferase
At5g03795-like [Glycine max]
Length = 549
Score = 199 bits (506), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 115/318 (36%), Positives = 177/318 (55%), Gaps = 28/318 (8%)
Query: 24 PVDFEP--RGNYASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTE---- 77
P+ EP G YASE +F K + VT DP KA LF++PFS ++ +
Sbjct: 233 PIFHEPLLDGIYASEGWFMKLMEANKXVTGDPGKAHLFYIPFSSRLLQQTLYVRNSHRHS 292
Query: 78 GIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSY 137
+ +++ +Y+ I+ KYP+WNRT GADHF VACH + + L++I+ +C++
Sbjct: 293 NLIEYMKNYVKMIAGKYPFWNRTSGADHFVVACHDWAPAETRGRM---LSSIRALCNADI 349
Query: 138 FISGHIAHKDVSLPQIWPR---------QEDPPKLGSSKRNKLAFFAGAVNSPVREKLLQ 188
+ I KDVSLP+ + R + DPP S+R LAFFAG ++ V LL+
Sbjct: 350 EVGFKIG-KDVSLPETYIRSSENPVKNIEGDPP----SQRPILAFFAGGLHVYVXPILLK 404
Query: 189 VWRNDSEIYAHSG-----RLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVII 243
W N SG R Y + SKFC+H +G EVN+ R+ +++++ C+PVII
Sbjct: 405 HWENKEPDMKISGPLPHVRGNVNYIQFMKSSKFCIHARGHEVNSPRVVEAIFHECIPVII 464
Query: 244 ANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVF 303
++++ PF ++LNW+SF++ V +IP L+ IL IS E YL + V KV++HF WH
Sbjct: 465 SDNFIPPFFEILNWESFAVFVTEEEIPNLRNILLSISEERYLEMHKRVKKVQEHFPWHAE 524
Query: 304 PSDYDAFYMVMYELWLRR 321
P D +M+++ +W R
Sbjct: 525 PVKDDLSHMLLHSIWYNR 542
>gi|356518346|ref|XP_003527840.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
max]
Length = 633
Score = 199 bits (505), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 128/336 (38%), Positives = 184/336 (54%), Gaps = 25/336 (7%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKS-HFVTKDPSKADLF 59
M R+ +VYVY R A + P+ G YASE +F K S FVT DP A LF
Sbjct: 301 MERTLKVYVY---REGDKAIMHSPI---LSGLYASEGWFMKHMEASKQFVTTDPKNAHLF 354
Query: 60 FLPFSIARMRHDR---RIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRS 116
+LPFS R+ + + ++S+Y+ I+ K+ +WNRTGGADHF VACH +
Sbjct: 355 YLPFSSQRLVDALWVPKSSYGNLIQYLSNYVDMIAGKHHFWNRTGGADHFLVACHDWAPA 414
Query: 117 AMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP--KLGS---SKRNKL 171
++ L A+ C++ G + KD+SLP+ R P +G SKR L
Sbjct: 415 ETKQHMAKCLRAL---CNAD-VKQGFVFGKDMSLPETVVRSPRNPTRSIGGNQVSKRKTL 470
Query: 172 AFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKTP-----YADELLGSKFCLHVKGFEVN 226
AFFAG ++ VR LLQ W N GRL Y + SK+C+ KG+EVN
Sbjct: 471 AFFAGQMHGYVRPILLQHWENKDPDMKIFGRLPKSKGNRNYIQYMKSSKYCICAKGYEVN 530
Query: 227 TARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLL 286
+ R+ +++ Y CVPVI+++++ PF ++LNW+SF++ V DIP LK IL I + YL
Sbjct: 531 SPRVVEAILYECVPVILSDNFVPPFFEMLNWESFAVFVLEKDIPNLKNILLSIPRKRYLQ 590
Query: 287 LQSYVLKVRKHFQWH-VFPSDYDAFYMVMYELWLRR 321
+Q V KV++HF WH P YD F+M+++ +W R
Sbjct: 591 MQMMVRKVQQHFLWHNKSPVKYDIFHMILHSIWYNR 626
>gi|356575096|ref|XP_003555678.1| PREDICTED: probable glycosyltransferase At3g07620-like [Glycine
max]
Length = 493
Score = 199 bits (505), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 126/344 (36%), Positives = 188/344 (54%), Gaps = 30/344 (8%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGN-YASESYFKKAF-MKSHFVTKDPSKADL 58
M + F+++VY F + P N Y+ E F + + S F T++P +A +
Sbjct: 163 MEKVFKIFVYEEGEPPLF-------HYGPCKNIYSMEGIFINSLEINSQFRTQNPDEAHV 215
Query: 59 FFLPFSIARMRH--------DRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVAC 110
+FLPFS+ + D+ + I D Y+ IS KY YWNR+ GADHF ++C
Sbjct: 216 YFLPFSVVMILEHLFHPVIRDKAVLERTIGD----YVHIISHKYKYWNRSYGADHFMLSC 271
Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHI-AHKDVSLPQI-WPRQEDPPKLGS--- 165
H G A E+ AI+V+C+++ IS H KD S P+I E +G
Sbjct: 272 HDWGPRATWYVKELYFIAIRVLCNAN--ISEHFNPKKDASFPEINLVNGETRGLIGGYPP 329
Query: 166 SKRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRL--KTPYADELLGSKFCLHVKGF 223
R LAFFAG ++ +R L Q W + +L PY + + SK+C+ GF
Sbjct: 330 CNRTILAFFAGQMHGRIRPVLFQHWEGKDKDVLVYEKLPDGVPYHETMKKSKYCICPSGF 389
Query: 224 EVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEE 283
EV + RI +++Y CVPVII+ Y LPF+DVLNW SFS+ + D+P LK+IL GIS ++
Sbjct: 390 EVASPRIVEAIYAQCVPVIISQQYVLPFSDVLNWDSFSVQILVSDVPKLKEILLGISEDK 449
Query: 284 YLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
Y+ LQ V +V++HF + P YD F+M+++ +WLRR +VRV+
Sbjct: 450 YMRLQEGVKQVQRHFVVNNPPKRYDVFHMIIHSIWLRRLNVRVK 493
>gi|297811281|ref|XP_002873524.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
gi|297319361|gb|EFH49783.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
Length = 546
Score = 198 bits (504), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 122/342 (35%), Positives = 190/342 (55%), Gaps = 37/342 (10%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPR----GNYASESYFKKAFMKSH-FVTKDPSK 55
M ++ +VY+Y P+ +P G YASE +F K +H F+TKDP+
Sbjct: 216 MEQTLKVYIYSEGDR--------PIFHQPEAIMEGIYASEGWFMKLMESNHRFLTKDPNI 267
Query: 56 ADLFFLPFSIARMR--------HDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFY 107
A LF+LPFS ++ H RR + ++ +Y+ I+ YP+WNRT G+DHF+
Sbjct: 268 AHLFYLPFSTRILQQKLYVHDSHSRR----NLVKYLKNYLDLIASNYPFWNRTRGSDHFF 323
Query: 108 VACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQ--IWPRQEDPPKLGS 165
ACH + + + +N I+ +C++ + + KDVSLP+ I Q +G
Sbjct: 324 TACHDWAPAETRGPY---INCIRSLCNADVGVD-FVVGKDVSLPETKISSSQNPNGNIGG 379
Query: 166 ---SKRNKLAFFAGAVNSPVREKLLQVW--RNDSEIYAHSGRLKTPYADELLGSKFCLHV 220
SKR LAFFAG ++ VR LL W R + ++ + Y + S+FC+
Sbjct: 380 NRPSKRTILAFFAGNLHGYVRPILLNQWSSRPEPDMKIFNRIDHKSYIRYMKRSRFCVCA 439
Query: 221 KGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGIS 280
KG+EVN+ R+ +S+ YGCVPVII++++ PF ++LNW+SF++ V +IP L+KIL I
Sbjct: 440 KGYEVNSPRVVESVLYGCVPVIISDNFVPPFLEILNWESFAVFVPEKEIPNLRKILISIP 499
Query: 281 SEEYLLLQSYVLKVRKHFQWH-VFPSDYDAFYMVMYELWLRR 321
Y+ +Q V+KV+KHF WH P YD F+M+++ +W R
Sbjct: 500 VRRYVEMQKRVMKVQKHFMWHDGEPVRYDVFHMILHSVWYNR 541
>gi|356554237|ref|XP_003545455.1| PREDICTED: probable glycosyltransferase At5g25310-like [Glycine
max]
Length = 483
Score = 198 bits (504), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 122/344 (35%), Positives = 191/344 (55%), Gaps = 26/344 (7%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYF--KKAFMKSHFVTKDPSKADL 58
M + F+VYVYP ++ + D + Y+ E F + F T DP+ A +
Sbjct: 149 MEKIFKVYVYPD------GDLPIAHDGPCKDIYSIEGRFLHEMEHGAGRFRTNDPNAAHV 202
Query: 59 FFLPFSIARMRH----DRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIG 114
+FLPFS+ M + F+S Y+ IS ++P+WN T GADHF +ACH G
Sbjct: 203 YFLPFSVTWMVKYLYTPLSFNVTPLKQFVSDYVRVISTRHPFWNITHGADHFMLACHDWG 262
Query: 115 RSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIW-------PRQEDPPKLGSSK 167
A Q + +I+V+C+++ G KDVSLP+I P+ PP ++
Sbjct: 263 PHASQGNPFLYNTSIRVLCNANT-SEGFNPRKDVSLPEIHLYGGEVSPKLLSPPP-DTAP 320
Query: 168 RNKLAFFAGAVNSPVREKLLQVWRNDSE----IYAHSGRLKTPYADELLGSKFCLHVKGF 223
R LAFF+G ++ P+R LL+ W+ND++ +Y + + Y +L SKFCL G
Sbjct: 321 RRYLAFFSGGLHGPIRPALLRHWKNDNDDDIRVYEYLPK-DLDYYSFMLNSKFCLCPSGH 379
Query: 224 EVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEE 283
EV + RI +++Y CVPVI++ +Y LPF+DVL W++FS+ V DIP LK+IL IS ++
Sbjct: 380 EVASPRIVEAIYAECVPVILSEYYVLPFSDVLQWEAFSVQVDVSDIPRLKEILSAISEDK 439
Query: 284 YLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
Y L+ V VR HF + +D F+M+++ +WLRR +++++
Sbjct: 440 YRKLKEGVKAVRGHFTLNRPAKRFDVFHMILHSIWLRRLNIKLR 483
>gi|356565157|ref|XP_003550811.1| PREDICTED: xylogalacturonan beta-1,3-xylosyltransferase-like
[Glycine max]
Length = 411
Score = 198 bits (503), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 116/300 (38%), Positives = 177/300 (59%), Gaps = 15/300 (5%)
Query: 46 SHFVTKDPSKADLFFLPFSIARMRH------DRRIGTEGI--PDFISHYIFNISQKYPYW 97
SHF + P +A +FFLPFSIA + H + E I + YI I+ KYPYW
Sbjct: 112 SHFRAEHPDQAQVFFLPFSIANVVHYVYKPIRKHSDYEPIRLQRLVEDYIGVIANKYPYW 171
Query: 98 NRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIW-PR 156
NR+ GADHF ++CH G ++ N I+V+C+++ G + +KDVS+P+++ P+
Sbjct: 172 NRSEGADHFLLSCHDWGPKVSYGNPKLFKNFIRVLCNANTS-EGFLPNKDVSIPEVYLPK 230
Query: 157 QE-DPPKLGS--SKRNKLAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLKTPYADELL 212
+ PP LG + R+ LAFFAG + +R+ LL W+ D++I H K +L+
Sbjct: 231 GKLGPPNLGQRPNDRSILAFFAGREHGDIRKILLNHWKGKDNDIQVHEYLPKGKNYTQLM 290
Query: 213 G-SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPL 271
G SKFCL G+EV + R+ ++++ GCVPV+I++ Y PF DVLNW FS+ + I
Sbjct: 291 GQSKFCLCPSGYEVASPRVVEAIHAGCVPVLISSSYSPPFTDVLNWSQFSVEIPVEKISE 350
Query: 272 LKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQWSTS 331
+K IL+ IS YL L VL+VR+HF + +D +M+++ +WLRR ++R+ STS
Sbjct: 351 IKTILQSISRNRYLRLHMNVLRVRRHFMLNRPAKPFDLMHMILHSIWLRRLNLRLIGSTS 410
>gi|147805375|emb|CAN76361.1| hypothetical protein VITISV_035438 [Vitis vinifera]
Length = 459
Score = 197 bits (502), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 115/300 (38%), Positives = 174/300 (58%), Gaps = 24/300 (8%)
Query: 45 KSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDF--------ISHYIFNISQKYPY 96
KS F+ K P +A FFLP S+ ++ + D+ ++ Y+ ++ KYPY
Sbjct: 154 KSPFIAKHPDEAHAFFLPLSVVKVVQFLYLPITSPEDYSRKRLQRVVTDYVKVVADKYPY 213
Query: 97 WNRTGGADHFYVACHS-IGRSAMQKAW--------EVKLNAIQVVCSSSYFISGHIAHKD 147
WNR+GGADHF V+CH + S+++ E+ N I+V+C+++ G +D
Sbjct: 214 WNRSGGADHFMVSCHDWVSSSSLEPMAPSVSYANPELFKNFIRVLCNANS-SEGFRPGRD 272
Query: 148 VSLPQI-WPRQE-DPPKLGSSKRNK--LAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGR 202
VSLP++ P E PP LG N+ LAFFAG + +R+ L + W++ D+E+ H
Sbjct: 273 VSLPEVNLPAGELGPPHLGQPSNNRPILAFFAGRAHGNIRKILFEHWKDQDNEVLVHERL 332
Query: 203 LKTPYADELLG-SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFS 261
K +L+G SKFCL G+EV + R+ ++++ GCVPVII+N Y LPF DVL+W FS
Sbjct: 333 HKGQNYAKLMGQSKFCLCPSGYEVASPRVVEAIHAGCVPVIISNXYSLPFNDVLDWSQFS 392
Query: 262 IVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
I + IP +K IL GIS +YL +Q VL+VR+HF + +D +M+++ LWLRR
Sbjct: 393 IQIPEAKIPEIKTILLGISKNKYLKMQERVLRVRRHFVLNRPARPFDIIHMILHSLWLRR 452
>gi|356548516|ref|XP_003542647.1| PREDICTED: xylogalacturonan beta-1,3-xylosyltransferase-like
[Glycine max]
Length = 340
Score = 197 bits (501), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 112/299 (37%), Positives = 176/299 (58%), Gaps = 24/299 (8%)
Query: 46 SHFVTKDPSKADLFFLPFSIARM------RHDRRIGTEGIPDFISHYIFNISQKYPYWNR 99
SHF ++P++A +F +PFSI + R+ R+ G++ I + YI I+ KYPYWNR
Sbjct: 44 SHFRARNPNQAHVFLIPFSIVNIVQYVYNRNLRQPGSQSIQLLVEDYIRVIAHKYPYWNR 103
Query: 100 TGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI--WPRQ 157
T GADHF ++CH G + ++ N I+V+C+++ G +KDVS+P++ PR
Sbjct: 104 TEGADHFLLSCHDWGPTISYANPKLFKNFIRVLCNANTS-EGFRPNKDVSIPEVNLLPRG 162
Query: 158 EDPPKLGSSKRNK-------LAFFAGAVNSPVREKLLQVWR---NDSEIYAHSGRLKTPY 207
LGS R + LAFFAG + +R LL W+ ND +IY + K Y
Sbjct: 163 ----TLGSPNRGQHPNDRTILAFFAGREHGAIRTILLNHWKDKDNDVQIYESLPKGKV-Y 217
Query: 208 ADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATL 267
+ SKFCL G+EV + R+ +++Y GCVPV+I++ Y PF DVLNW FS+ +
Sbjct: 218 TKLMGQSKFCLCPSGYEVASPRVVEAIYAGCVPVLISSSYSPPFTDVLNWSQFSVEIPVE 277
Query: 268 DIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
IP +K IL+ +S ++YL LQ VL+V++HF + +D +M+++ +WLRR ++++
Sbjct: 278 KIPEIKTILQSVSPKKYLKLQMNVLRVQRHFTINRPAKPFDLMHMILHSIWLRRLNLKL 336
>gi|255567220|ref|XP_002524591.1| catalytic, putative [Ricinus communis]
gi|223536144|gb|EEF37799.1| catalytic, putative [Ricinus communis]
Length = 507
Score = 197 bits (500), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 120/339 (35%), Positives = 189/339 (55%), Gaps = 20/339 (5%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSH-FVTKDPSKADLF 59
M + F++YVY P + D + Y+SE F K + T DP +A ++
Sbjct: 173 MEKQFKIYVY-EEGGPPMYH-----DGPCKSIYSSEGRFIHELEKGKLYRTLDPDEALVY 226
Query: 60 FLPFSIARMRHDRRI----GTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGR 115
FLPFS+ M + T I I YI IS K+P+WNR+ GADHF ++CH G
Sbjct: 227 FLPFSVVMMVEYLYVPDSHETNAIGRAIVDYIHVISNKHPFWNRSLGADHFMLSCHDWGP 286
Query: 116 SAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKL----GSSKRNKL 171
A + ++I+V+C+++ G KD S P+I + + L S+R+ L
Sbjct: 287 RASSYVPHLFNSSIRVLCNANTS-EGFNPSKDASFPEIHLKTGEISGLLGGVSPSRRSIL 345
Query: 172 AFFAGAVNSPVREKLLQVWRN---DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTA 228
AFFAG ++ +R+ LL+ W+N D ++Y Y L S+FCL G+EV +
Sbjct: 346 AFFAGRLHGHIRQILLEQWKNKDEDVQVYDQMPN-GVSYESMLKTSRFCLCPSGYEVASP 404
Query: 229 RIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQ 288
RI +++Y CVPV+I+++Y PF+DVLNWK+FS+ + DIP +K+IL GIS +YL +Q
Sbjct: 405 RIVEAIYTECVPVLISDNYVPPFSDVLNWKAFSVQIQVRDIPKIKEILMGISQRQYLRMQ 464
Query: 289 SYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
+ +V++HF + P +D F+M ++ +WLRR ++ +Q
Sbjct: 465 RRLKQVQRHFVVNGPPKRFDMFHMTIHSIWLRRLNIHIQ 503
>gi|334187609|ref|NP_196674.2| putative glycosyltransferase [Arabidopsis thaliana]
gi|292630779|sp|Q9LFP3.2|GLYT4_ARATH RecName: Full=Probable glycosyltransferase At5g11130
gi|332004254|gb|AED91637.1| putative glycosyltransferase [Arabidopsis thaliana]
Length = 480
Score = 196 bits (499), Expect = 9e-48, Method: Compositional matrix adjust.
Identities = 118/314 (37%), Positives = 174/314 (55%), Gaps = 22/314 (7%)
Query: 33 YASESYFKKAFMK--SHFVTKDPSKADLFFLPFSIARM-----RHDRRIGTEGIPDFISH 85
YA E F S F P +A +F++P I + R + + + +
Sbjct: 171 YAIEGQFMDEIENGNSRFKAASPEEATVFYIPVGIVNIIRFVYRPYTSYARDRLQNIVKD 230
Query: 86 YIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH 145
YI IS +YPYWNR+ GADHF+++CH E+ + I+ +C+++ G
Sbjct: 231 YISLISNRYPYWNRSRGADHFFLSCHDWAPDVSAVDPELYKHFIRALCNANS-SEGFTPM 289
Query: 146 KDVSLPQI--------WPRQEDPPKLGSSKRNKLAFFAGAVNSPVREKLLQVWRN-DSEI 196
+DVSLP+I + +PP+ R LAFFAG + VR+ L Q W+ D ++
Sbjct: 290 RDVSLPEINIPHSQLGFVHTGEPPQ----NRKLLAFFAGGSHGDVRKILFQHWKEKDKDV 345
Query: 197 YAHSGRLKT-PYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVL 255
+ KT Y + +KFCL G+EV + RI +SLY GCVPVIIA++Y LPF+DVL
Sbjct: 346 LVYENLPKTMNYTKMMDKAKFCLCPSGWEVASPRIVESLYSGCVPVIIADYYVLPFSDVL 405
Query: 256 NWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMY 315
NWK+FS+ + +P +KKIL+ I+ EEYL +Q VL+VRKHF + YD +M+M+
Sbjct: 406 NWKTFSVHIPISKMPDIKKILEAITEEEYLNMQRRVLEVRKHFVINRPSKPYDMLHMIMH 465
Query: 316 ELWLRRSSVRVQWS 329
+WLRR +VR+ S
Sbjct: 466 SIWLRRLNVRIPLS 479
>gi|8953375|emb|CAB96648.1| putative protein [Arabidopsis thaliana]
Length = 336
Score = 196 bits (499), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 124/351 (35%), Positives = 185/351 (52%), Gaps = 38/351 (10%)
Query: 1 MNRSFRVYVY-----PHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMK--SHFVTKDP 53
M + F+++ Y P P N+ YA E F S F P
Sbjct: 1 MEKRFKIWTYREGEAPLFHKGPLNNI-----------YAIEGQFMDEIENGNSRFKAASP 49
Query: 54 SKADLFFLPFSIARM-----RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYV 108
+A +F++P I + R + + + + YI IS +YPYWNR+ GADHF++
Sbjct: 50 EEATVFYIPVGIVNIIRFVYRPYTSYARDRLQNIVKDYISLISNRYPYWNRSRGADHFFL 109
Query: 109 ACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI--------WPRQEDP 160
+CH E+ + I+ +C+++ G +DVSLP+I + +P
Sbjct: 110 SCHDWAPDVSAVDPELYKHFIRALCNANSS-EGFTPMRDVSLPEINIPHSQLGFVHTGEP 168
Query: 161 PKLGSSKRNKLAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLKT-PYADELLGSKFCL 218
P+ R LAFFAG + VR+ L Q W+ D ++ + KT Y + +KFCL
Sbjct: 169 PQ----NRKLLAFFAGGSHGDVRKILFQHWKEKDKDVLVYENLPKTMNYTKMMDKAKFCL 224
Query: 219 HVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKG 278
G+EV + RI +SLY GCVPVIIA++Y LPF+DVLNWK+FS+ + +P +KKIL+
Sbjct: 225 CPSGWEVASPRIVESLYSGCVPVIIADYYVLPFSDVLNWKTFSVHIPISKMPDIKKILEA 284
Query: 279 ISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQWS 329
I+ EEYL +Q VL+VRKHF + YD +M+M+ +WLRR +VR+ S
Sbjct: 285 ITEEEYLNMQRRVLEVRKHFVINRPSKPYDMLHMIMHSIWLRRLNVRIPLS 335
>gi|356562965|ref|XP_003549738.1| PREDICTED: probable glycosyltransferase At3g42180-like [Glycine
max]
Length = 473
Score = 196 bits (498), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 111/298 (37%), Positives = 176/298 (59%), Gaps = 15/298 (5%)
Query: 46 SHFVTKDPSKADLFFLPFSIARM----RHDRRIGTEGIPD----FISHYIFNISQKYPYW 97
S F P +A LF LP+S++++ RR ++ PD ++ YI ++ +YPYW
Sbjct: 176 SPFKATHPEQAHLFLLPYSVSKVIRYVYKPRRSRSDYDPDRLQRLVADYINILANRYPYW 235
Query: 98 NRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIW--P 155
NR+ GADHF V+CH G E+ I+ +C+++ G ++DVS+P+++
Sbjct: 236 NRSKGADHFLVSCHDWGPRISDANPELFKYFIRALCNANTS-EGFQPNRDVSIPEVYLPS 294
Query: 156 RQEDPPKLGSSKRNK--LAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLKTPYADELL 212
+ PP +G N+ LAFFAG + +R+KLL+ W+N D E+ H K +L+
Sbjct: 295 GKLGPPNMGQHPNNRTILAFFAGGAHGKIRKKLLKRWKNKDKEVQVHEYLPKGQDYTKLM 354
Query: 213 G-SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPL 271
G SKFCL G EV + R+ +++Y GCVPVII ++Y LPF DVLNW+ FS+ +A +P
Sbjct: 355 GLSKFCLCPSGHEVASPRVVEAIYAGCVPVIICDNYSLPFIDVLNWRKFSMEIAVERMPE 414
Query: 272 LKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQWS 329
+K IL+ +S ++YL L S V +VR+HF + +D +M+++ LWLRR + ++ S
Sbjct: 415 IKTILQSVSKDKYLELYSNVRRVRRHFVINRPAKPFDLIHMILHSLWLRRLNFKLTAS 472
>gi|356562054|ref|XP_003549290.1| PREDICTED: probable glycosyltransferase At5g25310-like [Glycine
max]
Length = 482
Score = 196 bits (498), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 122/345 (35%), Positives = 188/345 (54%), Gaps = 27/345 (7%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYF--KKAFMKSHFVTKDPSKADL 58
M + F+VYVYP ++ + D + Y+ E F + F T DP+ A +
Sbjct: 147 MEKIFKVYVYPD------GDLPIAHDGPCKDIYSIEGRFLHEMEHGAGRFRTNDPNAAHV 200
Query: 59 FFLPFSIARMRH----DRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIG 114
FFLPFS+ M + F+S Y+ +S ++P+WN T GADHF +ACH G
Sbjct: 201 FFLPFSVTWMVKYLYTPLSFNVTPLKKFVSDYVRVVSTRHPFWNITHGADHFMLACHDWG 260
Query: 115 RSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIW-------PRQEDPPKLGSSK 167
A Q + +I+V+C+++ G KDVSLP+I P+ PP ++
Sbjct: 261 PHASQGNPFLYNTSIRVLCNANT-SEGFNPRKDVSLPEIHLYGGEVSPKLLSPPP-DTAP 318
Query: 168 RNKLAFFAGAVNSPVREKLLQVWRNDSE-----IYAHSGRLKTPYADELLGSKFCLHVKG 222
R LAFF+G ++ P+R LL W+N E +Y + + Y +L SKFCL G
Sbjct: 319 RRYLAFFSGGLHGPIRPALLGHWKNHDENDVIRVYEYLPK-DLDYYSFMLTSKFCLCPSG 377
Query: 223 FEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSE 282
EV + RI +++Y CVPVI++ +Y LPF+DVL W++FS+ V DIP LK+IL IS +
Sbjct: 378 HEVASPRIVEAIYAECVPVILSEYYVLPFSDVLQWEAFSVQVDVSDIPRLKEILSAISED 437
Query: 283 EYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
+Y L+ V VR+HF + +D F+M+++ +WLRR ++ ++
Sbjct: 438 KYRKLKEGVKAVRRHFTLNRPAKRFDVFHMILHSIWLRRLNIELR 482
>gi|297808545|ref|XP_002872156.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
gi|297317993|gb|EFH48415.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
Length = 334
Score = 196 bits (498), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 121/341 (35%), Positives = 192/341 (56%), Gaps = 23/341 (6%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAF--MKSHFVTKDPSKADL 58
M + F+VYVY P + D + YA E F M++ F T D ++A +
Sbjct: 1 MEKRFKVYVY-EEGEPPLVH-----DGPCKSVYAVEGRFITEIEKMRTKFRTYDANQAYV 54
Query: 59 FFLPFSIA---RMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGR 115
+FLPFS+ R ++ + + F+S YI +S +P+WNRT GADHF +ACH G
Sbjct: 55 YFLPFSVTWLVRYLYEGNSDAKPLRTFVSDYIRLVSTNHPFWNRTNGADHFMLACHDWGP 114
Query: 116 SAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI--WPRQEDPP-----KLGSSKR 168
Q ++ +I+V+C+++ G KDV+LP+I + + DP L +S R
Sbjct: 115 LTSQADNDLFNTSIRVMCNANS-SEGFNPSKDVTLPEIKLYGGEVDPKLRLSKTLSASPR 173
Query: 169 NKLAFFAGAVNSPVREKLLQVWRN---DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEV 225
L FFAG V+ PVR LL W+ D +Y + + Y D + SKFC G+EV
Sbjct: 174 PYLGFFAGGVHGPVRPILLNHWKQRDPDMPVYEYLPK-HLNYYDFMRSSKFCFCPSGYEV 232
Query: 226 NTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYL 285
+ R+ +++Y C+PVI++ ++ LPF DVL W++FS++V +IP LK+IL IS E+Y
Sbjct: 233 ASPRVIEAIYSECIPVILSVNFVLPFTDVLRWETFSVLVDVSEIPRLKEILMSISDEKYE 292
Query: 286 LLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
L+ + VR+HF+ + P +DAF++ ++ +WLRR ++R+
Sbjct: 293 WLKRNLRYVRRHFELNDPPKRFDAFHLTLHSIWLRRLNLRL 333
>gi|357142838|ref|XP_003572711.1| PREDICTED: probable glycosyltransferase At5g20260-like
[Brachypodium distachyon]
Length = 484
Score = 195 bits (496), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 118/352 (33%), Positives = 186/352 (52%), Gaps = 35/352 (9%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMK--SHFVTKDPSKADL 58
M + +++ Y P A++ D Y+ E F + + F + P +A++
Sbjct: 144 MEKRLKIWTY-SEGEPPLAHLAPGTDI-----YSIEGQFLAEMEEPLNRFAARHPDEANV 197
Query: 59 FFLPFSIARMRH-DRRIGTEG----IPDFISHYIFNISQKYPYWNRTGGADHFYVACHSI 113
F LP S+ + H R+ T + ++ Y+ IS K+PYWNR+GGADH V+CH
Sbjct: 198 FLLPISVCNLVHYVYRLNTTAHLAPLRKLLADYVAVISDKHPYWNRSGGADHVLVSCHDW 257
Query: 114 GRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI--------WPRQEDPPKLGS 165
+ + E++ NAI+V+C+++ G + KD +LP++ P Q P
Sbjct: 258 APLVSEGSPELRDNAIRVLCNAN-VSEGFVPRKDATLPEVNLADGVLRLPTQGLP----R 312
Query: 166 SKRNKLAFFAGAVNSPVREKLLQVW--RNDSEIYAHS---GRLKTPYADE---LLG-SKF 216
R LAFFAG + +R LL+ W R D E+ H P D+ L+G ++F
Sbjct: 313 QNRTTLAFFAGGMLGEIRRALLEQWAGREDPEMDVHEYLPPHGGGPGYDDYHALMGRARF 372
Query: 217 CLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKIL 276
CL GFEV + R+ +S++ GCVPVII+ Y LPF DVL+W S+ V IP LK IL
Sbjct: 373 CLCPSGFEVASPRVVESVFAGCVPVIISEGYPLPFGDVLDWSKMSVAVPAARIPELKAIL 432
Query: 277 KGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQW 328
+G+S Y +L++ VL+ ++HF H +D +MV++ +WLRR +VR+ +
Sbjct: 433 RGVSERRYRVLRARVLQAQRHFVLHRPARRFDMIHMVLHSIWLRRLNVRLPY 484
>gi|359484706|ref|XP_002266646.2| PREDICTED: probable glycosyltransferase At5g20260-like [Vitis
vinifera]
Length = 480
Score = 195 bits (496), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 118/338 (34%), Positives = 186/338 (55%), Gaps = 19/338 (5%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFM--KSHFVTKDPSKADL 58
M + F+++ Y + P + D G YA E F KSHF+ + P +A+
Sbjct: 148 MEKRFKIWAY-REGDQPLMH-----DGPSNGIYAIEGQFMDDIESGKSHFLARRPDEANA 201
Query: 59 FFLPFSIARMRH----DRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIG 114
F++P S+ ++ H + IP ++ YI ++ KYPYWNR+ GADHF V+CH
Sbjct: 202 FYIPMSLTKIVHFIYEPPHYYGKWIPRLVTDYINFVADKYPYWNRSKGADHFLVSCHDWA 261
Query: 115 RSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI-WPRQE-DPPKLGS--SKRNK 170
++ + I+ +C+++ H +D+S+P+I PR + PP L +KR
Sbjct: 262 PDVSALKPDLYKHFIRALCNANTSERFHPI-RDISIPEINIPRGKLGPPHLDQPPNKRPI 320
Query: 171 LAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRL--KTPYADELLGSKFCLHVKGFEVNTA 228
LAFFAG + VR L + W+ + RL Y+ + SKFCL G+EV +
Sbjct: 321 LAFFAGGAHGYVRSVLFKYWKEKDDEVQVFERLPGNRNYSKSMGDSKFCLCPSGYEVASP 380
Query: 229 RIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQ 288
RI +++ GCVP+II +HY LPF+DVL+W FSI + + IP +KKILK + +E YL +Q
Sbjct: 381 RIVEAIAAGCVPMIICDHYSLPFSDVLDWSKFSIYITSDKIPEIKKILKAVPTETYLEMQ 440
Query: 289 SYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
V +V++HF + YD +M+++ +WLRR +VR+
Sbjct: 441 KRVKQVQRHFAINRPARPYDMLHMILHSVWLRRLNVRL 478
>gi|240256346|ref|NP_197913.4| putative glycosyltransferase [Arabidopsis thaliana]
gi|292630776|sp|Q3E7Q9.2|GLYT6_ARATH RecName: Full=Probable glycosyltransferase At5g25310
gi|332006042|gb|AED93425.1| putative glycosyltransferase [Arabidopsis thaliana]
Length = 480
Score = 195 bits (496), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 120/341 (35%), Positives = 193/341 (56%), Gaps = 23/341 (6%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMK--SHFVTKDPSKADL 58
M + F+VYVY P + D + YA E F K + F T DP++A +
Sbjct: 147 MEKRFKVYVY-EEGEPPLVH-----DGPCKSVYAVEGRFITEMEKRRTKFRTYDPNQAYV 200
Query: 59 FFLPFSIA---RMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGR 115
+FLPFS+ R ++ + + F+S YI +S +P+WNRT GADHF + CH G
Sbjct: 201 YFLPFSVTWLVRYLYEGNSDAKPLKTFVSDYIRLVSTNHPFWNRTNGADHFMLTCHDWGP 260
Query: 116 SAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI--WPRQED-----PPKLGSSKR 168
Q ++ +I+V+C+++ G KDV+LP+I + + D L +S R
Sbjct: 261 LTSQANRDLFNTSIRVMCNANS-SEGFNPTKDVTLPEIKLYGGEVDHKLRLSKTLSASPR 319
Query: 169 NKLAFFAGAVNSPVREKLLQVWRN---DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEV 225
L FFAG V+ PVR LL+ W+ D +Y + + Y D + SKFC G+EV
Sbjct: 320 PYLGFFAGGVHGPVRPILLKHWKQRDLDMPVYEYLPK-HLNYYDFMRSSKFCFCPSGYEV 378
Query: 226 NTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYL 285
+ R+ +++Y C+PVI++ ++ LPF DVL W++FS++V +IP LK+IL IS+E+Y
Sbjct: 379 ASPRVIEAIYSECIPVILSVNFVLPFTDVLRWETFSVLVDVSEIPRLKEILMSISNEKYE 438
Query: 286 LLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
L+S + VR+HF+ + P +DAF++ ++ +WLRR ++++
Sbjct: 439 WLKSNLRYVRRHFELNDPPQRFDAFHLTLHSIWLRRLNLKL 479
>gi|356565155|ref|XP_003550810.1| PREDICTED: xylogalacturonan beta-1,3-xylosyltransferase-like
[Glycine max]
Length = 496
Score = 195 bits (495), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 122/349 (34%), Positives = 191/349 (54%), Gaps = 35/349 (10%)
Query: 1 MNRSFRVYVY-----PHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMK----SHFVTK 51
M + F+V+VY P + P N+ YA E F S F +
Sbjct: 157 MVKRFKVWVYQEGEQPLVHDGPVNNI-----------YAIEGQFMDEIDNNDKWSQFRAR 205
Query: 52 DPSKADLFFLPFSIARMRHDRRIGTEGIPDF--------ISHYIFNISQKYPYWNRTGGA 103
P +A +FFLPFSIA + H D+ + YI I KYPYWNR+ GA
Sbjct: 206 HPEEAHVFFLPFSIANVVHYVYKPILKQSDYEPVRLQLLVEDYISVIEDKYPYWNRSKGA 265
Query: 104 DHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIW--PRQEDPP 161
DHF ++CH E+ + I+ +C+++ G ++DVS+P+++ + PP
Sbjct: 266 DHFLLSCHDWAPKVSNGNPELFQSFIRALCNANTS-EGFHPNRDVSIPEVYLPVGKLGPP 324
Query: 162 KLGS--SKRNKLAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLKTPYADELLG-SKFC 217
LG + R LAFFAG V+ +R+ LL+ W++ D+E+ H K+ +L+G SKFC
Sbjct: 325 SLGQHPNSRTILAFFAGGVHGEIRKILLKHWKDKDNEVRVHEYLPKSQNYTKLMGQSKFC 384
Query: 218 LHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILK 277
L G EV + R+ ++++ GCVPVII ++Y LPF+DVL+W FS+ V+ IP +K IL+
Sbjct: 385 LCPSGHEVASPRVVEAIHAGCVPVIICDNYSLPFSDVLHWSQFSVKVSVQKIPEIKSILQ 444
Query: 278 GISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
IS ++YL L VL+VR+HF + +D +M+++ +WLRR ++++
Sbjct: 445 SISRKKYLRLHMNVLRVRRHFMINRPAKPFDMMHMILHSIWLRRLNIKL 493
>gi|15239475|ref|NP_198516.1| Exostosin family protein [Arabidopsis thaliana]
gi|332006752|gb|AED94135.1| Exostosin family protein [Arabidopsis thaliana]
Length = 547
Score = 194 bits (494), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 119/320 (37%), Positives = 180/320 (56%), Gaps = 27/320 (8%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSH--FVTKDPSKADL 58
M R ++YVY F + PRG YASE +F K M+S+ FV KDP KA L
Sbjct: 236 MERKLKIYVYKEGGKPIFHTPM------PRGIYASEGWFMK-LMESNKKFVVKDPRKAHL 288
Query: 59 FFLPFSIARMRHDRRIG---TEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGR 115
F++P SI +R + + + D + Y+ I+ KY +WNRTGGADHF VACH G
Sbjct: 289 FYIPISIKALRSSLGLDFQTPKSLADHLKEYVDLIAGKYKFWNRTGGADHFLVACHDWGN 348
Query: 116 SAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPK--LG---SSKRNK 170
K + N+++ +C+S+ G D +LP + R + P LG SS+R
Sbjct: 349 KLTTKTMK---NSVRSLCNSN-VAQGFRIGTDTALPVTYIRSSEAPLEYLGGKTSSERKI 404
Query: 171 LAFFAGAVNSPVREKLLQVWRN---DSEIYAHSGR---LKTPYADELLGSKFCLHVKGFE 224
LAFFAG+++ +R L+++W N D +I+ R K Y + + S++C+ +G+E
Sbjct: 405 LAFFAGSMHGYLRPILVKLWENKEPDMKIFGPMPRDPKSKKQYREYMKSSRYCICARGYE 464
Query: 225 VNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEY 284
V+T R+ +++ CVPVIIA++Y PF +VLNW+ F++ V DIP L+ IL I + Y
Sbjct: 465 VHTPRVVEAIINECVPVIIADNYVPPFFEVLNWEEFAVFVEEKDIPNLRNILLSIPEDRY 524
Query: 285 LLLQSYVLKVRKHFQWHVFP 304
+ +Q+ V V++HF WH P
Sbjct: 525 IGMQARVKAVQQHFLWHKKP 544
>gi|414883870|tpg|DAA59884.1| TPA: hypothetical protein ZEAMMB73_690759, partial [Zea mays]
Length = 577
Score = 194 bits (494), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 133/359 (37%), Positives = 192/359 (53%), Gaps = 46/359 (12%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPV--DFEPRGNYASESYF-------KKAFMKSHFVTK 51
M R +VYVY +P PV D R Y++E F +A T+
Sbjct: 230 MERQLKVYVY--EEGEP------PVFHDGPCRSIYSTEGRFIHSMETETEAEEGRRLRTR 281
Query: 52 DPSKADLFFLPFSIARMRHDRRIGTEGIPDF------ISHYIFNISQKYPYWNRTGGADH 105
DP++A +FFLPFS+ +M I G D ++ Y+ +S KYPYWNR+ GADH
Sbjct: 282 DPARAHVFFLPFSVVKMVQT--IYEPGSRDMAPLKRTVADYVRVLSSKYPYWNRSLGADH 339
Query: 106 FYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI------WPRQED 159
F ++CH G ++ N+I+V+C+++ G +DVSLPQ+ RQ
Sbjct: 340 FMLSCHDWGPYVSSANAQLFGNSIRVLCNANT-SEGFDPARDVSLPQVNLRSDAVERQVG 398
Query: 160 PPKLGSSKRNKLAFFAGAVNSPVREKLLQVWRNDSEI----------YAHSGRLKTPYAD 209
P +S+R LAFFAG + PVR LL W Y G YAD
Sbjct: 399 GPS--ASRRPVLAFFAGGNHGPVRPALLAHWGPGGRRGGDPDVRVSEYLPRGGGAPSYAD 456
Query: 210 ELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIAN-HYDLPFADVLNWKSFSIVVATLD 268
+ S+FCL G+EV + R+A++LY GCVPV++ + Y LPFADVL+W +F++ + D
Sbjct: 457 MMRRSRFCLCPGGYEVASPRLAEALYLGCVPVVVDDGEYALPFADVLDWDAFALRLRVAD 516
Query: 269 IPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPS-DYDAFYMVMYELWLRRSSVRV 326
IP LK+IL +S +Y+ +Q V VR+HF H P YDAF+M+++ +WLRR +VR+
Sbjct: 517 IPRLKEILAAVSPRQYIRMQRRVRMVRRHFMLHGGPPRRYDAFHMILHSVWLRRLNVRI 575
>gi|296085067|emb|CBI28482.3| unnamed protein product [Vitis vinifera]
Length = 634
Score = 194 bits (493), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 120/342 (35%), Positives = 187/342 (54%), Gaps = 28/342 (8%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
M + F++++Y F N + G +F +HF T+DP +A ++F
Sbjct: 305 MEKLFKIFIYKEGEPPLFHNGPCKSIYSIEG-----VFFSLMEGDTHFRTQDPDEAHVYF 359
Query: 61 LPFSIARMRH--------DRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHS 112
LPFS+ + H D+ + + +S Y+ ISQKY YWNR+ GADHF ++CH
Sbjct: 360 LPFSVVMIIHHLFDPIVRDKYV----MKHVVSDYVKVISQKYRYWNRSLGADHFMLSCHD 415
Query: 113 IGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLG------SS 166
G A ++ N+I+++C+++ + KD S+P+I D +G S
Sbjct: 416 WGPRATWYVPQLYYNSIRLLCNANTSECFN-PRKDASIPEI--NLIDGETIGLTGGLPPS 472
Query: 167 KRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKT--PYADELLGSKFCLHVKGFE 224
KR LAFFAG ++ +R LLQ W+ E L Y D + SK+C+ G E
Sbjct: 473 KRTILAFFAGGLHGRIRPALLQHWKEKDEQVQVYETLPEGLSYPDLMKKSKYCICPSGHE 532
Query: 225 VNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEY 284
V + RI +++Y CVPV+I+ HY LPF+DVL+W SFSI V+ +IP LKKIL GI + Y
Sbjct: 533 VASPRIVEAIYAECVPVLISQHYVLPFSDVLDWGSFSIQVSVNEIPNLKKILLGIPQDRY 592
Query: 285 LLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
+ +Q V +V++HF + P +D F+M+++ +WLRR +V +
Sbjct: 593 IRMQERVKQVQQHFVVNNPPKRFDVFHMIIHSIWLRRLNVAI 634
>gi|296084507|emb|CBI25528.3| unnamed protein product [Vitis vinifera]
Length = 862
Score = 194 bits (493), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 118/338 (34%), Positives = 186/338 (55%), Gaps = 19/338 (5%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFM--KSHFVTKDPSKADL 58
M + F+++ Y + P + D G YA E F KSHF+ + P +A+
Sbjct: 530 MEKRFKIWAY-REGDQPLMH-----DGPSNGIYAIEGQFMDDIESGKSHFLARRPDEANA 583
Query: 59 FFLPFSIARMRH----DRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIG 114
F++P S+ ++ H + IP ++ YI ++ KYPYWNR+ GADHF V+CH
Sbjct: 584 FYIPMSLTKIVHFIYEPPHYYGKWIPRLVTDYINFVADKYPYWNRSKGADHFLVSCHDWA 643
Query: 115 RSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI-WPRQE-DPPKLGS--SKRNK 170
++ + I+ +C+++ H +D+S+P+I PR + PP L +KR
Sbjct: 644 PDVSALKPDLYKHFIRALCNANTSERFHPI-RDISIPEINIPRGKLGPPHLDQPPNKRPI 702
Query: 171 LAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRL--KTPYADELLGSKFCLHVKGFEVNTA 228
LAFFAG + VR L + W+ + RL Y+ + SKFCL G+EV +
Sbjct: 703 LAFFAGGAHGYVRSVLFKYWKEKDDEVQVFERLPGNRNYSKSMGDSKFCLCPSGYEVASP 762
Query: 229 RIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQ 288
RI +++ GCVP+II +HY LPF+DVL+W FSI + + IP +KKILK + +E YL +Q
Sbjct: 763 RIVEAIAAGCVPMIICDHYSLPFSDVLDWSKFSIYITSDKIPEIKKILKAVPTETYLEMQ 822
Query: 289 SYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
V +V++HF + YD +M+++ +WLRR +VR+
Sbjct: 823 KRVKQVQRHFAINRPARPYDMLHMILHSVWLRRLNVRL 860
>gi|359485860|ref|XP_002264111.2| PREDICTED: probable glycosyltransferase At3g07620-like [Vitis
vinifera]
Length = 410
Score = 194 bits (493), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 120/342 (35%), Positives = 187/342 (54%), Gaps = 28/342 (8%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
M + F++++Y F N + G +F +HF T+DP +A ++F
Sbjct: 81 MEKLFKIFIYKEGEPPLFHNGPCKSIYSIEG-----VFFSLMEGDTHFRTQDPDEAHVYF 135
Query: 61 LPFSIARMRH--------DRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHS 112
LPFS+ + H D+ + + +S Y+ ISQKY YWNR+ GADHF ++CH
Sbjct: 136 LPFSVVMIIHHLFDPIVRDKYV----MKHVVSDYVKVISQKYRYWNRSLGADHFMLSCHD 191
Query: 113 IGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLG------SS 166
G A ++ N+I+++C+++ + KD S+P+I D +G S
Sbjct: 192 WGPRATWYVPQLYYNSIRLLCNANTSECFN-PRKDASIPEI--NLIDGETIGLTGGLPPS 248
Query: 167 KRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKT--PYADELLGSKFCLHVKGFE 224
KR LAFFAG ++ +R LLQ W+ E L Y D + SK+C+ G E
Sbjct: 249 KRTILAFFAGGLHGRIRPALLQHWKEKDEQVQVYETLPEGLSYPDLMKKSKYCICPSGHE 308
Query: 225 VNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEY 284
V + RI +++Y CVPV+I+ HY LPF+DVL+W SFSI V+ +IP LKKIL GI + Y
Sbjct: 309 VASPRIVEAIYAECVPVLISQHYVLPFSDVLDWGSFSIQVSVNEIPNLKKILLGIPQDRY 368
Query: 285 LLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
+ +Q V +V++HF + P +D F+M+++ +WLRR +V +
Sbjct: 369 IRMQERVKQVQQHFVVNNPPKRFDVFHMIIHSIWLRRLNVAI 410
>gi|357462311|ref|XP_003601437.1| hypothetical protein MTR_3g080690 [Medicago truncatula]
gi|355490485|gb|AES71688.1| hypothetical protein MTR_3g080690 [Medicago truncatula]
Length = 450
Score = 193 bits (490), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 125/344 (36%), Positives = 189/344 (54%), Gaps = 30/344 (8%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGN-YASESYFKKAFMKSH-FVTKDPSKADL 58
M + F+++VY F + P N Y+ E F + F T++P++A +
Sbjct: 120 MEKLFKIFVYEEGEPPLF-------HYGPCKNIYSMEGIFINLLENNTLFRTQNPNEAHV 172
Query: 59 FFLPFSIARMRH--------DRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVAC 110
+FLPFS+ + D+ + I D Y+ IS KY YWNR+ GADHF ++C
Sbjct: 173 YFLPFSVVMILEHLFHPVIRDKAVLGRTIGD----YVHIISHKYAYWNRSYGADHFMLSC 228
Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHI-AHKDVSLPQI-WPRQEDPPKLGSS-- 166
H G A E+ AI+V+C+++ IS H KD S P+I E LG
Sbjct: 229 HDWGPRATWYVKELYFIAIRVLCNAN--ISEHFNPKKDASFPEINLVSGETTGLLGGYPT 286
Query: 167 -KRNKLAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGR-LKTPYADELLGSKFCLHVKGF 223
R LAFFAG +N +R L Q W+N D ++ + K Y + + SK+C+ G+
Sbjct: 287 WNRTILAFFAGQMNGRIRPVLFQHWKNKDKDVLVYEKLPEKISYHETMKMSKYCICPSGW 346
Query: 224 EVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEE 283
EV + RI +++Y CVP++I+ Y LPF+DVLNW SFS+ + +IP LK+IL GIS E+
Sbjct: 347 EVASPRIVEAIYAECVPILISQQYVLPFSDVLNWDSFSVQIEVSEIPKLKEILLGISEEK 406
Query: 284 YLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
Y+ LQ V +V++HF + P YD F+M+++ +WLRR +V V+
Sbjct: 407 YMRLQEGVKQVQRHFVVNNPPKKYDVFHMIIHSIWLRRLNVLVK 450
>gi|359484710|ref|XP_002266490.2| PREDICTED: probable glycosyltransferase At5g20260 [Vitis vinifera]
Length = 480
Score = 193 bits (490), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 111/307 (36%), Positives = 177/307 (57%), Gaps = 15/307 (4%)
Query: 33 YASESYFKKAFM--KSHFVTKDPSKADLFFLPFSIARMRH----DRRIGTEGIPDFISHY 86
YA E F KS F+ + P +A+ F++P S+ R+ H + IP ++ Y
Sbjct: 174 YAIEGQFMDEIESGKSQFLARHPDEANAFYIPMSLTRVVHFIYEPPHYHGKWIPRLVTDY 233
Query: 87 IFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHK 146
I ++ KYPYWNR+ GADHF V+CH ++ + I+ +C+++ H +
Sbjct: 234 INFVADKYPYWNRSKGADHFLVSCHDWAPDVSALKPDLYKHFIRALCNANTSERFHPI-R 292
Query: 147 DVSLPQI-WPRQE-DPPKLGS--SKRNKLAFFAGAVNSPVREKLLQVWR---NDSEIYAH 199
D+S+P+I PR + PP L +KR LAFFAG + VR L + W+ ++ +++
Sbjct: 293 DISIPEINIPRGKLGPPHLDQPPNKRPILAFFAGGAHGYVRSVLFKYWKEKDDEVQVFER 352
Query: 200 SGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKS 259
R + Y+ + SKFCL G+EV + RI +++ GCVP+II +HY LPF+DVL+W
Sbjct: 353 LPRNRN-YSKSMGDSKFCLCPSGYEVASPRIVEAIAAGCVPMIICDHYSLPFSDVLDWSK 411
Query: 260 FSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWL 319
FSI + + IP +KKILK + +E YL +Q V +V++HF + YD +M+++ +WL
Sbjct: 412 FSIYITSDKIPEIKKILKAVPTETYLEMQKRVKQVQRHFAINRPARPYDMLHMILHSVWL 471
Query: 320 RRSSVRV 326
RR +VR+
Sbjct: 472 RRLNVRL 478
>gi|255548866|ref|XP_002515489.1| catalytic, putative [Ricinus communis]
gi|223545433|gb|EEF46938.1| catalytic, putative [Ricinus communis]
Length = 481
Score = 192 bits (489), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 121/343 (35%), Positives = 191/343 (55%), Gaps = 27/343 (7%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPV--DFEPRGNYASESYF--KKAFMKSHFVTKDPSKA 56
M R F+VYVY DP P+ D + Y E F + + T+DP +A
Sbjct: 148 MERRFKVYVY--SEGDP------PIVHDGPCKDIYTIEGRFIHEMEHGARRYRTRDPKRA 199
Query: 57 DLFFLPFSIARMRHD--RRIGTEGIP--DFISHYIFNISQKYPYWNRTGGADHFYVACHS 112
++F+PFS+ M + + + P F++ Y+ +S KYP+WNRT GADHF +ACH
Sbjct: 200 HVYFMPFSVTWMVKYLYKPLTYDHSPLRQFVADYVRVLSTKYPFWNRTHGADHFMLACHD 259
Query: 113 IGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIW------PRQEDPPKLGSS 166
G A + + +I+V+C+++ G KDVSLP+I P Q P ++
Sbjct: 260 WGPHASRGDHLLYNTSIRVLCNANT-SEGFNPRKDVSLPEIHLYGGNVPPQLLSPPPANT 318
Query: 167 KRNKLAFFAGAVNSPVREKLLQVWRN---DSEIYAHSGRLKTPYADELLGSKFCLHVKGF 223
R LAFFAG ++ P+R LL+ W++ D ++ + + Y +L SKFCL G
Sbjct: 319 TRPHLAFFAGGLHGPIRPLLLKHWKDRESDLRVFEYLPK-HLDYYSFMLRSKFCLCPSGH 377
Query: 224 EVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEE 283
EV + RI +S+Y CVPVI+++HY LPF+DVL W +FSI + +IP L+++L+ + E+
Sbjct: 378 EVASPRIVESIYAECVPVILSDHYVLPFSDVLRWDAFSIQLNVSEIPRLEEVLRSVPEEK 437
Query: 284 YLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
Y L+ + VR HF + +D F+M+++ +WLRR ++R+
Sbjct: 438 YERLKEGLRTVRTHFMLNQPAKRFDVFHMILHSIWLRRLNLRL 480
>gi|449443279|ref|XP_004139407.1| PREDICTED: probable glycosyltransferase At3g42180-like [Cucumis
sativus]
gi|449494024|ref|XP_004159425.1| PREDICTED: probable glycosyltransferase At3g42180-like [Cucumis
sativus]
Length = 447
Score = 192 bits (489), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 115/305 (37%), Positives = 170/305 (55%), Gaps = 17/305 (5%)
Query: 33 YASESYFKKAF--MKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDF-------- 82
YA E F KS F P +A +F LP SI + H D+
Sbjct: 137 YAIEGQFIDELDCSKSPFRASHPDEAHVFLLPLSITNIIHFIYRPITSPADYNRDRMHRV 196
Query: 83 ISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGH 142
+ YI ++ +YPYWNR+ GADHF V+CH ++ N I+VVC+++ G
Sbjct: 197 TTDYIRVVANRYPYWNRSNGADHFVVSCHDWAPEISDANPQLFKNFIRVVCNAN-ITEGF 255
Query: 143 IAHKDVSLPQI--WPRQEDPPKLGS--SKRNKLAFFAGAVNSPVREKLLQVWRN-DSEIY 197
+ D+ LP+I P PP LG +R LAFFAG + +R+ L++ W+ D+E+
Sbjct: 256 RPNIDIPLPEINIHPGTLGPPDLGQPPERRPILAFFAGGAHGYIRKILIKHWKEKDNEVQ 315
Query: 198 AHSGRLKTPYADELLG-SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLN 256
H KT +L+G SKFCL G+EV + R+ +++Y GCVPVII+++Y LPF+DVL+
Sbjct: 316 VHEYLPKTQNYTKLIGESKFCLCPSGYEVASPRVVEAIYGGCVPVIISDNYSLPFSDVLD 375
Query: 257 WKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYE 316
W FS+ + IP +K ILK IS E+YL L V+KV++HF+ + +D +M+++
Sbjct: 376 WSRFSVQIPVQRIPEIKTILKAISEEKYLKLYKGVIKVKRHFKINRPAKPFDVIHMLLHS 435
Query: 317 LWLRR 321
LWLRR
Sbjct: 436 LWLRR 440
>gi|296084509|emb|CBI25530.3| unnamed protein product [Vitis vinifera]
Length = 387
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 111/307 (36%), Positives = 177/307 (57%), Gaps = 15/307 (4%)
Query: 33 YASESYFKKAFM--KSHFVTKDPSKADLFFLPFSIARMRH----DRRIGTEGIPDFISHY 86
YA E F KS F+ + P +A+ F++P S+ R+ H + IP ++ Y
Sbjct: 81 YAIEGQFMDEIESGKSQFLARHPDEANAFYIPMSLTRVVHFIYEPPHYHGKWIPRLVTDY 140
Query: 87 IFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHK 146
I ++ KYPYWNR+ GADHF V+CH ++ + I+ +C+++ H +
Sbjct: 141 INFVADKYPYWNRSKGADHFLVSCHDWAPDVSALKPDLYKHFIRALCNANTSERFHPI-R 199
Query: 147 DVSLPQI-WPRQE-DPPKLGS--SKRNKLAFFAGAVNSPVREKLLQVWR---NDSEIYAH 199
D+S+P+I PR + PP L +KR LAFFAG + VR L + W+ ++ +++
Sbjct: 200 DISIPEINIPRGKLGPPHLDQPPNKRPILAFFAGGAHGYVRSVLFKYWKEKDDEVQVFER 259
Query: 200 SGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKS 259
R + Y+ + SKFCL G+EV + RI +++ GCVP+II +HY LPF+DVL+W
Sbjct: 260 LPRNRN-YSKSMGDSKFCLCPSGYEVASPRIVEAIAAGCVPMIICDHYSLPFSDVLDWSK 318
Query: 260 FSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWL 319
FSI + + IP +KKILK + +E YL +Q V +V++HF + YD +M+++ +WL
Sbjct: 319 FSIYITSDKIPEIKKILKAVPTETYLEMQKRVKQVQRHFAINRPARPYDMLHMILHSVWL 378
Query: 320 RRSSVRV 326
RR +VR+
Sbjct: 379 RRLNVRL 385
>gi|242043198|ref|XP_002459470.1| hypothetical protein SORBIDRAFT_02g005180 [Sorghum bicolor]
gi|241922847|gb|EER95991.1| hypothetical protein SORBIDRAFT_02g005180 [Sorghum bicolor]
Length = 584
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 135/358 (37%), Positives = 194/358 (54%), Gaps = 45/358 (12%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPV--DFEPRGNYASESYFKKAF-MKSHFVTKDPSKAD 57
M + +VYVY +P PV D R Y++E F A + T DPS+A
Sbjct: 234 MEKQLKVYVY--EEGEP------PVFHDGPCRSIYSTEGRFIHAMETATRLRTSDPSQAH 285
Query: 58 LFFLPFSIARMRHDRRIGTEGIPDF------ISHYIFNISQKYPYWNRTGGADHFYVACH 111
+FFLPFS+ +M + I G D ++ Y+ IS KYP+WNR+ GADHF ++CH
Sbjct: 286 VFFLPFSVVKMV--KTIYEPGSHDMAPLKRTVADYLRVISDKYPFWNRSAGADHFMLSCH 343
Query: 112 SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI------WPRQEDPPKLGS 165
G E+ N+I+V+C+++ +A KDVSLP+I RQ P +
Sbjct: 344 DWGPYVSSANAELFGNSIRVLCNANTSEGFDLA-KDVSLPEINLRSDAVERQVGGPS--A 400
Query: 166 SKRNKLAFFAGAVNSPVREKLLQVW-----RNDSEIYAHSGRLKTP----------YADE 210
S+R LAFFAG + PVR LL W R D S L T Y D
Sbjct: 401 SRRPFLAFFAGGNHGPVRPALLAHWGPGSGREDDPDVRVSEYLPTRGGRAGASAAAYTDM 460
Query: 211 LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIAN-HYDLPFADVLNWKSFSIVVATLDI 269
+ S+FCL G+EV + R+A+++Y CVPV++ + Y LPFADVLNW +F++ V D+
Sbjct: 461 MRRSRFCLCPGGYEVASPRLAEAVYLECVPVVVDDGEYALPFADVLNWDAFAVRVRVADV 520
Query: 270 PLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPS-DYDAFYMVMYELWLRRSSVRV 326
P +K+IL +S +Y+ +Q V VR+HF H P YDAF+M+++ +WLRR +VR+
Sbjct: 521 PRIKEILSAVSPRQYIRMQRRVRMVRRHFMVHGGPPRRYDAFHMILHSVWLRRLNVRI 578
>gi|356495238|ref|XP_003516486.1| PREDICTED: probable glycosyltransferase At3g07620-like [Glycine
max]
Length = 610
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 125/345 (36%), Positives = 187/345 (54%), Gaps = 43/345 (12%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEP--RGNYASESYFKKAFM-KSHFVTKDPSKAD 57
M R +V++Y P+ +P RG YASE +F K F+ KDP KA
Sbjct: 282 MERKLKVFIYREGAK--------PIFHQPKMRGIYASEGWFMKLMEGNKRFIVKDPRKAH 333
Query: 58 LFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHS----I 113
LF+LPFS ++M + + + Y+ I+ +Y +WNRT GADHF VACH I
Sbjct: 334 LFYLPFS-SQMLRVTLSNPKQMEQHLEKYVELIAGRYRFWNRTDGADHFLVACHDWASRI 392
Query: 114 GRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIW------PRQE---DPPKLG 164
R M+ I+ +C+S+ G KD +LP + P +E PP
Sbjct: 393 TRQPMK-------GCIRSLCNSN-VAKGFQIGKDTTLPVTYIHSVMDPLKECAGKPP--- 441
Query: 165 SSKRNKLAFFAGAVNSPVREKLLQVWRN---DSEIYAHSGR---LKTPYADELLGSKFCL 218
S+R+ LAFFAG+++ +R LL+ W N D +I+ R K Y + + SK+C+
Sbjct: 442 -SERSALAFFAGSMHGYLRPILLKHWANKEPDMKIFGPMPRDLEGKKMYMEYMNSSKYCI 500
Query: 219 HVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKG 278
+G+EV+T RI ++++ GCVPVII+++Y P +VL W++FS+ V D+P L+ IL
Sbjct: 501 CARGYEVHTPRIIEAIFSGCVPVIISDNYVPPLFEVLKWEAFSLFVRERDVPSLRDILLS 560
Query: 279 ISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSS 323
I E+YL L V KV++HF WH P YD F+M+++ +W R S
Sbjct: 561 IPEEKYLALHLGVKKVQQHFLWHKVPVKYDLFHMILHAIWKNRLS 605
>gi|224143183|ref|XP_002324873.1| predicted protein [Populus trichocarpa]
gi|222866307|gb|EEF03438.1| predicted protein [Populus trichocarpa]
Length = 335
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 112/295 (37%), Positives = 171/295 (57%), Gaps = 15/295 (5%)
Query: 45 KSHFVTKDPSKADLFFLPFSIARMRH-----DRRIGTEGIPDFISHYIFNISQKYPYWNR 99
KS F K+P +A FF+P SIA + H + I D YI IS KYPYWNR
Sbjct: 41 KSPFSAKNPDEALAFFIPVSIASILHFIYRPYVTYSRKQIQDIAEDYIGLISSKYPYWNR 100
Query: 100 TGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI-WPRQE 158
+ GADHF ++CH ++ N I+V+C+++ G +DVSLP+ PR +
Sbjct: 101 SSGADHFMISCHDWAPDVSAANPDLYRNFIRVLCNANSS-EGFKPARDVSLPEFKLPRGK 159
Query: 159 DPP----KLGSSKRNKLAFFAGAVNSPVREKLLQVWR---NDSEIYAHSGRLKTPYADEL 211
P + + R+ LAFFAG + VR+ L + W+ ND ++Y + Y +++
Sbjct: 160 LEPEHILQPCDNNRSILAFFAGGSHGSVRKILFKHWKEKDNDIQVYKYLPETLN-YTEQM 218
Query: 212 LGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPL 271
S++CL G+EV + R+ +++Y GCVPVII+++Y LPF+DVL+W FS+ + IP
Sbjct: 219 SKSRYCLCPSGWEVASPRVVEAIYSGCVPVIISDYYVLPFSDVLDWIKFSVHIPVSGIPE 278
Query: 272 LKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
+K IL+ I EEYL Q VL+V++HF+ H +D +MVM+ +WLRR ++R+
Sbjct: 279 IKTILQSIPVEEYLEKQKRVLQVQQHFKLHRPAKPFDVVHMVMHSVWLRRLNIRL 333
>gi|147775379|emb|CAN73458.1| hypothetical protein VITISV_022180 [Vitis vinifera]
Length = 333
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 116/342 (33%), Positives = 184/342 (53%), Gaps = 27/342 (7%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFM--KSHFVTKDPSKADL 58
M + F+++ Y + P + D G YA E F KSHF+ + P +A+
Sbjct: 1 MEKRFKIWAY-REGDQPLMH-----DGPSNGIYAIEGQFMDDIESGKSHFLARRPDEANA 54
Query: 59 FFLPFSIARMRH----DRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIG 114
F++P S+ ++ H + IP ++ YI ++ KYPYWNR+ GADHF V+CH
Sbjct: 55 FYIPMSLTKIVHFIYEPPHYYGKWIPRLVTDYINFVADKYPYWNRSKGADHFLVSCHDWA 114
Query: 115 RSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI--------WPRQEDPPKLGSS 166
++ + I+ +C+++ H +D+S+P+I P + PP +
Sbjct: 115 PDVSALKPDLYKHFIRALCNANTSERFHPI-RDISIPEINIPXGKLGPPHLDQPP----N 169
Query: 167 KRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRL--KTPYADELLGSKFCLHVKGFE 224
KR LAFFAG + VR L + W+ + RL Y+ + SKFCL G+E
Sbjct: 170 KRPILAFFAGGAHGYVRSVLFKYWKEKDDEVQVFERLPGNRNYSKSMGDSKFCLCPSGYE 229
Query: 225 VNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEY 284
V + RI +++ GCVP+II +HY LPF+DVL+W FSI + + IP +KKILK + +E Y
Sbjct: 230 VASPRIVEAIAAGCVPMIICDHYSLPFSDVLDWSKFSIYITSDKIPEIKKILKAVPTETY 289
Query: 285 LLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
L +Q V +V++HF + YD +M+++ +WLRR +VR+
Sbjct: 290 LEMQKRVKQVQRHFAINRPARPYDMLHMILHSVWLRRLNVRL 331
>gi|297807149|ref|XP_002871458.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
gi|297317295|gb|EFH47717.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
Length = 336
Score = 191 bits (486), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 124/347 (35%), Positives = 185/347 (53%), Gaps = 30/347 (8%)
Query: 1 MNRSFRVYVY-----PHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMK--SHFVTKDP 53
M + F+++ Y P P N+ YA E F S + P
Sbjct: 1 MEKRFKIWTYREGEAPLFHKGPLNNI-----------YAIEGQFMDEIENGNSRYKAASP 49
Query: 54 SKADLFFLPFSIARM-----RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYV 108
+A +F++P I + R + + + + YI IS +YPYWNR+ GADHF++
Sbjct: 50 EEATVFYIPVGIVNIIRFVYRPYTSYARDRLQNIVKDYISLISNRYPYWNRSRGADHFFL 109
Query: 109 ACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI-WPR-QEDPPKLGSS 166
+CH E+ + I+ +C+++ G +DVSLP+I P Q G +
Sbjct: 110 SCHDWAPDVSAVDPELYRHFIRALCNANAS-EGFTPMRDVSLPEINIPHSQLGFVHTGEA 168
Query: 167 KRNK--LAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLKT-PYADELLGSKFCLHVKG 222
+N+ LAFFAG + VR+ L + W+ D ++ + KT Y + +KFCL G
Sbjct: 169 PQNRKLLAFFAGGSHGEVRKILFEQWKEKDKDVLVYEYLPKTMNYTKMMDKAKFCLCPSG 228
Query: 223 FEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSE 282
+EV + RI +SLY GCVPVIIA+ Y LPF+DVLNWK+FS+ + +P +KKIL+ IS E
Sbjct: 229 WEVASPRIVESLYSGCVPVIIADSYVLPFSDVLNWKTFSVHIPISKMPDIKKILEAISEE 288
Query: 283 EYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQWS 329
EYL +Q VL+VRKHF + YD +M+M+ +WLRR +VR+ S
Sbjct: 289 EYLEMQRRVLEVRKHFVINRPSKPYDMLHMIMHSIWLRRLNVRIPLS 335
>gi|356546126|ref|XP_003541482.1| PREDICTED: xylogalacturonan beta-1,3-xylosyltransferase-like
[Glycine max]
Length = 490
Score = 191 bits (485), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 120/349 (34%), Positives = 185/349 (53%), Gaps = 35/349 (10%)
Query: 1 MNRSFRVYVY-----PHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMK----SHFVTK 51
M + F+V+VY P + P N+ YA E F S F +
Sbjct: 151 MVKRFKVWVYQEGEQPLVHDGPVNNI-----------YAIEGQFMDEMDNNGKWSQFRAR 199
Query: 52 DPSKADLFFLPFSIARMRH--------DRRIGTEGIPDFISHYIFNISQKYPYWNRTGGA 103
P +A +FFLP SIA + H + + YI I KYPYWNR+ GA
Sbjct: 200 HPEEAHVFFLPISIANVVHYVYKPILKQSDYEPVRLQHLVEDYIGVIQDKYPYWNRSIGA 259
Query: 104 DHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIW--PRQEDPP 161
DHF ++CH G E+ I+ +C+++ H ++DVS+P+++ + P
Sbjct: 260 DHFLLSCHDWGPKVSYGNPELFQTFIRALCNANTSEGFH-PNRDVSIPEVYLPVGKLGPA 318
Query: 162 KLGS--SKRNKLAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLKTPYADELLG-SKFC 217
LG + R LAFFAG V+ +R+ LL+ W++ D+E+ H K +L+G SKFC
Sbjct: 319 SLGQHPNSRTTLAFFAGGVHGEIRKILLKHWKDKDNEVLVHEYLPKGQDYTKLMGQSKFC 378
Query: 218 LHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILK 277
L G EV + R+ ++++ GCVPVII ++Y LPF+DVLNW FS+ + IP +K IL+
Sbjct: 379 LCPSGHEVASPRVVEAIHAGCVPVIICDNYSLPFSDVLNWSQFSVEIPVEKIPEIKSILQ 438
Query: 278 GISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
IS +YL L VL+VR+HF + +D +M+++ +WLRR ++++
Sbjct: 439 SISRNKYLRLHMNVLRVRRHFMINRPTKPFDMMHMILHSIWLRRLNIKL 487
>gi|296082060|emb|CBI21065.3| unnamed protein product [Vitis vinifera]
Length = 402
Score = 191 bits (484), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 119/343 (34%), Positives = 182/343 (53%), Gaps = 25/343 (7%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
M + FRV+ Y F + + + + G + E K F + P +A LF+
Sbjct: 66 MEKRFRVWTYQEGEKPLFHSGPMNLIYGIEGQFMDELESKNC----PFSARHPDEAMLFY 121
Query: 61 LPFSIARM-----------RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVA 109
+P S+ + R+ +G+ ++ YI IS+KYP+WNR+ GADHF V+
Sbjct: 122 IPISVVNIIRYVYQPYTNSASYSRVRLQGL---VADYIDVISRKYPFWNRSSGADHFMVS 178
Query: 110 CHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWP--RQEDPPKLGSSK 167
CH + + I+V+C+++ G +DVSLP+I R PP LG
Sbjct: 179 CHDWAPEISAANPKFFRHFIRVLCNANT-SEGFKPVRDVSLPEILVPYRMLGPPYLGQPP 237
Query: 168 RNK--LAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLKTPYADELLG-SKFCLHVKGF 223
N+ LAFFAG + VR L W+ D +I H T EL+G SKFCL GF
Sbjct: 238 TNRSILAFFAGGAHGKVRSILFHYWKEKDEDIQVHEYLPTTLNYTELMGRSKFCLCPSGF 297
Query: 224 EVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEE 283
EV + R+ +S+Y GCVPVII+++Y LPF+DVL+W FS+ + IP K IL+ I +E
Sbjct: 298 EVASPRVVESIYAGCVPVIISDNYSLPFSDVLDWSQFSVHIPIARIPETKTILQAIPIQE 357
Query: 284 YLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
YL Q V++V++HF + +D +MV++ +WLRR ++++
Sbjct: 358 YLTKQKTVMQVQRHFTLNRPAKRFDVLHMVLHSIWLRRINIQL 400
>gi|359476148|ref|XP_002285239.2| PREDICTED: probable glycosyltransferase At5g11130 [Vitis vinifera]
Length = 472
Score = 191 bits (484), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 119/343 (34%), Positives = 182/343 (53%), Gaps = 25/343 (7%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
M + FRV+ Y F + + + + G + E K F + P +A LF+
Sbjct: 136 MEKRFRVWTYQEGEKPLFHSGPMNLIYGIEGQFMDELESKNC----PFSARHPDEAMLFY 191
Query: 61 LPFSIARM-----------RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVA 109
+P S+ + R+ +G+ ++ YI IS+KYP+WNR+ GADHF V+
Sbjct: 192 IPISVVNIIRYVYQPYTNSASYSRVRLQGL---VADYIDVISRKYPFWNRSSGADHFMVS 248
Query: 110 CHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWP--RQEDPPKLGSSK 167
CH + + I+V+C+++ G +DVSLP+I R PP LG
Sbjct: 249 CHDWAPEISAANPKFFRHFIRVLCNANT-SEGFKPVRDVSLPEILVPYRMLGPPYLGQPP 307
Query: 168 RNK--LAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLKTPYADELLG-SKFCLHVKGF 223
N+ LAFFAG + VR L W+ D +I H T EL+G SKFCL GF
Sbjct: 308 TNRSILAFFAGGAHGKVRSILFHYWKEKDEDIQVHEYLPTTLNYTELMGRSKFCLCPSGF 367
Query: 224 EVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEE 283
EV + R+ +S+Y GCVPVII+++Y LPF+DVL+W FS+ + IP K IL+ I +E
Sbjct: 368 EVASPRVVESIYAGCVPVIISDNYSLPFSDVLDWSQFSVHIPIARIPETKTILQAIPIQE 427
Query: 284 YLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
YL Q V++V++HF + +D +MV++ +WLRR ++++
Sbjct: 428 YLTKQKTVMQVQRHFTLNRPAKRFDVLHMVLHSIWLRRINIQL 470
>gi|296085068|emb|CBI28483.3| unnamed protein product [Vitis vinifera]
Length = 416
Score = 190 bits (483), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 120/344 (34%), Positives = 185/344 (53%), Gaps = 30/344 (8%)
Query: 1 MNRSFRVYVY-----PHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMK-SHFVTKDPS 54
M + F++YVY P N P ++ Y++E F K S + T DP
Sbjct: 85 MEKLFKIYVYEEGEPPMFHNGPCKSI-----------YSTEGRFIHEMEKGSVYRTTDPD 133
Query: 55 KADLFFLPFSIARMRHDRRI----GTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVAC 110
+A L+FLPFS+ M + I + YI IS +P+WNR+ GADHF ++C
Sbjct: 134 QALLYFLPFSVVMMVQYLYVPDSHEIHAIEKTVIDYINLISHNHPFWNRSLGADHFMLSC 193
Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPK----LGSS 166
H G A + N+I+V+C+++ G KDVS P+I R + L S
Sbjct: 194 HDWGPRASTSVPYLYNNSIRVLCNANT-SEGFNPSKDVSFPEIHLRTGEMSGPLGGLSPS 252
Query: 167 KRNKLAFFAGAVNSPVREKLLQVWRN---DSEIYAHSGRLKTPYADELLGSKFCLHVKGF 223
+R L FFAG ++ +R LL+ W++ D ++Y Y L S+FCL G+
Sbjct: 253 RRPILGFFAGRLHGHIRYLLLEQWKDKDKDLQVYDQLPN-GLSYDSMLKKSRFCLCPSGY 311
Query: 224 EVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEE 283
EV + R+ +++Y CVPV+I+++Y PF DVLNWKSF++ V DI +K+IL GIS +
Sbjct: 312 EVASPRVVEAIYAECVPVLISDNYVPPFNDVLNWKSFAVQVQVRDIANIKRILMGISQTQ 371
Query: 284 YLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
YL + V +V++HF + P +D F+M ++ +WLRR ++R+Q
Sbjct: 372 YLRMYRRVKQVQRHFMVNAAPQRFDVFHMTIHSIWLRRLNIRIQ 415
>gi|359485862|ref|XP_002264076.2| PREDICTED: probable glycosyltransferase At5g03795-like [Vitis
vinifera]
Length = 484
Score = 190 bits (483), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 120/344 (34%), Positives = 185/344 (53%), Gaps = 30/344 (8%)
Query: 1 MNRSFRVYVY-----PHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMK-SHFVTKDPS 54
M + F++YVY P N P ++ Y++E F K S + T DP
Sbjct: 153 MEKLFKIYVYEEGEPPMFHNGPCKSI-----------YSTEGRFIHEMEKGSVYRTTDPD 201
Query: 55 KADLFFLPFSIARMRHDRRI----GTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVAC 110
+A L+FLPFS+ M + I + YI IS +P+WNR+ GADHF ++C
Sbjct: 202 QALLYFLPFSVVMMVQYLYVPDSHEIHAIEKTVIDYINLISHNHPFWNRSLGADHFMLSC 261
Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPK----LGSS 166
H G A + N+I+V+C+++ G KDVS P+I R + L S
Sbjct: 262 HDWGPRASTSVPYLYNNSIRVLCNANTS-EGFNPSKDVSFPEIHLRTGEMSGPLGGLSPS 320
Query: 167 KRNKLAFFAGAVNSPVREKLLQVWRN---DSEIYAHSGRLKTPYADELLGSKFCLHVKGF 223
+R L FFAG ++ +R LL+ W++ D ++Y Y L S+FCL G+
Sbjct: 321 RRPILGFFAGRLHGHIRYLLLEQWKDKDKDLQVYDQLPN-GLSYDSMLKKSRFCLCPSGY 379
Query: 224 EVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEE 283
EV + R+ +++Y CVPV+I+++Y PF DVLNWKSF++ V DI +K+IL GIS +
Sbjct: 380 EVASPRVVEAIYAECVPVLISDNYVPPFNDVLNWKSFAVQVQVRDIANIKRILMGISQTQ 439
Query: 284 YLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
YL + V +V++HF + P +D F+M ++ +WLRR ++R+Q
Sbjct: 440 YLRMYRRVKQVQRHFMVNAAPQRFDVFHMTIHSIWLRRLNIRIQ 483
>gi|449511061|ref|XP_004163851.1| PREDICTED: probable glycosyltransferase At3g07620-like [Cucumis
sativus]
Length = 341
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 115/336 (34%), Positives = 181/336 (53%), Gaps = 25/336 (7%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSH-FVTKDPSKADLF 59
M + +VYVY F +L G YASE +F K + FV KDP KA LF
Sbjct: 9 MEKMLKVYVYDEGEKPIFHQPIL------TGIYASEGWFMKLLEDNKKFVVKDPEKAHLF 62
Query: 60 FLPFSIARMRH---DRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRS 116
+LPFS +R ++ + + +YI I +KY +WN+ GG+DHF VACH
Sbjct: 63 YLPFSSQFLRSAFGNKFRNKRDLQKPLKNYIDVIGKKYRFWNKNGGSDHFLVACHDWAPK 122
Query: 117 AMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDP--PKLGS---SKRNKL 171
++ + N I+ +C+++ I KD SLP + + K+G S+R L
Sbjct: 123 LTKRLVK---NCIRALCNANGAGDFEIG-KDTSLPVTFVHSTEDLITKIGGKPPSERTTL 178
Query: 172 AFFAGAVNSPVREKLLQVWRNDSEIYAHSGRL------KTPYADELLGSKFCLHVKGFEV 225
AFFAG+++ +R LL W N G + K Y +++ SK+C+ +G++V
Sbjct: 179 AFFAGSMHGYLRPILLHYWENKEPDMMIVGPMPNSIEGKNAYMEQMKSSKYCICARGYQV 238
Query: 226 NTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYL 285
++ R+ +++ C+PVII+++Y P +VLNW+SFS+ V +IP L+ IL I E Y
Sbjct: 239 HSPRVIEAILNECIPVIISDNYVPPLFEVLNWESFSVFVKEREIPYLRDILLSIPEENYR 298
Query: 286 LLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
++ S V V++HF WH P+ YDAF+M+++ +W R
Sbjct: 299 VMHSRVKMVQQHFLWHEKPAKYDAFHMILHSIWYTR 334
>gi|449465629|ref|XP_004150530.1| PREDICTED: probable glycosyltransferase At5g20260-like [Cucumis
sativus]
Length = 299
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 109/298 (36%), Positives = 172/298 (57%), Gaps = 21/298 (7%)
Query: 45 KSHFVTKDPSKADLFFLPFSIARM-----RHDRRIGTEGIPDFISHYIFNISQKYPYWNR 99
KS F +P +A +FFLP SI + + + + + Y+ ++ KYPYWNR
Sbjct: 5 KSPFSAHEPEEAQVFFLPISIVYIVDYIYKPITTYARDRLVRIFTDYVRVVANKYPYWNR 64
Query: 100 TGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI------ 153
T GADHF V+CH ++ + I+V+C+++ G +D SLP+I
Sbjct: 65 TRGADHFMVSCHDWAPEVTKEDPNLFKYFIRVLCNANT-SEGFNPMRDASLPEINLPPTF 123
Query: 154 ---WPRQEDPPKLGSSKRNKLAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLKTPYAD 209
PR PP+ R+ LAFFAG + +R L+Q W++ D EI H +
Sbjct: 124 HLNLPRLGQPPQ----NRSILAFFAGGAHGFIRHILMQHWKDKDHEIQVHEYLPPSQNYT 179
Query: 210 ELLG-SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLD 268
EL+ SKFCL G+EV + R+ ++++ GCVPV+I+++Y LPF DVL+W FS+ + +
Sbjct: 180 ELIDRSKFCLCPSGYEVASPRLVEAIHGGCVPVVISDYYSLPFDDVLDWSKFSMRIPSER 239
Query: 269 IPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
IP +K IL+G+S ++YL LQ V+KV++HF+ H +D F+MV++ +WLRR +V++
Sbjct: 240 IPEIKTILRGVSMKKYLKLQRGVMKVQRHFEIHRPAKAFDMFHMVLHSVWLRRLNVKL 297
>gi|224142559|ref|XP_002324623.1| predicted protein [Populus trichocarpa]
gi|222866057|gb|EEF03188.1| predicted protein [Populus trichocarpa]
Length = 337
Score = 189 bits (480), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 109/295 (36%), Positives = 169/295 (57%), Gaps = 15/295 (5%)
Query: 46 SHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDF--------ISHYIFNISQKYPYW 97
S F +DP +A +FFLP S+A + H + D+ ++ Y+ +++KYPYW
Sbjct: 42 SPFRAQDPDEAHVFFLPVSVASIVHFIYLPITAAADYSRDRLRRVVTDYVHIVAKKYPYW 101
Query: 98 NRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWP-- 155
NR+ GADHF V+CH E+ I+V+C+++ I G +DV LP+I+
Sbjct: 102 NRSNGADHFMVSCHDWAPDVSIANSELFNKFIRVLCNANISI-GFRPPRDVLLPEIYLPF 160
Query: 156 RQEDPPKLGSSKRNK--LAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLKTPYADELL 212
+G + N+ LAFF G + +R+ L + W+N D+E+ H K L+
Sbjct: 161 SGLGTTHMGQAPNNRPILAFFEGRAHGYIRQVLFKHWKNKDNEVQVHELLPKGKNYTRLM 220
Query: 213 G-SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPL 271
G SKFCL GFEV + R+ +++Y GCVPVII+N+Y LPF+DVLNW FS+ + IP
Sbjct: 221 GQSKFCLCPSGFEVASPRVVEAIYQGCVPVIISNNYSLPFSDVLNWSQFSVQIPVEKIPE 280
Query: 272 LKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
+K IL+ IS+ +YL + V +V++HF + +D +MV++ LWLRR + R+
Sbjct: 281 IKMILQRISNSKYLRMHERVKRVQRHFVLNRPAKPFDVIHMVLHSLWLRRLNFRL 335
>gi|224143179|ref|XP_002324871.1| predicted protein [Populus trichocarpa]
gi|222866305|gb|EEF03436.1| predicted protein [Populus trichocarpa]
Length = 334
Score = 189 bits (480), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 119/343 (34%), Positives = 188/343 (54%), Gaps = 27/343 (7%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPV--DFEPRGNYASESYF--KKAFMKSHFVTKDPSKA 56
M + F+V+VY + LP+ D + Y E F + F T+DP +A
Sbjct: 1 MEKRFKVHVY--------SEGELPIVHDGPCKDIYTIEGRFIHEMEHGAKRFKTRDPRRA 52
Query: 57 DLFFLPFSIARM----RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHS 112
++F+PFS+ M + F++ Y+ +S KYP+WNRT GADHF ++CH
Sbjct: 53 HVYFMPFSVTWMVKYLYKPLTYDHTAMKQFVADYVRVVSSKYPFWNRTQGADHFMLSCHD 112
Query: 113 IGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQED-PPKL-----GSS 166
G A + +I+V+C+++ G KDVSLP+I + PPKL +S
Sbjct: 113 WGPHASHGNPFLYNTSIRVLCNANS-SEGFSPRKDVSLPEIHLYGGNVPPKLISPPPATS 171
Query: 167 KRNKLAFFAGAVNSPVREKLLQVWRN---DSEIYAHSGRLKTPYADELLGSKFCLHVKGF 223
R LAFF+G ++ P+R LL W+ D ++Y + + Y +L SKFCL G
Sbjct: 172 PRPYLAFFSGGLHGPIRPILLDHWKGRDPDLQVYEYLPK-DLDYYSFMLRSKFCLCPSGH 230
Query: 224 EVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEE 283
EV + RI +++Y CVPVI+++HY LPF+DVL W++F+I V +IP LK++L + E
Sbjct: 231 EVASPRIVEAIYAECVPVILSDHYVLPFSDVLRWEAFAIQVNVSEIPRLKEVLISVPEER 290
Query: 284 YLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
Y L+ + +RKHF + +D F+M+++ +WLRR ++R+
Sbjct: 291 YRRLKEGLRAIRKHFVLNQPAKRFDVFHMILHSIWLRRLNLRL 333
>gi|255582309|ref|XP_002531945.1| catalytic, putative [Ricinus communis]
gi|223528391|gb|EEF30427.1| catalytic, putative [Ricinus communis]
Length = 453
Score = 189 bits (480), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 106/297 (35%), Positives = 172/297 (57%), Gaps = 13/297 (4%)
Query: 45 KSHFVTKDPSKADLFFLPFSIARM-----RHDRRIGTEGIPDFISHYIFNISQKYPYWNR 99
+SHF+ + P +A +FFLP S+A + + + + ++ Y+ ++ KYPYWNR
Sbjct: 158 RSHFIARRPEEAQVFFLPVSVANIINFIYKPIVTYSRDQLQRLVTDYVGTVANKYPYWNR 217
Query: 100 TGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI-WPRQE 158
+ GADHF V+CH E+ N I+V+C+++ + +DVS+P+I P +
Sbjct: 218 SSGADHFLVSCHDWAPDISTANPELYRNFIRVLCNANTSERFN-PKRDVSIPEINIPSGK 276
Query: 159 DPPKL----GSSKRNKLAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLKTPYADELLG 213
P L S R LAFFAG + +R+ LL+ W+ DSEI H K +L+G
Sbjct: 277 LGPPLHQASSPSNRTILAFFAGGSHGYIRKLLLEHWKGKDSEIQVHEYLDKKQNYFKLMG 336
Query: 214 -SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLL 272
S+FCL G+EV + R+ ++ GCVPV I+++Y LPF+D+L+W FS+ + + I +
Sbjct: 337 QSRFCLCPSGYEVASPRVVTAIQLGCVPVTISDNYTLPFSDILDWSKFSVHIPSGKIQEI 396
Query: 273 KKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQWS 329
K ILKGIS +YL + V+ VR+HF + +D +M+++ +WLRR ++R+ +S
Sbjct: 397 KTILKGISPRQYLKMHKRVMLVRRHFMLNRPAQPFDMIHMMLHSIWLRRLNIRLPFS 453
>gi|449518133|ref|XP_004166098.1| PREDICTED: probable glycosyltransferase At5g20260-like, partial
[Cucumis sativus]
Length = 362
Score = 189 bits (480), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 109/298 (36%), Positives = 172/298 (57%), Gaps = 21/298 (7%)
Query: 45 KSHFVTKDPSKADLFFLPFSIARM-----RHDRRIGTEGIPDFISHYIFNISQKYPYWNR 99
KS F +P +A +FFLP SI + + + + + Y+ ++ KYPYWNR
Sbjct: 68 KSPFSAHEPEEAQVFFLPISIVYIVDYIYKPITTYARDRLVRIFTDYVRVVANKYPYWNR 127
Query: 100 TGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI------ 153
T GADHF V+CH ++ + I+V+C+++ G +D SLP+I
Sbjct: 128 TRGADHFMVSCHDWAPEVTKEDPNLFKYFIRVLCNANT-SEGFNPMRDASLPEINLPPTF 186
Query: 154 ---WPRQEDPPKLGSSKRNKLAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLKTPYAD 209
PR PP+ R+ LAFFAG + +R L+Q W++ D EI H +
Sbjct: 187 HLNLPRLGQPPQ----NRSILAFFAGGAHGFIRHILMQHWKDKDHEIQVHEYLPPSQNYT 242
Query: 210 ELLG-SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLD 268
EL+ SKFCL G+EV + R+ ++++ GCVPV+I+++Y LPF DVL+W FS+ + +
Sbjct: 243 ELIDRSKFCLCPSGYEVASPRLVEAIHGGCVPVVISDYYSLPFDDVLDWSKFSMRIPSER 302
Query: 269 IPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
IP +K IL+G+S ++YL LQ V+KV++HF+ H +D F+MV++ +WLRR +V++
Sbjct: 303 IPEIKTILRGVSMKKYLKLQRGVMKVQRHFEIHRPAKAFDMFHMVLHSVWLRRLNVKL 360
>gi|224142567|ref|XP_002324627.1| predicted protein [Populus trichocarpa]
gi|222866061|gb|EEF03192.1| predicted protein [Populus trichocarpa]
Length = 342
Score = 189 bits (480), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 111/301 (36%), Positives = 170/301 (56%), Gaps = 24/301 (7%)
Query: 45 KSHFVTKDPSKADLFFLPFSIARMRH---DRRI--GTEGIPDFISHYIFNISQKYPYWNR 99
KS F+ + P +A FFLP S+A + H RI + + ++ Y+ ++ KY YWNR
Sbjct: 41 KSPFIARHPDEAHAFFLPISVAYIIHYVYKPRITFARDQLQRLVTDYVRVVADKYTYWNR 100
Query: 100 TGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIW----- 154
T GADHF V+CH + E+ ++V+C+++ G +DVS+P+I+
Sbjct: 101 TQGADHFSVSCHDWAPDVSRANPELFRYFVRVLCNAN-ISEGFRPQRDVSIPEIFLPVGK 159
Query: 155 ---PRQEDPPKLGSSKRNKLAFFAGAVNSPVREKLLQVWRN-DSEIYAHS-----GRLKT 205
PR+ P SKR+ LAFFAG + +R+ LL W+ D E+ H + T
Sbjct: 160 LGPPREYTKP---PSKRSILAFFAGGAHGHIRKVLLTHWKEKDDEVQVHEYLTQRNKKNT 216
Query: 206 PYADELLG-SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVV 264
EL+G SKFCL G EV + R+ ++ GCVPV I+ +Y LPF+DVL+W FS+ +
Sbjct: 217 NLYFELMGQSKFCLCPSGHEVASPRVVTAIQLGCVPVTISANYSLPFSDVLDWSKFSVDI 276
Query: 265 ATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSV 324
IP +K ILKGISS YL +Q V+++++HF + YD +M+++ +WLRR +V
Sbjct: 277 PPEKIPEIKTILKGISSRRYLTMQRRVMQIQRHFMLNRPAQPYDMLHMILHSVWLRRLNV 336
Query: 325 R 325
+
Sbjct: 337 K 337
>gi|224129246|ref|XP_002320537.1| predicted protein [Populus trichocarpa]
gi|222861310|gb|EEE98852.1| predicted protein [Populus trichocarpa]
Length = 395
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 113/294 (38%), Positives = 167/294 (56%), Gaps = 19/294 (6%)
Query: 46 SHFVTKDPSKADLFFLPFS----IARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTG 101
+ F T +P +A ++FLPFS I + H + +S Y+ IS KY YWNR+
Sbjct: 104 TKFRTSNPDEAHVYFLPFSVVMIIEHLFHPIIRDKAVLERTVSDYVRIISHKYLYWNRSL 163
Query: 102 GADHFYVACHSIGRSAMQKAWEVKLNAIQVVC---SSSYFISGHIAHKDVSLPQIWPRQE 158
GADHF ++CH G A ++ N+I+V+C +S YF KD S P+I +
Sbjct: 164 GADHFMLSCHDWGPRATWYVRQLYYNSIRVLCNANTSEYFN----PKKDASFPEINLKTG 219
Query: 159 DPPKLGS----SKRNKLAFFAGAVNSPVREKLLQVWRN---DSEIYAHSGRLKTPYADEL 211
+ L S R LAFFAG ++ +R LLQ W D ++Y + Y + +
Sbjct: 220 EITGLTGGLPPSNRTVLAFFAGKMHGKLRPALLQHWMGKDKDVQVYETLPQ-GISYHEMM 278
Query: 212 LGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPL 271
SK+C+ G EV + RIA+++Y CVPV+I+ HY PF+DVLNW SF+I V +IP
Sbjct: 279 KKSKYCICPSGHEVASPRIAEAIYAECVPVLISQHYIFPFSDVLNWDSFTIQVPVTEIPN 338
Query: 272 LKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
LK IL+GI ++YL +Q V +V++HF + P YD F+M+++ +WLRR +VR
Sbjct: 339 LKNILEGIPEDQYLRMQERVRQVQRHFVVNNPPRRYDVFHMIIHSIWLRRLNVR 392
>gi|357478279|ref|XP_003609425.1| Xylogalacturonan beta-1,3-xylosyltransferase [Medicago truncatula]
gi|355510480|gb|AES91622.1| Xylogalacturonan beta-1,3-xylosyltransferase [Medicago truncatula]
Length = 617
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 104/295 (35%), Positives = 168/295 (56%), Gaps = 15/295 (5%)
Query: 46 SHFVTKDPSKADLFFLPFSIAR--------MRHDRRIGTEGIPDFISHYIFNISQKYPYW 97
S F K P++A +FFLPFS+A + + + + + Y+ ++ KYPYW
Sbjct: 320 SPFKAKHPNEAHIFFLPFSVANVVQYVYKPIMSKKDFNRDRLHRMVEDYVNVVAHKYPYW 379
Query: 98 NRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIW--P 155
NR+ GADHF ++CH + N +V+C+++ G +DVS+P+++
Sbjct: 380 NRSNGADHFLLSCHDWAPEISDANPNLFKNFTRVLCNANTS-EGFQPKRDVSIPEVYLPV 438
Query: 156 RQEDPPKLGSSKRNK--LAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLKTPYADELL 212
+ PP LG S N+ LAFF+G + +R+ LL W+N D+++ H K EL+
Sbjct: 439 GKLGPPNLGQSPLNRTILAFFSGGAHGDIRKLLLNHWKNKDAQVQVHEYLPKGQNYTELM 498
Query: 213 G-SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPL 271
G SKFCL G+EV + RI +++ GCVPVII+ +Y LPF DVLNW FS+ + I
Sbjct: 499 GLSKFCLCPSGYEVASPRIVEAINAGCVPVIISQNYSLPFNDVLNWSEFSVEIPVEKIVE 558
Query: 272 LKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
+K IL+ ++ ++Y+ L V+KV+KHF + +D +M+++ +WLRR + R+
Sbjct: 559 IKNILQNVTKDKYMKLHMNVMKVQKHFVMNRPAKPFDVMHMILHSIWLRRLNFRL 613
>gi|449466083|ref|XP_004150756.1| PREDICTED: probable glycosyltransferase At3g07620-like [Cucumis
sativus]
Length = 341
Score = 188 bits (477), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 115/336 (34%), Positives = 180/336 (53%), Gaps = 25/336 (7%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSH-FVTKDPSKADLF 59
M + +VYVY F +L G YASE +F K + FV KDP KA LF
Sbjct: 9 MEKMLKVYVYDEGEKPIFHQPIL------TGIYASEGWFMKLLEDNKKFVVKDPEKAHLF 62
Query: 60 FLPFSIARMRH---DRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRS 116
+LPFS +R ++ + + +YI I +KY +WN+ GG+DHF VACH
Sbjct: 63 YLPFSSQFLRSAFGNKFRNKRDLQKPLKNYIDVIGKKYRFWNKNGGSDHFLVACHDWAPK 122
Query: 117 AMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDP--PKLGS---SKRNKL 171
++ + N I+ +C+++ I KD SLP + + K+G S+R L
Sbjct: 123 LTKRLVK---NCIRALCNANGAGDFEIG-KDTSLPVTFVHSTEDLITKIGGKPPSERTTL 178
Query: 172 AFFAGAVNSPVREKLLQVWRNDSEIYAHSGRL------KTPYADELLGSKFCLHVKGFEV 225
AFFAG+++ +R LL W N G + K Y +++ SK+C+ +G++V
Sbjct: 179 AFFAGSMHGYLRPILLHYWENKEPDMMIVGPMPNSIEGKNAYMEQMKSSKYCICARGYQV 238
Query: 226 NTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYL 285
++ R+ +++ C+PVII+++Y P +VLNW+SFS+ V +IP L+ IL I E Y
Sbjct: 239 HSPRVIEAILNECIPVIISDNYVPPLFEVLNWESFSVFVKEREIPNLRDILLSIPEENYR 298
Query: 286 LLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
+ S V V++HF WH P+ YDAF+M+++ +W R
Sbjct: 299 AMHSRVKMVQQHFLWHEKPAKYDAFHMILHSIWYTR 334
>gi|255548864|ref|XP_002515488.1| catalytic, putative [Ricinus communis]
gi|223545432|gb|EEF46937.1| catalytic, putative [Ricinus communis]
Length = 474
Score = 187 bits (476), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 116/338 (34%), Positives = 180/338 (53%), Gaps = 16/338 (4%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
M + F+V+ Y F N + + G + E F+ KS F + P +A FF
Sbjct: 137 MEKRFKVWTYKEGEPPLFHNGPMKEIYSTEGQFIDE--FESG--KSLFSARRPDEAHAFF 192
Query: 61 LPFSIARM-----RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGR 115
LP SI + R + + + Y+ IS KYP+WNR+ GADHF +CH
Sbjct: 193 LPVSIVSIVKYVYRPYSDYSRIRLQNVVKDYVGVISSKYPFWNRSDGADHFLTSCHDWAP 252
Query: 116 SAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQE----DPPKLGSSKRNKL 171
E+ +V+C+++ G + +DVSLP+I R +P L R L
Sbjct: 253 DVSAGHPELYKYFTRVLCNANT-SEGFVPERDVSLPEIRLRDRKLSPEPHSLPPKDRRIL 311
Query: 172 AFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLKTPYADELLG-SKFCLHVKGFEVNTAR 229
AFFAG + VR KL + W+ D ++ + KT EL+ SKFCL G+EV + R
Sbjct: 312 AFFAGGEHGHVRTKLFEHWKGKDRDVQVYEYLPKTLNYTELMSHSKFCLCPSGWEVASPR 371
Query: 230 IADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQS 289
+ +++Y GCVPVII+++Y LPF+DVL+W FS+ + IP +K +L+ I +YL +Q
Sbjct: 372 VPEAIYSGCVPVIISDYYYLPFSDVLDWSKFSVHIPVARIPEIKTVLQKIPMRKYLTMQK 431
Query: 290 YVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
V++V++HF+ + YD +MV++ +WLRR +VR+
Sbjct: 432 RVIQVQRHFKLNRPAKPYDVLHMVLHSIWLRRLNVRLN 469
>gi|356546130|ref|XP_003541484.1| PREDICTED: probable glycosyltransferase At5g20260-like [Glycine
max]
Length = 468
Score = 187 bits (475), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 118/349 (33%), Positives = 190/349 (54%), Gaps = 30/349 (8%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGN-YASESYFKKAF---MKSHFVTKDPSKA 56
M + F+V+VY + P V + P + YA E F +S F K+P +A
Sbjct: 129 MVKRFKVWVY-EEGDQPL------VHYGPVNDIYAIEGQFIDEMDNSKRSPFKAKNPDEA 181
Query: 57 DLFFLPFSIARMRH----------DRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHF 106
FFLPFS+ + H D R + + + YI ++ KYPYWNR+ GADHF
Sbjct: 182 HAFFLPFSVVNVVHYAYKPYMSQNDYR--RDRLQRLVEDYIVVVADKYPYWNRSNGADHF 239
Query: 107 YVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPR--QEDPPKLG 164
++CH ++ N I+V+C+++ G +DVS+P+++ + PP LG
Sbjct: 240 LLSCHDWAPEISHANPDLFKNFIRVLCNANNS-EGFQPKRDVSIPEVYLSVGKLGPPNLG 298
Query: 165 SSKRNK--LAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLKTPYADELLG-SKFCLHV 220
N+ LAFF+G + +R+ LL+ W++ D+++ H K EL+G SKFCL
Sbjct: 299 QHPMNRTILAFFSGGAHGDIRKLLLKHWKDKDNQVQVHEYLPKGQNYTELMGLSKFCLCP 358
Query: 221 KGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGIS 280
G+EV + R+ +++ CVPVII+ +Y LP +DVLNW FSI ++ +IP +K IL+ ++
Sbjct: 359 SGYEVASPRVVEAINAVCVPVIISENYSLPLSDVLNWSQFSIQISVENIPDIKTILQNVT 418
Query: 281 SEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQWS 329
++Y L V +VR+HF H +D +M+++ +WLRR + R+ S
Sbjct: 419 QKKYKKLYRNVRRVRRHFVMHRPAKPFDLMHMIIHSIWLRRLNFRLMAS 467
>gi|334187794|ref|NP_197526.5| Exostosin family protein [Arabidopsis thaliana]
gi|332005439|gb|AED92822.1| Exostosin family protein [Arabidopsis thaliana]
Length = 458
Score = 187 bits (474), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 124/341 (36%), Positives = 183/341 (53%), Gaps = 23/341 (6%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGN-YASESYFKKAFMK--SHFVTKDPSKAD 57
M + F+V+VY P V P N Y+ E F S F +P +A
Sbjct: 125 MEKKFKVWVY-REGETPL------VHMGPMNNIYSIEGQFMDEIETGMSPFAANNPEEAH 177
Query: 58 LFFLPFSIARMRHD--RRIGT---EGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHS 112
F LP S+A + H R + T E + Y+ ++ KYPYWNR+ GADHFYV+CH
Sbjct: 178 AFLLPVSVANIVHYLYRPLVTYSREQLHKVFLDYVDVVAHKYPYWNRSLGADHFYVSCHD 237
Query: 113 IGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI-WPRQE-DPPKLGSSK--- 167
E+ N I+V+C+++ G + +DVS+P+I P PP+L S
Sbjct: 238 WAPDVSGSNPELMKNLIRVLCNANTS-EGFMPQRDVSIPEINIPGGHLGPPRLSRSSGHD 296
Query: 168 RNKLAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLKTP-YADELLGSKFCLHVKGFEV 225
R LAFFAG + +R LLQ W++ D E+ H K Y + ++FCL G+EV
Sbjct: 297 RPILAFFAGGSHGYIRRILLQHWKDKDEEVQVHEYLAKNKDYFKLMATARFCLCPSGYEV 356
Query: 226 NTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYL 285
+ R+ ++ GCVPVII++HY LPF+DVL+W F+I V + IP +K ILK IS Y
Sbjct: 357 ASPRVVAAINLGCVPVIISDHYALPFSDVLDWTKFTIHVPSKKIPEIKTILKSISWRRYR 416
Query: 286 LLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
+LQ VL+V++HF + +D M+++ +WLRR ++R+
Sbjct: 417 VLQRRVLQVQRHFVINRPSQPFDMLRMLLHSVWLRRLNLRL 457
>gi|224087074|ref|XP_002308055.1| predicted protein [Populus trichocarpa]
gi|222854031|gb|EEE91578.1| predicted protein [Populus trichocarpa]
Length = 338
Score = 187 bits (474), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 119/352 (33%), Positives = 187/352 (53%), Gaps = 41/352 (11%)
Query: 1 MNRSFRVYVY-----PHRRNDPFANVLLPVDFEPRGNYASESYFKKAFM--KSHFVTKDP 53
M + F+++VY P P N+ Y+ E F KSHF+ + P
Sbjct: 1 MEKRFKIWVYKEGELPVLHGGPVNNI-----------YSVEGQFLDEIERGKSHFIARHP 49
Query: 54 SKADLFFLPFSIARMRH---DRRI--GTEGIPDFISHYIFNISQKYPYWNRTGGADHFYV 108
+A F LP S+A + H R+ + ++ Y+ I+ KY YWNRT GADHF +
Sbjct: 50 DEAHAFLLPLSVAYIMHYIYKPRVTFSRHQLQTLVTDYVRVIADKYTYWNRTNGADHFSI 109
Query: 109 ACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIW--------PRQEDP 160
+CH G + E+ I+ +C+++ G +DVS+P+I+ PR+
Sbjct: 110 SCHDWGPDISRTNPELFKYFIRALCNANTS-EGFQPQRDVSVPEIFLHVGKLGLPREGAQ 168
Query: 161 PKLGSSKRNKLAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLKTPYAD----ELLG-S 214
P SKR LAFFAG + +R+ LL+ W++ D EI H + + +L+G S
Sbjct: 169 P---PSKRPILAFFAGGAHGRIRKVLLKRWKDKDGEIQVHEYVTQRKKNNNLYFKLMGQS 225
Query: 215 KFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKK 274
KFCL G EV + R+ ++ GCVPVII+++Y LPF+DVL+W FS+ + + I +K
Sbjct: 226 KFCLCPSGHEVASPRVVTAIQLGCVPVIISDNYSLPFSDVLDWSKFSVNIPSEKIQEIKT 285
Query: 275 ILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
ILKGIS + YL +Q V++ ++HF + YD +M+++ +WLRR + R+
Sbjct: 286 ILKGISHKRYLTMQRRVIQAQRHFTLNRPAKPYDMIHMILHSIWLRRLNHRM 337
>gi|292630785|sp|Q3E9A4.3|GLYT5_ARATH RecName: Full=Probable glycosyltransferase At5g20260
Length = 466
Score = 187 bits (474), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 124/341 (36%), Positives = 183/341 (53%), Gaps = 23/341 (6%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGN-YASESYFKKAFMK--SHFVTKDPSKAD 57
M + F+V+VY P V P N Y+ E F S F +P +A
Sbjct: 133 MEKKFKVWVY-REGETPL------VHMGPMNNIYSIEGQFMDEIETGMSPFAANNPEEAH 185
Query: 58 LFFLPFSIARMRHD--RRIGT---EGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHS 112
F LP S+A + H R + T E + Y+ ++ KYPYWNR+ GADHFYV+CH
Sbjct: 186 AFLLPVSVANIVHYLYRPLVTYSREQLHKVFLDYVDVVAHKYPYWNRSLGADHFYVSCHD 245
Query: 113 IGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI-WPRQE-DPPKLGSSK--- 167
E+ N I+V+C+++ G + +DVS+P+I P PP+L S
Sbjct: 246 WAPDVSGSNPELMKNLIRVLCNANTS-EGFMPQRDVSIPEINIPGGHLGPPRLSRSSGHD 304
Query: 168 RNKLAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLKTP-YADELLGSKFCLHVKGFEV 225
R LAFFAG + +R LLQ W++ D E+ H K Y + ++FCL G+EV
Sbjct: 305 RPILAFFAGGSHGYIRRILLQHWKDKDEEVQVHEYLAKNKDYFKLMATARFCLCPSGYEV 364
Query: 226 NTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYL 285
+ R+ ++ GCVPVII++HY LPF+DVL+W F+I V + IP +K ILK IS Y
Sbjct: 365 ASPRVVAAINLGCVPVIISDHYALPFSDVLDWTKFTIHVPSKKIPEIKTILKSISWRRYR 424
Query: 286 LLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
+LQ VL+V++HF + +D M+++ +WLRR ++R+
Sbjct: 425 VLQRRVLQVQRHFVINRPSQPFDMLRMLLHSVWLRRLNLRL 465
>gi|224120366|ref|XP_002331030.1| predicted protein [Populus trichocarpa]
gi|222872960|gb|EEF10091.1| predicted protein [Populus trichocarpa]
Length = 339
Score = 186 bits (473), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 112/303 (36%), Positives = 171/303 (56%), Gaps = 26/303 (8%)
Query: 45 KSHFVTKDPSKADLFFLPFSIARMRH---DRRI--GTEGIPDFISHYIFNISQKYPYWNR 99
KS F+ + P +A F LP S+A + H RI + + ++ Y+ I+ K+PYWNR
Sbjct: 41 KSQFIARHPEEAQAFLLPVSVAYIIHYVYRPRITFSRDQLQRLVTDYVRVIADKHPYWNR 100
Query: 100 TGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIW----- 154
T GADHF V+CH + + I+ +C+++ G +DVS+P+I+
Sbjct: 101 THGADHFSVSCHDWAPDVSRADPGLFKYFIRALCNANTS-EGFQPQRDVSIPEIFLPVGK 159
Query: 155 --PRQE--DPPKLGSSKRNKLAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLKTPYAD 209
P QE PP SKR+ LAFFAG + +R+ LL+ W+ D EI H + +
Sbjct: 160 LGPPQEYAQPP----SKRSILAFFAGGAHGHIRKILLERWKEKDDEIQVHEYLTRKNKKN 215
Query: 210 -----ELLG-SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIV 263
EL+G SKFCL G EV + R+ ++ GCVPV I+++Y LPF+DVL+W FS+
Sbjct: 216 NNLYFELMGQSKFCLCPSGHEVASPRVVTAIQLGCVPVTISDNYSLPFSDVLDWSKFSVD 275
Query: 264 VATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSS 323
+ + IP +K ILKGIS YL +Q V+++R+HF + YD +M+++ +WLRR
Sbjct: 276 IPSEKIPDIKIILKGISVRRYLTMQRRVMQIRRHFTLNRPAQPYDMLHMILHSVWLRRLD 335
Query: 324 VRV 326
V++
Sbjct: 336 VKL 338
>gi|302799042|ref|XP_002981280.1| glycosyltransferase-like protein [Selaginella moellendorffii]
gi|300150820|gb|EFJ17468.1| glycosyltransferase-like protein [Selaginella moellendorffii]
Length = 352
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 113/336 (33%), Positives = 185/336 (55%), Gaps = 23/336 (6%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSH-FVTKDPSKADLF 59
M R FR++VY D + ++ + G YASE F + + F +P A +F
Sbjct: 21 MQRVFRIFVY----KDGYKPLMH--EGPKTGIYASEGLFIATMERGNPFAVTEPKIATMF 74
Query: 60 FLPFSIARM---RHDRRI-GTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGR 115
F+PFS+ +M +D + I +I+ Y+ ++ KYPY N T G DHF+V+CH
Sbjct: 75 FIPFSLKQMVDYMYDTNSHSMKNIQSYIAGYLRRLASKYPYMNATNGIDHFFVSCHDWAL 134
Query: 116 SAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQ--EDPPKLGSSKRNK--L 171
A++K + + N ++VVC++ G +DVSLP+ RQ P +S ++ L
Sbjct: 135 MALEKQ-DCQRNIVKVVCNADSS-RGFNTSRDVSLPETRVRQGKHSPIIRDTSGMDRPYL 192
Query: 172 AFFAGAVNSPVREKLLQVWRNDS------EIYAHSGRLKTPYADELLGSKFCLHVKGFEV 225
AFFAG ++ +R LL W++ E+ S + Y++ + SK+C+ GFEV
Sbjct: 193 AFFAGQMHGKLRPVLLAHWKDKDPEMKIYEVLPPSVAERISYSEHMRLSKYCICAAGFEV 252
Query: 226 NTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYL 285
N+ R+ +++ CVPVI+A+++ LPF++V+NW S S+ VA D+ LK IL GI Y
Sbjct: 253 NSPRLVEAIVNECVPVILADNFVLPFSEVINWDSISVTVAEKDVANLKAILAGIPLRTYK 312
Query: 286 LLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
+Q+ + V++HF+W P YD F M+++ LW ++
Sbjct: 313 EMQARLKHVKRHFEWKNSPEKYDIFNMIVHSLWTQQ 348
>gi|224087070|ref|XP_002308053.1| predicted protein [Populus trichocarpa]
gi|222854029|gb|EEE91576.1| predicted protein [Populus trichocarpa]
Length = 339
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 112/303 (36%), Positives = 171/303 (56%), Gaps = 26/303 (8%)
Query: 45 KSHFVTKDPSKADLFFLPFSIARMRH---DRRI--GTEGIPDFISHYIFNISQKYPYWNR 99
KS F+ + P +A F LP S+A + H RI + + ++ Y+ I+ K+PYWNR
Sbjct: 41 KSQFIARHPEEAQAFLLPVSVAYIIHYVYRPRITFSRDQLQRLVTDYVRVIADKHPYWNR 100
Query: 100 TGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIW----- 154
T GADHF V+CH + + I+ +C+++ G +DVS+P+I+
Sbjct: 101 THGADHFSVSCHDWAPDVSRADPGLFKYFIRALCNANTS-EGFQPQRDVSIPEIFLPVGK 159
Query: 155 --PRQE--DPPKLGSSKRNKLAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLKTPYAD 209
P QE PP SKR+ LAFFAG + +R+ LL+ W+ D EI H + +
Sbjct: 160 LGPPQEYAQPP----SKRSILAFFAGGAHGHIRKILLERWKEKDDEIQVHEYLTQKNKKN 215
Query: 210 -----ELLG-SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIV 263
EL+G SKFCL G EV + R+ ++ GCVPV I+++Y LPF+DVL+W FS+
Sbjct: 216 NNLYFELMGQSKFCLCPSGHEVASPRVVTAIQLGCVPVTISDNYSLPFSDVLDWSKFSVD 275
Query: 264 VATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSS 323
+ + IP +K ILKGIS YL +Q V+++R+HF + YD +M+++ +WLRR
Sbjct: 276 IPSEKIPDIKIILKGISVRRYLTMQRRVMQIRRHFTLNRPAQPYDMLHMILHSVWLRRLD 335
Query: 324 VRV 326
V++
Sbjct: 336 VKL 338
>gi|224120362|ref|XP_002331029.1| predicted protein [Populus trichocarpa]
gi|222872959|gb|EEF10090.1| predicted protein [Populus trichocarpa]
Length = 336
Score = 186 bits (471), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 119/339 (35%), Positives = 183/339 (53%), Gaps = 18/339 (5%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
M + FR++VY + L + + G + E KS F P +A F
Sbjct: 1 MEKRFRIWVYREGELPLVHSAPLNLIYSIEGQFLDEM----ESGKSPFAASHPDEAHTFL 56
Query: 61 LPFSIARMRHD--RRIGTEG---IPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGR 115
LP S+A + H R + T + + Y+ ++ KYPYWNRT GADHF V+CH
Sbjct: 57 LPISVAYIIHYIYRPLVTFSRVELQRLVQDYVTVVAGKYPYWNRTEGADHFLVSCHDWAP 116
Query: 116 SAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI-WPRQE-DPPKLG--SSKRNKL 171
+ + N I+V+C+++ +DVS+P+I P + PP G SKR+
Sbjct: 117 DISRANPRLYKNFIRVLCNANT-SERFEPRRDVSIPEINIPFGKFGPPGKGLPPSKRSIF 175
Query: 172 AFFAGAVNSPVREKLLQVWRN-DSEIYAHS---GRLKTPYADELLGSKFCLHVKGFEVNT 227
AFFAG + +R+ LL+ W++ D EI H K Y + SKFCL G+EV +
Sbjct: 176 AFFAGGAHGYIRKLLLEHWKDKDDEIQVHEYLDHNKKNDYFKLMGQSKFCLCPSGYEVAS 235
Query: 228 ARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLL 287
R+ ++ GC+PV I+++Y LPF+DVL+W FS+ + + IP +K ILK IS YL+L
Sbjct: 236 PRVVTAIQSGCIPVTISDNYTLPFSDVLDWSKFSVNIPSEKIPEIKTILKKISFRRYLIL 295
Query: 288 QSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
Q V+K+R+HF+ + YD +M+++ +WLRR +VR+
Sbjct: 296 QGRVIKIRRHFKLNRPAQPYDMLHMILHSIWLRRLNVRL 334
>gi|356518828|ref|XP_003528079.1| PREDICTED: probable glycosyltransferase At5g03795-like [Glycine
max]
Length = 505
Score = 185 bits (470), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 118/339 (34%), Positives = 186/339 (54%), Gaps = 20/339 (5%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKS-HFVTKDPSKADLF 59
M + F+++VY F N L + YA+E F K ++ T DP +A ++
Sbjct: 173 MEKVFKIFVYEEGEPPLFHNGL------SKDIYATEGRFIHEMEKGRYYRTYDPDEAFVY 226
Query: 60 FLPFSIARMR---HDR--RIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIG 114
+LPFS+ + +DR + + + YI I+ K+P+WNR+ G DH ++CH G
Sbjct: 227 YLPFSVVMLVEYVYDRGSNYNLDPLGLVVKDYIQIIAHKHPFWNRSLGHDHVMLSCHDWG 286
Query: 115 RSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGS---SKRNKL 171
+ NAI+V+C+++ G KDVS P+I + + LG S+R L
Sbjct: 287 PLVSSYVDHLYNNAIRVLCNANT-SEGFKPAKDVSFPEIKLIKGEVKGLGGYPPSQRTIL 345
Query: 172 AFFAGAVNSPVREKLLQVWRN---DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTA 228
AFFAG ++ +R LL W+N D +IY Y +L SKFCL G+EV +
Sbjct: 346 AFFAGHLHGYIRYLLLSTWKNKDQDMQIYEELPE-GISYYTKLRSSKFCLCPSGYEVASP 404
Query: 229 RIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQ 288
R+ ++++ CVPV+I++ Y PF+DVLNW SFS+ V DIP +K+IL IS ++YL +
Sbjct: 405 RVVEAIFAECVPVLISDSYVPPFSDVLNWNSFSVQVNVKDIPNIKRILMEISEKQYLRMH 464
Query: 289 SYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
V +V++HF + P YD F+M ++ +WLRR ++ +Q
Sbjct: 465 KRVKQVQRHFVPNEPPKRYDMFHMTVHSIWLRRLNINIQ 503
>gi|15231488|ref|NP_187419.1| putative glycosyltransferase [Arabidopsis thaliana]
gi|75207507|sp|Q9SSE8.1|GLYT1_ARATH RecName: Full=Probable glycosyltransferase At3g07620
gi|6466945|gb|AAF13080.1|AC009176_7 hypothetical protein [Arabidopsis thaliana]
gi|332641055|gb|AEE74576.1| putative glycosyltransferase [Arabidopsis thaliana]
Length = 470
Score = 185 bits (470), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 119/340 (35%), Positives = 187/340 (55%), Gaps = 23/340 (6%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFV---TKDPSKAD 57
M + F++YVY + P + L D Y+ E F FM++ + T+DP KA
Sbjct: 139 MEKMFKIYVY-EEGDPPIFHYGLCKDI-----YSMEGLFLN-FMENDVLKYRTRDPDKAH 191
Query: 58 LFFLPFSIARMRH---DRRIGTEGIPD-FISHYIFNISQKYPYWNRTGGADHFYVACHSI 113
++FLPFS+ + H D + + + + I+ Y+ IS+KYPYWN + G DHF ++CH
Sbjct: 192 VYFLPFSVVMILHHLFDPVVRDKAVLERVIADYVQIISKKYPYWNTSDGFDHFMLSCHDW 251
Query: 114 GRSAMQKAWEVKLNAIQVVCSSSYFISGHI-AHKDVSLPQIWPRQEDPPKLGSS----KR 168
G A ++ N+I+V+C+++ IS + KD P+I D L R
Sbjct: 252 GHRATWYVKKLFFNSIRVLCNAN--ISEYFNPEKDAPFPEINLLTGDINNLTGGLDPISR 309
Query: 169 NKLAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLK-TPYADELLGSKFCLHVKGFEVN 226
LAFFAG + +R LL W+ D +I + Y + + S+FC+ G EV
Sbjct: 310 TTLAFFAGKSHGKIRPVLLNHWKEKDKDILVYENLPDGLDYTEMMRKSRFCICPSGHEVA 369
Query: 227 TARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLL 286
+ R+ +++Y GCVPV+I+ +Y LPF+DVLNW+ FS+ V+ +IP LK+IL I E Y+
Sbjct: 370 SPRVPEAIYSGCVPVLISENYVLPFSDVLNWEKFSVSVSVKEIPELKRILMDIPEERYMR 429
Query: 287 LQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
L V KV++H + P YD F M+++ +WLRR +V++
Sbjct: 430 LYEGVKKVKRHILVNDPPKRYDVFNMIIHSIWLRRLNVKL 469
>gi|359476146|ref|XP_002285237.2| PREDICTED: probable glycosyltransferase At5g25310-like [Vitis
vinifera]
gi|296082059|emb|CBI21064.3| unnamed protein product [Vitis vinifera]
Length = 478
Score = 185 bits (470), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 118/341 (34%), Positives = 184/341 (53%), Gaps = 23/341 (6%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYF--KKAFMKSHFVTKDPSKADL 58
M + +VYVY V + D + Y E F + F T+D +A +
Sbjct: 145 MEKRLKVYVYEE------GEVPMIHDGPCKDIYTIEGRFIHEIEHGDGKFRTRDAERAHV 198
Query: 59 FFLPFSIARM----RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIG 114
+F+PFS+ M + F+S Y+ IS KYP+WN T GADHF +ACH G
Sbjct: 199 YFMPFSVTWMVKYLYKPLTYNLTPLRQFVSDYVSVISTKYPFWNTTQGADHFMLACHDWG 258
Query: 115 RSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI--WPRQEDPPKLGSSKRNK-- 170
A + + +I+V+C+++ G KDVSLP+I + +P L N
Sbjct: 259 PHASRGHPVLYNTSIRVLCNANT-SEGFNPQKDVSLPEIHLYGGNVNPKLLSPPPPNSPR 317
Query: 171 --LAFFAGAVNSPVREKLLQVWRN---DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEV 225
LAFFAG ++ P+R L+Q W D +Y + + Y +L SK+CL G EV
Sbjct: 318 PFLAFFAGGLHGPIRPILIQHWMGRDTDLRVYEYLPK-DMDYYSLMLQSKYCLCPSGHEV 376
Query: 226 NTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYL 285
+ RI +++Y CVPVI+++HY LPF+DVL W++FS+ V +IP LK++L+ IS E+Y
Sbjct: 377 ASPRIVEAIYSECVPVILSDHYVLPFSDVLRWEAFSVKVEASEIPRLKEVLQAISEEKYT 436
Query: 286 LLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
L+ V VR+HF+ + +D F+M+++ +WLRR ++++
Sbjct: 437 RLKEGVRAVRRHFELNQPAKRFDVFHMILHSVWLRRINLKL 477
>gi|224142563|ref|XP_002324625.1| predicted protein [Populus trichocarpa]
gi|222866059|gb|EEF03190.1| predicted protein [Populus trichocarpa]
Length = 336
Score = 185 bits (470), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 115/347 (33%), Positives = 187/347 (53%), Gaps = 31/347 (8%)
Query: 1 MNRSFRVYVY-----PHRRNDPFANVLLPVDFEPRGNYASESYF--KKAFMKSHFVTKDP 53
M + F+++VY P P N+ Y E F + KS F P
Sbjct: 1 MEKRFKIWVYKEGERPLVHGGPLNNI-----------YGVEGQFLDEMEHGKSPFAASHP 49
Query: 54 SKADLFFLPFSIARM-----RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYV 108
+A +F LP S+A + + + + + Y+ ++ KYPYWNR+ GADHF V
Sbjct: 50 DEAHMFLLPISVAYIISYVYKPIVTYSRDELQRLVQDYVGVVADKYPYWNRSKGADHFLV 109
Query: 109 ACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI-WPRQE-DPPKLG-- 164
+CH ++ N I+V+C+++ +DVS+P+I P + PP G
Sbjct: 110 SCHDWAPDISGANPDLYKNFIRVLCNANTS-ERFEPRRDVSIPEINIPNGKLGPPHKGLP 168
Query: 165 SSKRNKLAFFAGAVNSPVREKLLQVWRN-DSEIYAHS--GRLKTPYADELLGSKFCLHVK 221
SKR+ AFFAG + +R+ LL+ W++ D EI H + T Y + + SKFCL
Sbjct: 169 PSKRSIFAFFAGGAHGYIRKVLLENWKDKDDEIQVHEYLDKKGTDYFELMGQSKFCLCPS 228
Query: 222 GFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISS 281
G+EV + R+ ++ GCVPV I+++Y LPF+DVL+W FS+ + + IP +K ILK IS
Sbjct: 229 GYEVASPRVVTAIQLGCVPVTISDNYTLPFSDVLDWSKFSVHIPSEKIPEIKTILKKISP 288
Query: 282 EEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQW 328
+ YL++Q V++V++HF+ + YD +M+++ +W+RR +V+V +
Sbjct: 289 QRYLMMQMRVIQVQRHFELNRPARPYDLLHMLLHSVWVRRLNVKVPY 335
>gi|302772499|ref|XP_002969667.1| glycosyltransferase-like protein [Selaginella moellendorffii]
gi|300162178|gb|EFJ28791.1| glycosyltransferase-like protein [Selaginella moellendorffii]
Length = 352
Score = 185 bits (469), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 113/336 (33%), Positives = 183/336 (54%), Gaps = 23/336 (6%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSH-FVTKDPSKADLF 59
M R FR++VY D + ++ + G YASE F + + F +P A +F
Sbjct: 21 MQRVFRIFVY----KDGYKPLMH--EGPKTGIYASEGLFIATMERGNPFAVTEPKIATMF 74
Query: 60 FLPFSIARM---RHDRRI-GTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGR 115
F+PFS+ +M +D + I +I+ Y+ ++ KYPY N T G DHF+V+CH
Sbjct: 75 FIPFSLKQMVDYMYDTNSHSMKNIQSYIAGYLRRLASKYPYMNATNGIDHFFVSCHDWAL 134
Query: 116 SAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQ-EDPP---KLGSSKRNKL 171
A++K + + N ++VVC++ G +DVSLP+ RQ + P + R L
Sbjct: 135 MALEKQ-DCQRNIVKVVCNADSS-RGFNTSRDVSLPETRVRQGKHSPIIRDISGMDRPYL 192
Query: 172 AFFAGAVNSPVREKLLQVWRNDS------EIYAHSGRLKTPYADELLGSKFCLHVKGFEV 225
AFFAG ++ +R LL W++ E+ S + Y++ + SK+C+ GFEV
Sbjct: 193 AFFAGQMHGKLRPVLLAHWKDKDPEMKIYEVLPPSVAERISYSEHMRLSKYCICAAGFEV 252
Query: 226 NTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYL 285
N+ R+ +++ CVPVI+A+++ LPF++V+NW S S+ VA D+ LK IL GI Y
Sbjct: 253 NSPRLVEAIVNECVPVILADNFVLPFSEVINWDSISVTVAEKDVANLKAILAGIPLRRYK 312
Query: 286 LLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
+Q+ + V++HF W P YD F M+++ LW ++
Sbjct: 313 EMQARLKHVKRHFVWKNSPEKYDIFNMIVHSLWTQQ 348
>gi|168043245|ref|XP_001774096.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674642|gb|EDQ61148.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 351
Score = 184 bits (468), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 112/338 (33%), Positives = 176/338 (52%), Gaps = 17/338 (5%)
Query: 1 MNRSFRVYVYPHRRNDPFAN-------VLLPVDFEPRGNYASESYFKKAFMKSHFVTKDP 53
M ++ R+Y+YP +N F + + + ++++F FVT D
Sbjct: 17 MAKNLRIYLYPASQNYNFTQYEYGMNPSEMVSELGVETSSTTDTFFNLLVESKRFVTDDA 76
Query: 54 SKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSI 113
A L+FLP SI R+ +G + + + HY+ + Y W+ + GADHFY + H+
Sbjct: 77 DGAHLYFLPISIDRVW--AAVGPAKVGEHLRHYLQWLRNTYKLWDLSLGADHFYFSSHAY 134
Query: 114 GRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQ--EDPPKLGSSKRNKL 171
+ E+ NAIQV S HKD+SLP + E +G+S+R KL
Sbjct: 135 DPINHRNNLELTKNAIQVASSPLRRNQNFFPHKDISLPSYKSQHIAEVQNLVGASQRPKL 194
Query: 172 AFFAGAVNS--PVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTAR 229
F + P+ ++Q W +DS+ + S +P ++LL S+FC+ V +
Sbjct: 195 VFVSSPPEDIDPIVASVIQKWTSDSDFHVESADQPSPPFEKLLSSRFCVSVSPQAM--LN 252
Query: 230 IADSLYYGCVPVIIANH--YDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLL 287
+ DSL GCVPV+IA+ YDLPF DVLNWK FS+V+ + P LK +L IS++EY +
Sbjct: 253 VVDSLRLGCVPVLIADSIIYDLPFQDVLNWKEFSVVLGVKESPNLKTLLSSISTDEYRKM 312
Query: 288 QSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
Q + KH +W+ P +DAF+M ++ELW+RR S++
Sbjct: 313 QYLGHQASKHMEWNDPPKPWDAFHMTLHELWVRRHSIK 350
>gi|297833558|ref|XP_002884661.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297330501|gb|EFH60920.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 470
Score = 184 bits (467), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 119/340 (35%), Positives = 187/340 (55%), Gaps = 23/340 (6%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFV---TKDPSKAD 57
M R F++YVY + P + L D Y+ E F FM++ + T+DP KA
Sbjct: 139 MERMFKIYVY-EEGDPPIFHYGLCKDI-----YSMEGLFLN-FMENDVLKYRTRDPDKAH 191
Query: 58 LFFLPFSIARMRH---DRRIGTEGIPD-FISHYIFNISQKYPYWNRTGGADHFYVACHSI 113
++FLPFS+ + H D + + + + I+ Y+ IS+KYPYWN + G DHF ++CH
Sbjct: 192 VYFLPFSVVMILHHLFDPVVRDKAVLESVIADYVQIISKKYPYWNTSDGFDHFMLSCHDW 251
Query: 114 GRSAMQKAWEVKLNAIQVVCSSSYFISGHI-AHKDVSLPQIWPRQEDPPKLGSS----KR 168
G A ++ N+I+V+C+++ IS + KD P+I + L R
Sbjct: 252 GHRATWYVKKLFFNSIRVLCNAN--ISEYFNPEKDAPFPEINLLTGEINNLTGGLDPISR 309
Query: 169 NKLAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLKT-PYADELLGSKFCLHVKGFEVN 226
LAFFAG + +R LL W+ D +I + + Y + + ++FC+ G EV
Sbjct: 310 KTLAFFAGKSHGKIRPVLLNHWKEKDKDILVYENLPEDLNYTEMMRKTRFCICPSGHEVA 369
Query: 227 TARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLL 286
+ RI +++Y GCVPV+I+ +Y LPF+DVLNW+ FS+ V+ +IP LK+IL I E Y+
Sbjct: 370 SPRIPEAIYSGCVPVLISENYVLPFSDVLNWEKFSVSVSVKEIPELKRILMDIPEERYMR 429
Query: 287 LQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
L V KV+ H + P YD F M+++ +WLRR +V++
Sbjct: 430 LYEGVKKVKMHILVNDPPKRYDVFNMIIHSIWLRRLNVKL 469
>gi|2245029|emb|CAB10449.1| limonene cyclase like protein [Arabidopsis thaliana]
gi|7268424|emb|CAB78716.1| limonene cyclase like protein [Arabidopsis thaliana]
Length = 1024
Score = 184 bits (467), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 120/334 (35%), Positives = 182/334 (54%), Gaps = 27/334 (8%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEP--RGNYASESYFKKAFMKSH--FVTKDPSKA 56
M +VY+YP P+ EP G YASE +F K M+S+ FVTK+P +A
Sbjct: 684 MELILKVYIYPDGDK--------PIFHEPHLNGIYASEGWFMK-LMESNKQFVTKNPERA 734
Query: 57 DLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRS 116
LF++P+S+ +++ ++ + P + Q T G + G
Sbjct: 735 HLFYMPYSVKQLQ--KKTTSTCSPSNTPSGTALMGQIISLSLATIGYRKCFYVKDQWGPY 792
Query: 117 AMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP--KLGS----SKRNK 170
+ + E+K NAI+ +C++ + KDVSLP+ R P +G+ S+R
Sbjct: 793 TVNEHPELKRNAIKALCNADLSDGIFVPGKDVSLPETSIRNAGRPLRNIGNGNRVSQRPI 852
Query: 171 LAFFAGAVNSPVREKLLQVWRN---DSEIYA---HSGRLKTPYADELLGSKFCLHVKGFE 224
LAFFAG ++ VR KLL+ WRN D +IY H+ K Y + SK+CL G+E
Sbjct: 853 LAFFAGNLHGRVRPKLLKHWRNKDEDMKIYGPLPHNVARKMTYVQHMKSSKYCLCPMGYE 912
Query: 225 VNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEY 284
VN+ RI +++YY CVPV+IA+++ LPF+DVL+W +FS+VV +IP LK+IL I Y
Sbjct: 913 VNSPRIVEAIYYECVPVVIADNFMLPFSDVLDWSAFSVVVPEKEIPRLKEILLEIPMRRY 972
Query: 285 LLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELW 318
L +QS V V++HF W P YD F+M+++ +W
Sbjct: 973 LKMQSNVKMVQRHFLWSPKPRKYDVFHMILHSIW 1006
>gi|147789666|emb|CAN71921.1| hypothetical protein VITISV_031190 [Vitis vinifera]
Length = 310
Score = 184 bits (466), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 104/291 (35%), Positives = 167/291 (57%), Gaps = 13/291 (4%)
Query: 45 KSHFVTKDPSKADLFFLPFSIARMRH----DRRIGTEGIPDFISHYIFNISQKYPYWNRT 100
KS F+ + P +A+ F++P S+ R+ H + IP + YI ++ KYPYWNR+
Sbjct: 18 KSQFLARHPDEANAFYIPMSLTRVVHFIYEPPHYXGKWIPRLVXDYINFVADKYPYWNRS 77
Query: 101 GGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI-WPRQE- 158
GADHF V+CH ++ + I+ +C+++ H +D+S+P+I PR +
Sbjct: 78 KGADHFLVSCHDWAPDVSALKPDLYKHFIRALCNANTSERFHPI-RDISIPEINIPRGKL 136
Query: 159 DPPKLGS--SKRNKLAFFAGAVNSPVREKLLQVWR---NDSEIYAHSGRLKTPYADELLG 213
PP L +KR LAFFAG + VR L + W+ ++ +++ R + Y+ +
Sbjct: 137 GPPHLDQPPNKRPILAFFAGGAHGYVRSVLFKYWKEKDDEVQVFERLPRNRN-YSKSMGD 195
Query: 214 SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLK 273
SKFCL G+EV + RI ++ GCVP+II +HY LPF+D L+W FSI + + IP +K
Sbjct: 196 SKFCLCPSGYEVASPRIVKAIAAGCVPMIICDHYSLPFSDXLDWSKFSIYITSDKIPEIK 255
Query: 274 KILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSV 324
KILK + +E YL +Q V +V++HF + YD +M+++ +W RR +V
Sbjct: 256 KILKAVPTETYLEMQKRVKQVQRHFAINRPAXPYDMLHMILHSVWXRRLNV 306
>gi|359484704|ref|XP_002264922.2| PREDICTED: probable glycosyltransferase At5g20260-like [Vitis
vinifera]
Length = 467
Score = 183 bits (465), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 108/294 (36%), Positives = 174/294 (59%), Gaps = 14/294 (4%)
Query: 46 SHFVTKDPSKADLFFLPFSIARMRH---DRRIGTEG--IPDFISHYIFNISQKYPYWNRT 100
SHFV P +A +F++P S+ R+ H + G + ++ YI+ +S KYPYWNR+
Sbjct: 175 SHFVAGHPDEAHVFYIPISVTRIAHYIYSPPVDYSGHMLQRLVTDYIYVVSDKYPYWNRS 234
Query: 101 GGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI-WPRQE- 158
GADHF V+CH ++ + I+V+C+++ +D+SLP++ P+ +
Sbjct: 235 NGADHFLVSCHDWAPEISIVTPDLYKHFIRVLCNANTSERFQPI-RDISLPEVNIPKGKL 293
Query: 159 DPPKLGS--SKRNKLAFFAGAVNSPVREKLLQVWR-NDSEI--YAHSGRLKTPYADELLG 213
PP L ++R+ LAFFAG + +R L + W+ ND E+ Y H + YA +
Sbjct: 294 GPPHLDKPPNQRHILAFFAGRESGYMRTLLFRSWKENDDEVQVYEHLPSNRD-YAKSMGD 352
Query: 214 SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLK 273
SKFCL G+EV + R+ +++ GCVPVII ++Y LPF++VL W FSI + + IP +K
Sbjct: 353 SKFCLCPSGWEVASPRVVEAIAAGCVPVIICDYYVLPFSEVLVWSKFSINITSDKIPEIK 412
Query: 274 KILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
KILK + +E YL +Q V +V++HF + YD +M+++ +WLRR +VR++
Sbjct: 413 KILKAVPNERYLRMQKRVKQVQRHFVINRPAQPYDMLHMILHSVWLRRLNVRLR 466
>gi|356565153|ref|XP_003550809.1| PREDICTED: probable glycosyltransferase At5g20260-like [Glycine
max]
Length = 398
Score = 183 bits (465), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 102/299 (34%), Positives = 169/299 (56%), Gaps = 15/299 (5%)
Query: 45 KSHFVTKDPSKADLFFLPFSIARMRH--------DRRIGTEGIPDFISHYIFNISQKYPY 96
+S F ++P +A FFLP S+ + H + + + YI ++ KYPY
Sbjct: 100 RSPFKARNPDEAHAFFLPLSVVNVVHYVYKPYMSQNDYSRDRLQRLVEDYIGVVADKYPY 159
Query: 97 WNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIW-- 154
WNR+ GADHF ++CH ++ N I+V+C+++ G +DVS+P+++
Sbjct: 160 WNRSNGADHFLLSCHDWAPEISHANPDLFKNFIRVLCNANNS-EGFQPKRDVSIPEVYLP 218
Query: 155 PRQEDPPKLGSSKRNK--LAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLKTPYADEL 211
+ PP LG N+ LAFF+G + +R+ LL+ W++ D+ + H K EL
Sbjct: 219 VGKLGPPNLGQHPMNRTILAFFSGGAHGDIRKLLLKHWKDKDNHVQVHEYLPKGQNYTEL 278
Query: 212 LG-SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIP 270
+G SKFCL G+EV + R+ +++ GCVPVII+ +Y LPF+DVLNW FSI ++ +I
Sbjct: 279 MGLSKFCLCPSGYEVASPRVVEAINAGCVPVIISENYSLPFSDVLNWSQFSIQISVENIS 338
Query: 271 LLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQWS 329
+K IL+ ++ ++Y L V +V++HF + +D +M+++ +WLRR + RV S
Sbjct: 339 DIKTILQNVTQKKYKKLHRNVRRVQRHFVMNRPAKPFDLMHMILHSIWLRRLNFRVMAS 397
>gi|359484708|ref|XP_002264880.2| PREDICTED: probable glycosyltransferase At5g20260-like [Vitis
vinifera]
Length = 467
Score = 183 bits (465), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 106/294 (36%), Positives = 175/294 (59%), Gaps = 14/294 (4%)
Query: 46 SHFVTKDPSKADLFFLPFSIARMRH---DRRIGTEG--IPDFISHYIFNISQKYPYWNRT 100
SHF+ P A +F++P S+ R+ H + G + ++ YI+ +S KYPYWNR+
Sbjct: 175 SHFMAGHPDVAHVFYIPISVTRIAHYIYSPPVDYSGHMLQRLVTDYIYVVSNKYPYWNRS 234
Query: 101 GGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI-WPRQE- 158
GADHF V+CH ++ + I+V+C+++ +D+SLP++ P+ +
Sbjct: 235 NGADHFLVSCHDWAPEISIVTPDLYKHFIRVLCNANTSERFQPI-RDISLPEVNIPKGKL 293
Query: 159 DPPKLGS--SKRNKLAFFAGAVNSPVREKLLQVWR-NDSEI--YAHSGRLKTPYADELLG 213
PP L ++R+ LAFF+G + +R L + W+ ND E+ Y H + YA ++
Sbjct: 294 GPPHLDKPPNQRHILAFFSGRESGYMRTLLFRSWKENDDEVQVYEHLPSNRD-YAKSMVD 352
Query: 214 SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLK 273
SKFCL G+EV + R+ +++ GCVPVII ++Y LPF++VL+W FSI + + IP +K
Sbjct: 353 SKFCLCPSGWEVASPRVVEAIAAGCVPVIICDYYVLPFSEVLDWSKFSINITSDKIPEIK 412
Query: 274 KILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
KILK + +E YL +Q V +V++HF + YD +M+++ +WLRR +VR++
Sbjct: 413 KILKAVPNERYLRMQKRVKQVQRHFVINRPAQPYDMLHMILHSVWLRRLNVRLR 466
>gi|413952956|gb|AFW85605.1| hypothetical protein ZEAMMB73_586394 [Zea mays]
Length = 509
Score = 183 bits (465), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 103/291 (35%), Positives = 166/291 (57%), Gaps = 15/291 (5%)
Query: 50 TKDPSKADLFFLPFSIARM-RHDRRIGT---EGIPDFISHYIFNISQKYPYWNRTGGADH 105
T +P++A FFLPFS+++M + R T + ++ Y+ ++ ++PYWNR+ GADH
Sbjct: 218 TSEPARAHAFFLPFSVSQMVQFAYRPNTYDKTPLRAIVADYVRVVASRHPYWNRSAGADH 277
Query: 106 FYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI------WPRQED 159
F +ACH G A E+ N I+ +C+++ G +DVS+P I PRQ
Sbjct: 278 FMLACHDWGPEASTGHPELHANGIRALCNANSS-EGFRPWQDVSVPDINLYDGDMPRQLL 336
Query: 160 PPKLGSSKRNKLAFFAGAVNSPVREKLLQVWRNDSE----IYAHSGRLKTPYADELLGSK 215
P G + R LAFFAG + +R+ LL+ W+ +Y H Y + ++
Sbjct: 337 APAPGVTSRPFLAFFAGGRHGHIRDLLLRHWKGRDPDFFPVYEHRHEDGFDYYSFMRRAR 396
Query: 216 FCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKI 275
FCL G++V + R+ +S++ CVPVI+++ Y LPFADVL W++FS+ V DIP L+++
Sbjct: 397 FCLCPSGYQVASPRVVESIHAECVPVILSDGYVLPFADVLRWEAFSVAVPVADIPRLREV 456
Query: 276 LKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
L+ I + E LQ V V++HF + P D F M+++ +WLRR ++R+
Sbjct: 457 LERIPAPEVEKLQQGVRLVKRHFMLNQPPQRLDMFNMILHSVWLRRLNLRL 507
>gi|296084508|emb|CBI25529.3| unnamed protein product [Vitis vinifera]
Length = 388
Score = 182 bits (463), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 106/294 (36%), Positives = 175/294 (59%), Gaps = 14/294 (4%)
Query: 46 SHFVTKDPSKADLFFLPFSIARMRH---DRRIGTEG--IPDFISHYIFNISQKYPYWNRT 100
SHF+ P A +F++P S+ R+ H + G + ++ YI+ +S KYPYWNR+
Sbjct: 96 SHFMAGHPDVAHVFYIPISVTRIAHYIYSPPVDYSGHMLQRLVTDYIYVVSNKYPYWNRS 155
Query: 101 GGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI-WPRQE- 158
GADHF V+CH ++ + I+V+C+++ +D+SLP++ P+ +
Sbjct: 156 NGADHFLVSCHDWAPEISIVTPDLYKHFIRVLCNANTSERFQPI-RDISLPEVNIPKGKL 214
Query: 159 DPPKLGS--SKRNKLAFFAGAVNSPVREKLLQVWR-NDSEI--YAHSGRLKTPYADELLG 213
PP L ++R+ LAFF+G + +R L + W+ ND E+ Y H + YA ++
Sbjct: 215 GPPHLDKPPNQRHILAFFSGRESGYMRTLLFRSWKENDDEVQVYEHLPSNRD-YAKSMVD 273
Query: 214 SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLK 273
SKFCL G+EV + R+ +++ GCVPVII ++Y LPF++VL+W FSI + + IP +K
Sbjct: 274 SKFCLCPSGWEVASPRVVEAIAAGCVPVIICDYYVLPFSEVLDWSKFSINITSDKIPEIK 333
Query: 274 KILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
KILK + +E YL +Q V +V++HF + YD +M+++ +WLRR +VR++
Sbjct: 334 KILKAVPNERYLRMQKRVKQVQRHFVINRPAQPYDMLHMILHSVWLRRLNVRLR 387
>gi|357478275|ref|XP_003609423.1| Xylogalacturonan beta-1,3-xylosyltransferase [Medicago truncatula]
gi|355510478|gb|AES91620.1| Xylogalacturonan beta-1,3-xylosyltransferase [Medicago truncatula]
Length = 398
Score = 182 bits (463), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 108/297 (36%), Positives = 171/297 (57%), Gaps = 15/297 (5%)
Query: 45 KSHFVTKDPSKADLFFLPFSIARM----RHDRRIGTEGIPD----FISHYIFNISQKYPY 96
KS F P A +FFLPFS++++ R+ ++ P + YI ++ KYPY
Sbjct: 96 KSPFKATHPELAHVFFLPFSVSKVIRYVYKPRKSRSDYNPHRLQLLVEDYIKIVANKYPY 155
Query: 97 WNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI-WP 155
WN + GADHF ++CH G ++ + I+ +C+++ G ++DVS+PQ+ P
Sbjct: 156 WNISQGADHFLLSCHDWGPRVSYANPKLFKHFIRALCNANTS-EGFWPNRDVSIPQLNLP 214
Query: 156 RQE-DPPKLGSSKRNK--LAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLKTPYADEL 211
+ PP N+ L FFAG + +R+KLL+ W++ D E+ H K +L
Sbjct: 215 VGKLGPPNTDQHPNNRTILTFFAGGAHGKIRKKLLKSWKDKDEEVQVHEYLPKGQDYTKL 274
Query: 212 LG-SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIP 270
+G SKFCL G EV + R+ +++Y GCVPVII ++Y LPF+DVLNW FS+ +A IP
Sbjct: 275 MGLSKFCLCPSGHEVASPRVVEAIYAGCVPVIICDNYSLPFSDVLNWSQFSMEIAVDRIP 334
Query: 271 LLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
+K IL+ I+ +Y +L S V +VRKHF+ + +D +M+++ +WLRR + R+
Sbjct: 335 EIKTILQNITETKYRVLYSNVRRVRKHFEMNRPAKPFDLIHMILHSVWLRRLNFRLH 391
>gi|413937765|gb|AFW72316.1| hypothetical protein ZEAMMB73_504926 [Zea mays]
Length = 472
Score = 182 bits (463), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 114/346 (32%), Positives = 180/346 (52%), Gaps = 29/346 (8%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFM--KSHFVTKDPSKADL 58
M R F+++ Y P A++ D Y+ E F + ++ F +DP +A
Sbjct: 138 MERKFKIWTY-REGEPPLAHLGPSADI-----YSIEGQFLEEIEDPRNPFAARDPGEAHA 191
Query: 59 FFLPFSIARMRH-DRRIGTEG----IPDFISHYIFNISQKYPYWNRTGGADHFYVACHSI 113
F LP S+ + H R+ T + ++ Y+ ++ KYPYWNR+ GADH V+CH
Sbjct: 192 FLLPVSVCNLVHYIYRLNTTAYMAHMRRALADYVDVVAHKYPYWNRSRGADHVIVSCHDW 251
Query: 114 GRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI--------WPRQEDPPKLGS 165
+ E+ NAI+V+C+++ G KD +LP++ P PP+
Sbjct: 252 APLVSEANRELYANAIRVLCNANTS-EGFRPRKDATLPEVNLADGLLRRPTLGLPPE--- 307
Query: 166 SKRNKLAFFAGAVNSPVREKLLQVW--RNDSEIYAHS-GRLKTPYADELLGSKFCLHVKG 222
R LAFFAG ++ +R LL W R D ++ H Y + ++FCL G
Sbjct: 308 -NRTTLAFFAGGMHGHIRRALLGYWLGRKDPDMDIHEYLPAGQDYHALMARARFCLCPSG 366
Query: 223 FEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSE 282
FEV + R+ +S++ GCVPVII++ Y PF+DVL+W S+ V IP LK +LKG+S
Sbjct: 367 FEVASPRVVESVFTGCVPVIISDGYPPPFSDVLDWSKMSVTVPPARIPELKAVLKGVSER 426
Query: 283 EYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQW 328
Y +L++ VL+ ++HF H +D MV++ +WLRR +VR+ +
Sbjct: 427 RYRVLRARVLQAQRHFVVHRPARRFDMIRMVLHSIWLRRINVRLPY 472
>gi|255582307|ref|XP_002531944.1| catalytic, putative [Ricinus communis]
gi|223528390|gb|EEF30426.1| catalytic, putative [Ricinus communis]
Length = 363
Score = 182 bits (462), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 115/347 (33%), Positives = 184/347 (53%), Gaps = 62/347 (17%)
Query: 33 YASESYFKKAFM--KSHFVTKDPSKADLFFLPFSIARM---------RHDRRIGTEGIPD 81
YA E F +S F+ + +A +FFLP S+A + +DR + +
Sbjct: 26 YAIEGQFIDEMESGESPFIARHADEAHVFFLPVSVAHIVEYIYLPITSYDR----DKLIR 81
Query: 82 FISHYIFNISQKYPYWNRTGGADHFYVACH------------------SIGRSAM----- 118
Y+ ++ KYP+WNR+ G+DHF ++CH ++GR+
Sbjct: 82 VFKDYVKVVADKYPFWNRSSGSDHFMLSCHDWAMVHINSIPVELVRAFALGRNLKFEISF 141
Query: 119 ---------QKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPK------- 162
Q E+ N I+V+C+++ G + +DV+LP++ PP
Sbjct: 142 HSHRAPEISQHDPELYKNLIRVICNANTS-EGFLPTRDVTLPEL----NIPPGGFDHVHH 196
Query: 163 -LGSSKRNKLAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLKTPYADELLG-SKFCLH 219
L S KR LAFFAG + +R+ LL W+N D E+ H K +L+G SKFCL
Sbjct: 197 CLPSHKRRILAFFAGGAHGYIRKILLHHWKNKDDEVQVHEYLSKDEDYRKLMGQSKFCLC 256
Query: 220 VKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGI 279
G+EV + RI +S+Y GC+PVII++HY+LPF+DVL+W S+ + IP +K ILKG+
Sbjct: 257 PSGYEVASPRIVESIYAGCIPVIISDHYNLPFSDVLDWSQISVQIPVEKIPEIKTILKGV 316
Query: 280 SSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
S+++YL +Q V +V++HF+ + +D +MV++ +WLRR +R+
Sbjct: 317 SNDKYLRMQKRVRRVQRHFEINRPSKPFDVLHMVLHSVWLRRLDIRL 363
>gi|326517848|dbj|BAK03842.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 451
Score = 181 bits (459), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 108/309 (34%), Positives = 168/309 (54%), Gaps = 32/309 (10%)
Query: 50 TKDPSKADLFFLPFSIARMRH--DRRIGTEGIP--DFISHYIFNISQKYPYWNRTGGADH 105
T+D +A FFLPFS+A+M R++ + P + Y+ ++ ++P+WNR+ GADH
Sbjct: 143 TRDADRAHAFFLPFSVAQMMQFAYRQLSYDRGPLLSLVGDYVRVVASRHPFWNRSAGADH 202
Query: 106 FYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI------WPRQED 159
F ++CH G A + E+ N I+ +C+++ G KDVS+P+I PRQ
Sbjct: 203 FMLSCHDWGPDASKGDPELYANGIRALCNANT-SEGFRPGKDVSIPEINLYDGDTPRQLL 261
Query: 160 PPKLGSSKRNKLAFFAGAVNSPVREKLLQVWR--------------------NDSEIYAH 199
P G S R LAFFAG + VR+ LL+ W+ N S +
Sbjct: 262 GPSPGLSARPYLAFFAGGRHGHVRDLLLRHWKGRDPATFPVYEYDIPSTTGGNSSGRHNR 321
Query: 200 SGR-LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWK 258
GR ++ Y + S+FCL G EV + R+ ++++ CVPV+++ Y PFADVL W+
Sbjct: 322 RGRDRQSDYFAYMHRSRFCLCPSGHEVASPRVVEAIHAECVPVLVSEGYAPPFADVLRWE 381
Query: 259 SFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELW 318
SFS+ V +DIP LK++L+GI E L+ V V++HF P D F+M+++ +W
Sbjct: 382 SFSVSVPVVDIPRLKEVLEGIPMAEVERLREGVRLVKRHFTLRQPPERLDMFHMILHSVW 441
Query: 319 LRRSSVRVQ 327
LRR + R+
Sbjct: 442 LRRLNFRLD 450
>gi|356507416|ref|XP_003522463.1| PREDICTED: LOW QUALITY PROTEIN: probable glycosyltransferase
At3g07620-like [Glycine max]
Length = 471
Score = 180 bits (457), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 117/335 (34%), Positives = 178/335 (53%), Gaps = 24/335 (7%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKS-HFVTKDPSKADLF 59
M + F+++VY F N L + YA+E F K ++ T DP +A ++
Sbjct: 151 MEKVFKIFVYEEGEPPLFHNGL------SKDIYATEGRFIHEMEKGRYYRTYDPDEAFVY 204
Query: 60 FLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQ 119
+LPFS + + + + YI I+ K+P+WNR+ G DHF ++CH G
Sbjct: 205 YLPFS--------GVYVDPLGLVVKDYIQVIAHKHPFWNRSLGYDHFMLSCHDWGPLVSS 256
Query: 120 KAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGS----SKRNKLAFFA 175
NAI+V+C+++ G KDVS P+I + + L S+R LAFFA
Sbjct: 257 YVDHFYNNAIRVLCNAN-VSEGFKPAKDVSFPEIKLIKGEVTNLVGGYPPSQRTILAFFA 315
Query: 176 GAVNSPVREKLLQVWRN---DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIAD 232
G + +R L W+N D +IY Y +L SKFCL G+EV + R+
Sbjct: 316 GHQHGYIRXLLQSTWKNKDQDMQIYEELPE-GISYYTKLRSSKFCLCPSGYEVASPRVVK 374
Query: 233 SLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVL 292
+++ CVPV+I++ Y PF+DVLNW SFS+ V DIP +KKIL GIS +YL + V
Sbjct: 375 AIFAECVPVLISDGYVPPFSDVLNWNSFSVQVDVKDIPNIKKILMGISERQYLRMYKRVK 434
Query: 293 KVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
+V++HF + P YD F+M ++ +WLRR ++ +Q
Sbjct: 435 QVQRHFVPNEPPKRYDMFHMTVHSIWLRRLNIHIQ 469
>gi|297818734|ref|XP_002877250.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323088|gb|EFH53509.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
Length = 422
Score = 179 bits (453), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 103/293 (35%), Positives = 166/293 (56%), Gaps = 15/293 (5%)
Query: 48 FVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFIS---HYIFN-----ISQKYPYWNR 99
F P +A FFLPFS+A + H DF H IFN +++K+P+WN+
Sbjct: 129 FRASRPEEAHAFFLPFSVANIVHYVYQPITSPADFNRARLHRIFNDYVDVVARKHPFWNQ 188
Query: 100 TGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIW--PRQ 157
+ GADHF V+CH E + ++ +C+++ G + D+S+P+I R+
Sbjct: 189 SNGADHFMVSCHDWAPDVADSKPEFFKDFMRGLCNANT-TEGFRPNIDISIPEINIPKRK 247
Query: 158 EDPPKLGSSKRNK--LAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLKTPYADELLG- 213
PP +G + N+ LAFFAG + +RE L W+ D ++ + K EL+G
Sbjct: 248 LKPPFMGQTPENRTILAFFAGRAHGYIREVLFTHWKGKDKDVQVYDHLTKGQNYHELIGH 307
Query: 214 SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLK 273
SKFCL G+EV + R +++Y GCVPV+I+++Y LPF DVL+W FS+ + IP +K
Sbjct: 308 SKFCLCPSGYEVASPREVEAIYSGCVPVVISDNYSLPFNDVLDWSKFSVEIPVDKIPDIK 367
Query: 274 KILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
KIL+ I ++Y+ + V+KVR+HF + +D +M+++ +WLRR ++R+
Sbjct: 368 KILQEIPHDKYIRMYQNVMKVRRHFVVNRPAQPFDVIHMILHSVWLRRLNIRL 420
>gi|297812207|ref|XP_002873987.1| hypothetical protein ARALYDRAFT_326421 [Arabidopsis lyrata subsp.
lyrata]
gi|297319824|gb|EFH50246.1| hypothetical protein ARALYDRAFT_326421 [Arabidopsis lyrata subsp.
lyrata]
Length = 654
Score = 178 bits (452), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 120/332 (36%), Positives = 179/332 (53%), Gaps = 21/332 (6%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMK--SHFVTKDPSKADL 58
M + F+V+VY R D + PV+ Y+ E F S F P +A
Sbjct: 320 MEKKFKVWVY--REGDTPLVHMGPVN----NIYSIEGQFMDEIETGMSPFAASGPEEAHA 373
Query: 59 FFLPFSIARMRHD--RRIGT---EGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSI 113
F LP SIA + H R + T E + Y+ ++ KYPYWNR+ GADHF+V+CH
Sbjct: 374 FLLPVSIANVVHYLYRPLVTYSREQLHKVFLDYVNVVAHKYPYWNRSLGADHFFVSCHDW 433
Query: 114 GRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI-WPR-QEDPPKLGSSK---R 168
E+ N I+V+C+++ G + +DVS+P+I PR Q PP+L S R
Sbjct: 434 APDVSGSNPEMMKNLIRVLCNANT-SEGFMPQRDVSIPEINIPRGQLGPPQLSRSSGHDR 492
Query: 169 NKLAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLKTP-YADELLGSKFCLHVKGFEVN 226
LAFF+G + +R+ LLQ W++ D E+ H Y + ++FCL G+EV
Sbjct: 493 PILAFFSGGSHGYIRKILLQHWKDKDEEVQVHEYLTNNKDYFKLMATARFCLCPSGYEVA 552
Query: 227 TARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLL 286
+ R+ ++ GCVPVII++HY LPF+DVL+W F+I V + IP +K ILK IS Y +
Sbjct: 553 SPRVVAAINLGCVPVIISDHYALPFSDVLDWTKFTIHVPSEKIPEIKTILKSISWRRYKV 612
Query: 287 LQSYVLKVRKHFQWHVFPSDYDAFYMVMYELW 318
LQ VL+V++HF + +D M+++ +W
Sbjct: 613 LQRRVLQVQRHFVINRPSQPFDMLRMLLHSVW 644
>gi|145339080|ref|NP_189804.4| putative glycosyltransferase [Arabidopsis thaliana]
gi|292630777|sp|Q3EAR7.2|GLYT2_ARATH RecName: Full=Probable glycosyltransferase At3g42180
gi|332644208|gb|AEE77729.1| putative glycosyltransferase [Arabidopsis thaliana]
Length = 470
Score = 178 bits (452), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 107/311 (34%), Positives = 168/311 (54%), Gaps = 15/311 (4%)
Query: 30 RGNYASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFIS---HY 86
G + E + F P +A FFLPFS+A + H DF H
Sbjct: 159 EGQFIDELSYVMGGPSGRFRASRPEEAHAFFLPFSVANIVHYVYQPITSPADFNRARLHR 218
Query: 87 IFN-----ISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISG 141
IFN ++ K+P+WN++ GADHF V+CH E N ++ +C+++ G
Sbjct: 219 IFNDYVDVVAHKHPFWNQSNGADHFMVSCHDWAPDVPDSKPEFFKNFMRGLCNANT-SEG 277
Query: 142 HIAHKDVSLPQIW--PRQEDPPKLGSSKRNK--LAFFAGAVNSPVREKLLQVWRN-DSEI 196
+ D S+P+I R+ PP +G + N+ LAFFAG + +RE L W+ D ++
Sbjct: 278 FRRNIDFSIPEINIPKRKLKPPFMGQNPENRTILAFFAGRAHGYIREVLFSHWKGKDKDV 337
Query: 197 YAHSGRLKTPYADELLG-SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVL 255
+ K EL+G SKFCL G+EV + R +++Y GCVPV+I+++Y LPF DVL
Sbjct: 338 QVYDHLTKGQNYHELIGHSKFCLCPSGYEVASPREVEAIYSGCVPVVISDNYSLPFNDVL 397
Query: 256 NWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMY 315
+W FS+ + IP +KKIL+ I ++YL + V+KVR+HF + +D +M+++
Sbjct: 398 DWSKFSVEIPVDKIPDIKKILQEIPHDKYLRMYRNVMKVRRHFVVNRPAQPFDVIHMILH 457
Query: 316 ELWLRRSSVRV 326
+WLRR ++R+
Sbjct: 458 SVWLRRLNIRL 468
>gi|109946595|gb|ABG48476.1| At3g42180 [Arabidopsis thaliana]
Length = 425
Score = 178 bits (452), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 107/311 (34%), Positives = 168/311 (54%), Gaps = 15/311 (4%)
Query: 30 RGNYASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFIS---HY 86
G + E + F P +A FFLPFS+A + H DF H
Sbjct: 114 EGQFIDELSYVMGGPSGRFRASRPEEAHAFFLPFSVANIVHYVYQPITSPADFNRARLHR 173
Query: 87 IFN-----ISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISG 141
IFN ++ K+P+WN++ GADHF V+CH E N ++ +C+++ G
Sbjct: 174 IFNDYVDVVAHKHPFWNQSNGADHFMVSCHDWAPDVPDSKPEFFKNFMRGLCNANT-SEG 232
Query: 142 HIAHKDVSLPQIW--PRQEDPPKLGSSKRNK--LAFFAGAVNSPVREKLLQVWRN-DSEI 196
+ D S+P+I R+ PP +G + N+ LAFFAG + +RE L W+ D ++
Sbjct: 233 FRRNIDFSIPEINIPKRKLKPPFMGQNPENRTILAFFAGRAHGYIREVLFSHWKGKDKDV 292
Query: 197 YAHSGRLKTPYADELLG-SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVL 255
+ K EL+G SKFCL G+EV + R +++Y GCVPV+I+++Y LPF DVL
Sbjct: 293 QVYDHLTKGQNYHELIGHSKFCLCPSGYEVASPREVEAIYSGCVPVVISDNYSLPFNDVL 352
Query: 256 NWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMY 315
+W FS+ + IP +KKIL+ I ++YL + V+KVR+HF + +D +M+++
Sbjct: 353 DWSKFSVEIPVDKIPDIKKILQEIPHDKYLRMYRNVMKVRRHFVVNRPAQPFDVIHMILH 412
Query: 316 ELWLRRSSVRV 326
+WLRR ++R+
Sbjct: 413 SVWLRRLNIRL 423
>gi|6735291|emb|CAB68119.1| putative protein [Arabidopsis thaliana]
Length = 340
Score = 178 bits (451), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 107/311 (34%), Positives = 168/311 (54%), Gaps = 15/311 (4%)
Query: 30 RGNYASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFIS---HY 86
G + E + F P +A FFLPFS+A + H DF H
Sbjct: 29 EGQFIDELSYVMGGPSGRFRASRPEEAHAFFLPFSVANIVHYVYQPITSPADFNRARLHR 88
Query: 87 IFN-----ISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISG 141
IFN ++ K+P+WN++ GADHF V+CH E N ++ +C+++ G
Sbjct: 89 IFNDYVDVVAHKHPFWNQSNGADHFMVSCHDWAPDVPDSKPEFFKNFMRGLCNANT-SEG 147
Query: 142 HIAHKDVSLPQI--WPRQEDPPKLGSSKRNK--LAFFAGAVNSPVREKLLQVWRN-DSEI 196
+ D S+P+I R+ PP +G + N+ LAFFAG + +RE L W+ D ++
Sbjct: 148 FRRNIDFSIPEINIPKRKLKPPFMGQNPENRTILAFFAGRAHGYIREVLFSHWKGKDKDV 207
Query: 197 YAHSGRLKTPYADELLG-SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVL 255
+ K EL+G SKFCL G+EV + R +++Y GCVPV+I+++Y LPF DVL
Sbjct: 208 QVYDHLTKGQNYHELIGHSKFCLCPSGYEVASPREVEAIYSGCVPVVISDNYSLPFNDVL 267
Query: 256 NWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMY 315
+W FS+ + IP +KKIL+ I ++YL + V+KVR+HF + +D +M+++
Sbjct: 268 DWSKFSVEIPVDKIPDIKKILQEIPHDKYLRMYRNVMKVRRHFVVNRPAQPFDVIHMILH 327
Query: 316 ELWLRRSSVRV 326
+WLRR ++R+
Sbjct: 328 SVWLRRLNIRL 338
>gi|90265152|emb|CAH67778.1| H0201G08.5 [Oryza sativa Indica Group]
gi|125546968|gb|EAY92790.1| hypothetical protein OsI_14593 [Oryza sativa Indica Group]
Length = 441
Score = 178 bits (451), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 113/347 (32%), Positives = 182/347 (52%), Gaps = 26/347 (7%)
Query: 1 MNRSFRVYVY-----PHRRNDPFANVLLPVDFEPRGNYASE-SYFKKAFMKSHFVTKDPS 54
M R F+VYVY P P N+ + G++ + + T DP+
Sbjct: 100 MERRFKVYVYEEGEPPILHEGPCKNI-----YTIEGSFIEQLELMSPSDAGGGVRTWDPT 154
Query: 55 KADLFFLPFSIARM-RHDRRIGTEGIPDF---ISHYIFNISQKYPYWNRTGGADHFYVAC 110
+A FFLPFS+++M + R ++ P ++ Y+ ++ ++P+WNR+ GADHF ++C
Sbjct: 155 RAHAFFLPFSVSQMVKFVYRPPSQDRPPLRAIVADYVRVVAARHPFWNRSAGADHFMLSC 214
Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI------WPRQEDPPKLG 164
H G A + E+ NAI+ +C+++ G KDVS+P+I PR+ P G
Sbjct: 215 HDWGPYASRGQPELYTNAIRALCNANT-SEGFRPGKDVSVPEINLYDGDMPRELLAPAPG 273
Query: 165 SSKRNKLAFFAGAVNSPVREKLLQVWRNDSE----IYAHSGRLKTPYADELLGSKFCLHV 220
R LAFFAG + VR+ LL+ W+ +Y + Y + ++FCL
Sbjct: 274 LESRPLLAFFAGGRHGHVRDLLLRHWKGRDAATFPVYEYDLPAAGDYYSFMRRARFCLCP 333
Query: 221 KGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGIS 280
G EV + R+ +++ CVPV+IA+ Y LPFADVL W++FS+ VA DIP L++ L+ I
Sbjct: 334 SGHEVASPRVVEAIQAECVPVVIADGYALPFADVLRWEAFSVAVAVGDIPRLRERLERIP 393
Query: 281 SEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
+ E L+ V V++H P D F M+++ +WLR ++R+
Sbjct: 394 AAEVERLRRGVRLVKRHLMLQQPPRRLDMFNMILHSVWLRGLNLRLH 440
>gi|115456850|ref|NP_001052025.1| Os04g0109900 [Oryza sativa Japonica Group]
gi|38345576|emb|CAE01775.2| OSJNBa0027H06.12 [Oryza sativa Japonica Group]
gi|113563596|dbj|BAF13939.1| Os04g0109900 [Oryza sativa Japonica Group]
gi|125589112|gb|EAZ29462.1| hypothetical protein OsJ_13536 [Oryza sativa Japonica Group]
Length = 441
Score = 177 bits (448), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 113/351 (32%), Positives = 180/351 (51%), Gaps = 34/351 (9%)
Query: 1 MNRSFRVYVY-----PHRRNDPFANVLLPVDFEPRGNYASE-SYFKKAFMKSHFVTKDPS 54
M R F+VYVY P P N+ + G++ + + T DP+
Sbjct: 100 MERRFKVYVYEEGEPPILHEGPCKNI-----YTIEGSFIEQLELMSPSDAGGGVRTWDPT 154
Query: 55 KADLFFLPFSIARM--------RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHF 106
+A FFLPFS+++M DR + ++ Y+ ++ ++P+WNR+ GADHF
Sbjct: 155 RAHAFFLPFSVSQMVKFVYRPPSQDR----APLRAIVADYVRVVAARHPFWNRSAGADHF 210
Query: 107 YVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI------WPRQEDP 160
++CH G A + E+ NAI+ +C+++ G KDVS+P+I PR+
Sbjct: 211 MLSCHDWGPYASRGQPELYTNAIRALCNANT-SEGFRPGKDVSVPEINLYDGDMPRELLA 269
Query: 161 PKLGSSKRNKLAFFAGAVNSPVREKLLQVWRNDSE----IYAHSGRLKTPYADELLGSKF 216
P G R LAFFAG + VR+ LL+ W+ +Y + Y + ++F
Sbjct: 270 PAPGLESRPLLAFFAGGRHGHVRDLLLRHWKGRDAATFPVYEYDLPAAGDYYSFMRRARF 329
Query: 217 CLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKIL 276
CL G EV + R+ +++ CVPV+IA+ Y LPFADVL W++FS+ VA DIP L++ L
Sbjct: 330 CLCPSGHEVASPRVVEAIQAECVPVVIADGYALPFADVLRWEAFSVAVAVGDIPRLRERL 389
Query: 277 KGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
+ I + E L+ V V++H P D F M+++ +WLR ++R+
Sbjct: 390 ERIPAAEVERLRRGVRLVKRHLMLQQPPRRLDMFNMILHSVWLRGLNLRLH 440
>gi|242065760|ref|XP_002454169.1| hypothetical protein SORBIDRAFT_04g025885 [Sorghum bicolor]
gi|241934000|gb|EES07145.1| hypothetical protein SORBIDRAFT_04g025885 [Sorghum bicolor]
Length = 338
Score = 177 bits (448), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 115/350 (32%), Positives = 180/350 (51%), Gaps = 34/350 (9%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLP-VDFEPRGNYASESYFKKAFM--KSHFVTKDPSKAD 57
M R F+++ Y + +P L P D Y+ E F + ++ F + P +A
Sbjct: 1 MERKFKIWTY--KEGEPPLTHLGPSADI-----YSIEGQFLEEIEDPRNPFAARHPGEAH 53
Query: 58 LFFLPFSIARM--------RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVA 109
F LP S+ + R + + ++ Y+ ++ +YPYWNR+ GADH V+
Sbjct: 54 AFLLPVSVCNLVQYIYPFYRRNTTAYMAHMRRALADYVDVVAGRYPYWNRSRGADHVMVS 113
Query: 110 CHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI--------WPRQEDPP 161
CH + E+ NAI+V+C+++ S KD +LP++ P PP
Sbjct: 114 CHDWAPLVSEANGELYANAIRVLCNANTSESFR-PRKDATLPEVNLGDGLLRRPTFGMPP 172
Query: 162 KLGSSKRNKLAFFAGAVNSPVREKLLQVW--RNDSEIYAHSGRLKTPYADELLGS-KFCL 218
+ R LAFFAG ++ +R+ LL W R D ++ H K L+ S +FCL
Sbjct: 173 E----NRTTLAFFAGGMHGHIRKALLGYWLGRKDPDMDIHEYLPKGQDYHALMASARFCL 228
Query: 219 HVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKG 278
GFEV + R+ +S++ GCVPVII++ Y PF+DVL+W S+ V IP LK ILKG
Sbjct: 229 CPSGFEVASPRVVESVFSGCVPVIISDGYPPPFSDVLDWSKMSVTVPPARIPELKDILKG 288
Query: 279 ISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQW 328
+S Y +L++ VL+ ++HF H +D MVM+ +WLRR +VR+ +
Sbjct: 289 VSERRYRVLRARVLQAQRHFVVHRPSQRFDMIRMVMHSIWLRRLNVRLPY 338
>gi|255567222|ref|XP_002524592.1| catalytic, putative [Ricinus communis]
gi|223536145|gb|EEF37800.1| catalytic, putative [Ricinus communis]
Length = 388
Score = 176 bits (447), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 116/349 (33%), Positives = 183/349 (52%), Gaps = 42/349 (12%)
Query: 1 MNRSFRVYVY-----PHRRNDPFANVLLPVDFEPRGNYASESYFKKAF-MKSHFVTKDPS 54
M + F+++VY P P N+ Y+ E F + F T +P
Sbjct: 57 MEKLFKIFVYEEGEPPLLHYGPCKNI-----------YSMEGLFLSLMETDTKFRTLNPD 105
Query: 55 KADLFFLPFSIARMRH--------DRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHF 106
+A +FFLPFS+ + D+ + + D Y+ IS KY YWNR+ GADHF
Sbjct: 106 EAQVFFLPFSVVMIIEHLFHPIIRDKAVLERTVVD----YVRIISHKYTYWNRSLGADHF 161
Query: 107 YVACHSIGRSAMQKAWEVKLNAIQVVC---SSSYFISGHIAHKDVSLPQIWPRQEDPPKL 163
++CH G A ++ N+I+V+C +S YF KD S P+I + L
Sbjct: 162 MLSCHDWGPRATWYERQLYFNSIRVLCNANTSEYFN----PKKDASFPEINLITGEIADL 217
Query: 164 GS----SKRNKLAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLK-TPYADELLGSKFC 217
S R LAFF+G ++ +R L Q W+ D ++ + + Y + + S++C
Sbjct: 218 TGGLPPSNRTILAFFSGKMHGKLRPLLFQHWKEKDKDVLVYETFPEGLSYQEMMKKSRYC 277
Query: 218 LHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILK 277
+ G EV + RI +++Y CVPV+I+ +Y PF+DVLNW+SFSI V+ DI LK IL
Sbjct: 278 ICPSGHEVASPRIVEAIYAECVPVLISQNYVFPFSDVLNWESFSIQVSVSDISNLKNILL 337
Query: 278 GISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
GI ++YL ++ V +V++HF + P YD F+M+++ +WLRR +V++
Sbjct: 338 GIPEDQYLRMRERVKQVQQHFLINNPPKRYDVFHMIIHSIWLRRLNVKL 386
>gi|356565159|ref|XP_003550812.1| PREDICTED: LOW QUALITY PROTEIN: probable glycosyltransferase
At5g20260-like [Glycine max]
Length = 488
Score = 176 bits (447), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 101/293 (34%), Positives = 165/293 (56%), Gaps = 14/293 (4%)
Query: 48 FVTKDPSKADLFFLPFSIARM-----RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGG 102
F+ + P +A +F LP S+ ++ + + Y I+ +YPYWNRT G
Sbjct: 196 FLARYPDEAHVFMLPISVTQIVRYVYNPLTTYSRDQLMRITVDYTNIIAHRYPYWNRTKG 255
Query: 103 ADHFYVACH--SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI----WPR 156
ADHF +CH + S + E+ N I+V+C+++ G KDV +P++ +
Sbjct: 256 ADHFLASCHDWAPDISREESGRELFKNIIRVLCNANTS-EGFKPEKDVPMPEMNLQGFKL 314
Query: 157 QEDPPKLGSSKRNKLAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLKTPYADELLG-S 214
P + R+ LAFFAG + +R+ LL+ W++ D E+ H K L+G S
Sbjct: 315 SSPIPGFDLNNRSILAFFAGGAHGRIRKILLEHWKDKDEEVQVHEYLPKGVDYQGLMGQS 374
Query: 215 KFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKK 274
KFCL G+EV + RI +S+ GCVPVI++++Y LPF+DVL+W FS+ + + I +K
Sbjct: 375 KFCLCPSGYEVASPRIVESINIGCVPVIVSDYYQLPFSDVLDWSKFSLHIPSRRIAEIKT 434
Query: 275 ILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
ILK + +YL LQ V+KV++HF+ + +D F+M+++ +WLRR ++R+
Sbjct: 435 ILKNVPHAKYLKLQKRVMKVQRHFELNRPAKPFDVFHMILHSIWLRRLNIRLH 487
>gi|42568129|ref|NP_198314.2| xylogalacturonan beta-1,3-xylosyltransferase [Arabidopsis thaliana]
gi|292630939|sp|Q94AA9.2|XGD1_ARATH RecName: Full=Xylogalacturonan beta-1,3-xylosyltransferase;
AltName: Full=Protein XYLOGALACTURONAN DEFICIENT 1
gi|332006509|gb|AED93892.1| xylogalacturonan beta-1,3-xylosyltransferase [Arabidopsis thaliana]
Length = 500
Score = 175 bits (443), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 115/341 (33%), Positives = 183/341 (53%), Gaps = 18/341 (5%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
MNR F+V+ Y F + + + G + E +S F P A +FF
Sbjct: 161 MNR-FKVWTYTEGEVPLFHDGPVNDIYGIEGQFMDEMCVDGPKSRSRFRADRPENAHVFF 219
Query: 61 LPFSIARMRH---DRRIGTEG-----IPDFISHYIFNISQKYPYWNRTGGADHFYVACHS 112
+PFS+A++ H EG + I Y+ ++ K+PYWNR+ G DHF V+CH
Sbjct: 220 IPFSVAKVIHFVYKPITSVEGFSRARLHRLIEDYVDVVATKHPYWNRSQGGDHFMVSCHD 279
Query: 113 IGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIW-PRQE-DPPKLGSSKRNK 170
+ ++ I+ +C+++ G + DVS+P+I+ P+ + P LG S R +
Sbjct: 280 WAPDVIDGNPKLFEKFIRGLCNANTS-EGFRPNVDVSIPEIYLPKGKLGPSFLGKSPRVR 338
Query: 171 --LAFFAGAVNSPVREKLLQVWR---NDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEV 225
LAFFAG + +R+ L Q W+ N+ ++Y K Y + SKFCL G+EV
Sbjct: 339 SILAFFAGRSHGEIRKILFQHWKEMDNEVQVYDRLPPGKD-YTKTMGMSKFCLCPSGWEV 397
Query: 226 NTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYL 285
+ R +++Y GCVPVII+++Y LPF+DVLNW SFSI + I +K IL+ +S YL
Sbjct: 398 ASPREVEAIYAGCVPVIISDNYSLPFSDVLNWDSFSIQIPVSRIKEIKTILQSVSLVRYL 457
Query: 286 LLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
+ VL+V++HF + YD +M+++ +WLRR ++R+
Sbjct: 458 KMYKRVLEVKQHFVLNRPAKPYDVMHMMLHSIWLRRLNLRL 498
>gi|15146187|gb|AAK83577.1| AT5g33290/F19N2_10 [Arabidopsis thaliana]
gi|18252279|gb|AAL62020.1| AT5g33290/F19N2_10 [Arabidopsis thaliana]
Length = 341
Score = 174 bits (442), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 116/341 (34%), Positives = 184/341 (53%), Gaps = 18/341 (5%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
MNR F+V+ Y F + + + G + E +S F P A +FF
Sbjct: 2 MNR-FKVWTYTEGEVPLFHDGPVNDIYGIEGQFMDEMCVDGPKSRSRFRADRPENAHVFF 60
Query: 61 LPFSIARMRH---DRRIGTEG-----IPDFISHYIFNISQKYPYWNRTGGADHFYVACHS 112
+PFS+A++ H EG + I Y+ ++ K+PYWNR+ G DHF V+CH
Sbjct: 61 IPFSVAKVIHFVYKPITSVEGFSRARLHRLIEDYVDVVATKHPYWNRSQGGDHFMVSCHD 120
Query: 113 IGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIW-PRQE-DPPKLGSSKRNK 170
+ ++ I+ +C+++ G + DVS+P+I+ P+ + P LG S R +
Sbjct: 121 WAPDEIDGNPKLFEKFIRGLCNANT-SEGFRPNVDVSIPEIYLPKGKLGPSFLGKSPRVR 179
Query: 171 --LAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRL--KTPYADELLGSKFCLHVKGFEV 225
LAFFAG + +R+ L Q W+ D+E+ + RL Y + SKFCL G+EV
Sbjct: 180 SILAFFAGRSHGEIRKILFQHWKEMDNEVQVYD-RLPPGKDYTKTMGMSKFCLCPSGWEV 238
Query: 226 NTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYL 285
+ R +++Y GCVPVII+++Y LPF+DVLNW SFSI + I +K IL+ +S YL
Sbjct: 239 ASPREVEAIYAGCVPVIISDNYSLPFSDVLNWDSFSIQIPVSRIKEIKTILQSVSLVRYL 298
Query: 286 LLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
+ VL+V++HF + YD +M+++ +WLRR ++R+
Sbjct: 299 KMYKRVLEVKQHFVLNRPAKPYDVMHMMLHSIWLRRLNLRL 339
>gi|75223627|gb|ABA18110.1| exostosin family protein [Arabidopsis arenosa]
Length = 340
Score = 174 bits (441), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 103/293 (35%), Positives = 163/293 (55%), Gaps = 15/293 (5%)
Query: 48 FVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFIS---HYIFN-----ISQKYPYWNR 99
F P +A FFLPFS+A + H DF H IFN ++ K+P+WN+
Sbjct: 47 FRASRPEEAHAFFLPFSVANIVHYVYQPITSPADFNRARLHRIFNDYVDVVACKHPFWNQ 106
Query: 100 TGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI--WPRQ 157
+ GADHF V+CH E + ++ +C+++ G + D S+P+I R+
Sbjct: 107 SNGADHFMVSCHDWAPDVADSKPEFFKDFMRGLCNANT-TEGFRPNIDFSIPEINIPKRK 165
Query: 158 EDPPKLGSSKRNK--LAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLKTPYADELLG- 213
PP +G + N+ LAFFAG + +RE L W+ D ++ + K EL+G
Sbjct: 166 LKPPFMGQTPENRTILAFFAGRAHGYIREVLFTHWKGKDKDVQVYDHLTKGQNYHELIGH 225
Query: 214 SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLK 273
SKFCL G+EV + R +++Y GCVPV+I+++Y LPF DVL+W FS+ + IP +K
Sbjct: 226 SKFCLCPSGYEVASPREVEAIYSGCVPVVISDNYSLPFKDVLDWSKFSVEIPVDKIPDIK 285
Query: 274 KILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
KIL+ I ++Y + V+KVR+HF + +D +M+++ +WLRR ++R+
Sbjct: 286 KILQEIPHDKYRRMYQNVMKVRRHFVVNRPAQPFDVIHMILHSVWLRRLNIRL 338
>gi|125540270|gb|EAY86665.1| hypothetical protein OsI_08049 [Oryza sativa Indica Group]
Length = 482
Score = 174 bits (441), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 110/359 (30%), Positives = 177/359 (49%), Gaps = 45/359 (12%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAF--MKSHFVTKDPSKADL 58
M R F+++ Y P A++ D Y+ E F +S F + P A
Sbjct: 138 MERRFKIWTY-REGEPPVAHIGPGTDI-----YSIEGQFMYEMDDPRSRFAARRPDDAHA 191
Query: 59 FFLPFSIARMRH-DRRIGTEG----IPDFISHYIFNISQKYPYWNRTGGADHFYVACHSI 113
F LP S+ + H R+ G + ++ Y+ ++++YPYWNR+ GADH V+CH
Sbjct: 192 FLLPISVCNLVHYVYRLNATGDLAPLRGLVADYVRVVAERYPYWNRSRGADHVIVSCHDW 251
Query: 114 GRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI--------WPRQEDPPKLGS 165
++ NAI+V+C+++ G KD +LP++ P PP+
Sbjct: 252 APMVTSAHRQLYGNAIRVLCNANTS-EGFRPRKDATLPEVNLADGVLRRPTAGLPPE--- 307
Query: 166 SKRNKLAFFAGAVNSPVREKLLQVWRNDSEI----------------YAHSGRLKTPYAD 209
R LAFFAG + +RE LL+ W ++ Y +G Y
Sbjct: 308 -NRTTLAFFAGGRHGHIRESLLRHWLIGNKGGAAADGDGDGDMRVHEYLPAG---EDYHA 363
Query: 210 ELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI 269
++ ++FCL GFEV + R+ +S++ GCVPVII+ Y PF DVL+W S+ V I
Sbjct: 364 QMAAARFCLCPSGFEVASPRVVESVFAGCVPVIISEGYPPPFGDVLDWGKMSVAVPAARI 423
Query: 270 PLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQW 328
P L+ IL+ +S Y +L++ VL+ ++HF H +D +MV++ +WLRR +VR+ +
Sbjct: 424 PELRAILRRVSERRYRVLRARVLQAQRHFVLHRPARRFDMIHMVLHSIWLRRLNVRLPY 482
>gi|357478261|ref|XP_003609416.1| Xylogalacturonan beta-1,3-xylosyltransferase [Medicago truncatula]
gi|355510471|gb|AES91613.1| Xylogalacturonan beta-1,3-xylosyltransferase [Medicago truncatula]
Length = 380
Score = 174 bits (440), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 109/311 (35%), Positives = 168/311 (54%), Gaps = 20/311 (6%)
Query: 33 YASESYFKKAFMK--SHFVTKDPSKADLFFLPFSIARMRH-----DRRIGTEGIPDFISH 85
Y E +F S F T +P +A +F LP S+ M H + I
Sbjct: 71 YGIEGHFMTEIENRLSPFSTHNPDEAHVFMLPLSVTNMVHYLYNPLTTYSRDQIMHVTID 130
Query: 86 YIFNISQKYPYWNRTGGADHFYVACH----SIGRSAMQKAWEVKLNAIQVVCSSSYFISG 141
Y I+ KY YWNR+ GADH V+CH I R + K E+ N I+V+C+++ G
Sbjct: 131 YTNIIAHKYRYWNRSKGADHLLVSCHDWAPEISRESSGK--ELFKNLIKVLCNANTS-EG 187
Query: 142 HIAHKDVSLPQI----WPRQEDPPKLGSSKRNKLAFFAGAVNSPVREKLLQVWRN-DSEI 196
+DVS+P++ + P S+ R+ LAFFAG + +R+ LL W+ D E+
Sbjct: 188 FDPKRDVSMPEMNLQGYKLSSPIPSKESNNRSILAFFAGGEHGMIRKTLLDQWKGKDKEV 247
Query: 197 YAHSGRLKTPYADELLG-SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVL 255
+ K +L+G SKFCL G+EV + R+ +S+ GCVPVI++++Y LPF+DVL
Sbjct: 248 LVYEYLPKKLKYFKLMGKSKFCLCPSGYEVASPRLVESINTGCVPVIVSDNYQLPFSDVL 307
Query: 256 NWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMY 315
+W FS+ + + I +K ILK + YL L VLKV++HF + +D F+M+++
Sbjct: 308 DWSKFSLHIPSKRISEIKTILKSVPHARYLKLHRRVLKVQRHFVLNPPAKPFDVFHMILH 367
Query: 316 ELWLRRSSVRV 326
+WLRR ++R+
Sbjct: 368 SIWLRRLNIRL 378
>gi|443732873|gb|ELU17436.1| hypothetical protein CAPTEDRAFT_228334 [Capitella teleta]
Length = 506
Score = 173 bits (439), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 108/339 (31%), Positives = 171/339 (50%), Gaps = 21/339 (6%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
M + ++Y+Y + + +++ Y E F S ++T+DPS+A FF
Sbjct: 173 MTKELKIYMYTTK-------IDAHINYVNDWKYGVEELFIHLLKSSPYITQDPSEATFFF 225
Query: 61 LPFSIARMRH---DRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSA 117
LPF R DR + +S ++ I Y +W+RT GADHFYV H G +
Sbjct: 226 LPFRCFAYRKTISDRDRAQRFTEEMVSKILYEIKSNYSFWDRTLGADHFYVCAHDFGPAI 285
Query: 118 MQKAWE-VKLNAIQVVCSSSYFISGHIAHKDVSLP--------QIWPRQEDPPKLGSSKR 168
+ + + NAI +V ++ Y ++ HKD+SLP + + L S R
Sbjct: 286 VAGSDPFLHKNAIAMVNTADYEHIYYVPHKDISLPPHPSHGKNSLANIGKGGHGLNPSDR 345
Query: 169 NKLAFFAGAVN-SPVREKLLQVWRNDSEIYAHSGRLKTP-YADELLGSKFCLHVKGFEVN 226
LAF+AG ++ +R + W D + G L Y L SKFCL ++G E
Sbjct: 346 TVLAFYAGNLDRGRIRPSIKDFWSTDIDFRIFMGHLTDERYQHYLKTSKFCLILRGNEAW 405
Query: 227 TARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLL 286
+ + D++++GCVPVII+++YDLP +L+W F++V+ + LK+IL +S ++
Sbjct: 406 SPCLMDAIWFGCVPVIISDYYDLPLHGMLDWNQFAVVIRESKVKSLKEILLAVSPQKLTS 465
Query: 287 LQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
+Q + +V HF W+ P YDAF VMY+LW RR VR
Sbjct: 466 MQEKLKQVYGHFVWNDPPRPYDAFQSVMYQLWKRRGVVR 504
>gi|47497631|dbj|BAD19700.1| exostosin-like protein [Oryza sativa Japonica Group]
Length = 345
Score = 173 bits (439), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 110/359 (30%), Positives = 177/359 (49%), Gaps = 45/359 (12%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAF--MKSHFVTKDPSKADL 58
M R F+++ Y P A++ D Y+ E F +S F + P A
Sbjct: 1 MERRFKIWTY-REGEPPVAHIGPGTDI-----YSIEGQFMYEMDDPRSRFAARRPDDAHA 54
Query: 59 FFLPFSIARMRH-DRRIGTEG----IPDFISHYIFNISQKYPYWNRTGGADHFYVACHSI 113
F LP S+ + H R+ G + ++ Y+ ++++YPYWNR+ GADH V+CH
Sbjct: 55 FLLPISVCNLVHYVYRLNATGDLAPLRGLVADYVRVVAERYPYWNRSRGADHVIVSCHDW 114
Query: 114 GRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI--------WPRQEDPPKLGS 165
++ NAI+V+C+++ G KD +LP++ P PP+
Sbjct: 115 APMVTSAHRQLYGNAIRVLCNANTS-EGFRPRKDATLPEVNLADGVLRRPTAGLPPE--- 170
Query: 166 SKRNKLAFFAGAVNSPVREKLLQVWRNDSEI----------------YAHSGRLKTPYAD 209
R LAFFAG + +RE LL+ W ++ Y +G Y
Sbjct: 171 -NRTTLAFFAGGRHGHIRESLLRHWLIGNKGGAAADGDGDGDMRVHEYLPAG---EDYHA 226
Query: 210 ELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI 269
++ ++FCL GFEV + R+ +S++ GCVPVII+ Y PF DVL+W S+ V I
Sbjct: 227 QMAAARFCLCPSGFEVASPRVVESVFAGCVPVIISEGYPPPFGDVLDWGKMSVAVPAARI 286
Query: 270 PLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQW 328
P L+ IL+ +S Y +L++ VL+ ++HF H +D +MV++ +WLRR +VR+ +
Sbjct: 287 PELRAILRRVSERRYRVLRARVLQAQRHFVLHRPARRFDMIHMVLHSIWLRRLNVRLPY 345
>gi|297805042|ref|XP_002870405.1| hypothetical protein ARALYDRAFT_493575 [Arabidopsis lyrata subsp.
lyrata]
gi|297316241|gb|EFH46664.1| hypothetical protein ARALYDRAFT_493575 [Arabidopsis lyrata subsp.
lyrata]
Length = 509
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 115/341 (33%), Positives = 184/341 (53%), Gaps = 18/341 (5%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
MNR F+V+ Y F + + + G + E +S F P A +FF
Sbjct: 170 MNR-FKVWTYTEGEVPLFHDGPVNDIYGIEGQFMDEMCVDGPKSRSRFRADHPEDAHVFF 228
Query: 61 LPFSIARMRH---DRRIGTEG-----IPDFISHYIFNISQKYPYWNRTGGADHFYVACHS 112
+PFS+A++ H EG + I Y+ ++ K+ YWNR+ G DHF V+CH
Sbjct: 229 IPFSVAKVIHFVYKPITSVEGFSRARLHRLIEDYVDVVATKHRYWNRSKGGDHFMVSCHD 288
Query: 113 IGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIW-PRQE-DPPKLGSSKRNK 170
+ ++ I+ +C+++ G + DVS+P+I+ P+ + P LG S R +
Sbjct: 289 WAPDVIDGNPKLFEKFIRALCNANT-SEGFRPNVDVSIPEIYLPKGKLGPSFLGKSPRIR 347
Query: 171 --LAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLK--TPYADELLGSKFCLHVKGFEV 225
LAFFAG + +R+ L + W+ D+E+ + RL Y + SKFCL G+EV
Sbjct: 348 SILAFFAGRSHGEIRKILFKHWKEMDNEVQVYD-RLPPGKDYTKTMGMSKFCLCPSGWEV 406
Query: 226 NTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYL 285
+ R +++Y GCVPVII+++Y LPF+DVLNW SFSI + IP +K IL+ +S YL
Sbjct: 407 ASPREVEAIYAGCVPVIISDNYSLPFSDVLNWDSFSIQIPVSRIPEIKTILQSVSLVRYL 466
Query: 286 LLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
+ VL+V++HF + YD +M+++ +WLRR ++R+
Sbjct: 467 KMYKRVLEVKQHFVLNRPAKPYDVMHMMLHSIWLRRLNLRL 507
>gi|26451681|dbj|BAC42936.1| unknown protein [Arabidopsis thaliana]
Length = 270
Score = 173 bits (438), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 98/252 (38%), Positives = 148/252 (58%), Gaps = 15/252 (5%)
Query: 81 DFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFIS 140
F+ +Y+ IS KY +WN+TGG+DHF VACH S + + I+ +C+S
Sbjct: 19 QFLKNYLDMISSKYSFWNKTGGSDHFLVACHDWAPSETR---QYMAKCIRALCNSD-VSE 74
Query: 141 GHIAHKDVSLPQ---IWPRQEDPPKLGS--SKRNKLAFFAGAVNSPVREKLLQVWRN--- 192
G + KDV+LP+ + PR+ G S+R LAFFAG ++ +R LLQ W
Sbjct: 75 GFVFGKDVALPETTILVPRRPLRALGGKPVSQRQILAFFAGGMHGYLRPLLLQNWGGNRD 134
Query: 193 -DSEIYAH--SGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDL 249
D +I++ + K Y + + SK+C+ KG EVN+ R+ ++L+Y CVPVII++++
Sbjct: 135 PDMKIFSEIPKSKGKKSYMEYMKSSKYCICPKGHEVNSPRVVEALFYECVPVIISDNFVP 194
Query: 250 PFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDA 309
PF +VLNW+SF++ V DIP LK IL I+ E Y +Q V V+KHF WH P +D
Sbjct: 195 PFFEVLNWESFAVFVLEKDIPDLKNILVSITEERYREMQMRVKMVQKHFLWHSKPERFDI 254
Query: 310 FYMVMYELWLRR 321
F+M+++ +W R
Sbjct: 255 FHMILHSIWYNR 266
>gi|356527767|ref|XP_003532479.1| PREDICTED: LOW QUALITY PROTEIN: probable glycosyltransferase
At5g03795-like [Glycine max]
Length = 467
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 106/288 (36%), Positives = 160/288 (55%), Gaps = 20/288 (6%)
Query: 49 VTKDPSKADLFFLPFSIARMRHDRRI----GTEGIPDFISHYIFNISQKYPYWNRTGGAD 104
+ KDP+KA LF++PFS + H + + F+ Y IS K PY+NRTGGAD
Sbjct: 178 IMKDPAKAHLFYMPFSSRMLEHSLYVRNSHNRTNLRQFLKDYTDKISAKIPYFNRTGGAD 237
Query: 105 HFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPK-- 162
HF ACH + E I+ +C++ G +DVSLP+ + R P+
Sbjct: 238 HFLAACHDWAPYETRHHMEY---CIKALCNAD-VTQGFKIGRDVSLPEAYVRSVRDPQRD 293
Query: 163 LGSSKRNK---LAFFAGAVNSPVREKLLQVWRN---DSEIYA---HSGRLKTPYADELLG 213
LG ++ LAF AG ++ + LL+ W++ D +IY H K Y + +
Sbjct: 294 LGGKPPHQXPILAFHAGNMHGYLHPILLKHWKDKDPDMKIYGPMPHGVTSKMNYINHMKN 353
Query: 214 SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLK 273
SK+C+ KG+EVN+ R+ ++++ CVPVII++++ F +VLNW FSI++A DIP LK
Sbjct: 354 SKYCICPKGYEVNSPRMVEAIFXECVPVIISDNFVPHFFEVLNWDVFSIILAEKDIPNLK 413
Query: 274 KILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
+IL + +YL LQ V K +KHF WHV P YD F++ ++ +W R
Sbjct: 414 QILLSV-XHKYLELQLGVRKAQKHFFWHVKPLKYDLFHITLHLIWYNR 460
>gi|75214755|gb|ABA18105.1| exostosin family protein [Capsella rubella]
Length = 423
Score = 172 bits (435), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 104/295 (35%), Positives = 165/295 (55%), Gaps = 15/295 (5%)
Query: 46 SHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFIS---HYIFN-----ISQKYPYW 97
S F P +A +FFLPFS+A + H DF H IFN +++KYP+W
Sbjct: 128 SRFRAVRPEEAHVFFLPFSVANIVHYVYKPIASPADFNRARLHRIFNDYVDVVARKYPFW 187
Query: 98 NRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI-WPR 156
++ GADHF V+CH E + ++ +C+++ G D S+P+I P+
Sbjct: 188 KQSNGADHFMVSCHDWAPDVPDSKPEFFKDFMRGLCNANT-SEGFKPSIDFSIPEINIPK 246
Query: 157 QE-DPPKLGSSKRNK--LAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLKTPYADELL 212
+ PP +G + N+ LAFFAG + +RE L W+ D ++ + K EL
Sbjct: 247 GKLKPPFMGQNPENRTILAFFAGRAHGYIREVLFTHWKGKDKDVQVYDHLTKGQNYHELT 306
Query: 213 G-SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPL 271
G SKFCL G+EV + R +++Y GCVPV+I+++Y LPF+DVL+W FS+ + IP
Sbjct: 307 GHSKFCLCPSGYEVASPREVEAIYSGCVPVVISDNYSLPFSDVLDWSKFSVEIPVDRIPD 366
Query: 272 LKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
+K IL+ I ++Y+ + VLKVRKHF + +D +M+++ +WLRR ++++
Sbjct: 367 IKNILQEIPHDKYIRMYQNVLKVRKHFVVNRPAQPFDVIHMILHSVWLRRLNIKL 421
>gi|443705810|gb|ELU02170.1| hypothetical protein CAPTEDRAFT_209273 [Capitella teleta]
Length = 473
Score = 171 bits (434), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 100/273 (36%), Positives = 155/273 (56%), Gaps = 11/273 (4%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRH---DRRIGTEGIPDFISHYIFN 89
Y ES F K KS FVTKD +A FFLPF A R+ DR + +S+ + +
Sbjct: 201 YGVESLFIKLLSKSSFVTKDAEEAHFFFLPFQCATYRNVIRDRAAAQNFTENLVSNILKD 260
Query: 90 ISQKYPYWNRTGGADHFYVACHSIGRSAMQKA-WEVKLNAIQVVCSSSYFISGHIAHKDV 148
IS +Y YW+R+ GADHFYV H +G S++ A ++ NAI +V ++ Y ++ HKD+
Sbjct: 261 ISSRYTYWDRSLGADHFYVCAHDMGASSVAAADANLQKNAIALVNTADYADPFYVPHKDI 320
Query: 149 SLPQIWPRQEDP-PKLG-----SSKRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGR 202
+LP + P +G S++R LAF+AG ++S + + W NDS+I+ H G
Sbjct: 321 ALPPHPAHGKGSLPDIGRGGGKSTERPNLAFYAGNLDSGQLRPVFKDWLNDSDIHIHHGH 380
Query: 203 LK-TPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFS 261
+ Y L +KFCL +G V + + D+++ GCVPVII+++YDLP +++W F+
Sbjct: 381 MSDNVYIKNLQSAKFCLVPRGHRVWSPVVMDAVWTGCVPVIISDYYDLPLHGLIDWTHFA 440
Query: 262 IVVATLDIPLLKKILKGISSEEYLLLQSYVLKV 294
+ + ++ LK LK I E+ +QSY+ KV
Sbjct: 441 VFLKEKEVLSLKSKLKSIPEEKLRRMQSYIKKV 473
>gi|75214619|gb|ABA18091.1| exostosin family protein [Olimarabidopsis pumila]
Length = 343
Score = 171 bits (433), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 104/296 (35%), Positives = 165/296 (55%), Gaps = 15/296 (5%)
Query: 48 FVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFIS---HYIFN-----ISQKYPYWNR 99
F P +A FFLPFS+A + H DF H IFN +++K+P+WN+
Sbjct: 48 FRAGRPEEAHAFFLPFSVANIVHYVSQPIASPADFNRARLHRIFNDYVEVVARKHPFWNQ 107
Query: 100 TGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI-WPRQE 158
+ GADHF V+CH E + I+ +C+++ G D S+P+I P+ +
Sbjct: 108 SNGADHFMVSCHDWAPDVAGSKPEFFKDFIRGLCNANT-SEGFRPSIDFSIPEINIPKGK 166
Query: 159 -DPPKLGSSKRNK--LAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLKTPYADELLG- 213
PP +G + N+ LAFFAG + +RE L W+ D ++ + K EL+G
Sbjct: 167 LKPPFMGQTPENRTILAFFAGRAHGYIREVLFTHWKGKDKDVQVYDHLTKGQNYHELIGH 226
Query: 214 SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLK 273
SKFCL G+EV + R +++Y GCVPV+I+++Y LPF DVL+W FS+ + IP +K
Sbjct: 227 SKFCLCPSGYEVASPREVEAIYSGCVPVVISDNYSLPFNDVLDWSKFSVEIPVDKIPDIK 286
Query: 274 KILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQWS 329
KIL+ I E+Y+ + V+KV +HF + +D +M+++ +WLRR ++++ S
Sbjct: 287 KILQEIPHEKYIKMYHNVMKVGRHFVVNRPAQPFDVIHMILHSVWLRRLNIKLPSS 342
>gi|242094892|ref|XP_002437936.1| hypothetical protein SORBIDRAFT_10g005120 [Sorghum bicolor]
gi|241916159|gb|EER89303.1| hypothetical protein SORBIDRAFT_10g005120 [Sorghum bicolor]
Length = 412
Score = 171 bits (432), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 101/295 (34%), Positives = 162/295 (54%), Gaps = 28/295 (9%)
Query: 50 TKDPSKADLFFLPFSIARM-----RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGAD 104
T DP++A FFLPFS+++M R D T + ++ Y+ ++ ++PYWNR+ GAD
Sbjct: 110 TSDPARAHAFFLPFSVSKMVQFAYRPDTYDKTP-LRAIVADYVRVVASRHPYWNRSAGAD 168
Query: 105 HFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPK-- 162
HF ++CH G A + E+ N I+ +C+++ G +DVS+P+I D P+
Sbjct: 169 HFMLSCHDWGPEASRGHPELHANGIRALCNANT-SEGFRPGQDVSVPEINLYTGDMPRQL 227
Query: 163 -----LGSSKRNKLAFFAGAVNSPVREKLLQVWRNDSE----IYAHS-----GRLKTPYA 208
+ R LAFFAG + VR+ LL+ W+ +Y H GR + A
Sbjct: 228 LAPPAPPLASRPFLAFFAGGRHGHVRDLLLRRWKGHDPDVFPVYEHEHEHSHGRRQQDGA 287
Query: 209 D-----ELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIV 263
+ ++FCL G+EV + R+ ++++ CVPVI+++ Y LPFADVL W++FS+
Sbjct: 288 PLDYYWYMRRARFCLCPSGYEVASPRVVEAIHAECVPVILSDGYALPFADVLRWEAFSVA 347
Query: 264 VATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELW 318
V DIP L+++L+ I + E LQ V V++HF H P D F M+++ +W
Sbjct: 348 VPVADIPRLREVLERIPAPEVERLQRGVRLVKRHFMLHQPPERLDMFNMILHSVW 402
>gi|255550902|ref|XP_002516499.1| catalytic, putative [Ricinus communis]
gi|223544319|gb|EEF45840.1| catalytic, putative [Ricinus communis]
Length = 601
Score = 170 bits (430), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 109/332 (32%), Positives = 160/332 (48%), Gaps = 69/332 (20%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEP--RGNYASESYFKKAFMKSH-FVTKDPSKAD 57
M + +VY+Y P+ +P +G YASE +F K S FVTK KA
Sbjct: 321 MEENLKVYIYKEGEK--------PILHQPVLKGIYASEGWFMKQLEASKKFVTKKSRKAH 372
Query: 58 LFFLPFSIARMR--------HDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVA 109
LF+LPFS + H R+ + ++ +Y+ I KYP+WNRT G DHF VA
Sbjct: 373 LFYLPFSSRNLELQLYVPDSHSRK----NLIKYLKNYLDLIVAKYPFWNRTEGVDHFLVA 428
Query: 110 CHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRN 169
CH S + ++ N I+ +C++ G I KD SLP+
Sbjct: 429 CHDWAASETE---QLMFNCIRALCNAD-VKEGFIFGKDASLPE----------------- 467
Query: 170 KLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTAR 229
NS + R K Y + S++C+ +G+EVN+ R
Sbjct: 468 --------TNSDAK-----------------ARGKMNYVQHMKSSRYCICARGYEVNSPR 502
Query: 230 IADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQS 289
I +++ Y CVPVII+++Y PF +VLNW+SF++ V DIP LK IL I + Y +Q
Sbjct: 503 IVEAILYECVPVIISDNYVPPFFEVLNWESFAVFVLEKDIPNLKNILLSIPEKRYREMQM 562
Query: 290 YVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
V V++HF WH P YD F+M+++ +W R
Sbjct: 563 RVKMVQQHFLWHARPVKYDLFHMILHSVWYNR 594
>gi|302814814|ref|XP_002989090.1| hypothetical protein SELMODRAFT_447545 [Selaginella moellendorffii]
gi|300143191|gb|EFJ09884.1| hypothetical protein SELMODRAFT_447545 [Selaginella moellendorffii]
Length = 1522
Score = 169 bits (428), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 106/321 (33%), Positives = 166/321 (51%), Gaps = 17/321 (5%)
Query: 5 FRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFFLPFS 64
F+VYVYP +N ++ D + + + F + + S FVT DP ADLF LP S
Sbjct: 56 FKVYVYPMIQNASAPDLR---DGKAARPGSIDRVFVDSLLASGFVTDDPEAADLFLLPAS 112
Query: 65 IARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEV 124
I+ + +R +GI + YI + YPYW R+ GADHF+V+CH I + E+
Sbjct: 113 ISAI-WKKRPDPKGIAHSLKSYIQQLRDLYPYWQRSLGADHFFVSCHDITSDWSRNVLEL 171
Query: 125 KLNAIQVVC--SSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLAFFAGAVNSPV 182
K NAIQ+ C + + +AHKD+++P DPP+ +R LA + +
Sbjct: 172 KKNAIQIACFPLARHGAQEFLAHKDITMPPAG-GSIDPPQ---RRRWNLAVYDSSSQGYA 227
Query: 183 REKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVI 242
+ W++D A + ++ L+ ++FCL + + + + ++ GC+PVI
Sbjct: 228 ASDVPASWKSDESFVAGAVKMDLQL---LVTTRFCLSLGSSDRHL--VIPAVRSGCIPVI 282
Query: 243 IANHY--DLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQW 300
+ DLPF D+L+W SF+IV++ + K IL+ I E+ LQ + KH +W
Sbjct: 283 FSAGKLSDLPFQDILDWNSFAIVLSRDQLHQTKAILESIDEEKLSRLQENGARAAKHMEW 342
Query: 301 HVFPSDYDAFYMVMYELWLRR 321
H P DAFYMV+Y+LW RR
Sbjct: 343 HSPPQPEDAFYMVLYQLWRRR 363
>gi|302804107|ref|XP_002983806.1| hypothetical protein SELMODRAFT_445668 [Selaginella moellendorffii]
gi|300148643|gb|EFJ15302.1| hypothetical protein SELMODRAFT_445668 [Selaginella moellendorffii]
Length = 1068
Score = 169 bits (427), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 107/321 (33%), Positives = 165/321 (51%), Gaps = 17/321 (5%)
Query: 5 FRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFFLPFS 64
F+VYVYP +N ++ D + + + F + + S FVT DP ADLF LP S
Sbjct: 56 FKVYVYPMIQNASAPDLR---DGKAARPGSIDRVFVDSLLASGFVTDDPEAADLFLLPAS 112
Query: 65 IARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEV 124
I+ + +R +GI + YI + YPYW R+ GADHF+V+CH I + E+
Sbjct: 113 ISAI-WKKRPDPKGIAHSLKSYIQQLRDLYPYWQRSLGADHFFVSCHDITSDWSRNVLEL 171
Query: 125 KLNAIQVVC--SSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLAFFAGAVNSPV 182
K NAIQ+ C + + +AHKD+++P DPP+ +R LA + +
Sbjct: 172 KKNAIQIACFPLARHGAQEFLAHKDITMPPAG-GSIDPPQ---RRRWNLAVYDSSSQGYA 227
Query: 183 REKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVI 242
+ W++D A + L L+ ++FCL + + + + ++ GC+PVI
Sbjct: 228 ARDVPASWKSDESFVAGAVALDLQL---LVTTRFCLSLGSSDRHL--VIPAVRSGCIPVI 282
Query: 243 IANH--YDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQW 300
+ DLPF D+L+W SF+IV++ + K IL+ I E+ LQ + KH +W
Sbjct: 283 FSAGKLSDLPFQDILDWNSFAIVLSRDQLHQTKGILESIDEEKRSRLQENGARAAKHMEW 342
Query: 301 HVFPSDYDAFYMVMYELWLRR 321
H P DAFYMV+Y+LW RR
Sbjct: 343 HSPPQPEDAFYMVLYQLWRRR 363
>gi|168000699|ref|XP_001753053.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162695752|gb|EDQ82094.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 471
Score = 167 bits (423), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 99/302 (32%), Positives = 159/302 (52%), Gaps = 15/302 (4%)
Query: 34 ASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQK 93
++ F + +S FVT+ +A LF LPFSI + D +G + + + Y+ +
Sbjct: 162 STADLFFRLLTRSEFVTEKAKRAQLFLLPFSIDVLWVD--LGPTQVAEKLRRYLEKVRTN 219
Query: 94 YPYWNRTGGADHFYVACHSIGRSAMQK-AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQ 152
YPYW + GADHFY++CH+ ++ + E+ N+IQ C+ HKDV PQ
Sbjct: 220 YPYWESSLGADHFYLSCHAFEHNSKHRNILELGKNSIQAACAPLRHNQKFYPHKDVVFPQ 279
Query: 153 IWPRQEDPPK---LGSSKRNKLAFFAGA--VNSPVREKLLQVWRNDSE--IYAHSGRLKT 205
P E+ + LG R LA+F+G V +P+ W D + + A+ +
Sbjct: 280 YKPVGEEDVRQAILGRRNRTSLAYFSGCPDVTTPLLSAF-HTWETDPDFIVEANPSPHRL 338
Query: 206 PYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIA--NHYDLPFADVLNWKSFSIV 263
L S+FC+ V +T + D+L +GCVPV+++ +DLPF LNW F++V
Sbjct: 339 SVYRNLARSRFCVSV--LPHDTFSLVDALRFGCVPVLLSKLTFHDLPFQGFLNWGQFAVV 396
Query: 264 VATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSS 323
+ D+P LK+IL +SS ++ +Q + KH +W+ P YDAF+M + ELW+RR S
Sbjct: 397 LGIEDLPNLKQILANVSSTKHREMQYLGHQAIKHLEWNNPPVAYDAFHMTLLELWVRRHS 456
Query: 324 VR 325
++
Sbjct: 457 IK 458
>gi|357118563|ref|XP_003561022.1| PREDICTED: probable glycosyltransferase At5g25310-like
[Brachypodium distachyon]
Length = 479
Score = 167 bits (423), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 101/306 (33%), Positives = 167/306 (54%), Gaps = 33/306 (10%)
Query: 50 TKDPSKADLFFLPFSIARM--------RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTG 101
T DP +A FFLPFS+++M +DR + ++ Y+ ++ ++ +WNR+
Sbjct: 174 TWDPERAHAFFLPFSVSQMVQFAYVPLSYDR----APLRALVADYVRVVAARHRFWNRSS 229
Query: 102 GADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI------WP 155
GADHF ++CH G A + E+ N I+ +C+++ G KDVS+P+I P
Sbjct: 230 GADHFMLSCHDWGPEASRGDPELYGNGIRALCNANT-SEGFRPGKDVSIPEINLYDGDTP 288
Query: 156 RQEDPPKLGSSKRNKLAFFAGAVNSPVREKLLQVWR----NDSEIYAHS----------G 201
RQ P G S+R LAFFAG + VR+ LL+ W+ ++ +Y + G
Sbjct: 289 RQLLLPAPGLSERPYLAFFAGGRHGHVRDLLLREWKGRDPDNFPVYEYDLPTTTNTTGGG 348
Query: 202 RLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFS 261
+ Y + ++FCL G EV + R+ ++++ GCVPV++++ Y PFADVL W+ FS
Sbjct: 349 DKQHDYYSYMQRARFCLCPSGHEVASPRVVEAIHAGCVPVLVSDGYAPPFADVLRWEGFS 408
Query: 262 IVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
+ V DIP L+++L+ I + E L+ V++HF P D F+M+++ +WLRR
Sbjct: 409 VSVPVADIPRLREVLESIPAAEVERLRDGGRLVKQHFTLRQPPERLDMFHMILHSVWLRR 468
Query: 322 SSVRVQ 327
+ R+
Sbjct: 469 LNFRLN 474
>gi|357436207|ref|XP_003588379.1| Exostosin-2 [Medicago truncatula]
gi|355477427|gb|AES58630.1| Exostosin-2 [Medicago truncatula]
Length = 551
Score = 167 bits (422), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 112/341 (32%), Positives = 171/341 (50%), Gaps = 65/341 (19%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEP--RGNYASESYFKKAF-MKSHFVTKDPSKAD 57
M +VYVY P+ P G YASE +F K FVTK+P KA
Sbjct: 250 MEERLKVYVYREGAR--------PILHSPFLTGIYASEGWFMKLMEANKRFVTKNPKKAH 301
Query: 58 LFFLPFSIARMRHDRRIGT----EGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSI 113
LF+LPFS + + + + ++ Y+ I+ ++ +WNRTGGADHF V CH
Sbjct: 302 LFYLPFSSRMLEEALYVKNSHSHKNLIQYLHDYVDMIAARHSFWNRTGGADHFLVGCHDW 361
Query: 114 GRSAMQKAWEVKL---NAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPK--LGS--- 165
S E KL N I+ +C++ G + KD SLP+ + R P LG
Sbjct: 362 APS------ETKLRLANCIRSLCNAD-VKEGFVFGKDASLPETYVRNAQIPTRDLGGNSF 414
Query: 166 SKRNKLAFFAGAVNSPVREKLLQVWRN---DSEIYAH--SGRLKTPYADELLGSKFCLHV 220
SK+ LAFFAG+++ VR LL+ W N D +I+ + + + Y + SK+C+
Sbjct: 415 SKKTTLAFFAGSMHGYVRPILLKHWENKDPDMKIFGKLPNSKGNSNYIHYMKSSKYCICA 474
Query: 221 KGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGIS 280
KG+EVN+ R+ ++++Y CVPVII++++ PF +VL+W+SFS++
Sbjct: 475 KGYEVNSPRVVEAIFYECVPVIISDNFVPPFFEVLDWESFSVI----------------- 517
Query: 281 SEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
V+KHF W+ P YD F+M+++ +W R
Sbjct: 518 -------------VQKHFLWNKNPVKYDIFHMILHSIWYNR 545
>gi|388493832|gb|AFK34982.1| unknown [Lotus japonicus]
Length = 229
Score = 166 bits (421), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 92/226 (40%), Positives = 138/226 (61%), Gaps = 7/226 (3%)
Query: 108 VACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKL---- 163
++CH G A Q + N+I+V+C+++ G KD S P+I R L
Sbjct: 2 LSCHDWGPMASQAVPNLYKNSIRVLCNANTS-EGFKHGKDASFPEINLRTSSIDDLVGGP 60
Query: 164 GSSKRNKLAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLK-TPYADELLGSKFCLHVK 221
SKR+ LA FAG ++ P+R LL+ W N D ++ +S K Y D L SKFCL
Sbjct: 61 SPSKRSVLASFAGRLHGPIRPPLLEHWENKDGDMQVYSSLPKGVSYYDMLRKSKFCLCPS 120
Query: 222 GFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISS 281
G+EV + R+ +++Y GCVPV+I+ HY PF+DVLNWKSFS+ V+ DIP LK+IL +++
Sbjct: 121 GYEVASPRVVEAIYTGCVPVLISEHYVPPFSDVLNWKSFSLEVSVKDIPRLKEILLSVNT 180
Query: 282 EEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
+Y+ +Q V ++R+HF+ H P +D F+MV++ +WLRR + RV
Sbjct: 181 RQYIRMQRRVGQIRRHFEIHSPPKRFDVFHMVLHSVWLRRLNFRVH 226
>gi|320163884|gb|EFW40783.1| conserved hypothetical protein [Capsaspora owczarzaki ATCC 30864]
Length = 478
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 109/352 (30%), Positives = 168/352 (47%), Gaps = 64/352 (18%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMR---HDRRIGTEGIPDFISHYIFN 89
Y +E F S F T P KA++FF+PF R +R +G + + Y
Sbjct: 127 YGAEQLFINLLATSAFHTTAPDKANMFFMPFRCTAYRRSVQERVLGDIVAKNVTAQYFDV 186
Query: 90 ISQKYPYWNRTGGADHFYVACHSIGR--SAMQKAWEVKLNAIQVVCSSSY---------- 137
+ KY +WN + G DHFY+ H +G +A+ VK NAI +V ++ Y
Sbjct: 187 VMNKYRWWNVSSGTDHFYICGHDMGTAVTALSHPALVK-NAIGLVNTADYDDARYIPHKD 245
Query: 138 ---------FISGHIA----------------------------HKDVSLPQIWPRQEDP 160
S H+A H D++ + R+
Sbjct: 246 ISLPPNIDVLPSAHVATEEEITADLIRLEMARDRLYRATRQKVAHPDMNFEPLMERRMGK 305
Query: 161 P---KLGS-----SKRNKLAFFAGAVN-SPVREKLLQVWRNDSEIYAHSGRLKTP--YAD 209
LG KR KLA+FAG ++ VR K+ + ND++I GR P Y +
Sbjct: 306 LVQYGLGGLIHPREKRTKLAYFAGPLHYGRVRPKVRDAFANDTDIVLFEGRHAQPILYYN 365
Query: 210 ELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI 269
EL SKFCL ++G+ + R+ D+++ GC+PVII++HYDLP +L+W FSI + I
Sbjct: 366 ELATSKFCLFLRGYRAWSPRLMDAVFMGCIPVIISDHYDLPLGQLLDWSEFSITIPEARI 425
Query: 270 PLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
P LK+ L +S + +Q+ + +V +HF W+ P +DAF+MV+++LW RR
Sbjct: 426 PRLKQTLLAVSDAQLSRMQNRLAEVYQHFVWNDPPKPFDAFHMVLWQLWRRR 477
>gi|356546124|ref|XP_003541481.1| PREDICTED: probable glycosyltransferase At5g20260-like, partial
[Glycine max]
Length = 290
Score = 164 bits (416), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 104/289 (35%), Positives = 166/289 (57%), Gaps = 15/289 (5%)
Query: 52 DPSKADLFFLPFSIARM-RHDRRIGTEGIPD----FISHYIFNISQKYPYWNRTGGADHF 106
+P +A +F LP S+A++ R+ T D Y I+ +YPYWNRT GADHF
Sbjct: 1 NPDEAHVFMLPISVAQIVRYVYNPLTTYSRDQLMWITIDYTNIIAHRYPYWNRTRGADHF 60
Query: 107 YVACHSIGRSAMQKAW---EVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPR--QEDPP 161
+CH + +A E+ N I+V+ +++ G KDV +P++ + + P
Sbjct: 61 LASCHDWAPPDISRAESGKELFKNIIRVLYNANKS-EGFKPEKDVPMPEVNLQGFKLSSP 119
Query: 162 KLGSSKRNK--LAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRLKTPYADELLG-SKFC 217
LG N+ LAFFAG V+ +RE LLQ W++ D E+ H K L+G SKFC
Sbjct: 120 ILGLDPNNRSILAFFAGGVHGRIREILLQHWKDKDEEVQVHEYLPKGVDYHGLMGQSKFC 179
Query: 218 LHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILK 277
L G+EV + RI +S+ GCVPVI++++Y LPF+DVL+ FS+ + + I +K +LK
Sbjct: 180 LCPSGYEVASPRIVESINIGCVPVIVSDYYQLPFSDVLDRSKFSLHIPSRRIAEIKTMLK 239
Query: 278 GISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
+ +YL LQ V+KV++HF + ++ F+M+++ +WLR+ ++R+
Sbjct: 240 NVPHAKYLKLQKRVMKVQRHFVLNRPAKSFNVFHMILHSIWLRQLNIRL 288
>gi|167517883|ref|XP_001743282.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163778381|gb|EDQ91996.1| predicted protein [Monosiga brevicollis MX1]
Length = 503
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 110/337 (32%), Positives = 171/337 (50%), Gaps = 28/337 (8%)
Query: 3 RSFRVYVYP-HRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFFL 61
R R++ YP H + P A P D++ Y E + S + + P +A F +
Sbjct: 175 RQLRIFTYPVHLGSMPRA----PDDYK----YGVERRLPQVLASSPYAVQQPEEATHFLI 226
Query: 62 PFSIARMRH---DRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAM 118
PF R+ DR G ++ +I +IS YPYWNR+ GA+HFYV H +G SA+
Sbjct: 227 PFQCTAHRYTVADRAGGQNAAEAGLASWIASISAAYPYWNRSAGANHFYVCSHDMGSSAV 286
Query: 119 -QKAWEVKLNAIQVVCSSSYFISGHIAHKDVSL-PQIWPRQEDPPKLGS----------- 165
Q + + N I +V ++ H+D++ P I + + G+
Sbjct: 287 AQLSRAAQQNLIGLVNTADRRDGFFNVHRDLATAPHIGDGCDTCLQGGTRLSVTREAWAG 346
Query: 166 SKRNKLAFFAGAVN-SPVREKLLQVWRNDSEIYAHSGRLKTP-YADELLGSKFCLHVKGF 223
+ RN+LAF AG + PVR +L Q + D + G L Y L S+FCL V+GF
Sbjct: 347 TPRNRLAFMAGNLQRGPVRPRLRQFFDGDPDFLLVDGTLAAAHYRQALAESEFCLVVRGF 406
Query: 224 EVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEE 283
V T R+ D+++ GC+PVIIA+ Y+LPF+ +L+W SF++ V D+P LK IL S+
Sbjct: 407 RVWTPRLMDAVWSGCIPVIIADGYELPFSSLLHWPSFAVFVPEHDVPRLKDILLAKLSQA 466
Query: 284 YLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLR 320
LL++ +L ++ +H DAF ++M +L R
Sbjct: 467 P-LLRANLLAASQYLTYHSNWVPLDAFDILMLQLAAR 502
>gi|449528899|ref|XP_004171439.1| PREDICTED: LOW QUALITY PROTEIN: probable glycosyltransferase
At3g07620-like, partial [Cucumis sativus]
Length = 173
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 75/163 (46%), Positives = 110/163 (67%), Gaps = 5/163 (3%)
Query: 168 RNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKTP-----YADELLGSKFCLHVKG 222
R L F+AG NS +R L +VW ND+E+ + R+ Y +KFC+ G
Sbjct: 10 RTTLGFWAGHRNSKIRVILARVWENDTELDISNNRISRATGHLLYQKRFYKTKFCICPGG 69
Query: 223 FEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSE 282
+VN+ARIADS++YGCVPVI++++YDLPF D+L+W+ FS++V D+ LK+ILK IS
Sbjct: 70 SQVNSARIADSIHYGCVPVILSDYYDLPFNDILDWRKFSVIVKERDVYQLKQILKDISDI 129
Query: 283 EYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
E++ L +++V+KHFQW+ P YDAF+MVMY+LWLR ++
Sbjct: 130 EFIKLHKNLMQVQKHFQWNSXPIKYDAFHMVMYDLWLRHHVIK 172
>gi|147775380|emb|CAN73459.1| hypothetical protein VITISV_022181 [Vitis vinifera]
Length = 319
Score = 157 bits (398), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 96/299 (32%), Positives = 152/299 (50%), Gaps = 39/299 (13%)
Query: 46 SHFVTKDPSKADLFFLPFSIARMRH---DRRIGTEG--IPDFISHYIFNISQKYPYWNRT 100
SHF+ P A +F++P S+ R+ H + G + ++ YI+ +S KYPYWNR+
Sbjct: 42 SHFMAGHPDVAHVFYIPISVTRIAHYIYSPPVDYSGHMLQRLVTDYIYVVSDKYPYWNRS 101
Query: 101 GGADHFYVACHSIGRS-AMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLP-------- 151
GADHF V+CH +S AM + + +C S + AH + P
Sbjct: 102 NGADHFLVSCHDWVQSFAMPTPPKGSSPSETSLCQKSTYPKAXSAHLTSTNPLTSVTSSP 161
Query: 152 ---QIWPRQEDPPKLGSSKRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKTPYA 208
+ P +P G K+ F ++Y H + YA
Sbjct: 162 SSPAVNPATCEPSCSGPGKKMMTKF---------------------QVYEHLPSNRD-YA 199
Query: 209 DELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLD 268
+ SKFCL G+EV + R+ +++ GCVPVII ++Y LPF++VL+W FSI + +
Sbjct: 200 KSMGDSKFCLCPSGWEVASPRVVEAIAAGCVPVIICDYYVLPFSEVLDWSKFSINITSDK 259
Query: 269 IPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
IP +KKILK + +E YL +Q V +V++HF + YD +M+++ +WLRR +VR++
Sbjct: 260 IPEIKKILKAVPNERYLRMQKRVKQVQRHFVINRPAQPYDMLHMILHSVWLRRLNVRLR 318
>gi|224098485|ref|XP_002311191.1| predicted protein [Populus trichocarpa]
gi|222851011|gb|EEE88558.1| predicted protein [Populus trichocarpa]
Length = 345
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 106/333 (31%), Positives = 166/333 (49%), Gaps = 42/333 (12%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
M SF++Y+Y + F++ +ES F + S FVT++P +A LFF
Sbjct: 40 MLNSFKIYIYTPSKPFSFSS-------------PTESLFFTSLQASPFVTQNPEEAHLFF 86
Query: 61 LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
+PF+ + T I FI ++ ++PYWNRT GADHFYV+C +G + +
Sbjct: 87 VPFA-------SNLSTRSIARFIR----DLRMEFPYWNRTLGADHFYVSCAGLGYESDRN 135
Query: 121 AWEVKLNAIQVVC---SSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLAFFAGA 177
E+K N++Q+ C F + HKD+SLP + G+ L G
Sbjct: 136 LVELKKNSVQISCFPVPEGKF----VPHKDISLPPLARITRASHAPGNRTVRYLVRHGGV 191
Query: 178 VNSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSK-FCLHVKGFEVNTARIADSLYY 236
+S KL RNDS+ S E LGS FCL G +++ I ++L +
Sbjct: 192 KDS----KLANELRNDSDFLMESEPSNEMTLVERLGSSMFCLFEDGADISG--IGEALRF 245
Query: 237 GCVPVIIANH--YDLPFADVLNWKSFSIVVAT-LDIPLLKKIL-KGISSEEYLLLQSYVL 292
GCVPV++ + DLP DVL+W+ ++ V + I +K++L + +E + +
Sbjct: 246 GCVPVMVTDRPMQDLPLMDVLSWQKIAVFVGSGGGIKEMKRVLDRTCKDDECEGTRRLGV 305
Query: 293 KVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
+HF W+ P YD+FYMV+Y+LWLRR ++R
Sbjct: 306 AASQHFGWNEIPQPYDSFYMVVYQLWLRRHTIR 338
>gi|296081850|emb|CBI20855.3| unnamed protein product [Vitis vinifera]
Length = 618
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 120/358 (33%), Positives = 167/358 (46%), Gaps = 76/358 (21%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEP--RGNYASESYFKKAFMKSH-FVTKDPSKAD 57
M + +VY Y PV +P +G YASE +F K + FVTK+ KA
Sbjct: 308 MENTLKVYTYREGER--------PVFHQPPIKGIYASEGWFMKLMQANKKFVTKNGRKAH 359
Query: 58 LFFLPFSIARMR--------HDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVA 109
LF+LPFS + H R+ + ++ +Y+ I KYP+WNRTGGADHF VA
Sbjct: 360 LFYLPFSSLMLEEALYVPNSHSRK----NLEQYLKNYLDMIGAKYPFWNRTGGADHFLVA 415
Query: 110 CHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP--KLGS-- 165
CH S K + N+I+ +C+S G KDVSLP+ R P +LG
Sbjct: 416 CHDWAPSETLK---LMANSIRALCNSD-IREGFKLGKDVSLPETCVRIPQNPLRQLGGKP 471
Query: 166 -SKRNKLAFFAGAVNSPVREKLLQVWRN---DSEIYAHSGRLKTP---YADELLGSKFCL 218
S+R LAFFAG+++ VR LL+ W N D +IY + K Y + SK+C+
Sbjct: 472 PSQRRILAFFAGSMHGYVRPILLKYWENKDPDMKIYGRMPKAKKGTMNYIQHMKSSKYCI 531
Query: 219 HVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKG 278
KG+EVN+ R DIP LK IL
Sbjct: 532 CAKGYEVNSPRK-------------------------------------DIPNLKSILLS 554
Query: 279 ISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRS-SVRVQWSTSLDSN 335
I + YL +Q V +V++HF WH P YD F+M+++ +W R +RV+ L N
Sbjct: 555 IPEKSYLEIQMRVKQVQQHFLWHAKPVKYDVFHMILHSVWYNRVLQIRVRPKQLLGRN 612
>gi|218198616|gb|EEC81043.1| hypothetical protein OsI_23835 [Oryza sativa Indica Group]
Length = 250
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 97/292 (33%), Positives = 146/292 (50%), Gaps = 60/292 (20%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
M RSFRV+VYP DP P G YASE YF + +S F T D +A LFF
Sbjct: 1 MERSFRVFVYPD--GDPGTFYQTPRKLT--GKYASEGYFFQNIRESRFRTDDLEQAHLFF 56
Query: 61 LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
+P S +MR + E + + +Y+ ++ KYPYWNRT G HS K
Sbjct: 57 VPISPHKMR-GKGTSYENMTIIVQNYVESLINKYPYWNRTLGE-------HS-------K 101
Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLAFFAGAVNS 180
+ + L + S+ L F+AG NS
Sbjct: 102 GFPLSLRTL------------------------------------SEWTILGFWAGHCNS 125
Query: 181 PVREKLLQVWRNDSEIYAHSGRLKTP-----YADELLGSKFCLHVKGFEVNTARIADSLY 235
+R L ++W ND+E+ + R+ Y + +KFC+ G +VN+ARI+DS++
Sbjct: 126 KIRVILARIWENDTELAISNNRINRAIGNLVYQKQFFWTKFCVCPGGSQVNSARISDSIH 185
Query: 236 YGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLL 287
YGCVPVI++++YDLPF+ +LNW+ F++V+ D+ LK ILK +S +E++ +
Sbjct: 186 YGCVPVILSDYYDLPFSGILNWRKFAVVLKESDVYELKSILKSLSQKEFVFI 237
>gi|255566500|ref|XP_002524235.1| conserved hypothetical protein [Ricinus communis]
gi|223536512|gb|EEF38159.1| conserved hypothetical protein [Ricinus communis]
Length = 337
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 103/331 (31%), Positives = 165/331 (49%), Gaps = 48/331 (14%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
M +SF++Y Y + F + PV ES F + SHF+T +P +A LFF
Sbjct: 42 MLQSFKIYTYTPPQPFSFTS---PV----------ESLFFTSLQNSHFITLNPEQAHLFF 88
Query: 61 LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
+PF ++ P ++ I ++ ++PYWNRT GADHFY++C +G + +
Sbjct: 89 IPFP-----------SDLSPRSLARVIRDLRTEFPYWNRTLGADHFYISCTGLGYESDRN 137
Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLAFFAGAVNS 180
E+K N++Q+ C S + HKD++LP + P K + +R AF
Sbjct: 138 LVELKKNSVQISCFPSP-NGKFVPHKDITLPPLVPSTIH--KSSNKRRPYKAF------- 187
Query: 181 PVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVP 240
V+ ++ R D E+ S +DE S+FCL + N + I ++L GCVP
Sbjct: 188 -VKYDGVEELRGDLEVLIESQP-----SDEKTRSEFCLF--DYAANISGIGEALSSGCVP 239
Query: 241 VIIANH--YDLPFADVLNWKSFSIVVATLD--IPLLKKILKGISS--EEYLLLQSYVLKV 294
++I DLP DVL W+ +++V + D +K++L G S + ++
Sbjct: 240 LVITERPIQDLPLMDVLRWQEIAVIVGSSDDGFKWVKRVLNGTCSRGDTCERMRRLGAGA 299
Query: 295 RKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
+H W+ P YDAF+MVMY+LWLRR ++R
Sbjct: 300 SQHLVWNETPEPYDAFHMVMYQLWLRRHTIR 330
>gi|222635945|gb|EEE66077.1| hypothetical protein OsJ_22095 [Oryza sativa Japonica Group]
Length = 265
Score = 151 bits (382), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 103/326 (31%), Positives = 157/326 (48%), Gaps = 63/326 (19%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
M RSFRV+VYP DP P G YASE YF + +S F T D KA LFF
Sbjct: 1 MERSFRVFVYPD--GDPGTFYQTPRKLT--GKYASEGYFFQNIRESRFRTDDLEKAHLFF 56
Query: 61 LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
+P S +MR + G G+ F H + + + + F ++ ++ +
Sbjct: 57 VPISPHKMR-GKDTGGWGLTIFSLHAMMLVLEH---------SKGFPLSLRTLSEWTILG 106
Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHK-DVSLPQIWPRQEDPPKLGSSKRNKLAFFAGAVN 179
W +GH K V L +IW + LA +N
Sbjct: 107 FW-----------------AGHRNSKIRVILARIWENDTE-----------LAISNNRIN 138
Query: 180 SPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCV 239
+ + Q +KFC+ G +VN+ARI+DS++YGC+
Sbjct: 139 RAIGNLVYQ--------------------KHFFRTKFCVCPGGSQVNSARISDSIHYGCM 178
Query: 240 PVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQ 299
PVI++++YDL F+ +LNW+ F++V+ D+ LK ILK +S +E++ L +++V+KHF+
Sbjct: 179 PVILSDYYDLRFSGILNWRKFAVVLKESDVYELKSILKSLSQKEFVSLHKSLVQVQKHFE 238
Query: 300 WHVFPSDYDAFYMVMYELWLRRSSVR 325
WH P YDAF+M+MYELWLR ++
Sbjct: 239 WHSPPVPYDAFHMIMYELWLRHHVIK 264
>gi|52075645|dbj|BAD44815.1| putative pectin-glucuronyltransferase [Oryza sativa Japonica Group]
gi|52075653|dbj|BAD44823.1| putative pectin-glucuronyltransferase [Oryza sativa Japonica Group]
Length = 514
Score = 148 bits (373), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 113/386 (29%), Positives = 182/386 (47%), Gaps = 66/386 (17%)
Query: 1 MNRSFRVYVY-----PHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSK 55
M R F+VYVY P P N+ + G + E + T DP++
Sbjct: 134 MERRFKVYVYEEGEPPIAHEGPCKNI-----YAVEGRFIEELELMAPPLGG-VRTWDPAR 187
Query: 56 ADLFFLPFSIARMRHD--RRIGTEGIP--DFISHYIFNISQKYPYWNRTGGADHFYVACH 111
A FLP S+++M R + + P ++ Y+ ++ ++ +WNR+ GADHF ++CH
Sbjct: 188 AHALFLPLSVSQMVQLAYRPLSYDLSPLRAIVADYVAVVASRHRFWNRSAGADHFMLSCH 247
Query: 112 S------------------------IGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKD 147
IG A + E+ NAI+ +C+++ G KD
Sbjct: 248 DWAIHTPSVQRDSISGFPTFRVQRLIGPHASRGHPELYANAIRALCNANTS-EGFRPDKD 306
Query: 148 VSLPQIWPRQED-----PPKLGSSKRNKLAFFAGAVNSPVREKLLQVWRN-DSEIYA--- 198
VS+P+I D R LAFFAG + VR+ LL+ W+ D ++
Sbjct: 307 VSIPEINLYDGDMPPELLSPAPPPPRPFLAFFAGGRHGHVRDLLLRHWKGRDPAVFPVYE 366
Query: 199 -----------------HSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPV 241
G PY + S+FCL G EV + R+ ++++ GCVPV
Sbjct: 367 YDLPSIPVSVSGDGDTDAGGEGGNPYYWYMRRSRFCLCPSGHEVASPRVVEAIHAGCVPV 426
Query: 242 IIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWH 301
++A+ Y PFADVL W++FS+ VA D+P L+++L+ I + E L+ V V++HF H
Sbjct: 427 VVADGYAPPFADVLRWEAFSVAVAVADVPRLRELLERIPAPEVERLRDGVRLVKRHFMLH 486
Query: 302 VFPSDYDAFYMVMYELWLRRSSVRVQ 327
P D F+M+++ +WLRR ++R+
Sbjct: 487 QPPERLDMFHMILHSVWLRRLNLRLN 512
>gi|357478277|ref|XP_003609424.1| Xylogalacturonan beta-1,3-xylosyltransferase [Medicago truncatula]
gi|355510479|gb|AES91621.1| Xylogalacturonan beta-1,3-xylosyltransferase [Medicago truncatula]
Length = 229
Score = 142 bits (357), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 79/205 (38%), Positives = 125/205 (60%), Gaps = 7/205 (3%)
Query: 129 IQVVCSSSYFISGHIAHKDVSLPQI-WPRQE-DPPKLGSSKRNK--LAFFAGAVNSPVRE 184
I+ +C+++ G ++DVS+PQ+ P + PP N+ L FFAG + +R+
Sbjct: 19 IRALCNANTS-EGFWPNRDVSIPQLNLPVGKLGPPNTDQHPNNRTILTFFAGGAHGKIRK 77
Query: 185 KLLQVWRN-DSEIYAHSGRLKTPYADELLG-SKFCLHVKGFEVNTARIADSLYYGCVPVI 242
KLL+ W++ D E+ H K +L+G SKFCL G EV + R+ +++Y GCVPVI
Sbjct: 78 KLLKSWKDKDEEVQVHEYLPKGQDYTKLMGLSKFCLCPSGHEVASPRVVEAIYAGCVPVI 137
Query: 243 IANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHV 302
I ++Y LPF+DVLNW FS+ +A IP +K IL+ I+ +Y +L S V +VRKHF+ +
Sbjct: 138 ICDNYSLPFSDVLNWSQFSMEIAVDRIPEIKTILQNITETKYRVLYSNVRRVRKHFEMNR 197
Query: 303 FPSDYDAFYMVMYELWLRRSSVRVQ 327
+D +M+++ +WLRR + R+
Sbjct: 198 PAKPFDLIHMILHSVWLRRLNFRLH 222
>gi|224112673|ref|XP_002316258.1| predicted protein [Populus trichocarpa]
gi|222865298|gb|EEF02429.1| predicted protein [Populus trichocarpa]
Length = 339
Score = 140 bits (354), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 95/299 (31%), Positives = 151/299 (50%), Gaps = 35/299 (11%)
Query: 35 SESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKY 94
+ES F S FVT++P +A L+F+PFS + T + FI ++ ++
Sbjct: 61 TESNFFTCLQNSPFVTQNPEEAHLYFVPFS-------SNLSTRSVARFIR----DLRMEF 109
Query: 95 PYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGH-IAHKDVSLPQI 153
PYWNRT GADHFYV+C +G + + E+K N++Q+ C + G + HKD++ P
Sbjct: 110 PYWNRTLGADHFYVSCAGLGYESDRNLVELKKNSVQISCFPT--TEGRFVPHKDITFPP- 166
Query: 154 WPRQEDPPKLGSSKRNKLAFFAGAV--NSPVREKLLQVWRNDSEIYAHSGRLKTPYADEL 211
++ N+ A + G V N L+ R DS+ S
Sbjct: 167 -----------HAQGNRTAKYLGFVRYNEVKESNLVNELRKDSDFLIESEPSNGMTLVGR 215
Query: 212 LGSK-FCLHVKGFEVNTARIADSLYYGCVPVIIANH--YDLPFADVLNWKSFSIVVATL- 267
LGS FCL G +V+ I ++L +GCVPV++ + DLP DV+ W+ +I V +
Sbjct: 216 LGSSVFCLFEYGADVSG--IGEALRFGCVPVMVMDRPMQDLPLMDVIGWQKIAIFVGSRG 273
Query: 268 DIPLLKKIL-KGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
+ +K+ L + +E + + +HF W+ P YD+F+MVMY+LWLRR ++R
Sbjct: 274 GVKEVKRELDRTCKDDECAGRRRLGVVASQHFVWNHMPQPYDSFHMVMYQLWLRRHAIR 332
>gi|224142185|ref|XP_002324439.1| predicted protein [Populus trichocarpa]
gi|222865873|gb|EEF03004.1| predicted protein [Populus trichocarpa]
Length = 208
Score = 138 bits (347), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 76/206 (36%), Positives = 121/206 (58%), Gaps = 12/206 (5%)
Query: 127 NAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP--KLGS---SKRNKLAFFAGAVNSP 181
+ I+ +C++ +G +DVS P+ + R P LG S+RN LAF+AG ++
Sbjct: 3 HCIKALCNAD-VTAGFKIGRDVSFPETYVRSARNPLRDLGGKPPSQRNILAFYAGNMHGY 61
Query: 182 VREKLLQVWRNDSEIYAHSGRL------KTPYADELLGSKFCLHVKGFEVNTARIADSLY 235
+R LL+ W++ G + K Y + SK+C+ KG+EVN+ R+ ++++
Sbjct: 62 LRPILLKYWKDKDPDMKIFGPMPPGVASKMNYIHHMQRSKYCICPKGYEVNSPRVVEAIF 121
Query: 236 YGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVR 295
Y CVPVII++++ PF DVL+W +FS+++A DI LK+IL I E+YL +Q V K +
Sbjct: 122 YECVPVIISDNFVPPFFDVLDWGAFSLILAEKDISNLKEILLSIPKEKYLQMQLGVRKAQ 181
Query: 296 KHFQWHVFPSDYDAFYMVMYELWLRR 321
+HF WH P YD FYM ++ +W R
Sbjct: 182 RHFLWHASPMKYDLFYMTLHSIWYNR 207
>gi|125544214|gb|EAY90353.1| hypothetical protein OsI_11935 [Oryza sativa Indica Group]
Length = 218
Score = 135 bits (339), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 75/188 (39%), Positives = 115/188 (61%), Gaps = 18/188 (9%)
Query: 146 KDVSLPQIWPRQEDPP-----KLGSSKRNKLAFFAGAVNSPVREKLLQVWRN--DSEIYA 198
+DVSLP+ R P L S+R LAFFAG V+ VR LL+ W + D +++
Sbjct: 29 RDVSLPETTIRTPRRPLRYVGGLPVSRRGILAFFAGNVHGRVRPVLLKHWGDGRDDDMWV 88
Query: 199 HS---GRL--KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFAD 253
+ R+ + Y + S+FCL G+EVN+ RI ++LYY CVPVIIA+++
Sbjct: 89 YGPLPARVSRRMSYIQHMKNSRFCLCPMGYEVNSPRIVEALYYECVPVIIADNF------ 142
Query: 254 VLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
VL+W +F++VVA D+P LKKIL+GI+ +Y+ + V ++++HF WH P YD F+M+
Sbjct: 143 VLDWSAFAVVVAEKDVPDLKKILQGITLRKYVAMHGCVKRLQRHFLWHARPLRYDLFHMI 202
Query: 314 MYELWLRR 321
++ +WL R
Sbjct: 203 LHSIWLSR 210
>gi|297725119|ref|NP_001174923.1| Os06g0638350 [Oryza sativa Japonica Group]
gi|255677258|dbj|BAH93651.1| Os06g0638350 [Oryza sativa Japonica Group]
Length = 257
Score = 134 bits (337), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 99/270 (36%), Positives = 136/270 (50%), Gaps = 44/270 (16%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
M RSFRV+VYP DP P G YASE YF + +S F T D KA LFF
Sbjct: 1 MERSFRVFVYPD--GDPGTFYQTPRKLT--GKYASEGYFFQNIRESRFRTDDLEKAHLFF 56
Query: 61 LPFSIARMR---------------------HDRRIGTEGIPDFISHYIFN--ISQKYPYW 97
+P S +MR +DR I F+ Y + S +W
Sbjct: 57 VPISPHKMRGKVPSSLLLVTYAWLILHIRSYDRSI------LFLDLYWWCPLCSSFRGHW 110
Query: 98 NRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQ 157
GADHF+V CH +G A + + N+I+VVCS SY +G+I HKDV+LPQI +
Sbjct: 111 GV--GADHFFVTCHDVGVRAFEGLPFIIKNSIRVVCSPSY-NAGYIPHKDVALPQIL-QP 166
Query: 158 EDPPKLGSSKRNK--LAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKTP-----YADE 210
P G+ N+ L F+AG NS +R L ++W ND+E+ + R+ Y
Sbjct: 167 FALPAGGNDIENRTILGFWAGHRNSKIRVILARIWENDTELAISNNRINRAIGNLVYQKH 226
Query: 211 LLGSKFCLHVKGFEVNTARIADSLYYGCVP 240
+KFC+ G +VN+ARI+DS++YGC+P
Sbjct: 227 FFRTKFCVCPGGSQVNSARISDSIHYGCMP 256
>gi|449465854|ref|XP_004150642.1| PREDICTED: probable glycosyltransferase At5g03795-like [Cucumis
sativus]
Length = 312
Score = 134 bits (336), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 98/297 (32%), Positives = 142/297 (47%), Gaps = 65/297 (21%)
Query: 35 SESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKY 94
+ES F K+ + S + T DP +A LFF+PFS I T + + I +
Sbjct: 65 AESLFYKSLLNSPYTTHDPDQAHLFFIPFS-------PHISTRSL----ARLIRTLRTDL 113
Query: 95 PYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFIS--GHIAHKDVSLPQ 152
PYWNRT GADHF+++ IG + + E+K NAIQV SS+ +S I HKDVSLP
Sbjct: 114 PYWNRTLGADHFFLSSSGIGYISDRNVVELKKNAIQV---SSFPVSPGKFIPHKDVSLP- 169
Query: 153 IWPRQEDPPKLGSSKRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKTPYADELL 212
P PP+ S Y D+L
Sbjct: 170 --PVSTLPPRTPSC----------------------------------------YGDKLA 187
Query: 213 GSKFCL-HVKGFEVNTARIADSLYYGCVPVIIANHY--DLPFADVLNWKSFSIVVA-TLD 268
S FCL +G +V+ I ++L +GCVPV+I++ + DLP DV+ W+ ++ VA
Sbjct: 188 KSDFCLFEYEGGDVSG--IGEALRFGCVPVVISDRWIQDLPLMDVVRWEEMAVFVAGGGG 245
Query: 269 IPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
I +KK+L+ + E ++ +HF W+ P DAF V Y+LW+RR +VR
Sbjct: 246 IEGVKKVLRRVDGERLDRMKKLGAAAAQHFVWNSPPQPLDAFNTVAYQLWVRRHAVR 302
>gi|449503277|ref|XP_004161922.1| PREDICTED: probable glycosyltransferase At5g03795-like [Cucumis
sativus]
Length = 310
Score = 133 bits (335), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 98/297 (32%), Positives = 142/297 (47%), Gaps = 65/297 (21%)
Query: 35 SESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKY 94
+ES F K+ + S + T DP +A LFF+PFS I T + + I +
Sbjct: 63 AESLFFKSLLNSPYATHDPDQAHLFFIPFS-------PHISTRSL----ARLIRTLRTDL 111
Query: 95 PYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFIS--GHIAHKDVSLPQ 152
PYWNRT GADHF+++ IG + + E+K NAIQV SS+ +S I HKDVSLP
Sbjct: 112 PYWNRTLGADHFFLSSSGIGYISDRNVVELKKNAIQV---SSFPVSPGKFIPHKDVSLP- 167
Query: 153 IWPRQEDPPKLGSSKRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKTPYADELL 212
P PP+ S Y D+L
Sbjct: 168 --PVSTLPPRTPSC----------------------------------------YGDKLA 185
Query: 213 GSKFCL-HVKGFEVNTARIADSLYYGCVPVIIANHY--DLPFADVLNWKSFSIVVA-TLD 268
S FCL +G +V+ I ++L +GCVPV+I++ + DLP DV+ W+ ++ VA
Sbjct: 186 KSDFCLFEYEGGDVSG--IGEALRFGCVPVVISDRWIQDLPLMDVVRWEEMAVFVAGGGG 243
Query: 269 IPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
I +KK+L+ + E ++ +HF W+ P DAF V Y+LW+RR +VR
Sbjct: 244 IEGVKKVLRRVDGERLDRMKKLGAAAAQHFVWNSPPQPLDAFNTVAYQLWVRRHAVR 300
>gi|297738775|emb|CBI28020.3| unnamed protein product [Vitis vinifera]
Length = 665
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 101/335 (30%), Positives = 147/335 (43%), Gaps = 80/335 (23%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKS-HFVTKDPSKADLF 59
M R +VYVY F +L +G YASE +F K ++ HFV KDP +A LF
Sbjct: 390 MERILKVYVYKDGEKPIFHQPIL------KGLYASEGWFMKLMERNKHFVVKDPRQAQLF 443
Query: 60 FLPFS--------IARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACH 111
++PFS R H+R + ++ Y I+ KY +WNRTGGADHF VACH
Sbjct: 444 YMPFSSRMLEYKLYVRNSHNR----TNLRQYLKQYSEKIAAKYRFWNRTGGADHFLVACH 499
Query: 112 SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP--KLGS---S 166
+ E I+ +C++ +G +DVSLP+ + R P LG S
Sbjct: 500 DWAPYETRHHME---QCIKALCNAD-VTAGFKIGRDVSLPETYVRSARNPLRDLGGKPPS 555
Query: 167 KRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVN 226
+R+ LAF+AG ++ +R LL+ W++ K P
Sbjct: 556 ERHILAFYAGNMHGYLRPILLKYWKD-----------KDP-------------------- 584
Query: 227 TARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLL 286
D YG +P + D+ N K +L I +++YL
Sbjct: 585 -----DMKIYGPMPPGLRGQQPTE-KDIPNLKD---------------VLLSIPNDKYLQ 623
Query: 287 LQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
+Q V KV+KHF WH P YD F+M ++ +W R
Sbjct: 624 MQLGVRKVQKHFLWHAKPLKYDLFHMTLHSIWYNR 658
>gi|449531319|ref|XP_004172634.1| PREDICTED: probable glycosyltransferase At5g03795-like [Cucumis
sativus]
Length = 307
Score = 121 bits (303), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 67/153 (43%), Positives = 90/153 (58%), Gaps = 6/153 (3%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
M F+VY+YP DP N + G YASE YF + +S F T+DP +A LFF
Sbjct: 147 MESKFKVYIYPD--GDP--NTFYQTPRKLTGKYASEGYFFQNIRESRFRTEDPDQAHLFF 202
Query: 61 LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
+P S +MR + E + + +Y+ + KYPYWNRT GADHF+V CH +G A +
Sbjct: 203 IPISCHKMR-GKGTSYENMTVIVQNYVEGLISKYPYWNRTLGADHFFVTCHDVGVRASEG 261
Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI 153
+ NAI+VVCS SY + G I HKDV+LPQ+
Sbjct: 262 LPFLIKNAIRVVCSPSYDV-GFIPHKDVALPQV 293
>gi|218197695|gb|EEC80122.1| hypothetical protein OsI_21883 [Oryza sativa Indica Group]
Length = 545
Score = 120 bits (302), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 112/417 (26%), Positives = 181/417 (43%), Gaps = 97/417 (23%)
Query: 1 MNRSFRVYVY-----PHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSK 55
M R F+VYVY P P N+ + G + E + T DP++
Sbjct: 134 MERRFKVYVYEEGEPPIAHEGPCKNI-----YAVEGRFIEELELMAPPLGG-VRTWDPAR 187
Query: 56 ADLFFLPFSIARMRHD--RRIGTEGIP--DFISHYIFNISQKYPYWNRTGGA-------- 103
A FLP S+++M R + + P ++ Y+ ++ ++ +WNR+ GA
Sbjct: 188 AHALFLPLSVSQMVQLAYRPLSYDLSPLRAIVADYVAVVASRHRFWNRSAGADHFMLSCH 247
Query: 104 -----DHFY---------------------------------VACHSI---------GRS 116
+HF+ V SI G
Sbjct: 248 DWASTNHFFPLFFRRKIFSQHQMISIQISKRVQIINSVHTPSVQRDSISGFPTFRVNGPH 307
Query: 117 AMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQED-----PPKLGSSKRNKL 171
A + E+ NAI+ +C+++ G KDVS+P+I D R L
Sbjct: 308 ASRGHPELYANAIRALCNANT-SEGFRPDKDVSIPEINLYDGDMPPELLSPAPPPPRPFL 366
Query: 172 AFFAGAVNSPVREKLLQVWRN-DSEIYA--------------------HSGRLKTPYADE 210
AFFAG + VR+ LL+ W+ D ++ G PY
Sbjct: 367 AFFAGGRHGHVRDLLLRHWKGRDPAVFPVYEYDLPSIPVSVSGDGDTDAGGEGGNPYYWY 426
Query: 211 LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIP 270
+ S+FCL G EV + R+ ++++ GCVPV++A+ Y PFADVL W++FS+ VA D+P
Sbjct: 427 MRRSRFCLCPSGHEVASPRVVEAIHAGCVPVVVADGYAPPFADVLRWEAFSVAVAVADVP 486
Query: 271 LLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
L+++L+ I + E L+ V V++HF H P D F+M+++ +WLRR ++R+
Sbjct: 487 RLRELLERIPAPEVERLRDGVRLVKRHFMLHQPPERLDMFHMILHSVWLRRLNLRLN 543
>gi|115469550|ref|NP_001058374.1| Os06g0680900 [Oryza sativa Japonica Group]
gi|113596414|dbj|BAF20288.1| Os06g0680900 [Oryza sativa Japonica Group]
gi|125556494|gb|EAZ02100.1| hypothetical protein OsI_24187 [Oryza sativa Indica Group]
Length = 477
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 94/289 (32%), Positives = 148/289 (51%), Gaps = 18/289 (6%)
Query: 50 TKDPSKADLFFLPFSIARM-----RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGAD 104
+ P +A FFLP S+A + R D + ++ Y+ ++ YP+WNR+ GAD
Sbjct: 183 ARHPGEAHAFFLPISVASIAGYVYRRDMIDFWDPQLRLVAGYVDGLAAMYPFWNRSRGAD 242
Query: 105 HFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLG 164
HF V+CH E++ NAI+V+C + G DV+LP + P G
Sbjct: 243 HFLVSCHQWAPILSAAKAELRGNAIRVMCDAD-MSDGFDPATDVALPPVVASARATPPQG 301
Query: 165 --SSKRNKLAFFAGAVNSP--VREKLLQVWRNDSEIYAHSGRL--KTPYADELLGSKFCL 218
+S+R LAFFA VRE LL W + GRL + + + ++FCL
Sbjct: 302 RVASERTVLAFFAAGGGGGGAVREALLARWEGRDDRVVVYGRLPAGVDHGELMRRARFCL 361
Query: 219 ----HVKGFEVNTARIADSLYYGCVPVIIANH-YDLPFADVLNWKSFSIVVATLDIPLLK 273
+G + R+ +++ GCVPV++ + Y PF+DVL+W FS+ V + +K
Sbjct: 362 CPCGGGEGAAAASRRVVEAITAGCVPVLVDDGGYSPPFSDVLDWARFSVAVPAERVGEIK 421
Query: 274 KILKGISSEEYLLLQSYVLKVRKHFQWHVFPSD-YDAFYMVMYELWLRR 321
IL G+S Y +L+ VL+VR+HF+ + P+ +D MV++ +WLRR
Sbjct: 422 DILGGVSDRRYGVLRRRVLRVRRHFRLNRPPAKRFDVVNMVIHSIWLRR 470
>gi|222635067|gb|EEE65199.1| hypothetical protein OsJ_20321 [Oryza sativa Japonica Group]
Length = 551
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 79/241 (32%), Positives = 125/241 (51%), Gaps = 27/241 (11%)
Query: 113 IGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQED-----PPKLGSSK 167
IG A + E+ NAI+ +C+++ G KDVS+P+I D
Sbjct: 310 IGPHASRGHPELYANAIRALCNANTS-EGFRPDKDVSIPEINLYDGDMPPELLSPAPPPP 368
Query: 168 RNKLAFFAGAVNSPVREKLLQVWRN-DSEIYA--------------------HSGRLKTP 206
R LAFFAG + VR+ LL+ W+ D ++ G P
Sbjct: 369 RPFLAFFAGGRHGHVRDLLLRHWKGRDPAVFPVYEYDLPSIPVSVSGDGDTDAGGEGGNP 428
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y + S+FCL G EV + R+ ++++ GCVPV++A+ Y PFADVL W++FS+ VA
Sbjct: 429 YYWYMRRSRFCLCPSGHEVASPRVVEAIHAGCVPVVVADGYAPPFADVLRWEAFSVAVAV 488
Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRV 326
D+P L+++L+ I + E L+ V V++HF H P D F+M+++ +WLRR ++R+
Sbjct: 489 ADVPRLRELLERIPAPEVERLRDGVRLVKRHFMLHQPPERLDMFHMILHSVWLRRLNLRL 548
Query: 327 Q 327
Sbjct: 549 N 549
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 34/120 (28%), Positives = 57/120 (47%), Gaps = 15/120 (12%)
Query: 1 MNRSFRVYVY-----PHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSK 55
M R F+VYVY P P N+ + G + E + T DP++
Sbjct: 137 MERRFKVYVYEEGEPPIAHEGPCKNI-----YAVEGRFIEELELMAPPLGG-VRTWDPAR 190
Query: 56 ADLFFLPFSIARMRHD--RRIGTEGIP--DFISHYIFNISQKYPYWNRTGGADHFYVACH 111
A FLP S+++M R + + P ++ Y+ ++ ++ +WNR+ GADHF ++CH
Sbjct: 191 AHALFLPLSVSQMVQLAYRPLSYDLSPLRAIVADYVAVVASRHRFWNRSAGADHFMLSCH 250
>gi|326490197|dbj|BAJ94172.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326526653|dbj|BAK00715.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 422
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 92/345 (26%), Positives = 156/345 (45%), Gaps = 51/345 (14%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIP------DFISHY 86
+A+E + + + S TK+P +AD F+ P + + +G P +
Sbjct: 83 FAAEIFMHQFLLASAVRTKNPEEADWFYTPVYVTC-----DLTQQGFPLPFRAPRIMRSA 137
Query: 87 IFNISQKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISG 141
I I+ +PYWNRT GADHF++A H G ++A E + + + ++
Sbjct: 138 IQYIAATWPYWNRTEGADHFFLAPHDFGACFHYQEERAIERGILPLLRRATLVQTFGQRN 197
Query: 142 HIAHKDVSLPQIWPRQEDPPKLGS------SKRNKLAFFAGAV----NSP-----VREKL 186
H+ ++ S+ P +P K+ + + R+ A+F G N P R
Sbjct: 198 HVCMQEGSI--TIPPYANPQKMQAHLISPGTPRSIFAYFRGLFYDMGNDPEGGYYARGAR 255
Query: 187 LQVWRN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIA 244
VW N D+ ++ S + Y +++ + FCL G+ + R+ +++ +GC+PVIIA
Sbjct: 256 ASVWENFKDNPLFDMSSEHPSTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIA 315
Query: 245 NHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYL----LLQSYVLKVRKHFQW 300
+ LPFAD + W+ S+ VA D+P L IL ++ E+ L LL S +K F
Sbjct: 316 DDIVLPFADAIPWEQISVFVAEADVPRLDSILASVAPEDVLRKQRLLASPAMKQAVLFHQ 375
Query: 301 HVFPSDYDAFYMVMYEL----------WLRRSSVRVQWSTSLDSN 335
P DAF V+ L +L+ + W+ LDS+
Sbjct: 376 PARPG--DAFDQVLNGLARKLPHPDGAFLKPGQKVLDWNAGLDSD 418
>gi|357492457|ref|XP_003616517.1| Xylogalacturonan beta-1,3-xylosyltransferase [Medicago truncatula]
gi|355517852|gb|AES99475.1| Xylogalacturonan beta-1,3-xylosyltransferase [Medicago truncatula]
Length = 259
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 90/332 (27%), Positives = 141/332 (42%), Gaps = 83/332 (25%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
M ++F+V++Y N + ES F + S ++T+ P +A LFF
Sbjct: 1 MAQNFKVFMYQPNTNTNITQFSFKTE--------QESLFYSSLQNSSYLTQHPEQAHLFF 52
Query: 61 LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
LPFS T + FIS I +PYWNR+ GADHFY++C I +
Sbjct: 53 LPFS-------SDTSTRSLARFISR----IRNDFPYWNRSLGADHFYLSCDGISHVNDRN 101
Query: 121 AWEVKLNAIQVVCSSSYFISGH---IAHKDVSLPQIWPRQEDPPKLGSSKRNKLAFFAGA 177
E+K NAIQ+ C F + H I HKD++LP I +P
Sbjct: 102 IVELKKNAIQIAC----FPTRHRSFIPHKDITLPPI----TNP----------------- 136
Query: 178 VNSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYG 237
++PVR L +FC+ V+ + + ++L G
Sbjct: 137 -HAPVR---------------------------LSNEEFCV-VEYQNDDVLWLGEALRLG 167
Query: 238 CVPVIIANHY--DLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVR 295
CVPV++ D+PF +VL W+ ++ V + I E ++ +
Sbjct: 168 CVPVVVTEEAVNDMPFMEVLRWREMAVFVKS-----GVNIETDTWRERKGNMRRLGVVGS 222
Query: 296 KHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
KH +W+ +DAF +MY+LWLRR ++R +
Sbjct: 223 KHLRWNRPAQPFDAFNTIMYQLWLRRHTIRYE 254
>gi|168043163|ref|XP_001774055.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674601|gb|EDQ61107.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 541
Score = 111 bits (278), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 78/271 (28%), Positives = 129/271 (47%), Gaps = 22/271 (8%)
Query: 5 FRVYVYPHRRNDPFANVLLPVDFEPRGNYASE-----SYFKKAFMKSHFVTKDPSKADLF 59
+++VYP + N F+ R + E F + ++S F T+ P KA LF
Sbjct: 147 LQIWVYPSQA----GNTSYEHKFDGRKDVMEELSSTADLFSRLLLRSKFSTELPQKAQLF 202
Query: 60 FLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSI-GRSAM 118
LPFSI +R D +G I D + Y+ N+ YPYWN + GA+HFY++ + +
Sbjct: 203 LLPFSIDALRVD--LGPSRISDHLRRYVQNVRTSYPYWNLSLGANHFYLSSQAFENNNKH 260
Query: 119 QKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQED---PPKLGSSKRNKLAFFA 175
+ E++ N+IQ C+ HKD P+ P + G + R LA+F
Sbjct: 261 RNVLELEKNSIQAACAPLRQNQNFYPHKDFIFPRYKPITQTEFYAALEGRTSRTVLAYFG 320
Query: 176 GAV-NSPVREKLLQVWRNDS--EIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIAD 232
G + ++P +L W++D E+ + +L SKFC++V +T D
Sbjct: 321 GTLADTPALVFILDAWKSDPDFEVEVDPSPHRISVYRQLARSKFCVNVPS--RDTFDFVD 378
Query: 233 SLYYGCVPVIIANH--YDLPFADVLNWKSFS 261
++ +GCV V+++ DLPF L+W+ F+
Sbjct: 379 AIRFGCVLVLLSKSVFLDLPFQGFLDWRQFA 409
>gi|356546128|ref|XP_003541483.1| PREDICTED: LOW QUALITY PROTEIN: xylogalacturonan
beta-1,3-xylosyltransferase-like [Glycine max]
Length = 252
Score = 111 bits (278), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 84/286 (29%), Positives = 137/286 (47%), Gaps = 50/286 (17%)
Query: 45 KSHFVTKDPSKADLFFLPFSIARMRH----DRRIGTEGIPD----FISHYIFNISQKYPY 96
KS F P +A LF LPFS++++ RR ++ PD + YI ++ +YP
Sbjct: 4 KSPFKATHPEQAHLFLLPFSVSKVIRYVYKPRRSRSDYDPDRLQRLVLDYINIVANRYPN 63
Query: 97 WNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPR 156
WNR+ GADHF V+ H + EV I+ +C+++ G +DVS+ +++
Sbjct: 64 WNRSRGADHFLVSFHD----WLDANPEVFKYFIRALCNANTS-EGFQPSRDVSITEVY-- 116
Query: 157 QEDPPKLGSSKRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGS-K 215
L + D E+ H K +L+G K
Sbjct: 117 ------------------------------LPSRKLDKEVQVHEYLPKGLEYTKLMGQRK 146
Query: 216 FCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKI 275
FCL +V + R+ +++Y GCVPVII ++Y LPF+DV+ F +A IP K I
Sbjct: 147 FCL-CPSXQVASPRVVEAIYVGCVPVIICDNYSLPFSDVVKRSKF---IAVERIPETKTI 202
Query: 276 LKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
L+ +S ++Y+ L S V +VR+HF + +D +M+++ L +R
Sbjct: 203 LQNVSKDKYMELYSNVKRVRRHFVINRPAKPFDLIHMILHSLRNKR 248
>gi|23821292|dbj|BAC20928.1| pectin-glucuronyltransferase [Nicotiana plumbaginifolia]
Length = 341
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 86/309 (27%), Positives = 143/309 (46%), Gaps = 25/309 (8%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E Y + + S T +P +AD F+ P ++ + + P + I I+
Sbjct: 2 FAAEIYMHRFLLSSAVRTFNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRMMRSAIQLIA 61
Query: 92 QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
+PYWNRT GADHF++ H G +KA E + + + ++ H+ K
Sbjct: 62 SNWPYWNRTEGADHFFITPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLK 121
Query: 147 D--VSLPQIWPRQEDPPKL--GSSKRNKLAFFAGAV----NSP-----VREKLLQVWRN- 192
D +++P P Q+ L + R+ +F G N P R VW N
Sbjct: 122 DGSITIPPYAPPQKMQSHLIPPDTPRSIFVYFRGLFYDVGNDPEGGYYARGARAAVWENF 181
Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
D+ ++ S T Y +++ + FCL G+ + R+ +++ +GC+PVIIA+ LPF
Sbjct: 182 KDNPLFDISTEHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVIFGCIPVIIADDIVLPF 241
Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
AD + W+ + VA D+P L IL I EE +L + +L Q +FP D
Sbjct: 242 ADAIPWEDIGVFVAEKDVPNLDTILTSIPPEE-ILRKQRLLANPSMKQAMLFPQPAQSGD 300
Query: 309 AFYMVMYEL 317
AF+ ++ L
Sbjct: 301 AFHQILNGL 309
>gi|386576430|gb|AFJ12119.1| glycosyltransferase, partial [Nicotiana tabacum]
Length = 317
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 86/309 (27%), Positives = 143/309 (46%), Gaps = 25/309 (8%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E Y + + S T +P +AD F+ P ++ + + P + I I+
Sbjct: 2 FAAEIYMHRFLLSSAVRTFNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRMMRSAIQLIA 61
Query: 92 QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
+PYWNRT GADHF++ H G +KA E + + + ++ H+ K
Sbjct: 62 SNWPYWNRTEGADHFFITPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLK 121
Query: 147 D--VSLPQIWPRQEDPPKL--GSSKRNKLAFFAGAV----NSP-----VREKLLQVWRN- 192
D +++P P Q+ L + R+ +F G N P R VW N
Sbjct: 122 DGSITIPPYAPPQKMQSHLIPPDTPRSIFVYFRGLFYDVGNDPEGGYYARGARAAVWENF 181
Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
D+ ++ S T Y +++ + FCL G+ + R+ +++ +GC+PVIIA+ LPF
Sbjct: 182 KDNPLFDISTEHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVIFGCIPVIIADDIVLPF 241
Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
AD + W+ + VA D+P L IL I EE +L + +L Q +FP D
Sbjct: 242 ADAIPWEDIGVFVAEKDVPNLDTILTSIPPEE-ILRKQRLLANPSMKQAMLFPQPAQSGD 300
Query: 309 AFYMVMYEL 317
AF+ ++ L
Sbjct: 301 AFHQILNGL 309
>gi|42573339|ref|NP_974766.1| Exostosin family protein [Arabidopsis thaliana]
gi|332004318|gb|AED91701.1| Exostosin family protein [Arabidopsis thaliana]
Length = 453
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 81/242 (33%), Positives = 123/242 (50%), Gaps = 30/242 (12%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPR----GNYASESYFKKAFMKSH-FVTKDPSK 55
M ++ +VYVY P+ +P G YASE +F K SH F+TKDP+K
Sbjct: 216 MEQTLKVYVYSEGDR--------PIFHQPEAIMEGIYASEGWFMKLMESSHRFLTKDPTK 267
Query: 56 ADLFFLPFSIARMR-----HDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVAC 110
A LF++PFS ++ HD + ++ +YI I+ YP WNRT G+DHF+ AC
Sbjct: 268 AHLFYIPFSSRILQQKLYVHDSH-SRNNLVKYLGNYIDLIASNYPSWNRTCGSDHFFTAC 326
Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQ--IWPRQEDPPKLGS--- 165
H + + + +N I+ +C++ I + KDVSLP+ + Q K+G
Sbjct: 327 HDWAPTETRGPY---INCIRALCNADVGID-FVVGKDVSLPETKVSSLQNPNGKIGGSRP 382
Query: 166 SKRNKLAFFAGAVNSPVREKLLQVW--RNDSEIYAHSGRLKTPYADELLGSKFCLHVKGF 223
SKR LAFFAG+++ VR LL W R + ++ + Y + S+FC+ KG+
Sbjct: 383 SKRTILAFFAGSLHGYVRPILLNQWSSRPEQDMKIFNRIDHKSYIRYMKRSRFCVCAKGY 442
Query: 224 EV 225
E
Sbjct: 443 EC 444
>gi|449459136|ref|XP_004147302.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase F8H-like
[Cucumis sativus]
Length = 447
Score = 108 bits (270), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 88/311 (28%), Positives = 135/311 (43%), Gaps = 28/311 (9%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGI---PDFISHYIFN 89
+ASE KA + SH T DPS+AD FF+P ++ G I P ++ +
Sbjct: 123 FASEVAIHKALLTSHVRTLDPSEADFFFVPVYVS-CNFSSFNGFPAIAHAPSLLASAVDV 181
Query: 90 ISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH---- 145
IS ++P+WNR+ G DH +VA H G N I +S +
Sbjct: 182 ISGQFPFWNRSRGFDHVFVASHDYGACFHSLEDMAIANGIPEFLKNSIILQTFGVKYKHP 241
Query: 146 -KDVS----LPQIWPRQEDPPKLGSSKRNKLAFFAGAV------------NSPVREKLLQ 188
+DV P I P +P + +R+ AFF G + VR + +
Sbjct: 242 CQDVENILIPPYISPEFMEPAVVDGRRRDIFAFFRGKMEVNPKNVGGRFYGKRVRTTIWK 301
Query: 189 VWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYD 248
+ D Y R Y E+ S FCL G+ + R+ +S+ GCVPVIIA+
Sbjct: 302 KFHRDRRFYLRRHRFAG-YRSEIARSVFCLCPLGWAPWSPRLVESVALGCVPVIIADGIR 360
Query: 249 LPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVL--KVRKHFQWHVFPSD 306
LPF ++W S+ VA D+ L+KIL+ +++ +Q + K R+ +H
Sbjct: 361 LPFPSAVDWPGISLTVAEKDVGKLRKILERVAATNLTAIQKNLWDPKNRRALLFHNPTQP 420
Query: 307 YDAFYMVMYEL 317
DA + V+ L
Sbjct: 421 QDATWQVLSAL 431
>gi|39725600|dbj|BAD04923.1| pectin-glucuronyltransferase [Nicotiana tabacum]
Length = 317
Score = 108 bits (269), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 80/282 (28%), Positives = 130/282 (46%), Gaps = 21/282 (7%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E Y + + S T +P +AD F+ P ++ + + P + I I+
Sbjct: 2 FAAEIYMHRFLLSSAVRTFNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRMMRSAIQLIA 61
Query: 92 QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
+PYWNRT GADHF++ H G +KA E + + + ++ H+ K
Sbjct: 62 SNWPYWNRTEGADHFFITPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLK 121
Query: 147 D--VSLPQIWPRQEDPPKL--GSSKRNKLAFFAGAV----NSP-----VREKLLQVWRN- 192
D +++P P Q+ L + R+ +F G N P R VW N
Sbjct: 122 DGSITIPPYAPPQKMQSHLIPPDTPRSIFVYFRGLFYDVGNDPEGGYYARGARAAVWENF 181
Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
D+ + S T Y +++ + FCL G+ + R+ +++ +GC+PVIIA+ LPF
Sbjct: 182 KDNPPFDISTEHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVIFGCIPVIIADDIVLPF 241
Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLK 293
AD + W+ + VA D+P L IL I EE L Q + K
Sbjct: 242 ADAIPWEDIGVFVAEKDVPTLDTILTSIPPEEILRKQRLLAK 283
>gi|150246869|emb|CAM91796.1| putative glycosyltransferase family 47 [Triticum aestivum]
Length = 420
Score = 107 bits (266), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 84/333 (25%), Positives = 148/333 (44%), Gaps = 35/333 (10%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T +P +AD F+ P + + P + I I+
Sbjct: 81 FAAEIFMHRFLLSSAVRTTNPEEADWFYTPVYPTCDLTPSGLPLPFKSPRMMRSAIELIA 140
Query: 92 QKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFIS-----GHIAHK 146
K+PYWNR+ GADHF+V H G + + I + + + H+ K
Sbjct: 141 TKWPYWNRSEGADHFFVTPHDFGACFHYQEEKAIGRGILPLLQRATLVQTFGQKNHVCLK 200
Query: 147 D--VSLPQIWPRQEDPPKL--GSSKRNKLAFFAG----AVNSP-----VREKLLQVWRN- 192
+ +++P P Q+ L G + R+ +F G N P R VW N
Sbjct: 201 EGSITIPPFAPPQKMQNHLIPGETPRSIFVYFRGLFYDTGNDPEGGYYARGARASVWENF 260
Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
++ ++ S Y +++ S FCL G+ + R+ +++ +GC+PVIIA+ LPF
Sbjct: 261 KNNPLFDISTDHPPTYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 320
Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
AD + W+ + V D+P L IL I +E+ +L + +L Q +FP D
Sbjct: 321 ADAIPWEEIGVFVPEEDVPRLDSILTSIPTED-ILRKQRLLANPSMKQAMLFPQPAQAGD 379
Query: 309 AFYMVM----------YELWLRRSSVRVQWSTS 331
AF+ ++ +++L+ VR+ W+
Sbjct: 380 AFHQILNGLARKLPHGEDVFLKPGQVRLNWTAG 412
>gi|242059739|ref|XP_002459015.1| hypothetical protein SORBIDRAFT_03g044510 [Sorghum bicolor]
gi|241930990|gb|EES04135.1| hypothetical protein SORBIDRAFT_03g044510 [Sorghum bicolor]
Length = 415
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 88/340 (25%), Positives = 150/340 (44%), Gaps = 41/340 (12%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T DP +AD F+ P ++ + P + I ++
Sbjct: 76 FAAEIFMHQFLLSSAVRTLDPEEADWFYTPVYTTCDLTPQGFPLPFRAPRIMRSAIRYVA 135
Query: 92 QKYPYWNRTGGADHFYVACHSIG-------RSAMQKAWEVKLNAIQVVCSSSYFISGHIA 144
+P+WNRT GADHF++ H G AM++ L +V ++ H+
Sbjct: 136 TTWPFWNRTDGADHFFLTPHDFGACFHYQEERAMERGILPLLRRATLV--QTFGQRNHVC 193
Query: 145 HKDVSLPQIWPRQEDPPKLGS------SKRNKLAFFAGAV----NSP-----VREKLLQV 189
+D S+ P DP K+ + + R+ +F G N P R V
Sbjct: 194 LQDGSI--TVPPYADPHKMQAHLISPGTPRSIFVYFRGLFYDMGNDPEGGYYARGARASV 251
Query: 190 WRN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHY 247
W N D+ ++ S Y +++ + FCL G+ + R+ +++ +GC+PVIIA+
Sbjct: 252 WENFKDNPLFDISTEHPYTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDI 311
Query: 248 DLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLK--VRKHFQWHVFPS 305
LPFAD + W+ S+ VA D+P L IL I + L Q + + V++ +H
Sbjct: 312 VLPFADAIPWEDISVFVAERDVPRLDSILTSIPLADILRRQRLLARESVKQALLFHQPAR 371
Query: 306 DYDAFYMVMYEL----------WLRRSSVRVQWSTSLDSN 335
DAF+ V+ L +L+ + W LDS+
Sbjct: 372 TGDAFHQVLNGLARKLPHGDGVFLQPGEKVLDWDAGLDSD 411
>gi|386576428|gb|AFJ12118.1| glycosyltransferase, partial [Nicotiana tabacum]
Length = 294
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 79/276 (28%), Positives = 128/276 (46%), Gaps = 21/276 (7%)
Query: 34 ASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNISQ 92
A+E Y + + S T +P +AD F+ P ++ + + P + I I+
Sbjct: 1 AAEIYMHRFLLSSAVRTFNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRMMRSAIQLIAS 60
Query: 93 KYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHKD 147
+PYWNRT GADHF++ H G +KA E + + + ++ H+ KD
Sbjct: 61 NWPYWNRTEGADHFFITPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLKD 120
Query: 148 --VSLPQIWPRQEDPPKL--GSSKRNKLAFFAGAV----NSP-----VREKLLQVWRN-- 192
+++P P Q+ L + R+ +F G N P R VW N
Sbjct: 121 GSITIPPYAPPQKMQSHLIPPDTPRSIFVYFRGLFYDVGNDPEGGYYARGARAAVWENFK 180
Query: 193 DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFA 252
D+ ++ S T Y +++ + FCL G+ + R+ +++ +GC+PVIIA+ LPFA
Sbjct: 181 DNPLFDISTEHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVIFGCIPVIIADDIVLPFA 240
Query: 253 DVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQ 288
D + W+ + VA D+P L IL I EE L Q
Sbjct: 241 DAIPWEDIGVFVAEKDVPNLDTILTSIPPEEILRKQ 276
>gi|326488903|dbj|BAJ98063.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 421
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 85/333 (25%), Positives = 146/333 (43%), Gaps = 35/333 (10%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T +P +AD F+ P + + P + I I+
Sbjct: 82 FAAEIFMHRFLLSSAVRTTNPEEADWFYTPVYPTCDLTPSGLPLPFKSPRMMRSAIELIA 141
Query: 92 QKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFIS-----GHIAHK 146
K+PYWNR+ GADHF+V H G + + I + + + H+ K
Sbjct: 142 TKWPYWNRSEGADHFFVTPHDFGACFHYQEEKAIGRGILPLLQRATLVQTFGQKNHVCLK 201
Query: 147 D--VSLPQIWPRQEDPPKL--GSSKRNKLAFFAG----AVNSP-----VREKLLQVWRN- 192
D +++P P Q+ L G + R+ +F G N P R VW N
Sbjct: 202 DGSITIPPFAPPQKMQNHLIPGETPRSIFVYFRGLFYDTGNDPEGGYYARGARASVWENF 261
Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
++ ++ S Y +++ S FCL G+ + R+ +++ +GC+PVIIA+ LPF
Sbjct: 262 KNNPLFDISTDHPPTYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 321
Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
AD + W+ + V D+P L IL I +E+ +L + +L Q +FP D
Sbjct: 322 ADAIPWEEIGVFVPEEDVPRLDSILTSIPTED-ILRKQRLLANPSMKQAMLFPQPAQAGD 380
Query: 309 AFYMVMYEL----------WLRRSSVRVQWSTS 331
AF+ ++ L +L+ R+ W+
Sbjct: 381 AFHQILNGLARKLPHGESVFLKPGQARLNWTAG 413
>gi|414878904|tpg|DAA56035.1| TPA: hypothetical protein ZEAMMB73_301237 [Zea mays]
Length = 360
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 87/340 (25%), Positives = 150/340 (44%), Gaps = 41/340 (12%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T DP +AD F+ P ++ + P + I ++
Sbjct: 21 FAAEIFMHQFLLSSAVRTLDPEEADWFYTPAYTTCDLTPQGFPLPFRAPRIMRSAIRYVA 80
Query: 92 QKYPYWNRTGGADHFYVACHSIG-------RSAMQKAWEVKLNAIQVVCSSSYFISGHIA 144
+P+WNRT GADHF++ H G AM++ L +V ++ H+
Sbjct: 81 TTWPFWNRTDGADHFFLTPHDFGACFHYQEERAMERGILPLLRRATLV--QTFGQRNHVC 138
Query: 145 HKDVSLPQIWPRQEDPPKLGS------SKRNKLAFFAGAV----NSP-----VREKLLQV 189
+D S+ P DP K+ + + R+ +F G N P R V
Sbjct: 139 LQDGSI--TVPPYADPGKMQAHLISPGTPRSIFVYFRGLFYDMGNDPEGGYYARGARASV 196
Query: 190 WRN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHY 247
W N D+ ++ S + Y +++ + FCL G+ + R+ +++ +GC+PVIIA+
Sbjct: 197 WENFKDNPLFDISTEHPSTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDI 256
Query: 248 DLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLK--VRKHFQWHVFPS 305
LPFAD + W+ S+ VA D+P L IL I + L Q + + V++ +H
Sbjct: 257 VLPFADAIPWEDISVFVAERDVPRLDSILTSIPLPDILRRQRLLARDSVKQALLFHQPAR 316
Query: 306 DYDAFYMVMYEL----------WLRRSSVRVQWSTSLDSN 335
DAF+ V+ L +L+ + W L S+
Sbjct: 317 PGDAFHQVLNGLARKLPHGEGVFLQPGEKALDWDAGLQSD 356
>gi|414878902|tpg|DAA56033.1| TPA: hypothetical protein ZEAMMB73_301237 [Zea mays]
Length = 387
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 87/340 (25%), Positives = 150/340 (44%), Gaps = 41/340 (12%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T DP +AD F+ P ++ + P + I ++
Sbjct: 48 FAAEIFMHQFLLSSAVRTLDPEEADWFYTPAYTTCDLTPQGFPLPFRAPRIMRSAIRYVA 107
Query: 92 QKYPYWNRTGGADHFYVACHSIG-------RSAMQKAWEVKLNAIQVVCSSSYFISGHIA 144
+P+WNRT GADHF++ H G AM++ L +V ++ H+
Sbjct: 108 TTWPFWNRTDGADHFFLTPHDFGACFHYQEERAMERGILPLLRRATLV--QTFGQRNHVC 165
Query: 145 HKDVSLPQIWPRQEDPPKLGS------SKRNKLAFFAGAV----NSP-----VREKLLQV 189
+D S+ P DP K+ + + R+ +F G N P R V
Sbjct: 166 LQDGSI--TVPPYADPGKMQAHLISPGTPRSIFVYFRGLFYDMGNDPEGGYYARGARASV 223
Query: 190 WRN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHY 247
W N D+ ++ S + Y +++ + FCL G+ + R+ +++ +GC+PVIIA+
Sbjct: 224 WENFKDNPLFDISTEHPSTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDI 283
Query: 248 DLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLK--VRKHFQWHVFPS 305
LPFAD + W+ S+ VA D+P L IL I + L Q + + V++ +H
Sbjct: 284 VLPFADAIPWEDISVFVAERDVPRLDSILTSIPLPDILRRQRLLARDSVKQALLFHQPAR 343
Query: 306 DYDAFYMVMYEL----------WLRRSSVRVQWSTSLDSN 335
DAF+ V+ L +L+ + W L S+
Sbjct: 344 PGDAFHQVLNGLARKLPHGEGVFLQPGEKALDWDAGLQSD 383
>gi|242090769|ref|XP_002441217.1| hypothetical protein SORBIDRAFT_09g022500 [Sorghum bicolor]
gi|241946502|gb|EES19647.1| hypothetical protein SORBIDRAFT_09g022500 [Sorghum bicolor]
Length = 422
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 93/351 (26%), Positives = 150/351 (42%), Gaps = 48/351 (13%)
Query: 24 PVDFEPRG---NYASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGI 79
PV+ +PR +A+E + ++ + S T DP +AD F+ P ++ +
Sbjct: 69 PVEKDPRCLTHMFATEIFVHRSLLSSAVRTLDPEEADWFYTPVYTTCDLTASGHPMPFDS 128
Query: 80 PDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFI 139
P + I I+ ++PYWNR+ GADHF+V H G + + I V + +
Sbjct: 129 PRMMRSAIRLIADRWPYWNRSEGADHFFVTPHDFGACFHFQEEKAMARGILPVLRRATLV 188
Query: 140 S-----GHIAHKD----VSLPQIWP-----RQEDPPKLGSSKRNKLAFFAG----AVNSP 181
H+ KD +++P P Q PP ++ R+ +F G A N P
Sbjct: 189 QTFGQRNHVCLKDGGGSITIPPYAPPWKMEAQLLPP---ATPRSIFVYFRGLFYDAGNDP 245
Query: 182 -----VREKLLQVWRN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSL 234
R VW N + ++ S T Y ++ + FCL G+ + R+ +++
Sbjct: 246 EGGYYARGARASVWENFKSNPLFDISTAHPTTYYQDMQRAVFCLCPLGWAPWSPRLVEAV 305
Query: 235 YYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYL----LLQSY 290
+GC+PV+IA+ LPFAD + W + VA D+P L IL I E L LL S
Sbjct: 306 VFGCIPVVIADDIVLPFADAIPWADIGVFVAEDDVPRLDTILTSIPVEVVLRKQRLLASP 365
Query: 291 VLKVRKHFQWHVFPSDYDAFYMVMYEL----------WLRRSSVRVQWSTS 331
+K F P DAF+ ++ L +LR + W+
Sbjct: 366 AMKRAVLFPQPAQPG--DAFHQILNGLARKLPHGDDAFLRNGQTVLNWTAG 414
>gi|414878903|tpg|DAA56034.1| TPA: hypothetical protein ZEAMMB73_301237 [Zea mays]
Length = 393
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 87/340 (25%), Positives = 150/340 (44%), Gaps = 41/340 (12%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T DP +AD F+ P ++ + P + I ++
Sbjct: 54 FAAEIFMHQFLLSSAVRTLDPEEADWFYTPAYTTCDLTPQGFPLPFRAPRIMRSAIRYVA 113
Query: 92 QKYPYWNRTGGADHFYVACHSIG-------RSAMQKAWEVKLNAIQVVCSSSYFISGHIA 144
+P+WNRT GADHF++ H G AM++ L +V ++ H+
Sbjct: 114 TTWPFWNRTDGADHFFLTPHDFGACFHYQEERAMERGILPLLRRATLV--QTFGQRNHVC 171
Query: 145 HKDVSLPQIWPRQEDPPKLGS------SKRNKLAFFAGAV----NSP-----VREKLLQV 189
+D S+ P DP K+ + + R+ +F G N P R V
Sbjct: 172 LQDGSI--TVPPYADPGKMQAHLISPGTPRSIFVYFRGLFYDMGNDPEGGYYARGARASV 229
Query: 190 WRN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHY 247
W N D+ ++ S + Y +++ + FCL G+ + R+ +++ +GC+PVIIA+
Sbjct: 230 WENFKDNPLFDISTEHPSTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDI 289
Query: 248 DLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLK--VRKHFQWHVFPS 305
LPFAD + W+ S+ VA D+P L IL I + L Q + + V++ +H
Sbjct: 290 VLPFADAIPWEDISVFVAERDVPRLDSILTSIPLPDILRRQRLLARDSVKQALLFHQPAR 349
Query: 306 DYDAFYMVMYEL----------WLRRSSVRVQWSTSLDSN 335
DAF+ V+ L +L+ + W L S+
Sbjct: 350 PGDAFHQVLNGLARKLPHGEGVFLQPGEKALDWDAGLQSD 389
>gi|414878901|tpg|DAA56032.1| TPA: secondary cell wall glycosyltransferase family 47 [Zea mays]
Length = 417
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 87/340 (25%), Positives = 150/340 (44%), Gaps = 41/340 (12%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T DP +AD F+ P ++ + P + I ++
Sbjct: 78 FAAEIFMHQFLLSSAVRTLDPEEADWFYTPAYTTCDLTPQGFPLPFRAPRIMRSAIRYVA 137
Query: 92 QKYPYWNRTGGADHFYVACHSIG-------RSAMQKAWEVKLNAIQVVCSSSYFISGHIA 144
+P+WNRT GADHF++ H G AM++ L +V ++ H+
Sbjct: 138 TTWPFWNRTDGADHFFLTPHDFGACFHYQEERAMERGILPLLRRATLV--QTFGQRNHVC 195
Query: 145 HKDVSLPQIWPRQEDPPKLGS------SKRNKLAFFAGAV----NSP-----VREKLLQV 189
+D S+ P DP K+ + + R+ +F G N P R V
Sbjct: 196 LQDGSI--TVPPYADPGKMQAHLISPGTPRSIFVYFRGLFYDMGNDPEGGYYARGARASV 253
Query: 190 WRN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHY 247
W N D+ ++ S + Y +++ + FCL G+ + R+ +++ +GC+PVIIA+
Sbjct: 254 WENFKDNPLFDISTEHPSTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDI 313
Query: 248 DLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLK--VRKHFQWHVFPS 305
LPFAD + W+ S+ VA D+P L IL I + L Q + + V++ +H
Sbjct: 314 VLPFADAIPWEDISVFVAERDVPRLDSILTSIPLPDILRRQRLLARDSVKQALLFHQPAR 373
Query: 306 DYDAFYMVMYEL----------WLRRSSVRVQWSTSLDSN 335
DAF+ V+ L +L+ + W L S+
Sbjct: 374 PGDAFHQVLNGLARKLPHGEGVFLQPGEKALDWDAGLQSD 413
>gi|168000965|ref|XP_001753186.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162695885|gb|EDQ82227.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 449
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 86/313 (27%), Positives = 141/313 (45%), Gaps = 33/313 (10%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A E Y +S T +P +AD F+ P ++ + + P + I IS
Sbjct: 106 FAVEEYLHDFLTESPVRTLNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRVMRSAISYIS 165
Query: 92 QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
+PYWNRT GADHF+V H +KA E + + + ++ + H+ K
Sbjct: 166 SHWPYWNRTDGADHFFVVPHDFAACFHYQEEKAIERGILPLLKRATLIQTFGQNHHVCLK 225
Query: 147 DVSL-------PQIWPRQEDPPKLGSSKRNKLAFFAGAVNSP---------VREKLLQVW 190
+ S+ P+ + +PP S+ R+ A+F G P R +W
Sbjct: 226 EDSIVIPPYAPPERMQTRLNPP---STPRSIFAYFRGLFYDPGNDPEGGYYARGARAAIW 282
Query: 191 RN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYD 248
N D+ ++ S Y +++ + FCL G+ + R+ + + +GC+PVIIA+
Sbjct: 283 ENFKDNPLFDISTEHPATYYEDMQRAIFCLCPLGWAPWSPRLVEGVIFGCIPVIIADDIV 342
Query: 249 LPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYL----LLQSYVLKVRKHFQWHVFP 304
LPFAD + W+ + V D+P+L KIL I+ EE L LL + +K F P
Sbjct: 343 LPFADAIPWEKIGVFVEEKDVPILDKILCTINHEEVLEKQRLLANPAMKQAMLFPRPAKP 402
Query: 305 SDYDAFYMVMYEL 317
DAF+ ++ L
Sbjct: 403 G--DAFHQILNGL 413
>gi|195648072|gb|ACG43504.1| secondary cell wall-related glycosyltransferase family 47 [Zea
mays]
Length = 420
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 87/340 (25%), Positives = 150/340 (44%), Gaps = 41/340 (12%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T DP +AD F+ P ++ + P + I ++
Sbjct: 81 FAAEIFMHQFLLSSAVRTLDPEEADWFYTPAYTTCDLTPQGFPLPFRAPRIMRSAIRYVA 140
Query: 92 QKYPYWNRTGGADHFYVACHSIG-------RSAMQKAWEVKLNAIQVVCSSSYFISGHIA 144
+P+WNRT GADHF++ H G AM++ L +V ++ H+
Sbjct: 141 TTWPFWNRTDGADHFFLTPHDFGACFHYQEERAMERGILPLLRRATLV--QTFGQRNHVC 198
Query: 145 HKDVSLPQIWPRQEDPPKLGS------SKRNKLAFFAGAV----NSP-----VREKLLQV 189
+D S+ P DP K+ + + R+ +F G N P R V
Sbjct: 199 LQDGSI--TVPPYADPRKMQAHLISPGTPRSIFVYFRGLFYDMGNDPEGGYYARGARASV 256
Query: 190 WRN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHY 247
W N D+ ++ S + Y +++ + FCL G+ + R+ +++ +GC+PVIIA+
Sbjct: 257 WENFKDNPLFDISTEHPSTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDI 316
Query: 248 DLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLK--VRKHFQWHVFPS 305
LPFAD + W+ S+ VA D+P L IL I + L Q + + V++ +H
Sbjct: 317 VLPFADAIPWEDISVFVAERDVPRLDSILTSIPLPDILRRQRLLARDSVKQALLFHQPAR 376
Query: 306 DYDAFYMVMYEL----------WLRRSSVRVQWSTSLDSN 335
DAF+ V+ L +L+ + W L S+
Sbjct: 377 PGDAFHQVLNGLARKLPHGEGVFLQPGEKALDWDAGLQSD 416
>gi|10176877|dbj|BAB10084.1| unnamed protein product [Arabidopsis thaliana]
gi|23821294|dbj|BAC20929.1| NpGUT1 homolog [Arabidopsis thaliana]
Length = 341
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 84/309 (27%), Positives = 140/309 (45%), Gaps = 25/309 (8%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E Y ++ + S T +P +AD F++P ++ + + P + I I+
Sbjct: 2 FAAEIYMQRFLLSSPVRTLNPEEADWFYVPVYTTCDLTPNGLPLPFKSPRMMRSAIQLIA 61
Query: 92 QKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFIS-----GHIAHK 146
+PYWNRT GADHF+V H G + + I + + + H+ K
Sbjct: 62 SNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIGRGILPLLQRATLVQTFGQRNHVCLK 121
Query: 147 D--VSLPQIWPRQEDPPKLGSSK--RNKLAFFAGAV----NSP-----VREKLLQVWRN- 192
+ +++P P Q+ L K R+ +F G N P R VW N
Sbjct: 122 EGSITVPPYAPPQKMQSHLIPEKTPRSIFVYFRGLFYDVGNDPEGGYYARGARAAVWENF 181
Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
D+ ++ S T Y +++ + FCL G+ + R+ +++ +GC+PVIIA+ LPF
Sbjct: 182 KDNPLFDISTEHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVIFGCIPVIIADDIVLPF 241
Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
AD + W+ + V D+P L IL I E +L + +L Q +FP D
Sbjct: 242 ADAIPWEDIGVFVDEKDVPYLDTILTSIPP-EVILRKQRLLANPSMKQAMLFPQPAQPGD 300
Query: 309 AFYMVMYEL 317
AF+ V+ L
Sbjct: 301 AFHQVLNGL 309
>gi|357126594|ref|XP_003564972.1| PREDICTED: probable glucuronosyltransferase Os01g0926400-like
[Brachypodium distachyon]
Length = 423
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 84/312 (26%), Positives = 141/312 (45%), Gaps = 31/312 (9%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T DP +AD F+ P ++ + P + + I+
Sbjct: 84 FAAEIFMHQFLLASAVRTMDPEEADWFYTPVYTTCDLTQQGFPLPFRAPRMMRSAVQYIA 143
Query: 92 QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
+PYWNRT GADHF++A H G ++A E + + + ++ H+ +
Sbjct: 144 ATWPYWNRTEGADHFFLAPHDFGACFHYQEERAIERGILPVLRRATLVQTFGQRNHVCMQ 203
Query: 147 DVSLPQIWPRQEDPPKLGS------SKRNKLAFFAGAV----NSP-----VREKLLQVWR 191
+ S+ P +P K+ + + R+ A+F G N P R VW
Sbjct: 204 EGSI--TIPPYANPQKMQAHLISPGTPRSIFAYFRGLFYDMGNDPEGGYYARGARASVWE 261
Query: 192 N--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDL 249
N D+ ++ S + Y +++ + FCL G+ + R+ +++ +GC+PVIIA+ L
Sbjct: 262 NFKDNPLFDMSTEHPSTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVL 321
Query: 250 PFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYL----LLQSYVLKVRKHFQWHVFPS 305
PFAD + W S+ V D+P L IL + E+ L LL S +K F P
Sbjct: 322 PFADAIPWDQISVFVPEADVPRLDSILASVPPEDVLRKQRLLASPAMKQAVLFHQPAQPR 381
Query: 306 DYDAFYMVMYEL 317
DAF V+ L
Sbjct: 382 --DAFDQVLNGL 391
>gi|297797147|ref|XP_002866458.1| hypothetical protein ARALYDRAFT_496352 [Arabidopsis lyrata subsp.
lyrata]
gi|297312293|gb|EFH42717.1| hypothetical protein ARALYDRAFT_496352 [Arabidopsis lyrata subsp.
lyrata]
Length = 417
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 84/309 (27%), Positives = 140/309 (45%), Gaps = 25/309 (8%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E Y ++ + S T +P +AD F++P ++ + + P + I I+
Sbjct: 78 FAAEIYMQRFLLSSPVRTLNPEEADWFYVPVYTTCDLTPNGLPLPFKSPRMMRSAIQLIA 137
Query: 92 QKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFIS-----GHIAHK 146
+PYWNRT GADHF+V H G + + I + + + H+ K
Sbjct: 138 SNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIGRGILPLLQRATLVQTFGQRNHVCLK 197
Query: 147 D--VSLPQIWPRQEDPPKLGSSK--RNKLAFFAGAV----NSP-----VREKLLQVWRN- 192
+ +++P P Q+ L K R+ +F G N P R VW N
Sbjct: 198 EGSITVPPYAPPQKMQSHLIPEKTPRSIFVYFRGLFYDVGNDPEGGYYARGARAAVWENF 257
Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
D+ ++ S T Y +++ + FCL G+ + R+ +++ +GC+PVIIA+ LPF
Sbjct: 258 KDNPLFDISTEHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVIFGCIPVIIADDIVLPF 317
Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
AD + W+ + V D+P L IL I E +L + +L Q +FP D
Sbjct: 318 ADAIPWEDIGVFVDEKDVPYLDTILTSIPP-EVILRKQRLLANPSMKQAMLFPQPAQPGD 376
Query: 309 AFYMVMYEL 317
AF+ V+ L
Sbjct: 377 AFHQVLNGL 385
>gi|317106607|dbj|BAJ53114.1| JHL07K02.4 [Jatropha curcas]
Length = 417
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 88/310 (28%), Positives = 141/310 (45%), Gaps = 27/310 (8%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T +P +AD F+ P ++ + + P + I IS
Sbjct: 78 FAAEIFMHRFLLSSPVRTLNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRMMRSAIQLIS 137
Query: 92 QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
+PYWNRT GADHF+V H G +KA E + + + ++ H+ K
Sbjct: 138 SNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLK 197
Query: 147 D--VSLPQIWPRQEDPPKLGSSK--RNKLAFFAG----AVNSP-----VREKLLQVWRN- 192
D +++P P Q+ L K R+ +F G N P R VW N
Sbjct: 198 DGSITIPPYAPPQKMQAHLIPEKTPRSIFVYFRGLFYDVGNDPEGGYYARGARAAVWENF 257
Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
D+ ++ S + Y +++ + FCL G+ + R+ +++ +GC+PVIIA+ LPF
Sbjct: 258 KDNPLFDISTEHPSTYYEDMQRAVFCLCPLGWAPWSPRLVEAVIFGCIPVIIADDIVLPF 317
Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYL----LLQSYVLKVRKHFQWHVFPSDY 307
AD + W+ + V D+P L IL I E L LL + +K F V P
Sbjct: 318 ADAIPWEQIGVFVDEKDVPNLDAILTSIPPEVILRKQRLLANPSMKQAMLFPQPVQPG-- 375
Query: 308 DAFYMVMYEL 317
DAF+ V+ L
Sbjct: 376 DAFHQVLNGL 385
>gi|301072486|gb|ADK56173.1| glycosyltransferase 47 [Triticum aestivum]
Length = 422
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 83/333 (24%), Positives = 147/333 (44%), Gaps = 35/333 (10%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T +P +AD F+ P + + P + I I+
Sbjct: 83 FAAEIFMHRFLLSSAVRTTNPEEADWFYTPVYPTCDLTPSGLPLPFKSPRMMRSAIELIA 142
Query: 92 QKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFIS-----GHIAHK 146
K+PYWNR+ GADHF+V H G + + I + + + H+ K
Sbjct: 143 TKWPYWNRSEGADHFFVTPHDFGACFHYQEEKAIGRGILPLLQRATLVQTFGQKNHVCLK 202
Query: 147 D--VSLPQIWPRQEDPPKL--GSSKRNKLAFFAG----AVNSP-----VREKLLQVWRN- 192
+ +++P P Q+ L G + R+ +F G N P R VW N
Sbjct: 203 EGSITIPPFAPPQKMQNHLIPGETPRSIFVYFRGLFYDTGNDPEGGYYARGARASVWENF 262
Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
++ ++ S Y +++ S FCL G+ + R+ +++ +GC+PVIIA+ LPF
Sbjct: 263 KNNPLFDISTDHPPTYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 322
Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
AD + W+ + V D+P L IL I +E+ +L + +L Q +FP D
Sbjct: 323 ADAIPWEEIGVFVPEEDVPRLDSILTSIPTED-ILRKQRLLANPSMKQAMLFPQPAQAGD 381
Query: 309 AFYMVM----------YELWLRRSSVRVQWSTS 331
AF+ ++ +++L+ R+ W+
Sbjct: 382 AFHQILNGLARKLPHGEDVFLKPGQARLNWTAG 414
>gi|18424516|ref|NP_568941.1| Exostosin family protein [Arabidopsis thaliana]
gi|75163931|sp|Q940Q8.1|IX10L_ARATH RecName: Full=Probable beta-1,4-xylosyltransferase IRX10L; AltName:
Full=Glucuronoxylan glucuronosyltransferase 1;
Short=AtGUT1; AltName: Full=Glucuronoxylan
glucuronosyltransferase 2; Short=AtGUT2; AltName:
Full=Protein IRREGULAR XYLEM 10-like; AltName:
Full=Xylan xylosyltransferase IRX10L
gi|15809826|gb|AAL06841.1| AT5g61840/mac9_140 [Arabidopsis thaliana]
gi|17978869|gb|AAL47406.1| AT5g61840/mac9_140 [Arabidopsis thaliana]
gi|332010141|gb|AED97524.1| Exostosin family protein [Arabidopsis thaliana]
Length = 415
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 84/309 (27%), Positives = 140/309 (45%), Gaps = 25/309 (8%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E Y ++ + S T +P +AD F++P ++ + + P + I I+
Sbjct: 76 FAAEIYMQRFLLSSPVRTLNPEEADWFYVPVYTTCDLTPNGLPLPFKSPRMMRSAIQLIA 135
Query: 92 QKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFIS-----GHIAHK 146
+PYWNRT GADHF+V H G + + I + + + H+ K
Sbjct: 136 SNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIGRGILPLLQRATLVQTFGQRNHVCLK 195
Query: 147 D--VSLPQIWPRQEDPPKLGSSK--RNKLAFFAGAV----NSP-----VREKLLQVWRN- 192
+ +++P P Q+ L K R+ +F G N P R VW N
Sbjct: 196 EGSITVPPYAPPQKMQSHLIPEKTPRSIFVYFRGLFYDVGNDPEGGYYARGARAAVWENF 255
Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
D+ ++ S T Y +++ + FCL G+ + R+ +++ +GC+PVIIA+ LPF
Sbjct: 256 KDNPLFDISTEHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVIFGCIPVIIADDIVLPF 315
Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
AD + W+ + V D+P L IL I E +L + +L Q +FP D
Sbjct: 316 ADAIPWEDIGVFVDEKDVPYLDTILTSIPP-EVILRKQRLLANPSMKQAMLFPQPAQPGD 374
Query: 309 AFYMVMYEL 317
AF+ V+ L
Sbjct: 375 AFHQVLNGL 383
>gi|357126610|ref|XP_003564980.1| PREDICTED: probable glucuronosyltransferase Os01g0926700-like
[Brachypodium distachyon]
Length = 418
Score = 104 bits (260), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 82/309 (26%), Positives = 142/309 (45%), Gaps = 25/309 (8%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T +P +AD F+ P ++ + P + I I+
Sbjct: 79 FAAEIFMHRFLLSSAVRTFNPEEADWFYTPVYATCDLTPSGLPLPFKSPRMVRSAIQLIA 138
Query: 92 QKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFIS-----GHIAHK 146
+K+PYWNR+ GADHF+VA H G + + I + + + H+ K
Sbjct: 139 EKWPYWNRSEGADHFFVAPHDFGACFHYQEEKAIGRGILPLLQRATLVQTFGQKNHVCLK 198
Query: 147 D--VSLPQIWPRQEDPPKL--GSSKRNKLAFFAG----AVNSP-----VREKLLQVWRN- 192
D +++P P Q+ L G + R+ +F G N P R VW N
Sbjct: 199 DGSITIPPYAPPQKMQNHLIPGDTPRSIFVYFRGLFYDTSNDPEGGYYARGARASVWENF 258
Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
++ ++ S Y +++ S FCL G+ + R+ +++ +GC+PVIIA+ LPF
Sbjct: 259 KNNPLFDISTDHPPTYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 318
Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
AD + W+ + V+ D+P L IL I + + +L + +L Q +FP D
Sbjct: 319 ADAIPWEDIGVFVSEDDVPKLDSILTSIPT-DVILRKQRLLANPSMKQAMLFPQPAEAGD 377
Query: 309 AFYMVMYEL 317
AF+ ++ L
Sbjct: 378 AFHQILNGL 386
>gi|34394612|dbj|BAC83914.1| limonene cyclase like protein [Oryza sativa Japonica Group]
gi|50508943|dbj|BAD31847.1| limonene cyclase like protein [Oryza sativa Japonica Group]
Length = 332
Score = 104 bits (260), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 53/122 (43%), Positives = 73/122 (59%), Gaps = 1/122 (0%)
Query: 206 PYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVA 265
P A K L G A I +++Y CVPV+I + Y LPFADVLNW +FS+ VA
Sbjct: 206 PQAQRRRDVKLELLELGLAKARATIREAIYLECVPVVIGDDYTLPFADVLNWAAFSVRVA 265
Query: 266 TLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWH-VFPSDYDAFYMVMYELWLRRSSV 324
DIP LK+IL +S +Y+ +Q V VR+HF P +D F+M+++ +WLRR +V
Sbjct: 266 VGDIPRLKEILAAVSPRQYIRMQRRVRAVRRHFMVSDGAPRRFDVFHMILHSIWLRRLNV 325
Query: 325 RV 326
RV
Sbjct: 326 RV 327
>gi|356508876|ref|XP_003523179.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10L-like
[Glycine max]
Length = 416
Score = 104 bits (260), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 86/309 (27%), Positives = 141/309 (45%), Gaps = 25/309 (8%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T +P +AD F+ P ++ + + P + I IS
Sbjct: 77 FAAEIFMHRFLLSSPVRTLNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRMMRSAIQLIS 136
Query: 92 QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
+PYWNRT GADHF+V H G +KA E + + + ++ H+ K
Sbjct: 137 SNWPYWNRTEGADHFFVTPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLK 196
Query: 147 D--VSLPQIWPRQEDPPKLGSSK--RNKLAFFAGAV----NSP-----VREKLLQVWRN- 192
+ +++P P Q+ L K R+ +F G N P R VW N
Sbjct: 197 EGSITIPPYAPPQKMHTHLIPEKTPRSIFVYFRGLFYDVGNDPEGGYYARGARAAVWENF 256
Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
D+ ++ S T Y +++ + FCL G+ + R+ +++ +GC+PVIIA+ LPF
Sbjct: 257 KDNPLFDISTEHPTTYYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 316
Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
AD + W+ + V D+P L IL I E +L + +L Q +FP D
Sbjct: 317 ADAIPWEEIGVFVDEKDVPQLDTILTSIPP-EVILRKQRLLANPSMKQAMLFPQPAQPGD 375
Query: 309 AFYMVMYEL 317
AF+ V+ L
Sbjct: 376 AFHQVLNGL 384
>gi|356516468|ref|XP_003526916.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10L-like
[Glycine max]
Length = 416
Score = 104 bits (260), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 86/309 (27%), Positives = 141/309 (45%), Gaps = 25/309 (8%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T +P +AD F+ P ++ + + P + I IS
Sbjct: 77 FAAEIFMHRFLLSSPVRTLNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRMMRSAIQLIS 136
Query: 92 QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
+PYWNRT GADHF+V H G +KA E + + + ++ H+ K
Sbjct: 137 SNWPYWNRTEGADHFFVTPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLK 196
Query: 147 D--VSLPQIWPRQEDPPKLGSSK--RNKLAFFAGAV----NSP-----VREKLLQVWRN- 192
+ +++P P Q+ L K R+ +F G N P R VW N
Sbjct: 197 EGSITIPPYAPPQKMHTHLIPEKTPRSIFVYFRGLFYDVGNDPEGGYYARGARAAVWENF 256
Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
D+ ++ S T Y +++ + FCL G+ + R+ +++ +GC+PVIIA+ LPF
Sbjct: 257 KDNPLFDISTEHPTTYYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 316
Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
AD + W+ + V D+P L IL I E +L + +L Q +FP D
Sbjct: 317 ADAIPWEEIGVFVDEKDVPQLDTILTSIPP-EVILRKQRLLANPSMKQAMLFPQPAQPGD 375
Query: 309 AFYMVMYEL 317
AF+ V+ L
Sbjct: 376 AFHQVLNGL 384
>gi|255540029|ref|XP_002511079.1| catalytic, putative [Ricinus communis]
gi|223550194|gb|EEF51681.1| catalytic, putative [Ricinus communis]
Length = 417
Score = 104 bits (260), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 86/309 (27%), Positives = 141/309 (45%), Gaps = 25/309 (8%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T +P +AD F+ P ++ + + P + I IS
Sbjct: 78 FAAEIFMHRFLLSSPVRTLNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRMMRSAIQLIS 137
Query: 92 QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
+PYWNRT GADHF+V H G +KA E + + + ++ H+ K
Sbjct: 138 SNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPMLQRATLVQTFGQRNHVCLK 197
Query: 147 D--VSLPQIWPRQEDPPKLGSSK--RNKLAFFAGAV----NSP-----VREKLLQVWRN- 192
+ +++P P Q+ L K R+ +F G N P R VW N
Sbjct: 198 EGSITIPPYAPPQKMQAHLIPEKTPRSIFVYFRGLFYDVGNDPEGGYYARGARAAVWENF 257
Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
D+ ++ S T Y +++ + FCL G+ + R+ +++ +GC+PVIIA+ LPF
Sbjct: 258 KDNPLFDISTEHPTTYYEDMQRAVFCLCPLGWAPWSPRLVEAVIFGCIPVIIADDIVLPF 317
Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
AD + W+ + V D+P L IL I E +L + +L Q +FP D
Sbjct: 318 ADAIPWEEIGVFVDEKDVPNLDTILTSIPP-EVILRKQRLLANPSMKQAMLFPQPAQAGD 376
Query: 309 AFYMVMYEL 317
AF+ V+ L
Sbjct: 377 AFHQVLNGL 385
>gi|359473830|ref|XP_003631364.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10L-like [Vitis
vinifera]
Length = 412
Score = 103 bits (258), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 84/309 (27%), Positives = 141/309 (45%), Gaps = 25/309 (8%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T +P +AD F+ P ++ + + P + I IS
Sbjct: 73 FAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYTTCDLTPNGLPLPFKSPRMMRSAIQLIS 132
Query: 92 QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
+PYWNRT GADHF+V H G +KA E + + + ++ H+
Sbjct: 133 SNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLN 192
Query: 147 D--VSLPQIWPRQEDPPKL--GSSKRNKLAFFAGAV----NSP-----VREKLLQVWRN- 192
+ +++P P Q+ L + R+ +F G N P R VW N
Sbjct: 193 EGSITIPPYAPPQKMQAHLIPQETPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENF 252
Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
D+ ++ S T Y +++ + FCL G+ + R+ +++ +GC+PVIIA+ LPF
Sbjct: 253 KDNPLFDISTEHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVIFGCIPVIIADDIVLPF 312
Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
AD + W+ + VA D+P L IL I E +L + +L Q +FP D
Sbjct: 313 ADAIPWEEIGVFVAEEDVPNLDTILTSIPP-EVILRKQRLLANPSMKQAMLFPQPAQSGD 371
Query: 309 AFYMVMYEL 317
AF+ ++ L
Sbjct: 372 AFHQILNGL 380
>gi|412993472|emb|CCO13983.1| predicted protein [Bathycoccus prasinos]
Length = 614
Score = 103 bits (258), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 93/356 (26%), Positives = 152/356 (42%), Gaps = 35/356 (9%)
Query: 5 FRVYVYPHRRNDPFANVLLPVDFEPR---GNYASESYFKKAFMKSHFVTKDPSKADLFFL 61
F+++VY + P N L D +PR Y +E F + + +T DP +A+ FF+
Sbjct: 260 FKIFVYDLK---PEFNADLARD-QPRCRTDQYGTEIRFHENLLHHSVLTNDPEEAEFFFV 315
Query: 62 PF--SIARMRHDRRIGTEGIPDFISHY----IFNISQKYPYWNRTGGADHFYVACHSIGR 115
P R + GT + + + I +YPYWNRT G DH + + G
Sbjct: 316 PIYGECYLFRETQNSGTNNAMKVTNLWYRDALKTIQTEYPYWNRTDGRDHVWSFPGARG- 374
Query: 116 SAMQKAWE--VKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQE----DPPKLGSSKRN 169
+ + W+ +K + F KD+ +P + P E K S K++
Sbjct: 375 PHIFRDWKKLIKKSIFLTPEGDRSFGEQFNTWKDIVIPGLEPDSEFIDGKLRKQSSLKKD 434
Query: 170 KLAFFAGAV---------NSPVREKLLQVWRNDSEIYAHS---GRLKTPYADELLGSKFC 217
AFF G + + +R K+ ++ ++ + Y EL S FC
Sbjct: 435 IFAFFRGTILNKAGILAYSRGIRPKMEAAFKKHKDVIFTEEIPSCDRDCYRKELRKSTFC 494
Query: 218 LHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILK 277
L +G+ T R ++ GC+PVIIA+ +LP+ + L+W S+ +A +D ILK
Sbjct: 495 LCPRGWSPWTLRAYQAMMVGCIPVIIADEIELPYENSLDWTKLSVKIAEVDAEKTIDILK 554
Query: 278 GISSEEYLLLQSYVLKVRKHFQWHVFPSD---YDAFYMVMYELWLRRSSVRVQWST 330
IS E Q + KV K W P DA V++EL ++ +++ T
Sbjct: 555 QISKSEIRNKQKAIEKVWKSVAWGSNPKKLDPMDAMECVLHELGRKKRAMKASTQT 610
>gi|412986347|emb|CCO14773.1| predicted protein [Bathycoccus prasinos]
Length = 445
Score = 103 bits (258), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 96/320 (30%), Positives = 146/320 (45%), Gaps = 32/320 (10%)
Query: 33 YASESYFKKAFMKS-HFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNIS 91
Y S+ + +KS T++PS AD FFLP M G D ++ + +
Sbjct: 118 YGSQVHIADFLLKSKELRTENPSDADFFFLPGWPKCMLDAPPNGAGLTDDELAKRLNGVI 177
Query: 92 QKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH------ 145
+K PY ++GG DH +V G + + K W K+ + ++ +
Sbjct: 178 EKLPYIKKSGGRDHVFVWPSGRGPT-LYKNWRCKIPNSIFLTPEGFYTDPYRTLAPYFDP 236
Query: 146 -KDVSLPQIWPRQED---PPKLGSSKRNKLAFFAGAV-------------NSPVREKLLQ 188
KDV LP ++D +SKR KLA FAG V + RE+LL+
Sbjct: 237 WKDVVLPGFMDGRKDSYLETNKRTSKRTKLASFAGTVPDGQALKGDEKHVKAHPRERLLK 296
Query: 189 VWRN-DSEIYAHSGRLKTPYADELLG-SKFCLHVKGFEVNTARIADSLYYGCVPVIIANH 246
+ + ++ A SGR TP E+LG SKFC+ +G T R ++ + GCVPVII++
Sbjct: 297 LSKKYPDDLLAISGR--TPKYAEILGDSKFCIVPRGLSPWTLRTYETFFAGCVPVIISDS 354
Query: 247 YDLPFADVLNWKSFSIV--VATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFP 304
LPF + L+W SI A +D LL LK I EE + +VR F +
Sbjct: 355 VRLPFQEFLDWSLISIKWPEAKIDESLL-TYLKSIPDEEIEKIVRRGEQVRCVFAYQADA 413
Query: 305 SDYDAFYMVMYELWLRRSSV 324
+ +AF +M+ L L+ +V
Sbjct: 414 TKCNAFSAIMWALSLKDRNV 433
>gi|224119858|ref|XP_002318180.1| predicted protein [Populus trichocarpa]
gi|224122230|ref|XP_002318783.1| predicted protein [Populus trichocarpa]
gi|222858853|gb|EEE96400.1| predicted protein [Populus trichocarpa]
gi|222859456|gb|EEE97003.1| predicted protein [Populus trichocarpa]
Length = 417
Score = 103 bits (258), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 86/309 (27%), Positives = 140/309 (45%), Gaps = 25/309 (8%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T +P +AD F+ P ++ + + P + I IS
Sbjct: 78 FAAEIFMHRFLLSSPVRTLNPDEADWFYTPVYTTCDLTTNGLPLPFKSPRMMRSAIQLIS 137
Query: 92 QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
+PYWNRT GADHF+V H G +KA E + + + ++ H+ K
Sbjct: 138 SNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLK 197
Query: 147 D--VSLPQIWPRQEDPPKLGSSK--RNKLAFFAGAV----NSP-----VREKLLQVWRN- 192
D +++P P Q+ L K R+ +F G N P R VW N
Sbjct: 198 DGSITVPPYAPPQKMQTHLIPEKTPRSIFVYFRGLFYDVGNDPEGGYYARGARAAVWENF 257
Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
D+ ++ S Y +++ + FCL G+ + R+ +++ +GC+PVIIA+ LPF
Sbjct: 258 KDNPLFDISTEHPATYYEDMQRAVFCLCPLGWAPWSPRLVEAVIFGCIPVIIADDIVLPF 317
Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
AD + W+ + V D+P L IL I E +L + +L Q +FP D
Sbjct: 318 ADAIPWEEIGVYVDEEDVPNLDTILTSIPP-EVILRKQRLLANPSMKQAMLFPQPAQPGD 376
Query: 309 AFYMVMYEL 317
AF+ V+ L
Sbjct: 377 AFHQVLNGL 385
>gi|18642697|gb|AAL76189.1|AC092173_1 Unknown protein [Oryza sativa Japonica Group]
gi|23821296|dbj|BAC20930.1| NpGUT1 homolog [Oryza sativa Japonica Group]
Length = 401
Score = 103 bits (258), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 85/312 (27%), Positives = 143/312 (45%), Gaps = 31/312 (9%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T +P +AD F+ P ++ + T P + I IS
Sbjct: 62 FATEIFMHRFLLSSAIRTSNPDEADWFYTPVYTTCDLTPWGHPLTTKSPRMMRSAIKFIS 121
Query: 92 QKYPYWNRTGGADHFYVACHSIGRS-AMQKAWEVKLNAIQVV----CSSSYFISGHIAHK 146
+ +PYWNRT GADHF+V H Q+A ++ + V+ ++ H K
Sbjct: 122 KYWPYWNRTEGADHFFVVPHDFAACFYFQEAKAIERGILPVLRRATLVQTFGQKNHACLK 181
Query: 147 D--VSLPQIWPRQED-----PPKLGSSKRNKLAFFAG----AVNSP-----VREKLLQVW 190
D +++P P + PP+ + R+ +F G N P R VW
Sbjct: 182 DGSITVPPYTPAHKIRAHLVPPE---TPRSIFVYFRGLFYDTSNDPEGGYYARGARASVW 238
Query: 191 RN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYD 248
N ++ ++ S Y +++ + FCL G+ + R+ +++ +GC+PVIIA+ D
Sbjct: 239 ENFKNNPMFDISTDHPQTYYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDID 298
Query: 249 LPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY- 307
LP +D + W+ ++ VA D+P L IL I +E L Q+ +L Q +FP
Sbjct: 299 LPLSDAIPWEEIAVFVAEDDVPQLDTILTSIPTEVILRKQA-MLAEPSMKQTMLFPQPAE 357
Query: 308 --DAFYMVMYEL 317
D F+ VM L
Sbjct: 358 PGDGFHQVMNAL 369
>gi|356512936|ref|XP_003525170.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10L-like
[Glycine max]
Length = 416
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 86/309 (27%), Positives = 141/309 (45%), Gaps = 25/309 (8%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T +P +AD F+ P ++ + + P + I IS
Sbjct: 77 FAAEIFMHRFLLSSPVRTLNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRMMRSAIQLIS 136
Query: 92 QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
+PYWNRT GADHF+V H G +KA E + + + ++ H+ K
Sbjct: 137 SNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILTLLRRATLVQTFGQRNHVCLK 196
Query: 147 D--VSLPQIWPRQEDPPKLGSSK--RNKLAFFAGAV----NSP-----VREKLLQVWRN- 192
+ +++P P Q+ L K R+ +F G N P R VW N
Sbjct: 197 EGSITIPPYAPPQKMHTHLIPDKTPRSIFVYFRGLFYDVGNDPEGGYYARGARAAVWENF 256
Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
D+ ++ S T Y +++ + FCL G+ + R+ +++ +GC+PVIIA+ LPF
Sbjct: 257 KDNLLFDISTEHPTTYYEDMQRAVFCLCPLGWAPWSPRLVEAVIFGCIPVIIADDIVLPF 316
Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
AD + W+ + V D+P L IL I E +L + +L Q +FP D
Sbjct: 317 ADAIPWEEIGVFVDEEDVPKLDTILTSIPP-EVILRKQRLLANPSMKQAMLFPQPAQPGD 375
Query: 309 AFYMVMYEL 317
AF+ V+ L
Sbjct: 376 AFHQVLNGL 384
>gi|224119530|ref|XP_002318097.1| predicted protein [Populus trichocarpa]
gi|222858770|gb|EEE96317.1| predicted protein [Populus trichocarpa]
Length = 243
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 68/192 (35%), Positives = 96/192 (50%), Gaps = 29/192 (15%)
Query: 1 MNRSFRVYVYP--------HRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKD 52
M + F+V+VYP H N+ + N+ASE YF S F+TK+
Sbjct: 55 MEKDFKVFVYPGGNPGTCYHSTNNTL-----------KSNHASEHYFFMNLRDSPFLTKN 103
Query: 53 PSKADLFFLPFSIARMRHDRRIGTEGIPDF----ISHYIFNISQKYPYWNRTGGADHFYV 108
P +A LFF+ S + + E +P + I Y+ + YPYWNRT GADHF+V
Sbjct: 104 PQEAHLFFIFISCLPLSDE-----EPLPGYRERVIKRYVKGLISTYPYWNRTLGADHFFV 158
Query: 109 ACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKR 168
+CH+IG +A ++ + NAI++VCS SY S +I KDV+LPQI P R
Sbjct: 159 SCHNIGSTATKEIPFLLKNAIRLVCSPSY-DSSYIPQKDVALPQILELSLPPDGDDMWNR 217
Query: 169 NKLAFFAGAVNS 180
N + V S
Sbjct: 218 NLMQLARSTVES 229
>gi|242088705|ref|XP_002440185.1| hypothetical protein SORBIDRAFT_09g027450 [Sorghum bicolor]
gi|241945470|gb|EES18615.1| hypothetical protein SORBIDRAFT_09g027450 [Sorghum bicolor]
Length = 416
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 86/314 (27%), Positives = 146/314 (46%), Gaps = 35/314 (11%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T +P +AD F+ P ++ + T P + I IS
Sbjct: 77 FATEIFMHRFLLTSAVRTLNPDEADWFYTPVYTTCDLTPWGHPLTTKSPRMMRSAIQYIS 136
Query: 92 QKYPYWNRTGGADHFYVACHSIG-------RSAMQKAWEVKLNAIQVVCSSSYFISGHIA 144
+++PYWNRT GADHF+V H G +A+Q+ L +V ++ H+
Sbjct: 137 KRWPYWNRTEGADHFFVTPHDFGACFYFQEETAIQRGVLPVLRRATLV--QTFGQKHHVC 194
Query: 145 HKD--VSLPQIWPRQED-----PPKLGSSKRNKLAFFAG----AVNSP-----VREKLLQ 188
K+ +++P P + PP+ + R+ +F G N P R
Sbjct: 195 LKEGSITIPPYAPPHKIRTHIVPPE---TPRSIFVYFRGLFYDTANDPEGGYYARGARAS 251
Query: 189 VWRN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANH 246
VW N ++ ++ S Y +++ + FCL G+ + R+ +++ +GC+PVIIA+
Sbjct: 252 VWENFKNNALFDISTEHPPTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADD 311
Query: 247 YDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSD 306
LPFAD + W+ ++ VA D+ L IL I EE +L + +L Q +FP
Sbjct: 312 IVLPFADAIPWEEIAVFVAEDDVLKLDTILTSIPMEE-ILRKQRLLANPSMKQAMLFPQP 370
Query: 307 Y---DAFYMVMYEL 317
DAF+ V+ L
Sbjct: 371 AEPRDAFHQVLNGL 384
>gi|116793780|gb|ABK26875.1| unknown [Picea sitchensis]
Length = 420
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 83/309 (26%), Positives = 137/309 (44%), Gaps = 25/309 (8%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T +P +AD F+ P ++ + + P + I IS
Sbjct: 81 FATEIFMNRFLLGSPVRTLNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRMMRSAIQYIS 140
Query: 92 QKYPYWNRTGGADHFYVACHSIG-------RSAMQKAWEVKLNAIQVVCSSSYFISGHIA 144
+PYWNRT GADHF+V H G A+++ L +V + +
Sbjct: 141 TNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRYHVCLK 200
Query: 145 HKDVSLPQIWPRQEDPPKL--GSSKRNKLAFFAGAV----NSP-----VREKLLQVWRN- 192
+++P P Q+ L S+ R+ +F G N P R VW N
Sbjct: 201 KGSITVPPYAPPQKMQAHLIPPSTPRSIFVYFRGLFYDVGNDPEGGYYARGARASVWENF 260
Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
D+ ++ S Y +++ + FCL G+ + R+ + + +GC+PVIIA+ LPF
Sbjct: 261 KDNPLFDISTEHPATYYEDMQRAIFCLCPLGWAPWSPRLVEGVIFGCIPVIIADDIVLPF 320
Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
AD + W+ + VA D+P L IL I E +L + +L Q +FP D
Sbjct: 321 ADAIPWEEIGVFVAEKDVPNLDTILTSIPP-EVILRKQRLLANPAMKQAMLFPQPAQPGD 379
Query: 309 AFYMVMYEL 317
AF+ ++ L
Sbjct: 380 AFHQILNGL 388
>gi|225456511|ref|XP_002284685.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10L [Vitis
vinifera]
gi|147860849|emb|CAN83159.1| hypothetical protein VITISV_022554 [Vitis vinifera]
gi|297734521|emb|CBI15768.3| unnamed protein product [Vitis vinifera]
Length = 416
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 86/309 (27%), Positives = 141/309 (45%), Gaps = 25/309 (8%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T +P +AD F+ P ++ + + P + I IS
Sbjct: 77 FAAEIFMHRFLLSSPVRTLNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRMMRSAIQLIS 136
Query: 92 QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVK-LNAIQ-VVCSSSYFISGHIAHK 146
+PYWNRT GADHF+V H G +KA E L+ +Q ++ H+ K
Sbjct: 137 SNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILHLLQRATLVQTFGQRNHVCLK 196
Query: 147 D--VSLPQIWPRQEDPPKL--GSSKRNKLAFFAGAV----NSP-----VREKLLQVWRN- 192
+ +++P P Q+ L + R+ +F G N P R VW N
Sbjct: 197 EGSITIPPYAPPQKMQSHLIPPDTPRSIFVYFRGLFYDVGNDPEGGYYARGARAAVWENF 256
Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
D+ ++ S Y +++ + FCL G+ + R+ + + +GC+PVIIA+ LPF
Sbjct: 257 KDNPLFDISTEHPMTYYEDMQRAVFCLCPLGWAPWSPRLVEGVIFGCIPVIIADDIVLPF 316
Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
AD + W+ + VA D+P L IL I E +L + +L Q +FP D
Sbjct: 317 ADAIPWEEIGVFVAEEDVPNLDTILTSIPP-EVILRKQRLLANPSMKQAMLFPQPAQPGD 375
Query: 309 AFYMVMYEL 317
AF+ ++ L
Sbjct: 376 AFHQILNGL 384
>gi|21592991|gb|AAM64940.1| unknown [Arabidopsis thaliana]
Length = 415
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 86/316 (27%), Positives = 141/316 (44%), Gaps = 39/316 (12%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E Y ++ + S T +P +AD F++P ++ + + P + I I+
Sbjct: 76 FAAEIYMQRFLLSSPVRTLNPEEADWFYVPVYTTCDLTPNGLPLPFKSPRMMRSAIQLIA 135
Query: 92 QKYPYWNRTGGADHFYVACH------------SIGRSAMQKAWEVKLNAIQVVCSSSYFI 139
+PYWNRT GADHF+V H +IGR + L ++
Sbjct: 136 SNWPYWNRTEGADHFFVVPHDFRACFHYQEEKAIGRGILPLLQRATL-------VQTFGQ 188
Query: 140 SGHIAHKD--VSLPQIWPRQEDPPKLGSSK--RNKLAFFAGAV----NSP-----VREKL 186
H+ K+ +++P P Q+ L K R+ +F G N P R
Sbjct: 189 RNHVCLKEGSITVPPYAPPQKMQSHLIPEKTPRSIFVYFRGLFYDVGNDPEGGYYARGAR 248
Query: 187 LQVWRN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIA 244
VW N D+ ++ S T Y +++ + FCL G+ + R+ +++ +GC+PVIIA
Sbjct: 249 AAVWENFKDNPLFDISTEHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVIFGCIPVIIA 308
Query: 245 NHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFP 304
+ LPFAD + W+ + V D+P L IL I E +L + +L Q +FP
Sbjct: 309 DDIVLPFADAIPWEDIGVFVDEKDVPYLDTILTSIPP-EVILRKQRLLANPSMKQAMLFP 367
Query: 305 SDY---DAFYMVMYEL 317
DAF+ V+ L
Sbjct: 368 QPAQPGDAFHQVLNGL 383
>gi|302783378|ref|XP_002973462.1| glycosyltransferase-like protein [Selaginella moellendorffii]
gi|300159215|gb|EFJ25836.1| glycosyltransferase-like protein [Selaginella moellendorffii]
Length = 437
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 88/309 (28%), Positives = 141/309 (45%), Gaps = 25/309 (8%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T +P +AD F+ P ++ + + P + I IS
Sbjct: 70 FATEIFVHEFLLGSAVRTLNPEEADWFYTPVYTTCDLTRNGLPLPFKSPRMMRSVIQYIS 129
Query: 92 QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVK-LNAIQ-VVCSSSYFISGHIAHK 146
++PYWNRT GADHF+V H G +KA E L+ +Q ++ H+ K
Sbjct: 130 NQWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILHLLQRATLVQTFGQRYHVCLK 189
Query: 147 DVSL--PQIWPRQEDPPKL--GSSKRNKLAFFAGAV----NSP-----VREKLLQVWRN- 192
+ S+ P P Q+ L S R+ +F G N P R VW N
Sbjct: 190 EGSIVVPPYCPPQKMQAHLIPPSIPRSIFVYFRGLFYDYGNDPEGGYYARGARAAVWENF 249
Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
D+ ++ S Y +++ + FCL G+ + R+ + + +GC+PVIIA+ LPF
Sbjct: 250 KDNPLFDISTEHPITYYEDMQRAIFCLCPLGWAPWSPRLVEGVIFGCIPVIIADDIVLPF 309
Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
AD + W+ + VA D+P L IL I E +L + +L Q +FP D
Sbjct: 310 ADAIPWEEIGVFVAEKDVPKLDTILTSIPP-EVILKKQRLLATPAMKQAMLFPQPAQPGD 368
Query: 309 AFYMVMYEL 317
AF+ ++ L
Sbjct: 369 AFHQILNGL 377
>gi|449496198|ref|XP_004160070.1| PREDICTED: LOW QUALITY PROTEIN: probable glucuronoxylan
glucuronosyltransferase IRX7-like [Cucumis sativus]
Length = 459
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 87/317 (27%), Positives = 133/317 (41%), Gaps = 37/317 (11%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIP------DFISHY 86
+ASE +A + SH+ T DP +AD FF+P ++ G P IS
Sbjct: 132 FASEVAIHRALLNSHYRTFDPLEADFFFVPVYVS----CNFSTVNGFPAIGHARSLISSA 187
Query: 87 IFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISG----- 141
+ +IS Y +WNRT G+DH +VA H + + +S +
Sbjct: 188 VSHISSHYSFWNRTNGSDHVFVASHDFASCFHTMEHVAIADGVPSFLKNSIILQTFGVKY 247
Query: 142 -HIAH--KDVSLPQIWPRQEDPPKLGSS----KRNKLAFFAGAV------------NSPV 182
H + V +P P + L S +R+ AFF G + + V
Sbjct: 248 KHPCQDVEHVVIPPYIPPESIENTLERSPVTGRRDIFAFFRGKMEMNPKNVSGRFYSKKV 307
Query: 183 REKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVI 242
R + + + D Y R Y E++ S FCL G+ + R+ +S+ GCVPVI
Sbjct: 308 RTMIWRKFNGDRRFYLQRHRFPG-YQSEIVRSVFCLCPLGWAPWSPRLVESVALGCVPVI 366
Query: 243 IANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVL--KVRKHFQW 300
IA+ LPF +NW SI VA DI L +IL ++ +Q + + R+ +
Sbjct: 367 IADGIRLPFPSAVNWPEISITVAEKDIGKLGRILDHVAGSNLTTIQKNLWDPRNRRALLF 426
Query: 301 HVFPSDYDAFYMVMYEL 317
H D DA + V+ L
Sbjct: 427 HNQVEDGDATWQVIGAL 443
>gi|223974777|gb|ACN31576.1| unknown [Zea mays]
Length = 412
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 83/336 (24%), Positives = 149/336 (44%), Gaps = 33/336 (9%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T DP AD F+ P ++ + P + I ++
Sbjct: 73 FAAEIFMHQFLLSSAVRTLDPEAADWFYTPVYTTCDLTPQGFPLPFRAPRMMRSAIRYVA 132
Query: 92 QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
+P+WNRT GADHF++ H G ++A E + + + ++ H+ +
Sbjct: 133 ATWPFWNRTDGADHFFLTPHDFGACFHYQEERAVERGILPLLRRATLVQTFGQRNHVCLQ 192
Query: 147 DVSL---PQIWPRQEDPPKLG-SSKRNKLAFFAGAV----NSP-----VREKLLQVWRN- 192
D S+ P P + +G + R+ +F G N P R VW N
Sbjct: 193 DGSITVPPYASPHRLQAHLVGPGTPRSIFVYFRGLFYDMGNDPEGGYYARGARASVWENF 252
Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
D+ ++ S Y +++ + FCL G+ + R+ +++ +GC+PVIIA+ LPF
Sbjct: 253 KDNPLFDISTEHPATYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 312
Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLK--VRKHFQWHVFPSDYDA 309
AD + W+ S+ VA D+P L IL I + L Q + + V++ +H DA
Sbjct: 313 ADAIPWEDMSVFVAERDVPRLDSILTSIPLPDILRRQRLLARDSVKRALLFHQPARPGDA 372
Query: 310 FYMVM----------YELWLRRSSVRVQWSTSLDSN 335
F+ V+ +++L+ + W L+S+
Sbjct: 373 FHQVLNGLARKLPHGEDVFLQPGEKALDWDAGLESD 408
>gi|110288702|gb|ABB46945.2| exostosin family protein, putative, expressed [Oryza sativa
Japonica Group]
gi|215766610|dbj|BAG98714.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 341
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 85/312 (27%), Positives = 143/312 (45%), Gaps = 31/312 (9%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T +P +AD F+ P ++ + T P + I IS
Sbjct: 2 FATEIFMHRFLLSSAIRTSNPDEADWFYTPVYTTCDLTPWGHPLTTKSPRMMRSAIKFIS 61
Query: 92 QKYPYWNRTGGADHFYVACHSIGRS-AMQKAWEVKLNAIQVV----CSSSYFISGHIAHK 146
+ +PYWNRT GADHF+V H Q+A ++ + V+ ++ H K
Sbjct: 62 KYWPYWNRTEGADHFFVVPHDFAACFYFQEAKAIERGILPVLRRATLVQTFGQKNHACLK 121
Query: 147 D--VSLPQIWPRQED-----PPKLGSSKRNKLAFFAG----AVNSP-----VREKLLQVW 190
D +++P P + PP+ + R+ +F G N P R VW
Sbjct: 122 DGSITVPPYTPAHKIRAHLVPPE---TPRSIFVYFRGLFYDTSNDPEGGYYARGARASVW 178
Query: 191 RN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYD 248
N ++ ++ S Y +++ + FCL G+ + R+ +++ +GC+PVIIA+
Sbjct: 179 ENFKNNPMFDISTDHPQTYYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIV 238
Query: 249 LPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY- 307
LPF+D + W+ ++ VA D+P L IL I +E L Q+ +L Q +FP
Sbjct: 239 LPFSDAIPWEEIAVFVAEDDVPQLDTILTSIPTEVILRKQA-MLAEPSMKQTMLFPQPAE 297
Query: 308 --DAFYMVMYEL 317
D F+ VM L
Sbjct: 298 PGDGFHQVMNAL 309
>gi|293336613|ref|NP_001170701.1| uncharacterized protein LOC100384778 precursor [Zea mays]
gi|238007038|gb|ACR34554.1| unknown [Zea mays]
gi|413951433|gb|AFW84082.1| hypothetical protein ZEAMMB73_582447 [Zea mays]
Length = 412
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 84/336 (25%), Positives = 148/336 (44%), Gaps = 33/336 (9%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T DP AD F+ P ++ + P + I ++
Sbjct: 73 FAAEIFMHQFLLSSAVRTLDPEAADWFYTPVYTTCDLTPQGFPLPFRAPRMMRSAIRYVA 132
Query: 92 QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
+P+WNRT GADHF++ H G ++A E + + + ++ H+ +
Sbjct: 133 ATWPFWNRTDGADHFFLTPHDFGACFHYQEERAVERGILPLLRRATLVQTFGQRNHVCLQ 192
Query: 147 DVSL---PQIWPRQEDPPKLG-SSKRNKLAFFAGAV----NSP-----VREKLLQVWRN- 192
D S+ P P + +G + R+ +F G N P R VW N
Sbjct: 193 DGSITVPPYASPHRLQAHLVGPGTPRSIFVYFRGLFYDMGNDPEGGYYARGARASVWENF 252
Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
D+ ++ S Y +++ + FCL G+ + R+ +++ +GC+PVIIA+ LPF
Sbjct: 253 KDNPLFDISTEHPATYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 312
Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLK--VRKHFQWHVFPSDYDA 309
AD + W+ S+ VA D+P L IL I + L Q + + V++ +H DA
Sbjct: 313 ADAIPWEDMSVFVAERDVPRLDSILTSIPLPDILRRQRLLARDSVKRALLFHQPARPGDA 372
Query: 310 FYMVMYEL----------WLRRSSVRVQWSTSLDSN 335
F+ V+ L +L+ + W L+S+
Sbjct: 373 FHQVLNGLARKLPHREGVFLQPGEKALDWDAGLESD 408
>gi|115481310|ref|NP_001064248.1| Os10g0180000 [Oryza sativa Japonica Group]
gi|122212636|sp|Q33AH8.2|GT101_ORYSJ RecName: Full=Probable glucuronosyltransferase GUT1; AltName:
Full=Glucuronoxylan glucuronosyltransferase 1;
Short=OsGUT1
gi|110288701|gb|ABB46947.2| exostosin family protein, putative, expressed [Oryza sativa
Japonica Group]
gi|113638857|dbj|BAF26162.1| Os10g0180000 [Oryza sativa Japonica Group]
Length = 417
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 85/312 (27%), Positives = 143/312 (45%), Gaps = 31/312 (9%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T +P +AD F+ P ++ + T P + I IS
Sbjct: 78 FATEIFMHRFLLSSAIRTSNPDEADWFYTPVYTTCDLTPWGHPLTTKSPRMMRSAIKFIS 137
Query: 92 QKYPYWNRTGGADHFYVACHSIGRS-AMQKAWEVKLNAIQVV----CSSSYFISGHIAHK 146
+ +PYWNRT GADHF+V H Q+A ++ + V+ ++ H K
Sbjct: 138 KYWPYWNRTEGADHFFVVPHDFAACFYFQEAKAIERGILPVLRRATLVQTFGQKNHACLK 197
Query: 147 D--VSLPQIWPRQED-----PPKLGSSKRNKLAFFAG----AVNSP-----VREKLLQVW 190
D +++P P + PP+ + R+ +F G N P R VW
Sbjct: 198 DGSITVPPYTPAHKIRAHLVPPE---TPRSIFVYFRGLFYDTSNDPEGGYYARGARASVW 254
Query: 191 RN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYD 248
N ++ ++ S Y +++ + FCL G+ + R+ +++ +GC+PVIIA+
Sbjct: 255 ENFKNNPMFDISTDHPQTYYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIV 314
Query: 249 LPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY- 307
LPF+D + W+ ++ VA D+P L IL I +E L Q+ +L Q +FP
Sbjct: 315 LPFSDAIPWEEIAVFVAEDDVPQLDTILTSIPTEVILRKQA-MLAEPSMKQTMLFPQPAE 373
Query: 308 --DAFYMVMYEL 317
D F+ VM L
Sbjct: 374 PGDGFHQVMNAL 385
>gi|21671946|gb|AAM74308.1|AC083944_26 Unknown protein similar to exostosin-2 [Oryza sativa Japonica
Group]
gi|110288700|gb|ABB46946.2| exostosin family protein, putative, expressed [Oryza sativa
Japonica Group]
Length = 401
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 85/312 (27%), Positives = 143/312 (45%), Gaps = 31/312 (9%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T +P +AD F+ P ++ + T P + I IS
Sbjct: 62 FATEIFMHRFLLSSAIRTSNPDEADWFYTPVYTTCDLTPWGHPLTTKSPRMMRSAIKFIS 121
Query: 92 QKYPYWNRTGGADHFYVACHSIGRS-AMQKAWEVKLNAIQVV----CSSSYFISGHIAHK 146
+ +PYWNRT GADHF+V H Q+A ++ + V+ ++ H K
Sbjct: 122 KYWPYWNRTEGADHFFVVPHDFAACFYFQEAKAIERGILPVLRRATLVQTFGQKNHACLK 181
Query: 147 D--VSLPQIWPRQED-----PPKLGSSKRNKLAFFAG----AVNSP-----VREKLLQVW 190
D +++P P + PP+ + R+ +F G N P R VW
Sbjct: 182 DGSITVPPYTPAHKIRAHLVPPE---TPRSIFVYFRGLFYDTSNDPEGGYYARGARASVW 238
Query: 191 RN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYD 248
N ++ ++ S Y +++ + FCL G+ + R+ +++ +GC+PVIIA+
Sbjct: 239 ENFKNNPMFDISTDHPQTYYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIV 298
Query: 249 LPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY- 307
LPF+D + W+ ++ VA D+P L IL I +E L Q+ +L Q +FP
Sbjct: 299 LPFSDAIPWEEIAVFVAEDDVPQLDTILTSIPTEVILRKQA-MLAEPSMKQTMLFPQPAE 357
Query: 308 --DAFYMVMYEL 317
D F+ VM L
Sbjct: 358 PGDGFHQVMNAL 369
>gi|302823891|ref|XP_002993593.1| glycosyltransferase-like protein [Selaginella moellendorffii]
gi|300138521|gb|EFJ05285.1| glycosyltransferase-like protein [Selaginella moellendorffii]
Length = 437
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 88/309 (28%), Positives = 141/309 (45%), Gaps = 25/309 (8%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T +P +AD F+ P ++ + + P + I IS
Sbjct: 70 FATEIFVHEFLLGSAVRTLNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRMMRSVIQYIS 129
Query: 92 QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVK-LNAIQ-VVCSSSYFISGHIAHK 146
++PYWNRT GADHF+V H G +KA E L+ +Q ++ H+ K
Sbjct: 130 NQWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILHLLQRATLVQTFGQRYHVCLK 189
Query: 147 DVSL--PQIWPRQEDPPKL--GSSKRNKLAFFAGAV----NSP-----VREKLLQVWRN- 192
+ S+ P P Q+ L S R+ +F G N P R VW N
Sbjct: 190 EGSIVVPPYCPPQKMQAHLIPPSIPRSIFVYFRGLFYDYGNDPEGGYYARGARAAVWENF 249
Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
D+ ++ S Y +++ + FCL G+ + R+ + + +GC+PVIIA+ LPF
Sbjct: 250 KDNPLFDISTEHPITYYEDMQRAIFCLCPLGWAPWSPRLVEGVIFGCIPVIIADDIVLPF 309
Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
AD + W+ + VA D+P L IL I E +L + +L Q +FP D
Sbjct: 310 ADAIPWEEIGVFVAEKDVPKLDTILTSIPP-EVILKKQRLLATPAMKQAMLFPQPAQPGD 368
Query: 309 AFYMVMYEL 317
AF+ ++ L
Sbjct: 369 AFHQILNGL 377
>gi|218184216|gb|EEC66643.1| hypothetical protein OsI_32904 [Oryza sativa Indica Group]
gi|222612524|gb|EEE50656.1| hypothetical protein OsJ_30886 [Oryza sativa Japonica Group]
Length = 400
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 85/312 (27%), Positives = 143/312 (45%), Gaps = 31/312 (9%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T +P +AD F+ P ++ + T P + I IS
Sbjct: 61 FATEIFMHRFLLSSAIRTSNPDEADWFYTPVYTTCDLTPWGHPLTTKSPRMMRSAIKFIS 120
Query: 92 QKYPYWNRTGGADHFYVACHSIGRS-AMQKAWEVKLNAIQVV----CSSSYFISGHIAHK 146
+ +PYWNRT GADHF+V H Q+A ++ + V+ ++ H K
Sbjct: 121 KYWPYWNRTEGADHFFVVPHDFAACFYFQEAKAIERGILPVLRRATLVQTFGQKNHACLK 180
Query: 147 D--VSLPQIWPRQED-----PPKLGSSKRNKLAFFAG----AVNSP-----VREKLLQVW 190
D +++P P + PP+ + R+ +F G N P R VW
Sbjct: 181 DGSITVPPYTPAHKIRAHLVPPE---TPRSIFVYFRGLFYDTSNDPEGGYYARGARASVW 237
Query: 191 RN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYD 248
N ++ ++ S Y +++ + FCL G+ + R+ +++ +GC+PVIIA+
Sbjct: 238 ENFKNNPMFDISTDHPQTYYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIV 297
Query: 249 LPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY- 307
LPF+D + W+ ++ VA D+P L IL I +E L Q+ +L Q +FP
Sbjct: 298 LPFSDAIPWEEIAVFVAEDDVPQLDTILTSIPTEVILRKQA-MLAEPSMKQTMLFPQPAE 356
Query: 308 --DAFYMVMYEL 317
D F+ VM L
Sbjct: 357 PGDGFHQVMNAL 368
>gi|75122905|sp|Q6H4N0.1|GT21_ORYSJ RecName: Full=Probable glucuronosyltransferase Os02g0520750
gi|49389081|dbj|BAD26319.1| putative pectin-glucuronyltransferase [Oryza sativa Japonica Group]
gi|215769122|dbj|BAH01351.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 434
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 81/309 (26%), Positives = 142/309 (45%), Gaps = 25/309 (8%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T +P +AD F+ P ++ + H P + I +S
Sbjct: 94 FAAEIFMHRFLLSSAVRTLNPEQADWFYAPVYTTCDLTHAGLPLPFKSPRMMRSAIQFLS 153
Query: 92 QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
+K+P+WNRT GADHF+V H G +KA E + + + ++ H+ K
Sbjct: 154 RKWPFWNRTDGADHFFVVPHDFGACFHYQEEKAIERGILPLLRRATLVQTFGQKNHVCLK 213
Query: 147 D--VSLPQIWPRQEDPPKL--GSSKRNKLAFFAGAV----NSP-----VREKLLQVWRN- 192
+ +++P P Q+ L + R+ +F G N P R +W N
Sbjct: 214 EGSITIPPYAPPQKMQAHLIPPDTPRSIFVYFRGLFYDNGNDPEGGYYARGARASLWENF 273
Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
++ ++ S Y +++ S FCL G+ + R+ +++ +GC+PVIIA+ LPF
Sbjct: 274 KNNPLFDISTEHPATYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 333
Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
AD + W + V D+P L IL I ++ +L + +L Q +FP D
Sbjct: 334 ADAIPWDEIGVFVDEEDVPRLDSILTSIPIDD-ILRKQRLLANPSMKQAMLFPQPAQPRD 392
Query: 309 AFYMVMYEL 317
AF+ ++ L
Sbjct: 393 AFHQILNGL 401
>gi|125539672|gb|EAY86067.1| hypothetical protein OsI_07437 [Oryza sativa Indica Group]
Length = 342
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 81/309 (26%), Positives = 142/309 (45%), Gaps = 25/309 (8%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T +P +AD F+ P ++ + H P + I +S
Sbjct: 2 FAAEIFMHRFLLSSAVRTLNPEQADWFYAPVYTTCDLTHAGLPLPFKSPRMMRSAIQFLS 61
Query: 92 QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
+K+P+WNRT GADHF+V H G +KA E + + + ++ H+ K
Sbjct: 62 RKWPFWNRTDGADHFFVVPHDFGACFHYQEEKAIERGILPLLRRATLVQTFGQKNHVCLK 121
Query: 147 D--VSLPQIWPRQEDPPKL--GSSKRNKLAFFAGAV----NSP-----VREKLLQVWRN- 192
+ +++P P Q+ L + R+ +F G N P R +W N
Sbjct: 122 EGSITIPPYAPPQKMQAHLIPPDTPRSIFVYFRGLFYDNGNDPEGGYYARGARASLWENF 181
Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
++ ++ S Y +++ S FCL G+ + R+ +++ +GC+PVIIA+ LPF
Sbjct: 182 KNNPLFDISTEHPATYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 241
Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
AD + W + V D+P L IL I ++ +L + +L Q +FP D
Sbjct: 242 ADAIPWDEIGVFVDEEDVPRLDSILTSIPIDD-ILRKQRLLANPSMKQAMLFPQPAQPRD 300
Query: 309 AFYMVMYEL 317
AF+ ++ L
Sbjct: 301 AFHQILNGL 309
>gi|297851174|ref|XP_002893468.1| hypothetical protein ARALYDRAFT_472948 [Arabidopsis lyrata subsp.
lyrata]
gi|297339310|gb|EFH69727.1| hypothetical protein ARALYDRAFT_472948 [Arabidopsis lyrata subsp.
lyrata]
Length = 412
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 85/313 (27%), Positives = 140/313 (44%), Gaps = 33/313 (10%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T++P +AD F+ P + + P + I IS
Sbjct: 73 FAAEIFMHRFLLSSPVRTRNPDEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSAIQLIS 132
Query: 92 QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIA-- 144
+PYWNRT GADHF+V H G +KA E + + + ++ H+
Sbjct: 133 SNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLD 192
Query: 145 HKDVSLPQIWPRQED-----PPKLGSSKRNKLAFFAGAV----NSP-----VREKLLQVW 190
+++P P Q+ PP + R+ +F G N P R VW
Sbjct: 193 EGSITIPPFAPPQKMQAHFIPPDI---PRSIFVYFRGLFYDVNNDPEGGYYARGARAAVW 249
Query: 191 RN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYD 248
N ++ ++ S T Y +++ + FCL G+ + R+ +++ +GC+PVIIA+
Sbjct: 250 ENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIV 309
Query: 249 LPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYL----LLQSYVLKVRKHFQWHVFP 304
LPFAD + W+ + VA D+P L IL I +E L LL + +K F P
Sbjct: 310 LPFADAIPWEEIGVFVAEKDVPELDTILTSIPTEVILRKQRLLANPSMKRAMLFPQPAQP 369
Query: 305 SDYDAFYMVMYEL 317
DAF+ ++ L
Sbjct: 370 G--DAFHQILNGL 380
>gi|15223522|ref|NP_174064.1| putative glycosyltransferase [Arabidopsis thaliana]
gi|75272002|sp|Q9FZJ1.1|IRX10_ARATH RecName: Full=Probable beta-1,4-xylosyltransferase IRX10; AltName:
Full=Glucuronoxylan glucuronosyltransferase 1;
Short=AtGUT1; AltName: Full=Glucuronoxylan
glucuronosyltransferase 2; AltName: Full=Protein
IRREGULAR XYLEM 10; AltName: Full=Xylan
xylosyltransferase IRX10
gi|9802541|gb|AAF99743.1|AC004557_22 F17L21.23 [Arabidopsis thaliana]
gi|63003872|gb|AAY25465.1| At1g27440 [Arabidopsis thaliana]
gi|98960979|gb|ABF58973.1| At1g27440 [Arabidopsis thaliana]
gi|332192709|gb|AEE30830.1| putative glycosyltransferase [Arabidopsis thaliana]
Length = 412
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 85/313 (27%), Positives = 140/313 (44%), Gaps = 33/313 (10%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T++P +AD F+ P + + P + I IS
Sbjct: 73 FAAEIFMHRFLLSSPVRTRNPDEADWFYTPIYPTCDLTPTGLPLPFKSPRMMRSSIQLIS 132
Query: 92 QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIA-- 144
+PYWNRT GADHF+V H G +KA E + + + ++ H+
Sbjct: 133 SNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLD 192
Query: 145 HKDVSLPQIWPRQED-----PPKLGSSKRNKLAFFAGAV----NSP-----VREKLLQVW 190
+++P P Q+ PP + R+ +F G N P R VW
Sbjct: 193 EGSITIPPFAPPQKMQAHFIPPDI---PRSIFVYFRGLFYDVNNDPEGGYYARGARAAVW 249
Query: 191 RN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYD 248
N ++ ++ S T Y +++ + FCL G+ + R+ +++ +GC+PVIIA+
Sbjct: 250 ENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIV 309
Query: 249 LPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYL----LLQSYVLKVRKHFQWHVFP 304
LPFAD + W+ + VA D+P L IL I +E L LL + +K F P
Sbjct: 310 LPFADAIPWEEIGVFVAEKDVPELDTILTSIPTEVILRKQRLLANPSMKRAMLFPQPAQP 369
Query: 305 SDYDAFYMVMYEL 317
DAF+ ++ L
Sbjct: 370 G--DAFHQILNGL 380
>gi|115441965|ref|NP_001045262.1| Os01g0926400 [Oryza sativa Japonica Group]
gi|75159223|sp|Q8S1X9.1|GT13_ORYSJ RecName: Full=Probable glucuronosyltransferase Os01g0926400;
AltName: Full=OsGT47D
gi|20160726|dbj|BAB89668.1| P0482D04.15 [Oryza sativa Japonica Group]
gi|20805224|dbj|BAB92891.1| putative pectin-glucuronyltransferase [Oryza sativa Japonica Group]
gi|113534793|dbj|BAF07176.1| Os01g0926400 [Oryza sativa Japonica Group]
gi|215686656|dbj|BAG88909.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215707279|dbj|BAG93739.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 422
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 85/338 (25%), Positives = 146/338 (43%), Gaps = 37/338 (10%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T DP +AD F+ P ++ + P + + ++
Sbjct: 83 FAAEIFMHQFLLSSPVRTLDPEEADWFYTPAYTTCDLTPQGFPLPFRAPRIMRSAVRYVA 142
Query: 92 QKYPYWNRTGGADHFYVACHSIGRS-AMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSL 150
+PYWNRT GADHF++A H G Q+ ++ + V+ ++ + H
Sbjct: 143 ATWPYWNRTDGADHFFLAPHDFGACFHYQEERAIERGILPVLRRATLVQTFGQRHHPCLQ 202
Query: 151 PQ--IWPRQEDPPKL------GSSKRNKLAFFAGAV----NSP-----VREKLLQVWRN- 192
P P DP K+ ++ R+ +F G N P R VW N
Sbjct: 203 PGSITVPPYADPRKMEAHRISPATPRSIFVYFRGLFYDMGNDPEGGYYARGARASVWENF 262
Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
D+ ++ S Y +++ + FCL G+ + R+ +++ +GC+PVIIA+ LPF
Sbjct: 263 KDNPLFDISTEHPATYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 322
Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYL----LLQSYVLKVRKHFQWHVFPSDY 307
AD + W S+ VA D+P L IL + +E + LL S +K F P
Sbjct: 323 ADAIPWGEISVFVAEEDVPRLDTILASVPLDEVIRKQRLLASPAMKQAVLFHQPARPG-- 380
Query: 308 DAFYMVMYEL----------WLRRSSVRVQWSTSLDSN 335
DAF+ ++ L +L + W L+++
Sbjct: 381 DAFHQILNGLARKLPHPKGVFLEPGEKGIDWDQGLEND 418
>gi|125582313|gb|EAZ23244.1| hypothetical protein OsJ_06939 [Oryza sativa Japonica Group]
Length = 434
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 81/309 (26%), Positives = 142/309 (45%), Gaps = 25/309 (8%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T +P +AD F+ P ++ + H P + I +S
Sbjct: 94 FAAEIFMHRFLLSSAVRTLNPEQADWFYAPVYTTCDLTHAGLPLPFKSPRMMRSAIQFLS 153
Query: 92 QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
+K+P+WNRT GADHF+V H G +KA E + + + ++ H+ K
Sbjct: 154 RKWPFWNRTDGADHFFVVPHDFGACFHYQEEKAIERGILPLLRRATLVQTFGQKNHVCLK 213
Query: 147 D--VSLPQIWPRQEDPPKL--GSSKRNKLAFFAGAV----NSP-----VREKLLQVWRN- 192
+ +++P P Q+ L + R+ +F G N P R +W N
Sbjct: 214 EGSITIPPYAPPQKMQAHLIPPDTPRSIFVYFRGLFYDNGNDPEGGYYARGARASLWENF 273
Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
++ ++ S Y +++ S FCL G+ + R+ +++ +GC+PVIIA+ LPF
Sbjct: 274 KNNPLFDISTEHPATYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 333
Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
AD + W + V D+P L IL I ++ +L + +L Q +FP D
Sbjct: 334 ADAIPWDEIGVFVDEEDVPRLDSILTSIPIDD-ILRKQRLLANPSMKQAMLFPQPAQPRD 392
Query: 309 AFYMVMYEL 317
AF+ ++ L
Sbjct: 393 AFHQILNGL 401
>gi|125528940|gb|EAY77054.1| hypothetical protein OsI_05011 [Oryza sativa Indica Group]
Length = 422
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 85/338 (25%), Positives = 146/338 (43%), Gaps = 37/338 (10%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T DP +AD F+ P ++ + P + + ++
Sbjct: 83 FAAEIFMHQFLLSSPVRTLDPEEADWFYTPAYTTCDLTPQGFPLPFRAPRIMRSAVRYVA 142
Query: 92 QKYPYWNRTGGADHFYVACHSIGRS-AMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSL 150
+PYWNRT GADHF++A H G Q+ ++ + V+ ++ + H
Sbjct: 143 ATWPYWNRTDGADHFFLAPHDFGACFHYQEERAIERGILPVLRRATLVQTFGQRHHPCLQ 202
Query: 151 PQ--IWPRQEDPPKL------GSSKRNKLAFFAGAV----NSP-----VREKLLQVWRN- 192
P P DP K+ ++ R+ +F G N P R VW N
Sbjct: 203 PGSITVPPYADPRKMEAHRISPATPRSIFVYFRGLFYDMGNDPEGGYYARGARASVWENF 262
Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
D+ ++ S Y +++ + FCL G+ + R+ +++ +GC+PVIIA+ LPF
Sbjct: 263 KDNPLFDISTEHPATYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 322
Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYL----LLQSYVLKVRKHFQWHVFPSDY 307
AD + W S+ VA D+P L IL + +E + LL S +K F P
Sbjct: 323 ADAIPWGEISVFVAEEDVPRLDTILASVPLDEVIRKQRLLASPAMKQAVLFHQPARPG-- 380
Query: 308 DAFYMVMYEL----------WLRRSSVRVQWSTSLDSN 335
DAF+ ++ L +L + W L+++
Sbjct: 381 DAFHQILNGLARKLPHPKGVFLEPGEKGIDWDQGLEND 418
>gi|357141002|ref|XP_003572039.1| PREDICTED: probable glucuronosyltransferase Os03g0107900-like
[Brachypodium distachyon]
Length = 465
Score = 101 bits (251), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 88/340 (25%), Positives = 149/340 (43%), Gaps = 58/340 (17%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLPF----------SIARMRHDRRIGTEGIPDF 82
+A+E +A ++ H + P +ADLF +P + ++H R
Sbjct: 138 FAAEVAVHEALLRRHLRAR-PEEADLFLVPVYVSCNFSTPTGLPSLKHAR--------GL 188
Query: 83 ISHYIFNISQKYPYWNRTGGADHFYVACHSIGRS--AMQK---AWEVKLNAIQVVCSSSY 137
++ + + + PYWNR+ G DH +VA H G AM+ A + + + ++
Sbjct: 189 LAEAVELVRRDMPYWNRSAGTDHVFVASHDFGACFHAMEDVAIAGGIPEFLKRSILLQTF 248
Query: 138 FISGHIAHKDVS--------LPQIWPRQEDPPKLGSSKRNKLAFFAGAV----------- 178
+ G ++V LP++ +P K S R+ AFF G +
Sbjct: 249 GVQGRHTCQEVEHVVIPPHVLPEVARELPEPEK---SHRDIFAFFRGKMEVHPKNMSGRF 305
Query: 179 -NSPVREKLLQVWRNDSEIY----AHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADS 233
VR KLLQ++ ++ + Y H G Y E+ S FCL G+ + R+ +S
Sbjct: 306 YGKKVRTKLLQLYGHNRKFYLKRKQHDG-----YRLEMARSLFCLCPLGWAPWSPRLVES 360
Query: 234 LYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLK 293
+ GC+PVIIA++ LPF VL W S+ VA DI L+ +L ++S +Q +
Sbjct: 361 VLLGCIPVIIADNIRLPFPGVLRWPDISLQVAERDIANLEAMLDHVASTNLTTIQGNLWD 420
Query: 294 V--RKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQWSTS 331
RK ++ + DA + V+ EL + +R + S
Sbjct: 421 PVKRKALVFNQPMEEGDATWQVLKELEAKLGHLRQKGRIS 460
>gi|212275360|ref|NP_001130059.1| uncharacterized protein LOC100191151 precursor [Zea mays]
gi|194688192|gb|ACF78180.1| unknown [Zea mays]
Length = 418
Score = 101 bits (251), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 80/311 (25%), Positives = 140/311 (45%), Gaps = 29/311 (9%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T +P +AD F+ P ++ + P + I ++
Sbjct: 78 FAAEIFMHRFLLSSAVRTFNPEEADWFYTPVYTTCDLTPSGLPLPFKSPRMMRSAIERVA 137
Query: 92 QKYPYWNRTGGADHFYVACHSIG-------RSAMQKAWEVKLNAIQVVCSSSYFISGHIA 144
+PYWNR+ GADHF+V H G A+ + L +V ++ H+
Sbjct: 138 TNWPYWNRSEGADHFFVTPHDFGACFHYQEEKAIGRGIPPLLQRATLV--QTFGQKNHVC 195
Query: 145 HKD--VSLPQIWPRQEDPPKL--GSSKRNKLAFFAG----AVNSP-----VREKLLQVWR 191
KD +++P P Q+ L + R+ +F G N P R VW
Sbjct: 196 LKDGSITIPPFAPPQKMQAHLIPADTPRSIFVYFRGLFYDTSNDPEGGYYARGARASVWE 255
Query: 192 N--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDL 249
N ++ ++ S Y +++ S FCL G+ + R+ +++ +GC+PVI+A+ L
Sbjct: 256 NFKNNPLFDISTDHPATYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIVADDIVL 315
Query: 250 PFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY-- 307
PFAD + W+ + VA D+P L IL I + + +L + +L Q +FP
Sbjct: 316 PFADAIPWEDIGVFVAEEDVPRLDTILTSIPT-DVVLRKQRLLANPSMKQAMLFPQPAQA 374
Query: 308 -DAFYMVMYEL 317
DAF+ ++ L
Sbjct: 375 GDAFHQILNGL 385
>gi|242059741|ref|XP_002459016.1| hypothetical protein SORBIDRAFT_03g044520 [Sorghum bicolor]
gi|241930991|gb|EES04136.1| hypothetical protein SORBIDRAFT_03g044520 [Sorghum bicolor]
Length = 417
Score = 101 bits (251), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 83/309 (26%), Positives = 141/309 (45%), Gaps = 25/309 (8%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T +P +AD F+ P ++ + P + I IS
Sbjct: 78 FAAEIFMHRFLLSSAIRTLNPEEADWFYTPVYTTCDLTPWGHPLPFKSPRIMRSAIQFIS 137
Query: 92 QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
++PYWNRT GADHF+V H G +KA E + + + ++ H+ K
Sbjct: 138 NRWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGVLPLLRRATLVQTFGQKDHVCLK 197
Query: 147 D--VSLPQIWPRQEDPPKL--GSSKRNKLAFFAG----AVNSP-----VREKLLQVWRN- 192
+ +++P P Q+ L + R+ +F G N P R VW N
Sbjct: 198 EGSITIPPYAPPQKMKTHLVPPGTPRSIFVYFRGLFYDTANDPEGGYYARGARASVWENF 257
Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
++ ++ S Y +++ + FCL G+ + R+ +++ +GC+PVIIA+ LPF
Sbjct: 258 KNNPLFDISTDHPPTYYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 317
Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
AD + W+ + VA D+P L IL I E +L + +L Q +FP D
Sbjct: 318 ADAIPWEEIGVFVAEDDVPKLDTILTSIPM-EVILRKQRLLANPSMKQAMLFPQPAQPGD 376
Query: 309 AFYMVMYEL 317
AF+ ++ L
Sbjct: 377 AFHQILNGL 385
>gi|168003473|ref|XP_001754437.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162694539|gb|EDQ80887.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 353
Score = 100 bits (250), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 84/316 (26%), Positives = 132/316 (41%), Gaps = 52/316 (16%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIP------DFISHY 86
+A+E K M S T DP +AD FF+P ++ + G P F+
Sbjct: 31 FAAEVAIHKVLMTSPIRTLDPCEADFFFIPVYVS----CKFTPKTGFPWLGQARKFMEAA 86
Query: 87 IFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFI------- 139
+ ++S + +WNR+GG DH +VA H G E + I S +
Sbjct: 87 VNHVSTRMEFWNRSGGRDHIFVASHDYGACFHTLETEAIAHGIPEFMRKSLILQTFGVQD 146
Query: 140 ------SGHIAHKDVSLPQIWPRQ-EDPPKLGSSKRNKLAFFAGAV------------NS 180
+ HI P + +DPP+ KRN AFF G + +
Sbjct: 147 FHPCQAAEHIQIPPYVSPSVAASYIKDPPE--RQKRNIFAFFRGKMEINPKNVSGLVYSR 204
Query: 181 PVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVP 240
VR L + + ++ R Y E+L S FCL G+ + RI +++ +GCVP
Sbjct: 205 GVRTVLYKKFSHNRRFLLKRHRTDN-YQLEMLRSTFCLCPVGWAPWSPRIVEAVVHGCVP 263
Query: 241 VIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLK------- 293
VIIA++ LP++ ++W S+ V D+P L KIL +++ +Q + K
Sbjct: 264 VIIADNISLPYSHAIDWTGISLSVREHDVPKLDKILLNVAATNLSTIQHNLWKEENRRAL 323
Query: 294 ------VRKHFQWHVF 303
V+ WHVF
Sbjct: 324 LFTDPLVKGDATWHVF 339
>gi|195619204|gb|ACG31432.1| secondary cell wall-related glycosyltransferase family 47 [Zea
mays]
Length = 419
Score = 100 bits (250), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 83/309 (26%), Positives = 141/309 (45%), Gaps = 25/309 (8%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T +P +AD F+ P ++ + P + I IS
Sbjct: 80 FAAEIFMHRFLLSSAIRTLNPEEADWFYTPVYTTCDLTPWGHPLPFKSPRIMRSAIQFIS 139
Query: 92 QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
++PYWNRT GADHF+V H G +KA E + + + ++ H+ K
Sbjct: 140 NRWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGVLPLLRRATLVQTFGQKDHVCLK 199
Query: 147 D--VSLPQIWPRQEDPPKL--GSSKRNKLAFFAG----AVNSP-----VREKLLQVWRN- 192
+ +++P P Q+ L + R+ +F G N P R VW N
Sbjct: 200 EGSITIPPYAPPQKMKTHLVPPGTPRSIFVYFRGLFYDTANDPEGGYYARGARASVWENF 259
Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
++ ++ S Y +++ + FCL G+ + R+ +++ +GC+PVIIA+ LPF
Sbjct: 260 KNNPLFDISTDHPPTYYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 319
Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
AD + W+ + VA D+P L IL I E +L + +L Q +FP D
Sbjct: 320 ADAIPWEEIGVFVAEDDVPRLDTILTSIPM-EVILRKQRLLANPSMKQAMLFPQPAQAGD 378
Query: 309 AFYMVMYEL 317
AF+ ++ L
Sbjct: 379 AFHQILNGL 387
>gi|242059743|ref|XP_002459017.1| hypothetical protein SORBIDRAFT_03g044530 [Sorghum bicolor]
gi|241930992|gb|EES04137.1| hypothetical protein SORBIDRAFT_03g044530 [Sorghum bicolor]
Length = 420
Score = 100 bits (249), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 80/309 (25%), Positives = 138/309 (44%), Gaps = 25/309 (8%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T +P +AD F+ P ++ + P + I I+
Sbjct: 81 FAAEIFMHRFLLSSAVRTFNPEEADWFYTPVYTTCDLTPSGLPLPFKSPRMMRSAIELIA 140
Query: 92 QKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFIS-----GHIAHK 146
+PYWNR+ GADHF+V H G + + I + + + H+ K
Sbjct: 141 TNWPYWNRSEGADHFFVTPHDFGACFHYQEEKAIGRGILPLLQRATLVQTFGQKNHVCLK 200
Query: 147 D--VSLPQIWPRQEDPPKL--GSSKRNKLAFFAG----AVNSP-----VREKLLQVWRN- 192
D +++P P Q+ L + R+ +F G N P R VW N
Sbjct: 201 DGSITIPPFAPPQKMQAHLIPADTPRSIFVYFRGLFYDTSNDPEGGYYARGARASVWENF 260
Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
++ ++ S Y +++ S FCL G+ + R+ +++ +GC+PVIIA+ LPF
Sbjct: 261 KNNPLFDISTDHPPTYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 320
Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
AD + W+ + VA D+P L IL I + + +L + +L Q +FP D
Sbjct: 321 ADAIPWEEIGVFVAEEDVPKLDSILTSIPT-DVILRKQRLLANPSMKQAMLFPQPAQAGD 379
Query: 309 AFYMVMYEL 317
AF+ ++ L
Sbjct: 380 AFHQILNGL 388
>gi|357126596|ref|XP_003564973.1| PREDICTED: probable glucuronosyltransferase Os01g0926600-like
[Brachypodium distachyon]
Length = 411
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 84/312 (26%), Positives = 141/312 (45%), Gaps = 31/312 (9%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T +P +AD F+ P ++ + P + I IS
Sbjct: 72 FAAEIFMHRFLLSSAIRTMNPEEADWFYTPVYTTCDLTPWGHPLPFKSPRIMRSAIQFIS 131
Query: 92 QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
+PYWNRT GADHF+V H G +KA E + + + ++ H+ K
Sbjct: 132 SHWPYWNRTAGADHFFVVPHDFGACFHYQEEKAIERGILPLLRRATLVQTFGQKDHVCLK 191
Query: 147 D--VSLPQIWPRQED-----PPKLGSSKRNKLAFFAG----AVNSP-----VREKLLQVW 190
+ +++P P Q+ PP+ + R+ +F G N P R VW
Sbjct: 192 EGSINIPPYAPPQKMKTHLVPPE---TPRSIFVYFRGLFYDTANDPEGGYYARGARASVW 248
Query: 191 RN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYD 248
N ++ ++ S Y +++ + FCL G+ + R+ +++ +GC+PVIIA+
Sbjct: 249 ENFKNNPLFDISTDHPPTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIV 308
Query: 249 LPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY- 307
LPFAD + W + VA D+P L IL I E +L + +L Q +FP
Sbjct: 309 LPFADAIPWDEIGVFVAEDDVPKLDTILTSIPM-EVILRKQRLLANPSMKQAMLFPQPAQ 367
Query: 308 --DAFYMVMYEL 317
DAF+ ++ L
Sbjct: 368 PGDAFHQILNGL 379
>gi|302826405|ref|XP_002994684.1| glycosyltransferase-like protein [Selaginella moellendorffii]
gi|300137154|gb|EFJ04250.1| glycosyltransferase-like protein [Selaginella moellendorffii]
Length = 332
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 74/275 (26%), Positives = 126/275 (45%), Gaps = 28/275 (10%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIP------DFISHY 86
+A+E + ++S T DP +AD FF+P ++ R G P D +
Sbjct: 32 FAAEVAIHQNLLRSPVRTLDPDEADFFFMPVYVSCNFTSR----SGFPTLFHASDILQAA 87
Query: 87 IFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHK 146
+ +S+ P+W+R G DH +VA H G V + Q + +S I K
Sbjct: 88 VGLVSRNMPFWDRHQGRDHVFVATHDFGACFHAMDLAVTMGIPQFLRNS--IILQTFGEK 145
Query: 147 DVSLPQIWPRQEDPPKLGSSKRNKLAFFAGAV------------NSPVREKLLQVWRNDS 194
+ Q + PP + +R LAFF G + + VR + + + +D
Sbjct: 146 NKHPCQNVDHIQIPPYV---RRKILAFFRGKMEIHPKNVSGHMYSRGVRTTIWRRFSHDR 202
Query: 195 EIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
+ R Y E+L S FCL G+ + RI +S+ GC+PVIIA++ LP++ V
Sbjct: 203 RFFIKRKRSDN-YKAEMLRSVFCLCPLGWAPWSPRIVESVIQGCIPVIIADNIQLPYSHV 261
Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQS 289
++W+ S+ VA D+ L +IL +++ ++Q+
Sbjct: 262 IDWRKISVTVAERDVHKLDRILSRVAATNVSMIQA 296
>gi|297808283|ref|XP_002872025.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
gi|297317862|gb|EFH48284.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
Length = 465
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 88/324 (27%), Positives = 145/324 (44%), Gaps = 49/324 (15%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIA----------RMRHDRRIGTEGIPDF 82
+A+E +A + S T DP +AD FF+P ++ + H R + + + DF
Sbjct: 137 FAAEVAIHRALLSSDVRTLDPEEADFFFVPVYVSCNFSTSNGFPSLSHARSLLSSAV-DF 195
Query: 83 ISHYIFNISQKYPYWNRTGGADHFYVACHSIGRS--AMQK-AWE------VKLNAIQVVC 133
+S + YP+WNRT G+DH +VA H G AM+ A E +K + I
Sbjct: 196 LSDH-------YPFWNRTQGSDHVFVASHDFGACFHAMEDMAIEEGIPEFMKKSIILQTF 248
Query: 134 SSSYFISGHIAHKDVSLPQIWPRQ-----EDPPKLGSSKRNKLAFFAGAV---------- 178
Y V P I P E P G +R+ AFF G +
Sbjct: 249 GVKYKHPCQEVEHVVIPPYIPPESVQRAIEKAPANG--RRDIWAFFRGKMEVNPKNISGR 306
Query: 179 --NSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYY 236
+ VR +L+ + Y + R Y E++ S FCL G+ + R+ +S
Sbjct: 307 FYSKGVRTAILKKFGGRRRFYLNRHRFAG-YRSEIVRSVFCLCPLGWAPWSPRLVESAVL 365
Query: 237 GCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLK--V 294
GCVPV+IA+ LPF++ + W S+ VA D+ L+KIL+ +++ ++Q +
Sbjct: 366 GCVPVVIADGIKLPFSETVRWPEISLTVAEKDVRSLRKILEHVAATNLSVIQRNLHGPVF 425
Query: 295 RKHFQWHVFPSDYDAFYMVMYELW 318
++ ++V + DA + ++ LW
Sbjct: 426 KRALLYNVPMKEGDATWHILESLW 449
>gi|307135997|gb|ADN33853.1| glucuronoxylan glucuronosyltransferase [Cucumis melo subsp. melo]
Length = 416
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 84/309 (27%), Positives = 140/309 (45%), Gaps = 25/309 (8%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T +P +AD F+ P ++ + + P + I IS
Sbjct: 77 FAAEIFMHRFLLTSPVRTLNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRMMRSAIQLIS 136
Query: 92 QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
+PYWNRT GADHF+V H G +KA E + + + ++ H+ K
Sbjct: 137 SNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLK 196
Query: 147 D--VSLPQIWPRQEDPPKLGSSK--RNKLAFFAGAV----NSP-----VREKLLQVWRN- 192
+ +++P P Q+ L K R+ +F G N P R VW N
Sbjct: 197 EGSITIPPYAPPQKMHAHLIPEKTPRSIFVYFRGLFYDVGNDPEGGYYARGARAAVWENF 256
Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
D+ ++ S T Y +++ + FCL G+ + R+ +++ +GC+PVIIA+ LPF
Sbjct: 257 KDNPLFDISTEHPTTYYEDMQRAVFCLCPLGWAPWSPRLVEAVIFGCIPVIIADDIVLPF 316
Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
AD + W+ + + D+ L IL I E +L + +L Q +FP D
Sbjct: 317 ADAIPWEEIGVFLDEKDVANLDTILTSIPL-EMILRKQRLLANPSMKQAMLFPQPAQPGD 375
Query: 309 AFYMVMYEL 317
AF+ V+ L
Sbjct: 376 AFHQVLNGL 384
>gi|194697530|gb|ACF82849.1| unknown [Zea mays]
Length = 418
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 78/309 (25%), Positives = 138/309 (44%), Gaps = 25/309 (8%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T +P +AD F+ P ++ + P + I ++
Sbjct: 78 FAAEIFMHRFLLSSAVRTFNPEEADWFYTPVYTTCDLTPSGLPLPFKSPRMMRSAIELVA 137
Query: 92 QKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFIS-----GHIAHK 146
+PYWNR+ GADHF+V H G + + I + + + H+ K
Sbjct: 138 TNWPYWNRSEGADHFFVTPHDFGACFHYQEEKAIGRGILPLLQRATLVQTFGQKNHVCLK 197
Query: 147 D--VSLPQIWPRQEDPPKL--GSSKRNKLAFFAG----AVNSP-----VREKLLQVWRN- 192
D +++P P Q+ L + R+ +F G N P R VW N
Sbjct: 198 DGSITIPPFAPPQKMQAHLIPADTPRSIFVYFRGLFYDTSNDPEGGYYARGARASVWENF 257
Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
++ ++ S Y +++ S FCL G+ + R+ +++ +GC+PVI+A+ LPF
Sbjct: 258 KNNPLFDISTDHPATYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIVADDIVLPF 317
Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
AD + W+ + VA D+P L IL I + + +L + +L Q +FP D
Sbjct: 318 ADAIPWEDIGVFVAEEDVPRLDTILTSIPT-DVVLRKQRLLANPSMKQAMLFPQPAQAGD 376
Query: 309 AFYMVMYEL 317
AF+ ++ L
Sbjct: 377 AFHQILNGL 385
>gi|301072484|gb|ADK56172.1| glycosyltransferase 47 [Triticum aestivum]
Length = 415
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 83/312 (26%), Positives = 141/312 (45%), Gaps = 31/312 (9%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T +P +AD F+ P ++ + P + + IS
Sbjct: 76 FAAEIFMHRFLLSSAIRTMNPEEADWFYTPVYTTCDLTPWGHPLPFKSPRIMRSAVQFIS 135
Query: 92 QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
+PYWNRT GADHF+V H G +KA E + + + ++ H+ K
Sbjct: 136 SHWPYWNRTAGADHFFVVPHDFGACFHYQEEKAIERGILPLLRRATLVQTFGQKDHVCLK 195
Query: 147 D--VSLPQIWPRQED-----PPKLGSSKRNKLAFFAG----AVNSP-----VREKLLQVW 190
+ +++P P Q+ PP+ + R+ +F G N P R VW
Sbjct: 196 EGSINIPPYAPPQKMKTHLVPPE---TPRSIFVYFRGLFYDTANDPEGGYYARGARASVW 252
Query: 191 RN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYD 248
N ++ ++ S Y +++ + FCL G+ + R+ +++ +GC+PVIIA+
Sbjct: 253 ENFKNNPLFDISTDHPPTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIV 312
Query: 249 LPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY- 307
LPFAD + W + VA D+P L IL I E +L + +L Q +FP
Sbjct: 313 LPFADAIPWDEIGVFVAEDDVPKLDTILTSIPM-EVILRKQRLLANPSMKQAMLFPQPAQ 371
Query: 308 --DAFYMVMYEL 317
DAF+ ++ L
Sbjct: 372 PGDAFHQILNGL 383
>gi|363543255|ref|NP_001241842.1| uncharacterized protein LOC100857042 precursor [Zea mays]
gi|194704652|gb|ACF86410.1| unknown [Zea mays]
gi|224034207|gb|ACN36179.1| unknown [Zea mays]
gi|414878896|tpg|DAA56027.1| TPA: hypothetical protein ZEAMMB73_615997 [Zea mays]
Length = 418
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 78/309 (25%), Positives = 138/309 (44%), Gaps = 25/309 (8%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T +P +AD F+ P ++ + P + I ++
Sbjct: 78 FAAEIFMHRFLLSSAVRTFNPEEADWFYTPVYTTCDLTPSGLPLPFKSPRMMRSAIELVA 137
Query: 92 QKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFIS-----GHIAHK 146
+PYWNR+ GADHF+V H G + + I + + + H+ K
Sbjct: 138 TNWPYWNRSEGADHFFVTPHDFGACFHYQEEKAIGRGILPLLQRATLVQTFGQKNHVCLK 197
Query: 147 D--VSLPQIWPRQEDPPKL--GSSKRNKLAFFAG----AVNSP-----VREKLLQVWRN- 192
D +++P P Q+ L + R+ +F G N P R VW N
Sbjct: 198 DGSITIPPFAPPQKMQAHLIPADTPRSIFVYFRGLFYDTSNDPEGGYYARGARASVWENF 257
Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
++ ++ S Y +++ S FCL G+ + R+ +++ +GC+PVI+A+ LPF
Sbjct: 258 KNNPLFDISTDHPATYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIVADDIVLPF 317
Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
AD + W+ + VA D+P L IL I + + +L + +L Q +FP D
Sbjct: 318 ADAIPWEDIGVFVAEEDVPRLDTILTSIPT-DVVLRKQRLLANPSMKQAMLFPQPAQAGD 376
Query: 309 AFYMVMYEL 317
AF+ ++ L
Sbjct: 377 AFHQILNGL 385
>gi|219363227|ref|NP_001136572.1| hypothetical protein precursor [Zea mays]
gi|194696226|gb|ACF82197.1| unknown [Zea mays]
gi|414866559|tpg|DAA45116.1| TPA: hypothetical protein ZEAMMB73_757616 [Zea mays]
Length = 453
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 81/294 (27%), Positives = 139/294 (47%), Gaps = 26/294 (8%)
Query: 54 SKADLFFLPF--SIARMRHDRRI--GTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVA 109
+ AD+ F+PF S++ RH R + + + P W RTGG DH +A
Sbjct: 136 ADADVVFVPFFASLSFNRHSRVVPPARNSEDRALQRRLLEFLAARPEWRRTGGRDHVVLA 195
Query: 110 CHSIGR-SAMQKAWEVKLNAIQVVCS-SSYFISGHIAHKDVSLP---QIWPRQEDPPKLG 164
H G A + W + V+C Y S KD+ P + D G
Sbjct: 196 HHPNGMLDARYRFWPC----VFVLCDFGRYPPSVANLDKDIIAPYRHLVANFAND--TAG 249
Query: 165 SSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTPYADE----LLGSKF 216
R L +F GA+ +R++L + +++ +++ G + ++ + SKF
Sbjct: 250 YDDRPTLLYFQGAIYRKDGGSIRQELYYLLKDEKDVHFSFGSVAGNGIEQATQGMRSSKF 309
Query: 217 CLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI---PLLK 273
CL++ G ++ R+ DS+ CVPVII++ +LPF DVL++ FS++V D LK
Sbjct: 310 CLNIAGDTPSSNRLFDSIVSHCVPVIISDEIELPFEDVLDYSKFSVIVRGADAVKKGFLK 369
Query: 274 KILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
++KGIS EE+ + + + +V KHF++ DA M+ + + S+R++
Sbjct: 370 SLIKGISQEEWTRMWNKLKEVEKHFEYQYPSQTDDAVQMIWKAIARKVPSIRLK 423
>gi|326509565|dbj|BAJ86998.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326509967|dbj|BAJ87200.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 415
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 83/312 (26%), Positives = 141/312 (45%), Gaps = 31/312 (9%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T +P +AD F+ P ++ + P + + IS
Sbjct: 76 FAAEIFMHRFLLSSAIRTMNPEEADWFYTPVYTTCDLTPWGHPLPFKSPRIMRSAVQFIS 135
Query: 92 QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
+PYWNRT GADHF+V H G +KA E + + + ++ H+ K
Sbjct: 136 SHWPYWNRTAGADHFFVVPHDFGACFHYQEEKAIERGILPLLRRATLVQTFGQKDHVCLK 195
Query: 147 D--VSLPQIWPRQED-----PPKLGSSKRNKLAFFAG----AVNSP-----VREKLLQVW 190
+ +++P P Q+ PP+ + R+ +F G N P R VW
Sbjct: 196 EGSINIPPYAPPQKMKTHLVPPE---TPRSIFVYFRGLFYDTANDPEGGYYARGARASVW 252
Query: 191 RN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYD 248
N ++ ++ S Y +++ + FCL G+ + R+ +++ +GC+PVIIA+
Sbjct: 253 ENFKNNPLFDISTDHPPTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIV 312
Query: 249 LPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY- 307
LPFAD + W + VA D+P L IL I E +L + +L Q +FP
Sbjct: 313 LPFADAIPWDEIGVFVAEDDVPKLDTILTSIPM-EVILRKQRLLANPSMKQAMLFPQPAQ 371
Query: 308 --DAFYMVMYEL 317
DAF+ ++ L
Sbjct: 372 AGDAFHQILNGL 383
>gi|242088703|ref|XP_002440184.1| hypothetical protein SORBIDRAFT_09g027440 [Sorghum bicolor]
gi|241945469|gb|EES18614.1| hypothetical protein SORBIDRAFT_09g027440 [Sorghum bicolor]
Length = 415
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 80/309 (25%), Positives = 138/309 (44%), Gaps = 25/309 (8%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T +P +AD F+ P ++ + P + I I+
Sbjct: 76 FAAEIFMHRFLLSSAVRTFNPEEADWFYTPVYTTCDLTPKGLPLPFKSPRMMRSAIQLIA 135
Query: 92 QKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFIS-----GHIAHK 146
+PYWNR+ GADHF+V H G + + I + + + H+ K
Sbjct: 136 TNWPYWNRSEGADHFFVTPHDFGACFHYQEEKAIGRGILPLLQRATLVQTFGQKNHVCLK 195
Query: 147 D--VSLPQIWPRQEDPPKL--GSSKRNKLAFFAG----AVNSP-----VREKLLQVWRN- 192
D +++P P Q+ L + R+ +F G N P R VW N
Sbjct: 196 DGSITIPPYAPPQKMQTHLIPADTPRSIFVYFRGLFYDTGNDPEGGYYARGARASVWENF 255
Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
++ ++ S Y +++ S FCL G+ + R+ +++ +GC+PVIIA+ LPF
Sbjct: 256 KNNPLFDISTDHPPTYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 315
Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
AD + W+ + VA D+P L IL I + + +L + +L Q +FP D
Sbjct: 316 ADAIPWEEIGVFVAEEDVPQLDSILTSIPT-DVVLRKQRLLANPSMKQAMLFPQPAQPGD 374
Query: 309 AFYMVMYEL 317
AF+ ++ L
Sbjct: 375 AFHQILNGL 383
>gi|414878899|tpg|DAA56030.1| TPA: secondary cell wall glycosyltransferase family 47 isoform 1
[Zea mays]
gi|414878900|tpg|DAA56031.1| TPA: secondary cell wall glycosyltransferase family 47 isoform 2
[Zea mays]
Length = 419
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 83/309 (26%), Positives = 140/309 (45%), Gaps = 25/309 (8%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T +P AD F+ P ++ + P + I IS
Sbjct: 80 FAAEIFMHRFLLSSAIRTLNPEVADWFYTPVYTTCDLTPWGHPLPFKSPRIMRSAIQFIS 139
Query: 92 QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
++PYWNRT GADHF+V H G +KA E + + + ++ H+ K
Sbjct: 140 NRWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGVLPLLRRATLVQTFGQKDHVCLK 199
Query: 147 D--VSLPQIWPRQEDPPKL--GSSKRNKLAFFAG----AVNSP-----VREKLLQVWRN- 192
+ +++P P Q+ L + R+ +F G N P R VW N
Sbjct: 200 EGSITIPPYAPPQKMKTHLVPPGTPRSIFVYFRGLFYDTANDPEGGYYARGARASVWENF 259
Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
++ ++ S Y +++ + FCL G+ + R+ +++ +GC+PVIIA+ LPF
Sbjct: 260 KNNPLFDISTDHPPTYYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 319
Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
AD + W+ + VA D+P L IL I E +L + +L Q +FP D
Sbjct: 320 ADAIPWEEIGVFVAEDDVPRLDTILTSIPM-EVILRKQRLLANPSMKQAMLFPQPAQAGD 378
Query: 309 AFYMVMYEL 317
AF+ ++ L
Sbjct: 379 AFHQILNGL 387
>gi|449456052|ref|XP_004145764.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase
IRX7-like [Cucumis sativus]
Length = 459
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 85/323 (26%), Positives = 134/323 (41%), Gaps = 49/323 (15%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIP------DFISHY 86
+ASE +A + S + T DP +AD FF+P ++ G P IS
Sbjct: 132 FASEVAIHRALLNSDYRTFDPLEADFFFVPVYVS----CNFSTVNGFPAIGHARSLISSA 187
Query: 87 IFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHK 146
+ +IS Y +WNRT G+DH +VA H + + + F+ I +
Sbjct: 188 VSHISSHYSFWNRTNGSDHVFVASHDFASCFHT------MEHVAIADGVPSFLKNSIILQ 241
Query: 147 DVSLPQIWPRQE-----DPPKLG-------------SSKRNKLAFFAGAV---------- 178
+ P Q+ PP + + +R+ AFF G +
Sbjct: 242 TFGVKYKHPCQDVEHVVIPPYISPESIENTLERSPVTGRRDIFAFFRGKMEMNPKNVSGR 301
Query: 179 --NSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYY 236
+ VR + + + D Y R Y E++ S FCL G+ + R+ +S+
Sbjct: 302 FYSKKVRTMIWRKFNGDRRFYLQRHRFPG-YQSEIVRSVFCLCPLGWAPWSPRLVESVAL 360
Query: 237 GCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVL--KV 294
GCVPVIIA+ LPF +NW SI VA DI L +IL +++ +Q + +
Sbjct: 361 GCVPVIIADGIRLPFPSAVNWPEISITVAEKDIGKLGRILDHVAASNLTTIQKNLWDPRN 420
Query: 295 RKHFQWHVFPSDYDAFYMVMYEL 317
R+ +H D DA + V+ L
Sbjct: 421 RRALLFHNQVEDGDATWQVIGAL 443
>gi|449440484|ref|XP_004138014.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10L-like
[Cucumis sativus]
gi|449528307|ref|XP_004171146.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10L-like
[Cucumis sativus]
Length = 416
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 84/309 (27%), Positives = 140/309 (45%), Gaps = 25/309 (8%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T +P +AD F+ P ++ + + P + I IS
Sbjct: 77 FAAEIFMHRFLLTSPVRTLNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRMMRSAIQLIS 136
Query: 92 QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
+PYWNRT GADHF+V H G +KA E + + + ++ H+ K
Sbjct: 137 SNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLK 196
Query: 147 D--VSLPQIWPRQEDPPKLGSSK--RNKLAFFAGAV----NSP-----VREKLLQVWRN- 192
+ +++P P Q+ L K R+ +F G N P R VW N
Sbjct: 197 EGSITIPPYAPPQKMHAHLIPEKTPRSIFVYFRGLFYDVGNDPEGGYYARGARAAVWENF 256
Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
D+ ++ S T Y +++ + FCL G+ + R+ +++ +GC+PVIIA+ LPF
Sbjct: 257 KDNPLFDISTEHPTTYYEDMQRAVFCLCPLGWAPWSPRLVEAVIFGCIPVIIADDIVLPF 316
Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
AD + W+ + + D+ L IL I E +L + +L Q +FP D
Sbjct: 317 ADAIPWEEIGVFLDEKDVANLDTILTSIPL-EMILRKQRLLANPSMKQAMLFPQPAQPGD 375
Query: 309 AFYMVMYEL 317
AF+ V+ L
Sbjct: 376 AFHQVLNGL 384
>gi|326530125|dbj|BAK08342.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 461
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 81/296 (27%), Positives = 140/296 (47%), Gaps = 22/296 (7%)
Query: 50 TKDPSKADLFFLPF--SIARMRHDRRIGTEGIPD--FISHYIFNISQKYPYWNRTGGADH 105
+DP++AD+ F+PF S++ RH + + + + + P W R+GG DH
Sbjct: 140 VRDPARADVVFVPFFASLSFNRHSKVVPPARTSEDRTLQRRLIEFLAARPEWRRSGGRDH 199
Query: 106 FYVACHSIGR-SAMQKAWEVKLNAIQVVCS-SSYFISGHIAHKDVSLP-QIWPRQEDPPK 162
+A H G A K W + V+C Y S KDV P Q
Sbjct: 200 VVLAHHPNGMLDARYKLWP----CVFVLCDFGRYPHSVANIDKDVIAPYQHVVDDFLNDS 255
Query: 163 LGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTPYADE----LLGS 214
G R L +F GA+ +R++L + +++ +++ G + +E + S
Sbjct: 256 TGYDDRPTLLYFQGAIYRKDGGFIRQELYYLLKDEKDVHFSFGSVAGNGIEESTRGMRAS 315
Query: 215 KFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI---PL 271
KFCL++ G ++ R+ DS+ CVPVII++ +LPF D+L++ F I+V D
Sbjct: 316 KFCLNIAGDTPSSNRLFDSIVSHCVPVIISDEIELPFEDMLDYSKFCIIVRGADAVKKGF 375
Query: 272 LKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
L ++KGIS EE+ + + + +V HF++ DA M+ + + S+R++
Sbjct: 376 LINLIKGISPEEWTSMWNKLREVEGHFEYQYPSQPEDAVQMIWKTIARKVPSIRLK 431
>gi|75159221|sp|Q8S1X7.1|GT15_ORYSJ RecName: Full=Probable glucuronosyltransferase Os01g0926700
gi|20160728|dbj|BAB89670.1| P0482D04.17 [Oryza sativa Japonica Group]
gi|20805226|dbj|BAB92893.1| putative pectin-glucuronyltransferase [Oryza sativa Japonica Group]
gi|125528942|gb|EAY77056.1| hypothetical protein OsI_05014 [Oryza sativa Indica Group]
gi|125573178|gb|EAZ14693.1| hypothetical protein OsJ_04618 [Oryza sativa Japonica Group]
Length = 417
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 80/309 (25%), Positives = 138/309 (44%), Gaps = 25/309 (8%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T +P +AD F+ P ++ + P + I I+
Sbjct: 78 FAAEIFMHRFLLSSAVRTFNPEEADWFYTPVYTTCDLTPSGLPLPFKSPRMMRSAIELIA 137
Query: 92 QKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFIS-----GHIAHK 146
+PYWNR+ GADHF+V H G + + I + + + H+ K
Sbjct: 138 TNWPYWNRSEGADHFFVTPHDFGACFHYQEEKAIGRGILPLLQRATLVQTFGQKNHVCLK 197
Query: 147 D--VSLPQIWPRQEDPPKL--GSSKRNKLAFFAG----AVNSP-----VREKLLQVWRN- 192
D +++P P Q+ L + R+ +F G N P R VW N
Sbjct: 198 DGSITIPPYAPPQKMQAHLIPPDTPRSIFVYFRGLFYDTSNDPEGGYYARGARASVWENF 257
Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
++ ++ S Y +++ S FCL G+ + R+ +++ +GC+PVIIA+ LPF
Sbjct: 258 KNNPLFDISTDHPPTYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 317
Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
AD + W+ + VA D+P L IL I + + +L + +L Q +FP D
Sbjct: 318 ADAIPWEEIGVFVAEEDVPKLDSILTSIPT-DVILRKQRLLANPSMKQAMLFPQPAQAGD 376
Query: 309 AFYMVMYEL 317
AF+ ++ L
Sbjct: 377 AFHQILNGL 385
>gi|115441967|ref|NP_001045263.1| Os01g0926600 [Oryza sativa Japonica Group]
gi|75159222|sp|Q8S1X8.1|GT14_ORYSJ RecName: Full=Probable glucuronosyltransferase Os01g0926600
gi|20160727|dbj|BAB89669.1| P0482D04.16 [Oryza sativa Japonica Group]
gi|20805225|dbj|BAB92892.1| putative pectin-glucuronyltransferase [Oryza sativa Japonica Group]
gi|113534794|dbj|BAF07177.1| Os01g0926600 [Oryza sativa Japonica Group]
gi|125528941|gb|EAY77055.1| hypothetical protein OsI_05013 [Oryza sativa Indica Group]
gi|125573177|gb|EAZ14692.1| hypothetical protein OsJ_04617 [Oryza sativa Japonica Group]
gi|215687152|dbj|BAG90922.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 415
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 84/312 (26%), Positives = 141/312 (45%), Gaps = 31/312 (9%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T +P +AD F+ P ++ + P + I IS
Sbjct: 76 FAAEIFMHRFLLSSAIRTLNPEEADWFYTPVYTTCDLTPWGHPLPFKSPRIMRSAIQFIS 135
Query: 92 QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
+PYWNRT GADHF+V H G +KA E + + + ++ H+ K
Sbjct: 136 SHWPYWNRTDGADHFFVVPHDFGACFHYQEEKAIERGILPLLRRATLVQTFGQKDHVCLK 195
Query: 147 D--VSLPQIWPRQED-----PPKLGSSKRNKLAFFAG----AVNSP-----VREKLLQVW 190
+ +++P P Q+ PP+ + R+ +F G N P R VW
Sbjct: 196 EGSITIPPYAPPQKMKTHLVPPE---TPRSIFVYFRGLFYDTANDPEGGYYARGARASVW 252
Query: 191 RN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYD 248
N ++ ++ S Y +++ S FCL G+ + R+ +++ +GC+PVIIA+
Sbjct: 253 ENFKNNPLFDISTDHPPTYYEDMQRSIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIV 312
Query: 249 LPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY- 307
LPFAD + W + VA D+P L IL I + +L + +L Q +FP
Sbjct: 313 LPFADAIPWDEIGVFVAEDDVPKLDTILTSIPM-DVILRKQRLLANPSMKQAMLFPQPAQ 371
Query: 308 --DAFYMVMYEL 317
DAF+ ++ L
Sbjct: 372 PGDAFHQILNGL 383
>gi|226499284|ref|NP_001142343.1| uncharacterized protein LOC100274514 precursor [Zea mays]
gi|194708308|gb|ACF88238.1| unknown [Zea mays]
Length = 419
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 83/309 (26%), Positives = 140/309 (45%), Gaps = 25/309 (8%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T +P AD F+ P ++ + P + I IS
Sbjct: 80 FAAEIFMHRFLLSSAIRTLNPEVADWFYTPVYTTCDLTPWGHPLPFRSPRIMRSAIQFIS 139
Query: 92 QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
++PYWNRT GADHF+V H G +KA E + + + ++ H+ K
Sbjct: 140 NRWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGVLPLLRRATLVQTFGQKDHVCLK 199
Query: 147 D--VSLPQIWPRQEDPPKL--GSSKRNKLAFFAG----AVNSP-----VREKLLQVWRN- 192
+ +++P P Q+ L + R+ +F G N P R VW N
Sbjct: 200 EGSITIPPYAPPQKMKTHLVPPGTPRSIFVYFRGLFYDTANDPEGGYYARGARASVWENF 259
Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
++ ++ S Y +++ + FCL G+ + R+ +++ +GC+PVIIA+ LPF
Sbjct: 260 KNNPLFDISTDHPPTYYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 319
Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
AD + W+ + VA D+P L IL I E +L + +L Q +FP D
Sbjct: 320 ADAIPWEEIGVFVAEDDVPRLDTILTSIPM-EVILRKQRLLANPSMKQAMLFPQPAQAGD 378
Query: 309 AFYMVMYEL 317
AF+ ++ L
Sbjct: 379 AFHQILNGL 387
>gi|198429575|ref|XP_002120379.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
Length = 482
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 76/251 (30%), Positives = 121/251 (48%), Gaps = 28/251 (11%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPR--GNYASESYFKKAFMKSHFVTKDPSKADL 58
M + +VY+Y + + D P G YA E F++ KS+F T+ P+ A
Sbjct: 224 MVNNLKVYIYETK---------IGTDHHPHRVGGYAVERVFQELLEKSNFRTQHPNLATF 274
Query: 59 FFLPFSIARMRHDRRIGTEGIPD---FISHYIFNISQKYPYWNRTGGADHFYVACHSIGR 115
FF+P + D EG+ + ++ + I +YPYW+++ GA+HFY+ H +G
Sbjct: 275 FFIPIRCSSYILDYPTEHEGLMEAKRVTANILHEIQTQYPYWSQSSGANHFYICSHDVGA 334
Query: 116 SAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLP--------QIWPRQEDPPKLGSSK 167
+ + NAI +V ++ Y I HKD+S+P I + +
Sbjct: 335 KVAEGLMK---NAIGLVSTADYDDPYFIPHKDISIPPTPSSGLSNIHLIGKGGALVDVRG 391
Query: 168 RNKLAFFAGAVNSP-VREKLLQVWRNDSEIYAHSGRLK-TPYADELLGSKFCLHVKGFEV 225
RN LAFFAG + S +R + W +D +I + LK + Y ++L +KFCL +G EV
Sbjct: 392 RNILAFFAGDITSGRIRPLAWRTWYSDQDIEIINRILKPSAYIEKLKKAKFCLIFRGKEV 451
Query: 226 NTARIADSLYY 236
T + SLYY
Sbjct: 452 ITV-MCYSLYY 461
>gi|326511078|dbj|BAJ91886.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 415
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 83/312 (26%), Positives = 141/312 (45%), Gaps = 31/312 (9%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T +P +AD F+ P ++ + P + + IS
Sbjct: 76 FAAEIFMHRFLLSSAIRTMNPEEADWFYTPVYTTCDLTPWGHPLPFKSPRIMRSAVQFIS 135
Query: 92 QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
+PYWNRT GADHF+V H G +KA E + + + ++ H+ K
Sbjct: 136 SHWPYWNRTAGADHFFVVPHDFGACFHYQEEKAIERGILPLLRRATLVQTFGQKDHVCLK 195
Query: 147 D--VSLPQIWPRQED-----PPKLGSSKRNKLAFFAG----AVNSP-----VREKLLQVW 190
+ +++P P Q+ PP+ + R+ +F G N P R VW
Sbjct: 196 EGSINIPPYAPPQKMKTHLVPPE---TPRSIFVYFRGLFYDTANDPEGGYYARGARASVW 252
Query: 191 RN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYD 248
N ++ ++ S Y +++ + FCL G+ + R+ +++ +GC+PVIIA+
Sbjct: 253 ENFKNNPLFDISTDHPPTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIV 312
Query: 249 LPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY- 307
LPFAD + W + VA D+P L IL I E +L + +L Q +FP
Sbjct: 313 LPFADAIPWDEIGMFVAEDDVPKLDTILTSIPM-EVILRKQRLLANPSMKQAMLFPQPAQ 371
Query: 308 --DAFYMVMYEL 317
DAF+ ++ L
Sbjct: 372 AGDAFHQILNGL 383
>gi|226499780|ref|NP_001150738.1| secondary cell wall-related glycosyltransferase family 47 precursor
[Zea mays]
gi|195641374|gb|ACG40155.1| secondary cell wall-related glycosyltransferase family 47 [Zea
mays]
gi|223974207|gb|ACN31291.1| unknown [Zea mays]
Length = 415
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 80/309 (25%), Positives = 138/309 (44%), Gaps = 25/309 (8%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T +P +AD F+ P ++ + P + I I+
Sbjct: 76 FAAEIFMHRFLLSSAVRTFNPEEADWFYTPVYTTCDLTPKGLPLPFKSPRMMRSAIQLIA 135
Query: 92 QKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFIS-----GHIAHK 146
+PYWNR+ GADHF+V H G + + I + + + H+ K
Sbjct: 136 TNWPYWNRSEGADHFFVTPHDFGACFHYQEEKAIGRGILPLLQRATLVQTFGQKNHVCLK 195
Query: 147 D--VSLPQIWPRQEDPPKL--GSSKRNKLAFFAG----AVNSP-----VREKLLQVWRN- 192
D +++P P Q+ L + R+ +F G N P R VW N
Sbjct: 196 DGSITIPPYAPPQKMQTHLIPADTPRSIFVYFRGLFYDTGNDPEGGYYARGARASVWENF 255
Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
++ ++ S Y +++ S FCL G+ + R+ +++ +GC+PVIIA+ LPF
Sbjct: 256 KNNPLFDISTDHPPTYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 315
Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
AD + W+ + VA D+P L IL I + + +L + +L Q +FP D
Sbjct: 316 ADAIPWEEIGVFVAEEDVPRLDSILTSIPT-DVVLRKQRLLANPSMKQAMLFPQPAQAGD 374
Query: 309 AFYMVMYEL 317
AF+ ++ L
Sbjct: 375 AFHQILNGL 383
>gi|357121010|ref|XP_003562215.1| PREDICTED: probable glucuronosyltransferase Os03g0107900-like
[Brachypodium distachyon]
Length = 441
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 86/323 (26%), Positives = 149/323 (46%), Gaps = 46/323 (14%)
Query: 30 RGNYASESYFKKAFMKSHFVTKDPSKADLFFLPFSIA----------RMRHDRRIGTEGI 79
R +A+E +A + H+ P ADLFF+P ++ + H R + + +
Sbjct: 113 RHLFAAEVALHEALL-GHYSAVRPEDADLFFVPVYVSCNFSTPNGFPSLSHARGLLADAV 171
Query: 80 PDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKL-NAI-----QVVC 133
D + ++ PYWNR+ GADH +VA H G + +V + + I + +
Sbjct: 172 -DLVR-------REAPYWNRSAGADHVFVASHDFG-ACFHPMEDVAIADGIPDFLKRSIL 222
Query: 134 SSSYFISG-HIAHKD---VSLPQIWPRQE-DPPKLGSSKRNKLAFFAGAV---------- 178
++ + G H+ + V P + P + +L ++R+ AFF G +
Sbjct: 223 LQTFGVQGPHVCQEAEHVVIPPHVPPEVALEILELEKTRRDIFAFFRGKMEVHPKNISGR 282
Query: 179 --NSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYY 236
+ VR +LLQ + +S+ Y R Y E+ S FCL G+ + R+ +S+
Sbjct: 283 FYSKKVRTELLQRYGRNSKFYLKRKRYDN-YRSEMARSLFCLCPLGWAPWSPRLVESVLL 341
Query: 237 GCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKV-- 294
GC+PVIIA++ LPF VL W S+ VA D+ L+K+L + + ++Q +
Sbjct: 342 GCIPVIIADNIRLPFPSVLRWSDISLQVAEKDVASLEKVLDHVVATNLTVIQKNLWDPVK 401
Query: 295 RKHFQWHVFPSDYDAFYMVMYEL 317
RK ++ + DA + V+ EL
Sbjct: 402 RKALVFNRRLEEGDATWQVLREL 424
>gi|357112427|ref|XP_003558010.1| PREDICTED: probable glycosyltransferase At3g07620-like
[Brachypodium distachyon]
Length = 464
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 80/297 (26%), Positives = 146/297 (49%), Gaps = 24/297 (8%)
Query: 50 TKDPSKADLFFLPF--SIARMRHDRRIGTEGIPD--FISHYIFNISQKYPYWNRTGGADH 105
+DP++AD+ F+PF S++ RH + + + + + P W R+GG DH
Sbjct: 143 VRDPARADVVFVPFFASLSFNRHSKVVPPARTSEDRALQRRLIEFLAARPEWRRSGGRDH 202
Query: 106 FYVACHSIGR-SAMQKAWEVKLNAIQVVCS-SSYFISGHIAHKDVSLPQIWPRQED-PPK 162
+A H G A K W + V+C Y S KDV P +
Sbjct: 203 VVLAHHPNGMLDARYKLWP----CVFVLCDFGRYPHSVANIDKDVIAPYLHVVGNFFNDS 258
Query: 163 LGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTPYADE----LLGS 214
G R L +F GA+ +R++L + +++ +++ G + ++ + S
Sbjct: 259 AGYDARPTLLYFQGAIYRKDGGFIRQELYYLLKDEKDVHFSFGSVAGNGIEQSTQGMRAS 318
Query: 215 KFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI---PL 271
KFCL++ G ++ R+ DS+ CVP+II++ +LPF DVL++ F I+V +D
Sbjct: 319 KFCLNIAGDTPSSNRLFDSIVSHCVPIIISDEIELPFEDVLDYSKFCIIVRGVDAVKKGF 378
Query: 272 LKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY-DAFYMVMYELWLRRSSVRVQ 327
L ++KGIS +E+ + + + +V +HF++ +PS + DA M+ + + S+R++
Sbjct: 379 LINLIKGISRQEWTSMWNKLKEVERHFEYQ-YPSQHDDAVQMIWKTIARKVPSIRLK 434
>gi|302791649|ref|XP_002977591.1| glycosyltransferase-like protein [Selaginella moellendorffii]
gi|300154961|gb|EFJ21595.1| glycosyltransferase-like protein [Selaginella moellendorffii]
Length = 345
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 76/285 (26%), Positives = 132/285 (46%), Gaps = 35/285 (12%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIP------DFISHY 86
+A+E + ++S T DP +AD FF+P ++ R G P D +
Sbjct: 32 FAAEVAIHQNLLRSPVRTLDPDEADFFFMPVYVSCNFTSR----SGFPTLFHASDILQAA 87
Query: 87 IFNISQKYPYWNRTGGADHFYVACHSIGRS--AMQKAW------EVKLNAI--QVVCSSS 136
+ +S+ P+W+R G DH +VA H G AM+ + N+I Q +
Sbjct: 88 VGLVSRNMPFWDRHQGRDHVFVATHDFGACFHAMEDLAVAMGIPQFLRNSIILQTFGEKN 147
Query: 137 YFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLAFFAGAV------------NSPVRE 184
++ H + P + P ++ P G +R LAFF G + + VR
Sbjct: 148 KHPCQNVDHIQIP-PYVVPAKKLPDPRGQ-RRKILAFFRGKMEIHPKNVSGHMYSRGVRT 205
Query: 185 KLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIA 244
+ + + +D + R Y E+L S FCL G+ + RI +S+ GC+PVIIA
Sbjct: 206 TIWRRFSHDRRFFIKRKRSDN-YKAEMLRSVFCLCPLGWAPWSPRIVESVIQGCIPVIIA 264
Query: 245 NHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQS 289
++ LP++ V++W+ S+ VA D+ L +IL +++ ++Q+
Sbjct: 265 DNIQLPYSHVIDWRKISVTVAERDVHKLDRILSKVAATNVSMIQA 309
>gi|60657598|gb|AAX33320.1| secondary cell wall-related glycosyltransferase family 47 [Populus
tremula x Populus tremuloides]
Length = 412
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 84/313 (26%), Positives = 132/313 (42%), Gaps = 33/313 (10%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T +P +AD F+ P + + P + I IS
Sbjct: 73 FAAEIFMHRFLLSSPVRTLNPDEADWFYSPIYPTCDLTPMGLPLPFKSPRMMRSAIQLIS 132
Query: 92 QKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFI------------ 139
+PYWNRT GADHF+V H G + + I + S +
Sbjct: 133 SNWPYWNRTEGADHFFVVPHDFGACFHCQEEKAVERGILPLLQRSTLVQTFGRRNHVCLN 192
Query: 140 SGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLAFFAGAV----NSP-----VREKLLQVW 190
G I + PQ + PP + R+ +F G N P R VW
Sbjct: 193 EGSITIPPFAPPQKMQAHQIPPDI---PRSIFVYFRGLFYDVNNDPEGGYYARGARAAVW 249
Query: 191 RN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYD 248
N ++ ++ S T Y +++ + FCL G+ + R+ +++ +GC+PVIIA+
Sbjct: 250 ENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIV 309
Query: 249 LPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYL----LLQSYVLKVRKHFQWHVFP 304
LPFAD + W+ + VA D+P L IL I E L LL + +K F P
Sbjct: 310 LPFADAIPWEEIGVFVAEEDVPNLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQP 369
Query: 305 SDYDAFYMVMYEL 317
DAF+ ++ L
Sbjct: 370 G--DAFHQILNGL 380
>gi|212276027|ref|NP_001130448.1| uncharacterized protein LOC100191546 precursor [Zea mays]
gi|194689154|gb|ACF78661.1| unknown [Zea mays]
gi|413946366|gb|AFW79015.1| putative Secondary cell wall glycosyltransferase family 47 [Zea
mays]
Length = 415
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 80/309 (25%), Positives = 138/309 (44%), Gaps = 25/309 (8%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T +P +AD F+ P ++ + P + I I+
Sbjct: 76 FAAEIFMHRFLLSSAVRTFNPEEADWFYTPVYTTCDLTPKGLPLPFKSPRMMRSAIQLIA 135
Query: 92 QKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFIS-----GHIAHK 146
+PYWNR+ GADHF+V H G + + I + + + H+ K
Sbjct: 136 TNWPYWNRSEGADHFFVTPHDFGACFHYQEEKAIGRGILPLLQRATLVQTFGQKNHVCLK 195
Query: 147 D--VSLPQIWPRQEDPPKL--GSSKRNKLAFFAG----AVNSP-----VREKLLQVWRN- 192
D +++P P Q+ L + R+ +F G N P R VW N
Sbjct: 196 DGSITIPPYAPPQKMQTHLIPPDTPRSIFVYFRGLFYDTGNDPEGGYYARGARASVWENF 255
Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
++ ++ S Y +++ S FCL G+ + R+ +++ +GC+PVIIA+ LPF
Sbjct: 256 KNNPLFDISTDHPPTYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 315
Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
AD + W+ + VA D+P L IL I + + +L + +L Q +FP D
Sbjct: 316 ADAIPWEEIGVFVAEDDVPQLDSILTSIPT-DVVLRKQRLLANPSMKQAMLFPQPAQPGD 374
Query: 309 AFYMVMYEL 317
AF+ ++ L
Sbjct: 375 AFHQILNGL 383
>gi|302786830|ref|XP_002975186.1| glycosyltransferase-like protein [Selaginella moellendorffii]
gi|300157345|gb|EFJ23971.1| glycosyltransferase-like protein [Selaginella moellendorffii]
Length = 405
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 85/324 (26%), Positives = 152/324 (46%), Gaps = 41/324 (12%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIP------DFISHY 86
+A+E + ++S T DP +AD FF+P ++ R G P D +
Sbjct: 81 FAAEVAIHQNLLRSPVRTLDPDEADFFFMPVYVSCNFTSR----SGFPTLFHASDILQAA 136
Query: 87 IFNISQKYPYWNRTGGADHFYVACHSIGRS--AMQKAWEVKLNAIQVVCSSSYFIS---- 140
+ +S+ P+W+R G DH +VA H G AM+ V + Q + +S +
Sbjct: 137 VGLVSRNMPFWDRHQGRDHVFVATHDFGACFHAMED-LAVTMGIPQFLRNSIILQTFGEK 195
Query: 141 -----GHIAHKDVSLPQIWPRQEDP-PKLGSSKRNKLAFFAGAV------------NSPV 182
++ H + P + P ++ P P+ S +R LAFF G + + V
Sbjct: 196 NKHPCQNVDHIQIP-PYVVPAKKLPDPR--SQRRKILAFFRGKMEIHPKNVSGHMYSRGV 252
Query: 183 REKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVI 242
R + + + +D + R Y E+L S FCL G+ + RI +S+ GC+PVI
Sbjct: 253 RTTIWRRFSHDRRFFIKRKRSDN-YKAEMLRSVFCLCPLGWAPWSPRIVESVIQGCIPVI 311
Query: 243 IANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLK--VRKHFQW 300
IA++ LP++ V++W+ S+ VA D+ L +IL +++ ++Q+ + + VR+ +
Sbjct: 312 IADNIQLPYSHVIDWRKISVTVAERDVHKLDRILSRVAATNVSMIQANLWRDEVRQALVY 371
Query: 301 HVFPSDYDAFYMVMYELWLRRSSV 324
+ DA + V+ L R++ V
Sbjct: 372 NQPLVRGDATWQVLDLLSKRKNKV 395
>gi|194706890|gb|ACF87529.1| unknown [Zea mays]
Length = 418
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 82/314 (26%), Positives = 144/314 (45%), Gaps = 35/314 (11%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T +P +AD F+ P ++ + T P + I +S
Sbjct: 79 FATEIFMHRFLLASAVRTLNPDEADWFYTPVYTTCDLTPWGHPLTVKSPRMMRSAIQYVS 138
Query: 92 QKYPYWNRTGGADHFYVACHSIG-------RSAMQKAWEVKLNAIQVVCSSSYFISGHIA 144
+++PYWNRT GADHF+V H G A+Q+ L +V ++ H+
Sbjct: 139 KRWPYWNRTEGADHFFVTPHDFGACFYFQEEKAIQRGVLPVLRRATLV--QTFGQKNHVC 196
Query: 145 HKD--VSLPQIWPRQED-----PPKLGSSKRNKLAFFAG----AVNSP-----VREKLLQ 188
++ +++P P + PP+ + R+ +F G N P R
Sbjct: 197 LREGSITIPPYAPPHKIRAHIVPPE---TPRSIFVYFRGLFYDTANDPEGGYYARGARAS 253
Query: 189 VWRN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANH 246
VW N ++ ++ S Y +++ + FCL G+ + R+ +++ +GC+PVIIA+
Sbjct: 254 VWENFKNNALFDISTEHPPTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADD 313
Query: 247 YDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSD 306
LPFAD + W+ ++ V D+ L IL I +E +L + +L Q +FP
Sbjct: 314 IVLPFADAIPWEEIAVFVPEDDVLRLDTILTSIPMDE-ILRKQRLLANPSMKQAMLFPQP 372
Query: 307 Y---DAFYMVMYEL 317
DAF+ V+ L
Sbjct: 373 AEPRDAFHQVLNGL 386
>gi|413946367|gb|AFW79016.1| secondary cell wall glycosyltransferase family 47 [Zea mays]
Length = 418
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 82/314 (26%), Positives = 144/314 (45%), Gaps = 35/314 (11%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T +P +AD F+ P ++ + T P + I +S
Sbjct: 79 FATEIFMHRFLLASAVRTLNPDEADWFYTPVYTTCDLTPWGHPLTVKSPRMMRSAIQYVS 138
Query: 92 QKYPYWNRTGGADHFYVACHSIG-------RSAMQKAWEVKLNAIQVVCSSSYFISGHIA 144
+++PYWNRT GADHF+V H G A+Q+ L +V ++ H+
Sbjct: 139 KRWPYWNRTEGADHFFVTPHDFGACFYFQEEKAIQRGVLPVLRRATLV--QTFGQKNHVC 196
Query: 145 HKD--VSLPQIWPRQED-----PPKLGSSKRNKLAFFAG----AVNSP-----VREKLLQ 188
++ +++P P + PP+ + R+ +F G N P R
Sbjct: 197 LREGSITIPPYAPPHKIRAHIVPPE---TPRSIFVYFRGLFYDTANDPEGGYYARGARAS 253
Query: 189 VWRN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANH 246
VW N ++ ++ S Y +++ + FCL G+ + R+ +++ +GC+PVIIA+
Sbjct: 254 VWENFKNNALFDISTEHPPTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADD 313
Query: 247 YDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSD 306
LPFAD + W+ ++ V D+ L IL I +E +L + +L Q +FP
Sbjct: 314 IVLPFADAIPWEEIAVFVPEDDVLRLDTILTSIPMDE-ILRKQRLLANPSMKQAMLFPQP 372
Query: 307 Y---DAFYMVMYEL 317
DAF+ V+ L
Sbjct: 373 AEPRDAFHQVLNGL 386
>gi|224166016|ref|XP_002338879.1| predicted protein [Populus trichocarpa]
gi|222873817|gb|EEF10948.1| predicted protein [Populus trichocarpa]
Length = 208
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 62/201 (30%), Positives = 102/201 (50%), Gaps = 46/201 (22%)
Query: 83 ISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGH 142
+ H++ ++ +YPYWNRT GADHF++ C I A ++ W + N+I+V+CS SY + +
Sbjct: 11 VQHFVNSLISEYPYWNRTLGADHFFITCADIHVIASERIWNLMKNSIRVMCSPSYNVE-Y 69
Query: 143 IAHKDVSLPQ-------------------IWPRQEDPPKLGSSKRN-------------- 169
+ HKDVSLPQ I P + P L ++K N
Sbjct: 70 VPHKDVSLPQSVQPFNVSVSQIMPPLYAFIAPTTQ-PLTLPAAKYNMKSRYRYLLCPWII 128
Query: 170 ---KLAFFAGAVNSPVREKLLQVWRNDSEI-------YAHSGRLKTPYADELLGSKFCLH 219
+ +F+ G + +R+ L+ W NDSE+ A + ++ Y ++ SKFC+
Sbjct: 129 LEQEYSFWRGLKENYIRKSLVNAWENDSELDIKEIQTEASTTEIRRLYHEKFYSSKFCIC 188
Query: 220 VKGFEVNTARIADSLYYGCVP 240
G +++ A IA +++YGCVP
Sbjct: 189 PGGPQIDGA-IAVAIHYGCVP 208
>gi|224053575|ref|XP_002297880.1| predicted protein [Populus trichocarpa]
gi|222845138|gb|EEE82685.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 84/313 (26%), Positives = 132/313 (42%), Gaps = 33/313 (10%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T +P +AD F+ P + + P + I IS
Sbjct: 73 FAAEIFMHRFLLSSPVRTLNPDEADWFYSPIYPTCDLTPMGLPLPFKSPRMMRSAIQLIS 132
Query: 92 QKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFI------------ 139
+PYWNRT GADHF+V H G + + I + S +
Sbjct: 133 SNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRSTLVQTFGQRNHVCLN 192
Query: 140 SGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLAFFAGAV----NSP-----VREKLLQVW 190
G I + PQ + PP + R+ +F G N P R VW
Sbjct: 193 EGSITIPPFAPPQKMQAHQIPPDI---PRSIFVYFRGLFYDVNNDPEGGYYARGARAAVW 249
Query: 191 RN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYD 248
N ++ ++ S T Y +++ + FCL G+ + R+ +++ +GC+PVIIA+
Sbjct: 250 ENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIV 309
Query: 249 LPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYL----LLQSYVLKVRKHFQWHVFP 304
LPFAD + W+ + VA D+P L IL I E L LL + +K F P
Sbjct: 310 LPFADAIPWEEIGVFVAEEDVPNLDTILTSIPPEVILRKQRLLANPSMKRAMLFPQPAQP 369
Query: 305 SDYDAFYMVMYEL 317
DAF+ ++ L
Sbjct: 370 G--DAFHQILNGL 380
>gi|115458146|ref|NP_001052673.1| Os04g0398600 [Oryza sativa Japonica Group]
gi|75143148|sp|Q7XLG3.2|GT42_ORYSJ RecName: Full=Probable glucuronosyltransferase Os04g0398600
gi|38347167|emb|CAE05157.2| OSJNBa0039C07.13 [Oryza sativa Japonica Group]
gi|113564244|dbj|BAF14587.1| Os04g0398600 [Oryza sativa Japonica Group]
gi|116309465|emb|CAH66536.1| H0209H04.3 [Oryza sativa Indica Group]
gi|125548129|gb|EAY93951.1| hypothetical protein OsI_15724 [Oryza sativa Indica Group]
gi|125590243|gb|EAZ30593.1| hypothetical protein OsJ_14642 [Oryza sativa Japonica Group]
gi|215706942|dbj|BAG93402.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215741563|dbj|BAG98058.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 420
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 83/333 (24%), Positives = 149/333 (44%), Gaps = 35/333 (10%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T +P +AD F+ P ++ + P + I IS
Sbjct: 81 FAAEIFMHRFLLSSAVRTLNPKEADWFYTPVYTTCDLTPAGLPLPFKSPRVMRSAIQYIS 140
Query: 92 QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
K+P+WNRT GADHF+V H G +KA E + + + ++ H+ K
Sbjct: 141 HKWPFWNRTDGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQENHVCLK 200
Query: 147 D--VSLPQIWPRQEDPPKL--GSSKRNKLAFFAG----AVNSP-----VREKLLQVWRN- 192
+ +++P P Q+ L + R+ +F G N P R +W N
Sbjct: 201 EGSITIPPYAPPQKMQAHLIPPDTPRSIFVYFRGLFYDTGNDPEGGYYARGARASLWENF 260
Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
++ ++ S Y +++ + FCL G+ + R+ +++ +GC+PVIIA+ LPF
Sbjct: 261 KNNPLFDISTDHPPTYYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 320
Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
AD + W+ + V D+P L IL + ++ +L + +L Q +FP D
Sbjct: 321 ADAIPWEEIGVFVEEKDVPKLDTILTSMPIDD-ILRKQRLLANPSMKQAMLFPQPAQPRD 379
Query: 309 AFYMVMYEL----------WLRRSSVRVQWSTS 331
AF+ ++ L +L+ S R+ W+
Sbjct: 380 AFHQILNGLARKLPHPEGVYLQPSDKRLNWTAG 412
>gi|226529361|ref|NP_001147714.1| LOC100281324 precursor [Zea mays]
gi|195613258|gb|ACG28459.1| secondary cell wall-related glycosyltransferase family 47 [Zea
mays]
Length = 418
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 82/314 (26%), Positives = 144/314 (45%), Gaps = 35/314 (11%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T +P +AD F+ P ++ + T P + I +S
Sbjct: 79 FATEIFMHRFLLASAVRTLNPDEADWFYTPVYTTCDLTPWGHPLTVKSPRMMRSAIQYVS 138
Query: 92 QKYPYWNRTGGADHFYVACHSIG-------RSAMQKAWEVKLNAIQVVCSSSYFISGHIA 144
+++PYWNRT GADHF+V H G A+Q+ L +V ++ H+
Sbjct: 139 KRWPYWNRTEGADHFFVTPHDFGACFYFQEEKAIQRGVLPVLRRATLV--QTFGQKNHVC 196
Query: 145 HKD--VSLPQIWPRQED-----PPKLGSSKRNKLAFFAG----AVNSP-----VREKLLQ 188
++ +++P P + PP+ + R+ +F G N P R
Sbjct: 197 LREGSITIPPYAPPHKIRAHIVPPE---TPRSIFVYFRGLFYDTANDPEGGYYARGARAS 253
Query: 189 VWRN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANH 246
VW N ++ ++ S Y +++ + FCL G+ + R+ +++ +GC+PVIIA+
Sbjct: 254 VWENFKNNALFDISTEHPPTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADD 313
Query: 247 YDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSD 306
LPFAD + W+ ++ V D+ L IL I +E +L + +L Q +FP
Sbjct: 314 IVLPFADAIPWEEIAVFVPEDDVLRLDTILTSIPMDE-ILRKQRLLANPSMKQAMLFPQP 372
Query: 307 Y---DAFYMVMYEL 317
DAF+ V+ L
Sbjct: 373 AEPRDAFHQVLNGL 386
>gi|363543513|ref|NP_001241766.1| secondary cell wall-related glycosyltransferase family 47 precursor
[Zea mays]
gi|195634597|gb|ACG36767.1| secondary cell wall-related glycosyltransferase family 47 [Zea
mays]
Length = 417
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 82/314 (26%), Positives = 144/314 (45%), Gaps = 35/314 (11%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T +P +AD F+ P ++ + T P + I +S
Sbjct: 78 FATEIFMHRFLLASAVRTLNPDEADWFYTPVYTTCDLTPWGHPLTVKSPRMMRSAIQYVS 137
Query: 92 QKYPYWNRTGGADHFYVACHSIG-------RSAMQKAWEVKLNAIQVVCSSSYFISGHIA 144
+++PYWNRT GADHF+V H G A+Q+ L +V ++ H+
Sbjct: 138 KRWPYWNRTEGADHFFVTPHDFGACFYFQEEKAIQRGVLPVLRRATLV--QTFGQKNHVC 195
Query: 145 HKD--VSLPQIWPRQED-----PPKLGSSKRNKLAFFAG----AVNSP-----VREKLLQ 188
++ +++P P + PP+ + R+ +F G N P R
Sbjct: 196 LREGSITIPPYAPPHKIRAHIVPPE---TPRSIFVYFRGLFYDTANDPEGGYYARGARAS 252
Query: 189 VWRN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANH 246
VW N ++ ++ S Y +++ + FCL G+ + R+ +++ +GC+PVIIA+
Sbjct: 253 VWENFKNNALFDISTEHPPTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADD 312
Query: 247 YDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSD 306
LPFAD + W+ ++ V D+ L IL I +E +L + +L Q +FP
Sbjct: 313 IVLPFADAIPWEEIAVFVPEDDVLRLDTILTSIPMDE-ILRKQRLLANPSMKQAMLFPQP 371
Query: 307 Y---DAFYMVMYEL 317
DAF+ V+ L
Sbjct: 372 AEPRDAFHQVLNGL 385
>gi|242042569|ref|XP_002468679.1| hypothetical protein SORBIDRAFT_01g050110 [Sorghum bicolor]
gi|241922533|gb|EER95677.1| hypothetical protein SORBIDRAFT_01g050110 [Sorghum bicolor]
Length = 429
Score = 97.8 bits (242), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 78/292 (26%), Positives = 131/292 (44%), Gaps = 43/292 (14%)
Query: 30 RGNYASESYFKKAFMKSHFVTKDPSKADLFFLPFSIA----------RMRHDRRIGTEGI 79
R +A+E +A + P ADLFF+P ++ + H R + E +
Sbjct: 99 RHLFAAEVAVHEALLAYAGRAARPEDADLFFVPVYVSCNFSTPNGFPSLSHARGLLAEAV 158
Query: 80 PDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKL-NAI-----QVVC 133
D + + PYWNR+ GADH +VA H G + +V + + I + +
Sbjct: 159 -DLVR-------VRMPYWNRSAGADHVFVASHDFG-ACFHPMEDVAIADGIPEFLKRSIL 209
Query: 134 SSSYFISGHIAHKDVS----LPQIWPR-QEDPPKLGSSKRNKLAFFAGAV---------- 178
++ + GH ++V P + P + P+ ++R+ AFF G +
Sbjct: 210 LQTFGVQGHHVCQEVEHVVIPPHVPPEVAHELPEPEKAQRDIFAFFRGKMEVHPKNISGR 269
Query: 179 --NSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYY 236
+ VR +LLQ + + + Y R Y E+ S FCL G+ + R+ +S+
Sbjct: 270 FYSKKVRTELLQHYGRNRKFYLKRKRFDN-YRSEMARSLFCLCPLGWAPWSPRLVESVLL 328
Query: 237 GCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQ 288
GC+PVIIA++ LPF VL W S+ VA DI L+ +L + + ++Q
Sbjct: 329 GCIPVIIADNIRLPFPSVLQWPEISLQVAEKDIANLEMVLDHVVATNLTMIQ 380
>gi|242061766|ref|XP_002452172.1| hypothetical protein SORBIDRAFT_04g021180 [Sorghum bicolor]
gi|241932003|gb|EES05148.1| hypothetical protein SORBIDRAFT_04g021180 [Sorghum bicolor]
Length = 432
Score = 97.8 bits (242), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 83/312 (26%), Positives = 139/312 (44%), Gaps = 31/312 (9%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T +P +AD F+ P ++ + P + I IS
Sbjct: 93 FAAEIFMHRFLLSSAVRTLNPEEADWFYTPVYTTCDLTPAGLPLPFKSPRMMRSAIKFIS 152
Query: 92 QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
K+P+WNRT GADHF+V H G +KA E + + + ++ H+ K
Sbjct: 153 NKWPFWNRTDGADHFFVVPHDFGACFHYQEEKATERGILPMLRRATLVQTFGQKNHVCLK 212
Query: 147 DVSLPQIWPRQEDPPKLGS------SKRNKLAFFAGAV----NSP-----VREKLLQVWR 191
+ S+ I P P K+ + + R+ +F G N P R +W
Sbjct: 213 EGSI--IIPPYAPPQKMQAHLVPPDTPRSIFVYFRGLFYDNGNDPEGGYYARGARASLWE 270
Query: 192 N--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDL 249
N ++ ++ S Y +++ + FCL G+ + R+ +++ +GC+PVIIA+ L
Sbjct: 271 NFKNNPLFDISTDHPVTYYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVL 330
Query: 250 PFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYL----LLQSYVLKVRKHFQWHVFPS 305
PFAD + W+ + V D+P L IL I E L LL + +K F P
Sbjct: 331 PFADAIPWEDIGVFVDEEDVPKLDSILTSIPIENILRKQRLLANPSMKKAMLFPQPAQPR 390
Query: 306 DYDAFYMVMYEL 317
DAF+ ++ L
Sbjct: 391 --DAFHQILNGL 400
>gi|357141481|ref|XP_003572240.1| PREDICTED: xylogalacturonan beta-1,3-xylosyltransferase-like
[Brachypodium distachyon]
Length = 543
Score = 97.4 bits (241), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 81/297 (27%), Positives = 145/297 (48%), Gaps = 23/297 (7%)
Query: 54 SKADLFFLPF--SIARMRHDRRIGTEGIPD--FISHYIFNISQKYPYWNRTGGADHFYVA 109
S+AD+FF+PF S++ RH + G E + + + + W R GG DH V
Sbjct: 241 SQADVFFVPFFASLSYNRHSKLQGKEKMSRNRLLQAELVKYLARQEEWRRWGGKDHLVVP 300
Query: 110 CHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH--KDVSLP--QIWPRQEDPPKLGS 165
H ++M +A + KL+A V S +A+ KDV P + D
Sbjct: 301 HHP---NSMMQARK-KLSAAMYVLSDFGRYPPDVANLKKDVVAPYKHVVRSLRDDESPTF 356
Query: 166 SKRNKLAFFAGAVN----SPVREKLLQVWRNDSEIYAHSGRLKTP----YADELLGSKFC 217
+R LA+F GA++ VR+KL Q+ +++ +++ G ++ + SKFC
Sbjct: 357 DQRPVLAYFQGAIHRKDGGKVRQKLYQLLKDEKDVHFTYGSVRQNGIRRATKGMASSKFC 416
Query: 218 LHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI---PLLKK 274
L++ G ++ R+ D++ CVPV+I++ +LPF DVL++ F + V D L +
Sbjct: 417 LNIAGDTPSSNRLFDAIVSHCVPVMISDDIELPFEDVLDYSEFCVFVRASDAVRKGFLLR 476
Query: 275 ILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQWSTS 331
+L+GI+ +E+ + + +V HF++ DA M+ + + S+++Q S
Sbjct: 477 LLRGITRDEWNTMWERLKEVAHHFEYQYPSKPDDAVQMIWGAVARKMHSLKLQLHKS 533
>gi|356541948|ref|XP_003539434.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase
IRX7-like [Glycine max]
Length = 459
Score = 97.4 bits (241), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 78/291 (26%), Positives = 127/291 (43%), Gaps = 45/291 (15%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDF---------I 83
+ASE +A + S T DP +AD FF+P ++ + DF I
Sbjct: 126 FASEVAIHRALLTSEVRTFDPYEADFFFVPVYVS-------CNFSAVNDFPAIGHARTLI 178
Query: 84 SHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISG-H 142
S + +S +YP+WNR+ G+DH +VA H G + I ++ +S +
Sbjct: 179 SSAVNLVSTEYPFWNRSRGSDHVFVASHDFGACFHTLEDVAMADGIPIILKNSIVLQTFG 238
Query: 143 IAHKD--------VSLPQIWPRQ-----EDPPKLGSSKRNKLAFFAGAV----------- 178
+ H+ V P + P E P G +R+ AFF G +
Sbjct: 239 VIHQHPCQEVENVVIPPYVSPESVRSTLEKFPVTG--RRDIFAFFRGKMEVHPKNVSRRF 296
Query: 179 -NSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYG 237
+ VR ++ + + D Y R Y E+ S FCL G+ + R+ +S+ G
Sbjct: 297 YSKRVRTEIWRKFNGDRRFYLQRHRFAG-YQLEIARSVFCLCPLGWAPWSPRLVESVALG 355
Query: 238 CVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQ 288
CVPV+IA+ LPF+ + W S+ VA D+ L KIL+ +++ ++Q
Sbjct: 356 CVPVVIADGIQLPFSSAVRWSEISLSVAERDVGKLGKILERVAATNLSVIQ 406
>gi|242079279|ref|XP_002444408.1| hypothetical protein SORBIDRAFT_07g021480 [Sorghum bicolor]
gi|241940758|gb|EES13903.1| hypothetical protein SORBIDRAFT_07g021480 [Sorghum bicolor]
Length = 539
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 88/313 (28%), Positives = 151/313 (48%), Gaps = 43/313 (13%)
Query: 52 DPSKADLFFLPF--SIARMRHDRRIGTEGIPD--FISHYIFNISQKYPYWNRTGGADHFY 107
+ S AD+FF+PF S++ RH + E + F+ + + W R GG +H
Sbjct: 219 NASLADVFFVPFFASLSYNRHSKLRRGEKVNRNRFLQAELVRYLMRKEEWRRWGGKNHLI 278
Query: 108 VACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH--KDVSLPQIWPRQE----DPP 161
V H ++M +A + KL+A V S S H+A+ KDV P + + D P
Sbjct: 279 VPHHP---NSMMEARK-KLSAAMFVLSDFGRYSPHVANLKKDVIAPYMHVVRSFGDGDSP 334
Query: 162 KLGSSKRNKLAFFAGAV--------------NSP------VREKLLQVWRNDSEIYAHSG 201
+R LA+F GA+ N P VR+KL Q+ +++ +++ G
Sbjct: 335 AF--DQRPILAYFQGAIHRKAVRALCSVLVANRPAFQGGKVRQKLYQLLKDERDVHFTYG 392
Query: 202 RLKTP----YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNW 257
++ + SKFCL++ G ++ R+ D++ CVPVII++ +LPF DVL++
Sbjct: 393 SVRQNGIRRATAGMSTSKFCLNIAGDTPSSNRLFDAIVSHCVPVIISDDIELPFEDVLDY 452
Query: 258 KSFSIVVATLDIP---LLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
F + V + D L ++L+GIS +E+ + + KV +HF++ DA M+
Sbjct: 453 SEFCVFVRSADAAKRGFLLRLLRGISRDEWTKMWMRLKKVTRHFEYQYPSRSGDAVQMIW 512
Query: 315 YELWLRRSSVRVQ 327
+ + SV++Q
Sbjct: 513 SAVARKMHSVQLQ 525
>gi|357472125|ref|XP_003606347.1| Secondary cell wall-related glycosyltransferase family [Medicago
truncatula]
gi|355507402|gb|AES88544.1| Secondary cell wall-related glycosyltransferase family [Medicago
truncatula]
Length = 427
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 88/319 (27%), Positives = 141/319 (44%), Gaps = 41/319 (12%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIP------DFISHY 86
+ASE +A + S T DP +AD FF+P ++ G P IS
Sbjct: 95 FASEVAIHRALLTSDVRTFDPYEADFFFVPVYVS----CNFSTVNGFPAIGHARSLISSA 150
Query: 87 IFNISQKYPYWNRTGGADHFYVACHSIGRS-------AMQKAW-EVKLNAIQV-VCSSSY 137
+ IS +YP+WNR+ G+DH +VA H G AM+ E+ N+I + +Y
Sbjct: 151 VKLISTEYPFWNRSTGSDHVFVASHDFGSCFHTLEDVAMKDGVPEIMKNSIVLQTFGVTY 210
Query: 138 FISGHIAHKDVSLPQIWPRQ-----EDPPKLGSSKRNKLAFFAGAV------------NS 180
V P + P E+ P G +R+ FF G + +
Sbjct: 211 DHPCQKVEHVVIPPFVSPESVRNTLENFPVNG--RRDIWVFFRGKMEVHPKNVSGRFYSK 268
Query: 181 PVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVP 240
VR + + + D Y R Y E+ S FCL G+ + R+ +S+ GCVP
Sbjct: 269 KVRTVIWKKFNGDRRFYLRRHRFAG-YQSEIARSVFCLCPLGWAPWSPRLVESVALGCVP 327
Query: 241 VIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVL--KVRKHF 298
VIIA+ LPF+ +NW S+ VA D+ L +IL+ +++ ++Q + + RK
Sbjct: 328 VIIADSIRLPFSSAVNWPEISVTVAEKDVWRLGEILEKVAATNLSIIQRNLWDPRTRKAL 387
Query: 299 QWHVFPSDYDAFYMVMYEL 317
++ + DA + V++ L
Sbjct: 388 LFNSRVHEGDATWQVLHSL 406
>gi|255565439|ref|XP_002523710.1| transferase, putative [Ricinus communis]
gi|223537014|gb|EEF38650.1| transferase, putative [Ricinus communis]
Length = 461
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 93/331 (28%), Positives = 141/331 (42%), Gaps = 42/331 (12%)
Query: 33 YASESYFKKAFMKSHFV-TKDPSKADLFFLPFSIARMRHDRRIGTEGIP------DFISH 85
+ASE KA S + T DP +AD FF+P ++ G P +S
Sbjct: 133 FASEVAIHKAISNSDDIRTFDPYEADFFFVPVYVSCNFST----INGFPAIGHARSLLSS 188
Query: 86 YIFNISQKYPYWNRTGGADHFYVACHSIG-------RSAMQKAWE--VKLNAIQVVCSSS 136
+ IS YP+WNR+ GADH +VA H G AMQ +K + I
Sbjct: 189 AVTFISTNYPFWNRSQGADHVFVASHDFGSCFHTLEERAMQDGVPEFLKKSIILQTFGVK 248
Query: 137 YFISGHIAHKDVSLPQIWP---RQEDPPKLGSSKRNKLAFFAGAV------------NSP 181
Y V P I P R + +R+ FF G + +
Sbjct: 249 YDHPCQQVENVVIPPYISPVSVRSTLKKAPLTGRRDIWVFFRGKMEVHPKNVSGRFYSKK 308
Query: 182 VREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPV 241
VR ++ + + D Y R Y E+ S FCL G+ + R+ +S+ GCVPV
Sbjct: 309 VRTEIWRRFNGDRRFYLQRHRFAG-YQSEIARSVFCLCPLGWAPWSPRLVESVALGCVPV 367
Query: 242 IIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVL--KVRKHFQ 299
IIA+ LPF + W + S+ VA D+ L +IL+ +++ L+Q + VR+
Sbjct: 368 IIADGIRLPFPSAVPWPAISLTVAEKDVAKLGRILEDVAATNLTLIQKNIWDPTVRRALL 427
Query: 300 WHVFPSDYDAFYMVMYELWLR----RSSVRV 326
++ + DA + V+Y L + R +VRV
Sbjct: 428 FNDQIEEGDATWQVLYALTKKLDRSRRTVRV 458
>gi|357163140|ref|XP_003579636.1| PREDICTED: probable glucuronosyltransferase Os04g0398600-like
[Brachypodium distachyon]
Length = 429
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 79/311 (25%), Positives = 141/311 (45%), Gaps = 29/311 (9%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T +P +AD F+ P ++ + P + I IS
Sbjct: 90 FAAEIFMHRFLLSSAVRTLNPKEADWFYAPVYTTCDLTPAGLPLPFKSPRVMRSAIQYIS 149
Query: 92 QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
K+P+WN+T GADHF+V H G +KA E + + + ++ H+ K
Sbjct: 150 NKWPFWNKTDGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQENHVCLK 209
Query: 147 DVSLPQIWPRQEDPPKLGS------SKRNKLAFFAG----AVNSP-----VREKLLQVWR 191
+ S+ I P P K+ + + R+ +F G N P R +W
Sbjct: 210 EGSI--IIPPFAPPQKMQAHLIPPDTPRSIFVYFRGLFYDTGNDPEGGYYARGARASLWE 267
Query: 192 N--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDL 249
N ++ ++ S Y +++ + FCL G+ + R+ +++ +GC+PVIIA+ L
Sbjct: 268 NFKNNPLFDISTEHPATYYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVL 327
Query: 250 PFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY-- 307
PFAD + W+ + + D+P L IL + E+ +L + +L Q +FP
Sbjct: 328 PFADAIPWEEIGVFIEEKDVPKLDTILTSMPIED-ILRKQRLLANPSMKQAMLFPQPAQA 386
Query: 308 -DAFYMVMYEL 317
DAF+ ++ L
Sbjct: 387 RDAFHQILNGL 397
>gi|413951430|gb|AFW84079.1| secondary cell wall glycosyltransferase family 47 [Zea mays]
Length = 421
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 79/309 (25%), Positives = 137/309 (44%), Gaps = 25/309 (8%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T +P +AD F+ P ++ + P + I I+
Sbjct: 82 FAAEIFMHRFLLSSAVRTFNPEEADWFYTPVYATCDLTPSGLPLPFKSPRMMRSAIELIA 141
Query: 92 QKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFIS-----GHIAHK 146
+PYWNR+ GADHF+V H G + + I + + + H+ K
Sbjct: 142 TNWPYWNRSEGADHFFVTPHDFGACFHYQEEKAIGRGILPLLQRATLVQTFGQKNHVCLK 201
Query: 147 --DVSLPQIWPRQEDPPKL--GSSKRNKLAFFAG----AVNSP-----VREKLLQVWRN- 192
+++P P Q+ L + R+ +F G N P R VW N
Sbjct: 202 GGSITIPPFAPPQKMQAHLIPLDTPRSIFVYFRGLFYDTSNDPEGGYYARGARASVWENF 261
Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
++ ++ S Y +++ S FCL G+ + R+ +++ +GC+PVIIA+ LPF
Sbjct: 262 KNNPLFDISTDHPPTYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 321
Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
AD + W+ + VA D+P L IL I + + +L + +L Q +FP D
Sbjct: 322 ADAIPWEEIGVFVAEEDVPKLDSILTSIPT-DVILRKQRLLANPAMKQAMLFPQPAQAGD 380
Query: 309 AFYMVMYEL 317
AF+ ++ L
Sbjct: 381 AFHQILNGL 389
>gi|42568020|ref|NP_197685.2| FRA8-like protein [Arabidopsis thaliana]
gi|75127070|sp|Q6NMM8.1|F8H_ARATH RecName: Full=Probable glucuronoxylan glucuronosyltransferase F8H;
AltName: Full=FRA8 homolog; AltName: Full=Protein
FRAGILE FIBER 8 homolog
gi|44681390|gb|AAS47635.1| At5g22940 [Arabidopsis thaliana]
gi|48958521|gb|AAT47813.1| At5g22940 [Arabidopsis thaliana]
gi|332005716|gb|AED93099.1| FRA8-like protein [Arabidopsis thaliana]
Length = 469
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 86/324 (26%), Positives = 145/324 (44%), Gaps = 49/324 (15%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIA----------RMRHDRRIGTEGIPDF 82
+A+E +A + S T DP +AD FF+P ++ + H R + + + DF
Sbjct: 136 FAAEVAIHRALLSSDVRTLDPDEADYFFVPVYVSCNFSTSNGFPSLSHARSLLSSAV-DF 194
Query: 83 ISHYIFNISQKYPYWNRTGGADHFYVACHSIGRS--AMQK-AWE------VKLNAIQVVC 133
+S + YP+WNR+ G+DH +VA H G AM+ A E +K + I
Sbjct: 195 LSDH-------YPFWNRSQGSDHVFVASHDFGACFHAMEDMAIEEGIPKFMKRSIILQTF 247
Query: 134 SSSYFISGHIAHKDVSLPQIWPRQ-----EDPPKLGSSKRNKLAFFAGAV---------- 178
Y V P I P E P G +R+ AFF G +
Sbjct: 248 GVKYKHPCQEVEHVVIPPYIPPESVQKAIEKAPVNG--RRDIWAFFRGKMEVNPKNISGR 305
Query: 179 --NSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYY 236
+ VR +L+ + Y + R Y E++ S FCL G+ + R+ +S
Sbjct: 306 FYSKGVRTAILKKFGGRRRFYLNRHRFAG-YRSEIVRSVFCLCPLGWAPWSPRLVESAVL 364
Query: 237 GCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLK--V 294
GCVPV+IA+ LPF++ + W S+ VA D+ L+K+L+ +++ +Q + +
Sbjct: 365 GCVPVVIADGIQLPFSETVQWPEISLTVAEKDVRNLRKVLEHVAATNLSAIQRNLHEPVF 424
Query: 295 RKHFQWHVFPSDYDAFYMVMYELW 318
++ ++V + DA + ++ LW
Sbjct: 425 KRALLYNVPMKEGDATWHILESLW 448
>gi|414882087|tpg|DAA59218.1| TPA: hypothetical protein ZEAMMB73_484283 [Zea mays]
Length = 479
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 81/293 (27%), Positives = 140/293 (47%), Gaps = 20/293 (6%)
Query: 52 DPSKADLFFLPF--SIARMRHDRRIGTE--GIPDFISHYIFNISQKYPYWNRTGGADHFY 107
DP ADL F+PF S++ RH R + E G + + P W R GGADH
Sbjct: 176 DPRDADLVFVPFFASLSYNRHSRPLPPEKVGRDKALQEKLVGYLTARPEWRRFGGADHVI 235
Query: 108 VACHSIGRSAMQKAWEVKLNAIQVVCS-SSYFISGHIAHKDVSLP-QIWPRQEDPPKLGS 165
VA H +++ A A+ V+ Y KDV P + + G
Sbjct: 236 VAHHP---NSLLHARAALSPAVFVLSDFGRYPPRVASLEKDVIAPYKHMAKTFVNDSAGF 292
Query: 166 SKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTPYADE----LLGSKFC 217
R L +F GA+ +R++L + +++ ++Y G ++ A + + SKFC
Sbjct: 293 DDRPTLLYFRGAIYRKEGGTIRQELYYMLKDEKDVYFSFGSVQDHGASKASQGMHSSKFC 352
Query: 218 LHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI---PLLKK 274
L++ G ++ R+ D++ CVPVII++ +LP+ DVL++ FSI V + D L +
Sbjct: 353 LNIAGDTPSSNRMFDAIVSHCVPVIISDDIELPYEDVLDYSKFSIFVRSSDAVEKGHLMR 412
Query: 275 ILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
+L G+S + + + S + +V +HF++ DA M+ L + S++++
Sbjct: 413 LLSGVSKQRWTEMWSRLREVDRHFEYQYPSQKDDAVQMIWRSLSRKVPSIKLK 465
>gi|168059925|ref|XP_001781950.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666596|gb|EDQ53246.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 356
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 78/289 (26%), Positives = 131/289 (45%), Gaps = 31/289 (10%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGI---PDFISHYIFN 89
+ASE K + S T +P +AD FF+P + + + G + P + + +
Sbjct: 31 FASEVAIHKILLSSPIKTLNPYEADFFFMPVYFS-CKFSSKTGFPRLGHAPKLMEDAVNH 89
Query: 90 ISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH---- 145
+S +WNR+GG DH +VA H G E + I + SS + H
Sbjct: 90 VSSMMEFWNRSGGKDHVFVAAHDFGACFHSLESEAIAHGIPEIVQSSLILQTFGVHGFHP 149
Query: 146 ----KDVSLP------QIWPRQEDPPKLGSSKRNKLAFFAG--AVNSPVREKLLQVWRND 193
+++ +P ++ + PP+ +RN AFF G +N L+
Sbjct: 150 CQAAENIQIPPYISPSTVFSYVKKPPE--EQRRNIFAFFRGKMEINPKNVSGLVYSRGVR 207
Query: 194 SEIYAHSGR-----LKTPYAD----ELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIA 244
+ IY R LK AD +LL S FCL G+ + RI +++ YGCVPVIIA
Sbjct: 208 TYIYKKFSRNRRFFLKRHRADNYQLDLLRSTFCLCPLGWAPWSPRIVEAVAYGCVPVIIA 267
Query: 245 NHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLK 293
++ LP++ ++W + S+ + D+ L KIL ++++ +Q + K
Sbjct: 268 DNIRLPYSHAIDWSNMSLNIREHDVHKLYKILLNVAAKNLSSIQKNLWK 316
>gi|108863947|gb|ABA91286.2| secondary cell wall-related glycosyltransferase family 47,
putative, expressed [Oryza sativa Japonica Group]
gi|215769393|dbj|BAH01622.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 476
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 79/297 (26%), Positives = 144/297 (48%), Gaps = 28/297 (9%)
Query: 52 DPSKADLFFLPF--SIARMRHDRRIGTE------GIPDFISHYIFNISQKYPYWNRTGGA 103
D AD+ F+PF S++ RH R + E G+ + + Y+ P W R+GGA
Sbjct: 172 DSRDADVVFVPFFASLSYNRHSRVVPPEKVSRDKGLQERLVRYLM----AQPEWKRSGGA 227
Query: 104 DHFYVACHSIGRSAMQKAWEVKLNAIQVVCS-SSYFISGHIAHKDVSLP-QIWPRQEDPP 161
DH VA H +++ A V A+ V+ Y KDV P + +
Sbjct: 228 DHVIVAHHP---NSLLHARSVLFPAVFVLSDFGRYHPRVASLEKDVIAPYKHMAKTFVND 284
Query: 162 KLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTPYADE----LLG 213
G R L +F GA+ +R++L + +++ ++Y G ++ A + +
Sbjct: 285 SAGFDDRPTLLYFRGAIFRKEGGNIRQELHYMLKDEKDVYFAFGSVQDHGASKASQGMHA 344
Query: 214 SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI---P 270
SKFCL++ G ++ R+ D++ CVPVII++ +LP+ D L++ FSI V + D
Sbjct: 345 SKFCLNIAGDTPSSNRLFDAIVSHCVPVIISDDIELPYEDALDYSKFSIFVRSSDAVKKG 404
Query: 271 LLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
L ++++G+S ++ ++ + +V KHF++ DA M+ L + ++R++
Sbjct: 405 YLMRLIRGVSKHQWTMMWRRLKEVDKHFEYQYPSQKDDAVQMIWQTLARKVPAIRLK 461
>gi|414864293|tpg|DAA42850.1| TPA: hypothetical protein ZEAMMB73_024068 [Zea mays]
Length = 434
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 85/318 (26%), Positives = 140/318 (44%), Gaps = 48/318 (15%)
Query: 6 RVYVY--PHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFFLPF 63
R+YVY P R N +A R +A+E +A + P ADLFF+P
Sbjct: 79 RIYVYDLPARFNRDWAAADARCS---RHLFAAEVAVHEALLAYAGRAARPEDADLFFVPV 135
Query: 64 SIA----------RMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSI 113
++ + H R + + + D + + PYWNR+ GADH +VA H
Sbjct: 136 YVSCNFSTPNGFPSLSHARGLLADAV-DLVR-------ARMPYWNRSAGADHVFVASHDF 187
Query: 114 GRSAMQKAWEVKL-NAI-----QVVCSSSYFISGHIAHKDVS----LPQIWPR-QEDPPK 162
G + +V + + I + + ++ + GH ++V P + P + P+
Sbjct: 188 G-ACFHPMEDVAIADGIPEFLKRSILLQTFGVQGHHVCQEVEHVVIPPHVPPEVAHELPE 246
Query: 163 LGSSKRNKLAFFAGAV------------NSPVREKLLQVWRNDSEIYAHSGRLKTPYADE 210
++R+ AFF G + + VR +LLQ + + + Y R Y E
Sbjct: 247 PEKAQRDIFAFFRGKMEVHPKNISGRFYSKKVRTELLQHYGRNRKFYLKRKRFDN-YRSE 305
Query: 211 LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIP 270
+ S FCL G+ + R+ +S+ GC+PVIIA+ LPF VL W+ S+ VA DI
Sbjct: 306 MARSLFCLCPLGWAPWSPRLVESVLLGCIPVIIADDIRLPFPPVLQWQEISLQVAEKDIA 365
Query: 271 LLKKILKGISSEEYLLLQ 288
L +L + + ++Q
Sbjct: 366 SLGMVLDHVVATNLTVIQ 383
>gi|10177241|dbj|BAB10615.1| unnamed protein product [Arabidopsis thaliana]
Length = 498
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 86/324 (26%), Positives = 145/324 (44%), Gaps = 49/324 (15%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIA----------RMRHDRRIGTEGIPDF 82
+A+E +A + S T DP +AD FF+P ++ + H R + + + DF
Sbjct: 165 FAAEVAIHRALLSSDVRTLDPDEADYFFVPVYVSCNFSTSNGFPSLSHARSLLSSAV-DF 223
Query: 83 ISHYIFNISQKYPYWNRTGGADHFYVACHSIGRS--AMQK-AWE------VKLNAIQVVC 133
+S + YP+WNR+ G+DH +VA H G AM+ A E +K + I
Sbjct: 224 LSDH-------YPFWNRSQGSDHVFVASHDFGACFHAMEDMAIEEGIPKFMKRSIILQTF 276
Query: 134 SSSYFISGHIAHKDVSLPQIWPRQ-----EDPPKLGSSKRNKLAFFAGAV---------- 178
Y V P I P E P G +R+ AFF G +
Sbjct: 277 GVKYKHPCQEVEHVVIPPYIPPESVQKAIEKAPVNG--RRDIWAFFRGKMEVNPKNISGR 334
Query: 179 --NSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYY 236
+ VR +L+ + Y + R Y E++ S FCL G+ + R+ +S
Sbjct: 335 FYSKGVRTAILKKFGGRRRFYLNRHRFAG-YRSEIVRSVFCLCPLGWAPWSPRLVESAVL 393
Query: 237 GCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLK--V 294
GCVPV+IA+ LPF++ + W S+ VA D+ L+K+L+ +++ +Q + +
Sbjct: 394 GCVPVVIADGIQLPFSETVQWPEISLTVAEKDVRNLRKVLEHVAATNLSAIQRNLHEPVF 453
Query: 295 RKHFQWHVFPSDYDAFYMVMYELW 318
++ ++V + DA + ++ LW
Sbjct: 454 KRALLYNVPMKEGDATWHILESLW 477
>gi|212722274|ref|NP_001131151.1| uncharacterized protein LOC100192459 [Zea mays]
gi|194690716|gb|ACF79442.1| unknown [Zea mays]
gi|413955856|gb|AFW88505.1| hypothetical protein ZEAMMB73_716681 [Zea mays]
Length = 391
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 80/294 (27%), Positives = 138/294 (46%), Gaps = 26/294 (8%)
Query: 54 SKADLFFLPF--SIARMRHDRRI--GTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVA 109
+ AD+ F+PF S++ RH R + + + + P W RTGG DH +A
Sbjct: 73 ADADVVFVPFFASLSFNRHSRVVPPARDSEDRALQRRLLEFLAARPEWRRTGGRDHVVLA 132
Query: 110 CHSIGR-SAMQKAWEVKLNAIQVVCS-SSYFISGHIAHKDVSLP---QIWPRQEDPPKLG 164
H G A + W + V+C Y S KDV P + D G
Sbjct: 133 HHPNGMLDARYRFWP----CVFVLCDFGRYPPSVANLDKDVIAPYRHLVANFAND--TAG 186
Query: 165 SSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTPYADE----LLGSKF 216
R L +F GA+ +R++L + +++ +++ G + ++ + SKF
Sbjct: 187 YDDRPTLLYFQGAIYRKDGGFIRQELYYLLKDEKDVHFSFGSVAGNGIEQATQGMRSSKF 246
Query: 217 CLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI---PLLK 273
CL++ G ++ R+ DS+ CVPV I++ +LPF DVL++ FS++V D L
Sbjct: 247 CLNIAGDTPSSNRLFDSIVSHCVPVTISDEIELPFEDVLDYSKFSVIVRGADAVKKGFLM 306
Query: 274 KILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
++KGIS EE+ + + + +V KHF++ DA M+ + + S+R++
Sbjct: 307 NLIKGISREEWTRMWNRLKEVEKHFEYQYPSQTDDAVQMIWKAIARKVPSIRLK 360
>gi|224075447|ref|XP_002304637.1| predicted protein [Populus trichocarpa]
gi|222842069|gb|EEE79616.1| predicted protein [Populus trichocarpa]
Length = 413
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 82/312 (26%), Positives = 136/312 (43%), Gaps = 31/312 (9%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T +P +AD F+ P + + P + I +S
Sbjct: 74 FAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYPTCDLTPTGLPLPFNSPRMMRSAIQLLS 133
Query: 92 QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
+PYWNRT GADHF+V H G +KA E + + + ++ H+
Sbjct: 134 SNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLRRATLVQTFGQRNHVCLN 193
Query: 147 DVSLPQIWPRQEDPPKLGS------SKRNKLAFFAGAV----NSP-----VREKLLQVWR 191
+ S+ P P K+ + + R+ +F G N P R VW
Sbjct: 194 EGSI--TIPPYAPPQKMQAHQIPLDTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWE 251
Query: 192 N--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDL 249
N ++ ++ S T Y +++ + FCL G+ + R+ +++ +GC+PVIIA+ L
Sbjct: 252 NFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVL 311
Query: 250 PFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYL----LLQSYVLKVRKHFQWHVFPS 305
PFAD + W+ + VA D+P L L I E L LL + +K F P
Sbjct: 312 PFADAIPWEEIGVFVAEEDVPHLDTFLTSIPPEVILRKQRLLANPSMKRAMLFPQPAQPG 371
Query: 306 DYDAFYMVMYEL 317
DAF+ ++ L
Sbjct: 372 --DAFHQILNGL 381
>gi|226503833|ref|NP_001149319.1| secondary cell wall-related glycosyltransferase family 47 precursor
[Zea mays]
gi|195626366|gb|ACG35013.1| secondary cell wall-related glycosyltransferase family 47 [Zea
mays]
Length = 421
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 79/309 (25%), Positives = 136/309 (44%), Gaps = 25/309 (8%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T +P +AD F+ P ++ + P + I I+
Sbjct: 82 FAAEIFMHRFLLSSAVRTFNPEEADWFYTPVYATCDLTPSGLPLPFKSPRMMRSAIELIA 141
Query: 92 QKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFIS-----GHIAHK 146
+PYWNR+ GADHF+V H G + + I + + + H+ K
Sbjct: 142 TNWPYWNRSEGADHFFVTPHDFGACFHYQEEKAIGRGILPLLQRATLVQTFGQKNHVCLK 201
Query: 147 --DVSLPQIWPRQEDPPKL--GSSKRNKLAFFAG----AVNSP-----VREKLLQVWRN- 192
+ +P P Q+ L + R+ +F G N P R VW N
Sbjct: 202 GGSIXIPPFAPPQKMQAHLIPLDTPRSIFVYFRGLFYDTSNDPEGGYYARGARASVWENF 261
Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
++ ++ S Y +++ S FCL G+ + R+ +++ +GC+PVIIA+ LPF
Sbjct: 262 KNNPLFDISTDHPPTYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 321
Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
AD + W+ + VA D+P L IL I + + +L + +L Q +FP D
Sbjct: 322 ADAIPWEEIGVFVAEEDVPKLDSILTSIPT-DVILRKQRLLANPAMKQAMLFPQPAQAGD 380
Query: 309 AFYMVMYEL 317
AF+ ++ L
Sbjct: 381 AFHQILNGL 389
>gi|194701034|gb|ACF84601.1| unknown [Zea mays]
gi|413955857|gb|AFW88506.1| hypothetical protein ZEAMMB73_716681 [Zea mays]
Length = 462
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 80/294 (27%), Positives = 138/294 (46%), Gaps = 26/294 (8%)
Query: 54 SKADLFFLPF--SIARMRHDRRI--GTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVA 109
+ AD+ F+PF S++ RH R + + + + P W RTGG DH +A
Sbjct: 144 ADADVVFVPFFASLSFNRHSRVVPPARDSEDRALQRRLLEFLAARPEWRRTGGRDHVVLA 203
Query: 110 CHSIGR-SAMQKAWEVKLNAIQVVCS-SSYFISGHIAHKDVSLP---QIWPRQEDPPKLG 164
H G A + W + V+C Y S KDV P + D G
Sbjct: 204 HHPNGMLDARYRFWP----CVFVLCDFGRYPPSVANLDKDVIAPYRHLVANFAND--TAG 257
Query: 165 SSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTPYADE----LLGSKF 216
R L +F GA+ +R++L + +++ +++ G + ++ + SKF
Sbjct: 258 YDDRPTLLYFQGAIYRKDGGFIRQELYYLLKDEKDVHFSFGSVAGNGIEQATQGMRSSKF 317
Query: 217 CLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI---PLLK 273
CL++ G ++ R+ DS+ CVPV I++ +LPF DVL++ FS++V D L
Sbjct: 318 CLNIAGDTPSSNRLFDSIVSHCVPVTISDEIELPFEDVLDYSKFSVIVRGADAVKKGFLM 377
Query: 274 KILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
++KGIS EE+ + + + +V KHF++ DA M+ + + S+R++
Sbjct: 378 NLIKGISREEWTRMWNRLKEVEKHFEYQYPSQTDDAVQMIWKAIARKVPSIRLK 431
>gi|356516800|ref|XP_003527081.1| PREDICTED: xylogalacturonan beta-1,3-xylosyltransferase-like
[Glycine max]
Length = 493
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 77/295 (26%), Positives = 140/295 (47%), Gaps = 23/295 (7%)
Query: 50 TKDPSKADLFFLPF--SIARMRHDRRIGTEGIP--DFISHYIFNISQKYPYWNRTGGADH 105
+D S+AD+ F+PF S++ RH + G E + + + W R+GG DH
Sbjct: 184 VQDSSQADVIFVPFFSSLSYNRHSKLNGEEKVSLNKMLQDRLVQFLMGQKEWKRSGGKDH 243
Query: 106 FYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH--KDVSLPQIWPRQEDPPKL 163
VA H ++ KL A +V + +A+ KD+ P P
Sbjct: 244 LIVAHHPNSLLDARR----KLGAAMLVLADFGRYPTELANIKKDIIAPYRHLVSTIPKAK 299
Query: 164 GSS--KRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTPYADE----LLG 213
+S KR L +F GA+ +R++L + +++ +++ G + ++ +
Sbjct: 300 SASFEKRTTLVYFQGAIYRKDGGAIRQELYYLLKDEKDVHFTFGSIGGNGINQASQGMAM 359
Query: 214 SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI---P 270
SKFCL++ G ++ R+ D++ CVPVII++ +LPF DVL++ FSI V D
Sbjct: 360 SKFCLNIAGDTPSSNRLFDAIVSHCVPVIISDEIELPFEDVLDYSDFSIFVRASDSMKKG 419
Query: 271 LLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
L +L+ I+ +E+ + + ++ HF++ DA M+ ++ + SS+R
Sbjct: 420 YLLNLLRSITQKEWSKMWERLKQITHHFEYQYPSQPGDAVNMIWQQVERKISSIR 474
>gi|194700342|gb|ACF84255.1| unknown [Zea mays]
gi|414587472|tpg|DAA38043.1| TPA: secondary cell wall glycosyltransferase family 47 [Zea mays]
Length = 427
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 79/311 (25%), Positives = 140/311 (45%), Gaps = 29/311 (9%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T +P +AD F+ P ++ + + P + I IS
Sbjct: 88 FAAEIFMHRFLLSSAVRTLNPKEADWFYTPVYTTCDLTNAGLPLPFKSPRVMRSAIQYIS 147
Query: 92 QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
K+P+WNRT GADHF+V H +KA E + + + ++ H+ K
Sbjct: 148 NKWPFWNRTDGADHFFVVPHDFAACFHYQEEKAIERGILPLLRRATLVQTFGQENHVCLK 207
Query: 147 DVSLPQIWPRQEDPPKLGS------SKRNKLAFFAG----AVNSP-----VREKLLQVWR 191
+ S+ I P P K+ + + R+ +F G N P R +W
Sbjct: 208 EGSI--IIPPYAPPQKMQAHLISPDTPRSIFVYFRGLFYDTGNDPEGGYYARGARASLWE 265
Query: 192 N--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDL 249
N + ++ S Y +++ + FCL G+ + R+ +++ +GC+PVIIA+ L
Sbjct: 266 NFKSNPLFDISTDHPATYYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVL 325
Query: 250 PFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY-- 307
PFAD + W+ + V D+P L IL + ++ +L + +L Q +FP
Sbjct: 326 PFADAIPWEEIGVFVEEKDVPKLDTILTSMPIDD-ILRKQRLLANPSMKQAMLFPQPAQP 384
Query: 308 -DAFYMVMYEL 317
DAF+ ++ L
Sbjct: 385 RDAFHQILNGL 395
>gi|226499262|ref|NP_001149267.1| secondary cell wall-related glycosyltransferase family 47 precursor
[Zea mays]
gi|195625922|gb|ACG34791.1| secondary cell wall-related glycosyltransferase family 47 [Zea
mays]
Length = 427
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 79/311 (25%), Positives = 140/311 (45%), Gaps = 29/311 (9%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T +P +AD F+ P ++ + + P + I IS
Sbjct: 88 FAAEIFMHRFLLSSAVRTLNPKEADWFYTPVYTTCDLTNAGLPLPFKSPRVMRSAIQYIS 147
Query: 92 QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
K+P+WNRT GADHF+V H +KA E + + + ++ H+ K
Sbjct: 148 NKWPFWNRTDGADHFFVVPHDFAACFHYQEEKAIERGILPLLRRATLVQTFGQENHVCLK 207
Query: 147 DVSLPQIWPRQEDPPKLGS------SKRNKLAFFAG----AVNSP-----VREKLLQVWR 191
+ S+ I P P K+ + + R+ +F G N P R +W
Sbjct: 208 EGSI--IIPPYAPPQKMQAHLISPDTPRSIFVYFRGLFYDTGNDPEGGYYARGARASLWE 265
Query: 192 N--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDL 249
N + ++ S Y +++ + FCL G+ + R+ +++ +GC+PVIIA+ L
Sbjct: 266 NFKSNPLFDISTDHPATYYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVL 325
Query: 250 PFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY-- 307
PFAD + W+ + V D+P L IL + ++ +L + +L Q +FP
Sbjct: 326 PFADAIPWEEIGVFVEEKDVPKLDTILTSMPIDD-ILRKQRLLANPSMKQAMLFPQPAQP 384
Query: 308 -DAFYMVMYEL 317
DAF+ ++ L
Sbjct: 385 RDAFHQILNGL 395
>gi|242072854|ref|XP_002446363.1| hypothetical protein SORBIDRAFT_06g014730 [Sorghum bicolor]
gi|241937546|gb|EES10691.1| hypothetical protein SORBIDRAFT_06g014730 [Sorghum bicolor]
Length = 430
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 80/309 (25%), Positives = 140/309 (45%), Gaps = 25/309 (8%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T +P +AD F+ P ++ + + P + I IS
Sbjct: 91 FAAEIFMHRFLLSSAVRTLNPKEADWFYTPVYTTCDLTNAGLPLPFKSPRVMRSAIQYIS 150
Query: 92 QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
K+P+WNRT GADHF+V H +KA E + + + ++ H+ K
Sbjct: 151 NKWPFWNRTDGADHFFVVPHDFAACFHYQEEKAIERGILPLLRRATLVQTFGQENHVCLK 210
Query: 147 DVSL--PQIWPRQEDPPKLGS--SKRNKLAFFAG----AVNSP-----VREKLLQVWRN- 192
+ S+ P P Q+ L S + R+ +F G N P R +W N
Sbjct: 211 EGSIIIPPFAPPQKMQAHLISPDTPRSIFVYFRGLFYDTGNDPEGGYYARGARASLWENF 270
Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
+ ++ S Y +++ + FCL G+ + R+ +++ +GC+PVIIA+ LPF
Sbjct: 271 KSNPLFDISTDHPATYYEDMQRAVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPF 330
Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
AD + W+ + V D+P L IL + ++ +L + +L Q +FP D
Sbjct: 331 ADAIPWEEIGVFVEEKDVPKLDTILTSMPIDD-ILRKQRLLANPSMKQAMLFPQPAQPRD 389
Query: 309 AFYMVMYEL 317
AF+ ++ L
Sbjct: 390 AFHQILNGL 398
>gi|293334733|ref|NP_001169191.1| uncharacterized protein LOC100383044 [Zea mays]
gi|223975431|gb|ACN31903.1| unknown [Zea mays]
gi|413957212|gb|AFW89861.1| hypothetical protein ZEAMMB73_311893 [Zea mays]
Length = 428
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 75/291 (25%), Positives = 127/291 (43%), Gaps = 41/291 (14%)
Query: 30 RGNYASESYFKKAFMKSHFVTKDPSKADLFFLPFSIA----------RMRHDRRIGTEGI 79
R +A+E +A + P ADLFF+P ++ + H R + + +
Sbjct: 98 RHLFAAEVAVHEALLAYTGRAARPEDADLFFVPVYVSCNFSTPNGFPSLSHARGMLADAV 157
Query: 80 PDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAI-----QVVCS 134
D + PYWNR+ GADH +VA H G + I + +
Sbjct: 158 -DLVQ-------AGMPYWNRSAGADHVFVASHDFGACFHPMEDVAIADGIPEFLKRSILL 209
Query: 135 SSYFISGHIAHKDVS----LPQIWPRQE-DPPKLGSSKRNKLAFFAGAV----------- 178
++ + GH ++V P + P E + P+ ++R+ AFF G +
Sbjct: 210 QTFGVQGHHTCQEVEHVVIPPHVPPEVEHELPEPEKAQRDIFAFFRGKMEVHPKNISGRF 269
Query: 179 -NSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYG 237
+ VR +LLQ + + + Y R Y E+ S FCL G+ + R+ +S+ G
Sbjct: 270 YSKKVRTELLQHYGRNRKFYLKRKRFDN-YRSEMARSLFCLCPLGWAPWSPRLVESVLLG 328
Query: 238 CVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQ 288
C+PVIIA++ +PF VL W S+ VA D+ L+ +L + + ++Q
Sbjct: 329 CIPVIIADNIRMPFPSVLQWPEISLQVAEKDVANLEVVLDHVVATNLSVIQ 379
>gi|357122399|ref|XP_003562903.1| PREDICTED: xylogalacturonan beta-1,3-xylosyltransferase-like
[Brachypodium distachyon]
Length = 498
Score = 94.4 bits (233), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 85/283 (30%), Positives = 137/283 (48%), Gaps = 44/283 (15%)
Query: 52 DPSKADLFFLPF--SIARMRHDRR---------------IGTEGIPDFISHYIFNISQKY 94
DP++ADLF++PF S++ + + R E + D + ++ ++
Sbjct: 173 DPAEADLFYVPFFSSLSLVVNPIRPLAAANASVAAAEPAYSDEAMQDELVEWL----ERQ 228
Query: 95 PYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH--KDVSLP- 151
PYW R G DH ++ C +A+ + + NA+ +V S + G A KDV LP
Sbjct: 229 PYWRRHRGRDHVFI-CQDP--NALYRVVDRISNAVLLV-SDFGRLRGDQASLVKDVILPY 284
Query: 152 --QIWPRQEDPPKLGSSKRNKLAFFAG----AVNSPVREKLLQVWRNDSEIYAHSG---R 202
+I P Q D + R L FF G VR+ L QV N+ ++ G R
Sbjct: 285 SHRINPFQGD---VSIEARPALLFFMGNRYRKEGGKVRDTLFQVLENEGDVIIKHGTQSR 341
Query: 203 LKTPYADELL-GSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFS 261
+ A + + SKFCLH G + R+ D+L CVPVII++H +LPF DV+++ + S
Sbjct: 342 VSRRMATQGMHSSKFCLHPAGDTPSACRLFDALVSLCVPVIISDHIELPFEDVIDYSNIS 401
Query: 262 IVVAT---LDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWH 301
I V T + L +L+ +SSE L Q + +V+ +F++
Sbjct: 402 IFVDTSKAVQPGFLTSMLRRVSSERILEYQREIKRVKHYFEYE 444
>gi|242041079|ref|XP_002467934.1| hypothetical protein SORBIDRAFT_01g036720 [Sorghum bicolor]
gi|241921788|gb|EER94932.1| hypothetical protein SORBIDRAFT_01g036720 [Sorghum bicolor]
Length = 462
Score = 94.4 bits (233), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 80/294 (27%), Positives = 137/294 (46%), Gaps = 26/294 (8%)
Query: 54 SKADLFFLPF--SIARMRHDRRI--GTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVA 109
+ AD+ F+PF S++ RH R + + + + P W RTGG DH +A
Sbjct: 144 ADADVVFVPFFASLSFNRHSRVVPPARDSEDRALQRRLLEFLAARPEWRRTGGRDHVVLA 203
Query: 110 CHSIGR-SAMQKAWEVKLNAIQVVCS-SSYFISGHIAHKDVSLP---QIWPRQEDPPKLG 164
H G A + W + V+C Y S KDV P + D G
Sbjct: 204 HHPNGMLDARYRFWP----CVFVLCDFGRYPPSVANLDKDVIAPYRHLVANFAND--TAG 257
Query: 165 SSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTPYADE----LLGSKF 216
R L +F GA+ +R++L + +++ +++ G + ++ + SKF
Sbjct: 258 YDDRPTLLYFQGAIYRKDGGSIRQELYYLLKDEKDVHFSFGSVAGNGIEQSTHGMRSSKF 317
Query: 217 CLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI---PLLK 273
CL++ G ++ R+ DS+ CVPVII++ +LPF DVL++ FS++V D L
Sbjct: 318 CLNIAGDTPSSNRLFDSIVSHCVPVIISDEIELPFEDVLDYSKFSVIVRGADAVKKGFLM 377
Query: 274 KILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
++ GIS EE+ + + + +V KHF + DA M+ + + S+R++
Sbjct: 378 SLITGISQEEWAHMWNKLKEVEKHFVYQYPSQTDDAVQMIWKAIARKVPSIRLK 431
>gi|356547155|ref|XP_003541982.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase
IRX7-like [Glycine max]
Length = 458
Score = 94.4 bits (233), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 80/289 (27%), Positives = 127/289 (43%), Gaps = 41/289 (14%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIP------DFISHY 86
+ASE +A + S T DP +AD FF+P ++ G P IS
Sbjct: 124 FASEVAIHRALLTSEVRTFDPYEADFFFVPVYVS----CNFSAVNGFPAIGHARTLISSA 179
Query: 87 IFNISQKYPYWNRTGGADHFYVACHSIGRS-------AMQKAWEVKLNAIQVVCSSSYFI 139
+ +S +YP+WNR+ G+DH +VA H G AM K+ +V + I
Sbjct: 180 VNLVSTEYPFWNRSRGSDHVFVASHDFGACFHTLEDVAMADGIP-KILKNSIVLQTFGVI 238
Query: 140 SGHIAH--KDVSLP------QIWPRQEDPPKLGSSKRNKLAFFAGAV------------N 179
H ++V +P + E P G +R+ AFF G + +
Sbjct: 239 HPHPCQDVENVVIPPYVAPESVRSTLEKFPVNG--RRDIWAFFRGKMEVHPKNVSGQFYS 296
Query: 180 SPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCV 239
VR ++ + + D Y R Y E+ S FCL G+ + R+ +S+ GCV
Sbjct: 297 KRVRTEIWRKFNGDRRFYLQRRRFAG-YQLEIARSVFCLCPLGWAPWSPRLVESVALGCV 355
Query: 240 PVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQ 288
PV+IA+ LPF+ + W S+ VA D+ L KIL+ +++ ++Q
Sbjct: 356 PVVIADGIRLPFSSAVRWSEISLTVAERDVGKLGKILERVAATNLSVIQ 404
>gi|168049543|ref|XP_001777222.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162671450|gb|EDQ58002.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 368
Score = 94.0 bits (232), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 80/285 (28%), Positives = 128/285 (44%), Gaps = 33/285 (11%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGI---PDFISHYIFN 89
+ASE K + S T DP +AD FF+P ++ + + G + P + + +
Sbjct: 46 FASEVAIHKILLTSPIRTLDPYEADFFFMPVYVS-CKFSPKTGFPWLGHAPKLMQAAVNH 104
Query: 90 ISQKYPYWNRTGGADHFYVACHSIG--------RSAMQKAWEVKLNAI--QVVCSSSYFI 139
+S K +WNR+ G DH +VA H G ++ Q + N++ Q +
Sbjct: 105 VSTKMEFWNRSWGRDHIFVAAHDYGACFHTLETQAIAQGIPQFMRNSLILQTFGVKGFHP 164
Query: 140 SGHIAHKDVSLPQIWPRQ-----EDPPKLGSSKRNKLAFFAG--AVNSPVREKLLQVWRN 192
H + P I P +DP L +R+ A+F G +N LL
Sbjct: 165 CQAAEHIQIP-PYISPSVAVSYVKDP--LEHQQRDIFAYFRGKMEINPKNVSGLLYSKGI 221
Query: 193 DSEIYAHSGR-----LKTPYAD----ELLGSKFCLHVKGFEVNTARIADSLYYGCVPVII 243
+ +Y R LK D E+L S FCL G+ + RI +++ YGC+PVII
Sbjct: 222 RTVLYKRFSRNKRFVLKRHRVDNSQQEMLRSTFCLCPLGWAPWSPRIVEAVTYGCIPVII 281
Query: 244 ANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQ 288
A++ LP++ ++W S S+ V D+P L KIL G++ +Q
Sbjct: 282 ADNISLPYSHTIDWSSISLTVPEHDVPKLDKILIGVAVTNLTAIQ 326
>gi|357149222|ref|XP_003575040.1| PREDICTED: probable glucuronosyltransferase Os02g0520750-like
[Brachypodium distachyon]
Length = 428
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 81/312 (25%), Positives = 136/312 (43%), Gaps = 31/312 (9%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + S T +P +AD F+ P ++ + P + I IS
Sbjct: 89 FAAEIFMHRFLFSSAVRTVNPEEADWFYTPVYTTCDLTRAGLPLPFKSPRMMRSAIQFIS 148
Query: 92 QKYPYWNRTGGADHFYVACHSIG-------RSAMQKAWEVKLNAIQVVCSSSYFISGHIA 144
K+P+WNRT G DHF+V H +A+ + L +V ++ H+
Sbjct: 149 NKWPFWNRTDGGDHFFVVPHDFAACFHYQEENAIARGILPLLRHATLV--QTFGQKNHVC 206
Query: 145 HKD--VSLPQIWPRQEDPPKL--GSSKRNKLAFFAGAV----NSP-----VREKLLQVWR 191
KD +++P P Q+ L + R+ +F G N P R +W
Sbjct: 207 LKDGSITIPPYAPPQKMQAHLIPPDTPRSIFVYFRGLFYDNGNDPEGGYYARGARASLWE 266
Query: 192 N--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDL 249
N ++ ++ S Y +++ S FCL G+ + R+ +++ +GC+PVIIA+ L
Sbjct: 267 NFKNNPLFDISTEHPATYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVL 326
Query: 250 PFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYL----LLQSYVLKVRKHFQWHVFPS 305
PFAD + W + V D+P L IL I ++ L LL + +K F P
Sbjct: 327 PFADAIPWDEIGVFVDEEDVPKLDSILTSIPIDDILRKQRLLANPSMKKAMLFPQPAQPR 386
Query: 306 DYDAFYMVMYEL 317
DAF+ ++ L
Sbjct: 387 --DAFHQILNGL 396
>gi|225436482|ref|XP_002275679.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase IRX7
[Vitis vinifera]
gi|297734915|emb|CBI17149.3| unnamed protein product [Vitis vinifera]
Length = 460
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 82/319 (25%), Positives = 140/319 (43%), Gaps = 39/319 (12%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPD------FISHY 86
+ASE KA +S T DP +AD FF+P ++ G P ++
Sbjct: 125 FASEVALHKALQESDVRTFDPWEADFFFVPVYVS----CNFSTVNGFPAIGHARPLLASA 180
Query: 87 IFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAI-----QVVCSSSYFISG 141
I +IS + P+WNR+ GADH +VA H G + + I + + ++ +
Sbjct: 181 IQHISTQLPFWNRSLGADHVFVASHDYGACFHAMEDVARADGIPEFLKKSIILQTFGVKH 240
Query: 142 HIAHKDVSLPQIWPRQEDPPKLGSS--------KRNKLAFFAGAV------------NSP 181
+DV I P P K+ S+ +R+ FF G + +
Sbjct: 241 QHPCQDVENVLI-PPYVSPEKVQSTLDSAPANGQRDIWVFFRGKMEVHPKNISGRFYSKA 299
Query: 182 VREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPV 241
VR + Q + + + Y R Y E++ S FCL G+ + R+ +S+ GCVPV
Sbjct: 300 VRTAIWQKYGGNRKFYLKRHRFAG-YQSEIVRSVFCLCPLGWAPWSPRLVESVVLGCVPV 358
Query: 242 IIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVL--KVRKHFQ 299
IIA+ LPF++ + W S+ VA D+ L IL+ +++ +Q + + ++
Sbjct: 359 IIADGIRLPFSEAIRWPEISLTVAEKDVGKLGMILEDVAATNLSTIQKNLWDPENKRALL 418
Query: 300 WHVFPSDYDAFYMVMYELW 318
++ + DA + V+ LW
Sbjct: 419 FNNQVQEGDATWQVLNALW 437
>gi|413915952|gb|AFW55884.1| secondary cell wall glycosyltransferase family 47 [Zea mays]
Length = 484
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 80/293 (27%), Positives = 139/293 (47%), Gaps = 20/293 (6%)
Query: 52 DPSKADLFFLPF--SIARMRHDRRIGTEGIPD--FISHYIFNISQKYPYWNRTGGADHFY 107
D ADL F+PF S++ RH R + E + + + P W R GGADH
Sbjct: 181 DSRDADLVFVPFFASLSYNRHYRPVPPEKVSRDRVLQEKLVRYLAARPEWRRYGGADHVI 240
Query: 108 VACHSIGRSAMQKAWEVKLNAIQVVCS-SSYFISGHIAHKDVSLP-QIWPRQEDPPKLGS 165
VA H +++ A V A+ V+ Y KDV P + + G
Sbjct: 241 VAHHP---NSLLHARAVLHPAVFVLSDFGRYPPRVASLEKDVIAPYKHMAKTYANDSAGF 297
Query: 166 SKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTPYADE----LLGSKFC 217
R L +F GA+ +R++L + + + ++Y G ++ A + + SKFC
Sbjct: 298 DDRPTLLYFRGAIYRKEGGSIRQELYYMLKEEKDVYFSFGSVQDHGASKASQGMHSSKFC 357
Query: 218 LHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI---PLLKK 274
L++ G ++ R+ D++ CVPVII++ +LP+ DVL++ FSI V + D L +
Sbjct: 358 LNIAGDTPSSNRLFDAIVTHCVPVIISDDIELPYEDVLDYSKFSIFVRSSDAVKKGYLMR 417
Query: 275 ILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
+L G+S +++ + + +V KHF++ DA M+ L + S++++
Sbjct: 418 LLSGVSKQQWTKMWDRLKEVDKHFEYQYPSQKDDAVQMIWQALSRKVPSIKLK 470
>gi|218185155|gb|EEC67582.1| hypothetical protein OsI_34941 [Oryza sativa Indica Group]
Length = 483
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 79/304 (25%), Positives = 144/304 (47%), Gaps = 35/304 (11%)
Query: 52 DPSKADLFFLPF--SIARMRHDRRIGTE------GIPDFISHYIFNISQKYPYWNRTGGA 103
D AD+ F+PF S++ RH R + E G+ + + Y+ P W R+GGA
Sbjct: 172 DSRDADVVFVPFFASLSYNRHSRVVPPEKVSRDKGLQERLVRYLM----AQPEWKRSGGA 227
Query: 104 DHFYVACHSIGRSAMQKAWEVKLNAIQVVCS-SSYFISGHIAHKDVSLP-QIWPRQEDPP 161
DH VA H +++ A V A+ V+ Y KDV P + +
Sbjct: 228 DHVIVAHHP---NSLLHARSVLFPAVFVLSDFGRYHPRVASLEKDVIAPYKHMAKTFVND 284
Query: 162 KLGSSKRNKLAFFAGAV-----------NSPVREKLLQVWRNDSEIYAHSGRLKTPYADE 210
G R L +F GA+ +R++L + +++ ++Y G ++ A +
Sbjct: 285 SAGFDDRPTLLYFRGAIFRKEVKIDSWKGGNIRQELHYMLKDEKDVYFAFGSVQDHGASK 344
Query: 211 ----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
+ SKFCL++ G ++ R+ D++ CVPVII++ +LP+ D L++ FSI V +
Sbjct: 345 ASQGMHASKFCLNIAGDTPSSNRLFDAIVSHCVPVIISDDIELPYEDALDYSKFSIFVRS 404
Query: 267 LDI---PLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSS 323
D L ++++G+S ++ ++ + +V KHF++ DA M+ L + +
Sbjct: 405 SDAVKKGYLMRLIRGVSKHQWTMMWRRLKEVDKHFEYQYPSQKDDAVQMIWQTLARKVPA 464
Query: 324 VRVQ 327
+R++
Sbjct: 465 IRLK 468
>gi|356539378|ref|XP_003538175.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase
IRX7-like [Glycine max]
Length = 460
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 81/290 (27%), Positives = 124/290 (42%), Gaps = 43/290 (14%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIP------DFISHY 86
+ASE +A + S T DP AD FF+P ++ G P I+
Sbjct: 128 FASEVAIHRALLTSEVRTFDPYDADFFFVPVYVS----CNFSTVNGFPAIGHARSLIASA 183
Query: 87 IFNISQKYPYWNRTGGADHFYVACHSIGRS-------AMQKAW-EVKLNAIQVVCSSSYF 138
+ IS +YP+WNR+ G+DH +VA H G AM E+ N+I V +
Sbjct: 184 VNLISSEYPFWNRSRGSDHVFVASHDFGSCFHTLEDVAMADGVPEIVRNSI--VLQTFGV 241
Query: 139 ISGHIAHKD--------VSLPQIWPRQEDPPKLGSSKRNKLAFFAGAV------------ 178
+ H K VS + E+ P G +R+ AFF G +
Sbjct: 242 VFDHPCQKVEHVVIPPYVSPESVRDTMENFPVDG--RRDIWAFFRGKMEVHPKNVSGRFY 299
Query: 179 NSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGC 238
+ VR + + + D Y R Y E+ S FCL G+ + R+ +S+ GC
Sbjct: 300 SKEVRTVIWRKFNGDRRFYLQRHRFAG-YQSEIARSVFCLCPLGWAPWSPRLVESVALGC 358
Query: 239 VPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQ 288
VPV+IA+ LPF + W S+ VA D+ L +IL+ +++ +Q
Sbjct: 359 VPVVIADGIRLPFVSAVKWSEISVTVAEKDVGRLAEILERVAATNLSTIQ 408
>gi|449484890|ref|XP_004157009.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10-like [Cucumis
sativus]
Length = 417
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 81/314 (25%), Positives = 140/314 (44%), Gaps = 35/314 (11%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIP------DFISHY 86
+A+E Y + + S T +P +AD F+ P + + G+P +
Sbjct: 78 FAAEIYMHRFLLNSPVRTLNPDEADWFYTPIYVTC-----DLTPNGLPLPFKSPRMMRSA 132
Query: 87 IFNISQKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISG 141
I IS +PYWNRT GADHF+V H G +KA + + + + ++
Sbjct: 133 IQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIDRGILPLLQRATLVQTFGQRN 192
Query: 142 HIAHKD--VSLPQIWPRQEDPPKLGSSK--RNKLAFFAGAV----NSP-----VREKLLQ 188
H+ + +++P P Q+ L S+ R+ +F G N P R
Sbjct: 193 HVCLNEGSITIPPYCPPQKMKTHLIPSETPRSIFVYFRGLFYDVNNDPEGGYYARGARAA 252
Query: 189 VWRN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANH 246
VW N ++ ++ S T Y +++ + FCL G+ + R+ +++ +GC+PVIIA+
Sbjct: 253 VWENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADD 312
Query: 247 YDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSD 306
LPFAD + W+ + V D+ L IL I + +L + +L + +FP
Sbjct: 313 IVLPFADAIPWEEIGVFVDEKDVSNLDTILTSIPP-DVILRKQRLLANPSMKRAMMFPQP 371
Query: 307 Y---DAFYMVMYEL 317
DAF+ ++ L
Sbjct: 372 AQSGDAFHQILNGL 385
>gi|115487106|ref|NP_001066040.1| Os12g0124400 [Oryza sativa Japonica Group]
gi|77552918|gb|ABA95714.1| secondary cell wall-related glycosyltransferase family 47,
putative, expressed [Oryza sativa Japonica Group]
gi|113648547|dbj|BAF29059.1| Os12g0124400 [Oryza sativa Japonica Group]
gi|215678612|dbj|BAG92267.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218186350|gb|EEC68777.1| hypothetical protein OsI_37312 [Oryza sativa Indica Group]
Length = 475
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 76/293 (25%), Positives = 140/293 (47%), Gaps = 20/293 (6%)
Query: 52 DPSKADLFFLPF--SIARMRHDRRIGTEGIP--DFISHYIFNISQKYPYWNRTGGADHFY 107
D AD+ F+PF S++ RH R + E + + + P W R+GGADH
Sbjct: 172 DSRDADVVFVPFFASLSYNRHSRVVPPEKVSRDKELQEKLVRYLMAQPEWKRSGGADHVI 231
Query: 108 VACHSIGRSAMQKAWEVKLNAIQVVCS-SSYFISGHIAHKDVSLP-QIWPRQEDPPKLGS 165
VA H +++ A V + V+ Y KDV P + + G
Sbjct: 232 VAHHP---NSLLHARSVLFPVVFVLSDFGRYHPRVASLEKDVIAPYKHMAKTFVNDSAGF 288
Query: 166 SKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTPYADE----LLGSKFC 217
R L +F GA+ +R++L + +++ ++Y G ++ A + + SKFC
Sbjct: 289 DDRPTLLYFRGAIFRKEGGNIRQELYYMLKDEKDVYFAFGSVQDHGASKASKGMHASKFC 348
Query: 218 LHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI---PLLKK 274
L++ G ++ R+ D++ CVPVII++ +LP+ D L++ FSI V + D L +
Sbjct: 349 LNIAGDTPSSNRLFDAIVSHCVPVIISDDIELPYEDALDYSKFSIFVRSSDAVKKGYLMR 408
Query: 275 ILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
+++G+S ++ + + + +V KHF++ DA M+ L + ++R++
Sbjct: 409 LIRGVSKHQWTRMWNRLKEVDKHFEYQYPSQKDDAVQMIWQALARKVPAIRLK 461
>gi|356542668|ref|XP_003539788.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase
IRX7-like [Glycine max]
Length = 461
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 79/288 (27%), Positives = 124/288 (43%), Gaps = 39/288 (13%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIP------DFISHY 86
+ASE +A + S T DP AD FF+P ++ G P I+
Sbjct: 129 FASEVAIHRALLTSEVRTFDPYDADFFFVPVYVS----CNFSTVNGFPAIGHARSLIASA 184
Query: 87 IFNISQKYPYWNRTGGADHFYVACHSIGRS-------AMQKAW-EVKLNAIQVVCSSSYF 138
+ +S +YP+WNR+ G+DH +VA H G AM E+ N+I + +
Sbjct: 185 VSLVSSEYPFWNRSRGSDHVFVASHDFGSCFHTLEDVAMADGVPEIMRNSIVLQTFGVVY 244
Query: 139 ISGHIAHKDVSLP------QIWPRQEDPPKLGSSKRNKLAFFAGAV------------NS 180
+ + V +P + E+ P G +R+ AFF G + +
Sbjct: 245 DHPCQSVEHVVIPPYVSPESVRDTMENFPVNG--RRDIWAFFRGKMELHPKNVSGRFYSK 302
Query: 181 PVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVP 240
VR + + + D Y R Y E+ S FCL G+ + R+ +S+ GCVP
Sbjct: 303 KVRTVIWRKFNGDRRFYLQRQRFAG-YQSEIARSVFCLCPLGWAPWSPRLVESVALGCVP 361
Query: 241 VIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQ 288
VIIA+ LPF + W SI VA D+ L +IL+ +++ +Q
Sbjct: 362 VIIADGIRLPFISAVKWPEISITVAEKDVGRLAEILERVAATNLSTIQ 409
>gi|359491711|ref|XP_002284930.2| PREDICTED: xylogalacturonan beta-1,3-xylosyltransferase isoform 1
[Vitis vinifera]
Length = 498
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 76/294 (25%), Positives = 141/294 (47%), Gaps = 21/294 (7%)
Query: 50 TKDPSKADLFFLPF--SIARMRHDRRIGTE--GIPDFISHYIFNISQKYPYWNRTGGADH 105
K+ S+AD+ F+PF S++ RH R G E + + + N W + GG +H
Sbjct: 189 VKNSSQADIIFVPFFSSLSYNRHSRLHGKEKVSVNKMLQDKLVNFLMGQDEWKQLGGKNH 248
Query: 106 FYVACHSIGRSAMQKAWEVKLNAIQVVCS-SSYFISGHIAHKDVSLPQIWPRQEDPPKLG 164
VA H ++M A + +A+ V+ Y + KDV P + +P
Sbjct: 249 LIVAHHP---NSMLDARKKLGSAMFVLADFGRYPVEIANIDKDVIAPYKHVLRSNPVADS 305
Query: 165 SS--KRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLK----TPYADELLGS 214
++ R L +F GA+ +R++L + R++ +++ G ++ ++ + S
Sbjct: 306 ATFEGRPLLVYFQGAIYRKDGGAIRQELYYLLRDEKDVHFTFGSVRGNGINGASEGMASS 365
Query: 215 KFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI---PL 271
KFCL++ G ++ R+ D++ CVPVII++ +LPF DVL++ F I V D
Sbjct: 366 KFCLNIAGDTPSSNRLFDAIVSHCVPVIISDEIELPFEDVLDYSEFCIFVRASDAVKNGF 425
Query: 272 LKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
L +L+GI E++ + + ++ HF++ DA M+ + + SS++
Sbjct: 426 LLNLLRGIKREKWTKMWERLKEIAHHFEYQYPSQAGDAVDMIWGAVSRKISSIQ 479
>gi|297733976|emb|CBI15223.3| unnamed protein product [Vitis vinifera]
Length = 455
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 76/294 (25%), Positives = 141/294 (47%), Gaps = 21/294 (7%)
Query: 50 TKDPSKADLFFLPF--SIARMRHDRRIGTE--GIPDFISHYIFNISQKYPYWNRTGGADH 105
K+ S+AD+ F+PF S++ RH R G E + + + N W + GG +H
Sbjct: 146 VKNSSQADIIFVPFFSSLSYNRHSRLHGKEKVSVNKMLQDKLVNFLMGQDEWKQLGGKNH 205
Query: 106 FYVACHSIGRSAMQKAWEVKLNAIQVVCS-SSYFISGHIAHKDVSLPQIWPRQEDPPKLG 164
VA H ++M A + +A+ V+ Y + KDV P + +P
Sbjct: 206 LIVAHHP---NSMLDARKKLGSAMFVLADFGRYPVEIANIDKDVIAPYKHVLRSNPVADS 262
Query: 165 SS--KRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLK----TPYADELLGS 214
++ R L +F GA+ +R++L + R++ +++ G ++ ++ + S
Sbjct: 263 ATFEGRPLLVYFQGAIYRKDGGAIRQELYYLLRDEKDVHFTFGSVRGNGINGASEGMASS 322
Query: 215 KFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI---PL 271
KFCL++ G ++ R+ D++ CVPVII++ +LPF DVL++ F I V D
Sbjct: 323 KFCLNIAGDTPSSNRLFDAIVSHCVPVIISDEIELPFEDVLDYSEFCIFVRASDAVKNGF 382
Query: 272 LKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
L +L+GI E++ + + ++ HF++ DA M+ + + SS++
Sbjct: 383 LLNLLRGIKREKWTKMWERLKEIAHHFEYQYPSQAGDAVDMIWGAVSRKISSIQ 436
>gi|226529660|ref|NP_001151894.1| secondary cell wall-related glycosyltransferase family 47 [Zea
mays]
gi|195650683|gb|ACG44809.1| secondary cell wall-related glycosyltransferase family 47 [Zea
mays]
Length = 488
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 80/293 (27%), Positives = 139/293 (47%), Gaps = 20/293 (6%)
Query: 52 DPSKADLFFLPF--SIARMRHDRRIGTEGIPD--FISHYIFNISQKYPYWNRTGGADHFY 107
D ADL F+PF S++ RH R + E + + + P W R GGADH
Sbjct: 185 DSRDADLVFVPFFASLSYNRHYRPVPPEKVSRDRALQEKLVRYLAARPEWRRFGGADHVI 244
Query: 108 VACHSIGRSAMQKAWEVKLNAIQVVCS-SSYFISGHIAHKDVSLP-QIWPRQEDPPKLGS 165
VA H +++ A V A+ V+ Y KDV P + + G
Sbjct: 245 VAHHP---NSLLHARAVLHPAVFVLSDFGRYPPRVASLEKDVIAPYKHMAKTYANDSAGF 301
Query: 166 SKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTPYADE----LLGSKFC 217
R L +F GA+ +R++L + + + ++Y G ++ A + + SKFC
Sbjct: 302 DDRPTLLYFRGAIYRKEGGSIRQELYYMLKEEKDVYFSFGSVQDHGASKASQGMHSSKFC 361
Query: 218 LHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI---PLLKK 274
L++ G ++ R+ D++ CVPVII++ +LP+ DVL++ FSI V + D L +
Sbjct: 362 LNIAGDTPSSNRLFDAIVTHCVPVIISDDIELPYEDVLDYSKFSIFVRSSDAVKKGYLMR 421
Query: 275 ILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
+L G+S +++ + + +V KHF++ DA M+ L + S++++
Sbjct: 422 LLSGVSKQQWTKMWDRLKEVDKHFEYQYPSQKDDAVQMIWQALSRKVPSIKLK 474
>gi|413932967|gb|AFW67518.1| hypothetical protein ZEAMMB73_420245 [Zea mays]
Length = 386
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 80/311 (25%), Positives = 140/311 (45%), Gaps = 30/311 (9%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T +P +AD F+ P ++ + P + I I+
Sbjct: 48 FAAEIFMHRLLLSSAVRTFNPEEADWFYTPVYATCDLTPSGLPLPFKSPRMMLSAIELIA 107
Query: 92 QKYPYWNRTGGADHFYVACHSIG-------RSAMQKAWEVKLNAIQVVCSSSYFISGHIA 144
+PYWNR+ GADHF+V H G A+ + L +V ++ H+
Sbjct: 108 TNWPYWNRSEGADHFFVTPHDFGACFHYQDEKAIGRGILPLLQHATLV--QTFGQKNHVC 165
Query: 145 HK--DVSLPQIWPRQEDPPKL--GSSKRNKLAFFAG----AVNSP-----VREKLLQVWR 191
K +++P P Q+ L + R+ +F G N P R VW
Sbjct: 166 LKGGSITIPPFAPPQKMQAHLIPADTPRSIFVYFRGLFYDTSNDPEGGYYARGARASVWE 225
Query: 192 N--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDL 249
N ++ ++ S + Y +++ S FCL G+ + R+ +++ +GC+P+IIA+ L
Sbjct: 226 NFKNNPLFDISTDHPSTYYEDMERSVFCLCPLGWAPWSPRLVEAVVFGCIPLIIADIV-L 284
Query: 250 PFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY-- 307
PFAD + W+ + VA D+P L IL I + + +L + +L Q +FP
Sbjct: 285 PFADAIPWEEIGVFVAEEDVPKLDSILTSIPT-DVILRKQRLLANPSMKQAMLFPQPAQA 343
Query: 308 -DAFYMVMYEL 317
DAF+ ++ L
Sbjct: 344 GDAFHQILNGL 354
>gi|222616547|gb|EEE52679.1| hypothetical protein OsJ_35064 [Oryza sativa Japonica Group]
Length = 528
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 76/293 (25%), Positives = 140/293 (47%), Gaps = 20/293 (6%)
Query: 52 DPSKADLFFLPF--SIARMRHDRRIGTEGIP--DFISHYIFNISQKYPYWNRTGGADHFY 107
D AD+ F+PF S++ RH R + E + + + P W R+GGADH
Sbjct: 225 DSRDADVVFVPFFASLSYNRHSRVVPPEKVSRDKELQEKLVRYLMAQPEWKRSGGADHVI 284
Query: 108 VACHSIGRSAMQKAWEVKLNAIQVVCS-SSYFISGHIAHKDVSLP-QIWPRQEDPPKLGS 165
VA H +++ A V + V+ Y KDV P + + G
Sbjct: 285 VAHHP---NSLLHARSVLFPVVFVLSDFGRYHPRVASLEKDVIAPYKHMAKTFVNDSAGF 341
Query: 166 SKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTPYADE----LLGSKFC 217
R L +F GA+ +R++L + +++ ++Y G ++ A + + SKFC
Sbjct: 342 DDRPTLLYFRGAIFRKEGGNIRQELYYMLKDEKDVYFAFGSVQDHGASKASKGMHASKFC 401
Query: 218 LHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI---PLLKK 274
L++ G ++ R+ D++ CVPVII++ +LP+ D L++ FSI V + D L +
Sbjct: 402 LNIAGDTPSSNRLFDAIVSHCVPVIISDDIELPYEDALDYSKFSIFVRSSDAVKKGYLMR 461
Query: 275 ILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
+++G+S ++ + + + +V KHF++ DA M+ L + ++R++
Sbjct: 462 LIRGVSKHQWTRMWNRLKEVDKHFEYQYPSQKDDAVQMIWQALARKVPAIRLK 514
>gi|356507133|ref|XP_003522325.1| PREDICTED: probable glycosyltransferase At3g42180-like [Glycine
max]
Length = 484
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 77/302 (25%), Positives = 142/302 (47%), Gaps = 37/302 (12%)
Query: 50 TKDPSKADLFFLPF--SIARMRHDRRIGTEGIP------DFISHYIFNISQKYPYWNRTG 101
+D S+AD+ F+PF S++ RH + G E + D + ++ + W R+G
Sbjct: 182 VQDSSQADVIFVPFFSSLSYNRHSKLNGQEKVSLNKRLQDRLVQFLMGRKE----WKRSG 237
Query: 102 GADHFYVACHSIGRSAMQKAWEVKLNAIQVVCS--SSYFISGHIAHKDVSLP-----QIW 154
G DH VA H ++ +L A +V + Y + KD+ P
Sbjct: 238 GKDHLIVAHHPNSLLDARR----RLGAAMLVLADFGRYPVELANIKKDIIAPYRHLVGTI 293
Query: 155 PRQEDPPKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTPYADE 210
PR E KR L +F GA+ +R++L + +++++++ G + ++
Sbjct: 294 PRAESA---SFEKRTTLVYFQGAIYRKDGGAIRQELYYLLKDENDVHFTFGSIGGNGINQ 350
Query: 211 ----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
+ SKFCL++ G ++ R+ D++ CVPVII++ +LPF D L++ FSI+V
Sbjct: 351 ASQGMALSKFCLNIAGDTPSSNRLFDAIVSHCVPVIISDEIELPFEDDLDYSDFSIIVHA 410
Query: 267 LDI---PLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSS 323
D L +L+ I +E+ + + ++ HF++ DA M+ ++ + SS
Sbjct: 411 SDAMKKGYLLNLLRSIKRDEWNKMWERLKQITHHFEYQYPSQPGDAVNMIWQQVEHKISS 470
Query: 324 VR 325
+R
Sbjct: 471 IR 472
>gi|115476598|ref|NP_001061895.1| Os08g0438600 [Oryza sativa Japonica Group]
gi|42408650|dbj|BAD09870.1| Exostosin family-like protein [Oryza sativa Japonica Group]
gi|42408898|dbj|BAD10156.1| Exostosin family-like protein [Oryza sativa Japonica Group]
gi|113623864|dbj|BAF23809.1| Os08g0438600 [Oryza sativa Japonica Group]
gi|215715329|dbj|BAG95080.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215740942|dbj|BAG97437.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 566
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 70/232 (30%), Positives = 115/232 (49%), Gaps = 19/232 (8%)
Query: 97 WNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH--KDVSLP--Q 152
W R GGADH V H ++M A +L+A V S +A+ KDV P
Sbjct: 311 WRRWGGADHLVVPHHP---NSMMDARR-RLSAAMFVLSDFGRYPPDVANLRKDVIAPYKH 366
Query: 153 IWPRQEDPPKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTP-- 206
+ P D G +R LA+F GA+ VR++L Q+ +++ +++ G ++
Sbjct: 367 VVPSLGDGDSPGFEQRPVLAYFQGAIHRKNGGRVRQRLYQLIKDEKDVHFTYGSVRQNGI 426
Query: 207 --YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVV 264
+ SKFCL++ G ++ R+ D++ CVPVII++ +LPF DVL++ +F + V
Sbjct: 427 RRATKGMASSKFCLNIAGDTPSSNRLFDAIVSHCVPVIISDDIELPFEDVLDYSAFCVFV 486
Query: 265 ATLDI---PLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
D L +L+GIS EE+ + + +V HF++ DA M+
Sbjct: 487 RASDAVKRGFLLHLLRGISQEEWTAMWRRLKEVAHHFEYQYPSQPGDAVQMI 538
>gi|42570324|ref|NP_850113.2| exostosin-like protein [Arabidopsis thaliana]
gi|75216857|sp|Q9ZUV3.1|IRX7_ARATH RecName: Full=Probable glucuronoxylan glucuronosyltransferase IRX7;
AltName: Full=Protein FRAGILE FIBER 8; AltName:
Full=Protein IRREGULAR XYLEM 7
gi|4063747|gb|AAC98455.1| hypothetical protein [Arabidopsis thaliana]
gi|77022037|gb|ABA60868.1| putative glucuronyltransferase [Arabidopsis thaliana]
gi|77022039|gb|ABA60869.1| putative glucuronyltransferase [Arabidopsis thaliana]
gi|330252987|gb|AEC08081.1| exostosin-like protein [Arabidopsis thaliana]
Length = 448
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 82/321 (25%), Positives = 135/321 (42%), Gaps = 43/321 (13%)
Query: 33 YASESYFKKAFM--KSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIP------DFIS 84
+A+E KAF+ + T+DP +AD FF+P ++ G P I+
Sbjct: 120 FAAEVALHKAFLSLEGDVRTEDPYEADFFFVPVYVS----CNFSTINGFPAIGHARSLIN 175
Query: 85 HYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISG--- 141
I +S +YP+WNRT G+DH + A H G + + + +S +
Sbjct: 176 DAIKLVSTQYPFWNRTSGSDHVFTATHDFGSCFHTMEDRAIADGVPIFLRNSIILQTFGV 235
Query: 142 ------HIAHKDVSLPQIWPR-----QEDPPKLGSSKRNKLAFFAGAV------------ 178
V P I P Q++ P + +R+ FF G +
Sbjct: 236 TFNHPCQEVENVVIPPYISPESLHKTQKNIPV--TKERDIWVFFRGKMELHPKNISGRFY 293
Query: 179 NSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGC 238
+ VR + + + D Y R Y E+ S FCL G+ + R+ +S+ GC
Sbjct: 294 SKRVRTNIWRSYGGDRRFYLQRQRFAG-YQSEIARSVFCLCPLGWAPWSPRLVESVALGC 352
Query: 239 VPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVL--KVRK 296
VPVIIA+ LPF + W S+ VA D+ L IL+ +++ ++Q + VR+
Sbjct: 353 VPVIIADGIRLPFPSTVRWPDISLTVAERDVGKLGDILEHVAATNLSVIQRNLEDPSVRR 412
Query: 297 HFQWHVFPSDYDAFYMVMYEL 317
++V + DA + V+ L
Sbjct: 413 ALMFNVPSREGDATWQVLEAL 433
>gi|27817890|dbj|BAC55656.1| exostosin family protein-like protein [Oryza sativa Japonica Group]
Length = 453
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 69/225 (30%), Positives = 111/225 (49%), Gaps = 21/225 (9%)
Query: 92 QKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH-KDVSL 150
++ PYW R G DH ++ C +A+ + + NA+ ++ S + KDV L
Sbjct: 182 ERQPYWRRHQGRDHVFI-CQDP--NALYRVVDRISNAVLLISDFGRLRSEQASLVKDVIL 238
Query: 151 P---QIWPRQEDPPKLGSSKRNKLAFFAG----AVNSPVREKLLQVWRNDSEIY----AH 199
P +I Q D +G R L FF G VR+ L QV N++++ A
Sbjct: 239 PYAHRINSFQGD---VGVESRPSLLFFMGNRYRKEGGKVRDTLFQVLENEADVIIKHGAQ 295
Query: 200 SGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKS 259
S + + SKFCLH G + R+ D+L CVPVI++++ +LPF DV+++++
Sbjct: 296 SRESRRMATRGMHSSKFCLHPAGDTPSACRLFDALVSLCVPVIVSDYIELPFEDVIDYRN 355
Query: 260 FSIVVAT---LDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWH 301
SI V T + L L+GISS+ L Q + KV+ +F++
Sbjct: 356 ISIFVETSKAVQPGFLTSTLRGISSQRILEYQREIKKVKHYFEYE 400
>gi|326499682|dbj|BAJ86152.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 304
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 73/254 (28%), Positives = 118/254 (46%), Gaps = 30/254 (11%)
Query: 90 ISQKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIA 144
IS +PYWNRT GADHF+V H G +KA E + + + ++ H+
Sbjct: 23 ISSHWPYWNRTAGADHFFVVPHDFGACFHYQEEKAIERGILPLLRRATLVQTFGQKDHVC 82
Query: 145 HKD--VSLPQIWPRQED-----PPKLGSSKRNKLAFFAG----AVNSP-----VREKLLQ 188
K+ +++P P Q+ PP+ + R+ +F G N P R
Sbjct: 83 LKEGSINIPPYAPPQKMKTHLVPPE---TPRSIFVYFRGLFYDTANDPEGGYYARGARAS 139
Query: 189 VWRN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANH 246
VW N ++ ++ S Y +++ + FCL G+ + R+ +++ +GC+PVIIA+
Sbjct: 140 VWENFKNNPLFDISTDHPPTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADD 199
Query: 247 YDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSD 306
LPFAD + W + VA D+P L IL I E +L + +L Q +FP
Sbjct: 200 IVLPFADAIPWDEIGVFVAEDDVPKLDTILTSIPM-EVILRKQRLLANPSMKQAMLFPQP 258
Query: 307 Y---DAFYMVMYEL 317
DAF+ ++ L
Sbjct: 259 AQAGDAFHQILNGL 272
>gi|218199855|gb|EEC82282.1| hypothetical protein OsI_26516 [Oryza sativa Indica Group]
Length = 393
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 66/222 (29%), Positives = 107/222 (48%), Gaps = 15/222 (6%)
Query: 92 QKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH-KDVSL 150
++ PYW R G DH ++ C +A+ + + NA+ ++ S + KDV L
Sbjct: 122 ERQPYWRRHQGRDHVFI-CQDP--NALYRVVDRISNAVLLISDFGRLRSEQASLVKDVIL 178
Query: 151 PQIWPRQEDPPKLGSSKRNKLAFFAG----AVNSPVREKLLQVWRNDSEIY----AHSGR 202
P +G R L FF G VR+ L QV N++++ A S
Sbjct: 179 PYAHRINSFQGDVGVESRPSLLFFMGNRYRKEGGKVRDTLFQVLENEADVIIKHGAQSRE 238
Query: 203 LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI 262
+ + SKFCLH G + R+ D+L CVPVI++++ +LPF DV+++++ SI
Sbjct: 239 SRRMATRGMHSSKFCLHPAGDTPSACRLFDALVSLCVPVIVSDYIELPFEDVIDYRNISI 298
Query: 263 VVAT---LDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWH 301
V T + L L+GISS+ L Q + KV+ +F++
Sbjct: 299 FVETSKAVQPGFLTSTLRGISSQRILEYQREIKKVKHYFEYE 340
>gi|222637299|gb|EEE67431.1| hypothetical protein OsJ_24780 [Oryza sativa Japonica Group]
Length = 500
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 69/225 (30%), Positives = 111/225 (49%), Gaps = 21/225 (9%)
Query: 92 QKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH-KDVSL 150
++ PYW R G DH ++ C +A+ + + NA+ ++ S + KDV L
Sbjct: 229 ERQPYWRRHQGRDHVFI-CQDP--NALYRVVDRISNAVLLISDFGRLRSEQASLVKDVIL 285
Query: 151 P---QIWPRQEDPPKLGSSKRNKLAFFAG----AVNSPVREKLLQVWRNDSEIY----AH 199
P +I Q D +G R L FF G VR+ L QV N++++ A
Sbjct: 286 PYAHRINSFQGD---VGVESRPSLLFFMGNRYRKEGGKVRDTLFQVLENEADVIIKHGAQ 342
Query: 200 SGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKS 259
S + + SKFCLH G + R+ D+L CVPVI++++ +LPF DV+++++
Sbjct: 343 SRESRRMATRGMHSSKFCLHPAGDTPSACRLFDALVSLCVPVIVSDYIELPFEDVIDYRN 402
Query: 260 FSIVVAT---LDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWH 301
SI V T + L L+GISS+ L Q + KV+ +F++
Sbjct: 403 ISIFVETSKAVQPGFLTSTLRGISSQRILEYQREIKKVKHYFEYE 447
>gi|115472841|ref|NP_001060019.1| Os07g0567000 [Oryza sativa Japonica Group]
gi|113611555|dbj|BAF21933.1| Os07g0567000 [Oryza sativa Japonica Group]
Length = 500
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 69/225 (30%), Positives = 111/225 (49%), Gaps = 21/225 (9%)
Query: 92 QKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH-KDVSL 150
++ PYW R G DH ++ C +A+ + + NA+ ++ S + KDV L
Sbjct: 229 ERQPYWRRHQGRDHVFI-CQDP--NALYRVVDRISNAVLLISDFGRLRSEQASLVKDVIL 285
Query: 151 P---QIWPRQEDPPKLGSSKRNKLAFFAG----AVNSPVREKLLQVWRNDSEIY----AH 199
P +I Q D +G R L FF G VR+ L QV N++++ A
Sbjct: 286 PYAHRINSFQGD---VGVESRPSLLFFMGNRYRKEGGKVRDTLFQVLENEADVIIKHGAQ 342
Query: 200 SGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKS 259
S + + SKFCLH G + R+ D+L CVPVI++++ +LPF DV+++++
Sbjct: 343 SRESRRMATRGMHSSKFCLHPAGDTPSACRLFDALVSLCVPVIVSDYIELPFEDVIDYRN 402
Query: 260 FSIVVAT---LDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWH 301
SI V T + L L+GISS+ L Q + KV+ +F++
Sbjct: 403 ISIFVETSKAVQPGFLTSTLRGISSQRILEYQREIKKVKHYFEYE 447
>gi|218201205|gb|EEC83632.1| hypothetical protein OsI_29363 [Oryza sativa Indica Group]
Length = 566
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 70/232 (30%), Positives = 114/232 (49%), Gaps = 19/232 (8%)
Query: 97 WNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH--KDVSLP--Q 152
W R GGADH V H ++M A +L+A V S +A+ KDV P
Sbjct: 311 WRRWGGADHLVVPHHP---NSMMDARR-RLSAAMFVLSDFGRYPPDVANLRKDVIAPYKH 366
Query: 153 IWPRQEDPPKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTP-- 206
+ P D G +R LA+F GA+ VR++L Q+ +++ +++ G ++
Sbjct: 367 VVPSLGDGDSPGFEQRPVLAYFQGAIHRKNGGRVRQRLYQLIKDEKDVHFTYGSVRQNGI 426
Query: 207 --YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVV 264
+ SKFCL++ G ++ R+ D++ CVPVII++ +LPF DVL++ F + V
Sbjct: 427 RRATKGMASSKFCLNIAGDTPSSNRLFDAIVSHCVPVIISDDIELPFEDVLDYSDFCVFV 486
Query: 265 ATLDI---PLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
D L +L+GIS EE+ + + +V HF++ DA M+
Sbjct: 487 RASDAVKRGFLLHLLRGISQEEWTAMWRRLKEVAHHFEYQYPSQPGDAVQMI 538
>gi|357157638|ref|XP_003577864.1| PREDICTED: probable glycosyltransferase At3g07620-like
[Brachypodium distachyon]
Length = 485
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 77/294 (26%), Positives = 142/294 (48%), Gaps = 22/294 (7%)
Query: 52 DPSKADLFFLPF--SIARMRHDRRIGTEGIPD--FISHYIFNISQKYPYWNRTGGADHFY 107
D AD+ F+PF S++ RH + + E + + + P W R+GGADH
Sbjct: 181 DAQDADVVFVPFFASLSYNRHSKPVPPEKVSRDRALQEKLVRYLAARPEWKRSGGADHVI 240
Query: 108 VACHSIGRSAMQKAWEVKLNAIQVVCS-SSYFISGHIAHKDVSLP-QIWPRQEDPPKLGS 165
VA H +++ A A+ V+ Y KD+ P + + G
Sbjct: 241 VAHHP---NSLLHARSALFPAVFVLSDFGRYHPRVASLEKDLVAPYRHMAKTFVNDTAGF 297
Query: 166 SKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLK----TPYADELLGSKFC 217
R L +F GA+ +R++L + +++ +++ G ++ + + + SKFC
Sbjct: 298 DDRPTLLYFRGAIYRKEGGNIRQELYNMLKDEKDVFFSFGSVQDHGVSKASQGMHSSKFC 357
Query: 218 LHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI---PLLKK 274
L++ G ++ R+ D++ CVPVII++ +LP+ DVL++ FSI V + D L K
Sbjct: 358 LNIAGDTPSSNRLFDAIVSHCVPVIISDDIELPYEDVLDYSKFSIFVRSSDAVKRGYLMK 417
Query: 275 ILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY-DAFYMVMYELWLRRSSVRVQ 327
+++G++ + + + +V KHF++ FPS DA M+ L + S+R++
Sbjct: 418 LIRGVTKHRWTRMWKRLKEVDKHFEYQ-FPSRKDDAVQMIWQALARKVPSIRLK 470
>gi|412994079|emb|CCO14590.1| predicted protein [Bathycoccus prasinos]
Length = 553
Score = 91.7 bits (226), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 80/323 (24%), Positives = 152/323 (47%), Gaps = 42/323 (13%)
Query: 30 RGNYASESYFKKAFM--KSHFVTKDPSKADLFFLP-------FSIARMRHDRR-IGTEGI 79
R Y +E +F + F K+ T +P +ADLFF+P +S +RH+ + E
Sbjct: 222 RDQYGTEIWFHRNFRDDKNGVRTMNPEEADLFFVPQYGECFLWSREMLRHENQGQAMEET 281
Query: 80 PDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKL-NAIQVVCSSSYF 138
++ + ++ K PY+NRT G DH +V + G + + W+ ++ ++I +
Sbjct: 282 NEYFLEVLSHVKGKLPYFNRTDGRDHIFVFAGARGPTIF-RDWQKEIPHSIYLTPEGDRT 340
Query: 139 ISGHIAHKDVSLPQIW---------PRQE---DPPKLGSSKRNKLAFFAGAVNSP----- 181
+ KD+ +P + R E +PPK R LA F G ++ P
Sbjct: 341 LPQFDTWKDIVIPGLEYDKRMYLEEHRNELVTNPPK-----RKILAMFRGTIDHPAGFAY 395
Query: 182 ---VREKLLQVWRNDSEIYAHSGRLKTP----YADELLGSKFCLHVKGFEVNTARIADSL 234
+R KL ++++N +++ + ++K Y E+ S FCL+ G+ T R ++
Sbjct: 396 SKGLRPKLKKIFQNATDV-IYDTKIKDCDRDCYVREMTESVFCLNPLGWTPWTLRFYQAV 454
Query: 235 YYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKV 294
C+P+IIA++ + PF +N+ F++ + D+ + + ++ + EE + Y+ K+
Sbjct: 455 MTRCIPIIIADNIEFPFESEINYSEFALKIPEKDVSDILETMRHMPEEERERRRRYMDKI 514
Query: 295 RKHFQWHVFPSDYDAFYMVMYEL 317
K F + DA+Y + EL
Sbjct: 515 WKQFTYQRPAEIGDAYYSTVKEL 537
>gi|125586098|gb|EAZ26762.1| hypothetical protein OsJ_10674 [Oryza sativa Japonica Group]
Length = 364
Score = 91.7 bits (226), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 75/290 (25%), Positives = 138/290 (47%), Gaps = 24/290 (8%)
Query: 57 DLFFLPF--SIARMRHDRRIGTEGIPD--FISHYIFNISQKYPYWNRTGGADHFYVACHS 112
D+ F+PF S++ RH + + + + + + P W R+GG DH +A H
Sbjct: 50 DVVFVPFFASLSFNRHSKVVPPARASEDRALQRRLLDYLAARPEWRRSGGRDHVVLAHHP 109
Query: 113 IGR-SAMQKAWEVKLNAIQVVCS-SSYFISGHIAHKDVSLP--QIWPRQEDPPKLGSSKR 168
G A K W + V+C Y S KDV P + P + G R
Sbjct: 110 NGMLDARYKLWP----CVFVLCDFGRYPPSVAGLDKDVIAPYRHVVPNFAND-SAGYDDR 164
Query: 169 NKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTPYADE----LLGSKFCLHV 220
L +F GA+ +R++L + +++ +++ G + ++ + SKFCL++
Sbjct: 165 PTLLYFQGAIYRKDGGFIRQELYYLLKDEKDVHFSFGSVVGNGIEQATQGMRASKFCLNI 224
Query: 221 KGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI---PLLKKILK 277
G ++ R+ DS+ CVP+II++ +LPF DVL++ F I+V D L ++
Sbjct: 225 AGDTPSSNRLFDSIVSHCVPIIISDEIELPFEDVLDYSKFCIIVRGADAVKKGFLMNLIN 284
Query: 278 GISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
GIS E++ + + + +V +HF++ + DA M+ + + S+R++
Sbjct: 285 GISREDWTRMWNRLKEVERHFEYQYPSQNDDAVQMIWKAIARKAPSIRLK 334
>gi|307111456|gb|EFN59690.1| hypothetical protein CHLNCDRAFT_133230 [Chlorella variabilis]
Length = 569
Score = 91.7 bits (226), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 80/306 (26%), Positives = 133/306 (43%), Gaps = 28/306 (9%)
Query: 39 FKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWN 98
F + F++S T+DPS+A LF++P + + G E + H I +PYW+
Sbjct: 271 FMEEFLESPVRTEDPSEASLFYIPAFLYSYSGNMAGGDEHTQLLLDH----IRATWPYWD 326
Query: 99 RTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFIS----GHIAHKDVS----L 150
R GG DHF G + + + S+ GH H + L
Sbjct: 327 RHGGRDHFLFVPADRGTCPWGSRFSDLIRIVHFGMHSTRTNHNPHFGHQGHPEFGCYNPL 386
Query: 151 PQIWPRQEDPPKLGSSKRNKLAFFAGAVNSP----------VREKLLQVWRNDSEIYAHS 200
I P L L FFAG++ + + +L+ W ND E ++ S
Sbjct: 387 RDIVAAGTGAP-LSLPWAGWLFFFAGSIRTDDNVYSGRTRLILSELVAQW-NDPE-FSFS 443
Query: 201 GRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSF 260
G Y +KFCL G+ R+ S+ GCVPV+I H P+ +VL +++F
Sbjct: 444 GGYVNNYPAGFREAKFCLAPWGYGFGM-RLHQSILGGCVPVVIQEHVFQPYEEVLPYETF 502
Query: 261 SIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQW--HVFPSDYDAFYMVMYELW 318
S+ ++ D+P L++ L+ ++ E+Y L V++ ++ F W H+ +D + W
Sbjct: 503 SLRLSNEDLPQLRETLRSVTDEQYRELLEGVVRYKEAFSWERHLGGRAFDYTIASLRRRW 562
Query: 319 LRRSSV 324
L S+
Sbjct: 563 LNSLSL 568
>gi|242046000|ref|XP_002460871.1| hypothetical protein SORBIDRAFT_02g036640 [Sorghum bicolor]
gi|241924248|gb|EER97392.1| hypothetical protein SORBIDRAFT_02g036640 [Sorghum bicolor]
Length = 500
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 76/273 (27%), Positives = 124/273 (45%), Gaps = 26/273 (9%)
Query: 52 DPSKADLFFLPF--SIARMRHDRRIGTEGI---------PDFISHYIFNISQKYPYWNRT 100
DPS ADLF++PF S++ + + R D + + ++ PYW R
Sbjct: 176 DPSDADLFYVPFFSSLSLVVNPIRSPPAANASGAAAAYSDDAMQEELLEWLERQPYWRRH 235
Query: 101 GGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH-KDVSLPQIWPRQED 159
G DH ++ C +A+ + + NA+ +V S + KDV LP
Sbjct: 236 MGRDHVFI-CQDP--NALYRVIDRISNAVLLVSDFGRLRSDQASLVKDVILPYSHRINSF 292
Query: 160 PPKLGSSKRNKLAFFAG----AVNSPVREKLLQVWRNDSEIYAHSG----RLKTPYADEL 211
++G R L FF G VR+ L Q+ N+ ++ G + +
Sbjct: 293 KGEVGVDGRPLLLFFMGNRYRKEGGKVRDALFQILENEDDVTIKHGTQSRESRRAARQGM 352
Query: 212 LGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT---LD 268
SKFCLH G + R+ D+L CVPVI++++ +LPF D++++ SI V T +
Sbjct: 353 HSSKFCLHPAGDTPSACRLFDALVSLCVPVIVSDYIELPFEDIIDYNKISIFVGTSKAVQ 412
Query: 269 IPLLKKILKGISSEEYLLLQSYVLKVRKHFQWH 301
L +L+ ISSE L Q KV+++F++
Sbjct: 413 PGYLTSMLRRISSERILEYQRETKKVKRYFEYE 445
>gi|115452759|ref|NP_001049980.1| Os03g0324700 [Oryza sativa Japonica Group]
gi|108707908|gb|ABF95703.1| secondary cell wall-related glycosyltransferase family 47,
putative, expressed [Oryza sativa Japonica Group]
gi|113548451|dbj|BAF11894.1| Os03g0324700 [Oryza sativa Japonica Group]
gi|125543687|gb|EAY89826.1| hypothetical protein OsI_11372 [Oryza sativa Indica Group]
gi|215768347|dbj|BAH00576.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 468
Score = 91.3 bits (225), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 75/290 (25%), Positives = 138/290 (47%), Gaps = 24/290 (8%)
Query: 57 DLFFLPF--SIARMRHDRRIGTEGIPD--FISHYIFNISQKYPYWNRTGGADHFYVACHS 112
D+ F+PF S++ RH + + + + + + P W R+GG DH +A H
Sbjct: 154 DVVFVPFFASLSFNRHSKVVPPARASEDRALQRRLLDYLAARPEWRRSGGRDHVVLAHHP 213
Query: 113 IGR-SAMQKAWEVKLNAIQVVCS-SSYFISGHIAHKDVSLP--QIWPRQEDPPKLGSSKR 168
G A K W + V+C Y S KDV P + P + G R
Sbjct: 214 NGMLDARYKLWP----CVFVLCDFGRYPPSVAGLDKDVIAPYRHVVPNFAND-SAGYDDR 268
Query: 169 NKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTPYADE----LLGSKFCLHV 220
L +F GA+ +R++L + +++ +++ G + ++ + SKFCL++
Sbjct: 269 PTLLYFQGAIYRKDGGFIRQELYYLLKDEKDVHFSFGSVVGNGIEQATQGMRASKFCLNI 328
Query: 221 KGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI---PLLKKILK 277
G ++ R+ DS+ CVP+II++ +LPF DVL++ F I+V D L ++
Sbjct: 329 AGDTPSSNRLFDSIVSHCVPIIISDEIELPFEDVLDYSKFCIIVRGADAVKKGFLMNLIN 388
Query: 278 GISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
GIS E++ + + + +V +HF++ + DA M+ + + S+R++
Sbjct: 389 GISREDWTRMWNRLKEVERHFEYQYPSQNDDAVQMIWKAIARKAPSIRLK 438
>gi|159478058|ref|XP_001697121.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158274595|gb|EDP00376.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 802
Score = 91.3 bits (225), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 85/342 (24%), Positives = 149/342 (43%), Gaps = 75/342 (21%)
Query: 34 ASESYFKKAFMKSHFVTKDPSKADLFFLPFSI-----------------ARMRHDRRIGT 76
A E+Y + ++S T DP +AD F++P + A + H R T
Sbjct: 400 AVEAYLHETLLQSEHRTFDPDEADFFYVPVYVTCYMWPILGWADGPWWYAPLAHTR---T 456
Query: 77 EGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKA----------W---E 123
+ + +S +IS +P+WNR GG DH ++ G M A W +
Sbjct: 457 MHVSNMLSEVHAHISSTFPWWNRRGGRDHIWLMAADEGACYMPTAIYNTSIILTHWGRMD 516
Query: 124 VKLNAIQVVCSSSYFIS------------------GHIAH---KDVSLPQIWPRQ--EDP 160
++ + +Y ++ GH + KD+ +P D
Sbjct: 517 LEHQSNTAYQQDNYNMAMPGEFKAWPGMDWQSRMRGHPCYDPRKDLVIPAFKSVDHFRDS 576
Query: 161 PKLGSSK--RNKLAFFAGAVNSP--------VREKLLQVW-RND----SEIYAHSGRL-K 204
P LG + R+ L +F G + +R+KL +W +ND +IY +G + +
Sbjct: 577 PLLGGAPLVRDLLCYFRGDIGQARFPQYSRGLRQKLFHLWHKNDWAAKHKIYIGNGEMVR 636
Query: 205 TPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVV 264
PY++ LL S+FCL + G + + R D++ +GC+PV+I ++ F +L+W+SFSI +
Sbjct: 637 GPYSEHLLRSRFCLVLPG-DGWSPRAEDAVLHGCIPVVIMDNVHAVFESILDWESFSIRI 695
Query: 265 ATLDIPL--LKKILKGISSEEYLLLQSYVLKVRKHFQWHVFP 304
D L L ++L+ + E +Q + +V F + P
Sbjct: 696 REDDAALEALPQLLEAVPPERVAKMQRNLARVWHRFAYATGP 737
>gi|326514612|dbj|BAJ96293.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 428
Score = 90.9 bits (224), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 77/324 (23%), Positives = 140/324 (43%), Gaps = 51/324 (15%)
Query: 33 YASESYFKKAFM-KSHFVTKDPSKADLFFLPF----------SIARMRHDRRIGTEGIPD 81
+A+E +A + + P +ADLF +P + + H R
Sbjct: 100 FAAEVAVHEALLLRQRRAGLRPEEADLFLVPVYACCNFSTPTGLPSLAHAR--------G 151
Query: 82 FISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNA------IQVVCSS 135
++ + + + P+WNR+ GADH +VA H G + +V + A +
Sbjct: 152 LLADAVGLVRAQMPFWNRSAGADHVFVASHDFG-ACFHPMEDVAMAAGIPEFLKGSILLQ 210
Query: 136 SYFISGHIAHKDVS--------LPQIWPRQEDPPKLGSSKRNKLAFFAGAV--------- 178
++ + G +DV P++ PR+ P+ + R+ AFF G +
Sbjct: 211 TFGVQGRHPCQDVEHVVIPPYVPPELAPRELPEPE--KAHRDIFAFFRGKMEVHPKNISG 268
Query: 179 ---NSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLY 235
+ VR +LL+++ + + Y R Y E+ S FC+ G+ + R+ +S+
Sbjct: 269 HFYSRKVRTELLRLYGRNRKFYLKRKR-NDGYRSEMARSLFCICPLGWAPWSPRLVESVL 327
Query: 236 YGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKV- 294
GC+PV+IA+ LPF VL W S+ VA D+ L+ +L +++ +Q +
Sbjct: 328 LGCIPVVIADDIRLPFPGVLRWPDISLQVAERDVAGLEAVLDHVAATNLTTIQGNLWDPV 387
Query: 295 -RKHFQWHVFPSDYDAFYMVMYEL 317
RK ++ + DA + V+ EL
Sbjct: 388 KRKALVFNRPMEEGDATWQVLKEL 411
>gi|356565097|ref|XP_003550781.1| PREDICTED: xylogalacturonan beta-1,3-xylosyltransferase-like
[Glycine max]
Length = 472
Score = 90.9 bits (224), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 77/300 (25%), Positives = 143/300 (47%), Gaps = 33/300 (11%)
Query: 50 TKDPSKADLFFLPF--SIARMRHDRRIGTE--GIPDFISHYIFNISQKYPYWNRTGGADH 105
++ +AD+ F+PF S++ RH + G E + + + + + W R+GG DH
Sbjct: 170 VQNSRQADVVFVPFFSSLSYNRHSKIHGKEKVSVNRMLQQRLVQLLMEREEWKRSGGRDH 229
Query: 106 FYVACH--SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH--KDVSLP-----QIWPR 156
VA H SI R+ KL + +V + +A+ KD+ P PR
Sbjct: 230 VIVAHHPNSILRARR------KLGSAMLVLADFGRYPSQLANIKKDIIAPYRHLVSTVPR 283
Query: 157 QEDPPKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTPYADE-- 210
E +R+ L +F GA+ +R+KL + +++ +++ G ++ ++
Sbjct: 284 AESA---SYEERSTLLYFQGAIYRKDGGAIRQKLYYLLKDEKDVHFAFGSIRKNGINQAS 340
Query: 211 --LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLD 268
+ SKFCL+V G ++ R+ D++ CVPVII++ +LPF DVL++ F + V D
Sbjct: 341 QGMALSKFCLNVAGDTPSSNRLFDAIVSHCVPVIISDEIELPFEDVLDYSEFGLFVHASD 400
Query: 269 I---PLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
L +L+ I E++ + + + +HF++ DA M+ E+ + SS++
Sbjct: 401 AVRKGYLLNLLRSIKPEKWTQMWERLKDITQHFEYQYPSQPGDAVNMIWEEVAHKISSLQ 460
>gi|297822463|ref|XP_002879114.1| hypothetical protein ARALYDRAFT_481698 [Arabidopsis lyrata subsp.
lyrata]
gi|297324953|gb|EFH55373.1| hypothetical protein ARALYDRAFT_481698 [Arabidopsis lyrata subsp.
lyrata]
Length = 452
Score = 90.9 bits (224), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 82/321 (25%), Positives = 136/321 (42%), Gaps = 43/321 (13%)
Query: 33 YASESYFKKAFM--KSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIP------DFIS 84
+A+E KAF+ + T+DP +AD FF+P ++ G P I+
Sbjct: 124 FAAEVALHKAFLSLEGDIRTEDPYEADFFFVPVYVSCNFST----INGFPAIGHARTLIN 179
Query: 85 HYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFIS---- 140
I +S +YP+WNR G+DH + A H G + + + SS +
Sbjct: 180 DAIKFVSTQYPFWNRNNGSDHVFTATHDFGSCFHTMEDRAIADGVPKILRSSIVLQTFGV 239
Query: 141 --GHIAHKDVSLPQIWPRQEDPPKLGSS--------KRNKLAFFAGAV------------ 178
H + ++ + P P L + +R+ AFF G +
Sbjct: 240 TFNHPCQEVENV--VIPPYISPESLHKTLKNIPVNKERDIWAFFRGKMELHPKNISGRFY 297
Query: 179 NSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGC 238
+ VR K+ + + D Y R + Y E+ S FCL G+ + R+ +S+ GC
Sbjct: 298 SKRVRTKIWRSYGGDRRFYLQRQRF-SGYQLEIARSVFCLCPLGWAPWSPRLVESVALGC 356
Query: 239 VPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVL--KVRK 296
VPVIIA+ LPF + W S+ VA D+ L IL+ + + ++Q + VR+
Sbjct: 357 VPVIIADGIRLPFPSAVRWPDISLTVAERDVGKLGDILEHVVATNLSVIQRNLEDPSVRR 416
Query: 297 HFQWHVFPSDYDAFYMVMYEL 317
++V + DA + V+ L
Sbjct: 417 ALMFNVPSREGDATWQVLEAL 437
>gi|307106650|gb|EFN54895.1| hypothetical protein CHLNCDRAFT_135016 [Chlorella variabilis]
Length = 584
Score = 90.9 bits (224), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 87/328 (26%), Positives = 138/328 (42%), Gaps = 74/328 (22%)
Query: 32 NYASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPD---------- 81
NY + F + F+ S T+DPS+A+LF++P + GT G P
Sbjct: 244 NYIAYQQFLEQFLLSPVRTEDPSEANLFYIPMLLYGYS-----GTPGGPSRAPQVDSLCN 298
Query: 82 ------FISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSS 135
I + I+ K+PYWNRT G DHFY A A + A K A Q + S
Sbjct: 299 MMPGQAHIDLVLDQIAHKWPYWNRTRGRDHFYWA------PADRGACYHKGLAEQAIKVS 352
Query: 136 SYFISGHIAHKDVSLPQIWPRQEDPP-----------------KLGSS------------ 166
+ + H + + L ++ + P KL +S
Sbjct: 353 HFGL--HATNNSIDLGDLYSHNQMSPDHGCYHPLRDVVAPPFEKLAASWLNTTLRLGLDG 410
Query: 167 ---KRNKLAFFAGAV-------NSPVREKL---LQVWRNDSEIYAHSGRLKT-PYADELL 212
+N +F+G V + R+KL ++ W +D E GRL+ Y +
Sbjct: 411 NIKGKNATFYFSGNVQGINLMYSGGTRQKLQALIKQW-DDPEFGFVEGRLQEGAYEQRIR 469
Query: 213 GSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLL 272
S+FCL G R+ ++ G +PVI+ H P DVL +++FSI + D+P L
Sbjct: 470 ESRFCLAPYGHGYGM-RLGQCIFAGSIPVIVQEHVFQPLEDVLPYEAFSIRLTNDDLPQL 528
Query: 273 KKILKGISSEEYLLLQSYVLKVRKHFQW 300
++IL+GI+ +Y L + +L+ W
Sbjct: 529 REILRGITEAQYRELMTGLLRYSLALSW 556
>gi|114325715|gb|ABI64067.1| glycosyltransferase GT47C [Populus tremula x Populus alba]
Length = 442
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 87/330 (26%), Positives = 134/330 (40%), Gaps = 62/330 (18%)
Query: 33 YASESYFKKAFMKSHFV-TKDPSKADLFFLPFSIARMRHDRRIGTEGIP------DFISH 85
+ASE KA S V T DP +AD FF+P ++ G P +S
Sbjct: 114 FASEVAIHKALSSSLDVRTFDPYEADFFFVPVYVSCNFST----VNGFPAIGHARSLLSS 169
Query: 86 YIFNISQKYPYWNRTGGADHFYVACHSIG--------------------RSAMQKAWEVK 125
+ IS YP+WNR+ G+DH +VA H G RS + + + VK
Sbjct: 170 AVQLISSNYPFWNRSQGSDHVFVASHDYGACFHAMEERAMEDGIPEFLKRSIILQTFGVK 229
Query: 126 LN----AIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLAFFAGAV--- 178
N ++ V Y G + P + +R+ AFF G +
Sbjct: 230 FNHPCQDVENVVIPPYISPGSVRATLEKYPL------------TGRRDIWAFFRGKMEVH 277
Query: 179 ---------NSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTAR 229
+ VR +L+ + D Y R Y E++ S FCL G+ + R
Sbjct: 278 PKNISGRYYSKKVRTVILRKYSGDRRFYLQRHRFAG-YQSEIVRSVFCLCPLGWAPWSPR 336
Query: 230 IADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQS 289
+ +S+ GCVPVIIA+ LPF + W S+ VA D+ L +L +++ +Q
Sbjct: 337 LVESVALGCVPVIIADGIRLPFPTAVRWSEISLTVAEKDVANLGTLLDQVAATNLSAIQK 396
Query: 290 YVL--KVRKHFQWHVFPSDYDAFYMVMYEL 317
+ VR+ ++ DA + V+Y L
Sbjct: 397 NLWDPDVRRALLFNDPVQGGDATWQVLYAL 426
>gi|168059257|ref|XP_001781620.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666934|gb|EDQ53576.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 460
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 81/303 (26%), Positives = 139/303 (45%), Gaps = 41/303 (13%)
Query: 52 DPSKADLFFLPF--SIARMRHDRRIGTEGIPD-------------------FISHYIFNI 90
DP +AD+FF+PF +++ H + D + + +I
Sbjct: 139 DPEQADVFFVPFFSALSFNSHGHGMSEGAAADKRLQIVLLTFGRHVNASCHLVQAGLVDI 198
Query: 91 SQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCS-SSYFISGHIAHKDVS 149
K +W + G DH VA H +A++ ++ +I +V Y + KDV
Sbjct: 199 LSKSKWWQASQGRDHILVAHHP---NALRHYRDMLNQSIFIVADFGRYDKTVARLSKDVV 255
Query: 150 LP--QIWPR-QEDPPKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIY----- 197
P + P +D P S R L FF G + + VR KL ++ N+S+++
Sbjct: 256 APYVHVLPSYDQDNPADPFSLRKTLLFFQGRIHRKGDGIVRTKLAELLANNSDVHYVDSL 315
Query: 198 AHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNW 257
A + + T A + S+FCLH G ++ R+ D++ CVPVII++ +LPF D LN+
Sbjct: 316 ASAEAIATSTAG-MRTSRFCLHPAGDTPSSCRLFDAIVSHCVPVIISDRIELPFEDDLNY 374
Query: 258 KSFSIVVA---TLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
K FSI + ++ L + L+ I+ E +L + + + V HF++ P DA M+
Sbjct: 375 KDFSIFFSSEESVKPGHLLRTLRSITRERWLRMWNALKTVSHHFEYQHPPKKDDAVNMIF 434
Query: 315 YEL 317
++
Sbjct: 435 KQV 437
>gi|224106838|ref|XP_002314302.1| predicted protein [Populus trichocarpa]
gi|222850710|gb|EEE88257.1| predicted protein [Populus trichocarpa]
Length = 462
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 87/320 (27%), Positives = 137/320 (42%), Gaps = 42/320 (13%)
Query: 33 YASESYFKKAFMKSHFV-TKDPSKADLFFLPFSIARMRHDRRIGTEGIP------DFISH 85
+ASE KA S + T DP +AD FF+P ++ G P +S
Sbjct: 134 FASEVAIHKALSNSLDIRTFDPYEADFFFVPVYVS----CNFSTVNGFPAIGHARSLLSS 189
Query: 86 YIFNISQKYPYWNRTGGADHFYVACHSIG---RSAMQKAWE------VKLNAIQVVCSSS 136
+ IS YP+WNR+ G+DH +VA H G + ++A E +K + I
Sbjct: 190 AVQLISSNYPFWNRSQGSDHVFVASHDYGACFHAMEERAMEDGIPEFLKRSIILQTFGVK 249
Query: 137 YFISGHIAHKDVSLPQIWPRQ-----EDPPKLGSSKRNKLAFFAGAV------------N 179
+ V P I P + E+ P G +R+ AFF G + +
Sbjct: 250 FNHPCQDVENVVIPPYISPERVRTTLENYPLNG--RRDIWAFFRGKMEVHPKNISGRYYS 307
Query: 180 SPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCV 239
VR + + + D Y R Y E++ S FCL G+ + R+ +S+ GCV
Sbjct: 308 KKVRTVIWRKYSGDRRFYLQRHRF-AGYQSEIVRSVFCLCPLGWAPWSPRLVESVALGCV 366
Query: 240 PVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVL--KVRKH 297
PVIIA+ LPF + W S+ VA D+ L +L +++ +Q + VR+
Sbjct: 367 PVIIADGIRLPFPTAVRWSEISLTVAEKDVANLGTLLDHVAATNLSAIQKNLWDPDVRRA 426
Query: 298 FQWHVFPSDYDAFYMVMYEL 317
++ + DA + V+Y L
Sbjct: 427 LLFNDRVQEGDATWQVLYAL 446
>gi|115450193|ref|NP_001048697.1| Os03g0107900 [Oryza sativa Japonica Group]
gi|122247627|sp|Q10SX7.1|GT31_ORYSJ RecName: Full=Probable glucuronosyltransferase Os03g0107900
gi|108705764|gb|ABF93559.1| exostosin family protein, putative, expressed [Oryza sativa
Japonica Group]
gi|113547168|dbj|BAF10611.1| Os03g0107900 [Oryza sativa Japonica Group]
gi|125542077|gb|EAY88216.1| hypothetical protein OsI_09667 [Oryza sativa Indica Group]
gi|215766485|dbj|BAG98793.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222624042|gb|EEE58174.1| hypothetical protein OsJ_09104 [Oryza sativa Japonica Group]
Length = 427
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 81/320 (25%), Positives = 141/320 (44%), Gaps = 45/320 (14%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIA----------RMRHDRRIGTEGIPDF 82
+A+E +A + P A LFF+P ++ + H R + + + D
Sbjct: 101 FAAEVALHEALLAYAGRAARPDDATLFFVPVYVSCNFSTDNGFPSLSHARALLADAV-DL 159
Query: 83 ISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKL-NAI-----QVVCSSS 136
+ + PYWNR+ GADH +VA H G + +V + + I + + +
Sbjct: 160 VR-------AQMPYWNRSAGADHVFVASHDFG-ACFHPMEDVAIADGIPEFLKRSILLQT 211
Query: 137 YFISG-HIAHKD---VSLPQIWPRQE-DPPKLGSSKRNKLAFFAGAV------------N 179
+ + G H+ + V P + P + P+ ++R+ AFF G + +
Sbjct: 212 FGVQGTHVCQEADHVVIPPHVPPEVALELPEPEKAQRDIFAFFRGKMEVHPKNISGRFYS 271
Query: 180 SPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCV 239
VR +LLQ + + + Y R Y E+ S FCL G+ + R+ +S+ GC+
Sbjct: 272 KKVRTELLQKYGRNRKFYLKRKRYGN-YRSEMARSLFCLCPLGWAPWSPRLVESVLLGCI 330
Query: 240 PVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLK--VRKH 297
PVIIA+ LPF VL W S+ VA D+ L+ +L + + ++Q + RK
Sbjct: 331 PVIIADDIRLPFPSVLQWLDISLQVAEKDVASLEMVLDHVVATNLTVIQKNLWDPVKRKA 390
Query: 298 FQWHVFPSDYDAFYMVMYEL 317
++ + DA + V+ EL
Sbjct: 391 LVFNRPMEEGDATWQVLREL 410
>gi|428181360|gb|EKX50224.1| hypothetical protein GUITHDRAFT_104038 [Guillardia theta CCMP2712]
Length = 723
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 94/344 (27%), Positives = 146/344 (42%), Gaps = 61/344 (17%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARM--RHDRRIGTEGIPD--------- 81
YA+E Y + + S +T DP ADLFF+P A H R G G PD
Sbjct: 372 YAAEVYIHEQLLLSDSLTLDPGAADLFFIPLYAACFLSSHFVRPGP-GWPDNDVDIGKTY 430
Query: 82 -FISHYIFNISQKYPYWNRTGGADHFYV------ACH--------------SIGRSAMQK 120
+ + ++ Q YP+++R+ GADH V +C S R+ ++
Sbjct: 431 QAVQLVLEHVRQTYPFFDRSAGADHVLVLSSDWGSCQGPFLELHNSILLVTSGDRTLVRP 490
Query: 121 AWEVKLNAIQVVCSSSYFISGHIA----HKDVSLPQIWPRQEDPPKLGSS-------KRN 169
AW A + S + + + KDV +P + P P L +S R+
Sbjct: 491 AWYAARAADHMGSSEEFAVRSRLPCFQLFKDVVIPPLVPH----PALTASYMGERTRGRD 546
Query: 170 KLAFFAGAVNSPVREKLLQ----------VWRNDSEI--YAHSGRLKTP-YADELLGSKF 216
L +F G V+ L + R S + + S R+ + Y DELL S F
Sbjct: 547 ILVYFRGTAAGSVKALLYNKDYSLGIRQLLLRRYSRVRGWVVSDRINSSSYHDELLRSVF 606
Query: 217 CLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKIL 276
CL G+E+ + R +++ GC+PV++ + LPF L++ F++ V I L+ IL
Sbjct: 607 CLAPAGWELWSVRFFEAILLGCIPVLLTDDVQLPFQQRLDYSRFTVKVEQRRILELESIL 666
Query: 277 KGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLR 320
I+ Q + +V K + P D DAF +M EL R
Sbjct: 667 SSINETVIRRKQEGLKEVWKRMTYQRPPEDGDAFTGIMDELARR 710
>gi|307111406|gb|EFN59640.1| hypothetical protein CHLNCDRAFT_133110 [Chlorella variabilis]
Length = 767
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 83/337 (24%), Positives = 139/337 (41%), Gaps = 72/337 (21%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFN--- 89
Y ES F + M+S T DP +AD F++P + + R G + D+ + N
Sbjct: 372 YGLESAFLEMLMQSEHRTLDPEEADFFYVPVFTSCFIYPVRSGASSLQDWFYAPVQNRVQ 431
Query: 90 ------------ISQKYPYWNRTGGADHFYVACHSIG----------------------- 114
I +P+W R GG DH ++ H G
Sbjct: 432 GAANMLLEAFHWIQSHHPWWERRGGRDHIWLVTHDEGSCWVPAAIRPSIILSHWGRMDLN 491
Query: 115 RSAMQKAWE-----VKLNAIQVVCSSSYF---------ISGHIAH---KDVSLPQIWP-- 155
S+ WE +Q + I+GH + KD+ +P I
Sbjct: 492 HSSTTGYWEDDYRQANARKLQHAEQQQHLFEPDGFQQKIAGHACYDPVKDLVVPLIKTPN 551
Query: 156 RQEDPPKLGSSKRNK--LAFFAGAVNSP-------VREKLLQVWRNDSEIYAHSGR---- 202
R + P G+ RN+ LAF G VN VR+++ R + + +
Sbjct: 552 RNKHSPLFGAPTRNRTWLAFHRGRVNHEFPRYSRGVRQRVDNASREHQWLENYGSKFGDE 611
Query: 203 -LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFS 261
L+ Y++ L S FCL ++G + +AR+ D++ +GC+PV+I + + F VL+ FS
Sbjct: 612 SLQGDYSELLASSIFCLVLQG-DGWSARMDDAMSHGCIPVVIIDDVHVSFESVLDLSQFS 670
Query: 262 IVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHF 298
+ V + D+ L +IL+ +S E LQ + +V + +
Sbjct: 671 LRVKSADVERLPEILQAVSQERREELQRNLARVWQRY 707
>gi|224136520|ref|XP_002322350.1| predicted protein [Populus trichocarpa]
gi|222869346|gb|EEF06477.1| predicted protein [Populus trichocarpa]
Length = 343
Score = 88.2 bits (217), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 79/274 (28%), Positives = 120/274 (43%), Gaps = 29/274 (10%)
Query: 72 RRIGTEGI-----PDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRS---AMQKAWE 123
RR+ E + P I IS +PYWNRT GADHF+V H G +KA E
Sbjct: 40 RRVKEEPLFAFQSPRMIRSATQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIE 99
Query: 124 VKLNAI--QVVCSSSYFISGHIAHKD--VSLPQIWPRQEDPPKLGSSK--RNKLAFFAGA 177
+ + ++ H+ KD +++P P Q+ L K R+ +F G
Sbjct: 100 RGILPLLQHATLVQTFGQRNHVCLKDGSITVPSYAPPQKMQTHLIPEKTPRSIFVYFRGL 159
Query: 178 V----NSP-----VREKLLQVWRN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVN 226
N P R VW N D+ ++ S T Y +++ + FCL +
Sbjct: 160 FYDVGNDPEGGYYARGARAAVWENFKDNPLFDISTEHPTTYYEDMQQAVFCLCPLSWAPW 219
Query: 227 TARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLL 286
+ R+ ++L +G +PVII + LPFAD + W+ + V D+P L IL I E +L
Sbjct: 220 SPRLVEALIFGYIPVIIVDDIVLPFADAIPWEEIGVFVDEKDVPNLDTILTSIPP-EVIL 278
Query: 287 LQSYVLKVRKHFQWHVFP---SDYDAFYMVMYEL 317
+ +L Q +FP DAF+ V+ L
Sbjct: 279 RKQRLLANPSMKQAMLFPQLAQAGDAFHQVLNGL 312
>gi|168042843|ref|XP_001773896.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674740|gb|EDQ61244.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 373
Score = 87.8 bits (216), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 73/301 (24%), Positives = 134/301 (44%), Gaps = 24/301 (7%)
Query: 56 ADLFFLPFSIAR-----MRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVAC 110
AD+ F+PF + R ++R E + + +K P W +GG DH V
Sbjct: 75 ADVIFVPFFASLSYNKFTRAEQRALGEDKNQELQEKLMQFLEKQPAWQASGGVDHVIVIH 134
Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH--KDVSLP--QIWPRQEDPPKLGSS 166
H M+ A+ VV + S +A+ KD+ P + E +
Sbjct: 135 HPNSGYFMRDHLR---KAMFVVADFGRYASD-VANIGKDIVAPYKHVVNDFEAEATISYE 190
Query: 167 KRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTPY----ADELLGSKFCL 218
KR L FF GA+ +R +L ++ + +++ G ++ + SKFCL
Sbjct: 191 KRKTLLFFQGAIMRKEGGIIRLQLYKLLNGEPDVHFEGGNTTNSAIRSASEGMQNSKFCL 250
Query: 219 HVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI---PLLKKI 275
++ G ++ R+ D++ CVPVII++ ++PF D LN+ +FSI + + D + +
Sbjct: 251 NLAGDTPSSNRLFDAIASHCVPVIISDDIEVPFEDTLNYSTFSIFIKSSDALKSNFIIDL 310
Query: 276 LKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQWSTSLDSN 335
L+G+S E++ + + + +V HF++ DA +M + + VR+ +
Sbjct: 311 LRGVSREKWTKMWATLKQVEHHFKYQYPTQPDDAVHMTWKAIARKIHKVRLHLNKERRYQ 370
Query: 336 W 336
W
Sbjct: 371 W 371
>gi|159490314|ref|XP_001703124.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158270754|gb|EDO96589.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 490
Score = 87.8 bits (216), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 90/338 (26%), Positives = 136/338 (40%), Gaps = 59/338 (17%)
Query: 30 RGNYASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFN 89
R + ++ +F + + S D KAD +++P RM D R F+S +
Sbjct: 122 RLDRSTHIHFYQRLLGSGARIADGDKADWYYIPIR-QRMTADSR--------FLSEAVAY 172
Query: 90 ISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFIS--------- 140
IS YP+WNRTGG+ HF + +G Q +L A + + + ++
Sbjct: 173 ISATYPWWNRTGGSRHFVIHTGDLGADETQLG--ARLQAPNITWLTHWGLTMDKVFSGWK 230
Query: 141 -GHIAHKDVSLP--------QIWPRQEDP--PKLGSSKRNKLAFFAGAVNSPVREKLLQV 189
H KDV +P + + + P P + +R FFAG + +
Sbjct: 231 KAHRPDKDVVIPVFLTPGHFKHFGLERTPLHPLMDKQERTTTFFFAGRICGDRKPPKTGS 290
Query: 190 WRN---DSEIYAHSGR----------------LKTP-YADELLGSKFCLHVKGFEVNTAR 229
W N S Y+ R L P Y L SKFCL G +
Sbjct: 291 WPNCGPRSPGYSAGVRQLVHHHHWDPPGFKVVLHEPNYGAALGSSKFCLAPLGGGHGQRQ 350
Query: 230 IADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQS 289
I S + GC+PV IA+ PF NW F + A DIP L IL+ +S++EY Q
Sbjct: 351 IIVS-FMGCLPVCIADDVYEPFEPQYNWTQFGVRPAESDIPELHTILESVSAKEYAAKQR 409
Query: 290 YVLKVRKHFQWH-----VFPSD--YDAFYMVMYELWLR 320
+ +HF + +F D YDAF + L ++
Sbjct: 410 ALRCAAQHFVYSSIVGGLFGEDGRYDAFETTLEVLRVK 447
>gi|412991334|emb|CCO16179.1| predicted protein [Bathycoccus prasinos]
Length = 558
Score = 87.8 bits (216), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 77/311 (24%), Positives = 134/311 (43%), Gaps = 27/311 (8%)
Query: 32 NYASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHY----- 86
Y +E M+S T DP +A+ F++P + I T G +
Sbjct: 234 QYGTEIRIHANIMQSKMYTLDPLEAEFFYVPV-YGECKLFENIATLGAKKGLQETNAWWL 292
Query: 87 --IFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLN-AIQVVCSSSYFISGHI 143
+ ++ +YP+WNRT G DH + + G + K W+ + +I + +S
Sbjct: 293 EAMKLVTDQYPFWNRTQGRDHVFTFAGARG-PHIFKDWKRHIKKSIFLTPEGDRSLSEQF 351
Query: 144 AH-KDVSLPQIWPRQE----DPPKLGSSKRNK-LAFFAGAV--------NSPVREKLLQV 189
KD+ +P + P + K KR K A+F G + + +R K+ +
Sbjct: 352 NTWKDIVIPGLEPEKAFWSGSLRKQKEVKRAKTFAYFRGTIANKLGKQYSKGIRIKMKEA 411
Query: 190 WRNDSEIY---AHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANH 246
+++ ++ HS KT Y +E+ S FCL +G+ T R +L GC+PVIIA+
Sbjct: 412 FKDIKDVVFTEQHSSCDKTCYREEMRASTFCLCPRGWSPWTLRAYQALMVGCIPVIIADE 471
Query: 247 YDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSD 306
+ P+ + +W+ SI + IL+ + + + + K W +D
Sbjct: 472 IEFPYENSFDWRQVSIKIPEKRHLETIDILRSVPDDVVERKRKAMAKFWPSVAWKKPAAD 531
Query: 307 YDAFYMVMYEL 317
DAF++VM EL
Sbjct: 532 DDAFHLVMKEL 542
>gi|168032656|ref|XP_001768834.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162679946|gb|EDQ66387.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 368
Score = 87.8 bits (216), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 67/236 (28%), Positives = 115/236 (48%), Gaps = 21/236 (8%)
Query: 95 PYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH--KDVSLP- 151
P W + GADH + H +AM E +A+ VV + +A+ KDV P
Sbjct: 109 PAWRASNGADHVLIIHHP---NAMVYKREQFRSAMFVVADFGRY-DAEVANIAKDVVAPY 164
Query: 152 -QIWPRQEDPPKLGSS--KRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLK 204
I P +D SS R L FF GA+ +R+KL ++ R++S++ +G
Sbjct: 165 KHIIPNFDDDIDSVSSFNTRTTLLFFQGAIVRKEGGIIRQKLYELLRDESDVVFVNGTTT 224
Query: 205 TP----YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSF 260
+ + SKFCLH++G ++ R+ D++ CVP+I+++ +LPF DV+N+ F
Sbjct: 225 SAGIRSATSGMRQSKFCLHMEGDTPSSNRLFDAVASHCVPLIVSDDIELPFEDVINYTEF 284
Query: 261 SIVVATLDI---PLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
+ V + D L +L+ +E+ + + +V+KHF++ + DA M
Sbjct: 285 CLFVNSSDALRKGFLTNLLRNFGEKEWTRMHDRMREVQKHFEYQLPSEIGDAVQMT 340
>gi|326490259|dbj|BAJ84793.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 498
Score = 87.8 bits (216), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 82/285 (28%), Positives = 137/285 (48%), Gaps = 46/285 (16%)
Query: 52 DPSKADLFFLPF--SIARMRHDRR-------------IGT----EGIPDFISHYIFNISQ 92
DP +ADLF++PF S++ + + R +G E + D + ++ +
Sbjct: 171 DPVEADLFYVPFFSSLSLVVNPIRPPAAANASEAAATVGPWYSDEAMQDELVEWL----E 226
Query: 93 KYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH--KDVSL 150
+ YW R G DH ++ C +A+ + + NA+ +V S + G A KDV L
Sbjct: 227 RQSYWRRYRGRDHVFI-CQDP--NALYRVVDRISNAVLLV-SDFGRLRGDQASLVKDVIL 282
Query: 151 P---QIWPRQEDPPKLGSSKRNKLAFFAG----AVNSPVREKLLQVWRNDSEIYAHSG-- 201
P +I P + D + R L FF G +R+ L QV N+ ++ G
Sbjct: 283 PYSHRINPFKGD---VNVDSRPALLFFMGNRYRKEGGKIRDTLFQVLENEGDVIIKHGAQ 339
Query: 202 -RLKTPYADELL-GSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKS 259
R+ A + + SKFCLH G + R+ D+L CVPVI+++H +LPF DV+++ +
Sbjct: 340 SRVSRRMATQGMHSSKFCLHPAGDTPSACRLFDALVSLCVPVIVSDHIELPFEDVIDYSN 399
Query: 260 FSIVVAT---LDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWH 301
SI V T + L +L+ +SSE L Q + +V+ +F++
Sbjct: 400 ISIFVDTSKAIQPGFLTSMLRKVSSERILEYQREIQRVKHYFEYE 444
>gi|326437066|gb|EGD82636.1| hypothetical protein PTSG_03293 [Salpingoeca sp. ATCC 50818]
Length = 366
Score = 87.8 bits (216), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 54/157 (34%), Positives = 80/157 (50%), Gaps = 15/157 (9%)
Query: 2 NRSFRVYVYPHRRNDPFANVLLPV-DFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
+R RVY+YP P + L P D++ YA+E+ F + S F T P +A LFF
Sbjct: 103 SRHLRVYMYPL----PESLQLPPTRDYK----YAAEATFTRMLRASTFSTDSPEEAQLFF 154
Query: 61 LPFSIARMR---HDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIG--- 114
+ S A R DR G + + ++ Q+YPYWNRT G DHF+V H +G
Sbjct: 155 VRVSCAEARFTQRDREAGQRAADAHATAVLAHVQQRYPYWNRTQGRDHFFVCGHDMGAAP 214
Query: 115 RSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLP 151
R+A + + N I +V ++ ++ HKD+SLP
Sbjct: 215 RTAAARMFPSARNMIALVNTADVTEPDYVVHKDISLP 251
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 35/106 (33%), Positives = 52/106 (49%), Gaps = 7/106 (6%)
Query: 222 GFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISS 281
G T R+ D+++ GCVPV IA+HYD P A ++W ++ +A D +K L+ +
Sbjct: 255 GDGCPTPRLMDAIWAGCVPVFIADHYDPPLAKYVDWALLAVFIAEADAAHIKAHLEMDAR 314
Query: 282 EEYLLLQSYVLKVRKHFQW-------HVFPSDYDAFYMVMYELWLR 320
Y +Y+ +VR W H AF +VM EL LR
Sbjct: 315 TMYAHRSAYIARVRDRLTWWDPAQRQHTMGRSTSAFDLVMLELALR 360
>gi|60657602|gb|AAX33322.1| secondary cell wall-related glycosyltransferase family 47 [Populus
tremula x Populus tremuloides]
Length = 442
Score = 87.4 bits (215), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 86/330 (26%), Positives = 133/330 (40%), Gaps = 62/330 (18%)
Query: 33 YASESYFKKAFMKSHFV-TKDPSKADLFFLPFSIARMRHDRRIGTEGIP------DFISH 85
+ASE KA S V T DP +AD FF+P ++ G P +S
Sbjct: 114 FASEVAIHKALSSSLDVRTFDPYEADFFFVPVYVS----CNFSTVNGFPAIGHARSLLSS 169
Query: 86 YIFNISQKYPYWNRTGGADHFYVACHSIG--------------------RSAMQKAWEVK 125
+ IS YP+WNR+ G+DH +VA H G RS + + + VK
Sbjct: 170 AVQLISSNYPFWNRSQGSDHVFVASHDYGACFHAMEERAMEDGIPEFLKRSIILQTFGVK 229
Query: 126 LN----AIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLAFFAGAV--- 178
N ++ V Y G + P + +R+ AFF G +
Sbjct: 230 FNHPCQDVENVVIPPYISPGSVRTTLEKYPL------------TGRRDIWAFFRGKMEVH 277
Query: 179 ---------NSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTAR 229
+ VR + + + D Y R Y E++ S FCL G+ + R
Sbjct: 278 PKNISGRYYSKKVRTVIWRKYSGDRRFYLQRHRF-AGYQSEIVRSVFCLCPLGWAPWSPR 336
Query: 230 IADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQS 289
+ +S+ GCVPVIIA+ LPF + W S+ VA D+ L +L +++ +Q
Sbjct: 337 LVESVALGCVPVIIADGIRLPFPTAVRWSEISLTVAEKDVANLGTLLDQVAATNLSAIQK 396
Query: 290 YVL--KVRKHFQWHVFPSDYDAFYMVMYEL 317
+ VR+ ++ DA + V+Y L
Sbjct: 397 NLWDPDVRRALLFNDPVQGGDATWQVLYAL 426
>gi|222619798|gb|EEE55930.1| hypothetical protein OsJ_04615 [Oryza sativa Japonica Group]
Length = 401
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 82/331 (24%), Positives = 136/331 (41%), Gaps = 44/331 (13%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T DP +AD F+ P ++ + P + + ++
Sbjct: 83 FAAEIFMHQFLLSSPVRTLDPEEADWFYTPAYTTCDLTPQGFPLPFRAPRIMRSAVRYVA 142
Query: 92 QKYPYWNRTGGADHFYVACHSIGRS-AMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSL 150
+PYWNRT GADHF++A H G Q+ ++ + V+ ++ + H
Sbjct: 143 ATWPYWNRTDGADHFFLAPHDFGACFHYQEERAIERGILPVLRRATLVQTFGQRHHPCLQ 202
Query: 151 PQ--IWPRQEDPPKL------GSSKRNKLAFFAGAVNSPVREKLLQVWRNDSE--IYAHS 200
P P DP K+ ++ R+ +F G L ND E YA
Sbjct: 203 PGSITVPPYADPRKMEAHRISPATPRSIFVYFRG---------LFYDMGNDPEGGYYARG 253
Query: 201 GRLKT--PYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWK 258
R + D L + T R+ +++ +GC+PVIIA+ LPFAD + W
Sbjct: 254 ARASVWENFKDNPL-----FDISTEHPATPRLVEAVVFGCIPVIIADDIVLPFADAIPWG 308
Query: 259 SFSIVVATLDIPLLKKILKGISSEEYL----LLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
S+ VA D+P L IL + +E + LL S +K F P DAF+ ++
Sbjct: 309 EISVFVAEEDVPRLDTILASVPLDEVIRKQRLLASPAMKQAVLFHQPARPG--DAFHQIL 366
Query: 315 YEL----------WLRRSSVRVQWSTSLDSN 335
L +L + W L+++
Sbjct: 367 NGLARKLPHPKGVFLEPGEKGIDWDQGLEND 397
>gi|357475227|ref|XP_003607899.1| Xylogalacturonan beta-1,3-xylosyltransferase [Medicago truncatula]
gi|355508954|gb|AES90096.1| Xylogalacturonan beta-1,3-xylosyltransferase [Medicago truncatula]
Length = 481
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 69/272 (25%), Positives = 129/272 (47%), Gaps = 27/272 (9%)
Query: 52 DPSKADLFFLPF--SIARMRHD-RRIGTEGIPDFISH-------YIFNISQKYPYWNRTG 101
DP +ADLFF+PF S++ + + R G+ +P+ ++ + + +W R+
Sbjct: 163 DPEEADLFFVPFFSSLSLIVNPVRPAGSGSVPEKTAYSDEENQEALMEWLEMQEFWKRSK 222
Query: 102 GADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYF--ISGHIAHKDVSLPQIWPRQED 159
G DH VA +AM + + N + +V G + KDV +P +
Sbjct: 223 GRDHVIVASDP---NAMYRVVDRVKNCVLLVSDFGRLRPDQGSLV-KDVIVPYSHRIRTY 278
Query: 160 PPKLGSSKRNKLAFFAG----AVNSPVREKLLQVWRNDSEIY----AHSGRLKTPYADEL 211
+G KRN L FF G +R+ L Q+ + ++ A S + + +
Sbjct: 279 DGGIGVDKRNTLLFFMGNRYRKEGGKIRDTLFQILEKEDDVIIKHGAQSRESRRAASQGM 338
Query: 212 LGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT---LD 268
SKFCLH G + R+ D++ CVPVI+++ +LPF D ++++ ++ V T +
Sbjct: 339 HTSKFCLHPAGDTPSACRLFDAIVSLCVPVIVSDSIELPFEDTIDYRKIAVFVETAAAIQ 398
Query: 269 IPLLKKILKGISSEEYLLLQSYVLKVRKHFQW 300
L IL+G++ + + Q + +V+++F++
Sbjct: 399 PGYLVSILRGMAPDRIVEYQKELKEVKRYFKY 430
>gi|223975191|gb|ACN31783.1| unknown [Zea mays]
gi|414887161|tpg|DAA63175.1| TPA: exostosin-like protein [Zea mays]
Length = 500
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 78/273 (28%), Positives = 123/273 (45%), Gaps = 26/273 (9%)
Query: 52 DPSKADLFFLPF--SIARMRHDRR---------IGTEGIPDFISHYIFNISQKYPYWNRT 100
DPS ADLF++PF S++ + + R GT + + + ++ YW R
Sbjct: 176 DPSDADLFYVPFFSSLSLVVNPIRSPLAANASGAGTAYSDEALQEELLEWLERQLYWQRH 235
Query: 101 GGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH-KDVSLPQIWPRQED 159
G DH ++ C +A+ + + NA+ +V S + KDV LP
Sbjct: 236 RGRDHVFI-CQDP--NALYRVVDRISNAVLLVSDFGRLRSDQASLVKDVILPYSHRINSF 292
Query: 160 PPKLGSSKRNKLAFFAG----AVNSPVREKLLQVWRNDSEIYAHSGRLKTPYADE----L 211
++G R L FF G VR+ L Q+ N+ ++ G E +
Sbjct: 293 KGEVGVDGRPSLLFFMGNRYRKEGGKVRDALFQILENEDDVTIKHGTQSRESRREATQGM 352
Query: 212 LGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT---LD 268
SKFCLH G + R+ D+L CVPVI +++ +LPF D++++ SI V T +
Sbjct: 353 HSSKFCLHPAGDTPSACRLFDALVSLCVPVIASDYIELPFEDIIDYNKISIFVGTSKAVQ 412
Query: 269 IPLLKKILKGISSEEYLLLQSYVLKVRKHFQWH 301
L L+ ISSE L Q + KVR +F++
Sbjct: 413 PGYLTSTLRRISSERILEYQREIKKVRHYFEYE 445
>gi|449469226|ref|XP_004152322.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10-like [Cucumis
sativus]
Length = 388
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 78/306 (25%), Positives = 129/306 (42%), Gaps = 48/306 (15%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIP------DFISHY 86
+A+E Y + + S T +P +AD F+ P + + G+P +
Sbjct: 78 FAAEIYMHRFLLNSPVRTLNPDEADWFYTPIYVTC-----DLTPNGLPLPFKSPRMMRSA 132
Query: 87 IFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHK 146
I IS +PYWNRT GADHF+V H G + F
Sbjct: 133 IQLISSNWPYWNRTEGADHFFVVPHDFG---------------------ACFHYQEEKAI 171
Query: 147 DVSLPQIWPRQEDPPKLGS-SKRNKLAFFAGAV----NSP-----VREKLLQVWRN--DS 194
D +P P++ + S + R+ +F G N P R VW N ++
Sbjct: 172 DRGIPYCPPQKMKTHLIPSETPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENFKNN 231
Query: 195 EIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
++ S T Y +++ + FCL G+ + R+ +++ +GC+PVIIA+ LPFAD
Sbjct: 232 PLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADA 291
Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---DAFY 311
+ W+ + V D+ L IL I + +L + +L + +FP DAF+
Sbjct: 292 IPWEEIGVFVDEKDVSNLDTILTSIPP-DVILRKQRLLANPSMKRAMMFPQPAQSGDAFH 350
Query: 312 MVMYEL 317
++ L
Sbjct: 351 QILNGL 356
>gi|449439621|ref|XP_004137584.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase
IRX7-like [Cucumis sativus]
Length = 494
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 73/283 (25%), Positives = 133/283 (46%), Gaps = 23/283 (8%)
Query: 50 TKDPSKADLFFLPF--SIARMRHDRRIGTEGIP--DFISHYIFNISQKYPYWNRTGGADH 105
KD S+AD+ F+PF S++ +H + G E I + + + W RTGG +H
Sbjct: 185 VKDSSQADVIFVPFFSSLSYNQHSKSHGKEKINVNKILQQKLIDFLFGQKEWRRTGGKNH 244
Query: 106 FYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH--KDVSLPQIWPRQEDPPKL 163
+A H ++M A + KL + V + IA+ KD+ P + P
Sbjct: 245 LVIAHHP---NSMLDARK-KLGSAMFVLADFGRYPAAIANIEKDIIAPYRHIVKTVPSSK 300
Query: 164 GSS--KRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLK----TPYADELLG 213
++ +R L +F GA+ VR++L + +++ +++ G +K +
Sbjct: 301 SATFDERPILVYFQGAIYRKDGGVVRQELYYLLKDEEDVHFTFGSVKGNGINKAGQGMAS 360
Query: 214 SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI---P 270
SKFCL++ G ++ R+ DS+ CVPVII++ +LP+ D+L++ F + V D
Sbjct: 361 SKFCLNIAGDTPSSNRLFDSIASHCVPVIISDDIELPYEDILDYSEFCVFVRAADSIRKG 420
Query: 271 LLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
L +L+GI E + + + ++ F++ DA M+
Sbjct: 421 YLLNLLRGIGRERWTKMWDRIKEIVHEFEYQYPSQSGDAVDMI 463
>gi|449533617|ref|XP_004173769.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase
IRX7-like, partial [Cucumis sativus]
Length = 330
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 73/283 (25%), Positives = 133/283 (46%), Gaps = 23/283 (8%)
Query: 50 TKDPSKADLFFLPF--SIARMRHDRRIGTEGIP--DFISHYIFNISQKYPYWNRTGGADH 105
KD S+AD+ F+PF S++ +H + G E I + + + W RTGG +H
Sbjct: 21 VKDSSQADVIFVPFFSSLSYNQHSKSHGKEKINVNKILQQKLIDFLFGQKEWRRTGGKNH 80
Query: 106 FYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH--KDVSLPQIWPRQEDPPKL 163
+A H ++M A + KL + V + IA+ KD+ P + P
Sbjct: 81 LVIAHHP---NSMLDARK-KLGSAMFVLADFGRYPAAIANIEKDIIAPYRHIVKTVPSSK 136
Query: 164 GSS--KRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLK----TPYADELLG 213
++ +R L +F GA+ VR++L + +++ +++ G +K +
Sbjct: 137 SATFDERPILVYFQGAIYRKDGGVVRQELYYLLKDEEDVHFTFGSVKGNGINKAGQGMAS 196
Query: 214 SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI---P 270
SKFCL++ G ++ R+ DS+ CVPVII++ +LP+ D+L++ F + V D
Sbjct: 197 SKFCLNIAGDTPSSNRLFDSIASHCVPVIISDDIELPYEDILDYSEFCVFVRAADSIRKG 256
Query: 271 LLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
L +L+GI E + + + ++ F++ DA M+
Sbjct: 257 YLLNLLRGIGRERWTKMWDRIKEIVHEFEYQYPSQSGDAVDMI 299
>gi|195611602|gb|ACG27631.1| exostosin-like [Zea mays]
Length = 456
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 77/273 (28%), Positives = 123/273 (45%), Gaps = 26/273 (9%)
Query: 52 DPSKADLFFLPF--SIARMRHDRR---------IGTEGIPDFISHYIFNISQKYPYWNRT 100
DPS ADLF++PF S++ + + R GT + + + ++ YW R
Sbjct: 132 DPSDADLFYVPFFSSLSLVVNPIRSPLAANASGAGTAYSDEALQEELLEWLERQLYWQRH 191
Query: 101 GGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH-KDVSLPQIWPRQED 159
G DH ++ C +A+ + + NA+ +V S + KDV LP
Sbjct: 192 RGRDHVFI-CQDP--NALYRVVDRISNAVLLVSDFGRLRSDQASLVKDVILPYSHRINSF 248
Query: 160 PPKLGSSKRNKLAFFAG----AVNSPVREKLLQVWRNDSEIYAHSG----RLKTPYADEL 211
++G R L FF G VR+ L Q+ N+ ++ G + +
Sbjct: 249 KGEVGVDGRPSLLFFMGNRYRKEGGKVRDALFQILENEDDVTIKHGTQSRESRRAATQGM 308
Query: 212 LGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT---LD 268
SKFCLH G + R+ D+L CVPVI +++ +LPF D++++ SI V T +
Sbjct: 309 HSSKFCLHPAGDTPSACRLFDALVSLCVPVIASDYIELPFEDIIDYNKISIFVGTSKAVQ 368
Query: 269 IPLLKKILKGISSEEYLLLQSYVLKVRKHFQWH 301
L L+ ISSE L Q + KVR +F++
Sbjct: 369 PGYLTSTLRRISSERILEYQREIKKVRHYFEYE 401
>gi|255547405|ref|XP_002514760.1| catalytic, putative [Ricinus communis]
gi|223546364|gb|EEF47866.1| catalytic, putative [Ricinus communis]
Length = 490
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 70/280 (25%), Positives = 135/280 (48%), Gaps = 21/280 (7%)
Query: 52 DPSKADLFFLPF--SIARMRHDRRIGTEGIP--DFISHYIFNISQKYPYWNRTGGADHFY 107
+ S+AD+ F+P+ S++ RH + G E + + + + W R+GG DH
Sbjct: 187 NSSQADIIFVPYFSSLSYNRHSKLHGKEKVSMNKMLQNRLVEFLMGQDEWKRSGGRDHLI 246
Query: 108 VACHSIGRSAMQKAWEVKLNAIQVVCS-SSYFISGHIAHKDVSLPQIWPRQEDPPKLGS- 165
VA H ++M A ++ A+ V+ Y + KDV P + P +
Sbjct: 247 VAHHP---NSMLDARKMLGAAMFVLADFGRYPVEIANLKKDVIAPYKHVVRTIPSGESAQ 303
Query: 166 -SKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTPYADE----LLGSKF 216
+R L FF GA+ +R++L + +++ +++ G ++ ++ + SKF
Sbjct: 304 FEERPILVFFQGAIYRKDGGIIRQELYYLLKDEKDVHFTFGTVRKNGVNKAGQGMASSKF 363
Query: 217 CLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI---PLLK 273
CL++ G ++ R+ D++ CVPVII++ +LPF DVL++ FS+ V D L
Sbjct: 364 CLNIAGDTPSSNRLFDAIVSHCVPVIISDDIELPFEDVLDYSEFSVFVRASDAVKEGYLL 423
Query: 274 KILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
+L+ I +++ ++ + ++ HF++ DA M+
Sbjct: 424 NLLQSIDRDKWTMMWERLKEIAPHFEYQYPSQSGDAVDMI 463
>gi|255565307|ref|XP_002523645.1| catalytic, putative [Ricinus communis]
gi|223537097|gb|EEF38731.1| catalytic, putative [Ricinus communis]
Length = 452
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 79/291 (27%), Positives = 131/291 (45%), Gaps = 37/291 (12%)
Query: 52 DPSKADLFFLP-FSIARMRHD--RRIGTEGIPDFISHY--------IFNISQKYPYWNRT 100
DP +ADLF++P FS + + R GTE P + HY + ++ YW R
Sbjct: 131 DPDEADLFYVPVFSSLSLIVNPVRPAGTE--PGLVQHYSDEEMQEQLVEWLEQQEYWKRN 188
Query: 101 GGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYF--ISGHIAHKDVSLPQIWPRQE 158
G DH +A +A+ + + NAI ++ G + KD+ +P
Sbjct: 189 NGRDHVIIAGDP---NALYRVLDRVKNAILLLSDFGRVRPDQGSLV-KDIIVPYSHRINV 244
Query: 159 DPPKLGSSKRNKLAFFAG----AVNSPVREKLLQVWRNDSEIYAHSGRL----KTPYADE 210
+G RN L FF G +R+ L Q+ ++ ++ G + +
Sbjct: 245 YNGDIGVRDRNTLLFFMGNRYRKDGGKIRDLLFQMLESEEDVVIKHGTQSRENRRAASRG 304
Query: 211 LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI- 269
+ SKFCL+ G + R+ DS+ CVPVI+++ +LPF DV+++ +I V T D
Sbjct: 305 MHTSKFCLNPAGDTPSACRLFDSIVSLCVPVIVSDSIELPFEDVIDYTKIAIFVETTDSL 364
Query: 270 --PLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELW 318
L K+L+ ++SE L Q + KV ++F+ YD + E+W
Sbjct: 365 KPGYLVKLLREVTSERILEYQKELKKVTRYFE-------YDNSNGTVNEIW 408
>gi|224129654|ref|XP_002320639.1| predicted protein [Populus trichocarpa]
gi|222861412|gb|EEE98954.1| predicted protein [Populus trichocarpa]
Length = 375
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 77/297 (25%), Positives = 138/297 (46%), Gaps = 23/297 (7%)
Query: 50 TKDPSKADLFFLPF--SIARMRHDRRIGTE--GIPDFISHYIFNISQKYPYWNRTGGADH 105
++ S+AD+ F+PF S++ RH + G E + + + W R GG DH
Sbjct: 73 VQNSSQADIVFVPFFSSLSYNRHSKLHGKEKVSVNKMLQTKLVQFLTARDEWKRFGGNDH 132
Query: 106 FYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH--KDVSLPQIWPRQEDPPKL 163
VA H ++M A + KL + V + IA+ KD+ P + P
Sbjct: 133 LIVAHHP---NSMLHARK-KLGSAMFVLADFGRYPVEIANLGKDIIAPYKHVVRTIPSGE 188
Query: 164 GSS--KRNKLAFFAGAV----NSPVREKLLQVWRNDSEIY----AHSGRLKTPYADELLG 213
+ +R L F GA+ +R++L + +++ +++ + G A +
Sbjct: 189 SAQFDRRPILMHFQGAIYRKDGGAIRQELYYLLKDEKDVHFTFGTYRGNGIKKAAQGMAS 248
Query: 214 SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI---P 270
SKFCL++ G ++ R+ D++ CVPVII++ +LPF DVL++ F + V D
Sbjct: 249 SKFCLNIAGDTPSSNRLFDAIASHCVPVIISDDIELPFEDVLDYSEFCLFVRASDAVKKG 308
Query: 271 LLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
L +L+GI +++ L + ++ HF++ DA MV + + SSV+ +
Sbjct: 309 YLLDLLRGIEKDQWTKLWERLKEIAPHFEYSYPSQPGDAVDMVWKAVLRKTSSVQFK 365
>gi|168051833|ref|XP_001778357.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162670236|gb|EDQ56808.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 406
Score = 84.3 bits (207), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 74/313 (23%), Positives = 144/313 (46%), Gaps = 23/313 (7%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEG----IPDFISHYIF 88
Y + + +K ++ +DP +AD+FF+PF + ++ G EG + + +
Sbjct: 85 YLLDGWDRKDGRRAAIRVRDPYQADVFFVPFFASLSFNNYGYGMEGPGAELDKNLQECVV 144
Query: 89 NISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIA--HK 146
NI +W + G DH V H + + LN+ ++ + S +A K
Sbjct: 145 NILLNSKWWKASQGRDHVIVLHHPNAF----RHYRHLLNSSMLIVADFGRFSTDVACLQK 200
Query: 147 DVSLP--QIWPRQEDPPKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEI-YAH 199
D+ P + D S+R+ L +F G + + VR KL + N+ ++ Y
Sbjct: 201 DIVAPYEHVVQSYVDDHSNSFSQRHILLYFQGRIHRKADGIVRAKLAKALMNEKDVHYMD 260
Query: 200 SGRLKTPYADELLG---SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLN 256
S A+ G S+FCLH G ++ R+ D++ CVPVI+++ +LPF D ++
Sbjct: 261 SEASSEALAEATSGMRSSRFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDRIELPFEDDID 320
Query: 257 WKSFSIVVATLDI---PLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
+ FS+ ++ + L +IL+GI+ ++ + + + V HF++ DA M+
Sbjct: 321 YNEFSLFFSSEEAVRPQYLLRILRGINETKWTQMWTKLKAVSHHFEFQHPAKKDDAVNMI 380
Query: 314 MYELWLRRSSVRV 326
++ + S+++
Sbjct: 381 FKQVQRKLPSMKL 393
>gi|168010648|ref|XP_001758016.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162690893|gb|EDQ77258.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 481
Score = 84.3 bits (207), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 77/292 (26%), Positives = 133/292 (45%), Gaps = 32/292 (10%)
Query: 48 FVTKDPSKADLFFLPFSIARMRHDRRIGTE----GIPDFIS-------HYIFNISQKYPY 96
F D ADL F+PF A + ++R +E G D + + ++ P
Sbjct: 169 FRVNDWRAADLMFVPF-FASLAYNRYTKSEHKVGGELDLVGDKNQKLQEKLLKFLEQQPA 227
Query: 97 WNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH--KDVSLP--Q 152
W +GG+DH V H AM+ + AI +V + S +A+ KDV P
Sbjct: 228 WQASGGSDHIVVIHHPNSFHAMRNFFS---KAIFIVADFGRYPS-EVANLRKDVVAPYKH 283
Query: 153 IWPRQEDPPKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTP-- 206
+ P D +R L FF G + +R++L ++ +N+ ++ G +
Sbjct: 284 VIPSFVDD-STPFEEREILLFFQGTIVRKQGGVIRQQLYEMLKNEKGVHFEEGSAGSAGI 342
Query: 207 --YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVV 264
+ SK CL++ G ++ R+ D++ CVPVII++ +LPF D L++ FSI +
Sbjct: 343 HSATTGMRRSKCCLNIAGDTPSSNRLFDAIASHCVPVIISDEIELPFEDELDYSGFSIFI 402
Query: 265 ATLDIPLLK---KILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
+ D K +++ +S +E++ L + +V HF++ YDA MV
Sbjct: 403 NSTDAVQEKFVINLIRSVSRKEWMRLWKRLKEVSLHFEYQHPTKPYDAVNMV 454
>gi|356577103|ref|XP_003556667.1| PREDICTED: probable glucuronosyltransferase Os02g0520750-like
[Glycine max]
Length = 459
Score = 84.0 bits (206), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 71/274 (25%), Positives = 126/274 (45%), Gaps = 29/274 (10%)
Query: 52 DPSKADLFFLPF--SIARMRHDRRI--GTEGIPDFI------SHYIFNISQKYPYWNRTG 101
DP +ADLFF+PF S++ + + R G+ + + +K YW R
Sbjct: 140 DPEEADLFFVPFFSSLSLIVNPVRPPGSNSGLEKPVYSDEENQEALVEWLEKQEYWKRNN 199
Query: 102 GADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYF--ISGHIAHKDVSLPQIWPRQED 159
G DH VA +AM + + NA+ +V G + KDV +P +
Sbjct: 200 GRDHVIVASDP---NAMYRVIDRVRNAVLLVSDFGRLRPDQGSLV-KDVVVPYSHRIRTY 255
Query: 160 PPKLGSSKRNKLAFFAG----AVNSPVREKLLQVWRNDSEIY----AHSGRLKTPYADEL 211
P +G R L FF G +R+ L Q+ N+ ++ A S + + +
Sbjct: 256 PGDVGVEDRKTLLFFMGNRYRKEGGKIRDLLFQILENEKDVIIKHGAQSRESRRAASHGM 315
Query: 212 LGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI-- 269
SKFCLH G + R+ D++ C+PVI++++ +LPF D ++++ ++ V T
Sbjct: 316 HTSKFCLHPAGDTPSACRLFDAIVSLCIPVIVSDNIELPFEDTIDYRKIAVFVETSSAIK 375
Query: 270 --PLLKKILKGISSEEYLLLQSYVLKVRKHFQWH 301
LL K L+ ++ + L Q + +V+++F++
Sbjct: 376 PGHLLSK-LRAVTPDRVLEYQKKLKEVKRYFEYE 408
>gi|168014691|ref|XP_001759885.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689015|gb|EDQ75389.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 383
Score = 84.0 bits (206), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 68/250 (27%), Positives = 122/250 (48%), Gaps = 20/250 (8%)
Query: 92 QKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCS-SSYFISGHIAHKDVSL 150
+K+P W + G +H V H ++MQ + NA+ VV Y KDV
Sbjct: 131 EKHPAWKASNGKNHVMVIHHP---NSMQAVRDRLRNALYVVSDFGRYENETANIRKDVVA 187
Query: 151 P--QIWPRQEDPPKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLK 204
P + P D R+ + +F G++ +R +L + +++ +++ +G
Sbjct: 188 PYKHVLPTFTDDSS-SFHTRSTVVYFQGSIVRKEGGKIRHELYDLLKDEPDVHFTTGITA 246
Query: 205 TP----YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSF 260
+ + S+FCL++ G ++ R+ DS+ CVPVII++ +LPF D LN+ SF
Sbjct: 247 SEGFHSATRGMRSSRFCLNLAGDTPSSNRLFDSIASHCVPVIISDDLELPFEDDLNYSSF 306
Query: 261 SIVVAT---LDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPS-DYDAFYMVMYE 316
I + + L + +L+ +SSEE+ L+ +L V +HF++ FPS DA MV
Sbjct: 307 CIFINSTRALQPGYVINLLRNVSSEEWTLMWERLLVVERHFEYQ-FPSVANDAVNMVWKA 365
Query: 317 LWLRRSSVRV 326
+ + ++R+
Sbjct: 366 IARKLPAIRL 375
>gi|224120296|ref|XP_002318294.1| predicted protein [Populus trichocarpa]
gi|222858967|gb|EEE96514.1| predicted protein [Populus trichocarpa]
Length = 382
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 69/269 (25%), Positives = 126/269 (46%), Gaps = 23/269 (8%)
Query: 52 DPSKADLFFLPF--SIARMRHDRRIGTEGIPDFI------SHYIFNISQKYPYWNRTGGA 103
DP +ADLF++P S++ + ++G + D + + +K YW R G
Sbjct: 62 DPEEADLFYVPVFSSLSLTVNPVQVGKVPVSDPVYSDEKMQDELVEWLEKQEYWRRNNGR 121
Query: 104 DHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH-KDVSLPQIWPRQEDPPK 162
DH A +A+ + + NA+ ++ S + KDV +P
Sbjct: 122 DHVLFAGDP---NALYRVLDRVKNAVLLLSDFGRVRSDQGSLVKDVIVPYAHRINVYNGD 178
Query: 163 LGSSKRNKLAFFAG----AVNSPVREKLLQVWRNDSEIY-AHSGRLKTPYADELLG---S 214
+G +R L FF G +R+ L Q+ + ++ +H + + LG S
Sbjct: 179 IGVDERKTLLFFMGNRYRKDGGKIRDMLFQLLEKEEDVLISHGTQSRESRRTATLGMHTS 238
Query: 215 KFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT---LDIPL 271
KFCL+ G + R+ DS+ CVP+I+++ +LPF DV++++ +I V T L
Sbjct: 239 KFCLNPAGDTPSACRLFDSIVSLCVPLIVSDSIELPFEDVIDYRKIAIFVDTESSLKPGY 298
Query: 272 LKKILKGISSEEYLLLQSYVLKVRKHFQW 300
L K+L+ +S+E L Q + +V+++F++
Sbjct: 299 LVKLLRAVSTERILEYQKEMREVKRYFEY 327
>gi|15230692|ref|NP_190126.1| exostosin family protein [Arabidopsis thaliana]
gi|6967106|emb|CAB72489.1| putative protein [Arabidopsis thaliana]
gi|332644505|gb|AEE78026.1| exostosin family protein [Arabidopsis thaliana]
Length = 475
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 65/240 (27%), Positives = 122/240 (50%), Gaps = 21/240 (8%)
Query: 97 WNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCS-SSYFISGHIAHKDVSLPQ--- 152
W R+GG DH +A H ++M A A+ ++ Y + KDV P
Sbjct: 206 WKRSGGRDHVVLAHHP---NSMLDARNKLFPAMFILSDFGRYPPTVANVEKDVIAPYKHV 262
Query: 153 IWPRQEDPPKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTPYA 208
I + D G R L +F GA+ VR++L + +++ +++ G ++
Sbjct: 263 IKAYENDTS--GFDSRPILLYFQGAIYRKDGGFVRQELFYLLQDEKDVHFSFGSVRNGGI 320
Query: 209 DE----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVV 264
++ + SKFCL++ G ++ R+ D++ CVPVII++ +LPF DV+++ FS+ V
Sbjct: 321 NKASQGMHNSKFCLNIAGDTPSSNRLFDAIASHCVPVIISDDIELPFEDVIDYSEFSVFV 380
Query: 265 ATLDI---PLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
T D L +++GI+ EE+ + + + +V K++++H FPS D ++++ R+
Sbjct: 381 RTSDALKENFLVNLIRGITKEEWTRMWNRLKEVEKYYEFH-FPSKVDDAVQMIWQAIARK 439
>gi|24476038|gb|AAN62780.1| Unknown protein [Oryza sativa Japonica Group]
Length = 449
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 82/341 (24%), Positives = 141/341 (41%), Gaps = 65/341 (19%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIA----------RMRHDRRIGTEGIPDF 82
+A+E +A + P A LFF+P ++ + H R + + + D
Sbjct: 101 FAAEVALHEALLAYAGRAARPDDATLFFVPVYVSCNFSTDNGFPSLSHARALLADAV-DL 159
Query: 83 ISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKA------WEVKLNAI------- 129
+ + PYWNR+ GADH +VA H G +E+ +NA
Sbjct: 160 VR-------AQMPYWNRSAGADHVFVASHDFGACFHPMELFVIIHFELGVNAKSNLALGQ 212
Query: 130 --------------QVVCSSSYFISG-HIAHKD---VSLPQIWPRQE-DPPKLGSSKRNK 170
+ + ++ + G H+ + V P + P + P+ ++R+
Sbjct: 213 EDVAIADGIPEFLKRSILLQTFGVQGTHVCQEADHVVIPPHVPPEVALELPEPEKAQRDI 272
Query: 171 LAFFAGAV------------NSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCL 218
AFF G + + VR +LLQ + + + Y R Y E+ S FCL
Sbjct: 273 FAFFRGKMEVHPKNISGRFYSKKVRTELLQKYGRNRKFYLKRKRYGN-YRSEMARSLFCL 331
Query: 219 HVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKG 278
G+ + R+ +S+ GC+PVIIA+ LPF VL W S+ VA D+ L+ +L
Sbjct: 332 CPLGWAPWSPRLVESVLLGCIPVIIADDIRLPFPSVLQWLDISLQVAEKDVASLEMVLDH 391
Query: 279 ISSEEYLLLQSYVLK--VRKHFQWHVFPSDYDAFYMVMYEL 317
+ + ++Q + RK ++ + DA + V+ EL
Sbjct: 392 VVATNLTVIQKNLWDPVKRKALVFNRPMEEGDATWQVLREL 432
>gi|302782213|ref|XP_002972880.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
gi|300159481|gb|EFJ26101.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
Length = 459
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 77/286 (26%), Positives = 132/286 (46%), Gaps = 30/286 (10%)
Query: 52 DPSKADLFFLPF--SIARMRHDRR----IGTEGI--PDFISHYIFNISQKYPYWNRTGGA 103
DP+ AD+FF+PF S++ R+ R G G D + + + W R GGA
Sbjct: 156 DPATADIFFVPFFSSLSYNRYCRTGHRFQGGRGCVENDRLEKRLVEFLRGQELWRRNGGA 215
Query: 104 DHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH--KDVSLP--QIWPR-QE 158
DH V H +++ A + A+ VV F S +A+ KD+ P + P
Sbjct: 216 DHVIVMHHP---NSLMVARSLLKEAMFVVADFGRF-SRAVANMRKDIVAPYKHVIPSFAR 271
Query: 159 DPPKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKT----PYADE 210
D S R L FF GA+ +R+KL ++ ++ ++ +G +
Sbjct: 272 DATTFES--RETLLFFQGAIVRKEGGIIRQKLYEILKDSPGVHFVTGNTQKDGIRSATAG 329
Query: 211 LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT---L 267
+ +KFCLH+ G ++ R+ D++ CVPVII++ +LPF D L++ F + V + L
Sbjct: 330 MRNAKFCLHLAGDTPSSNRLFDAIASHCVPVIISDEIELPFEDELDYSQFCVFVESDKAL 389
Query: 268 DIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
+ + L+ I +E+ + + V +HF++ DA +M
Sbjct: 390 RKGFVVRALERIGRDEWTRKWAMLKSVERHFEYQHPSLPEDAVHMT 435
>gi|356553158|ref|XP_003544925.1| PREDICTED: probable glucuronosyltransferase Os01g0926700-like
[Glycine max]
Length = 440
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 92/362 (25%), Positives = 159/362 (43%), Gaps = 40/362 (11%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEP------RGNYASESYFKKAFMKSHFVTKDPSKAD 57
S ++YVY D +L D + +G + S+ K ++S T +AD
Sbjct: 66 SLKIYVYQEDEIDGLKELLRGRDAKITDEACLKGQWGSQVKIHKLLLQSKQRTWKKEEAD 125
Query: 58 LFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSA 117
LFF+P + ++ R +G + S Y+ ISQ PY+ +GG +H +V G +
Sbjct: 126 LFFVP---SYVKCARMMGGLNDKEINSTYVKVISQ-MPYFRLSGGRNHIFVFPSGAG-AH 180
Query: 118 MQKAWEVKLNAIQVVC-----SSSYFISGHIAHKDVSLPQIWPRQEDPPKLGS------- 165
+ K+W +N ++ + S KD+ +P + K G
Sbjct: 181 LFKSWATYINRSIILTPEGDRTDKRDTSAFNTWKDIIIPG--NIDDGMTKTGDTTVQPLP 238
Query: 166 -SKRNKLAFFAG-AVNSPVREKLLQVWRN--------DSEIYAHSGRLKTPYADELLGSK 215
SKR LA + G A R KL+++ + D + + Y + L SK
Sbjct: 239 LSKRKYLANYLGRAQGKAGRLKLIELSKQFPEKLECPDLKFSGPDKLGRKEYFEHLRNSK 298
Query: 216 FCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI-PLLKK 274
FCL +G T R +S + CVPVI+++ +LPF +V+++ SI + I P L +
Sbjct: 299 FCLAPRGESSWTLRFYESFFVECVPVILSDQIELPFQNVIDYSQISIKWPSSQIGPELLQ 358
Query: 275 ILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQWSTSLDS 334
L+ I EE ++ + + R+ W V+ SD ++ + +W + VR Q+ S ++
Sbjct: 359 YLESIPDEE---IEKIIARGRQVRCWWVYASDSESCSAMRGIMWELQRKVR-QFHHSAET 414
Query: 335 NW 336
W
Sbjct: 415 FW 416
>gi|297826983|ref|XP_002881374.1| hypothetical protein ARALYDRAFT_482474 [Arabidopsis lyrata subsp.
lyrata]
gi|297327213|gb|EFH57633.1| hypothetical protein ARALYDRAFT_482474 [Arabidopsis lyrata subsp.
lyrata]
Length = 446
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 78/289 (26%), Positives = 126/289 (43%), Gaps = 37/289 (12%)
Query: 52 DPSKADLFFLP-FS----IARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHF 106
DP ADLF++P FS I G+ + + + + +W R GG DH
Sbjct: 129 DPDDADLFYVPVFSSLSLIVNAGRPVEPGSGYSDEKMQEGLMEWLEGQEWWRRNGGRDHV 188
Query: 107 YVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH------KDVSLPQIWPRQEDP 160
A +A+ + + N++ +V G + H KDV +P
Sbjct: 189 IPAGDP---NALYRILDRVKNSVLLVADF-----GRLRHDQGSFVKDVVIPYSHRVNLFN 240
Query: 161 PKLGSSKRNKLAFFAG----AVNSPVREKLLQVWRNDSEIYAHSGRL----KTPYADELL 212
++G RN L FF G VR+ L QV + ++ G + +
Sbjct: 241 GEIGVQDRNTLLFFMGNRYRKDGGKVRDLLFQVLEKEDDVTIKHGTQSRENRRAATKGMH 300
Query: 213 GSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVV---ATLDI 269
SKFCL+ G + R+ DS+ CVPVI+++ +LPF DV++++ FSI V A L
Sbjct: 301 TSKFCLNPAGDTPSACRLFDSIVSLCVPVIVSDSIELPFEDVIDYRKFSIFVEANAALQP 360
Query: 270 PLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELW 318
L ++L+ I +++ L Q + VR++F DYD + E+W
Sbjct: 361 GFLVQMLRKIKTKKILEYQREMQPVRRYF-------DYDNPNGAVKEIW 402
>gi|302852375|ref|XP_002957708.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
gi|300257002|gb|EFJ41257.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
Length = 600
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 77/312 (24%), Positives = 137/312 (43%), Gaps = 55/312 (17%)
Query: 33 YASESYFKKAFMKSHFV-TKDPSKADLFFLP----FSIARMRHDRRIGTEGIPDFISHYI 87
YA+ YF F++ H V T++P +A+LF++P F IA +R+ +P ++ +
Sbjct: 260 YAAYEYFMTYFLQDHAVRTENPYEANLFYIPMLAYFYIANVRNP-------VPQ-VTLAL 311
Query: 88 FNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVV---CSSSYFISGHIA 144
+ K+P++NRTGG DHFY + R A ++ + I++V +
Sbjct: 312 DYVRTKWPFYNRTGGRDHFYFL--TGDRGACSTPRWLQDSCIKLVHFGLQGEELPGTGVP 369
Query: 145 HKDVSLPQIWPRQEDPP---------------------KLGSSKRNKLAFF--------- 174
+++ Q+ PP K G KL FF
Sbjct: 370 NREYGCVQVKRDLVIPPINLFTDLVPSETQAYYKWLVSKKGYDSNRKLLFFFAGGVGQVP 429
Query: 175 --AGAVNSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIAD 232
+G V ++ L + ++ GR+ Y + L SKFC+ GF R+
Sbjct: 430 EYSGGVRQAIKGLLSSLTPKPEDVEFFEGRVHN-YKELLQSSKFCIAPYGFGWGL-RLIQ 487
Query: 233 SLYYGCVPVIIANHYDLPFA---DVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQS 289
++ YGC+P+II +H PF D L ++ FS+ + +DIP + ++L+ + + L+
Sbjct: 488 AIEYGCIPLIIQDHVYQPFERPKDFLPYEEFSVRMGLVDIPYMIELLRSYTEAQLAQLRL 547
Query: 290 YVLKVRKHFQWH 301
+ K + F W+
Sbjct: 548 GMAKYYQAFIWN 559
>gi|297815694|ref|XP_002875730.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
gi|297321568|gb|EFH51989.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
Length = 479
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 73/293 (24%), Positives = 144/293 (49%), Gaps = 33/293 (11%)
Query: 52 DPSKADLFFLPF--SIARMR------HDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGA 103
+ S+AD+ F+PF S++ R H + + + + ++ + W R+GG
Sbjct: 161 NSSEADVIFVPFFSSLSYNRFSKVNPHQKTSRNKDLQGKLVTFLTGQEE----WKRSGGR 216
Query: 104 DHFYVACHSIGRSAMQKAWEVKLNAIQVVCS-SSYFISGHIAHKDVSLPQ---IWPRQED 159
DH +A H ++M A A+ ++ Y + KD+ P I + D
Sbjct: 217 DHVVLAHHP---NSMLDARNKLFPAMFILSDFGRYPPTVANVEKDIIAPYKHVIKAYEND 273
Query: 160 PPKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTPYADE----L 211
G R L +F GA+ VR++L + +++ +++ G ++ ++ +
Sbjct: 274 TS--GFDSRPILLYFQGAIYRKDGGFVRQELFYLLQDEKDVHFSFGSVRNGGINKASQGM 331
Query: 212 LGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI-- 269
SKFCL++ G ++ R+ D++ CVPVII++ +LPF DV+++ F++ V T D
Sbjct: 332 HNSKFCLNIAGDTPSSNRLFDAIASHCVPVIISDDIELPFEDVIDYSEFAVFVRTSDALK 391
Query: 270 -PLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
L +++GIS EE+ + + + +V K++++H FPS D ++++ R+
Sbjct: 392 ENFLVNLIRGISKEEWTRMWNRLKEVEKYYEFH-FPSKVDDAVQMIWQAIARK 443
>gi|159485716|ref|XP_001700890.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158281389|gb|EDP07144.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 704
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 83/334 (24%), Positives = 134/334 (40%), Gaps = 62/334 (18%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRR-------IGTEGIP----- 80
Y E+ + + S T DP +AD F++PF A M + T G P
Sbjct: 310 YGLEAALHEYLLISEHRTFDPEEADYFYVPFYGACMIYPVAGWADYPWFWTPGGPRVMQV 369
Query: 81 -DFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRS-----AMQKAWEVKLNAIQVV-C 133
+ I + I ++YP+W R GG DH ++ H G W +
Sbjct: 370 INMIREIVEWIDKQYPFWKRRGGRDHIWLFTHDEGACWAPSVIKDSVWLTHWGRLDPEHT 429
Query: 134 SSSYFISGHIAHKDVSLPQ--------------------IWPRQEDPP--------KLGS 165
S++ F+ + H V+ Q + P + PP S
Sbjct: 430 SNTAFVGDNYTHDMVNWRQPEGYIKYIKGHPCYDPQKDLVVPNFKSPPHYVRSPLQSTPS 489
Query: 166 SKRNKLAFFAGAVNS--------PVREKLL-----QVWRNDSE-IYAHSGRLKTPYADEL 211
R+ FF G V +R+K+ Q W N + + G + Y+D L
Sbjct: 490 KPRDIFFFFKGDVGKHRLSHYSRGIRQKIYKMAMEQDWANTQKSLIGDGGNVHGDYSDLL 549
Query: 212 LGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPL 271
S FCL G + + R+ D++ +GC+PVIIA+ F VL+ SF++ VA D+P
Sbjct: 550 SRSLFCLVAPG-DGWSPRLEDAVLHGCIPVIIADRVHAVFESVLDIDSFAVRVAEADVPR 608
Query: 272 LKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPS 305
+ IL+ +S + L QS + +V +++ P
Sbjct: 609 VMDILRAVSDIKIRLKQSRLGQVWHRYRYGALPG 642
>gi|168017794|ref|XP_001761432.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162687438|gb|EDQ73821.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 377
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 78/304 (25%), Positives = 131/304 (43%), Gaps = 30/304 (9%)
Query: 48 FVTKDPSKADLFFLPFSIARMRHDRRIGTE----GIPDFISHYIFNISQKY-------PY 96
F D AD F+PF A + +++ TE G D + + +K P
Sbjct: 71 FRVSDWKAADYMFVPF-FASVAYNKYTKTEHHAGGELDLVGDKNQKLQEKLLEYLKQQPA 129
Query: 97 WNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCS-SSYFISGHIAHKDVSLP--QI 153
W + G DH V H AM+ ++ N + V+ Y KDV P I
Sbjct: 130 WQASDGCDHILVMHHPNSMHAMRDSFR---NVLFVLADFGRYPPDVANVEKDVVAPYKHI 186
Query: 154 WPRQEDPPKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTP--- 206
P D R L FF G + +R++L ++ +++ ++ G +
Sbjct: 187 IP-SFDNDSSSFEDRETLLFFQGTIVRKQGGVIRQQLYEMLKDEEGVHFEEGSSGSEGVH 245
Query: 207 -YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVA 265
+ GSKFCL++ G ++ R+ DS+ CVPVII++ +LPF D L++ F + +
Sbjct: 246 SATSGMRGSKFCLNIAGDTPSSNRLFDSIASHCVPVIISDDIELPFEDELDYSEFCVFIK 305
Query: 266 TLDIPLLK---KILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRS 322
+ D K +L+ I+ ++ L + V +HF++ YDA MV + R
Sbjct: 306 SEDALKEKYVINLLRSITRVQWTFLWKRLKAVARHFEYQHPTKPYDAVNMVWRAIARRAP 365
Query: 323 SVRV 326
SV++
Sbjct: 366 SVKL 369
>gi|42571577|ref|NP_973879.1| exostosin-like protein [Arabidopsis thaliana]
gi|332191986|gb|AEE30107.1| exostosin-like protein [Arabidopsis thaliana]
Length = 410
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 75/285 (26%), Positives = 126/285 (44%), Gaps = 31/285 (10%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPR------GNYASESYFKKAFMKSHFVTKDPSKAD 57
S ++YVY D +L D + G + S+ K ++S F T +AD
Sbjct: 88 SLKIYVYDENEIDGLKELLYGRDGSVKTTACLKGQWGSQVKIHKLLLESKFRTIKKDEAD 147
Query: 58 LFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSA 117
LFF+P A ++ R +G + Y+ +SQ PY+ R+GG DH +V G +
Sbjct: 148 LFFVP---AYVKCVRMLGGLNDKEINQTYVKVLSQ-MPYFRRSGGRDHIFVFPSGAG-AH 202
Query: 118 MQKAWEVKLNAIQVVC-----SSSYFISGHIAHKDVSLP------QIWPRQEDPPKLGSS 166
+ ++W +N ++ + + + KD+ +P Q D L S
Sbjct: 203 LFRSWSTFINRSIILTPEADRTDKKDTTAFNSWKDIIIPGNVDDAMTKNGQPDVQPLPLS 262
Query: 167 KRNKLAFFAG-AVNSPVREKLLQVWRN--------DSEIYAHSGRLKTPYADELLGSKFC 217
KR LA + G A R KL+ + + D + +T Y + L +KFC
Sbjct: 263 KRKYLANYLGRAQGKAGRLKLIDLSKQFPDKLECPDLKFSGTEKFGRTTYFEHLRNAKFC 322
Query: 218 LHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI 262
L +G T R +S + CVPV++++H +LPF +V+++ SI
Sbjct: 323 LAPRGESSWTLRFYESFFVECVPVLLSDHAELPFQNVIDYAQVSI 367
>gi|224075401|ref|XP_002304618.1| predicted protein [Populus trichocarpa]
gi|222842050|gb|EEE79597.1| predicted protein [Populus trichocarpa]
Length = 105
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 35/74 (47%), Positives = 52/74 (70%)
Query: 211 LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIP 270
L G ++G+EVNT+R++D+++YGC+PV+I+N+ DLPFADVL+W FS+V+ DI
Sbjct: 12 LRGENIASMLEGYEVNTSRVSDAIHYGCIPVVISNNRDLPFADVLDWSKFSVVINQRDIA 71
Query: 271 LLKKILKGISSEEY 284
LK L + E Y
Sbjct: 72 FLKTKLLSRTRETY 85
>gi|21537341|gb|AAM61682.1| unknown [Arabidopsis thaliana]
Length = 462
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 75/285 (26%), Positives = 126/285 (44%), Gaps = 31/285 (10%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPR------GNYASESYFKKAFMKSHFVTKDPSKAD 57
S ++YVY D +L D + G + S+ K ++S F T +AD
Sbjct: 88 SLKIYVYDENEIDGLKELLYGRDGSVKTTACLKGQWGSQVKIHKLLLESKFRTIKKDEAD 147
Query: 58 LFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSA 117
LFF+P A ++ R +G + Y+ +SQ PY+ R+GG DH +V G +
Sbjct: 148 LFFVP---AYVKCVRMLGGLNDKEINQTYVKVLSQ-MPYFRRSGGRDHIFVFPSGAG-AH 202
Query: 118 MQKAWEVKLNAIQVVC-----SSSYFISGHIAHKDVSLP------QIWPRQEDPPKLGSS 166
+ ++W +N ++ + + + KD+ +P Q D L S
Sbjct: 203 LFRSWSTFINRSIILTPEADRTDKKDTTAFNSWKDIIIPGNVDDAMTKNGQPDVQPLPLS 262
Query: 167 KRNKLAFFAG-AVNSPVREKLLQVWRN--------DSEIYAHSGRLKTPYADELLGSKFC 217
KR LA + G A R KL+ + + D + +T Y + L +KFC
Sbjct: 263 KRKYLANYLGRAQGKAGRLKLIDLSKQFPDKLECPDLKFSGTEKFGRTTYFEHLRNAKFC 322
Query: 218 LHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI 262
L +G T R +S + CVPV++++H +LPF +V+++ SI
Sbjct: 323 LAPRGESSWTLRFYESFFVECVPVLLSDHAELPFQNVIDYAQVSI 367
>gi|18394994|ref|NP_564141.1| exostosin-like protein [Arabidopsis thaliana]
gi|332191985|gb|AEE30106.1| exostosin-like protein [Arabidopsis thaliana]
Length = 462
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 75/285 (26%), Positives = 126/285 (44%), Gaps = 31/285 (10%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPR------GNYASESYFKKAFMKSHFVTKDPSKAD 57
S ++YVY D +L D + G + S+ K ++S F T +AD
Sbjct: 88 SLKIYVYDENEIDGLKELLYGRDGSVKTTACLKGQWGSQVKIHKLLLESKFRTIKKDEAD 147
Query: 58 LFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSA 117
LFF+P A ++ R +G + Y+ +SQ PY+ R+GG DH +V G +
Sbjct: 148 LFFVP---AYVKCVRMLGGLNDKEINQTYVKVLSQ-MPYFRRSGGRDHIFVFPSGAG-AH 202
Query: 118 MQKAWEVKLNAIQVVC-----SSSYFISGHIAHKDVSLP------QIWPRQEDPPKLGSS 166
+ ++W +N ++ + + + KD+ +P Q D L S
Sbjct: 203 LFRSWSTFINRSIILTPEADRTDKKDTTAFNSWKDIIIPGNVDDAMTKNGQPDVQPLPLS 262
Query: 167 KRNKLAFFAG-AVNSPVREKLLQVWRN--------DSEIYAHSGRLKTPYADELLGSKFC 217
KR LA + G A R KL+ + + D + +T Y + L +KFC
Sbjct: 263 KRKYLANYLGRAQGKAGRLKLIDLSKQFPDKLECPDLKFSGTEKFGRTTYFEHLRNAKFC 322
Query: 218 LHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI 262
L +G T R +S + CVPV++++H +LPF +V+++ SI
Sbjct: 323 LAPRGESSWTLRFYESFFVECVPVLLSDHAELPFQNVIDYAQVSI 367
>gi|357508931|ref|XP_003624754.1| Secondary cell wall-related glycosyltransferase family [Medicago
truncatula]
gi|355499769|gb|AES80972.1| Secondary cell wall-related glycosyltransferase family [Medicago
truncatula]
Length = 510
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 74/287 (25%), Positives = 137/287 (47%), Gaps = 34/287 (11%)
Query: 51 KDPSKADLFFLPF--SIARMRH------DRRIGTEGIPDFISHYIFNISQKYPYWNRTGG 102
++ ++AD+ F+PF S+ RH +RR + + + + Y+ N + W R+GG
Sbjct: 190 RNSTEADVIFVPFFSSLTYNRHSKTGPHERRSRNKVLQEKLVRYLMNQEE----WKRSGG 245
Query: 103 ADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH--KDVSLPQ---IWPRQ 157
DH +A H ++M A +KL + S +IA+ KDV P I
Sbjct: 246 RDHLILAHHP---NSMLDA-RMKLWPATFILSDFGRYPPNIANVDKDVIAPYKHVIASYV 301
Query: 158 EDPPKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTP----YAD 209
+D S R L +F GA+ R++L + + + +++ G ++ +
Sbjct: 302 DDQSTFDS--RKTLLYFQGAIYRKDGGYARQELFYLLKEEKDVHFSFGSVQKGGVRNATN 359
Query: 210 ELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI 269
+ SKFCL++ G ++ R+ D++ CVPVII++ +LP+ DVL++ F + V T D
Sbjct: 360 GMRSSKFCLNIAGDTPSSNRLFDAIASHCVPVIISDEIELPYEDVLDYSKFCVFVRTRDA 419
Query: 270 ---PLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
L ++ I +E+ + + + +V K F++ + DA M+
Sbjct: 420 VKKKYLINFIRSIGKDEWTRMWNRLKEVEKFFEFQFPSKEGDAVEMI 466
>gi|18650643|gb|AAL75891.1| At1g21480/F24J8_23 [Arabidopsis thaliana]
Length = 462
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 75/285 (26%), Positives = 126/285 (44%), Gaps = 31/285 (10%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPR------GNYASESYFKKAFMKSHFVTKDPSKAD 57
S ++YVY D +L D + G + S+ K ++S F T +AD
Sbjct: 88 SLKIYVYDENEIDGLKELLYGRDGSVKTTACLKGQWGSQVKIHKLLLESKFRTIKKDEAD 147
Query: 58 LFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSA 117
LFF+P A ++ R +G + Y+ +SQ PY+ R+GG DH +V G +
Sbjct: 148 LFFVP---AYVKCVRMLGGLNDKEINQTYVKVLSQ-MPYFRRSGGRDHIFVFPSGAG-AH 202
Query: 118 MQKAWEVKLNAIQVVC-----SSSYFISGHIAHKDVSLP------QIWPRQEDPPKLGSS 166
+ ++W +N ++ + + + KD+ +P Q D L S
Sbjct: 203 LFRSWSTFINRSIILTPEADRTDKKDTTAFNSWKDIIIPGNVDDAMTKNGQPDVQPLPLS 262
Query: 167 KRNKLAFFAG-AVNSPVREKLLQVWRN--------DSEIYAHSGRLKTPYADELLGSKFC 217
KR LA + G A R KL+ + + D + +T Y + L +KFC
Sbjct: 263 KRKYLANYLGRAQGKAGRLKLIDLSKQFPDKLECPDLKFSGTEKFGRTTYFEHLRNAKFC 322
Query: 218 LHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI 262
L +G T R +S + CVPV++++H +LPF +V+++ SI
Sbjct: 323 LAPRGESSWTLRFYESFFVECVPVLLSDHAELPFQNVIDYAQVSI 367
>gi|359472749|ref|XP_002276440.2| PREDICTED: probable glycosyltransferase At5g11130-like [Vitis
vinifera]
gi|147815974|emb|CAN68074.1| hypothetical protein VITISV_007510 [Vitis vinifera]
gi|297737986|emb|CBI27187.3| unnamed protein product [Vitis vinifera]
Length = 461
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 67/270 (24%), Positives = 120/270 (44%), Gaps = 23/270 (8%)
Query: 52 DPSKADLFFLPF--------SIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGA 103
DP +ADLF++ F + R + GT + + + ++ YW R G
Sbjct: 141 DPEEADLFYVSFFSSLSLVVNPIRPANGEGAGTGYSDEEMQESLMEWLEQQEYWKRNNGR 200
Query: 104 DHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH-KDVSLPQIWPRQEDPPK 162
DH ++ C +A+ + N + +V S + KDV LP + +
Sbjct: 201 DHVFI-CQD--PNALHLIVDRVKNGVLLVSDFGRLRSDTASLVKDVILPYAHRIKSYSGE 257
Query: 163 LGSSKRNKLAFFAG----AVNSPVREKLLQVWRNDSEIY----AHSGRLKTPYADELLGS 214
+G R L FF G +R+ L Q+ + ++ A S + + + S
Sbjct: 258 IGVENRKSLLFFMGNRYRKEGGKIRDLLFQILEQEEDVIIKHGAQSRESRRMASQGMHSS 317
Query: 215 KFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVV---ATLDIPL 271
KFCLH G + R+ D++ CVPVI+++ +LPF DV++++ +I V + +
Sbjct: 318 KFCLHPAGDTPSACRLFDAIVSLCVPVIVSDQIELPFEDVIDYRKIAIFVDSTSAVKPGF 377
Query: 272 LKKILKGISSEEYLLLQSYVLKVRKHFQWH 301
L K L+ I+ E L Q + +V ++F++
Sbjct: 378 LVKNLRKITRERILEYQREMQEVTRYFEYE 407
>gi|326511980|dbj|BAJ95971.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 784
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 85/336 (25%), Positives = 140/336 (41%), Gaps = 72/336 (21%)
Query: 30 RGNYASESYFKKAFMKSHFVTKDPSKADLFFLPFSIA------------RMRHDRRIGTE 77
R Y ++ ++ + S T + +AD F++P + RM D R+ +
Sbjct: 379 RQLYGAQMALYESILASPHRTLNGDEADYFYVPVLDSCLITRSDDAPHLRMPEDLRLRSY 438
Query: 78 GIPDFISHYIFNISQKYPYWNRTGGADHFYV------ACHS---IGRSAMQKAWEVKLNA 128
++ +I+Q+YPYWNRT G DH + AC++ I S M W N
Sbjct: 439 HTLEYYRKAYDHIAQRYPYWNRTSGRDHIWFFSWDEGACYAPKEIWNSMMLVHWG-NTNT 497
Query: 129 IQVVCSSSYFI-----------SGHIA---HKDVSLPQ--------IWPRQEDPPKLGSS 166
+++Y+ H KD+ LP IW + PK+
Sbjct: 498 KHEKSTTAYWADNWDDIPFDRRGNHPCFDPRKDLVLPAWKEPNPGAIWLKLWARPKI--- 554
Query: 167 KRNKLAFFAGAVNSP-------------VREKLLQVWRNDSEIYAHSGRLKTP------- 206
R L +F G + +R+KL + + GR +T
Sbjct: 555 NRTTLFYFNGNLGPAYEEGRPEDTYSMGIRQKLAAEFGSTPNKQGKLGRQQTANVTVTYL 614
Query: 207 ----YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI 262
Y +EL S FC + G + + R+ DS+ GC+PVII + LP+ +VLN+ SFS+
Sbjct: 615 KSEMYYEELASSIFCGVLPG-DGWSGRMEDSMLQGCIPVIIQDGIFLPYENVLNYNSFSV 673
Query: 263 VVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHF 298
+ DIP L K+L+G++ + + V +V + F
Sbjct: 674 RIQEDDIPNLIKVLQGLNGTQIDFMLGNVRQVWQRF 709
>gi|297850568|ref|XP_002893165.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
gi|297339007|gb|EFH69424.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
Length = 462
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 75/285 (26%), Positives = 125/285 (43%), Gaps = 31/285 (10%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPR------GNYASESYFKKAFMKSHFVTKDPSKAD 57
S ++YVY D +L D + G + S+ K ++S F T +AD
Sbjct: 88 SLKIYVYDENEIDGLKELLYGRDGSVKTTACLKGQWGSQVKIHKLLLESKFRTIKKDEAD 147
Query: 58 LFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSA 117
LFF+P A ++ R +G + Y+ +SQ PY+ R+GG DH +V G +
Sbjct: 148 LFFVP---AYVKCVRMLGGLNDKEINQTYVKVLSQ-MPYFRRSGGRDHIFVFPSGAG-AH 202
Query: 118 MQKAWEVKLNAIQVVC-----SSSYFISGHIAHKDVSLP------QIWPRQEDPPKLGSS 166
+ ++W +N ++ + + KD+ +P Q D L S
Sbjct: 203 LFRSWSTFINRSIILTPEADRTDKKDTTAFNTWKDIIIPGNVDDAMTKNGQPDVQPLPLS 262
Query: 167 KRNKLAFFAG-AVNSPVREKLLQVWRN--------DSEIYAHSGRLKTPYADELLGSKFC 217
KR LA + G A R KL+ + + D + +T Y + L +KFC
Sbjct: 263 KRKYLANYLGRAQGKAGRLKLIDLSKQYPDKLECPDLKFSGTEKFGRTTYFEHLRNAKFC 322
Query: 218 LHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI 262
L +G T R +S + CVPV++++H +LPF +V+++ SI
Sbjct: 323 LAPRGESSWTLRFYESFFVECVPVLLSDHAELPFQNVIDYAQVSI 367
>gi|159470363|ref|XP_001693329.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158277587|gb|EDP03355.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 626
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 81/316 (25%), Positives = 140/316 (44%), Gaps = 61/316 (19%)
Query: 33 YASESYFKKAFMKSHFV-TKDPSKADLFFLP----FSIARMRHDRRIGTEGIPDFISHYI 87
Y++ F + F++ + T++P +A+LF++P F I +R + +P + I
Sbjct: 271 YSAYELFMRYFLQDNVTRTENPWEANLFYVPMLLYFYIGNVR-------DAVPQ-TAWAI 322
Query: 88 FNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVV-----CSSSYFISGH 142
++ ++P+W+R+GG DHFY G + + E++ AI+VV + + +I
Sbjct: 323 NHVRSRWPFWDRSGGRDHFYFMTGDRGTCHLPR--ELQDQAIKVVHWGMQVAGTDWIG-- 378
Query: 143 IAHKDVSLPQIWPRQEDPP---------------------KLGSS-KRNKLAFFAGAV-- 178
+ +KD + Q+ PP G R L FFAG +
Sbjct: 379 LDNKDYACIQLKRDLVVPPINMFAEILPTDTVKHYQTVVANGGQDFGRTLLFFFAGGIAQ 438
Query: 179 --------NSPVREKLLQVW-----RNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEV 225
++E L V +++ GR + Y LL SKFC+ GF
Sbjct: 439 SMEYSGGTRQAIKELLTSVHIANGNSTPADVVFVEGRTQE-YKKLLLTSKFCIAPYGFGW 497
Query: 226 NTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYL 285
R+ ++ +GC+PVII +H F D L ++ FS+ + D+P L IL+ S E+
Sbjct: 498 GL-RLVQAIEFGCIPVIIQDHVYQAFEDFLPYEEFSVRLPLRDVPRLLDILRSYSPEQQA 556
Query: 286 LLQSYVLKVRKHFQWH 301
L+ + K + F WH
Sbjct: 557 ALRLGMAKYYRAFVWH 572
>gi|356519776|ref|XP_003528545.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10L-like
[Glycine max]
Length = 452
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 71/279 (25%), Positives = 129/279 (46%), Gaps = 33/279 (11%)
Query: 49 VTKDPSKADLFFLPF--SIARMRHDRR-----IGTEG---IPDFISHYIFNISQKYPYWN 98
+ DP +ADLFF+PF S++ + + R G+E + + +K YW
Sbjct: 131 LVADPEEADLFFVPFFSSLSLIVNPVRPPGSNSGSEKPVYSDEENQEALVEWLEKQEYWK 190
Query: 99 RTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYF--ISGHIAHKDVSLP---QI 153
R G DH VA +AM + + NA+ +V G + KDV +P +I
Sbjct: 191 RNSGRDHVIVASDP---NAMYRVIDRVRNAVLLVSDFGRLRPDQGSLV-KDVVVPYSHRI 246
Query: 154 WPRQEDPPKLGSSKRNKLAFFAG----AVNSPVREKLLQVWRNDSEIY----AHSGRLKT 205
Q D G RN L FF G +R+ L ++ N+ ++ A S +
Sbjct: 247 RTYQGDA---GVEDRNTLLFFMGNRYRKEGGKIRDILFKILENEKDVIIKHGAQSRESRR 303
Query: 206 PYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVA 265
+ + SKFCLH G + R+ D++ C+PVI++++ +LPF D ++++ ++ +
Sbjct: 304 AASQGMHTSKFCLHPAGDTPSACRLFDAIVSLCIPVIVSDNIELPFEDTIDYRKLAVFIE 363
Query: 266 T---LDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWH 301
T + L L+ ++ + L Q + +V+++F++
Sbjct: 364 TSSAIKPGYLVSKLRALTPDRVLAYQKELKEVKRYFEYE 402
>gi|168004467|ref|XP_001754933.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162694037|gb|EDQ80387.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 378
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 72/296 (24%), Positives = 133/296 (44%), Gaps = 40/296 (13%)
Query: 50 TKDPSKADLFFLPF-------------SIARMRHDRRIGTEGIPDFISHYIFNISQKYPY 96
KDP +A++FF+PF + DR + EG+ + +S+ +
Sbjct: 68 VKDPEQAEVFFVPFFASLSFNSFGRNMAAPNAAKDREL-QEGVVEMLSNS--------KW 118
Query: 97 WNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCS-SSYFISGHIAHKDVSLP--QI 153
W ++ G DH V H +A + ++ ++ +V Y + KD+ P +
Sbjct: 119 WQKSQGRDHIIVIHHP---NAFRYYRDMMNQSMFIVADFGRYNQTVARLKKDIVAPYAHV 175
Query: 154 WPR-QEDPPKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTPY- 207
P ED P S R L FF G V + +R KL ++ N +++Y +T
Sbjct: 176 VPSYNEDNPSDPFSARKTLLFFQGRVRRKADGVIRAKLGKLLMNQTDVYYEDSLARTEAI 235
Query: 208 ---ADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVV 264
+ S+FCLH G ++ R+ D++ CVPVI+++ +LPF D L++ FSI
Sbjct: 236 AMSTQGMRFSRFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDRIELPFEDDLDYSEFSIFF 295
Query: 265 ATLDIPL---LKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYEL 317
+ + + L L+ I+ E +L + + + + HF++ + DA ++ ++
Sbjct: 296 SAKEAIIPGHLLGTLRSITRERWLQMWNKLKAISHHFEYQNPSKEDDAVNLIFKQV 351
>gi|449462180|ref|XP_004148819.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase F8H-like
[Cucumis sativus]
gi|449524512|ref|XP_004169266.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase F8H-like
[Cucumis sativus]
Length = 458
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 84/328 (25%), Positives = 150/328 (45%), Gaps = 30/328 (9%)
Query: 30 RGNYASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFN 89
+G + ++ + ++S F T++ +AD FF+P A ++ R +G + YI
Sbjct: 116 KGQWGTQVKIHRLLLQSRFRTRNKEEADFFFVP---AYVKCVRMLGGLNDKEINEAYIQV 172
Query: 90 ISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVC-----SSSYFISGHIA 144
+ Q PY+ +GG DH +V G + + K+W +N ++ + S
Sbjct: 173 LGQ-MPYFRLSGGRDHIFVFPSGAG-AHLFKSWATYINRSIILTPEGDRTDKKDFSAFNT 230
Query: 145 HKDVSLP------QIWPRQEDPPKLGSSKRNKLAFFAGAVNSPV-REKLLQVWRN----- 192
KD+ +P P + L SKR LA + G V R KL+++ +
Sbjct: 231 WKDIIIPGNVDDGMTSPGAKIVQPLPLSKRKHLANYLGRDQGKVGRLKLIELAKQFPEKL 290
Query: 193 DSEIYAHSG--RL-KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDL 249
+S + SG +L K Y + L +KFCL +G T R +S + CVPV++++ +L
Sbjct: 291 ESPVLKFSGPDKLGKLEYFEHLRNAKFCLAPRGESSWTLRFYESFFVECVPVVLSDQVEL 350
Query: 250 PFADVLNWKSFSIVVATLDI-PLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYD 308
PF +V+++ SI + +I P L + L+ I E + + + R+ V+ SD +
Sbjct: 351 PFQNVIDYSQISIKWPSSEIGPQLLEYLESIPDET---IDKMIARGRRVRCLWVYASDSE 407
Query: 309 AFYMVMYELWLRRSSVRVQWSTSLDSNW 336
+ LW + VR Q+ S ++ W
Sbjct: 408 PCSTMQGILWELQRKVR-QFHQSTETFW 434
>gi|302852377|ref|XP_002957709.1| hypothetical protein VOLCADRAFT_119761 [Volvox carteri f.
nagariensis]
gi|300257003|gb|EFJ41258.1| hypothetical protein VOLCADRAFT_119761 [Volvox carteri f.
nagariensis]
Length = 1481
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 85/354 (24%), Positives = 151/354 (42%), Gaps = 64/354 (18%)
Query: 12 HRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHD 71
HRR+D N + D YA+++ + + S T DP +AD F++P S + +
Sbjct: 332 HRRHDLPGNQTVWSDGW---VYAADTLLHELLLISEHRTFDPEEADFFYVPHSASCLPFP 388
Query: 72 RR--------IGTEG-----IPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAM 118
+G G + + + + I + YP+W R GG DH ++ H G
Sbjct: 389 MGSWADYPWFLGPGGPRIRQMVNMLREVVDWIDKTYPFWRRRGGRDHIWLFTHDEGACWA 448
Query: 119 QKAWE------------VKLNAIQVVCSSSY---FISGH-----IAH----------KDV 148
K E ++ + + Y F+S H + H KD+
Sbjct: 449 PKVLENSTWLTHWGRMGLEHRSGTAFLADKYDIDFVSPHQPEGFLTHIKGHPCYDSTKDL 508
Query: 149 SLPQI-WPRQ-EDPPKLGSS--KRNKLAFFAGAVNS--------PVREKLLQV-----WR 191
+P PR P LGS+ +R+ FF G V VR+KL ++ W+
Sbjct: 509 VVPAFKQPRHYRSSPLLGSATKQRDIFLFFRGDVGKHRMAHYSRGVRQKLYKLSVENNWK 568
Query: 192 NDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
+ + + + ++ Y+D L S+FCL G + +AR+ D++ +GC+PVI+ + + F
Sbjct: 569 SKNVLIGGTHEVRGEYSDLLSRSQFCLVAAG-DGWSARLEDAVLHGCIPVIVIDEVHVVF 627
Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPS 305
+LN SF++ + +P + IL I + Q+++ V F++ P
Sbjct: 628 ESILNVDSFAVRIDEQQLPQILDILAAIPERKIRAKQAHLGHVWHRFRYGSLPG 681
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 66/292 (22%), Positives = 116/292 (39%), Gaps = 29/292 (9%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARM----------RHDRRIGTEGIPDF 82
Y +ES + + S T DP +AD F++P+ M + G I
Sbjct: 1043 YGAESALHEYLLLSEHRTFDPEEADFFYVPYYGTCMIWPVLHWADFPYFHTTGGPRILQV 1102
Query: 83 ISHYIFN---ISQKYPYWNRTGGADHFYVACHSIGRS-----AMQKAWEVKLNAIQVVCS 134
I+ I I++ YP+W R GG DH ++ H G + W ++
Sbjct: 1103 INMLIDTVDWINKMYPFWGRRGGRDHIFLFPHDEGACWAPNVLVNATWLTHWGRTDMIHE 1162
Query: 135 SSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLAFFAGAVNSPVREKLLQVWRNDS 194
S +D + RQ P + R + + +E W++
Sbjct: 1163 SKTSFDADNYTRDY----VGWRQ--PGGFVNLIRGHPCYDPVKIYRLAKEN---NWQDKH 1213
Query: 195 EIY-AHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFAD 253
I + + Y+D L S FCL G + +AR D++ +GC+PVII + + F
Sbjct: 1214 NILIGDAADVPGDYSDLLSRSLFCLVATG-DGWSARTEDAVLHGCIPVIIIDGVHIKFET 1272
Query: 254 VLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPS 305
V + FSI + + + +ILK I + +Q+++ +V +++ P
Sbjct: 1273 VFSVDEFSIRIPEANASRILEILKEIPKTKIRSIQAHLGRVWHRYRYANLPG 1324
>gi|168060641|ref|XP_001782303.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666233|gb|EDQ52893.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 426
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 64/237 (27%), Positives = 112/237 (47%), Gaps = 23/237 (9%)
Query: 95 PYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH--KDVSLP- 151
P W + G++H V H +AM E K ++ V + +A+ KDV P
Sbjct: 162 PAWRASEGSNHVVVIHHP---NAMLHTRE-KFRSVMFVVADFGRYGAEVANMAKDVVAPY 217
Query: 152 -QIWPR--QEDPPKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGR-- 202
+ P ++ L R L FF GA+ +R++L ++ + I +G
Sbjct: 218 KHVIPNFDEDVDAALSFKSRTTLLFFQGAIARKEGGIIRQQLYELLGEEPNIIFSNGTTS 277
Query: 203 ---LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKS 259
+++ A + SKFCLH+ G ++ R+ D++ CVP+II+N +LPF DVLN+
Sbjct: 278 NAGIRSATAG-MRQSKFCLHLAGDTPSSNRLFDAVASHCVPLIISNEIELPFEDVLNYSE 336
Query: 260 FSIVVATLDI---PLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
FS+ V + D + +L + +E+ + + +V +HFQ+ + DA +M
Sbjct: 337 FSLFVNSSDALRKGFVTDLLSNVGEKEWTRMHDRLRQVERHFQYQLPAQIGDAVHMT 393
>gi|302812737|ref|XP_002988055.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
gi|300144161|gb|EFJ10847.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
Length = 459
Score = 81.3 bits (199), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 76/286 (26%), Positives = 131/286 (45%), Gaps = 30/286 (10%)
Query: 52 DPSKADLFFLPF--SIARMRHDRR----IGTEGI--PDFISHYIFNISQKYPYWNRTGGA 103
DP+ AD+FF+PF S++ R+ R G G D + + + W R GG
Sbjct: 156 DPATADIFFVPFFSSLSYNRYCRTGHRFQGGRGCVENDRLEKRLVEFLRGQELWRRNGGV 215
Query: 104 DHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH--KDVSLP--QIWPR-QE 158
DH V H +++ A + A+ VV F S +A+ KD+ P + P
Sbjct: 216 DHVIVMHHP---NSLMVARSLLKEAMFVVADFGRF-SRAVANMRKDIVAPYKHVIPSFAR 271
Query: 159 DPPKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKT----PYADE 210
D S R L FF GA+ +R+KL ++ ++ ++ +G +
Sbjct: 272 DATTFES--RETLLFFQGAIVRKEGGIIRQKLYEILKDSPGVHFVTGNTQKDGIRSATAG 329
Query: 211 LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT---L 267
+ +KFCLH+ G ++ R+ D++ CVPVII++ +LPF D L++ F + V + L
Sbjct: 330 MRNAKFCLHLAGDTPSSNRLFDAIASHCVPVIISDEIELPFEDELDYSQFCVFVESDKAL 389
Query: 268 DIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
+ + L+ I +E+ + + V +HF++ DA +M
Sbjct: 390 RKGFVVRALERIGRDEWTRKWAMLKSVERHFEYQHPSLPEDAVHMT 435
>gi|302773760|ref|XP_002970297.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
gi|300161813|gb|EFJ28427.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
Length = 437
Score = 81.3 bits (199), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 76/287 (26%), Positives = 129/287 (44%), Gaps = 45/287 (15%)
Query: 52 DPSKADLFFLPFSIA-------------RMRHDRRIGTEGIPDFISHYIFNISQKYPYWN 98
DP +AD+FF+P+ + HD+++ G+ +++S K P++
Sbjct: 128 DPGEADVFFVPYFASLSFNVFGVSMRDPETEHDKKLQV-GMIEYLS--------KSPWYQ 178
Query: 99 RTGGADHFYVACHSIGRSAMQKAWEVKLN-AIQVVCSSSYFISGHIA-HKDVSLP---QI 153
R+GG DH V H ++ +LN ++ VV F G A HKDV P +
Sbjct: 179 RSGGRDHVLVLHHPNAFRFLKD----RLNLSLLVVADFGRFPKGVAALHKDVVAPYSHMV 234
Query: 154 WPRQEDPPKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTPYAD 209
D +R L FF G V + VR +L + N ++ G + T +
Sbjct: 235 PTYNGDDGTDPFEERTTLLFFQGRVKRKDDGVVRTQLAAILENQPRVHFEEG-IATNFTV 293
Query: 210 E-----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI-- 262
E + S+FCLH G ++ R+ D++ CVPVI+++ +LPF D L++ FS+
Sbjct: 294 EQAMQGMRSSRFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDKIELPFEDELDYSEFSLFF 353
Query: 263 -VVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYD 308
V + L L+ S ++ + + +V +HF++ PS D
Sbjct: 354 SVDEAVRPGFLLGALEKFSKRRWMKMWRRLKQVTRHFEYQ-HPSQRD 399
>gi|302793388|ref|XP_002978459.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
gi|300153808|gb|EFJ20445.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
Length = 437
Score = 81.3 bits (199), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 76/287 (26%), Positives = 129/287 (44%), Gaps = 45/287 (15%)
Query: 52 DPSKADLFFLPFSIA-------------RMRHDRRIGTEGIPDFISHYIFNISQKYPYWN 98
DP +AD+FF+P+ + HD+++ G+ +++S K P++
Sbjct: 128 DPGEADVFFVPYFASLSFNVFGVSMRDPETEHDKKLQV-GMIEYLS--------KSPWYQ 178
Query: 99 RTGGADHFYVACHSIGRSAMQKAWEVKLNA-IQVVCSSSYFISGHIA-HKDVSLP---QI 153
R+GG DH V H ++ +LN+ + VV F G A HKDV P +
Sbjct: 179 RSGGRDHVLVLHHPNAFRFLKD----RLNSSLLVVADFGRFPKGVAALHKDVVAPYSHMV 234
Query: 154 WPRQEDPPKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTPYAD 209
D +R L FF G V + VR +L + N ++ G + T +
Sbjct: 235 PTYNGDDGSDPFEERTTLLFFQGRVKRKDDGVVRTQLAAILENQPRVHFEEG-IATNFTV 293
Query: 210 E-----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI-- 262
E + S+FCLH G ++ R+ D++ CVPVI+++ +LPF D L++ FS+
Sbjct: 294 EQAMQGMRSSRFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDKIELPFEDELDYSEFSLFF 353
Query: 263 -VVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYD 308
V + L L+ S ++ + + +V +HF++ PS D
Sbjct: 354 SVDEAVRPGFLLGALEKFSKRRWMKMWRRLKQVTRHFEYQ-HPSQRD 399
>gi|5882750|gb|AAD55303.1|AC008263_34 F25A4.34 [Arabidopsis thaliana]
gi|12324821|gb|AAG52383.1|AC011765_35 unknown protein; 115857-117304 [Arabidopsis thaliana]
Length = 458
Score = 81.3 bits (199), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 74/283 (26%), Positives = 135/283 (47%), Gaps = 25/283 (8%)
Query: 51 KDPSKADLFFLPF--SIARMRHDRRIGTEGIPD--FISHYIFNISQKYPYWNRTGGADHF 106
K+ ++AD+ F+PF S++ R + G E D + + + W R G DH
Sbjct: 148 KNSNEADIVFVPFFASLSYNRKSKLRGNETSSDDRLLQERLVEFLKSQDEWKRFDGKDHL 207
Query: 107 YVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH--KDVSLPQIWPRQEDPPKLG 164
VA H +++ A L + V S S IA+ KD+ P + +
Sbjct: 208 IVAHHP---NSLLYARNF-LGSAMFVLSDFGRYSSAIANLEKDIIAPYVHVVKTISNNES 263
Query: 165 SS--KRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTPYADE----LLGS 214
+S KR LA+F GA+ +R++L + +++ +++ G ++ + + S
Sbjct: 264 ASFEKRPVLAYFQGAIYRKDGGTIRQELYNLLKDEKDVHFAFGTVRGNGTKQTGKGMASS 323
Query: 215 KFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVV---ATLDIPL 271
KFCL++ G ++ R+ D++ CVPVII++ +LPF D L++ FS+ V +
Sbjct: 324 KFCLNIAGDTPSSNRLFDAIVSHCVPVIISDQIELPFEDTLDYSGFSVFVHASEAVKKEF 383
Query: 272 LKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY-DAFYMV 313
L IL+GI+ +++ + +V F++ FPS D+ M+
Sbjct: 384 LVNILRGITEDQWKKKWGRLKEVAGCFEYR-FPSQVGDSVNMI 425
>gi|18410670|ref|NP_565089.1| exostosin-like protein [Arabidopsis thaliana]
gi|16209709|gb|AAL14411.1| At1g74680/F1M20_36 [Arabidopsis thaliana]
gi|27363220|gb|AAO11529.1| At1g74680/F1M20_36 [Arabidopsis thaliana]
gi|332197500|gb|AEE35621.1| exostosin-like protein [Arabidopsis thaliana]
Length = 461
Score = 81.3 bits (199), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 74/283 (26%), Positives = 135/283 (47%), Gaps = 25/283 (8%)
Query: 51 KDPSKADLFFLPF--SIARMRHDRRIGTEGIPD--FISHYIFNISQKYPYWNRTGGADHF 106
K+ ++AD+ F+PF S++ R + G E D + + + W R G DH
Sbjct: 151 KNSNEADIVFVPFFASLSYNRKSKLRGNETSSDDRLLQERLVEFLKSQDEWKRFDGKDHL 210
Query: 107 YVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH--KDVSLPQIWPRQEDPPKLG 164
VA H +++ A L + V S S IA+ KD+ P + +
Sbjct: 211 IVAHHP---NSLLYARNF-LGSAMFVLSDFGRYSSAIANLEKDIIAPYVHVVKTISNNES 266
Query: 165 SS--KRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTPYADE----LLGS 214
+S KR LA+F GA+ +R++L + +++ +++ G ++ + + S
Sbjct: 267 ASFEKRPVLAYFQGAIYRKDGGTIRQELYNLLKDEKDVHFAFGTVRGNGTKQTGKGMASS 326
Query: 215 KFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVV---ATLDIPL 271
KFCL++ G ++ R+ D++ CVPVII++ +LPF D L++ FS+ V +
Sbjct: 327 KFCLNIAGDTPSSNRLFDAIVSHCVPVIISDQIELPFEDTLDYSGFSVFVHASEAVKKEF 386
Query: 272 LKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY-DAFYMV 313
L IL+GI+ +++ + +V F++ FPS D+ M+
Sbjct: 387 LVNILRGITEDQWKKKWGRLKEVAGCFEYR-FPSQVGDSVNMI 428
>gi|255570432|ref|XP_002526175.1| catalytic, putative [Ricinus communis]
gi|223534552|gb|EEF36251.1| catalytic, putative [Ricinus communis]
Length = 453
Score = 80.9 bits (198), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 86/345 (24%), Positives = 149/345 (43%), Gaps = 40/345 (11%)
Query: 4 SFRVYVYPHRRNDPFANVL------LPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKAD 57
S ++YVY + D +L + D +G + ++ + S + T+ +AD
Sbjct: 79 SMKIYVYEEKEIDGLKELLRGRDGKISADTCVKGQWGTQVKIHRLLQNSRYRTRKKEEAD 138
Query: 58 LFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSA 117
LFF+P A ++ R +G + Y+ +SQ PY+ R+GG DH +V G +
Sbjct: 139 LFFVP---AYVKCVRMLGGLNDKEINLTYVKVLSQ-MPYFRRSGGRDHIFVFPSGAG-AH 193
Query: 118 MQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIW-----PRQEDP--PKLGS----- 165
+ ++W +N ++ KD S W P D K+G+
Sbjct: 194 LFRSWATYINRSVILTPEG----DRTDKKDTSAFNTWKDIIIPGNVDDGMTKIGTTIVKP 249
Query: 166 ---SKRNKLAFFAGAVNSPV-REKLLQVWRN-----DSEIYAHSGRLK---TPYADELLG 213
SKR LA + G V R KL+++ + + SG K Y + L
Sbjct: 250 LPLSKRKFLANYLGRAQGKVGRLKLIELAKQYPDKLECPELKFSGPEKFGKMEYFEHLRN 309
Query: 214 SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI-PLL 272
+KFCL +G T R +S + CVPV++++ +LPF +V+++ SI + I P L
Sbjct: 310 AKFCLAPRGESSWTLRFYESFFVECVPVLLSDQAELPFQNVIDYTHVSIKWPSTKIGPEL 369
Query: 273 KKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYEL 317
+ L+ I E+ + + +VR + + A +M+EL
Sbjct: 370 LEYLESIPDEDIERMIANGRQVRCLWVYAPESEQCSAMQGIMWEL 414
>gi|223943427|gb|ACN25797.1| unknown [Zea mays]
Length = 380
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 82/332 (24%), Positives = 143/332 (43%), Gaps = 62/332 (18%)
Query: 43 FMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQ---------- 92
+KS + +AD+F++PF IS+++ +
Sbjct: 55 LLKSVIRVQQQEEADIFYVPFFTT----------------ISYFLLEKQECKALYREALK 98
Query: 93 ---KYPYWNRTGGADHFYVACH-----SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHI- 143
P W R+ G DH H S+ RS + W + + + +++ G +
Sbjct: 99 WVTDQPAWQRSEGRDHVIPVHHPWSFKSVRRSVKKAIWLLP----DMDSTGNWYKPGQVY 154
Query: 144 AHKDVSLPQIWPRQEDPPKL---GSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEI 196
KDV LP + K SKR+ L FF G + +R KL++ ++ +I
Sbjct: 155 LEKDVILPYVPNVDLCDHKCVLETQSKRSILLFFRGRLKRNAGGKIRSKLVEELKSAKDI 214
Query: 197 YAHSGRL----KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFA 252
G K D + S FCL G ++AR+ D++ GC+PVII++ +LPF
Sbjct: 215 VIEEGSTGAQGKAAAQDGMRKSFFCLSPAGDTPSSARLFDAIVSGCIPVIISDELELPFE 274
Query: 253 DVLNWKSFSIVVATLDIPL---LKKILKGISSEEYLLLQSYVLKVRKHFQWH-----VFP 304
+L+++ ++ V+ D L K L+GI+++ +QS ++K +HF + + P
Sbjct: 275 GILDYREIALFVSASDAVQPGWLLKYLRGINAKRIREIQSNLVKYSRHFLYSSPAQPLGP 334
Query: 305 SDYD----AFYMVMYELWLRRSSVRVQWSTSL 332
D A +V +L +RRS V+ S S+
Sbjct: 335 EDLTWRMIAGKLVNIKLQIRRSQRLVKESRSI 366
>gi|226496701|ref|NP_001141657.1| uncharacterized protein LOC100273782 [Zea mays]
gi|194705444|gb|ACF86806.1| unknown [Zea mays]
Length = 497
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 82/332 (24%), Positives = 143/332 (43%), Gaps = 62/332 (18%)
Query: 43 FMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQ---------- 92
+KS + +AD+F++PF IS+++ +
Sbjct: 172 LLKSVIRVQQQEEADIFYVPFFTT----------------ISYFLLEKQECKALYREALK 215
Query: 93 ---KYPYWNRTGGADHFYVACH-----SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHI- 143
P W R+ G DH H S+ RS + W + + + +++ G +
Sbjct: 216 WVTDQPAWQRSEGRDHVIPVHHPWSFKSVRRSVKKAIWLLP----DMDSTGNWYKPGQVY 271
Query: 144 AHKDVSLPQIWPRQEDPPKL---GSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEI 196
KDV LP + K SKR+ L FF G + +R KL++ ++ +I
Sbjct: 272 LEKDVILPYVPNVDLCDHKCVLETQSKRSILLFFRGRLKRNAGGKIRSKLVEELKSAKDI 331
Query: 197 YAHSGRL----KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFA 252
G K D + S FCL G ++AR+ D++ GC+PVII++ +LPF
Sbjct: 332 VIEEGSTGAQGKAAAQDGMRKSFFCLSPAGDTPSSARLFDAIVTGCIPVIISDELELPFE 391
Query: 253 DVLNWKSFSIVVATLDIPL---LKKILKGISSEEYLLLQSYVLKVRKHFQWH-----VFP 304
+L+++ ++ V+ D L K L+GI+++ +QS ++K +HF + + P
Sbjct: 392 GILDYREIALFVSASDAVQPGWLLKYLRGINAKRIREIQSNLVKYSRHFLYSSPAQPLGP 451
Query: 305 SDYD----AFYMVMYELWLRRSSVRVQWSTSL 332
D A +V +L +RRS V+ S S+
Sbjct: 452 EDLTWRMIAGKLVNIKLQIRRSQRLVKESRSI 483
>gi|224062946|ref|XP_002300942.1| predicted protein [Populus trichocarpa]
gi|222842668|gb|EEE80215.1| predicted protein [Populus trichocarpa]
Length = 460
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 89/365 (24%), Positives = 157/365 (43%), Gaps = 46/365 (12%)
Query: 4 SFRVYVYPHRRNDPFANVL------LPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKAD 57
S ++YVY D +L + D +G + ++ ++S F T+ +AD
Sbjct: 86 SLKIYVYEEDEIDGLKELLRGRDGKISADACLKGQWGTQVKIHGLLLESRFRTRKKEEAD 145
Query: 58 LFFLPFSIARMRHDRRIGTEGIPD-FISHYIFNISQKYPYWNRTGGADHFYVACHSIGRS 116
LFF+P A ++ R +G G+ D I+H + + PY+ R+GG DH +V G +
Sbjct: 146 LFFVP---AYVKCVRMMG--GLNDKEINHTYVKVLSQMPYFRRSGGRDHIFVFPSGAG-A 199
Query: 117 AMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPK-------------- 162
+ ++W +N ++ + + KD S W P
Sbjct: 200 HLFRSWATYINRSIILTTEA----DRTDKKDTSAFNTWKDIIIPGNVEDGMTKRRIAMVQ 255
Query: 163 -LGSSKRNKLAFFAGAVNSPV-REKLLQVWRN-----DSEIYAHSGRLK---TPYADELL 212
L SKR LA + G V R KL+++ + +S SG K Y L
Sbjct: 256 PLPLSKRKYLANYLGRAQGKVGRLKLIELAKQYPDKLESPELKFSGPGKFGRMEYFQHLR 315
Query: 213 GSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPL- 271
+KFCL +G T R +S + CVPVI+++ + PF +V+++ SI + I L
Sbjct: 316 NAKFCLAPRGESSWTLRFYESFFVECVPVILSDQAEFPFQNVIDYTQISIKWPSTRIGLE 375
Query: 272 LKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQWSTS 331
L + L+ I E ++ + R+ V+ ++++ + +W + VR Q+ S
Sbjct: 376 LLEYLESIPDEN---IEQMIAAGRQIRCLWVYAPEFESCSAMQGIMWELQRKVR-QFHQS 431
Query: 332 LDSNW 336
++ W
Sbjct: 432 AETFW 436
>gi|168016669|ref|XP_001760871.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162687880|gb|EDQ74260.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 378
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 77/290 (26%), Positives = 132/290 (45%), Gaps = 29/290 (10%)
Query: 48 FVTKDPSKADLFFLPF--SIARMRHDRRIGTEGIPDFISHY-------IFNISQKYPYWN 98
F D AD+ F+PF S++ R + + + D I + ++ P W
Sbjct: 67 FRVADWRDADVIFVPFFASLSYNRFGKASEEKRLTDLIKDQNDVLQLKLVKFLEEQPAWK 126
Query: 99 RTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH--KDVSLP--QIW 154
+GG DH +V H ++MQ N++ +V + S +A+ KDV P +
Sbjct: 127 ASGGRDHVFVIHHP---NSMQATRNRLRNSLFIVSDFGRYDS-EVANIQKDVVAPYKHVI 182
Query: 155 PRQEDPPKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTP-YAD 209
P D R L FF GA+ +R +L ++ ++ + +G +
Sbjct: 183 P-TFDFDDSSFHTRKILLFFQGAIVRKEGGKIRHELYRLLKDKPGVRFTTGNTALDGFQS 241
Query: 210 ELLG---SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
+G SKFCL++ G ++ R+ DS+ CVPVII++ +LPF D L++ +F I + +
Sbjct: 242 ATIGMRSSKFCLNMAGDTPSSNRLFDSIVSHCVPVIISDDIELPFEDTLDYSNFCIFINS 301
Query: 267 ---LDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
L + +L+ +S EE+ L + +L V HF++ DA MV
Sbjct: 302 SLALKPGYVINMLRNVSEEEWTQLWNQLLLVEHHFEYQHPTRKNDAVNMV 351
>gi|15241619|ref|NP_199306.1| Exostosin family protein [Arabidopsis thaliana]
gi|30694651|ref|NP_851132.1| Exostosin family protein [Arabidopsis thaliana]
gi|10177484|dbj|BAB10875.1| unnamed protein product [Arabidopsis thaliana]
gi|15081733|gb|AAK82521.1| AT5g44930/K21C13_11 [Arabidopsis thaliana]
gi|27363262|gb|AAO11550.1| At5g44930/K21C13_11 [Arabidopsis thaliana]
gi|332007793|gb|AED95176.1| Exostosin family protein [Arabidopsis thaliana]
gi|332007794|gb|AED95177.1| Exostosin family protein [Arabidopsis thaliana]
Length = 443
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 70/281 (24%), Positives = 126/281 (44%), Gaps = 24/281 (8%)
Query: 52 DPSKADLFFLPF--SIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVA 109
DP++ADLF++ S++ + R G + + + + + +W R G DH VA
Sbjct: 129 DPAEADLFYVSAFSSLSLIVDSGRPGFGYSDEEMQESLVSWLESQEWWRRNNGRDHVIVA 188
Query: 110 CHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH-KDVSLPQIWPRQEDPPKLGSSKR 168
+A+++ + NA+ +V + + KDV +P +LG +R
Sbjct: 189 GDP---NALKRVMDRVKNAVLLVTDFDRLRADQGSLVKDVIIPYSHRIDAYEGELGVKQR 245
Query: 169 NKLAFFAG----AVNSPVREKLLQVWRNDSEIYAHSGRLKT----PYADELLGSKFCLHV 220
L FF G VR+ L ++ + ++ G + SKFCLH+
Sbjct: 246 TNLLFFMGNRYRKDGGKVRDLLFKLLEKEEDVVIKRGTQSRENMRAVKQGMHTSKFCLHL 305
Query: 221 KGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVV---ATLDIPLLKKILK 277
G + R+ D++ CVPVI+++ +LPF DV++++ FSI + A L + K L+
Sbjct: 306 AGDTSSACRLFDAIASLCVPVIVSDGIELPFEDVIDYRKFSIFLRRDAALKPGFVVKKLR 365
Query: 278 GISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELW 318
+ + L Q + +VR++F DY + E+W
Sbjct: 366 KVKPGKILKYQKVMKEVRRYF-------DYTHLNGSVNEIW 399
>gi|30686300|ref|NP_850241.1| Exostosin family protein [Arabidopsis thaliana]
gi|50253448|gb|AAT71926.1| At2g35100 [Arabidopsis thaliana]
gi|53828615|gb|AAU94417.1| At2g35100 [Arabidopsis thaliana]
gi|110737872|dbj|BAF00874.1| hypothetical protein [Arabidopsis thaliana]
gi|330253970|gb|AEC09064.1| Exostosin family protein [Arabidopsis thaliana]
Length = 447
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 75/284 (26%), Positives = 124/284 (43%), Gaps = 27/284 (9%)
Query: 52 DPSKADLFFLP-FS----IARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHF 106
DP+ ADLF++P FS I G+ + + + + +W R G DH
Sbjct: 130 DPADADLFYVPVFSSLSLIVNAGRPVEAGSGYSDEKMQEGLVEWLEGQEWWRRNAGRDHV 189
Query: 107 YVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH-KDVSLPQIWPRQEDPPKLGS 165
A +A+ + + NA+ +V + KDV +P ++G
Sbjct: 190 IPAGDP---NALYRILDRVKNAVLLVSDFGRLRPDQGSFVKDVVIPYSHRVNLFNGEIGV 246
Query: 166 SKRNKLAFFAG----AVNSPVREKLLQVWRNDSEIYAHSGRL----KTPYADELLGSKFC 217
RN L FF G VR+ L QV + ++ G + + SKFC
Sbjct: 247 EDRNTLLFFMGNRYRKDGGKVRDLLFQVLEKEDDVTIKHGTQSRENRRAATKGMHTSKFC 306
Query: 218 LHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVV---ATLDIPLLKK 274
L+ G + R+ DS+ CVP+I+++ +LPF DV++++ FSI V A L L +
Sbjct: 307 LNPAGDTPSACRLFDSIVSLCVPLIVSDSIELPFEDVIDYRKFSIFVEANAALQPGFLVQ 366
Query: 275 ILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELW 318
+L+ I +++ L Q + VR++F DYD + E+W
Sbjct: 367 MLRKIKTKKILEYQREMKSVRRYF-------DYDNPNGAVKEIW 403
>gi|414880038|tpg|DAA57169.1| TPA: hypothetical protein ZEAMMB73_490377 [Zea mays]
Length = 497
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 82/332 (24%), Positives = 143/332 (43%), Gaps = 62/332 (18%)
Query: 43 FMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQ---------- 92
+KS + +AD+F++PF IS+++ +
Sbjct: 172 LLKSVIRVQQQEEADIFYVPFFTT----------------ISYFLLEKQECKALYREALK 215
Query: 93 ---KYPYWNRTGGADHFYVACH-----SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHI- 143
P W R+ G DH H S+ RS + W + + + +++ G +
Sbjct: 216 WVTDQPAWQRSEGRDHVIPVHHPWSFKSVRRSVKKAIWLLP----DMDSTGNWYKPGQVY 271
Query: 144 AHKDVSLPQIWPRQEDPPKL---GSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEI 196
KDV LP + K SKR+ L FF G + +R KL++ ++ +I
Sbjct: 272 LEKDVILPYVPNVDLCDHKCVLETQSKRSILLFFRGRLKRNAGGKIRSKLVEELKSAKDI 331
Query: 197 YAHSGRL----KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFA 252
G K D + S FCL G ++AR+ D++ GC+PVII++ +LPF
Sbjct: 332 VIEEGSTGAQGKAAAQDGMRKSFFCLSPAGDTPSSARLFDAIVSGCIPVIISDELELPFE 391
Query: 253 DVLNWKSFSIVVATLDIPL---LKKILKGISSEEYLLLQSYVLKVRKHFQWH-----VFP 304
+L+++ ++ V+ D L K L+GI+++ +QS ++K +HF + + P
Sbjct: 392 GILDYREIALFVSASDAVQPGWLLKYLRGINAKRIREIQSNLVKYSRHFLYSSPAQPLGP 451
Query: 305 SDYD----AFYMVMYELWLRRSSVRVQWSTSL 332
D A +V +L +RRS V+ S S+
Sbjct: 452 EDLTWRMIAGKLVNIKLQIRRSQRLVKESRSI 483
>gi|356570361|ref|XP_003553358.1| PREDICTED: probable glycosyltransferase At3g07620-like [Glycine
max]
Length = 520
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 77/289 (26%), Positives = 137/289 (47%), Gaps = 38/289 (13%)
Query: 51 KDPSKADLFFLPF--SIARMR------HDRRIGTEGIPDFISHYIFNISQKYPYWNRTGG 102
++ S+AD+ F+PF S+ R H++R + + + + Y+ + W R+GG
Sbjct: 202 RNSSEADVIFVPFFSSLCYNRLSKTGPHEKRSRNKLLQEKLVKYVTAQEE----WKRSGG 257
Query: 103 ADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH--KDVSLPQ---IWPRQ 157
DH +A H ++M A +KL + S +IA+ KDV P +
Sbjct: 258 KDHVILAHHP---NSMLDA-RMKLWPGTFILSDFGRYPTNIANVEKDVIAPYKHVVGSYD 313
Query: 158 EDPPKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKT----PYAD 209
D S R L +F GA+ VR +L + +N+ +++ G ++ +
Sbjct: 314 NDQSSFDS--RTTLLYFQGAIYRKDGGHVRHELYYLLKNEKDVHFSFGSVQKGGVRKATE 371
Query: 210 ELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI 269
+ SKFCL++ G ++ R+ D++ CVPVII++ +LP+ DVL++ F I V T D
Sbjct: 372 GMRSSKFCLNIAGDTPSSNRLFDAIASHCVPVIISDDIELPYEDVLDYSQFCIFVRTRDA 431
Query: 270 PLLKK-----ILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
LKK ++ I EE+ + + + +V F++ + DA M+
Sbjct: 432 --LKKRYLINFIRSIGKEEWTRMWNRLKEVESFFEFQFPSKEGDAVQMI 478
>gi|356497977|ref|XP_003517832.1| PREDICTED: xylogalacturonan beta-1,3-xylosyltransferase-like
[Glycine max]
Length = 509
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 77/302 (25%), Positives = 145/302 (48%), Gaps = 34/302 (11%)
Query: 50 TKDPSKADLFFLPF--SIARMR------HDRRIGTEGIPDFISHYIFNISQKYPYWNRTG 101
++ S+AD+ F+PF S++ R H ++ + + + + Y+ + W R+G
Sbjct: 190 VQNSSEADIIFVPFFSSLSYNRYSKSKPHVKKSKNKILQEKLVTYLMAQEE----WKRSG 245
Query: 102 GADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH--KDVSLP---QIWPR 156
G DH +A H ++M A +KL + S +IA+ KDV P I
Sbjct: 246 GKDHLILAHHP---NSMLDA-RMKLWPATFILSDFGRYPPNIANVEKDVIAPYKHLISSY 301
Query: 157 QEDPPKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEI---YAHSGRLKTPYAD 209
D S R L +F GA+ R++L + +++ ++ + G+ A
Sbjct: 302 VNDNSNFDS--RPTLLYFQGAIYRKDGGLARQELFYLLKDEKDVHFSFGSIGKDGIKKAT 359
Query: 210 E-LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLD 268
E + SKFCL++ G ++ R+ D++ CVPVII++ +LP+ DV+++ F I V T D
Sbjct: 360 EGMRASKFCLNIAGDTPSSNRLFDAIASHCVPVIISDKIELPYEDVIDYSEFCIFVRTSD 419
Query: 269 I---PLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
L ++GI+ EE+ + + + +V F++H + DA M+ + + ++R
Sbjct: 420 AIKEKFLINFIRGIAKEEWTRMWNKLKEVEHFFEFHFPSKENDAVQMIWQAVARKVPAMR 479
Query: 326 VQ 327
++
Sbjct: 480 LK 481
>gi|297839341|ref|XP_002887552.1| F25A4.34 [Arabidopsis lyrata subsp. lyrata]
gi|297333393|gb|EFH63811.1| F25A4.34 [Arabidopsis lyrata subsp. lyrata]
Length = 458
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 69/276 (25%), Positives = 130/276 (47%), Gaps = 28/276 (10%)
Query: 51 KDPSKADLFFLPF--SIARMRHDRRIGTEGIPD--FISHYIFNISQKYPYWNRTGGADHF 106
K+ ++AD+ F+PF S++ R + G E I + + + W R G DH
Sbjct: 148 KNSNEADIVFVPFFASLSYNRKSKLRGNETISGDRLLQERLVEFLKSQDEWKRFDGKDHL 207
Query: 107 YVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIA-HKDVSLP-----QIWPRQEDP 160
+A H +++ A +A+ V+ + S + KD+ P + E
Sbjct: 208 IIAHHP---NSLLYAKNFLGSAMFVLSDFGRYSSANANLEKDIIAPYLHVVKTISNNESA 264
Query: 161 PKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTPYADE----LL 212
P KR LA+F GA+ +R++L + R++ +++ G ++ + +
Sbjct: 265 P---FEKRPVLAYFQGAIYRKDGGTIRQELYNLLRDEKDVHFAFGTVRRNGTKQTGKGMA 321
Query: 213 GSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVV---ATLDI 269
SKFCL++ G ++ R+ D++ CVPVII++ +LPF D L++ FS+ V +
Sbjct: 322 SSKFCLNIAGDTPSSNRLFDAIVSHCVPVIISDQIELPFEDSLDYSGFSVFVHASEAVKK 381
Query: 270 PLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPS 305
L +L+GI+ +++ + +V F++ FPS
Sbjct: 382 GFLVNLLRGITEDQWKKKWGRLKEVAGCFEYR-FPS 416
>gi|384251887|gb|EIE25364.1| exostosin-like glycosyltransferase [Coccomyxa subellipsoidea C-169]
Length = 705
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 86/336 (25%), Positives = 133/336 (39%), Gaps = 65/336 (19%)
Query: 32 NYASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRH-------DRRIGTEGIPDFIS 84
Y E + ++S T DP AD F++P + H G P +
Sbjct: 316 TYQIEPALHEMLLQSPHRTLDPEAADFFYVPVYTSCFIHPVYGWADTPWFHNPGSPRVMH 375
Query: 85 HYIFNISQK------YPYWNRTGGADHFYVACHSIGR---------SAMQKAWEVKL--- 126
+ K PYWNRTGG DH ++ H G S + W K
Sbjct: 376 AATMMLEAKRWLETELPYWNRTGGRDHIWLISHDEGSCWAPSEIRSSIILSHWGRKALDH 435
Query: 127 -------------NAIQVVCSS---SYFISGHIAH---KDVSLPQIWP--RQEDPPKLGS 165
NA+ + I GH + KD+ +P P R P G+
Sbjct: 436 ESYSAYPFDNYSDNAVHPEWRPHGWRHIIEGHPCYDPDKDLIIPAFVPPARIVPSPLTGA 495
Query: 166 SK--RNKLAFFAGAV--------NSPVREKLL-----QVWRNDSEIYAHSGRLKTP--YA 208
+ R L FF G V + +R+++ Q WR I+ + + TP Y+
Sbjct: 496 REDPRPLLLFFRGDVGLNRRPHYSRGIRQRIYALSKEQRWREKYRIWIGT-KEDTPGGYS 554
Query: 209 DELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLD 268
+ L SKFCL V G + + R D++ +GCVPV++ + D F +L+W+ F++ + +
Sbjct: 555 ELLSSSKFCLVVPG-DGWSPRAEDAMLHGCVPVVVNDGVDQVFETLLDWEEFAVRIPERE 613
Query: 269 IPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFP 304
+ L +IL IS LQ V +V F + P
Sbjct: 614 MEFLPEILLSISPSRLQQLQKGVRRVWHRFMYRALP 649
>gi|307110230|gb|EFN58466.1| hypothetical protein CHLNCDRAFT_140476 [Chlorella variabilis]
Length = 632
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 78/341 (22%), Positives = 137/341 (40%), Gaps = 74/341 (21%)
Query: 32 NYASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDF---ISHYIF 88
YA E+ F + ++S T DP +AD F++P + + R G + + DF +SH
Sbjct: 235 TYALEAGFLEMLLQSEHRTLDPEEADFFYVPVFTSCFINPVRDGADSLRDFFYGVSHNRV 294
Query: 89 N------------ISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCS-- 134
+ +PYW R GG DH ++ H + A + + ++ S
Sbjct: 295 QGAANMLLEAYHWVQAMFPYWERRGGRDHIWLVTHDEASCWVPAA----IRSTSIILSHW 350
Query: 135 --------SSYFISGHIAHKDVSLPQIWPRQ-----------EDP--------------- 160
S S + DV+ PQ P DP
Sbjct: 351 GRMDAHHTSGTGYSADVYSNDVTHPQFEPDGFLGKLNLTQPCYDPVKDLVVPLMKTPEHY 410
Query: 161 ---PKLGSSKRNK--LAFFAGAV---NSP----VREKLLQVWRNDSEIYAHS------GR 202
P +G+ R + LAF G V N P +R++L + + H
Sbjct: 411 RLSPLVGAPPRQRTWLAFHRGRVQADNPPYSRGIRQRLAKAAAEGGWLEKHKIAVGEYDT 470
Query: 203 LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI 262
L+ Y++ L S FC + G + +AR+ D++ +GC+PV+I + + F V++ +F+I
Sbjct: 471 LQGDYSELLASSVFCPVIPG-DGWSARMDDAMLHGCIPVLIMDEVQVSFESVVDLSTFTI 529
Query: 263 VVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVF 303
+ D L IL+ ++ E +Q + +V + F + +
Sbjct: 530 RIPEADAEKLPDILQAVTQERREEMQRALARVWQRFTYSSY 570
>gi|145352087|ref|XP_001420390.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144580624|gb|ABO98683.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 517
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 47/116 (40%), Positives = 69/116 (59%), Gaps = 6/116 (5%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L SK+CL+V G +TAR+ D + +GCVPVI+A+ YDLPF+ + +W FS+ V
Sbjct: 401 YMTMLSKSKYCLYVYGDRAHTARLYDIITFGCVPVIVADGYDLPFSWLFDWSKFSVRVLE 460
Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY-DAFYMVMYELWLRR 321
D+ L IL +Y L+ ++KV FQ+H S + DAF++ M L +RR
Sbjct: 461 DDVATLPSIL---DRADYDSLRRELVKVHSFFQYHNRGSIFGDAFWITM--LGVRR 511
>gi|168017876|ref|XP_001761473.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162687479|gb|EDQ73862.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 399
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 67/252 (26%), Positives = 115/252 (45%), Gaps = 24/252 (9%)
Query: 30 RGNYASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFN 89
RG + ++ K MKS + T D S+A+ FF+P + +R + + + + +
Sbjct: 47 RGQWGTQVKIHKLLMKSRYRTLDKSRANFFFVPVYVKCVRIFGGLNEKEV----NEHFLK 102
Query: 90 ISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVC-----SSSYFISGHIA 144
I ++ PY++R+GG DH +V G + + K W LN + + S
Sbjct: 103 ILRQMPYFHRSGGRDHIFVFPSGAG-AHLVKGWPNFLNRSIFLTPEGDRTDKKAFSSFNT 161
Query: 145 HKDVSLPQI-----WPRQEDPPKLGSSKRNKLAFFAG-AVNSPVREKLLQVWRN-----D 193
KD+ +P P L SKR +A + G A R +L+++ + D
Sbjct: 162 WKDIIIPGNVDIINHPSNSATSPLPLSKRKYVANYLGRAQGKKGRLQLIELAKQFPAELD 221
Query: 194 SEIYAHSGRLK---TPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLP 250
+ A G K Y + L +KFCL +G T R ++ + CVPVI+++ +LP
Sbjct: 222 APELAFQGSAKLGRIEYYNRLRNAKFCLAPRGESSWTLRFYEAFFVECVPVILSDEIELP 281
Query: 251 FADVLNWKSFSI 262
+ +VL++ FSI
Sbjct: 282 YQNVLDYSGFSI 293
>gi|356506196|ref|XP_003521873.1| PREDICTED: probable glucuronosyltransferase GUT1-like [Glycine max]
Length = 505
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 76/290 (26%), Positives = 138/290 (47%), Gaps = 39/290 (13%)
Query: 51 KDPSKADLFFLPF--SIARMR-------HDRRIGTEGIPDFISHYIFNISQKYPYWNRTG 101
++ S++D+ F+PF S+ R H++R + + + + Y+ + W R+G
Sbjct: 196 RNSSESDVVFVPFFSSLCYNRFSSKTNPHEKRSMNKVLQEKLVKYVTEQEE----WKRSG 251
Query: 102 GADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH--KDVSLPQ---IWPR 156
G DH VA H ++M A +KL + S +IA+ KDV P +
Sbjct: 252 GKDHVIVAHHP---NSMLDA-RMKLWPGTFILSDFGRYPTNIANVEKDVIAPYKHVVGSY 307
Query: 157 QEDPPKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTP----YA 208
D S R L +F GA+ VR +L + +N+ +++ G ++ A
Sbjct: 308 DNDQSSFDS--RPTLLYFQGAIYRKDGGHVRHELYYLVKNEKDVHFSFGNVEKGGVRNAA 365
Query: 209 DELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLD 268
+ + SKFCL++ G ++ R+ D++ CVPVII++ +LP+ DV+++ F + V T D
Sbjct: 366 EGMRSSKFCLNIAGDTPSSNRLFDAIASHCVPVIISDEIELPYEDVIDYSQFCVFVRTRD 425
Query: 269 IPLLKK-----ILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
LKK ++ I EE+ + + + +V F++ + DA M+
Sbjct: 426 A--LKKRYLINFIRSIGKEEWTRMWNRLKEVESFFEFQFPSKEGDAVQMI 473
>gi|307103001|gb|EFN51266.1| hypothetical protein CHLNCDRAFT_141180 [Chlorella variabilis]
Length = 404
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 72/294 (24%), Positives = 132/294 (44%), Gaps = 37/294 (12%)
Query: 39 FKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWN 98
F + + S T+DPS+A+LFF+P +++ + + D ++ +I + YPYWN
Sbjct: 90 FIEQLLMSPVRTEDPSEANLFFVP-ALSWSYGGNALNAVHL-DLVADHI---ASHYPYWN 144
Query: 99 RTGGADHFY------VACHSIGRS-AMQKAWEVKLNAIQVV------CSSSYFISGHIAH 145
R+ G DH + AC GR+ A K LN I + +++ + +
Sbjct: 145 RSQGRDHIFWLTNDRGACALTGRTEAAIKLTHFGLNTINISVGWGPGAATNPENACYNPL 204
Query: 146 KDVSLPQIWPRQEDPPKLGS--------SKRNKLAFFAGAVNS---------PVREKLLQ 188
+DV P + ++ + + L FF+GAV++ + +L++
Sbjct: 205 RDVVAPPFDDMARELMEVSRKLSVEDIIAAKTSLFFFSGAVSNDSEYSGNTRQLLRELVK 264
Query: 189 VWRNDSEIYAHSGRLK-TPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHY 247
W + I+ G Y L SKFC V G+ R+ ++ G VP++I
Sbjct: 265 RWNDPEIIFETEGDTGLGDYVKRLRASKFCPAVFGYGFGM-RLLTCVFSGSVPLVIQERV 323
Query: 248 DLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWH 301
P D+L +++FS+ + +P L +IL+ I+ ++Y L +++ R F W
Sbjct: 324 AQPLEDLLPYETFSLRLNNGHLPDLPRILRSITDQQYQRLVQGLVRYRDAFHWE 377
>gi|60657600|gb|AAX33321.1| secondary cell wall-related glycosyltransferase family 47 [Populus
tremula x Populus tremuloides]
Length = 509
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 76/302 (25%), Positives = 145/302 (48%), Gaps = 34/302 (11%)
Query: 50 TKDPSKADLFFLPF--SIARMR------HDRRIGTEGIPDFISHYIFNISQKYPYWNRTG 101
++ S+AD+ F+PF SI+ R H ++ + + + + ++ SQK W R+G
Sbjct: 190 VQNSSEADVIFVPFFSSISYNRYSKVNPHQKKSKNKSLEEKLVKFV--TSQKE--WKRSG 245
Query: 102 GADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH--KDVSLPQ---IWPR 156
G DH +A H ++M A +KL + + S +IA+ KDV P I
Sbjct: 246 GRDHIILAHHP---NSMLYA-RMKLWTAMFILADFGRYSPNIANVGKDVIAPYKHVIKSY 301
Query: 157 QEDPPKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLK----TPYA 208
D S R L +F GA+ R++L +++ +++ G ++ + +
Sbjct: 302 ANDSSNFDS--RPTLLYFQGAIYRKDGGFARQELFYALKDEKDVHFQFGSVQKDGVSKAS 359
Query: 209 DELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLD 268
+ SKFCL++ G ++ R+ D++ CVPVII++ +LP+ DVL++ F I V T D
Sbjct: 360 QGMHSSKFCLNIAGDTPSSNRLFDAIASHCVPVIISDDIELPYEDVLDYSQFCIFVRTSD 419
Query: 269 I---PLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
L +++ I +E+ + + +V F++ + DA M+ + + ++R
Sbjct: 420 AVREKFLINLVRSIKKDEWTRMWQRLKEVENFFEFQYPSKEGDAVQMIWQAVARKVPAIR 479
Query: 326 VQ 327
++
Sbjct: 480 LK 481
>gi|159479930|ref|XP_001698039.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158273838|gb|EDO99624.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 810
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 92/350 (26%), Positives = 143/350 (40%), Gaps = 95/350 (27%)
Query: 32 NYASESYFKKAFMKSHFVTKDPSKADLFFLPFSIA---------------------RMRH 70
Y E+ + ++S T DP +AD F++P I R+ H
Sbjct: 413 TYGIETLMHELMLQSEHRTFDPEEADFFYVPMYITCYFWPILGWADGPWWHAPNGLRVMH 472
Query: 71 DRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKA--------- 121
+ TE + D++ K PYW+R GG DH ++ G M KA
Sbjct: 473 GANMITE-LHDWLR-------TKLPYWDRRGGRDHIWLMAADEGACWMPKAVYDTSIVLT 524
Query: 122 -W-----EVKLNAIQV-----VCSSSYF-----------ISGHIA---HKDVSLPQI--- 153
W E K N + S F I GH KD+ +P
Sbjct: 525 HWGRLDPEHKSNTAYLQDNYTAKPESAFDAWRGVDFGDRIKGHPCFDPRKDLVVPAFKSP 584
Query: 154 --WPRQEDPPKLGSS--KRNKLAFFAGAVNSP--------VREKLLQV-----WRNDSEI 196
+PR P +G+ +R+ L FF G V + +R++L + W N +I
Sbjct: 585 NHFPRS---PLIGAPPLERDLLLFFRGDVGASRLPHYSRGIRQRLFHLAHKHDWYNRFKI 641
Query: 197 YAHSG-RLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVL 255
SG LK Y+++L SKFCL G + + R D++ +GC+PV++ + F +L
Sbjct: 642 AIGSGDSLKGDYSEQLARSKFCLVAPG-DGWSPRAEDAILHGCIPVVVMDGVQAVFESIL 700
Query: 256 NWKSFSIVVATLDIPL--LKKILKGISSEEYLLLQSYVLKVRKHFQWHVF 303
+W SFS+ + D L L ++L IS E +Q ++ +V WH F
Sbjct: 701 DWDSFSLRIREDDAALEALPQLLASISPERLAHMQRHLARV-----WHRF 745
>gi|224123938|ref|XP_002319201.1| predicted protein [Populus trichocarpa]
gi|222857577|gb|EEE95124.1| predicted protein [Populus trichocarpa]
Length = 450
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 76/302 (25%), Positives = 145/302 (48%), Gaps = 34/302 (11%)
Query: 50 TKDPSKADLFFLPF--SIARMR------HDRRIGTEGIPDFISHYIFNISQKYPYWNRTG 101
++ S+AD+ F+PF SI+ R H ++ + + + + ++ SQK W R+G
Sbjct: 151 VQNSSEADVIFVPFFSSISYNRYSKVNPHQKKSKNKSLEEKLVKFV--TSQKE--WKRSG 206
Query: 102 GADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH--KDVSLPQ---IWPR 156
G DH +A H ++M A +KL + + S +IA+ KDV P I
Sbjct: 207 GRDHIILAHHP---NSMLYA-RMKLWTAMFILADFGRYSPNIANVGKDVIAPYKHVIKSY 262
Query: 157 QEDPPKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLK----TPYA 208
D S R L +F GA+ R++L +++ +++ G ++ + +
Sbjct: 263 ANDSSNFDS--RPTLLYFQGAIYRKDGGFARQELFYALKDEKDVHFQFGSVQKDGVSKAS 320
Query: 209 DELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLD 268
+ SKFCL++ G ++ R+ D++ CVPVII++ +LP+ DVL++ F I V T D
Sbjct: 321 QGMHSSKFCLNIAGDTPSSNRLFDAIASHCVPVIISDDIELPYEDVLDYSQFCIFVRTSD 380
Query: 269 I---PLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
L +++ I +E+ + + +V F++ + DA M+ + + ++R
Sbjct: 381 AVREKFLINLVRSIKKDEWTRMWQRLKEVENFFEFQYPSKEGDAVQMIWQAVARKVPAIR 440
Query: 326 VQ 327
++
Sbjct: 441 LK 442
>gi|125547211|gb|EAY93033.1| hypothetical protein OsI_14833 [Oryza sativa Indica Group]
Length = 429
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 70/287 (24%), Positives = 128/287 (44%), Gaps = 30/287 (10%)
Query: 2 NRSFRVYVYPHRRNDPFANVLLPVDFEP------RGNYASESYFKKAFMKSHFVTKDPSK 55
+R R+YVY D +L + + +G + ++ + + S F T D +
Sbjct: 52 HRDLRIYVYAEDEVDGLRALLRGRNDDVTAATCIKGQWGTQVKIHQLLLSSRFRTFDKDE 111
Query: 56 ADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGR 115
ADLFF+P + +R ++ + I Y+ + + PY+ R+GG DH +V G
Sbjct: 112 ADLFFVPTYVKCVRMTGKLNDKEIN---QTYVKVVLSQMPYFRRSGGRDHIFVFPSGAG- 167
Query: 116 SAMQKAWEVKLNAIQVVC-----SSSYFISGHIAHKDVSLP------QIWPRQEDPPKLG 164
+ + ++W LN ++ + IS KD+ +P + + +
Sbjct: 168 AHLFRSWATFLNRSIILTPEGDRTDKRGISAFNTWKDIIIPGNVDDSMVKSDRLAVKPIP 227
Query: 165 SSKRNKLAFFAGAVNSPV-REKLLQVWRN-DSEIYAHSGRLKTP-------YADELLGSK 215
+KR LA F G V R +L+++ + ++ + +L P Y L +K
Sbjct: 228 LTKRKYLANFLGRAQGKVGRLQLVKLAKQYPDKLESPELKLSGPDKLGRIDYFKHLRNAK 287
Query: 216 FCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI 262
FCL +G T R +S + CVPVI+++ +LPF +V+++ SI
Sbjct: 288 FCLAPRGESSWTLRFYESFFVECVPVILSDEVELPFQNVIDYTEVSI 334
>gi|22330483|ref|NP_176908.2| exostosin-like protein [Arabidopsis thaliana]
gi|115311405|gb|ABI93883.1| At1g67410 [Arabidopsis thaliana]
gi|332196520|gb|AEE34641.1| exostosin-like protein [Arabidopsis thaliana]
Length = 430
Score = 78.2 bits (191), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 70/285 (24%), Positives = 130/285 (45%), Gaps = 27/285 (9%)
Query: 52 DPSKADLFFLPF--SIARMRHDRRIGTEGIPDF---ISHYIFNISQKYPYWNRTGGADHF 106
DP AD+F++PF S++ H + + T+ +F + + + YWNR+GG DH
Sbjct: 122 DPDLADVFYVPFFSSLSFNTHGKNM-TDPDTEFDRLLQVELMEFLENSKYWNRSGGKDHV 180
Query: 107 YVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWP----RQEDPPK 162
H +++ +V + + VV Y KDV P + +E
Sbjct: 181 IPMTHPNAFRFLRQ--QVNASILIVVDFGRYSKDMARLSKDVVSPYVHVVESLNEEGDDG 238
Query: 163 LGS--SKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIY-----AHSGRLKTPYADEL 211
+G R L +F G +R +L ++ +S+++ A + +K + +
Sbjct: 239 MGDPFEARTTLLYFRGNTVRKDEGKIRLRLEKLLAGNSDVHFEKSVATTQNIKVS-TEGM 297
Query: 212 LGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI---VVATLD 268
SKFCLH G ++ R+ D++ C+PVII++ +LPF D +++ FS+ + +L+
Sbjct: 298 RSSKFCLHPAGDTPSSCRLFDAIVSHCIPVIISDKIELPFEDEIDYSEFSLFFSIKESLE 357
Query: 269 IPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
+ L+ E++L + + V HF++ P DA M+
Sbjct: 358 PGYILNNLRQFPKEKWLEMWKRLKNVSHHFEFQYPPKREDAVNML 402
>gi|21592312|gb|AAM64263.1| unknown [Arabidopsis thaliana]
Length = 273
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 46/118 (38%), Positives = 66/118 (55%), Gaps = 15/118 (12%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEP--RGNYASESYFKKAF-MKSHFVTKDPSKAD 57
M +VY+YP P+ EP G YASE +F K + FVTK+P KA
Sbjct: 155 MELILKVYIYPDGDK--------PIFHEPHLNGIYASEGWFMKLMESNTQFVTKNPEKAH 206
Query: 58 LFFLPFSIARMRHDRRI-GTEGIPD---FISHYIFNISQKYPYWNRTGGADHFYVACH 111
LF++P+S+ +++ + G+ I F+ Y+ +S KYP+WNRT G+DHF VACH
Sbjct: 207 LFYMPYSVKQLQKSIFVPGSHNIKPLSIFLRDYVNMLSIKYPFWNRTHGSDHFLVACH 264
>gi|15228598|ref|NP_187015.1| exostosin-like protein [Arabidopsis thaliana]
gi|6091757|gb|AAF03467.1|AC009327_6 hypothetical protein [Arabidopsis thaliana]
gi|30102722|gb|AAP21279.1| At3g03650 [Arabidopsis thaliana]
gi|110736553|dbj|BAF00242.1| hypothetical protein [Arabidopsis thaliana]
gi|332640446|gb|AEE73967.1| exostosin-like protein [Arabidopsis thaliana]
Length = 499
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 73/288 (25%), Positives = 136/288 (47%), Gaps = 34/288 (11%)
Query: 50 TKDPSKADLFFLPF--SIARMR------HDRRIGTEGIPDFISHYIFNISQKYPYWNRTG 101
K+ S+AD+ F+PF S++ R ++ + + + + Y+ SQK W +G
Sbjct: 194 VKNSSEADVVFVPFFSSLSYNRFSKVNQKQKKSQDKELQENVVKYV--TSQKE--WKTSG 249
Query: 102 GADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPP 161
G DH +A H ++M A A+ VV + S H+A+ D + + P + P
Sbjct: 250 GKDHVIMAHHP---NSMSTARHKLFPAMFVVADFGRY-SPHVANVDKDI--VAPYKHLVP 303
Query: 162 KL-----GSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLK----TPYA 208
G R L +F GA+ VR++L + + + +++ G ++ +
Sbjct: 304 SYVNDTSGFDGRPILLYFQGAIYRKAGGFVRQELYNLLKEEKDVHFSFGSVRNHGISKAG 363
Query: 209 DELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLD 268
+ + SKFCL++ G ++ R+ D++ C+PVII++ +LP+ DVLN+ F + V + D
Sbjct: 364 EGMRSSKFCLNIAGDTPSSNRLFDAIASHCIPVIISDDIELPYEDVLNYNEFCLFVRSSD 423
Query: 269 I---PLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
L +++ I EEY + + +V ++F D + Y V
Sbjct: 424 ALKKGFLMGLVRSIGREEYNKMWLRLKEVERYFDLRFPVKDDEGDYAV 471
>gi|225457795|ref|XP_002265632.1| PREDICTED: probable glucuronosyltransferase Os03g0107900 [Vitis
vinifera]
gi|302142755|emb|CBI19958.3| unnamed protein product [Vitis vinifera]
Length = 459
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 86/348 (24%), Positives = 148/348 (42%), Gaps = 46/348 (13%)
Query: 4 SFRVYVYPHRRNDPFANVL------LPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKAD 57
S ++YVY D ++L +P + G + ++ + +KS F T+ +AD
Sbjct: 85 SLKIYVYEEDEIDGLKSLLYGRDGSIPTEVCVTGQWGTQVKIHRLLLKSRFRTRRKEEAD 144
Query: 58 LFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSA 117
LFF+P I +R + + I Y+ +SQ PY+ +GG +H +V G
Sbjct: 145 LFFVPTYIKCVRMKGGLNDKEIDQM---YVKVLSQ-MPYFRLSGGRNHIFVFPSGAG-PH 199
Query: 118 MQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGS------------ 165
+ K+W LN ++ KD S W P +
Sbjct: 200 LFKSWATYLNRSIILTPEG----DRTDKKDTSAFNTWKDIIIPGNVADEMTTNGATFVQP 255
Query: 166 ---SKRNKLAFFAGAVNSPV-REKLLQVWRN-----DSEIYAHSGRLK---TPYADELLG 213
SKR LA F G + R +L+++ + +S SG K Y L
Sbjct: 256 LPLSKRKFLANFLGRAQRKLGRLQLIELAKQYPDKLESPELQFSGPDKLGRIEYFHHLRN 315
Query: 214 SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI-PLL 272
+KFC +G T R +S + CVPVI+++ +LPF +V+++ SI + I P L
Sbjct: 316 AKFCFAPRGESSWTLRFYESFFVECVPVILSDQVELPFQNVIDYTQVSIKWPSSQIGPQL 375
Query: 273 KKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYD---AFYMVMYEL 317
+ L+ I + +++ + + R+ W V+ S+ + A +M+EL
Sbjct: 376 LEYLESIPDK---VIEEMISRGREVRCWWVYASESEPCSAMRGIMWEL 420
>gi|297841429|ref|XP_002888596.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
gi|297334437|gb|EFH64855.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
Length = 429
Score = 78.2 bits (191), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 71/285 (24%), Positives = 129/285 (45%), Gaps = 27/285 (9%)
Query: 52 DPSKADLFFLPF--SIARMRHDRRIGTEGIPDF---ISHYIFNISQKYPYWNRTGGADHF 106
DP AD F++PF S++ H + + T+ +F + + + YWNR+GG DH
Sbjct: 121 DPDLADAFYVPFFSSLSFNTHGKNM-TDPDTEFDRQLQVELMEFLEGSEYWNRSGGKDHV 179
Query: 107 YVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWP----RQEDPPK 162
H +++ +V + + VV Y KDV P + +ED
Sbjct: 180 IPMTHPNAFRFLRQ--QVNASILIVVDFGRYAKDMARLSKDVVSPYVHVVESLNEEDDDG 237
Query: 163 LGS--SKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIY-----AHSGRLKTPYADEL 211
L R L +F G +R +L ++ +S+++ A + +K + +
Sbjct: 238 LTDPFEARTTLLYFRGNTVRKDEGKIRLRLEKLLAGNSDVHFEKSVATTQNIKVS-TEGM 296
Query: 212 LGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI---VVATLD 268
SKFCLH G ++ R+ D++ C+PVII++ +LPF D +++ FS+ + +L+
Sbjct: 297 RSSKFCLHPAGDTPSSCRLFDAIVSHCIPVIISDKIELPFEDEIDYSEFSLFFSIKESLE 356
Query: 269 IPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
+ L+ E++L + + V HF++ P DA M+
Sbjct: 357 PGYILNKLRQFPKEKWLEMWKRLKNVSHHFEFQYPPKREDAVNML 401
>gi|326491429|dbj|BAJ94192.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 400
Score = 78.2 bits (191), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 81/326 (24%), Positives = 140/326 (42%), Gaps = 31/326 (9%)
Query: 5 FRVYVYPHRRNDPFANVLLPVDFEP------RGNYASESYFKKAFMKSHFVTKDPSKADL 58
RVYVY D +L D +G + ++ + ++S + T D +A+L
Sbjct: 54 LRVYVYAEDEVDGLRALLRGRDGAVSAATCLKGQWGTQVKIHQLLLRSRYRTLDKDEANL 113
Query: 59 FFLPFSIARMRHDRRIGTEGIPD-FISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSA 117
FF+P + +R T G+ D I+ + + PY+ R+GG DH +V G +
Sbjct: 114 FFVPSYVKCVRM-----TGGLTDKEINQTYVKVLSQMPYFRRSGGRDHIFVFPSGAG-AH 167
Query: 118 MQKAWEVKLNAIQVVC-----SSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLA 172
+ ++W LN ++ + IS KD+ +P +D +G R KL
Sbjct: 168 LFRSWATFLNRSIILTPEGDRTDKRGISAFNTWKDIIIPG---NVDD--SMGKVGRLKLV 222
Query: 173 FFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIAD 232
A + L++ D GR+ Y L +KFCL +G T R +
Sbjct: 223 ELAKQYPDKLESPELKLSGPDK-----LGRID--YFKHLRNAKFCLAPRGESSWTLRFYE 275
Query: 233 SLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI-PLLKKILKGISSEEYLLLQSYV 291
S + CVPV++++ +LPF +V+++ SI I P L + L+ I E + +
Sbjct: 276 SFFVECVPVLLSDEVELPFQNVIDYTKISIKWPASKIGPELFQYLESIPEERIEEMIARG 335
Query: 292 LKVRKHFQWHVFPSDYDAFYMVMYEL 317
+VR + + + A +M+EL
Sbjct: 336 REVRCMWVYALDTEPCSAMTAIMWEL 361
>gi|326432404|gb|EGD77974.1| hypothetical protein PTSG_09607 [Salpingoeca sp. ATCC 50818]
Length = 377
Score = 77.8 bits (190), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 71/277 (25%), Positives = 118/277 (42%), Gaps = 10/277 (3%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLPFSI------ARMRHDRRIGTEGIPDFISHY 86
+ASE +A S T D ADL + + + D R E P +
Sbjct: 52 FASELLLPQAIRNSALHTDDAEAADLILIQYCVMIWGMQGEKMMDIRPLLEKRPVLARRF 111
Query: 87 IFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKL-NAIQVVCSSSYFISGHIAH 145
N ++ + G H + H GR +++ + L NA ++ S +
Sbjct: 112 QQNPARFLVVLSSDHGPCHNFNE-HLTGRRLVKRWIDRSLENATLLMNDGSLVNKCYRPG 170
Query: 146 KDVSLP-QIWPRQED-PPKLGSSKRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRL 203
KDV +P W + R AFFAGA +S +RE ++ N+ ++
Sbjct: 171 KDVVIPPSTWIGNATFACSRPITDRKHFAFFAGAASSLIREYIINELGNEDWLFIPHDLQ 230
Query: 204 KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIV 263
Y E+ + FCL +G + R+ ++L GC+PVIIA+ PF DVL++ +F++
Sbjct: 231 HEEYMCEMGNAVFCLAPRGRAAWSPRLVEALEAGCIPVIIADMNHEPFHDVLDYSTFTVQ 290
Query: 264 VATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQW 300
V + L + L ISS + L + + R HF++
Sbjct: 291 VHEDKLETLGEQLHSISSGQVARLHANGQRARAHFRY 327
>gi|159480086|ref|XP_001698117.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158273916|gb|EDO99702.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 821
Score = 77.8 bits (190), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 82/340 (24%), Positives = 144/340 (42%), Gaps = 68/340 (20%)
Query: 32 NYASESYFKKAFMKSHFVTKDPSKADLFFLP------------FSIARMRHDRRIGTEGI 79
Y+ E+ F + ++S T DP +AD F++P ++ A H + + +
Sbjct: 390 TYSVETMFHELLLQSEHRTFDPEEADFFYVPHYVTCYFWPIMGWADAPWWHAPYVDSRPM 449
Query: 80 --PDFISHYIFNISQKYPYWNRTGGADHFYV------AC--------------------- 110
+ ++ + PYW+R GG DH ++ AC
Sbjct: 450 HGANMLTELHGWLRTNLPYWDRRGGRDHIWLMAADEGACWMPTAIYNTSIVLTHWGRLEA 509
Query: 111 -HSIGRSAMQKAWEVKLNAIQVVCSSSYF--ISGHIA---HKDVSLPQIWP--RQEDPPK 162
H+ G + +Q ++ + Q Y I GH KD+ +P P P
Sbjct: 510 NHTSGTAYLQDVYDRPVYGFQRWPGVDYHHDIEGHPCFDPKKDLVIPAFKPPFHFARSPL 569
Query: 163 LGSS--KRNKLAFF---AGAVNSP-----VREKLL-----QVWRNDSEIY-AHSGRLKTP 206
LG+ +R+ L +F +GA P +R+++ Q W N +I +H G +
Sbjct: 570 LGAPPLQRDILLYFRGDSGAFRLPQYSRGIRQRITDLSNRQDWFNRYKIVISHGGMVGGD 629
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y++ L SKFCL G + + R D++ +GC+PV++ + F +L+W SFS+ +
Sbjct: 630 YSEHLARSKFCLVAPG-DGWSPRAEDAILHGCIPVVVMDGVQAVFESILDWDSFSLRIRE 688
Query: 267 LDIPL--LKKILKGISSEEYLLLQSYVLKVRKHFQWHVFP 304
D L L ++L IS E +Q ++ +V F + P
Sbjct: 689 DDAALEALPQLLASISPERLAHMQRHLARVWHRFAYTTGP 728
>gi|255542540|ref|XP_002512333.1| catalytic, putative [Ricinus communis]
gi|223548294|gb|EEF49785.1| catalytic, putative [Ricinus communis]
Length = 434
Score = 77.4 bits (189), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 70/286 (24%), Positives = 120/286 (41%), Gaps = 24/286 (8%)
Query: 52 DPSKADLFFLPF----SIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFY 107
DP KAD FF+PF S H I + + ++ K YW ++GG DH
Sbjct: 127 DPEKADAFFVPFFSSLSFNTHGHTMTDPETEIDRQLQVDVIDMLYKSKYWQKSGGRDHVI 186
Query: 108 VACHSIGRSAMQKAWEVKLNAIQVVCS--SSYFISGHIAHKDVSLPQIW---PRQEDPPK 162
H +++ +LNA ++ + Y S KDV P + +D
Sbjct: 187 PMTHPNAFRFLRQ----QLNASILIVADFGRYPKSMSTLSKDVVAPYVHVVDSFTDDEVS 242
Query: 163 LGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIY----AHSGRLKTPYADELLGS 214
R L FF G VR KL ++ +I+ + + + + S
Sbjct: 243 NPFESRTTLLFFRGNTIRKDEGKVRAKLAKILTGYDDIHFERSSATAETIKASTEGMRSS 302
Query: 215 KFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI---VVATLDIPL 271
KFCLH G ++ R+ D++ CVPVI+++ +LP+ D +++ FS+ V +
Sbjct: 303 KFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDQIELPYEDEIDYSQFSVFFSVNEAIQPGY 362
Query: 272 LKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYEL 317
+ L+ + E +L + + + HF++ P DA M+ E+
Sbjct: 363 MVDQLRQLPKERWLEMWRKLKSISHHFEFQYPPEKEDAVDMLWREV 408
>gi|242055419|ref|XP_002456855.1| hypothetical protein SORBIDRAFT_03g044100 [Sorghum bicolor]
gi|241928830|gb|EES01975.1| hypothetical protein SORBIDRAFT_03g044100 [Sorghum bicolor]
Length = 432
Score = 77.4 bits (189), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 77/288 (26%), Positives = 126/288 (43%), Gaps = 32/288 (11%)
Query: 50 TKDPSKADLFFLPF--SIARMRHDRRIGTEGIPD-----FISHYIFNISQKYPYWNRTGG 102
+DP AD FF+PF S++ H R + PD + I +I K YW R+ G
Sbjct: 125 VRDPDAADAFFVPFFSSLSFNVHGRNMTD---PDTEADRLLQVEIVDILWKSKYWQRSAG 181
Query: 103 ADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIA--HKDVSLPQIWP----R 156
DH H ++ +NA ++ S + +A KDV P +
Sbjct: 182 RDHVIPMHHPNAFRFLR----AMVNASILIVSDFGRYTKELASLRKDVVAPYVHVVDSFL 237
Query: 157 QEDPPKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDS----EIYAHSGRLKTPYA 208
+DPP + R+ L FF G +R KL +V + E +G
Sbjct: 238 DDDPPDPFEA-RHTLLFFRGRTVRKDEGKIRAKLGKVLKGKEGVRFEDSIATGDGIKIST 296
Query: 209 DELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI---VVA 265
+ + SKFCLH G ++ R+ D++ CVPVI+++ +LPF D +++ FS+ V
Sbjct: 297 EGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSSRIELPFEDEIDYSEFSLFFSVEE 356
Query: 266 TLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
L L L+ I ++++ + S + V H+++ P DA M+
Sbjct: 357 ALRPDYLLNQLRQIPKKKWVDMWSKLKNVSHHYEFQYPPRKGDAVNMI 404
>gi|302829555|ref|XP_002946344.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
gi|300268090|gb|EFJ52271.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
Length = 785
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 85/352 (24%), Positives = 137/352 (38%), Gaps = 90/352 (25%)
Query: 32 NYASESYFKKAFMKSHFVTKDPSKADLFFLPFSI-----------------ARMRHDRRI 74
Y+ E Y + ++S T DP +AD F++P I A + H R
Sbjct: 382 TYSVEVYLHEMMLQSEHRTFDPEEADFFYVPMYITCFMWPVMGWADFPWWHAPLAHTR-- 439
Query: 75 GTEGIPDFISHYIFN----ISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQ 130
P +S+ I +S +P+WNR GG DH ++ G M V N+
Sbjct: 440 -----PMHVSNMILEAYEWLSTTFPWWNRRGGRDHIWLMAPDEGACYMPT---VVYNSSI 491
Query: 131 VVCSSSYFISGH---------IAHKDVSLPQI-------WPRQEDP-------------- 160
++ H I KD+ + Q W + P
Sbjct: 492 ILTHWGRMDPDHKSGSAFDQDIYDKDLPVAQFKGWRGLDWMEKSRPHLCYNPEKDLVIPA 551
Query: 161 ----------PKLGSS--KRNKLAFFAGAVNSP--------VREKLLQV-----WRNDSE 195
P LG+ +R+ L +F G V +R+KL Q W +
Sbjct: 552 FKSPDHFQESPLLGAPPLERDILLYFRGDVGEGRRDHYSRGIRQKLFQFAHWGKWAEKYK 611
Query: 196 IYAHSGR-LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
IY +G + Y++ L SKFCL G + +AR D++ +GCVP+++ + F +
Sbjct: 612 IYIGTGETIGGSYSEHLARSKFCLVAPG-DGWSARAEDAILHGCVPLVVMDGVHAVFESI 670
Query: 255 LNWKSFSIVVATLDIPL--LKKILKGISSEEYLLLQSYVLKVRKHFQWHVFP 304
L+W SFSI + + L + ++L IS E +Q + +V F + P
Sbjct: 671 LDWDSFSIRIREDNQALQAIPELLTAISPERLAKMQRNLARVWHRFAYATGP 722
>gi|159470401|ref|XP_001693348.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158277606|gb|EDP03374.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 356
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 86/315 (27%), Positives = 134/315 (42%), Gaps = 65/315 (20%)
Query: 33 YASESYFKKAFMKSHFV-TKDPSKADLFFLP----FSIARMRHDRRIGTEGIPDF-ISHY 86
Y + YF K F+ + V T++P +A LF++P F +R P++ +
Sbjct: 25 YTAYEYFLKYFITDNIVRTENPYEAHLFYVPALNFFYSGNLR---------PPEYHLEAV 75
Query: 87 IFNISQKYPYWNRTGGADHFYV------ACH----------SIGRSAMQKAWEVKLNAIQ 130
+ ++ +P++NR+GG DHF ACH + MQK LN
Sbjct: 76 MDHVKTAWPFYNRSGGRDHFIFLTGDRGACHMPRDMQDSMIKVVHFGMQKQ---GLNWTS 132
Query: 131 VVCSSSYFISGHI-AHKDVSLPQ-------IWPR------QEDPPKLG-SSKRNKLAFFA 175
+ + Y G I +D+ +P +WP Q G + RN FA
Sbjct: 133 MEHNKEY---GCIRMRQDLVVPPHPNDHKPLWPVGAAAYFQRIAAAGGHDAGRNITFLFA 189
Query: 176 GAVNS---------PVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVN 226
G V VR LL + D I GR + Y D L S+FCL G
Sbjct: 190 GGVGEGEYSGGTRQAVRALLLNI--TDPAIMFVEGR-RDDYVDLLWRSQFCLAAYGHGWG 246
Query: 227 TARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLL 286
R+ S+ +GC+PVII +H F D L ++ FS+ + D+P L ++L+ S E+
Sbjct: 247 I-RVMQSIQFGCIPVIIQDHVYQAFEDFLPYEEFSVRLPLRDVPRLLELLRSYSPEQLAA 305
Query: 287 LQSYVLKVRKHFQWH 301
L+ + K + F W+
Sbjct: 306 LRLGMAKYFRAFIWN 320
>gi|255580236|ref|XP_002530948.1| catalytic, putative [Ricinus communis]
gi|223529463|gb|EEF31420.1| catalytic, putative [Ricinus communis]
Length = 512
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 73/307 (23%), Positives = 148/307 (48%), Gaps = 34/307 (11%)
Query: 50 TKDPSKADLFFLPF--SIARMR------HDRRIGTEGIPDFISHYIFNISQKYPYWNRTG 101
++ S+AD+ F+PF S++ R H +R + + + + Y+ + + W R+
Sbjct: 181 VRNSSEADVIFVPFFSSLSYNRYSKVNPHQKRSKNKLLQEKLVRYVTSQME----WKRSQ 236
Query: 102 GADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH--KDVSLP---QIWPR 156
G DH +A H ++M A +KL + + +IA+ KD+ P I
Sbjct: 237 GQDHIILAHHP---NSMLDA-RMKLWPALFILADFGRYPPNIANVDKDLIAPYKHVIRSY 292
Query: 157 QEDPPKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLK----TPYA 208
+D S R L +F GA+ R++L + +++ +++ G ++ +
Sbjct: 293 ADDSSTFDS--RPTLLYFQGAIYRKDGGFARQELFYLLKDEKDVHFQFGSVQKDGINKAS 350
Query: 209 DELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLD 268
+ SKFCL++ G ++ R+ D++ CVPVII++ +LP+ DVL++ F I V T D
Sbjct: 351 QGMHTSKFCLNIAGDTPSSNRLFDAIASHCVPVIISDDIELPYEDVLDYSQFCIFVRTSD 410
Query: 269 I---PLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
L +++GI +E+ + + +V + F++ + DA M+ + + ++R
Sbjct: 411 AIKEKFLINLIRGIGKDEWTQMWQKLKEVERFFEFQYPSKEGDAVQMIWQAVARKVPAIR 470
Query: 326 VQWSTSL 332
++ + S+
Sbjct: 471 MKINKSM 477
>gi|307104488|gb|EFN52741.1| hypothetical protein CHLNCDRAFT_138314 [Chlorella variabilis]
Length = 647
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 84/348 (24%), Positives = 137/348 (39%), Gaps = 88/348 (25%)
Query: 32 NYASESYFKKAFMKSHFVTKDPSKADLFFLPF----------SIARMRHDRRIG-----T 76
Y+ E + ++S T DP +AD F++P S A D G
Sbjct: 233 TYSLEFGLLEMMLQSEHRTLDPEEADFFYVPVFPSCFIWPVRSTADSLRDFYYGWAQSRV 292
Query: 77 EGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCS-- 134
+G + + + YPYW+R GG DH ++ H + A + + ++ S
Sbjct: 293 QGAANLLLEAYHWLRAHYPYWDRRGGRDHIWLVTHDEASCYVPAA----IKSASIILSHW 348
Query: 135 --------SSYFISGHIAHKDVSLPQIWPRQE-----------DP--------------- 160
S G++ H +VS P P DP
Sbjct: 349 GRKDPNHTSGTGFPGNVYHLNVSHPHWEPEGSMAKLDLSQPCHDPVKDLVLPLMKTPDHY 408
Query: 161 ---PKLGSSKRNK--LAFFAG---AVNSP-----VREKL--------------LQVWRND 193
P +G+ RN+ LAF G ++P VR++L + + N
Sbjct: 409 HQSPLVGAPTRNRTWLAFHRGRQHKTDAPEYSRGVRQRLWSASQEHGWLDKYGILLGENP 468
Query: 194 SEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFAD 253
S A +L Y+ L S FCL + G + +AR+ D+ +GC+PVI+ + D+ F
Sbjct: 469 SSPGAEEVKLAGDYSQLLASSIFCLVLPG-DGWSARMDDATLHGCIPVIVMDEVDVSFES 527
Query: 254 VLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWH 301
V++ + F++ VA D+ L +IL IS E +Q + +V WH
Sbjct: 528 VIDLQQFTVRVAQADVERLPEILLEISQERRQEMQRALGRV-----WH 570
>gi|218192182|gb|EEC74609.1| hypothetical protein OsI_10218 [Oryza sativa Indica Group]
Length = 737
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 86/332 (25%), Positives = 143/332 (43%), Gaps = 70/332 (21%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP----FSIAR--------MRHDRRIGTEGIP 80
Y ++ ++ + S T + +AD F++P I R M D R+ +
Sbjct: 337 YGAQIALYESILASPHRTLNGDEADYFYVPALDSCLITRSDDAPHLQMPRDLRLRSYHTL 396
Query: 81 DFISHYIFNISQKYPYWNRTGGADHFYV------ACHS---IGRSAMQKAWEVKLNAIQV 131
++ +I+Q+YPYWNRT G DH + AC++ I S M W N
Sbjct: 397 EYYRMTYDHIAQRYPYWNRTSGRDHIWFFSWDEGACYAPKEIWNSMMLVHWG-NTNTKHK 455
Query: 132 VCSSSYF--------ISGHIAH------KDVSLPQIWPRQEDPP----KLGSSKRNK--- 170
+++Y+ I H KD+ LP W +Q +P KL + RN
Sbjct: 456 NSTTAYWADNWNYIPIDRRGNHPCFDPRKDLVLP-AW-KQPNPAAIWLKLWARTRNNRTT 513
Query: 171 LAFFAGAVNSP-------------VREKLLQVWRNDSEIYAHSGRLKTP----------- 206
L +F G + +R+KL + + + GR T
Sbjct: 514 LFYFNGNLGPAYKDGRHEDTYSMGIRQKLAAEFGSTPDKQGKLGRQHTANVTVTYLRTEK 573
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y +EL S FC + G + + R+ DS+ GC+PVII + LP+ ++LN+ SF++ +
Sbjct: 574 YYEELASSIFCGVLPG-DGWSGRMEDSMLQGCIPVIIQDGILLPYENMLNYNSFAVRIQE 632
Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRKHF 298
DIP L +IL+GI+ + + V ++ + F
Sbjct: 633 DDIPNLIRILRGINETQVEFMLRNVRQIWQRF 664
>gi|125585107|gb|EAZ25771.1| hypothetical protein OsJ_09611 [Oryza sativa Japonica Group]
Length = 779
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 86/332 (25%), Positives = 143/332 (43%), Gaps = 70/332 (21%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP----FSIAR--------MRHDRRIGTEGIP 80
Y ++ ++ + S T + +AD F++P I R M D R+ +
Sbjct: 379 YGAQIALYESILASPHRTLNGDEADYFYVPALDSCLITRSDDAPHLQMPRDLRLRSYHTL 438
Query: 81 DFISHYIFNISQKYPYWNRTGGADHFYV------ACHS---IGRSAMQKAWEVKLNAIQV 131
++ +I+Q+YPYWNRT G DH + AC++ I S M W N
Sbjct: 439 EYYRMTYDHIAQRYPYWNRTSGRDHIWFFSWDEGACYAPKEIWNSMMLVHWG-NTNTKHK 497
Query: 132 VCSSSYF--------ISGHIAH------KDVSLPQIWPRQEDPP----KLGSSKRNK--- 170
+++Y+ I H KD+ LP W +Q +P KL + RN
Sbjct: 498 NSTTAYWADNWNYIPIDRRGNHPCFDPRKDLVLPA-W-KQPNPAAIWLKLWARTRNNRTT 555
Query: 171 LAFFAGAVNSP-------------VREKLLQVWRNDSEIYAHSGRLKTP----------- 206
L +F G + +R+KL + + + GR T
Sbjct: 556 LFYFNGNLGPAYKDGRHEDTYSMGIRQKLAAEFGSTPDKQGKLGRQHTANVTVTYLRTEK 615
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y +EL S FC + G + + R+ DS+ GC+PVII + LP+ ++LN+ SF++ +
Sbjct: 616 YYEELASSIFCGVLPG-DGWSGRMEDSMLQGCIPVIIQDGILLPYENMLNYNSFAVRIQE 674
Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRKHF 298
DIP L +IL+GI+ + + V ++ + F
Sbjct: 675 DDIPNLIRILRGINETQVEFMLRNVRQIWQRF 706
>gi|302845214|ref|XP_002954146.1| hypothetical protein VOLCADRAFT_118607 [Volvox carteri f.
nagariensis]
gi|300260645|gb|EFJ44863.1| hypothetical protein VOLCADRAFT_118607 [Volvox carteri f.
nagariensis]
Length = 401
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 83/324 (25%), Positives = 122/324 (37%), Gaps = 57/324 (17%)
Query: 41 KAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRT 100
+ + S T D AD FF+P + R G + ++ + I Q +P+W R
Sbjct: 2 QRLLSSGVRTADGDAADFFFIPLVM------RTKGQSA--NHLTAVVSYIQQHWPWWGRY 53
Query: 101 GGAD-HFYVACHSIGRSAMQKAWEVKLNAIQVVC-----------SSSYFISGHIAHKDV 148
GG H V +GR + E L ++ V S ++ H KD+
Sbjct: 54 GGGHRHLLVVPADLGRRMLP---EELLKLVENVTFLTHWGSHTNHSEGAWVESHRPGKDI 110
Query: 149 SLPQIWPRQEDPP-------KLGSSKRNKLA---FFAGAVNSPVREKLLQVWRNDSEIYA 198
+P + D P L S +R + FF+G + S E R +S+
Sbjct: 111 VVPPL--HNADEPIVFSPLHTLHSKRRRQRTSGLFFSGRICSDGSEPHRGRCRTNSQGNV 168
Query: 199 HSGRLK--------------TPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIA 244
LK YA L FCL G R + GCVPV+I
Sbjct: 169 RHKVLKHHWNRTTWTLTTRAKAYASALSSHTFCLSPGGGGYGR-RSVQAAVMGCVPVLIG 227
Query: 245 NHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQW---- 300
+ PF L+W FS+ V DIP L IL+ ++S +Q + +H +
Sbjct: 228 DGLHQPFEPELDWSQFSMSVPEQDIPHLHTILESMNSSTIAAMQEQLRCAAQHLYYSTTF 287
Query: 301 -HVFPSD--YDAFYMVMYELWLRR 321
V D YDAF +M L +RR
Sbjct: 288 GEVMGEDGRYDAFETLMEVLRMRR 311
>gi|302825130|ref|XP_002994198.1| glycosyltransferase-like protein [Selaginella moellendorffii]
gi|300137940|gb|EFJ04733.1| glycosyltransferase-like protein [Selaginella moellendorffii]
Length = 762
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 82/335 (24%), Positives = 143/335 (42%), Gaps = 71/335 (21%)
Query: 34 ASESYFKKAFMKSHFVTKDPSKADLFFLPF--SIARMRHD--------RRIGTEGI--PD 81
+E F + + S T + +AD FF P + A R D + +G G +
Sbjct: 364 GAELAFLEGLLASPHRTMNGDEADYFFAPVLGACAITRADDAPHFSMEKHMGLRGYFSGE 423
Query: 82 FISHYIFNISQKYPYWNRTGGADHFYVACHSIGR-SAMQKAWEVKL-------NAIQVVC 133
+ +I ++YP+WNR+ G DH ++ G SA ++ W + N+
Sbjct: 424 LYKNAYMHIKEQYPFWNRSSGRDHIWLFPWDEGACSAPKEIWNGTMLVHWGNTNSKHKKS 483
Query: 134 SSSYFISG-----------HIAH---KDVSLPQIWPRQEDPPKLGS-------SKRNKLA 172
++ YF H + KD+ LP W + DP + +R L
Sbjct: 484 TTGYFADSWDDIPKEWRGDHPCYDPLKDIVLP-AW-KNPDPRSVAERFWSRPREERKTLF 541
Query: 173 FFAGAVNSP-------------VREKLLQVWRNDSEIYAHSGRLKTP-----------YA 208
+F G + +R+++ + + + + GR P YA
Sbjct: 542 YFNGNLGKGYDFGRPEDRYSMGIRQRVAEEFGSTPNNHGKLGRQAAPDVVVTPQRSDDYA 601
Query: 209 DELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLD 268
EL S+FC G + + R+ D++ +GC+PVII + LP+ +L+++SF++ VA
Sbjct: 602 KELSSSRFCGVFPG-DGWSGRMEDAVLHGCIPVIIQDGIHLPYESLLDYESFTVRVAEDK 660
Query: 269 IPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVF 303
IP L IL+ IS+ E ++S + VR +Q V+
Sbjct: 661 IPELITILRNISNAE---VESKLEAVRGLWQRFVY 692
>gi|449502905|ref|XP_004161776.1| PREDICTED: probable glucuronosyltransferase Os01g0926700-like
[Cucumis sativus]
Length = 482
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 72/289 (24%), Positives = 125/289 (43%), Gaps = 42/289 (14%)
Query: 52 DPSKADLFFLPFSIARMRHDRRIGTE--------GIPDFISHY-IFNISQKYPYWNRTGG 102
+ +AD+ F+PF A M + ++G G D+ + + + W ++GG
Sbjct: 144 EAEEADVIFVPF-FATMSAEMQLGMAKGAFRKKVGNEDYERQRNVMDFLKSTDAWKKSGG 202
Query: 103 ADHFYVACHSIGRSAMQKAWEVKLNAIQVVC--------------SSSYFISGHIAHKDV 148
DH + HS+ W VK V SS+ I H V
Sbjct: 203 RDHVLFSLHSLTDPVAM--WHVKAEIAPAVLLVVDFGGWFRLDTKSSNGSSPDMIQHTQV 260
Query: 149 SLPQ--IWPRQEDPPKL---GSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAH 199
S+ + I P P+L + KR L +F GA VREKL + N+ ++
Sbjct: 261 SVLKDVIVPYTHLLPRLHLSANKKRQTLLYFKGAKHRHRGGLVREKLWDLLVNEPDVIME 320
Query: 200 SGRLKTPYADE----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVL 255
G ++ + S+FCLH G + R+ D++ C+PV+++++ +LPF D++
Sbjct: 321 EGFPNATGKEQSIKGMRSSEFCLHPAGDTPTSCRLFDAIQSLCIPVVVSDNIELPFEDMV 380
Query: 256 NWKSFSIVVATLDI---PLLKKILKGISSEEYLLLQSYVLKVRKHFQWH 301
++ FS+ VA D L K L+ I E+ + Y+ +V+ F++
Sbjct: 381 DYSEFSVFVAVNDALKPNWLVKHLRTIPEEQRNRFRLYMARVQSVFEYE 429
>gi|224136688|ref|XP_002322391.1| predicted protein [Populus trichocarpa]
gi|222869387|gb|EEF06518.1| predicted protein [Populus trichocarpa]
Length = 457
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 66/271 (24%), Positives = 124/271 (45%), Gaps = 27/271 (9%)
Query: 52 DPSKADLFFLPF--SIARMRHDRRIGTEGIP--------DFISHYIFNISQKYPYWNRTG 101
DP +ADLF++P S++ + + R GT +P + + + ++ YW R
Sbjct: 137 DPEEADLFYVPVFSSLSLIVNPARAGT--VPGSDPVYSDEKMQEELVEWLEEQEYWRRNN 194
Query: 102 GADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH-KDVSLPQIWPRQEDP 160
G DH A +A+ + + N + ++ S + KDV +P
Sbjct: 195 GRDHVVFAGDP---NALYRVLDRVKNVVLLLSDFGRVRSDQGSLIKDVIVPYSHRINVYN 251
Query: 161 PKLGSSKRNKLAFFAG----AVNSPVREKLLQVWRNDSEIYAHSGRL----KTPYADELL 212
+G +R L FF G +R+ L Q+ + ++ G + +
Sbjct: 252 GDIGVEERKTLLFFMGNRYRKDGGKIRDLLFQMLEKEEDVVIRHGTQSRENRRTATRGMH 311
Query: 213 GSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT---LDI 269
SKFCL+ G + R+ DS+ CVP+I+++ +LPF DV++++ +I V T L
Sbjct: 312 TSKFCLNPAGDTPSACRLFDSIVSLCVPLIVSDSIELPFEDVIDYRKIAIFVDTESSLKP 371
Query: 270 PLLKKILKGISSEEYLLLQSYVLKVRKHFQW 300
L ++L+ +S+E+ L Q + +V+++F +
Sbjct: 372 GYLVRMLRAVSTEKILEYQKQMREVKRYFVY 402
>gi|357125540|ref|XP_003564451.1| PREDICTED: probable glucuronosyltransferase GUT1-like [Brachypodium
distachyon]
Length = 495
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 61/224 (27%), Positives = 106/224 (47%), Gaps = 24/224 (10%)
Query: 95 PYWNRTGGADHFYVACH-----SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHI-AHKDV 148
P W R+ G DH H S+ R + W + + + +++ G + KDV
Sbjct: 219 PAWQRSEGRDHIIPVHHPWSFKSVRRFVKKAIWLLP----DMDSTGNWYKPGQVYLEKDV 274
Query: 149 SLPQIWPRQEDPPKLGS---SKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSG 201
LP + K S SKR+ L FF G + VR KL+ ++ ++ G
Sbjct: 275 ILPYVPNVDLCDYKCASETQSKRSMLLFFRGRLKRNAGGKVRSKLVTELKDAEDVVIEEG 334
Query: 202 RL----KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNW 257
K + + S FCL+ G ++AR+ D++ GC+PVII++ +LPF +L++
Sbjct: 335 TAGAEGKVAAQNGMRKSLFCLNPAGDTPSSARLFDAIVSGCIPVIISDELELPFEGILDY 394
Query: 258 KSFSIVVATLDI---PLLKKILKGISSEEYLLLQSYVLKVRKHF 298
+ ++ V++ D L K L+GI ++ +QS ++K +HF
Sbjct: 395 RKIALFVSSSDALQPGWLVKYLRGIDAKRVREMQSNLVKYSRHF 438
>gi|357463031|ref|XP_003601797.1| hypothetical protein MTR_3g085480 [Medicago truncatula]
gi|355490845|gb|AES72048.1| hypothetical protein MTR_3g085480 [Medicago truncatula]
Length = 425
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 67/293 (22%), Positives = 125/293 (42%), Gaps = 22/293 (7%)
Query: 52 DPSKADLFFLPF----SIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFY 107
DP D++F+PF S H R I + + + + YW R+GG DH +
Sbjct: 119 DPEVVDVYFVPFFSSLSFNTHGHHMRDPETEIDHQLQIDLMGLLGQSKYWQRSGGRDHIF 178
Query: 108 VACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISG-HIAHKDVSLPQIW---PRQEDPPKL 163
H +A + + +IQVV + G +KDV P + +D P
Sbjct: 179 PMTHP---NAFRFLRDQLNESIQVVVDFGRYPKGVSNLNKDVVSPYVHFVDSYVDDEPHD 235
Query: 164 GSSKRNKLAFFAGAVNSP----VREKLLQVWRNDSEIY----AHSGRLKTPYADELLGSK 215
R L FF G + VR K ++ +++ + +G + + SK
Sbjct: 236 PFESRTTLLFFRGGTHRKDKGIVRAKFTKILAGFDDVHYERSSATGENIKLSSKGMRSSK 295
Query: 216 FCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVA---TLDIPLL 272
FCLH G ++ R+ D++ CVPVI+++ +LPF + +++ FS+ + L+ +
Sbjct: 296 FCLHPAGDTPSSCRLFDAIVSHCVPVIVSDKIELPFENEIDYSQFSLFFSFKEALEPGYM 355
Query: 273 KKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
L+ + + + + + H+++H P DA M+ ++ + +R
Sbjct: 356 INQLRSFPKQNWTEMWRQLKNISHHYEFHYPPEREDAVNMLWRQIKHKLPGIR 408
>gi|241997492|ref|XP_002433395.1| exostosin-2, putative [Ixodes scapularis]
gi|215490818|gb|EEC00459.1| exostosin-2, putative [Ixodes scapularis]
Length = 714
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 70/272 (25%), Positives = 118/272 (43%), Gaps = 35/272 (12%)
Query: 41 KAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRT 100
+A S + TKD ++A +F R+ R + + Q P+WN
Sbjct: 129 EAIQHSPYYTKDAARACVFVPSIDTLNQRNLNR-----------KALSRVLQSLPHWN-- 175
Query: 101 GGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHK-DVSLPQIWP---- 155
G++H + + S + A +++N + + + F S D+ +P P
Sbjct: 176 NGSNHLLF--NMLPGSLPEYATSLEVNTGRALVAGGGFDSWTFRRSHDLGIPVFNPARPS 233
Query: 156 ------RQEDPPKLGSSKRNKLAF-FAGAVNS--PVREKLLQV------WRNDSEIYAHS 200
R+ D P L S + ++ F F V S V +L + W ++
Sbjct: 234 LDEVSQRRVDRPWLVVSSQTRIHFEFRSEVASLAAVHPGVLNLSTCGRPWDGENNGVRCR 293
Query: 201 GRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSF 260
G Y D L KFCL V+ + + ++D+L GCVPVI+A+ Y LPF++VL+WK
Sbjct: 294 GDTLYKYPDILAEGKFCLVVRAARLGQSVLSDALMAGCVPVIVADEYILPFSEVLDWKRA 353
Query: 261 SIVVATLDIPLLKKILKGISSEEYLLLQSYVL 292
+I + D+ L +LKG+S ++S L
Sbjct: 354 AIQIREDDLEDLVTVLKGVSKARLFEMRSQAL 385
>gi|356525152|ref|XP_003531191.1| PREDICTED: probable glycosyltransferase At3g07620-like isoform 1
[Glycine max]
Length = 427
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 71/283 (25%), Positives = 122/283 (43%), Gaps = 26/283 (9%)
Query: 52 DPSKADLFFLPF----SIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFY 107
DP A FF+PF S H + I + + + +K YW R+GG DH +
Sbjct: 120 DPELAQAFFVPFFSSLSFNTHGHTMKDPATQIDRQLQVDLMELLKKSKYWQRSGGRDHVF 179
Query: 108 VACHSIGRSAMQKAWEVKLN-AIQVVCSSSYFISGHI-AHKDVSLPQIW---PRQEDPPK 162
H ++ +LN +IQVV + G +KDV P + +D P+
Sbjct: 180 PMTHPNAFRFLRG----QLNESIQVVVDFGRYPRGMSNLNKDVVSPYVHVVDSFTDDEPQ 235
Query: 163 LGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIY-----AHSGRLKTPYADELLG 213
R+ L FF G VR KL ++ +++ A +K + +
Sbjct: 236 DPYESRSTLLFFRGRTYRKDEGIVRVKLAKILAGYDDVHYERSVATEENIKAS-SKGMRS 294
Query: 214 SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVA---TLDIP 270
SKFCLH G ++ R+ D++ CVPVI+++ +LPF D +++ FS+ + L
Sbjct: 295 SKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDQIELPFEDDIDYSQFSVFFSFKEALQPG 354
Query: 271 LLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
+ L+ E++ + + + H+++ P DA M+
Sbjct: 355 YMIDQLRKFPKEKWTEMWRQLKSISHHYEFEYPPKREDAVDML 397
>gi|116787500|gb|ABK24530.1| unknown [Picea sitchensis]
Length = 446
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 69/280 (24%), Positives = 117/280 (41%), Gaps = 20/280 (7%)
Query: 52 DPSKADLFFLPF----SIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFY 107
DP +AD+F++PF S + R + + ++ ++ W R+GG DH
Sbjct: 139 DPDQADVFYVPFFASLSFNTHGQNMRDPETEFDKKLQIEVVDMLKRSKSWQRSGGRDHVI 198
Query: 108 VACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQ---EDPPKLG 164
V H ++ EV + V Y S KDV P + D
Sbjct: 199 VIHHPNAFRFLRD--EVNASIFVVADFGRYPRSVSFLRKDVVAPYVHVVDTYVNDDSSDP 256
Query: 165 SSKRNKLAFFAGAVNSP----VREKLLQVWRNDSEIYAHSGRLKTP----YADELLGSKF 216
R L +F G VR KL ++ N ++ T + S+F
Sbjct: 257 FESRTMLLYFRGRTKRKDEGFVRLKLAKILGNHKRVHFEDSLATTEGFEVAKQGMRSSRF 316
Query: 217 CLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI---VVATLDIPLLK 273
CLH G ++ R+ D++ CVPVI+++ +LPF D ++++ FS+ V L L
Sbjct: 317 CLHPAGDTPSSCRLFDAIVSHCVPVIVSDRIELPFEDEIDYQEFSLFFSVKEALRPGYLM 376
Query: 274 KILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
+ L+ E++L + + + +V HF++ P DA M+
Sbjct: 377 QKLETFPKEKWLKMWNKLKQVAHHFEYQYPPIKDDAVNML 416
>gi|326525254|dbj|BAK07897.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 440
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 76/299 (25%), Positives = 127/299 (42%), Gaps = 54/299 (18%)
Query: 50 TKDPSKADLFFLPF--SIARMRHDRRIGTEGIPD-----FISHYIFNISQKYPYWNRTGG 102
+DP+ A+ FF+PF S++ H R + PD + + +I K YW R+ G
Sbjct: 134 VRDPAAAEAFFVPFFSSLSFNVHGRNMTD---PDTEADRLLQVELMDILGKSEYWQRSAG 190
Query: 103 ADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIA--HKDVSLPQIW------ 154
DH H M+ +NA ++ S + +A KDV P +
Sbjct: 191 RDHVIPMHHPNAFRFMRDM----VNASVLIVSDFGRYTKELASLRKDVVAPYVHVVDSFL 246
Query: 155 ------PRQEDPPKLGSSKRNKLAFFAG----AVNSPVREKLLQVWRN-------DSEIY 197
P + DP L FF G +R KL ++ ++ DS
Sbjct: 247 DDNASDPFEADP---------TLLFFRGRPVRKAEGKIRGKLAKILKDRDGVRFEDSLAI 297
Query: 198 AHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNW 257
++ T D + SKFCLH G ++ R+ D++ C+PVII++ +LPF D +++
Sbjct: 298 GDGIKIST---DGMRSSKFCLHPAGDTPSSCRLFDAIVSHCIPVIISSRIELPFEDEIDY 354
Query: 258 KSFS---IVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
FS V L+ L L+ + E+++ + S + V H+++ P DA M+
Sbjct: 355 SEFSPFFSVEEALEPDYLLNQLRQMPKEKWVEMWSKLKNVSSHYEFQYPPRKDDAVNMI 413
>gi|194707922|gb|ACF88045.1| unknown [Zea mays]
Length = 240
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 55/187 (29%), Positives = 95/187 (50%), Gaps = 20/187 (10%)
Query: 166 SKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRL----KTPYADELLGSKFC 217
SKR+ L FF G + +R KL++ ++ +I G K D + S FC
Sbjct: 40 SKRSILLFFRGRLKRNAGGKIRSKLVEELKSAKDIVIEEGSTGAQGKAAAQDGMRKSFFC 99
Query: 218 LHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPL---LKK 274
L G ++AR+ D++ GC+PVII++ +LPF +L+++ ++ V+ D L K
Sbjct: 100 LSPAGDTPSSARLFDAIVSGCIPVIISDELELPFEGILDYREIALFVSASDAVQPGWLLK 159
Query: 275 ILKGISSEEYLLLQSYVLKVRKHFQWH-----VFPSDYD----AFYMVMYELWLRRSSVR 325
L+GI+++ +QS ++K +HF + + P D A +V +L +RRS
Sbjct: 160 YLRGINAKRIREIQSNLVKYSRHFLYSSPAQPLGPEDLTWRMIAGKLVNIKLQIRRSQRL 219
Query: 326 VQWSTSL 332
V+ S S+
Sbjct: 220 VKESRSI 226
>gi|145355370|ref|XP_001421936.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144582175|gb|ABP00230.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 490
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 73/277 (26%), Positives = 125/277 (45%), Gaps = 31/277 (11%)
Query: 36 ESYFKKAFMKSHFVTKDPSKADL---FFLPFS------IARMRHDRRIGTEGIPDFISHY 86
E + + F F + S D+ + LP + + ++RH G + + H
Sbjct: 150 EDFLNRNFQNGPFSSCASSVDDVNDAYLLPLNPYFDRVMDKLRHFSEKGRSAMARGVKHA 209
Query: 87 IFNISQKYP-YWNRTG-GADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIA 144
I S + P W R+G F V H ++ + +V L+ VV SS IS
Sbjct: 210 IDYASAEDPNAWVRSGESCSRFSVITHGSSTMSLLRHKDVILSTTFVVASSET-ISPSRP 268
Query: 145 H---------KDVS----LPQIWP-RQEDPPKLGSSKRNKLAFFAGAVNS-PVREK---- 185
H KDVS L + P +++R L F GA+NS P R +
Sbjct: 269 HTSQAPFHPDKDVSAINSLSFVIPLHAARRRAFANAERETLLMFRGAINSFPNRREIADF 328
Query: 186 LLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIAN 245
LL+ R S Y ++ S+FCL+++G V++ R+ +S+ +GCVPVI+A+
Sbjct: 329 LLKNTRGQMYDLGPSCSTSKEYTAKMKNSRFCLYMRGTRVHSPRLIESMLFGCVPVILAD 388
Query: 246 HYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSE 282
Y+LP + +++W +FS+++ D + L+ +S+
Sbjct: 389 DYELPLSWLVDWSAFSVMIPERDFQTIPDALERANSD 425
>gi|326522384|dbj|BAK07654.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 496
Score = 74.7 bits (182), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 62/224 (27%), Positives = 104/224 (46%), Gaps = 24/224 (10%)
Query: 95 PYWNRTGGADHFYVACH-----SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHI-AHKDV 148
P W R+ G DH H S+ R + W + + + +++ G + KDV
Sbjct: 220 PAWQRSEGRDHVIPVHHPWSFKSVRRFVKKAIWLLP----DMDSTGNWYKPGQVYLEKDV 275
Query: 149 SLPQIWPRQEDPPKLGS---SKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSG 201
LP + K S SKR+ L FF G + +R KL+ +N +I G
Sbjct: 276 ILPYVPNVDLCDYKCVSETQSKRSTLLFFRGRLKRNAGGKIRSKLVTELQNIEDIIIEEG 335
Query: 202 ----RLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNW 257
+ K + S FCL+ G ++AR+ D++ GC+PVII++ +LPF +L++
Sbjct: 336 SAGAKGKVAALTGMRKSLFCLNPAGDTPSSARLFDAIVSGCIPVIISDELELPFEGILDY 395
Query: 258 KSFSIVVATLDIPL---LKKILKGISSEEYLLLQSYVLKVRKHF 298
++ V++ D L K L+G+ + +QS +LK +HF
Sbjct: 396 SKIALFVSSTDAVQPGWLVKYLRGVDGKRVREMQSNLLKYSRHF 439
>gi|356512371|ref|XP_003524893.1| PREDICTED: probable glycosyltransferase At3g07620-like [Glycine
max]
Length = 427
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 69/282 (24%), Positives = 121/282 (42%), Gaps = 24/282 (8%)
Query: 52 DPSKADLFFLPF----SIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFY 107
DP A FF+PF S H + I + + + +K YW R+GG DH +
Sbjct: 120 DPELAQAFFVPFFSSLSFNTHGHTMKDPATQIDRQLQVDLMELLKKSNYWQRSGGRDHVF 179
Query: 108 VACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHI-AHKDVSLPQIW---PRQEDPPKL 163
H +A + + +IQVV + G +KDV P + +D P+
Sbjct: 180 PMTHP---NAFRFLRDQLNESIQVVVDFGRYPRGMSNLNKDVVSPYVHVVDSFTDDEPQD 236
Query: 164 GSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIY-----AHSGRLKTPYADELLGS 214
R+ L FF G VR KL ++ +++ A +K + + S
Sbjct: 237 PYESRSTLLFFRGRTYRKDEGIVRVKLAKILAGYDDVHYERSVATEENIKAS-SKGMRSS 295
Query: 215 KFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVA---TLDIPL 271
KFCLH G ++ R+ D++ C+PVI+++ +LPF D +++ FS+ + L
Sbjct: 296 KFCLHPAGDTPSSCRLFDAIVSHCIPVIVSDQIELPFEDEIDYSQFSVFFSFKEALQPGY 355
Query: 272 LKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
+ L+ E++ + + + H+++ P DA M+
Sbjct: 356 MIDQLRKFPKEKWTEMWRQLKSISHHYEFRYPPKREDAVDML 397
>gi|449444196|ref|XP_004139861.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10L-like
[Cucumis sativus]
Length = 478
Score = 74.3 bits (181), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 72/287 (25%), Positives = 122/287 (42%), Gaps = 46/287 (16%)
Query: 54 SKADLFFLPFSIARMRHDRRIGTE--------GIPDFISHY-IFNISQKYPYWNRTGGAD 104
+AD+ F+PF A M + ++G G D+ + + + W ++GG D
Sbjct: 146 EEADVIFVPF-FATMSAEMQLGMAKGAFRKKVGNEDYERQRNVMDFLKSTDAWKKSGGRD 204
Query: 105 HFYVACHSIGRSAMQKAWEVKLNAIQVVC--------------SSSYFISGHIAHKDVSL 150
H +V + W VK V SS+ I H VS+
Sbjct: 205 HVFVLTDPVA------MWHVKTEIAPAVLLVVDFGGWFRLDTKSSNGSSPDMIQHTQVSV 258
Query: 151 PQ--IWPRQEDPPKL---GSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSG 201
+ I P P+L + KR L +F GA VREKL + N+ ++ G
Sbjct: 259 LKDVIVPYTHLLPRLHLSANKKRQTLLYFKGAKRRHRGGLVREKLWDLLVNEPDVIMEEG 318
Query: 202 ----RLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNW 257
K + S+FCLH G + R+ D++ C+PV+++++ +LPF D++++
Sbjct: 319 FPNATGKEQSIKGMRSSEFCLHPAGDTPTSCRLFDAIQSLCIPVVVSDNIELPFEDMVDY 378
Query: 258 KSFSIVVATLDI---PLLKKILKGISSEEYLLLQSYVLKVRKHFQWH 301
FS+ VA D L K L+ I E+ + Y+ +V+ F++
Sbjct: 379 SEFSVFVAVNDALKPNWLVKHLRTIPEEQRNGFRLYMARVQSVFEYE 425
>gi|384252594|gb|EIE26070.1| exostosin-domain-containing protein [Coccomyxa subellipsoidea
C-169]
Length = 898
Score = 74.3 bits (181), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 81/357 (22%), Positives = 136/357 (38%), Gaps = 87/357 (24%)
Query: 32 NYASESYFKKAFMK---SHFVTKDPSKADLFFLPFSIARMRHD----------------- 71
Y ES F + ++ S T DP +AD FF+P + H
Sbjct: 488 TYGIESAFHERLLQALSSEHRTLDPEEADFFFMPVYTSCFLHPVWGYVDHPWYYGPTIDC 547
Query: 72 RRIGTEGIPDFISHYIFN-----------ISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
RR G I ++ + + +P+W R GG DH ++ H G K
Sbjct: 548 RRDGDMQICQTGANRVMQAMFMLLEAQKWVEVNHPWWRRKGGRDHIWLITHDEGSCWAPK 607
Query: 121 AWEVKLNAI------------------------QVVCSSSYFISGHIAH----------K 146
E++L+ I Q V ++ G+ H K
Sbjct: 608 --EIRLSIILSHWGRKDVNHTSNSAFKPWDNYTQEVIHPEWWPEGYTHHIKGHACYDPIK 665
Query: 147 DVSLPQIWPRQEDP---PKLGSSK--RNKLAFFAGAVNS--------PVREKLLQVWRND 193
D+ +P + E P +G + R+ L F G V +R++L + +
Sbjct: 666 DLIIPNLKHPAEFANFSPLVGHPQPPRDILFLFRGDVGKHRLPHYSRGIRQRLFALAQEH 725
Query: 194 SEIYAHS------GRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHY 247
H+ + Y++ L SKFCL G + + R D++ +GCVPV++ +
Sbjct: 726 DWAGRHAILIGDRDDVAGDYSELLTRSKFCLVAPG-DGFSPRAEDAILHGCVPVVVMDEV 784
Query: 248 DLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFP 304
D F+ +L+W +FS+ +A DI L +IL + +Q + V + F+W P
Sbjct: 785 DPVFSSILDWSAFSLRIAEADIEQLPQILLAVPEARLQAMQRSLRNVWQRFKWSSLP 841
>gi|125576050|gb|EAZ17272.1| hypothetical protein OsJ_32791 [Oryza sativa Japonica Group]
Length = 506
Score = 74.3 bits (181), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 43/170 (25%), Positives = 92/170 (54%), Gaps = 7/170 (4%)
Query: 165 SSKRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKTPYADE----LLGSKFCLHV 220
+SK + F + +R++L + +++ ++Y G ++ A + + SKFCL++
Sbjct: 322 TSKLTDVPTFELSKGGNIRQELHYMLKDEKDVYFAFGSVQDHGASKASQGMHASKFCLNI 381
Query: 221 KGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI---PLLKKILK 277
G ++ R+ D++ CVPVII++ +LP+ D L++ FSI V + D L ++++
Sbjct: 382 AGDTPSSNRLFDAIVSHCVPVIISDDIELPYEDALDYSKFSIFVRSSDAVKKGYLMRLIR 441
Query: 278 GISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
G+S ++ ++ + +V KHF++ DA M+ L + ++R++
Sbjct: 442 GVSKHQWTMMWRRLKEVDKHFEYQYPSQKDDAVQMIWQTLARKVPAIRLK 491
Score = 37.4 bits (85), Expect = 9.0, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 34/68 (50%), Gaps = 12/68 (17%)
Query: 52 DPSKADLFFLPF--SIARMRHDRRIGTE------GIPDFISHYIFNISQKYPYWNRTGGA 103
D AD+ F+PF S++ RH R + E G+ + + Y+ P W R+GGA
Sbjct: 140 DSRDADVVFVPFFASLSYNRHSRVVPPEKVSRDKGLQERLVRYLM----AQPEWKRSGGA 195
Query: 104 DHFYVACH 111
DH VA H
Sbjct: 196 DHVIVAHH 203
>gi|55297487|dbj|BAD68203.1| exostosin family protein-like [Oryza sativa Japonica Group]
gi|55297674|dbj|BAD68245.1| exostosin family protein-like [Oryza sativa Japonica Group]
Length = 512
Score = 74.3 bits (181), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 71/269 (26%), Positives = 123/269 (45%), Gaps = 33/269 (12%)
Query: 95 PYWNRTGGADHFYVACH-----SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHI-AHKDV 148
P W R+ G DH H S+ R + W + + + +++ G + KDV
Sbjct: 223 PAWQRSEGRDHVIPVHHPWSFKSVRRFVKKAIWLLP----DMDSTGNWYKPGQVYLEKDV 278
Query: 149 SLPQIWPRQEDPPKLGS---SKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSG 201
LP + K S S+R+ L FF G + +R KL+ ++ I G
Sbjct: 279 ILPYVPNVDLCDSKCVSETQSRRSTLLFFRGRLRRNAGGKIRSKLVTELKDAEGIIIEEG 338
Query: 202 RL----KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNW 257
K + + S FCL+ G ++AR+ D++ GC+PVI+++ +LPF +L++
Sbjct: 339 TAGADGKAAAQNGMRKSLFCLNPAGDTPSSARLFDAIVSGCIPVIVSDELELPFEGILDY 398
Query: 258 KSFSIVVATLDIPL---LKKILKGISSEEYLLLQSYVLKVRKHFQWH-----VFPSDYD- 308
+ ++ V++ D L K L+ I ++ +QS +LK +HF + + P D
Sbjct: 399 RKIALFVSSNDAVQPGWLVKYLRSIDAKRIRQMQSNLLKYSRHFLYSSPARPLGPEDLTW 458
Query: 309 ---AFYMVMYELWLRRSSVRVQWSTSLDS 334
A +V +L +RRS V+ S S+ S
Sbjct: 459 RMIAGKLVNIKLHIRRSQRVVRESRSICS 487
>gi|302835860|ref|XP_002949491.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
gi|300265318|gb|EFJ49510.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
Length = 554
Score = 74.3 bits (181), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 70/270 (25%), Positives = 109/270 (40%), Gaps = 28/270 (10%)
Query: 39 FKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWN 98
F + + S D AD +++P R+ T F+ + I I + YP+WN
Sbjct: 147 FYQRLLGSGARVADGDLADWYYIPI---------RLRTATDSAFLKYAIEYIREAYPWWN 197
Query: 99 RTGGADHFYVACHSIGR-SAMQKAWEVKLN-------AIQVVCSSSYFISGHIAHKDVSL 150
RTGGA HF + +G M + + N + V ++S + H K +
Sbjct: 198 RTGGARHFVIHTGDLGADEVMDDVYGMAANMTWLTHWGLTVDKNTSGWWKAHRPDKARAG 257
Query: 151 PQIWPRQEDPPKLGSSKRNKLAFFAGAVNSPVREKLLQV-----W-RNDSEIYAHSGRLK 204
+ R ++ ++R + +P R + Q W R I +
Sbjct: 258 ARWGTRGGYYTRVSVNRRRGSHMWGPPSPAPHRAGVRQKVHFHHWNRTGFRIVT----FE 313
Query: 205 TPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVV 264
Y L+ SKFCL G +I S Y GC+PV IA+ PF +W F++
Sbjct: 314 RNYGKALVSSKFCLAPLGGGHGQRQIIVS-YMGCIPVCIADGVYEPFEPQTDWTEFAVRP 372
Query: 265 ATLDIPLLKKILKGISSEEYLLLQSYVLKV 294
A DIP L +IL+GIS+ L L+
Sbjct: 373 AEADIPRLHEILEGISAGNKLAEMQVALRC 402
>gi|125528116|gb|EAY76230.1| hypothetical protein OsI_04166 [Oryza sativa Indica Group]
Length = 513
Score = 73.9 bits (180), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 71/269 (26%), Positives = 123/269 (45%), Gaps = 33/269 (12%)
Query: 95 PYWNRTGGADHFYVACH-----SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHI-AHKDV 148
P W R+ G DH H S+ R + W + + + +++ G + KDV
Sbjct: 224 PAWQRSEGRDHVIPVHHPWSFKSVRRFVKKAIWLLP----DMDSTGNWYKPGQVYLEKDV 279
Query: 149 SLPQIWPRQEDPPKLGS---SKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSG 201
LP + K S S+R+ L FF G + +R KL+ ++ I G
Sbjct: 280 ILPYVPNVDLCDSKCVSETQSRRSTLLFFRGRLRRNAGGKIRSKLVTELKDAEGIIIEEG 339
Query: 202 RL----KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNW 257
K + + S FCL+ G ++AR+ D++ GC+PVI+++ +LPF +L++
Sbjct: 340 TAGADGKAAAQNGMRKSLFCLNPAGDTPSSARLFDAIVSGCIPVIVSDELELPFEGILDY 399
Query: 258 KSFSIVVATLDIPL---LKKILKGISSEEYLLLQSYVLKVRKHFQWH-----VFPSDYD- 308
+ ++ V++ D L K L+ I ++ +QS +LK +HF + + P D
Sbjct: 400 RKIALFVSSNDAVQPGWLVKYLRSIDAKRIRQMQSNLLKYSRHFLYSSPARPLGPEDLTW 459
Query: 309 ---AFYMVMYELWLRRSSVRVQWSTSLDS 334
A +V +L +RRS V+ S S+ S
Sbjct: 460 RMIAGKLVNIKLHIRRSQRVVRESRSICS 488
>gi|296085534|emb|CBI29266.3| unnamed protein product [Vitis vinifera]
Length = 390
Score = 73.9 bits (180), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 77/309 (24%), Positives = 127/309 (41%), Gaps = 47/309 (15%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T +P +AD F+ P ++ + + P + I IS
Sbjct: 73 FAAEIFMHRFLLSSPVRTLNPEEADWFYTPIYTTCDLTPNGLPLPFKSPRMMRSAIQLIS 132
Query: 92 QKYPYWNRTGGADHFYVACHSIGRS---AMQKAWEVKLNAI--QVVCSSSYFISGHIAHK 146
+PYWNRT GADHF+V H G +KA E + + + ++ H+
Sbjct: 133 SNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLN 192
Query: 147 D--VSLPQIWPRQEDPPKL--GSSKRNKLAFFAGAV----NSP-----VREKLLQVWRN- 192
+ +++P P Q+ L + R+ +F G N P R VW N
Sbjct: 193 EGSITIPPYAPPQKMQAHLIPQETPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVWENF 252
Query: 193 -DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
D+ ++ S T Y +++ + FCL +AD + LPF
Sbjct: 253 KDNPLFDISTEHPTTYYEDMQRAIFCL---------CPLADDIV-------------LPF 290
Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---D 308
AD + W+ + VA D+P L IL I E +L + +L Q +FP D
Sbjct: 291 ADAIPWEEIGVFVAEEDVPNLDTILTSIPP-EVILRKQRLLANPSMKQAMLFPQPAQSGD 349
Query: 309 AFYMVMYEL 317
AF+ ++ L
Sbjct: 350 AFHQILNGL 358
>gi|374922021|gb|AFA26188.1| hypothetical protein, partial [Lolium perenne]
Length = 282
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 69/264 (26%), Positives = 110/264 (41%), Gaps = 60/264 (22%)
Query: 90 ISQKYPYWNRTGGADHFYV------ACHS---IGRSAMQKAWEVKLNAIQVVCSSSYFIS 140
I+Q+YPYWNRT G DH + AC++ I S M W N +++Y+
Sbjct: 8 IAQRYPYWNRTSGRDHIWFFSWDEGACYAPKEIWNSMMLVHWG-NTNTKHENSTTAYWAD 66
Query: 141 G--------------HIAHKDVSLPQ--------IWPRQEDPPKLGSSKRNKLAFFAGAV 178
KD+ LP IW + P+ S R L +F G +
Sbjct: 67 NWDNIPLDRRGDHPCFDPTKDLVLPAWKDPDPAAIWLKLWARPR---SNRRTLFYFNGNL 123
Query: 179 NSP-------------VREKLLQVWRNDSEIYAHSGRL-----------KTPYADELLGS 214
S +R+KL + + GR Y +EL S
Sbjct: 124 GSAYEQGRPEDTYSMGIRQKLAAEFGSTPNKQGKLGRQHVANVTVTHLRSEKYYEELASS 183
Query: 215 KFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKK 274
FC + G + + R+ DS+ GC+PVII + LP+ +VLN+ SF++ + DIP L
Sbjct: 184 IFCGVLPG-DGWSGRMEDSMLQGCIPVIIQDGIFLPYENVLNYNSFAVRIQEDDIPNLIT 242
Query: 275 ILKGISSEEYLLLQSYVLKVRKHF 298
+L+G++ + + V ++ + F
Sbjct: 243 VLRGMNETQIEFMLGNVRQIWQRF 266
>gi|224284379|gb|ACN39924.1| unknown [Picea sitchensis]
Length = 787
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 87/332 (26%), Positives = 141/332 (42%), Gaps = 70/332 (21%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIA------------RMRHDRRIGTEGIP 80
Y S+ ++ + S + T + +AD F++P A M++ + +
Sbjct: 381 YGSQMALYESLLASPYRTLNGEEADYFYVPVLDACLITRADDAPHLSMKNHMGLRSYLTL 440
Query: 81 DFISHYIFNISQKYPYWNRTGGADHFYV------ACHS---IGRSAMQKAWEVKLNAIQV 131
DF +I + Y YWNR+ G DH + AC++ I S M W N+
Sbjct: 441 DFYKKAYDHIMEHYTYWNRSSGHDHIWFFAWDEGACYAPKEIWNSMMLVHWG-NTNSKHN 499
Query: 132 VCSSSYFISG--HIA------------HKDVSLPQIWPRQEDPPKLGS-----SKRNK-- 170
+++Y HI KD+ LP W R DP + + S+R +
Sbjct: 500 HSTTAYLADNWDHIPIERRGRHPCFDPEKDLVLP-AWKR-PDPYNVKARFWARSRRERFT 557
Query: 171 LAFFAGAV------NSP-------VREKLLQVWRNDSEIYAHSGRLKTP----------- 206
L +F G + N P +R+KL + ++ GR T
Sbjct: 558 LFYFNGNLGASFKNNRPEPTYSLGIRQKLAAEFASEPNKEGKFGRQSTKDVIVVSQKSPN 617
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y EL S FC G + + R+ DS+ GC+PVII + + + +VLN+ SF++ +A
Sbjct: 618 YYSELGSSLFCGVFPG-DGWSGRMEDSVLQGCIPVIIQDGIQVAYENVLNYDSFAVRIAE 676
Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRKHF 298
DIP L +IL+GI+ E + V K+R+ F
Sbjct: 677 DDIPHLVQILRGINETELEFKLANVQKLRQRF 708
>gi|222640628|gb|EEE68760.1| hypothetical protein OsJ_27458 [Oryza sativa Japonica Group]
Length = 518
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 46/171 (26%), Positives = 87/171 (50%), Gaps = 12/171 (7%)
Query: 155 PRQEDPP-----KLGSSKRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKT---- 205
PR+ PP + +++R + VR++L Q+ +++ +++ G ++
Sbjct: 320 PRRAAPPEQHDGRPAAAQRRHVRPLRLRQGGRVRQRLYQLIKDEKDVHFTYGSVRQNGIR 379
Query: 206 PYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVA 265
+ SKFCL++ G ++ R+ D++ CVPVII++ +LPF DVL++ +F + V
Sbjct: 380 RATKGMASSKFCLNIAGDTPSSNRLFDAIVSHCVPVIISDDIELPFEDVLDYSAFCVFVR 439
Query: 266 TLDI---PLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
D L +L+GIS EE+ + + +V HF++ DA M+
Sbjct: 440 ASDAVKRGFLLHLLRGISQEEWTAMWRRLKEVAHHFEYQYPSQPGDAVQMI 490
>gi|125572389|gb|EAZ13904.1| hypothetical protein OsJ_03829 [Oryza sativa Japonica Group]
Length = 504
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 71/269 (26%), Positives = 123/269 (45%), Gaps = 33/269 (12%)
Query: 95 PYWNRTGGADHFYVACH-----SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHI-AHKDV 148
P W R+ G DH H S+ R + W + + + +++ G + KDV
Sbjct: 215 PAWQRSEGRDHVIPVHHPWSFKSVRRFVKKAIWLLP----DMDSTGNWYKPGQVYLEKDV 270
Query: 149 SLPQIWPRQEDPPKLGS---SKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSG 201
LP + K S S+R+ L FF G + +R KL+ ++ I G
Sbjct: 271 ILPYVPNVDLCDSKCVSETQSRRSTLLFFRGRLRRNAGGKIRSKLVTELKDAEGIIIEEG 330
Query: 202 RL----KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNW 257
K + + S FCL+ G ++AR+ D++ GC+PVI+++ +LPF +L++
Sbjct: 331 TAGADGKAAAQNGMRKSLFCLNPAGDTPSSARLFDAIVSGCIPVIVSDELELPFEGILDY 390
Query: 258 KSFSIVVATLDI---PLLKKILKGISSEEYLLLQSYVLKVRKHFQWH-----VFPSDYD- 308
+ ++ V++ D L K L+ I ++ +QS +LK +HF + + P D
Sbjct: 391 RKIALFVSSNDAVQPGWLVKYLRSIDAKRIRQMQSNLLKYSRHFLYSSPARPLGPEDLTW 450
Query: 309 ---AFYMVMYELWLRRSSVRVQWSTSLDS 334
A +V +L +RRS V+ S S+ S
Sbjct: 451 RMIAGKLVNIKLHIRRSQRVVRESRSICS 479
>gi|224085069|ref|XP_002307477.1| predicted protein [Populus trichocarpa]
gi|222856926|gb|EEE94473.1| predicted protein [Populus trichocarpa]
Length = 460
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 79/323 (24%), Positives = 138/323 (42%), Gaps = 40/323 (12%)
Query: 4 SFRVYVYPHRRNDPFANVL------LPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKAD 57
S ++YVY D +L + D +G + ++ + ++S F T+ +A+
Sbjct: 86 SLKIYVYEEDEIDGLKELLRGREGKISADACVKGQWGTQVKIHRLLLQSRFRTRKKGEAN 145
Query: 58 LFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSA 117
LFF+P A + R +G + Y+ +SQ PY+ R+GG DH +V G +
Sbjct: 146 LFFVP---AYAKCVRMMGGLNDKEINHTYVKALSQ-MPYFRRSGGRDHIFVFPSGAG-AH 200
Query: 118 MQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPR-------QEDPPKLGS----- 165
+ ++W +N ++ KD S W ++ K G+
Sbjct: 201 LFRSWATYINRSIILSPEG----DRTDKKDTSSFNTWKDIIIPGNVEDGMTKRGAAMAQP 256
Query: 166 ---SKRNKLAFFAGAVNSPV-REKLLQVWRN-----DSEIYAHSGRLK---TPYADELLG 213
SKR LA + G V R KL+++ + + SG K Y L
Sbjct: 257 LPLSKRKYLANYLGRAQGKVGRLKLIELAKQYPDKLECPELKFSGPEKFGRMEYFQHLRN 316
Query: 214 SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPL-L 272
+KFC+ +G T R +S + CVPVI+++ + PF +V+++ SI + I L L
Sbjct: 317 AKFCVAPRGESSWTLRFYESFFVECVPVILSDQAEFPFQNVIDYTQISIKWPSTRIGLEL 376
Query: 273 KKILKGISSEEYLLLQSYVLKVR 295
+ L+ I E+ + + +VR
Sbjct: 377 LEYLESIPDEDVERMIAAGRQVR 399
>gi|255070683|ref|XP_002507423.1| glycosyltransferase family 47 protein [Micromonas sp. RCC299]
gi|226522698|gb|ACO68681.1| glycosyltransferase family 47 protein [Micromonas sp. RCC299]
Length = 698
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 43/151 (28%), Positives = 77/151 (50%), Gaps = 20/151 (13%)
Query: 174 FAGAVNSPVREKLL---------QVW--RNDSEIYAHSGRLKTPYADELLGSKFCLHVKG 222
F G + VRE++L + W R+D ++ Y + SKFCLHV+G
Sbjct: 492 FRGTLRGGVRERILGHYLSVGKSRNWDLRSDGQVSPSR------YMRLMRDSKFCLHVRG 545
Query: 223 FEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSE 282
V + R+ + + +GCVPVI+A+ Y P + + +W FS+ + ++ L ++L+G+
Sbjct: 546 TRVQSPRLIEGMLFGCVPVIVADGYVPPLSWLFDWSKFSVRLPEVEHERLPEVLQGV--- 602
Query: 283 EYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
++ LQ+ + +V F +H P DA +
Sbjct: 603 DWATLQANLRRVAPFFVYHRTPIPGDALWTT 633
>gi|449681334|ref|XP_002163001.2| PREDICTED: exostosin-1a-like [Hydra magnipapillata]
Length = 469
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 44/141 (31%), Positives = 75/141 (53%), Gaps = 5/141 (3%)
Query: 163 LGSSKRNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFC 217
+GS RN L + + + EK Q + + +S + Y + L S FC
Sbjct: 135 IGSETRNSLYLIHNNEDIILLTTCIHEKNWQKFADSRCEEDNSNYDRFNYTELLANSTFC 194
Query: 218 LHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILK 277
L +G + + R +++ YGC+PVI++N +DLPF DV++W FSIV+ + L IL+
Sbjct: 195 LIPRGRRLASFRFLEAIQYGCIPVIMSNGWDLPFNDVIDWVKFSIVLDESLLLQLPSILR 254
Query: 278 GISSEEYLLLQSYVLKVRKHF 298
GIS ++ L ++ + V K++
Sbjct: 255 GISFDQVLAMKQQTIFVWKNY 275
>gi|302837063|ref|XP_002950091.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
gi|300264564|gb|EFJ48759.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
Length = 593
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 82/377 (21%), Positives = 148/377 (39%), Gaps = 97/377 (25%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFF-------LPFSI-------------ARMRH-- 70
YA+++ ++ + S T DP +AD F+ LPF I AR H
Sbjct: 214 YAADTLLHESLLISEHRTFDPEEADFFYVPHYASCLPFPIGVWADAPWFPEPGARAHHHL 273
Query: 71 ------------------DRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHS 112
+R + + + + + I+ +PYW R GG DH ++ H
Sbjct: 274 CEVLPATRVITVTCRLPVNRGPRVKQMVNMVRDTLDWINSTFPYWRRRGGRDHIWLFTHD 333
Query: 113 IGRSAMQKA-----WEVKLNAIQVVCSSSY----------------------FISGHIA- 144
G A W ++ +S+ +I GH
Sbjct: 334 EGACWAPTAINASIWLTHWGRTELNHTSNTAFLADKYDSDFAGPLQPEGFVKYIKGHPCF 393
Query: 145 --HKDVSLP--------QIWPRQEDPPKLGSSKRNKLAFFAGAVNS--------PVREKL 186
KD+ +P P Q +P + +R+ L FF G V VR+++
Sbjct: 394 NPEKDLVIPAFKAPSHYHASPLQGNPAR----ERDLLFFFRGDVGKNRLPNYSRGVRQQI 449
Query: 187 LQV-----WRNDSEIYAHSGR-LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVP 240
++ W Y G ++ Y++ L + FCL G + +AR+ D++ +GC+P
Sbjct: 450 YKMAKEGGWAEKYRFYIGDGSDVEGDYSEMLSRAIFCLVAPG-DGWSARMEDAVLHGCIP 508
Query: 241 VIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQW 300
V+IA+ + F +VL +F++ + +P L +L+ + Q+++ +V + ++W
Sbjct: 509 VVIADGVEAVFENVLELDAFALRLPQEAVPRLLDVLRAVPQRAIRSKQAHLGRVWQRYRW 568
Query: 301 HVFPSDYDAFYMVMYEL 317
P DAF +M L
Sbjct: 569 ASLPKLDDAFATIMQWL 585
>gi|297597828|ref|NP_001044591.2| Os01g0811400 [Oryza sativa Japonica Group]
gi|255673802|dbj|BAF06505.2| Os01g0811400 [Oryza sativa Japonica Group]
Length = 497
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 70/266 (26%), Positives = 121/266 (45%), Gaps = 33/266 (12%)
Query: 95 PYWNRTGGADHFYVACH-----SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHI-AHKDV 148
P W R+ G DH H S+ R + W + + + +++ G + KDV
Sbjct: 223 PAWQRSEGRDHVIPVHHPWSFKSVRRFVKKAIWLLP----DMDSTGNWYKPGQVYLEKDV 278
Query: 149 SLPQIWPRQEDPPKLGS---SKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSG 201
LP + K S S+R+ L FF G + +R KL+ ++ I G
Sbjct: 279 ILPYVPNVDLCDSKCVSETQSRRSTLLFFRGRLRRNAGGKIRSKLVTELKDAEGIIIEEG 338
Query: 202 RL----KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNW 257
K + + S FCL+ G ++AR+ D++ GC+PVI+++ +LPF +L++
Sbjct: 339 TAGADGKAAAQNGMRKSLFCLNPAGDTPSSARLFDAIVSGCIPVIVSDELELPFEGILDY 398
Query: 258 KSFSIVVATLDI---PLLKKILKGISSEEYLLLQSYVLKVRKHFQWH-----VFPSDYD- 308
+ ++ V++ D L K L+ I ++ +QS +LK +HF + + P D
Sbjct: 399 RKIALFVSSNDAVQPGWLVKYLRSIDAKRIRQMQSNLLKYSRHFLYSSPARPLGPEDLTW 458
Query: 309 ---AFYMVMYELWLRRSSVRVQWSTS 331
A +V +L +RRS V+ S S
Sbjct: 459 RMIAGKLVNIKLHIRRSQRVVRESRS 484
>gi|449487654|ref|XP_004157734.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10L-like
[Cucumis sativus]
Length = 464
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 65/276 (23%), Positives = 118/276 (42%), Gaps = 34/276 (12%)
Query: 52 DPSKADLFFLPF-------------SIARMRHDRRI--GTEGIPDFISHYIFNISQKYPY 96
DP +ADLF++PF + + R++ E D ++ +K Y
Sbjct: 141 DPEEADLFYVPFFSSLSLIVNPIRPATGSDQQQRKLVYSDEETQDAFMEWL----EKQEY 196
Query: 97 WNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH-KDVSLPQIWP 155
W R+ G DH +A +A+ + + N+I +V + + KDV +P
Sbjct: 197 WKRSNGRDHVIIAQDP---NALYRLIDRVKNSILLVSDFGRLRADQASLVKDVIVPYSHR 253
Query: 156 RQEDPPKLGSSKRNKLAFFAG----AVNSPVREKLLQVWRNDSEIYAHSG----RLKTPY 207
+G R L FF G +R+ L + + ++ G +
Sbjct: 254 INTYTGDIGVENRKTLLFFMGNRYRKEGGKIRDMLFNILELEQDVIIKHGTQSRESRRAA 313
Query: 208 ADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIV---V 264
+ SKFCL+ G + R+ DS+ CVPVI+++ +LPF DV+++ ++ V
Sbjct: 314 THGMHTSKFCLNPAGDTPSACRLFDSVVSLCVPVIVSDSIELPFEDVIDYSKIAVFFDSV 373
Query: 265 ATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQW 300
+ + L L+ IS E L Q + K++++F++
Sbjct: 374 SAVKPEFLISKLRRISEERILDYQREMKKIKRYFEY 409
>gi|449455387|ref|XP_004145434.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10L-like
[Cucumis sativus]
Length = 464
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 65/276 (23%), Positives = 117/276 (42%), Gaps = 34/276 (12%)
Query: 52 DPSKADLFFLPF-------------SIARMRHDRRI--GTEGIPDFISHYIFNISQKYPY 96
DP ADLF++PF + + R++ E D ++ +K Y
Sbjct: 141 DPEVADLFYVPFFSSLSLIVNPIRPATGSDQQQRKLVYSDEETQDAFMEWL----EKQEY 196
Query: 97 WNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH-KDVSLPQIWP 155
W R+ G DH +A +A+ + + N+I +V + + KDV +P
Sbjct: 197 WKRSNGRDHVIIAQDP---NALYRLIDRVKNSILLVSDFGRLRADQASLVKDVIVPYSHR 253
Query: 156 RQEDPPKLGSSKRNKLAFFAG----AVNSPVREKLLQVWRNDSEIYAHSG----RLKTPY 207
+G R L FF G +R+ L + + ++ G +
Sbjct: 254 INTYTGDIGVENRKTLLFFMGNRYRKEGGKIRDMLFNILEQEQDVIIKHGTQSRESRRAA 313
Query: 208 ADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIV---V 264
+ SKFCL+ G + R+ DS+ CVPVI+++ +LPF DV+++ ++ V
Sbjct: 314 THGMHTSKFCLNPAGDTPSACRLFDSVVSLCVPVIVSDSIELPFEDVIDYSKIAVFFDSV 373
Query: 265 ATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQW 300
+ + L L+ IS E L Q + K++++F++
Sbjct: 374 SAVKPEFLISKLRRISEERILDYQREMKKIKRYFEY 409
>gi|308799499|ref|XP_003074530.1| Acetylglucosaminyltransferase EXT1/exostosin 1 (ISS) [Ostreococcus
tauri]
gi|116000701|emb|CAL50381.1| Acetylglucosaminyltransferase EXT1/exostosin 1 (ISS), partial
[Ostreococcus tauri]
Length = 439
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 37/92 (40%), Positives = 56/92 (60%), Gaps = 3/92 (3%)
Query: 211 LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIP 270
L S++CL+V G +TAR+ D + +GCVPVI+A+ YDLPF+ + +W FS+ V D+
Sbjct: 345 LSKSRYCLYVYGDRAHTARLYDIITFGCVPVIVADGYDLPFSWLFDWSKFSVRVPEDDVA 404
Query: 271 LLKKILKGISSEEYLLLQSYVLKVRKHFQWHV 302
L IL +Y L+ ++KV FQ+H
Sbjct: 405 KLPGIL---DQADYDSLRGELVKVHSFFQYHA 433
>gi|226508962|ref|NP_001152546.1| exostosin-like [Zea mays]
gi|195657373|gb|ACG48154.1| exostosin-like [Zea mays]
gi|414878975|tpg|DAA56106.1| TPA: exostosin-like protein [Zea mays]
Length = 427
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 73/288 (25%), Positives = 124/288 (43%), Gaps = 32/288 (11%)
Query: 50 TKDPSKADLFFLPF--SIARMRHDRRIGTEGIPD-----FISHYIFNISQKYPYWNRTGG 102
+DP AD FF+PF S++ H R + PD + + +I K YW R+ G
Sbjct: 120 VRDPDAADAFFVPFFSSLSFNVHGRNMTD---PDTEADRLLQVELVDILWKSKYWQRSAG 176
Query: 103 ADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIA--HKDVSLPQIWP----R 156
DH H ++ +NA ++ S + +A KDV P +
Sbjct: 177 RDHVIPMHHPNAFRFLR----AMVNASILIVSDFGRYTKELASLRKDVVAPYVHVVGSFL 232
Query: 157 QEDPPKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDS----EIYAHSGRLKTPYA 208
+DPP + R+ L FF G +R KL ++ + E +G
Sbjct: 233 DDDPPDPFEA-RHTLLFFRGRTVRKDEGKIRSKLEKILKGKEGVRFEDSIATGDGINIST 291
Query: 209 DELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI---VVA 265
+ + SKFCLH G ++ R+ D++ CVPVI+++ +LPF D +++ FS+ V
Sbjct: 292 EGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSSRIELPFEDEIDYSEFSLFFSVEE 351
Query: 266 TLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
L L L+ + +++ + + V H+++ P DA M+
Sbjct: 352 ALRPDYLLNELRQVPKRKWVDMWLKLKNVSHHYEFQYPPRKGDAVNMI 399
>gi|242059273|ref|XP_002458782.1| hypothetical protein SORBIDRAFT_03g040220 [Sorghum bicolor]
gi|241930757|gb|EES03902.1| hypothetical protein SORBIDRAFT_03g040220 [Sorghum bicolor]
Length = 426
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 77/279 (27%), Positives = 130/279 (46%), Gaps = 36/279 (12%)
Query: 30 RGNYASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFN 89
+G + ++ + +KS F T + +A+LFF+P + ++ R G + Y+
Sbjct: 84 KGQWGTQVKIHQFLLKSRFRTFNKDQANLFFVP---SYVKCVRMTGALSDKEINQTYVKV 140
Query: 90 ISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVC---------SSSYF-- 138
+SQ PY+ R+GG DH +V G + + ++W LN ++ +S F
Sbjct: 141 LSQ-MPYFRRSGGRDHIFVFPSGAG-AHLFRSWATFLNRSIILTPEGDRTDKRGTSAFNT 198
Query: 139 -----ISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLAFFAG-AVNSPVREKLLQVWRN 192
I G++ D S+ + R P L +KR LA F G A R +L+++ +
Sbjct: 199 WKDIIIPGNV---DDSMVKSDARAVQPIPL--TKRKYLANFLGRAQGKAGRLQLVELAKQ 253
Query: 193 -----DSEIYAHSGRLK---TPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIA 244
+S + SG K Y L +KFCL +G T R +S + CVPVI++
Sbjct: 254 YPDKLESPVLKLSGPNKLGRIEYFKHLRNAKFCLAPRGESSWTLRFYESFFVECVPVILS 313
Query: 245 NHYDLPFADVLNWKSFSIVVATLDI-PLLKKILKGISSE 282
+ +LPF +V+++ SI + I P L + L+ IS E
Sbjct: 314 DEVELPFQNVIDYGEISIKWPSSRIGPELLEYLESISDE 352
>gi|224144904|ref|XP_002325457.1| predicted protein [Populus trichocarpa]
gi|222862332|gb|EEE99838.1| predicted protein [Populus trichocarpa]
Length = 463
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 75/302 (24%), Positives = 143/302 (47%), Gaps = 34/302 (11%)
Query: 50 TKDPSKADLFFLPF--SIARMR------HDRRIGTEGIPDFISHYIFNISQKYPYWNRTG 101
++ S+AD+ F+PF S+ R H ++ + + + + ++ SQK W R+G
Sbjct: 144 VRNSSEADVIFVPFFSSLCYNRYSKVNPHQKKSKDKLLQEKLVKFL--TSQKE--WKRSG 199
Query: 102 GADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH--KDVSLPQ---IWPR 156
G DH +A H ++M A VKL + + +IA+ KDV P I
Sbjct: 200 GRDHVLLAHHP---NSMLDA-RVKLWPAIFILADFGRYPPNIANVAKDVIAPYKHVIRSY 255
Query: 157 QEDPPKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKT----PYA 208
D S R L +F GA+ R++L + +++ E++ G ++ +
Sbjct: 256 VNDSSNFDS--RPTLLYFQGAIYRKDGGFARQELFYLLKDEKEVHFQFGSVQKDGVGKAS 313
Query: 209 DELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLD 268
+ SKFCL++ G ++ R+ D++ CVPVII++ +LP+ +VL++ F I V T D
Sbjct: 314 QGMHSSKFCLNIAGDTPSSNRLFDAIASHCVPVIISDDIELPYENVLDYSQFCIFVRTSD 373
Query: 269 I---PLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
L +++ I +E+ + + +V F++ + DA M+ + + ++R
Sbjct: 374 AVREKFLVNLIRSIKKDEWTRMWKRLKEVENFFEFQYPSREGDAVQMIWQAVARKVPAIR 433
Query: 326 VQ 327
++
Sbjct: 434 LK 435
>gi|3668093|gb|AAC61825.1| unknown protein [Arabidopsis thaliana]
Length = 460
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 67/254 (26%), Positives = 111/254 (43%), Gaps = 20/254 (7%)
Query: 52 DPSKADLFFLP-FS----IARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHF 106
DP+ ADLF++P FS I G+ + + + + +W R G DH
Sbjct: 130 DPADADLFYVPVFSSLSLIVNAGRPVEAGSGYSDEKMQEGLVEWLEGQEWWRRNAGRDHV 189
Query: 107 YVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH-KDVSLPQIWPRQEDPPKLGS 165
A +A+ + + NA+ +V + KDV +P ++G
Sbjct: 190 IPAGDP---NALYRILDRVKNAVLLVSDFGRLRPDQGSFVKDVVIPYSHRVNLFNGEIGV 246
Query: 166 SKRNKLAFFAG----AVNSPVREKLLQVWRNDSEIYAHSGRL----KTPYADELLGSKFC 217
RN L FF G VR+ L QV + ++ G + + SKFC
Sbjct: 247 EDRNTLLFFMGNRYRKDGGKVRDLLFQVLEKEDDVTIKHGTQSRENRRAATKGMHTSKFC 306
Query: 218 LHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVV---ATLDIPLLKK 274
L+ G + R+ DS+ CVP+I+++ +LPF DV++++ FSI V A L L +
Sbjct: 307 LNPAGDTPSACRLFDSIVSLCVPLIVSDSIELPFEDVIDYRKFSIFVEANAALQPGFLVQ 366
Query: 275 ILKGISSEEYLLLQ 288
+L+ I +++ L Q
Sbjct: 367 MLRKIKTKKILEYQ 380
>gi|289166878|gb|ADC84489.1| glycosyltransferase family 47C [Salix sachalinensis]
Length = 252
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 63/233 (27%), Positives = 99/233 (42%), Gaps = 29/233 (12%)
Query: 82 FISHYIFNISQKYPYWNRTGGADHFYVACHSIG--------RSAMQKAWEVKLNAIQVVC 133
+S + IS +P+WNR+ G+DH +VA H G R+A E +I +
Sbjct: 5 LLSSAVNLISSNHPFWNRSRGSDHVFVASHDYGACFHAMEERAAEDGIPEFLKRSIILQT 64
Query: 134 SSSYFISGHIAHKDVSLPQ-IWPRQ-----EDPPKLGSSKRNKLAFFAGAV--------- 178
F ++V +P I P E P G +R+ FF G +
Sbjct: 65 FGVKFDHPCQDVENVVIPPFITPESVQTTLEKYPLTG--RRDIWVFFRGKMEVHPKNISG 122
Query: 179 ---NSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLY 235
+ VR + + + D Y R Y E+ S FCL G+ + R+ +S+
Sbjct: 123 RYYSKKVRTVIWRKYSGDPRFYLRRHRFAG-YQSEIARSVFCLCPLGWAPWSPRLVESIA 181
Query: 236 YGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQ 288
GCVPVIIA+ LPF + W S+ VA D+ L+ +L +++ +Q
Sbjct: 182 LGCVPVIIADGIRLPFPAAVRWSDISLTVAEKDVADLRTLLDHVAASNLSAIQ 234
>gi|242059031|ref|XP_002458661.1| hypothetical protein SORBIDRAFT_03g037670 [Sorghum bicolor]
gi|241930636|gb|EES03781.1| hypothetical protein SORBIDRAFT_03g037670 [Sorghum bicolor]
Length = 499
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 72/268 (26%), Positives = 127/268 (47%), Gaps = 35/268 (13%)
Query: 95 PYWNRTGGADHFYVACH-----SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHI-AHKDV 148
P W R+ G DH H S+ R + W + + + +++ G + KDV
Sbjct: 223 PAWQRSEGRDHVIPVHHPWSFKSVRRFVKKAIWLLP----DMDSTGNWYKPGQVYLEKDV 278
Query: 149 SLPQIWPRQE--DPPKLGSS--KRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHS 200
LP + P + D + + KR+ L FF G + +R KL++ ++ +I
Sbjct: 279 ILPYV-PNVDLCDHKCVLETQFKRSILLFFRGRLKRNAGGKIRSKLVEELKSAEDIVIEE 337
Query: 201 G----RLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLN 256
G + K D + S FCL G ++AR+ D++ GC+PVII++ +LPF +L+
Sbjct: 338 GSAGAQGKAAAQDGMRKSLFCLSPAGDTPSSARLFDAIVSGCIPVIISDELELPFEGILD 397
Query: 257 WKSFSIVVATLDIPL---LKKILKGISSEEYLLLQSYVLKVRKHFQWH-----VFPSDYD 308
++ ++ V++ D L K L+GI ++ +QS ++K +HF + + P D
Sbjct: 398 YREIALFVSSSDAVQPGWLVKYLRGIDAKRIREIQSNLVKYSRHFLYSSPAQPLGPEDLT 457
Query: 309 ----AFYMVMYELWLRRSSVRVQWSTSL 332
A +V +L +RRS V+ S S+
Sbjct: 458 WRMIAGKVVNIKLQIRRSQRLVRESRSV 485
>gi|269785149|ref|NP_001161530.1| exostosin 1-like protein [Saccoglossus kowalevskii]
gi|268054045|gb|ACY92509.1| exostosin 1-like protein [Saccoglossus kowalevskii]
Length = 737
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 91/369 (24%), Positives = 142/369 (38%), Gaps = 100/369 (27%)
Query: 5 FRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFK--KAFMKSHFVTKDPSKADLFFLP 62
F+VYVYP + GN SE+Y K A +S F T DP KA LF +P
Sbjct: 93 FKVYVYPTQE----------------GNKVSEAYDKILSAIRESRFYTSDPKKACLF-IP 135
Query: 63 FSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHS---------- 112
SI + D PD++ + I Q P WN G +H +S
Sbjct: 136 -SIDTLDRDHLS-----PDYVKNAQSKI-QSLPLWN--NGQNHLIFVLYSGTWPEYSDLD 186
Query: 113 ----IGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK- 167
+G++ + KA +N F G D+S+P +D + G S+
Sbjct: 187 LGFELGQAMLAKASTTSIN----------FRPGF----DISIPLF---SKDHAQKGGSRG 229
Query: 168 ----------RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHS---GRLKTPYAD 209
R L F G + S R L + I + G+ AD
Sbjct: 230 DLQTNNFPVARKYLLVFKGKRYLSGIGSETRNALYHIHNGQDIILLTTCKHGKSWEKNAD 289
Query: 210 E-----------------LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFA 252
L S FCL +G + + R +SL C+PV+++N ++LPF+
Sbjct: 290 SRCEQDNAEFDRYDFHILLHNSTFCLVPRGRRLGSFRFLESLQAACIPVLLSNGWELPFS 349
Query: 253 DVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYM 312
+V++W SI+ + + I++ +S++E L L+ + F W + S D M
Sbjct: 350 EVIDWNRASIIGDERLLLQIPSIVRTVSNDEIL-----SLRQQTQFLWETYFSSVDKIVM 404
Query: 313 VMYELWLRR 321
E+ R
Sbjct: 405 TTLEIIQER 413
>gi|255585230|ref|XP_002533317.1| catalytic, putative [Ricinus communis]
gi|223526861|gb|EEF29074.1| catalytic, putative [Ricinus communis]
Length = 478
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 73/272 (26%), Positives = 114/272 (41%), Gaps = 49/272 (18%)
Query: 32 NYASESYFKKAFMKSHFVTKDPSKADLFFLPFSI---ARMRHDRRIGT----EGIPDFIS 84
N S+S+ K+ F D ++AD+ F+PF A M R GT EG D+
Sbjct: 132 NLRSQSFAKRVF--------DFNQADVVFVPFFATLSAEMELARGEGTFRKKEGNEDYKR 183
Query: 85 H-YIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVC---------- 133
+ + W R+GG DH +V + W V+ V
Sbjct: 184 QKEVIEFVKSSDAWKRSGGKDHVFVLTDPVA------MWHVRAEIAPAVLLVVDFGGWYR 237
Query: 134 ----SSSYFISGHIAHKDVSLPQ--IWPRQEDPPKLGSS---KRNKLAFFAGAV----NS 180
SS S I H VSL + I P P+L S KR L +F GA
Sbjct: 238 LDSKSSDGNSSNIIRHTQVSLLKDVIVPYTHLLPQLPLSENKKRQTLLYFKGAKYRHRGG 297
Query: 181 PVREKLLQVWRNDSEIYAHSGRLKTPYADE----LLGSKFCLHVKGFEVNTARIADSLYY 236
VREKL + N+ + G ++ + S+FCLH G + R+ D++
Sbjct: 298 MVREKLWDLLVNEPGVIMEEGFPNATGREQSIKGMRTSEFCLHPAGDTPTSCRLFDAIQS 357
Query: 237 GCVPVIIANHYDLPFADVLNWKSFSIVVATLD 268
C+P+I++++ +LPF ++++ FS+ +A D
Sbjct: 358 LCIPIIVSDNIELPFEGIVDYLEFSVFMAVDD 389
>gi|168033234|ref|XP_001769121.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162679650|gb|EDQ66095.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 765
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 83/334 (24%), Positives = 138/334 (41%), Gaps = 67/334 (20%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMR-----------HDRRIGTEG--I 79
Y + ++ + S T + +AD F++PF A + + +G
Sbjct: 367 YGAGIALYESLLASEHRTTNGDEADFFYVPFLQACIVEQGDAAPHLTFQGKYMGLRQYFA 426
Query: 80 PDFISHYIFNISQKYPYWNRTGGADHFYV------ACHS---IGRSAMQKAWEVKLNAIQ 130
D+ F+I Q YPYWNR+ G DH + AC + I S M W NA
Sbjct: 427 GDYSKQIYFHIQQNYPYWNRSAGRDHIWFFPWDEGACSAPKEIWNSMMLSHWG-NTNAKH 485
Query: 131 VVCSSSY-----------FISGHIAH---KDVSLP-----QIWPRQEDPPKLGSSKRNKL 171
+++Y + H + KD+ LP +P ++ R L
Sbjct: 486 KASTTAYRADNWDLIPPEWRGDHPCYDPAKDLVLPAWKFPDPYPIVQNLSSRHRQDRPTL 545
Query: 172 AFFAGAVNSP-------------VREKLLQVWRNDSEIYAHSGRL----------KTP-Y 207
+F G + S +R+KL + + GR ++P Y
Sbjct: 546 FYFNGNLGSAYDNGRPEPGYSMGIRQKLAAEFGSQPNKKGLLGRQAVDDVVVQAQRSPQY 605
Query: 208 ADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATL 267
EL S+FC + G + + R+ DS+ GC+PVII + LPF +VL+++SF++ VA
Sbjct: 606 KLELSKSRFCGVLPG-DGWSGRMEDSILSGCIPVIIQDGIHLPFENVLDYESFTVRVAED 664
Query: 268 DIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWH 301
+I L ILK I+ + + + V + + F +H
Sbjct: 665 NIHNLITILKAINEAQVDSMLAVVRGLWQRFTYH 698
>gi|413956860|gb|AFW89509.1| hypothetical protein ZEAMMB73_873038 [Zea mays]
Length = 783
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 72/265 (27%), Positives = 115/265 (43%), Gaps = 60/265 (22%)
Query: 89 NISQKYPYWNRTGGADHFYV------ACHS---IGRSAMQKAWEVKLNAIQVVCSSSYFI 139
+I+Q+YPYWNRT G DH + AC++ I +S M W N +++Y+
Sbjct: 451 HIAQQYPYWNRTSGRDHIWFFSWDEGACYAPKEIWKSMMLVHWG-NTNTKHKNSTTAYWA 509
Query: 140 -----------SGHIA---HKDVSLPQ--------IWPRQEDPPKLGSSKRNKLAFFAGA 177
H KD+ LP IW + P+ + R L +F G
Sbjct: 510 DNWDDIPLDKRGNHPCFDPRKDLVLPAWKEPNPGAIWLKLWARPR---NNRTTLFYFNGN 566
Query: 178 VNSP-------------VREKLLQVWRNDSEIYAHSGR----------LKTP-YADELLG 213
+ S +R+KL + + GR L+T Y +EL
Sbjct: 567 LGSAYEGGRPEDTYSMGIRQKLAAEFGSTPNKQGRLGRQHAADVTVTYLRTEKYYEELAS 626
Query: 214 SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLK 273
S FC + G + + R+ DS+ GC+PVII + LP+ +VLN+ SF++ + DIP L
Sbjct: 627 SVFCGVLPG-DGWSGRMEDSMLQGCIPVIIQDGIFLPYENVLNYNSFAVRIQEDDIPGLI 685
Query: 274 KILKGISSEEYLLLQSYVLKVRKHF 298
L+GI+ + + V ++ + F
Sbjct: 686 STLRGINDTQVEFMLGNVRQMWQRF 710
>gi|212722910|ref|NP_001131485.1| uncharacterized protein LOC100192822 [Zea mays]
gi|194691662|gb|ACF79915.1| unknown [Zea mays]
Length = 426
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 72/274 (26%), Positives = 127/274 (46%), Gaps = 26/274 (9%)
Query: 30 RGNYASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFN 89
+G + ++ + +KS F T + A+LFF+P + ++ R G + Y+
Sbjct: 84 KGQWGTQVKVHQFLLKSRFRTFNKDHANLFFVP---SYVKCVRMTGALSDKEINQTYVKV 140
Query: 90 ISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVC-----SSSYFISGHIA 144
+SQ PY+ R+GG DH +V G + + ++W + LN ++ + S
Sbjct: 141 LSQ-MPYFRRSGGRDHIFVFPSGAG-AHLFRSWAIFLNRSIILTPEGDRTDKRGTSAFNT 198
Query: 145 HKDVSLPQIWPR---QEDPPKLGS---SKRNKLAFFAG-AVNSPVREKLLQVWRN-DSEI 196
KD+ +P + D P + +KR LA F G A R +L+++ + ++
Sbjct: 199 WKDIIIPGNVDDSMVKSDAPAVQPIPLTKRKYLANFLGRAQGKAGRLQLVELAKQYPDKL 258
Query: 197 YAHSGRLKTP-------YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDL 249
+ +L P Y L +KFCL +G T R +S + CVPVI+++ +L
Sbjct: 259 ESPELKLSGPNKLGRIEYFKHLRNAKFCLAPRGESSWTLRFYESFFVECVPVILSDEVEL 318
Query: 250 PFADVLNWKSFSIVVATLDI-PLLKKILKGISSE 282
PF +V+++ SI + I P L + L+ IS E
Sbjct: 319 PFQNVIDYSEISIKWPSSRIGPELLEYLESISDE 352
>gi|449452903|ref|XP_004144198.1| PREDICTED: probable glycosyltransferase At3g07620-like [Cucumis
sativus]
Length = 429
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 68/285 (23%), Positives = 124/285 (43%), Gaps = 30/285 (10%)
Query: 52 DPSKADLFFLPF--SIARMRHDRRIGTEGIPDFISHYI----FNISQKYPYWNRTGGADH 105
DP AD FF+PF S++ H R + + H + + YW R+ G DH
Sbjct: 122 DPENADAFFVPFFSSLSFNSHGRNMTDPATE--VDHQLQIDLMKFLSESKYWQRSKGRDH 179
Query: 106 FYVACHSIGRSAMQKAWEVKLNA-IQVVCS-SSYFISGHIAHKDVSLPQIWPRQ---EDP 160
H ++ ++NA IQ+V Y + KDV P + +D
Sbjct: 180 VIPMTHPNAFRFLRN----QVNASIQIVVDFGRYPKTMSNLGKDVVAPYVHVVSSFIDDN 235
Query: 161 PKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIY-----AHSGRLKTPYADEL 211
P R L FF G + +R KL ++ +++ A +KT + +
Sbjct: 236 PPDPFESRPTLLFFQGKTFRKDDGIIRVKLAKILDGYDDVHYERSAATEKSIKT-SSQGM 294
Query: 212 LGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVA---TLD 268
SKFCLH G ++ R+ D++ CVPVI+++ +LP+ D +++ F++ + L
Sbjct: 295 RSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDQIELPYEDEIDYSQFTLFFSFEEALQ 354
Query: 269 IPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
+ + L+ E ++ + + ++ +H+++ P DA M+
Sbjct: 355 PGYMVEKLREFPKERWIEMWKQLKEISRHYEFQYPPKKEDAVNML 399
>gi|357126548|ref|XP_003564949.1| PREDICTED: probable glycosyltransferase At3g42180-like
[Brachypodium distachyon]
Length = 432
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 70/284 (24%), Positives = 120/284 (42%), Gaps = 28/284 (9%)
Query: 52 DPSKADLFFLPF--SIARMRHDRRIGTEGIPD-----FISHYIFNISQKYPYWNRTGGAD 104
DP A+ FF+PF S++ H R + PD + + ++ K YW R+ G D
Sbjct: 128 DPEAAEAFFVPFFSSLSFNVHGRNMTD---PDTEADRLLQVELIDVLWKSKYWQRSAGRD 184
Query: 105 HFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPR----QEDP 160
H H ++ V + + V Y KDV P + +DP
Sbjct: 185 HVIPMHHPNAFRFLRDM--VNASVLIVADFGRYTQELASLRKDVVAPYVHVVDSFINDDP 242
Query: 161 PKLGSSKRNKLAFFAG----AVNSPVREKLLQVWRNDS----EIYAHSGRLKTPYADELL 212
P + R L FF G +R KL ++ ++ E +G + +
Sbjct: 243 PDPFEA-RPTLLFFRGRTVRKAEGKIRAKLAKILKDKDGVRFEDSLATGEGINTSTEGMR 301
Query: 213 GSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI---VVATLDI 269
SKFCLH G ++ R+ D++ C+PVI+++ +LPF D +++ FS+ V L
Sbjct: 302 SSKFCLHPAGDTPSSCRLFDAIVSHCIPVIVSSRIELPFEDEIDYSEFSLFFSVEEALKP 361
Query: 270 PLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
L L+ + E+++ + S + V H+++ DA M+
Sbjct: 362 DYLLDQLRQMPKEKWVEMWSKLKNVSSHYEFQYPTRKGDAVNMI 405
>gi|357168395|ref|XP_003581626.1| PREDICTED: probable beta-1,4-xylosyltransferase IRX10-like
[Brachypodium distachyon]
Length = 348
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 75/291 (25%), Positives = 129/291 (44%), Gaps = 26/291 (8%)
Query: 41 KAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRT 100
+ +KS F T D +A LFF+P + ++ R G + Y+ +SQ PY+ R+
Sbjct: 44 QLLLKSRFRTLDKDEAHLFFVP---SYVKCVRMTGALTDKEINQTYVKVLSQ-MPYFRRS 99
Query: 101 GGADHFYVACHSIGRSAMQKAWEVKLNAIQVVC-----SSSYFISGHIAHKDVSLPQIWP 155
GG DH +V G + + ++W LN ++ + IS KD+ +P
Sbjct: 100 GGRDHIFVFPSGAG-AHLFRSWATFLNRSIILTPEGDRTDKRGISAFNTWKDIIIPG--- 155
Query: 156 RQEDPPKLGSSKRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSK 215
+D +G + R KL A + L++ D GR+ Y L +K
Sbjct: 156 NVDD--SMGKAGRLKLVELAKQYPDKLESPELKLSGPDK-----LGRID--YFKHLRNAK 206
Query: 216 FCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI-PLLKK 274
FCL +G T R +S + CVPVI+++ +LPF +++++ SI + I P L +
Sbjct: 207 FCLAPRGESSWTLRFYESFFVECVPVILSDEVELPFQNMIDYTEISIKWPSSKISPELFE 266
Query: 275 ILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
L+ I E ++ + + R+ V+ D + + LW + VR
Sbjct: 267 YLESIPEER---IEEMIARGREVRCLWVYAPDLEPCSAMAAILWELQRKVR 314
>gi|255558009|ref|XP_002520033.1| catalytic, putative [Ricinus communis]
gi|223540797|gb|EEF42357.1| catalytic, putative [Ricinus communis]
Length = 507
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 58/224 (25%), Positives = 98/224 (43%), Gaps = 24/224 (10%)
Query: 95 PYWNRTGGADHFYVACH-----SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHI-AHKDV 148
P W R+GG DH H S+ R W + + + +++ G + KD+
Sbjct: 229 PAWKRSGGRDHILPVHHPWSFKSVRRYVKNAIWLLP----DMDSTGNWYKPGQVFLEKDL 284
Query: 149 SLPQIWPRQEDPPKLGS---SKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSG 201
LP + K S SKR L FF G + +R KL+ + G
Sbjct: 285 ILPYVPNVDLCDAKCASENESKRTTLLFFRGRLKRNAGGKIRAKLVAELSGAEGVVVEEG 344
Query: 202 RL----KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNW 257
K + S FCL G ++AR+ D++ GC+PVI+++ +LPF +L++
Sbjct: 345 TAGEGGKAAAQTGMRKSIFCLSPAGDTPSSARLFDAIVSGCIPVIVSDELELPFEGILDY 404
Query: 258 KSFSIVVATLDI---PLLKKILKGISSEEYLLLQSYVLKVRKHF 298
+ ++ V++ D L K LK +S + +Q ++K +HF
Sbjct: 405 RKIAVFVSSSDAIQPGWLIKFLKDVSPAQTREMQRNLVKYSRHF 448
>gi|392575274|gb|EIW68408.1| hypothetical protein TREMEDRAFT_32575 [Tremella mesenterica DSM 1558]
Length = 1176
Score = 71.2 bits (173), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 53/92 (57%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y E+ G++FC G + RI D++Y GC+PV+ A PFAD L+W FSI +
Sbjct: 1032 YMKEISGARFCPQPTGIAGWSPRINDAIYAGCIPVLTAEGTHYPFADFLDWSKFSIRIKP 1091
Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRKHF 298
++ L++IL I E+ +Q+ ++ VR+ F
Sbjct: 1092 TELDQLERILSAIPLEQLEEMQANLMLVREAF 1123
>gi|357113818|ref|XP_003558698.1| PREDICTED: uncharacterized protein LOC100844507 [Brachypodium
distachyon]
Length = 781
Score = 71.2 bits (173), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 71/265 (26%), Positives = 111/265 (41%), Gaps = 60/265 (22%)
Query: 89 NISQKYPYWNRTGGADHFYV------ACHS---IGRSAMQKAWEVKLNAIQVVCSSSYFI 139
+ISQ+Y YWNRT G DH + AC++ I S M W N +++Y+
Sbjct: 450 HISQRYAYWNRTSGRDHIWFFSWDEGACYAPKEIWNSMMLVHWG-NTNTKHENSTTAYWA 508
Query: 140 -----------SGHIA---HKDVSLPQ--------IWPRQEDPPKLGSSKRNKLAFFAGA 177
H KD+ LP IW + P++ R L +F G
Sbjct: 509 DNWDDIPLDRRGNHPCFDPRKDLVLPAWKVPEPGAIWLKLWARPRI---NRTTLFYFNGN 565
Query: 178 VNSP-------------VREKLLQVWRNDSEIYAHSGRLKTP-----------YADELLG 213
+ +R+KL + + GR T Y +EL
Sbjct: 566 LGPAYEQGRPEDTYSMGIRQKLAAEFGSTPSKEGKLGRQHTANVTVTYLRSEKYYEELAS 625
Query: 214 SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLK 273
S FC + G + + R+ DS+ GC+PVII + LP+ +VLN+ SF++ + DIP L
Sbjct: 626 SVFCGALPG-DGWSGRMEDSMLQGCIPVIIQDGIFLPYENVLNYNSFAVRIQEHDIPNLI 684
Query: 274 KILKGISSEEYLLLQSYVLKVRKHF 298
+IL GI+ + + V ++ + F
Sbjct: 685 RILGGINETQIEFMLGNVRQIWQRF 709
>gi|289166880|gb|ADC84490.1| glycosyltransferase family 47C [Salix miyabeana]
Length = 252
Score = 71.2 bits (173), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 63/233 (27%), Positives = 98/233 (42%), Gaps = 29/233 (12%)
Query: 82 FISHYIFNISQKYPYWNRTGGADHFYVACHSIG--------RSAMQKAWEVKLNAIQVVC 133
+S + IS +P+WNR+ G+DH +VA H G R+A E +I +
Sbjct: 5 LLSSAVNLISSNHPFWNRSRGSDHVFVASHDYGACFHAMEERAAEDGIPEFLKRSIILQT 64
Query: 134 SSSYFISGHIAHKDVSLPQ-IWPRQ-----EDPPKLGSSKRNKLAFFAGAV--------- 178
F ++V +P I P E P G +R+ FF G +
Sbjct: 65 FGVKFDHPCQDVENVVIPPFITPESVQTTLEKYPLTG--RRDIWVFFRGKMEVHPKNISG 122
Query: 179 ---NSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLY 235
+ VR + + + D Y R Y E+ S FCL G+ + R+ +S+
Sbjct: 123 RYYSKKVRTVIWRKYSGDPRFYLRRHRFAG-YQSEIARSVFCLCPLGWAPWSPRLVESIA 181
Query: 236 YGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQ 288
GCVPVIIA+ LPF + W S+ VA D+ L +L +++ +Q
Sbjct: 182 LGCVPVIIADGIRLPFPAAVRWSDISLTVAEKDVADLGTLLDHVAASNLSAIQ 234
>gi|449489243|ref|XP_004158257.1| PREDICTED: LOW QUALITY PROTEIN: probable glycosyltransferase
At3g07620-like [Cucumis sativus]
Length = 429
Score = 71.2 bits (173), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 68/285 (23%), Positives = 123/285 (43%), Gaps = 30/285 (10%)
Query: 52 DPSKADLFFLPF--SIARMRHDRRIGTEGIPDFISHYI----FNISQKYPYWNRTGGADH 105
DP AD FF+PF S++ H R + + H + + YW R+ G DH
Sbjct: 122 DPENADAFFVPFFSSLSFNSHGRNMTDPATE--VDHQLQIDLMKFLSESKYWQRSKGRDH 179
Query: 106 FYVACHSIGRSAMQKAWEVKLNA-IQVVCS-SSYFISGHIAHKDVSLPQIWPRQ---EDP 160
H ++ ++NA IQ+V Y + KDV P + +D
Sbjct: 180 VIPMTHPNAFRFLRN----QVNASIQIVVDFGRYPKTMSNLGKDVVAPYVHVVSSFIDDN 235
Query: 161 PKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIY-----AHSGRLKTPYADEL 211
P R L FF G + +R KL ++ +++ A +KT + +
Sbjct: 236 PPDPFESRPTLLFFQGKTFRKDDGIIRVKLAKILDGYDDVHYERSAATEKSIKT-SSQGM 294
Query: 212 LGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVV---ATLD 268
SKFCLH G ++ R+ D++ CVPVI+++ +LP+ D +++ F++ L
Sbjct: 295 RSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDQIELPYEDEIDYSQFTLFFXFEEALQ 354
Query: 269 IPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
+ + L+ E ++ + + ++ +H+++ P DA M+
Sbjct: 355 PGYMVEKLREFPKERWIEMWKQLKEISRHYEFQYPPKKEDAVNML 399
>gi|225461772|ref|XP_002285599.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase
IRX7-like [Vitis vinifera]
Length = 513
Score = 71.2 bits (173), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 59/223 (26%), Positives = 101/223 (45%), Gaps = 22/223 (9%)
Query: 95 PYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVV----CSSSYFISGHIA-HKDVS 149
P W R+ G DH H ++K+ + NAI ++ + +++ G ++ KD+
Sbjct: 238 PAWKRSEGRDHILPVHHPWSFKTVRKSMK---NAIWLLPDMDSTGNWYKPGQVSLEKDLI 294
Query: 150 LPQIWPRQEDPPKLGS---SKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGR 202
LP + K S SKR L FF G + +R KL+ D + G
Sbjct: 295 LPYVPNVDLCDAKCSSESESKRKTLLFFRGRLKRNAGGKIRAKLMAELSGDDGVVIQEGT 354
Query: 203 L----KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWK 258
K + S FCL G ++AR+ D++ GC+PVI+++ +LPF +L+++
Sbjct: 355 AGEGGKEAAQRGMRKSIFCLSPAGDTPSSARLFDAIVSGCIPVIVSDELELPFEGILDYR 414
Query: 259 SFSIVVATLDI---PLLKKILKGISSEEYLLLQSYVLKVRKHF 298
++ V++ D L LK IS + +Q + K +HF
Sbjct: 415 KIALFVSSSDAMQPGWLLTFLKSISPAQIKEMQRNLAKYSRHF 457
>gi|302142837|emb|CBI20132.3| unnamed protein product [Vitis vinifera]
Length = 379
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 59/223 (26%), Positives = 101/223 (45%), Gaps = 22/223 (9%)
Query: 95 PYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVV----CSSSYFISGHIA-HKDVS 149
P W R+ G DH H ++K+ + NAI ++ + +++ G ++ KD+
Sbjct: 104 PAWKRSEGRDHILPVHHPWSFKTVRKSMK---NAIWLLPDMDSTGNWYKPGQVSLEKDLI 160
Query: 150 LPQIWPRQEDPPKLGS---SKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGR 202
LP + K S SKR L FF G + +R KL+ D + G
Sbjct: 161 LPYVPNVDLCDAKCSSESESKRKTLLFFRGRLKRNAGGKIRAKLMAELSGDDGVVIQEGT 220
Query: 203 L----KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWK 258
K + S FCL G ++AR+ D++ GC+PVI+++ +LPF +L+++
Sbjct: 221 AGEGGKEAAQRGMRKSIFCLSPAGDTPSSARLFDAIVSGCIPVIVSDELELPFEGILDYR 280
Query: 259 SFSIVVATLDI---PLLKKILKGISSEEYLLLQSYVLKVRKHF 298
++ V++ D L LK IS + +Q + K +HF
Sbjct: 281 KIALFVSSSDAMQPGWLLTFLKSISPAQIKEMQRNLAKYSRHF 323
>gi|356503460|ref|XP_003520526.1| PREDICTED: uncharacterized protein LOC100775594 [Glycine max]
Length = 761
Score = 70.9 bits (172), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 81/337 (24%), Positives = 136/337 (40%), Gaps = 81/337 (24%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLPFS----IARMRHDRRIGT---EGIPDFISH 85
Y ++ ++ + S T + +AD FF+P I R H + T EG+ F++
Sbjct: 360 YGAQIALYESILASPHRTLNGDEADFFFVPVLDSCLIDRADHAPHLSTQNHEGLRSFLTL 419
Query: 86 YIF-----NISQKYPYWNRTGGADHFYV------ACHS---IGRSAMQKAW----EVKLN 127
+ +I ++YPYWN + G DH + AC++ I S M W +
Sbjct: 420 DFYKNAYNHIVEQYPYWNCSSGRDHIWFFSWDEGACYAPKEIWSSMMLVHWGNTNTKHYH 479
Query: 128 AIQVVCSSSY---------FISGHIAHKDVSLPQ--------------IWPRQEDPPKLG 164
+ C ++ F KD+ +P WP +
Sbjct: 480 STTAYCPDNWDGIPSDRRGFHPCFDPEKDLVIPAWKVTHVHVLSSKLWAWPLE------- 532
Query: 165 SSKRNKLAFFAGAVNSP------------VREKLLQVWRNDSEIYAHSGRLKTP------ 206
KR L +F G + +R+KL + + + G+ +
Sbjct: 533 --KRKTLFYFNGNLGPAYPYGRNEWYSMGIRQKLAEEFGSKPNKEGKLGKQRAKDVVVTA 590
Query: 207 -----YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFS 261
Y EL S FC + G + + R+ DS+ GC+PVII + LP+ +VLN+ SF+
Sbjct: 591 ERSENYEVELASSVFCGVLPG-DGWSGRMEDSVLQGCIPVIIQDGIFLPYENVLNYDSFA 649
Query: 262 IVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHF 298
+ + +IP L KIL+GI+ E + V K+ + F
Sbjct: 650 VRIPEDEIPNLIKILRGINDTEIKFKLANVQKIWQRF 686
>gi|297611147|ref|NP_001065633.2| Os11g0128000 [Oryza sativa Japonica Group]
gi|255679743|dbj|BAF27478.2| Os11g0128000, partial [Oryza sativa Japonica Group]
Length = 199
Score = 70.9 bits (172), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 90/166 (54%), Gaps = 10/166 (6%)
Query: 169 NKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKTPYADE----LLGSKFCLHVKGFE 224
N + F G +R++L + +++ ++Y G ++ A + + SKFCL++ G
Sbjct: 22 NCVIFVQGG---NIRQELHYMLKDEKDVYFAFGSVQDHGASKASQGMHASKFCLNIAGDT 78
Query: 225 VNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI---PLLKKILKGISS 281
++ R+ D++ CVPVII++ +LP+ D L++ FSI V + D L ++++G+S
Sbjct: 79 PSSNRLFDAIVSHCVPVIISDDIELPYEDALDYSKFSIFVRSSDAVKKGYLMRLIRGVSK 138
Query: 282 EEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQ 327
++ ++ + +V KHF++ DA M+ L + ++R++
Sbjct: 139 HQWTMMWRRLKEVDKHFEYQYPSQKDDAVQMIWQTLARKVPAIRLK 184
>gi|297833038|ref|XP_002884401.1| hypothetical protein ARALYDRAFT_477609 [Arabidopsis lyrata subsp.
lyrata]
gi|297330241|gb|EFH60660.1| hypothetical protein ARALYDRAFT_477609 [Arabidopsis lyrata subsp.
lyrata]
Length = 495
Score = 70.5 bits (171), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 75/290 (25%), Positives = 138/290 (47%), Gaps = 35/290 (12%)
Query: 50 TKDPSKADLFFLPF--SIARMR----HDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGA 103
K+ S+AD+ F+PF S++ R + ++ ++ ++ + SQK W +GG
Sbjct: 196 VKNSSEADVVFVPFFSSLSYNRFSKVNQKQKKSQDKELQVNVVKYVTSQKE--WKISGGK 253
Query: 104 DHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH--KDVSLP--QIWPRQED 159
DH +A H ++M A A+ VV + S H+A+ KD+ P + P +
Sbjct: 254 DHVIMAHHP---NSMSTARHKLYPAMFVVADFGRY-SPHVANIDKDIVAPYKHLVPSYAN 309
Query: 160 PPKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLK----TPYADEL 211
G R L +F GA+ VR++L + +++ G ++ T + +
Sbjct: 310 DTS-GFDGRPILLYFQGAIYRKAGGFVRQELYK------DVHFSFGSVRNHGITKAGEGM 362
Query: 212 LGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI-- 269
SKFCL++ G ++ R+ D++ C+PVII++ +LP+ DVLN+ F + V + D
Sbjct: 363 RSSKFCLNIAGDTPSSNRLFDAIASHCIPVIISDDIELPYEDVLNYNEFCLFVRSSDALK 422
Query: 270 -PLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELW 318
L ++K I +EY + + +V ++F FP+ D + +W
Sbjct: 423 KGFLMGLVKSIGRDEYNKMWLRLKEVERYFDLR-FPTKDDEGDYAVQMIW 471
>gi|384490457|gb|EIE81679.1| hypothetical protein RO3G_06384 [Rhizopus delemar RA 99-880]
Length = 793
Score = 70.5 bits (171), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 67/308 (21%), Positives = 131/308 (42%), Gaps = 47/308 (15%)
Query: 46 SHFVTKDPSKADLFFLPFSIA----------RMRHDRRIGTEGIPDFISHYIFNISQKYP 95
H+VT++P +AD FF+PF + + D R + ++ + + Q+YP
Sbjct: 481 EHYVTENPEEADFFFIPFFGSCYLYNCWYENKWNWDERCEVDA--KYVDPLMDMVIQEYP 538
Query: 96 YWNRTGGADHFYVACHSIGRS----AMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLP 151
YWN+TGG +H + H + ++ ++ + V + ++S H H+D+ +P
Sbjct: 539 YWNKTGGRNHIMI--HPMDKTFTYYQSNPRFQSAIFLKTVGDKRNKWMSRHRYHRDIVIP 596
Query: 152 QI-------------WPRQEDPPKLGSSKRNKLAFFAGAV---------NSPVREKLLQV 189
+ + PK S KR+ A F G ++ +R
Sbjct: 597 SATRMIHHLRANPLDYLNAQGQPK--SGKRDIFALFQGCCPDVQPTDEYSNGIRSLFFNH 654
Query: 190 WRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDL 249
+ + Y ++L +K+ L G+ ++T RI + + +G VPV+IA+
Sbjct: 655 FAHYPGYEIGQSVADEEYLEKLSRAKYGLSPMGWTLDTTRIWEFMAFGVVPVVIADGIIE 714
Query: 250 PFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDA 309
PF ++W F + + ++ L +ILK I + Y Q + + + + DA
Sbjct: 715 PFEFDVDWDKFIVRIRRDEVHRLDEILKSIDDKTYEYKQKNLWEFGRRVGLEM-----DA 769
Query: 310 FYMVMYEL 317
++ ++ EL
Sbjct: 770 WHFIVREL 777
>gi|412986477|emb|CCO14903.1| predicted protein [Bathycoccus prasinos]
Length = 574
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 40/108 (37%), Positives = 61/108 (56%), Gaps = 5/108 (4%)
Query: 161 PKLGSSKRNKLAFFAGAVNSPVREKLLQ--VWRNDSEIYAHSGRLKTP--YADELLGSKF 216
PKL ++RN F G P+REK+ + + E + + +P Y + SK+
Sbjct: 389 PKL-EAERNIRLMFRGNNRGPLREKVFRYLIENGSPEDSIETTGVASPQAYMSLMEHSKY 447
Query: 217 CLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVV 264
CLHV+G V + R+ + + +GCVPVI+A+ Y+LP A L+W FSI V
Sbjct: 448 CLHVRGTRVMSPRLIELMLFGCVPVIVADAYELPLAWFLDWTKFSIRV 495
>gi|359359217|gb|AEV41121.1| putative exostosin family domain-containing protein [Oryza
officinalis]
Length = 460
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 66/250 (26%), Positives = 112/250 (44%), Gaps = 36/250 (14%)
Query: 49 VTKDPSKADLFFLPFSIARMRHDRRIG---------TEGIPDFISHY-IFNISQKYPYWN 98
V D AD+ F+PF A + + +G EG D+ + + +P W
Sbjct: 122 VVADWRDADVVFVPF-FATLSAEMELGWGAKGAFRRKEGNEDYRRQREVVDRVTAHPAWR 180
Query: 99 RTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVV----------CSSSYFISGHIAHKDV 148
R+GG DH +V + ++K E+ + + VV S+S +S I H V
Sbjct: 181 RSGGRDHVFVLTDPVAMWHVRK--EIAPSILLVVDFGGWYKLDSNSASSNVSHMIQHTQV 238
Query: 149 SLPQ--IWPRQEDPPKLGSSK---RNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAH 199
SL + I P P + S+ R L +F GA VREKL + N+ ++
Sbjct: 239 SLLKDVIVPYTHLLPTMQLSENKDRLTLLYFKGAKHRHRGGLVREKLWDLMVNEPDVVME 298
Query: 200 SGRLKTPYADE----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVL 255
G ++ + S+FCLH G + R+ D++ C+PVI+++ +LPF ++
Sbjct: 299 EGYPNATGREQSIKGMRTSEFCLHPAGDTPTSCRLFDAVASLCIPVIVSDEIELPFEGMI 358
Query: 256 NWKSFSIVVA 265
++ F+I V+
Sbjct: 359 DYTEFTIFVS 368
>gi|357474287|ref|XP_003607428.1| Exostosin family protein-like protein [Medicago truncatula]
gi|355508483|gb|AES89625.1| Exostosin family protein-like protein [Medicago truncatula]
Length = 502
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 59/224 (26%), Positives = 101/224 (45%), Gaps = 24/224 (10%)
Query: 95 PYWNRTGGADHFYVACH-----SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHI-AHKDV 148
P W R+GG DH H ++ R + W + + + +++ G + KD+
Sbjct: 222 PAWKRSGGRDHILPVHHPWSFKTVRRYVKKAIWLLP----DMDSTGNWYKPGQVYLEKDL 277
Query: 149 SLPQIWPRQ-EDPPKLG--SSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSG 201
LP + D L + KRN L FF G + +R KL+ R + G
Sbjct: 278 ILPYVANVDFCDATCLSEINPKRNTLLFFRGRLKRNAGGKIRSKLVDQLRGADGVVIEEG 337
Query: 202 RL----KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNW 257
K + + S FCL+ G ++AR+ D++ GC+PVI+++ +LPF +L++
Sbjct: 338 TSGEGGKEAAQNGMRKSLFCLNPAGDTPSSARLFDAIVSGCIPVIVSDELELPFEGILDY 397
Query: 258 KSFSIVVATLDI---PLLKKILKGISSEEYLLLQSYVLKVRKHF 298
+ ++ V++ D L K LK I S +Q + K +HF
Sbjct: 398 RKIALFVSSNDALKPSWLLKYLKDIRSAHIKEMQQNLAKYSRHF 441
>gi|359473738|ref|XP_002272591.2| PREDICTED: LOW QUALITY PROTEIN: probable glycosyltransferase
At5g25310-like [Vitis vinifera]
Length = 437
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 71/284 (25%), Positives = 121/284 (42%), Gaps = 25/284 (8%)
Query: 52 DPSKADLFFLPF--SIARMRHDRRIGTEGIPDFISHY---IFNISQKYPYWNRTGGADHF 106
DP AD FF+PF S++ H + T+ +F I I ++ YW R+GG DH
Sbjct: 126 DPEMADAFFVPFFSSLSFNTHGHNM-TDPDTEFDRQLQIDILKILRESKYWQRSGGRDHV 184
Query: 107 YVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIW---PRQEDPPKL 163
H ++ +V + + V Y KDV P + +D
Sbjct: 185 IPMHHPNAFRFFRE--QVNTSILIVADFGRYPKEISNLRKDVVAPYVHVVDSFTDDNSPD 242
Query: 164 GSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEI----YAHSGRLKTPYADELLG-- 213
R L FF G VR+KL+++ + + H L G
Sbjct: 243 PYESRTTLLFFRGRTIRKDEGIVRDKLVKLLAGXDDYLQLHFHHRSYLSFLVXQSTQGMR 302
Query: 214 -SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVA---TLDI 269
SKFCLH G ++ R+ D++ CVPVI+++ +LP+ D +++ FSI + L+
Sbjct: 303 SSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDQIELPYEDEIDYTQFSIFFSDKEALEP 362
Query: 270 PLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
+ + L+ I E ++ + ++ + H+++ P DA M+
Sbjct: 363 GYMIEQLRQIPKERWVEMWRHLKYISHHYEFQYPPKKGDAIDML 406
>gi|193687008|ref|XP_001944271.1| PREDICTED: exostosin-2-like [Acyrthosiphon pisum]
Length = 703
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 79/301 (26%), Positives = 127/301 (42%), Gaps = 41/301 (13%)
Query: 2 NRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFK--KAFMKSHFVTKDPSKADLF 59
N F+VYVYP R+ + DF P G S+ Y A ++S + T +P +A +F
Sbjct: 86 NGDFKVYVYPMARH-------VDEDFIPIGGKMSKEYHTILSAIVESQYYTTNPEEACVF 138
Query: 60 FLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGA--DHFYVACHSIGRSA 117
R T + H+ N Q + +N G D+ V SIG+S
Sbjct: 139 VSSIDTLNQNRFRVKETSQALALLPHW--NDGQNHLIFNMVPGTAPDYKTVVDLSIGKSM 196
Query: 118 MQKA----WEVKLNAIQVVCSSSYFISGHI-AHKDVSLPQIWPRQEDPPKLGSSKRNKLA 172
+ A W + SS+ IS I ++ +SL + + + + + N
Sbjct: 197 VAGAGFDSWTYR---------SSFDISIAIYSNLAISLSNNYTFKHRTTFISTVQTNLHN 247
Query: 173 FFAGAVNSPVREKLLQVWRNDSEI-----YAHSGRLKT---------PYADELLGSKFCL 218
F ++ S ++K + S I +HSG +T YAD L S FCL
Sbjct: 248 DFITSLKSIEKQKSMTQNITMSMIRVIEPCSHSGHDRTIICHENITYNYADILADSVFCL 307
Query: 219 HVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKG 278
+ G + DSL GC+P++ NH LPF +V++WK I+ + ++ L ++ G
Sbjct: 308 ILPGPRLMDMVFIDSLAAGCIPIVAINHVVLPFFEVIDWKRAVIMWSETELNTLLDVVSG 367
Query: 279 I 279
I
Sbjct: 368 I 368
>gi|34394613|dbj|BAC83915.1| limonene cyclase like protein [Oryza sativa Japonica Group]
gi|50508945|dbj|BAD31849.1| limonene cyclase like protein [Oryza sativa Japonica Group]
Length = 417
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 46/122 (37%), Positives = 66/122 (54%), Gaps = 23/122 (18%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPV--DFEPRGNYASESYFKKAF-MKSHFVTKDPSKAD 57
M + F+V+VY +P PV D R Y++E F A M++ T+DP +A
Sbjct: 268 MEKVFKVFVY--EEGEP------PVFHDGPCRSIYSTEGRFIYAMEMENRMRTRDPDQAH 319
Query: 58 LFFLPFSIARM--------RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVA 109
+FFLPFS+ +M HD + + IS YI +S KYP+WNR+ GADHF ++
Sbjct: 320 VFFLPFSVVKMVKMIYEPNSHD----MDPLRRTISDYINVVSTKYPHWNRSLGADHFMLS 375
Query: 110 CH 111
CH
Sbjct: 376 CH 377
>gi|449501885|ref|XP_004161484.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein
LOC101226446 [Cucumis sativus]
Length = 859
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 84/334 (25%), Positives = 140/334 (41%), Gaps = 70/334 (20%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLPF--SIARMRHDRRIGTE-----GIPDFISH 85
Y +E F ++ + S T + +AD FF+P S R D G+ F++
Sbjct: 391 YGAEMAFYESILASPHRTLNGEEADFFFVPVLDSCIITRADDAPHLSLRDYMGLRSFLTL 450
Query: 86 YIF-----NISQKYPYWNRTGGADHFYV------ACHS---IGRSAMQKAWEVKLNAIQV 131
+ +I ++YPYWNR+ G DH + AC++ I S M W N+
Sbjct: 451 DFYKKAHDHIVEQYPYWNRSSGRDHIWFFSWDEGACYAPKEIWNSMMLVHWG-NTNSKHN 509
Query: 132 VCSSSYFI-----------SGHIA---HKDVSLPQIWPRQEDPPKLGS-------SKRNK 170
+++Y+ H KD+ +P W R D +L +R
Sbjct: 510 HSTTAYWGDNWDNIPSSKRGNHPCFDPEKDLVVPA-WKRP-DGSRLSKKLWARPREERKT 567
Query: 171 LAFFAGAVNSP-------------VREKLLQVWRNDSEIYAHSGR------LKTP----- 206
FF G + +R+K+ + + + G+ + TP
Sbjct: 568 FFFFNGNLGPAYERGRPESTYSMGIRQKVAEEFGSSPNKEGKLGKQHAADVIVTPLRSEN 627
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y ++L S FC + G + + R+ DS+ GC+PVII + LP+ +VLN+ SF++ +
Sbjct: 628 YHEDLASSVFCGVMPG-DGWSGRMEDSILQGCIPVIIQDGIFLPYENVLNYDSFAVRIGE 686
Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQW 300
DIP L IL+G + E S V K+ + F +
Sbjct: 687 DDIPNLINILRGFNESEIEFKLSNVRKIWQRFMY 720
>gi|449437619|ref|XP_004136589.1| PREDICTED: uncharacterized protein LOC101206674 [Cucumis sativus]
Length = 791
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 84/334 (25%), Positives = 140/334 (41%), Gaps = 70/334 (20%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLPF--SIARMRHDRRIGTE-----GIPDFISH 85
Y +E F ++ + S T + +AD FF+P S R D G+ F++
Sbjct: 391 YGAEMAFYESILASPHRTLNGEEADFFFVPVLDSCIITRADDAPHLSLRDYMGLRSFLTL 450
Query: 86 YIF-----NISQKYPYWNRTGGADHFYV------ACHS---IGRSAMQKAWEVKLNAIQV 131
+ +I ++YPYWNR+ G DH + AC++ I S M W N+
Sbjct: 451 DFYKKAHDHIVEQYPYWNRSSGRDHIWFFSWDEGACYAPKEIWNSMMLVHWG-NTNSKHN 509
Query: 132 VCSSSYFI-----------SGHIA---HKDVSLPQIWPRQEDPPKLGS-------SKRNK 170
+++Y+ H KD+ +P W R D +L +R
Sbjct: 510 HSTTAYWGDNWDNIPSSKRGNHPCFDPEKDLVVP-AWKRP-DGSRLSKKLWARPREERKT 567
Query: 171 LAFFAGAVNSP-------------VREKLLQVWRNDSEIYAHSGR------LKTP----- 206
FF G + +R+K+ + + + G+ + TP
Sbjct: 568 FFFFNGNLGPAYERGRPESTYSMGIRQKVAEEFGSSPNKEGKLGKQHAADVIVTPLRSEN 627
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y ++L S FC + G + + R+ DS+ GC+PVII + LP+ +VLN+ SF++ +
Sbjct: 628 YHEDLASSVFCGVMPG-DGWSGRMEDSILQGCIPVIIQDGIFLPYENVLNYDSFAVRIGE 686
Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQW 300
DIP L IL+G + E S V K+ + F +
Sbjct: 687 DDIPNLINILRGFNESEIEFKLSNVRKIWQRFMY 720
>gi|348519397|ref|XP_003447217.1| PREDICTED: exostosin-1c-like [Oreochromis niloticus]
Length = 740
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 80/356 (22%), Positives = 133/356 (37%), Gaps = 81/356 (22%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFK--KAFMKSHFVTKDPSKADLFFL 61
FRVY+YP +ND SESY K + +S + T DP +A LF L
Sbjct: 108 GFRVYIYPSEKND----------------RVSESYRKILASIAESRYYTSDPREACLFVL 151
Query: 62 PFSIARMRHDRRIGTEGIPDFISHYIFNISQK---YPYWNRTGGADHFYVACHSIGRSAM 118
I + D+ G ++ N+ ++ YP WN G +H +S
Sbjct: 152 --GIDTLDRDQLSG---------QFVPNVDERIRGYPLWN--DGRNHLIFNLYSGTWPNY 198
Query: 119 QKAWEVKLNAIQVVCSSSYFISGHIAHK-DVSLPQIWPRQEDPPKLGSSK---------- 167
+ ++ N Q + + + + H DVS+P +D P+ G +
Sbjct: 199 TE--DLGFNIGQAILAKASLNTEHFRPGFDVSIPLF---SKDHPQKGGERGWLVRNTVPP 253
Query: 168 -RNKLAFFAG-----AVNSPVREKLLQV-----------------WRNDSEIYAHSGRL- 203
R L F G + S R L + W + L
Sbjct: 254 RRKYLLMFKGKRYLTGIGSDTRNALHHIHNGKDIVSLTTCRHGKDWEKHKDARCDHDNLE 313
Query: 204 --KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFS 261
+ Y + L S FCL +G + + R +SL C+PV+++N ++LPF+DV+ W
Sbjct: 314 YERFDYQELLHNSTFCLVPRGRRLGSFRFLESLQAACIPVLLSNGWELPFSDVIQWNQAV 373
Query: 262 IVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYEL 317
I + + ++ + +E L L+ R W + S D + E+
Sbjct: 374 IEGDERLLLQVPSTVRAVGNERVL-----ALRQRTQMLWEAYFSSVDKIVLTTLEI 424
>gi|224135879|ref|XP_002327326.1| predicted protein [Populus trichocarpa]
gi|222835696|gb|EEE74131.1| predicted protein [Populus trichocarpa]
Length = 449
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 83/307 (27%), Positives = 131/307 (42%), Gaps = 49/307 (15%)
Query: 34 ASESYFKKAFMKSHFVTKDPSKADLFFLPFSI---ARMRHDRRIGT----EGIPDFISH- 85
SE ++F K F D ++AD+ F+PF A M + G+ EG D+
Sbjct: 100 TSEKLKSRSFAKRVF---DFNEADVVFVPFFATLSAEMELAKGKGSFRRKEGNEDYQRQK 156
Query: 86 YIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLN---AIQVVCS-------- 134
+ + + W R+GG DH +V + W V+ AI +V
Sbjct: 157 EVVDFVRNSEAWKRSGGKDHVFVLTDPVA------MWHVRAEIAPAILLVVDFGGWYRLD 210
Query: 135 ---SSYFISGHIAHKDVSLPQ--IWPRQEDPPKLGSS---KRNKLAFFAGAV----NSPV 182
S+ S I H VSL + I P P+ S KRN L +F GA V
Sbjct: 211 SKSSNGSSSDMIRHTQVSLLKDVIVPYTHLLPRFQFSENKKRNTLLYFKGAKHRHRGGIV 270
Query: 183 REKLLQVWRNDSEIYAHSGRLKTPYADEL-----LGSKFCLHVKGFEVNTARIADSLYYG 237
RE L + N+ + G EL S+FCLH G + R+ D++
Sbjct: 271 RENLWDLLVNEPGVIMEEG-FPNATGRELSIRGMRTSEFCLHPAGDTPTSCRLFDAIQSL 329
Query: 238 CVPVIIANHYDLPFADVLNWKSFSIVVA---TLDIPLLKKILKGISSEEYLLLQSYVLKV 294
C+PVI++++ +LPF +L++ FS+ VA L L L+ IS ++ L+ + K+
Sbjct: 330 CIPVIVSDNIELPFEGILDYTEFSVFVAGDDALKPTWLMDHLRSISEKQKEELRRNMAKI 389
Query: 295 RKHFQWH 301
+ +Q+
Sbjct: 390 QLIYQYE 396
>gi|115451051|ref|NP_001049126.1| Os03g0174300 [Oryza sativa Japonica Group]
gi|108706455|gb|ABF94250.1| exostosin family protein, putative, expressed [Oryza sativa
Japonica Group]
gi|113547597|dbj|BAF11040.1| Os03g0174300 [Oryza sativa Japonica Group]
gi|215695509|dbj|BAG90700.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 576
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 82/311 (26%), Positives = 133/311 (42%), Gaps = 70/311 (22%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP----FSIAR--------MRHDRRIGTEGIP 80
Y ++ ++ + S T + +AD F++P I R M D R+ +
Sbjct: 258 YGAQIALYESILASPHRTLNGDEADYFYVPALDSCLITRSDDAPHLQMPRDLRLRSYHTL 317
Query: 81 DFISHYIFNISQKYPYWNRTGGADHFYV------ACHS---IGRSAMQKAWEVKLNAIQV 131
++ +I+Q+YPYWNRT G DH + AC++ I S M W N
Sbjct: 318 EYYRMTYDHIAQRYPYWNRTSGRDHIWFFSWDEGACYAPKEIWNSMMLVHWG-NTNTKHK 376
Query: 132 VCSSSYF--------ISGHIAH------KDVSLPQIWPRQEDPP----KLGSSKRNK--- 170
+++Y+ I H KD+ LP W +Q +P KL + RN
Sbjct: 377 NSTTAYWADNWNYIPIDRRGNHPCFDPRKDLVLPA-W-KQPNPAAIWLKLWARTRNNRTT 434
Query: 171 LAFFAGAVNSP-------------VREKLLQVWRNDSEIYAHSGRLKTP----------- 206
L +F G + +R+KL + + + GR T
Sbjct: 435 LFYFNGNLGPAYKDGRHEDTYSMGIRQKLAAEFGSTPDKQGKLGRQHTANVTVTYLRTEK 494
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y +EL S FC + G + + R+ DS+ GC+PVII + LP+ ++LN+ SF++ +
Sbjct: 495 YYEELASSIFCGVLPG-DGWSGRMEDSMLQGCIPVIIQDGILLPYENMLNYNSFAVRIQE 553
Query: 267 LDIPLLKKILK 277
DIP L +IL+
Sbjct: 554 DDIPNLIRILR 564
>gi|432883098|ref|XP_004074204.1| PREDICTED: exostosin-1c-like [Oryzias latipes]
Length = 740
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 82/353 (23%), Positives = 133/353 (37%), Gaps = 75/353 (21%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFK--KAFMKSHFVTKDPSKADLFFL 61
FRVY+YP +ND SESY K + +S + T DP +A LF L
Sbjct: 108 GFRVYIYPSEKND----------------RVSESYRKILASIAESRYYTSDPREACLFVL 151
Query: 62 PFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKA 121
I + D+ G F+S+ I + YP WN G +H +S +
Sbjct: 152 --GIDTLDRDQLSG-----QFVSNVDDRI-RGYPLWNE--GRNHLIFNLYSGTWPNYTE- 200
Query: 122 WEVKLNAIQVVCSSSYFISGHIAHK-DVSLPQIWPRQEDPPKLGSSK-----------RN 169
++ N Q + + + + H DVS+P +D P+ G + R
Sbjct: 201 -DLGFNIGQAILAKASLNTEHFRPGFDVSIPLF---SKDHPQKGGERGWLVRNSTPPRRK 256
Query: 170 KLAFFAG-----AVNSPVREKLLQV-----------------WRNDSEIYAHSGRL---K 204
L F G + S R L + W + L +
Sbjct: 257 YLLMFKGKRYLTGIGSDTRNALHHIHNGKDIVSLTTCRHGKDWEKHKDARCDHDNLEYER 316
Query: 205 TPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVV 264
Y + L S FCL +G + + R +SL C+PV+++N ++LPF+DV+ W I
Sbjct: 317 FDYQELLHNSTFCLVPRGRRLGSFRFLESLQAACIPVLLSNGWELPFSDVIQWNQAVIEG 376
Query: 265 ATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYEL 317
+ + ++ + +E L L+ R W + S D + E+
Sbjct: 377 DERLLLQVPSTVRAVGNERVL-----ALRQRTQMLWDAYFSSVDKIVLTTLEI 424
>gi|388512347|gb|AFK44235.1| unknown [Lotus japonicus]
Length = 267
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 64/254 (25%), Positives = 113/254 (44%), Gaps = 27/254 (10%)
Query: 92 QKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLN-AIQVVCSSSYFISGHIAHKDVSL 150
+K +W R+ G DH + H ++ +LN +IQVV + G +KDV
Sbjct: 5 KKSVHWQRSRGRDHVFPMTHPNAFRFLRN----QLNESIQVVVDFGRYPKGSNLNKDVVS 60
Query: 151 PQIW---PRQEDPPKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAH---- 199
P + +D P+ R L FF G VR KL ++ +++
Sbjct: 61 PYVHVVDSFTDDEPQDPYESRPTLLFFRGRTFRKDEGIVRAKLAKILTGFDDVHYERSFA 120
Query: 200 SGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKS 259
+G + + SKFCLH G ++ R+ D++ CVPVI+++ +LPF D +++
Sbjct: 121 TGENIKLSSQGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDQIELPFEDEIDYSQ 180
Query: 260 FSIVVATLD-------IPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYM 312
FS+ + + I L+K K SE + L++ + H+++ P DA M
Sbjct: 181 FSLFFSFKEALQPGYMIDQLRKFPKDKWSEMWRQLKN----ISHHYEFQYPPKKEDAVNM 236
Query: 313 VMYELWLRRSSVRV 326
+ ++ + VR+
Sbjct: 237 LWRQVKHKLPGVRL 250
>gi|297820564|ref|XP_002878165.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
gi|297324003|gb|EFH54424.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
Length = 792
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 84/337 (24%), Positives = 144/337 (42%), Gaps = 73/337 (21%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIA------------RMRHDRRIGTEGIP 80
Y S+ F + + + T + +AD FF+P + M++ + +
Sbjct: 390 YGSQMAFYENILATAHRTLNGEEADFFFVPVLDSCIINRADDAPHINMQNHTGLRSSFTL 449
Query: 81 DFISHYIFNISQKYPYWNRTGGADHFYV------ACHS---IGRSAMQKAWEVKLNAIQV 131
+F +I +KYPYWNR+ G DH + AC++ I S M W N+
Sbjct: 450 EFYKRAYEHIVEKYPYWNRSAGRDHIWFFSWDEGACYAPKEIWNSMMLVHWG-NTNSKHN 508
Query: 132 VCSSSYF------ISGH--------IAHKDVSLPQIWPRQEDPPKLGSS-------KRNK 170
+++Y+ IS KD+ +P W + DP + ++ KR
Sbjct: 509 HSTTAYWGDNWDDISDERRGDHPCFDPRKDLVIP-AW-KVPDPYSMRANYWARPREKRKT 566
Query: 171 LAFFAGAVNSP-------------VREKLLQVWRNDSEIYAHSGR------LKTP----- 206
L +F G + +R+KL + + + G+ + TP
Sbjct: 567 LFYFNGNLGPAYEKGRPEDSYSMGIRQKLAEEFGSSPNKEGKLGKQHAEDVIVTPLRSDN 626
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y ++ S FC G + + R+ DS+ GCVPVII + LP+ ++LN++SF++ V+
Sbjct: 627 YHKDIANSIFCGAFPG-DGWSGRMEDSILQGCVPVIIQDGIYLPYENMLNYESFAVRVSE 685
Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVF 303
DIP L L+G S E +Q + V+K +Q +F
Sbjct: 686 DDIPNLINTLRGFSETE---IQFRLANVKKLWQRFLF 719
>gi|32488405|emb|CAE02830.1| OSJNBa0043A12.35 [Oryza sativa Japonica Group]
gi|90265244|emb|CAH67697.1| H0624F09.5 [Oryza sativa Indica Group]
Length = 464
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 66/251 (26%), Positives = 112/251 (44%), Gaps = 36/251 (14%)
Query: 49 VTKDPSKADLFFLPFSIARMRHDRRIG---------TEGIPDFISHY-IFNISQKYPYWN 98
V D AD+ F+PF A + + +G EG D+ + + +P W
Sbjct: 126 VVADWRDADVVFVPF-FATLSAEMELGWGAKGAFRRKEGNEDYRRQREVVDRVTAHPAWR 184
Query: 99 RTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVV----------CSSSYFISGHIAHKDV 148
R+GG DH +V + ++K E+ + VV S+S +S I H V
Sbjct: 185 RSGGRDHVFVLTDPVAMWHVRK--EIAPAILLVVDFGGWYKLDSNSASSNVSHMIQHTQV 242
Query: 149 SLPQ--IWPRQEDPPKLGSSK---RNKLAFFAGAVN----SPVREKLLQVWRNDSEIYAH 199
SL + I P P + S+ R L +F GA + VREKL + N+ ++
Sbjct: 243 SLLKDVIVPYTHLLPTMHLSENKDRPTLLYFKGAKHRHRGGLVREKLWDLMVNEPDVVME 302
Query: 200 SGRLKTPYADE----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVL 255
G ++ + S+FCLH G + R+ D++ C+PVI+++ +LPF ++
Sbjct: 303 EGYPNATGREQSIKGMRTSEFCLHPAGDTPTSCRLFDAVASLCIPVIVSDEIELPFEGMI 362
Query: 256 NWKSFSIVVAT 266
++ F+I V+
Sbjct: 363 DYTEFAIFVSV 373
>gi|147818414|emb|CAN68950.1| hypothetical protein VITISV_039537 [Vitis vinifera]
Length = 488
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 43/175 (24%), Positives = 89/175 (50%), Gaps = 11/175 (6%)
Query: 168 RNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLK----TPYADELLGSKFCLH 219
R L +F GA+ +R++L + +++ +++ G + + + SKFCL+
Sbjct: 290 RPTLLYFQGAIYRKDGGFIRQELFYLLKDEKDVHFAFGNTQGNGINKASQGMHSSKFCLN 349
Query: 220 VKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI---PLLKKIL 276
+ G ++ R+ D++ CVPVII++ +LP+ DVL++ F I V T D L K++
Sbjct: 350 IAGDTPSSNRLFDAIASHCVPVIISDEIELPYEDVLDYSQFCIFVRTSDALKDKFLXKLI 409
Query: 277 KGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQWSTS 331
+ I +E+ + + +V F++ + DA M+ + + ++R++ S
Sbjct: 410 RSIKKDEWTRMWRRLKEVENFFEFQYPSKEGDAVQMIWQAITRKVPAIRMKLHKS 464
>gi|359493064|ref|XP_002270238.2| PREDICTED: probable glucuronosyltransferase Os01g0926700-like
[Vitis vinifera]
Length = 483
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 67/253 (26%), Positives = 111/253 (43%), Gaps = 43/253 (16%)
Query: 52 DPSKADLFFLPFSIARMRHDRRIG--------TEGIPDFISH-YIFNISQKYPYWNRTGG 102
D ++AD+ F+PF A + + ++G EG D+ + + W R+GG
Sbjct: 149 DVNEADVVFVPF-FATISAEIQLGGGKGVFRKKEGNEDYERQRQVMEFVRGTEAWKRSGG 207
Query: 103 ADHFYVACHSIGRSAMQKAWEVKLN---AIQVVC-----------SSSYFISGHIAHKDV 148
DH +V + W VK AI +V +S+ +S I H V
Sbjct: 208 RDHVFVLTDPVA------MWHVKAEIAPAILLVVDFGGWYKLDSKASNNSLSEMIQHTQV 261
Query: 149 SLPQ--IWPRQEDPPKLGSSK---RNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAH 199
SL + I P P+L S+ R L +F GA VREKL + + +
Sbjct: 262 SLLKDVIVPYTHLLPRLHLSENQIRQTLLYFKGAKHRHRGGLVREKLWDLLVYEQGVIME 321
Query: 200 SGRLKTPYADE----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVL 255
G ++ + S+FCLH G + R+ D++ C+PVI++++ +LPF ++
Sbjct: 322 EGFPNATGREQSIKGMRTSEFCLHPAGDTPTSCRLFDAIQSLCIPVIVSDNIELPFEGMV 381
Query: 256 NWKSFSIVVATLD 268
++ FS+ VA D
Sbjct: 382 DYSEFSVFVAVRD 394
>gi|356538648|ref|XP_003537813.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase F8H-like
[Glycine max]
Length = 502
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 59/225 (26%), Positives = 100/225 (44%), Gaps = 26/225 (11%)
Query: 95 PYWNRTGGADHFYVACH-----SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHI-AHKDV 148
P W R+GG DH H S+ R W + + + +++ G + KD+
Sbjct: 222 PAWKRSGGRDHILPVHHPWSFKSVRRYVKNAIWLLP----DMDSTGNWYKPGQVYLEKDL 277
Query: 149 SLPQIWPRQE--DPPKLGSS--KRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHS 200
LP + P + D L + KR+ L FF G + +R KL +
Sbjct: 278 ILPYV-PNVDLCDAKCLSETNPKRSTLLFFRGRLKRNAGGKIRSKLGAELSGVDGVVIEE 336
Query: 201 GRL----KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLN 256
G K + S FCL G ++AR+ D++ GC+PVII++ +LPF +L+
Sbjct: 337 GTAGDGGKEAAQRGMRKSLFCLSPAGDTPSSARLFDAIVSGCIPVIISDELELPFEGILD 396
Query: 257 WKSFSIVVATLDI---PLLKKILKGISSEEYLLLQSYVLKVRKHF 298
++ ++ ++++D L K LKGI +Q ++K +HF
Sbjct: 397 YRKIAVFISSIDAVKPGWLLKYLKGIRPAHIKAMQQNLVKYSRHF 441
>gi|357162592|ref|XP_003579459.1| PREDICTED: xylogalacturonan beta-1,3-xylosyltransferase-like
[Brachypodium distachyon]
Length = 474
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 75/289 (25%), Positives = 128/289 (44%), Gaps = 44/289 (15%)
Query: 52 DPSKADLFFLPFSIARMRHDRRIG----------TEGIPDFISHY-IFNISQKYPYWNRT 100
D ++AD+ F+PF A + + +G EG D+ + + +P W R+
Sbjct: 138 DWTEADVVFVPF-FATLSAELELGWGATKGAFRRKEGNADYRRQREVVDRVTAHPAWRRS 196
Query: 101 GGADHFYVACHSIG----RSAMQKAWEV--------KLNAIQVVCSSSYFISGHIAHKDV 148
GG DH +V + R+ + A + KL++ +SS+ I H V
Sbjct: 197 GGRDHVFVLTDPMAMWHVRAEIAPAILLVVDFGGWYKLDSKSAGSNSSHMIQ----HTQV 252
Query: 149 SLPQ--IWPRQEDPPKLGSSK---RNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAH 199
SL + I P P L S+ R L +F GA VREKL V N+ +
Sbjct: 253 SLLKDVIIPYTHLLPTLQLSENMDRPTLLYFKGAKHRHRGGLVREKLWDVMINEPGVVME 312
Query: 200 SGRLKTPYADE----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVL 255
G ++ + S+FCLH G ++ R+ D++ C+PVI+++ +LPF ++
Sbjct: 313 EGFPNATGREQSIKGMRTSEFCLHPAGDTPSSCRLFDAVASLCIPVIVSDDIELPFEGMI 372
Query: 256 NWKSFSIVVA---TLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWH 301
++ FSI V+ + L LK IS ++ + + KV+ F++
Sbjct: 373 DYTEFSIFVSVGNAMRPKWLASYLKTISKQQKDEFRRNLAKVQHIFEYE 421
>gi|224097754|ref|XP_002311068.1| predicted protein [Populus trichocarpa]
gi|222850888|gb|EEE88435.1| predicted protein [Populus trichocarpa]
Length = 380
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 72/280 (25%), Positives = 117/280 (41%), Gaps = 20/280 (7%)
Query: 52 DPSKADLFFLPF--SIARMRHDRRIGT-EGIPDF-ISHYIFNISQKYPYWNRTGGADHFY 107
DP A+ +F+PF S++ H R + E D + + + QK YW R+GG DH
Sbjct: 76 DPEIAEAYFVPFFSSLSFNTHGRNMTDPETEKDRQLQVDLIDFLQKSKYWQRSGGRDHVI 135
Query: 108 VACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLP---QIWPRQEDPPKLG 164
H +++ V + + V Y S KDV P + ++D
Sbjct: 136 PMTHPNAFRFLRQL--VNASILIVADFGRYPKSLSTLSKDVVSPYVHNVDSFKDDDLLDP 193
Query: 165 SSKRNKLAFFAGAV----NSPVREKLLQVWRN-DSEIYAHSGRLKTPYADELLG---SKF 216
R L FF G VR KL ++ D Y S G SKF
Sbjct: 194 FESRKTLLFFRGNTVRKDKGKVRAKLEKILAGYDDVRYERSSPTAEAIQASTQGMRSSKF 253
Query: 217 CLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI---VVATLDIPLLK 273
CLH G ++ R+ D++ CVPVI+++ +LP+ D +++ FSI + + L
Sbjct: 254 CLHPAGDTPSSCRLFDAIVSHCVPVIVSDLIELPYEDEIDYSQFSIFFSINEAIQPDYLV 313
Query: 274 KILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
L+ + ++ + + K+ HF++ P DA M+
Sbjct: 314 NQLRKFPKDRWIEMWRQLKKISHHFEFQYPPVKEDAVNML 353
>gi|18399194|ref|NP_564443.1| exostosin-like protein [Arabidopsis thaliana]
gi|5091619|gb|AAD39607.1|AC007454_6 F23M19.7 [Arabidopsis thaliana]
gi|15450928|gb|AAK96735.1| Unknown protein [Arabidopsis thaliana]
gi|20148711|gb|AAM10246.1| unknown protein [Arabidopsis thaliana]
gi|332193570|gb|AEE31691.1| exostosin-like protein [Arabidopsis thaliana]
Length = 477
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 67/281 (23%), Positives = 118/281 (41%), Gaps = 39/281 (13%)
Query: 54 SKADLFFLPFSIARMRHDRRIGT--------EGIPDFISH-YIFNISQKYPYWNRTGGAD 104
S AD+ F+PF A + + +G G D+ + + + W R+ G D
Sbjct: 144 SDADVVFVPF-FATLSAEMELGNGKGSFRKKSGNEDYQRQRQVLDFVKNTKAWKRSNGRD 202
Query: 105 HFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQ------------ 152
H +V + +++ E+ L+ + VV +F + SLP+
Sbjct: 203 HVFVLTDPVAMWHVRE--EIALSILLVVDFGGWFRQDSKSSNGTSLPERIQHTQVSVIKD 260
Query: 153 -IWPRQEDPPKLGSS---KRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLK 204
I P P+L S +R+ L +F GA +REKL + N+ + G
Sbjct: 261 VIVPYTHLLPRLDLSQNQRRHSLLYFKGAKHRHRGGLIREKLWDLLVNEPGVVMEEGFPN 320
Query: 205 TPYADE----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSF 260
++ + S+FCLH G + R+ D++ C+PVI+++ +LPF ++++ F
Sbjct: 321 ATGREQSIRGMRNSEFCLHPAGDTPTSCRLFDAIQSLCIPVIVSDTIELPFEGIIDYSEF 380
Query: 261 SIVVATLDIPL---LKKILKGISSEEYLLLQSYVLKVRKHF 298
S+ + D L L S E L+S + KV+ F
Sbjct: 381 SVFASVSDALTPKWLANHLGRFSEREKETLRSRIAKVQSVF 421
>gi|359359122|gb|AEV41028.1| putative exostosin family domain-containing protein [Oryza minuta]
Length = 459
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 66/250 (26%), Positives = 110/250 (44%), Gaps = 36/250 (14%)
Query: 49 VTKDPSKADLFFLPFSIARMRHDRRIG---------TEGIPDFISHY-IFNISQKYPYWN 98
V D AD+ F+PF A + + +G EG D+ + + +P W
Sbjct: 121 VVADWRDADVVFVPF-FATLSAEMELGWGAKGAFRRKEGNEDYCRQREVVDRVTAHPAWR 179
Query: 99 RTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVV----------CSSSYFISGHIAHKDV 148
R+GG DH +V + ++K E+ + VV S+S S I H V
Sbjct: 180 RSGGRDHVFVLTDPVAMWHVRK--EIAPAILLVVDFGGWYKLDSNSASSNFSHMIQHTQV 237
Query: 149 SLPQ--IWPRQEDPPKLGSSK---RNKLAFFAGAVN----SPVREKLLQVWRNDSEIYAH 199
SL + I P P + S+ R L +F GA + VREKL + N+ ++
Sbjct: 238 SLLKDVIVPYTHLLPTMQLSENKERTTLLYFKGAKHRHRGGLVREKLWDLMVNEPDVVME 297
Query: 200 SGRLKTPYADE----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVL 255
G ++ + S+FCLH G + R+ D++ C+PVI+++ +LPF ++
Sbjct: 298 EGYPNATGREQSIKGMRTSEFCLHPAGDTPTSCRLFDAVASLCIPVIVSDEIELPFEGMI 357
Query: 256 NWKSFSIVVA 265
++ F I V+
Sbjct: 358 DYTEFVIFVS 367
>gi|359359070|gb|AEV40977.1| putative exostosin family domain-containing protein [Oryza
punctata]
Length = 459
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 66/250 (26%), Positives = 110/250 (44%), Gaps = 36/250 (14%)
Query: 49 VTKDPSKADLFFLPFSIARMRHDRRIG---------TEGIPDFISHY-IFNISQKYPYWN 98
V D AD+ F+PF A + + +G EG D+ + + +P W
Sbjct: 121 VVADWRDADVVFVPF-FATLSAEMELGWGAKGAFRRKEGNEDYCRQREVVDRVTAHPAWR 179
Query: 99 RTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVV----------CSSSYFISGHIAHKDV 148
R+GG DH +V + ++K E+ + VV S+S S I H V
Sbjct: 180 RSGGRDHVFVLTDPVAMWHVRK--EIAPAILLVVDFGGWYKLDSNSASSNFSHMIQHTQV 237
Query: 149 SLPQ--IWPRQEDPPKLGSSK---RNKLAFFAGAVN----SPVREKLLQVWRNDSEIYAH 199
SL + I P P + S+ R L +F GA + VREKL + N+ ++
Sbjct: 238 SLLKDVIVPYTHLLPTMQLSENKERTTLLYFKGAKHRHRGGLVREKLWDLMVNEPDVVME 297
Query: 200 SGRLKTPYADE----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVL 255
G ++ + S+FCLH G + R+ D++ C+PVI+++ +LPF ++
Sbjct: 298 EGYPNATGREQSIKGMRTSEFCLHPAGDTPTSCRLFDAVASLCIPVIVSDEIELPFEGMI 357
Query: 256 NWKSFSIVVA 265
++ F I V+
Sbjct: 358 DYTEFVIFVS 367
>gi|224145217|ref|XP_002325567.1| predicted protein [Populus trichocarpa]
gi|222862442|gb|EEE99948.1| predicted protein [Populus trichocarpa]
Length = 476
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 68/259 (26%), Positives = 118/259 (45%), Gaps = 36/259 (13%)
Query: 41 KAFMKSHFVTKDPSKADLFFLPFSI---ARMRHDRRIGT----EGIPDFISH-YIFNISQ 92
++F K F D ++AD+ F+PF A M + G+ EG D+ + +I +
Sbjct: 134 RSFAKRVF---DCNEADVVFVPFFATLSAEMELAKGKGSFRRKEGNEDYRRQKQVVDIVR 190
Query: 93 KYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYF----------ISGH 142
W R+GG DH +V + ++ E+ + VV ++ S
Sbjct: 191 NSDAWKRSGGKDHVFVLTDPVAMWHLRA--EIAPAILLVVDFGGWYRLDSKSSNGSSSDM 248
Query: 143 IAHKDVSLPQ--IWPRQEDPPKLGSS---KRNKLAFFAGAV----NSPVREKLLQVWRND 193
I H VSL + I P P+L S KR+ L +F GA VREKL + N+
Sbjct: 249 IQHTQVSLLKDVIVPYTHLLPRLQLSENKKRSTLLYFKGAKHRHRGGIVREKLWDLLVNE 308
Query: 194 SEIYAHSGRLKTPYADE----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDL 249
+ G ++ + S+FCLH G ++ R+ D++ C+PV+++++ +L
Sbjct: 309 PGVIIEEGFPNATGREQSIRGMRSSEFCLHPAGDTPSSCRLFDAIQSLCIPVVVSDNIEL 368
Query: 250 PFADVLNWKSFSIVVATLD 268
PF ++++ F++ VA D
Sbjct: 369 PFEGMVDYTEFAVFVAVDD 387
>gi|302800594|ref|XP_002982054.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
gi|300150070|gb|EFJ16722.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
Length = 453
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 71/297 (23%), Positives = 127/297 (42%), Gaps = 43/297 (14%)
Query: 52 DPSKADLFFLPF-----SIARMRHDR-----RIGTEGIPDFISHYIFNISQKYPYWNRTG 101
DP +AD+ F+PF + A++R+ + R E + I W R+G
Sbjct: 116 DPDQADVVFVPFFAALSAEAQLRNGKGHFRHRKDNEDYER--QKAVMEIVTSSSRWQRSG 173
Query: 102 GADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQ--------- 152
G DH +V + + E+ + + VV +++ + +++S PQ
Sbjct: 174 GRDHVFVLTDPMAMYHFRA--EIANSILLVVDFGGWYMEDAKSSRNLSSPQPIYHTQVSL 231
Query: 153 ----IWPRQEDPPKLGSSK----RNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHS 200
I P P L S+ R L +F GA VR++L V + +
Sbjct: 232 IKDVIVPYTHLLPTLALSQDNAVRTTLLYFKGARYRHRTGLVRDQLWSVLDGEPGVLLEE 291
Query: 201 G----RLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLN 256
G + + S FCLH G ++ R+ D++ C+PVI+++ +LPF +L+
Sbjct: 292 GFPNRTGQVQAVQGMRNSHFCLHPAGDTPSSCRLFDAVASLCIPVIVSDSIELPFEGMLD 351
Query: 257 WKSFSIVVATLDIPL---LKKILKGISSEEYLLLQSYVLKVRKHFQW-HVFPSDYDA 309
+ F+I V+ D L L + L +SS+ ++ + V+ HF++ + FP A
Sbjct: 352 YTQFAIFVSVHDALLPKWLVRHLSSLSSKVRNQMRHNLASVQHHFEYENGFPGGRGA 408
>gi|42566010|ref|NP_191322.3| exostosin family protein [Arabidopsis thaliana]
gi|44917463|gb|AAS49056.1| At3g57630 [Arabidopsis thaliana]
gi|46931284|gb|AAT06446.1| At3g57630 [Arabidopsis thaliana]
gi|332646159|gb|AEE79680.1| exostosin family protein [Arabidopsis thaliana]
Length = 793
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 82/332 (24%), Positives = 137/332 (41%), Gaps = 70/332 (21%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIA------------RMRHDRRIGTEGIP 80
Y S+ F + + + T + +AD FF+P + M++ + +
Sbjct: 391 YGSQMAFYENILATAHRTMNGEEADFFFVPVLDSCIINRADDAPHINMQNHTGLRSSLTL 450
Query: 81 DFISHYIFNISQKYPYWNRTGGADHFYV------ACHS---IGRSAMQKAWEVKLNAIQV 131
+F +I +KYPYWNR+ G DH + AC++ I S M W N+
Sbjct: 451 EFYKRAYEHIVEKYPYWNRSAGRDHIWFFSWDEGACYAPKEIWNSMMLVHWG-NTNSKHN 509
Query: 132 VCSSSYF------ISGH--------IAHKDVSLPQIWPRQEDPPKLGSS-------KRNK 170
+++YF IS KD+ +P W + DP + + KR
Sbjct: 510 HSTTAYFGDNWDDISDERRGDHPCFDPRKDLVIP-AW-KVPDPYSMRKNYWERPREKRKT 567
Query: 171 LAFFAGAVNSP-------------VREKLLQVWRNDSEIYAHSGR------LKTP----- 206
L +F G + +R+KL + + + G+ + TP
Sbjct: 568 LFYFNGNLGPAYEKGRPEDSYSMGIRQKLAEEFGSSPNKEGKLGKQHAEDVIVTPLRSDN 627
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y ++ S FC G + + R+ DS+ GCVPVII + LP+ ++LN++SF++ V
Sbjct: 628 YHKDIANSIFCGAFPG-DGWSGRMEDSILQGCVPVIIQDGIYLPYENMLNYESFAVRVNE 686
Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRKHF 298
DIP L L+G S E V ++ + F
Sbjct: 687 DDIPNLINTLRGFSEAEIQFRLGNVKELWQRF 718
>gi|42572713|ref|NP_974452.1| exostosin family protein [Arabidopsis thaliana]
gi|110740929|dbj|BAE98560.1| hypothetical protein [Arabidopsis thaliana]
gi|332646160|gb|AEE79681.1| exostosin family protein [Arabidopsis thaliana]
Length = 791
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 82/332 (24%), Positives = 137/332 (41%), Gaps = 70/332 (21%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIA------------RMRHDRRIGTEGIP 80
Y S+ F + + + T + +AD FF+P + M++ + +
Sbjct: 389 YGSQMAFYENILATAHRTMNGEEADFFFVPVLDSCIINRADDAPHINMQNHTGLRSSLTL 448
Query: 81 DFISHYIFNISQKYPYWNRTGGADHFYV------ACHS---IGRSAMQKAWEVKLNAIQV 131
+F +I +KYPYWNR+ G DH + AC++ I S M W N+
Sbjct: 449 EFYKRAYEHIVEKYPYWNRSAGRDHIWFFSWDEGACYAPKEIWNSMMLVHWG-NTNSKHN 507
Query: 132 VCSSSYF------ISGH--------IAHKDVSLPQIWPRQEDPPKLGSS-------KRNK 170
+++YF IS KD+ +P W + DP + + KR
Sbjct: 508 HSTTAYFGDNWDDISDERRGDHPCFDPRKDLVIP-AW-KVPDPYSMRKNYWERPREKRKT 565
Query: 171 LAFFAGAVNSP-------------VREKLLQVWRNDSEIYAHSGR------LKTP----- 206
L +F G + +R+KL + + + G+ + TP
Sbjct: 566 LFYFNGNLGPAYEKGRPEDSYSMGIRQKLAEEFGSSPNKEGKLGKQHAEDVIVTPLRSDN 625
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y ++ S FC G + + R+ DS+ GCVPVII + LP+ ++LN++SF++ V
Sbjct: 626 YHKDIANSIFCGAFPG-DGWSGRMEDSILQGCVPVIIQDGIYLPYENMLNYESFAVRVNE 684
Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRKHF 298
DIP L L+G S E V ++ + F
Sbjct: 685 DDIPNLINTLRGFSEAEIQFRLGNVKELWQRF 716
>gi|296085881|emb|CBI31205.3| unnamed protein product [Vitis vinifera]
Length = 455
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 43/175 (24%), Positives = 89/175 (50%), Gaps = 11/175 (6%)
Query: 168 RNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLK----TPYADELLGSKFCLH 219
R L +F GA+ +R++L + +++ +++ G + + + SKFCL+
Sbjct: 257 RPTLLYFQGAIYRKDGGFIRQELFYLLKDEKDVHFAFGNTQGNGINKASQGMHSSKFCLN 316
Query: 220 VKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI---PLLKKIL 276
+ G ++ R+ D++ CVPVII++ +LP+ DVL++ F I V T D L K++
Sbjct: 317 IAGDTPSSNRLFDAIASHCVPVIISDEIELPYEDVLDYSQFCIFVRTSDALKDKFLIKLI 376
Query: 277 KGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQWSTS 331
+ I +E+ + + +V F++ + DA M+ + + ++R++ S
Sbjct: 377 RSIKKDEWTRMWRRLKEVENFFEFQYPSKEGDAVQMIWQAITRKVPAIRMKLHKS 431
>gi|218195798|gb|EEC78225.1| hypothetical protein OsI_17865 [Oryza sativa Indica Group]
gi|222629749|gb|EEE61881.1| hypothetical protein OsJ_16573 [Oryza sativa Japonica Group]
Length = 452
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 64/244 (26%), Positives = 110/244 (45%), Gaps = 36/244 (14%)
Query: 56 ADLFFLPFSIARMRHDRRIG---------TEGIPDFISHY-IFNISQKYPYWNRTGGADH 105
AD+ F+PF A + + +G EG D+ + + +P W R+GG DH
Sbjct: 121 ADVVFVPF-FATLSAEMELGWGAKGAFRRKEGNEDYRRQREVVDRVTAHPAWRRSGGRDH 179
Query: 106 FYVACHSIGRSAMQKAWEVKLNAIQVV----------CSSSYFISGHIAHKDVSLPQ--I 153
+V + ++K E+ + VV S+S +S I H VSL + I
Sbjct: 180 VFVLTDPVAMWHVRK--EIAPAILLVVDFGGWYKLDSNSASSNVSHMIQHTQVSLLKDVI 237
Query: 154 WPRQEDPPKLGSSK---RNKLAFFAGAVN----SPVREKLLQVWRNDSEIYAHSGRLKTP 206
P P + S+ R L +F GA + VREKL + N+ ++ G
Sbjct: 238 VPYTHLLPTMHLSENKDRPTLLYFKGAKHRHRGGLVREKLWDLMVNEPDVVMEEGYPNAT 297
Query: 207 YADE----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI 262
++ + S+FCLH G + R+ D++ C+PVI+++ +LPF ++++ F+I
Sbjct: 298 GREQSIKGMRTSEFCLHPAGDTPTSCRLFDAVASLCIPVIVSDEIELPFEGMIDYTEFAI 357
Query: 263 VVAT 266
V+
Sbjct: 358 FVSV 361
>gi|357474305|ref|XP_003607437.1| Cation proton exchanger [Medicago truncatula]
gi|355508492|gb|AES89634.1| Cation proton exchanger [Medicago truncatula]
Length = 1198
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 59/226 (26%), Positives = 102/226 (45%), Gaps = 24/226 (10%)
Query: 95 PYWNRTGGADHFYVACH-----SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHI-AHKDV 148
P W R+GG DH H ++ R + W + + + +++ G + KD+
Sbjct: 222 PAWKRSGGRDHILPVHHPWSFKTVRRYVKKAIWLLP----DMDSTGNWYKPGQVYLEKDL 277
Query: 149 SLPQIWPRQ-EDPPKLG--SSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSG 201
LP + D L + KRN L FF G + +R KL+ R + G
Sbjct: 278 ILPYVANVDFCDATCLSEINPKRNTLLFFRGRLKRNAGGKIRSKLVDQLRGADGVVIEEG 337
Query: 202 RL----KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNW 257
K + + S FCL+ G ++AR+ D++ GC+PVI+++ +LPF +L++
Sbjct: 338 TSGEGGKEAAQNGMRKSLFCLNPAGDTPSSARLFDAIVSGCIPVIVSDELELPFEGILDY 397
Query: 258 KSFSIVVATLDI---PLLKKILKGISSEEYLLLQSYVLKVRKHFQW 300
+ ++ V++ D L K LK I S +Q + K +HF +
Sbjct: 398 RKIALFVSSNDALKPSWLLKYLKDIRSAHIKEMQQNLAKYSRHFLY 443
>gi|225439168|ref|XP_002268382.1| PREDICTED: xylogalacturonan beta-1,3-xylosyltransferase [Vitis
vinifera]
Length = 488
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 72/300 (24%), Positives = 137/300 (45%), Gaps = 26/300 (8%)
Query: 52 DPSKADLFFLPF--SIARMRHDRRIGTE--GIPDFISHYIFNISQKYPYWNRTGGADHFY 107
+ S+AD+ F+PF S++ + G + + + + W R+ G DH
Sbjct: 171 NSSEADVIFVPFFSSLSYNHFGKFKGRQKKNENNLLQDKLVKFLTAQEEWIRSEGRDHII 230
Query: 108 VACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH--KDVSLPQ---IWPRQEDPPK 162
+A H ++M A +KL + S +IA+ KDV P I D
Sbjct: 231 MAHHP---NSMLDA-RMKLWPAIFILSDFGRYPPNIANVGKDVIAPYKHVIKSFINDTSD 286
Query: 163 LGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLK----TPYADELLGS 214
S R L +F GA+ +R++L + +++ +++ G + + + S
Sbjct: 287 FDS--RPTLLYFQGAIYRKDGGFIRQELFYLLKDEKDVHFAFGNTQGNGINKASQGMHSS 344
Query: 215 KFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI---PL 271
KFCL++ G ++ R+ D++ CVPVII++ +LP+ DVL++ F I V T D
Sbjct: 345 KFCLNIAGDTPSSNRLFDAIASHCVPVIISDEIELPYEDVLDYSQFCIFVRTSDALKDKF 404
Query: 272 LKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVRVQWSTS 331
L K+++ I +E+ + + +V F++ + DA M+ + + ++R++ S
Sbjct: 405 LIKLIRSIKKDEWTRMWRRLKEVENFFEFQYPSKEGDAVQMIWQAITRKVPAIRMKLHKS 464
>gi|255644422|gb|ACU22716.1| unknown [Glycine max]
Length = 189
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 43/161 (26%), Positives = 85/161 (52%), Gaps = 10/161 (6%)
Query: 172 AFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKTPYADE----LLGSKFCLHVKGFEVNT 227
AF G +R++L + +++ +++ G + ++ + SKFCL++ G ++
Sbjct: 13 AFLKGGA---IRQELYYLLKDEKDVHFTFGSIGGNGINQASQGMAMSKFCLNIAGDTPSS 69
Query: 228 ARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI---PLLKKILKGISSEEY 284
R+ D++ CVPVII++ +LPF DVL++ FSI V D L +L+ I+ +E+
Sbjct: 70 NRLFDAIVSHCVPVIISDEIELPFEDVLDYSDFSIFVRASDSMKKGYLLNLLRSITQKEW 129
Query: 285 LLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
+ + ++ HF++ DA M+ ++ + SS+R
Sbjct: 130 SKMWERLKQITHHFEYQYPSQPGDAVNMIWQQVERKISSIR 170
>gi|302792172|ref|XP_002977852.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
gi|300154555|gb|EFJ21190.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
Length = 434
Score = 68.2 bits (165), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 66/269 (24%), Positives = 122/269 (45%), Gaps = 24/269 (8%)
Query: 52 DPSKADLFFLPF--SIARMRHDRRIGTEGIP---DFISHYIFNISQKYPYWNRTGGADHF 106
+P +AD+F++PF S++ + + + + P + I + ++ W + G DH
Sbjct: 128 NPQEADVFYVPFFSSLSLVVGNGKSEDDEDPYSDEDIQEELMAWLEEQESWKKNKGRDHV 187
Query: 107 YVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH-KDVSLPQIWPRQEDPPKLGS 165
V C +A+++ + N + ++ F + KDV LP + +
Sbjct: 188 -VICQDP--NALKRLRDRLKNTVLLLSDFERFKPDQASLVKDVVLPYTHRIDSYFNENVT 244
Query: 166 SKRNKLAFFAG----AVNSPVREKLLQVWRNDSEIY------AHSGRLKTPYADELLGSK 215
R+ L FF G +R++L QV + ++ + GR + SK
Sbjct: 245 LDRDTLLFFMGNRYRKEGGKIRDQLFQVLDVEPDMVMKHGTQSREGRRLAKVG--MQTSK 302
Query: 216 FCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT---LDIPLL 272
FCLH G + R+ D++ CVPVI+++ +LPF D L++ F+I V + L+ L
Sbjct: 303 FCLHPAGDTPSACRLFDAIVSVCVPVIVSDDIELPFEDELDYSEFAIFVPSINALEPGYL 362
Query: 273 KKILKGISSEEYLLLQSYVLKVRKHFQWH 301
L+ IS + Q + +VRK+F++
Sbjct: 363 GSYLRSISPDLLKQKQQRLREVRKYFEYE 391
>gi|303271929|ref|XP_003055326.1| glycosyltransferase family 47 protein [Micromonas pusilla CCMP1545]
gi|226463300|gb|EEH60578.1| glycosyltransferase family 47 protein [Micromonas pusilla CCMP1545]
Length = 595
Score = 68.2 bits (165), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 36/123 (29%), Positives = 63/123 (51%)
Query: 191 RNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLP 250
R D ++ + + Y L SKFCLHV+G V R+ +++ +GCVPVIIA+ YDLP
Sbjct: 451 RTDWDLDSDGATTPSGYMKLLARSKFCLHVRGTRVYAPRLVEAMLFGCVPVIIADGYDLP 510
Query: 251 FADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAF 310
+ L+W +FS+ + + + + + + ++ + +V F +H P DA
Sbjct: 511 LSWFLDWDAFSVRMTEREGVNATRAAEIVDAADWREKHEALRRVVGFFMYHDPPVFGDAL 570
Query: 311 YMV 313
+
Sbjct: 571 WAT 573
>gi|3176669|gb|AAC18793.1| End is cut off [Arabidopsis thaliana]
Length = 440
Score = 68.2 bits (165), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 72/314 (22%), Positives = 132/314 (42%), Gaps = 55/314 (17%)
Query: 52 DPSKADLFFLPF--SIARMRHDRRIGTEGIPDF---ISHYIFNISQKYPYWNRTGGADHF 106
DP AD+F++PF S++ H + + T+ +F + + + YWNR+GG DH
Sbjct: 122 DPDLADVFYVPFFSSLSFNTHGKNM-TDPDTEFDRLLQVELMEFLENSKYWNRSGGKDHV 180
Query: 107 YVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWP----RQEDPPK 162
H +++ +V + + VV Y KDV P + +E
Sbjct: 181 IPMTHPNAFRFLRQ--QVNASILIVVDFGRYSKDMARLSKDVVSPYVHVVESLNEEGDDG 238
Query: 163 LGS--SKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIY-----------------AH 199
+G R L +F G +R +L ++ +S+++ +
Sbjct: 239 MGDPFEARTTLLYFRGNTVRKDEGKIRLRLEKLLAGNSDVHFEKSVATTQNIKVSDLEQN 298
Query: 200 SGR-LKTPYADE----------------LLGSKFCLHVKGFEVNTARIADSLYYGCVPVI 242
GR L Y ++ + SKFCLH G ++ R+ D++ C+PVI
Sbjct: 299 RGRYLMLTYQNDSNCSTCVLFCYQSTEGMRSSKFCLHPAGDTPSSCRLFDAIVSHCIPVI 358
Query: 243 IANHYDLPFADVLNWKSFSI---VVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQ 299
I++ +LPF D +++ FS+ + +L+ + L+ E++L + + V HF+
Sbjct: 359 ISDKIELPFEDEIDYSEFSLFFSIKESLEPGYILNNLRQFPKEKWLEMWKRLKNVSHHFE 418
Query: 300 WHVFPSDYDAFYMV 313
+ P DA M+
Sbjct: 419 FQYPPKREDAVNML 432
>gi|293336592|ref|NP_001167834.1| uncharacterized protein LOC100381534 [Zea mays]
gi|223944319|gb|ACN26243.1| unknown [Zea mays]
Length = 241
Score = 68.2 bits (165), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 48/171 (28%), Positives = 80/171 (46%), Gaps = 18/171 (10%)
Query: 135 SSYFISGHIAHKDVS----LPQIWPR-QEDPPKLGSSKRNKLAFFAGAV----------- 178
++ + GH ++V P + P + P+ ++R+ AFF G +
Sbjct: 21 QTFGVQGHHVCQEVEHVVIPPHVPPEVAHELPEPEKAQRDIFAFFRGKMEVHPKNISGRF 80
Query: 179 -NSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYG 237
+ VR +LLQ + + + Y R Y E+ S FCL G+ + R+ +S+ G
Sbjct: 81 YSKKVRTELLQHYGRNRKFYLKRKRFDN-YRSEMARSLFCLCPLGWAPWSPRLVESVLLG 139
Query: 238 CVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQ 288
C+PVIIA+ LPF VL W+ S+ VA DI L +L + + ++Q
Sbjct: 140 CIPVIIADDIRLPFPPVLQWQEISLQVAEKDIASLGMVLDHVVATNLTVIQ 190
>gi|307107123|gb|EFN55367.1| hypothetical protein CHLNCDRAFT_52572 [Chlorella variabilis]
Length = 815
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 78/339 (23%), Positives = 139/339 (41%), Gaps = 75/339 (22%)
Query: 25 VDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFFLP-----------------FSIAR 67
++E G Y +E+ +A ++S T DP +AD F++P F A
Sbjct: 409 TNWEDSGGYLAETGLHEALLQSKHRTLDPEEADYFYIPVYSSCYMYPIHGFADTPFFHAF 468
Query: 68 MRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIG--------RSAMQ 119
+ R T + + H++ +PYW+R+GG DH + H G R A
Sbjct: 469 HKIPRVHATTNMLIEVYHWL---RAHHPYWDRSGGRDHIILQSHDEGSCWLPAVLRPATM 525
Query: 120 KAWEVKLNAIQVVCSSSYFI------------------SGHIAH-------KDVSLPQIW 154
+++ SS+ +I G + KD+ +P +
Sbjct: 526 LTHWGRMDLGHT--SSTGYIDDVYSRPARHPIYMPEGTEGKLGDFPCYDPAKDLVVPPMT 583
Query: 155 P--RQEDPPKLGSSKRNK--LAFFAGAV---NSP----VREKLLQVWRNDS-----EIYA 198
+ E P +G+ RN+ LAFF G N P +R+ L + R+ +I+
Sbjct: 584 SPLKYELSPLVGAFTRNRTTLAFFKGRTQQNNKPYSRGIRQTLENLCRDKDWWGKFKIWI 643
Query: 199 HSGR---LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVL 255
G + Y+ L S FC + G + + R D++ +GC+PVII + L F ++
Sbjct: 644 GEGNPPDMDRTYSQLLASSTFCFVLPG-DGFSPRFEDAVQHGCLPVIIQDEVHLAFESII 702
Query: 256 NWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKV 294
+++ F + + D+ + +IL I E+ +Q + V
Sbjct: 703 DYRKFVVRIQQKDMERVPEILGAIPPEKVQTMQKALATV 741
>gi|302837496|ref|XP_002950307.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
gi|300264312|gb|EFJ48508.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
Length = 368
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 86/349 (24%), Positives = 147/349 (42%), Gaps = 55/349 (15%)
Query: 4 SFRVYVY--PHRRNDPFANVLLPVDFEPRGN-YASESYFKKAFMKSHFV-TKDPSKADLF 59
F++Y+Y P P+ L D R N Y + F K F+ V T++P +A+LF
Sbjct: 10 EFKIYMYELPWEIAFPYE---LGEDVHTRDNIYTAYEEFMKYFLVDDMVRTQNPYEANLF 66
Query: 60 FLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQ 119
++P + R G I + I K+P++NRTGG DHF Q
Sbjct: 67 YVPALTYFYATNVRNGQWQAEAVIEY----IRTKWPFYNRTGGRDHFVFFTGDRASCHFQ 122
Query: 120 KAWEVKLNAIQVV---CSSSYFISGHIAHKDVSLPQIWPRQEDPPK---LG--------- 164
+ W ++ + I+VV I+++D + Q PP+ LG
Sbjct: 123 R-W-IQDSVIKVVHFGMQHRNLTWNEISNRDYACIQNKRDLVVPPRTVNLGPLLPSFSTP 180
Query: 165 -----------SSKRNKLAFFAGAV-----NSPVREKLLQVWRNDSEIYAH----SGRL- 203
R L FFAG V + VR + Q+ + + + A GR+
Sbjct: 181 YYKWLVSNQGYDGNRTLLFFFAGGVADGEYSGGVRLAIKQMLSSITHLPADVKFVEGRVG 240
Query: 204 --KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFS 261
+ Y + SKFC+ G R+ +++ GCVPVII ++ F D L ++ FS
Sbjct: 241 GGEDEYFAMIRASKFCIAPYGHGWGN-RLVQAVHLGCVPVIIQDYVYQAFEDFLPYEDFS 299
Query: 262 IVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAF 310
+ + D+P + +L+ S + L+ L + ++++ ++ +YD
Sbjct: 300 VRMRLADVPHMIDLLRSYSEADLARLR---LGLARYYRAFIWEREYDGL 345
>gi|125582866|gb|EAZ23797.1| hypothetical protein OsJ_07509 [Oryza sativa Japonica Group]
Length = 322
Score = 67.8 bits (164), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 45/160 (28%), Positives = 76/160 (47%), Gaps = 14/160 (8%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAF--MKSHFVTKDPSKADL 58
M R F+++ Y P A++ D Y+ E F +S F + P A
Sbjct: 102 MERRFKIWTY-REGEPPVAHIGPGTDI-----YSIEGQFMYEMDDPRSRFAARRPDDAHA 155
Query: 59 FFLPFSIARMRH-DRRIGTEG----IPDFISHYIFNISQKYPYWNRTGGADHFYVACHSI 113
F LP S+ + H R+ G + ++ Y+ ++++YPYWNR+ GADH V+CH
Sbjct: 156 FLLPISVCNLVHYVYRLNATGDLAPLRGLVADYVRVVAERYPYWNRSRGADHVIVSCHDW 215
Query: 114 GRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQI 153
++ NAI+V+C+++ G KD +LP++
Sbjct: 216 APMVTSAHRQLYGNAIRVLCNANTS-EGFRPRKDATLPEM 254
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 34/124 (27%), Positives = 57/124 (45%), Gaps = 22/124 (17%)
Query: 226 NTARIADSLYYGC---VPVIIANHYDL------------------PFADVLNWKSFSIVV 264
N +R AD + C P++ + H L P D S+ V
Sbjct: 200 NRSRGADHVIVSCHDWAPMVTSAHRQLYGNAIRVLCNANTSEGFRPRKDA-TLPEMSVAV 258
Query: 265 ATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSV 324
IP L+ IL+ +S Y +L++ VL+ ++HF H +D +MV++ +WLRR +V
Sbjct: 259 PAARIPELRAILRRVSERRYRVLRARVLQAQRHFVLHRPARRFDMIHMVLHSIWLRRLNV 318
Query: 325 RVQW 328
R+ +
Sbjct: 319 RLPY 322
>gi|302847711|ref|XP_002955389.1| hypothetical protein VOLCADRAFT_96311 [Volvox carteri f.
nagariensis]
gi|300259231|gb|EFJ43460.1| hypothetical protein VOLCADRAFT_96311 [Volvox carteri f.
nagariensis]
Length = 1222
Score = 67.8 bits (164), Expect = 7e-09, Method: Composition-based stats.
Identities = 71/287 (24%), Positives = 111/287 (38%), Gaps = 73/287 (25%)
Query: 52 DPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACH 111
D AD +F+P +RH T +S + I + +P+WNRT G HF +A
Sbjct: 616 DGDTADWYFIPVV---LRHFCDATT------LSAAVSYIREMHPWWNRTQGHRHFVIATG 666
Query: 112 SIGRSAMQKAWE------VKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGS 165
+GRS ++ V + SS + + H D+ LP LGS
Sbjct: 667 DMGRSESERGHLTANVTFVSYWGLHAPKLSSGWRASHRNATDIVLPVF---------LGS 717
Query: 166 SKRNKLAFFAGAVNSPVREKLLQVWRN-DSEIYAHSGRL----------------KTP-- 206
K +++ F ++ K R + I+ +GR+ K+P
Sbjct: 718 PKLSRMGIFTSRLHPKFATKAPHELRERNGPIFFFAGRICGDHSKPQVDGVWPNCKSPHN 777
Query: 207 -----------------------------YADELLGSKFCLHVKGFEVNTARIADSLYYG 237
YA LL SKFC G + R ++ G
Sbjct: 778 MGYSGGTRQKIHFHHWNRTGYFIQLGDRHYAKHLLTSKFCFGPTGGG-HGQRQMQAVQAG 836
Query: 238 CVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEY 284
CVPV+I++ F L+W +F + +A DIP + ++L+ IS EEY
Sbjct: 837 CVPVVISDDVLEAFEPFLDWNTFGVRLAEADIPRMHEVLEAISPEEY 883
Score = 66.6 bits (161), Expect = 1e-08, Method: Composition-based stats.
Identities = 62/233 (26%), Positives = 104/233 (44%), Gaps = 40/233 (17%)
Query: 52 DPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACH 111
D AD +F+P + +RH T +S + I + +P+WNRT G HF +A
Sbjct: 139 DGDTADWYFIPVT---LRHTADAYT------LSAAVSYIREMHPWWNRTHGHRHFVIAIG 189
Query: 112 SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKL 171
+GR ++ + ++ F++ H + P+++ + +S RN
Sbjct: 190 DMGRLESERGRQ---------STNVTFVT----HWGLHAPKLFSGWK------ASHRN-- 228
Query: 172 AFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIA 231
A + PV W N + + G YA LL SK+C G + R
Sbjct: 229 ---ATDIVLPVH---FHHW-NRTGYFIQLG--DRHYAKHLLTSKYCFGPTGGG-HGQRQM 278
Query: 232 DSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEY 284
++ GCVPV+I++ F L+W +F + +A DIP + ++L+ IS EEY
Sbjct: 279 QAVQAGCVPVVISDDVLEAFEPFLDWNTFGVRLAEADIPRMHEVLEAISPEEY 331
>gi|302795426|ref|XP_002979476.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
gi|300152724|gb|EFJ19365.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
Length = 434
Score = 67.4 bits (163), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 68/288 (23%), Positives = 125/288 (43%), Gaps = 24/288 (8%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLPF--SIARMRHDRRIGTEGIP---DFISHYI 87
Y E + S +P +AD+F++PF S++ + + + + P + +
Sbjct: 109 YDLEQGEDRRLSDSSVRVMNPQEADVFYVPFFSSLSLVVGNGKSEDDEDPYSDEDTQEEL 168
Query: 88 FNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH-K 146
++ W + G DH V C +A+++ + N + ++ F + K
Sbjct: 169 MAWLEEQESWKKNKGRDHV-VICQDP--NALKRLRDRLKNTVLLLSDFERFKPDQASLVK 225
Query: 147 DVSLPQIWPRQEDPPKLGSSKRNKLAFFAG----AVNSPVREKLLQVWRNDSEI------ 196
DV LP + + R+ L FF G +R++L QV + ++
Sbjct: 226 DVVLPYTHRIDSYSNENVTLDRDTLLFFMGNRYRKEGGKIRDQLFQVLDVEPDMVMKHGT 285
Query: 197 YAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLN 256
+ GR + SKFCLH G + R+ D++ CVPVI+++ +LPF D L+
Sbjct: 286 QSREGRRLAKVG--MQTSKFCLHPAGDTPSACRLFDAIVSVCVPVIVSDDIELPFEDELD 343
Query: 257 WKSFSIVVAT---LDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWH 301
+ F+I V + L+ L L+ IS + Q + +VRK+F++
Sbjct: 344 YSEFAIFVPSINALEPGYLGSYLRSISPDLLKQKQQRLREVRKYFEYE 391
>gi|256088012|ref|XP_002580154.1| exostosin-1 [Schistosoma mansoni]
gi|353230126|emb|CCD76297.1| putative exostosin-1 [Schistosoma mansoni]
Length = 766
Score = 67.4 bits (163), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 89/370 (24%), Positives = 148/370 (40%), Gaps = 78/370 (21%)
Query: 5 FRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFFLPFS 64
F+VYVYP AN+ L + A S +T DP++A F+P S
Sbjct: 95 FKVYVYPLSSKQ--ANLSL-----------TYQKILTALHNSKLLTSDPNEA-CIFIP-S 139
Query: 65 IARMRHDRRIGTEGIPDFISHYIFNISQKYPYWN--------RTGGADHFYVACHSIGRS 116
+ + DR G +I+H + N+ P+WN + G +H H+ G
Sbjct: 140 LDTLDRDRLSPHFG--QYIAHELVNL----PFWNSLPRRDLDKYAGRNHLIFNLHA-GTW 192
Query: 117 AMQKAWEVKLNAIQVVCSSSYFISGHIAHK-DVSLPQIWPRQEDPPKLGSSKRNKLAF-- 173
E +L Q + + + F + H K D+SLP I + P + GSS+ N+L
Sbjct: 193 PYYYEDEYRLWLGQAMLAKASFSTKHFRPKFDISLPLI--HSQHPLQSGSSQLNQLVSSE 250
Query: 174 ------------------FAGAVNSPVREKLLQVWRNDSEIYAHSGRLKTP---YADE-- 210
+ + S R+ L + I + R T YAD+
Sbjct: 251 HLRGRLDLPYLLSFKGKRYVSGIGSASRDILFHLHNGKDIIMLTTCRHGTDWTRYADKRC 310
Query: 211 ---------------LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVL 255
+ S FCL +G + + R + L GC+PV+++N +LPF++V+
Sbjct: 311 ATDMALYDAYDYWELMYNSTFCLVPRGRRLGSYRFLEVLQAGCIPVMLSNDLELPFSEVI 370
Query: 256 NWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMY 315
+W I LL L+ I+S + + + V+ F WH + S ++ +
Sbjct: 371 DWNRAVIWADERLPLLLPLSLRRITSHQIIQYRQQVM-----FLWHTYLSSIESIVLTTL 425
Query: 316 ELWLRRSSVR 325
E+ R S+R
Sbjct: 426 EIIRDRVSLR 435
>gi|255543228|ref|XP_002512677.1| catalytic, putative [Ricinus communis]
gi|223548638|gb|EEF50129.1| catalytic, putative [Ricinus communis]
Length = 253
Score = 67.4 bits (163), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 52/192 (27%), Positives = 90/192 (46%), Gaps = 18/192 (9%)
Query: 140 SGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLAFFAGAV----NSP-----VREKLLQVW 190
G I + PQ ++ PP+ + R+ +F G N P R VW
Sbjct: 34 EGSITIPPYAPPQKMQARQIPPE---TPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVW 90
Query: 191 RN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYD 248
N ++ ++ S T Y +++ + FCL G+ + R+ +++ +GC+PVIIA+
Sbjct: 91 ENFKNNPLFDISTEHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIV 150
Query: 249 LPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY- 307
LPFAD + W+ + VA D+P L IL I + + +L + +L + +FP
Sbjct: 151 LPFADAIPWEEIGVFVAEEDVPNLDTILTSIPT-QVVLRKQRLLANPSMKRAMLFPQPAQ 209
Query: 308 --DAFYMVMYEL 317
DAF+ ++ L
Sbjct: 210 SGDAFHQILNGL 221
>gi|81298858|ref|YP_399066.1| TPR repeat-containing protein [Synechococcus elongatus PCC 7942]
gi|81167739|gb|ABB56079.1| TPR repeat [Synechococcus elongatus PCC 7942]
Length = 788
Score = 67.4 bits (163), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 53/191 (27%), Positives = 94/191 (49%), Gaps = 31/191 (16%)
Query: 131 VVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLG-----SSKRNKLAFFAGAVNSPVREK 185
VV S + + + DVS P + P L S R+ L F GA + PVRE+
Sbjct: 114 VVVSHCHTLGNYRVGIDVSFPPM-------PLLDQHCYPSRDRSTLLSFRGANSHPVREQ 166
Query: 186 LLQVWRNDSEIYAHS------GRL------------KTPYADELLGSKFCLHVKGFEVNT 227
L Q EI A G L + Y D + S+F + +G ++ +
Sbjct: 167 L-QRLHQPPEIAAELIQQSYWGTLNYVDEAEGLSAEQQVYTDLIARSRFSVAPRGHDIFS 225
Query: 228 ARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLL 287
R+ + + G +PVI+A+ + LPF+++L+W FS+ VA L ++L+ IS++++ ++
Sbjct: 226 YRLLEVMAGGAIPVILADDWVLPFSELLDWSEFSLSVAEDRCWELPQLLQAISTDQWQVM 285
Query: 288 QSYVLKVRKHF 298
Q ++ +V +H+
Sbjct: 286 QQHLQQVYQHY 296
>gi|56751461|ref|YP_172162.1| hypothetical protein syc1452_c [Synechococcus elongatus PCC 6301]
gi|56686420|dbj|BAD79642.1| hypothetical protein [Synechococcus elongatus PCC 6301]
Length = 788
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 53/191 (27%), Positives = 94/191 (49%), Gaps = 31/191 (16%)
Query: 131 VVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLG-----SSKRNKLAFFAGAVNSPVREK 185
VV S + + + DVS P + P L S R+ L F GA + PVRE+
Sbjct: 114 VVVSHCHTLGNYRVGIDVSFPPM-------PLLDQHCYPSRDRSTLLSFRGANSHPVREQ 166
Query: 186 LLQVWRNDSEIYAHS------GRL------------KTPYADELLGSKFCLHVKGFEVNT 227
L Q EI A G L + Y D + S+F + +G ++ +
Sbjct: 167 L-QRLHQPPEIAAELIQQSYWGTLNYVDEAEGLSAEQQVYTDLIARSRFSVAPRGHDIFS 225
Query: 228 ARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLL 287
R+ + + G +PVI+A+ + LPF+++L+W FS+ VA L ++L+ IS++++ ++
Sbjct: 226 YRLLEVMAGGAIPVILADDWVLPFSELLDWSEFSLSVAEDRCWELPQLLQAISTDQWQVM 285
Query: 288 QSYVLKVRKHF 298
Q ++ +V +H+
Sbjct: 286 QQHLQQVYQHY 296
>gi|414866628|tpg|DAA45185.1| TPA: hypothetical protein ZEAMMB73_313698 [Zea mays]
Length = 588
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 42/140 (30%), Positives = 71/140 (50%), Gaps = 6/140 (4%)
Query: 183 REKLLQVWRN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVP 240
R VW N ++ ++ S Y +++ S FCL G+ + R+ +++ +GC+P
Sbjct: 418 RGACASVWENFKNNPLFDISTDHPPTYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIP 477
Query: 241 VIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQW 300
VIIA+ LPFAD + W+ + VA D+P L IL I + + +L + +L Q
Sbjct: 478 VIIADDIVLPFADAIPWEEIGVFVAEEDVPKLDSILMSIPT-DVILRKQRLLANPSMKQA 536
Query: 301 HVFPSDY---DAFYMVMYEL 317
+FP DAF+ ++ L
Sbjct: 537 MLFPQPAQAGDAFHQILNGL 556
>gi|302766093|ref|XP_002966467.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
gi|300165887|gb|EFJ32494.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
Length = 453
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 64/265 (24%), Positives = 115/265 (43%), Gaps = 39/265 (14%)
Query: 52 DPSKADLFFLPF-----SIARMRHDR-----RIGTEGIPDFISHYIFNISQKYPYWNRTG 101
DP +AD+ F+PF + A++R+ + R E + I W R+G
Sbjct: 116 DPDQADVVFVPFFAALSAEAQLRNGKGHFRHRKDNEDYER--QRAVMEIVTSSSRWQRSG 173
Query: 102 GADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQ--------- 152
G DH +V + + E+ + + VV +++ + +++S PQ
Sbjct: 174 GRDHVFVLTDPMAMYHFRA--EIANSILLVVDFGGWYMEDAKSSRNLSSPQPIYHTQVSL 231
Query: 153 ----IWPRQEDPPKLGSSK----RNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHS 200
I P P L S+ R+ L +F GA VR++L V + +
Sbjct: 232 IKDVIVPYTHLLPTLALSQDNAVRSTLLYFKGARYRHRTGLVRDQLWSVLDGEPGVLLEE 291
Query: 201 G----RLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLN 256
G + + S FCLH G ++ R+ D++ C+PVI+++ +LPF +L+
Sbjct: 292 GFPNRTGQVQAVQGMRNSHFCLHPAGDTPSSCRLFDAVASLCIPVIVSDSIELPFEGMLD 351
Query: 257 WKSFSIVVATLDIPLLKKILKGISS 281
+ F+I V+ D L K +++ +SS
Sbjct: 352 YTQFAIFVSVHDALLPKWLVRHLSS 376
>gi|414866629|tpg|DAA45186.1| TPA: hypothetical protein ZEAMMB73_313698 [Zea mays]
Length = 206
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 42/140 (30%), Positives = 71/140 (50%), Gaps = 6/140 (4%)
Query: 183 REKLLQVWRN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVP 240
R VW N ++ ++ S Y +++ S FCL G+ + R+ +++ +GC+P
Sbjct: 36 RGACASVWENFKNNPLFDISTDHPPTYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIP 95
Query: 241 VIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQW 300
VIIA+ LPFAD + W+ + VA D+P L IL I + + +L + +L Q
Sbjct: 96 VIIADDIVLPFADAIPWEEIGVFVAEEDVPKLDSILMSIPT-DVILRKQRLLANPSMKQA 154
Query: 301 HVFPSDY---DAFYMVMYEL 317
+FP DAF+ ++ L
Sbjct: 155 MLFPQPAQAGDAFHQILNGL 174
>gi|224093354|ref|XP_002334839.1| predicted protein [Populus trichocarpa]
gi|222875141|gb|EEF12272.1| predicted protein [Populus trichocarpa]
Length = 175
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 42/134 (31%), Positives = 68/134 (50%), Gaps = 6/134 (4%)
Query: 189 VWRN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANH 246
VW N D+ ++ S Y +++ + FCL G+ + R+ +++ +GC+PVIIA+
Sbjct: 11 VWENFKDNPLFDISTEHPATYYEDMQRAVFCLCPLGWAPWSPRLVEAVIFGCIPVIIADD 70
Query: 247 YDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSD 306
LPFAD + W+ + V D+P L IL I E +L + +L Q +FP
Sbjct: 71 IVLPFADAIPWEEIGVYVDEEDVPNLDTILTSIPP-EVILRKQRLLANPSMKQAMLFPQP 129
Query: 307 Y---DAFYMVMYEL 317
DAF+ V+ L
Sbjct: 130 AQPGDAFHQVLNGL 143
>gi|302848153|ref|XP_002955609.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
gi|300259018|gb|EFJ43249.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
Length = 572
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 72/320 (22%), Positives = 118/320 (36%), Gaps = 59/320 (18%)
Query: 50 TKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFN-ISQKYPYWNRTGGADHFYV 108
T DP KAD F++P S ++ F+ + N I + +P WN+TGGA H
Sbjct: 179 TADPEKADFFYIPASARDLKRA----------FLLEPLLNYIIEAWPIWNQTGGARHIMP 228
Query: 109 ACHSIGRSAM-QKAWEVKLNAIQVVCSSSYFISGH------------IAHKDVSLPQIWP 155
A +G + K + N + Y H + +D+ +P +
Sbjct: 229 AEGDVGTCELPMKIRNMTANVTWLQFWGMYDFHPHWTQIFHNRVPCMVPGRDIVVPFMAM 288
Query: 156 RQED------PPKLGSSKRNKLA--FFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKTP- 206
D P + KRN+ FFAG + ++ L + +SG ++
Sbjct: 289 SSHDRFVIETPLHPRNQKRNRTNTFFFAGGICGSGNKRALPPHCTYYKQVRYSGGVRQAV 348
Query: 207 ------------------YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYD 248
YA + S FCL G I ++Y GC+PV +
Sbjct: 349 YYHYHKRPGWRVVPGTDDYARDYASSIFCLAAAGGGWGKRGIVATMY-GCIPVAATDMLY 407
Query: 249 LPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVF----- 303
F ++W F + V+ IP L +L+ + E+ +Q +H W
Sbjct: 408 EAFEPEMDWNRFGVRVSQAQIPQLGDMLEAFTPEQIRQMQIRTACAAQHLHWSTNLGGIM 467
Query: 304 --PSDYDAFYMVMYELWLRR 321
++DAF +M L +RR
Sbjct: 468 GETGEFDAFNTIMAILRMRR 487
>gi|296084506|emb|CBI25527.3| unnamed protein product [Vitis vinifera]
Length = 275
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 37/112 (33%), Positives = 62/112 (55%), Gaps = 6/112 (5%)
Query: 46 SHFVTKDPSKADLFFLPFSIARMRH---DRRIGTEG--IPDFISHYIFNISQKYPYWNRT 100
SHFV P +A +F++P S+ R+ H + G + ++ YI+ +S KYPYWNR+
Sbjct: 96 SHFVAGHPDEAHVFYIPISVTRIAHYIYSPPVDYSGHMLQRLVTDYIYVVSDKYPYWNRS 155
Query: 101 GGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQ 152
GADHF V+CH ++ + I+V+C+++ +D+SLP+
Sbjct: 156 NGADHFLVSCHDWAPEISIVTPDLYKHFIRVLCNANTSERFQPI-RDISLPE 206
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 29/78 (37%), Positives = 49/78 (62%), Gaps = 1/78 (1%)
Query: 250 PFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDA 309
P D+ + FSI + + IP +KKILK + +E YL +Q V +V++HF + YD
Sbjct: 198 PIRDI-SLPEFSINITSDKIPEIKKILKAVPNERYLRMQKRVKQVQRHFVINRPAQPYDM 256
Query: 310 FYMVMYELWLRRSSVRVQ 327
+M+++ +WLRR +VR++
Sbjct: 257 LHMILHSVWLRRLNVRLR 274
>gi|159489064|ref|XP_001702517.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158280539|gb|EDP06296.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 489
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 70/307 (22%), Positives = 116/307 (37%), Gaps = 69/307 (22%)
Query: 41 KAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRT 100
+ + S T D AD +F+P + H + ++ + + + +P+W+RT
Sbjct: 149 QRMLSSGVRTTDGDSADYYFIPLLMRTRTH--------TVNHLAAVVHYVRKHWPWWDRT 200
Query: 101 GGAD-HFYVACHSIGRSAMQKA-------------WEVKLNAIQVVCSSSYFISGHIAHK 146
GG H VA IGR + W + N S +++ H K
Sbjct: 201 GGGHRHLLVAPGDIGRRILTPELLHMTENCTFLTHWGLHRNH-----SGGNWLASHRPGK 255
Query: 147 DVSLPQIWPRQED---PPKLGSSKRNKLA-----FFAGAVNSPVREKLLQVWRNDSEIYA 198
D+ +P + P E P + K+N+ + FFAG + ++ + Y+
Sbjct: 256 DIVVPPLTPPDEPIVYSPLHATLKKNRKSRLGELFFAGRICGDNQKPTDGKCSEKRQDYS 315
Query: 199 HSGRLK----------------TP-YADELLGSKFCLHVKG--------FEVNT------ 227
R + TP YA+ L FCL G +++T
Sbjct: 316 AGTRQQVAHHHWNRPNWTITTHTPAYAEALSTHIFCLSPTGTARLSHARLDLSTLQRPGG 375
Query: 228 ---ARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEY 284
R SL GC+PV + +H PF ++W FS+ V DI L +L G+ + +
Sbjct: 376 GYGRRSVQSLLMGCIPVTVTDHVHQPFEPEVDWARFSVPVREDDIAQLHHVLTGLRASPH 435
Query: 285 LLLQSYV 291
L Q V
Sbjct: 436 TLAQMQV 442
>gi|297851878|ref|XP_002893820.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
gi|297339662|gb|EFH70079.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
Length = 478
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 61/248 (24%), Positives = 106/248 (42%), Gaps = 36/248 (14%)
Query: 54 SKADLFFLPFSIARMRHDRRIGT--------EGIPDFISH-YIFNISQKYPYWNRTGGAD 104
S AD+ F+PF A + + +G G D+ + + + W R+ G D
Sbjct: 145 SDADVVFVPF-FATLSAEMELGNGKGSFRKKNGNEDYQRQRQVLDFVKNTEAWKRSNGRD 203
Query: 105 HFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQ------------ 152
H +V + +++ E+ L+ + VV +F + SLP+
Sbjct: 204 HVFVLTDPVAMWHVRE--EIALSILLVVDFGGWFRQDSKSSNGTSLPERIEHTQVSVIKD 261
Query: 153 -IWPRQEDPPKLGSS---KRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSG-RL 203
I P P L S +R+ L +F GA +REKL + ++ I G
Sbjct: 262 VIVPYTHLLPSLDLSQNQRRHSLLYFKGAKHRHRGGLIREKLWDLLVDEQGIVMEEGFPN 321
Query: 204 KTPYADELLG---SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSF 260
T ++G S+FCLH G + R+ D++ C+PVI+++ +LPF ++++ F
Sbjct: 322 ATGREQSIIGMRNSEFCLHPAGDTPTSCRLFDAIQSLCIPVIVSDTIELPFEGIIDYSEF 381
Query: 261 SIVVATLD 268
S+ V D
Sbjct: 382 SVFVPVSD 389
>gi|302842849|ref|XP_002952967.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
gi|300261678|gb|EFJ45889.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
Length = 728
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 81/330 (24%), Positives = 125/330 (37%), Gaps = 63/330 (19%)
Query: 43 FMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRT-G 101
+ + D + AD F +P + RM +D D + + I +PYWN T G
Sbjct: 263 LLSAGLRVADAASADFFLVPIRV-RMAYDS--------DRVIQAVSYIRSVWPYWNATRG 313
Query: 102 GADHFYVACHSIGRSAM-QKAWEVKLNAIQVV---CSSSYFISG----HIAHKDVSLPQI 153
G H +V GR + + A + NA + + + +G H KDV LP +
Sbjct: 314 GGRHIFVHTGDWGRDELSEDAQLLTRNATWLTHWGLARDHEFAGWRQSHRPGKDVVLPLM 373
Query: 154 W----------PRQED-PPKLGSSKRNKLAFFAGAV-NSPVREKLLQVWRNDSEIYA--- 198
PR P +R FFAG + S L + N +
Sbjct: 374 LAASLLSTYQLPRASPLHPAGPRPERTTTLFFAGRICGSRATPSLNGTYPNCPNVLGSED 433
Query: 199 -------------HSGRL--------KTPYADELLGSKFCLHVKGFEVNTARIADSLYYG 237
H GR + P A E+ +KFCL G + L G
Sbjct: 434 AYSAATRQRAYFYHHGRANWKLVTASRAPAA-EMATAKFCLAPSGGGQGKRSVLAPLM-G 491
Query: 238 CVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKH 297
CVPV + + PF L W+ F++ V D+P++ ++L + E+ Q+ + +H
Sbjct: 492 CVPVPVTDGLMQPFEPELRWERFAVGVRERDLPVMHELLDRLMPEQVAGFQAELTCAAQH 551
Query: 298 FQW-----HVFPSD--YDAFYMVMYELWLR 320
W VF D YDAF ++ L +R
Sbjct: 552 LFWSSLYGSVFGEDGAYDAFETLVQVLRMR 581
>gi|302783725|ref|XP_002973635.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
gi|300158673|gb|EFJ25295.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
Length = 452
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 60/232 (25%), Positives = 108/232 (46%), Gaps = 33/232 (14%)
Query: 97 WNRTGGADHFYVACHSIG----RSAMQKA---------WEVKLNAIQVVCSSSYFISGHI 143
W R+ G DH +V + R + A W ++ +A + SS+ +
Sbjct: 177 WRRSNGVDHVFVLADPMAMWHVREQISTAMFLVVDFGGWYLE-DAKNKLNSSTIIQHSQV 235
Query: 144 AH-KDVSLPQ--IWPRQEDPPKLGSSK-RNKLAFFAGAV----NSPVREKLLQVWRNDSE 195
+ KDV +P + P P K+ + R L +F GA + VREKL ++ N+ E
Sbjct: 236 SPIKDVIIPHTHLLP----PLKIADDQHRTVLLYFRGARHRHRSGLVREKLWKILDNEPE 291
Query: 196 IYAHSGRLKTPYADE----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
+ G E + S+FCL G ++ R+ D++ C+PVI+++ LPF
Sbjct: 292 VLLEKGLPDDAGLAEATRGMRSSEFCLTPAGDTPSSCRLYDAIASLCIPVIVSDDIQLPF 351
Query: 252 ADVLNWKSFSIVVATLDIPL---LKKILKGISSEEYLLLQSYVLKVRKHFQW 300
+N++ F + V+T D L + L+ I SEE ++ + +V+++F++
Sbjct: 352 EGFVNYEEFCVFVSTRDATQPGWLVQKLRSIGSEERSTMRQTLSRVQRYFEY 403
>gi|359359170|gb|AEV41075.1| putative exostosin family domain-containing protein [Oryza minuta]
Length = 317
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 54/195 (27%), Positives = 92/195 (47%), Gaps = 25/195 (12%)
Query: 94 YPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVV----------CSSSYFISGHI 143
+P W R+GG DH +V + ++K E+ + + VV S+S +S I
Sbjct: 33 HPAWRRSGGRDHVFVLTDPVAMWHVRK--EIAPSILLVVDFGGWYKLDSNSASSNVSHMI 90
Query: 144 AHKDVSLPQ--IWPRQEDPPKLGSSK---RNKLAFFAGAV----NSPVREKLLQVWRNDS 194
H VSL + I P P + S+ R L +F GA VREKL + N+
Sbjct: 91 QHTQVSLLKDVIVPYTHLLPTMQLSENKDRLTLLYFKGAKHRHRGGLVREKLWDLMVNEP 150
Query: 195 EIYAHSGRLKTPYADE----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLP 250
++ G ++ + S+FCLH G + R+ D++ C+PVI+++ +LP
Sbjct: 151 DVVMEEGYPNATGREQSIKGMRTSEFCLHPAGDTPTSCRLFDAVASLCIPVIVSDEIELP 210
Query: 251 FADVLNWKSFSIVVA 265
F ++++ F+I V+
Sbjct: 211 FEGMIDYTEFTIFVS 225
>gi|255571564|ref|XP_002526728.1| catalytic, putative [Ricinus communis]
gi|223533917|gb|EEF35642.1| catalytic, putative [Ricinus communis]
Length = 728
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 81/332 (24%), Positives = 138/332 (41%), Gaps = 70/332 (21%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLPF----SIAR--------MRHDRRIGTEGIP 80
Y ++ ++ + S + T + +AD FF+P I R M+ + +
Sbjct: 328 YGAQMALYESLLASPYRTLNGEEADFFFVPILDSCIITRADDAPHLSMQDHMGLRSSLTL 387
Query: 81 DFISHYIFNISQKYPYWNRTGGADHFYV------ACHS---IGRSAMQKAWEVKLNAIQV 131
++ +I + YPYWNRT G DH + AC++ I S M W N+
Sbjct: 388 EYYRKAYDHIVEHYPYWNRTSGRDHIWFFSWDEGACYAPKEIWNSMMLVHWG-NTNSKHN 446
Query: 132 VCSSSYFISG--------------HIAHKDVSLPQIWPRQEDPPKLGS-------SKRNK 170
+++Y+ KD+ LP W R D L + +R
Sbjct: 447 HSTTAYWADNWDKISSDRRGRHPCFDPDKDLVLP-AWKRP-DVSALSTKLWARPLERRKT 504
Query: 171 LAFFAGAVNSP-------------VREKLLQVWRNDSEIYAHSGR------LKTP----- 206
L FF G + +R+KL + + + G+ + TP
Sbjct: 505 LFFFNGNLGPAYPNGRPELSYSMGIRQKLAEEFGSSPNKDGKLGKQHAEDVIVTPLRSEN 564
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y ++L S FC + G + + R+ DS+ GC+PVII + LP+ +VLN++SF++ +
Sbjct: 565 YHEDLASSIFCGVLPG-DGWSGRMEDSILQGCIPVIIQDGIFLPYENVLNYESFAVRIRE 623
Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRKHF 298
+I L KIL+G + E + V K+ + F
Sbjct: 624 DEISNLLKILRGFNETEKEFKLANVRKIWQRF 655
>gi|159470791|ref|XP_001693540.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158283043|gb|EDP08794.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 510
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 69/297 (23%), Positives = 109/297 (36%), Gaps = 58/297 (19%)
Query: 35 SESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKY 94
+ +F + + DP+ AD F+LP + R ++G + + +
Sbjct: 144 TTRHFLEMLTATGARVGDPAAADWFYLPVRL-------RSSSDG--HVLRRALEYVQAAQ 194
Query: 95 PYWNRTGGADHFYVACHSIGRSAMQKA-----------WEVKLNAIQVVCSSSYFISGHI 143
P++N TGG DHF +A +GR ++ W + + + + S ++ + H
Sbjct: 195 PWFNATGGKDHFVLAVGDMGRLESERGPLSANVTFVSHWGLYRSKAEQL-QSPHWRASHR 253
Query: 144 AHKDVSLP-----------QIWPRQEDP------PKLGSSKRNKLAFFAGAVNSPVREKL 186
D+ LP I + P P + L +FAG V
Sbjct: 254 NATDIVLPVYLTLRKLQKFGILGSRHHPKFATVAPPDVRERNGPLFWFAGRVCQDSSPPR 313
Query: 187 LQVWRNDSEIYAHSG-------------------RLKTPYADELLGSKFCLHVKGFEVNT 227
VW N + +S R YA +L +KFC G
Sbjct: 314 TDVWPNCPKAMGYSAMTRQAVYFHHWNRTGFAVLRGDKQYAKHMLTAKFCFGPMGGGHGQ 373
Query: 228 ARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEY 284
+ +L GCVPV+I + + L+W F + VA DIP L IL I EEY
Sbjct: 374 RQFQAALA-GCVPVVIGDGVLEAWEPYLDWNDFGVRVAEADIPRLHTILGAIGPEEY 429
>gi|259155096|ref|NP_001158790.1| exostosin-1c [Salmo salar]
gi|223647436|gb|ACN10476.1| Exostosin-1c [Salmo salar]
Length = 759
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 81/353 (22%), Positives = 130/353 (36%), Gaps = 75/353 (21%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFK--KAFMKSHFVTKDPSKADLFFL 61
FRVYVYP + G SESY K + +S + T DP +A LF L
Sbjct: 108 GFRVYVYPSEK----------------GERGSESYRKILTSIGESRYYTTDPREACLFVL 151
Query: 62 PFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKA 121
I + D+ G + +P+ + YP WN G +H +S +
Sbjct: 152 --GIDTLDRDQLSG-QFVPNLDDRI-----KGYPLWNE--GQNHLIFNLYSGTWPNYTE- 200
Query: 122 WEVKLNAIQVVCSSSYFISGHIAHK-DVSLPQIWPRQEDPPKLGSSK-----------RN 169
++ N Q + + + + H DVS+P +D P+ G + R
Sbjct: 201 -DLGFNIGQAILAKASLNTEHFRPGFDVSIPLF---SKDHPQKGGERGWLVRNTVPPRRK 256
Query: 170 KLAFFAG-----AVNSPVREKLLQV-----------------WRNDSEIYAHSGRL---K 204
L F G + S R L + W + L +
Sbjct: 257 YLLMFKGKRYLTGIGSDTRNALHHIHNGKDIVSLTTCRHGKDWEKHKDARCDHDNLEYER 316
Query: 205 TPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVV 264
Y + L S FCL +G + + R +SL CVPV+++N ++LPF+DV+ W I
Sbjct: 317 FDYQELLHNSTFCLVPRGRRLGSFRFLESLQAACVPVLLSNGWELPFSDVIQWNQAVIEG 376
Query: 265 ATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYEL 317
+ + + + +E L L+ R W + S D + E+
Sbjct: 377 DERLLLQVPSTVHAVGNERVL-----ALRQRTQMLWDAYFSSVDKIVLTTLEI 424
>gi|291190070|ref|NP_001167078.1| Exostosin-2 [Salmo salar]
gi|223647990|gb|ACN10753.1| Exostosin-2 [Salmo salar]
Length = 724
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 32/82 (39%), Positives = 49/82 (59%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L S FC+ ++G + A ++D L GCVPVI+A+ Y LPF++VL+WK S+V+
Sbjct: 314 YPQILQESSFCVVLRGARLGQATLSDVLQAGCVPVILADSYILPFSEVLDWKRASVVIPE 373
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+P + ILK I + +Q
Sbjct: 374 EKLPEMYTILKSIPHRQVEEMQ 395
>gi|356544337|ref|XP_003540609.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase F8H-like
[Glycine max]
Length = 494
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 59/225 (26%), Positives = 98/225 (43%), Gaps = 26/225 (11%)
Query: 95 PYWNRTGGADHFYVACH-----SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHI-AHKDV 148
P W R+GG DH H S+ R W + + + +++ G + KD+
Sbjct: 214 PAWKRSGGRDHILPVHHPWSFKSVRRYVKNAIWLLP----DMDSTGNWYKPGQVYLEKDL 269
Query: 149 SLPQIWPRQE--DPPKLGSS--KRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHS 200
LP + P + D L + KR+ L FF G + +R KL +
Sbjct: 270 ILPYV-PNVDLCDAKCLSETNPKRSTLLFFRGRLKRNAGGKIRSKLGAELSGADGVVIEE 328
Query: 201 GRL----KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLN 256
G K + S FCL G ++AR+ D++ GC+PVII++ +LPF +L+
Sbjct: 329 GTAGEGGKEAAQRGMRKSLFCLSPAGDTPSSARLFDAIVSGCIPVIISDELELPFEGILD 388
Query: 257 WKSFSIVVATLDI---PLLKKILKGISSEEYLLLQSYVLKVRKHF 298
++ ++ +++ D L K LKGI +Q + K +HF
Sbjct: 389 YRKIAVFISSNDAVKPGWLLKYLKGIRPAHIKEMQQNLAKYSRHF 433
>gi|56606002|ref|NP_001008400.1| exostosin-2 [Danio rerio]
gi|55247892|gb|AAV48783.1| exostosin-2 [Danio rerio]
Length = 719
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 32/82 (39%), Positives = 49/82 (59%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L S FC+ ++G + A ++D L GCVPVI+A+ Y LPF++VL+WK S+V+
Sbjct: 309 YPQILQESSFCVVLRGARLGQATLSDVLQAGCVPVIMADSYILPFSEVLDWKRASVVIPE 368
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+P + ILK I + +Q
Sbjct: 369 EKLPEMYTILKSIPHRQVEEMQ 390
>gi|224051022|ref|XP_002199808.1| PREDICTED: exostosin-2 [Taeniopygia guttata]
Length = 718
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 32/82 (39%), Positives = 49/82 (59%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L S FC+ ++G + A ++D L GCVPVIIA+ Y LPF++VL+WK S+V+
Sbjct: 308 YPQVLQESTFCVVLRGARLGQAVLSDVLQAGCVPVIIADSYILPFSEVLDWKRASVVIPE 367
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+P + IL+ I + +Q
Sbjct: 368 EKMPEMYSILQSIPQRQIEEMQ 389
>gi|321463593|gb|EFX74608.1| hypothetical protein DAPPUDRAFT_307216 [Daphnia pulex]
Length = 724
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 59/257 (22%), Positives = 108/257 (42%), Gaps = 29/257 (11%)
Query: 41 KAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRT 100
KA KS + T DP++A LF + + RI + + +S PYWN
Sbjct: 143 KAITKSMYFTSDPNEACLFI---PTIDLLNQNRIRPKDVGKALS--------SLPYWNE- 190
Query: 101 GGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDP 160
G +H EV + ++ S + + + DVS+P P +
Sbjct: 191 -GRNHLIFNILPGSMPEYHPFLEVDI-GYAMMTSGGFSSLTYRSGFDVSVPVYSPLAAEL 248
Query: 161 PKLGSSKRNKLAFFAGA-VNSPVREKLLQVWRNDSEIYAHSGRLKTP------------- 206
S RN L + ++ RE + ++ + +P
Sbjct: 249 RTQSSQDRNWLVISSQPYIHEDFREVISEMAAEHPGFLVLNSCGSSPLDTKLRCRDEETY 308
Query: 207 -YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVA 265
+ D L+ + FCL V+G + + +SL GC+PV++++ Y LP+ +V++WKS + +
Sbjct: 309 NFPDILMNATFCLVVRGARLGQPTLMESLAAGCIPVVVSDSYVLPYEEVIDWKSAVLQLY 368
Query: 266 TLDIPLLKKILKGISSE 282
D+ + +L+G+SS+
Sbjct: 369 EDDLSKMMDLLRGVSSD 385
>gi|387015840|gb|AFJ50039.1| Exostosin-2-like [Crotalus adamanteus]
Length = 718
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 50/82 (60%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L + FC+ ++G + A ++D+L GC+PV+IA+ Y LPF++VL+WK S+V+
Sbjct: 308 YPQVLQEATFCIVLRGARLGQAVLSDALQAGCIPVVIADSYVLPFSEVLDWKRASVVIPE 367
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+P L IL+ I + +Q
Sbjct: 368 EKMPELYNILQSIPQRQIEEMQ 389
>gi|224144703|ref|XP_002325382.1| predicted protein [Populus trichocarpa]
gi|222862257|gb|EEE99763.1| predicted protein [Populus trichocarpa]
Length = 505
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 56/224 (25%), Positives = 100/224 (44%), Gaps = 24/224 (10%)
Query: 95 PYWNRTGGADHFYVACH-----SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHI-AHKDV 148
P W R+ G +H + H S+ R W + + + +++ G + KD+
Sbjct: 228 PAWKRSEGRNHIFPIHHPWSFKSVRRYVKNAIWLLP----DMDSTGNWYKPGQVFLEKDL 283
Query: 149 SLPQIWPRQEDPPKLGS---SKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSG 201
LP + K S SKR+ L +F G + +R KL+ ++ G
Sbjct: 284 ILPYVPNVNLCDTKCISESESKRSTLLYFRGRLKRNAGGKIRAKLVAELSGAEGVFIEEG 343
Query: 202 RL----KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNW 257
K + S FCL G ++AR+ D++ GC+PV++++ +LPF +L++
Sbjct: 344 TAGEGGKAAAQIGMRKSIFCLSPAGDTPSSARLFDAIVSGCIPVVVSDELELPFEGILDY 403
Query: 258 KSFSIVVATLDIPL---LKKILKGISSEEYLLLQSYVLKVRKHF 298
+ ++ V++ D L K LKGIS + +Q + K +HF
Sbjct: 404 RKIALFVSSSDAVQPGWLLKFLKGISLAQIRGMQRNLAKYSRHF 447
>gi|443721868|gb|ELU10993.1| hypothetical protein CAPTEDRAFT_157657 [Capitella teleta]
Length = 712
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 30/100 (30%), Positives = 58/100 (58%)
Query: 192 NDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
N ++ G Y L S+FCL ++ + ++D++ GCVPVI+A+ + LPF
Sbjct: 288 NMTDRCTQEGNTALEYPQVLQESEFCLVIRTSRLGQLTLSDAMKAGCVPVIVADSFILPF 347
Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYV 291
++V++WK +IVVA ++ + ++++ IS + L ++ V
Sbjct: 348 SEVIDWKRAAIVVAEDNLSTVNEVVRAISRDSLLQMRRQV 387
>gi|125528898|gb|EAY77012.1| hypothetical protein OsI_04969 [Oryza sativa Indica Group]
Length = 416
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 62/245 (25%), Positives = 106/245 (43%), Gaps = 24/245 (9%)
Query: 87 IFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIA-- 144
+ I K YW R+ G DH H ++ +NA ++ + + +A
Sbjct: 150 LMEILWKSKYWQRSAGRDHVIPMHHPNAFRFLRDM----VNASILIVADFGRYTKELASL 205
Query: 145 HKDVSLPQIWP----RQEDPPKLGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEI 196
KDV P + +DPP + R L FF G +R KL ++ + +
Sbjct: 206 RKDVVAPYVHVVDSFLNDDPPDPFDA-RPTLLFFRGRTVRKDEGKIRAKLAKILKGKDGV 264
Query: 197 -----YAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
A +KT + + SKFCLH G ++ R+ D++ CVPVI+++ +LPF
Sbjct: 265 RFEDSLATGEGIKT-STEGMRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSSRIELPF 323
Query: 252 ADVLNWKSFSI---VVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYD 308
D +++ FS+ V L L L+ I +++ + S + V H+++ P D
Sbjct: 324 EDEIDYSEFSLFFSVEEALRPDYLLNQLRQIQKTKWVEIWSKLKNVSHHYEFQNPPRKGD 383
Query: 309 AFYMV 313
A M+
Sbjct: 384 AVNMI 388
>gi|332030617|gb|EGI70305.1| Exostosin-2 [Acromyrmex echinatior]
Length = 594
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 50/83 (60%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y D L S FCL ++G + + + D++ GC+PVIIA+ +PF DV++W ++ +
Sbjct: 187 YPDSLQTSTFCLIIRGARLAQSALLDAMAAGCIPVIIADSLMMPFHDVIDWTKAAVFIRE 246
Query: 267 LDIPLLKKILKGISSEEYLLLQS 289
+DI L ++LK IS + + +Q
Sbjct: 247 VDILLTIQLLKKISPQRIMDMQE 269
>gi|307215037|gb|EFN89864.1| Exostosin-2 [Harpegnathos saltator]
Length = 594
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 30/83 (36%), Positives = 52/83 (62%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y D L + FCL ++G + + + D++ GC+PVIIA+ +PF DV++W +I+V
Sbjct: 187 YPDILHSATFCLIIRGARLAQSVLLDAMAAGCIPVIIADSLIMPFHDVIDWTKAAILVRE 246
Query: 267 LDIPLLKKILKGISSEEYLLLQS 289
+DI L+ ++LK IS + + +Q
Sbjct: 247 VDILLIIQLLKKISHQRIVEMQE 269
>gi|227206252|dbj|BAH57181.1| AT3G03650 [Arabidopsis thaliana]
Length = 244
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 42/161 (26%), Positives = 81/161 (50%), Gaps = 11/161 (6%)
Query: 164 GSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLK----TPYADELLGSK 215
G R L +F GA+ VR++L + + + +++ G ++ + + + SK
Sbjct: 56 GFDGRPILLYFQGAIYRKAGGFVRQELYNLLKEEKDVHFSFGSVRNHGISKAGEGMRSSK 115
Query: 216 FCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI---PLL 272
FCL++ G ++ R+ D++ C+PVII++ +LP+ DVLN+ F + V + D L
Sbjct: 116 FCLNIAGDTPSSNRLFDAIASHCIPVIISDDIELPYEDVLNYNEFCLFVRSSDALKKGFL 175
Query: 273 KKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
+++ I EEY + + +V ++F D + Y V
Sbjct: 176 MGLVRSIGREEYNKMWLRLKEVERYFDLRFPVKDDEGDYAV 216
>gi|449270126|gb|EMC80844.1| Exostosin-2, partial [Columba livia]
Length = 714
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 31/82 (37%), Positives = 49/82 (59%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L S FC+ ++G + A ++D L GCVPVIIA+ Y LPF++VL+WK S+V+
Sbjct: 304 YPQVLQESTFCVVLRGARLGQAVLSDVLQAGCVPVIIADSYILPFSEVLDWKRASVVIPE 363
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+P + IL+ + + +Q
Sbjct: 364 EKMPEMYSILQSVPQRQIEEMQ 385
>gi|342320455|gb|EGU12395.1| Proteophosphoglycan ppg4 [Rhodotorula glutinis ATCC 204091]
Length = 1322
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 41/137 (29%), Positives = 64/137 (46%), Gaps = 6/137 (4%)
Query: 168 RNKLAFFAGAVNS--PVREKLLQVWRN----DSEIYAHSGRLKTPYADELLGSKFCLHVK 221
R LAFFAG V + + R +S I Y L SKFCL +
Sbjct: 1149 RKHLAFFAGGVRGFGAIARTKIGCGRTGQDPNSAILYQQFSPGQRYLGTLNASKFCLLPR 1208
Query: 222 GFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISS 281
G R +++Y GC+P I + PF D+L++ FS+ + D +++IL +
Sbjct: 1209 GIPAWMTRTFEAIYAGCIPAFIVDRNLFPFQDILDYSRFSVTIPEADAHRIEEILSAYTP 1268
Query: 282 EEYLLLQSYVLKVRKHF 298
E+ LQ+ ++KVR+ F
Sbjct: 1269 EQLSELQANLVKVREAF 1285
>gi|224090294|ref|XP_002308967.1| predicted protein [Populus trichocarpa]
gi|222854943|gb|EEE92490.1| predicted protein [Populus trichocarpa]
Length = 793
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 77/332 (23%), Positives = 139/332 (41%), Gaps = 70/332 (21%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLPF----------SIARMRHDRRIGTEG--IP 80
Y ++ ++ + S + T + +AD FF+P + ++ +G
Sbjct: 392 YGAQMALYESILASPYRTLNGEEADFFFVPVLDSCIITRADDAPHLSMEQHLGLRSSLTL 451
Query: 81 DFISHYIFNISQKYPYWNRTGGADHFYV------ACHS---IGRSAMQKAWEVKLNAIQV 131
+F +I + YP+WNR+ G DH + AC++ I S M W N+
Sbjct: 452 EFYRKAYDHIVEHYPFWNRSSGRDHLWSFSWDEGACYAPKEIWNSMMVVHWG-NTNSKHN 510
Query: 132 VCSSSYFISG--------------HIAHKDVSLPQIWPRQEDPPKLGS-------SKRNK 170
+++Y+ KD+ LP W R D L + KR
Sbjct: 511 HSTTAYWADNWDKISSDRRGKHPCFDPDKDLVLPA-WKRP-DVNALSTKLWARPLEKRKT 568
Query: 171 LAFFAGAVNSP-------------VREKLLQVWRNDSEIYAHSGR------LKTP----- 206
L +F G + +R+KL + + + + G+ + +P
Sbjct: 569 LFYFNGNLGPAYLNGRPEALYSMGIRQKLAEEFGSTPNKDGNLGKQHAENVIVSPLRSES 628
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y ++L S FC + G + + R+ DS+ GC+PV+I + LP+ +VLN++SF++ +
Sbjct: 629 YHEDLASSVFCGVMPG-DGWSGRMEDSILQGCIPVVIQDGIYLPYENVLNYESFAVRILE 687
Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRKHF 298
+IP L KIL+G + E + V K+ + F
Sbjct: 688 DEIPNLIKILQGFNETEIENKLTSVQKIGQRF 719
>gi|260825668|ref|XP_002607788.1| hypothetical protein BRAFLDRAFT_64151 [Branchiostoma floridae]
gi|229293137|gb|EEN63798.1| hypothetical protein BRAFLDRAFT_64151 [Branchiostoma floridae]
Length = 733
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 66/269 (24%), Positives = 110/269 (40%), Gaps = 49/269 (18%)
Query: 46 SHFVTKDPSKADLF-----FLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWN-- 98
S + T DP+KA LF L +I R+R ++ + P WN
Sbjct: 149 SEYNTNDPAKACLFVPSIDLLNQNILRLRETGQV----------------LGRLPRWNHG 192
Query: 99 ----RTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIW 154
+ GA+H A EV ++ + + DVS+P
Sbjct: 193 HHDFQLQGANHLLFNMLPGMEPDYNTALEVP-RGKAILAGGGFSSWTYRPGYDVSIPVFN 251
Query: 155 PRQED---PPKLGSSKRNKLAFFAGAV-NSPVREKLLQVWRNDSEIY------------- 197
P D P K R L A V + R +L V R++ ++
Sbjct: 252 PFTADMELPGKPDGYSRPWLIVSAQTVIHMDFRAELDAVARDNDDVLVLDRCNDLPEGIP 311
Query: 198 AHSGRLKTP----YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFAD 253
AH R K Y D L + FC+ ++G + + ++D++ GC+PVI + Y +PF++
Sbjct: 312 AHRRRCKEDRVYNYPDILQEATFCMILRGARMGQSALSDAMMAGCIPVIAIDTYVMPFSE 371
Query: 254 VLNWKSFSIVVATLDIPLLKKILKGISSE 282
VL+WK ++++ D+P + +L+ IS E
Sbjct: 372 VLDWKRAAVILREEDLPDVHNVLRRISQE 400
>gi|302829362|ref|XP_002946248.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
gi|300269063|gb|EFJ53243.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
Length = 656
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 86/352 (24%), Positives = 150/352 (42%), Gaps = 73/352 (20%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPRGN------YASESYFKKAFMK--SHFVTKDPSK 55
S ++Y+Y D N++ P FE GN Y S F+ F+ S T++P +
Sbjct: 290 SLKIYMY-----DIPPNIVGPHQFE-DGNGGIHPQYESFLRFQGLFLNDVSGIRTENPHE 343
Query: 56 ADLFFLP----FSIARMRHDRRIGTEGIPDFISHYIFN-ISQKYPYWNRTGGADHFYVAC 110
A+LF++P +S + + G P + N ++ +P++NRTGG DHF +
Sbjct: 344 ANLFYIPAFTYYSSSNL---------GDPTGAAVRAVNWVAATFPFFNRTGGRDHFVLLS 394
Query: 111 HSIGRSAMQKAWEVKLNAIQV----------------VCSSSYFISGHIAHKDVSLP--- 151
G ++ + + N I+V V ++ Y A +DV +P
Sbjct: 395 GDRGACYLKTLPQTE-NLIRVTHFGYERPNITDMGPLVTNTEY--GCFKAGRDVVMPPYV 451
Query: 152 --------QIWPRQEDPPKLGS--SKRNKLAFFAGAV--NSP-----VREKLLQVWRNDS 194
+ + E+P + + ++ L FF+G + N P VR+ L + N S
Sbjct: 452 KSNVAGIQGVRAKLEEPGGAEALLAGKDTLLFFSGDIRHNEPEYSGGVRQALALLLANTS 511
Query: 195 EI-----YAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDL 249
+ Y L SKFCL G R+ ++ + C+PVII +
Sbjct: 512 YPDVVFKGGYMMMGMGEYESLLRRSKFCLAPYGHGWGI-RLIHAITHACIPVIIQDKVRQ 570
Query: 250 PFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWH 301
PF D+L++ FS+ V+ ++P L +IL+ + + L + +V + F W
Sbjct: 571 PFEDILHYPDFSVRVSKAELPRLVEILRAVPEPDLLRMIKENSRVYRAFLWQ 622
>gi|297739695|emb|CBI29877.3| unnamed protein product [Vitis vinifera]
Length = 822
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 80/332 (24%), Positives = 139/332 (41%), Gaps = 70/332 (21%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLPF--SIARMRHDRR----------IGTEGIP 80
Y ++ ++ + S T D +AD FF+P S +R D + +
Sbjct: 421 YGAQMAIYESILASPHRTLDGEEADFFFVPVLDSCIIVRADDAPHLNMHAHGGLRSSLTL 480
Query: 81 DFISHYIFNISQKYPYWNRTGGADHFYV------ACHS---IGRSAMQKAWEVKLNAIQV 131
+F +I ++YP+WNR+ G DH + AC++ I S M W N+
Sbjct: 481 EFYKTAYDHIVEQYPFWNRSSGRDHIWFFSWDEGACYAPKEIWDSMMLVHWG-NTNSKHN 539
Query: 132 VCSSSYFISGHIA--------------HKDVSLPQIWPRQEDPPKLGS-------SKRNK 170
+++Y+ + +KD+ LP W R D L S +R
Sbjct: 540 HSTTAYWADNWDSVSSDRRGNHPCFDPYKDLVLPA-WKRP-DVVSLSSKLWSRPREQRKT 597
Query: 171 LAFFAGAVNSP-------------VREKLLQVWRNDSEIYAHSGR------LKTP----- 206
L +F G + +R+K+ + + + G+ + TP
Sbjct: 598 LFYFNGNLGPAYEGGRPETTYSMGIRQKVAEEFGSSPNKEGKLGKQHAEDVIVTPLRSGN 657
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y + L S FC + G + + R DS+ GC+PV+I + LPF ++LN++SF++ +
Sbjct: 658 YHESLASSVFCGVMPG-DGWSGRFEDSILQGCIPVVIQDGIFLPFENMLNYESFAVRIRE 716
Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRKHF 298
+IP L KIL+G++ E V K+ + F
Sbjct: 717 DEIPNLIKILRGMNETEIEFKLENVRKIWQRF 748
>gi|225441752|ref|XP_002277596.1| PREDICTED: uncharacterized protein LOC100267584 [Vitis vinifera]
Length = 794
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 80/332 (24%), Positives = 139/332 (41%), Gaps = 70/332 (21%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLPF--SIARMRHDRR----------IGTEGIP 80
Y ++ ++ + S T D +AD FF+P S +R D + +
Sbjct: 393 YGAQMAIYESILASPHRTLDGEEADFFFVPVLDSCIIVRADDAPHLNMHAHGGLRSSLTL 452
Query: 81 DFISHYIFNISQKYPYWNRTGGADHFYV------ACHS---IGRSAMQKAWEVKLNAIQV 131
+F +I ++YP+WNR+ G DH + AC++ I S M W N+
Sbjct: 453 EFYKTAYDHIVEQYPFWNRSSGRDHIWFFSWDEGACYAPKEIWDSMMLVHWG-NTNSKHN 511
Query: 132 VCSSSYFISGHIA--------------HKDVSLPQIWPRQEDPPKLGS-------SKRNK 170
+++Y+ + +KD+ LP W R D L S +R
Sbjct: 512 HSTTAYWADNWDSVSSDRRGNHPCFDPYKDLVLPA-WKR-PDVVSLSSKLWSRPREQRKT 569
Query: 171 LAFFAGAVNSP-------------VREKLLQVWRNDSEIYAHSGR------LKTP----- 206
L +F G + +R+K+ + + + G+ + TP
Sbjct: 570 LFYFNGNLGPAYEGGRPETTYSMGIRQKVAEEFGSSPNKEGKLGKQHAEDVIVTPLRSGN 629
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y + L S FC + G + + R DS+ GC+PV+I + LPF ++LN++SF++ +
Sbjct: 630 YHESLASSVFCGVMPG-DGWSGRFEDSILQGCIPVVIQDGIFLPFENMLNYESFAVRIRE 688
Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRKHF 298
+IP L KIL+G++ E V K+ + F
Sbjct: 689 DEIPNLIKILRGMNETEIEFKLENVRKIWQRF 720
>gi|405969796|gb|EKC34747.1| Exostosin-2 [Crassostrea gigas]
Length = 698
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 59/101 (58%), Gaps = 5/101 (4%)
Query: 201 GRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSF 260
G ++ Y D L SKFC+ ++ + ++D+L GC+PVI+A+ Y LPF++VL+WK
Sbjct: 284 GTVEYTYPDILQDSKFCMVLRSARLGHTALSDALRTGCIPVIVADGYVLPFSEVLDWKRA 343
Query: 261 SIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWH 301
++V+ ++ + ++LK S E + Y ++ + F W
Sbjct: 344 AVVIREENLKDVVEVLKSYSME-----RIYQMRRQARFFWE 379
>gi|159478515|ref|XP_001697348.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158274506|gb|EDP00288.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 620
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 67/281 (23%), Positives = 106/281 (37%), Gaps = 55/281 (19%)
Query: 43 FMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGG 102
M T +P +AD FFLP G + + + + YWNR G
Sbjct: 187 LMSLGLRTVNPEEADYFFLPGC-----------GRGCNKWDQKFKYIMEHYAKYWNRRNG 235
Query: 103 ADHFYVACHSIGRS---------------AMQKAWEVKLNAIQVVCSSSYFISGHIAHKD 147
DH GR M W + ++ Q F + HI +D
Sbjct: 236 RDHLMTHAGDWGRCEKAWGPDSAPFIANLTMLNHWGITVDRSQET-EHDLFNACHIPDQD 294
Query: 148 VSLP----QIWPRQEDP---PKLGSSKRNK--LAFFAGAV---NSP-------------V 182
+ +P ++P+ E PK + NK LA AG++ NS V
Sbjct: 295 IQVPVLCGDLYPQFEHNVWHPKRRAHPVNKTVLASVAGSICGWNSAEEPPCKNKYYSFGV 354
Query: 183 REKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVI 242
R L R+ H + + S+FC G ++ S+ GC+PVI
Sbjct: 355 RAALWTTLRDKPGF--HIAKRVPVLGQSMAESEFCFAPTGAGHGKRQVV-SVTLGCMPVI 411
Query: 243 IANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEE 283
I++H PF L+W F + +A D+P ++ IL+G + ++
Sbjct: 412 ISDHVAQPFEPFLDWNDFGVWIAEADLPDVEAILRGFTPQQ 452
>gi|242077606|ref|XP_002448739.1| hypothetical protein SORBIDRAFT_06g032380 [Sorghum bicolor]
gi|241939922|gb|EES13067.1| hypothetical protein SORBIDRAFT_06g032380 [Sorghum bicolor]
Length = 456
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 69/288 (23%), Positives = 125/288 (43%), Gaps = 47/288 (16%)
Query: 55 KADLFFLPFSIARMRHDRRIG---------TEGIPDFISHY-IFNISQKYPYWNRTGGAD 104
+AD+ F+PF A + + +G +G D+ + + +P W R+GG D
Sbjct: 124 EADVVFVPF-FATLSAEMELGWGTKGAFRKKDGNEDYRRQREVVDRVTSHPAWRRSGGRD 182
Query: 105 HFYVACHSIGRSAMQKAWEVKLN---AIQVVC-----------SSSYFISGHIAHKDVSL 150
H +V + W V+ AI +V S++ S I H VSL
Sbjct: 183 HVFVMTDPVA------MWHVRAEIAPAILLVVDFGGWYKVDSKSANRNSSRMIQHTQVSL 236
Query: 151 PQ--IWPRQEDPPKLGSSK---RNKLAFFAGAVN----SPVREKLLQVWRNDSEIYAHSG 201
+ I P P L S+ R L +F GA + VREKL + N+ ++ G
Sbjct: 237 LKDVIVPYTHLLPTLLLSENKDRPTLLYFKGAKHRHRGGLVREKLWDLLGNEPDVIMEEG 296
Query: 202 RLKTPYADE----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNW 257
++ + S+FCLH G + R+ D++ C+PVI+++ +LP+ ++++
Sbjct: 297 FPNATGREQSIKGMQTSEFCLHPAGDTPTSCRLFDAIASLCIPVIVSDEVELPYEGMIDY 356
Query: 258 KSFSIVVATLDI---PLLKKILKGISSEEYLLLQSYVLKVRKHFQWHV 302
FSI V+ + L L+ I ++ + + +V+ F+++
Sbjct: 357 TEFSIFVSVRNAMRPKWLTSYLRNIPKQQKDEFRKNLARVQPIFEYNT 404
>gi|115441893|ref|NP_001045226.1| Os01g0921300 [Oryza sativa Japonica Group]
gi|19386797|dbj|BAB86176.1| OJ1485_B09.5 [Oryza sativa Japonica Group]
gi|57899432|dbj|BAD88370.1| exostosin-like [Oryza sativa Japonica Group]
gi|113534757|dbj|BAF07140.1| Os01g0921300 [Oryza sativa Japonica Group]
gi|125573139|gb|EAZ14654.1| hypothetical protein OsJ_04578 [Oryza sativa Japonica Group]
gi|215741014|dbj|BAG97509.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767487|dbj|BAG99715.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 437
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 73/283 (25%), Positives = 121/283 (42%), Gaps = 26/283 (9%)
Query: 52 DPSKADLFFLPF--SIARMRHDRRIGT-EGIPD-FISHYIFNISQKYPYWNRTGGADHFY 107
DP A+ FF+PF S++ H R + E D + + I K YW R+ G DH
Sbjct: 132 DPDAAEAFFVPFFSSLSFNVHGRNMTDPETEADRLLQVELMEILWKSKYWQRSAGRDHVI 191
Query: 108 VACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIA--HKDVSLPQIWPRQ---EDPPK 162
H ++ +NA ++ + + +A KDV P + D P
Sbjct: 192 PMHHPNAFRFLRDM----VNASILIVADFGRYTKELASLRKDVVAPYVHVVDSFLNDDPP 247
Query: 163 LGSSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEI-----YAHSGRLKTPYADELLG 213
R L FF G +R KL ++ + + A +KT + +
Sbjct: 248 DPFDDRPTLLFFRGRTVRKDEGKIRAKLAKILKGKDGVRFEDSLATGEGIKT-STEGMRS 306
Query: 214 SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI---VVATLDIP 270
SKFCLH G ++ R+ D++ CVPVI+++ +LPF D +++ FS+ V L
Sbjct: 307 SKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSSRIELPFEDEIDYSEFSLFFSVEEALRPD 366
Query: 271 LLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
L L+ I +++ + S + V H+++ P DA M+
Sbjct: 367 YLLNQLRQIQKTKWVEIWSKLKNVSHHYEFQNPPRKGDAVNMI 409
>gi|302787839|ref|XP_002975689.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
gi|300156690|gb|EFJ23318.1| arabinosyltransferase-like protein [Selaginella moellendorffii]
Length = 452
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 59/232 (25%), Positives = 107/232 (46%), Gaps = 33/232 (14%)
Query: 97 WNRTGGADHFYVACHSIG----RSAMQKA---------WEVKLNAIQVVCSSSYFISGHI 143
W R+ G DH +V + R + A W ++ +A + SS+ +
Sbjct: 177 WRRSNGVDHVFVLADPMAMWHVREQISTAVFLVVDFGGWYLE-DAKNKLNSSTIIQHSQV 235
Query: 144 AH-KDVSLPQ--IWPRQEDPPKLGSSK-RNKLAFFAGAV----NSPVREKLLQVWRNDSE 195
+ KDV +P + P P K+ + R L +F GA + VREKL ++ N+ E
Sbjct: 236 SPIKDVIIPHTHLLP----PLKIADDQHRTVLLYFRGARHRHRSGLVREKLWKILDNEPE 291
Query: 196 IYAHSGRLKTPYADE----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
+ G E + S+FCL G ++ R+ D++ C+PVI+++ LPF
Sbjct: 292 VLLEEGLPDDAGLAEATRGMRSSEFCLTPAGDTPSSCRLYDAIASLCIPVIVSDDIQLPF 351
Query: 252 ADVLNWKSFSIVVATLDIPL---LKKILKGISSEEYLLLQSYVLKVRKHFQW 300
+N++ F + V+ D L + L+ I SEE ++ + +V+++F++
Sbjct: 352 EGFVNYEEFCVFVSARDATQPGWLVQKLRSIGSEERSTMRQTLSRVQRYFEY 403
>gi|449453962|ref|XP_004144725.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase
IRX7-like [Cucumis sativus]
Length = 518
Score = 64.7 bits (156), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 59/224 (26%), Positives = 102/224 (45%), Gaps = 24/224 (10%)
Query: 95 PYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVV----CSSSYFISGHI-AHKDVS 149
P W R+ G DH H ++K + NAI ++ + +++ G + KD+
Sbjct: 238 PAWKRSEGRDHILPVHHPWSFKTVRKFMK---NAIWLLPDMDSTGNWYKPGQVFLEKDLI 294
Query: 150 LPQIWPRQE--DPPKLG--SSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSG 201
LP + P E D L SKR+ L FF G + +R KL ++ G
Sbjct: 295 LPYV-PNVELCDSKCLSYQQSKRSILLFFRGRLKRNAGGKIRAKLGGELSGADDVLIEEG 353
Query: 202 RL----KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNW 257
K + S FCL G ++AR+ D++ GC+PVI+++ +LPF +L++
Sbjct: 354 TAGEGGKAAAQTGMRKSIFCLSPAGDTPSSARLFDAIVSGCIPVIVSDELELPFEGILDY 413
Query: 258 KSFSIVVATLDI---PLLKKILKGISSEEYLLLQSYVLKVRKHF 298
+ ++ V++ D L L+ S+ + LQ + K+ +HF
Sbjct: 414 RKIALFVSSSDALKSGWLLTYLRSFSAADIRRLQQNLAKLSRHF 457
>gi|148229160|ref|NP_001083108.1| uncharacterized protein LOC398750 [Xenopus laevis]
gi|37805422|gb|AAH60367.1| MGC68803 protein [Xenopus laevis]
Length = 718
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 32/82 (39%), Positives = 49/82 (59%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L S FC+ ++G + ++D L GCVPVIIA+ Y LPF++VL+WK S+V+
Sbjct: 308 YPQILQESTFCIVLRGARLGQGVLSDVLQAGCVPVIIADSYVLPFSEVLDWKRASVVIPE 367
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+ + IL+GIS + +Q
Sbjct: 368 EKMFEMYSILQGISQRQVEEMQ 389
>gi|302835858|ref|XP_002949490.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
gi|300265317|gb|EFJ49509.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
Length = 499
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 76/341 (22%), Positives = 118/341 (34%), Gaps = 72/341 (21%)
Query: 38 YFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPD--FISHYIFNISQKYP 95
YF + + S D AD +F+P + D F+ + I +P
Sbjct: 95 YFWQRLLGSGARVADGDLADWYFIP-----------VRQRSFSDSWFLREALSYIRTHHP 143
Query: 96 YWNRTGGADHFYVACHSIGRSAMQK-------------------------AWEVKLNAIQ 130
+WNRT G H + G + K W +
Sbjct: 144 WWNRTEGHRHMVLHTGDWGLGEVAKDVRQMSLNVTWLTHWGLSTDRPNIQRWTRAFRPER 203
Query: 131 VVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNK-LAFFAGAV----------- 178
V Y GH H ++ + P + + RN+ L FFAG +
Sbjct: 204 DVVIPVYISPGHFVHFGINRSPLHPVTAASRRTAARPRNESLLFFAGRICHDAKRPNPDT 263
Query: 179 ------------NSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVN 226
VREK N S H R + Y+ + S FCL G +
Sbjct: 264 FPACGDDTAEWYGGGVREKFFVSHWNRSGF--HVVRSEPRYSHYMSRSVFCLAPPG-AGH 320
Query: 227 TARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLL 286
R +L+ GCVPV +A+ PF L+W+ + + +A DIP ++L G++ E+
Sbjct: 321 GQRQIQALFMGCVPVTVADGVYEPFEPALSWEEWGLRIAEQDIPRAHELLGGLTREQLAE 380
Query: 287 LQSYVLKVRKHFQWHVFP-------SDYDAFYMVMYELWLR 320
QS + +H + YDAF + L +R
Sbjct: 381 KQSRMHCAAQHMLYSTITGAVLGEDGRYDAFETTLEVLRVR 421
>gi|321261385|ref|XP_003195412.1| hypothetical protein CGB_G5240C [Cryptococcus gattii WM276]
gi|317461885|gb|ADV23625.1| Conserved hypothetical protein [Cryptococcus gattii WM276]
Length = 1130
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 44/171 (25%), Positives = 84/171 (49%), Gaps = 17/171 (9%)
Query: 145 HKDVSLPQIWPR----QEDPPKLGSSK----RNKLAFFAGAVNSPVREKLLQVWRN---- 192
H+DV +P R +E P + + K R+ L ++G + + + +++ N
Sbjct: 921 HQDVVIPARTCRSNTLRETFPNVEAIKPMRERSNLLMWSGTYSGTGKSERIRLTCNRGGA 980
Query: 193 -DSEIYAHSGR----LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHY 247
D E+ G+ + Y +L ++FC +G + + +D++Y GC+PV I+
Sbjct: 981 GDRELIKGGGKQSNFASSDYMKDLNNARFCAQPRGIAGWSPQTSDAIYAGCIPVFISEGT 1040
Query: 248 DLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHF 298
PFAD L+W S+ VA ++ ++K+L I + LQ+ ++ VR+ F
Sbjct: 1041 HYPFADFLDWSKLSVRVAPTELDKIEKVLAAIPLSKVEELQANLVCVREAF 1091
>gi|326920358|ref|XP_003206441.1| PREDICTED: exostosin-2-like [Meleagris gallopavo]
Length = 718
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 49/82 (59%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L + FC+ ++G + A ++D L GCVPVIIA+ Y LPF++VL+WK S+V+
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVIIADSYILPFSEVLDWKRASVVIPE 367
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+P + IL+ + + +Q
Sbjct: 368 EKMPEMYSILQSVPQRQIEEMQ 389
>gi|159489402|ref|XP_001702686.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158280708|gb|EDP06465.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 615
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 77/337 (22%), Positives = 128/337 (37%), Gaps = 65/337 (19%)
Query: 39 FKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWN 98
F + M + T + +AD FF+P + + + +I YI N +PYW+
Sbjct: 207 FWQRLMSAGLRTVNGDEADYFFIPLNTRTLMAPEQAA------WILPYIRNT---WPYWD 257
Query: 99 RTGGADHFYVACHSIGRSAMQKAWEVKLN------------AIQVVCSSSYFISGHIAHK 146
R G H + +G + K+N + + H K
Sbjct: 258 RDNGHRHLIIHTGDMGLHELPLGLRRKMNETLSNITWLTHWGLHTYHPIGTWFPAHRPGK 317
Query: 147 DVSLP--------QIWPRQ----EDPPKLGSS-KRNKLAFFAGAV--------------- 178
D+ +P Q+ P E K G R + FFAG +
Sbjct: 318 DIVIPVMITTPGFQLSPLNPAVAEKAAKRGRPYTREQTFFFAGRICGDRKPPDPLTHECA 377
Query: 179 ------NSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIAD 232
++ VR+++ N + +G K Y E+ KFCL G ++
Sbjct: 378 PKRTDYSASVRQRVYFHHHNRTGFKVLTGTSK--YMQEITSHKFCLAPTGGGHGKRQVLV 435
Query: 233 SLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVL 292
+L GC+PV I + PF L W FS+ VA DIP L ++L+ + E+ +QS +
Sbjct: 436 ALM-GCIPVTITDGVYQPFEPELPWADFSVPVAEDDIPRLHEVLEALPPEQVEQMQSRLH 494
Query: 293 KVRKHFQWH-----VFPSD--YDAFYMVMYELWLRRS 322
+H + + D YDAF ++ L +R++
Sbjct: 495 CAAQHMFYSSSLGAIIGEDGRYDAFETMIEILRVRKA 531
>gi|289166872|gb|ADC84486.1| glycosyltransferase family 47A [Salix miyabeana]
Length = 215
Score = 64.7 bits (156), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 48/162 (29%), Positives = 75/162 (46%), Gaps = 15/162 (9%)
Query: 142 HIAHKD--VSLPQIWPRQEDPPKLGSSK--RNKLAFFAGAV----NSP-----VREKLLQ 188
H+ KD +++P P Q+ L K R+ +F G N P R
Sbjct: 41 HVCLKDGSITIPPYAPPQKMQTHLIPEKTPRSIFVYFRGLFYDVGNDPEGGYYARGARAA 100
Query: 189 VWRN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANH 246
VW N D+ ++ S Y +++ + FCL G+ + R+ +++ +GC+PVIIA+
Sbjct: 101 VWENFKDNPLFDISTEHPATYYEDMQRAVFCLCPLGWAPWSPRLVEAVIFGCIPVIIADD 160
Query: 247 YDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQ 288
LPFAD + W+ + V D+P L IL I E L Q
Sbjct: 161 IVLPFADAIPWEDIGVYVDEEDVPNLDTILTSIPPEVILRKQ 202
>gi|449520748|ref|XP_004167395.1| PREDICTED: probable glucuronoxylan glucuronosyltransferase
IRX7-like [Cucumis sativus]
Length = 517
Score = 64.3 bits (155), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 59/224 (26%), Positives = 102/224 (45%), Gaps = 24/224 (10%)
Query: 95 PYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVV----CSSSYFISGHI-AHKDVS 149
P W R+ G DH H ++K + NAI ++ + +++ G + KD+
Sbjct: 238 PAWKRSEGRDHILPVHHPWSFKTVRKFMK---NAIWLLPDMDSTGNWYKPGQVFLEKDLI 294
Query: 150 LPQIWPRQE--DPPKLG--SSKRNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSG 201
LP + P E D L SKR+ L FF G + +R KL ++ G
Sbjct: 295 LPYV-PNVELCDRKCLSYQQSKRSILLFFRGRLKRNAGGKIRAKLGGELSGADDVLIEEG 353
Query: 202 RL----KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNW 257
K + S FCL G ++AR+ D++ GC+PVI+++ +LPF +L++
Sbjct: 354 TAGEGGKAAAQTGMRKSIFCLSPAGDTPSSARLFDAIVSGCIPVIVSDELELPFEGILDY 413
Query: 258 KSFSIVVATLDI---PLLKKILKGISSEEYLLLQSYVLKVRKHF 298
+ ++ V++ D L L+ S+ + LQ + K+ +HF
Sbjct: 414 RKIALFVSSSDALKSGWLLTYLRSFSAADIRRLQQNLAKLSRHF 457
>gi|71897059|ref|NP_001026520.1| exostosin-2 [Gallus gallus]
gi|60098757|emb|CAH65209.1| hypothetical protein RCJMB04_7p22 [Gallus gallus]
Length = 567
Score = 64.3 bits (155), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 49/82 (59%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L + FC+ ++G + A ++D L GCVPVIIA+ Y LPF++VL+WK S+V+
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVIIADSYILPFSEVLDWKRASVVIPE 367
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+P + IL+ + + +Q
Sbjct: 368 DKMPEMYSILQSVPQRQIEEMQ 389
>gi|405121937|gb|AFR96705.1| hypothetical protein CNAG_03480 [Cryptococcus neoformans var. grubii
H99]
Length = 1125
Score = 63.9 bits (154), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 42/151 (27%), Positives = 72/151 (47%), Gaps = 13/151 (8%)
Query: 161 PKLGS----SKRNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR----LKTPY 207
P +GS S+R+ L +AG + +R + D E+ G+ Y
Sbjct: 936 PNVGSIKPMSERSNLLMWAGTHWVTGKSERIRLTCDRGGAGDRELIKGGGKQSNFANGDY 995
Query: 208 ADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATL 267
++L ++FC +G + + D++Y GC+PV IA PFA L+W S+ VA
Sbjct: 996 INDLNNARFCPQPRGITGWSPQTNDAIYAGCIPVFIAEGTHYPFAGFLDWSKLSVRVAPT 1055
Query: 268 DIPLLKKILKGISSEEYLLLQSYVLKVRKHF 298
++ ++KIL I + LQ+ ++ VR+ F
Sbjct: 1056 ELDKIEKILAAIPLSKVEELQANLVSVREAF 1086
>gi|296088686|emb|CBI38136.3| unnamed protein product [Vitis vinifera]
Length = 291
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 56/199 (28%), Positives = 89/199 (44%), Gaps = 33/199 (16%)
Query: 97 WNRTGGADHFYVACHSIGRSAMQKAWEVKLN---AIQVVC-----------SSSYFISGH 142
W R+GG DH +V + W VK AI +V +S+ +S
Sbjct: 10 WKRSGGRDHVFVLTDPVA------MWHVKAEIAPAILLVVDFGGWYKLDSKASNNSLSEM 63
Query: 143 IAHKDVSLPQ--IWPRQEDPPKLGSSK---RNKLAFFAGAV----NSPVREKLLQVWRND 193
I H VSL + I P P+L S+ R L +F GA VREKL + +
Sbjct: 64 IQHTQVSLLKDVIVPYTHLLPRLHLSENQIRQTLLYFKGAKHRHRGGLVREKLWDLLVYE 123
Query: 194 SEIYAHSGRLKTPYADE----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDL 249
+ G ++ + S+FCLH G + R+ D++ C+PVI++++ +L
Sbjct: 124 QGVIMEEGFPNATGREQSIKGMRTSEFCLHPAGDTPTSCRLFDAIQSLCIPVIVSDNIEL 183
Query: 250 PFADVLNWKSFSIVVATLD 268
PF ++++ FS+ VA D
Sbjct: 184 PFEGMVDYSEFSVFVAVRD 202
>gi|289166870|gb|ADC84485.1| glycosyltransferase family 47A [Salix sachalinensis]
Length = 215
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 47/160 (29%), Positives = 73/160 (45%), Gaps = 14/160 (8%)
Query: 140 SGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLAFFAGAV----NSP-----VREKLLQVW 190
G IA + PQ + PP + R+ +F G N P R VW
Sbjct: 46 EGSIAIPPFAPPQKMQAHQIPP---DTPRSIFVYFRGLFYDVNNDPEGGYYARGARAAVW 102
Query: 191 RN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYD 248
N ++ ++ S T Y +++ + FCL G+ + R+ +++ +GC+ VIIA+
Sbjct: 103 ENFKNNPLFDISTDHPTTYYEDMQRAIFCLCPLGWAPWSPRLVEAVVFGCISVIIADDIV 162
Query: 249 LPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQ 288
LPFAD + W+ + VA D+P L IL I E L Q
Sbjct: 163 LPFADAIPWEEIGVFVAEEDVPNLDTILTTIPPEVILRKQ 202
>gi|324508547|gb|ADY43609.1| Exostosin-1b, partial [Ascaris suum]
Length = 628
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 74/323 (22%), Positives = 130/323 (40%), Gaps = 70/323 (21%)
Query: 5 FRVYVYPHRRNDP----FANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
FRVYVYP +AN+L K +S + T DPSKA LF
Sbjct: 82 FRVYVYPDNNESTVSIVYANIL------------------KVLRESVYYTDDPSKACLFV 123
Query: 61 LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYP-YWNRTGGADHFYV----------A 109
L SI + DR+ +++ H I + WN GG +H +
Sbjct: 124 L--SIDTIDRDRKS-----ENYVKHVDEQIEALHSDLWN--GGRNHIIFNLYHGTYPDYS 174
Query: 110 CHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSL----PQIWPRQEDPPKLGS 165
H +G + A + +A + ++ +S + HK+ SL +WP + L S
Sbjct: 175 DHDLGFD-VGYAMVARASANAQIFRPNFDLSFPLFHKEHSLRTVVESVWPLKLKDEYLVS 233
Query: 166 SKRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKT---PYADE------------ 210
K + + + S R+ L + S + + + Y DE
Sbjct: 234 FKGKRYVY---GIGSETRDSLYHLHNAHSVVMVTTCKHNNDWKKYEDERCDEDNIEYERW 290
Query: 211 -----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVA 265
+ S FCL +G + + R +SL GC+PVI+++ ++LPF+++++W +++
Sbjct: 291 DYETTMSNSTFCLTPRGRRLGSFRFLESLRLGCIPVILSDDWELPFSEIIDWSQAAVIAH 350
Query: 266 TLDIPLLKKILKGISSEEYLLLQ 288
+ + +L I E L ++
Sbjct: 351 EDTVLTISDVLNAIPLERVLYMK 373
>gi|224128538|ref|XP_002320357.1| predicted protein [Populus trichocarpa]
gi|222861130|gb|EEE98672.1| predicted protein [Populus trichocarpa]
Length = 137
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 36/118 (30%), Positives = 65/118 (55%), Gaps = 3/118 (2%)
Query: 211 LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI- 269
+ SKFCL++ G ++ R+ D++ CVPVII++ +LPF DVL++ F + V D
Sbjct: 1 MASSKFCLNIAGDTPSSNRLFDAIASQCVPVIISDGIELPFEDVLDYSEFGVFVRASDAV 60
Query: 270 --PLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSSVR 325
L +L+GI +++ +L + ++ F++ DA MV + ++SSV+
Sbjct: 61 KKGYLLYLLRGIKKDQWTILWERLKEIAPQFEYRYPSQPGDAVDMVWEAVLRKKSSVQ 118
>gi|356576131|ref|XP_003556187.1| PREDICTED: probable glucuronosyltransferase Os01g0926400-like
[Glycine max]
Length = 481
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 62/254 (24%), Positives = 107/254 (42%), Gaps = 44/254 (17%)
Query: 52 DPSKADLFFLPFSIARMRHDRRIGTEGIPDFISH---------YIFNISQKYPYWNRTGG 102
DP AD+ F+PF A + + ++G H + + + WNR+GG
Sbjct: 146 DPLLADVVFVPF-FATLSAEMQLGANKGAFRKKHDNDDYKRQRQVMDAVKNTHAWNRSGG 204
Query: 103 ADHFYVACHSIGRSAMQKAWEVKLNAIQVVC---------------SSSYFISGHIAHKD 147
DH +V + W VK V S+ S I H
Sbjct: 205 RDHVFVLTDPVA------MWHVKDEIAPAVLLVVDFGGWYRLDSRGGSNCSESDVIPHTQ 258
Query: 148 VSLPQ--IWPRQEDPPKLGSS---KRNKLAFFAGAVN----SPVREKLLQVWRNDSEIYA 198
VS+ + I P P+L S +R++L +F GA + +REKL + ++ +
Sbjct: 259 VSVIKDVIVPYTHLLPRLDLSDNKERHQLLYFKGAKHRHRGGIIREKLWDLLVSEPGVIM 318
Query: 199 HSGRLKTPYADE----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
G ++ + S+FCLH G + R+ D++ C+PVI++++ +LPF +
Sbjct: 319 EEGFPNATGREQSIKGMQTSEFCLHPAGDTPTSCRLFDAIQSLCIPVIVSDNIELPFEGM 378
Query: 255 LNWKSFSIVVATLD 268
+++ FS+ A D
Sbjct: 379 VDYAEFSVFAAVSD 392
>gi|432908332|ref|XP_004077815.1| PREDICTED: exostosin-1b-like [Oryzias latipes]
Length = 741
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 77/357 (21%), Positives = 136/357 (38%), Gaps = 83/357 (23%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
F+VYVYP ++ G SESY + S F T DP+KA LF L
Sbjct: 108 GFKVYVYPQQK----------------GEKLSESYQNILSSIEGSRFHTPDPAKACLFVL 151
Query: 62 PFSIARMRHDRRIGTEGIPDFISHYIFNIS---QKYPYWNRTGGADHFYVACHSIGRSAM 118
S+ + D+ Y+ N+ Q P WN G +H +S
Sbjct: 152 --SLDTLDRDQ---------LSPQYVHNLKAKIQSLPLWNE--GRNHIIFNLYSGTWPDY 198
Query: 119 QKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK----------- 167
+ + ++ +S + DVS+P +D P+ G +
Sbjct: 199 TEDLGFDI-GFAMLAKASISTENFRPNFDVSVPLF---SKDHPRTGGERGFLKHNTIPPY 254
Query: 168 RNKLAFFAG-----AVNSPVREKLLQV-----------------WRNDSEIYAHSGRL-- 203
R + F G + S R L V W+ ++ +
Sbjct: 255 RKYMLVFKGKRYLTGIGSDTRNALYHVHNSEDVVLLTTCKHGKDWQKHKDVRCDKDNVEY 314
Query: 204 -KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI 262
K Y + L S FCL +G + + R ++L CVPV+++N ++LPF++++NW + ++
Sbjct: 315 DKYDYREMLYNSTFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEIINWNTAAV 374
Query: 263 VVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQ--WHVFPSDYDAFYMVMYEL 317
+ +++L I S + Q +L +R+ Q W + + + + E+
Sbjct: 375 IGD-------ERLLLQIPSTVRSIHQDKILSLRQQTQFLWEAYFNSLEKIVLTTLEI 424
>gi|159471277|ref|XP_001693783.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158283286|gb|EDP09037.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 611
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 69/296 (23%), Positives = 110/296 (37%), Gaps = 60/296 (20%)
Query: 38 YFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYW 97
+F + S D +KAD +F+P + R D + + I ++ +P++
Sbjct: 183 HFLEMLTASGARVADGAKADWYFIPVRL-RSSSDAYV--------LQRAIHHLRHAHPWF 233
Query: 98 NRTGGADHFYVACHSIGRSAMQKA-----------WEVKLNAIQVVCSSSYFISGHIAHK 146
N TGG HF +A +GR ++ W + + +SS + + H
Sbjct: 234 NATGGGRHFVIAVGDMGRLESERGPLSANVTFVSHWGLASSKKAERLNSSPWRASHRNAT 293
Query: 147 DVSLPQIWPRQEDPPKLGSSKRNKLAFFAGAVNSPVREK----------LLQ-------- 188
D+ LP ++ K G ++ FA +RE+ + Q
Sbjct: 294 DIVLP-VYISLRKLEKYGITRSRHHPKFATMAPPEIRERNGPLFWFAGRICQDRSKPATD 352
Query: 189 -VWRNDSEIYAHSGRLKTP-------------------YADELLGSKFCLHVKGFEVNTA 228
VW N +S + Y+ LL SKFC G
Sbjct: 353 GVWPNCPGAMGYSAMTRQAVHFHHWDRPGYFVAPGDKNYSRHLLTSKFCFGAMGGGHGQR 412
Query: 229 RIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEY 284
++ +L GCVPV+I + + L+W F + VA DIP L IL I EEY
Sbjct: 413 QLQAALA-GCVPVVIGDGVLEAWEPYLDWNDFGVRVAEADIPRLHTILGAIGPEEY 467
>gi|291384884|ref|XP_002708900.1| PREDICTED: exostosin 2 [Oryctolagus cuniculus]
Length = 765
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 51/163 (31%), Positives = 77/163 (47%), Gaps = 21/163 (12%)
Query: 147 DVSLPQIWP--RQEDPPKLGSSKRNKLAFFAG-AVNSPVREKL--LQVWRNDSEIYAH-- 199
DVS+P P + D P+ G R + AV+ RE+L LQ +S + H
Sbjct: 274 DVSIPVYSPLSAEVDLPEKGPGPRRYFLLSSQMAVHPEFREELEALQAKHGESVLVLHQC 333
Query: 200 ----SGRLKTP----------YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIAN 245
G L Y L + FC+ ++G + A ++D L GCVPV+IA+
Sbjct: 334 TNLSEGALSVRKRCHKHQVFHYPQVLQEATFCVVLRGARLGQAALSDVLQAGCVPVVIAD 393
Query: 246 HYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQ 288
Y LPF++VL+WK S+VV + + KIL+ I + +Q
Sbjct: 394 SYILPFSEVLDWKRASVVVPEEKMADVYKILQSIPQRQIQEMQ 436
>gi|159470093|ref|XP_001693194.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158277452|gb|EDP03220.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 673
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 66/303 (21%), Positives = 118/303 (38%), Gaps = 29/303 (9%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFF-------LPFSIARMRHDRRIGTEGIP----- 80
YA+++ + + S T DP +AD F+ LPF I G P
Sbjct: 297 YAADTLLHELLLVSPHRTFDPEEADFFYVPHQASCLPFPIGNWADWPWFKGPGGPRIRQM 356
Query: 81 -DFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFI 139
+ I I Q YP+W R GG DH + H G W + + + +
Sbjct: 357 LNMIMETRDWIDQHYPFWKRRGGRDHIWTFTHDEG-----ACWAPNVLNTSIWLTHWGRM 411
Query: 140 SGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAH 199
+P + R D + ++ P ++ ++ ++ A
Sbjct: 412 DPDHTSNTAFVPDRYDR--DFKSAYQPEGYRVHMQGHPCYRPGQDLVIPAFKRPDHYRAS 469
Query: 200 SGRLKTPYADELLG--------SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPF 251
T EL G S FCL G + +AR+ D++ +GC+PVII ++ + F
Sbjct: 470 PLAAATSKPRELPGDYSDMLSRSLFCLVAAG-DGWSARLEDAVLHGCIPVIIIDNVHVVF 528
Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFY 311
+L+ SFS+ +A D+ + +IL+ I + Q+++ V +++ P +
Sbjct: 529 ESILDIDSFSVRIAEADVDRILEILQAIPERKIRFKQAHLGHVFHRYRYAALPGLENNLR 588
Query: 312 MVM 314
+M
Sbjct: 589 QLM 591
>gi|297721259|ref|NP_001172992.1| Os02g0520750 [Oryza sativa Japonica Group]
gi|255670949|dbj|BAH91721.1| Os02g0520750 [Oryza sativa Japonica Group]
Length = 213
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 39/134 (29%), Positives = 68/134 (50%), Gaps = 6/134 (4%)
Query: 189 VWRN--DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANH 246
+W N ++ ++ S Y +++ S FCL G+ + R+ +++ +GC+PVIIA+
Sbjct: 48 LWENFKNNPLFDISTEHPATYYEDMQRSVFCLCPLGWAPWSPRLVEAVVFGCIPVIIADD 107
Query: 247 YDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSD 306
LPFAD + W + V D+P L IL I ++ +L + +L Q +FP
Sbjct: 108 IVLPFADAIPWDEIGVFVDEEDVPRLDSILTSIPIDD-ILRKQRLLANPSMKQAMLFPQP 166
Query: 307 Y---DAFYMVMYEL 317
DAF+ ++ L
Sbjct: 167 AQPRDAFHQILNGL 180
>gi|255076909|ref|XP_002502118.1| glycosyltransferase family 47 protein [Micromonas sp. RCC299]
gi|226517383|gb|ACO63376.1| glycosyltransferase family 47 protein [Micromonas sp. RCC299]
Length = 717
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 48/168 (28%), Positives = 77/168 (45%), Gaps = 19/168 (11%)
Query: 168 RNKLAFFAGA-VNSP-------VREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLH 219
R L FFAGA V+ P + E + + + H+G+ Y S FC+
Sbjct: 449 RRWLLFFAGAWVDKPAYADRRAIAEAMAGREQEGIHVVQHAGQF---YEKNYASSTFCIA 505
Query: 220 VKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGI 279
G R+ + GC+PVI+ ++ P+ DVL + FS+ VA DIP + I+K I
Sbjct: 506 PTGSGWGR-RMNLATQSGCIPVIVQDNIAAPYDDVLPYDEFSVRVAKADIPKIPDIVKAI 564
Query: 280 SSEEYLLLQSYVLKVRKHFQW-HVFPSDY------DAFYMVMYELWLR 320
+ E+ ++ + + QW + SD+ DAF ++M L R
Sbjct: 565 TPEKLDRMRQQLACAARALQWSSILGSDFGEGGENDAFALLMLTLQHR 612
>gi|345305619|ref|XP_001510258.2| PREDICTED: exostosin-2 [Ornithorhynchus anatinus]
Length = 615
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 29/82 (35%), Positives = 49/82 (59%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L + FC+ ++G + + ++D L GCVPVI+A+ Y LPF++VL+WK S+V+
Sbjct: 308 YPQVLQEATFCVVLRGARLGQSVLSDVLQAGCVPVILADSYILPFSEVLDWKRASVVIPE 367
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+P + IL+ I + +Q
Sbjct: 368 EKMPEMYSILQSIPQRQIEEMQ 389
>gi|356500491|ref|XP_003519065.1| PREDICTED: uncharacterized protein LOC100783624 [Glycine max]
Length = 795
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 80/334 (23%), Positives = 135/334 (40%), Gaps = 70/334 (20%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLPF----SIAR--------MRHDRRIGTEGIP 80
Y ++ ++ + S T + +AD FF+P I R M+ + +
Sbjct: 395 YGAQIALYESLLASPHRTLNGEEADFFFVPVLDSCIITRADDAPHLSMQEHMGLRSSLTL 454
Query: 81 DFISHYIFNISQKYPYWNRTGGADHFYV------ACHS---IGRSAMQKAWEVKLNAIQV 131
++ +I ++YPYWNR+ G DH + AC++ I S M W N
Sbjct: 455 EYYKKAYIHIVEQYPYWNRSSGRDHVWSFSWDEGACYAPKEIWNSMMLVHWG-NTNTKHN 513
Query: 132 VCSSSYFISG--------------HIAHKDVSLPQIWPRQEDPPKLGS-------SKRNK 170
+++Y+ KD+ LP W + D L S KR
Sbjct: 514 HSTTAYWADNWDKISSDKRGTHPCFDPDKDLVLPA-W-KVPDANVLTSKLWAWSHEKRKT 571
Query: 171 LAFFAGAVNSP-------------VREKLLQVWRNDSEIYAHSGR------LKTP----- 206
L +F G + +R+KL + + + G+ + TP
Sbjct: 572 LFYFNGNLGPAYPHGRPEDTYSMGIRQKLAEEFGSSPNKDGKLGKQHAKDVIVTPERSEN 631
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y +L S FC G + + R+ DS+ GC+PV+I + LP+ +VLN+ SF++ +
Sbjct: 632 YHLDLASSVFCGVFPG-DGWSGRMEDSILQGCIPVVIQDGIFLPYENVLNYDSFAVRIPE 690
Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQW 300
+IP L KIL+G + E V K+ + F +
Sbjct: 691 AEIPNLIKILRGFNDTEIEFKLENVQKIWQRFMY 724
>gi|355686834|gb|AER98200.1| exostoses 2 [Mustela putorius furo]
Length = 717
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 31/82 (37%), Positives = 49/82 (59%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L + FC+ ++G + A ++D L GCVPVI+A+ Y LPF++VL+WK S+VV
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAALSDVLRAGCVPVIVADSYVLPFSEVLDWKRASVVVPE 367
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+ + IL+GI + +Q
Sbjct: 368 EKLSDVYGILQGIPRRQIEEMQ 389
>gi|148235505|ref|NP_001080448.1| exostosin 2 [Xenopus laevis]
gi|27881752|gb|AAH44703.1| Ext2-prov protein [Xenopus laevis]
Length = 718
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 31/82 (37%), Positives = 48/82 (58%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L S FC+ ++G + ++D L GCVPVIIA+ Y LPF++VL+WK S+V+
Sbjct: 308 YPQILQESTFCIVLRGARLGQGLLSDVLQAGCVPVIIADSYVLPFSEVLDWKRASVVIPE 367
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+ + IL+GI + +Q
Sbjct: 368 EKMFEMYSILQGIPQRQVEEMQ 389
>gi|222636093|gb|EEE66225.1| hypothetical protein OsJ_22377 [Oryza sativa Japonica Group]
Length = 341
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 47/152 (30%), Positives = 69/152 (45%), Gaps = 10/152 (6%)
Query: 50 TKDPSKADLFFLPFSIARM-----RHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGAD 104
+ P +A FFLP S+A + R D + ++ Y+ ++ YP+WNR+ GAD
Sbjct: 183 ARHPGEAHAFFLPISVASIAGYVYRRDMIDFWDPQLRLVAGYVDGLAAMYPFWNRSRGAD 242
Query: 105 HFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLG 164
HF V+CH E++ NAI+V+C + G DV+LP + P G
Sbjct: 243 HFLVSCHQWAPILSAAKAELRGNAIRVMCDAD-MSDGFDPATDVALPPVVASARATPPQG 301
Query: 165 --SSKRNKL--AFFAGAVNSPVREKLLQVWRN 192
+S+R L G VRE LL W
Sbjct: 302 RVASERTVLAFFAAGGGGGGAVREVLLTRWEG 333
>gi|307176975|gb|EFN66281.1| Exostosin-2 [Camponotus floridanus]
Length = 563
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 27/79 (34%), Positives = 49/79 (62%)
Query: 211 LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIP 270
L + FCL ++G + + + D++ GC+PVIIA+ +PF DV++W ++ V +DI
Sbjct: 160 LPSATFCLIIRGARLAQSSLLDAMAAGCIPVIIADSLTMPFHDVIDWTKAAVFVREVDIL 219
Query: 271 LLKKILKGISSEEYLLLQS 289
L+ ++LK IS + + +Q
Sbjct: 220 LIIQLLKKISHQRIMEMQE 238
>gi|432851754|ref|XP_004067068.1| PREDICTED: exostosin-2-like [Oryzias latipes]
Length = 719
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 31/82 (37%), Positives = 48/82 (58%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L S FC+ ++G + A ++D L GCVPVI+A+ Y LPF++VL+WK S+V+
Sbjct: 309 YPQILQESSFCVVLRGARLGQATLSDVLQAGCVPVILADSYILPFSEVLDWKRASVVIPE 368
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+ + ILK I + +Q
Sbjct: 369 EKLSEMYTILKSIPHRQVEEMQ 390
>gi|348558689|ref|XP_003465149.1| PREDICTED: exostosin-2-like isoform 2 [Cavia porcellus]
Length = 669
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/82 (39%), Positives = 48/82 (58%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L + FC+ ++G + A ++D L GCVPVIIA+ Y LPF++VL+WK S+VV
Sbjct: 307 YPQVLQEATFCVVLRGARLGQAALSDVLQAGCVPVIIADSYILPFSEVLDWKRASVVVPE 366
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
I + IL+ I + +Q
Sbjct: 367 EKIADVYSILQSIPRRQMEEMQ 388
>gi|348558687|ref|XP_003465148.1| PREDICTED: exostosin-2-like isoform 1 [Cavia porcellus]
Length = 717
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/82 (39%), Positives = 48/82 (58%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L + FC+ ++G + A ++D L GCVPVIIA+ Y LPF++VL+WK S+VV
Sbjct: 307 YPQVLQEATFCVVLRGARLGQAALSDVLQAGCVPVIIADSYILPFSEVLDWKRASVVVPE 366
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
I + IL+ I + +Q
Sbjct: 367 EKIADVYSILQSIPRRQMEEMQ 388
>gi|118481039|gb|ABK92473.1| unknown [Populus trichocarpa]
Length = 239
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 45/167 (26%), Positives = 76/167 (45%), Gaps = 15/167 (8%)
Query: 165 SSKRNKLAFFAGAV------------NSPVREKLLQVWRNDSEIYAHSGRLKTPYADELL 212
+ +R+ AFF G + + VR + + + D Y R Y E++
Sbjct: 58 NGRRDIWAFFRGKMEVHPKNISGRYYSKKVRTVIWRKYSGDRRFYLQRHRFAG-YQSEIV 116
Query: 213 GSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLL 272
S FCL G+ + R+ +S+ GCVPVIIA+ LPF + W S+ VA D+ L
Sbjct: 117 RSVFCLCPLGWAPWSPRLVESVALGCVPVIIADGIRLPFPTAVRWSEISLTVAEKDVANL 176
Query: 273 KKILKGISSEEYLLLQSYVL--KVRKHFQWHVFPSDYDAFYMVMYEL 317
+L +++ +Q + VR+ ++ + DA + V+Y L
Sbjct: 177 GTLLDHVAATNLSAIQKNLWDPDVRRALLFNDRVQEGDATWQVLYAL 223
>gi|348509623|ref|XP_003442347.1| PREDICTED: exostosin-2-like [Oreochromis niloticus]
Length = 719
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 31/82 (37%), Positives = 48/82 (58%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L S FC+ ++G + A ++D L GCVPVI+A+ Y LPF++VL+WK S+V+
Sbjct: 309 YPQILQESSFCVVLRGARLGQAALSDVLQAGCVPVILADSYILPFSEVLDWKRASVVIPE 368
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+ + ILK I + +Q
Sbjct: 369 EKLSEMYTILKSIPHRQVEEMQ 390
>gi|410911648|ref|XP_003969302.1| PREDICTED: exostosin-1b-like [Takifugu rubripes]
Length = 740
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 74/342 (21%), Positives = 139/342 (40%), Gaps = 53/342 (15%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
F+VYVYP ++ G SESY S F T DP +A LF L
Sbjct: 107 GFKVYVYPQQK----------------GEKISESYQNILSTIEGSRFYTSDPGQACLFVL 150
Query: 62 PFSI-------ARMRHDRRIGTEGIP---DFISHYIFNI-SQKYPYWNRTGGAD--HFYV 108
+ H+ + + +P D +H IFN+ S +P + G D +
Sbjct: 151 SLDTLDRDQLSPQYVHNMKTKVQNLPLWNDGRNHLIFNLYSGTWPDYTEDLGFDIGQAML 210
Query: 109 ACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQ------IWPRQEDPPK 162
A SI + ++V + + G++ + S+P ++ +
Sbjct: 211 AKASISTDNFRPNFDVSIPLFSKEHPRTGGERGYLKYN--SIPPFRKYMLVFKGKRYLTG 268
Query: 163 LGSSKRNKLAFFAGAVNSPVRE--KLLQVWRNDSEIYAHSGRL---KTPYADELLGSKFC 217
+GS RN L A + + K + W+ + K Y + L S FC
Sbjct: 269 IGSDTRNALHHVHNAEDVVLLTTCKHGKDWQKHKDARCDKDNAEYDKYDYKEMLHNSTFC 328
Query: 218 LHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILK 277
L +G + + R ++L CVPV+++N ++LPF+++++W + +++ +++L
Sbjct: 329 LVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEIIDWNTAAVIGD-------ERLLL 381
Query: 278 GISSEEYLLLQSYVLKVRKHFQ--WHVFPSDYDAFYMVMYEL 317
I + + + Q +L +R+ Q W + S + + E+
Sbjct: 382 QIPTTVHSIHQDKILSLRQQTQFLWEAYFSSVEKIVLTTLEI 423
>gi|58269252|ref|XP_571782.1| hypothetical protein [Cryptococcus neoformans var. neoformans JEC21]
gi|57228018|gb|AAW44475.1| conserved hypothetical protein [Cryptococcus neoformans var.
neoformans JEC21]
Length = 1132
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 29/92 (31%), Positives = 52/92 (56%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y ++L ++FC +G + + D++Y GC+PV I+ PFAD L+W S+ VA
Sbjct: 1002 YMNDLNNARFCPQPRGITGWSPQTNDAIYAGCIPVFISEGTHYPFADFLDWSKLSVRVAP 1061
Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRKHF 298
++ ++KIL I + LQ+ ++ +R+ F
Sbjct: 1062 TELDKIEKILAAIPLSKVEELQANLVSMREAF 1093
>gi|134114275|ref|XP_774385.1| hypothetical protein CNBG3660 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50257020|gb|EAL19738.1| hypothetical protein CNBG3660 [Cryptococcus neoformans var.
neoformans B-3501A]
Length = 1132
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 29/92 (31%), Positives = 52/92 (56%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y ++L ++FC +G + + D++Y GC+PV I+ PFAD L+W S+ VA
Sbjct: 1002 YMNDLNNARFCPQPRGITGWSPQTNDAIYAGCIPVFISEGTHYPFADFLDWSKLSVRVAP 1061
Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRKHF 298
++ ++KIL I + LQ+ ++ +R+ F
Sbjct: 1062 TELDKIEKILAAIPLSKVEELQANLVSMREAF 1093
>gi|47224077|emb|CAG12906.1| unnamed protein product [Tetraodon nigroviridis]
Length = 743
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 74/338 (21%), Positives = 129/338 (38%), Gaps = 81/338 (23%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
F+VYVYP ++ G SESY + S F T DP +A LF L
Sbjct: 110 GFKVYVYPQQK----------------GEKMSESYQNILSSIEGSRFYTSDPEQACLFVL 153
Query: 62 PFSIARMRHDRRIGTEGIPDFISHYIFNIS---QKYPYWNRTGGADHFYVACHSIGRSAM 118
S+ + D+ Y+ N+ Q P WN GG +H +S
Sbjct: 154 --SLDTLDRDQ---------LSPQYVHNLKGRIQSLPLWN--GGKNHIIFNLYSGTWPDY 200
Query: 119 QKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK----------- 167
+ + ++ +S + DVS+P +D P+ G +
Sbjct: 201 TEDLGFDI-GFAMLAKASISTESFRPNFDVSIPLF---SKDHPRTGGERGYLRHNTIPPF 256
Query: 168 RNKLAFFAG-----AVNSPVREKLLQV-----------------WRNDSEIYAHSGRL-- 203
R + F G + S R L V W+ + +
Sbjct: 257 RKYILVFKGKRYLTGIGSDTRNALYHVHNSEDVVLLTTCKHGKDWQKHKDARCDRDNMEY 316
Query: 204 -KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI 262
K Y + L S FCL +G + + R ++L CVPV+++N ++LPF+++++W + ++
Sbjct: 317 DKYDYKEMLYNSTFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEIIDWNTAAV 376
Query: 263 VVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQW 300
+ +++L I S + Q +L +R+ Q+
Sbjct: 377 IGD-------ERLLLQIPSTVRSIHQDQILSLRQQTQF 407
>gi|322790909|gb|EFZ15575.1| hypothetical protein SINV_11283 [Solenopsis invicta]
Length = 676
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 27/82 (32%), Positives = 49/82 (59%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y D L + FCL ++G + + + D++ GC+P IIA+ +PF DV++W ++ +
Sbjct: 269 YPDSLQTATFCLIIRGARLAQSVLLDAMAAGCIPAIIADSLMMPFHDVIDWTKAAVFIRE 328
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+DI L ++LK IS + + +Q
Sbjct: 329 VDILLTIQLLKKISHQRIMEMQ 350
>gi|242020116|ref|XP_002430502.1| Exostosin-2, putative [Pediculus humanus corporis]
gi|212515659|gb|EEB17764.1| Exostosin-2, putative [Pediculus humanus corporis]
Length = 696
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 73/302 (24%), Positives = 130/302 (43%), Gaps = 45/302 (14%)
Query: 5 FRVYVYPHRR--NDPFANVLLPVDFEPRGNYASESYFK--KAFMKSHFVTKDPSKADLFF 60
+VYVYP + N+ +V+ + E Y+K KA + S + + +P +A
Sbjct: 95 IQVYVYPLKTYINEKAEHVVPQMSLE---------YYKILKAIINSKYYSPNPDEA-CIL 144
Query: 61 LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
+P SI + ++ E + + P+W G +H + + I ++
Sbjct: 145 VP-SIDTLNQNKLKLKE---------VSQVLASLPHWR--NGENH--LIWNFIPGTSPDY 190
Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHK-DVSLPQIWPRQEDPPKLGSSKRNKLAFF----- 174
+ V+LN + + + + F S + D+SLP P + L S+ K
Sbjct: 191 STVVELNVGKSIIAGAGFSSWSFRYGFDISLPLFSPYAKSIKTLNKSQEKKWFVINSQAN 250
Query: 175 --------AGAVNSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVN 226
GA++ V+ +L + D H G + Y L + FC+ ++G +
Sbjct: 251 LHSDYERELGALSGVVKLELCE--NGDPRFRCHQG-IAYNYPSVLQHATFCIIIRGARLA 307
Query: 227 TARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLL 286
+ +SL GC+PVI A+ LPF DV++WK SI + D+ L + L +S ++ L
Sbjct: 308 QQALLESLSAGCIPVIAADLMVLPFQDVIDWKRASITILESDLSSLIEKLSSVSDDKKLE 367
Query: 287 LQ 288
LQ
Sbjct: 368 LQ 369
>gi|348512501|ref|XP_003443781.1| PREDICTED: exostosin-1a-like [Oreochromis niloticus]
Length = 743
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 75/357 (21%), Positives = 135/357 (37%), Gaps = 83/357 (23%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
F+VYVYP ++ G SESY + S F T DP +A LF L
Sbjct: 110 GFKVYVYPQQK----------------GEKLSESYQNILSSIEGSRFYTSDPGQACLFVL 153
Query: 62 PFSIARMRHDRRIGTEGIPDFISHYIFNIS---QKYPYWNRTGGADHFYVACHSIGRSAM 118
S+ + D+ Y+ N+ Q P WN GG +H +S
Sbjct: 154 --SLDTLDRDQ---------LSPQYVHNLKAKIQNLPLWN--GGKNHIIFNLYSGTWPDY 200
Query: 119 QKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK----------- 167
+ + + ++ +S DVS+P +D P+ G K
Sbjct: 201 TEDLGFDI-GLAMLAKASISTENFRPDFDVSIPLF---SKDHPRTGGEKGYLKYNTIPPY 256
Query: 168 RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGRL------------------- 203
R + F G + S R L + ++ + + +
Sbjct: 257 RKYMLVFKGKRYLTGIGSDTRNALYHIHNSEDVVLLTTCKHGKDWQKHKDARCDKDNAEY 316
Query: 204 -KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI 262
K Y + L S FCL +G + + R ++L CVPV+++N ++LPF+++++W ++
Sbjct: 317 DKYDYREMLYNSTFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEIIDWNRAAV 376
Query: 263 VVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQ--WHVFPSDYDAFYMVMYEL 317
+ +++L I S + Q +L +R+ Q W + + + + E+
Sbjct: 377 IGD-------ERLLLQIPSTVRSIHQDKILSLRQQTQLLWEAYFNSVEKIVLTTLEI 426
>gi|327280286|ref|XP_003224883.1| PREDICTED: exostosin-1-like [Anolis carolinensis]
Length = 753
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 77/363 (21%), Positives = 135/363 (37%), Gaps = 95/363 (26%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
F+VYVYP ++ + A ESY A S F T DPS+A LF L
Sbjct: 120 GFKVYVYPQQKGEKIA----------------ESYQNILAAVEGSRFYTSDPSQACLFIL 163
Query: 62 PFSI-------ARMRHDRRIGTEGI---PDFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
+ H+ R +G+ + +H +FN+ S +P + G D
Sbjct: 164 SLDTLDRDQLSPQYVHNLRSKVQGLHLWNNGRNHLVFNLYSGTWPDYTEDVGFD------ 217
Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
IG + + KA S + DVS+P +D P+ G K
Sbjct: 218 --IGHAMLAKA--------------SISTENFRPNFDVSIPLF---SKDHPRTGGEKGFL 258
Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR------------ 202
R + F G + S R L V + + + +
Sbjct: 259 RFNTIPPLRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHKDSR 318
Query: 203 --------LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
K Y + L + FCL +G + + R ++L CVPV+++N ++LPF++V
Sbjct: 319 CDRDNAEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEV 378
Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
++W +++ + + ++ I ++ L L+ + F W + S + +
Sbjct: 379 IDWNQAAVIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTT 433
Query: 315 YEL 317
E+
Sbjct: 434 LEI 436
>gi|363731065|ref|XP_003640902.1| PREDICTED: exostosin-1-like [Gallus gallus]
Length = 740
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 78/363 (21%), Positives = 136/363 (37%), Gaps = 95/363 (26%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFK--KAFMKSHFVTKDPSKADLFFL 61
F+VYVYP ++ + A ESY A S F T DPS+A LF L
Sbjct: 107 GFKVYVYPQQKGEKIA----------------ESYQNVLAAIEGSRFYTSDPSQACLFVL 150
Query: 62 PFSI-------ARMRHDRRIGTEGI---PDFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
+ H+ R + + + +H IFN+ S +P + G D
Sbjct: 151 SLDTLDRDQLSPQYVHNLRSKVQSLHLWNNGRNHLIFNLYSGTWPDYTEDVGFD------ 204
Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
IG++ + KA S + DVS+P +D P+ G K
Sbjct: 205 --IGQAMLAKA--------------SISTENFRPNFDVSIPLF---SKDHPRTGGEKGFL 245
Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR------------ 202
R + F G + S R L V + + + +
Sbjct: 246 RFNTIPPLRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHKDSR 305
Query: 203 --------LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
K Y + L + FCL +G + + R ++L CVPV+++N ++LPF++V
Sbjct: 306 CDRDNAEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEV 365
Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
++WK +++ + + ++ I ++ L L+ + F W + S + +
Sbjct: 366 IDWKQAAVIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTT 420
Query: 315 YEL 317
E+
Sbjct: 421 LEI 423
>gi|347963135|ref|XP_311070.5| AGAP000081-PA [Anopheles gambiae str. PEST]
gi|333467342|gb|EAA06302.6| AGAP000081-PA [Anopheles gambiae str. PEST]
Length = 761
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 27/93 (29%), Positives = 55/93 (59%), Gaps = 1/93 (1%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L FCL +G + + +++ GC+PV++A++Y LPFAD+L+W+ ++ +
Sbjct: 353 YPALLTTGTFCLVARGVRLGQPALLEAMAAGCIPVVMADNYVLPFADLLDWELLAVRLPE 412
Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKV-RKHF 298
++ + +L+ IS+E +Q+ + V R++F
Sbjct: 413 ANLHTIVPVLRAISAERVAEMQAQIRSVYRRYF 445
>gi|351696937|gb|EHA99855.1| Exostosin-2 [Heterocephalus glaber]
Length = 717
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 31/85 (36%), Positives = 49/85 (57%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L + FC+ ++G + A ++D L GCVPV+IA+ Y LPF++VL+WK S+VV
Sbjct: 307 YPQVLQEATFCVVLRGARLGQAALSDVLRAGCVPVVIADSYILPFSEVLDWKRASVVVPE 366
Query: 267 LDIPLLKKILKGISSEEYLLLQSYV 291
I + +L+ I + +Q V
Sbjct: 367 EKIADMYGVLQSIPRRQMEEMQRQV 391
>gi|348538380|ref|XP_003456670.1| PREDICTED: exostosin-1a-like [Oreochromis niloticus]
Length = 776
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 80/357 (22%), Positives = 144/357 (40%), Gaps = 83/357 (23%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFK--KAFMKSHFVTKDPSKADLFFL 61
F+VY+YP +R G+ SE+Y K + +S F T DP +A LF L
Sbjct: 142 GFKVYIYPQQR----------------GSEISETYKKILTSIEESRFHTTDPLRACLFIL 185
Query: 62 PFSIARMRHDR---------RIGTEGIP---DFISHYIFNI-SQKYPYWNRTGGADHFYV 108
++ + D+ R + +P D +H IFN+ S +P + G +
Sbjct: 186 --AVDTLDRDQLSVQYVQNIRSRIQNLPTWNDGRNHLIFNLYSGSWPDYTEDLGFE---- 239
Query: 109 ACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQ----IWPRQEDPPKLG 164
+G++ + KA +A V S+Y IS + KD L + PP
Sbjct: 240 ----VGQAMLAKA-----SADVVNFRSNYDISIPLFSKDHPLKGGGIGYLTLNDAPP--- 287
Query: 165 SSKRNKLAF----FAGAVNSPVREKLLQVWRNDSEIYAHSGR------------------ 202
S++ +L F + + S R L + + I + +
Sbjct: 288 -SRKYQLVFKGKRYLTGIGSETRNALYHIHNGEDIILLTTCKHGKDWEKHKDSRCDRDNE 346
Query: 203 --LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSF 260
K Y + L S FCL +G + + R ++L C+PVI++N ++LPF++V++W+
Sbjct: 347 DYSKFDYQELLHNSTFCLVPRGRRLGSFRFLEALQAACIPVILSNGWELPFSEVIDWRKA 406
Query: 261 SIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYEL 317
+I+ + + I + + + L L+ + F W + S + E+
Sbjct: 407 AIIGDERLLLQVPSITRSVGRDRIL-----ALRQQTQFLWDAYFSSVAKIVLTTLEI 458
>gi|255633862|gb|ACU17292.1| unknown [Glycine max]
Length = 141
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 37/106 (34%), Positives = 54/106 (50%), Gaps = 6/106 (5%)
Query: 216 FCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKI 275
FCL G+ + R+ +++ +GC+PVIIA+ LPFAD + W+ + V D+P L I
Sbjct: 6 FCLCPLGWAPWSPRLVEAVVFGCIPVIIADDIVLPFADAIPWEEIGVFVDEKDVPQLDTI 65
Query: 276 LKGISSEEYL----LLQSYVLKVRKHFQWHVFPSDYDAFYMVMYEL 317
L I E L LL + +K F P DAF+ V+ L
Sbjct: 66 LTSIPPEVILRKQRLLANPFMKQAMLFPQPAQPG--DAFHQVLNGL 109
>gi|344281096|ref|XP_003412316.1| PREDICTED: LOW QUALITY PROTEIN: exostosin-2-like [Loxodonta
africana]
Length = 728
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 31/85 (36%), Positives = 49/85 (57%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L + FC+ ++G + A ++D L GCVPV+IA+ Y LPF++VL+WK S+VV
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367
Query: 267 LDIPLLKKILKGISSEEYLLLQSYV 291
+ + IL+ I + +Q V
Sbjct: 368 EKMSDMYSILQSIPQRQIEEMQRQV 392
>gi|168012366|ref|XP_001758873.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162690010|gb|EDQ76379.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 426
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 54/221 (24%), Positives = 102/221 (46%), Gaps = 17/221 (7%)
Query: 97 WNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSS-SYFISGHIA-HKDVSLPQIW 154
W R+GG DH H + ++ + + + + SS +++ G ++ KDV +P +
Sbjct: 150 WQRSGGRDHVLAVHHPWSMKSHRRFLKSAIWLLSDLDSSGNWYKEGEVSLEKDVIMPYVA 209
Query: 155 PRQE-DPPKLGSSK--RNKLAFFAGAV----NSPVREKLLQVWRNDSE-IYAHSG----R 202
D L +SK R L FF G + VR +L V R++ E I G
Sbjct: 210 NVDACDDNCLATSKPSRKTLLFFQGRIVRGSAGKVRSRLAAVLRDEKERIVFQEGFSGAE 269
Query: 203 LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI 262
K + S FCL G ++AR+ D++ GC+PV++++ +LPF +L+++ ++
Sbjct: 270 GKATAQHGMRSSVFCLSPAGDTPSSARLFDAIVSGCIPVVVSDELELPFEGILDYRQVAL 329
Query: 263 VVATLDIPL---LKKILKGISSEEYLLLQSYVLKVRKHFQW 300
V L L+ + ++ +Q + + +HF++
Sbjct: 330 FVPAARAAQKGWLVAHLRNKTPQDVAAMQQRLAQYGRHFRY 370
>gi|355686831|gb|AER98199.1| exostoses 1 [Mustela putorius furo]
Length = 702
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 77/363 (21%), Positives = 135/363 (37%), Gaps = 95/363 (26%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
F+VYVYP ++ G +ESY A S F T DPS+A LF L
Sbjct: 69 GFKVYVYPQQK----------------GEKIAESYQNILAAIEGSRFYTSDPSQACLFVL 112
Query: 62 PFSI-------ARMRHDRRIGTEGI---PDFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
+ H+ R + + + +H IFN+ S +P + G D
Sbjct: 113 SLDTLDRDQLSPQYVHNLRSKVQSLHLWNNGRNHLIFNLYSGTWPDYTEDVGFD------ 166
Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
IG++ + KA S + DVS+P +D P+ G +
Sbjct: 167 --IGQAMLAKA--------------SISTENFRPNFDVSIPLF---SKDHPRTGGERGFL 207
Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR------------ 202
R + F G + S R L V + + + +
Sbjct: 208 KFNTIPPLRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHKDSR 267
Query: 203 --------LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
K Y + L + FCL +G + + R ++L CVPV+++N ++LPF++V
Sbjct: 268 CDRDNTEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEV 327
Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
+NW +++ + + ++ I ++ L L+ + F W + S + +
Sbjct: 328 INWNQAAVIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTT 382
Query: 315 YEL 317
E+
Sbjct: 383 LEI 385
>gi|166183792|gb|ABY84155.1| exostosin 1 (predicted) [Callithrix jacchus]
Length = 746
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 77/363 (21%), Positives = 135/363 (37%), Gaps = 95/363 (26%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
F+VYVYP ++ + A ESY A S F T DPS+A LF L
Sbjct: 113 GFKVYVYPQQKGEKIA----------------ESYQNILAAIEGSRFYTSDPSQACLFVL 156
Query: 62 PFSI-------ARMRHDRRIGTEGI---PDFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
+ H+ R + + + +H IFN+ S +P + G D
Sbjct: 157 SLDTLDRDQLSPQYVHNLRSKVQSLHLWNNGRNHLIFNLYSGTWPDYTEDVGFD------ 210
Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
IG++ + KA S + DVS+P +D P+ G +
Sbjct: 211 --IGQAMLAKA--------------SISTENFRPNFDVSIPLF---SKDHPRTGGERGFL 251
Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR------------ 202
R + F G + S R L V + + + +
Sbjct: 252 KFNTIPPLRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHKDSR 311
Query: 203 --------LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
K Y + L + FCL +G + + R ++L CVPV+++N ++LPF++V
Sbjct: 312 CDRDNTEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEV 371
Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
+NW +++ + + ++ I ++ L L+ + F W + S + +
Sbjct: 372 INWNQAAVIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTT 426
Query: 315 YEL 317
E+
Sbjct: 427 LEI 429
>gi|147902262|ref|NP_001091564.1| exostosin-1 [Bos taurus]
gi|238686639|sp|A5D7I4.1|EXT1_BOVIN RecName: Full=Exostosin-1; AltName:
Full=Glucuronosyl-N-acetylglucosaminyl-proteoglycan/N-
acetylglucosaminyl-proteoglycan
4-alpha-N-acetylglucosaminyltransferase; AltName:
Full=Multiple exostoses protein 1 homolog
gi|146186820|gb|AAI40569.1| EXT1 protein [Bos taurus]
gi|296480555|tpg|DAA22670.1| TPA: exostosin-1 [Bos taurus]
Length = 746
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 77/363 (21%), Positives = 135/363 (37%), Gaps = 95/363 (26%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
F+VYVYP ++ + A ESY A S F T DPS+A LF L
Sbjct: 113 GFKVYVYPQQKGEKIA----------------ESYQNILAAIEGSRFYTSDPSQACLFVL 156
Query: 62 PFSI-------ARMRHDRRIGTEGI---PDFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
+ H+ R + + + +H IFN+ S +P + G D
Sbjct: 157 SLDTLDRDQLSPQYVHNLRSKVQSLHLWNNGRNHLIFNLYSGTWPDYTEDVGFD------ 210
Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
IG++ + KA S + DVS+P +D P+ G +
Sbjct: 211 --IGQAMLAKA--------------SISTENFRPNFDVSIPLF---SKDHPRTGGERGFL 251
Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR------------ 202
R + F G + S R L V + + + +
Sbjct: 252 KFNTIPPLRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHKDSR 311
Query: 203 --------LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
K Y + L + FCL +G + + R ++L CVPV+++N ++LPF++V
Sbjct: 312 CDRDNTEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEV 371
Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
+NW +++ + + ++ I ++ L L+ + F W + S + +
Sbjct: 372 INWNQAAVIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTT 426
Query: 315 YEL 317
E+
Sbjct: 427 LEI 429
>gi|190402228|gb|ACE77644.1| exostosin 1 (predicted) [Sorex araneus]
Length = 746
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 77/363 (21%), Positives = 135/363 (37%), Gaps = 95/363 (26%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
F+VYVYP ++ + A ESY A S F T DPS+A LF L
Sbjct: 113 GFKVYVYPQQKGEKIA----------------ESYQNILAAIEGSRFYTSDPSQACLFVL 156
Query: 62 PFSI-------ARMRHDRRIGTEGI---PDFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
+ H+ R + + + +H IFN+ S +P + G D
Sbjct: 157 SLDTLDRDQLSPQYVHNLRSKVQSLHLWNNGRNHLIFNLYSGTWPDYTEDVGFD------ 210
Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
IG++ + KA S H DVS+P ++ P+ G +
Sbjct: 211 --IGQAMLAKA--------------SISTENFRPHFDVSIPLF---SKEHPRTGGERGFL 251
Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR------------ 202
R + F G + S R L V + + + +
Sbjct: 252 KFNTIPPLRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDILLLTTCKHGKDWQKHKDSR 311
Query: 203 --------LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
K Y + L + FCL +G + + R ++L CVPV+++N ++LPF++V
Sbjct: 312 CDRDNTEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEV 371
Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
+NW +++ + + ++ I ++ L L+ + F W + S + +
Sbjct: 372 INWNQAAVIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTT 426
Query: 315 YEL 317
E+
Sbjct: 427 LEI 429
>gi|403283504|ref|XP_003933159.1| PREDICTED: exostosin-1 [Saimiri boliviensis boliviensis]
Length = 746
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 77/363 (21%), Positives = 135/363 (37%), Gaps = 95/363 (26%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
F+VYVYP ++ + A ESY A S F T DPS+A LF L
Sbjct: 113 GFKVYVYPQQKGEKIA----------------ESYQNILAAIEGSRFYTSDPSQACLFVL 156
Query: 62 PFSI-------ARMRHDRRIGTEGI---PDFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
+ H+ R + + + +H IFN+ S +P + G D
Sbjct: 157 SLDTLDRDQLSPQYVHNLRSKVQSLHLWNNGRNHLIFNLYSGTWPDYTEDVGFD------ 210
Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
IG++ + KA S + DVS+P +D P+ G +
Sbjct: 211 --IGQAMLAKA--------------SISTENFRPNFDVSIPLF---SKDHPRTGGERGFL 251
Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR------------ 202
R + F G + S R L V + + + +
Sbjct: 252 KFNTIPPLRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHKDSR 311
Query: 203 --------LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
K Y + L + FCL +G + + R ++L CVPV+++N ++LPF++V
Sbjct: 312 CDRDNTEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEV 371
Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
+NW +++ + + ++ I ++ L L+ + F W + S + +
Sbjct: 372 INWNQAAVIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTT 426
Query: 315 YEL 317
E+
Sbjct: 427 LEI 429
>gi|73974379|ref|XP_539145.2| PREDICTED: exostosin-1 isoform 1 [Canis lupus familiaris]
Length = 746
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 77/363 (21%), Positives = 135/363 (37%), Gaps = 95/363 (26%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
F+VYVYP ++ + A ESY A S F T DPS+A LF L
Sbjct: 113 GFKVYVYPQQKGEKIA----------------ESYQNILAAIEGSRFYTSDPSQACLFVL 156
Query: 62 PFSI-------ARMRHDRRIGTEGI---PDFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
+ H+ R + + + +H IFN+ S +P + G D
Sbjct: 157 SLDTLDRDQLSPQYVHNLRSKVQSLHLWNNGRNHLIFNLYSGTWPDYTEDVGFD------ 210
Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
IG++ + KA S + DVS+P +D P+ G +
Sbjct: 211 --IGQAMLAKA--------------SISTENFRPNFDVSIPLF---SKDHPRTGGERGFL 251
Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR------------ 202
R + F G + S R L V + + + +
Sbjct: 252 KFNTIPPLRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHKDSR 311
Query: 203 --------LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
K Y + L + FCL +G + + R ++L CVPV+++N ++LPF++V
Sbjct: 312 CDRDNTEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEV 371
Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
+NW +++ + + ++ I ++ L L+ + F W + S + +
Sbjct: 372 INWNQAAVIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTT 426
Query: 315 YEL 317
E+
Sbjct: 427 LEI 429
>gi|301774588|ref|XP_002922714.1| PREDICTED: exostosin-1-like [Ailuropoda melanoleuca]
Length = 746
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 77/363 (21%), Positives = 135/363 (37%), Gaps = 95/363 (26%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
F+VYVYP ++ + A ESY A S F T DPS+A LF L
Sbjct: 113 GFKVYVYPQQKGEKIA----------------ESYQNILAAIEGSRFYTSDPSQACLFVL 156
Query: 62 PFSI-------ARMRHDRRIGTEGI---PDFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
+ H+ R + + + +H IFN+ S +P + G D
Sbjct: 157 SLDTLDRDQLSPQYVHNLRSKVQSLHLWNNGRNHLIFNLYSGTWPDYTEDVGFD------ 210
Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
IG++ + KA S + DVS+P +D P+ G +
Sbjct: 211 --IGQAMLAKA--------------SISTENFRPNFDVSIPLF---SKDHPRTGGERGFL 251
Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR------------ 202
R + F G + S R L V + + + +
Sbjct: 252 KFNTIPPLRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHKDSR 311
Query: 203 --------LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
K Y + L + FCL +G + + R ++L CVPV+++N ++LPF++V
Sbjct: 312 CDRDNTEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEV 371
Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
+NW +++ + + ++ I ++ L L+ + F W + S + +
Sbjct: 372 INWNQAAVIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTT 426
Query: 315 YEL 317
E+
Sbjct: 427 LEI 429
>gi|388503508|gb|AFK39820.1| unknown [Lotus japonicus]
Length = 141
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 35/105 (33%), Positives = 55/105 (52%), Gaps = 4/105 (3%)
Query: 216 FCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKI 275
FCL G+ + R+ +++ +GC+PVI+A+ LPFAD + W+ + V D+P L I
Sbjct: 6 FCLCPLGWAPWSPRLVEAVVFGCIPVIVADDIVLPFADAIPWEDIGVFVDEEDVPKLDTI 65
Query: 276 LKGISSEEYLLLQSYVLKVRKHFQWHVFPSDY---DAFYMVMYEL 317
L I E +L + +L Q +FP DAF+ V+ L
Sbjct: 66 LTSIPP-EIILRKQRLLANPSMKQAMLFPQPAQPGDAFHQVLNGL 109
>gi|46370066|ref|NP_000118.2| exostosin-1 [Homo sapiens]
gi|20141422|sp|Q16394.2|EXT1_HUMAN RecName: Full=Exostosin-1; AltName:
Full=Glucuronosyl-N-acetylglucosaminyl-proteoglycan/N-
acetylglucosaminyl-proteoglycan
4-alpha-N-acetylglucosaminyltransferase; AltName:
Full=Multiple exostoses protein 1; AltName:
Full=Putative tumor suppressor protein EXT1
gi|12654671|gb|AAH01174.1| Exostoses (multiple) 1 [Homo sapiens]
gi|119612378|gb|EAW91972.1| exostoses (multiple) 1 [Homo sapiens]
gi|123981930|gb|ABM82794.1| exostoses (multiple) 1 [synthetic construct]
gi|123996761|gb|ABM85982.1| exostoses (multiple) 1 [synthetic construct]
gi|189053697|dbj|BAG35949.1| unnamed protein product [Homo sapiens]
gi|261859910|dbj|BAI46477.1| exostoses (multiple) 1 [synthetic construct]
Length = 746
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 77/363 (21%), Positives = 135/363 (37%), Gaps = 95/363 (26%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
F+VYVYP ++ + A ESY A S F T DPS+A LF L
Sbjct: 113 GFKVYVYPQQKGEKIA----------------ESYQNILAAIEGSRFYTSDPSQACLFVL 156
Query: 62 PFSI-------ARMRHDRRIGTEGI---PDFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
+ H+ R + + + +H IFN+ S +P + G D
Sbjct: 157 SLDTLDRDQLSPQYVHNLRSKVQSLHLWNNGRNHLIFNLYSGTWPDYTEDVGFD------ 210
Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
IG++ + KA S + DVS+P +D P+ G +
Sbjct: 211 --IGQAMLAKA--------------SISTENFRPNFDVSIPLF---SKDHPRTGGERGFL 251
Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR------------ 202
R + F G + S R L V + + + +
Sbjct: 252 KFNTIPPLRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHKDSR 311
Query: 203 --------LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
K Y + L + FCL +G + + R ++L CVPV+++N ++LPF++V
Sbjct: 312 CDRDNTEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEV 371
Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
+NW +++ + + ++ I ++ L L+ + F W + S + +
Sbjct: 372 INWNQAAVIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTT 426
Query: 315 YEL 317
E+
Sbjct: 427 LEI 429
>gi|281183009|ref|NP_001162444.1| exostosin-1 [Papio anubis]
gi|384475833|ref|NP_001245062.1| exostosin 1 [Macaca mulatta]
gi|114621433|ref|XP_001141496.1| PREDICTED: exostosin-1 isoform 1 [Pan troglodytes]
gi|397505670|ref|XP_003823375.1| PREDICTED: exostosin-1 [Pan paniscus]
gi|238687365|sp|A9X1C8.1|EXT1_PAPAN RecName: Full=Exostosin-1; AltName:
Full=Glucuronosyl-N-acetylglucosaminyl-proteoglycan/N-
acetylglucosaminyl-proteoglycan
4-alpha-N-acetylglucosaminyltransferase; AltName:
Full=Multiple exostoses protein 1 homolog
gi|163781066|gb|ABY40823.1| exostoses 1 (predicted) [Papio anubis]
gi|355698182|gb|EHH28730.1| Exostosin-1 [Macaca mulatta]
gi|383410151|gb|AFH28289.1| exostosin-1 [Macaca mulatta]
gi|387541472|gb|AFJ71363.1| exostosin-1 [Macaca mulatta]
gi|410224816|gb|JAA09627.1| exostosin 1 [Pan troglodytes]
gi|410259738|gb|JAA17835.1| exostosin 1 [Pan troglodytes]
gi|410302704|gb|JAA29952.1| exostosin 1 [Pan troglodytes]
gi|410353629|gb|JAA43418.1| exostosin 1 [Pan troglodytes]
Length = 746
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 77/363 (21%), Positives = 135/363 (37%), Gaps = 95/363 (26%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
F+VYVYP ++ + A ESY A S F T DPS+A LF L
Sbjct: 113 GFKVYVYPQQKGEKIA----------------ESYQNILAAIEGSRFYTSDPSQACLFVL 156
Query: 62 PFSI-------ARMRHDRRIGTEGI---PDFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
+ H+ R + + + +H IFN+ S +P + G D
Sbjct: 157 SLDTLDRDQLSPQYVHNLRSKVQSLHLWNNGRNHLIFNLYSGTWPDYTEDVGFD------ 210
Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
IG++ + KA S + DVS+P +D P+ G +
Sbjct: 211 --IGQAMLAKA--------------SISTENFRPNFDVSIPLF---SKDHPRTGGERGFL 251
Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR------------ 202
R + F G + S R L V + + + +
Sbjct: 252 KFNTIPPLRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHKDSR 311
Query: 203 --------LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
K Y + L + FCL +G + + R ++L CVPV+++N ++LPF++V
Sbjct: 312 CDRDNTEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEV 371
Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
+NW +++ + + ++ I ++ L L+ + F W + S + +
Sbjct: 372 INWNQAAVIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTT 426
Query: 315 YEL 317
E+
Sbjct: 427 LEI 429
>gi|1168162|gb|AAB62283.1| putative tumour suppressor/hereditary multiple exostoses candidate
gene [Homo sapiens]
gi|1586817|prf||2204384A EXT1 gene
Length = 746
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 77/363 (21%), Positives = 135/363 (37%), Gaps = 95/363 (26%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
F+VYVYP ++ + A ESY A S F T DPS+A LF L
Sbjct: 113 GFKVYVYPQQKGEKIA----------------ESYQNILAAIEGSRFYTSDPSQACLFVL 156
Query: 62 PFSI-------ARMRHDRRIGTEGI---PDFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
+ H+ R + + + +H IFN+ S +P + G D
Sbjct: 157 SLDTLDRDQLSPQYVHNLRSKVQSLHLWNNGRNHLIFNLYSGTWPDYTEDVGFD------ 210
Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
IG++ + KA S + DVS+P +D P+ G +
Sbjct: 211 --IGQAMLAKA--------------SISTENFRPNFDVSIPLF---SKDHPRTGGERGFL 251
Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR------------ 202
R + F G + S R L V + + + +
Sbjct: 252 KFNTIPPLRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHKDSR 311
Query: 203 --------LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
K Y + L + FCL +G + + R ++L CVPV+++N ++LPF++V
Sbjct: 312 CDRDNTEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEV 371
Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
+NW +++ + + ++ I ++ L L+ + F W + S + +
Sbjct: 372 INWNQAAVIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTT 426
Query: 315 YEL 317
E+
Sbjct: 427 LEI 429
>gi|426235700|ref|XP_004011818.1| PREDICTED: exostosin-1 [Ovis aries]
Length = 746
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 77/363 (21%), Positives = 135/363 (37%), Gaps = 95/363 (26%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
F+VYVYP ++ + A ESY A S F T DPS+A LF L
Sbjct: 113 GFKVYVYPQQKGEKIA----------------ESYQNILAAIEGSRFYTSDPSQACLFVL 156
Query: 62 PFSI-------ARMRHDRRIGTEGI---PDFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
+ H+ R + + + +H IFN+ S +P + G D
Sbjct: 157 SLDTLDRDQLSPQYVHNLRSKVQSLHLWNNGRNHLIFNLYSGTWPDYTEDVGFD------ 210
Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
IG++ + KA S + DVS+P +D P+ G +
Sbjct: 211 --IGQAMLAKA--------------SISTENFRPNFDVSIPLF---SKDHPRTGGERGFL 251
Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR------------ 202
R + F G + S R L V + + + +
Sbjct: 252 KFNTIPPLRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHKDSR 311
Query: 203 --------LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
K Y + L + FCL +G + + R ++L CVPV+++N ++LPF++V
Sbjct: 312 CDRDNTEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEV 371
Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
+NW +++ + + ++ I ++ L L+ + F W + S + +
Sbjct: 372 INWNQAAVIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTT 426
Query: 315 YEL 317
E+
Sbjct: 427 LEI 429
>gi|348588241|ref|XP_003479875.1| PREDICTED: exostosin-1 [Cavia porcellus]
Length = 746
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 77/363 (21%), Positives = 135/363 (37%), Gaps = 95/363 (26%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
F+VYVYP ++ + A ESY A S F T DPS+A LF L
Sbjct: 113 GFKVYVYPQQKGEKIA----------------ESYQNILAAIEGSRFYTSDPSQACLFVL 156
Query: 62 PFSI-------ARMRHDRRIGTEGI---PDFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
+ H+ R + + + +H IFN+ S +P + G D
Sbjct: 157 SLDTLDRDQLSPQYVHNLRSKVQSLHLWNNGRNHLIFNLYSGTWPDYTEDVGFD------ 210
Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
IG++ + KA S + DVS+P +D P+ G +
Sbjct: 211 --IGQAMLAKA--------------SISTENFRPNFDVSIPLF---SKDHPRTGGERGFL 251
Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR------------ 202
R + F G + S R L V + + + +
Sbjct: 252 KFNTIPPLRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDVLLLTTCKHGKDWQKHKDSR 311
Query: 203 --------LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
K Y + L + FCL +G + + R ++L CVPV+++N ++LPF++V
Sbjct: 312 CDRDNTEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEV 371
Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
+NW +++ + + ++ I ++ L L+ + F W + S + +
Sbjct: 372 INWNQAAVIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTT 426
Query: 315 YEL 317
E+
Sbjct: 427 LEI 429
>gi|395818023|ref|XP_003782438.1| PREDICTED: exostosin-1 [Otolemur garnettii]
gi|197215619|gb|ACH53015.1| exostosin 1 (predicted) [Otolemur garnettii]
Length = 746
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 78/363 (21%), Positives = 135/363 (37%), Gaps = 95/363 (26%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
F+VYVYP ++ + A ESY A S F T DPS+A LF L
Sbjct: 113 GFKVYVYPQQKGEKIA----------------ESYQNILAAIEGSRFYTSDPSQACLFVL 156
Query: 62 PFSI-------ARMRHDRRIGTEGI---PDFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
+ H+ R + + + +H IFN+ S +P + G D
Sbjct: 157 SLDTLDRDQLSPQYVHNLRSKVQSLHLWNNGRNHLIFNLYSGTWPDYTEDVGFD------ 210
Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
IG++ + KA S + DVS+P +D P+ G +
Sbjct: 211 --IGQAMLAKA--------------SISTENFRPNFDVSIPLF---SKDHPRTGGERGFL 251
Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQV-----------------WRNDSEIY 197
R + F G + S R L V W+ +
Sbjct: 252 RFNTIPPLRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHKDSR 311
Query: 198 AHSGRL---KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
+ K Y + L + FCL +G + + R ++L CVPV+++N ++LPF++V
Sbjct: 312 CDRDNMEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEV 371
Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
+NW +++ + + ++ I ++ L L+ + F W + S + +
Sbjct: 372 INWNQAAVIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTT 426
Query: 315 YEL 317
E+
Sbjct: 427 LEI 429
>gi|348519387|ref|XP_003447212.1| PREDICTED: exostosin-1b-like [Oreochromis niloticus]
Length = 740
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 77/342 (22%), Positives = 139/342 (40%), Gaps = 53/342 (15%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
F+VYVYP ++ G SESY S F T DP +A LF L
Sbjct: 107 GFKVYVYPQQK----------------GEKISESYQNILSTIEGSRFYTSDPGQACLFVL 150
Query: 62 PFSI-------ARMRHDRRIGTEGIP---DFISHYIFNI-SQKYPYWNRTGGAD--HFYV 108
+ H+ + + +P D +H IFN+ S +P + G D +
Sbjct: 151 SLDTLDRDQLSPQYVHNLKTKVQNLPLWNDGRNHLIFNLYSGTWPDYTEDLGFDIGQAML 210
Query: 109 ACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQ------IWPRQEDPPK 162
A SI + ++V + + G++ + S+P ++ +
Sbjct: 211 AKASISTENFRPNFDVSIPLFSKEHPRTGGERGYLKYN--SIPPFRKYMLVFKGKRYLTG 268
Query: 163 LGSSKRNKLAFFAGAVNSPVRE--KLLQVWRNDSEIYAHSGRL---KTPYADELLGSKFC 217
+GS RN L A + + K + W+ + K Y + L S FC
Sbjct: 269 IGSDTRNALYHVHNAEDVVLLTTCKHGKDWQKHKDARCDKDNAEYDKYDYKEMLHNSTFC 328
Query: 218 LHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILK 277
L +G + + R ++L CVPV+++N ++LPF+++++W + +++ D LL +I
Sbjct: 329 LVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEIIDWNTAAVIG---DERLLLQIPT 385
Query: 278 GISSEEYLLLQSYVLKVRKHFQ--WHVFPSDYDAFYMVMYEL 317
+ S + Q +L +R+ Q W + S + + E+
Sbjct: 386 TVRS----IHQDKILSLRQQTQFLWEAYFSSVEKIVLTTLEI 423
>gi|431915722|gb|ELK16055.1| Exostosin-2 [Pteropus alecto]
Length = 1849
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 33/94 (35%), Positives = 53/94 (56%), Gaps = 7/94 (7%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L + FC+ ++G + A ++D L GCVPV+IA+ Y LPF++VL+WK S+VV
Sbjct: 1439 YPQVLQEATFCVVLRGARLGQAVLSDVLRAGCVPVVIADSYILPFSEVLDWKRASVVVPE 1498
Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQW 300
I + IL+ I Q + +++K +W
Sbjct: 1499 EKISDVYSILQSIP-------QRQIEEMQKQARW 1525
>gi|350537983|ref|NP_001233696.1| exostosin-1 [Cricetulus griseus]
gi|20138355|sp|Q9JK82.1|EXT1_CRIGR RecName: Full=Exostosin-1; AltName: Full=Heparan sulfate
copolymerase; AltName: Full=Multiple exostoses protein 1
homolog
gi|7960285|gb|AAF71276.1|AF252858_1 exostosin 1 [Cricetulus griseus]
Length = 746
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 77/363 (21%), Positives = 135/363 (37%), Gaps = 95/363 (26%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
F+VYVYP ++ + A ESY A S F T DPS+A LF L
Sbjct: 113 GFKVYVYPQQKGEKIA----------------ESYQNILAAIEGSRFYTSDPSQACLFVL 156
Query: 62 PFSI-------ARMRHDRRIGTEGI---PDFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
+ H+ R + + + +H IFN+ S +P + G D
Sbjct: 157 SLDTLDRDQLSPQYVHNLRSKVQSLHLWNNGRNHLIFNLYSGTWPDYTEDVGFD------ 210
Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
IG++ + KA S + DVS+P +D P+ G +
Sbjct: 211 --IGQAMLAKA--------------SISTENFRPNFDVSIPLF---SKDHPRTGGERGFL 251
Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR------------ 202
R + F G + S R L V + + + +
Sbjct: 252 KFNTIPPLRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDVLLLTTCKHGKDWQKHKDSR 311
Query: 203 --------LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
K Y + L + FCL +G + + R ++L CVPV+++N ++LPF++V
Sbjct: 312 CDRDNTEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEV 371
Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
+NW +++ + + ++ I ++ L L+ + F W + S + +
Sbjct: 372 INWNQAAVIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTT 426
Query: 315 YEL 317
E+
Sbjct: 427 LEI 429
>gi|19909908|dbj|BAB87180.1| XEXT1 [Xenopus laevis]
Length = 735
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 71/359 (19%), Positives = 131/359 (36%), Gaps = 87/359 (24%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFFLPF 63
F+VY+YP ++ D ++ S + A S F T DPS+A +F L
Sbjct: 102 GFKVYIYPQQKGDKLSD--------------SYQHILAAIETSRFYTSDPSQACVFVL-- 145
Query: 64 SIARMRHDRRIGTEGIPDFISHYIFNISQKYP---YWNRTGGADHFYVACHSIGRSAMQK 120
S+ + D+ HY+ N+ K WN G +H +S
Sbjct: 146 SLDTLDRDQ---------LSPHYVHNLKSKVQNLHLWN--NGRNHLIFNLYS-------G 187
Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHK------DVSLPQIWPRQEDPPKLGSSK------- 167
W + + I+ + DVS+P +D P+ G K
Sbjct: 188 TWPDYTEDVGFDIGQAMLAKASISTENFRPNFDVSIPLF---SKDHPRTGGDKGFLRFNN 244
Query: 168 ----RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR---------------- 202
R + F G + S R L V + + + +
Sbjct: 245 IPPMRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHKDARCDKD 304
Query: 203 ----LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWK 258
K Y + L + FCL +G + + R ++L CVPV+++N ++LPF++V++W
Sbjct: 305 NAEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVIDWN 364
Query: 259 SFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYEL 317
+++ + + ++ I ++ L L+ + F W + S + + E+
Sbjct: 365 QAAVIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTTLEI 418
>gi|50416406|gb|AAH77234.1| XEXT1 protein [Xenopus laevis]
Length = 735
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 71/359 (19%), Positives = 131/359 (36%), Gaps = 87/359 (24%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFFLPF 63
F+VY+YP ++ D ++ S + A S F T DPS+A +F L
Sbjct: 102 GFKVYIYPQQKGDKLSD--------------SYQHILAAIETSRFYTSDPSQACVFVL-- 145
Query: 64 SIARMRHDRRIGTEGIPDFISHYIFNISQKYP---YWNRTGGADHFYVACHSIGRSAMQK 120
S+ + D+ HY+ N+ K WN G +H +S
Sbjct: 146 SLDTLDRDQ---------LSPHYVHNLKSKVQNLHLWN--NGRNHLIFNLYS-------G 187
Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHK------DVSLPQIWPRQEDPPKLGSSK------- 167
W + + I+ + DVS+P +D P+ G K
Sbjct: 188 TWPDYTEDVGFDIGQAMLAKASISTENFRPNFDVSIPLF---SKDHPRTGGDKGFLRFNN 244
Query: 168 ----RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR---------------- 202
R + F G + S R L V + + + +
Sbjct: 245 IPPMRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHKDARCDKD 304
Query: 203 ----LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWK 258
K Y + L + FCL +G + + R ++L CVPV+++N ++LPF++V++W
Sbjct: 305 NAEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVIDWN 364
Query: 259 SFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYEL 317
+++ + + ++ I ++ L L+ + F W + S + + E+
Sbjct: 365 QAAVIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTTLEI 418
>gi|221041480|dbj|BAH12417.1| unnamed protein product [Homo sapiens]
Length = 337
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 48/82 (58%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L + FC+ ++G + A ++D L GCVPV+IA+ Y LPF++VL+WK S+VV
Sbjct: 189 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 248
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+ + IL+ I + +Q
Sbjct: 249 EKMSDVYSILQSIPQRQIEEMQ 270
>gi|156378657|ref|XP_001631258.1| predicted protein [Nematostella vectensis]
gi|156218295|gb|EDO39195.1| predicted protein [Nematostella vectensis]
Length = 563
Score = 61.2 bits (147), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 51/88 (57%), Gaps = 5/88 (5%)
Query: 214 SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLK 273
+ FCL ++G + + DSL GC+P+++++ Y LPF++VL+WK ++VV+ +I +
Sbjct: 237 ATFCLVIRGARLGQTALLDSLMMGCIPIVVSDDYILPFSEVLDWKRAAVVVSENEIDRIP 296
Query: 274 KILKGISSEEYLLLQSYVLKVRKHFQWH 301
ILK S Q ++++ F W
Sbjct: 297 LILKDYSQN-----QIKDMRLQGKFMWE 319
>gi|440897880|gb|ELR49485.1| Exostosin-2, partial [Bos grunniens mutus]
Length = 731
Score = 61.2 bits (147), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 31/82 (37%), Positives = 48/82 (58%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L + FC+ ++G + A ++D L GCVPVIIA+ Y LPF++VL+WK S+VV
Sbjct: 321 YPQVLQEATFCMVLRGARLGQAVLSDVLRAGCVPVIIADSYVLPFSEVLDWKRASVVVPE 380
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+ + IL+ I + +Q
Sbjct: 381 EKMSDVYSILQSIPRRQIEEMQ 402
>gi|354491358|ref|XP_003507822.1| PREDICTED: exostosin-2 [Cricetulus griseus]
Length = 718
Score = 61.2 bits (147), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 48/82 (58%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L + FC+ ++G + A ++D L GCVPV+IA+ Y LPF++VL+WK S+VV
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+ + IL+ I + +Q
Sbjct: 368 EKMSDMYSILQNIPQRQIEEMQ 389
>gi|117935049|ref|NP_803462.2| exostosin-2 [Bos taurus]
gi|117306659|gb|AAI26571.1| Exostoses (multiple) 2 [Bos taurus]
gi|296479641|tpg|DAA21756.1| TPA: exostosin-2 [Bos taurus]
Length = 718
Score = 61.2 bits (147), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 31/82 (37%), Positives = 48/82 (58%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L + FC+ ++G + A ++D L GCVPVIIA+ Y LPF++VL+WK S+VV
Sbjct: 308 YPQVLQEATFCMVLRGARLGQAVLSDVLRAGCVPVIIADSYVLPFSEVLDWKRASVVVPE 367
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+ + IL+ I + +Q
Sbjct: 368 EKMSDVYSILQSIPRRQIEEMQ 389
>gi|338711993|ref|XP_001489915.2| PREDICTED: LOW QUALITY PROTEIN: exostosin-2 [Equus caballus]
Length = 728
Score = 61.2 bits (147), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 30/87 (34%), Positives = 50/87 (57%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L + FC+ ++G + A ++D L GCVPV+IA+ Y LPF++VL+WK S+VV
Sbjct: 308 YPQVLQEATFCMVLRGARLGQAVLSDVLRAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367
Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLK 293
+ + IL+ I + +Q + +
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQRQLFQ 394
>gi|149721624|ref|XP_001496484.1| PREDICTED: exostosin-1 [Equus caballus]
Length = 746
Score = 61.2 bits (147), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 77/363 (21%), Positives = 135/363 (37%), Gaps = 95/363 (26%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
F+VYVYP ++ + A ESY A S F T DPS+A LF L
Sbjct: 113 GFKVYVYPQQKGEKIA----------------ESYQNILAAIEGSRFYTSDPSQACLFVL 156
Query: 62 PFSI-------ARMRHDRRIGTEGI---PDFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
+ H+ R + + + +H IFN+ S +P + G D
Sbjct: 157 SLDTLDRDQLSPQYVHNLRSKVQSLHLWNNGRNHLIFNLYSGTWPDYTEDVGFD------ 210
Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
IG++ + KA S + DVS+P +D P+ G +
Sbjct: 211 --IGQAMLAKA--------------SISTENFRPNFDVSIPLF---SKDHPRTGGERGFL 251
Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR------------ 202
R + F G + S R L V + + + +
Sbjct: 252 KFNTIPPLRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDVLLLTTCKHGKDWQKHKDSR 311
Query: 203 --------LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
K Y + L + FCL +G + + R ++L CVPV+++N ++LPF++V
Sbjct: 312 CDRDNTEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEV 371
Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
+NW +++ + + ++ I ++ L L+ + F W + S + +
Sbjct: 372 INWNQAAVIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTT 426
Query: 315 YEL 317
E+
Sbjct: 427 LEI 429
>gi|20138161|sp|O77783.1|EXT2_BOVIN RecName: Full=Exostosin-2; AltName:
Full=Glucuronosyl-N-acetylglucosaminyl-proteoglycan/N-
acetylglucosaminyl-proteoglycan
4-alpha-N-acetylglucosaminyltransferase; AltName:
Full=HS-polymerase; Short=HS-POL; AltName: Full=Multiple
exostoses protein 2 homolog
gi|3599568|gb|AAC35386.1| glucuronyl/N-acetylglucosaminyl transferase [Bos taurus]
Length = 718
Score = 61.2 bits (147), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 31/82 (37%), Positives = 48/82 (58%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L + FC+ ++G + A ++D L GCVPVIIA+ Y LPF++VL+WK S+VV
Sbjct: 308 YPQVLQEATFCMVLRGARLGQAVLSDVLRAGCVPVIIADSYVLPFSEVLDWKRASVVVPE 367
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+ + IL+ I + +Q
Sbjct: 368 EKMSDVYSILQSIPRRQIEEMQ 389
>gi|148234643|ref|NP_001083782.1| exostosin 1 [Xenopus laevis]
gi|62871603|gb|AAH94398.1| XEXT1 protein [Xenopus laevis]
Length = 738
Score = 61.2 bits (147), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 71/359 (19%), Positives = 131/359 (36%), Gaps = 87/359 (24%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFFLPF 63
F+VY+YP ++ D ++ S + A S F T DPS+A +F L
Sbjct: 105 GFKVYIYPQQKGDKLSD--------------SYQHILAAIETSRFYTSDPSQACVFVL-- 148
Query: 64 SIARMRHDRRIGTEGIPDFISHYIFNISQKYP---YWNRTGGADHFYVACHSIGRSAMQK 120
S+ + D+ HY+ N+ K WN G +H +S
Sbjct: 149 SLDTLDRDQ---------LSPHYVHNLKSKVQNLHLWN--NGRNHLIFNLYS-------G 190
Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHK------DVSLPQIWPRQEDPPKLGSSK------- 167
W + + I+ + DVS+P +D P+ G K
Sbjct: 191 TWPDYTEDVGFDIGQAMLAKASISTENFRPNFDVSIPLF---SKDHPRTGGDKGFLRFNN 247
Query: 168 ----RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR---------------- 202
R + F G + S R L V + + + +
Sbjct: 248 IPPMRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHKDARCDKD 307
Query: 203 ----LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWK 258
K Y + L + FCL +G + + R ++L CVPV+++N ++LPF++V++W
Sbjct: 308 NAEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVIDWN 367
Query: 259 SFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYEL 317
+++ + + ++ I ++ L L+ + F W + S + + E+
Sbjct: 368 QAAVIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTTLEI 421
>gi|344257014|gb|EGW13118.1| Exostosin-2 [Cricetulus griseus]
Length = 701
Score = 61.2 bits (147), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 48/82 (58%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L + FC+ ++G + A ++D L GCVPV+IA+ Y LPF++VL+WK S+VV
Sbjct: 291 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 350
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+ + IL+ I + +Q
Sbjct: 351 EKMSDMYSILQNIPQRQIEEMQ 372
>gi|291388448|ref|XP_002710791.1| PREDICTED: exostosin 1 [Oryctolagus cuniculus]
Length = 746
Score = 61.2 bits (147), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 77/363 (21%), Positives = 135/363 (37%), Gaps = 95/363 (26%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
F+VYVYP ++ + A ESY A S F T DPS+A LF L
Sbjct: 113 GFKVYVYPQQKGEKIA----------------ESYQNILAAIEGSRFYTSDPSQACLFVL 156
Query: 62 PFSI-------ARMRHDRRIGTEGI---PDFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
+ H+ R + + + +H IFN+ S +P + G D
Sbjct: 157 SLDTLDRDQLSPQYVHNLRSKVQSLHLWNNGRNHLIFNLYSGTWPDYTEDVGFD------ 210
Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
IG++ + KA S + DVS+P +D P+ G +
Sbjct: 211 --IGQAMLAKA--------------SISTENFRPNFDVSIPLF---SKDHPRTGGERGFL 251
Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR------------ 202
R + F G + S R L V + + + +
Sbjct: 252 KFNTIPPLRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDVLLLTTCKHGKDWQKHKDSR 311
Query: 203 --------LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
K Y + L + FCL +G + + R ++L CVPV+++N ++LPF++V
Sbjct: 312 CDRDNTEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEV 371
Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
+NW +++ + + ++ I ++ L L+ + F W + S + +
Sbjct: 372 INWNQAAVIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTT 426
Query: 315 YEL 317
E+
Sbjct: 427 LEI 429
>gi|183637125|gb|ACC64545.1| exostosin 1 (predicted) [Rhinolophus ferrumequinum]
Length = 746
Score = 61.2 bits (147), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 77/363 (21%), Positives = 135/363 (37%), Gaps = 95/363 (26%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
F+VYVYP ++ + A ESY A S F T DPS+A LF L
Sbjct: 113 GFKVYVYPQQKGEKIA----------------ESYQNILAAIEGSRFYTSDPSQACLFVL 156
Query: 62 PFSI-------ARMRHDRRIGTEGI---PDFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
+ H+ R + + + +H IFN+ S +P + G D
Sbjct: 157 SLDTLDRDQLSPQYVHNLRSKVQSLHLWNNGRNHLIFNLYSGTWPDYTEDVGFD------ 210
Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
IG++ + KA S + DVS+P +D P+ G +
Sbjct: 211 --IGQAMLAKA--------------SISTENFRPNFDVSIPLF---SKDHPRTGGERGFL 251
Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR------------ 202
R + F G + S R L V + + + +
Sbjct: 252 KFNTIPPLRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDVLLLTTCKHGKDWQKHKDSR 311
Query: 203 --------LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
K Y + L + FCL +G + + R ++L CVPV+++N ++LPF++V
Sbjct: 312 CDRDNTEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEV 371
Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
+NW +++ + + ++ I ++ L L+ + F W + S + +
Sbjct: 372 INWNQAAVIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTT 426
Query: 315 YEL 317
E+
Sbjct: 427 LEI 429
>gi|94536924|ref|NP_001035420.1| exostosin-1c [Danio rerio]
gi|92098257|gb|AAI15217.1| Exostoses (multiple) 1c [Danio rerio]
Length = 737
Score = 61.2 bits (147), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 82/357 (22%), Positives = 135/357 (37%), Gaps = 83/357 (23%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFK--KAFMKSHFVTKDPSKADLFFL 61
FRVY+YP + G SESY K + +S + T DP +A LF L
Sbjct: 105 GFRVYIYPPEK----------------GERVSESYRKILTSVSESRYYTSDPREACLFVL 148
Query: 62 PFSIARMRHDRRIGTEGIPDFISHYIFNISQK---YPYWNRTGGADHFYVACHSIGRSAM 118
I + D+ ++ N+ ++ YP WN G +H +S
Sbjct: 149 --GIDTLDRDQ---------LSQQFVPNVDERIRGYPLWN--DGRNHVIFNLYSGTWPNY 195
Query: 119 QKAWEVKLNAIQVVCSSSYFISGHIAHK-DVSLPQIWPRQEDPPKLGSS----------K 167
+ ++ N Q + + + + H D+S+P +E P K G +
Sbjct: 196 TE--DLGFNVGQAILAKASLNTEHFRPGFDISIPLF--SKEHPQKGGKRGWLVRNSVPPR 251
Query: 168 RNKLAFFAG-----AVNSPVREKL-----------LQVWRNDSEIYAHSGRL-------- 203
R L F G + S R L L R+ + H
Sbjct: 252 RKYLLMFKGKRYLTGIGSDTRNALHHIHNGKDIVSLTTCRHGKDWEKHKDARCDHDNQEY 311
Query: 204 -KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI 262
+ Y + L S FCL +G + + R +SL C+PV+++N ++LPF+DV+ W
Sbjct: 312 ERFDYQELLHNSTFCLVPRGRRLGSFRFLESLQAACIPVLLSNGWELPFSDVIQWNQ--- 368
Query: 263 VVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQ--WHVFPSDYDAFYMVMYEL 317
V D +++L + S + VL +R+ Q W + S D + E+
Sbjct: 369 AVVEGD----ERLLLQVPSTVRAVGMDRVLALRQQTQTLWDAYFSSVDKIVLTTLEI 421
>gi|1235559|emb|CAA65443.1| ext1 [Mus musculus]
Length = 745
Score = 61.2 bits (147), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 78/363 (21%), Positives = 136/363 (37%), Gaps = 95/363 (26%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
F+VYVYP ++ G +ESY A S F T DPS+A LF L
Sbjct: 113 GFKVYVYPQQK----------------GEKIAESYQNILAAIEGSRFYTSDPSQACLFVL 156
Query: 62 PFSI-------ARMRHDRRIGTEGI---PDFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
+ H+ R + + + +H IFN+ S +P + G D
Sbjct: 157 SLDTLDRDQLSPQYVHNLRSKVQSLHLWNNGRNHLIFNLYSGTWPDYTEDVGFD------ 210
Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
IG++ + KA S+ F DVS+P +D P+ G +
Sbjct: 211 --IGQAMLAKA----------TISTENFRPNF----DVSIPLF---SKDHPRTGGERGFL 251
Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR------------ 202
R + F G + S R L V + + + +
Sbjct: 252 KFNTIPPLRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDVLLLTTCKHGKDWQKHKDSR 311
Query: 203 --------LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
K Y + L + FCL +G + + R ++L CVPV+++N ++LPF++V
Sbjct: 312 CDRDNTEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEV 371
Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
+NW +++ + + ++ I ++ L L+ + F W + S + +
Sbjct: 372 INWNQAAVIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTT 426
Query: 315 YEL 317
E+
Sbjct: 427 LEI 429
>gi|112807209|ref|NP_034292.2| exostosin-1 [Mus musculus]
gi|283837898|ref|NP_001124012.1| exostosin 1 [Rattus norvegicus]
gi|3023731|sp|P97464.1|EXT1_MOUSE RecName: Full=Exostosin-1; AltName:
Full=Glucuronosyl-N-acetylglucosaminyl-proteoglycan/N-
acetylglucosaminyl-proteoglycan
4-alpha-N-acetylglucosaminyltransferase; AltName:
Full=Multiple exostoses protein 1 homolog
gi|1813640|gb|AAB41728.1| Ext1 [Mus musculus]
gi|13435765|gb|AAH04741.1| Exostoses (multiple) 1 [Mus musculus]
gi|74205633|dbj|BAE21106.1| unnamed protein product [Mus musculus]
gi|148697307|gb|EDL29254.1| exostoses (multiple) 1, isoform CRA_a [Mus musculus]
gi|149066396|gb|EDM16269.1| similar to Ext1 [Rattus norvegicus]
Length = 746
Score = 61.2 bits (147), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 77/363 (21%), Positives = 135/363 (37%), Gaps = 95/363 (26%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
F+VYVYP ++ + A ESY A S F T DPS+A LF L
Sbjct: 113 GFKVYVYPQQKGEKIA----------------ESYQNILAAIEGSRFYTSDPSQACLFVL 156
Query: 62 PFSI-------ARMRHDRRIGTEGI---PDFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
+ H+ R + + + +H IFN+ S +P + G D
Sbjct: 157 SLDTLDRDQLSPQYVHNLRSKVQSLHLWNNGRNHLIFNLYSGTWPDYTEDVGFD------ 210
Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
IG++ + KA S + DVS+P +D P+ G +
Sbjct: 211 --IGQAMLAKA--------------SISTENFRPNFDVSIPLF---SKDHPRTGGERGFL 251
Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR------------ 202
R + F G + S R L V + + + +
Sbjct: 252 KFNTIPPLRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDVLLLTTCKHGKDWQKHKDSR 311
Query: 203 --------LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
K Y + L + FCL +G + + R ++L CVPV+++N ++LPF++V
Sbjct: 312 CDRDNTEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEV 371
Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
+NW +++ + + ++ I ++ L L+ + F W + S + +
Sbjct: 372 INWNQAAVIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTT 426
Query: 315 YEL 317
E+
Sbjct: 427 LEI 429
>gi|148697308|gb|EDL29255.1| exostoses (multiple) 1, isoform CRA_b [Mus musculus]
Length = 566
Score = 61.2 bits (147), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 77/363 (21%), Positives = 135/363 (37%), Gaps = 95/363 (26%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
F+VYVYP ++ + A ESY A S F T DPS+A LF L
Sbjct: 127 GFKVYVYPQQKGEKIA----------------ESYQNILAAIEGSRFYTSDPSQACLFVL 170
Query: 62 PFSI-------ARMRHDRRIGTEGI---PDFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
+ H+ R + + + +H IFN+ S +P + G D
Sbjct: 171 SLDTLDRDQLSPQYVHNLRSKVQSLHLWNNGRNHLIFNLYSGTWPDYTEDVGFD------ 224
Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
IG++ + KA S + DVS+P +D P+ G +
Sbjct: 225 --IGQAMLAKA--------------SISTENFRPNFDVSIPLF---SKDHPRTGGERGFL 265
Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR------------ 202
R + F G + S R L V + + + +
Sbjct: 266 KFNTIPPLRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDVLLLTTCKHGKDWQKHKDSR 325
Query: 203 --------LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
K Y + L + FCL +G + + R ++L CVPV+++N ++LPF++V
Sbjct: 326 CDRDNTEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEV 385
Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
+NW +++ + + ++ I ++ L L+ + F W + S + +
Sbjct: 386 INWNQAAVIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTT 440
Query: 315 YEL 317
E+
Sbjct: 441 LEI 443
>gi|302845050|ref|XP_002954064.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
gi|300260563|gb|EFJ44781.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
Length = 1122
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 68/277 (24%), Positives = 108/277 (38%), Gaps = 40/277 (14%)
Query: 39 FKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWN 98
F + M + T D +AD +F+P + TE P + + I + YP+W+
Sbjct: 637 FWQRLMSAGIRTVDGDEADYYFIPVNTR---------TELAPGMVEWVLSYIRRTYPWWS 687
Query: 99 RTGGADHFYVACHSIG--------RSAMQKA-----WEVKLNAIQVVCSSSYFISGHIAH 145
+ G H + +G RS ++ A W Q + ++ H
Sbjct: 688 KDNGNRHLIIHTGDMGIADLPADMRSRLKSAFSNITWLTHWGIYQYHPVAKWY-PAHRPG 746
Query: 146 KDVSLP-----------QIWPRQEDPPKL--GSSKRNKLAFFAGAVNSPVREKLLQVWRN 192
KDV LP + PR E + + R+ FFAG + +
Sbjct: 747 KDVVLPVMVTTQGFHLSPLNPRVEARARRRNQTMARSGTFFFAGRICGDRKPPDPATGDC 806
Query: 193 DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFA 252
+SG ++ ++ KFCL G ++ SL GCVPV+I N PF
Sbjct: 807 SRTRPDYSGGVRQ---LDISSHKFCLAPLGGGHGKRQVLVSLM-GCVPVLIGNGVLQPFE 862
Query: 253 DVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQS 289
++W FS+ V DIP L +IL IS + +Q+
Sbjct: 863 PEIDWSRFSVSVPEADIPDLPRILANISDQRVADMQA 899
>gi|126332604|ref|XP_001362716.1| PREDICTED: exostosin-2 [Monodelphis domestica]
Length = 718
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 29/82 (35%), Positives = 48/82 (58%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L + FC+ ++G + A ++D L GCVPVIIA+ Y LPF++VL+WK S+V+
Sbjct: 308 YPQVLQEASFCMVLRGARLGQAVLSDVLQAGCVPVIIADSYILPFSEVLDWKRASVVIPE 367
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+ + +L+ I + +Q
Sbjct: 368 EKMRDMYSVLRSIPQRQIEEMQ 389
>gi|302845052|ref|XP_002954065.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
gi|300260564|gb|EFJ44782.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
Length = 600
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 78/335 (23%), Positives = 127/335 (37%), Gaps = 60/335 (17%)
Query: 39 FKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWN 98
F + M S T D +KAD F++P + R G+ + + + I + YP+W+
Sbjct: 191 FWQRLMSSGMRTLDGNKADYFYIPINT-------RTGSLAREE-LEWTLPYIKKTYPWWS 242
Query: 99 RTGGADHFYVACHSIGRSAMQKAWEVKLN------------AIQVVCSSSYFISGHIAHK 146
+ G H + +G + A +LN + + + H K
Sbjct: 243 KDNGNRHLIIHTGDMGINDFPLATRRELNESLSNITWLTHWGLHEYHPIAKWYPAHRPGK 302
Query: 147 DVSLP-----------QIWPRQEDPPKLGSSKR--NKLAFFAGAVNS------PVREKLL 187
D+ +P + PR E K + R N FFAG + P K
Sbjct: 303 DIVIPVMIMTQGFHLSPMNPRMEAEIKAQGAPRLRNGTLFFAGRICGDRDLPDPKTGKCG 362
Query: 188 QVWRNDS-----EIYAHSGRLK--------TPYADELLGSKFCLHVKGFEVNTARIADSL 234
+ S +Y +K + Y +++ KFCL G I +
Sbjct: 363 PGHEDYSFGVRQAVYLQHRNVKGFRIVAWTSTYLEDISSHKFCLAPVGGGHGKRNILVA- 421
Query: 235 YYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKV 294
+ GC+PV+I +H PF ++W FSI V DIP L +IL + + E Q +
Sbjct: 422 FMGCLPVLIGDHVLQPFEPEIDWSRFSISVPEADIPDLPRILANVPASEVASKQKRLRCA 481
Query: 295 RKHFQWH-----VFPSD--YDAFYMVMYELWLRRS 322
+H + + D YDAF +M L +R+
Sbjct: 482 AQHMFYSSTLGAILGEDGRYDAFETLMEILRVRKE 516
>gi|410912512|ref|XP_003969733.1| PREDICTED: exostosin-2-like [Takifugu rubripes]
Length = 719
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 47/82 (57%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L S FC+ ++G + A ++D L GCVPVI+A+ Y LPF++VL+WK S+ +
Sbjct: 309 YPQVLQDSSFCVVLRGARLGQAALSDVLQAGCVPVILADSYILPFSEVLDWKRASVFIPE 368
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+ + ILK I + +Q
Sbjct: 369 EKLSEMYGILKSIPHRQVEEMQ 390
>gi|297791307|ref|XP_002863538.1| hypothetical protein ARALYDRAFT_494490 [Arabidopsis lyrata subsp.
lyrata]
gi|297309373|gb|EFH39797.1| hypothetical protein ARALYDRAFT_494490 [Arabidopsis lyrata subsp.
lyrata]
Length = 476
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 70/282 (24%), Positives = 118/282 (41%), Gaps = 51/282 (18%)
Query: 52 DPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFN---ISQKYPYW-------NRT 100
DP++ADLFF+ FS + I P+F S + ++ + + W R
Sbjct: 158 DPAEADLFFVAAFSSLSL-----IVNSDRPEFGSGFGYSEEVMQESLVSWLEGQEWCRRN 212
Query: 101 GGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH-KDVSLPQIWPRQED 159
G DH VA +A+ + + NA+ +V +F + + KDV +P
Sbjct: 213 NGRDHVIVAGDP---NALNRVMDRVKNAVLLVTDLGWFRADQGSLVKDVIIPYSHRVDAY 269
Query: 160 PPKLGSSKRNKLAF------FAGAV-----------NSPVREKLLQVWRNDSEIYAHSGR 202
+LG +RN L + G+V VR+ L ++ N+ ++ G
Sbjct: 270 EGELGVKQRNNLLYRETSHNLLGSVLVYGLALNVKYGGRVRDLLFKLLENEEDVVIKHGT 329
Query: 203 L----KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWK 258
+ + SKFCLH G + R+ D+L CVPVI+++ +LPF D
Sbjct: 330 QSRENRRAAKQGMHTSKFCLHSAGDTHSACRLFDALASLCVPVIVSDGIELPFED----- 384
Query: 259 SFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQW 300
A L + K L+ + E+ L Q + +VR++F +
Sbjct: 385 -----DAALKPGFVVKKLRKVKPEKILKYQKAMKEVRRYFDY 421
>gi|395543711|ref|XP_003773757.1| PREDICTED: exostosin-2 [Sarcophilus harrisii]
Length = 917
Score = 60.8 bits (146), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 47/78 (60%)
Query: 214 SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLK 273
+ FC+ ++G + A ++D L GCVPVIIA+ Y LPF++VL+WK S+V+ + +
Sbjct: 294 ASFCIVLRGARLGQAALSDVLQAGCVPVIIADSYILPFSEVLDWKRASVVIPEEKMRDMY 353
Query: 274 KILKGISSEEYLLLQSYV 291
+L+ I + +Q V
Sbjct: 354 SVLRSIPQRQIEEMQRQV 371
>gi|226526925|gb|ACO71282.1| exostoses 1 (predicted) [Dasypus novemcinctus]
Length = 744
Score = 60.8 bits (146), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 77/363 (21%), Positives = 135/363 (37%), Gaps = 95/363 (26%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
F+VYVYP ++ + A ESY A S F T DPS+A LF L
Sbjct: 111 GFKVYVYPQQKGEKIA----------------ESYQNILAAIEGSRFYTSDPSQACLFVL 154
Query: 62 PFSI-------ARMRHDRRIGTEGI---PDFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
+ H+ R + + + +H IFN+ S +P + G D
Sbjct: 155 SLDTLDRDQLSPQYVHNLRSKVQSLHLWNNGRNHLIFNLYSGTWPDYTEDVGFD------ 208
Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
IG++ + KA S + DVS+P +D P+ G +
Sbjct: 209 --IGQAMLAKA--------------SISTENFRPNFDVSIPLF---SKDHPRTGGERGFL 249
Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR------------ 202
R + F G + S R L V + + + +
Sbjct: 250 KFNTIPPLRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDILLLTTCKHGKDWQKHKDSR 309
Query: 203 --------LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
K Y + L + FCL +G + + R ++L CVPV+++N ++LPF++V
Sbjct: 310 CDRDNTEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEV 369
Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
+NW +++ + + ++ I ++ L L+ + F W + S + +
Sbjct: 370 INWNQAAVIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTT 424
Query: 315 YEL 317
E+
Sbjct: 425 LEI 427
>gi|390332053|ref|XP_003723408.1| PREDICTED: LOW QUALITY PROTEIN: exostosin-2-like
[Strongylocentrotus purpuratus]
Length = 707
Score = 60.8 bits (146), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 32/95 (33%), Positives = 52/95 (54%), Gaps = 5/95 (5%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L + FC+ ++ + A ++D+L GC+PVII++ Y LPF++V++WK S+VV
Sbjct: 297 YPHILQDATFCIVLRRTRLGQAALSDALQAGCIPVIISDAYILPFSEVIDWKRASLVVRE 356
Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWH 301
IP L IL + E Y ++ + F W
Sbjct: 357 DRIPDLPDILHAVELEHI-----YEMRQQVRFLWQ 386
>gi|219362519|ref|NP_001137073.1| uncharacterized protein LOC100217246 [Zea mays]
gi|194691480|gb|ACF79824.1| unknown [Zea mays]
gi|413919898|gb|AFW59830.1| hypothetical protein ZEAMMB73_270023 [Zea mays]
Length = 458
Score = 60.8 bits (146), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 66/286 (23%), Positives = 117/286 (40%), Gaps = 47/286 (16%)
Query: 55 KADLFFLPFSIARMRHDRRIG---------TEGIPDFISHY-IFNISQKYPYWNRTGGAD 104
+AD+ F+PF A + + +G +G D+ + + +P W R+ G D
Sbjct: 126 EADVVFVPF-FATLSAEMELGWGTKGAFRKKDGNEDYRRQREVVDRVTSHPAWRRSSGRD 184
Query: 105 HFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGH--------------IAHKDVSL 150
H +V + W V+ + F + I H VSL
Sbjct: 185 HIFVLTDPVA------MWHVRAEIAPAILLVVDFGGWYKVDSKSSSKNSSRVIQHTQVSL 238
Query: 151 PQ--IWPRQEDPPKLGSSK---RNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSG 201
+ I P P L S+ R L +F GA VREKL + N+ ++ G
Sbjct: 239 LKDVIVPYTHLLPTLLLSENKDRRTLLYFKGAKHRHRGGLVREKLWDLLGNEPDVIMEEG 298
Query: 202 RLKTPYADE----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNW 257
++ L S+FCLH G + R+ D++ C+PVI+++ +LPF ++++
Sbjct: 299 FPNATGREQSIKGLRTSEFCLHPAGDTPTSCRLFDAIASLCIPVIVSDEVELPFEGIIDY 358
Query: 258 KSFSIVVA---TLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQW 300
SI V+ + L L+ IS ++ + + +V+ F++
Sbjct: 359 TEISIFVSVSNAMRPKWLTSYLRNISKQQKDEFRRNLARVQPIFEY 404
>gi|355752199|gb|EHH56319.1| Exostosin-2, partial [Macaca fascicularis]
Length = 733
Score = 60.8 bits (146), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 48/82 (58%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L + FC+ ++G + A ++D L GCVPV+IA+ Y LPF++VL+WK S+VV
Sbjct: 323 YPQVLQEATFCVVLRGARLGQAMLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 382
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+ + IL+ I + +Q
Sbjct: 383 EKMSDVYSILQSIPQRQIEEMQ 404
>gi|148230947|ref|NP_001082080.1| exostosin [Xenopus laevis]
gi|13183631|gb|AAK15278.1|AF319538_1 exostosin [Xenopus laevis]
gi|37921194|gb|AAO84329.1| exostosin 1 [Xenopus laevis]
Length = 738
Score = 60.8 bits (146), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 71/359 (19%), Positives = 131/359 (36%), Gaps = 87/359 (24%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFFLPF 63
F+VY+YP ++ D ++ S + A S F T DPS+A +F L
Sbjct: 105 GFKVYIYPQQKGDKLSD--------------SYQHILAAIETSRFYTSDPSQACVFVL-- 148
Query: 64 SIARMRHDRRIGTEGIPDFISHYIFNISQKYP---YWNRTGGADHFYVACHSIGRSAMQK 120
S+ + D+ HY+ N+ K WN G +H +S
Sbjct: 149 SLDTLDRDQ---------LSPHYVHNLKSKVQNLHLWN--NGRNHLIFNLYS-------G 190
Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHK------DVSLPQIWPRQEDPPKLGSSK------- 167
W + + I+ + DVS+P +D P+ G K
Sbjct: 191 TWPDYTEDVGFDIGQAMLAKASISTENFRPNFDVSIPLF---SKDHPRTGGDKGFLRFNN 247
Query: 168 ----RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR---------------- 202
R + F G + S R L V + + + +
Sbjct: 248 IPPMRKYMLVFKGKRYLTGIGSDTRNALYHVIMGEDVVLLTTCKHGKKWQKHKDARCDKD 307
Query: 203 ----LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWK 258
K Y + L + FCL +G + + R ++L CVPV+++N ++LPF++V++W
Sbjct: 308 NAEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVIDWN 367
Query: 259 SFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYEL 317
+++ + + ++ I ++ L L+ + F W + S + + E+
Sbjct: 368 QAAVIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTTLEI 421
>gi|148695683|gb|EDL27630.1| exostoses (multiple) 2, isoform CRA_b [Mus musculus]
Length = 706
Score = 60.8 bits (146), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 47/82 (57%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L + FC ++G + A ++D L GCVPV+IA+ Y LPF++VL+WK S+VV
Sbjct: 372 YPQVLQEATFCTVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 431
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+ + IL+ I + +Q
Sbjct: 432 EKMSDVYSILQNIPQRQIEEMQ 453
>gi|2251238|gb|AAB62718.1| multiple exostoses type II protein EXT2.I [Homo sapiens]
Length = 728
Score = 60.8 bits (146), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 33/93 (35%), Positives = 53/93 (56%), Gaps = 2/93 (2%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L + FC+ ++G + A ++D L GCVPV+IA+ Y LPF++VL+WK S+VV
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367
Query: 267 LDIPLLKKILKGISSE--EYLLLQSYVLKVRKH 297
+ + IL+ I E + Q ++ VR+
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQRQLFMEPVRRE 400
>gi|297688867|ref|XP_002821894.1| PREDICTED: exostosin-2 isoform 2 [Pongo abelii]
Length = 728
Score = 60.8 bits (146), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 48/82 (58%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L + FC+ ++G + A ++D L GCVPV+IA+ Y LPF++VL+WK S+VV
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+ + IL+ I + +Q
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQ 389
>gi|296010875|ref|NP_001171554.1| exostosin-2 isoform 3 [Homo sapiens]
gi|119588476|gb|EAW68070.1| exostoses (multiple) 2, isoform CRA_c [Homo sapiens]
Length = 728
Score = 60.8 bits (146), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 48/82 (58%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L + FC+ ++G + A ++D L GCVPV+IA+ Y LPF++VL+WK S+VV
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+ + IL+ I + +Q
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQ 389
>gi|74214457|dbj|BAE31083.1| unnamed protein product [Mus musculus]
Length = 690
Score = 60.8 bits (146), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 47/82 (57%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L + FC ++G + A ++D L GCVPV+IA+ Y LPF++VL+WK S+VV
Sbjct: 412 YPQVLQEATFCTVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 471
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+ + IL+ I + +Q
Sbjct: 472 EKMSDVYSILQNIPQRQIEEMQ 493
>gi|426368047|ref|XP_004051026.1| PREDICTED: exostosin-2 isoform 2 [Gorilla gorilla gorilla]
Length = 728
Score = 60.8 bits (146), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 48/82 (58%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L + FC+ ++G + A ++D L GCVPV+IA+ Y LPF++VL+WK S+VV
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+ + IL+ I + +Q
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQ 389
>gi|426245345|ref|XP_004016473.1| PREDICTED: exostosin-2 [Ovis aries]
Length = 718
Score = 60.8 bits (146), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 31/82 (37%), Positives = 48/82 (58%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L + FC+ ++G + A ++D L GCVPVIIA+ Y LPF++VL+WK S+VV
Sbjct: 308 YPQVLQEATFCMVLRGARLGQAVLSDVLRAGCVPVIIADSYVLPFSEVLDWKRASVVVPE 367
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+ + IL+ I + +Q
Sbjct: 368 EKMLDVYSILQSIPRRQIEEMQ 389
>gi|196014406|ref|XP_002117062.1| hypothetical protein TRIADDRAFT_1026 [Trichoplax adhaerens]
gi|190580284|gb|EDV20368.1| hypothetical protein TRIADDRAFT_1026 [Trichoplax adhaerens]
Length = 657
Score = 60.8 bits (146), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 37/146 (25%), Positives = 71/146 (48%), Gaps = 10/146 (6%)
Query: 163 LGSSKRNKLAFFAGAVNSPVRE--KLLQVWRNDSEIYAHSGRL---KTPYADELLGSKFC 217
+GS RN L +N + K WR + L + Y D L+ S FC
Sbjct: 179 IGSETRNALHHLHDDLNYIMVTTCKHGNTWREHQDSRCVVDELTYGEYDYQDLLINSTFC 238
Query: 218 LHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILK 277
L +G + + R ++L +GC+P++++N + LPF++V++WK + + + + ++++
Sbjct: 239 LVPRGRRLGSFRFLEALQFGCIPIVLSNGWVLPFSEVIDWKKACVQIDERQLFDVPELIE 298
Query: 278 GISSEEYLLLQSYVLKVRKHFQWHVF 303
IS E+ L +K + F W +
Sbjct: 299 SISDEKIL-----AMKQQSIFLWQTY 319
>gi|332836218|ref|XP_003313041.1| PREDICTED: exostosin-2 isoform 1 [Pan troglodytes]
Length = 728
Score = 60.8 bits (146), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 48/82 (58%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L + FC+ ++G + A ++D L GCVPV+IA+ Y LPF++VL+WK S+VV
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+ + IL+ I + +Q
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQ 389
>gi|332210817|ref|XP_003254509.1| PREDICTED: exostosin-2 isoform 2 [Nomascus leucogenys]
Length = 728
Score = 60.8 bits (146), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 48/82 (58%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L + FC+ ++G + A ++D L GCVPV+IA+ Y LPF++VL+WK S+VV
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+ + IL+ I + +Q
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQ 389
>gi|357508935|ref|XP_003624756.1| Cysteine synthase [Medicago truncatula]
gi|355499771|gb|AES80974.1| Cysteine synthase [Medicago truncatula]
Length = 407
Score = 60.8 bits (146), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 38/139 (27%), Positives = 76/139 (54%), Gaps = 9/139 (6%)
Query: 183 REKLLQVWRNDSEIYAHSGRLKTP----YADELLGSKFCLHVKGFEVNTARIADSLYYGC 238
R++L + + + +++ G ++ + + SKFCL++ G ++ R+ D++ C
Sbjct: 13 RQELFYLLKEEKDVHFSFGSVQKGGVRNATNGMRSSKFCLNIAGDTPSSNRLFDAIASHC 72
Query: 239 VPVIIANHYDLPFADVLNWKSFSIVVATLDI---PLLKKILKGISSEEYLLLQSYVLKVR 295
VPVII++ +LP+ DVL++ F + V T D L ++ I +E+ + + + +V
Sbjct: 73 VPVIISDEIELPYEDVLDYSKFCVFVRTRDAVKKKYLINFIRSIGKDEWTRMWNRLKEVE 132
Query: 296 KHFQWHVFPS-DYDAFYMV 313
K F++ FPS + DA M+
Sbjct: 133 KFFEFQ-FPSKEGDAVEMI 150
>gi|159470095|ref|XP_001693195.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158277453|gb|EDP03221.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 659
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 70/285 (24%), Positives = 113/285 (39%), Gaps = 47/285 (16%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARM------RHDRRIGTEGIPDFISHY 86
Y ES + + S T DP +AD F++P I+ + D + + H
Sbjct: 338 YGVESGLHEYLLLSEHRTFDPEEADFFYVPVYISCLIWPVLNYADFPVFYSNGGTRVMHA 397
Query: 87 IFNISQ-------KYPYWNRTGGADHFYVACHSIG-----RSAMQKAWEVKLNAIQVV-C 133
+ +S+ YP+W R GG DH + H G S + W +
Sbjct: 398 VNMLSEARDWIDANYPFWKRRGGRDHIWTFPHDEGACWAPNSIVSSIWLTHWGRMDPDHT 457
Query: 134 SSSYFISGHIAHKDVSLPQIWPRQEDPPKL--------GSSKRNKLAFFAGAVNSPVREK 185
S S F + + VS PRQ PK G K+ + A N
Sbjct: 458 SKSSFDADNYTRDFVS-----PRQ---PKGYTHLIQGHGCYDPKKIYNMSIANN------ 503
Query: 186 LLQVWRNDSEIYAHSGR-LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIA 244
WR + G+ ++ Y+D L S FCL G + +AR D++ +GC+PV++
Sbjct: 504 ----WRQKYNVLVGDGQDVQGDYSDLLSRSLFCLVATG-DGWSARTEDAVLHGCIPVVVI 558
Query: 245 NHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQS 289
+ + F + + SFSI + D+ + ILK + E +Q+
Sbjct: 559 DGVHMKFETLFDVDSFSIRIPEADVANILTILKALPEERVRAMQA 603
>gi|46250443|gb|AAH68545.1| Exostoses (multiple) 2 [Homo sapiens]
Length = 718
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 48/82 (58%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L + FC+ ++G + A ++D L GCVPV+IA+ Y LPF++VL+WK S+VV
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+ + IL+ I + +Q
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQ 389
>gi|390470425|ref|XP_002755247.2| PREDICTED: exostosin-2 isoform 1 [Callithrix jacchus]
Length = 718
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 48/82 (58%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L + FC+ ++G + A ++D L GCVPV+IA+ Y LPF++VL+WK S+VV
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+ + IL+ I + +Q
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQ 389
>gi|46370069|ref|NP_997005.1| exostosin-2 isoform 2 [Homo sapiens]
gi|3023739|sp|Q93063.1|EXT2_HUMAN RecName: Full=Exostosin-2; AltName:
Full=Glucuronosyl-N-acetylglucosaminyl-proteoglycan/N-
acetylglucosaminyl-proteoglycan
4-alpha-N-acetylglucosaminyltransferase; AltName:
Full=Multiple exostoses protein 2; AltName:
Full=Putative tumor suppressor protein EXT2
gi|1518042|gb|AAB07008.1| EXT2 [Homo sapiens]
gi|1519605|gb|AAC51219.1| multiple exostosis 2 [Homo sapiens]
gi|1621113|gb|AAC50764.1| hereditary multiple exostoses gene 2 protein [Homo sapiens]
gi|14603196|gb|AAH10058.1| Exostoses (multiple) 2 [Homo sapiens]
gi|119588474|gb|EAW68068.1| exostoses (multiple) 2, isoform CRA_b [Homo sapiens]
gi|189065454|dbj|BAG35293.1| unnamed protein product [Homo sapiens]
gi|325463523|gb|ADZ15532.1| exostoses (multiple) 2 [synthetic construct]
Length = 718
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 48/82 (58%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L + FC+ ++G + A ++D L GCVPV+IA+ Y LPF++VL+WK S+VV
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+ + IL+ I + +Q
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQ 389
>gi|432090368|gb|ELK23794.1| Exostosin-2 [Myotis davidii]
Length = 718
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 48/82 (58%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L + FC+ ++G + A ++D L GCVPV+IA+ Y LPF++VL+WK S+VV
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLRAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+ + IL+ I + +Q
Sbjct: 368 EKMADVYSILQSIPQRQIEEMQ 389
>gi|402893736|ref|XP_003910045.1| PREDICTED: exostosin-2-like isoform 1 [Papio anubis]
Length = 535
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 48/82 (58%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L + FC+ ++G + A ++D L GCVPV+IA+ Y LPF++VL+WK S+VV
Sbjct: 341 YPQVLQEATFCVVLRGARLGQAILSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 400
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+ + IL+ I + +Q
Sbjct: 401 EKMSDVYSILQSIPQRQIEEMQ 422
>gi|332210815|ref|XP_003254508.1| PREDICTED: exostosin-2 isoform 1 [Nomascus leucogenys]
Length = 718
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 48/82 (58%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L + FC+ ++G + A ++D L GCVPV+IA+ Y LPF++VL+WK S+VV
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+ + IL+ I + +Q
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQ 389
>gi|74213823|dbj|BAE29346.1| unnamed protein product [Mus musculus]
Length = 786
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 47/82 (57%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L + FC ++G + A ++D L GCVPV+IA+ Y LPF++VL+WK S+VV
Sbjct: 376 YPQVLQEATFCTVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 435
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+ + IL+ I + +Q
Sbjct: 436 EKMSDVYSILQNIPQRQIEEMQ 457
>gi|395742863|ref|XP_003777829.1| PREDICTED: exostosin-2 [Pongo abelii]
Length = 751
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 48/82 (58%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L + FC+ ++G + A ++D L GCVPV+IA+ Y LPF++VL+WK S+VV
Sbjct: 341 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 400
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+ + IL+ I + +Q
Sbjct: 401 EKMSDVYSILQSIPQRQIEEMQ 422
>gi|426368045|ref|XP_004051025.1| PREDICTED: exostosin-2 isoform 1 [Gorilla gorilla gorilla]
Length = 718
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 48/82 (58%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L + FC+ ++G + A ++D L GCVPV+IA+ Y LPF++VL+WK S+VV
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+ + IL+ I + +Q
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQ 389
>gi|397473541|ref|XP_003808267.1| PREDICTED: exostosin-2 [Pan paniscus]
gi|410305316|gb|JAA31258.1| exostosin 2 [Pan troglodytes]
Length = 751
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 48/82 (58%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L + FC+ ++G + A ++D L GCVPV+IA+ Y LPF++VL+WK S+VV
Sbjct: 341 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 400
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+ + IL+ I + +Q
Sbjct: 401 EKMSDVYSILQSIPQRQIEEMQ 422
>gi|168277694|dbj|BAG10825.1| exostosin-2 [synthetic construct]
Length = 746
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 48/82 (58%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L + FC+ ++G + A ++D L GCVPV+IA+ Y LPF++VL+WK S+VV
Sbjct: 336 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 395
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+ + IL+ I + +Q
Sbjct: 396 EKMSDVYSILQSIPQRQIEEMQ 417
>gi|147846684|emb|CAN80640.1| hypothetical protein VITISV_016911 [Vitis vinifera]
Length = 1363
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 79/320 (24%), Positives = 136/320 (42%), Gaps = 71/320 (22%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLPF--SIARMRHDRR----------IGTEGIP 80
Y ++ ++ + S T D +AD FF+P S +R D + +
Sbjct: 409 YGAQMAIYESILASPHRTLDGEEADFFFVPVLDSCIIVRADDAPHLNMHAHGGLRSSLTL 468
Query: 81 DFISHYIFNISQKYPYWNRTGGADHFYV------ACHS---IGRSAMQKAWEVKLNAIQV 131
+F +I ++YP+WNR+ G DH + AC++ I S M W N+
Sbjct: 469 EFYKTAYDHIVEQYPFWNRSSGRDHIWFFSWDEGACYAPKEIWDSMMLVHWG-NTNSKHN 527
Query: 132 VCSSSYFI-----------SGHIA---HKDVSLPQIWPRQEDPPKLGS-------SKRNK 170
+++Y+ H +KD+ LP W R D L S +R
Sbjct: 528 HSTTAYWADNWDSVSSDRRGNHPCFDPYKDLVLPA-WKR-PDVVSLSSKLWSRPREQRKT 585
Query: 171 LAFFAGAVNSP-------------VREKLLQVWRNDSEIYAHSGR------LKTP----- 206
L +F G + +R+K+ + + + G+ + TP
Sbjct: 586 LFYFNGNLGPAYEGGRPETTYSMGIRQKVAEEFGSSPNKEGKLGKQHAEDVIVTPLRSGN 645
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y + L S FC + G + + R DS+ GC+PV+I + LPF ++LN++SF++ +
Sbjct: 646 YHESLASSVFCGVMPG-DGWSGRFEDSILQGCIPVVIQDGIFLPFENMLNYESFAVRIRE 704
Query: 267 LDIPLLKKILKGISSEEYLL 286
+IP L KIL+ +S + Y+L
Sbjct: 705 DEIPNLIKILR-LSGDPYVL 723
>gi|332836216|ref|XP_508383.3| PREDICTED: exostosin-2 isoform 3 [Pan troglodytes]
Length = 718
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 48/82 (58%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L + FC+ ++G + A ++D L GCVPV+IA+ Y LPF++VL+WK S+VV
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+ + IL+ I + +Q
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQ 389
>gi|332210819|ref|XP_003254510.1| PREDICTED: exostosin-2 isoform 3 [Nomascus leucogenys]
Length = 751
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 48/82 (58%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L + FC+ ++G + A ++D L GCVPV+IA+ Y LPF++VL+WK S+VV
Sbjct: 341 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 400
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+ + IL+ I + +Q
Sbjct: 401 EKMSDVYSILQSIPQRQIEEMQ 422
>gi|296010873|ref|NP_000392.3| exostosin-2 isoform 1 [Homo sapiens]
gi|119588477|gb|EAW68071.1| exostoses (multiple) 2, isoform CRA_d [Homo sapiens]
Length = 751
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 48/82 (58%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L + FC+ ++G + A ++D L GCVPV+IA+ Y LPF++VL+WK S+VV
Sbjct: 341 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 400
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+ + IL+ I + +Q
Sbjct: 401 EKMSDVYSILQSIPQRQIEEMQ 422
>gi|73982548|ref|XP_533196.2| PREDICTED: exostosin-2 isoform 2 [Canis lupus familiaris]
Length = 718
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 48/82 (58%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L + FC+ ++G + A ++D L GCVPV+IA+ Y LPF++VL+WK S+VV
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLRAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+ + IL+ I + +Q
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQ 389
>gi|403254631|ref|XP_003920065.1| PREDICTED: exostosin-2 [Saimiri boliviensis boliviensis]
Length = 718
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 48/82 (58%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L + FC+ ++G + A ++D L GCVPV+IA+ Y LPF++VL+WK S+VV
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+ + IL+ I + +Q
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQ 389
>gi|332836220|ref|XP_003313042.1| PREDICTED: exostosin-2 isoform 2 [Pan troglodytes]
Length = 751
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 48/82 (58%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L + FC+ ++G + A ++D L GCVPV+IA+ Y LPF++VL+WK S+VV
Sbjct: 341 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 400
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+ + IL+ I + +Q
Sbjct: 401 EKMSDVYSILQSIPQRQIEEMQ 422
>gi|355566589|gb|EHH22968.1| Exostosin-2 [Macaca mulatta]
Length = 751
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 31/94 (32%), Positives = 53/94 (56%), Gaps = 7/94 (7%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L + FC+ ++G + A ++D L GCVPV+IA+ Y LPF++VL+WK S+VV
Sbjct: 341 YPQVLQEATFCVVLRGARLGQAMLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 400
Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQW 300
+ + IL+ I Q + ++++ +W
Sbjct: 401 EKMSDVYSILQSIP-------QRQIEEMKRQARW 427
>gi|119588473|gb|EAW68067.1| exostoses (multiple) 2, isoform CRA_a [Homo sapiens]
gi|119588475|gb|EAW68069.1| exostoses (multiple) 2, isoform CRA_a [Homo sapiens]
Length = 731
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 48/82 (58%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L + FC+ ++G + A ++D L GCVPV+IA+ Y LPF++VL+WK S+VV
Sbjct: 321 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 380
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+ + IL+ I + +Q
Sbjct: 381 EKMSDVYSILQSIPQRQIEEMQ 402
>gi|124301269|gb|ABN04853.1| Exostosin-like [Medicago truncatula]
Length = 196
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 32/126 (25%), Positives = 68/126 (53%), Gaps = 7/126 (5%)
Query: 182 VREKLLQVWRNDSEIY----AHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYG 237
+R+ L Q+ + ++ A S + + + SKFCLH G + R+ D++
Sbjct: 20 IRDTLFQILEKEDDVIIKHGAQSRESRRAASQGMHTSKFCLHPAGDTPSACRLFDAIVSL 79
Query: 238 CVPVIIANHYDLPFADVLNWKSFSIVVAT---LDIPLLKKILKGISSEEYLLLQSYVLKV 294
CVPVI+++ +LPF D ++++ ++ V T + L IL+G++ + + Q + +V
Sbjct: 80 CVPVIVSDSIELPFEDTIDYRKIAVFVETAAAIQPGYLVSILRGMAPDRIVEYQKELKEV 139
Query: 295 RKHFQW 300
+++F++
Sbjct: 140 KRYFKY 145
>gi|297688871|ref|XP_002821896.1| PREDICTED: exostosin-2 isoform 4 [Pongo abelii]
Length = 718
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 48/82 (58%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L + FC+ ++G + A ++D L GCVPV+IA+ Y LPF++VL+WK S+VV
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+ + IL+ I + +Q
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQ 389
>gi|335281986|ref|XP_003122905.2| PREDICTED: exostosin-2-like [Sus scrofa]
Length = 718
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 48/82 (58%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L + FC+ ++G + A ++D L GCVPV+IA+ Y LPF++VL+WK S+VV
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLRAGCVPVVIADSYVLPFSEVLDWKRASVVVPE 367
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+ + IL+ I + +Q
Sbjct: 368 EKMSDVYSILQSIPRRQMEEMQ 389
>gi|402893738|ref|XP_003910046.1| PREDICTED: exostosin-2-like isoform 2 [Papio anubis]
Length = 515
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 48/82 (58%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L + FC+ ++G + A ++D L GCVPV+IA+ Y LPF++VL+WK S+VV
Sbjct: 321 YPQVLQEATFCVVLRGARLGQAILSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 380
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+ + IL+ I + +Q
Sbjct: 381 EKMSDVYSILQSIPQRQIEEMQ 402
>gi|311253457|ref|XP_001925015.2| PREDICTED: exostosin-1 [Sus scrofa]
Length = 746
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 76/363 (20%), Positives = 135/363 (37%), Gaps = 95/363 (26%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
F+VYVYP ++ + A ESY A S F T DPS+A LF L
Sbjct: 113 GFKVYVYPQQKGEKIA----------------ESYQNILAAIEGSRFYTSDPSQACLFVL 156
Query: 62 PFSI-------ARMRHDRRIGTEGI---PDFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
+ H+ + + + + +H IFN+ S +P + G D
Sbjct: 157 SLDTLDRDQLSPQYVHNLKSKVQSLHLWNNGRNHLIFNLYSGTWPDYTEDVGFD------ 210
Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
IG++ + KA S + DVS+P +D P+ G +
Sbjct: 211 --IGQAMLAKA--------------SISTENFRPNFDVSIPLF---SKDHPRTGGERGFL 251
Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR------------ 202
R + F G + S R L V + + + +
Sbjct: 252 KFNTIPPLRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHKDSR 311
Query: 203 --------LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
K Y + L + FCL +G + + R ++L CVPV+++N ++LPF++V
Sbjct: 312 CDRDNTEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEV 371
Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
+NW +++ + + ++ I ++ L L+ + F W + S + +
Sbjct: 372 INWNQAAVIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTT 426
Query: 315 YEL 317
E+
Sbjct: 427 LEI 429
>gi|297268065|ref|XP_001106468.2| PREDICTED: exostosin-2 [Macaca mulatta]
Length = 718
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 48/82 (58%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L + FC+ ++G + A ++D L GCVPV+IA+ Y LPF++VL+WK S+VV
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAILSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+ + IL+ I + +Q
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQ 389
>gi|387015838|gb|AFJ50038.1| Exostosin-1-like [Crotalus adamanteus]
Length = 751
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 75/363 (20%), Positives = 134/363 (36%), Gaps = 95/363 (26%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
F+VYVYP ++ + A ESY A S F T DPS+A LF L
Sbjct: 118 GFKVYVYPQQKGEKIA----------------ESYQNILAAIEGSRFYTSDPSQACLFVL 161
Query: 62 PFSI-------ARMRHDRRIGTEGI---PDFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
+ H+ R + + + +H +FN+ S +P + G D
Sbjct: 162 SLDTLDRDQLSPQYVHNLRSKVQSLHLWNNGRNHLVFNLYSGTWPDYTEDVGFD------ 215
Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
IG++ + KA S DVS+P +D P+ G +
Sbjct: 216 --IGQAMLAKA--------------SISTENFRPRFDVSIPLF---SKDHPRTGGERGFL 256
Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR------------ 202
R + F G + S R L V + + + +
Sbjct: 257 RFNTIPPLRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHKDSR 316
Query: 203 --------LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
K Y + L + FCL +G + + R ++L C+PV+++N ++LPF++V
Sbjct: 317 CDRDNAEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACIPVMLSNGWELPFSEV 376
Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
+NW +++ + + ++ I ++ L L+ + F W + S + +
Sbjct: 377 INWNQAAVIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTT 431
Query: 315 YEL 317
E+
Sbjct: 432 LEI 434
>gi|161612233|gb|AAI55813.1| Exostoses (multiple) 1c [Danio rerio]
Length = 737
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 82/357 (22%), Positives = 135/357 (37%), Gaps = 83/357 (23%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFK--KAFMKSHFVTKDPSKADLFFL 61
FRVY+YP + G SESY K + +S + T DP +A LF L
Sbjct: 105 GFRVYIYPPEK----------------GERVSESYRKILTSVSESRYYTSDPREACLFVL 148
Query: 62 PFSIARMRHDRRIGTEGIPDFISHYIFNISQK---YPYWNRTGGADHFYVACHSIGRSAM 118
I + D+ ++ N+ ++ YP WN G +H +S
Sbjct: 149 --GIDTLDRDQ---------LSQQFVPNVDERIRGYPLWN--DGRNHVIFNLYSGTWPNY 195
Query: 119 QKAWEVKLNAIQVVCSSSYFISGHIAHK-DVSLPQIWPRQEDPPKLGSS----------K 167
+ ++ N Q + + + + H D+S+P +E P K G +
Sbjct: 196 TE--DLGFNVGQAILAKASLNTEHFRPGFDISIPLF--SKEHPQKGGKRGWLVRNSVPPR 251
Query: 168 RNKLAFFAG-----AVNSPVREKL-----------LQVWRNDSEIYAHSGRL-------- 203
R L F G + S R L L R+ + H
Sbjct: 252 RKYLLMFKGKRYLTGIGSDTRNALHHIHNGKDIVSLTTCRHGKDWEKHKDARCDHDNQEY 311
Query: 204 -KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI 262
+ Y + L S FCL +G + + R +SL C+PV+++N ++LPF+DV+ W
Sbjct: 312 ERFDYQELLHNSTFCLVPRGRRLGSFRFLESLQAACIPVLLSNGWELPFSDVIQWNQ--- 368
Query: 263 VVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQ--WHVFPSDYDAFYMVMYEL 317
V D +++L + S + VL +R+ Q W + S D + E+
Sbjct: 369 AVVEGD----ERLLLQVPSTVRAVGIDRVLALRQQTQTLWDAYFSSVDKIVLTTLEI 421
>gi|297688865|ref|XP_002821893.1| PREDICTED: exostosin-2 isoform 1 [Pongo abelii]
Length = 731
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 48/82 (58%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L + FC+ ++G + A ++D L GCVPV+IA+ Y LPF++VL+WK S+VV
Sbjct: 321 YPQVLQEATFCVVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 380
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+ + IL+ I + +Q
Sbjct: 381 EKMSDVYSILQSIPQRQIEEMQ 402
>gi|410973605|ref|XP_003993238.1| PREDICTED: exostosin-2 [Felis catus]
Length = 718
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 29/82 (35%), Positives = 48/82 (58%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L + FC+ ++G + A ++D L GCVPV+IA+ Y LPF++VL+WK S+VV
Sbjct: 308 YPQVLQEATFCVVLRGARLGQAVLSDVLRAGCVPVVIADSYVLPFSEVLDWKRASVVVPE 367
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+ + IL+ + + +Q
Sbjct: 368 EKMSEVYSILQSVPQRQIEEMQ 389
>gi|196004286|ref|XP_002112010.1| hypothetical protein TRIADDRAFT_24279 [Trichoplax adhaerens]
gi|190585909|gb|EDV25977.1| hypothetical protein TRIADDRAFT_24279 [Trichoplax adhaerens]
Length = 668
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 26/76 (34%), Positives = 47/76 (61%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L S FCL ++G+ + + D+L +GC+PV++++ Y LPF++VL+WK ++V
Sbjct: 251 YPTILQDSTFCLMLRGYRLIQSNFLDALKFGCIPVVLSDEYILPFSEVLDWKRAALVFRE 310
Query: 267 LDIPLLKKILKGISSE 282
+ L +L IS++
Sbjct: 311 DQLLSLPAVLSSISTK 326
>gi|47230478|emb|CAF99671.1| unnamed protein product [Tetraodon nigroviridis]
Length = 722
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 47/82 (57%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L S FC+ ++G + A ++D L GCVPVI+A+ Y LPF++VL+WK S+ +
Sbjct: 309 YPQILQESSFCVVLRGARLGQAVLSDVLQAGCVPVILADSYILPFSEVLDWKRASVFIPE 368
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+ + ILK I + +Q
Sbjct: 369 EKLSEMYSILKSIPHRQVEEMQ 390
>gi|307111637|gb|EFN59871.1| hypothetical protein CHLNCDRAFT_133704 [Chlorella variabilis]
Length = 833
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 69/323 (21%), Positives = 123/323 (38%), Gaps = 74/323 (22%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLPF--------------------SIARMRHDR 72
Y E+ + ++S T DP +AD F+LP A R
Sbjct: 413 YQPETGIHEMLLQSEHRTLDPEEADYFYLPVYASCAIWPVLYSNDFPYFHGGPAAQR--- 469
Query: 73 RIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVV 132
T G + + +PYW+R GG DH + H G W + +V
Sbjct: 470 ---THGATNMFMEVQSWVRSHFPYWDRNGGRDHIVLTVHDEG-----SCWLPAVLRPAIV 521
Query: 133 CSSSYFISGHIAHKDVSLPQ----------------IWPRQEDPPKLG---SSKRNKLAF 173
S H DV+ P +W + KLG +K+ +
Sbjct: 522 MS-------HWGRTDVNPPAGTGYDADTYSNEVRHPVWQPEGHLSKLGEFPCYDPSKVTY 574
Query: 174 FAGAVNSP--------VREKLLQV-----WRNDSEIYAHSGRLKTP---YADELLGSKFC 217
G P R+ L + W + I+ +G Y++ + S FC
Sbjct: 575 ILGGRIQPENARYSRGTRQFLANISEAEGWWDKYRIHVGAGSPPGGPGDYSECMARSVFC 634
Query: 218 LHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILK 277
L + G + ++R D++ +GC+PVI+ + +L + +L+ ++S+ V D+ + +IL+
Sbjct: 635 LALMG-DGYSSRFDDAVLHGCIPVIVQDGIELTWHSLLDIPAYSLRVPQADMARIPQILQ 693
Query: 278 GISSEEYLLLQSYVLKVRKHFQW 300
+ E+ +Q+ + KV + W
Sbjct: 694 AVPQEDIARMQANLAKVWRRHIW 716
>gi|188528923|ref|NP_001120887.1| exostosin 2 [Xenopus (Silurana) tropicalis]
gi|183986340|gb|AAI66243.1| ext2 protein [Xenopus (Silurana) tropicalis]
Length = 718
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 29/82 (35%), Positives = 48/82 (58%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L S FC+ ++G + + ++D L GCVPVIIA+ Y LPF++VL+WK S+V+
Sbjct: 308 YPQILQESTFCIVLRGARLGQSVLSDVLQAGCVPVIIADSYVLPFSEVLDWKRASVVIPE 367
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+ + IL+ + + +Q
Sbjct: 368 EKMFEMYSILQAVPQRQLEEMQ 389
>gi|74201817|dbj|BAE28510.1| unnamed protein product [Mus musculus]
Length = 701
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 47/82 (57%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L + FC ++G + A ++D L GCVPV+IA+ Y LPF++VL+WK S+VV
Sbjct: 291 YPQVLQEATFCTVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 350
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+ + IL+ I + +Q
Sbjct: 351 EKMSDVYSILQNIPQRQIEEMQ 372
>gi|226442845|ref|NP_034293.2| exostosin-2 [Mus musculus]
gi|341940670|sp|P70428.2|EXT2_MOUSE RecName: Full=Exostosin-2; AltName:
Full=Glucuronosyl-N-acetylglucosaminyl-proteoglycan/N-
acetylglucosaminyl-proteoglycan
4-alpha-N-acetylglucosaminyltransferase; AltName:
Full=Multiple exostoses protein 2 homolog
gi|74216635|dbj|BAE37749.1| unnamed protein product [Mus musculus]
gi|148695682|gb|EDL27629.1| exostoses (multiple) 2, isoform CRA_a [Mus musculus]
Length = 718
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 47/82 (57%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L + FC ++G + A ++D L GCVPV+IA+ Y LPF++VL+WK S+VV
Sbjct: 308 YPQVLQEATFCTVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+ + IL+ I + +Q
Sbjct: 368 EKMSDVYSILQNIPQRQIEEMQ 389
>gi|13879260|gb|AAH06597.1| Exostoses (multiple) 2 [Mus musculus]
Length = 718
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 47/82 (57%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L + FC ++G + A ++D L GCVPV+IA+ Y LPF++VL+WK S+VV
Sbjct: 308 YPQVLQEATFCTVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+ + IL+ I + +Q
Sbjct: 368 EKMSDVYSILQNIPQRQIEEMQ 389
>gi|260826381|ref|XP_002608144.1| hypothetical protein BRAFLDRAFT_91377 [Branchiostoma floridae]
gi|229293494|gb|EEN64154.1| hypothetical protein BRAFLDRAFT_91377 [Branchiostoma floridae]
Length = 595
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 46/69 (66%)
Query: 214 SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLK 273
+ FC+ ++G + + ++D++ GC+PVI + Y +PF++VL+WK ++++ D+P +
Sbjct: 194 ATFCMILRGARMGQSALSDAMMAGCIPVIAIDTYVMPFSEVLDWKRAAVILREEDLPDVH 253
Query: 274 KILKGISSE 282
+L+ IS E
Sbjct: 254 NVLRRISQE 262
>gi|395512319|ref|XP_003760388.1| PREDICTED: exostosin-1 [Sarcophilus harrisii]
Length = 746
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 77/363 (21%), Positives = 135/363 (37%), Gaps = 95/363 (26%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
F+VYVYP ++ G +ESY A S F T DPS+A LF L
Sbjct: 113 GFKVYVYPQQK----------------GEKIAESYQNILAAIEGSRFYTSDPSQACLFVL 156
Query: 62 PFSI-------ARMRHDRRIGTEGI---PDFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
+ H+ + + + + +H IFN+ S +P + G D
Sbjct: 157 SLDTLDRDQLSPQYVHNLKSKVQSLHLWNNGRNHLIFNLYSGTWPDYTEDVGFD------ 210
Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
IG++ + KA S + DVS+P +D P+ G K
Sbjct: 211 --IGQAMLAKA--------------SISTENFRPNFDVSIPLF---SKDHPRTGGEKGFL 251
Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR------------ 202
R + F G + S R L V + + + +
Sbjct: 252 RFNTIPPLRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHKDSR 311
Query: 203 --------LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
K Y + L + FCL +G + + R ++L CVPV+++N ++LPF++V
Sbjct: 312 CDKDNTEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEV 371
Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
++W +I+ + + ++ I ++ L L+ + F W + S + +
Sbjct: 372 IDWNQAAIIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTT 426
Query: 315 YEL 317
E+
Sbjct: 427 LEI 429
>gi|187469677|gb|AAI66748.1| Ext2 protein [Rattus norvegicus]
Length = 718
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 76/307 (24%), Positives = 120/307 (39%), Gaps = 44/307 (14%)
Query: 5 FRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFLP 62
+VY+YP ++ A V P + S Y A S + T D S+A LF
Sbjct: 104 IKVYIYPLKKYVDDAGV-------PVSSAISREYNELLTAISDSDYYTDDISRACLFVPS 156
Query: 63 FSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAW 122
+ P I ++Q P W+R G +H G A
Sbjct: 157 IDVLNQN----------PLRIKETAQALAQ-LPRWDR--GTNHLLFNMLPGGPPDYNTAL 203
Query: 123 EVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQED---PPKLGSSKRNKLAFFAGAVN 179
+V + ++ + + DVS+P P + P K S+R L A++
Sbjct: 204 DVPRDR-ALLAGGGFSTWTYRQGYDVSIPVFSPLSAEVALPEKAPGSRRYFLLSSQMAIH 262
Query: 180 SPVREKL--LQVWRNDSEIYAH------SGRLKT----------PYADELLGSKFCLHVK 221
RE+L LQ +S + G L Y L + FC ++
Sbjct: 263 PEYREELEALQAKHRESVLVLDKCTNLSEGVLSVRKRCHQHQVFDYPQVLQEATFCTVLR 322
Query: 222 GFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISS 281
G + A ++D L GCVPV+IA+ Y LPF++VL+WK S+V+ + + IL+ I
Sbjct: 323 GARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVIPEEKMSDVYSILQNIPQ 382
Query: 282 EEYLLLQ 288
+ +Q
Sbjct: 383 RQIEEMQ 389
>gi|74144073|dbj|BAE22143.1| unnamed protein product [Mus musculus]
Length = 435
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 47/82 (57%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L + FC ++G + A ++D L GCVPV+IA+ Y LPF++VL+WK S+VV
Sbjct: 25 YPQVLQEATFCTVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 84
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+ + IL+ I + +Q
Sbjct: 85 EKMSDVYSILQNIPQRQIEEMQ 106
>gi|197101852|ref|NP_001125538.1| exostosin-1 [Pongo abelii]
gi|75042009|sp|Q5RBC3.1|EXT1_PONAB RecName: Full=Exostosin-1; AltName:
Full=Glucuronosyl-N-acetylglucosaminyl-proteoglycan/N-
acetylglucosaminyl-proteoglycan
4-alpha-N-acetylglucosaminyltransferase; AltName:
Full=Multiple exostoses protein 1 homolog
gi|55728386|emb|CAH90937.1| hypothetical protein [Pongo abelii]
Length = 746
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 76/363 (20%), Positives = 134/363 (36%), Gaps = 95/363 (26%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
F+VY YP ++ + A ESY A S F T DPS+A LF L
Sbjct: 113 GFKVYAYPQQKGEKIA----------------ESYQNILAAIEGSRFYTSDPSQACLFVL 156
Query: 62 PFSI-------ARMRHDRRIGTEGI---PDFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
+ H+ R + + + +H IFN+ S +P + G D
Sbjct: 157 SLDTLDRDQLSPQYVHNLRSKVQSLHLWNNGRNHLIFNLYSGTWPDYTEDVGFD------ 210
Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
IG++ + KA S + DVS+P +D P+ G +
Sbjct: 211 --IGQAMLAKA--------------SISTENFRPNFDVSIPLF---SKDHPRTGGERGFL 251
Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR------------ 202
R + F G + S R L V + + + +
Sbjct: 252 KFNTIPPLRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHKDSR 311
Query: 203 --------LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
K Y + L + FCL +G + + R ++L CVPV+++N ++LPF++V
Sbjct: 312 CDRDNTEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEV 371
Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
+NW +++ + + ++ I ++ L L+ + F W + S + +
Sbjct: 372 INWNQAAVIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTT 426
Query: 315 YEL 317
E+
Sbjct: 427 LEI 429
>gi|126322251|ref|XP_001370057.1| PREDICTED: exostosin-1 [Monodelphis domestica]
Length = 746
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 77/363 (21%), Positives = 135/363 (37%), Gaps = 95/363 (26%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
F+VYVYP ++ G +ESY A S F T DPS+A LF L
Sbjct: 113 GFKVYVYPQQK----------------GEKIAESYQNILAAIEGSRFYTSDPSQACLFVL 156
Query: 62 PFSI-------ARMRHDRRIGTEGI---PDFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
+ H+ + + + + +H IFN+ S +P + G D
Sbjct: 157 SLDTLDRDQLSPQYVHNLKSKVQSLHLWNNGRNHLIFNLYSGTWPDYTEDVGFD------ 210
Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
IG++ + KA S + DVS+P +D P+ G K
Sbjct: 211 --IGQAMLAKA--------------SISTENFRPNFDVSIPLF---SKDHPRTGGEKGFL 251
Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR------------ 202
R + F G + S R L V + + + +
Sbjct: 252 RFNTIPPLRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHKDSR 311
Query: 203 --------LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
K Y + L + FCL +G + + R ++L CVPV+++N ++LPF++V
Sbjct: 312 CDKDNTEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEV 371
Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
++W +I+ + + ++ I ++ L L+ + F W + S + +
Sbjct: 372 IDWNQAAIIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTT 426
Query: 315 YEL 317
E+
Sbjct: 427 LEI 429
>gi|444707556|gb|ELW48821.1| Exostosin-2 [Tupaia chinensis]
Length = 740
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 29/82 (35%), Positives = 47/82 (57%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L + FCL ++G + A ++D L GCVPVI+A+ Y LPF++VL+WK S+ V
Sbjct: 330 YPQVLQEATFCLVLRGARLGQAVLSDVLQAGCVPVIVADSYILPFSEVLDWKRASVAVPE 389
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+ + IL+ + + +Q
Sbjct: 390 EKLSDVYSILQSVPQRQIEEMQ 411
>gi|327259699|ref|XP_003214673.1| PREDICTED: exostosin-2-like [Anolis carolinensis]
Length = 718
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 28/82 (34%), Positives = 47/82 (57%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L + FC+ ++G + A ++D L GC+PV+IA+ Y LPF++VL+WK S+V+
Sbjct: 308 YPQVLQEATFCIVLRGARLGQAVLSDVLQAGCIPVVIADSYILPFSEVLDWKRASVVIPE 367
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+ + IL I + +Q
Sbjct: 368 EKMSEMYSILHSIPQRQIEEMQ 389
>gi|302830910|ref|XP_002947021.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
gi|300268065|gb|EFJ52247.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
Length = 834
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 78/340 (22%), Positives = 138/340 (40%), Gaps = 80/340 (23%)
Query: 32 NYASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEG----IPD-----F 82
Y E F + ++S T DP AD F++P + +G +P
Sbjct: 431 TYGLEVLFHEMLLQSEHRTFDPEAADYFYVPVYGSCFIFPLHCYADGPWWHVPSGPRVMH 490
Query: 83 ISHYIFN----ISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVV------ 132
+++ + I + +PYW+R GG DH ++ H G A E+ ++I +
Sbjct: 491 VTNMMLEVRDWIRKHFPYWDRRGGRDHIWLMTHDEG--ACYAPTEIYNSSIFLTHWGRID 548
Query: 133 ---CSSSYF---------------------ISGHIAH---KDVSLPQI--------WPRQ 157
S++ F I GH + KD+ +P + P
Sbjct: 549 KHHASNTAFTPDNYTQEYVHPEQPGGWLHLIDGHPCYTPGKDLVVPALKLPHHFRQSPLL 608
Query: 158 EDPPKLGSSKRNKLAFFAGAVNS--------PVREKLLQVWRNDSEIYAHS------GRL 203
PP+ +R+ L + G V +R++L ++WR+ + ++ +
Sbjct: 609 FHPPR----QRDILLYLRGDVGKHRLPNYSRGIRQRLYRLWRDQQWLQGYNVMIGDGSDV 664
Query: 204 KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIV 263
Y++ L SKFCL V G + + R+ D++ +GCVPVII + + D L + FSI
Sbjct: 665 PGDYSEHLSRSKFCLVVPG-DGWSPRLEDAVLHGCVPVIIMDGVHGVWEDQLELERFSIR 723
Query: 264 VATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVF 303
V ++ L + L + +L+ K+RK WH +
Sbjct: 724 VGEDELEGLPQQLAVVPQR---VLEDMQRKLRK--VWHRY 758
>gi|356533217|ref|XP_003535163.1| PREDICTED: uncharacterized protein LOC100807663 [Glycine max]
Length = 795
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 75/337 (22%), Positives = 134/337 (39%), Gaps = 80/337 (23%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLPF----SIAR--------MRHDRRIGTEGIP 80
Y ++ ++ + S T + +AD FF+P I R M+ + +
Sbjct: 395 YGAQIALYESLLASPHRTLNGEEADFFFVPVLDSCIITRADDAPHLSMQEHMGLRSSLTL 454
Query: 81 DFISHYIFNISQKYPYWNRTGGADHFYV------ACHS---IGRSAMQKAWEVKLNAIQV 131
++ + +I ++YPYW+ + G DH + AC++ I S M W N
Sbjct: 455 EYYKNTYTHIVEQYPYWSHSSGRDHIWSFSWDEGACYAPKEIWNSMMLVHWG-NTNTKHN 513
Query: 132 VCSSSYFISG--------------HIAHKDVSLP------------QIWPRQEDPPKLGS 165
+++Y+ KD+ LP ++W R +
Sbjct: 514 HSTTAYWADNWDKISSDRRGIHPCFDPDKDLVLPAWKVPDAYVLTSKLWARSHE------ 567
Query: 166 SKRNKLAFFAGAVNSP-------------VREKLLQVWRNDSEIYAHSGR------LKTP 206
KR L +F G + +R+KL + + + G+ + TP
Sbjct: 568 -KRKTLFYFNGNLGPAYPHGRPEDTYSMGIRQKLAEEFGSSPNKDGKLGKQHAKDVIVTP 626
Query: 207 -----YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFS 261
Y +L S FC G + + R+ DS+ GC+PV+I + LP+ +VLN+ SF+
Sbjct: 627 ERSEDYHMDLASSVFCGVFPG-DGWSGRMEDSILQGCIPVVIQDGIFLPYENVLNYDSFA 685
Query: 262 IVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHF 298
+ + +IP L K L+G + E + V K+ + F
Sbjct: 686 VRIPEAEIPNLIKTLRGFNDTEIEFKLANVQKIWQRF 722
>gi|323456573|gb|EGB12440.1| hypothetical protein AURANDRAFT_70660 [Aureococcus anophagefferens]
Length = 1731
Score = 59.7 bits (143), Expect = 2e-06, Method: Composition-based stats.
Identities = 35/145 (24%), Positives = 69/145 (47%), Gaps = 20/145 (13%)
Query: 175 AGAVNSPVREKLLQVWRNDSEIYAHSGRLKTP-----------------YADELLGSKFC 217
AG + +R L W + A G + P Y + + +K+C
Sbjct: 1465 AGGLEYDLRRALTDAWDPGGPLRAKRGVARAPPPLDPRVDVRFLMAKANYTESMRRAKYC 1524
Query: 218 LHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILK 277
+ +GF + R+++++ GCVP ++ P+A VL+W +FS+ +A D+ L ++L
Sbjct: 1525 VVTEGFSPWSPRLSEAVALGCVPCFLSPSLAPPYATVLDWSAFSVEIAEADVGRLPEVL- 1583
Query: 278 GISSEEYLLLQSYVLKVRKHFQWHV 302
++ ++ L + +L+VR F + V
Sbjct: 1584 --AAYDWAYLHANLLRVRPLFAFCV 1606
>gi|417404013|gb|JAA48784.1| Putative glycosyl transferase family 64 domain protein [Desmodus
rotundus]
Length = 701
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 48/82 (58%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L + FC+ ++G + A ++D L GCVPV+IA+ Y LPF++VL+WK S+VV
Sbjct: 291 YPQVLQEATFCVVLRGARLGQAVLSDILRAGCVPVVIADSYILPFSEVLDWKRASVVVPE 350
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+ + IL+ I + +Q
Sbjct: 351 EKMSDVYGILQSIPQRQIGEMQ 372
>gi|224046654|ref|XP_002200457.1| PREDICTED: exostosin-1 [Taeniopygia guttata]
Length = 741
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 77/363 (21%), Positives = 134/363 (36%), Gaps = 95/363 (26%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFK--KAFMKSHFVTKDPSKADLFFL 61
F+VYVYP ++ + A ESY A S F T DP +A LF L
Sbjct: 108 GFKVYVYPQQKGEKIA----------------ESYQNVLAAIEGSRFYTSDPGQACLFVL 151
Query: 62 PFSI-------ARMRHDRRIGTEGI---PDFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
+ H+ R + + + +H IFN+ S +P + G D
Sbjct: 152 SLDTLDRDQLSPQYVHNLRSKVQSLHLWNNGRNHLIFNLYSGTWPDYTEDVGFD------ 205
Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
IG++ + KA S + DVS+P +D P+ G K
Sbjct: 206 --IGQAMLAKA--------------SISTENFRPNFDVSIPLF---SKDHPRTGGEKGFL 246
Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR------------ 202
R + F G + S R L V + + + +
Sbjct: 247 RFNTIPPLRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHKDSR 306
Query: 203 --------LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
K Y + L + FCL +G + + R ++L CVPV+++N ++LPF++V
Sbjct: 307 CDRDNAEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEV 366
Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
++W +I+ + + ++ I ++ L L+ + F W + S + +
Sbjct: 367 IDWNQAAIIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTT 421
Query: 315 YEL 317
E+
Sbjct: 422 LEI 424
>gi|157820411|ref|NP_001101221.1| exostosin-2 [Rattus norvegicus]
gi|149022701|gb|EDL79595.1| exostoses (multiple) 2 (predicted) [Rattus norvegicus]
Length = 670
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 29/82 (35%), Positives = 47/82 (57%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L + FC ++G + A ++D L GCVPV+IA+ Y LPF++VL+WK S+V+
Sbjct: 308 YPQVLQEATFCTVLRGARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKRASVVIPE 367
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+ + IL+ I + +Q
Sbjct: 368 EKMSDVYSILQNIPQRQIEEMQ 389
>gi|395815565|ref|XP_003781296.1| PREDICTED: exostosin-2 [Otolemur garnettii]
Length = 718
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 29/82 (35%), Positives = 48/82 (58%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L + FC+ ++G + A +++ L GCVPV+IA+ Y LPF++VL+WK S+VV
Sbjct: 308 YPQVLQDATFCVVLRGARLGQAVLSEVLQAGCVPVVIADSYILPFSEVLDWKRASVVVPE 367
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+ + IL+ I + +Q
Sbjct: 368 EKMSDVYSILQSIPQRQIEEMQ 389
>gi|322511376|gb|ADX06684.1| putative exostosin-like protein [Organic Lake phycodnavirus]
Length = 290
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 46/144 (31%), Positives = 70/144 (48%), Gaps = 15/144 (10%)
Query: 156 RQEDPPKLGSSKRNKLAFFAGAVNSP-----VREKLLQVW-RNDSEIYAHSGRLKTPYAD 209
R E PK S+++ L F G + S VR+K+ V RN + +SG K +
Sbjct: 119 RLERMPKQSFSQKSILCSFVGNITSNNVVPNVRKKMFDVLSRNANFTMINSGGWKPVVTE 178
Query: 210 EL--------LGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYD-LPFADVLNWKSF 260
L L SKF L +G+ + R + L G +PV I N + LPF DV+N+K
Sbjct: 179 TLQDIFIKTTLKSKFALAPRGYGRGSFRFFECLQLGTIPVYIWNDVNWLPFQDVINYKQL 238
Query: 261 SIVVATLDIPLLKKILKGISSEEY 284
+V+ +I L+++L IS +Y
Sbjct: 239 CVVIHADEIDTLEQVLLNISETQY 262
>gi|308491052|ref|XP_003107717.1| CRE-RIB-1 protein [Caenorhabditis remanei]
gi|308249664|gb|EFO93616.1| CRE-RIB-1 protein [Caenorhabditis remanei]
Length = 347
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 67/273 (24%), Positives = 111/273 (40%), Gaps = 42/273 (15%)
Query: 41 KAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRT 100
K F + T DP++A LF L DR + + ++ YI ++ Q WN
Sbjct: 39 KHFQEREHYTNDPNEACLFLLGIDTT----DRDVRSLNYVKNVNEYIESLDQ--AVWN-- 90
Query: 101 GGADHF-YVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLP---QIWPR 156
G +H Y H K ++ +S + D+SLP + P
Sbjct: 91 NGRNHLIYNFYHGTFPDYDDHNLNFKTGE-AIIARASSSENNFFKDFDISLPLFHENHPY 149
Query: 157 QEDPPKLGSS----KRNKLAFFAG-----AVNSPVREKLL-------------------- 187
Q + +L + KR LA F G + S R +
Sbjct: 150 QIESQRLHNEPKEEKRRYLASFKGKRYVYGIGSGTRNLVHHLHNGDDIVMVTTCKHNNDW 209
Query: 188 QVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHY 247
QV+++D + + Y D L S FCL +G + + R ++L GC+PV+I++ +
Sbjct: 210 QVYQDDRCQRDNEEYDRWEYDDLLSNSTFCLVPRGRRLGSFRFLETLRSGCIPVVISDSW 269
Query: 248 DLPFADVLNWKSFSIVVATLDIPLLKKILKGIS 280
LPF++ ++W S +IVVA D + ++L S
Sbjct: 270 ILPFSETIDWHSAAIVVAERDALSIPELLMSTS 302
>gi|410905063|ref|XP_003966011.1| PREDICTED: exostosin-1b-like [Takifugu rubripes]
Length = 743
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 71/322 (22%), Positives = 134/322 (41%), Gaps = 51/322 (15%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
F+V+VYP ++ G SESY + S F T DP +A LF L
Sbjct: 110 GFKVFVYPQQK----------------GEKMSESYQNILSSIEGSRFYTSDPEQACLFVL 153
Query: 62 PFSI-------ARMRHDRRIGTEGIP---DFISHYIFNI-SQKYPYWNRTGGADHFY--V 108
+ H+ + + +P + +H IFN+ S +P + G D Y +
Sbjct: 154 SLDTLDRDQLSPQYVHNLKGRIQSLPLWNEGKNHIIFNLYSGTWPNYTEDLGFDIGYAML 213
Query: 109 ACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQ------IWPRQEDPPK 162
A SI + ++V + + G++ H S+P ++ +
Sbjct: 214 AKASISTENFRPNFDVSIPLFSKDHPRTGGERGYLRHN--SIPPFRKYMLVFKGKRYLTG 271
Query: 163 LGSSKRNKLAFFAGAVNSPVRE--KLLQVWRNDSEIYAHSGRL---KTPYADELLGSKFC 217
+GS RN L + + + K + W+ + + K Y + L S FC
Sbjct: 272 IGSDTRNALYHVHNSEDVVLLTTCKHGKDWQKHKDARCDRDNMEYDKYDYREMLYNSTFC 331
Query: 218 LHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILK 277
L +G + + R ++L CVPV+++N ++LPF+++++W + +++ +++L
Sbjct: 332 LVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEIIDWNTAAVIGD-------ERLLL 384
Query: 278 GISSEEYLLLQSYVLKVRKHFQ 299
I S + Q +L +R+ Q
Sbjct: 385 QIPSTVRSIHQDQILSLRQQTQ 406
>gi|297807715|ref|XP_002871741.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
gi|297317578|gb|EFH48000.1| exostosin family protein [Arabidopsis lyrata subsp. lyrata]
Length = 510
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 67/267 (25%), Positives = 119/267 (44%), Gaps = 33/267 (12%)
Query: 95 PYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVV----CSSSYFISGHIA-HKDVS 149
P W R+ G DH + H +++K + NAI ++ + +++ G ++ KD+
Sbjct: 227 PAWKRSEGRDHIFPIHHPWSFKSVRKFVK---NAIWLLPDMDSTGNWYKPGQVSLEKDLI 283
Query: 150 LPQIWPRQE--DPPKLGSSK--RNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSG 201
LP + P + D L S R L FF G + +R KL + G
Sbjct: 284 LPYV-PNVDICDAKCLSESAPMRTTLLFFRGRLKRNAGGKIRAKLGAELSGVKGVIISEG 342
Query: 202 RL----KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNW 257
K + S FCL G ++AR+ D++ GC+PVI+++ + PF +L++
Sbjct: 343 TAGEGGKLAAQGGMRRSLFCLCPAGDTPSSARLFDAIVSGCIPVIVSDELEFPFEGILDY 402
Query: 258 KSFSIVVATLDI---PLLKKILKGISSEEYLLLQSYVLKVRKHFQWH-----VFPSDYD- 308
K +++V++ D+ L L+ ++ + LQ + + +HF + + P D
Sbjct: 403 KKVAVLVSSNDVVQPGWLVNHLRSLTPFQIKELQKNLAQYSRHFLYSSPAQPLGPEDLTW 462
Query: 309 ---AFYMVMYELWLRRSSVRVQWSTSL 332
A +V +L RRS V+ S SL
Sbjct: 463 RMMAGKLVNIKLHTRRSQRVVKGSRSL 489
>gi|401886285|gb|EJT50333.1| hypothetical protein A1Q1_00388 [Trichosporon asahii var. asahii
CBS 2479]
Length = 1041
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 41/148 (27%), Positives = 74/148 (50%), Gaps = 11/148 (7%)
Query: 166 SKRNKLAFFAGAVNSPVREKLLQVW-------RNDSEIYAHSGRLKTPYADELLGSKFCL 218
++R LA F G+ N L+ + ++AH KT Y D L ++FC
Sbjct: 845 AERQTLATFKGSPNGQGTSLRLKTTCPRLLPPGSLEPVWAHIPEGKT-YLDLLGDTRFCP 903
Query: 219 HVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKG 278
G T R++D +Y GC+PV++ + + + + +W FS+ V ++ L++IL G
Sbjct: 904 IPFGTAGWTYRLSDVVYAGCIPVLVGDQSHMTYWSMFDWSLFSVQVFEHELDHLERILSG 963
Query: 279 ISSEEYLLLQSYVLKVRKHFQWHVFPSD 306
I+ E+ Q ++ VR+ F ++PS+
Sbjct: 964 ITEEDAQRKQDALMLVREAF---LYPSE 988
>gi|68052297|sp|Q5IGR6.1|EXT1C_DANRE RecName: Full=Exostosin-1c; AltName:
Full=Glucuronosyl-N-acetylglucosaminyl-proteoglycan/N-
acetylglucosaminyl-proteoglycan
4-alpha-N-acetylglucosaminyltransferase 1c; AltName:
Full=Multiple exostoses protein 1 homolog c
gi|56785795|gb|AAW29035.1| EXT1c [Danio rerio]
Length = 737
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 81/357 (22%), Positives = 134/357 (37%), Gaps = 83/357 (23%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFK--KAFMKSHFVTKDPSKADLFFL 61
FRVY+YP + G SE Y K + +S + T DP +A LF L
Sbjct: 105 GFRVYIYPPEK----------------GERVSEGYRKILTSVSESRYYTSDPREACLFVL 148
Query: 62 PFSIARMRHDRRIGTEGIPDFISHYIFNISQK---YPYWNRTGGADHFYVACHSIGRSAM 118
I + D+ ++ N+ ++ YP WN G +H +S
Sbjct: 149 --GIDTLDRDQ---------LSQQFVPNVDERIRGYPLWN--DGRNHVIFNLYSGTWPNY 195
Query: 119 QKAWEVKLNAIQVVCSSSYFISGHIAHK-DVSLPQIWPRQEDPPKLGSS----------K 167
+ ++ N Q + + + + H D+S+P +E P K G +
Sbjct: 196 TE--DLGFNVGQAILAKASLNTEHFRPGFDISIPLF--SKEHPQKGGKRGWLVRNSVPPR 251
Query: 168 RNKLAFFAG-----AVNSPVREKL-----------LQVWRNDSEIYAHSGRL-------- 203
R L F G + S R L L R+ + H
Sbjct: 252 RKYLLMFKGKRYLTGIGSDTRNALHHIHNGKDIVSLTTCRHGKDWEKHKDARCDHDNQEY 311
Query: 204 -KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI 262
+ Y + L S FCL +G + + R +SL C+PV+++N ++LPF+DV+ W
Sbjct: 312 ERFDYQELLHNSTFCLVPRGRRLGSFRFLESLQAACIPVLLSNGWELPFSDVIQWNQ--- 368
Query: 263 VVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQ--WHVFPSDYDAFYMVMYEL 317
V D +++L + S + VL +R+ Q W + S D + E+
Sbjct: 369 AVVEGD----ERLLLQVPSTVRAVGIDRVLALRQQTQTLWDAYFSSVDKIVLTTLEI 421
>gi|55742053|ref|NP_001006730.1| exostosin 1 [Xenopus (Silurana) tropicalis]
gi|49523045|gb|AAH75481.1| exostoses (multiple) 1 [Xenopus (Silurana) tropicalis]
Length = 738
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 75/363 (20%), Positives = 134/363 (36%), Gaps = 95/363 (26%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
F+VY+YP ++ D SESY A S T DPS+A LF L
Sbjct: 105 GFKVYIYPQQKGDKL----------------SESYQHILAAIESSRLYTSDPSQACLFVL 148
Query: 62 PFSI-------ARMRHDRRIGTEGI---PDFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
+ H+ + + + + +H IFN+ S +P + G D
Sbjct: 149 SLDTLDRDQLSPQYVHNLKSKVQNLHLWNNGRNHLIFNLYSGTWPDYTEDVGFD------ 202
Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
IG++ + KA S + DVS+P +D P+ G K
Sbjct: 203 --IGQAMLAKA--------------SISTENFRPNFDVSIPLF---SKDHPRTGGEKGFL 243
Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR------------ 202
R + F G + S R L V + + + +
Sbjct: 244 RFNNIPPMRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHKDAR 303
Query: 203 --------LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
K Y + L + FCL +G + + R ++L CVPV+++N ++LPF++V
Sbjct: 304 CDKDNAEYEKYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEV 363
Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
++W +++ + + ++ I ++ L L+ + F W + S + +
Sbjct: 364 IDWNQAAVIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTT 418
Query: 315 YEL 317
E+
Sbjct: 419 LEI 421
>gi|405975819|gb|EKC40363.1| Exostosin-1b [Crassostrea gigas]
Length = 714
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 84/358 (23%), Positives = 132/358 (36%), Gaps = 89/358 (24%)
Query: 3 RSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFK--KAFMKSHFVTKDPSKADLFF 60
R F+VYVYP VL V SESY K +S + T +P +A +F
Sbjct: 85 RGFKVYVYP---------VLERV---------SESYSKIISTIQESKYYTTNPDEACIFI 126
Query: 61 LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
L SI + DR I + I I ++ WN D+ V G + M +
Sbjct: 127 L--SIDTL--DRDINSVNSAKNIMPKIAHLGA---LWNNGTWPDYSEVLDFDTGEAIMAR 179
Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK------------R 168
A S S F D+SLP + PPK G R
Sbjct: 180 A----------SISESRFRPNF----DISLPLVGSTH--PPKGGERGYMYTSINNIPPLR 223
Query: 169 NKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGRL-------------------- 203
+ L F G V S R L + D + + R
Sbjct: 224 HYLLGFKGKRYLTGVGSETRNSLYHMHNGDDIVLLTTCRHGKFWQKKAKELNDTRCDIDN 283
Query: 204 ----KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKS 259
+ Y L + FCL +G + + R ++L GC+PV+++N ++LPF +V++WK
Sbjct: 284 REFDRYDYKKLLYNATFCLVPRGRRLGSFRFLETLQAGCIPVLLSNGWELPFGEVIDWKK 343
Query: 260 FSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYEL 317
++ + + I+ G+S E + ++ + F W + S D + E+
Sbjct: 344 AAVWADERLLFQVPSIVHGLSQPEI-----FAMRQQTQFLWEAYFSSMDKIINTVLEI 396
>gi|341881773|gb|EGT37708.1| CBN-RIB-1 protein [Caenorhabditis brenneri]
Length = 383
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 68/297 (22%), Positives = 115/297 (38%), Gaps = 61/297 (20%)
Query: 27 FEPRGNYASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHY 86
FE A S K F +S T +P++A +F L DR + ++ ++ Y
Sbjct: 59 FEVAPQSAIYSKILKYFQESEHYTNNPNEACIFLLGIDTT----DRDVRSQNYVKNVNEY 114
Query: 87 IFNISQKYPYWNRTGGADH----FYVAC------HSIGRSAMQKAWEVKLNAIQVVCSSS 136
I ++ WN G +H FY H++G + + S +
Sbjct: 115 IDSLDP--AVWN--NGRNHLIFNFYHGTFPDYDDHNLGFDTGEAM------IARASSSEN 164
Query: 137 YFISGHIAHKDVSLP--------QIWPRQEDPPKLGSSKRNKLAFFAG-----AVNSPVR 183
F G DVSLP + P+ + + S+R L F G + S R
Sbjct: 165 NFFKGF----DVSLPLFHENHPDDVEPKVKIDDQRNESRRKYLVSFKGKRYVYGIGSGTR 220
Query: 184 EKLLQVWRNDSEIYAHSGRLKTP--------------------YADELLGSKFCLHVKGF 223
+ + D + + + Y D L S FCL +G
Sbjct: 221 NLVHHLHNGDDMVMVTTCKHNNDWQVYQDDRCQRDNDEYDQWDYEDLLTNSTFCLVPRGR 280
Query: 224 EVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGIS 280
+ + R ++L GC+PV+I++ + LPF + ++W S +IVVA D + ++L +S
Sbjct: 281 RLGSFRFLETLRSGCIPVVISDSWILPFTETIDWSSAAIVVAERDALSIPELLMSMS 337
>gi|326932982|ref|XP_003212589.1| PREDICTED: exostosin-1c-like [Meleagris gallopavo]
Length = 734
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 81/372 (21%), Positives = 141/372 (37%), Gaps = 84/372 (22%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFK--KAFMKSHFVTKDPSKADLFFL 61
F+V+ YP +P SESY K + S + T P +A LF L
Sbjct: 103 GFKVFTYPREHGEPM----------------SESYSKILASIESSRYYTPHPEEACLFVL 146
Query: 62 PFSIARMRHDRRIGTEGIPDFISHYIFNISQK---YPYWNRTGGADHFYVACHSIGRSAM 118
SI + D +HYI ++ Q+ +P WN G +H ++
Sbjct: 147 --SIDTLDRD---------PLSAHYIRSVEQRIRSFPLWN--SGRNHLIFNLYAGTWPNY 193
Query: 119 QKAWEVKLNAIQVVCSSSYFISGHIAHK-DVSLPQIWPRQEDPPKLGSS----------K 167
+ E+ + + + + F S + DVS+P ++PR E P + G S K
Sbjct: 194 TE--ELGFDIGHAMLAKASFHSKNFRPGFDVSIP-LFPR-EHPQRGGQSGWLHHNSVPPK 249
Query: 168 RNKLAFFAG-----AVNSPVREKLLQV-----------------WRNDSEIYAHSGRL-- 203
+ L F G + S R L + W + +
Sbjct: 250 KKYLLVFKGKRYLTGIGSGTRNALHHIHNGRDIVSLTTCRHGKDWERHKDTRCDQDNVDY 309
Query: 204 -KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI 262
K Y + L S FC+ +G + + R ++L C+PV++++ ++LPF++ ++W ++
Sbjct: 310 EKFDYQELLHNSTFCIVPRGRRLGSFRFLEALQAACIPVLLSDGWELPFSEAIDWGKAAV 369
Query: 263 VVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYE-----L 317
V + + + ++ I E L Q + F W + S D E L
Sbjct: 370 VGSERLLLQIPSAVRCIRPERVLAFQQ-----QTQFLWDAYFSSVDKIVHTTLEIIRDRL 424
Query: 318 WLRRSSVRVQWS 329
RS R+ W+
Sbjct: 425 LPHRSRSRILWN 436
>gi|15237870|ref|NP_197191.1| Exostosin family protein [Arabidopsis thaliana]
gi|9755690|emb|CAC01702.1| putative protein [Arabidopsis thaliana]
gi|15810401|gb|AAL07088.1| unknown protein [Arabidopsis thaliana]
gi|23296585|gb|AAN13125.1| unknown protein [Arabidopsis thaliana]
gi|332004972|gb|AED92355.1| Exostosin family protein [Arabidopsis thaliana]
Length = 511
Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 67/267 (25%), Positives = 120/267 (44%), Gaps = 33/267 (12%)
Query: 95 PYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVV----CSSSYFISGHIA-HKDVS 149
P W R+ G DH + H +++K + NAI ++ + +++ G ++ KD+
Sbjct: 228 PAWKRSEGRDHIFPIHHPWSFKSVRKFVK---NAIWLLPDMDSTGNWYKPGQVSLEKDLI 284
Query: 150 LPQIWPRQE--DPPKLGSSK--RNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSG 201
LP + P + D L S R L FF G + +R KL +I G
Sbjct: 285 LPYV-PNVDICDTKCLSESAPMRTTLLFFRGRLKRNAGGKIRAKLGAELSGIKDIIISEG 343
Query: 202 RL----KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNW 257
K + S FCL G ++AR+ D++ GC+PVI+++ + PF +L++
Sbjct: 344 TAGEGGKLAAQRGMRRSLFCLCPAGDTPSSARLFDAIVSGCIPVIVSDELEFPFEGILDY 403
Query: 258 KSFSIVVATLDI---PLLKKILKGISSEEYLLLQSYVLKVRKHFQWH-----VFPSDYD- 308
K +++V++ D L L+ ++ + LQ+ + + +HF + + P D
Sbjct: 404 KKVAVLVSSSDAIQPGWLVNHLRSLTPFQVKGLQNNLAQYSRHFLYSSPAQPLGPEDLTW 463
Query: 309 ---AFYMVMYELWLRRSSVRVQWSTSL 332
A +V +L RRS V+ S S+
Sbjct: 464 RMIAGKLVNIKLHTRRSQRVVKGSRSI 490
>gi|256079045|ref|XP_002575801.1| exostosin-2 [Schistosoma mansoni]
gi|353230856|emb|CCD77273.1| putative exostosin-2 [Schistosoma mansoni]
Length = 1001
Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 28/92 (30%), Positives = 51/92 (55%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y+ L S+FCL + +N + DSL GC+PVI+ + LPF+++L+W +I +
Sbjct: 607 YSVSLCSSEFCLIIDADNLNRFNLYDSLACGCIPVIVNDDIVLPFSEILDWYKIAIRIPQ 666
Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRKHF 298
+ + IL SS+E LL+ ++ + + +
Sbjct: 667 VKFQKIPSILSTYSSKEKFLLRKQIMFIYQRY 698
>gi|393907069|gb|EFO19790.2| exostosin-1 [Loa loa]
Length = 677
Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 68/319 (21%), Positives = 127/319 (39%), Gaps = 60/319 (18%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFFLPF 63
+FRVYVYP + + V + K +S + T DP KA LF L
Sbjct: 80 AFRVYVYPSDNSSAMSIVY--------------NNILKVIRESMYYTDDPQKACLFVL-- 123
Query: 64 SIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYV----------ACHSI 113
I + DRR +E +++ I N+ + WNR G +H + H +
Sbjct: 124 GIDTVDRDRR--SENYVKYVNELIGNLPTEI--WNR--GRNHVIFNLYHGTYPDYSDHDL 177
Query: 114 GRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKD----VSLPQIWPRQEDPPKLGSSKRN 169
G A + +A V ++ +S + HK+ ++ W + L S K
Sbjct: 178 GFDT-GYALIARASANTRVFRENFDLSFPLFHKEHPLRTTIKAEWSLKIKDKYLVSFKGK 236
Query: 170 KLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKTP--------------------YAD 209
+ + + S R+ L + S I + + T Y
Sbjct: 237 RYVY---GIGSETRDSLYHLHNGQSVIMVTTCKHNTDWKKYEDGRCEEDNVEYDHWDYEM 293
Query: 210 ELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI 269
+ S FCL +G + + R ++L GC+PV++++ ++LPF++V++W+ ++ +
Sbjct: 294 TMANSTFCLTPRGRRLGSFRFLEALRLGCIPVVLSDDWELPFSEVIDWRQAVVIGHEDTV 353
Query: 270 PLLKKILKGISSEEYLLLQ 288
+ +L I + L ++
Sbjct: 354 LTISDVLSAIPFDRILFMK 372
>gi|256079047|ref|XP_002575802.1| exostosin-2 [Schistosoma mansoni]
gi|353230857|emb|CCD77274.1| putative exostosin-2 [Schistosoma mansoni]
Length = 1022
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 28/92 (30%), Positives = 51/92 (55%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y+ L S+FCL + +N + DSL GC+PVI+ + LPF+++L+W +I +
Sbjct: 628 YSVSLCSSEFCLIIDADNLNRFNLYDSLACGCIPVIVNDDIVLPFSEILDWYKIAIRIPQ 687
Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRKHF 298
+ + IL SS+E LL+ ++ + + +
Sbjct: 688 VKFQKIPSILSTYSSKEKFLLRKQIMFIYQRY 719
>gi|312084451|ref|XP_003144281.1| exostosin-1 [Loa loa]
Length = 694
Score = 58.2 bits (139), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 68/319 (21%), Positives = 127/319 (39%), Gaps = 60/319 (18%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFFLPF 63
+FRVYVYP + + V + K +S + T DP KA LF L
Sbjct: 80 AFRVYVYPSDNSSAMSIVY--------------NNILKVIRESMYYTDDPQKACLFVL-- 123
Query: 64 SIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYV----------ACHSI 113
I + DRR +E +++ I N+ + WNR G +H + H +
Sbjct: 124 GIDTVDRDRR--SENYVKYVNELIGNLPTEI--WNR--GRNHVIFNLYHGTYPDYSDHDL 177
Query: 114 GRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKD----VSLPQIWPRQEDPPKLGSSKRN 169
G A + +A V ++ +S + HK+ ++ W + L S K
Sbjct: 178 GFDT-GYALIARASANTRVFRENFDLSFPLFHKEHPLRTTIKAEWSLKIKDKYLVSFKGK 236
Query: 170 KLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKTP--------------------YAD 209
+ + + S R+ L + S I + + T Y
Sbjct: 237 RYVY---GIGSETRDSLYHLHNGQSVIMVTTCKHNTDWKKYEDGRCEEDNVEYDHWDYEM 293
Query: 210 ELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDI 269
+ S FCL +G + + R ++L GC+PV++++ ++LPF++V++W+ ++ +
Sbjct: 294 TMANSTFCLTPRGRRLGSFRFLEALRLGCIPVVLSDDWELPFSEVIDWRQAVVIGHEDTV 353
Query: 270 PLLKKILKGISSEEYLLLQ 288
+ +L I + L ++
Sbjct: 354 LTISDVLSAIPFDRILFMK 372
>gi|441647993|ref|XP_004090847.1| PREDICTED: LOW QUALITY PROTEIN: exostosin-1 [Nomascus leucogenys]
Length = 746
Score = 58.2 bits (139), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 76/363 (20%), Positives = 134/363 (36%), Gaps = 95/363 (26%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
F+VYVYP ++ + A ESY A S F T DPS+A LF L
Sbjct: 113 GFKVYVYPQQKGEKIA----------------ESYQNILAAIEGSRFYTSDPSQACLFXL 156
Query: 62 PFSI-------ARMRHDRRIGTEGI---PDFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
+ H+ R + + + +H IFN+ S +P + G D
Sbjct: 157 SLDTLDRDQLSPQYVHNLRSKVQSLHLWNNGRNHLIFNLYSGTWPDYTEDVGFD------ 210
Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
IG++ + KA S + DVS+P +D P+ G +
Sbjct: 211 --IGQAMLAKA--------------SISTENFRPNFDVSIPLF---SKDHPRTGGERGFL 251
Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR------------ 202
R + F G + S R L V + + + +
Sbjct: 252 KFNTIPPLRKYMLVFKGKRYLTGIGSDTRNALYHVHNGEDVVLLTTCKHGKDWQKHKDSR 311
Query: 203 --------LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
K Y + L + FCL +G + + ++L CVPV+++N ++LPF++V
Sbjct: 312 CDRDNTEYEKYDYREMLHNATFCLVPRGRRLGSFIFLEALQAACVPVMLSNGWELPFSEV 371
Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
+NW +++ + + ++ I ++ L L+ + F W + S + +
Sbjct: 372 INWNQAAVIGDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTT 426
Query: 315 YEL 317
E+
Sbjct: 427 LEI 429
>gi|17541994|ref|NP_502180.1| Protein RIB-1 [Caenorhabditis elegans]
gi|6434267|emb|CAB61014.1| Protein RIB-1 [Caenorhabditis elegans]
gi|130381603|dbj|BAF48989.1| heparan sulfate polymerase [Caenorhabditis elegans]
Length = 382
Score = 57.8 bits (138), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 76/325 (23%), Positives = 126/325 (38%), Gaps = 81/325 (24%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFFLPF 63
S +VY++P + FE S K F++S+ T DP++A +F L
Sbjct: 44 SKKVYIHPMEKR-----------FEESPQSVIYSKILKHFLESNHYTNDPNEACIFLLGI 92
Query: 64 SIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADH----FYVAC------HSI 113
DR + ++ ++ YI ++ WN G +H FY H++
Sbjct: 93 DTT----DRDVRSQNYVKNVNDYIESLDP--SVWN--NGRNHLIFNFYHGTFPDYDDHNL 144
Query: 114 ----GRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRN 169
G + + +A + N I+V DVSLP K S+RN
Sbjct: 145 NFDTGEAMIARASSSENNFIKVF--------------DVSLPLFHENHPYEIKESKSERN 190
Query: 170 K---------LAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGRLKTPYA------- 208
L F G + S R + + D + + + +
Sbjct: 191 DDRIENQRKYLVSFKGKRYVYGIGSGTRNLVHHLHNGDDIVMVTTCKHNNDWQVYQDDRC 250
Query: 209 ------------DELLG-SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVL 255
DELL S FCL +G + + R ++L GCVPV+I++ + LPF++ +
Sbjct: 251 QRDNDEYDRWEYDELLANSTFCLVPRGRRLGSFRFLETLRSGCVPVVISDSWILPFSETI 310
Query: 256 NWKSFSIVVATLDIPLLKKILKGIS 280
+W S +IVVA D + ++L S
Sbjct: 311 DWNSAAIVVAERDALSIPELLMSTS 335
>gi|159483641|ref|XP_001699869.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158281811|gb|EDP07565.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 427
Score = 57.8 bits (138), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 68/304 (22%), Positives = 111/304 (36%), Gaps = 54/304 (17%)
Query: 39 FKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWN 98
F + + T +P +AD F++P S ++ + + IS +P+WN
Sbjct: 125 FMERILSGGHRTHNPEEADFFYIPGSSRDLKKAF---------LLQPLLAYISTTWPFWN 175
Query: 99 RTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYF---------------ISGHI 143
TGGA H A +G + +V+L V + I +
Sbjct: 176 ATGGARHIMPAEGDVGTCEL--PLKVRLFTANVTWLQFWGMYDFHPHWTQIFHNRIPCMV 233
Query: 144 AHKDVSLPQIWPRQED------PPKLGSSKRNKLA--FFAGAVNSPVREKLL-------- 187
+D+ +P + D P + KRN+ FFAG V ++ L
Sbjct: 234 PGRDIVVPFMAMSSHDRFVIETPLHPRNQKRNRTNTFFFAGGVCGSGNKRALPPHCTYYK 293
Query: 188 QVWRNDS---EIYAH----SGRLKTP----YADELLGSKFCLHVKGFEVNTARIADSLYY 236
QV + +Y H +G P YA + S+FCL G I ++Y
Sbjct: 294 QVRYSGGVRQAVYLHFHNRTGWRVVPGTDDYARDYASSRFCLAAAGGGWGKRGIVAAMY- 352
Query: 237 GCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRK 296
GC+PV + F ++W F + + +IP L L+ S E +Q +
Sbjct: 353 GCIPVAATDMLYEAFEPEMDWGRFGVRITQAEIPQLADKLEAYSEAEVARMQERTACAAQ 412
Query: 297 HFQW 300
H W
Sbjct: 413 HLHW 416
>gi|343171904|gb|AEL98656.1| exostosin-like protein, partial [Silene latifolia]
Length = 250
Score = 57.8 bits (138), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 42/160 (26%), Positives = 76/160 (47%), Gaps = 17/160 (10%)
Query: 168 RNKLAFFAGAV----NSPVREKLLQVWRNDSEIY-------AHSGRLKTPYADELLGSKF 216
R L FF G VR +L ++ + +I+ S +L T + SKF
Sbjct: 67 RTTLLFFRGRTVRKDEGVVRARLAKILVDYEDIHYERSVASEESIKLST---QGMRSSKF 123
Query: 217 CLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT---LDIPLLK 273
CL G ++ R+ D++ CVPVI+++ +LPF D L++ +FSI +T L +
Sbjct: 124 CLDPAGDTPSSCRLFDAIVSHCVPVIVSDQIELPFEDELDYSNFSIFFSTEEALKPGYMV 183
Query: 274 KILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
+ L+ + E++L + + + H+++ P D M+
Sbjct: 184 EELRKVPEEKWLNMYKKLKNISHHYEFQYPPKREDGVNML 223
>gi|308491504|ref|XP_003107943.1| hypothetical protein CRE_12808 [Caenorhabditis remanei]
gi|308249890|gb|EFO93842.1| hypothetical protein CRE_12808 [Caenorhabditis remanei]
Length = 847
Score = 57.8 bits (138), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 76/312 (24%), Positives = 124/312 (39%), Gaps = 56/312 (17%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFFLPF 63
S +VY++P + FE S K F +S T DP++A LF L
Sbjct: 13 SRKVYIHPMEKR-----------FEESPQSVIYSKILKHFQESEHYTNDPNEACLFLLGI 61
Query: 64 SIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHF-YVACHSIGRSAMQKAW 122
DR I + ++ YI ++ Q WN G +H Y H G
Sbjct: 62 DTT----DRDIRSLNYVKNVNEYIESLDQAV--WN--NGRNHLIYNFYH--GTFPDYDDH 111
Query: 123 EVKLNAIQ-VVCSSSYFISGHIAHKDVSLP--------QIWPRQEDPPKLGSSKRNKLAF 173
+ N + ++ +S + D+SLP QI ++ + KR LA
Sbjct: 112 NLNFNTGEAIIARASSSENNFFKDFDISLPLFHENHPYQIESQRALHNEPKEEKRRYLAS 171
Query: 174 FAG-----AVNSPVREKLL--------------------QVWRNDSEIYAHSGRLKTPYA 208
F G + S R + QV+++D + + Y
Sbjct: 172 FKGKRYVYGIGSGTRNLVHHLHNGDDIVMVTTCKHNNDWQVYQDDRCQRDNEEYDRWEYD 231
Query: 209 DELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLD 268
D L S FCL +G + + R ++L GC+PV+I++ + LPF++ ++W S +IVVA D
Sbjct: 232 DLLSNSTFCLVPRGRRLGSFRFLETLRSGCIPVVISDSWILPFSETIDWHSAAIVVAERD 291
Query: 269 IPLLKKILKGIS 280
+ ++L S
Sbjct: 292 ALSIPELLMSTS 303
>gi|3023724|sp|O01704.1|EXT1_CAEEL RecName: Full=Multiple exostoses homolog 1; AltName: Full=Related
to mammalian RIB protein 1
gi|2058697|gb|AAC47509.1| multiple exostoses homolog 1 [Caenorhabditis elegans]
Length = 378
Score = 57.8 bits (138), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 76/325 (23%), Positives = 126/325 (38%), Gaps = 81/325 (24%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFFLPF 63
S +VY++P + FE S K F++S+ T DP++A +F L
Sbjct: 40 SKKVYIHPMEKR-----------FEESPQSVIYSKILKHFLESNHYTNDPNEACIFLLGI 88
Query: 64 SIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADH----FYVAC------HSI 113
DR + ++ ++ YI ++ WN G +H FY H++
Sbjct: 89 DTT----DRDVRSQNYVKNVNDYIESLDP--SVWN--NGRNHLIFNFYHGTFPDYDDHNL 140
Query: 114 ----GRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRN 169
G + + +A + N I+V DVSLP K S+RN
Sbjct: 141 NFDTGEAMIARASSSENNFIKVF--------------DVSLPLFHENHPYEIKESKSERN 186
Query: 170 K---------LAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGRLKTPYA------- 208
L F G + S R + + D + + + +
Sbjct: 187 DDRIENQRKYLVSFKGKRYVYGIGSGTRNLVHHLHNGDDIVMVTTCKHNNDWQVYQDDRC 246
Query: 209 ------------DELLG-SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVL 255
DELL S FCL +G + + R ++L GCVPV+I++ + LPF++ +
Sbjct: 247 QRDNDEYDRWEYDELLANSTFCLVPRGRRLGSFRFLETLRSGCVPVVISDSWILPFSETI 306
Query: 256 NWKSFSIVVATLDIPLLKKILKGIS 280
+W S +IVVA D + ++L S
Sbjct: 307 DWNSAAIVVAERDALSIPELLMSTS 331
>gi|401888173|gb|EJT52138.1| hypothetical protein A1Q1_06676 [Trichosporon asahii var. asahii CBS
2479]
Length = 1189
Score = 57.8 bits (138), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 26/103 (25%), Positives = 56/103 (54%), Gaps = 3/103 (2%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y +EL ++FC +G + R+ D+++ GC+PV+ + PFA +++W S+ V
Sbjct: 1063 YLEELNTARFCPQPRGIAGWSPRVNDAIFAGCIPVLTSEDTHYPFAGLIDWSQISVRVHP 1122
Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDA 309
++ ++++L I +Q+ ++ +R F ++P+D A
Sbjct: 1123 TELDHVEELLASIPLARLEQIQANIVAIRDAF---MYPADGKA 1162
>gi|198437606|ref|XP_002128024.1| PREDICTED: similar to Exostosin-1
(Glucuronosyl-N-acetylglucosaminyl-proteoglycan/N-
acetylglucosaminyl-proteoglycan
4-alpha-N-acetylglucosaminyltransferase) (Putative tumor
suppressor protein EXT1) (Multiple exostoses protein 1)
[Ciona intestinalis]
Length = 766
Score = 57.4 bits (137), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 77/356 (21%), Positives = 129/356 (36%), Gaps = 82/356 (23%)
Query: 4 SFRVYVYP---HRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
F+VYVYP + + F N+L + S + T DP KA LF
Sbjct: 126 GFKVYVYPDVGEKTSTNFQNIL------------------ASLRASQYYTSDPEKACLFV 167
Query: 61 LPFSIARMRHDRRIGTEGIPDFISHYIFNISQK---YPYWNRTGGADHFYVACHSIGRSA 117
+ H + YI N+ K YWN G +H +S +
Sbjct: 168 PAYDTLDRDH-----------LSADYIHNLGAKISRLKYWN--NGKNHIIFNLYS--GTW 212
Query: 118 MQKAWEVKLNAIQVVCSSSYFISGHIAHK-DVSLPQIWPRQEDPPKLGSS---------- 166
+ +V N + + + + F + H D+S P I + P G+
Sbjct: 213 PEYLEDVGFNLGEAILAKASFGDNYYRHGFDISFPLI--GKTHPHMQGTQGFLKANYFPP 270
Query: 167 KRNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHS---GRLKTPYADE-------- 210
+R L F G + S R L + D I + G+ ++D+
Sbjct: 271 RRKYLLSFKGKRYTYGIGSSTRNALYHIHNGDDIIILTTCKHGKNWQSFSDQKCETDNEE 330
Query: 211 ---------LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFS 261
L S FC+ +G + + R +SL C+PV++AN + LPF +V++W S
Sbjct: 331 YEKWDYQSLLHNSTFCMVPRGRRLGSFRFLESLQAACIPVVLANGWKLPFDEVIDWSKAS 390
Query: 262 IVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYEL 317
+ + + IL+ + +LL+ + F W + S D E+
Sbjct: 391 LAWEERLLLQVPGILREVQDNRIMLLRQ-----QSQFLWDKYFSSMDVIIRSTLEI 441
>gi|326429784|gb|EGD75354.1| hypothetical protein PTSG_06430 [Salpingoeca sp. ATCC 50818]
Length = 351
Score = 57.4 bits (137), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 34/97 (35%), Positives = 57/97 (58%), Gaps = 7/97 (7%)
Query: 204 KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGC--VPVIIANHYDLPFADVLNWKSFS 261
K Y + L +KF L V+GF ++ R Y+GC +PVI+ +HY LP+ D+L+W++FS
Sbjct: 193 KYTYTELALETKFGLIVEGFGYHSLR-----YHGCGRLPVIVVDHYVLPYQDLLDWETFS 247
Query: 262 IVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHF 298
+ + + L +IL+ I E ++Q V+ V + F
Sbjct: 248 MRIPEHRLLELPRILRSIPDEVVEMIQRRVVFVFEDF 284
>gi|302843170|ref|XP_002953127.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
gi|300261514|gb|EFJ45726.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
Length = 616
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 39/122 (31%), Positives = 58/122 (47%), Gaps = 10/122 (8%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y ++ SKFCL G ++ YGC+PV I ++ PF L+W +FS+ V
Sbjct: 413 YVRDMSTSKFCLAPTGGGHGKRQVLVG-RYGCIPVPITDYVLQPFEPELDWPAFSVTVKE 471
Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWH------VFPSD--YDAFYMVMYELW 318
D+P L IL I+ + +Q + KH W+ +F D YDAF +M L
Sbjct: 472 EDVPNLHTILAAINDTKLAEMQRALACAAKHL-WYSSMWGAIFGEDSRYDAFATLMEILR 530
Query: 319 LR 320
+R
Sbjct: 531 VR 532
>gi|402589373|gb|EJW83305.1| hypothetical protein WUBG_05784 [Wuchereria bancrofti]
Length = 689
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 73/325 (22%), Positives = 128/325 (39%), Gaps = 72/325 (22%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFFLPF 63
+FRVYVYP + + V S K +S + T DP KA LF L
Sbjct: 81 AFRVYVYPSDNSSAMSVVY--------------SNILKVIRESIYYTDDPQKACLFVL-- 124
Query: 64 SIARMRHDRRIGT---------EGIPDFI-----SHYIFNISQ-KYP-YWNRTGGADHFY 107
I + DRR E +P I +H IFN+ YP Y + G D Y
Sbjct: 125 GIDTVDRDRRSENYVKYVNELIENLPSEIWNHGRNHIIFNLYHGTYPDYSDHDLGFDTGY 184
Query: 108 VACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKD----VSLPQIWPRQEDPPKL 163
I R++ A + ++ +S + HK+ ++ W + L
Sbjct: 185 AV---IARAS----------ANTRIFRENFDLSFPLFHKEHPLRTTVKAKWSLKIKDKYL 231
Query: 164 GSSKRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKT---PYADE---------- 210
+ K + + + S R+ L + S + + + T Y DE
Sbjct: 232 VTFKGKRYVY---GIGSETRDSLYHLHNGQSVVMVTTCKHNTDWKKYEDERCEEDNVEYD 288
Query: 211 -------LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIV 263
+ S FCL +G + + R ++L GC+PV++++ ++LPF++V++W+ I+
Sbjct: 289 HWDYEMIMSNSTFCLTPRGRRLGSFRFLEALRLGCIPVVLSDDWELPFSEVIDWRQAVII 348
Query: 264 VATLDIPLLKKILKGISSEEYLLLQ 288
+ + +L I + L ++
Sbjct: 349 GHEDTVLTISDVLSAIPLDRILFMK 373
>gi|343171902|gb|AEL98655.1| exostosin-like protein, partial [Silene latifolia]
Length = 250
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 41/160 (25%), Positives = 76/160 (47%), Gaps = 17/160 (10%)
Query: 168 RNKLAFFAGAV----NSPVREKLLQVWRNDSEIY-------AHSGRLKTPYADELLGSKF 216
R L FF G VR +L ++ + +++ S +L T + SKF
Sbjct: 67 RTTLLFFRGRTVRKDEGVVRARLAKILVDYEDVHYERSVASEESIKLST---QGMRSSKF 123
Query: 217 CLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT---LDIPLLK 273
CL G ++ R+ D++ CVPVI+++ +LPF D L++ +FSI +T L +
Sbjct: 124 CLDPAGDTPSSCRLFDAIVSHCVPVIVSDQIELPFEDELDYSNFSIFFSTEEALKPGYMV 183
Query: 274 KILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
+ L+ + E++L + + + H+++ P D M+
Sbjct: 184 EELRKVPEEKWLNMYKKLKNISHHYEFQYPPKREDGVNML 223
>gi|1619954|gb|AAB17006.1| multiple exostosis 2 protein [Mus musculus]
Length = 718
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 29/82 (35%), Positives = 46/82 (56%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L + FC ++ + A ++D L GCVPV+IA+ Y LPF++VL+WK S+VV
Sbjct: 308 YPQVLQEATFCTVLRRARLGQAVLSDVLQAGCVPVVIADSYILPFSEVLDWKKASVVVPE 367
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+ + IL+ I + +Q
Sbjct: 368 EKMSDVYSILQNIPQRQIEEMQ 389
>gi|110637313|ref|YP_677520.1| hypothetical protein CHU_0899 [Cytophaga hutchinsonii ATCC 33406]
gi|110279994|gb|ABG58180.1| conserved hypothetical protein [Cytophaga hutchinsonii ATCC 33406]
Length = 330
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 47/147 (31%), Positives = 80/147 (54%), Gaps = 15/147 (10%)
Query: 167 KRNKLAF-FAGAVNSPVREKLLQVWRNDS------EI---YAHSGRLKTPYADELLGSKF 216
+ KL F F+G+ + P R KL +RN+S EI Y HS K Y +++L S F
Sbjct: 131 QNKKLLFTFSGSCSHPFRIKLFNAYRNESSEYKVAEIKRWYNHSDFEKETYLEDILSSYF 190
Query: 217 CLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKIL 276
L +G + RI +++ G VPVIIA+ + +PF+ + ++ + +A D+ + IL
Sbjct: 191 VLCPRGIASYSHRIIETMALGSVPVIIADEW-VPFS--IEEDNYYVRIAESDVENIYAIL 247
Query: 277 KGISSEEYLLLQSYVLKV-RKHFQWHV 302
K + +Y L++ V V +K+F+ H+
Sbjct: 248 KAKQT-DYENLRNNVSDVYKKYFESHI 273
>gi|383847805|ref|XP_003699543.1| PREDICTED: exostosin-2-like [Megachile rotundata]
Length = 707
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 70/303 (23%), Positives = 128/303 (42%), Gaps = 44/303 (14%)
Query: 7 VYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAF--MKSHFVTKDPSKADLFFLPFS 64
VY+YP + + + P + S+ +++ F + S F T +P +A F+P S
Sbjct: 102 VYIYPPK-------IYVDALERPVTSQMSKEFYQILFTIISSKFYTPNPYEA-CIFIP-S 152
Query: 65 IARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGA--DHFYVACHSIGRSAMQKAW 122
+ + +R E S +N + + +N G+ D+ V S+G++ + A
Sbjct: 153 LDTLNQNRLKLQEVSQALKSLPFWNNGENHVVFNMVSGSVPDYNTVIDVSVGKAMIAGAG 212
Query: 123 EVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLAFFAGA-VNSP 181
SS + SG D+SLP P + S R LA + +NS
Sbjct: 213 ----------MSSLTYRSGF----DISLPVYSPLTNNFKPNISDTRLWLAISSQININSA 258
Query: 182 VREKLLQV----------------WRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEV 225
+ LL++ + + + +G Y D L + FCL ++G +
Sbjct: 259 FEQDLLEIKSLSPKDILILGPCLQYSSMNSTIRCAGIDVYKYPDVLQTATFCLVIRGARL 318
Query: 226 NTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYL 285
+ + + + G +PVIIA+ +PF DV++W +I + +DI + +LK +S +
Sbjct: 319 GQSTLLECMATGSIPVIIADSLAMPFHDVIDWTRAAIFIREVDILSVISVLKKVSPKRIT 378
Query: 286 LLQ 288
LQ
Sbjct: 379 ELQ 381
>gi|157110617|ref|XP_001651176.1| exostosin-2 [Aedes aegypti]
gi|108878646|gb|EAT42871.1| AAEL005626-PA [Aedes aegypti]
Length = 711
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 26/76 (34%), Positives = 44/76 (57%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L +FCL +G ++ + D+L GC+PVI+A++ LPF+D+L+W SI +
Sbjct: 303 YPGVLENGQFCLIARGVRLSQPTLMDALAAGCIPVIMADNLILPFSDILDWDLISIRIYE 362
Query: 267 LDIPLLKKILKGISSE 282
++ + LK +S E
Sbjct: 363 NNLHSVITTLKAVSKE 378
>gi|302794690|ref|XP_002979109.1| hypothetical protein SELMODRAFT_418827 [Selaginella moellendorffii]
gi|300153427|gb|EFJ20066.1| hypothetical protein SELMODRAFT_418827 [Selaginella moellendorffii]
Length = 435
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/101 (27%), Positives = 60/101 (59%), Gaps = 3/101 (2%)
Query: 203 LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI 262
L+T ++ S FCL G ++AR+ D++ GC+PVI+++ + PF +++++ ++
Sbjct: 279 LETSWSQRHTFSIFCLSPAGDTPSSARLFDAIVSGCIPVIVSDELEPPFEGLVDYRKVAL 338
Query: 263 V---VATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQW 300
V T + L L+ I++ + +L+S++L+ +HFQ+
Sbjct: 339 FVPSVKTTEKGWLVSYLRAITARQLSMLRSHMLEFSRHFQY 379
>gi|47222207|emb|CAG11086.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1040
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 29/111 (26%), Positives = 52/111 (46%), Gaps = 5/111 (4%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y + L S FCL +G + + R +SL C+PV+++N ++LPF+DV+ W I
Sbjct: 568 YQELLHNSTFCLVPRGRRLGSFRFLESLQAACIPVLLSNGWELPFSDVIQWNQAVIEGDE 627
Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYEL 317
+ + ++ + +E L L+ R W + S D + E+
Sbjct: 628 RLLLQVPSTVRAVGNERVL-----ALRQRTQMLWEAYFSSVDKIVLTTLEI 673
>gi|345480513|ref|XP_001603840.2| PREDICTED: exostosin-1-like isoform 1 [Nasonia vitripennis]
gi|345480515|ref|XP_003424163.1| PREDICTED: exostosin-1-like isoform 2 [Nasonia vitripennis]
Length = 713
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 76/322 (23%), Positives = 122/322 (37%), Gaps = 56/322 (17%)
Query: 45 KSHFVTKDPSKADLFFLPFSIARMRHDRR-IGTEGIPDFISHYIFNISQKYPYWNRTGGA 103
+S + T DPS+A LF L DR + TE + H + + + PYWN G
Sbjct: 115 ESRYYTSDPSRACLFVLALDTL----DRDPLSTEFV-----HNLPSKVARLPYWN--NGK 163
Query: 104 DHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLP---QIWPRQEDP 160
+H +S + ++ +S I DVS+P + P +
Sbjct: 164 NHLIFNLYSGTWPDYIEDAMAFDYGYAMLAKASMSIMKLREDFDVSIPLFTKQHPERGGE 223
Query: 161 PKLG-----SSKRNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGRLKTPY--- 207
P L +K+ LA F G + S R L + I+ + R +
Sbjct: 224 PGLAIHNHFPNKKKYLAAFKGKRYVHGIGSETRNALHHLHNGKDLIFVTTCRHGKSWREL 283
Query: 208 ADE-----------------LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLP 250
DE LL S FCL +G + + R ++L GC+PVI++N + LP
Sbjct: 284 QDEHCPQDIREYDMYDYDVLLLNSTFCLVPRGRRLGSFRFLEALRAGCIPVILSNGWALP 343
Query: 251 FADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAF 310
F D ++W I + + IL+ + E+ + VL+ F W + S +
Sbjct: 344 FHDRIDWFQAVIYADERLLFQVPDILRSVVEEKIM-----VLRQTTQFLWERYFSSLEKI 398
Query: 311 YMVMYEL------WLRRSSVRV 326
+E+ W R RV
Sbjct: 399 IFTTFEIIRERIPWEGRREYRV 420
>gi|68052299|sp|Q5IGR8.1|EXT1A_DANRE RecName: Full=Exostosin-1a; AltName:
Full=Glucuronosyl-N-acetylglucosaminyl-proteoglycan/N-
acetylglucosaminyl-proteoglycan
4-alpha-N-acetylglucosaminyltransferase 1a; AltName:
Full=Multiple exostoses protein 1 homolog a
gi|56785791|gb|AAW29033.1| EXT1a [Danio rerio]
Length = 730
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 75/365 (20%), Positives = 137/365 (37%), Gaps = 99/365 (27%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
F+VYVYP ++ G SESY + S F T DP +A LF L
Sbjct: 97 GFKVYVYPQQK----------------GEKISESYQNILSSIEGSRFYTSDPGQACLFVL 140
Query: 62 PFSI-------ARMRHDRRIGTEGIP---DFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
+ H+ + + + + +H IFN+ S +P + G D
Sbjct: 141 NLDTLDRDQLSPQYVHNLKTKIQNLNLWNNGRNHLIFNLYSGTWPDYTEDLGFD------ 194
Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
IG++ + KA S + D+S+P +D P+ G +
Sbjct: 195 --IGQAMLAKA--------------SISTESFRPNFDISIPLF---SKDHPRTGGERGFL 235
Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQV-----------------WRNDSEIY 197
R + F G + S R L + W+ +
Sbjct: 236 KYNTIPPFRKYMLVFKGKRYLTGIGSDTRNALYHIHNAEDVVLLTTCKHGKDWQKHKDAR 295
Query: 198 AHSGRL---KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
+ Y + L S FCL +G + + R ++L CVPV+++N ++LPF+++
Sbjct: 296 CDKDNAEYDRYDYKEMLHNSTFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEI 355
Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRK--HFQWHVFPSDYDAFYM 312
++W++ +++ +++L I S + Q +L +R+ F W + S + +
Sbjct: 356 IDWRTAAVIGD-------ERLLLQIPSTVRSIHQDRILSLRQQTQFLWEAYFSSVEKIVL 408
Query: 313 VMYEL 317
E+
Sbjct: 409 TTLEI 413
>gi|321468156|gb|EFX79142.1| hypothetical protein DAPPUDRAFT_304945 [Daphnia pulex]
Length = 729
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/111 (25%), Positives = 58/111 (52%), Gaps = 5/111 (4%)
Query: 211 LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIP 270
L S FCL +G + + R + L GC+PV+++N++ +PF+++++WK+ +I +
Sbjct: 307 LHNSTFCLVPRGRRLGSFRFIEVLQAGCIPVLLSNNWVIPFSEIIDWKTSAIWADERLLL 366
Query: 271 LLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
+ I++ I +E + L+ + WH++ S D E+ +R
Sbjct: 367 QVPDIVRSIEAERVM-----ALRQQSQLLWHMYFSSIDRIIFTTLEIVRQR 412
>gi|159490316|ref|XP_001703125.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158270755|gb|EDO96590.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 528
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 37/128 (28%), Positives = 58/128 (45%), Gaps = 8/128 (6%)
Query: 199 HSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWK 258
H R + Y L S +CL G +I +L+ GCVPV IA+ PF +NW
Sbjct: 371 HVARSEKRYGQYLARSLYCLAPPGAGHGQRQI-QALFMGCVPVTIADGVAEPFEPAVNWT 429
Query: 259 SFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWH-----VFPSD--YDAFY 311
+ + VA D+P + +L I E+ + Q+ + +H + VF D +DAF
Sbjct: 430 DWGVRVAEADVPQMHTLLDDIGPEQLAVKQARMRCAAQHMLYSSITGGVFGEDGAFDAFE 489
Query: 312 MVMYELWL 319
+ + L
Sbjct: 490 TTLEVMRL 497
>gi|129270192|ref|NP_001012368.2| exostosin-1a [Danio rerio]
gi|126632203|gb|AAI33118.1| Exostoses (multiple) 1a [Danio rerio]
Length = 730
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 75/365 (20%), Positives = 137/365 (37%), Gaps = 99/365 (27%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
F+VYVYP ++ G SESY + S F T DP +A LF L
Sbjct: 97 GFKVYVYPQQK----------------GEKISESYQNILSSIEGSRFYTSDPGQACLFVL 140
Query: 62 PFSI-------ARMRHDRRIGTEGIP---DFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
+ H+ + + + + +H IFN+ S +P + G D
Sbjct: 141 NLDTLDRDQLSPQYVHNLKTKIQNLNLWNNGRNHLIFNLYSGTWPDYTEDLGFD------ 194
Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
IG++ + KA S + D+S+P +D P+ G +
Sbjct: 195 --IGQAMLAKA--------------SISTESFRPNFDISIPLF---SKDHPRTGGERGFL 235
Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQV-----------------WRNDSEIY 197
R + F G + S R L + W+ +
Sbjct: 236 KYNTIPPFRKYMLVFKGKRYLTGIGSDTRNALYHIHNAEDVVLLTTCKHGKDWQKHKDAR 295
Query: 198 AHSGRL---KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
+ Y + L S FCL +G + + R ++L CVPV+++N ++LPF+++
Sbjct: 296 CDKDNAEYDRYDYKEMLHNSTFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEI 355
Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRK--HFQWHVFPSDYDAFYM 312
++W++ +++ +++L I S + Q +L +R+ F W + S + +
Sbjct: 356 IDWRTAAVIGD-------ERLLLQIPSTVRSIHQDRLLSLRQQTQFLWEAYFSSVETIVL 408
Query: 313 VMYEL 317
E+
Sbjct: 409 TTLEI 413
>gi|297738432|emb|CBI27633.3| unnamed protein product [Vitis vinifera]
Length = 441
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 29/106 (27%), Positives = 58/106 (54%), Gaps = 3/106 (2%)
Query: 211 LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVA---TL 267
+ SKFCLH G ++ R+ D++ CVPVI+++ +LP+ D +++ FSI + L
Sbjct: 305 MRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSDQIELPYEDEIDYTQFSIFFSDKEAL 364
Query: 268 DIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
+ + + L+ I E ++ + ++ + H+++ P DA M+
Sbjct: 365 EPGYMIEQLRQIPKERWVEMWRHLKYISHHYEFQYPPKKGDAIDML 410
>gi|380025736|ref|XP_003696624.1| PREDICTED: LOW QUALITY PROTEIN: exostosin-2-like [Apis florea]
Length = 700
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 27/82 (32%), Positives = 44/82 (53%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y D L + FCL ++G + + + + G +PVIIA+ +PF +++W I +
Sbjct: 295 YPDVLQTATFCLVIRGARLGQTTLLECMAAGSIPVIIADSLTMPFHGIIDWTRAVIFIRE 354
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+DI L +LK IS E + LQ
Sbjct: 355 VDILSLISVLKKISQERIIELQ 376
>gi|115461230|ref|NP_001054215.1| Os04g0670600 [Oryza sativa Japonica Group]
gi|113565786|dbj|BAF16129.1| Os04g0670600, partial [Oryza sativa Japonica Group]
Length = 275
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 42/145 (28%), Positives = 72/145 (49%), Gaps = 13/145 (8%)
Query: 134 SSSYFISGHIAHKDVSLPQ--IWPRQEDPPKLGSSK---RNKLAFFAGAVN----SPVRE 184
S+S +S I H VSL + I P P + S+ R L +F GA + VRE
Sbjct: 39 SASSNVSHMIQHTQVSLLKDVIVPYTHLLPTMHLSENKDRPTLLYFKGAKHRHRGGLVRE 98
Query: 185 KLLQVWRNDSEIYAHSGRLKTPYADE----LLGSKFCLHVKGFEVNTARIADSLYYGCVP 240
KL + N+ ++ G ++ + S+FCLH G + R+ D++ C+P
Sbjct: 99 KLWDLMVNEPDVVMEEGYPNATGREQSIKGMRTSEFCLHPAGDTPTSCRLFDAVASLCIP 158
Query: 241 VIIANHYDLPFADVLNWKSFSIVVA 265
VI+++ +LPF ++++ F+I V+
Sbjct: 159 VIVSDEIELPFEGMIDYTEFAIFVS 183
>gi|215686890|dbj|BAG89740.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215693850|dbj|BAG89049.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 264
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 42/145 (28%), Positives = 72/145 (49%), Gaps = 13/145 (8%)
Query: 134 SSSYFISGHIAHKDVSLPQ--IWPRQEDPPKLGSSK---RNKLAFFAGAVN----SPVRE 184
S+S +S I H VSL + I P P + S+ R L +F GA + VRE
Sbjct: 28 SASSNVSHMIQHTQVSLLKDVIVPYTHLLPTMHLSENKDRPTLLYFKGAKHRHRGGLVRE 87
Query: 185 KLLQVWRNDSEIYAHSGRLKTPYADE----LLGSKFCLHVKGFEVNTARIADSLYYGCVP 240
KL + N+ ++ G ++ + S+FCLH G + R+ D++ C+P
Sbjct: 88 KLWDLMVNEPDVVMEEGYPNATGREQSIKGMRTSEFCLHPAGDTPTSCRLFDAVASLCIP 147
Query: 241 VIIANHYDLPFADVLNWKSFSIVVA 265
VI+++ +LPF ++++ F+I V+
Sbjct: 148 VIVSDEIELPFEGMIDYTEFAIFVS 172
>gi|1524415|gb|AAC53143.1| multiple exostosis protein [Mus musculus]
Length = 718
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 27/77 (35%), Positives = 44/77 (57%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L + FC ++ + A ++D L GCVPV+IA+ Y LPF+++L+WK S+VV
Sbjct: 308 YPQVLQEATFCTVLRRARLGQAVLSDVLQAGCVPVVIADSYILPFSEILDWKKASVVVPE 367
Query: 267 LDIPLLKKILKGISSEE 283
+ + IL+ I +
Sbjct: 368 EKMSDVYSILQNIPQRQ 384
>gi|159466264|ref|XP_001691329.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158279301|gb|EDP05062.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 815
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 76/348 (21%), Positives = 135/348 (38%), Gaps = 94/348 (27%)
Query: 32 NYASESYFKKAFMKSHFVTKDPSKADLFFLPF---------------------SIARMRH 70
Y E + ++S T DP AD F++P S R+ H
Sbjct: 410 TYGLEVLMHEMLLQSEHRTFDPEAADFFYVPVYGSCFIFPLHCYADGPWWYAPSGPRVMH 469
Query: 71 DRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHS----------IGRSAMQK 120
+ E + D I + +PYW+R GG DH ++ H G S
Sbjct: 470 VTNMMLE-VRDLIRKH-------FPYWDRRGGRDHIWLMTHDEGACYAPSDIYGSSIFLT 521
Query: 121 AWEVKLNAIQVVCSSS------------------YFISGHIAH---KDVSLPQI------ 153
W + + + + + + I GH + KD+ +P +
Sbjct: 522 HWGRRDSDHKSNTAFTPDNYTQEYVHPMQDKGWLHLIEGHPCYTPGKDLIVPALKLPHHF 581
Query: 154 --WPRQEDPPKLGSSKRNKLAFFAGAVNS--------PVREKLLQVWRND------SEIY 197
P PP+ R+ L + G V +R++L ++W++ + +
Sbjct: 582 SGSPLLFHPPR----PRDILLYLRGDVGKHRLPNYSRGIRQRLYKLWKDHDWQNKYNAMI 637
Query: 198 AHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNW 257
+ Y++ L SKFC+ G + +AR+ D++ +GCVPVI+ ++ F + L++
Sbjct: 638 GDGSDVPGGYSEHLASSKFCVVAPG-DGWSARLEDAVLHGCVPVIVMDNVSAVFEEQLDF 696
Query: 258 KSFSIVV--ATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVF 303
SFSI V A ++ L + LK + +L+ K+R WH +
Sbjct: 697 NSFSIRVGEAEAELAYLPERLKSVPPR---ILEGMQKKLRT--IWHRY 739
>gi|356534007|ref|XP_003535549.1| PREDICTED: LOW QUALITY PROTEIN: xylogalacturonan
beta-1,3-xylosyltransferase-like [Glycine max]
Length = 373
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 59/241 (24%), Positives = 104/241 (43%), Gaps = 31/241 (12%)
Query: 52 DPSKADLFFLPF----SIARMRHDRRIGTEGIPDFISH-YIFNISQKYPYWNRTGGADHF 106
DP AD+ F+PF S + ++ G + D+ + + + WNR+GG DH
Sbjct: 51 DPLLADVVFVPFFATLSANKGAFRKKHGND---DYKRQRQVVDAVKSTQVWNRSGGRDHV 107
Query: 107 YVA----CHSIGRSAMQKA--WEVKLNAIQVVCSSSYFISGHIAHKDVSLPQ--IWPRQE 158
+V C + S + C S + H VS+ + I P
Sbjct: 108 FVLTGAFCKNPSFSFVPGGDFGGWSRGGGGSNCGESDVV----PHTQVSVIKDVIVPYMH 163
Query: 159 DPPKLGSSK---RNKLAFFAGAV----NSPVREKLLQVWRNDSEIYAHSGRLKTPYADE- 210
P+L S+ R++L +F GA +REKL + ++ + G ++
Sbjct: 164 LLPRLDLSENKVRHQLLYFKGAKHRHRGGIIREKLWDLLVSEPGVIMEEGFPNATGREQS 223
Query: 211 ---LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATL 267
+ S+FCLH G + R+ D++ C+PVI+++ +LPF ++++ FS+ A
Sbjct: 224 IKGMRTSEFCLHPAGDTPTSCRLFDAIQSLCIPVIVSDIIELPFEGMVDYAEFSVFPAVN 283
Query: 268 D 268
D
Sbjct: 284 D 284
>gi|268536482|ref|XP_002633376.1| C. briggsae CBR-RIB-1 protein [Caenorhabditis briggsae]
Length = 349
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 27/74 (36%), Positives = 43/74 (58%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y D L S FCL +G + + R ++L GC+PV+I++ + LPF++ +W S IVVA
Sbjct: 230 YDDLLANSTFCLVPRGRRLGSFRFLETLRSGCIPVVISDSWVLPFSETTDWNSAVIVVAE 289
Query: 267 LDIPLLKKILKGIS 280
D + ++L S
Sbjct: 290 RDALSIPELLMSTS 303
>gi|384245403|gb|EIE18897.1| hypothetical protein COCSUDRAFT_45043 [Coccomyxa subellipsoidea
C-169]
Length = 420
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 87/380 (22%), Positives = 138/380 (36%), Gaps = 70/380 (18%)
Query: 7 VYVY--PHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFFLPFS 64
VYVY P + N LP + P Y + K S TKDPS A +F++P
Sbjct: 34 VYVYDLPEKFNKDIKE--LPTIWHPE-QYDIDQVLHKHLTHSEVNTKDPSVAKVFYIPVY 90
Query: 65 IARMRHDR------------------------------RIGTEGIPDFISHYIFNISQKY 94
+ R + + ++ D + I ++ + Y
Sbjct: 91 LGRYFNAQWQRFSDPSDAWLINKECHGLDSVDCWAEKWKVAENATSDLVRSAIAHVKENY 150
Query: 95 PYWNRTGGADHFYVACHSIGRSAMQKAWEV----KLNAIQVVCSSSYFISGHIAHKDV-- 148
PYWN + GADHF V + G+ M KA ++ +IQ S Y + + D
Sbjct: 151 PYWNASNGADHFMVFSYDHGKCEMAKALRFEEFGEMFSIQAYGSLVYRSNAKVQAVDRGD 210
Query: 149 ----SLPQIWPRQEDP-----PKLGSSKRNK-LAFFAGAVNSPVREKLLQVWRNDSEIYA 198
S P W P N L+ FA N + + + + A
Sbjct: 211 SYSWSGPSTWACYRPDADVLVPVFSPYGHNTILSPFAVERNISLLMRFDYPLNDGKSLVA 270
Query: 199 HSG-RLKTPYAD-----ELLGSKFCLH-VKGFEV-------NTARIADSLYYGCVPVIIA 244
H G RL+ D L GS L K + ++AR+ ++ +GC+PV
Sbjct: 271 HHGHRLRKELIDYWQEQPLNGSDLGLRSTKARPIPAARNTQDSARVWRAITFGCIPVTFF 330
Query: 245 NHYDLPFADVLN--WKSFSIVVATLDIPLLKKILKGISSEEYLL--LQSYVLKVRKHFQW 300
+DLPF + + F + + D L ++ + L +Q + + ++ F W
Sbjct: 331 RAFDLPFQRHMGMPYADFVLNIQPDDYRQLNARIQALLDNPSQLRRMQEALERHQRFFVW 390
Query: 301 HVFPSDYDAFYMVMYELWLR 320
D + MV EL LR
Sbjct: 391 DE-AGDGGIYTMVEKELTLR 409
>gi|389744786|gb|EIM85968.1| hypothetical protein STEHIDRAFT_168993 [Stereum hirsutum FP-91666
SS1]
Length = 1111
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 38/117 (32%), Positives = 55/117 (47%), Gaps = 4/117 (3%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L + FC G R+ DS+Y GC+PV+I PF D+L+W SI V
Sbjct: 972 YMSLLNDTVFCPQPAGTTGWATRLVDSMYAGCIPVLIGQASHFPFYDMLDWGKISIRVEP 1031
Query: 267 LDIPLLKKIL-KGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRS 322
D+ L+ IL S ++ LQ+ ++ VR V+P D +V E+ RR
Sbjct: 1032 SDLAQLEDILFSRYSLDDIERLQANIMLVRDAL---VYPLDDVDDTVVREEMIERRG 1085
>gi|296227312|ref|XP_002759355.1| PREDICTED: exostosin-1 [Callithrix jacchus]
Length = 475
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 38/147 (25%), Positives = 70/147 (47%), Gaps = 12/147 (8%)
Query: 176 GAVNSPVREKLLQVWRNDSEIYAHSGRLKT---PYADELLGSKFCLHVKGFEVNTARIAD 232
G N P+ L+Q + +I + Y + L + FCL +G + + R +
Sbjct: 19 GMPNIPLIPGLIQAGLAEKDILDQERKWTEHWYDYREMLHNATFCLVPRGRRLGSFRFLE 78
Query: 233 SLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVL 292
+L CVPV+++N ++LPF++V+NW +++ D LL +I I S + Q +L
Sbjct: 79 ALQAACVPVMLSNGWELPFSEVINWNQAAVIG---DERLLLQIPSTIRS----IHQDKIL 131
Query: 293 KVRK--HFQWHVFPSDYDAFYMVMYEL 317
+R+ F W + S + + E+
Sbjct: 132 ALRQQTQFLWEAYFSSVEKIVLTTLEI 158
>gi|291238132|ref|XP_002738985.1| PREDICTED: tout-velu-like, partial [Saccoglossus kowalevskii]
Length = 403
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 22/67 (32%), Positives = 43/67 (64%)
Query: 214 SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLK 273
+ FC+ ++G + ++D+L GC+PV++++ Y LPF+ VL+WK +I + D+ +
Sbjct: 2 ATFCIVLRGARLGQTALSDALKAGCIPVVLSDTYVLPFSQVLDWKRAAIRLNEEDLDQVA 61
Query: 274 KILKGIS 280
+L+ IS
Sbjct: 62 SVLRSIS 68
>gi|170054218|ref|XP_001863025.1| exostosin-2 [Culex quinquefasciatus]
gi|167874545|gb|EDS37928.1| exostosin-2 [Culex quinquefasciatus]
Length = 758
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 26/76 (34%), Positives = 43/76 (56%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L +FCL +G ++ + D+L GC+PVI+A++ LPF +VL+W SI +
Sbjct: 350 YPSVLENGQFCLVARGVRLSQPTLMDALASGCIPVIMADNLVLPFGEVLDWDLVSIRIHE 409
Query: 267 LDIPLLKKILKGISSE 282
++ + LK +S E
Sbjct: 410 NNLHSVISTLKAVSKE 425
>gi|326429052|gb|EGD74622.1| hypothetical protein PTSG_05987 [Salpingoeca sp. ATCC 50818]
Length = 452
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 43/176 (24%), Positives = 76/176 (43%), Gaps = 22/176 (12%)
Query: 161 PKLGSSKRNKLAFFAGAV-NSPVREKLLQVWRNDSEIYAHSGRLK-----TPYAD----- 209
P+ + R+ FFAG + N+ VR L + H ++K P +
Sbjct: 268 PQTSARARDTFLFFAGTLSNNRVRTVLKKAVEGHPHCVIHDAQVKMMQDKNPRSSHSLLV 327
Query: 210 ---------ELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV-LNWKS 259
E+ S+FCL +G T RI +++ GC+PVI+++ Y PF + +
Sbjct: 328 RQVVRDTLAEMATSEFCLCPRGLTAGTRRIFEAVLVGCIPVIVSDGYTWPFPHLAAELDA 387
Query: 260 FSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHV-FPSDYDAFYMVM 314
S+ V D + IL +S E + + + + + +H+ P DAFY ++
Sbjct: 388 ASVRVPEKDAARVLDILGHVSRRERVAKRVRLAHLAHNVTYHLPAPQPGDAFYNII 443
>gi|302809300|ref|XP_002986343.1| hypothetical protein SELMODRAFT_425371 [Selaginella moellendorffii]
gi|300145879|gb|EFJ12552.1| hypothetical protein SELMODRAFT_425371 [Selaginella moellendorffii]
Length = 421
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 27/101 (26%), Positives = 59/101 (58%), Gaps = 3/101 (2%)
Query: 203 LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI 262
L+T ++ S FCL G ++AR+ D++ GC+PVI+++ + PF +++++ ++
Sbjct: 279 LETSWSQRHTFSIFCLSPAGDTPSSARLFDAIVSGCIPVIVSDELEPPFEGLVDYRKVAL 338
Query: 263 V---VATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQW 300
V T + L L+ I++ + +L+ ++L+ +HFQ+
Sbjct: 339 FVPSVKTTEKGWLVSYLRAITARQLSMLRGHMLEFSRHFQY 379
>gi|358256597|dbj|GAA50186.1| glucuronyl/N-acetylglucosaminyl transferase EXT1 [Clonorchis
sinensis]
Length = 802
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 50/195 (25%), Positives = 86/195 (44%), Gaps = 42/195 (21%)
Query: 123 EVKLNAIQVVCSSSYFISGHIAHK-DVSLPQIWPRQEDPPKLGSS--------------- 166
E +L+ Q + + + F + I H D+SLP I P+ + L +S
Sbjct: 226 EYRLSLGQAILAKASFSTTKIRHTFDISLPLIHPQHPEKISLEASGHDASYHRVHQRTKV 285
Query: 167 KRNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGRLKTP---YAD--------- 209
+R L F G + S R L + D I + R T YAD
Sbjct: 286 RRPILLSFKGKRYVSGIGSASRNTLFHLHNGDDVIMVTTCRHGTDWIRYADKRCSVDMAT 345
Query: 210 -------ELL-GSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFS 261
EL+ S FCL +G + + R ++L C+PV+++N ++LPF++V++W S +
Sbjct: 346 YDQYDYNELMHNSTFCLVPRGRRLGSYRFLEALEASCIPVMLSNDWELPFSEVIDW-SKA 404
Query: 262 IVVATLDIPLLKKIL 276
++ A +PL ++
Sbjct: 405 VIWADEHLPLTLSLM 419
>gi|117606177|ref|NP_001012369.2| exostosin-1b [Danio rerio]
gi|116487511|gb|AAI25901.1| Exostoses (multiple) 1b [Danio rerio]
gi|182891996|gb|AAI65657.1| Ext1b protein [Danio rerio]
Length = 741
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 73/363 (20%), Positives = 133/363 (36%), Gaps = 95/363 (26%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
F+VYVYP ++ G SESY S F T DP +A +F L
Sbjct: 108 GFKVYVYPQQK----------------GEKISESYQNILSTIEGSRFYTSDPGQACVFVL 151
Query: 62 PFSI-------ARMRHDRRIGTEGIP---DFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
+ H+ + + + + +H IFN+ S +P + G D
Sbjct: 152 SLDTLDRDQLSPQYVHNLKTKVQSLALWNNGRNHLIFNLYSGTWPDYTEDLGFD------ 205
Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
IG++ + KA S + D+S+P ++ P+ G +
Sbjct: 206 --IGQAMLAKA--------------SISTENFRPNFDISIPLF---SKEHPRTGGDRGYL 246
Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQV-----------------WRNDSEIY 197
R + F G + S R L V W+ +
Sbjct: 247 KYNTIPPFRKYMLVFKGKRYLTGIGSDTRNALYHVHNAEDVVLLTTCKHGKDWQKHKDAR 306
Query: 198 AHSGRL---KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
K Y + L S FCL +G + + R ++L CVPV+++N ++LPF++V
Sbjct: 307 CDKDNAEYDKYDYREMLHNSTFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEV 366
Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
++W + +++ + + ++ I ++ L L+ + F W + S + +
Sbjct: 367 IDWNTAAVIGDERLLLQIPSTVRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTT 421
Query: 315 YEL 317
E+
Sbjct: 422 LEI 424
>gi|68052298|sp|Q5IGR7.1|EXT1B_DANRE RecName: Full=Exostosin-1b; AltName:
Full=Glucuronosyl-N-acetylglucosaminyl-proteoglycan/N-
acetylglucosaminyl-proteoglycan
4-alpha-N-acetylglucosaminyltransferase 1b; AltName:
Full=Multiple exostoses protein 1 homolog b
gi|56785793|gb|AAW29034.1| EXT1b [Danio rerio]
Length = 741
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 73/363 (20%), Positives = 133/363 (36%), Gaps = 95/363 (26%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESY--FKKAFMKSHFVTKDPSKADLFFL 61
F+VYVYP ++ G SESY S F T DP +A +F L
Sbjct: 108 GFKVYVYPQQK----------------GEKISESYQNILSTIEGSRFYTSDPGQACVFVL 151
Query: 62 PFSI-------ARMRHDRRIGTEGIP---DFISHYIFNI-SQKYPYWNRTGGADHFYVAC 110
+ H+ + + + + +H IFN+ S +P + G D
Sbjct: 152 SLDTLDRDQLSPQYVHNLKTKVQSLALWNNGRNHLIFNLYSGTWPDYTEDLGFD------ 205
Query: 111 HSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSK--- 167
IG++ + KA S + D+S+P ++ P+ G +
Sbjct: 206 --IGQAMLAKA--------------SISTENFRPNFDISIPLF---SKEHPRTGGDRGYL 246
Query: 168 --------RNKLAFFAG-----AVNSPVREKLLQV-----------------WRNDSEIY 197
R + F G + S R L V W+ +
Sbjct: 247 KYNTIPPFRKYMLVFKGKRYLTGIGSDTRNALYHVHNAEDVVLLTTCKHGKDWQKHKDAR 306
Query: 198 AHSGRL---KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
K Y + L S FCL +G + + R ++L CVPV+++N ++LPF++V
Sbjct: 307 CDKDNAEYDKYDYREMLHNSTFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEV 366
Query: 255 LNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVM 314
++W + +++ + + ++ I ++ L L+ + F W + S + +
Sbjct: 367 IDWNTAAVIGDERLLLQIPSTVRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTT 421
Query: 315 YEL 317
E+
Sbjct: 422 LEI 424
>gi|9454585|gb|AAF87908.1|AC015447_18 Hypothetical protein [Arabidopsis thaliana]
Length = 414
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 64/276 (23%), Positives = 107/276 (38%), Gaps = 61/276 (22%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPR------GNYASESYFKKAFMKSHFVTKDPSKAD 57
S ++YVY D +L D + G + S+ K ++S F T +AD
Sbjct: 88 SLKIYVYDENEIDGLKELLYGRDGSVKTTACLKGQWGSQVKIHKLLLESKFRTIKKDEAD 147
Query: 58 LFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSA 117
LFF+P A ++ R +G + Y+ +SQ PY+ R+GG DH +V G +
Sbjct: 148 LFFVP---AYVKCVRMLGGLNDKEINQTYVKVLSQ-MPYFRRSGGRDHIFVFPSGAG-AH 202
Query: 118 MQKAWEVKLNAIQVVC-----SSSYFISGHIAHKDVSLP------QIWPRQEDPPKLGSS 166
+ ++W +N ++ + + + KD+ +P Q D L S
Sbjct: 203 LFRSWSTFINRSIILTPEADRTDKKDTTAFNSWKDIIIPGNVDDAMTKNGQPDVQPLPLS 262
Query: 167 KRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVN 226
KR LA + G +GRLK + K
Sbjct: 263 KRKYLANYLGRAQ------------------GKAGRLKLIDLSKQFPDK----------- 293
Query: 227 TARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI 262
CVPV++++H +LPF +V+++ SI
Sbjct: 294 ----------ECVPVLLSDHAELPFQNVIDYAQVSI 319
>gi|340369783|ref|XP_003383427.1| PREDICTED: exostosin-2-like [Amphimedon queenslandica]
Length = 703
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 38/93 (40%), Positives = 51/93 (54%), Gaps = 3/93 (3%)
Query: 204 KTPYADELLGSKFCLHVK-GFEVNTARIADSLYYGCVPVIIANH-YDLPFADVLNWKSFS 261
K PY D L SK+C+ V G T + D L GCVPVII N+ LPF++V++W+ F+
Sbjct: 304 KLPYPDILQDSKYCMIVSDGTGRGTPDLMDVLMMGCVPVIIRNYELVLPFSEVIDWQRFA 363
Query: 262 IVVATLDIPLLKKILKGISSEEYLLLQSYVLKV 294
+ V + L IL G S +L Q VL V
Sbjct: 364 VFVWLEQLFQLMPIL-GSSRNGLILKQKQVLHV 395
>gi|380799435|gb|AFE71593.1| exostosin-1, partial [Macaca mulatta]
Length = 542
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 33/116 (28%), Positives = 60/116 (51%), Gaps = 9/116 (7%)
Query: 204 KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIV 263
K Y + L + FCL +G + + R ++L CVPV+++N ++LPF++V+NW +++
Sbjct: 117 KYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVI 176
Query: 264 VATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQ--WHVFPSDYDAFYMVMYEL 317
D LL +I I S + Q +L +R+ Q W + S + + E+
Sbjct: 177 G---DERLLLQIPSTIRS----IHQDKILALRQQTQFLWEAYFSSVEKIVLTTLEI 225
>gi|349603640|gb|AEP99427.1| Exostosin-1-like protein, partial [Equus caballus]
Length = 435
Score = 54.7 bits (130), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 34/120 (28%), Positives = 61/120 (50%), Gaps = 9/120 (7%)
Query: 200 SGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKS 259
SG Y + L + FCL +G + + R ++L CVPV+++N ++LPF++V+NW
Sbjct: 6 SGVCLYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQ 65
Query: 260 FSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRK--HFQWHVFPSDYDAFYMVMYEL 317
+++ D LL +I I S + Q +L +R+ F W + S + + E+
Sbjct: 66 AAVIG---DERLLLQIPSTIRS----IHQDKILALRQQTQFLWEAYFSSVEKIVLTTLEI 118
>gi|307106602|gb|EFN54847.1| hypothetical protein CHLNCDRAFT_134888 [Chlorella variabilis]
Length = 833
Score = 54.7 bits (130), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 45/176 (25%), Positives = 84/176 (47%), Gaps = 20/176 (11%)
Query: 147 DVSLPQIWPRQE--DPPKLGSSKRNK--LAFFAGAV---NSPVREKLLQVWRNDSEIYAH 199
D+ LP + Q+ + P LG+ R + LAFF G N + Q N + +
Sbjct: 590 DLVLPVMHSAQKYLESPMLGAPTRERRILAFFKGRTQQSNPEYSRGIRQTLENLTREHDW 649
Query: 200 SGRLKTPYADE------------LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHY 247
G+ K +E L S FC + G + ++R D++ +GC+PV+I +
Sbjct: 650 WGKHKVHVGEEMPEGESDSYSAMLAQSVFCFALMG-DGFSSRTDDAIIHGCIPVLIQDGV 708
Query: 248 DLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVF 303
+ ++++L+ S+S+ + D+ + +IL+ IS E+ +Q+ + KV + W F
Sbjct: 709 EPTWSNLLDTGSYSVRILQKDMERVPEILQAISKEDVARMQANLGKVWRRHLWSGF 764
Score = 38.9 bits (89), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 27/96 (28%), Positives = 40/96 (41%), Gaps = 14/96 (14%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTE------GIPDFISHY 86
Y E +A ++S T DP +AD F++P ++ T+ G + +H
Sbjct: 414 YNPEPGLHEALLQSEHRTLDPDEADFFYIPAFVSCFLFPVLSATDFPYFHGGPVAWRTHA 473
Query: 87 IFN--------ISQKYPYWNRTGGADHFYVACHSIG 114
N I YPYW+R GG DH + H G
Sbjct: 474 AANMFIEVYHWIRSHYPYWDRNGGRDHIVGSFHDEG 509
>gi|431901712|gb|ELK08589.1| Exostosin-1 [Pteropus alecto]
Length = 452
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 32/113 (28%), Positives = 59/113 (52%), Gaps = 9/113 (7%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y + L + FCL +G + + R ++L CVPV+++N ++LPF++V+NW +++
Sbjct: 30 YREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIG-- 87
Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRK--HFQWHVFPSDYDAFYMVMYEL 317
D LL +I I S + Q +L +R+ F W + S + + E+
Sbjct: 88 -DERLLLQIPSTIRS----IHQDKILALRQQTQFLWEAYFSSVEKIVLTTLEI 135
>gi|198416199|ref|XP_002119284.1| PREDICTED: similar to exostoses (multiple) 2, partial [Ciona
intestinalis]
Length = 659
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 25/85 (29%), Positives = 47/85 (55%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y + L FCL + + + + +S+ GC+PV + Y LPF++VL+W S+++
Sbjct: 298 YPEILQEGTFCLLLPTSRLGQSALMESMQAGCIPVFACDTYILPFSEVLDWSRASVLIRE 357
Query: 267 LDIPLLKKILKGISSEEYLLLQSYV 291
+P + IL+ I E+ +L++ V
Sbjct: 358 DSLPDIMNILRRIPHEQVVLMKKQV 382
>gi|440904844|gb|ELR55304.1| Exostosin-1, partial [Bos grunniens mutus]
Length = 426
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 26/111 (23%), Positives = 56/111 (50%), Gaps = 5/111 (4%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y + L + FCL +G + + R ++L CVPV+++N ++LPF++V+NW +++
Sbjct: 4 YREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDE 63
Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYEL 317
+ + ++ I ++ L L+ + F W + S + + E+
Sbjct: 64 RLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTTLEI 109
>gi|355757951|gb|EHH61385.1| Exostosin-1, partial [Macaca fascicularis]
Length = 431
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 26/111 (23%), Positives = 56/111 (50%), Gaps = 5/111 (4%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y + L + FCL +G + + R ++L CVPV+++N ++LPF++V+NW +++
Sbjct: 9 YREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDE 68
Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYEL 317
+ + ++ I ++ L L+ + F W + S + + E+
Sbjct: 69 RLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTTLEI 114
>gi|281354411|gb|EFB29995.1| hypothetical protein PANDA_011713 [Ailuropoda melanoleuca]
Length = 426
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 32/113 (28%), Positives = 59/113 (52%), Gaps = 9/113 (7%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y + L + FCL +G + + R ++L CVPV+++N ++LPF++V+NW +++
Sbjct: 4 YREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIG-- 61
Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRK--HFQWHVFPSDYDAFYMVMYEL 317
D LL +I I S + Q +L +R+ F W + S + + E+
Sbjct: 62 -DERLLLQIPSTIRS----IHQDKILALRQQTQFLWEAYFSSVEKIVLTTLEI 109
>gi|410987732|ref|XP_004000149.1| PREDICTED: exostosin-1 [Felis catus]
Length = 488
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 32/113 (28%), Positives = 59/113 (52%), Gaps = 9/113 (7%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y + L + FCL +G + + R ++L CVPV+++N ++LPF++V+NW +++
Sbjct: 66 YREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIG-- 123
Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRK--HFQWHVFPSDYDAFYMVMYEL 317
D LL +I I S + Q +L +R+ F W + S + + E+
Sbjct: 124 -DERLLLQIPSTIRS----IHQDKILALRQQTQFLWEAYFSSVEKIVLTTLEI 171
>gi|326918032|ref|XP_003205297.1| PREDICTED: exostosin-1-like, partial [Meleagris gallopavo]
Length = 535
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 33/116 (28%), Positives = 61/116 (52%), Gaps = 9/116 (7%)
Query: 204 KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIV 263
K Y + L + FCL +G + + R ++L CVPV+++N ++LPF++V++WK +++
Sbjct: 159 KYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVIDWKQAAVI 218
Query: 264 VATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQ--WHVFPSDYDAFYMVMYEL 317
D LL +I I S + Q +L +R+ Q W + S + + E+
Sbjct: 219 G---DERLLLQIPSTIRS----IHQDKILALRQQTQFLWEAYFSSVEKIVLTTLEI 267
>gi|77748248|gb|AAI05840.1| Ext1 protein [Rattus norvegicus]
Length = 557
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 27/114 (23%), Positives = 57/114 (50%), Gaps = 5/114 (4%)
Query: 204 KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIV 263
K Y + L + FCL +G + + R ++L CVPV+++N ++LPF++V+NW +++
Sbjct: 132 KYDYREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVI 191
Query: 264 VATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYEL 317
+ + ++ I ++ L L+ + F W + S + + E+
Sbjct: 192 GDERLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTTLEI 240
>gi|194698238|gb|ACF83203.1| unknown [Zea mays]
gi|413919897|gb|AFW59829.1| hypothetical protein ZEAMMB73_270023 [Zea mays]
Length = 264
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 41/137 (29%), Positives = 65/137 (47%), Gaps = 13/137 (9%)
Query: 143 IAHKDVSLPQ--IWPRQEDPPKLGSSK---RNKLAFFAGAV----NSPVREKLLQVWRND 193
I H VSL + I P P L S+ R L +F GA VREKL + N+
Sbjct: 37 IQHTQVSLLKDVIVPYTHLLPTLLLSENKDRRTLLYFKGAKHRHRGGLVREKLWDLLGNE 96
Query: 194 SEIYAHSGRLKTPYADE----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDL 249
++ G ++ L S+FCLH G + R+ D++ C+PVI+++ +L
Sbjct: 97 PDVIMEEGFPNATGREQSIKGLRTSEFCLHPAGDTPTSCRLFDAIASLCIPVIVSDEVEL 156
Query: 250 PFADVLNWKSFSIVVAT 266
PF ++++ SI V+
Sbjct: 157 PFEGIIDYTEISIFVSV 173
>gi|363731063|ref|XP_418396.3| PREDICTED: exostosin-1 [Gallus gallus]
Length = 581
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 32/113 (28%), Positives = 60/113 (53%), Gaps = 9/113 (7%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y + L + FCL +G + + R ++L CVPV+++N ++LPF++V++WK +++
Sbjct: 159 YREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVIDWKQAAVIG-- 216
Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRK--HFQWHVFPSDYDAFYMVMYEL 317
D LL +I I S + Q +L +R+ F W + S + + E+
Sbjct: 217 -DERLLLQIPSTIRS----IHQDKILALRQQTQFLWEAYFSSVEKIVLTTLEI 264
>gi|427796253|gb|JAA63578.1| Putative tout-velu, partial [Rhipicephalus pulchellus]
Length = 835
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 77/347 (22%), Positives = 135/347 (38%), Gaps = 54/347 (15%)
Query: 2 NRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFFL 61
R F+VYVYP + P A+ PV + +S + T D S+A +F
Sbjct: 184 GRDFKVYVYPSESDAPPAS---PV----------YQRILRVIRQSSYATADASEACVFVP 230
Query: 62 --------PFSIARMRHDRRIGTEGIPDFISHYIFNI-SQKYPYWNRTGGAD--HFYVAC 110
P S R R + +H IFN+ S +P ++ G D +A
Sbjct: 231 AVDTVDRDPLSPDYARTARLTDSPLWNGGQNHLIFNLFSGTWPDYSEELGLDIGLAMLAK 290
Query: 111 HSIGRSAMQKAWEVKL----------NAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDP 160
SI SA + +++ L + S+ G++ ++ +
Sbjct: 291 SSIPESAFRPGFDIALPLFPKAHPERGGKPAIQSAGPVDKGYLL--------VFKGKRYV 342
Query: 161 PKLGSSKRNKLAFFAGAVN----SPVREKLLQVWRNDSEIYAHSGRL--KTPYADELLGS 214
+GS RN L + + R + R D A + RL + Y + +
Sbjct: 343 YGIGSDTRNALHHLNNGRDVLLLTTCRHGKQWMERRDERCEADN-RLYDRYDYGSLMENA 401
Query: 215 KFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKK 274
FCL +G + + R +SL GCVPV++AN ++LPF + L W+ ++ D LL +
Sbjct: 402 TFCLVPRGRRLGSFRFLESLQAGCVPVLLANGWELPFGESLRWEGAALRA---DERLLLQ 458
Query: 275 ILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
+ + S + + ++ R W + S D + + ++ R
Sbjct: 459 VPDTLRSMPRRRV--HAMRQRSQLLWETYFSSVDKIVLTVLQILRER 503
>gi|406700155|gb|EKD03339.1| hypothetical protein A1Q2_02368 [Trichosporon asahii var. asahii
CBS 8904]
Length = 1041
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 38/148 (25%), Positives = 73/148 (49%), Gaps = 11/148 (7%)
Query: 166 SKRNKLAFFAGAVNSPVREKLLQVW-------RNDSEIYAHSGRLKTPYADELLGSKFCL 218
++R LA F G+ N L+ + ++AH +T Y D L ++FC
Sbjct: 845 AERQTLATFKGSPNGQGTSLRLKTTCPRLLPPGSLEPVWAHIPEGQT-YLDLLGDTRFCP 903
Query: 219 HVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKG 278
G T R++D +Y GC+PV++ + + + + +W FS+ V ++ L++IL
Sbjct: 904 IPFGTAGWTYRLSDVVYAGCIPVLVGDQSHMTYWSMFDWSLFSVQVFEHELDHLERILGA 963
Query: 279 ISSEEYLLLQSYVLKVRKHFQWHVFPSD 306
++ E+ Q ++ VR+ F ++PS+
Sbjct: 964 VTEEDAQRKQDALMLVREAF---LYPSE 988
>gi|242018853|ref|XP_002429885.1| Exostosin-1, putative [Pediculus humanus corporis]
gi|212514919|gb|EEB17147.1| Exostosin-1, putative [Pediculus humanus corporis]
Length = 725
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 68/297 (22%), Positives = 114/297 (38%), Gaps = 61/297 (20%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFFLPF 63
F++YVYP +P+ P +Y ++S + T DPS+A LF L
Sbjct: 91 GFKIYVYPQTDESG-----IPLPLVPSASYQK---IINVIIESRYYTSDPSQACLFVL-- 140
Query: 64 SIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWE 123
+I + D D++ + I QK PYWN+ G +H +S E
Sbjct: 141 AIDTLDRDSLSN-----DYVRNLPLRI-QKLPYWNK--GRNHVIFNLYS---GTWPDYAE 189
Query: 124 VKLN---AIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSS-----------KRN 169
L ++ +S + DVS+P + K G S K+
Sbjct: 190 ENLGFDYGFSILAKASMSTVHFRPNFDVSIPLFHKNHRE--KGGESGYVSSNNFPVIKKY 247
Query: 170 KLAF----FAGAVNSPVREKLLQVWRNDSEIYAHSGR-------LKTPYADE-------- 210
LAF + + S R L + + + + +K DE
Sbjct: 248 VLAFKGKRYVHGIGSETRNSLYHLHNEKDMVLVTTCKHGKSWKEMKDERCDEDNQEYDRY 307
Query: 211 -----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI 262
L S FCL +G + + R ++L GC+PV+++N + LPF +V++W ++
Sbjct: 308 DYEVLLQNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNGWVLPFQEVIDWTKAAV 364
>gi|168041166|ref|XP_001773063.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675610|gb|EDQ62103.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 395
Score = 54.3 bits (129), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 67/288 (23%), Positives = 117/288 (40%), Gaps = 52/288 (18%)
Query: 50 TKDPSKADLFFLPF-----SIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGAD 104
T +P +ADLFF+PF ++ + D + + + + N +Q W R GG D
Sbjct: 80 TSNPGEADLFFIPFFSGLEALPYLYTDGKRRLQQGRELVEWLEANATQT---WRRHGGHD 136
Query: 105 HFYVACHSIGRSAMQKA--------WEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPR 156
HF +A GR+A W L + + +++ + + + + +P
Sbjct: 137 HFLIA----GRTAWDFCRPLTAVTWWGTSLFSNPEMENTTAMLLERRSWRGDEMAVPYPV 192
Query: 157 QEDPPK----------LGSSKRNKLAFFAGAVNS----PVREKLLQVWRNDSEIYAH--S 200
P + SS R L F+GA+ +RE L Q +
Sbjct: 193 GFHPSTSASLQSWIKLVRSSTRKYLFSFSGALRPQLVFSIREILSQQCTQAGSACSRLDC 252
Query: 201 GRLKT-----PYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVL 255
G++K P LL +KFCL +G + DS+ GC+PV H D F
Sbjct: 253 GKIKCSHEPQPIYTSLLQAKFCLQPRGDTATRRSVIDSIVSGCIPVFF--HKDTAFTQYR 310
Query: 256 -----NWKSFSIVVATLDI----PLLKKILKGISSEEYLLLQSYVLKV 294
++ +FS+ + DI +KKIL+G S+++ ++ ++ +
Sbjct: 311 WHLPNDYDNFSVFIDEEDIKNGKADVKKILEGYSAKQVEQMRERLIGI 358
>gi|328788198|ref|XP_001121972.2| PREDICTED: exostosin-2 [Apis mellifera]
Length = 708
Score = 54.3 bits (129), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 26/82 (31%), Positives = 44/82 (53%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y + L + FCL ++G + + + + G +PVIIA+ +PF +++W I +
Sbjct: 301 YPNVLQTATFCLVIRGARLGQTTLLECMAAGSIPVIIADSLTMPFHGIIDWTRAVIFIRE 360
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+DI L +LK IS E + LQ
Sbjct: 361 VDILSLISVLKKISQERIIELQ 382
>gi|351697844|gb|EHB00763.1| Exostosin-like 1 [Heterocephalus glaber]
Length = 674
Score = 54.3 bits (129), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 31/117 (26%), Positives = 60/117 (51%), Gaps = 6/117 (5%)
Query: 201 GRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSF 260
G +T ++L + FCL + G +R +L GC+PV+++ H++LPF++V++W
Sbjct: 256 GPKQTHPGEKLPNATFCL-IPGRRAEASRFLQALQAGCIPVLLSPHWELPFSEVIDWTKA 314
Query: 261 SIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYEL 317
+I VA +PL ++L + +E + L+ + F W + S + E+
Sbjct: 315 AI-VADERLPL--QVLAAL--QEMAPARVLALRQQTQFLWDAYFSSVEKVIHTTLEI 366
>gi|351697330|gb|EHB00249.1| Exostosin-1, partial [Heterocephalus glaber]
Length = 426
Score = 54.3 bits (129), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 26/111 (23%), Positives = 56/111 (50%), Gaps = 5/111 (4%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y + L + FCL +G + + R ++L CVPV+++N ++LPF++V+NW +++
Sbjct: 4 YREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDE 63
Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYEL 317
+ + ++ I ++ L L+ + F W + S + + E+
Sbjct: 64 RLLLQIPSTIRSIHQDKIL-----ALRQQTQFLWEAYFSSVEKIVLTTLEI 109
>gi|432094719|gb|ELK26199.1| Exostosin-1 [Myotis davidii]
Length = 544
Score = 54.3 bits (129), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 32/113 (28%), Positives = 59/113 (52%), Gaps = 9/113 (7%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y + L + FCL +G + + R ++L CVPV+++N ++LPF++V+NW +++
Sbjct: 122 YREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIG-- 179
Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRK--HFQWHVFPSDYDAFYMVMYEL 317
D LL +I I S + Q +L +R+ F W + S + + E+
Sbjct: 180 -DERLLLQIPSTIRS----IHQDKILALRQQTQFLWEAYFSSVEKIVLTTLEI 227
>gi|410931012|ref|XP_003978890.1| PREDICTED: exostosin-1c-like, partial [Takifugu rubripes]
Length = 424
Score = 54.3 bits (129), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 28/111 (25%), Positives = 52/111 (46%), Gaps = 5/111 (4%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y + L S FCL +G + + R +SL C+PV+++N ++LPF+DV+ W I
Sbjct: 3 YQELLHNSTFCLVPRGRRLGSFRFLESLQAACIPVLLSNGWELPFSDVIQWNQAVIEGDE 62
Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYEL 317
+ + ++ + ++ L L+ R W + S D + E+
Sbjct: 63 RLLLQVPSTVRAVGNDRVL-----ALRQRTQMLWEAYFSSVDKIVLTTLEI 108
>gi|307177266|gb|EFN66444.1| Exostosin-1 [Camponotus floridanus]
Length = 711
Score = 54.3 bits (129), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 79/371 (21%), Positives = 140/371 (37%), Gaps = 77/371 (20%)
Query: 3 RSFRVYVYP-HRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFFL 61
+ F VYVYP P +L V E R + T DP++A +F L
Sbjct: 87 QGFTVYVYPIEEAISPLYQKILNVITESR-----------------YYTSDPARACIFVL 129
Query: 62 PFSIARMRHDRR-IGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
DR + TE + + S + PYWN G +H +S +
Sbjct: 130 ALDTL----DRDPLSTEFVHNLPSKLL-----HLPYWN--NGRNHLIFNLYSGTWPDYSE 178
Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGS----------SKRNK 170
++ +S I H + DVS+P ++ P + G + +
Sbjct: 179 ESLAFDMGYAILAKASMSIFRHRSDFDVSIPLF--GKQHPERGGEPGQALENNFPNNKKY 236
Query: 171 LAFFAG-----AVNSPVREKLLQVWRNDSEIYAHS---GRLKTPYADE------------ 210
+A F G + S R L + ++ + G+ + DE
Sbjct: 237 VAAFKGKRYVHGIGSETRNALYHLHNGKDLVFVTTCRHGKAWREFQDEHCQQNNQEYDTY 296
Query: 211 -----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVA 265
L+ + FCL +G + + R ++L GC+PVI++N + LPF + ++W +I
Sbjct: 297 DYEILLMNATFCLVPRGRRLGSFRFLEALRAGCIPVILSNGWALPFHERIDWTQAAIFSD 356
Query: 266 TLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSD-----YDAFYMVMYELWLR 320
+ + I++ +S+ +Q L+ + F W + S + AF + L
Sbjct: 357 ERLLLQIPDIVRSVSN-----VQILKLRQQTQFLWERYFSSIEKIVFTAFENIRERLPWE 411
Query: 321 RSSVRVQWSTS 331
+ R+ W+TS
Sbjct: 412 GARERIVWNTS 422
>gi|449278615|gb|EMC86416.1| Exostosin-1 [Columba livia]
Length = 1015
Score = 53.9 bits (128), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 31/113 (27%), Positives = 59/113 (52%), Gaps = 9/113 (7%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y + L + FCL +G + + R ++L CVPV+++N ++LPF++V++W +++
Sbjct: 38 YREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVIDWNQAAVIG-- 95
Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQ--WHVFPSDYDAFYMVMYEL 317
D LL +I I S + Q +L +R+ Q W + S + + E+
Sbjct: 96 -DERLLLQIPSTIRS----IHQDKILALRQQTQFLWEAYFSSVEKIVLTTLEI 143
>gi|402218829|gb|EJT98904.1| hypothetical protein DACRYDRAFT_110241 [Dacryopinax sp. DJM-731 SS1]
Length = 1153
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 34/151 (22%), Positives = 65/151 (43%), Gaps = 8/151 (5%)
Query: 162 KLGSSKRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVK 221
KL KR + ++L+ VW + + ++ L + +C +
Sbjct: 976 KLNCEKRLRTLEDVATPRLETEQRLMTVWDSLGDYESYPAILND--------TIWCPLPE 1027
Query: 222 GFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISS 281
G R+ D +Y GC+PV + + PF D+L+W SI + D+ ++++L +
Sbjct: 1028 GVTGWATRLEDVVYGGCIPVFVGHASQYPFYDMLDWSKLSIAIERKDLQRIEEVLMSYTM 1087
Query: 282 EEYLLLQSYVLKVRKHFQWHVFPSDYDAFYM 312
EE Q+ ++ VR F + + + D M
Sbjct: 1088 EEIERFQTNLMLVRDAFLYPLDGNHKDQLTM 1118
>gi|348571154|ref|XP_003471361.1| PREDICTED: exostosin-like 1-like [Cavia porcellus]
Length = 674
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 31/117 (26%), Positives = 60/117 (51%), Gaps = 6/117 (5%)
Query: 201 GRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSF 260
G +T ++L + FCL + G +R +L GC+PV+++ H++LPF++V++W
Sbjct: 256 GPKQTRPEEKLPNATFCL-IPGQRAEASRFLQALQAGCIPVLLSPHWELPFSEVIDWTKA 314
Query: 261 SIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYEL 317
+I VA +PL ++L + +E + L+ + F W + S + E+
Sbjct: 315 AI-VADKRLPL--QVLAAL--QEMAPARVLALRQQTQFLWAAYFSSVEKVIHTTLEI 366
>gi|168023962|ref|XP_001764506.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162684370|gb|EDQ70773.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 438
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 66/291 (22%), Positives = 111/291 (38%), Gaps = 52/291 (17%)
Query: 38 YFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYW 97
YFK A +S + T DP KA L+F+ DRR E IP + PYW
Sbjct: 123 YFKDALERSPYYTADPEKACLYFVTV-------DRR--AEKIPSLPT---------LPYW 164
Query: 98 NRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQ 157
N G +H V+ +KA V+ + +S + + A D+S+ R
Sbjct: 165 NH--GLNHVIVSISDFW--VQRKATPVETIEMASTMTSIAHHASYRAGFDISVALPQKRS 220
Query: 158 -EDPPKLGSSKRNKLAFFAG-----------------------------AVNSPVREKLL 187
D L + +R F G P E LL
Sbjct: 221 YSDVQGLKAFERKYFLTFTGMQFLGSSGSRINPVLRSMHNGEDVIIAITCKQDPNSEALL 280
Query: 188 QVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHY 247
+E + EL+ S F L G +++R+ + L G +PV+I++++
Sbjct: 281 SRPELRAECVQDQSVYEQYTTRELMDSTFGLVQAGRGSSSSRLLEVLSAGSIPVVISDNF 340
Query: 248 DLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHF 298
LPF +L+W+ +V + + + + L+ +S E + + L + + F
Sbjct: 341 VLPFESLLDWRRCLLVFPSSQMQRIVRTLRSLSKGEIEFRREHCLYIYRDF 391
>gi|312378632|gb|EFR25154.1| hypothetical protein AND_09769 [Anopheles darlingi]
Length = 732
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 28/99 (28%), Positives = 54/99 (54%)
Query: 200 SGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKS 259
SG + Y L FCL +G + + +++ GC+PVI+A++ LPF+++L+W+
Sbjct: 316 SGGNEYEYPAVLESGVFCLIARGVRLAQPVLLEAMATGCIPVIVADNLVLPFSNILDWEL 375
Query: 260 FSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHF 298
S+ V + + +LK +S + LQ++V V + +
Sbjct: 376 LSVRVYESQLHSVLALLKRVSDQRIRELQAHVRYVYERY 414
>gi|223975397|gb|ACN31886.1| unknown [Zea mays]
Length = 134
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 30/106 (28%), Positives = 55/106 (51%), Gaps = 3/106 (2%)
Query: 211 LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI---VVATL 267
+ SKFCLH G ++ R+ D++ CVPVI+++ +LPF D +++ FS+ V L
Sbjct: 1 MRSSKFCLHPAGDTPSSCRLFDAIVSHCVPVIVSSRIELPFEDEIDYSEFSLFFSVEEAL 60
Query: 268 DIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMV 313
L L+ + +++ + + V H+++ P DA M+
Sbjct: 61 RPDYLLNELRQVPKRKWVDMWLKLKNVSHHYEFQYPPRKGDAVNMI 106
>gi|260782087|ref|XP_002586123.1| hypothetical protein BRAFLDRAFT_105910 [Branchiostoma floridae]
gi|229271215|gb|EEN42134.1| hypothetical protein BRAFLDRAFT_105910 [Branchiostoma floridae]
Length = 551
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 38/116 (32%), Positives = 55/116 (47%), Gaps = 10/116 (8%)
Query: 214 SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDL--PFADVLNWKSFSIVVATLDIP- 270
S FCL G D++ GCVPV + + PF VLN+ FS+++ D+
Sbjct: 417 SVFCLQPPGDSPTRKSFYDAVACGCVPVTFTLEHPVRYPFDQVLNYSDFSVIIDGKDVTD 476
Query: 271 ---LLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPS---DYDAFYMVMYELWLR 320
+ IL+ I SE +LQ + KV Q+ +PS DAF MV+ E+ R
Sbjct: 477 RNITILNILRKIPSERIKMLQDNLKKVAPLLQYS-YPSTVPSQDAFTMVLEEMAQR 531
>gi|395521813|ref|XP_003765009.1| PREDICTED: exostosin-like 1 [Sarcophilus harrisii]
Length = 694
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 26/107 (24%), Positives = 52/107 (48%), Gaps = 5/107 (4%)
Query: 211 LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIP 270
L S FC + R+ +L GCVPV+++ ++LPFA+V++W + ++++ +
Sbjct: 286 LHNSTFCFIPSSCHAGSFRLLQALKAGCVPVLLSRGWELPFAEVIDWGTAAVIIDERHLL 345
Query: 271 LLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYEL 317
+K +L+G+ L L+ + F W + S + E+
Sbjct: 346 QIKSVLQGLPPARVL-----ALRQQTQFLWDAYFSSVEKIVHSTLEI 387
>gi|391340944|ref|XP_003744793.1| PREDICTED: exostosin-1a-like [Metaseiulus occidentalis]
Length = 1047
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 73/302 (24%), Positives = 120/302 (39%), Gaps = 77/302 (25%)
Query: 5 FRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFK--KAFMKSHFVTKDPSKADLFFLP 62
FRVY+YP G S +Y K A +S F T +P +A LF
Sbjct: 359 FRVYIYPLED----------------GQAVSATYMKLLSAIRRSRFYTSNPEEACLFVPN 402
Query: 63 FSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADH--FYVACHSIGRSAMQK 120
DR I ++ ++N+ P+WN GG +H F + S +
Sbjct: 403 VDTL----DRDILSDEYVRSAQARLWNL----PHWN--GGRNHLIFNLFSGSWPDYSQDL 452
Query: 121 AWEVKLNAI-QVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLG-----SSKRNK---- 170
++ L + + + F G+ D+S+P ++PR P++G SS +K
Sbjct: 453 GFDPGLAMLAKSSAPETIFRPGY----DISIP-LFPRTH--PEIGGEPGFSSSESKIITP 505
Query: 171 -----LAFFAG-----AVNSPVREKLLQV-----------------WRNDSEIYAHSGRL 203
L F G + S +R L + W+ + S
Sbjct: 506 LRKRFLLTFKGKRYLYGIGSEIRNSLFHLNNVNDVLLLTTCKHGKQWKLKKDERCDSDNA 565
Query: 204 ---KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSF 260
K Y + S FCL +G + + R +SL GC+P++++N + LPF +V++WKS
Sbjct: 566 DYDKQDYTVLMQNSTFCLVPRGRRLGSFRFLESLQAGCIPIVLSNDWRLPFDEVIDWKSA 625
Query: 261 SI 262
+I
Sbjct: 626 TI 627
>gi|91076108|ref|XP_968944.1| PREDICTED: similar to tout-velu CG10117-PA [Tribolium castaneum]
gi|270014706|gb|EFA11154.1| hypothetical protein TcasGA2_TC004758 [Tribolium castaneum]
Length = 719
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 73/302 (24%), Positives = 116/302 (38%), Gaps = 73/302 (24%)
Query: 2 NRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFK--KAFMKSHFVTKDPSKADLF 59
+ FRVYVYP N + S SY K M+S + T DP A LF
Sbjct: 79 QQGFRVYVYPPDDN----------------STPSPSYQKLLNVLMESRYYTADPRLACLF 122
Query: 60 FLPFSIARMRHDRRIGTEGIPDFISHYIFNIS---QKYPYWNRTGGADHFYVACHSIGRS 116
L SI + DR + Y+ N+ Q P+WN G +H +S G
Sbjct: 123 VL--SIDTLDRDR---------LSADYVRNMQSRLQHLPHWN--NGLNHVIFNLYS-GTW 168
Query: 117 AMQKAWEVKLNAIQVVCSSSYFISGHIAHK-DVSLP---QIWPRQEDPPKLGSS------ 166
+ + + + + H+ D+S+P ++ P E ++GS
Sbjct: 169 PNYTENNLDFDYGMAILAKASMSDSHMRPGFDISIPLFHKVHP--EKGGEVGSVLANSLP 226
Query: 167 -KRNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR-------LKTPYADE--- 210
++N L F G + S R L + I + R +K D+
Sbjct: 227 LQKNYLLAFKGKRYVHGIGSDTRNSLYHLHNRKDMIMVTTCRHGKSWKDMKDERCDQDNK 286
Query: 211 ----------LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSF 260
L S FCL +G + + R ++L GC+PV+++N + LPFA ++W
Sbjct: 287 EYDKYDYEVLLQNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNGWALPFAQKIDWSKA 346
Query: 261 SI 262
+I
Sbjct: 347 AI 348
>gi|260786761|ref|XP_002588425.1| hypothetical protein BRAFLDRAFT_153069 [Branchiostoma floridae]
gi|229273587|gb|EEN44436.1| hypothetical protein BRAFLDRAFT_153069 [Branchiostoma floridae]
Length = 107
Score = 53.9 bits (128), Expect = 1e-04, Method: Composition-based stats.
Identities = 32/110 (29%), Positives = 52/110 (47%), Gaps = 5/110 (4%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L S FCL +G + + R ++L C+PVI AN ++LPF++V+ W +I
Sbjct: 3 YNTLLHNSTFCLVPRGRRLGSFRFLEALQAACIPVIEANGWELPFSEVIEWDRATITADE 62
Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYE 316
+ L IL+ I E+ L L+ + F W + S + + E
Sbjct: 63 RLLFQLPSILRAIPPEKIL-----ALRQQTQFLWETYFSSVERIVLTTLE 107
>gi|47207620|emb|CAG13862.1| unnamed protein product [Tetraodon nigroviridis]
Length = 426
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 30/113 (26%), Positives = 60/113 (53%), Gaps = 9/113 (7%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y + L S FCL +G + + R ++L CVPV+++N ++LPF+++++W + +++
Sbjct: 4 YKEMLHNSTFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEIIDWNTAAVIG-- 61
Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRK--HFQWHVFPSDYDAFYMVMYEL 317
D LL +I + S + Q +L +R+ F W + S + + E+
Sbjct: 62 -DERLLLQIPTTVRS----IHQDRILSLRQQTQFLWEAYFSSVEKIVLTTLEI 109
>gi|322795454|gb|EFZ18199.1| hypothetical protein SINV_05251 [Solenopsis invicta]
Length = 711
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 88/378 (23%), Positives = 153/378 (40%), Gaps = 91/378 (24%)
Query: 3 RSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFFLP 62
+ F VYVYP +V+ P+ ++ N +ES + T DP++A +F L
Sbjct: 87 QGFTVYVYP------VEDVISPL-YQKILNVITES---------RYYTSDPARACIFVLA 130
Query: 63 FSIARMRHDRR-IGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHS---IGRSAM 118
DR + TE + + S + PYWN G +H +S +
Sbjct: 131 LDTL----DRDPLSTEFVHNLPSKLL-----HLPYWN--NGRNHLIFNLYSGTWPDYAEE 179
Query: 119 QKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKR--------NK 170
A++V ++ +S I H DVS+P ++ P + G S + NK
Sbjct: 180 SLAFDV---GYAMLAKASMSIFRHRPDFDVSIPLF--GKQHPERGGESGQALENNFPNNK 234
Query: 171 --LAFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR-------LKTPYADE------ 210
+A F G + S R L + ++ + R L+ + +
Sbjct: 235 KYVAAFKGKRYVHGIGSETRNALYHLHNGKDLVFVTTCRHGKAWRELQDEHCQQDNQEYD 294
Query: 211 -------LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKS---F 260
L+ + FCL +G + + R ++L GC+PVI++N + LPF + ++W F
Sbjct: 295 TYDYEILLMNATFCLVPRGRRLGSFRFLEALRSGCIPVILSNGWALPFHERIDWTQAVIF 354
Query: 261 SIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRK--HFQWHVFPSDYDAFYMVMYELW 318
S L IP IL+ +S+ + +LKVR+ F W + S + ++E
Sbjct: 355 SDERLLLQIP---DILRSVSNVQ-------ILKVRQQTQFLWERYFSSIEKIVFTVFENI 404
Query: 319 LRR-----SSVRVQWSTS 331
R + R+ W+T+
Sbjct: 405 RERLPWEGARERIVWNTN 422
>gi|390338044|ref|XP_783281.3| PREDICTED: exostosin-1b-like [Strongylocentrotus purpuratus]
Length = 702
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 28/111 (25%), Positives = 57/111 (51%), Gaps = 5/111 (4%)
Query: 211 LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIP 270
L S FCL +G + + R +SL C+P++++N ++LPF++V++W + +
Sbjct: 276 LHNSTFCLVPRGRRLGSFRFLESLQAACIPMLLSNGWELPFSEVIDWSKAVVFGDERLLL 335
Query: 271 LLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
+ I++ I++E+ LLL+ + F W+ + S + E+ R
Sbjct: 336 QVPSIVRSITAEQILLLRQ-----QTQFLWNSYFSSTEKIVHTTLEIIKER 381
>gi|149410585|ref|XP_001509292.1| PREDICTED: exostosin-1 [Ornithorhynchus anatinus]
Length = 443
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 32/113 (28%), Positives = 59/113 (52%), Gaps = 9/113 (7%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y + L + FCL +G + + R ++L CVPV+++N ++LPF++V++W +I+
Sbjct: 21 YREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVIDWNQAAIIG-- 78
Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRK--HFQWHVFPSDYDAFYMVMYEL 317
D LL +I I S + Q +L +R+ F W + S + + E+
Sbjct: 79 -DERLLLQIPSTIRS----IHQDKILALRQQTQFLWEAYFSSVEKIVLTTLEI 126
>gi|332017043|gb|EGI57842.1| Exostosin-1 [Acromyrmex echinatior]
Length = 911
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 86/379 (22%), Positives = 150/379 (39%), Gaps = 93/379 (24%)
Query: 3 RSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFFLP 62
+ F VYVYP +V+ P+ ++ N +ES + T DP++A +F L
Sbjct: 87 QGFTVYVYP------VEDVISPL-YQKILNVITES---------RYYTSDPARACIFVLA 130
Query: 63 FSIARMRHDRR-IGTEGIPDFISHYIFNISQKY---PYWNRTGGADHFYVACHS---IGR 115
DR + TE ++ N+ K PYWN G +H +S
Sbjct: 131 LDTL----DRDPLSTE--------FVHNLPLKLLHLPYWN--NGRNHLIFNLYSGTWPDY 176
Query: 116 SAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGS---------- 165
+ A++V ++ +S I H DVS+P ++ P + G
Sbjct: 177 AEESLAFDV---GYAMLAKASMSIFKHRPDFDVSIPLFG--KQHPERGGEPGQALENNFP 231
Query: 166 SKRNKLAFFAG-----AVNSPVREKLLQVWRNDSEIYAHS---GRLKTPYADE------- 210
+ + +A F G + S R L + ++ + G+ + DE
Sbjct: 232 NNKKYVAAFKGKRYVHGIGSETRNALYHLHNGKDLVFVTTCRHGKTWREFQDEHCQQDNQ 291
Query: 211 ----------LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKS- 259
L+ + FCL +G + + R ++L GC+PVI++N + LPF + ++W
Sbjct: 292 EYDTYDYEILLMNATFCLVPRGRRLGSFRFLEALRSGCIPVILSNSWALPFHERIDWNQA 351
Query: 260 --FSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYEL 317
FS L IP IL+ +S+ +Q L+ + F W + S + ++E
Sbjct: 352 VIFSDERLLLQIP---DILRSVSN-----VQILKLRQQTQFLWERYFSSIEKIVFTVFEN 403
Query: 318 WLRR-----SSVRVQWSTS 331
R + R+ W+TS
Sbjct: 404 IRERLPWEGARERIVWNTS 422
>gi|449662559|ref|XP_002154865.2| PREDICTED: exostosin-2-like [Hydra magnipapillata]
Length = 729
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 64/280 (22%), Positives = 114/280 (40%), Gaps = 42/280 (15%)
Query: 6 RVYVYPHR--RNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFFLPF 63
+VY+YP+ R + + FE + K+A KS + T++ ++A LF P
Sbjct: 144 KVYIYPNNEYRTSNNEKIFMEESFE-------FNLIKEAIEKSKYFTENITEACLFIPPL 196
Query: 64 SIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWE 123
+ + EGI ++ + ++ N + F + SI +AM A+
Sbjct: 197 DL--------LTEEGIDLKLASAVLKFLSRWESGNNHVIFNLFPLHPTSIN-TAMSLAYG 247
Query: 124 VKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGS-SKRNKLAFFAGAVNSPV 182
NAI V +++ S + D+S+P K + S K V+ +
Sbjct: 248 ---NAI--VAGANFISSNYRQGFDISIPLFTKAMTSTQKFSTFSGTRKWKLIVSQVSMDI 302
Query: 183 RE-----KLLQVWRNDSEIYAHSGRLKTPYADELLG-------------SKFCLHVKGFE 224
+ KL + + I A G Y + L FCL + G+
Sbjct: 303 EKRNVLNKLKDLHPDLLMIQACPGTFDMMYPEVLCKDSKPFYYPAVLQEGTFCLLLPGYY 362
Query: 225 VNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVV 264
++ + D++ GC+PVI+ N Y LPF +V++W +I+V
Sbjct: 363 YGSSLLLDAMMMGCIPVIMMNDYVLPFNEVIDWSRAAIIV 402
>gi|444706299|gb|ELW47642.1| Exostosin-like 1, partial [Tupaia chinensis]
Length = 595
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 31/118 (26%), Positives = 59/118 (50%), Gaps = 6/118 (5%)
Query: 200 SGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKS 259
+G +T D L + FCL + G R +L GC+PV+++ ++LPF++V++W
Sbjct: 175 AGAEQTHPGDTLPSATFCL-IPGRRPGAGRFLQALQAGCIPVLLSPRWELPFSEVIDWTK 233
Query: 260 FSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYEL 317
+I VA +PL ++L + +E + L+ + F W + S + + E+
Sbjct: 234 AAI-VADERLPL--QVLAAL--QEMPPARVLALRQQTQFLWGAYFSSVEKVVLTTLEV 286
>gi|390340500|ref|XP_003725255.1| PREDICTED: xyloglucan galactosyltransferase KATAMARI1 homolog
[Strongylocentrotus purpuratus]
Length = 495
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/139 (28%), Positives = 64/139 (46%), Gaps = 13/139 (9%)
Query: 189 VWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVII---AN 245
VW N E + + P D + S FCL G+ DS+ GC+PV +
Sbjct: 355 VWFNTPECHQD---IHLPIIDWMRHSIFCLQPPGYSNIRKSFYDSIMSGCIPVTFRSKRS 411
Query: 246 HYDLPFADVLNWKSFSIVVATLDI----PLLKKILKGISSEEYLLLQSYVLKVRKHFQWH 301
H PF L+++ F++ + ++ + ILKGI+ + LQ+ + +V FQ+
Sbjct: 412 HVIYPFERTLDYRRFTVNIPIDEVLSGKTNVTNILKGITKWKIAELQTELAEVAPKFQYS 471
Query: 302 VFP---SDYDAFYMVMYEL 317
P DYDAF ++ E+
Sbjct: 472 YPPIRGPDYDAFAAIIEEM 490
>gi|116786286|gb|ABK24052.1| unknown [Picea sitchensis]
Length = 208
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 36/106 (33%), Positives = 55/106 (51%), Gaps = 9/106 (8%)
Query: 166 SKRNKLAFFAGAVNSPV-REKLLQVWRN-DSEIYAHSGRLKTP-------YADELLGSKF 216
SKR LA F G V V R +LL++ + ++ A + P Y L +KF
Sbjct: 8 SKRKYLANFLGRVQGKVGRLQLLKLSQQFPDKLEAPELKFSGPEKFGRIEYFQHLRNAKF 67
Query: 217 CLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI 262
CL +G T R ++ + CVPVI+++ +LPF +VL++ FSI
Sbjct: 68 CLAPRGESSWTLRFYEAFFVECVPVILSDQIELPFQNVLDYSQFSI 113
>gi|384251773|gb|EIE25250.1| hypothetical protein COCSUDRAFT_40557 [Coccomyxa subellipsoidea
C-169]
Length = 280
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 33/98 (33%), Positives = 53/98 (54%), Gaps = 3/98 (3%)
Query: 205 TPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVV 264
T Y + S FCL G R+ +L +GC+PVIIA++ +PF DVL ++ F++ V
Sbjct: 145 TQYMADFGRSTFCLAATGAGWG-VRLKLALMHGCIPVIIADNVQMPFEDVLPYQDFAVHV 203
Query: 265 ATLDIPLLKKILKGISSEEYLL--LQSYVLKVRKHFQW 300
+ L ++L I S E L+ +Q V + ++F W
Sbjct: 204 REHALYRLPEVLDAILSTEGLVKRMQINVSCIWRYFTW 241
>gi|307197832|gb|EFN78943.1| Exostosin-1 [Harpegnathos saltator]
Length = 711
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 83/373 (22%), Positives = 146/373 (39%), Gaps = 81/373 (21%)
Query: 3 RSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFFLP 62
+ F VYVYP +V+ P+ ++ N +ES + T DP++A +F L
Sbjct: 87 QGFTVYVYP------VEDVISPL-YQKILNVITES---------RYYTSDPARACIFVLA 130
Query: 63 FSIARMRHDRR-IGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKA 121
DR + TE + + S + PYWN G +H +S +
Sbjct: 131 LDTL----DRDPLSTEFVHNLPSKLL-----HLPYWN--NGRNHLIFNLYSGTWPDYAEE 179
Query: 122 WEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGS----------SKRNKL 171
V ++ +S I + DVS+P ++ P + G S + +
Sbjct: 180 SLVFDVGYAILAKASMSIFRYRPDFDVSIPLF--GKQHPERGGEPGQALENNFPSNKKYV 237
Query: 172 AFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR-------LKTPYADE--------- 210
A F G + S R L + ++ + R L+ + +
Sbjct: 238 AAFKGKRYVHGIGSETRNALYHLHNGKDLVFVTTCRHGKAWRELQDEHCQQDNQEYDTYD 297
Query: 211 ----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKS---FSIV 263
L+ + FCL +G + + R ++L GC+PVI++N + LPF + ++W FS
Sbjct: 298 YEILLMNATFCLVPRGRRLGSFRFLEALRAGCIPVILSNGWALPFHERIDWTQAVIFSDE 357
Query: 264 VATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR-- 321
L IP I++ +S+ +Q L+ + F W + S + ++E R
Sbjct: 358 RLLLQIP---DIVRSVSN-----VQILKLRQQTQFLWERYFSSIEKIVFTVFENIRERLP 409
Query: 322 ---SSVRVQWSTS 331
+ RV W+TS
Sbjct: 410 WEGARERVVWNTS 422
>gi|159473595|ref|XP_001694919.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158276298|gb|EDP02071.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 674
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 37/125 (29%), Positives = 61/125 (48%), Gaps = 3/125 (2%)
Query: 177 AVNSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYY 236
A + VR+ LL ++ N + + P +L S+FC GF R+ +
Sbjct: 526 AYSGGVRQGLLALFGNTTRPDVSINKGGGP--SLMLRSRFCFTPMGFGWGV-RLTQAAMT 582
Query: 237 GCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRK 296
GCVPV++ +H DVL ++ FSI V+ ++ L +IL I++EE LQ+ + +
Sbjct: 583 GCVPVMVQDHVWPTLWDVLPYEKFSIRVSRHNLYRLFEILDSITAEELASLQAGLAHWHR 642
Query: 297 HFQWH 301
F W
Sbjct: 643 AFVWQ 647
>gi|388496634|gb|AFK36383.1| unknown [Medicago truncatula]
Length = 316
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 31/88 (35%), Positives = 44/88 (50%), Gaps = 10/88 (11%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIP------DFISHY 86
+ASE +A + S T DP +AD FF+P ++ G P IS
Sbjct: 133 FASEVAIHRALLTSDVRTFDPYEADFFFVPVYVS----CNFSTVNGFPAIGHARSLISSA 188
Query: 87 IFNISQKYPYWNRTGGADHFYVACHSIG 114
+ IS +YP+WNR+ G+DH +VA H G
Sbjct: 189 VKLISTEYPFWNRSTGSDHVFVASHDFG 216
>gi|339247053|ref|XP_003375160.1| exostosin-1 [Trichinella spiralis]
gi|316971539|gb|EFV55297.1| exostosin-1 [Trichinella spiralis]
Length = 1019
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 74/303 (24%), Positives = 118/303 (38%), Gaps = 70/303 (23%)
Query: 1 MNRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFK--KAFMKSHFVTKDPSKADL 58
++R F+VY+YP N D E + S SY K + +S + T DP +A L
Sbjct: 377 IDRPFKVYIYPDISN---------FDEESK---TSASYRKILQILRQSKYFTDDPDQACL 424
Query: 59 FFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYP-----YWN--RTGGADHFYVACH 111
F L + T + Y+ N++ K WN + G +H +
Sbjct: 425 FVLSYD-----------TLSRDSLSAEYVENMNAKIKSLPTNLWNNGKVSGMNHLIFNLY 473
Query: 112 SIGRSAMQKAWEVKLNAIQVVCSSSYFISGHI-AHKDVSLPQI----------------- 153
S G E+ Q + + + F + H +H D+SL
Sbjct: 474 S-GTWPDYDLTELGFEPGQAILAKASFSTRHYRSHFDISLALFHDILPLRGLNATDVENV 532
Query: 154 ---WPRQEDPPKL-----------GSSKRNKLAFFAGAVNSPVRE--KLLQVW-RNDSEI 196
WPR L GS RN L A + + K + W +N E
Sbjct: 533 DLNWPRSNWSYTLVFKGKRYVFGIGSETRNALYHLHNAKDIIMLTTCKHGKDWMKNQDER 592
Query: 197 YAHSGRLKTP--YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV 254
+ L Y + + SKFCL +G + + R ++L GC+PVI++N + LPF++V
Sbjct: 593 CSIDNDLYDNWNYEELMANSKFCLVPRGRRLGSFRFLEALEKGCIPVILSNDWVLPFSEV 652
Query: 255 LNW 257
++W
Sbjct: 653 IDW 655
>gi|195431447|ref|XP_002063753.1| GK15839 [Drosophila willistoni]
gi|194159838|gb|EDW74739.1| GK15839 [Drosophila willistoni]
Length = 776
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 78/326 (23%), Positives = 127/326 (38%), Gaps = 59/326 (18%)
Query: 5 FRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFK--KAFMKSHFVTKDPSKADLFFLP 62
F VYVYP P++ S +Y K A +S + T DPS A LF L
Sbjct: 111 FLVYVYPPE----------PLNSLGAAPPTSANYQKILTAIQESRYYTTDPSIACLFVLA 160
Query: 63 FSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYV----------ACHS 112
DR +E D++ + ++ + P+WN G +H A HS
Sbjct: 161 IDTL----DRDSLSE---DYVRNVPSRLA-RLPHWN--NGRNHIIFNLYSGTWPDYAEHS 210
Query: 113 IGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLA 172
+G A + A K + + + +S + HK L ++K+ LA
Sbjct: 211 LGFDAGE-AILAKASMSVLQLRPGFDVSIPLFHKQFPLRAGATGSVQSNNFPANKKYLLA 269
Query: 173 F----FAGAVNSPVREKLLQVWRNDSEIYAHSGR-------LKTPYADE----------- 210
F + + S R L + + + R L+ DE
Sbjct: 270 FKGKRYVHGIGSETRNSLFHLHNGRDMVLVTTCRHGKSWRELQDNRCDEDNREYDRYDYE 329
Query: 211 --LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLD 268
L S FCL +G + + R ++L GC+PV+++N + LPF ++WK +I
Sbjct: 330 TLLQNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNAWVLPFESKIDWKQAAIWADERL 389
Query: 269 IPLLKKILKGISSEEYLLL--QSYVL 292
+ + I++ IS+E L Q+ VL
Sbjct: 390 LLQVPDIVRSISAERIFALRQQTQVL 415
>gi|156362595|ref|XP_001625861.1| predicted protein [Nematostella vectensis]
gi|156212714|gb|EDO33761.1| predicted protein [Nematostella vectensis]
Length = 728
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 77/357 (21%), Positives = 144/357 (40%), Gaps = 77/357 (21%)
Query: 5 FRVYVYP----HRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFF 60
FR+YVYP + + + N++ K +S + T+ P +A LF
Sbjct: 101 FRIYVYPVHPGAKLSTTYTNII------------------KVIKESRYYTEFPEEACLFI 142
Query: 61 LPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQK 120
+I + D+ D++ H I+N ++ PYW G +H + + +
Sbjct: 143 T--AIDTLDRDKLSA-----DYV-HNIYNKIRQLPYWK--NGENH--IIFNLFAGTWPDY 190
Query: 121 AWEVKLNAIQVVCSSSYFISGHIAHK-DVSLPQIWPRQEDPPKLGS----------SKRN 169
+ +V + + + + S I DVSLP ++P+ LG+ ++
Sbjct: 191 SEDVGFDFGKAILVKASLSSDLIRPGFDVSLP-LFPKTHPHKDLGNLPHSCSAFPLERKY 249
Query: 170 KLAF----FAGAVNSPVREKLLQVWRNDSEIY----AHSGRLKTPYADE----------- 210
KLAF + + S R L + N +I G+ + DE
Sbjct: 250 KLAFKGKRYLNGIGSESRNALYHI-HNGRDIVLLTTCKHGKAWHKHKDERCDGDNALYDR 308
Query: 211 ------LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVV 264
LL + FCL +G + + R +SL GC+P ++++ ++LPFA+V++WK I
Sbjct: 309 YSYDELLLNATFCLVPRGRRLGSFRFLESLKVGCIPFLLSDGWELPFAEVIDWKKAVIDG 368
Query: 265 ATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
+ + + I++ S + L ++ L F W+ + S + E+ R
Sbjct: 369 SERLLMQVPGIVRSYSRSQVLAMKQQSL-----FLWNAYFSSVEKIVHTTIEVIRSR 420
>gi|380027202|ref|XP_003697318.1| PREDICTED: exostosin-1-like [Apis florea]
Length = 711
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 79/355 (22%), Positives = 142/355 (40%), Gaps = 80/355 (22%)
Query: 3 RSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFFLP 62
+ F VYVYP +V+ P+ ++ N +ES + T DP++A +F L
Sbjct: 87 QGFTVYVYP------IEDVISPL-YQKILNVITES---------RYYTSDPTRACIFVLA 130
Query: 63 FSIARMRHDRR-IGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKA 121
DR + TE + + S + + PYWN G +H +S +
Sbjct: 131 LDTL----DRDPLSTEFVHNLPSKLM-----RLPYWN--NGRNHLIFNLYSGTWPDYAEE 179
Query: 122 WEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKR--------NK--L 171
++ +S I H D+S+P ++ P + G S + NK +
Sbjct: 180 SLAFDLGYAMLAKASMSIFRHRPEFDISIPLF--GKQHPERGGESGQALENNFPNNKKYI 237
Query: 172 AFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR-------LKTPYADE--------- 210
A F G + S R L + ++ + R L+ + +
Sbjct: 238 AAFKGKRYVHGIGSETRNALYHLHNGKDLVFVTTCRHGKAWRELQDEHCQQDNQEYDMYD 297
Query: 211 ----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKS---FSIV 263
L+ + FCL +G + + R ++L GC+PVI++N + LPF + ++W FS
Sbjct: 298 YEILLMNATFCLVPRGRRLGSFRFLEALRAGCIPVILSNGWALPFHERIDWTQAVIFSDE 357
Query: 264 VATLDIPLLKKILKGISSEEYLLLQSYVLKVRK--HFQWHVFPSDYDAFYMVMYE 316
L IP I++ +S+ ++LK+R+ F W + S + ++E
Sbjct: 358 RLLLQIP---DIVRSVSN-------VHILKLRQQTQFLWERYFSSIEKIVFTVFE 402
>gi|281347217|gb|EFB22801.1| hypothetical protein PANDA_019397 [Ailuropoda melanoleuca]
Length = 360
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 23/52 (44%), Positives = 35/52 (67%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWK 258
Y L + FC+ ++G + A ++D L GCVPV+IA+ Y LPF++VL+WK
Sbjct: 308 YPQVLQEAIFCVVLRGARLGQAVLSDVLRAGCVPVVIADSYILPFSEVLDWK 359
>gi|302835519|ref|XP_002949321.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
gi|300265623|gb|EFJ49814.1| acetylglucosaminyltransferase [Volvox carteri f. nagariensis]
Length = 855
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 78/338 (23%), Positives = 127/338 (37%), Gaps = 72/338 (21%)
Query: 32 NYASESYFKKAFMKSHFVTKDPSKADLFFLPFSIA----RMRHDRRIGTEGIPDFISHYI 87
Y+ E+YF + S T DP +AD F++P + + G P Y
Sbjct: 444 TYSVEAYFHEVLSISPHRTFDPEEADFFYVPVYYTCWMWPINGWADMPFYGAPTSWHRYS 503
Query: 88 FN----------ISQKYPYWNRTGGADHFYVACHSIG----------RSAMQKAW-EVKL 126
I +P+W+R GG DH ++ H G S M W + L
Sbjct: 504 NAANLWLKAKTWIQSNFPFWDRRGGRDHIWMTNHDEGACYMPTEIYQTSIMLTHWGRMDL 563
Query: 127 NAIQVVC------SSSYFISGHIAHKDVSL-----PQIWPRQE-------------DPPK 162
N S G + KDV P PR++ P
Sbjct: 564 NHTSNTAYRPDNYSDGITWKGVLDGKDVKTLYQGHPCYDPRKDLVIPAFKTPDHFSQSPL 623
Query: 163 LGS--SKRNKLAFFAGAVNS--------PVREKLLQV-----WRNDSEIY-AHSGRLKTP 206
LGS +R+ L + G V +R+KL ++ W I+ ++
Sbjct: 624 LGSWPRQRDILLYLRGDVGKHREPNYSRGIRQKLYKLAVDNEWAKKHRIFIGEQFEIQGS 683
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y + L S FC V G + + R D++ +GC+P+II ++ + F +++ SFS+ ++
Sbjct: 684 YGEHLSRSLFCAVVPG-DGYSPRFEDAVLHGCLPLIIVDNTHVLFESIIDVDSFSLRISE 742
Query: 267 LDI-PLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVF 303
+ L +L IS ++ +Q R WH F
Sbjct: 743 AALNEYLPHLLTAISPDQIARMQR-----RLSLVWHRF 775
>gi|432882349|ref|XP_004073986.1| PREDICTED: exostosin-1b-like [Oryzias latipes]
Length = 429
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 25/111 (22%), Positives = 56/111 (50%), Gaps = 5/111 (4%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y + L S FCL +G + + R ++L CVPV+++N ++LPF+++++W +++
Sbjct: 7 YKEMLHNSTFCLVPRGRRLGSFRFLEALQAACVPVVLSNGWELPFSEIIDWNRATVIGDE 66
Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYEL 317
+ + ++ I ++ L L+ + F W + S + + E+
Sbjct: 67 RLLLQIPTTVRSIHPDKIL-----SLRQQTQFLWEAYFSSVEKIVLTTLEI 112
>gi|302791425|ref|XP_002977479.1| galactosyltransferase-like protein [Selaginella moellendorffii]
gi|300154849|gb|EFJ21483.1| galactosyltransferase-like protein [Selaginella moellendorffii]
Length = 466
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 71/305 (23%), Positives = 109/305 (35%), Gaps = 46/305 (15%)
Query: 32 NYASESYFKKAFMKSHFVTKDPSKADLFFLPF----SIARMRHDRRIGTEGIPDFISHYI 87
++ E F + + T DP A FF+PF I+R + + +
Sbjct: 133 QFSGEIIFHRRMLDHRCRTLDPDGASAFFVPFYAGLDISRNLWASGKSSSDVDSLGEQLL 192
Query: 88 FNISQKYPYWNRTGGADHFYVACHSIGR--------SAMQKAWEVKLNAIQVVCSSSYFI 139
+ +++P++NR+GGADHF VA GR + W L + S +
Sbjct: 193 HWLQRQHPHFNRSGGADHFLVA----GRISWDFRRMPSAAGEWGSSLFHQIEMRSVKRLV 248
Query: 140 SGHIAHKDVSLPQIWPRQEDPPK----------LGSSKRNKLAFFAGAVNSPVREKLLQV 189
D L +P P + S R L FAG+ R QV
Sbjct: 249 IERNPWDDSELGVPYPTSFHPSSDEDLAQWVEFVQGSPRPHLVAFAGSPRPGYRSDFRQV 308
Query: 190 W--------RNDSEIYAHSGRLKTPYADEL------LGSKFCLHVKGFEVNTARIADSLY 235
R S + +D L L S FCL +G + DSL
Sbjct: 309 LLGQCRAAPRGISRCLDCTADTAGCTSDPLRVTKLFLSSVFCLQPRGDSFTRKSLFDSLI 368
Query: 236 YGCVPVIIAN-----HYDLPFA-DVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQS 289
GC+PV+ N Y+L D + F + + + +L+GIS E +Q
Sbjct: 369 SGCIPVLFWNQSAYWQYELYLPRDPEEYSVFIPHQSVKNGTNVLDVLQGISRERIGRMQR 428
Query: 290 YVLKV 294
VL++
Sbjct: 429 AVLRI 433
>gi|159483755|ref|XP_001699926.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158281868|gb|EDP07622.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 785
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 40/140 (28%), Positives = 69/140 (49%), Gaps = 9/140 (6%)
Query: 168 RNKLAFFAG------AVNSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVK 221
R+ L +F G A ++ VR+ LL ++ N + A K + +L S+FC
Sbjct: 619 RDTLFYFNGFTKPDLAYSAGVRQGLLALFGNSTR--ADLSINKGGGSQRMLRSRFCFTPM 676
Query: 222 GFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISS 281
GF R++ ++ GCVP+++ +H DVL ++ FSI V+ ++ L L+ I+
Sbjct: 677 GFGWGI-RLSQAMLTGCVPIMVHDHVWPTLWDVLPYEQFSIRVSRHNMYRLLDYLESITP 735
Query: 282 EEYLLLQSYVLKVRKHFQWH 301
++ LQ V + K F W
Sbjct: 736 QQLARLQDGVAQWHKAFVWQ 755
>gi|358335371|dbj|GAA53901.1| exostosin-2 [Clonorchis sinensis]
Length = 1030
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 30/95 (31%), Positives = 47/95 (49%)
Query: 197 YAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLN 256
Y R Y + L S FCL ++ V + DS+ GC+PVI +++ LPF++VL+
Sbjct: 683 YDPPSRTIVDYGEALARSLFCLIIQIPPVGQFALFDSMNAGCIPVIADDNFILPFSEVLD 742
Query: 257 WKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYV 291
W +I V ++ + L +SEE Q V
Sbjct: 743 WSKIAIRVRHSELHKIVTTLTSFTSEEIAQFQRQV 777
>gi|307108297|gb|EFN56537.1| hypothetical protein CHLNCDRAFT_51514 [Chlorella variabilis]
Length = 810
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 37/134 (27%), Positives = 66/134 (49%), Gaps = 16/134 (11%)
Query: 161 PKLGSSKRNK--LAFFAGAV---NSPVREKLLQVWRNDSE----------IYAHSGRLKT 205
P LG++ R++ LAF G V N P L Q N S + ++
Sbjct: 585 PLLGATPRSRTWLAFHRGRVQQDNPPYSRGLRQRLANASAAGGWLEKHKIVMGEHDVVEG 644
Query: 206 PYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVA 265
Y++ L S FCL + G + +AR+ D++ +GC+PVII ++ + F +L+ +F++ +
Sbjct: 645 DYSELLASSTFCLVLPG-DGWSARMDDAMLHGCIPVIIMDNVHVSFESILDLAAFTVRIP 703
Query: 266 TLDIPLLKKILKGI 279
D L ++L +
Sbjct: 704 QADAERLPEVLAAV 717
Score = 40.0 bits (92), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 26/91 (28%), Positives = 39/91 (42%), Gaps = 15/91 (16%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-------FSIARMRHDRRIGTEGIPDFISH 85
YA+ES F + ++S T DP +AD ++P + R R G+ + H
Sbjct: 381 YAAESGFLEMLLQSEHRTLDPEEADFLYVPAYTSCLITPVQRTADSLRDMWYGVENLRVH 440
Query: 86 YIFN--------ISQKYPYWNRTGGADHFYV 108
+ I PYWNR GG DH ++
Sbjct: 441 AATHMLLEAYYWIKAHAPYWNRRGGWDHIWL 471
>gi|440797580|gb|ELR18663.1| exostosin, putative [Acanthamoeba castellanii str. Neff]
Length = 423
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 56/276 (20%), Positives = 104/276 (37%), Gaps = 53/276 (19%)
Query: 40 KKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNR 99
+ A S + +DP KA + F ++ +GT S I + P W R
Sbjct: 99 RAALRASKYFVEDPEKACVLVPNFEHTLASNE--MGT-------SLVIAQTLRSLPTWER 149
Query: 100 TGGA-----------------DHFYVACHSIGRSA--MQKAWEVKLNAIQVVCSSSYFIS 140
GG D Y +G S + A+++ LN C
Sbjct: 150 YGGPGFNHLLFNKHDDVGVEYDPAYAMVAKVGWSTGHYRPAFDISLNP---PCGK----- 201
Query: 141 GHIAHKDVS---LPQIWPRQEDPPKLGSSKRNKLAFFAGAVNSPVREKLLQVWRND---- 193
G KD + +P W P + L F N P+R + + + +
Sbjct: 202 GRPGLKDAAGHVVPTWWAN----PDRKRENKYFLTFLGTMRNYPLRRAIAERFHDPDNGV 257
Query: 194 ------SEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHY 247
E ++ Y D L ++F L +G + + R +++ G +PVI+ + Y
Sbjct: 258 IIQTSVEEQIGGKPSVEVEYLDTLFHTQFTLCPRGRALYSYRTTEAIAAGAIPVILGDGY 317
Query: 248 DLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEE 283
P+ ++++W+SF++++ + +L+ +SEE
Sbjct: 318 AFPYNELIDWRSFAVILPESSWETMMDVLRSFTSEE 353
>gi|383847108|ref|XP_003699197.1| PREDICTED: exostosin-1-like [Megachile rotundata]
Length = 711
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 77/350 (22%), Positives = 136/350 (38%), Gaps = 72/350 (20%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFFLPF 63
F VYVYP +V+ P+ ++ N +ES + T DP++A +F L
Sbjct: 88 GFTVYVYP------VEDVISPL-YQKILNVITES---------RYYTSDPARACIFVLAL 131
Query: 64 SIARMRHDRR-IGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAW 122
DR + TE I + + + + PYWN G +H +S +
Sbjct: 132 DTL----DRDPLSTEFIHNLPAKLM-----RLPYWN--NGRNHLIFNLYSGTWPDYAEES 180
Query: 123 EVKLNAIQVVCSSSYFISGHIAHKDVSLPQIW---------PRQEDPPKLGSSKRNKLAF 173
++ +S I H + DVS+P P Q +SK+ AF
Sbjct: 181 LAFDIGYAMLAKASMSIFRHRPNFDVSIPLFGKQHPERGGEPGQALENNFPNSKKYVAAF 240
Query: 174 ----FAGAVNSPVREKLLQVWRNDSEIYAHSGR-------LKTPYADE------------ 210
+ + S R L + ++ + R L+ + +
Sbjct: 241 KGKRYVHGIGSETRNALYHLHNGKDLVFVTTCRHGKAWRELQDEHCQQDNQEYDMYDYEI 300
Query: 211 -LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKS---FSIVVAT 266
L+ + FCL +G + + R ++L GC+PVI++N + LPF + ++W FS
Sbjct: 301 LLMNATFCLVPRGRRLGSFRFLEALRAGCIPVILSNGWALPFHERIDWTQAVIFSDERLL 360
Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYE 316
L IP I++ +S+ +Q L+ + F W + S + ++E
Sbjct: 361 LQIP---DIVRSVSN-----VQILKLRQQTQFLWERYFSSIEKIVFTVFE 402
>gi|170583869|ref|XP_001896766.1| exostosin-1 [Brugia malayi]
gi|158595933|gb|EDP34387.1| exostosin-1, putative [Brugia malayi]
Length = 411
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 20/76 (26%), Positives = 44/76 (57%)
Query: 213 GSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLL 272
S FCL +G + + R ++L GC+PV++++ ++LPF++V++W+ I+ + +
Sbjct: 158 NSTFCLTPRGRRLGSFRFLEALRLGCIPVVLSDDWELPFSEVIDWRQAVIIGHEDTVLTI 217
Query: 273 KKILKGISSEEYLLLQ 288
+L I + L ++
Sbjct: 218 SDVLNAIPLDRILFMK 233
>gi|340727245|ref|XP_003401958.1| PREDICTED: exostosin-2-like isoform 2 [Bombus terrestris]
Length = 710
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 23/82 (28%), Positives = 46/82 (56%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y + L + FCL ++G + + + + + G +P+IIA+ +P+ +++W +I V
Sbjct: 301 YPNVLQTATFCLVIRGARLGQSTLLECMAAGSIPIIIADSLTMPYHGIIDWSRAAIFVRE 360
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+DI + +LK IS + + LQ
Sbjct: 361 VDILSIISVLKKISPQRIIELQ 382
>gi|350423159|ref|XP_003493402.1| PREDICTED: exostosin-2-like [Bombus impatiens]
Length = 708
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 23/82 (28%), Positives = 46/82 (56%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y + L + FCL ++G + + + + + G +P+IIA+ +P+ +++W +I V
Sbjct: 301 YPNVLQTATFCLVIRGARLGQSTLLECMAAGSIPIIIADSLTMPYHGIIDWSRAAIFVRE 360
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+DI + +LK IS + + LQ
Sbjct: 361 VDILSIISVLKKISPQRIIELQ 382
>gi|340727243|ref|XP_003401957.1| PREDICTED: exostosin-2-like isoform 1 [Bombus terrestris]
Length = 708
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 23/82 (28%), Positives = 46/82 (56%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y + L + FCL ++G + + + + + G +P+IIA+ +P+ +++W +I V
Sbjct: 301 YPNVLQTATFCLVIRGARLGQSTLLECMAAGSIPIIIADSLTMPYHGIIDWSRAAIFVRE 360
Query: 267 LDIPLLKKILKGISSEEYLLLQ 288
+DI + +LK IS + + LQ
Sbjct: 361 VDILSIISVLKKISPQRIIELQ 382
>gi|195381971|ref|XP_002049706.1| GJ20607 [Drosophila virilis]
gi|194144503|gb|EDW60899.1| GJ20607 [Drosophila virilis]
Length = 757
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 77/326 (23%), Positives = 128/326 (39%), Gaps = 59/326 (18%)
Query: 5 FRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFK--KAFMKSHFVTKDPSKADLFFLP 62
F VYVYP P++ +S +Y K A +S + T DP+ A LF L
Sbjct: 100 FLVYVYPPE----------PLNSLGAAPPSSANYQKILTAIQESRYHTSDPAAACLFVLG 149
Query: 63 FSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYV----------ACHS 112
DR +E D++ + ++ + P+WN G +H A HS
Sbjct: 150 IDTL----DRDSLSE---DYVRNVPSRLA-RLPHWN--NGRNHIIFNLYSGTWPDYAEHS 199
Query: 113 IGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLA 172
+G A + A K + + + +S + HK L ++K+ LA
Sbjct: 200 LGFDAGE-AILAKASMSVLQMRHGFDVSIPLFHKQFPLRAGATGSVQSNNFPANKKYLLA 258
Query: 173 F----FAGAVNSPVREKLLQVWRNDSEIYAHSGR-------LKTPYADE----------- 210
F + + S R L + + + R L+ DE
Sbjct: 259 FKGKRYVHGIGSETRNSLFHLHNGRDMVMVTTCRHGKSWRELQDNRCDEDNREYDRYDYE 318
Query: 211 --LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLD 268
L S FCL +G + + R ++L GC+PV+++N + LPF ++WK +I
Sbjct: 319 TLLQNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNAWVLPFESKIDWKQAAIWADERL 378
Query: 269 IPLLKKILKGISSEEYLLL--QSYVL 292
+ + I++ IS+E L Q+ VL
Sbjct: 379 LLQVPDIVRSISAERIFALRQQTQVL 404
>gi|198457239|ref|XP_001360599.2| GA10084 [Drosophila pseudoobscura pseudoobscura]
gi|198135910|gb|EAL25174.2| GA10084 [Drosophila pseudoobscura pseudoobscura]
Length = 765
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 77/328 (23%), Positives = 130/328 (39%), Gaps = 63/328 (19%)
Query: 5 FRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFK--KAFMKSHFVTKDPSKADLFFLP 62
F VYVYP P++ S +Y K A +S + T DPS A LF L
Sbjct: 108 FLVYVYPPE----------PLNSLGAAPPTSANYQKILTAIQESRYHTSDPSAACLFVLG 157
Query: 63 FSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAW 122
DR +E D++ + ++ + P+WN G +H +S G
Sbjct: 158 MDTL----DRDSLSE---DYVRNVPSRLA-RLPHWN--NGRNHIIFNLYS-GTWPDYAEN 206
Query: 123 EVKLNAIQVVCSSSYFISGHIAHK-DVSLPQIWPRQEDPPKLGSS-----------KRNK 170
+ +A + + + + H DVS+P ++ P + G++ K+
Sbjct: 207 SLGFDAGHAILAKASMSVLQVRHGFDVSIPLF--HKQFPLRAGATGSVQSNNFPANKKYL 264
Query: 171 LAF----FAGAVNSPVREKLLQVWRNDSEIYAHSGR-------LKTPYADE--------- 210
LAF + + S R L + + + R L+ DE
Sbjct: 265 LAFKGKRYVHGIGSETRNSLFHLHNGRDMVLVTTCRHGKSWRELQDNRCDEDNREYDRYD 324
Query: 211 ----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
L S FCL +G + + R ++L GC+PV+++N + LPF ++WK +I
Sbjct: 325 YETLLQNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNAWVLPFESKIDWKQAAIWADE 384
Query: 267 LDIPLLKKILKGISSEEYLLL--QSYVL 292
+ + I++ IS+E L Q+ VL
Sbjct: 385 RLLLQVPDIVRSISAERIFALRQQTQVL 412
>gi|357628255|gb|EHJ77645.1| hypothetical protein KGM_04618 [Danaus plexippus]
Length = 655
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 68/328 (20%), Positives = 125/328 (38%), Gaps = 50/328 (15%)
Query: 29 PRGNYASESYFK--KAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHY 86
P S +Y K +S + T DP++A LF ++ + D + E IPD S
Sbjct: 20 PTDGPVSATYRKVLSVVRESRYATHDPAEACLFIP--AVDTLDADP-LSPEHIPDVASRL 76
Query: 87 IFNISQKYPYWNRTGGADHFYVACHS-----IGRSAMQ----KAWEVKLNAIQVVCSSSY 137
+ PYW G +H ++ A+ A + +A + + +
Sbjct: 77 -----SRLPYWK--NGRNHLLFNLYAGTWPDYAEGALGFDPGDAILARASASETIFRDGF 129
Query: 138 FISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLAF----FAGAVNSPVREKLLQVWRND 193
IS + HK+ P + +++ LAF + + S R L + +
Sbjct: 130 DISLPLFHKEHPERGGVPPSATGNPFPAPRKHLLAFKGKRYVHGIGSETRNSLWHLHDGN 189
Query: 194 SEIYAHSGR-------LKTPYADE-------------LLGSKFCLHVKGFEVNTARIADS 233
+ I + R L+ DE L S FCL +G + + R ++
Sbjct: 190 NLILVTTCRHGKSWKDLRDERCDEDNREYDKFDYEQLLANSTFCLVARGRRLGSYRFLEA 249
Query: 234 LYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLK 293
L GCVPV+++N + LPF + ++W+ I + + ++++ + E L L+
Sbjct: 250 LAAGCVPVLLSNGWRLPFDERIDWRRAVIWADERLLLQVPELVRSVPPERIL-----ALR 304
Query: 294 VRKHFQWHVFPSDYDAFYMVMYELWLRR 321
+ W + S + E+ L R
Sbjct: 305 QQTQLLWEQYFSSIEKIVFTTIEILLER 332
>gi|195025057|ref|XP_001985992.1| GH20787 [Drosophila grimshawi]
gi|193901992|gb|EDW00859.1| GH20787 [Drosophila grimshawi]
Length = 754
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 77/329 (23%), Positives = 124/329 (37%), Gaps = 65/329 (19%)
Query: 5 FRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFK--KAFMKSHFVTKDPSKADLFFLP 62
F VYVYP P++ +S +Y K A +S + T DP+ A LF L
Sbjct: 98 FLVYVYPPE----------PLNSLGAAPPSSANYQKILTAIQESRYHTSDPAAACLFVL- 146
Query: 63 FSIARMRHDRRIGTEGIPDFISHYIFNISQKY---PYWNRTGGADHFYV----------A 109
I T Y+ N+ + P+WN G +H A
Sbjct: 147 ----------GIDTLDRDSLSDDYVRNVPSRLARLPHWN--NGRNHIIFNLYSGTWPDYA 194
Query: 110 CHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRN 169
HS+G A + A K + + + +S + HK L ++K+
Sbjct: 195 EHSLGFDAGE-AILAKASMSVLQMRHGFDVSIPLFHKQFPLRAGATGSVQSNNFPANKKY 253
Query: 170 KLAF----FAGAVNSPVREKLLQVWRNDSEIYAHSGR-------LKTPYADE-------- 210
LAF + + S R L + + + R L+ DE
Sbjct: 254 LLAFKGKRYVHGIGSETRNSLFHLHNGRDMVMVTTCRHGKSWRELQDNRCDEDNREYDRY 313
Query: 211 -----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVA 265
L S FCL +G + + R ++L GC+PV+++N + LPF ++WK +I
Sbjct: 314 DYETLLQNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNAWVLPFESKIDWKQAAIWAD 373
Query: 266 TLDIPLLKKILKGISSEEYLLL--QSYVL 292
+ + I++ IS+E L Q+ VL
Sbjct: 374 ERLLLQVPDIVRSISAERIFALRQQTQVL 402
>gi|412988024|emb|CCO19420.1| exostosin-like glycosyltransferase [Bathycoccus prasinos]
Length = 909
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 42/156 (26%), Positives = 77/156 (49%), Gaps = 24/156 (15%)
Query: 165 SSKRNKLAFFAG--AVNSP--------VREKLLQVWRN--DSEIYAHSGRLKTPYADELL 212
+++R FF+G + P +R+K+ + W+N +I +H+ Y +
Sbjct: 695 ATQRTTFCFFSGNLGLEKPWGEDYSRGLRQKVARRWQNVYGFDILSHT----DDYLGRIR 750
Query: 213 GSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV-LNWKSFSIVVATLDIP- 270
SKFCL + G + + ++ + GC+PVI+ + D+P+ L++ FSI V D+
Sbjct: 751 SSKFCLALPG-DGWSGGLSVYIRNGCIPVIVQDGVDMPWEGTFLDYSKFSIRVREGDVEN 809
Query: 271 LLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSD 306
L+ +L+ ++ EE LQ+ + V WH F D
Sbjct: 810 RLQSVLETVTPEELQNLQNGLKNV-----WHFFSYD 840
>gi|302817463|ref|XP_002990407.1| galactosyltransferase-like protein [Selaginella moellendorffii]
gi|300141792|gb|EFJ08500.1| galactosyltransferase-like protein [Selaginella moellendorffii]
Length = 530
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 72/311 (23%), Positives = 127/311 (40%), Gaps = 48/311 (15%)
Query: 27 FEPRGNYASESY-----FKKAFMKSHFVTKDPSKADLFFLPF----SIARMRHDRRIGTE 77
F RG YA+ + F + +T D S A F+P+ I+R + T
Sbjct: 146 FSDRGWYATNQFAVEVIFHNRMKQYDCLTNDSSIAAAIFVPYYAGLDISRYLWGVNVSTR 205
Query: 78 GIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSI----GRSAMQKAWEVKLNAI-QVV 132
D + + + + P W R GG DHF VA ++ + W KL I +V
Sbjct: 206 ---DSGALRMVDWLVRQPEWRRMGGRDHFMVAGRITWDFRRKTEKEDDWGNKLFIIPEVK 262
Query: 133 CSSSYFISGHIAH-KDVSLP---QIWPRQED-----PPKLGSSKRNKLAFFAGA----VN 179
+S I H D ++P P Q+ ++ +R L FAGA +
Sbjct: 263 NITSLVIEASPWHFNDFAIPYPTYFHPTQDSDVVDWQVRMRGMERPALFSFAGAPRQQLR 322
Query: 180 SPVREKLLQVWRNDSE---IYAHSGRLKTPYADELLG----SKFCLHVKGFEVNTARIAD 232
+RE+++ R + + G K ++ S FCL +G I D
Sbjct: 323 KSIRERIMDQCRESPQCKLLECDFGESKCHVPSAVMKLFEESVFCLQPQGDSFTRRSIFD 382
Query: 233 SLYYGCVPVIIANHYDLPFADVL-----NWKSFSIVVATLDIPL----LKKILKGISSEE 283
S+ GC+PV H D ++ + N + +S+ ++ +DI ++ +L+ I ++E
Sbjct: 383 SMLAGCIPVFF--HPDSAYSQFVWHLPRNHRKYSVFISEIDIRRGNVSIESVLRQIPADE 440
Query: 284 YLLLQSYVLKV 294
L ++ V+++
Sbjct: 441 VLRMREEVIQL 451
>gi|195334376|ref|XP_002033859.1| GM21554 [Drosophila sechellia]
gi|194125829|gb|EDW47872.1| GM21554 [Drosophila sechellia]
Length = 760
Score = 51.2 bits (121), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 75/328 (22%), Positives = 130/328 (39%), Gaps = 63/328 (19%)
Query: 5 FRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFK--KAFMKSHFVTKDPSKADLFFLP 62
F VY+YP P++ S +Y K A +S + T DP+ A LF L
Sbjct: 104 FLVYIYPPE----------PLNSLGAAPPTSANYQKILTAIQESRYYTSDPTAACLFVLG 153
Query: 63 FSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAW 122
DR +E D++ + ++ + PYWN G +H +S G
Sbjct: 154 IDTL----DRDSLSE---DYVRNVPSRLA-RLPYWN--NGRNHIIFNLYS-GTWPDYAEN 202
Query: 123 EVKLNAIQVVCSSSYFISGHIAHK-DVSLPQIWPRQEDPPKLGSS-----------KRNK 170
+ +A + + + + + H DVS+P ++ P + G++ K+
Sbjct: 203 SLGFDAGEAILAKASMGVLQLRHGFDVSIPLF--HKQFPLRAGATGTVQSNNFPANKKYL 260
Query: 171 LAF----FAGAVNSPVREKLLQVWRNDSEIYAHSGR-------LKTPYADE--------- 210
LAF + + S R L + + + R L+ DE
Sbjct: 261 LAFKGKRYVHGIGSETRNSLFHLHNGRDMVLVTTCRHGKSWRELQDNRCDEDNREYDRYD 320
Query: 211 ----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
L S FCL +G + + R ++L GC+PV+++N + LPF ++WK +I
Sbjct: 321 YETLLQNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNAWVLPFESKIDWKQAAIWADE 380
Query: 267 LDIPLLKKILKGISSEEYLLL--QSYVL 292
+ + I++ I +E L Q+ VL
Sbjct: 381 RLLLQVPDIVRSIPAERIFALRQQTQVL 408
>gi|3435314|gb|AAC32397.1| putative tumor suppressor homolog [Drosophila melanogaster]
Length = 760
Score = 51.2 bits (121), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 75/355 (21%), Positives = 136/355 (38%), Gaps = 66/355 (18%)
Query: 5 FRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFK--KAFMKSHFVTKDPSKADLFFLP 62
F VY+YP P++ S +Y K A +S + T DP+ A LF L
Sbjct: 104 FLVYIYPPE----------PLNSLGAAPPTSANYQKILTAIQESRYYTSDPTAACLFVLG 153
Query: 63 FSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAW 122
DR +E D++ + ++ + PYWN G +H +S G
Sbjct: 154 IDTL----DRDSLSE---DYVRNVPSRLA-RLPYWN--NGRNHIIFNLYS-GTWPDYAEN 202
Query: 123 EVKLNAIQVVCSSSYFISGHIAHK-DVSLPQIWPRQEDPPKLGSS-----------KRNK 170
+ +A + + + + + H DVS+P ++ P + G++ K+
Sbjct: 203 SLGFDAGEAILAKASMGVLQLRHGFDVSIPLF--HKQFPLRAGATGTVQSNNFPANKKYL 260
Query: 171 LAF----FAGAVNSPVREKLLQVWRNDSEIYAHSGR-------LKTPYADE--------- 210
LAF + + S R L + + + R L+ DE
Sbjct: 261 LAFKGKRYVHGIGSETRNSLFHLHNGRDMVLVTTCRHGKSWRELQDNRCDEDNREYDRYD 320
Query: 211 ----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
L S FCL +G + + R ++L GC+PV+++N + LPF ++WK +I
Sbjct: 321 YETLLQNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNAWVLPFESKIDWKQAAIWADE 380
Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRR 321
+ + I++ I +E + L+ + W + + +E+ R
Sbjct: 381 RLLLQVPDIVRSIPAERI-----FALRQQTQVLWERYFGSIEKIVFTTFEIIRER 430
>gi|302812249|ref|XP_002987812.1| galactosyltransferase-like protein [Selaginella moellendorffii]
gi|300144431|gb|EFJ11115.1| galactosyltransferase-like protein [Selaginella moellendorffii]
Length = 530
Score = 51.2 bits (121), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 72/311 (23%), Positives = 127/311 (40%), Gaps = 48/311 (15%)
Query: 27 FEPRGNYASESY-----FKKAFMKSHFVTKDPSKADLFFLPF----SIARMRHDRRIGTE 77
F RG YA+ + F + +T D S A F+P+ I+R + T
Sbjct: 146 FSDRGWYATNQFAVEVIFHNRMKQYDCLTNDSSIAAAIFVPYYAGLDISRYLWGVNVSTR 205
Query: 78 GIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSI----GRSAMQKAWEVKLNAI-QVV 132
D + + + + P W R GG DHF VA ++ + W KL I +V
Sbjct: 206 ---DSGALRMVDWLVRQPEWRRMGGRDHFMVAGRITWDFRRKTEKEDDWGNKLFIIPEVK 262
Query: 133 CSSSYFISGHIAH-KDVSLP---QIWPRQED-----PPKLGSSKRNKLAFFAGA----VN 179
+S I H D ++P P Q+ ++ +R L FAGA +
Sbjct: 263 NITSLVIEASPWHFNDFAIPYPTYFHPTQDSDVVDWQVRMRGMERPALFSFAGAPRQQLR 322
Query: 180 SPVREKLLQVWRNDSE---IYAHSGRLKTPYADELLG----SKFCLHVKGFEVNTARIAD 232
+RE+++ R + + G K ++ S FCL +G I D
Sbjct: 323 KSIRERIMDQCRESPQCKLLECDFGESKCHVPSAVMKLFEESVFCLQPQGDSFTRRSIFD 382
Query: 233 SLYYGCVPVIIANHYDLPFADVL-----NWKSFSIVVATLDIPL----LKKILKGISSEE 283
S+ GC+PV H D ++ + N + +S+ ++ +DI ++ +L+ I ++E
Sbjct: 383 SMLAGCIPVFF--HPDSAYSQFVWHLPRNHRKYSVFISEIDIRRGNVSIESVLRQIPADE 440
Query: 284 YLLLQSYVLKV 294
L ++ V+++
Sbjct: 441 VLRMREEVIQL 451
>gi|195583406|ref|XP_002081513.1| GD11060 [Drosophila simulans]
gi|194193522|gb|EDX07098.1| GD11060 [Drosophila simulans]
Length = 760
Score = 51.2 bits (121), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 75/328 (22%), Positives = 130/328 (39%), Gaps = 63/328 (19%)
Query: 5 FRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFK--KAFMKSHFVTKDPSKADLFFLP 62
F VY+YP P++ S +Y K A +S + T DP+ A LF L
Sbjct: 104 FLVYIYPPE----------PLNSLGAAPPTSANYQKILTAIQESRYYTSDPTAACLFVLG 153
Query: 63 FSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAW 122
DR +E D++ + ++ + PYWN G +H +S G
Sbjct: 154 IDTL----DRDSLSE---DYVRNVPSRLA-RLPYWN--NGRNHIIFNLYS-GTWPDYAEN 202
Query: 123 EVKLNAIQVVCSSSYFISGHIAHK-DVSLPQIWPRQEDPPKLGSS-----------KRNK 170
+ +A + + + + + H DVS+P ++ P + G++ K+
Sbjct: 203 SLGFDAGEAILAKASMGVLQLRHGFDVSIPLF--HKQFPLRAGATGTVQSNNFPANKKYL 260
Query: 171 LAF----FAGAVNSPVREKLLQVWRNDSEIYAHSGR-------LKTPYADE--------- 210
LAF + + S R L + + + R L+ DE
Sbjct: 261 LAFKGKRYVHGIGSETRNSLFHLHNGRDMVLVTTCRHGKSWRELQDNRCDEDNREYDRYD 320
Query: 211 ----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
L S FCL +G + + R ++L GC+PV+++N + LPF ++WK +I
Sbjct: 321 YETLLQNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNAWVLPFESKIDWKQAAIWADE 380
Query: 267 LDIPLLKKILKGISSEEYLLL--QSYVL 292
+ + I++ I +E L Q+ VL
Sbjct: 381 RLLLQVPDIVRSIPAERIFALRQQTQVL 408
>gi|66517433|ref|XP_391845.2| PREDICTED: exostosin-1 [Apis mellifera]
Length = 711
Score = 51.2 bits (121), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 76/355 (21%), Positives = 140/355 (39%), Gaps = 80/355 (22%)
Query: 3 RSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFFLP 62
+ F VYVYP +V+ P+ ++ N +ES + T DP++A +F L
Sbjct: 87 QGFTVYVYP------IEDVISPL-YQKILNVITES---------RYYTSDPTRACIFVLA 130
Query: 63 FSIARMRHDRR-IGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKA 121
DR + TE + + S + + PYWN G +H +S +
Sbjct: 131 LDTL----DRDPLSTEFVHNLPSKLM-----RLPYWN--NGRNHLIFNLYSGTWPDYAEE 179
Query: 122 WEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGS----------SKRNKL 171
++ +S I H D+S+P ++ P + G + + +
Sbjct: 180 SLAFDLGYAMLAKASMSIFRHRPEFDISIPLF--GKQHPERGGEPGQALENNFPNNKKYI 237
Query: 172 AFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR-------LKTPYADE--------- 210
A F G + S R L + ++ + R L+ + +
Sbjct: 238 AAFKGKRYVHGIGSETRNALYHLHNGKDLVFVTTCRHGKAWRELQDEHCQQDNQEYDMYD 297
Query: 211 ----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKS---FSIV 263
L+ + FCL +G + + R ++L GC+PVI++N + LPF + ++W FS
Sbjct: 298 YEILLMNATFCLVPRGRRLGSFRFLEALRAGCIPVILSNGWALPFHERIDWTQAVIFSDE 357
Query: 264 VATLDIPLLKKILKGISSEEYLLLQSYVLKVRK--HFQWHVFPSDYDAFYMVMYE 316
L IP I++ +S+ ++LK+R+ F W + S + ++E
Sbjct: 358 RLLLQIP---DIVRSVSN-------VHILKLRQQTQFLWERYFSSIEKIVFTVFE 402
>gi|17137330|ref|NP_477231.1| tout-velu, isoform A [Drosophila melanogaster]
gi|442623684|ref|NP_001260971.1| tout-velu, isoform B [Drosophila melanogaster]
gi|61212925|sp|Q9V730.1|EXT1_DROME RecName: Full=Exostosin-1; AltName: Full=Protein tout-velu
gi|7303172|gb|AAF58236.1| tout-velu, isoform A [Drosophila melanogaster]
gi|60678089|gb|AAX33551.1| LD10920p [Drosophila melanogaster]
gi|85701308|dbj|BAE78509.1| Tout-velu [Drosophila melanogaster]
gi|220960240|gb|ACL92656.1| ttv-PA [synthetic construct]
gi|440214386|gb|AGB93503.1| tout-velu, isoform B [Drosophila melanogaster]
Length = 760
Score = 51.2 bits (121), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 75/328 (22%), Positives = 130/328 (39%), Gaps = 63/328 (19%)
Query: 5 FRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFK--KAFMKSHFVTKDPSKADLFFLP 62
F VY+YP P++ S +Y K A +S + T DP+ A LF L
Sbjct: 104 FLVYIYPPE----------PLNSLGAAPPTSANYQKILTAIQESRYYTSDPTAACLFVLG 153
Query: 63 FSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAW 122
DR +E D++ + ++ + PYWN G +H +S G
Sbjct: 154 IDTL----DRDSLSE---DYVRNVPSRLA-RLPYWN--NGRNHIIFNLYS-GTWPDYAEN 202
Query: 123 EVKLNAIQVVCSSSYFISGHIAHK-DVSLPQIWPRQEDPPKLGSS-----------KRNK 170
+ +A + + + + + H DVS+P ++ P + G++ K+
Sbjct: 203 SLGFDAGEAILAKASMGVLQLRHGFDVSIPLF--HKQFPLRAGATGTVQSNNFPANKKYL 260
Query: 171 LAF----FAGAVNSPVREKLLQVWRNDSEIYAHSGR-------LKTPYADE--------- 210
LAF + + S R L + + + R L+ DE
Sbjct: 261 LAFKGKRYVHGIGSETRNSLFHLHNGRDMVLVTTCRHGKSWRELQDNRCDEDNREYDRYD 320
Query: 211 ----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
L S FCL +G + + R ++L GC+PV+++N + LPF ++WK +I
Sbjct: 321 YETLLQNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNAWVLPFESKIDWKQAAIWADE 380
Query: 267 LDIPLLKKILKGISSEEYLLL--QSYVL 292
+ + I++ I +E L Q+ VL
Sbjct: 381 RLLLQVPDIVRSIPAERIFALRQQTQVL 408
>gi|194883052|ref|XP_001975618.1| GG20465 [Drosophila erecta]
gi|190658805|gb|EDV56018.1| GG20465 [Drosophila erecta]
Length = 760
Score = 51.2 bits (121), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 75/328 (22%), Positives = 130/328 (39%), Gaps = 63/328 (19%)
Query: 5 FRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFK--KAFMKSHFVTKDPSKADLFFLP 62
F VY+YP P++ S +Y K A +S + T DP+ A LF L
Sbjct: 104 FLVYIYPPE----------PLNSLGAAPPTSANYQKILTAIQESRYYTSDPTAACLFVLG 153
Query: 63 FSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAW 122
DR +E D++ + ++ + PYWN G +H +S G
Sbjct: 154 IDTL----DRDSLSE---DYVRNVPSRLA-RLPYWN--NGRNHIIFNLYS-GTWPDYAEN 202
Query: 123 EVKLNAIQVVCSSSYFISGHIAHK-DVSLPQIWPRQEDPPKLGSS-----------KRNK 170
+ +A + + + + + H DVS+P ++ P + G++ K+
Sbjct: 203 SLGFDAGEAILAKASMGVLQLRHGFDVSIPLF--HKQFPLRAGATGSVQSNNFPANKKYL 260
Query: 171 LAF----FAGAVNSPVREKLLQVWRNDSEIYAHSGR-------LKTPYADE--------- 210
LAF + + S R L + + + R L+ DE
Sbjct: 261 LAFKGKRYVHGIGSETRNSLFHLHNGRDMVLVTTCRHGKSWRELQDNRCDEDNREYDRYD 320
Query: 211 ----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
L S FCL +G + + R ++L GC+PV+++N + LPF ++WK +I
Sbjct: 321 YETLLQNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNAWVLPFESKIDWKQAAIWADE 380
Query: 267 LDIPLLKKILKGISSEEYLLL--QSYVL 292
+ + I++ I +E L Q+ VL
Sbjct: 381 RLLLQVPDIVRSIPAERIFALRQQTQVL 408
>gi|195486003|ref|XP_002091324.1| GE13596 [Drosophila yakuba]
gi|194177425|gb|EDW91036.1| GE13596 [Drosophila yakuba]
Length = 760
Score = 51.2 bits (121), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 75/328 (22%), Positives = 130/328 (39%), Gaps = 63/328 (19%)
Query: 5 FRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFK--KAFMKSHFVTKDPSKADLFFLP 62
F VY+YP P++ S +Y K A +S + T DP+ A LF L
Sbjct: 104 FLVYIYPPE----------PLNSLGAAPPTSANYQKILTAIQESRYYTSDPTAACLFVLG 153
Query: 63 FSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAW 122
DR +E D++ + ++ + PYWN G +H +S G
Sbjct: 154 IDTL----DRDSLSE---DYVRNVPSRLA-RLPYWN--NGRNHIIFNLYS-GTWPDYAEN 202
Query: 123 EVKLNAIQVVCSSSYFISGHIAHK-DVSLPQIWPRQEDPPKLGSS-----------KRNK 170
+ +A + + + + + H DVS+P ++ P + G++ K+
Sbjct: 203 SLGFDAGEAILAKASMGVLQLRHGFDVSIPLF--HKQFPLRAGATGSVQSNNFPANKKYL 260
Query: 171 LAF----FAGAVNSPVREKLLQVWRNDSEIYAHSGR-------LKTPYADE--------- 210
LAF + + S R L + + + R L+ DE
Sbjct: 261 LAFKGKRYVHGIGSETRNSLFHLHNGRDMVLVTTCRHGKSWRELQDNRCDEDNREYDRYD 320
Query: 211 ----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
L S FCL +G + + R ++L GC+PV+++N + LPF ++WK +I
Sbjct: 321 YETLLQNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNAWVLPFESKIDWKQAAIWADE 380
Query: 267 LDIPLLKKILKGISSEEYLLL--QSYVL 292
+ + I++ I +E L Q+ VL
Sbjct: 381 RLLLQVPDIVRSIPAERIFALRQQTQVL 408
>gi|444732432|gb|ELW72727.1| Exostosin-1 [Tupaia chinensis]
Length = 436
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 26/92 (28%), Positives = 47/92 (51%), Gaps = 17/92 (18%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y + L + FCL +G + + R ++L CVPV+++N ++LPF++V+NW +++
Sbjct: 54 YREMLHNATFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVI--- 110
Query: 267 LDIPLLKKILKGISSEEYLLLQSYVLKVRKHF 298
+E LLLQ ++ KH
Sbjct: 111 --------------GDERLLLQIIQDRIFKHI 128
>gi|350421075|ref|XP_003492724.1| PREDICTED: exostosin-1-like isoform 1 [Bombus impatiens]
gi|350421078|ref|XP_003492725.1| PREDICTED: exostosin-1-like isoform 2 [Bombus impatiens]
Length = 711
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 77/355 (21%), Positives = 140/355 (39%), Gaps = 80/355 (22%)
Query: 3 RSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFFLP 62
+ F VYVYP +V+ P+ ++ N +ES + T DP++A +F L
Sbjct: 87 QGFTVYVYP------VEDVISPL-YQKILNVITES---------RYYTSDPTRACIFVLA 130
Query: 63 FSIARMRHDRR-IGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKA 121
DR + TE + + S I + PYWN G +H +S +
Sbjct: 131 LDTL----DRDPLSTEFVHNLPSKLI-----RLPYWN--NGRNHLIFNLYSGTWPDYAEE 179
Query: 122 WEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGS----------SKRNKL 171
++ +S I H D+S+P ++ P + G + + +
Sbjct: 180 SLAFDLGYAMLAKASMSIFRHRPDFDISIPLF--GKQHPERGGEPGQALENNFPNNKKYV 237
Query: 172 AFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR-------LKTPYADE--------- 210
A F G + S R L + ++ + R L+ + +
Sbjct: 238 AAFKGKRYVHGIGSETRNALYHLHNGKDLVFVTTCRHGKAWRELQDEHCQQDNQEYDTYD 297
Query: 211 ----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKS---FSIV 263
L+ + FCL +G + + R ++L GC+PVI++N + LPF + ++W FS
Sbjct: 298 YEILLMNATFCLVPRGRRLGSFRFLEALRAGCIPVILSNGWALPFHERIDWTQAVIFSDE 357
Query: 264 VATLDIPLLKKILKGISSEEYLLLQSYVLKVRK--HFQWHVFPSDYDAFYMVMYE 316
L IP I++ +S+ ++LK+R+ F W + S + ++E
Sbjct: 358 RLLLQIP---DIVRSVSN-------VHILKLRQQTQFLWERYFSSIEKIVFTVFE 402
>gi|118781611|ref|XP_311558.3| AGAP010388-PA [Anopheles gambiae str. PEST]
gi|116130033|gb|EAA07205.3| AGAP010388-PA [Anopheles gambiae str. PEST]
Length = 744
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 81/360 (22%), Positives = 143/360 (39%), Gaps = 70/360 (19%)
Query: 2 NRSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFK--KAFMKSHFVTKDPSKADLF 59
+++F VYVYP +P ++ P S++Y K + +S + T DP +A LF
Sbjct: 98 DKNFYVYVYPP---EPLNSLGAPPPI-------SQNYQKIISSIQESRYYTTDPEQACLF 147
Query: 60 FLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQ 119
L DR +E DF+ + + Q+ P+WN G +H +S G
Sbjct: 148 VLGIDTL----DRDSLSE---DFVRNVPSRL-QRLPHWN--NGRNHIIFNLYS-GTWPDY 196
Query: 120 KAWEVKLNAIQ-VVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSS-----------K 167
+ + Q ++ +S I DVS+P ++ P + G++ K
Sbjct: 197 NENGLGFDPGQAILAKASMSIQSLRPGFDVSIPLF--HKQFPLRGGNTGFVISNNFPANK 254
Query: 168 RNKLAF----FAGAVNSPVREKLLQV-----------------WRNDSEIYAHSGRL--- 203
+ LAF + + S R L + WR+ +
Sbjct: 255 KYLLAFKGKRYVHGIGSETRNSLFHLHNARDFVLVTTCKHGKSWRDLQDARCDEDNREYD 314
Query: 204 KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIV 263
+ Y L S FCL +G + + R + L GC+PV+++N + LPF ++WK +I
Sbjct: 315 RYDYETLLQNSTFCLVPRGRRLGSFRFLEVLQAGCIPVLLSNSWVLPFQSKIDWKQAAIW 374
Query: 264 VATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQ--WHVFPSDYDAFYMVMYELWLRR 321
+ + I++ IS+ S +L +R+ Q W + S + +E+ R
Sbjct: 375 ADERLLLQVPDIVRSIST-------SRILALRQQTQVLWERYFSSIEKIIFTTFEIIRER 427
>gi|340719750|ref|XP_003398310.1| PREDICTED: exostosin-1-like isoform 1 [Bombus terrestris]
gi|340719752|ref|XP_003398311.1| PREDICTED: exostosin-1-like isoform 2 [Bombus terrestris]
Length = 711
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 77/355 (21%), Positives = 140/355 (39%), Gaps = 80/355 (22%)
Query: 3 RSFRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFFLP 62
+ F VYVYP +V+ P+ ++ N +ES + T DP++A +F L
Sbjct: 87 QGFTVYVYP------VEDVISPL-YQKILNVITES---------RYYTSDPTRACIFVLA 130
Query: 63 FSIARMRHDRR-IGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKA 121
DR + TE + + S I + PYWN G +H +S +
Sbjct: 131 LDTL----DRDPLSTEFVHNLPSKLI-----RLPYWN--NGRNHLIFNLYSGTWPDYAEE 179
Query: 122 WEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGS----------SKRNKL 171
++ +S I H D+S+P ++ P + G + + +
Sbjct: 180 SLAFDLGYAMLAKASMSIFRHRPDFDISIPLF--GKQHPERGGEPGQALENNFPNNKKYV 237
Query: 172 AFFAG-----AVNSPVREKLLQVWRNDSEIYAHSGR-------LKTPYADE--------- 210
A F G + S R L + ++ + R L+ + +
Sbjct: 238 AAFKGKRYVHGIGSETRNALYHLHNGKDLVFVTTCRHGKAWRELQDEHCQQDNQEYDTYD 297
Query: 211 ----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKS---FSIV 263
L+ + FCL +G + + R ++L GC+PVI++N + LPF + ++W FS
Sbjct: 298 YEILLMNATFCLVPRGRRLGSFRFLEALRAGCIPVILSNGWALPFHERIDWTQAVIFSDE 357
Query: 264 VATLDIPLLKKILKGISSEEYLLLQSYVLKVRK--HFQWHVFPSDYDAFYMVMYE 316
L IP I++ +S+ ++LK+R+ F W + S + ++E
Sbjct: 358 RLLLQIP---DIVRSVSN-------VHILKLRQQTQFLWERYFSSIEKIVFTVFE 402
>gi|195150443|ref|XP_002016164.1| GL11446 [Drosophila persimilis]
gi|194110011|gb|EDW32054.1| GL11446 [Drosophila persimilis]
Length = 676
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 47/88 (53%), Gaps = 2/88 (2%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L S FCL +G + + R ++L GC+PV+++N + LPF ++WK +I
Sbjct: 325 YETLLQNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNAWVLPFESKIDWKQAAIWADE 384
Query: 267 LDIPLLKKILKGISSEEYLLL--QSYVL 292
+ + I++ IS+E L Q+ VL
Sbjct: 385 RLLLQVPDIVRSISAERIFALRQQTQVL 412
>gi|195123279|ref|XP_002006135.1| GI20872 [Drosophila mojavensis]
gi|193911203|gb|EDW10070.1| GI20872 [Drosophila mojavensis]
Length = 761
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 47/88 (53%), Gaps = 2/88 (2%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
Y L S FCL +G + + R ++L GC+PV+++N + LPF ++WK +I
Sbjct: 321 YGTLLQNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNAWVLPFESKIDWKQAAIWADE 380
Query: 267 LDIPLLKKILKGISSEEYLLL--QSYVL 292
+ + I++ IS+E L Q+ VL
Sbjct: 381 RLLLQVPDIVRSISAERIFALRQQTQVL 408
>gi|299116802|emb|CBN74915.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 1052
Score = 51.2 bits (121), Expect = 8e-04, Method: Composition-based stats.
Identities = 60/248 (24%), Positives = 106/248 (42%), Gaps = 19/248 (7%)
Query: 36 ESYFKKAFMKSHFVTKDPSKADLFFLPF--SIARMRHDRRIGTEGIPDFISHYIFNISQK 93
E + +A + + +P +ADLF++P ++ + R G + + I+ + ++
Sbjct: 200 EIWLHRAMLAHPWRVANPEEADLFYVPMYPVLSTKLGNNRCGGKTHDELINTSVEYLALS 259
Query: 94 YPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSS---SYFISGHIAHKDVSL 150
Y+ R GGADH V +SA+ + L V + + G K V++
Sbjct: 260 SVYFRRFGGADHTLVCAWWNCKSALGPKPRMLLRRTVVGINEKMLEWTRWGCGLDKMVTI 319
Query: 151 P----QIWPRQEDPPKLGSSKRNKLAFFAGAVNSPVREKLLQVWRNDSE-----IYAHS- 200
P + E + R+ FF G + L V +E + H
Sbjct: 320 PYTASSVLTTSEMIGGRAAEDRDIPFFFVGTARGRPERQNLDVVTGMAEGSVMMLGDHQS 379
Query: 201 --GRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVII-ANHYDLPFAD-VLN 256
G T YA + S+FC +G ++ RI D++ GC P++ A+ LPF++ VLN
Sbjct: 380 DWGMNSTQYAAHIARSRFCFCPRGDTESSRRIFDAVAAGCTPIVTEASVAVLPFSEHVLN 439
Query: 257 WKSFSIVV 264
+ F++VV
Sbjct: 440 YSDFAVVV 447
>gi|302780765|ref|XP_002972157.1| galactosyltransferase-like protein [Selaginella moellendorffii]
gi|300160456|gb|EFJ27074.1| galactosyltransferase-like protein [Selaginella moellendorffii]
Length = 470
Score = 50.8 bits (120), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 70/305 (22%), Positives = 109/305 (35%), Gaps = 46/305 (15%)
Query: 32 NYASESYFKKAFMKSHFVTKDPSKADLFFLPF----SIARMRHDRRIGTEGIPDFISHYI 87
++ E F + + T DP A FF+PF I+R + + +
Sbjct: 133 QFSGEIIFHRRMLDHRCRTLDPDGASAFFVPFYAGLDISRNLWASGKSSSDVDSLGEQLL 192
Query: 88 FNISQKYPYWNRTGGADHFYVACHSIGR--------SAMQKAWEVKLNAIQVVCSSSYFI 139
+ +++P++NR+GGADHF VA GR + W L + S +
Sbjct: 193 HWLQRQHPHFNRSGGADHFLVA----GRISWDFRRMPSAAGEWGSSLFHQIEMRSVKRLV 248
Query: 140 SGHIAHKDVSLPQIWPRQEDPPK----------LGSSKRNKLAFFAGAVNSPVREKLLQV 189
D L +P P + S R L FAG+ R QV
Sbjct: 249 IERNPWDDSELGVPYPTSFHPSSDEDLAQWVEFVQGSPRPHLVAFAGSPRPGYRSDFRQV 308
Query: 190 W--------RNDSEIYAHSGRLKTPYADEL------LGSKFCLHVKGFEVNTARIADSLY 235
R S + +D L L S FCL +G + DSL
Sbjct: 309 LLGQCRAAQRGISGCLDCTADTAGCTSDPLRVTKLFLSSVFCLQPRGDSFTRKSLFDSLI 368
Query: 236 YGCVPVIIAN-----HYDLPFA-DVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQS 289
GC+PV+ N Y+L D + F + + + +L+G+S E +Q
Sbjct: 369 SGCIPVLFWNQSAYWQYELYLPRDPEEYSVFIPHQSVKNGTNVLDVLQGVSRERIGRMQR 428
Query: 290 YVLKV 294
VL++
Sbjct: 429 AVLRI 433
>gi|123474050|ref|XP_001320210.1| Exostosin family protein [Trichomonas vaginalis G3]
gi|121903010|gb|EAY07987.1| Exostosin family protein [Trichomonas vaginalis G3]
Length = 353
Score = 50.8 bits (120), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 62/269 (23%), Positives = 109/269 (40%), Gaps = 49/269 (18%)
Query: 41 KAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYP---YW 97
K+ + F KDP +ADLF++P A G+ D+ + I Q Y+
Sbjct: 37 KSLERYEFRVKDPEEADLFYVPLFAALF--------NGLKDYANIDTIIIPQLRAFGKYF 88
Query: 98 NRTGGADHFYV-ACHSIGRSAMQKAWEVKLNAIQVVCSSSYFISGHIAH---KDVSLP-- 151
+R GG D+ ++ S + + L ++ + +Y S + ++V+ P
Sbjct: 89 DRYGGVDYAFIQMLFSQDNIPITVHQQKTLASMITLGDLNYNYSKYQMRESWRNVNFPLT 148
Query: 152 -QIWPRQEDPPKLGSSKRNKLAFFAGAVN--------SPVREKLLQVWRNDSEIYAHSGR 202
I + E P+ S R+ FF G +N +P+RE + V R + HS
Sbjct: 149 SNIAQQFEIKPE---SSRHISTFFIGQINLTDFDTVAAPIREGMANVMR----VIPHSIV 201
Query: 203 LKTPYADELLG------------SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLP 250
+ D + G SKFC G T R+ D+ C+P+++++ P
Sbjct: 202 IDARRYDPITGVYSYNFSRMMSNSKFCCVPHGDGPTTKRLFDTFRTLCIPIVLSDEIKFP 261
Query: 251 FADV-LNWKSFSIVVATLD---IPLLKKI 275
F D+ +N+ I + + IP+ I
Sbjct: 262 FEDLFINYPEILIQIPAFEPDRIPIAMSI 290
>gi|302838103|ref|XP_002950610.1| hypothetical protein VOLCADRAFT_104742 [Volvox carteri f.
nagariensis]
gi|300264159|gb|EFJ48356.1| hypothetical protein VOLCADRAFT_104742 [Volvox carteri f.
nagariensis]
Length = 1119
Score = 50.8 bits (120), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 30/92 (32%), Positives = 49/92 (53%), Gaps = 1/92 (1%)
Query: 209 DELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLD 268
D + SKFCL G R+A+++ GCVPV+I +H P DV+ ++ FS+ + D
Sbjct: 587 DLMARSKFCLAPMGAGWGI-RLAEAMVRGCVPVVIQDHVYQPLWDVVPFEEFSLRFSRRD 645
Query: 269 IPLLKKILKGISSEEYLLLQSYVLKVRKHFQW 300
+ L L ++SE+ LQ V + + +W
Sbjct: 646 VADLVDHLDDVTSEQLARLQGGVERYHRWEEW 677
>gi|195431325|ref|XP_002063693.1| GK15814 [Drosophila willistoni]
gi|194159778|gb|EDW74679.1| GK15814 [Drosophila willistoni]
Length = 728
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 28/102 (27%), Positives = 53/102 (51%), Gaps = 9/102 (8%)
Query: 199 HSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWK 258
H RL P+ L SKFCL + + + + + C+PVI ++Y LPF DV++W
Sbjct: 313 HQKRLDYPHL--LAKSKFCLVARSLRLGQPDLLEIMSQNCIPVIAIDNYILPFEDVVDWS 370
Query: 259 SFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQW 300
S+ + ++ + + L+ IS+ + +++++K QW
Sbjct: 371 LASVRIRESELHSVLRKLESISNVK-------IVEMQKQVQW 405
>gi|302774815|ref|XP_002970824.1| galactosyltransferase-like protein [Selaginella moellendorffii]
gi|300161535|gb|EFJ28150.1| galactosyltransferase-like protein [Selaginella moellendorffii]
Length = 454
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 60/248 (24%), Positives = 100/248 (40%), Gaps = 36/248 (14%)
Query: 31 GNYASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIP--DFISHYIF 88
G +A E + F + +T DP A+LF++P+ A + + + T+ + D + +
Sbjct: 125 GQFALEIMIHERFRRYQCLTDDPHLANLFYIPY-YAGLDVSQYLFTKQVQMRDKLGQRLL 183
Query: 89 NISQKYPYWNRTGGADHFYVACHSI---GRSAMQ-KAWEVKLNAIQVVCSSSYFISGHIA 144
Q +WNR G DH V + GRS ++W L +IQ + +++ +
Sbjct: 184 GYLQGNRHWNRKRGRDHVLVLGRIVWDFGRSEENHESWGSSLLSIQELDNATKLLIERDV 243
Query: 145 HKDVSLPQIWP-------RQEDPPKLG---SSKRNKLAFFAGAV------NSPVR----- 183
+ + +P RQE L S R+ L FAGA+ + +R
Sbjct: 244 WRSSQMALPYPTGFHPDSRQEIDEWLAVVNGSSRDLLVSFAGALRDGNGSTATMRRSLRR 303
Query: 184 -----EKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGC 238
E+L + R + + + T A L S FCL G D L GC
Sbjct: 304 QCQRHERLCTILRCERINCEENPEIVTCVA---LRSVFCLMPPGDSPTRKAFFDGLVAGC 360
Query: 239 VPVIIANH 246
+PV+ + H
Sbjct: 361 IPVVFSEH 368
>gi|340377881|ref|XP_003387457.1| PREDICTED: exostosin-1b-like [Amphimedon queenslandica]
Length = 729
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 48/228 (21%), Positives = 88/228 (38%), Gaps = 40/228 (17%)
Query: 131 VVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSS----------KRNKLAFFAG---- 176
++ +S+ + + D+SLP I + K +R L F G
Sbjct: 195 IIAKASFALENYRTGFDISLPLIHKLHSEKGKFTGGVSAHGNLFPIRRKYLLIFKGKRYL 254
Query: 177 -AVNSPVREKLLQVWRNDSEIYAHS---GRLKTPYADE-----------------LLGSK 215
+ S R + + D I + G+ + Y DE + S
Sbjct: 255 WGLGSATRNAIYHLDNGDDVIMLTTCKHGKFWSRYRDEKCKKDNIFFEKYNYNHLISNST 314
Query: 216 FCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKI 275
FCL +G + + R +SL+ GC+PV ++N LPF ++++W V + + +
Sbjct: 315 FCLVPRGRRLGSFRFLESLHAGCIPVSLSNGLVLPFHELIDWSKALFVFDERQLFQVPHM 374
Query: 276 LKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYELWLRRSS 323
L+ I ++ L +++ F W + S DA E+ +R S
Sbjct: 375 LRHIPEDKIL-----SMRLHTQFYWEQYFSSVDAIMHTTLEIVKQRIS 417
>gi|299472640|emb|CBN78292.1| Glycosyltransferase, family GT47 [Ectocarpus siliculosus]
Length = 587
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 25/90 (27%), Positives = 43/90 (47%)
Query: 205 TPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVV 264
+P +L+ + F L G T R+A++L G +PV I + PF + W FS
Sbjct: 470 SPSYGDLMNTTFALLPAGRSPATYRLAEALSAGALPVFIHQDFVKPFPGKIPWSDFSFSF 529
Query: 265 ATLDIPLLKKILKGISSEEYLLLQSYVLKV 294
++P + K L+ + E +Q+ L+V
Sbjct: 530 PPEEVPRMLKTLRAVPDRELAQMQATALEV 559
>gi|168017355|ref|XP_001761213.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162687553|gb|EDQ73935.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 442
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 74/337 (21%), Positives = 132/337 (39%), Gaps = 66/337 (19%)
Query: 11 PHRRNDPFANVLLPV------DFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFFLPF- 63
PH +N F V+ D+ Y E F + T DP++ADLFF+P+
Sbjct: 71 PHHKNHGFGAVVNATVEVFRQDWYGTDAYMLEVIFYERMQTYSCRTSDPAEADLFFIPYF 130
Query: 64 ----SIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQ 119
++ + D + + + + N + W R GG DHFY+A GR+
Sbjct: 131 AGLDALPYLYTDSKRELQQGREVVEWLEENAPKT---WRRHGGHDHFYIA----GRT--- 180
Query: 120 KAWEVKLNAIQVV-CSSSYFISGHIAHKDVSLPQIWPRQED----PPKLG---------- 164
AW+ +V +S F + + + + + P ++D P +G
Sbjct: 181 -AWDFCRPLTKVNWWGTSLFNNPEMENTTAMVLERRPWRDDEVAIPYPVGFHPSTSATLH 239
Query: 165 -------SSKRNKLAFFAGAVNSPVREKLLQVW-RNDSEIYAHSGRLKT----------P 206
SS R L F+GA+ + + ++ R SE RL P
Sbjct: 240 SWIEVVRSSPRKHLFSFSGALRPHLTISIREILSRQCSEAGNACSRLDCGKIKCSHEPEP 299
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADV-----LNWKSFS 261
LL + FCL +G + DS+ GC+PV H D + ++++FS
Sbjct: 300 IYTSLLQATFCLQPRGDTSTRRSVIDSIVSGCIPVFF--HEDTAYTQYHWFLPKDYENFS 357
Query: 262 IVVATLDI----PLLKKILKGISSEEYLLLQSYVLKV 294
+ + D+ + KIL ++++ ++ ++K+
Sbjct: 358 VFIDEKDMKDGNADVSKILGAYTAKQVEQIRERLIKI 394
>gi|194755878|ref|XP_001960206.1| GF13249 [Drosophila ananassae]
gi|190621504|gb|EDV37028.1| GF13249 [Drosophila ananassae]
Length = 765
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 76/328 (23%), Positives = 130/328 (39%), Gaps = 63/328 (19%)
Query: 5 FRVYVYPHRRNDPFANVLLPVDFEPRGNYASESYFK--KAFMKSHFVTKDPSKADLFFLP 62
F VYVYP P++ +S +Y K A +S + T DPS A LF L
Sbjct: 105 FLVYVYPPE----------PLNSLGAAPPSSANYQKILTAIQESRYHTNDPSAACLFVLG 154
Query: 63 FSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAW 122
DR +E D++ + ++ + P+WN G +H +S G
Sbjct: 155 IDTL----DRDSLSE---DYVRNVPSRLA-RLPHWN--NGRNHIIFNLYS-GTWPDYAEN 203
Query: 123 EVKLNAIQVVCSSSYFISGHIAHK-DVSLPQIWPRQEDPPKLGSS-----------KRNK 170
+ +A + + + + + H DVS+P ++ P + G++ K+
Sbjct: 204 SLGFDAGEAILAKASMSVLQVRHGFDVSIPLF--HKQFPLRAGATGSVQSNNFPANKKYL 261
Query: 171 LAF----FAGAVNSPVREKLLQVWRNDSEIYAHSGR-------LKTPYADE--------- 210
LAF + + S R L + + + R L+ DE
Sbjct: 262 LAFKGKRYVHGIGSETRNSLFHLHNGRDMVLVTTCRHGKSWRELQDNRCDEDNREYDRYD 321
Query: 211 ----LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVAT 266
L S FCL +G + + R ++L GC+PV+++N + LPF ++WK +I
Sbjct: 322 YETLLQNSTFCLVPRGRRLGSFRFLEALQAGCIPVLLSNAWVLPFESKIDWKQAAIWADE 381
Query: 267 LDIPLLKKILKGISSEEYLLL--QSYVL 292
+ + ++ IS E L Q+ VL
Sbjct: 382 RLLLQVPDTVRSISVERIFALRQQTQVL 409
>gi|363742350|ref|XP_003642625.1| PREDICTED: exostosin-1b-like, partial [Gallus gallus]
Length = 456
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/114 (22%), Positives = 54/114 (47%), Gaps = 5/114 (4%)
Query: 204 KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIV 263
K Y + L S FC+ +G + + R ++L C+PV++++ ++LPF++ ++W ++V
Sbjct: 37 KFDYQELLHNSTFCIVPRGRRLGSFRFLEALQAACIPVLLSDGWELPFSEAIDWGKAAVV 96
Query: 264 VATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYEL 317
+ + + ++ I E L Q + F W + S D E+
Sbjct: 97 GSERLLLQIPSAVRCIRPERVLAFQQ-----QTQFLWDAYFSSVDKIVHTTLEI 145
>gi|302769322|ref|XP_002968080.1| galactosyltransferase-like protein [Selaginella moellendorffii]
gi|300163724|gb|EFJ30334.1| galactosyltransferase-like protein [Selaginella moellendorffii]
Length = 494
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 54/237 (22%), Positives = 93/237 (39%), Gaps = 28/237 (11%)
Query: 31 GNYASESYFKKAFMKSHFVTKDPSKADLFFLPF--SIARMRHDRRIGTEGIPDFISHYIF 88
YA + F + + +T P +A LF++P+ + +R+ T + + +
Sbjct: 171 NQYALDVLFHQRLLHYPCLTDSPEEASLFYVPYYAGLDVLRYHYTNETLEQKNELGLEVM 230
Query: 89 NISQKYPYWNRTGGADHFYVACHSIG--RSAMQKAWEVKL-------NAIQVVCSSSYFI 139
++ ++ +W R G DH V R + W L N +++ F
Sbjct: 231 DLLKRQQWWWRRNGRDHLLVMGKITWDFRRNNETMWGNTLLKMAEFENMTKLLLERDPFE 290
Query: 140 SGHIAHKDVSLPQIWPRQEDP------PKLGSSKRNKLAFFAGAVNSP--VREKLLQVWR 191
IA V P + D ++ SS R+ L FAG P +R L+ +
Sbjct: 291 PNEIA---VPHPTYFHPSSDSDISTWISRIASSSRDNLVSFAGMPRDPEHLRTHLINQCK 347
Query: 192 NDSE---IYAHSGRL---KTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVI 242
+ + + A SG L P + L S+FC+ G + DSL GC+PV+
Sbjct: 348 DRPDRCKLLACSGNLCDSPEPTMELFLSSQFCMQPPGDSATRRSVFDSLIAGCIPVL 404
>gi|326429779|gb|EGD75349.1| hypothetical protein PTSG_06425 [Salpingoeca sp. ATCC 50818]
Length = 401
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 23/65 (35%), Positives = 40/65 (61%)
Query: 234 LYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLK 293
+ G +PVI+ +HY LP+ D+L+W++FSI + + L +IL+ I E ++Q V+
Sbjct: 195 MAAGAIPVIVVDHYVLPYQDLLDWETFSIRIPEHRLLELPRILRSIPDEVVEMMQRRVVF 254
Query: 294 VRKHF 298
V + F
Sbjct: 255 VFEEF 259
>gi|388510202|gb|AFK43167.1| unknown [Medicago truncatula]
Length = 216
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/83 (32%), Positives = 42/83 (50%), Gaps = 1/83 (1%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFFLP-FSIARMRHDRRIGTEGIPDFISHYIFNIS 91
+A+E + + + S T +P +AD F+ P ++ + + P + I IS
Sbjct: 77 FAAEIFMHRFLLSSAVRTLNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRMMRSAIQLIS 136
Query: 92 QKYPYWNRTGGADHFYVACHSIG 114
+PYWNRT GADHF+V H G
Sbjct: 137 SNWPYWNRTEGADHFFVVPHDFG 159
>gi|159478873|ref|XP_001697525.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158274404|gb|EDP00187.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 597
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/92 (29%), Positives = 49/92 (53%), Gaps = 1/92 (1%)
Query: 203 LKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSI 262
+K Y+D L +KFCL G + +AR+ D++ +GC+PVIIA+ F +L+ F +
Sbjct: 500 VKGEYSDMLSRAKFCLVAPG-DGWSARMEDAVLHGCIPVIIADGVHAVFESILDIDGFGL 558
Query: 263 VVATLDIPLLKKILKGISSEEYLLLQSYVLKV 294
+ +P + IL + Q+++ +V
Sbjct: 559 RIPQEQVPRILDILLAVPPRAIRSKQAHLGRV 590
Score = 37.7 bits (86), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 25/95 (26%), Positives = 39/95 (41%), Gaps = 13/95 (13%)
Query: 33 YASESYFKKAFMKSHFVTKDPSKADLFF-------LPFSIARMRHDRRIGTEGIP----- 80
YA ++ ++ + S T DP +AD F+ LPF I G P
Sbjct: 252 YAMDTLLHESLLGSPHRTFDPEEADFFYVPHQASCLPFPIGAWADFPWFPDAGGPRTRQM 311
Query: 81 -DFISHYIFNISQKYPYWNRTGGADHFYVACHSIG 114
+ + + I+ +P+W R GG DH + H G
Sbjct: 312 LNLVIDTVQWINATFPFWQRRGGRDHIFTFTHDEG 346
>gi|194755024|ref|XP_001959792.1| GF13049 [Drosophila ananassae]
gi|190621090|gb|EDV36614.1| GF13049 [Drosophila ananassae]
Length = 717
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 68/324 (20%), Positives = 127/324 (39%), Gaps = 65/324 (20%)
Query: 5 FRVYVYP-----HRRNDPFANVLLPVDFEPRGNYASESYFK--KAFMKSHFVTKDPSKAD 57
+VY+YP R+D A+ L S YF+ +A +KS + T +P++A
Sbjct: 108 LKVYIYPLQDFVDERSDNVASTL------------SSEYFQILEAVLKSRYYTSNPNEAC 155
Query: 58 LFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSA 117
LF + +F H +W+R GA+H + + + +A
Sbjct: 156 LFLPSLDLLNQN-----------NFDKHLAGAALASLDFWDR--GANH--IIFNMLPGAA 200
Query: 118 MQKAWEVKLNAIQVVCSSSYFIS-GHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLAFFAG 176
+ +N + F S + DVS+P +W + P ++ + K
Sbjct: 201 PSYNTVLDVNTDNAIIFGGGFDSWSYRPGFDVSIP-VWSPRLVPQHAHATAQRKFLLLVA 259
Query: 177 AVN------SPVREK---------LLQVWRN-----DSEIYAHSGRLKTPYADELLGSKF 216
+N +RE LL N + H L+ P L KF
Sbjct: 260 QLNILPRFVRTMREVALAHNDQMLLLGACENMDLTSRCPVSQHHKSLEYPRL--LSRGKF 317
Query: 217 CLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKIL 276
CL K + + + + C+PV+ ++Y +PF DV++W S+ + ++ + + L
Sbjct: 318 CLITKSLRLGQPDLVEIMSQHCIPVVAVDNYIMPFEDVIDWSLASVRIRESELHSVMQKL 377
Query: 277 KGISSEEYLLLQSYVLKVRKHFQW 300
+ IS+ + +++++K QW
Sbjct: 378 QAISNIK-------IVEMQKQVQW 394
>gi|326435600|gb|EGD81170.1| hypothetical protein PTSG_11210 [Salpingoeca sp. ATCC 50818]
Length = 107
Score = 49.7 bits (117), Expect = 0.002, Method: Composition-based stats.
Identities = 23/62 (37%), Positives = 39/62 (62%)
Query: 237 GCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRK 296
G +PVI+ +HY LP+ D+L+W++FSI + + L +IL+ I E ++Q V+ V +
Sbjct: 4 GAIPVIVVDHYVLPYQDLLDWETFSIRIPEHRLLELPRILRSIPDEVVEMMQRRVVFVFE 63
Query: 297 HF 298
F
Sbjct: 64 EF 65
>gi|47203855|emb|CAG13860.1| unnamed protein product [Tetraodon nigroviridis]
Length = 72
Score = 49.7 bits (117), Expect = 0.002, Method: Composition-based stats.
Identities = 18/57 (31%), Positives = 38/57 (66%)
Query: 207 YADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIV 263
Y + L S FCL +G + + R ++L CVPV+++N ++LPF+++++W + +++
Sbjct: 4 YKEMLHNSTFCLVPRGRRLGSFRFLEALQAACVPVMLSNGWELPFSEIIDWNTAAVI 60
>gi|326521044|dbj|BAJ92885.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 289
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 64/270 (23%), Positives = 103/270 (38%), Gaps = 72/270 (26%)
Query: 41 KAFMKSHFVTKDPSKADLFFLPFSIA------------RMRHDRRIGTEGIPDFISHYIF 88
++ + S T + +AD F++P + RM D R+ + ++
Sbjct: 15 ESILASPHRTLNGDEADYFYVPVLDSCLITRSDDAPHLRMPEDLRLRSYHTLEYYRKAYD 74
Query: 89 NISQKYPYWNRTGGADHFYV------ACHS---IGRSAMQKAWEVKLNAIQVVCSSSYFI 139
+I+Q+YPYWNRT G DH + AC++ I S M W N +++Y+
Sbjct: 75 HIAQRYPYWNRTSGRDHIWFFSWDEGACYAPKEIWNSMMLVHWG-NTNTKHEKSTTAYWA 133
Query: 140 SG--------------HIAHKDVSLP--------QIWPRQEDPPKLGSSKRNKLAFFAGA 177
KD+ LP IW + PK+ R L +F G
Sbjct: 134 DNWDDIPFDRRGNHPCFDPRKDLVLPAWKEPNPGAIWLKLWARPKI---NRTTLFYFNGN 190
Query: 178 VNSP-------------VREKLLQVWRNDSEIYAHSGRLKTP-----------YADELLG 213
+ +R+KL + + GR +T Y +EL
Sbjct: 191 LGPAYEEGRPEDTYSMGIRQKLAAEFGSTPNKQGKLGRQQTANVTVTYLKSEMYYEELAS 250
Query: 214 SKFCLHVKGFEVNTARIADSLYYGCVPVII 243
S FC + G + + R+ DS+ GC+PVII
Sbjct: 251 SIFCGVLPG-DGWSGRMEDSMLQGCIPVII 279
>gi|159473731|ref|XP_001694987.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
gi|158276366|gb|EDP02139.1| exostosin-like glycosyltransferase [Chlamydomonas reinhardtii]
Length = 721
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 29/92 (31%), Positives = 49/92 (53%), Gaps = 1/92 (1%)
Query: 209 DELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLD 268
D + +KFCL G R+A+++ GCVPVII +H D+L ++ FSI + +
Sbjct: 586 DSMKLAKFCLAPMGAGWGI-RLAEAMVSGCVPVIIQDHIYQAHWDILPFEEFSIRIGRNE 644
Query: 269 IPLLKKILKGISSEEYLLLQSYVLKVRKHFQW 300
+ L IL +S ++ LQ+ + + + F W
Sbjct: 645 LHQLVDILDDVSPQQLDSLQAGIERYHRAFFW 676
>gi|255642058|gb|ACU21295.1| unknown [Glycine max]
Length = 340
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 71/280 (25%), Positives = 115/280 (41%), Gaps = 43/280 (15%)
Query: 4 SFRVYVYPHRRNDPFANVLLPVDFEP------RGNYASESYFKKAFMKSHFVTKDPSKAD 57
S ++YVY D +L D + +G + S+ K ++S T +AD
Sbjct: 66 SLKIYVYQENEIDGLKELLRGRDAKITDEACLKGQWGSQVKIHKLLLQSKQRTWKKEEAD 125
Query: 58 LFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSA 117
LFF+P + ++ R +G + S Y+ ISQ PY+ +GG +H +V G +
Sbjct: 126 LFFVP---SYVKCARMMGGLNDKEINSTYVKVISQ-MPYFRLSGGRNHIFVFPSGAG-AH 180
Query: 118 MQKAWEVKLNAIQVVC---------SSSYF-------ISGHIAHKDVSLPQIWPRQEDPP 161
+ K+W +N ++ +S F I G+I D + + P
Sbjct: 181 LFKSWATYINRSIILTPEGDRTDKRDTSAFNTWKDIIIPGNI---DDGMTKTGDTTVQPL 237
Query: 162 KLGSSKRNKLAFFAG-AVNSPVREKLLQVWRN--------DSEIYAHSGRLKTPYADELL 212
L SKR LA + G A R KL+++ + D + + Y + L
Sbjct: 238 PL--SKRKYLANYLGRAQGKAGRLKLIELSKQFPEKLECPDLKFSGPDKLGRKEYFEHLR 295
Query: 213 GSKFCLHVKGFEVNTARIADSLYYG-CVPVII-ANHYDLP 250
SKFCL +G T R +S + G C II +N +P
Sbjct: 296 NSKFCLAPRGESSWTLRFYESFFCGMCSSYIIRSNRIAIP 335
>gi|326435785|gb|EGD81355.1| hypothetical protein PTSG_11833 [Salpingoeca sp. ATCC 50818]
Length = 944
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 31/98 (31%), Positives = 50/98 (51%), Gaps = 9/98 (9%)
Query: 213 GSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLL 272
G +H+ +V+ A I + G +PVI+ +HY LP+ D+L+W +FSI + + L
Sbjct: 630 GPGSLVHLLKPDVDPAYI---MATGAIPVIVVDHYVLPYQDLLDWGTFSIRIPEHRLLEL 686
Query: 273 KKILKGISSEEYLLLQSYVLKVRKHF------QWHVFP 304
+IL+ I E ++Q V+ V F Q H P
Sbjct: 687 PRILRSIPDEVVEMMQRRVVFVYGEFFKSLSTQVHTAP 724
>gi|302772274|ref|XP_002969555.1| hypothetical protein SELMODRAFT_451122 [Selaginella moellendorffii]
gi|300163031|gb|EFJ29643.1| hypothetical protein SELMODRAFT_451122 [Selaginella moellendorffii]
Length = 453
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 58/245 (23%), Positives = 98/245 (40%), Gaps = 30/245 (12%)
Query: 31 GNYASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIP--DFISHYIF 88
G +A E + F + +T DP A+LF++P+ A + + + T+ + D + +
Sbjct: 129 GQFAVEIMIHERFRRYQCLTDDPHLANLFYIPY-YAGLDVSQYLFTKQVQMRDKLGQRLL 187
Query: 89 NISQKYPYWNRTGGADHFYVACHSI---GRSAMQ-KAWEVKLNAIQVVCSSSYFISGHIA 144
Q +WNR G DH V + GRS ++W L +IQ + +++ +
Sbjct: 188 GYLQGNRHWNRKRGRDHVLVLGRIVWDFGRSEENHESWGSSLLSIQELDNATKLLIERDV 247
Query: 145 HKDVSLPQIWPRQEDPPK----------LGSSKRNKLAFFAGAV------NSPVREKL-L 187
+ + +P P + S R+ L FAGA+ + +R L
Sbjct: 248 WRSSQMALPYPTGFHPDSRREIDEWLAVVNGSSRDLLVSFAGALRDGNGSTATMRRSLRR 307
Query: 188 QVWRNDSEI-YAHSGRLKTPYADEL-----LGSKFCLHVKGFEVNTARIADSLYYGCVPV 241
Q R++S R+ E+ L S FCL G D L GC+PV
Sbjct: 308 QCQRHESLCTILRCERINCEENPEIVTCVALRSIFCLMPPGDSPTRKGFFDGLVAGCIPV 367
Query: 242 IIANH 246
+ + H
Sbjct: 368 VFSEH 372
>gi|19699001|gb|AAL91236.1| unknown protein [Arabidopsis thaliana]
gi|30023662|gb|AAP13364.1| At2g28110 [Arabidopsis thaliana]
Length = 214
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 30/90 (33%), Positives = 45/90 (50%), Gaps = 12/90 (13%)
Query: 33 YASESYFKKAFM--KSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIP------DFIS 84
+A+E KAF+ + T+DP +AD FF+P ++ G P I+
Sbjct: 120 FAAEVALHKAFLSLEGDVRTEDPYEADFFFVPVYVSC----NFSTINGFPAIGHARSLIN 175
Query: 85 HYIFNISQKYPYWNRTGGADHFYVACHSIG 114
I +S +YP+WNRT G+DH + A H G
Sbjct: 176 DAIKLVSTQYPFWNRTSGSDHVFTATHDFG 205
>gi|260813939|ref|XP_002601673.1| hypothetical protein BRAFLDRAFT_94550 [Branchiostoma floridae]
gi|229286975|gb|EEN57685.1| hypothetical protein BRAFLDRAFT_94550 [Branchiostoma floridae]
Length = 786
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 86/367 (23%), Positives = 142/367 (38%), Gaps = 68/367 (18%)
Query: 25 VDFEPRGNYASESYFKKAFMKSHFVTKDPSKADLFFLPFSIARMRHDRRIGTEGIPDFIS 84
V F ++ E + + S F T DP KAD+F++P+ A + T P
Sbjct: 415 VSFRSTHMFSLEVILHQKLLSSTFRTLDPEKADVFYIPYYPALAAACEPVSTIDSPALDR 474
Query: 85 HYIFNISQKYPYWNRTGGADHFYVACHSIGRSAMQKAWEVKLNAIQVVCSSSY-FISGHI 143
I+ YPY+ + G H ++GR + A +V ++ S S F++ I
Sbjct: 475 ELWQFITSNYPYFQQ--GKPHMM----ALGRIEREHA-DVTGGILKTRESRSVTFVA--I 525
Query: 144 AHKDVSLPQIWPRQEDPPKL-----------------GSSK---------RNKLAFFAGA 177
H+ + R+ P + G SK R+ L FAG+
Sbjct: 526 EHESDPKTLKFIRRSGLPMVVAPYPSCGHLLSDNKFGGESKSERTQLDIPRDVLVLFAGS 585
Query: 178 --VNSPVREKLLQVWRNDSEIYAHSGRLKT-------------PYADELLG----SKFCL 218
++ +R L Q R SE Y + L + + L+ S FCL
Sbjct: 586 RRMSHDIRRILSQQLRPTSEKYDATSSLNKQNVWFITQECRDRSWQENLVEWMHHSVFCL 645
Query: 219 HVKGFEVNTARIADSLYYGCVPVIIANHYD--LPFADVLNWKSFSIVVATLDIPLLKK-- 274
G D++ GC+PVI ++ PF DVL++ F++ V D K+
Sbjct: 646 QPPGDSPTRKSFFDAVQCGCIPVIFKLDHEPVYPFDDVLDYSKFTVKVTDGDFFQEKRSI 705
Query: 275 --ILKGISSEEYLLLQSYVLKVRKHFQWHVFPSD----YDAFYMVMYELWLR---RSSVR 325
IL+ I ++ + +V Q+ P DAF M+M E+ RS+VR
Sbjct: 706 VDILQDIPEAVIAAKRAELRQVTPLLQYSYPPLPETHVQDAFDMIMQEIGRTRGGRSNVR 765
Query: 326 VQWSTSL 332
++ + +
Sbjct: 766 RRYGSKV 772
>gi|224131234|ref|XP_002328488.1| predicted protein [Populus trichocarpa]
gi|222838203|gb|EEE76568.1| predicted protein [Populus trichocarpa]
Length = 90
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 24/69 (34%), Positives = 41/69 (59%)
Query: 213 GSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLL 272
SKFCL + ++ R+ D++ CVPVII++ + P+ DV+++ F I V T ++
Sbjct: 3 SSKFCLDIASDTPSSNRLIDAIASHCVPVIISDDIEFPYEDVIDYSQFCISVRTSNVVRE 62
Query: 273 KKILKGISS 281
K ++ ISS
Sbjct: 63 KFLVNLISS 71
>gi|57044145|ref|XP_544489.1| PREDICTED: exostoses (multiple)-like 1 [Canis lupus familiaris]
Length = 672
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 29/125 (23%), Positives = 56/125 (44%), Gaps = 6/125 (4%)
Query: 193 DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFA 252
D + G +TP L + FC+ + G +L GC+PV+++ ++LPF+
Sbjct: 249 DGRCEQNRGPEQTPLGATLPNATFCV-IPGHRPEALHFLQALQAGCIPVLLSPRWELPFS 307
Query: 253 DVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYM 312
+V++W +IV D L ++L + +E + L+ + F W + S +
Sbjct: 308 EVIDWTKAAIVA---DKRLPFQVLAAL--QEMPPTRVLALRQQTQFLWDAYFSSVEKVIH 362
Query: 313 VMYEL 317
E+
Sbjct: 363 TTLEI 367
>gi|297814922|ref|XP_002875344.1| hypothetical protein ARALYDRAFT_904882 [Arabidopsis lyrata subsp.
lyrata]
gi|297321182|gb|EFH51603.1| hypothetical protein ARALYDRAFT_904882 [Arabidopsis lyrata subsp.
lyrata]
Length = 64
Score = 49.3 bits (116), Expect = 0.003, Method: Composition-based stats.
Identities = 24/62 (38%), Positives = 38/62 (61%)
Query: 214 SKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLK 273
S FCL G+ + + R+ +S GCVPV+IAN LPF++++ W +++A D L+
Sbjct: 3 SIFCLCPFGWGIWSPRLVESAVSGCVPVVIANGIQLPFSEIVRWPEILLMMAKKDDMNLQ 62
Query: 274 KI 275
KI
Sbjct: 63 KI 64
>gi|291399546|ref|XP_002716175.1| PREDICTED: exostoses-like 1 [Oryctolagus cuniculus]
Length = 671
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 29/118 (24%), Positives = 57/118 (48%), Gaps = 7/118 (5%)
Query: 200 SGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKS 259
G L P + L + FCL + G + R +L GC+PV+++ ++LPF++ ++W
Sbjct: 253 GGELSQP-TETLPNATFCL-IPGHRSDAWRFLQALQAGCIPVLLSPLWELPFSERIDWTK 310
Query: 260 FSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYEL 317
+I+ D +++L + +E L + L+ + F W + S + M E+
Sbjct: 311 ATIIA---DERQPRQVLAAL--QEMPLARVLSLRQQTQFLWAAYFSSVEKVIHTMLEI 363
>gi|440798055|gb|ELR19126.1| hypothetical protein ACA1_335890 [Acanthamoeba castellanii str.
Neff]
Length = 350
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 36/138 (26%), Positives = 64/138 (46%), Gaps = 18/138 (13%)
Query: 162 KLGSSKRNKLAFFAGAVNSPVREKLLQVWRNDSEIYAHSGRLKTPYADELLGSKFCLHVK 221
K G+++R++L VR L R DS Y D +L +KF L V+
Sbjct: 204 KGGNNRRHRL-------RPKVRAGLDDSSRGDS----------GDYHDLMLNTKFALIVQ 246
Query: 222 GFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISS 281
G +++ R+ +++ VPVI+A++Y LPF++ + W +I V + ++ I
Sbjct: 247 GNGLHSYRLTEAMRANAVPVILADNYVLPFSEAVRWDEIAIFVPESQWASIPDVIGRIDD 306
Query: 282 EEYLLLQSYVLKV-RKHF 298
E ++ + V HF
Sbjct: 307 EALARMREKLATVYEAHF 324
>gi|302855272|ref|XP_002959133.1| hypothetical protein VOLCADRAFT_121779 [Volvox carteri f.
nagariensis]
gi|300255495|gb|EFJ39797.1| hypothetical protein VOLCADRAFT_121779 [Volvox carteri f.
nagariensis]
Length = 771
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 55/274 (20%), Positives = 98/274 (35%), Gaps = 60/274 (21%)
Query: 52 DPSKADLFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACH 111
DP++AD FF+P G + + F ++ YW + G DH
Sbjct: 302 DPARADYFFIPGC-----------GRGCDKWDDKFHFILAHYGQYWTQNQGRDHIMTHPG 350
Query: 112 SIGRS---------------AMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLP----- 151
GR M + W + ++ V + F + H ++D+ +P
Sbjct: 351 DWGRCEHSWDAFADKFISNVTMLQHWGMTVDRSSEV-EHNLFNTCHKPNQDILVPPMCGD 409
Query: 152 -------QIW--PRQEDPPKLGSSKRNKLAFFAGAV-------NSPVREKLLQV------ 189
IW R+E+P + LA AG++ P R + +
Sbjct: 410 LYPQFEYNIWHPNRKENP-----ITKTNLASVAGSICGWNSVEEPPCRNRYYSLGVRAAL 464
Query: 190 WRNDSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDL 249
W+ H + + S+FC G + + GC+PVII++H
Sbjct: 465 WQLRDVPGFHIAKRVAMMGQSMAESEFCFAPTGAGYGKRNVMATTL-GCMPVIISDHVAQ 523
Query: 250 PFADVLNWKSFSIVVATLDIPLLKKILKGISSEE 283
P+ LNW F + + ++ IL+G + ++
Sbjct: 524 PYEPFLNWNEFGVWIPESQAKDVEIILRGFTPQQ 557
>gi|326434432|gb|EGD80002.1| hypothetical protein PTSG_10278 [Salpingoeca sp. ATCC 50818]
Length = 583
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 21/59 (35%), Positives = 37/59 (62%)
Query: 234 LYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVL 292
+ G +PVI+ +HY LP+ D+L+W++FSI + + L +IL+ I E ++Q V+
Sbjct: 201 MAAGTIPVIVVDHYVLPYQDLLDWETFSIRIPEHRLLELPRILRSIPDEVVEMMQRRVV 259
>gi|302760669|ref|XP_002963757.1| galactosyltransferase-like protein [Selaginella moellendorffii]
gi|300169025|gb|EFJ35628.1| galactosyltransferase-like protein [Selaginella moellendorffii]
Length = 411
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 66/284 (23%), Positives = 108/284 (38%), Gaps = 43/284 (15%)
Query: 32 NYASESYFKKAFMKSHFVTKDPSKADLFFLPF--SIARMRHDRRIGTEGIPDFISHYIFN 89
++ E F ++ VT D A++F++PF + R+ R D + H +
Sbjct: 77 QFSGEVIFHTRILRHPCVTNDSDSANVFYVPFYAGLDVSRYLWRPSKAEDRDHLGHKLVE 136
Query: 90 ISQKYPYWNRTGGADHFYVACHSIG------RSAMQKAWEVKL-------NAIQVVCSSS 136
P W R G DHF + IG R + AW L N ++ S+
Sbjct: 137 WLSTQPAWTRARGRDHFTM----IGRITWDFRRPEENAWGSGLLNMAEMKNMTRLAIESN 192
Query: 137 YFISGH--IAHKDVSLPQIWPRQEDPPKLGSSKRNKLAF-FAGAVNSPV----REKLLQV 189
+ G + + PQ + ++ + +K L F FAGA + R +LL
Sbjct: 193 PWEGGEYGVPYPTSFHPQNEHQLQEWQEFVRNKERGLVFSFAGATRKRIPNDFRLELLAQ 252
Query: 190 WRND----SEIYAHSGRLKT--PYADELLGSKFCLHVKGFEVNTARIADSLYYGCVPVII 243
+ S + + +T P L S FCL +G I DS+ GC+PV
Sbjct: 253 CSDSRGACSAMDCSDSKCETPEPVVQLFLNSTFCLQPRGDGYTRRSIFDSVLAGCIPVFF 312
Query: 244 ANHYD-------LPFADVLNWKSFSIVVATLDIPLLKKILKGIS 280
N P D +S+S+ + D+ KI++ +S
Sbjct: 313 WNQSSYWQYKWFFPEED----ESYSVFIDREDVRKGTKIMEVLS 352
>gi|443712570|gb|ELU05824.1| hypothetical protein CAPTEDRAFT_92231 [Capitella teleta]
Length = 668
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 27/107 (25%), Positives = 51/107 (47%), Gaps = 5/107 (4%)
Query: 211 LLGSKFCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIP 270
L S FCL +G + + R ++L C+PV ++N++ LPF++V++W +I +
Sbjct: 253 LHNSTFCLVPRGRRLGSYRFLEALQAACIPVFLSNNWVLPFSEVIDWNQAAIWGDERLLL 312
Query: 271 LLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFYMVMYEL 317
+ I++ I + L L+ + F W + S D E+
Sbjct: 313 QIPSIVRSIRHADLL-----ALRQQTQFLWETYFSSIDKIVATTLEI 354
>gi|297282579|ref|XP_001108035.2| PREDICTED: exostoses (multiple)-like 1 isoform 1 [Macaca mulatta]
Length = 675
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 32/126 (25%), Positives = 60/126 (47%), Gaps = 7/126 (5%)
Query: 193 DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTA-RIADSLYYGCVPVIIANHYDLPF 251
D+ G +T + L + FCL + G A R +L GC+PV+++ ++LPF
Sbjct: 248 DARCEQDPGPGQTQRGETLPNATFCL-ISGHRPEAASRFLQALQAGCIPVLLSPRWELPF 306
Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFY 311
++V++W +I VA +PL ++L + +E + L+ + F W + S +
Sbjct: 307 SEVIDWTKAAI-VADERLPL--QVLAAL--QEMSPARVLALRQQTQFLWDAYFSSVEKVI 361
Query: 312 MVMYEL 317
E+
Sbjct: 362 HTTLEV 367
>gi|355557695|gb|EHH14475.1| hypothetical protein EGK_00405 [Macaca mulatta]
Length = 675
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 32/126 (25%), Positives = 60/126 (47%), Gaps = 7/126 (5%)
Query: 193 DSEIYAHSGRLKTPYADELLGSKFCLHVKGFEVNTA-RIADSLYYGCVPVIIANHYDLPF 251
D+ G +T + L + FCL + G A R +L GC+PV+++ ++LPF
Sbjct: 248 DARCEQDPGPGQTQRGETLPNATFCL-ISGHRPEAASRFLQALQAGCIPVLLSPRWELPF 306
Query: 252 ADVLNWKSFSIVVATLDIPLLKKILKGISSEEYLLLQSYVLKVRKHFQWHVFPSDYDAFY 311
++V++W +I VA +PL ++L + +E + L+ + F W + S +
Sbjct: 307 SEVIDWTKAAI-VADERLPL--QVLAAL--QEMSPARVLALRQQTQFLWDAYFSSVEKVI 361
Query: 312 MVMYEL 317
E+
Sbjct: 362 HTTLEV 367
>gi|195334813|ref|XP_002034071.1| GM20090 [Drosophila sechellia]
gi|194126041|gb|EDW48084.1| GM20090 [Drosophila sechellia]
Length = 717
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 69/325 (21%), Positives = 125/325 (38%), Gaps = 67/325 (20%)
Query: 5 FRVYVYP-----HRRNDPFANVLLPVDFEPRGNYASESYFK--KAFMKSHFVTKDPSKAD 57
+VY+YP ++D A L S YF+ +A +KS + T +P++A
Sbjct: 108 LKVYIYPLQEFVDEQSDKTATTL------------SSEYFQILEAVLKSRYYTSNPNEAC 155
Query: 58 LFFLPFSIARMRHDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRSA 117
LF + F H + +W+R GA+H G +
Sbjct: 156 LFLPSLDLLNQNV-----------FDKHLAGSALASLDFWDR--GANHLIFNMLPGGAPS 202
Query: 118 MQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLAFFAGA 177
+V + ++ + + DV++P PR +++R K
Sbjct: 203 YNTVLDVNTDN-AIIFGGGFDSWSYRPGFDVAIPVWSPRLVRQHAHATAQR-KFLLVVAQ 260
Query: 178 VNSPVREKLLQVWRNDSEIYAHSGRL-------------KTP---------YADELLGSK 215
+N + + ++ R S AHS +L + P Y L K
Sbjct: 261 LN--ILPRFVRTLRELS--LAHSEQLLLLGACENLDLTMRCPLSQHHKSLEYPRLLSRGK 316
Query: 216 FCLHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKI 275
FCL + + + + + C+PVI ++Y LPF DV++W S+ V ++ + +
Sbjct: 317 FCLLGRSLRMGQPDLVEIMSQHCIPVIAVDNYVLPFEDVIDWSLASVRVRENELHSVMQK 376
Query: 276 LKGISSEEYLLLQSYVLKVRKHFQW 300
LK ISS + +++++K QW
Sbjct: 377 LKAISSVK-------IVEMQKQVQW 394
>gi|195488335|ref|XP_002092270.1| GE14097 [Drosophila yakuba]
gi|194178371|gb|EDW91982.1| GE14097 [Drosophila yakuba]
Length = 717
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 79/366 (21%), Positives = 142/366 (38%), Gaps = 73/366 (19%)
Query: 5 FRVYVYP-----HRRNDPFANVLLPVDFEPRGNYASESYFK--KAFMKSHFVTKDPSKAD 57
+VY+YP ++D A L S YF+ +A +KS + T +P++A
Sbjct: 108 LKVYIYPLQEFVDEQSDKTATTL------------SSEYFQILEAVLKSRYYTSNPNEAC 155
Query: 58 LFFLPFSIARMR-HDRRIGTEGIPDFISHYIFNISQKYPYWNRTGGADHFYVACHSIGRS 116
LF + D+ + + +W+R GA+H G
Sbjct: 156 LFLPSLDLLNQNVFDKHLAGAALASL------------DFWDR--GANHIIFNMLPGGAP 201
Query: 117 AMQKAWEVKLNAIQVVCSSSYFISGHIAHKDVSLPQIWPRQEDPPKLGSSKRNKLAFFA- 175
+ +V + ++ + + DV++P PR +++R L A
Sbjct: 202 SYNTVLDVNTDN-AIIFGGGFDSWSYRPGFDVAIPVWSPRLVRQHAHATAQRKFLLVVAQ 260
Query: 176 -----------GAVNSPVREKLLQVWRNDS-------EIYAHSGRLKTPYADELLGSKFC 217
V+ E+LL + +S + H L+ P L KFC
Sbjct: 261 LNILPRFLRTLKEVSLANSEQLLLLGACESLDLTMRCPLSQHHKSLEYPRL--LSRGKFC 318
Query: 218 LHVKGFEVNTARIADSLYYGCVPVIIANHYDLPFADVLNWKSFSIVVATLDIPLLKKILK 277
L + + + + + C+PVI ++Y LPF DV++W SI + ++ + + LK
Sbjct: 319 LLGRSLRMGQPDLVEIMSQHCIPVIAVDNYVLPFEDVIDWSLASIRIRENELHSVMQKLK 378
Query: 278 GISSEEYLLLQSYVLKVRKHFQW--HVFPSDYDAFYMVMYELW------LRRSSVRVQWS 329
ISS + +++++K QW + D + E+ LR S R QW+
Sbjct: 379 AISSVK-------IVEMQKQVQWLFSKYFKDLKTVTLTALEVLESRIFPLRARSSR-QWN 430
Query: 330 TSLDSN 335
T +D+N
Sbjct: 431 T-IDTN 435
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.324 0.138 0.439
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,569,800,435
Number of Sequences: 23463169
Number of extensions: 227970700
Number of successful extensions: 535364
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1092
Number of HSP's successfully gapped in prelim test: 152
Number of HSP's that attempted gapping in prelim test: 532326
Number of HSP's gapped (non-prelim): 1666
length of query: 336
length of database: 8,064,228,071
effective HSP length: 143
effective length of query: 193
effective length of database: 9,003,962,200
effective search space: 1737764704600
effective search space used: 1737764704600
T: 11
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 77 (34.3 bits)