BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 016223
(393 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|224075287|ref|XP_002304587.1| predicted protein [Populus trichocarpa]
gi|222842019|gb|EEE79566.1| predicted protein [Populus trichocarpa]
Length = 379
Score = 467 bits (1202), Expect = e-129, Method: Compositional matrix adjust.
Identities = 230/390 (58%), Positives = 288/390 (73%), Gaps = 13/390 (3%)
Query: 3 SSSFLPSRSSASQRRTAQASDNPNTDGTHGPVLEQAPTPKPIFQFIALGLIVFLGLLQFL 62
SSSFLPS S A+QRRT+Q NP + G P + AP P+PIFQF+A GLI+FL LLQ L
Sbjct: 2 SSSFLPSTSFATQRRTSQKLGNPKSAGPLDPCITPAPNPRPIFQFLAFGLIIFLALLQLL 61
Query: 63 PATHFRHPSDPFRIWVPFNSNTSFSKTRDSGDRNSGSISSISQDDGMVHIVSWMQCLDLR 122
PATHFR P DPFR W P +SN S S + SGD DGMVH+VSWM CLDLR
Sbjct: 62 PATHFRDPFDPFRNWAPLHSNPS-SPVKFSGD------------DGMVHVVSWMDCLDLR 108
Query: 123 LLAVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFPHSNLEFHGQEEVKKVIR 182
LAVL NSTLS S +PDL+ FH F+P G+ED VSFYKLKVLFPHSNLE HGQE+VK+++R
Sbjct: 109 ALAVLANSTLSSSSHPDLVFFHFFIPGGNEDKVSFYKLKVLFPHSNLEIHGQEQVKEIMR 168
Query: 183 TASTGVKYSVQNFEEIVPFVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAA 242
T +G +Y+ ++EEIVPF+I +VH L KFIY+S +VI+K RVEELIG+DL+NYAIA A
Sbjct: 169 TVFSGGQYAEPSYEEIVPFIIPTVHNFLRKFIYVSANVIMKARVEELIGVDLNNYAIATA 228
Query: 243 DDCSERLNSYVNPDVLDAIQRSASQPWVSGKPYAVNSCMPDLGMLLIDARKLEKYILEAF 302
+DCS RL +YVN DVLDAIQRSAS+ WVS PYA + C+PD +L+I+ARKLEK +E
Sbjct: 229 EDCSRRLKNYVNLDVLDAIQRSASKAWVSETPYAKDFCLPDFSVLVINARKLEKDFVEIV 288
Query: 303 LWWKKVINQRDRSIGRSPAIALALYDRYLKLSSSWLVTDSTSSVVNKSLAIRYDGPMTAC 362
LWW K +N R+R+ ++ A+ALALY+ YLKLSSSWLV D +SS VN S+ I YDGP T+C
Sbjct: 289 LWWSKALNLRERTNKKNVAVALALYNSYLKLSSSWLVKDFSSSQVNNSMIIHYDGPKTSC 348
Query: 363 SEFGDGANMEPARGDLWKQHLPPLFYQMVG 392
E +G E + G++W ++LP +++G
Sbjct: 349 KESINGTASEYSHGNVWTRYLPSTSDRILG 378
>gi|224053693|ref|XP_002297932.1| predicted protein [Populus trichocarpa]
gi|222845190|gb|EEE82737.1| predicted protein [Populus trichocarpa]
Length = 387
Score = 443 bits (1139), Expect = e-122, Method: Compositional matrix adjust.
Identities = 229/390 (58%), Positives = 290/390 (74%), Gaps = 5/390 (1%)
Query: 3 SSSFLPSRSSASQRRTAQASDNPNTDGTHGPVLEQAPTPKPIFQFIALGLIVFLGLLQFL 62
+SS+LPS SA+QRR++Q SDNP T G P L AP P+PIFQF+A GL + LGLLQ L
Sbjct: 2 ASSYLPSAYSATQRRSSQTSDNPITAGPLYPGLVPAPNPRPIFQFLAFGLFILLGLLQLL 61
Query: 63 PATHFRHPSDPFRIWVPFNSNTSFSKTRDSGDRNSGSISSISQDDGMVHIVSWMQCLDLR 122
PATHFR P DP R WVP +SN + S R+ RN MVH+VSWM CLDLR
Sbjct: 62 PATHFRDPFDPSRNWVPLHSNLA-SPLREFDARNGDGGGDDG----MVHVVSWMDCLDLR 116
Query: 123 LLAVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFPHSNLEFHGQEEVKKVIR 182
+LAVL NSTLS S YP+L+ FH F+P G+ED V FYKLKVLFPHSNLE HGQEEVK+++R
Sbjct: 117 VLAVLANSTLSSSSYPELVSFHFFIPGGNEDKVPFYKLKVLFPHSNLEIHGQEEVKEIVR 176
Query: 183 TASTGVKYSVQNFEEIVPFVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAA 242
A + +Y+ +EEIVPF+I +VHQ LSKFIY+S +VI+K RVEELIG+DL +YAIA A
Sbjct: 177 IAFSDEQYAKPRYEEIVPFIIPTVHQFLSKFIYVSANVIMKARVEELIGVDLDDYAIATA 236
Query: 243 DDCSERLNSYVNPDVLDAIQRSASQPWVSGKPYAVNSCMPDLGMLLIDARKLEKYILEAF 302
+DCS+RL +YVN +VLDAIQRS S+PWVS PYA ++C+PDL +L+I+ARKL K I+E
Sbjct: 237 EDCSQRLKTYVNSEVLDAIQRSVSKPWVSETPYAKDTCLPDLSVLVINARKLGKDIVETV 296
Query: 303 LWWKKVINQRDRSIGRSPAIALALYDRYLKLSSSWLVTDSTSSVVNKSLAIRYDGPMTAC 362
LWW K +N R+R+ ++ A+ALALY+RYLKLSSSWLV D TS VN S+ I YDGP T+C
Sbjct: 297 LWWSKALNLRERTDQKNLALALALYNRYLKLSSSWLVKDITSPEVNNSMIIYYDGPKTSC 356
Query: 363 SEFGDGANMEPARGDLWKQHLPPLFYQMVG 392
+ GA E + G++W Q+LP + +++G
Sbjct: 357 IKSISGAASEYSHGNVWTQYLPSISDRILG 386
>gi|225465111|ref|XP_002270401.1| PREDICTED: uncharacterized protein LOC100242682 [Vitis vinifera]
gi|302143265|emb|CBI20560.3| unnamed protein product [Vitis vinifera]
Length = 382
Score = 425 bits (1093), Expect = e-116, Method: Compositional matrix adjust.
Identities = 209/380 (55%), Positives = 284/380 (74%), Gaps = 10/380 (2%)
Query: 13 ASQRRTAQASDNPNTDGTHGPVLEQAPTPKPIFQFIALGLIVFLGLLQFLPATHFRHPSD 72
A+ RR QAS + + G P ++ +P P+PI Q +A ++VFLGLLQFLPATHFRHPSD
Sbjct: 12 ATYRRPHQASTSVSPPGPPTPPVDASPNPRPIAQLLAFVVVVFLGLLQFLPATHFRHPSD 71
Query: 73 PFRIWVPFNSNTSFSKTRDSGDRNSGSISSISQDDGMVHIVSWMQCLDLRLLAVLVNSTL 132
P R WVPF+S + S + S++ +DG +HIVSWM CLDLR+L VL NSTL
Sbjct: 72 PLRKWVPFDSGSPVSSS---------FSRSVASEDGKIHIVSWMDCLDLRVLVVLANSTL 122
Query: 133 SGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFPHSNLEFHGQEEVKKVIRTASTGVKYSV 192
S SRYPD +HFH F+P+G +D VS++KLKVLFPHS+LE GQEEVK+ +RTA++G+ Y+
Sbjct: 123 SNSRYPDKVHFHFFIPEGHDDKVSYFKLKVLFPHSDLEIFGQEEVKEKVRTANSGIVYAG 182
Query: 193 QNFEEIVPFVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSERLNSY 252
++FEEIVPFVI ++H+S KFIY+SP+VIVKGRVEEL+ +L++YA+A +DCS+RL++Y
Sbjct: 183 RSFEEIVPFVIPTIHRSWGKFIYISPNVIVKGRVEELLEANLTSYAVAVIEDCSKRLDNY 242
Query: 253 VNPDVLDAIQRSASQPWVSGKPYAVNSCMPDLGMLLIDARKLEKYILEAFLWWKKVINQR 312
VN +VL AIQR+AS+ W+SG PYA+ +CMPD +LLID RKL+K ++EA LWW KV+N
Sbjct: 243 VNSEVLAAIQRTASKSWISGTPYAMKACMPDSSILLIDPRKLDKDLVEAILWWSKVLNWS 302
Query: 313 DRSIGRSPAIALALYDRYLKLSSSWLVTDSTSSVVNKSLAIRYDGPMTACSEFGDGANME 372
+RS ++PAIALAL+ RY KLSSSWL+ S + + I YDGP CS FG+G + +
Sbjct: 303 ERSSPKNPAIALALHGRYSKLSSSWLLGHSRRGTGKEIMIIPYDGPTNMCSGFGNGPS-Q 361
Query: 373 PARGDLWKQHLPPLFYQMVG 392
G++WKQ+LPP+ +++G
Sbjct: 362 SVPGNIWKQYLPPMADRILG 381
>gi|242037931|ref|XP_002466360.1| hypothetical protein SORBIDRAFT_01g006383 [Sorghum bicolor]
gi|241920214|gb|EER93358.1| hypothetical protein SORBIDRAFT_01g006383 [Sorghum bicolor]
Length = 382
Score = 257 bits (657), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 140/347 (40%), Positives = 204/347 (58%), Gaps = 24/347 (6%)
Query: 39 PTPKPIFQFIALGLIVFLGLLQFLPATHFRHPSDPFRIWVPFNSNTSFSKTRDSGDRNSG 98
P P P Q +A+ IV LG LQFLPATHFR P+DP R W+P D +G
Sbjct: 48 PRPWPWMQKLAIVAIVVLGCLQFLPATHFRDPNDPQRNWIPI----------DGSGNPTG 97
Query: 99 SISSISQDDGMVHIVSWMQCLDLRLLAVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFY 158
S++ + G V + SW+ CLDLR LAV+ NSTLS S P + FH +P+G D S++
Sbjct: 98 SLNYV----GSVDVFSWISCLDLRTLAVMTNSTLSSSSDPQNISFHFLIPEGGNDKSSYH 153
Query: 159 KLKVLFPHSNLEFHGQEEVKKVIRTASTGVKYSVQNFEEIVPFVIASVHQSLSKFIYMSP 218
KLK L P S+L Q+++K + A+ + E+ PF+IA S +++Y++
Sbjct: 154 KLKALLPDSDLTVTSQKQIKDKLNVATPEGNFLWLFHMELSPFLIAKSQLSKKRYVYIAA 213
Query: 219 SVIVKGRVEELIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRS-ASQPWVSGKPYAV 277
I+KG+VE+L +DL +YAI A DCS+RLN Y++ DVL AIQR+ A+Q WVS +PY
Sbjct: 214 DSIIKGKVEDLGRMDLGSYAIGAVLDCSKRLNDYISMDVLSAIQRAGAAQSWVSKEPYNK 273
Query: 278 NSCMPDLGMLLIDARKLEKYILEAFLWWKKVINQRDRSIGRSPAIALALYDRYLKLSSSW 337
++C+ D +LLI+ RKL+K ++ + +WW IN ++ A+ LA D+Y+KL S+W
Sbjct: 274 DACLLDFDVLLIEPRKLDKNLVASIMWWINAINLANQRDQIRLAVTLAFNDKYMKLPSNW 333
Query: 338 LVTDSTSSVVNKSLAIRYDGPMTACSEFGDGANMEPA-RGDLWKQHL 383
D + ++N +DGP CSE DG E + G W+Q+L
Sbjct: 334 KRKDGDTDILN------FDGPKIVCSE--DGRQHEQSGYGATWQQYL 372
>gi|147819623|emb|CAN76562.1| hypothetical protein VITISV_010091 [Vitis vinifera]
Length = 838
Score = 249 bits (635), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 113/200 (56%), Positives = 161/200 (80%), Gaps = 1/200 (0%)
Query: 135 SRYPDLLHFHLFVPKGSEDMVSFYKLKVLFPHSNLEFHGQEEVKKVIRTASTGVKYSVQN 194
+RYPD +HFH F+P+G +D VS++KLKVLFPHS+LE GQEEVK+ +RTA++G+ Y+ ++
Sbjct: 620 NRYPDKVHFHFFIPEGHDDKVSYFKLKVLFPHSDLEIFGQEEVKEKVRTANSGIVYAGRS 679
Query: 195 FEEIVPFVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSERLNSYVN 254
FEEIVPFVI ++H+S KFIY+SP+VIVKGRVEEL+ +L++YA+A +DCS+RL++YVN
Sbjct: 680 FEEIVPFVIPTIHRSWXKFIYISPNVIVKGRVEELLEANLTSYAVAVIEDCSKRLDNYVN 739
Query: 255 PDVLDAIQRSASQPWVSGKPYAVNSCMPDLGMLLIDARKLEKYILEAFLWWKKVINQRDR 314
+VL AIQR+AS+ W+SG PYA+ +CMPD +LLID RKL+K ++EA LWW KV+N +R
Sbjct: 740 SEVLAAIQRTASKSWISGTPYAMKACMPDSSILLIDPRKLDKDLVEAILWWSKVLNWSER 799
Query: 315 SIGRSPAIALALYDRYLKLS 334
+ +P I +Y ++L+
Sbjct: 800 FV-IAPTIFFVVYFGKIRLA 818
>gi|20303602|gb|AAM19029.1|AC084748_19 hypothetical protein [Oryza sativa Japonica Group]
gi|24899455|gb|AAN65025.1| hypothetical protein [Oryza sativa Japonica Group]
gi|108711353|gb|ABF99148.1| expressed protein [Oryza sativa Japonica Group]
Length = 373
Score = 240 bits (613), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 133/356 (37%), Positives = 204/356 (57%), Gaps = 28/356 (7%)
Query: 31 HGPVLEQAPTPKPIFQFIALGLIVFLGLLQFLPATHFRHPSDPFRIWVPFNSNTSFSKTR 90
HG + + P +P Q +A+ IV LG LQFLPATHFR P+DP R W+PF
Sbjct: 34 HGDI-DPPPRARPAVQKLAIAAIVVLGCLQFLPATHFRDPADPQRNWIPF---------- 82
Query: 91 DSGDRNSGSISSISQDDGMVHIVSWMQCLDLRLLAVLVNSTLSGSRYPDLLHFHLFVPKG 150
DR+ + + + G V++ SW+ CLDLR LAVL NST+S S P + F+ +P+
Sbjct: 83 ---DRSRKPVD-LPDEVGSVNVFSWISCLDLRTLAVLTNSTISSSSDPHNISFNFLIPER 138
Query: 151 SEDMVSFYKLKVLFPHSNLEFHGQEEVKKVIRTASTGVKYSVQNFEEIVPFVIASVHQSL 210
D +YKLK + P SN+ Q++++ + A+ E+ P +I + S
Sbjct: 139 GTDKSPYYKLKAVLPDSNVTVASQKKIEDKLNVATPEGNLFWSFPNELSPIIIGTTQFSQ 198
Query: 211 SKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRSASQPWV 270
+++Y+S IVKG++E+L IDL YAIAA +DC++R+ YV+ DVL+A+Q++A + V
Sbjct: 199 KRYVYISADSIVKGKIEDLGRIDLGTYAIAAVEDCTKRIGDYVSIDVLNAVQKTAPKNLV 258
Query: 271 SGKPYAVNSCMPDLGMLLIDARKLEKYILEAFLWWKK---VINQRDRSIGRSPAIALALY 327
+PY ++C+ D +L+++ RKL K ++++ + W + + N RD+ AI LALY
Sbjct: 259 YTEPYDKDACLLDFDVLVVEPRKLRKDLIDSIMVWFRAFSLANPRDQI---RLAITLALY 315
Query: 328 DRYLKLSSSWLVTDSTSSVVNKSLAIRYDGPMTACSEFGDGANMEPARGDLWKQHL 383
D YLKL S+W D+ S ++N Y+GP CSE G E GD W+ +L
Sbjct: 316 DNYLKLPSNWKRADANSDILN------YNGPKNVCSEGGRQLQEE-GNGDEWQLYL 364
>gi|413932915|gb|AFW67466.1| hypothetical protein ZEAMMB73_148709 [Zea mays]
Length = 382
Score = 239 bits (610), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 150/383 (39%), Positives = 217/383 (56%), Gaps = 35/383 (9%)
Query: 10 RSSASQRRTAQASDNPNTDGT---HGPVLEQAPTPKPIFQFIALGLIVFLGLLQFLPATH 66
R A+ R+ + P G H V E P P P Q +A+ IV LG LQFLPATH
Sbjct: 16 RRPAAAARSGVGAQEPPPRGQLAIHADV-EPPPRPWPGMQKLAIVAIVVLGCLQFLPATH 74
Query: 67 FRHPSDPFRIWVPFNSNTSFSKTRDSGDRN-SGSISSISQDDGMVHIVSWMQCLDLRLLA 125
FR P+DP R W+PF+ G RN +GS++ + G V + SW+ CLDLR LA
Sbjct: 75 FRDPNDPQRNWIPFD-----------GSRNPTGSLNDV----GSVDVFSWISCLDLRTLA 119
Query: 126 VLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFPHSNLEFHGQEEVKKVIRTAS 185
VL NSTLS S P + FH +P+G D +S+ KLKVL P S+L Q+++K + A+
Sbjct: 120 VLTNSTLSSSSDPHNISFHYLIPEGGNDKLSYLKLKVLLPDSDLTVTSQKQIKDKLNVAT 179
Query: 186 TGVKYSVQNFEEIVPFVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDC 245
+ E+ PF+IA S +F+Y++ I+KG+VE+L ++L YAI A +DC
Sbjct: 180 PEGNFLWLFHTELSPFLIAKSQLSKKRFVYIAADSIIKGKVEDLGRMELGGYAIGAVEDC 239
Query: 246 SERLNSYVNPDVLDAIQRS-ASQPWVSGKPYAVNSCMPDLGMLLIDARKLEKYILEAFLW 304
S+RL+ Y++ DVL AIQR+ A+ WV +PY ++C+ D +LL++ RKLEK ++ + +W
Sbjct: 240 SKRLSDYISMDVLSAIQRAGAAFSWVPKEPYNEDACLLDFDVLLMEPRKLEKTLITSIMW 299
Query: 305 WKKVI---NQRDRSIGRSPAIALALYDRYLKLSSSWLVTDSTSSVVNKSLAIRYDGPMTA 361
W I NQRD A+ L+ +Y +L S+W D ++ + + +DGP
Sbjct: 300 WADAIRLANQRDLI---RLAVTLSFNGKYTRLPSNWKRADDDNADI-----LNFDGPKNV 351
Query: 362 CSEFGDGANME-PARGDLWKQHL 383
CSE DG E P G W+Q+L
Sbjct: 352 CSE--DGRQHEQPGYGATWQQYL 372
>gi|357113463|ref|XP_003558522.1| PREDICTED: uncharacterized protein LOC100841880 [Brachypodium
distachyon]
Length = 362
Score = 233 bits (594), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 131/349 (37%), Positives = 198/349 (56%), Gaps = 24/349 (6%)
Query: 35 LEQAPTPKPIFQFIALGLIVFLGLLQFLPATHFRHPSDPFRIWVPFNSNTSFSKTRDSGD 94
+E AP P P Q +A+ ++ LG LQFLPATHFRHP+DP R WVPF+
Sbjct: 28 VEPAPRPWPGMQMLAIAAVLVLGGLQFLPATHFRHPADPGRNWVPFDPT----------- 76
Query: 95 RNSGSISSISQDDGMVHIVSWMQCLDLRLLAVLVNSTLSGSRYPDLLHFHLFVPKGSEDM 154
R+ IS G + I SW+ CLDLR LAVL NSTLS S P ++F+ +P+G D
Sbjct: 77 RHPLDIS------GRIEIFSWISCLDLRTLAVLTNSTLSSSSEPHDVYFNFLIPEGGNDK 130
Query: 155 VSFYKLKVLFPHSNLEFHGQEEVKKVIRTASTGVKYSVQNFEEIVPFVIASVHQSLSKFI 214
+ FYK+K + P SN+ Q+++K + A+ + E+ +IA+ S +++
Sbjct: 131 LPFYKIKSVLPDSNITVTSQKQIKDKLNVATPEGNFFWSFHNELSSIIIATTQPSKKRYV 190
Query: 215 YMSPSVIVKGRVEELIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRSASQPWVSGKP 274
Y+S ++KG++EEL IDL YAI A +DCS+R+ Y N DVL+A+ R+A S +P
Sbjct: 191 YISADSVIKGKIEELARIDLGIYAIGAVEDCSKRVGDYTNMDVLNAVHRTAPTGLASTEP 250
Query: 275 YAVNSCMPDLGMLLIDARKLEKYILEAFLWWKKVINQRDRSIGRSPAIALALYDRYLKLS 334
Y ++C+ D+ +L+++ R L++ ++A W ++ + G AI LA YD YLKL
Sbjct: 251 YNKDTCLLDIDVLVVEPRNLKRNTIDAIKVWVTGLSLANPRDGIQLAITLAFYDNYLKLP 310
Query: 335 SSWLVTDSTSSVVNKSLAIRYDGPMTACSEFGDGANMEPARGDLWKQHL 383
SSW ++ + + + YDGP CS G + E G+ W+Q+L
Sbjct: 311 SSWKRGNANADI------LHYDGPKNVCSADGR-QHQEQGSGETWRQYL 352
>gi|125588106|gb|EAZ28770.1| hypothetical protein OsJ_12790 [Oryza sativa Japonica Group]
Length = 302
Score = 194 bits (494), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 106/281 (37%), Positives = 168/281 (59%), Gaps = 17/281 (6%)
Query: 108 GMVHIVSWMQCLDLRLLAVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFPHS 167
G V++ SW+ CLDLR LAVL NST+S S P + F+ +P+ D +YKLK + P S
Sbjct: 25 GSVNVFSWISCLDLRTLAVLTNSTISSSSDPHNISFNFLIPERGTDKSPYYKLKAVLPDS 84
Query: 168 NLEFHGQEEVKKVIRTAS--TGVKYSVQNFEEIVPFVIASVHQSLSKFIYMSPSVIVKGR 225
N+ Q++++ + A+ + +S N E+ P +I + S +++Y+S IVKG+
Sbjct: 85 NVTVASQKKIEDKLNVATPEGNLFWSFPN--ELSPIIIGTTQFSQKRYVYISADSIVKGK 142
Query: 226 VEELIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRSASQPWVSGKPYAVNSCMPDLG 285
+E+L IDL YAIAA +DC++R+ YV+ DVL+A+Q++A + V +PY ++C+ D
Sbjct: 143 IEDLGRIDLGTYAIAAVEDCTKRIGDYVSIDVLNAVQKTAPKNLVYTEPYDKDACLLDFD 202
Query: 286 MLLIDARKLEKYILEAFLWWKK---VINQRDRSIGRSPAIALALYDRYLKLSSSWLVTDS 342
+L+++ RKL K ++++ + W + + N RD+ AI LALYD YLKL S+W D+
Sbjct: 203 VLVVEPRKLRKDLIDSIMVWFRAFSLANPRDQI---RLAITLALYDNYLKLPSNWKRADA 259
Query: 343 TSSVVNKSLAIRYDGPMTACSEFGDGANMEPARGDLWKQHL 383
S ++N Y+GP CSE G E GD W+ +L
Sbjct: 260 NSDILN------YNGPKNVCSEGGRQL-QEEGNGDEWQLYL 293
>gi|115455681|ref|NP_001051441.1| Os03g0777600 [Oryza sativa Japonica Group]
gi|113549912|dbj|BAF13355.1| Os03g0777600, partial [Oryza sativa Japonica Group]
Length = 289
Score = 193 bits (491), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 106/281 (37%), Positives = 167/281 (59%), Gaps = 17/281 (6%)
Query: 108 GMVHIVSWMQCLDLRLLAVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFPHS 167
G V++ SW+ CLDLR LAVL NST+S S P + F+ +P+ D +YKLK + P S
Sbjct: 12 GSVNVFSWISCLDLRTLAVLTNSTISSSSDPHNISFNFLIPERGTDKSPYYKLKAVLPDS 71
Query: 168 NLEFHGQEEVKKVIRTAS--TGVKYSVQNFEEIVPFVIASVHQSLSKFIYMSPSVIVKGR 225
N+ Q++++ + A+ + +S N E+ P +I + S +++Y+S IVKG+
Sbjct: 72 NVTVASQKKIEDKLNVATPEGNLFWSFPN--ELSPIIIGTTQFSQKRYVYISADSIVKGK 129
Query: 226 VEELIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRSASQPWVSGKPYAVNSCMPDLG 285
+E+L IDL YAIAA +DC++R+ YV+ DVL+A+Q++A + V +PY ++C+ D
Sbjct: 130 IEDLGRIDLGTYAIAAVEDCTKRIGDYVSIDVLNAVQKTAPKNLVYTEPYDKDACLLDFD 189
Query: 286 MLLIDARKLEKYILEAFLWWKKVI---NQRDRSIGRSPAIALALYDRYLKLSSSWLVTDS 342
+L+++ RKL K ++++ + W + N RD+ AI LALYD YLKL S+W D+
Sbjct: 190 VLVVEPRKLRKDLIDSIMVWFRAFSLANPRDQI---RLAITLALYDNYLKLPSNWKRADA 246
Query: 343 TSSVVNKSLAIRYDGPMTACSEFGDGANMEPARGDLWKQHL 383
S ++N Y+GP CSE G E GD W+ +L
Sbjct: 247 NSDILN------YNGPKNVCSEGGRQL-QEEGNGDEWQLYL 280
>gi|326515044|dbj|BAJ99883.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 370
Score = 189 bits (481), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 123/355 (34%), Positives = 203/355 (57%), Gaps = 29/355 (8%)
Query: 35 LEQAPTPKPIFQFIALGLIVFLGLLQFLPATHFRHPSDPFRIWVPFNSNTSFSKTRDSGD 94
+E AP P Q +AL ++ LG LQ LPATH RHP+D R WVP + +R S D
Sbjct: 33 VEPAPRAWPGMQMLALAAVLVLGGLQLLPATHTRHPADRSRTWVPVDP------SRHSQD 86
Query: 95 RNSGSISSISQDDGMVHIVSWMQCLDLRLLAVLVNSTLSGSRYPDLLHFHLFVPKGSEDM 154
+S + V + S + CLDLR LAVL NSTLS S P + F+ +P+G D
Sbjct: 87 --------LSHEVASVDVFSSISCLDLRTLAVLTNSTLSSSSDPHHVSFNFVIPEGGNDQ 138
Query: 155 VSFYKLKVLFPHSNLEFHGQEEVKKVIRTASTGVKYSVQNFEEIVPFVIASVHQSLSKFI 214
V +YK+K + P SN+ Q+++K + A+ + E+ P V+A S +++
Sbjct: 139 VPYYKIKAVLPDSNITVTSQKQIKDKLNVATPEGNFFASFPNELSPIVVART-LSRKRYV 197
Query: 215 YMSPSVIVKGRVEELIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRSASQPWV-SGK 273
Y+S I+KG++E+L+ IDL ++A+ A++DCS+RL Y++ DVL+A+QR+A + V +
Sbjct: 198 YISADSIIKGKIEDLVRIDLGSHAVGASEDCSKRLGDYISMDVLNAVQRTAPRGLVHHTE 257
Query: 274 PYAVNSCMPDLGMLLIDARKLEKYILEAFLWWK---KVINQRDRSIGRSPAIALALYDRY 330
+ ++C+ D +LL++ R +++ +++ +W K+ N RD + A+ LA Y+ Y
Sbjct: 258 SFDKDACLLDFDVLLVEPRNIKRNPIDSIAFWTMAVKLANPRDSVML---AMTLAFYNDY 314
Query: 331 LKLSSSWLVTDSTSSVVNKSLAIRYDGPMTACSEFGDGANMEPARGDLWKQHLPP 385
LKL ++W ++ + + + YDGP C+E G + E G++W+Q+L P
Sbjct: 315 LKLPTNWERANADTDI------LYYDGPKNVCTEDG-RQHQEKGFGEIWQQYLGP 362
>gi|116790957|gb|ABK25803.1| unknown [Picea sitchensis]
Length = 381
Score = 182 bits (463), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 116/330 (35%), Positives = 176/330 (53%), Gaps = 21/330 (6%)
Query: 32 GPV---LEQAPTPKPIFQFIALGLIVFLGLLQFLPATHFRHPSDPFRIWVPFNSNTSFSK 88
GP+ +++AP P IAL +++ LQ LPATH+R P D R W S +
Sbjct: 31 GPLHIDIDEAPKPWRFAPLIALLIMIIFLFLQLLPATHYRDPFDSHRQW------RSITP 84
Query: 89 TRDSGDRNSGSISSISQDDGMVHIVSWMQCLDLRLLAVLVNSTLSGSRYPDLLHFHLFVP 148
+ +R I ++ VHI SW LDLR LAVLVNST+S SR P ++F+LF+P
Sbjct: 85 DVLTANRTITHIPMQAKPLEHVHIFSWSDDLDLRPLAVLVNSTISNSRDPGNIYFYLFIP 144
Query: 149 KGSEDMVSFYKLKVLFPHSNLEFHGQEEVKKVIRTASTGVKYSVQNFEEIVPFVIASVHQ 208
E+ S +LF SN+ + +++ + ++G + S +F +PF I ++Q
Sbjct: 145 DDWEEERSRSIFSLLFRKSNIFIYRHSVIRE--KAFASGGEDSKSDFPYFLPFYIPKIYQ 202
Query: 209 SLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRSASQP 268
+L +FIY P +IVKG+VEEL ++L+N +AA +DCS Y+N +S+P
Sbjct: 203 NLRRFIYAVPDIIVKGKVEELFQVNLTNSPVAAVEDCSHNF-EYINA--------KSSRP 253
Query: 269 WVSGKPYAVNSCMPDLGMLLIDARKLEKYILEAFLWWKKVINQRDRSIGRSPAIALALYD 328
WV+ PYA N+C+ D +LL++ L K WKK+ + R+ AI LAL
Sbjct: 254 WVAQTPYAKNTCILDHSVLLVNVELLAKENFMEVTKWKKLFHTGGRNDQSDQAIMLALNG 313
Query: 329 RYLKLSSSWLVTDSTSSVVNKSLAI-RYDG 357
Y KL +SW +S S ++ + I +DG
Sbjct: 314 NYTKLDASWNSRESRFSGIDSDVKIFHFDG 343
>gi|168026280|ref|XP_001765660.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683086|gb|EDQ69499.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 527
Score = 168 bits (426), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 107/346 (30%), Positives = 177/346 (51%), Gaps = 43/346 (12%)
Query: 35 LEQAPTPKPIFQFIALGLIVFLGLLQFLPATHFRHPSDPFRIWVP--------------F 80
LE AP P P ++AL L+ LQ P+TH+RHP D + W+P F
Sbjct: 90 LEPAPQPWPGLTYVALFLLCGFVALQMYPSTHWRHPQDRTKTWIPHDRTETDQASSTDPF 149
Query: 81 NSNTSFSKTRDSGDRNS-GSISSISQDDG---------------MVHIVSWMQCLDLRLL 124
S+ S + + + D S G++S++S D+ +VHI D R L
Sbjct: 150 YSSLSEEQGQQAQDLASQGTVSNLSNDEKHDEYTLAFLKKASEQVVHIFVSTDGADFRPL 209
Query: 125 AVLVNSTLSGSRYPDLLHFHLFVPKG----SEDMVSFYK-LKVLFPHSNLEFHGQEEVKK 179
AVLVNST+S + +P+ LHFHL +P ++ + +F++ K+ N++F E K
Sbjct: 210 AVLVNSTISNAVHPERLHFHLVLPASHHSRAKHLAAFFQDTKIDIVSENIDFKDME--KH 267
Query: 180 VIRTASTGVKYSVQNFEEIVPFVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAI 239
+ ++ + +Q+ PF++ + + +FIY+ ++VKG +EELI IDL N A
Sbjct: 268 ITFRKNSKARPELQSVYNFAPFLLPLHFKDVGRFIYLDADIVVKGNIEELIQIDLGNRAA 327
Query: 240 AAADDCSERLNSYVNPDVLDAIQRSASQP-WVSGKPYAVNSCMPDLGMLLIDARK-LEKY 297
AA +DCS+ +Y + + L IQ +P WV +P ++C+ + G+L+ID + +++
Sbjct: 328 AAVEDCSQTFETYFDFNELAKIQARPEKPTWVPTEPIKPDACVFNRGVLVIDTNQWIKQQ 387
Query: 298 ILEAFLWWKKVINQRDRSIGR----SPAIALALYDRYLKLSSSWLV 339
+ EA LWW + + + P LALY +Y+KL + W V
Sbjct: 388 VTEAILWWMDEFQSAESVLYKYGLSQPPFLLALYGKYMKLDTPWNV 433
>gi|414876209|tpg|DAA53340.1| TPA: hypothetical protein ZEAMMB73_753685 [Zea mays]
Length = 403
Score = 165 bits (418), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 89/214 (41%), Positives = 136/214 (63%), Gaps = 1/214 (0%)
Query: 105 QDDGMVHIVSWMQCLDLRLLAVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLF 164
D G V + SW++CLDLR LAVL NSTLS S P + FH +P+G D +S++KLKVL
Sbjct: 120 NDVGSVDVFSWIRCLDLRTLAVLTNSTLSSSSDPHNISFHYLIPEGGNDKLSYFKLKVLL 179
Query: 165 PHSNLEFHGQEEVKKVIRTASTGVKYSVQNFEEIVPFVIASVHQSLSKFIYMSPSVIVKG 224
P S+L Q+++K + A+ + E+ PF+IA S +F+Y++ I+KG
Sbjct: 180 PDSDLTVTSQKQIKDKLNVATPKGNFLWLFHIELSPFLIAKSQLSKKRFVYIAADSIIKG 239
Query: 225 RVEELIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRS-ASQPWVSGKPYAVNSCMPD 283
+VE+L ++L YAI A +DCS+RL+ Y++ DVL AIQR+ A+ WV +PY ++C+ D
Sbjct: 240 KVEDLGRMELGGYAIGAVEDCSKRLSDYISMDVLSAIQRAGATFSWVPKEPYNEDACLLD 299
Query: 284 LGMLLIDARKLEKYILEAFLWWKKVINQRDRSIG 317
+LL++ RKLEK ++ + +WW Q+ R +G
Sbjct: 300 FDVLLMEPRKLEKTLITSIMWWATGCLQQCRVMG 333
>gi|414876210|tpg|DAA53341.1| TPA: hypothetical protein ZEAMMB73_753685, partial [Zea mays]
Length = 218
Score = 160 bits (406), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 86/202 (42%), Positives = 131/202 (64%), Gaps = 1/202 (0%)
Query: 105 QDDGMVHIVSWMQCLDLRLLAVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLF 164
D G V + SW++CLDLR LAVL NSTLS S P + FH +P+G D +S++KLKVL
Sbjct: 10 NDVGSVDVFSWIRCLDLRTLAVLTNSTLSSSSDPHNISFHYLIPEGGNDKLSYFKLKVLL 69
Query: 165 PHSNLEFHGQEEVKKVIRTASTGVKYSVQNFEEIVPFVIASVHQSLSKFIYMSPSVIVKG 224
P S+L Q+++K + A+ + E+ PF+IA S +F+Y++ I+KG
Sbjct: 70 PDSDLTVTSQKQIKDKLNVATPKGNFLWLFHIELSPFLIAKSQLSKKRFVYIAADSIIKG 129
Query: 225 RVEELIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRS-ASQPWVSGKPYAVNSCMPD 283
+VE+L ++L YAI A +DCS+RL+ Y++ DVL AIQR+ A+ WV +PY ++C+ D
Sbjct: 130 KVEDLGRMELGGYAIGAVEDCSKRLSDYISMDVLSAIQRAGATFSWVPKEPYNEDACLLD 189
Query: 284 LGMLLIDARKLEKYILEAFLWW 305
+LL++ RKLEK ++ + +WW
Sbjct: 190 FDVLLMEPRKLEKTLITSIMWW 211
>gi|302767718|ref|XP_002967279.1| hypothetical protein SELMODRAFT_439806 [Selaginella moellendorffii]
gi|300165270|gb|EFJ31878.1| hypothetical protein SELMODRAFT_439806 [Selaginella moellendorffii]
Length = 1342
Score = 158 bits (400), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 115/373 (30%), Positives = 183/373 (49%), Gaps = 53/373 (14%)
Query: 1 MSSSSFLPSRSSASQRRTAQASDNPNTDGTHGPVL---EQAPTPKPIFQ---FIALGLIV 54
S + LP R +RR N + DG P+L E AP PKP AL +
Sbjct: 12 FSPAGALPVRRPLGKRR------NGDGDGEVDPLLPVPELAPAPKPWKGAPIVAALICVA 65
Query: 55 FLGLLQFLPATHFRHPSDPFRIWVPFNSNTSFSKTRDSGDRNSGSISSISQDDGMVHI-- 112
FL +LQ LPATH+RHP+DP R W+P D G+ + + + + + I
Sbjct: 66 FL-MLQMLPATHYRHPADPTRTWIPIRQEP------DQGE-SGKEVEEVDAREFVTGIKS 117
Query: 113 --VSWMQCL---DLRLLAVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFPHS 167
+ C DLR L V++NSTL+ +R+ + FH+ + S+ KLK LFP +
Sbjct: 118 TSIDLFVCTDEKDLRPLVVVINSTLANTRHTQRIRFHIITTE-SQREAWLSKLKALFPLA 176
Query: 168 NLE---------FHGQEEV------KKVIRTASTGVKYSVQNFEEIVPFVIASVHQSLSK 212
++ FHG E++ V + + ++ + +PF + + + +
Sbjct: 177 AIDMVSFLDIVLFHGSEKIDFEEIGNHVFYRKDSKAREALTSPYNFLPFYLPRMFPGMQR 236
Query: 213 FIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQ-RSASQ--PW 269
IY+ V+ G +EEL DL ++ +AA +DCS+ SY N D+L IQ R AS+ PW
Sbjct: 237 IIYLDSDVV--GDIEELFNTDLEDHPVAAVEDCSQIFGSYFNFDLLHRIQSREASESTPW 294
Query: 270 VSGKPYAVNSCMPDLGMLLIDARK-LEKYILEAFLWWKKVINQRDRSIGR----SPAIAL 324
+ +P+ ++C+ + G+L+ID RK +E+ EA WW +Q + + + P L
Sbjct: 295 IPSQPFDPSACIFNRGVLVIDPRKWIEQNSTEAIEWWLDEFHQAQKPLYKYGVSQPPFLL 354
Query: 325 ALYDRYLKLSSSW 337
ALY+ Y KL ++W
Sbjct: 355 ALYNHYKKLDTAW 367
>gi|302754034|ref|XP_002960441.1| hypothetical protein SELMODRAFT_437525 [Selaginella moellendorffii]
gi|300171380|gb|EFJ37980.1| hypothetical protein SELMODRAFT_437525 [Selaginella moellendorffii]
Length = 1342
Score = 158 bits (399), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 115/373 (30%), Positives = 183/373 (49%), Gaps = 53/373 (14%)
Query: 1 MSSSSFLPSRSSASQRRTAQASDNPNTDGTHGPVL---EQAPTPKPIFQ---FIALGLIV 54
S + LP R +RR N + DG P+L E AP PKP AL +
Sbjct: 12 FSPAGALPVRRPLGKRR------NGDGDGEVDPLLPVPELAPAPKPWKGAPIVAALICVA 65
Query: 55 FLGLLQFLPATHFRHPSDPFRIWVPFNSNTSFSKTRDSGDRNSGSISSISQDDGMVHI-- 112
FL +LQ LPATH+RHP+DP R W+P + +D G+ + + + + + I
Sbjct: 66 FL-MLQMLPATHYRHPADPTRTWIPIR------QEQDQGE-SGKEVEEVDAREFVTGIKS 117
Query: 113 --VSWMQCL---DLRLLAVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFPHS 167
+ C DLR L V++NSTL+ +R+ + FH+ + S+ KLK LFP +
Sbjct: 118 TSIDLFVCTDEKDLRPLVVVINSTLANTRHTQRIRFHIITTE-SQREAWLSKLKALFPLA 176
Query: 168 NLE---------FHGQEEV------KKVIRTASTGVKYSVQNFEEIVPFVIASVHQSLSK 212
++ FHG E++ V + + ++ + +PF + + + +
Sbjct: 177 AIDMVSFLDIVLFHGSEKIDFEEISNHVFYRKDSKAREALTSPYNFLPFYLPRMFPGMQR 236
Query: 213 FIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQ-RSASQ--PW 269
IY+ V+ G +EEL DL ++ +AA +DCS+ SY N D+L IQ R AS+ PW
Sbjct: 237 IIYLDSDVV--GDIEELFNTDLEDHPVAAVEDCSQIFGSYFNFDLLHRIQSREASESTPW 294
Query: 270 VSGKPYAVNSCMPDLGMLLIDARK-LEKYILEAFLWWKKVINQRDRSIGR----SPAIAL 324
+ +P+ +C+ + G+L+ID RK +E EA WW +Q + + + P L
Sbjct: 295 IPRQPFDPTACIFNRGVLVIDPRKWIEHNSTEAIEWWLDEFHQAQKPLYKYGVSQPPFLL 354
Query: 325 ALYDRYLKLSSSW 337
ALY+ Y KL ++W
Sbjct: 355 ALYNHYKKLDTAW 367
>gi|168016352|ref|XP_001760713.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162688073|gb|EDQ74452.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1016
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 68/245 (27%), Positives = 127/245 (51%), Gaps = 13/245 (5%)
Query: 79 PFNSNTSFSKTRDSGDRNSGSISSISQDDGMVHIVSWMQCLDLRLLAVLVNSTLSGSRYP 138
P + S T S R I + ++D +H+ DLR LAVL+NS+++ +P
Sbjct: 381 PILNAESEEGTNYSLKREDEPIDVVKRED--IHVFVCTDEADLRPLAVLINSSMANCPHP 438
Query: 139 DLLHFHLFVPKGSEDMVSFYKLKVLFPHSNLEFHGQ----EEVKKVIR-TASTGVKYSVQ 193
+ L +HL +P + +LK LFP++ +E + EV++ I TG + +
Sbjct: 439 ERLFYHLVMPYSQRNAAK--RLKHLFPNARVEMAEKYIDIREVEEHITFRNDTGARKELV 496
Query: 194 NFEEIVPFVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSERLNSYV 253
+ +PF + + + + IY+ ++VKG +E L +DL +++AA +DCS+R Y
Sbjct: 497 SPYNFLPFYLPKTYSEIRRIIYLDSDIVVKGNLEVLNDVDLEGHSVAAIEDCSQRFQVYF 556
Query: 254 NPDVLDAIQRSAS--QP-WVSGKPYAVNSCMPDLGMLLIDARK-LEKYILEAFLWWKKVI 309
+ LD I + +P W+ +P+ ++C+ + G+L+ID + +E+ I +A +WW
Sbjct: 557 DFAQLDEIHKRQGPDRPKWLPDEPFNKSACVFNRGVLIIDTNQWIEQNITKAIVWWMDEF 616
Query: 310 NQRDR 314
+ D+
Sbjct: 617 RKADK 621
>gi|168001842|ref|XP_001753623.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162695030|gb|EDQ81375.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 565
Score = 104 bits (259), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 78/292 (26%), Positives = 145/292 (49%), Gaps = 35/292 (11%)
Query: 79 PFNSNTSFSKTRDSGDRNSGSISSISQDDGMVHIVSWMQCLDLRLLAVLVNSTLSGSRYP 138
P S S + DS ++ ++ + + +D +H+ DLR LAVL+NS+++ +P
Sbjct: 178 PIRSTESEDENIDSSNKGDETVEASTLED--IHVFVCTDEADLRPLAVLINSSMANCPHP 235
Query: 139 DLLHFHLFVPKGSEDMVSFYKLKVLFPHSNLEFHGQ----EEVKKVIR-TASTGVKYSVQ 193
+ L +HL +P + +LK L P + +E + EV++ I TG + +
Sbjct: 236 ERLFYHLVMPHNQRNAAK--RLKHLLPKARIEMAEKYIDIREVEEHITFRNDTGARKELV 293
Query: 194 NFEEIVPF-------------VIAS----VHQSLSKFIYMSPSVIVKGRVEELIGIDLSN 236
+ +PF VI S + Q + I +P ++++G +E L +DL
Sbjct: 294 SPYNFLPFYLPKTIFKLLRATVICSFCLAIGQRFIQLISSTPLIVLQGNLEVLNDVDLEG 353
Query: 237 YAIAAADDCSERLNSYVNPDVLDAIQRSAS--QP-WVSGKPYAVNSCMPDLGMLLIDARK 293
+++AA +DCS+R Y + LD IQ+ +P W+ +P+ ++C+ + G+L+ID ++
Sbjct: 354 HSVAAIEDCSQRFQVYFDFAQLDEIQKRQGPDRPSWLPDEPFNKSACVFNRGVLVIDTKE 413
Query: 294 -LEKYILEAFLWWKKVINQRDR----SIGRS-PAIALALYDRYLKLSSSWLV 339
+++ I +A +WW + D+ G S P LALY ++ L +W V
Sbjct: 414 WIDQNITKAIVWWMDEFRKADKKALYKAGMSQPPFLLALYGKHKVLDETWNV 465
>gi|168010261|ref|XP_001757823.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162691099|gb|EDQ77463.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 264
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 57/153 (37%), Positives = 83/153 (54%), Gaps = 6/153 (3%)
Query: 193 QNFEEIVPFVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSERLNSY 252
Q+ VPF++ + + + IY+ V+VKG +EEL+ IDL N AIAA +DCS++L +Y
Sbjct: 18 QSMYSFVPFLLPQYFKDVGRLIYLDADVVVKGNIEELMHIDLENKAIAAVEDCSQKLETY 77
Query: 253 VNPDVLDAIQRSASQP-WVSGKPYAVNSCMPDLGMLLIDARKLEK-YILEAFLWWKKVIN 310
+ D L IQ +P WV +P N+C + G+L+ID K + +A WW
Sbjct: 78 FDLDRLAKIQARPEKPAWVPAEPINPNACGLNEGVLVIDTNPWNKQQVTKAIFWWMDEFR 137
Query: 311 QRDRSIGR----SPAIALALYDRYLKLSSSWLV 339
D ++ + P LALY RY KL S W V
Sbjct: 138 SADSALYKHGFSQPLFLLALYGRYKKLDSPWNV 170
>gi|219363365|ref|NP_001136812.1| uncharacterized protein LOC100216958 [Zea mays]
gi|194697210|gb|ACF82689.1| unknown [Zea mays]
Length = 126
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 45/109 (41%), Positives = 63/109 (57%)
Query: 105 QDDGMVHIVSWMQCLDLRLLAVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLF 164
D G V + SW+ CLDLR LAVL NSTLS S P + FH +P+G D +S++KLKVL
Sbjct: 10 NDVGSVDVFSWISCLDLRSLAVLTNSTLSSSSDPHNISFHYLIPEGGNDKLSYFKLKVLL 69
Query: 165 PHSNLEFHGQEEVKKVIRTASTGVKYSVQNFEEIVPFVIASVHQSLSKF 213
P S+L Q+++K + A+ + E+ PF+IA S +F
Sbjct: 70 PDSDLTVTSQKQIKDKLNVATPEGNFLWLFHTELSPFLIAKSQLSKKRF 118
>gi|323452866|gb|EGB08739.1| hypothetical protein AURANDRAFT_5202, partial [Aureococcus
anophagefferens]
Length = 153
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 45/136 (33%), Positives = 68/136 (50%), Gaps = 12/136 (8%)
Query: 211 SKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRSASQPWV 270
K +Y+ V+V+G V EL ID+ +A AA +DCS+++ YVN ++L + W
Sbjct: 19 EKILYLDTDVVVRGDVGELAAIDMEGFAAAAVEDCSQQVAKYVNLELLADV-----DAWG 73
Query: 271 SGKPYAVN--SCMPDLGMLLID-AR----KLEKYILEAFLWWKKVINQRDRSIGRSPAIA 323
G + +C+ + G++L D AR +L + I E + K + R P
Sbjct: 74 LGARVREHGGACVFNRGVVLFDPARWRNLRLTETIEELVAAFTKSSARLWRGGISQPPFL 133
Query: 324 LALYDRYLKLSSSWLV 339
LAL RYLKL SW V
Sbjct: 134 LALAGRYLKLDISWNV 149
>gi|449469050|ref|XP_004152234.1| PREDICTED: probable galacturonosyltransferase-like 7-like [Cucumis
sativus]
gi|449515897|ref|XP_004164984.1| PREDICTED: probable galacturonosyltransferase-like 7-like [Cucumis
sativus]
Length = 367
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 75/310 (24%), Positives = 118/310 (38%), Gaps = 36/310 (11%)
Query: 49 ALGLIVFLGLLQ-FLPATHFRHPSDPFRIWVPFNSNTS-----FSKTRDSGDRNSGSISS 102
A+ +IV LQ F PA R F + +P ++ +S FS + S RN+G
Sbjct: 14 AMVMIVLAPALQSFPPAEAIRSSHLDFYLRLPVDTASSRSLNRFSFRKASTFRNAGQCGG 73
Query: 103 ISQ---------DDGMVHIVSWMQCLDLRLLAVLVNSTLSGSRYPDLLHFHLFVPKGS-E 152
D +VH+ + LR VNS L S P+ + FH V S E
Sbjct: 74 GDSRFSGKFGVCDPNLVHVAITLDVEYLRGSVAAVNSILRNSLCPESVFFHFLVSDTSLE 133
Query: 153 DMVSFYKLKVLFPHSNLE--FHGQEEVKKVIRTASTGVKYSVQNFEEIVPFVIASVHQS- 209
D V + FP N + + E V+ +I ST V+ +++ +A + +S
Sbjct: 134 DFV-----RSTFPQMNFKVYYFDPEIVRNLI---STSVRQALEQPLNYARNYLAGLLESC 185
Query: 210 LSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRSASQPW 269
+ K IY+ +IV + +L +L + I A + C + Y R S
Sbjct: 186 VKKVIYLDSDLIVVDDIRKLWTTNLGEWTIGAPEYCHANFSKYFT-------TRFWSDER 238
Query: 270 VSGKPYAVNSCMPDLGMLLIDARKLEK-YILEAFLWWKKV-INQRDRSIGRSPAIALALY 327
G C + G+++ID K E WW K+ + R +G P L
Sbjct: 239 FFGTFAGRKPCYFNTGVMVIDLVKWRNGGYTEKIEWWMKLQKSNRIYELGSLPPFLLVFA 298
Query: 328 DRYLKLSSSW 337
+ W
Sbjct: 299 GNVATIEHRW 308
>gi|323447251|gb|EGB03182.1| hypothetical protein AURANDRAFT_34357 [Aureococcus anophagefferens]
Length = 185
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 46/181 (25%), Positives = 82/181 (45%), Gaps = 41/181 (22%)
Query: 192 VQNFEEIVPFVIASVHQSL-----SKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCS 246
V+ E + PF A+ + + +Y+ IV+G V EL +DL AA +DC+
Sbjct: 9 VKRKELLSPFNFAAFYLPYVLLESRRVLYLDTDAIVEGDVGELAHLDLGGAPAAAVEDCT 68
Query: 247 ERLNSYVNPDVLDA-----------IQRSASQPW----VSGKPYAVNSCMPDLGMLLIDA 291
+++ Y+N ++L+ + R+A + W + Y+ +C+ + G++L D
Sbjct: 69 QKVFKYINYELLERYDSGGRSKLGPMNRNAPRLWSRFGFTADAYSNETCVFNRGVVLFDC 128
Query: 292 ---RKLE-----KYILEAFL-----WWKKVINQRDRSIGRSPAIALALYDRYLKLSSSWL 338
R+L + +++AF+ W+ I+Q P LAL RY KL W
Sbjct: 129 PRWRELRLTETIEDLVDAFVASRAKLWRGGISQ--------PPFLLALAGRYFKLDMEWN 180
Query: 339 V 339
V
Sbjct: 181 V 181
>gi|218193839|gb|EEC76266.1| hypothetical protein OsI_13731 [Oryza sativa Indica Group]
Length = 83
Score = 50.1 bits (118), Expect = 0.002, Method: Composition-based stats.
Identities = 27/63 (42%), Positives = 35/63 (55%), Gaps = 7/63 (11%)
Query: 321 AIALALYDRYLKLSSSWLVTDSTSSVVNKSLAIRYDGPMTACSEFGDGANMEPARGDLWK 380
A+ LALYD YLKL S+W D+ S ++N Y+GP CSE G E GD W+
Sbjct: 19 AVTLALYDNYLKLPSNWKRADANSDILN------YNGPKNVCSEGGRQLQ-EEGNGDEWQ 71
Query: 381 QHL 383
+L
Sbjct: 72 LYL 74
>gi|15217851|ref|NP_171772.1| galacturonosyltransferase 5 [Arabidopsis thaliana]
gi|42571307|ref|NP_973744.1| galacturonosyltransferase 5 [Arabidopsis thaliana]
gi|75172933|sp|Q9FWY9.1|GATL5_ARATH RecName: Full=Probable galacturonosyltransferase-like 5
gi|9972380|gb|AAG10630.1|AC022521_8 Unknown protein [Arabidopsis thaliana]
gi|25083419|gb|AAN72073.1| Unknown protein [Arabidopsis thaliana]
gi|26452192|dbj|BAC43184.1| unknown protein [Arabidopsis thaliana]
gi|31711862|gb|AAP68287.1| At1g02720 [Arabidopsis thaliana]
gi|332189341|gb|AEE27462.1| galacturonosyltransferase 5 [Arabidopsis thaliana]
gi|332189342|gb|AEE27463.1| galacturonosyltransferase 5 [Arabidopsis thaliana]
Length = 361
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 78/338 (23%), Positives = 135/338 (39%), Gaps = 36/338 (10%)
Query: 49 ALGLIVFLGLLQ-FLPATHFR--HP-SDPFRIWVPFNSNTSFSKTRDSGD-RNSGSISSI 103
AL +I+ LQ F PA R HP +D F+ N+ +S R+S RN+ S
Sbjct: 14 ALAMILLSPSLQSFSPAAAIRSSHPYADEFKP----QQNSDYSSFRESPMFRNAEQCRSS 69
Query: 104 SQDDG-----MVHIVSWMQCLDLRLLAVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFY 158
+D G +VH+ + LR VNS L S P + FH V S+++ S
Sbjct: 70 GEDSGVCNPNLVHVAITLDIDYLRGSIAAVNSILQHSMCPQSVFFHFLVSSESQNLESL- 128
Query: 159 KLKVLFPH-SNLE--FHGQEEVKKVIRTASTGVKYSV-QNFEEIVPFVIASVHQSLSKFI 214
++ FP +NL+ + E V+ +I S+ V+ ++ Q ++ + + + I
Sbjct: 129 -IRSTFPKLTNLKIYYFAPETVQSLI---SSSVRQALEQPLNYARNYLADLLEPCVKRVI 184
Query: 215 YMSPSVIVKGRVEELIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRSASQPWVSGKP 274
Y+ ++V + +L L I A + C Y S +G
Sbjct: 185 YLDSDLVVVDDIVKLWKTGLGQRTIGAPEYCHANFTKYFTGGFW-------SDKRFNGTF 237
Query: 275 YAVNSCMPDLGMLLIDARKLEKY----ILEAFLWWKKVINQRDRSIGRSPAIALALYDRY 330
N C + G+++ID +K ++ +E ++ +K+ +R +G P L
Sbjct: 238 KGRNPCYFNTGVMVIDLKKWRQFRFTKRIEKWMEIQKI--ERIYELGSLPPFLLVFAGHV 295
Query: 331 LKLSSSWLVTDSTSSVVNKSLAIRYDGPMTACSEFGDG 368
+S W V S + GP++ G G
Sbjct: 296 APISHRWNQHGLGGDNVRGSCRDLHSGPVSLLHWSGSG 333
>gi|224089503|ref|XP_002308736.1| glycosyltransferase [Populus trichocarpa]
gi|222854712|gb|EEE92259.1| glycosyltransferase [Populus trichocarpa]
Length = 648
Score = 47.8 bits (112), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 61/280 (21%), Positives = 110/280 (39%), Gaps = 49/280 (17%)
Query: 125 AVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKL-----KVLFPHSNL---------- 169
AV+VNST++ S++P L FHL + S + + L K N+
Sbjct: 354 AVVVNSTVTNSKHPSKLVFHLVSDRLSYAAMRMWFLVNPPGKATIQVQNIDEFTWLNSSY 413
Query: 170 -----EFHGQEEVKKVIRTAS----TGVKYSVQNFEEIVP---FVIASVHQSLSKFIYMS 217
+ H Q + R S + +KY + I+ F + + L+K +++
Sbjct: 414 SPVLKQLHSQSMIDYYFRAHSANSDSNLKYRNPKYLSILNHLRFYLPEIFPKLNKVLFLD 473
Query: 218 PSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDVLDAIQRSASQPWVSGKP 274
++V+ + L +DL A + C E R ++Y+N S P +S
Sbjct: 474 DDIVVQKDLTGLWSLDLKGKVNGAVETCRESFHRFDTYLN----------FSNPLISNN- 522
Query: 275 YAVNSCMPDLGMLLIDARKLEKY-ILEAFLWWKKVINQRDR-SIGRSPAIALALYDRYLK 332
+ +C GM L D + ++ I + + W+K+ + R +G P + L+ R
Sbjct: 523 FDPRACGWAYGMNLFDLEEWKRQNITDVYHSWQKLNHDRQLWKLGTLPPGLITLWKRTHP 582
Query: 333 LSSSWLVT------DSTSSVVNKSLAIRYDGPMTACSEFG 366
L W V + + + + I Y+G M E G
Sbjct: 583 LDRRWHVLGLGYNPNVSQIEIERGAVIHYNGNMKPWLEIG 622
>gi|115475918|ref|NP_001061555.1| Os08g0327100 [Oryza sativa Japonica Group]
gi|38423965|dbj|BAD01674.1| putative glycosyltransferase [Oryza sativa Japonica Group]
gi|38637194|dbj|BAD03445.1| putative glycosyltransferase [Oryza sativa Japonica Group]
gi|113623524|dbj|BAF23469.1| Os08g0327100 [Oryza sativa Japonica Group]
gi|222640351|gb|EEE68483.1| hypothetical protein OsJ_26894 [Oryza sativa Japonica Group]
Length = 643
Score = 47.8 bits (112), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 64/282 (22%), Positives = 113/282 (40%), Gaps = 53/282 (18%)
Query: 125 AVLVNSTLSGSRYPDLLHFHLFV---------------PKGS-----EDMVSFYKLKVLF 164
AV+VNST+ +++P FH+ P G ++ F L +
Sbjct: 349 AVVVNSTVLNAKHPSHHVFHIVTDRLNYAPMRMWFLSNPPGKATIEVRNIEEFTWLNASY 408
Query: 165 PHSNLEFHGQEEVKKVIRT----ASTGVKYSVQNFEEIVP---FVIASVHQSLSKFIYMS 217
+ Q + RT + + +KY + I+ F + ++ +L K +++
Sbjct: 409 SPVLKQLESQSMIDYYFRTHRANSDSNLKYRNPKYLSILNHLRFYLPEIYPNLHKIVFLD 468
Query: 218 PSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDVLDAIQRSASQPWVSGKP 274
V++K + L ID+ I + C E R + Y+N S P V K
Sbjct: 469 DDVVIKKDLTSLWSIDMKGKVIGVVETCGESFHRFDRYLN----------FSNP-VIVKN 517
Query: 275 YAVNSCMPDLGMLLID-ARKLEKYILEAFLWWKKVINQRDR---SIGRSPAIALALYDRY 330
+ ++C GM + D A + I E + W+K +NQ DR +G P + +++
Sbjct: 518 FDPHACGWAFGMNVFDLAEWRRQNITEIYHSWQK-LNQ-DRLLWKLGTLPPGLITFWNKT 575
Query: 331 LKLSSSWLVT------DSTSSVVNKSLAIRYDGPMTACSEFG 366
L L+ SW V +S + ++ I Y+G M E G
Sbjct: 576 LPLNRSWHVLGLGYNPHVSSRDIERAAVIHYNGNMKPWLEIG 617
>gi|449458506|ref|XP_004146988.1| PREDICTED: probable galacturonosyltransferase 6-like [Cucumis
sativus]
Length = 603
Score = 47.4 bits (111), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 66/276 (23%), Positives = 114/276 (41%), Gaps = 30/276 (10%)
Query: 125 AVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFPHSNLEFHGQEEVKKVIRTA 184
AV+VNST+S + P+ + FHL + +S + L + +E E+ K +
Sbjct: 327 AVVVNSTISSATEPEKIVFHLVTNSLNLPAMSMWFLLNPPGKATIEVLSMEDFKWLSNEY 386
Query: 185 STGVKYSVQNFEE--------IVPFVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSN 236
G + +QN + + F + ++ SL K I + V+V+ + L + +
Sbjct: 387 DLG--WKMQNSSDPRFTSELNYLRFYLPNIFPSLDKVILLDHDVVVQKDLSGLWHVGMKG 444
Query: 237 YAIAAADDCSERLNSYVNPDVLDAIQRSASQPWVSGKPYAVNSCMPDLGMLLIDARKLEK 296
A + C + S++ D+ S P V K + +C GM L D R+ +
Sbjct: 445 KVNGAVETCQDTEVSFLRMDMFINF----SDP-VINKKFNNKACTWAFGMNLFDLRRWRE 499
Query: 297 YILEAFLWWKKVINQRDRSI---GRSPAIALALYDRYLKLSSSWLVT----DSTS--SVV 347
L A L+ K + +R I G P + Y++ L W V DST ++
Sbjct: 500 ENLTA-LYHKYLRLSNERPILKGGSLPLGWVTFYNQTTALERRWHVLGLGHDSTVLLDII 558
Query: 348 NKSLAIRYDGPMTACSEFGDGANMEPARGDLWKQHL 383
K+ I YDG + G G E LW++++
Sbjct: 559 RKAAVIHYDGVRKPWLDIGFGEYKE-----LWRKYI 589
>gi|218200956|gb|EEC83383.1| hypothetical protein OsI_28798 [Oryza sativa Indica Group]
Length = 621
Score = 47.4 bits (111), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 64/282 (22%), Positives = 113/282 (40%), Gaps = 53/282 (18%)
Query: 125 AVLVNSTLSGSRYPDLLHFHLFV---------------PKGS-----EDMVSFYKLKVLF 164
AV+VNST+ +++P FH+ P G ++ F L +
Sbjct: 327 AVVVNSTVLNAKHPSHHVFHIVTDRLNYAPMRMWFLSNPPGKATIEVRNIEEFTWLNASY 386
Query: 165 PHSNLEFHGQEEVKKVIRT----ASTGVKYSVQNFEEIVP---FVIASVHQSLSKFIYMS 217
+ Q + RT + + +KY + I+ F + ++ +L K +++
Sbjct: 387 SPVLKQLESQSMIDYYFRTHRANSDSNLKYRNPKYLSILNHLRFYLPEIYPNLHKIVFLD 446
Query: 218 PSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDVLDAIQRSASQPWVSGKP 274
V++K + L ID+ I + C E R + Y+N S P V K
Sbjct: 447 DDVVIKKDLTSLWSIDMKGKVIGVVETCGESFHRFDRYLN----------FSNP-VIVKN 495
Query: 275 YAVNSCMPDLGMLLID-ARKLEKYILEAFLWWKKVINQRDR---SIGRSPAIALALYDRY 330
+ ++C GM + D A + I E + W+K +NQ DR +G P + +++
Sbjct: 496 FDPHACGWAFGMNVFDLAEWRRQNITEIYHSWQK-LNQ-DRLLWKLGTLPPGLITFWNKT 553
Query: 331 LKLSSSWLVT------DSTSSVVNKSLAIRYDGPMTACSEFG 366
L L+ SW V +S + ++ I Y+G M E G
Sbjct: 554 LPLNRSWHVLGLGYNPHVSSRDIERAAVIHYNGNMKPWLEIG 595
>gi|334182350|ref|NP_001184925.1| alpha-1,4-galacturonosyltransferase [Arabidopsis thaliana]
gi|332189915|gb|AEE28036.1| alpha-1,4-galacturonosyltransferase [Arabidopsis thaliana]
Length = 602
Score = 47.4 bits (111), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 60/284 (21%), Positives = 109/284 (38%), Gaps = 44/284 (15%)
Query: 125 AVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFPHSNLEFHGQEEVKKVIRTA 184
+V+VNST+S S+ P+ + FH+ + +S + L + + ++ +++ + R
Sbjct: 326 SVVVNSTISSSKEPERIVFHVVTDSLNYPAISMWFLLNIQSKATIQILNIDDMDVLPRDY 385
Query: 185 STGVKYSVQNFEEIVP------FVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYA 238
+ N + F + + L+K + + V+V+ + L ID+
Sbjct: 386 DQLLMKQNSNDPRFISTLNHARFYLPDIFPGLNKMVLLDHDVVVQRDLSRLWSIDMKGKV 445
Query: 239 IAAADDCSERLNSYVNPDVLDAIQRSASQPWVSGKPYAVNSCMPDLGMLLID-----ARK 293
+ A + C E +S+ + S WV+GK ++ +C GM LID RK
Sbjct: 446 VGAVETCLEGESSFRSMSTFINF----SDTWVAGK-FSPRACTWAFGMNLIDLEEWRIRK 500
Query: 294 LEKYILEAF-------LWWKKVINQRDRSIGRSPAIALALYDRYLKLSSSWLV------T 340
L ++ F LW G P L Y + L L W V +
Sbjct: 501 LTSTYIKYFNLGTKRPLW----------KAGSLPIGWLTFYRQTLALDKRWHVMGLGRES 550
Query: 341 DSTSSVVNKSLAIRYDGPMTACSEFGDGANMEPARGDLWKQHLP 384
+ + ++ I YDG M + G + W H+P
Sbjct: 551 GVKAVDIEQAAVIHYDGVMKPWLDIG-----KENYKRYWNIHVP 589
>gi|18390688|ref|NP_563771.1| alpha-1,4-galacturonosyltransferase [Arabidopsis thaliana]
gi|75191689|sp|Q9M9Y5.1|GAUT6_ARATH RecName: Full=Probable galacturonosyltransferase 6
gi|7523701|gb|AAF63140.1|AC011001_10 Unknown protein [Arabidopsis thaliana]
gi|15028087|gb|AAK76574.1| unknown protein [Arabidopsis thaliana]
gi|20259311|gb|AAM14391.1| unknown protein [Arabidopsis thaliana]
gi|332189914|gb|AEE28035.1| alpha-1,4-galacturonosyltransferase [Arabidopsis thaliana]
Length = 589
Score = 47.4 bits (111), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 60/284 (21%), Positives = 109/284 (38%), Gaps = 44/284 (15%)
Query: 125 AVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFPHSNLEFHGQEEVKKVIRTA 184
+V+VNST+S S+ P+ + FH+ + +S + L + + ++ +++ + R
Sbjct: 313 SVVVNSTISSSKEPERIVFHVVTDSLNYPAISMWFLLNIQSKATIQILNIDDMDVLPRDY 372
Query: 185 STGVKYSVQNFEEIVP------FVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYA 238
+ N + F + + L+K + + V+V+ + L ID+
Sbjct: 373 DQLLMKQNSNDPRFISTLNHARFYLPDIFPGLNKMVLLDHDVVVQRDLSRLWSIDMKGKV 432
Query: 239 IAAADDCSERLNSYVNPDVLDAIQRSASQPWVSGKPYAVNSCMPDLGMLLID-----ARK 293
+ A + C E +S+ + S WV+GK ++ +C GM LID RK
Sbjct: 433 VGAVETCLEGESSFRSMSTFINF----SDTWVAGK-FSPRACTWAFGMNLIDLEEWRIRK 487
Query: 294 LEKYILEAF-------LWWKKVINQRDRSIGRSPAIALALYDRYLKLSSSWLV------T 340
L ++ F LW G P L Y + L L W V +
Sbjct: 488 LTSTYIKYFNLGTKRPLW----------KAGSLPIGWLTFYRQTLALDKRWHVMGLGRES 537
Query: 341 DSTSSVVNKSLAIRYDGPMTACSEFGDGANMEPARGDLWKQHLP 384
+ + ++ I YDG M + G + W H+P
Sbjct: 538 GVKAVDIEQAAVIHYDGVMKPWLDIG-----KENYKRYWNIHVP 576
>gi|357111850|ref|XP_003557723.1| PREDICTED: probable galacturonosyltransferase 11-like [Brachypodium
distachyon]
Length = 539
Score = 46.6 bits (109), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 63/275 (22%), Positives = 109/275 (39%), Gaps = 54/275 (19%)
Query: 125 AVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFPHSNLEFHGQEE-------- 176
+V+VNST+S + +P L FH+ + +S + L F +E +E
Sbjct: 247 SVVVNSTVSNANHPQQLVFHVVTDRIHYGAMSTWFLMNDFKGCTVEVRCIDEFPWLNAAS 306
Query: 177 ---VKKV------------IRTASTGVKYSVQNFEEIVP---FVIASVHQSLSKFIYMSP 218
V+++ ++T +K+ F ++ F I + +L K I++
Sbjct: 307 SPLVRRLSEMETKGYYYGGLKTPEREIKFHNPKFVSLLNHLRFYIPQILPNLEKVIFLDD 366
Query: 219 SVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDVLDAIQRSASQPWVSGK-- 273
V+V+ + +L I+L I A + C E R + Y+N SQP +S K
Sbjct: 367 DVVVQKDLTQLFSIELHGNVIGAVETCLESFHRYHKYLN----------FSQPIISSKID 416
Query: 274 PYAVNSCMPDLGMLLIDA---RKLEKYILEAFLWWKKVINQRDRSIGRSPAIALALYDRY 330
P+ +C GM + D RK L + W ++ ++Q G PA L Y
Sbjct: 417 PH---TCGWAFGMNIFDLIAWRKANATALYHY-WEEQNMDQLLWRTGTLPAGLLTFYGLM 472
Query: 331 LKLSSSWLV------TDSTSSVVNKSLAIRYDGPM 359
L W V D ++ + + Y+G M
Sbjct: 473 EPLDRRWHVLGLGYDVDIDDRLIESAAVVHYNGNM 507
>gi|356536655|ref|XP_003536852.1| PREDICTED: probable galacturonosyltransferase 6-like [Glycine max]
Length = 585
Score = 46.2 bits (108), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 39/169 (23%), Positives = 72/169 (42%), Gaps = 25/169 (14%)
Query: 201 FVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE------RLNSYVN 254
F + + +L+K + + V+V+ + EL I++ I A C E R++ ++N
Sbjct: 395 FYLPDIFPALNKIVLLDHDVVVQQDLSELWNINMKGNVIGAVGTCQEGKIPFYRIDMFIN 454
Query: 255 PDVLDAIQRSASQPWVSGKPYAVNSCMPDLGMLLIDARKLEKYILEAFLWWKKVINQRDR 314
S P + GK + N+C GM L D ++ ++ L A ++ +
Sbjct: 455 ----------LSDPLI-GKRFDANACTWAFGMNLFDLQQWRRHNLTAV--YQNYVQMGLW 501
Query: 315 SIGRSPAIALALYDRYLKLSSSWLV------TDSTSSVVNKSLAIRYDG 357
+IG P L Y++ L W V +D + + ++ I YDG
Sbjct: 502 NIGSLPLGWLTFYNKTELLDRQWHVLGLGYSSDVDRNEIEQASVIHYDG 550
>gi|356504981|ref|XP_003521271.1| PREDICTED: probable galacturonosyltransferase 6-like [Glycine max]
Length = 625
Score = 45.8 bits (107), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 62/256 (24%), Positives = 111/256 (43%), Gaps = 29/256 (11%)
Query: 120 DLRLLAVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFPHSNLEFHGQE---- 175
+L AV+VNST+S ++ + L FH V S + + + +L P H Q
Sbjct: 346 NLLACAVVVNSTVSNAKKQEKLVFH--VVTNSLNFPAIWMWFLLNPPGKATVHIQSIENF 403
Query: 176 ----EVKKVIRTASTGVKYSVQ-NFEEIVPFVIASVHQSLSKFIYMSPSVIVKGRVEELI 230
+ + S+ +Y+ + N+ + F + + +L+K ++ V+V+ + L
Sbjct: 404 EWLPKYNTFNKHNSSDPRYTSELNY---LRFYLPDIFPTLNKILFFDHDVVVQQDLSGLW 460
Query: 231 GIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRSASQPWVSGKPYAVNSCMPDLGMLLID 290
++ IAA C E S+ D+ S P+++ K + VN+C GM L D
Sbjct: 461 NANMKGKVIAAVGTCQEGGTSFHRMDMFINF----SDPFIA-KRFDVNACTWAFGMNLFD 515
Query: 291 ARKLEKYILEAFLWWKKVINQRDR---SIGRSPAIALALYDRYLKLSSSWLVTD-STSSV 346
++ ++ L A L+ + + R +IG P L Y++ L W + SV
Sbjct: 516 LQQWRRHNLTA-LYHRYLQMGSKRPLWNIGSLPLGWLTFYNKTKVLDRRWHILGLGYDSV 574
Query: 347 VNK-----SLAIRYDG 357
V+K + I YDG
Sbjct: 575 VDKNEIERAAIIHYDG 590
>gi|297843072|ref|XP_002889417.1| hypothetical protein ARALYDRAFT_470231 [Arabidopsis lyrata subsp.
lyrata]
gi|297335259|gb|EFH65676.1| hypothetical protein ARALYDRAFT_470231 [Arabidopsis lyrata subsp.
lyrata]
Length = 361
Score = 45.8 bits (107), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 77/336 (22%), Positives = 128/336 (38%), Gaps = 32/336 (9%)
Query: 49 ALGLIVFLGLLQ-FLPATHFR--HP-SDPFRIWVPFNSNTSFSKTRDSG-DRNSGSISSI 103
AL +I+ LQ F PA R HP +D F+ N+ S R+S RN+ S
Sbjct: 14 ALAMILLSPSLQSFSPAAAIRSSHPYADEFKP----QQNSDHSSFRESPLFRNAEQCRSS 69
Query: 104 SQDDG-----MVHIVSWMQCLDLRLLAVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFY 158
+D G +VH+ + LR VNS L S P + FH V +++ S
Sbjct: 70 GEDSGVCNPNLVHVAITLDIDYLRGSIAAVNSILQHSMCPQSVFFHFLVSSEIQNLESL- 128
Query: 159 KLKVLFPH-SNLE--FHGQEEVKKVIRTASTGVKYSV-QNFEEIVPFVIASVHQSLSKFI 214
++ FP +NL+ + E V+ +I S+ V+ ++ Q ++ + + + I
Sbjct: 129 -IRSTFPKLTNLKIYYFAPETVQSLI---SSSVRQALEQPLNYARNYLADLLEPCVKRVI 184
Query: 215 YMSPSVIVKGRVEELIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRSASQPWVSGKP 274
Y+ ++V + +L L I A + C Y S +G
Sbjct: 185 YLDSDLVVVDDIVKLWKTGLGQRTIGAPEYCHANFTKYFTGGFW-------SDKRFNGTF 237
Query: 275 YAVNSCMPDLGMLLIDARKLE--KYILEAFLWWKKVINQRDRSIGRSPAIALALYDRYLK 332
N C + G+++ID +K +Y W + +R +G P L
Sbjct: 238 KGRNPCYFNTGVMVIDLKKWRRFRYTKRIEKWMEIQKMERIYELGSLPPFLLVFAGHVAP 297
Query: 333 LSSSWLVTDSTSSVVNKSLAIRYDGPMTACSEFGDG 368
+S W V S + GP++ G G
Sbjct: 298 ISHRWNQHGLGGDNVRGSCRDLHSGPVSLLHWSGSG 333
>gi|224124222|ref|XP_002330135.1| glycosyltransferase [Populus trichocarpa]
gi|222871269|gb|EEF08400.1| glycosyltransferase [Populus trichocarpa]
Length = 531
Score = 45.4 bits (106), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 57/272 (20%), Positives = 101/272 (37%), Gaps = 48/272 (17%)
Query: 125 AVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFPHSNLEFHGQEE-------- 176
+V+VNST+ + +P L FH+ S + + L F + +E EE
Sbjct: 239 SVVVNSTVCNADHPKQLVFHIVTNGISYGSMQAWFLTNDFKGATVEVQNIEEFSWLNASY 298
Query: 177 ---VKKVIRTASTGVKYSVQNFEEIVP---------------FVIASVHQSLSKFIYMSP 218
+K++I S + ++ P F I ++ L K +++
Sbjct: 299 APVIKQIIHQDSRAYYFGADQDMKVEPKLRNPKYLSLLNHLRFYIPEIYPLLEKIVFLDD 358
Query: 219 SVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDVLDAIQRSASQPWVSGKPY 275
V+V+ + L +DL A + C E R Y+N S P +S K +
Sbjct: 359 DVVVQKDLTRLFSLDLHGNVNGAVETCLETFHRYYKYIN----------FSNPIISSK-F 407
Query: 276 AVNSCMPDLGMLLID--ARKLEKYILEAFLWWKKVINQRDRSIGRSPAIALALYDRYLKL 333
+C GM + D A + E + W ++ +Q +G P LA Y L
Sbjct: 408 DPQACGWAFGMNIFDLIAWRKENVTAQYHYWQEQNADQTLWKLGTLPPALLAFYGLTEPL 467
Query: 334 SSSWLV------TDSTSSVVNKSLAIRYDGPM 359
W V + +++ + I ++G M
Sbjct: 468 DRRWHVLGLGYDMNIDDRLIDSAAVIHFNGNM 499
>gi|240254105|ref|NP_564057.4| alpha-1,4-galacturonosyltransferase [Arabidopsis thaliana]
gi|75164846|sp|Q949Q1.1|GAUTB_ARATH RecName: Full=Probable galacturonosyltransferase 11
gi|15293067|gb|AAK93644.1| unknown protein [Arabidopsis thaliana]
gi|17064736|gb|AAL32522.1| Unknown protein [Arabidopsis thaliana]
gi|332191610|gb|AEE29731.1| alpha-1,4-galacturonosyltransferase [Arabidopsis thaliana]
Length = 537
Score = 45.4 bits (106), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 57/273 (20%), Positives = 105/273 (38%), Gaps = 48/273 (17%)
Query: 125 AVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFPHSNLEFHGQEE-------- 176
+V+VNST+S + +P L FH+ + S + + L F S +E EE
Sbjct: 243 SVVVNSTVSNADHPKQLVFHIVTNRVSYKAMQAWFLSNDFKGSAIEIRSVEEFSWLNASY 302
Query: 177 ---VKKVIRTASTGVKYSVQNFEEIVP-----------------FVIASVHQSLSKFIYM 216
VK+++ T + + Q ++ + F I ++ L K +++
Sbjct: 303 SPVVKQLLDTDARAYYFGEQTSQDTISEPKVRNPKYLSLLNHLRFYIPEIYPQLEKIVFL 362
Query: 217 SPSVIVKGRVEELIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRSASQPWVSGKPYA 276
V+V+ + L +DL A + C E + Y + S P +S K +
Sbjct: 363 DDDVVVQKDLTPLFSLDLHGNVNGAVETCLEAFHRYYK-------YLNFSNPLISSK-FD 414
Query: 277 VNSCMPDLGMLLIDARKLEKYILEA-FLWWKKVINQRDRS---IGRSPAIALALYDRYLK 332
+C GM + D + A + +W+ R+R+ +G P L+ Y
Sbjct: 415 PQACGWAFGMNVFDLIAWRNANVTARYHYWQD--QNRERTLWKLGTLPPGLLSFYGLTEP 472
Query: 333 LSSSWLV------TDSTSSVVNKSLAIRYDGPM 359
L W V + + ++ + I Y+G M
Sbjct: 473 LDRRWHVLGLGYDVNIDNRLIETAAVIHYNGNM 505
>gi|218200953|gb|EEC83380.1| hypothetical protein OsI_28795 [Oryza sativa Indica Group]
Length = 650
Score = 45.1 bits (105), Expect = 0.059, Method: Compositional matrix adjust.
Identities = 62/275 (22%), Positives = 111/275 (40%), Gaps = 53/275 (19%)
Query: 125 AVLVNSTLSGSRYPDLLHFHLFV---------------PKGS-----EDMVSFYKLKVLF 164
AV+VNST+ +++P FH+ P G ++ F L +
Sbjct: 349 AVVVNSTVLNAKHPSHHVFHIVTDRLNYAPMRMWFLSNPPGKATIEVRNIEEFTWLNASY 408
Query: 165 PHSNLEFHGQEEVKKVIRT----ASTGVKYSVQNFEEIVP---FVIASVHQSLSKFIYMS 217
+ Q + RT + + +KY + I+ F + ++ +L K +++
Sbjct: 409 SPVLKQLESQSMIDYYFRTHRANSDSNLKYRNPKYLSILNHLRFYLPEIYPNLHKIVFLD 468
Query: 218 PSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDVLDAIQRSASQPWVSGKP 274
V++K + L ID+ I + C E R + Y+N S P V K
Sbjct: 469 DDVVIKKDLTSLWSIDMKGKVIGVVETCGESFHRFDRYLN----------FSNP-VIVKN 517
Query: 275 YAVNSCMPDLGMLLID-ARKLEKYILEAFLWWKKVINQRDR---SIGRSPAIALALYDRY 330
+ ++C GM + D A + I E + W+K +NQ DR +G P + +++
Sbjct: 518 FDPHACGWAFGMNVFDLAEWRRQNITEIYHSWQK-LNQ-DRLLWKLGTLPPGLITFWNKT 575
Query: 331 LKLSSSWLVT------DSTSSVVNKSLAIRYDGPM 359
L L+ SW V +S + ++ I Y+G M
Sbjct: 576 LPLNRSWHVLGLGYNPHVSSRDIERAAVIHYNGNM 610
>gi|224101685|ref|XP_002312381.1| glycosyltransferase, CAZy family GT8 [Populus trichocarpa]
gi|222852201|gb|EEE89748.1| glycosyltransferase, CAZy family GT8 [Populus trichocarpa]
Length = 346
Score = 44.7 bits (104), Expect = 0.082, Method: Compositional matrix adjust.
Identities = 48/229 (20%), Positives = 82/229 (35%), Gaps = 21/229 (9%)
Query: 105 QDDGMVHIVSWMQCLDLRLLAVLVNSTLSGSRYPDLLHFHLFVPKGSEDM-----VSFYK 159
D ++HI + LR V S L + P+ + FH D+ +F
Sbjct: 57 HDPSIIHIAMTLDATYLRGSVAGVLSVLQHAACPEHIVFHFIATHRRADLRRTITSTFPY 116
Query: 160 LKVLFPHSNLEFHGQEEVKKVIRTASTGVKYSVQNFEEIVPFVIASVHQSLSKFIYMSPS 219
L H N + + + R + Y+ +++PF + + IY
Sbjct: 117 LTFHLYHFNTDLVRGKISSSIRRALDQPLNYARIYLADLLPFTV-------RRIIYFDSD 169
Query: 220 VIVKGRVEELIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRSASQPWVSGKPYAVNS 279
+IV V +L I+L + + A + C + Y N R S P + +
Sbjct: 170 LIVVDDVAKLWNINLGAHVLGAPEYCHVNFSYYFN-------SRFWSSPVYATSFTGRRA 222
Query: 280 CMPDLGMLLIDARKLE--KYILEAFLWWKKVINQRDRSIGRSPAIALAL 326
C + G+++ID RK KY + W + R +G P L
Sbjct: 223 CYFNTGVMVIDLRKWREGKYTEKLEYWMRVQKKNRIYELGSLPPFLLVF 271
>gi|356534929|ref|XP_003536003.1| PREDICTED: probable galacturonosyltransferase-like 7-like [Glycine
max]
Length = 359
Score = 44.7 bits (104), Expect = 0.089, Method: Compositional matrix adjust.
Identities = 71/335 (21%), Positives = 127/335 (37%), Gaps = 27/335 (8%)
Query: 47 FIALGLIVFLG--LLQFLPATHFR---HPSDPFRIWVPFNSNTSFSKTRDSGDRNSGSIS 101
F A L++ L L F PA R H R+ P S + R++ D N + S
Sbjct: 11 FSAAMLVILLSPSLQSFHPAEAIRSSHHLDGLLRLPPPRLSFRPAPRFRNAADANKCASS 70
Query: 102 SISQ---DDGMVHIVSWMQCLDLRLLAVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFY 158
S+S D +VH+ + LR V+S L S+ P+ + FH V + + + +
Sbjct: 71 SVSTSVCDPSLVHVAITLDVEYLRGSIAAVHSILQHSQCPENIFFHFLVSETNLESL--- 127
Query: 159 KLKVLFPHSNLE--FHGQEEVKKVIRTASTGVKYSV-QNFEEIVPFVIASVHQSLSKFIY 215
+K FP N + + E V+ +I ST V+ ++ Q ++ + + + IY
Sbjct: 128 -VKSTFPQLNFKVYYFDPEIVRNLI---STSVRQALEQPLNYARNYLADLLEPCVERVIY 183
Query: 216 MSPSVIVKGRVEELIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRSASQPWVSGKPY 275
+ ++V + +L L + I A + C Y + ++ + +P
Sbjct: 184 LDSDLVVVDDIAKLWSTSLGSRTIGAPEYCHANFTKYFTAAFWSDTR--FARAFAGRRP- 240
Query: 276 AVNSCMPDLGMLLIDARKLEK--YILEAFLWWKKVINQRDRSIGRSPAIALALYDRYLKL 333
C + G+++ID + + Y W + N R +G P L +
Sbjct: 241 ----CYFNTGVMVIDLVRWRRIGYSKRIERWMEIQKNDRIYELGSLPPFLLVFAGHVAPI 296
Query: 334 SSSWLVTDSTSSVVNKSLAIRYDGPMTACSEFGDG 368
W V S + GP++ G G
Sbjct: 297 EHRWNQHGLGGDNVKGSCRDLHAGPVSLLHWSGSG 331
>gi|356502770|ref|XP_003520189.1| PREDICTED: probable galacturonosyltransferase 6-like [Glycine max]
Length = 585
Score = 44.3 bits (103), Expect = 0.092, Method: Compositional matrix adjust.
Identities = 54/252 (21%), Positives = 104/252 (41%), Gaps = 35/252 (13%)
Query: 125 AVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKL------KVLFPHSNLEFHGQEEVK 178
A +VNST+S ++ + L FH+ + +S + L + S F +
Sbjct: 315 AAVVNSTISTAKEQEKLVFHVLTKSLNLPSISMWFLINPPGKATVHILSIDNFEWSSKYN 374
Query: 179 KVIRTASTGVKYSVQ-NFEEIVPFVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNY 237
S+ +Y+ + N+ + F + + +L+K + V+V+ + EL I++
Sbjct: 375 TYQENNSSDPRYTSELNY---LRFYLPDIFPALNKIVLFDHDVVVQRDLSELWNINMKGK 431
Query: 238 AIAAADDCSE------RLNSYVNPDVLDAIQRSASQPWVSGKPYAVNSCMPDLGMLLIDA 291
I A C E R++ ++N S P + GK + VN+C GM L D
Sbjct: 432 VIGAIGTCQEGKIPFHRIDMFIN----------LSDPLI-GKRFDVNACTWAFGMNLFDL 480
Query: 292 RKLEKYILEAFLWWKKVINQRDRSIGRSPAIALALYDRYLKLSSSWLV------TDSTSS 345
++ ++ L ++ + +IG P L Y++ L W V ++ +
Sbjct: 481 QQWRRHNLTVV--YQNYLQMGLWNIGSLPLGWLTFYNKTELLDRQWHVLGLGYSSNVDRN 538
Query: 346 VVNKSLAIRYDG 357
+ ++ I YDG
Sbjct: 539 EIEQAAVIHYDG 550
>gi|147771376|emb|CAN62993.1| hypothetical protein VITISV_021618 [Vitis vinifera]
Length = 367
Score = 44.3 bits (103), Expect = 0.092, Method: Compositional matrix adjust.
Identities = 60/296 (20%), Positives = 112/296 (37%), Gaps = 22/296 (7%)
Query: 80 FNSNTSFSKTRDSG--DRNSGSISSISQDDGMVHIVSWMQCLDLRLLAVLVNSTLSGSRY 137
F ++F+ + G DRN SS+ D +VH+ + LR V+S L S+
Sbjct: 59 FRKASAFNNAEECGFRDRNFSGKSSVC-DPFLVHVAITLDVHYLRGSMAAVHSILQHSQC 117
Query: 138 PDLLHFHLFVPKGSEDMVSFYKLKVLFPHSNLE--FHGQEEVKKVIRTASTGVKYSVQNF 195
P+ + FH V + +++ ++ FP + + E V+ +I ST V+ ++++
Sbjct: 118 PEDIFFHFLVSETHLEIL----VRSTFPQLKFKVYYFNPEIVRNLI---STSVREALEHP 170
Query: 196 EEIVPFVIASVHQ-SLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSERLNSYVN 254
+A + + + + IY+ +IV + +L L I A + C Y
Sbjct: 171 LNYARNYLADLLEPCVRRVIYLDSDLIVVDDIYKLWSTSLGTRTIGAPEYCHANFTRYFT 230
Query: 255 PDVLDAIQRSASQPWVSGKPYAVNSCMPDLGMLLIDARKLEKYILEAFL--WWKKVINQR 312
+ S+ G C + G+++ID K ++ + W + N R
Sbjct: 231 -------DKFWSEKRYYGTFDGRKPCYFNTGVIVIDLAKWRRFGFTKRIERWMEVQKNNR 283
Query: 313 DRSIGRSPAIALALYDRYLKLSSSWLVTDSTSSVVNKSLAIRYDGPMTACSEFGDG 368
+G P L + W V S + GP++ G G
Sbjct: 284 IYELGSLPPYLLVFAGHVAPIEHRWNQHGLGGDNVKGSCRELHPGPVSLLHWSGSG 339
>gi|449491241|ref|XP_004158838.1| PREDICTED: probable galacturonosyltransferase 4-like [Cucumis
sativus]
Length = 641
Score = 43.9 bits (102), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 43/194 (22%), Positives = 79/194 (40%), Gaps = 27/194 (13%)
Query: 201 FVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDV 257
F + + L K +++ ++V+ + L IDL A + C E R + Y+N
Sbjct: 450 FYLPQLFPKLKKVLFLDDDIVVQKDLTGLWSIDLKGNVNGAVETCGESFHRFDRYLN--- 506
Query: 258 LDAIQRSASQPWVSGKPYAVNSCMPDLGMLLIDARKLEKY-ILEAFLWWKKVINQRDR-S 315
S P +S K + ++C GM + D + ++ I E + W+K+ R
Sbjct: 507 -------FSNPLIS-KSFDPHACGWAYGMNIFDLDEWKRQNITEVYHSWQKLNYDRQLWK 558
Query: 316 IGRSPAIALALYDRYLKLSSSWLV------TDSTSSVVNKSLAIRYDGPMTACSEFGDGA 369
+G P + + R +L SW V T+ ++++ I Y+G M E
Sbjct: 559 LGTLPPGLITFWKRTYQLDKSWHVLGLGYNTNVGQKEIDRAAVIHYNGNMKPWLEIA--- 615
Query: 370 NMEPARGDLWKQHL 383
P + W +H+
Sbjct: 616 --IPKYRNYWTKHV 627
>gi|449434420|ref|XP_004134994.1| PREDICTED: probable galacturonosyltransferase 4-like [Cucumis
sativus]
Length = 625
Score = 43.9 bits (102), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 43/194 (22%), Positives = 79/194 (40%), Gaps = 27/194 (13%)
Query: 201 FVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDV 257
F + + L K +++ ++V+ + L IDL A + C E R + Y+N
Sbjct: 434 FYLPQLFPKLKKVLFLDDDIVVQKDLTGLWSIDLKGNVNGAVETCGESFHRFDRYLN--- 490
Query: 258 LDAIQRSASQPWVSGKPYAVNSCMPDLGMLLIDARKLEKY-ILEAFLWWKKVINQRDR-S 315
S P +S K + ++C GM + D + ++ I E + W+K+ R
Sbjct: 491 -------FSNPLIS-KSFDPHACGWAYGMNIFDLDEWKRQNITEVYHSWQKLNYDRQLWK 542
Query: 316 IGRSPAIALALYDRYLKLSSSWLV------TDSTSSVVNKSLAIRYDGPMTACSEFGDGA 369
+G P + + R +L SW V T+ ++++ I Y+G M E
Sbjct: 543 LGTLPPGLITFWKRTYQLDKSWHVLGLGYNTNVGQKEIDRAAVIHYNGNMKPWLEIA--- 599
Query: 370 NMEPARGDLWKQHL 383
P + W +H+
Sbjct: 600 --IPKYRNYWTKHV 611
>gi|297833366|ref|XP_002884565.1| hypothetical protein ARALYDRAFT_477926 [Arabidopsis lyrata subsp.
lyrata]
gi|297330405|gb|EFH60824.1| hypothetical protein ARALYDRAFT_477926 [Arabidopsis lyrata subsp.
lyrata]
Length = 351
Score = 43.9 bits (102), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 51/244 (20%), Positives = 94/244 (38%), Gaps = 26/244 (10%)
Query: 104 SQDDGMVHIVSWMQCLDLRLLAVLVNSTLSGSRYPDLLHFH-LFVPKGSEDMVSFYKLKV 162
+++ +HI + LR V S L S P+ L FH L +P D+ F +K
Sbjct: 60 TREADQIHIAMTLDTNYLRGTMAAVLSLLQHSTCPENLSFHFLSLPHFENDL--FTSIKS 117
Query: 163 LFPHSNLEFHG------QEEVKKVIRTA-STGVKYSVQNFEEIVPFVIASVHQSLSKFIY 215
FP+ N + + + ++ K IR A + Y+ +I+P S+ + IY
Sbjct: 118 TFPYLNFKIYQFDPNLVRSKISKSIRQALDQPLNYARIYLADIIP-------SSVDRIIY 170
Query: 216 MSPSVIVKGRVEELIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRSASQPWVSGKPY 275
+ ++V +E+L +++ +AA + C Y + S P +
Sbjct: 171 LDSDLVVVDDIEKLWHVEMEGKVVAAPEYCHANFTHYFT-------KTFWSDPVLVKVLE 223
Query: 276 AVNSCMPDLGMLLIDARKLEK--YILEAFLWWKKVINQRDRSIGRSPAIALALYDRYLKL 333
C + G++++D K K Y + W +R +G P L +
Sbjct: 224 GKRPCYFNTGVMVVDVNKWRKGMYTQKVEEWMTIQKQKRIYHLGSLPPFLLIFAGDIKAV 283
Query: 334 SSSW 337
+ W
Sbjct: 284 NHRW 287
>gi|224100589|ref|XP_002311936.1| glycosyltransferase, CAZy family GT8 [Populus trichocarpa]
gi|222851756|gb|EEE89303.1| glycosyltransferase, CAZy family GT8 [Populus trichocarpa]
Length = 347
Score = 43.5 bits (101), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 53/238 (22%), Positives = 94/238 (39%), Gaps = 27/238 (11%)
Query: 110 VHIVSWMQCLDLRLLAVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFPHSNL 169
+HIV + LR V S L S P+ + FH + ++ F +K FP+ N
Sbjct: 63 IHIVMTLDANYLRGTMAAVLSILQHSTCPENMEFHFLWSRFEREV--FSSIKSTFPYLNF 120
Query: 170 EFHG------QEEVKKVIRTA-STGVKYSVQNFEEIVPFVIASVHQSLSKFIYMSPSVIV 222
+F+ + ++ K IR A + Y+ +I+P + + IY+ ++V
Sbjct: 121 KFYRFDSNRVRGKISKSIRQALDQPLNYARIYLADIIP-------SDVKRVIYLDSDLVV 173
Query: 223 KGRVEELIGIDLSNYAIAAADDCSERLNSYV-NPDVLDAIQRSASQPWVSGKPYAVNSCM 281
+ +L +DL +AA + C +Y N LD P ++ + C
Sbjct: 174 VDDIAKLWEVDLEEKVLAAPEYCHANFTNYFSNLFWLD--------PVLAKTFHGRRPCY 225
Query: 282 PDLGMLLIDARKLEK-YILEAFLWWKKVINQ-RDRSIGRSPAIALALYDRYLKLSSSW 337
+ G++++D K + I + W V Q R +G P L L + W
Sbjct: 226 FNTGVMVVDVEKWRQGGITQKVEEWMTVQKQKRIYHLGSLPPFLLVLAGNIKGVDHRW 283
>gi|225437483|ref|XP_002274175.1| PREDICTED: probable galacturonosyltransferase-like 7-like [Vitis
vinifera]
Length = 450
Score = 43.5 bits (101), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 60/296 (20%), Positives = 112/296 (37%), Gaps = 22/296 (7%)
Query: 80 FNSNTSFSKTRDSG--DRNSGSISSISQDDGMVHIVSWMQCLDLRLLAVLVNSTLSGSRY 137
F ++F+ + G DRN SS+ D +VH+ + LR V+S L S+
Sbjct: 142 FRKASAFNNAEECGFRDRNFSGKSSVC-DPFLVHVAITLDVHYLRGSMAAVHSILQHSQC 200
Query: 138 PDLLHFHLFVPKGSEDMVSFYKLKVLFPHSNLE--FHGQEEVKKVIRTASTGVKYSVQNF 195
P+ + FH V + +++ ++ FP + + E V+ +I ST V+ ++++
Sbjct: 201 PEDIFFHFLVSETHLEIL----VRSTFPQLKFKVYYFNPEIVRNLI---STSVREALEHP 253
Query: 196 EEIVPFVIASVHQ-SLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSERLNSYVN 254
+A + + + + IY+ +IV + +L L I A + C Y
Sbjct: 254 LNYARNYLADLLEPCVRRVIYLDSDLIVVDDIYKLWSTSLGTRTIGAPEYCHANFTRYFT 313
Query: 255 PDVLDAIQRSASQPWVSGKPYAVNSCMPDLGMLLIDARKLEKYILEAFL--WWKKVINQR 312
+ S+ G C + G+++ID K ++ + W + N R
Sbjct: 314 -------DKFWSEKRYYGTFDGRKPCYFNTGVIVIDLAKWRRFGFTKRIERWMEVQKNNR 366
Query: 313 DRSIGRSPAIALALYDRYLKLSSSWLVTDSTSSVVNKSLAIRYDGPMTACSEFGDG 368
+G P L + W V S + GP++ G G
Sbjct: 367 IYELGSLPPYLLVFAGHVAPIEHRWNQHGLGGDNVKGSCRELHPGPVSLLHWSGSG 422
>gi|15230679|ref|NP_187277.1| putative galacturonosyltransferase-like 4 [Arabidopsis thaliana]
gi|75186391|sp|Q9M8J2.1|GATL4_ARATH RecName: Full=Probable galacturonosyltransferase-like 4; AltName:
Full=Galactinol synthase 9; Short=AtGolS9; Short=GolS-9
gi|6862930|gb|AAF30319.1|AC018907_19 putative glycosyl transferase [Arabidopsis thaliana]
gi|332640845|gb|AEE74366.1| putative galacturonosyltransferase-like 4 [Arabidopsis thaliana]
Length = 351
Score = 43.5 bits (101), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 50/231 (21%), Positives = 91/231 (39%), Gaps = 26/231 (11%)
Query: 104 SQDDGMVHIVSWMQCLDLRLLAVLVNSTLSGSRYPDLLHFH-LFVPKGSEDMVSFYKLKV 162
+++ +HI + LR V S L S P+ L FH L +P D+ F +K
Sbjct: 60 TREADQIHIAMTLDTNYLRGTMAAVLSLLQHSTCPENLSFHFLSLPHFENDL--FTSIKS 117
Query: 163 LFPHSNLEFHG------QEEVKKVIRTA-STGVKYSVQNFEEIVPFVIASVHQSLSKFIY 215
FP+ N + + + ++ K IR A + Y+ +I+P S+ + IY
Sbjct: 118 TFPYLNFKIYQFDPNLVRSKISKSIRQALDQPLNYARIYLADIIP-------SSVDRIIY 170
Query: 216 MSPSVIVKGRVEELIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRSASQPWVSGKPY 275
+ ++V +E+L +++ +AA + C Y + S P +
Sbjct: 171 LDSDLVVVDDIEKLWHVEMEGKVVAAPEYCHANFTHYFT-------RTFWSDPVLVKVLE 223
Query: 276 AVNSCMPDLGMLLIDARKLEK--YILEAFLWWKKVINQRDRSIGRSPAIAL 324
C + G++++D K K Y + W +R +G P L
Sbjct: 224 GKRPCYFNTGVMVVDVNKWRKGMYTQKVEEWMTIQKQKRIYHLGSLPPFLL 274
>gi|297843472|ref|XP_002889617.1| hypothetical protein ARALYDRAFT_470701 [Arabidopsis lyrata subsp.
lyrata]
gi|297335459|gb|EFH65876.1| hypothetical protein ARALYDRAFT_470701 [Arabidopsis lyrata subsp.
lyrata]
Length = 595
Score = 43.5 bits (101), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 61/291 (20%), Positives = 112/291 (38%), Gaps = 58/291 (19%)
Query: 125 AVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFPHSNLEFHGQEEV------- 177
+V+VNST+S S+ P+ + FH+ + +S + L + + ++ +++
Sbjct: 319 SVVVNSTISSSKEPERIVFHVVTDSLNYPAISMWFLLNIQSRATIQILNIDDMDVLPPDY 378
Query: 178 -KKVIRTASTGVKYSVQNFEEIVP-----FVIASVHQSLSKFIYMSPSVIVKGRVEELIG 231
+ +++ S ++ I P F + + L+K + V+V+ + L
Sbjct: 379 DQLLMKQNSNDPRF-------ISPLNHARFYLPDIFPGLNKIVLFDHDVVVQRDLSRLWS 431
Query: 232 IDLSNYAIAAADDCSERLNSYVNPDVLDAIQRSASQPWVSGKPYAVNSCMPDLGMLLID- 290
ID+ + A + C E +S+ + S WV+GK ++ +C GM LID
Sbjct: 432 IDMKGKVVGAVETCLEGESSFRSMSTFINF----SDTWVAGK-FSPRACTWAFGMNLIDL 486
Query: 291 ----ARKLEKYILEAF-------LWWKKVINQRDRSIGRSPAIALALYDRYLKLSSSWLV 339
RKL ++ F LW G P L Y + L L W V
Sbjct: 487 EEWRIRKLTSTYIKYFNLGTKRPLW----------KAGSLPIGWLTFYRQTLALDKRWHV 536
Query: 340 ------TDSTSSVVNKSLAIRYDGPMTACSEFGDGANMEPARGDLWKQHLP 384
+ + + ++ I YDG M + G + W H+P
Sbjct: 537 MGLGRESGVKAVDIEQAAVIHYDGVMKPWLDIG-----KENYKRYWNIHVP 582
>gi|357464109|ref|XP_003602336.1| Glycosyltransferase CAZy family GT8 [Medicago truncatula]
gi|355491384|gb|AES72587.1| Glycosyltransferase CAZy family GT8 [Medicago truncatula]
Length = 343
Score = 43.5 bits (101), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 56/268 (20%), Positives = 94/268 (35%), Gaps = 34/268 (12%)
Query: 80 FNSNTSFSKTRDSGDRNSGSISSISQDDGMVHIVSWMQCLDLRLLAVLVNSTLSGSRYPD 139
F +F R+ R + SI +HI + LR V S L + P+
Sbjct: 34 FREAPAFRNGRECPPRETSSI---------IHIAMTLDATYLRGSTAGVFSVLQHASCPE 84
Query: 140 LLHFHLFVPKGSEDMVSFYKLKV-LFPHSNLE-FHGQEEVKK------VIRTASTGVKYS 191
+ FH FV ++ + FP+ N +H + + + R + Y+
Sbjct: 85 NIAFH-FVTTTHRRRQELRRIIISTFPYLNFHIYHFDSNLVRGKISYSIRRALDQPLNYA 143
Query: 192 VQNFEEIVPFVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSERLNS 251
++VP + + IY +IV V +L IDL N+ + A + C +
Sbjct: 144 RMYLADLVP-------ATAQRIIYFDSDLIVVDDVAKLWSIDLGNHVLGAPEYCHANFTT 196
Query: 252 YVNPDVLDAIQRSASQPWVSGKPYAVNSCMPDLGMLLIDARKLE--KYILEAFLWWKKVI 309
Y R S P S +C + G+++ID K KY + W +
Sbjct: 197 YFT-------HRFWSNPSYSASFKGREACYFNTGVMVIDLWKWREGKYTEKLENWMRIQK 249
Query: 310 NQRDRSIGRSPAIALALYDRYLKLSSSW 337
R +G P L ++ W
Sbjct: 250 RSRIYELGSLPPFLLVFAGDVERVEHRW 277
>gi|356500226|ref|XP_003518934.1| PREDICTED: probable galacturonosyltransferase-like 7-like [Glycine
max]
Length = 357
Score = 43.5 bits (101), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 69/333 (20%), Positives = 129/333 (38%), Gaps = 25/333 (7%)
Query: 47 FIALGLIVFLG--LLQFLPATHFR---HPSDPFRIWVPFNSNTSFSKTRDSGD-RNSGSI 100
F A L++ L L F PA R H R+ P S + R++ D + + S+
Sbjct: 11 FSAAMLVILLSPSLQSFHPAEAIRSSHHLDGLLRLPPPRLSFRPAAPFRNAADGKCASSV 70
Query: 101 SSISQDDGMVHIVSWMQCLDLRLLAVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKL 160
+ D +VH+ + LR V+S L S+ P+ + FH V + + + + +
Sbjct: 71 PTSVCDPSLVHVAITLDVEYLRGSIAAVHSILQHSQCPENIFFHFLVSETNLESL----V 126
Query: 161 KVLFPHSNLE--FHGQEEVKKVIRTASTGVKYSV-QNFEEIVPFVIASVHQSLSKFIYMS 217
K FP N + + E V+ +I ST V+ ++ Q ++ + + + IY+
Sbjct: 127 KSTFPQLNFKVYYFDPEIVRNLI---STSVRQALEQPLNYARNYLADLLEPCVERVIYLD 183
Query: 218 PSVIVKGRVEELIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRSASQPWVSGKPYAV 277
+++ + +L L + I A + C Y ++ +++ +G+
Sbjct: 184 SDLVLVDDIAKLWSTSLGSRTIGAPEYCHANFTKYFTAGFWSDMRFASA---FAGR---- 236
Query: 278 NSCMPDLGMLLIDARKLEK--YILEAFLWWKKVINQRDRSIGRSPAIALALYDRYLKLSS 335
C + G+++ID + K Y W + N R +G P L R +
Sbjct: 237 RPCYFNTGVMVIDLVRWRKIGYSKRIERWMEIQKNDRIYELGSLPPFLLVFAGRVAPIEH 296
Query: 336 SWLVTDSTSSVVNKSLAIRYDGPMTACSEFGDG 368
W V S + GP++ G G
Sbjct: 297 RWNQHGLGGDNVKGSCRDLHAGPVSLLHWSGSG 329
>gi|225432474|ref|XP_002277334.1| PREDICTED: probable galacturonosyltransferase-like 4 [Vitis
vinifera]
Length = 352
Score = 43.5 bits (101), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 48/243 (19%), Positives = 93/243 (38%), Gaps = 25/243 (10%)
Query: 104 SQDDGMVHIVSWMQCLDLRLLAVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVL 163
++D +H+ + LR V S L S P+ L FH + ++VS ++
Sbjct: 62 TRDVDGIHVSMTLDANYLRGTMAAVLSILQHSTCPENLSFHFLSAQHVPEIVS--TIQAT 119
Query: 164 FPHSNLEFHG------QEEVKKVIRTA-STGVKYSVQNFEEIVPFVIASVHQSLSKFIYM 216
FP+ N + + ++ K IR A + Y+ +I+P ++ + IY+
Sbjct: 120 FPYLNFRVYRFDSNRVRGKISKSIRRALDQPLNYARIYLADILP-------ANVRRVIYL 172
Query: 217 SPSVIVKGRVEELIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRSASQPWVSGKPYA 276
+++ + L G+DL + +AA + C Y + S P ++
Sbjct: 173 DSDLVMVDDISNLWGVDLGDKVVAAPEYCHANFTKYFTDEFW-------SSPEMAKTFKG 225
Query: 277 VNSCMPDLGMLLIDARKLEK--YILEAFLWWKKVINQRDRSIGRSPAIALALYDRYLKLS 334
+ C + G++++D + K Y + W R +G P L L +
Sbjct: 226 RSPCYFNTGVMVVDVDRWRKGGYTQKVEEWMAVQKQNRIYDLGSLPPFLLVLAGNIKAVD 285
Query: 335 SSW 337
W
Sbjct: 286 HRW 288
>gi|296080851|emb|CBI18781.3| unnamed protein product [Vitis vinifera]
Length = 638
Score = 43.1 bits (100), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 39/177 (22%), Positives = 73/177 (41%), Gaps = 22/177 (12%)
Query: 201 FVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDV 257
F + + L+K +++ ++V+ + L IDL A + C E R + Y+N
Sbjct: 447 FYLPEIFPKLNKVLFLDDDIVVQKDLTGLWSIDLKGNVNGAVETCGESFHRFDRYLN--- 503
Query: 258 LDAIQRSASQPWVSGKPYAVNSCMPDLGMLLIDARKLEK-YILEAFLWWKKVINQRDR-S 315
S P +S K + ++C GM + D + +K +I E + W+K+ + R
Sbjct: 504 -------FSNPLIS-KNFDSHACGWAYGMNIFDLDQWKKQHITEVYHTWQKLNHDRQLWK 555
Query: 316 IGRSPAIALALYDRYLKLSSSWLVT------DSTSSVVNKSLAIRYDGPMTACSEFG 366
+G P + + R + SW V + ++ I Y+G + E G
Sbjct: 556 LGTLPPGLITFWKRTFPIDRSWHVLGLGYNPSVNRREIERAAVIHYNGNLKPWLEIG 612
>gi|413920988|gb|AFW60920.1| transferase [Zea mays]
Length = 648
Score = 43.1 bits (100), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 42/179 (23%), Positives = 75/179 (41%), Gaps = 26/179 (14%)
Query: 201 FVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDV 257
F + ++ L K +++ ++VK + L I++ A + C E R + Y+N
Sbjct: 457 FYLPEIYPKLDKMVFLDDDIVVKKDLAGLWSINMKGKVNGAVETCGESFHRYDRYLN--- 513
Query: 258 LDAIQRSASQPWVSGKPYAVNSCMPDLGMLLID-ARKLEKYILEAFLWWKKVINQRDRS- 315
S P ++ K + ++C+ GM + D A + I E + W+K+ DRS
Sbjct: 514 -------FSNPIIT-KSFDPHACVWAFGMNVFDLAEWRRQNITEIYHSWQKL--NEDRSL 563
Query: 316 --IGRSPAIALALYDRYLKLSSSWLVT------DSTSSVVNKSLAIRYDGPMTACSEFG 366
+G P + +++ LS SW V S + + I Y+G M E G
Sbjct: 564 WKLGTLPPGLVTFWNKTFPLSRSWHVLGLGYNPHVNSRDIEHAAVIHYNGNMKPWLEIG 622
>gi|168016061|ref|XP_001760568.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162688265|gb|EDQ74643.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 275
Score = 43.1 bits (100), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 46/234 (19%), Positives = 93/234 (39%), Gaps = 18/234 (7%)
Query: 106 DDGMVHIVSWMQCLDLRLLAVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFP 165
++ +VHI + LR + S L + + FH K ++ + ++
Sbjct: 7 NESLVHIAMTLDANYLRGSMAAIYSILLHAECASNVRFHFVATKEKKNKCKSFCRSAMYF 66
Query: 166 HSNLEFHGQEEVKKVIRTASTGVKYSVQNFEEIVPFVIASVHQSLSKFIYMSPSVIVKGR 225
+S E+ K+I ++ + N+ ++ + + + IY+ V+V GR
Sbjct: 67 YSC-------ELLKLIYSSDFVITQEPLNYARF--YLAHMIDSCVKRIIYLDLDVLVLGR 117
Query: 226 VEELIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRSASQPWVSGKPYAVNSCMPDLG 285
+EEL ++ N + + C SY + I S + + + +P NS G
Sbjct: 118 IEELWMTNMGNSTVGTPEYCHANFPSYFTENFW--INSSLASTFANKQPCYFNS-----G 170
Query: 286 MLLIDARKLEKYILEAFL-WWKKVINQRD-RSIGRSPAIALALYDRYLKLSSSW 337
M+LI+ + K + L +W +V Q+ +G P + L + + W
Sbjct: 171 MMLINLERWRKTRCTSTLEYWMEVQKQQHIYELGSLPPLLLTFAGSIQAIDNRW 224
>gi|223974051|gb|ACN31213.1| unknown [Zea mays]
gi|413920987|gb|AFW60919.1| hypothetical protein ZEAMMB73_694281 [Zea mays]
Length = 649
Score = 43.1 bits (100), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 42/179 (23%), Positives = 75/179 (41%), Gaps = 26/179 (14%)
Query: 201 FVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDV 257
F + ++ L K +++ ++VK + L I++ A + C E R + Y+N
Sbjct: 458 FYLPEIYPKLDKMVFLDDDIVVKKDLAGLWSINMKGKVNGAVETCGESFHRYDRYLN--- 514
Query: 258 LDAIQRSASQPWVSGKPYAVNSCMPDLGMLLID-ARKLEKYILEAFLWWKKVINQRDRS- 315
S P ++ K + ++C+ GM + D A + I E + W+K+ DRS
Sbjct: 515 -------FSNPIIT-KSFDPHACVWAFGMNVFDLAEWRRQNITEIYHSWQKL--NEDRSL 564
Query: 316 --IGRSPAIALALYDRYLKLSSSWLVT------DSTSSVVNKSLAIRYDGPMTACSEFG 366
+G P + +++ LS SW V S + + I Y+G M E G
Sbjct: 565 WKLGTLPPGLVTFWNKTFPLSRSWHVLGLGYNPHVNSRDIEHAAVIHYNGNMKPWLEIG 623
>gi|226529842|ref|NP_001148932.1| transferase, transferring glycosyl groups [Zea mays]
gi|195623384|gb|ACG33522.1| transferase, transferring glycosyl groups [Zea mays]
Length = 648
Score = 43.1 bits (100), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 42/179 (23%), Positives = 75/179 (41%), Gaps = 26/179 (14%)
Query: 201 FVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDV 257
F + ++ L K +++ ++VK + L I++ A + C E R + Y+N
Sbjct: 457 FYLPEIYPKLDKMVFLDDDIVVKKDLAGLWSINMKGKVNGAVETCGESFHRYDRYLN--- 513
Query: 258 LDAIQRSASQPWVSGKPYAVNSCMPDLGMLLID-ARKLEKYILEAFLWWKKVINQRDRS- 315
S P ++ K + ++C+ GM + D A + I E + W+K+ DRS
Sbjct: 514 -------FSNPVIT-KSFDPHACVWAFGMNVFDLAEWRRQNITEIYHSWQKL--NEDRSL 563
Query: 316 --IGRSPAIALALYDRYLKLSSSWLVT------DSTSSVVNKSLAIRYDGPMTACSEFG 366
+G P + +++ LS SW V S + + I Y+G M E G
Sbjct: 564 WKLGTLPPGLVTFWNKTFPLSRSWHVLGLGYNPHVNSRDIEHAAVIHYNGNMKPWLEIG 622
>gi|357145596|ref|XP_003573698.1| PREDICTED: probable galacturonosyltransferase 4-like isoform 1
[Brachypodium distachyon]
Length = 660
Score = 43.1 bits (100), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 43/179 (24%), Positives = 73/179 (40%), Gaps = 26/179 (14%)
Query: 201 FVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDV 257
F + ++ L K +++ V+VK + L ID+ A + C E R + Y+N
Sbjct: 469 FYLPEIYPKLDKMVFLDDDVVVKKDLTGLWSIDMKGKVNGAVETCGESFHRFDRYLN--- 525
Query: 258 LDAIQRSASQPWVSGKPYAVNSCMPDLGMLLID-ARKLEKYILEAFLWWKKVINQRDR-- 314
S P V K + ++C GM + D A + I E + W+K+ DR
Sbjct: 526 -------FSNP-VIAKNFDPHACGWAFGMNVFDLAEWRRQDITEIYHSWQKL--NEDRLL 575
Query: 315 -SIGRSPAIALALYDRYLKLSSSWLVT------DSTSSVVNKSLAIRYDGPMTACSEFG 366
+G P + +++ L+ SW V S + ++ I Y+G M E G
Sbjct: 576 WKLGTLPPGLITFWNKTFPLNRSWHVLGLGYNPHVNSRDIERAAVIHYNGNMKPWLEIG 634
>gi|359496846|ref|XP_002272372.2| PREDICTED: uncharacterized protein LOC100258406 [Vitis vinifera]
Length = 1286
Score = 43.1 bits (100), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 39/177 (22%), Positives = 73/177 (41%), Gaps = 22/177 (12%)
Query: 201 FVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDV 257
F + + L+K +++ ++V+ + L IDL A + C E R + Y+N
Sbjct: 1095 FYLPEIFPKLNKVLFLDDDIVVQKDLTGLWSIDLKGNVNGAVETCGESFHRFDRYLN--- 1151
Query: 258 LDAIQRSASQPWVSGKPYAVNSCMPDLGMLLIDARKLEK-YILEAFLWWKKVINQRDR-S 315
S P +S K + ++C GM + D + +K +I E + W+K+ + R
Sbjct: 1152 -------FSNPLIS-KNFDSHACGWAYGMNIFDLDQWKKQHITEVYHTWQKLNHDRQLWK 1203
Query: 316 IGRSPAIALALYDRYLKLSSSWLVT------DSTSSVVNKSLAIRYDGPMTACSEFG 366
+G P + + R + SW V + ++ I Y+G + E G
Sbjct: 1204 LGTLPPGLITFWKRTFPIDRSWHVLGLGYNPSVNRREIERAAVIHYNGNLKPWLEIG 1260
>gi|168057091|ref|XP_001780550.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162668028|gb|EDQ54644.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 280
Score = 42.7 bits (99), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 46/219 (21%), Positives = 90/219 (41%), Gaps = 16/219 (7%)
Query: 123 LLAVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFPHSNLEFHGQEEVKKVIR 182
+ ++L ++ SG+ + FH G + F V L+F + +++
Sbjct: 15 IYSILRHAECSGN-----IRFHFVATNGKVKYIRFPAFVVAETLPFLQFQTYPFDESLVK 69
Query: 183 T-ASTGVKYSVQNFEEIVPFVIAS-VHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIA 240
+ S V+++++ F +A + + + IY+ V+V R+EEL I++ N +
Sbjct: 70 SRISYAVRHALEEPLNYARFYLAHMIDPCVKRIIYLDSDVLVIDRIEELWMINMGNSTVG 129
Query: 241 AADDCSERLNSYVNPDVLDAIQRSASQPWVSGKPYAVNSCMPDLGMLLIDARKLEKYILE 300
+ C +SY S + + + KP NS G++LI+ + K
Sbjct: 130 TPEYCHANFHSYFTERFWR--NSSLASIFANKKPCYFNS-----GVMLINLDRWRKEACT 182
Query: 301 AFL-WWKKVINQRD-RSIGRSPAIALALYDRYLKLSSSW 337
A L +W +V +R +G P + L + S W
Sbjct: 183 ATLEYWMEVQKERHIYELGSLPPLLLTFAGSIQAIDSRW 221
>gi|255559941|ref|XP_002520989.1| Glycosyltransferase QUASIMODO1, putative [Ricinus communis]
gi|223539826|gb|EEF41406.1| Glycosyltransferase QUASIMODO1, putative [Ricinus communis]
Length = 633
Score = 42.7 bits (99), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 60/265 (22%), Positives = 107/265 (40%), Gaps = 33/265 (12%)
Query: 125 AVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFPHSNLEFHGQEEVKKVIRTA 184
AV+VNST+S ++ + + FH+ + +S + L L P S +++ +
Sbjct: 357 AVVVNSTVSSAKDAESIIFHVVTDSLNLPAISMWFL--LNPPSKATI----QIQSIDNFG 410
Query: 185 STGVKYSV----QNFEE--------IVPFVIASVHQSLSKFIYMSPSVIVKGRVEELIGI 232
KYS QN + + F + + L+K + V+V+ + L +
Sbjct: 411 WLSTKYSSTVKQQNSHDPSYVSALNHLRFYLPDLFPLLNKIVLFDHDVVVQKDLTGLWSL 470
Query: 233 DLSNYAIAAADDCSERLNSYVNPDVLDAIQRSASQPWVSGKPYAVNSCMPDLGMLLIDAR 292
D++ A + C E SY D+ S P+V+ K + N+C GM L D +
Sbjct: 471 DMNGKVNGAVETCQESDTSYRQMDMFINF----SDPFVT-KRFDANACTWAFGMNLFDLK 525
Query: 293 KLEKYILEAFLWWKKVINQRDRSI---GRSPAIALALYDRYLKLSSSW------LVTDST 343
+ + L A L+ K + R + G P Y++ + L W +D
Sbjct: 526 EWRRQNLTA-LYHKYLQEGYKRPLWKAGSLPVGWATFYNQTVALDKRWHRLGLGYESDVG 584
Query: 344 SSVVNKSLAIRYDGPMTACSEFGDG 368
+N++ + YDG M + G G
Sbjct: 585 QDDINQAAVLHYDGVMKPWMDIGVG 609
>gi|147766034|emb|CAN70213.1| hypothetical protein VITISV_038741 [Vitis vinifera]
Length = 759
Score = 42.4 bits (98), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 39/177 (22%), Positives = 73/177 (41%), Gaps = 22/177 (12%)
Query: 201 FVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDV 257
F + + L+K +++ ++V+ + L IDL A + C E R + Y+N
Sbjct: 568 FYLPEIFPKLNKVLFLDDDIVVQKDLTGLWSIDLKGNVNGAVETCGESFHRFDRYLN--- 624
Query: 258 LDAIQRSASQPWVSGKPYAVNSCMPDLGMLLIDARKLEK-YILEAFLWWKKVINQRDR-S 315
S P +S K + ++C GM + D + +K +I E + W+K+ + R
Sbjct: 625 -------FSNPLIS-KNFDSHACGWAYGMNIFDLDQWKKQHITEVYHTWQKLNHDRQLWK 676
Query: 316 IGRSPAIALALYDRYLKLSSSWLVT------DSTSSVVNKSLAIRYDGPMTACSEFG 366
+G P + + R + SW V + ++ I Y+G + E G
Sbjct: 677 LGTLPPGLITFWKRTXPIDRSWHVLGLGYNPSVNRREIERAAVIHYNGNLKPWLEIG 733
>gi|255547123|ref|XP_002514619.1| Glycosyltransferase QUASIMODO1, putative [Ricinus communis]
gi|223546223|gb|EEF47725.1| Glycosyltransferase QUASIMODO1, putative [Ricinus communis]
Length = 535
Score = 42.4 bits (98), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 53/269 (19%), Positives = 103/269 (38%), Gaps = 42/269 (15%)
Query: 125 AVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFPHSNLEFHGQEE-------- 176
+V+VNS +S + +P L FH+ S + + L F + +E +E
Sbjct: 243 SVVVNSAISNADHPTQLVFHIVTNGVSYGAMQAWFLSDDFKGATIEVQNVKEFSWLNASY 302
Query: 177 ---VKKVIRTASTGVKYSVQNFEEIVP---------------FVIASVHQSLSKFIYMSP 218
VK+++ S +S ++ P F I ++ L K +++
Sbjct: 303 APVVKQLLAEDSRSYYFSGYQDMKVEPKLRNPKYLSLLNHLRFYIPEIYPQLEKIVFLDD 362
Query: 219 SVIVKGRVEELIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRSASQPWVSGKPYAVN 278
V+V+ + +L +DL A + C E + Y + S P +S K +
Sbjct: 363 DVVVQKDLTQLFSLDLHGNVNGAVETCLEAFHRYYK-------YLNFSNPIISSK-FDPQ 414
Query: 279 SCMPDLGMLLID--ARKLEKYILEAFLWWKKVINQRDRSIGRSPAIALALYDRYLKLSSS 336
+C GM + D A + + W ++ +++ +G P LA Y L
Sbjct: 415 ACGWAFGMNVFDLIAWRKANVTAQYHYWQEQNVDRTLWKLGTLPPALLAFYGLTEPLDRR 474
Query: 337 WLV------TDSTSSVVNKSLAIRYDGPM 359
W V T+ + ++ + + ++G M
Sbjct: 475 WHVLGLGYDTNIDNRLIESAAVVHFNGNM 503
>gi|357112099|ref|XP_003557847.1| PREDICTED: probable galacturonosyltransferase-like 4-like
[Brachypodium distachyon]
Length = 351
Score = 42.4 bits (98), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 44/222 (19%), Positives = 87/222 (39%), Gaps = 17/222 (7%)
Query: 121 LRLLAVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFPHSNLEFHGQEEVKKV 180
LR V S L + P+ + FH + D+V ++ FP+ +L + + + +V
Sbjct: 78 LRGTMAGVLSILQHTACPESVAFHFLATRMDGDLVGM--VRATFPYLDLRVY-RFDPSRV 134
Query: 181 IRTASTGVKYSVQ---NFEEIVPFVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNY 237
S +++++ N+ I ++ ++ + + IY+ VIV + L +DL +
Sbjct: 135 RGRISRSIRHALDQPLNYARI--YLADTLPPDVRRVIYLDSDVIVVDDIRTLFSVDLGGH 192
Query: 238 AIAAADDCSERLNSYVNPDVLDAIQRSASQPWVSGKPYAVNSCMPDLGMLLIDARKLEK- 296
+ A + C +Y DA + P +SG C + G++++D +
Sbjct: 193 VVGAPEYCHANFTNYFT----DAFW---TDPALSGTFRGRRPCYFNTGVMVMDVDRWRNG 245
Query: 297 -YILEAFLWWKKVINQRDRSIGRSPAIALALYDRYLKLSSSW 337
Y W +R +G P L L + W
Sbjct: 246 GYTRRVEGWMAVQKQKRIYHLGSLPPFLLVLAGDIQAVDHRW 287
>gi|242078863|ref|XP_002444200.1| hypothetical protein SORBIDRAFT_07g014890 [Sorghum bicolor]
gi|241940550|gb|EES13695.1| hypothetical protein SORBIDRAFT_07g014890 [Sorghum bicolor]
Length = 648
Score = 42.0 bits (97), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 41/179 (22%), Positives = 75/179 (41%), Gaps = 26/179 (14%)
Query: 201 FVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDV 257
F + ++ L K +++ ++VK + L I++ A + C E R + Y+N
Sbjct: 457 FYLPEIYPKLDKMVFLDDDIVVKKDLTGLWSINMKGKVNGAVETCGESFHRYDRYLN--- 513
Query: 258 LDAIQRSASQPWVSGKPYAVNSCMPDLGMLLID-ARKLEKYILEAFLWWKKVINQRDRS- 315
S P ++ K + ++C GM + D A + I + + W+K+ DRS
Sbjct: 514 -------FSNPIIA-KSFDPHACGWAFGMNVFDLAEWRRQNITQIYHSWQKL--NEDRSL 563
Query: 316 --IGRSPAIALALYDRYLKLSSSWLVT------DSTSSVVNKSLAIRYDGPMTACSEFG 366
+G P + +++ LS SW V S + ++ I Y+G M E G
Sbjct: 564 WKLGTLPPGLITFWNKTFPLSRSWHVLGLGYNPHVNSRDIERAAVIHYNGNMKPWLEIG 622
>gi|357158970|ref|XP_003578298.1| PREDICTED: probable galacturonosyltransferase 4-like isoform 1
[Brachypodium distachyon]
Length = 696
Score = 41.2 bits (95), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 40/179 (22%), Positives = 73/179 (40%), Gaps = 26/179 (14%)
Query: 201 FVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDV 257
F + + L+K +++ +V+ + L IDL A + C E R + Y+N
Sbjct: 505 FYLPEIFPKLNKVLFLDDDTVVQQDLSALWSIDLKGKVNGAVETCGETFHRFDKYLN--- 561
Query: 258 LDAIQRSASQPWVSGKPYAVNSCMPDLGMLLIDARKLEKY-ILEAFLWWKKVINQRDR-- 314
S P ++ + +C GM + D + K I + + W+K+ DR
Sbjct: 562 -------FSNPIIANN-FHPRACGWAYGMNMFDLSEWRKQNITDVYHTWQKL--NEDRLL 611
Query: 315 -SIGRSPAIALALYDRYLKLSSSWLV------TDSTSSVVNKSLAIRYDGPMTACSEFG 366
+G PA + ++R L SSW + T+ + ++ I Y+G + E G
Sbjct: 612 WKLGTLPAGLVTFWNRTFPLDSSWHLLGLGYNTNVNERDIRRASVIHYNGNLKPWLEIG 670
>gi|357158973|ref|XP_003578299.1| PREDICTED: probable galacturonosyltransferase 4-like isoform 2
[Brachypodium distachyon]
Length = 690
Score = 41.2 bits (95), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 40/179 (22%), Positives = 73/179 (40%), Gaps = 26/179 (14%)
Query: 201 FVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDV 257
F + + L+K +++ +V+ + L IDL A + C E R + Y+N
Sbjct: 499 FYLPEIFPKLNKVLFLDDDTVVQQDLSALWSIDLKGKVNGAVETCGETFHRFDKYLN--- 555
Query: 258 LDAIQRSASQPWVSGKPYAVNSCMPDLGMLLIDARKLEKY-ILEAFLWWKKVINQRDR-- 314
S P ++ + +C GM + D + K I + + W+K+ DR
Sbjct: 556 -------FSNPIIANN-FHPRACGWAYGMNMFDLSEWRKQNITDVYHTWQKL--NEDRLL 605
Query: 315 -SIGRSPAIALALYDRYLKLSSSWLV------TDSTSSVVNKSLAIRYDGPMTACSEFG 366
+G PA + ++R L SSW + T+ + ++ I Y+G + E G
Sbjct: 606 WKLGTLPAGLVTFWNRTFPLDSSWHLLGLGYNTNVNERDIRRASVIHYNGNLKPWLEIG 664
>gi|413933399|gb|AFW67950.1| hypothetical protein ZEAMMB73_846223 [Zea mays]
Length = 399
Score = 41.2 bits (95), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 59/277 (21%), Positives = 109/277 (39%), Gaps = 53/277 (19%)
Query: 125 AVLVNSTLSGSRYPDLLHFHLFVPK---GSEDM----------------VSFYK------ 159
+V+VNST+ ++ P+ FHL K G+ +M V +K
Sbjct: 101 SVVVNSTIMNAKEPEKHVFHLVTDKLNFGAMNMWFLLNPPGDATMHVENVDDFKWLNSSY 160
Query: 160 ---LKVLFPHSNLEFHGQEEVKKVIRTASTGVKYSVQNFEEIVP---FVIASVHQSLSKF 213
LK L + E++ + + K + S+ +KY + ++ F + V+ L+K
Sbjct: 161 CPVLKQLESAAMKEYYFKADRPKTLSAGSSNLKYRNPKYLSMLNHLRFYLPQVYPKLNKI 220
Query: 214 IYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDVLDAIQRSASQPWV 270
+++ ++V+ + L +DL+ A + C E R + Y+N S P +
Sbjct: 221 LFLDDDIVVQRDLTGLWEVDLNGNVNGAVETCGESFHRFDKYLN----------FSNPNI 270
Query: 271 SGKPYAVNSCMPDLGMLLIDARKLEKYILEAFL--WWKKVINQRDRSIGRSPAIALALYD 328
+ + + N+C GM + D + +K + W N+ +G P L Y
Sbjct: 271 A-QNFDPNACGWAYGMNMFDLEEWKKKDITGIYHKWQNMNENRLLWKLGTLPPGLLTFYK 329
Query: 329 RYLKLSSSWLVT------DSTSSVVNKSLAIRYDGPM 359
L SW V S ++ + I Y+G M
Sbjct: 330 LTHPLDKSWHVLGLGYNPTVERSEIDNAAVIHYNGNM 366
>gi|326527369|dbj|BAK04626.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 343
Score = 41.2 bits (95), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 43/222 (19%), Positives = 87/222 (39%), Gaps = 17/222 (7%)
Query: 121 LRLLAVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFPHSNLEFHGQEEVKKV 180
LR V S L + P+ + FH + D+V+ L+V FP+ +L + + + +V
Sbjct: 70 LRGTMAGVLSILQHTACPENVVFHFLAARMDGDLVAM--LRVTFPYLDLRVY-RFDPSRV 126
Query: 181 IRTASTGVKYSVQ---NFEEIVPFVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNY 237
S +++++ N+ I ++ ++ + + IY+ VIV + L + L+ +
Sbjct: 127 RGRISRSIRHALDQPLNYARI--YLADTLPPDVRRVIYLDSDVIVVDDIRTLFSVHLAGH 184
Query: 238 AIAAADDCSERLNSYVNPDVLDAIQRSASQPWVSGKPYAVNSCMPDLGMLLIDARKLEK- 296
+ A + C +Y P +SG + C + G++++D +
Sbjct: 185 VVGAPEYCHTNFTNYFTDTFW-------MDPALSGTFHGRRPCYFNTGVMVMDVDQWRTG 237
Query: 297 -YILEAFLWWKKVINQRDRSIGRSPAIALALYDRYLKLSSSW 337
Y W +R +G P L L + W
Sbjct: 238 GYTRRVEGWMAVQKQKRIYHLGSLPPFLLVLAGDIQAVDHRW 279
>gi|242049946|ref|XP_002462717.1| hypothetical protein SORBIDRAFT_02g030820 [Sorghum bicolor]
gi|241926094|gb|EER99238.1| hypothetical protein SORBIDRAFT_02g030820 [Sorghum bicolor]
Length = 683
Score = 40.8 bits (94), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 60/286 (20%), Positives = 113/286 (39%), Gaps = 57/286 (19%)
Query: 125 AVLVNSTLSGSRYPDLLHFHLFVPK---GSEDM----------------VSFYK------ 159
+V+VNST+ ++ P+ FHL K G+ +M V +K
Sbjct: 385 SVVVNSTIMNAKEPEKHVFHLVTDKLNFGAMNMWFLLNPPGKATIHVENVDEFKWLNSSY 444
Query: 160 ---LKVLFPHSNLEFHGQEEVKKVIRTASTGVKYSVQNFEEIVP---FVIASVHQSLSKF 213
L+ L + E++ + + + S+ +KY + ++ F + V+ L K
Sbjct: 445 CPVLRQLESAAMKEYYFKADRPTTLSAGSSNLKYRNPKYLSMLNHLRFYLPEVYPKLDKI 504
Query: 214 IYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDVLDAIQRSASQPWV 270
+++ ++V+ + L +DL+ A + C E R + Y+N S P +
Sbjct: 505 LFLDDDIVVQKDLTGLWDVDLNGKVNGAVETCGESFHRFDKYLN----------FSNPHI 554
Query: 271 SGKPYAVNSCMPDLGMLLIDARKLEKY-ILEAFLWWKKVINQRDRS---IGRSPAIALAL 326
+ + + N+C GM + D R+ +K I + W+ + DR+ +G P L
Sbjct: 555 A-RNFDPNACGWAYGMNIFDLREWKKKDITGIYHKWQNL--NEDRALWKLGTLPPGLLTF 611
Query: 327 YDRYLKLSSSWLVT------DSTSSVVNKSLAIRYDGPMTACSEFG 366
Y L SW V S ++ + + Y+G M E
Sbjct: 612 YKLTHPLDKSWHVLGLGYNPSIDRSEIDSAAVVHYNGNMKPWLELA 657
>gi|357118841|ref|XP_003561157.1| PREDICTED: alpha-1,4-galacturonosyltransferase 1-like [Brachypodium
distachyon]
Length = 589
Score = 40.8 bits (94), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 58/277 (20%), Positives = 110/277 (39%), Gaps = 53/277 (19%)
Query: 125 AVLVNSTLSGSRYPDLLHFHLFVPK---GSEDM----------------VSFYK------ 159
+V+VNST+ ++ P+ FHL K G+ +M V +K
Sbjct: 291 SVVVNSTIVNAKEPEKHVFHLVTDKLNFGAMNMWFLLNPPGDATIHVENVDDFKWLNSSY 350
Query: 160 ---LKVLFPHSNLEFHGQEEVKKVIRTASTGVKYSVQNFEEIVP---FVIASVHQSLSKF 213
LK L + E++ + + +K + S+ +KY + ++ F + V+ L+K
Sbjct: 351 CPVLKQLESAAMKEYYFKADRQKTLSAGSSNLKYRNPKYLSMLNHLRFYLPQVYPKLNKI 410
Query: 214 IYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDVLDAIQRSASQPWV 270
+++ ++V+ + L +DL+ A + C E R + Y+N S P +
Sbjct: 411 LFLDDDIVVQKDLTGLWEVDLNGNVNGAVETCGESFHRFDKYLN----------FSNPNI 460
Query: 271 SGKPYAVNSCMPDLGMLLIDARKLEKYILEAFL--WWKKVINQRDRSIGRSPAIALALYD 328
+ + + N+C GM + D + +K + W N+ +G P + Y
Sbjct: 461 A-QNFDPNACGWAYGMNMFDLEEWKKKDITGIYHKWQNMNENRLLWKLGTLPPGLMTFYK 519
Query: 329 RYLKLSSSWLVT------DSTSSVVNKSLAIRYDGPM 359
L SW V S ++ + I Y+G M
Sbjct: 520 LTHPLDKSWHVLGLGYNPTVEHSEIDSAAVIHYNGNM 556
>gi|293331885|ref|NP_001169332.1| uncharacterized protein LOC100383199 precursor [Zea mays]
gi|224028751|gb|ACN33451.1| unknown [Zea mays]
gi|413933401|gb|AFW67952.1| hypothetical protein ZEAMMB73_846223 [Zea mays]
Length = 590
Score = 40.8 bits (94), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 59/277 (21%), Positives = 109/277 (39%), Gaps = 53/277 (19%)
Query: 125 AVLVNSTLSGSRYPDLLHFHLFVPK---GSEDM----------------VSFYK------ 159
+V+VNST+ ++ P+ FHL K G+ +M V +K
Sbjct: 292 SVVVNSTIMNAKEPEKHVFHLVTDKLNFGAMNMWFLLNPPGDATMHVENVDDFKWLNSSY 351
Query: 160 ---LKVLFPHSNLEFHGQEEVKKVIRTASTGVKYSVQNFEEIVP---FVIASVHQSLSKF 213
LK L + E++ + + K + S+ +KY + ++ F + V+ L+K
Sbjct: 352 CPVLKQLESAAMKEYYFKADRPKTLSAGSSNLKYRNPKYLSMLNHLRFYLPQVYPKLNKI 411
Query: 214 IYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDVLDAIQRSASQPWV 270
+++ ++V+ + L +DL+ A + C E R + Y+N S P +
Sbjct: 412 LFLDDDIVVQRDLTGLWEVDLNGNVNGAVETCGESFHRFDKYLN----------FSNPNI 461
Query: 271 SGKPYAVNSCMPDLGMLLIDARKLEKYILEAFL--WWKKVINQRDRSIGRSPAIALALYD 328
+ + + N+C GM + D + +K + W N+ +G P L Y
Sbjct: 462 A-QNFDPNACGWAYGMNMFDLEEWKKKDITGIYHKWQNMNENRLLWKLGTLPPGLLTFYK 520
Query: 329 RYLKLSSSWLVT------DSTSSVVNKSLAIRYDGPM 359
L SW V S ++ + I Y+G M
Sbjct: 521 LTHPLDKSWHVLGLGYNPTVERSEIDNAAVIHYNGNM 557
>gi|356522095|ref|XP_003529685.1| PREDICTED: probable galacturonosyltransferase 4-like [Glycine max]
Length = 661
Score = 40.8 bits (94), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 37/170 (21%), Positives = 72/170 (42%), Gaps = 22/170 (12%)
Query: 201 FVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDV 257
F + + L+K +++ ++V+ + L IDL A + C E R + Y+N
Sbjct: 470 FYLPEIFPKLNKVLFLDDDIVVQKDLTGLWSIDLKGNVNGAVETCGERFHRFDRYLN--- 526
Query: 258 LDAIQRSASQPWVSGKPYAVNSCMPDLGMLLIDARKLEKY-ILEAFLWWKKVINQRDR-S 315
S P ++ K + +C GM + D + ++ I E + W+K+ + R
Sbjct: 527 -------FSNPHIA-KNFDPRACGWAYGMNVFDLVQWKRQNITEVYHNWQKLNHDRQLWK 578
Query: 316 IGRSPAIALALYDRYLKLSSSWLVT------DSTSSVVNKSLAIRYDGPM 359
+G P + + R +L+ SW V + + ++ I Y+G M
Sbjct: 579 LGTLPPGLITFWKRTFQLNRSWHVLGLGYNPNINQKEIERAAVIHYNGNM 628
>gi|326502964|dbj|BAJ99110.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 591
Score = 40.8 bits (94), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 58/277 (20%), Positives = 110/277 (39%), Gaps = 53/277 (19%)
Query: 125 AVLVNSTLSGSRYPDLLHFHLFVPK---GSEDM----------------VSFYK------ 159
+V+VNST+ ++ P+ FHL K G+ +M V +K
Sbjct: 293 SVVVNSTIVNAKEPEKHVFHLVTDKLNFGAMNMWFLLNPPGDATIHVENVDDFKWLNSSY 352
Query: 160 ---LKVLFPHSNLEFHGQEEVKKVIRTASTGVKYSVQNFEEIVP---FVIASVHQSLSKF 213
LK L + E++ + + +K + S+ +KY + ++ F + V+ L+K
Sbjct: 353 CPVLKQLESAAMKEYYFKADRQKTLSAGSSNLKYRNPKYLSMLNHLRFYLPQVYPKLNKI 412
Query: 214 IYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDVLDAIQRSASQPWV 270
+++ ++V+ + L +DL+ A + C E R + Y+N S P +
Sbjct: 413 LFLDDDIVVQKDLTGLWEVDLNGNVNGAVETCGESFHRFDKYLN----------FSNPNI 462
Query: 271 SGKPYAVNSCMPDLGMLLIDARKLEKYILEAFL--WWKKVINQRDRSIGRSPAIALALYD 328
S + + N+C GM + D + +K + W N+ +G P + Y
Sbjct: 463 S-QNFDPNACGWAYGMNMFDLEEWKKKDITGIYHKWQNMNENRLLWKLGTLPPGLMTFYK 521
Query: 329 RYLKLSSSWLVT------DSTSSVVNKSLAIRYDGPM 359
L SW V + ++ + I Y+G M
Sbjct: 522 LTHPLDKSWHVLGLGYNPTVEHAEIDTAAVIHYNGNM 558
>gi|147846492|emb|CAN79512.1| hypothetical protein VITISV_014158 [Vitis vinifera]
Length = 648
Score = 40.8 bits (94), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 48/232 (20%), Positives = 94/232 (40%), Gaps = 50/232 (21%)
Query: 125 AVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFPHSNLEFHGQEEVKKVIRTA 184
+V++NST+ + P+ FH+ D +SF +K+ F + +
Sbjct: 391 SVVINSTMLXASEPEKHVFHIVT-----DKLSFAAMKMWF----------------LVNS 429
Query: 185 STGVKYSVQNFEEI-----------VPFVIASVHQSLSKFIYMSPSVIVKGRVEELIGID 233
V V+N ++ + F + V+ L K +++ ++V+ + L +D
Sbjct: 430 PAKVTIQVENIDDFKNPKYLSMLNHLRFYLPEVYPKLEKILFLDDDIVVQKDLTPLWSLD 489
Query: 234 LSNYAIAAADDCSE---RLNSYVNPDVLDAIQRSASQPWVSGKPYAVNSCMPDLGMLLID 290
+ AA + C E R + Y+N S P +S + + N+C GM + D
Sbjct: 490 MQGMVNAAVETCKESFHRFDKYLN----------FSHPKIS-ENFDPNACGWAFGMNMFD 538
Query: 291 ARKLEKYILEAFLWWKKVINQRDRS---IGRSPAIALALYDRYLKLSSSWLV 339
++ K + + + +N+ DR+ +G P + Y+ L SW V
Sbjct: 539 LKEWRKRNMTGIYHYWQDMNE-DRTLWKLGSLPPGLITFYNLTYPLDRSWHV 589
>gi|212721098|ref|NP_001131608.1| uncharacterized protein LOC100192958 [Zea mays]
gi|194692022|gb|ACF80095.1| unknown [Zea mays]
Length = 228
Score = 40.4 bits (93), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 44/213 (20%), Positives = 85/213 (39%), Gaps = 29/213 (13%)
Query: 170 EFHGQEEVKKVIRTASTGVKYSVQNFEEIVP---FVIASVHQSLSKFIYMSPSVIVKGRV 226
E++ + + + S+ +KY + ++ F + V+ L K +++ ++V+ +
Sbjct: 3 EYYFKADRPTTLSAGSSNLKYRNPKYLSMLNHLRFYLPEVYPKLDKILFLDDDIVVQKDL 62
Query: 227 EELIGIDLSNYAIAAADDCSE---RLNSYVNPDVLDAIQRSASQPWVSGKPYAVNSCMPD 283
L +DL+ A + C E R + Y+N S P ++ + + N+C
Sbjct: 63 TGLWDVDLNGKVNGAVETCGESFHRFDKYLN----------FSNPHIA-RNFDPNACGWA 111
Query: 284 LGMLLIDARKLEKY-ILEAFLWWKKVINQRDR---SIGRSPAIALALYDRYLKLSSSWLV 339
GM + D R+ +K I + W+ + DR +G P L Y L SW V
Sbjct: 112 YGMNIFDLREWKKKDITGIYHKWQNM--NEDRVLWKLGTLPPGLLTFYKLTHPLDKSWHV 169
Query: 340 T------DSTSSVVNKSLAIRYDGPMTACSEFG 366
S ++ + + Y+G M E
Sbjct: 170 LGLGYNPSIDRSEIDNAAVVHYNGNMKPWLELA 202
>gi|414872193|tpg|DAA50750.1| TPA: hypothetical protein ZEAMMB73_194342 [Zea mays]
Length = 593
Score = 40.4 bits (93), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 59/277 (21%), Positives = 109/277 (39%), Gaps = 53/277 (19%)
Query: 125 AVLVNSTLSGSRYPDLLHFHLFVPK---GSEDM----------------VSFYK------ 159
+V+VNST+ ++ P+ FHL K G+ +M V +K
Sbjct: 295 SVVVNSTIMNAKEPEKHVFHLVTDKLNFGAMNMWFLLNPPGDATIHVENVDDFKWLNSSY 354
Query: 160 ---LKVLFPHSNLEFHGQEEVKKVIRTASTGVKYSVQNFEEIVP---FVIASVHQSLSKF 213
LK L + E++ + + K + S+ +KY + ++ F + V+ L+K
Sbjct: 355 CPVLKQLESAAMKEYYFKADRPKTLSAGSSNLKYRNPKYLSMLNHLRFYLPQVYPKLNKI 414
Query: 214 IYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDVLDAIQRSASQPWV 270
+++ ++V+ + L +DL+ A + C E R + Y+N S P +
Sbjct: 415 LFLDDDIVVQRDLTGLWEVDLNGNVNGAVETCGESFHRFDKYLN----------FSNPNI 464
Query: 271 SGKPYAVNSCMPDLGMLLIDARKLEKYILEAFL--WWKKVINQRDRSIGRSPAIALALYD 328
+ + + N+C GM + D + +K + W N+ +G P L Y
Sbjct: 465 A-QNFDPNACGWAYGMNMFDLEEWKKKDITGIYHKWQNMNENRLLWKLGTLPPGLLTFYK 523
Query: 329 RYLKLSSSWLVT------DSTSSVVNKSLAIRYDGPM 359
L SW V S ++ + I Y+G M
Sbjct: 524 LTHPLDKSWHVLGLGYNPTIERSEIDNAAVIHYNGNM 560
>gi|212275396|ref|NP_001130678.1| uncharacterized protein LOC100191781 precursor [Zea mays]
gi|194688930|gb|ACF78549.1| unknown [Zea mays]
gi|414872194|tpg|DAA50751.1| TPA: hypothetical protein ZEAMMB73_194342 [Zea mays]
Length = 588
Score = 40.4 bits (93), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 59/277 (21%), Positives = 109/277 (39%), Gaps = 53/277 (19%)
Query: 125 AVLVNSTLSGSRYPDLLHFHLFVPK---GSEDM----------------VSFYK------ 159
+V+VNST+ ++ P+ FHL K G+ +M V +K
Sbjct: 290 SVVVNSTIMNAKEPEKHVFHLVTDKLNFGAMNMWFLLNPPGDATIHVENVDDFKWLNSSY 349
Query: 160 ---LKVLFPHSNLEFHGQEEVKKVIRTASTGVKYSVQNFEEIVP---FVIASVHQSLSKF 213
LK L + E++ + + K + S+ +KY + ++ F + V+ L+K
Sbjct: 350 CPVLKQLESAAMKEYYFKADRPKTLSAGSSNLKYRNPKYLSMLNHLRFYLPQVYPKLNKI 409
Query: 214 IYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDVLDAIQRSASQPWV 270
+++ ++V+ + L +DL+ A + C E R + Y+N S P +
Sbjct: 410 LFLDDDIVVQRDLTGLWEVDLNGNVNGAVETCGESFHRFDKYLN----------FSNPNI 459
Query: 271 SGKPYAVNSCMPDLGMLLIDARKLEKYILEAFL--WWKKVINQRDRSIGRSPAIALALYD 328
+ + + N+C GM + D + +K + W N+ +G P L Y
Sbjct: 460 A-QNFDPNACGWAYGMNMFDLEEWKKKDITGIYHKWQNMNENRLLWKLGTLPPGLLTFYK 518
Query: 329 RYLKLSSSWLVT------DSTSSVVNKSLAIRYDGPM 359
L SW V S ++ + I Y+G M
Sbjct: 519 LTHPLDKSWHVLGLGYNPTIERSEIDNAAVIHYNGNM 555
>gi|413933400|gb|AFW67951.1| hypothetical protein ZEAMMB73_846223 [Zea mays]
Length = 507
Score = 40.4 bits (93), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 59/277 (21%), Positives = 109/277 (39%), Gaps = 53/277 (19%)
Query: 125 AVLVNSTLSGSRYPDLLHFHLFVPK---GSEDM----------------VSFYK------ 159
+V+VNST+ ++ P+ FHL K G+ +M V +K
Sbjct: 209 SVVVNSTIMNAKEPEKHVFHLVTDKLNFGAMNMWFLLNPPGDATMHVENVDDFKWLNSSY 268
Query: 160 ---LKVLFPHSNLEFHGQEEVKKVIRTASTGVKYSVQNFEEIVP---FVIASVHQSLSKF 213
LK L + E++ + + K + S+ +KY + ++ F + V+ L+K
Sbjct: 269 CPVLKQLESAAMKEYYFKADRPKTLSAGSSNLKYRNPKYLSMLNHLRFYLPQVYPKLNKI 328
Query: 214 IYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDVLDAIQRSASQPWV 270
+++ ++V+ + L +DL+ A + C E R + Y+N S P +
Sbjct: 329 LFLDDDIVVQRDLTGLWEVDLNGNVNGAVETCGESFHRFDKYLN----------FSNPNI 378
Query: 271 SGKPYAVNSCMPDLGMLLIDARKLEKYILEAFL--WWKKVINQRDRSIGRSPAIALALYD 328
+ + + N+C GM + D + +K + W N+ +G P L Y
Sbjct: 379 A-QNFDPNACGWAYGMNMFDLEEWKKKDITGIYHKWQNMNENRLLWKLGTLPPGLLTFYK 437
Query: 329 RYLKLSSSWLVT------DSTSSVVNKSLAIRYDGPM 359
L SW V S ++ + I Y+G M
Sbjct: 438 LTHPLDKSWHVLGLGYNPTVERSEIDNAAVIHYNGNM 474
>gi|414886435|tpg|DAA62449.1| TPA: hypothetical protein ZEAMMB73_004043 [Zea mays]
Length = 683
Score = 40.4 bits (93), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 60/286 (20%), Positives = 112/286 (39%), Gaps = 57/286 (19%)
Query: 125 AVLVNSTLSGSRYPDLLHFHLFVPK---GSEDM----------------VSFYK------ 159
+V+VNST+ ++ P+ FHL K G+ +M V +K
Sbjct: 385 SVVVNSTIMNAKEPEKHVFHLVTDKLNFGAMNMWFLLNPPGKATIHVENVDEFKWLNSSY 444
Query: 160 ---LKVLFPHSNLEFHGQEEVKKVIRTASTGVKYSVQNFEEIVP---FVIASVHQSLSKF 213
L+ L + E++ + + + S+ +KY + ++ F + V+ L K
Sbjct: 445 CPVLRQLESAAMKEYYFKADRPTTLSAGSSNLKYRNPKYLSMLNHLRFYLPEVYPKLDKI 504
Query: 214 IYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDVLDAIQRSASQPWV 270
+++ ++V+ + L +DL+ A + C E R + Y+N S P +
Sbjct: 505 LFLDDDIVVQKDLTGLWDVDLNGKVNGAVETCGESFHRFDKYLN----------FSNPHI 554
Query: 271 SGKPYAVNSCMPDLGMLLIDARKLEKY-ILEAFLWWKKVINQRDR---SIGRSPAIALAL 326
+ + + N+C GM + D R+ +K I + W+ + DR +G P L
Sbjct: 555 A-RNFDPNACGWAYGMNIFDLREWKKKDITGIYHKWQNM--NEDRVLWKLGTLPPGLLTF 611
Query: 327 YDRYLKLSSSWLVT------DSTSSVVNKSLAIRYDGPMTACSEFG 366
Y L SW V S ++ + + Y+G M E
Sbjct: 612 YKLTHPLDKSWHVLGLGYNPSIDRSEIDNAAVVHYNGNMKPWLELA 657
>gi|297844810|ref|XP_002890286.1| hypothetical protein ARALYDRAFT_312807 [Arabidopsis lyrata subsp.
lyrata]
gi|297336128|gb|EFH66545.1| hypothetical protein ARALYDRAFT_312807 [Arabidopsis lyrata subsp.
lyrata]
Length = 538
Score = 40.4 bits (93), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 55/270 (20%), Positives = 101/270 (37%), Gaps = 48/270 (17%)
Query: 137 YPDLLHFHLFVPKGSEDMVSFYKLKVLFPHSNLEFHGQEE-----------VKKVIRTAS 185
+P L FH+ + S + + + L F S +E EE VK+++ T +
Sbjct: 256 HPKQLVFHIVTNRVSYNAMQAWFLSNDFKGSAIEIRSIEEFSWLNASYSPVVKQLLDTDA 315
Query: 186 TGVKYSVQNFEEIVP-----------------FVIASVHQSLSKFIYMSPSVIVKGRVEE 228
+ Q ++ + F I ++ L K I++ V+V+ +
Sbjct: 316 RAYYFGEQTSQDTISEPKVRNPKYLSLLNHLRFYIPEIYPQLEKIIFLDDDVVVQKDLTP 375
Query: 229 LIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRSASQPWVSGKPYAVNSCMPDLGMLL 288
L +DL A + C E + Y + S P +S K + +C GM +
Sbjct: 376 LFSLDLHGNVNGAVETCLEAFHRYYK-------YLNFSNPLISSK-FDPQACGWAFGMNV 427
Query: 289 IDARKLEKYILEA-FLWWKKVINQRDRS---IGRSPAIALALYDRYLKLSSSWLV----- 339
D K + A + +W++ R+R+ +G P L+ Y L W V
Sbjct: 428 FDLIAWRKANVTARYHYWQE--QNRERTLWKLGTLPPGLLSFYGLTEPLDRRWHVLGLGY 485
Query: 340 -TDSTSSVVNKSLAIRYDGPMTACSEFGDG 368
+ + ++ + I Y+G M + G G
Sbjct: 486 DVNIDNRLIETAAVIHYNGNMKPWLKLGIG 515
>gi|225436890|ref|XP_002271296.1| PREDICTED: probable galacturonosyltransferase-like 4 [Vitis
vinifera]
Length = 356
Score = 40.4 bits (93), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 47/232 (20%), Positives = 91/232 (39%), Gaps = 25/232 (10%)
Query: 104 SQDDGMVHIVSWMQCLDLRLLAVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVL 163
S D +H+V + LR + S L S P+ + FH D+ F +
Sbjct: 66 SSDADRIHVVMTLDANYLRGTIAALLSILQHSTCPENIDFHFLWSHFESDI--FSSINST 123
Query: 164 FPHSNLEFHG------QEEVKKVIRTA-STGVKYSVQNFEEIVPFVIASVHQSLSKFIYM 216
FP N + + + ++ K IR A + Y+ +I+P + + IY+
Sbjct: 124 FPFLNFKVYRFDSNRVRGKISKSIRRALDQPLNYARIYLGDILP-------ADVRRVIYL 176
Query: 217 SPSVIVKGRVEELIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRSASQPWVSGKPYA 276
+++ + +L G++L + +AA + C +Y R+ ++ + KP
Sbjct: 177 DSDLVMVDDIAKLWGVELGDKVLAAPEYCHANFTNYFTSAFWS--DRALARTFDGRKP-- 232
Query: 277 VNSCMPDLGMLLIDARKLEK--YILEAFLWWKKVINQRDRSIGRSPAIALAL 326
C + G++++D K + Y + W +R +G P L L
Sbjct: 233 ---CYFNTGVMVVDVEKWREGGYTKKVEEWMAVQKKKRIYQLGSLPPFLLVL 281
>gi|255566847|ref|XP_002524407.1| transferase, transferring glycosyl groups, putative [Ricinus
communis]
gi|223536368|gb|EEF38018.1| transferase, transferring glycosyl groups, putative [Ricinus
communis]
Length = 344
Score = 40.4 bits (93), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 54/246 (21%), Positives = 92/246 (37%), Gaps = 35/246 (14%)
Query: 95 RNSGSISSISQDDGMVHIVSWMQCLDLRLLAVLVNSTLSGSRYPDLLHFHLFVPKGSEDM 154
RN + +HI + LR V S L + P+ + FH ++
Sbjct: 45 RNGRECLQTTSSSTFIHIAMTLDSTYLRGSVAGVFSVLQHASCPENIIFHFIATHRRAEL 104
Query: 155 VSFYKLKVLFPHSNLE-FHGQEEVKK------VIRTASTGVKYSVQNFEEIVPFVIASVH 207
+ V FP+ N +H ++ K V R + Y+ +++P
Sbjct: 105 RR--TITVTFPYLNFHLYHFDSDLVKGKISSSVRRALDQPLNYARFYLADLLP------- 155
Query: 208 QSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRSASQ 267
S+ + IY +IV V +L I+L + + A + C +Y N S+
Sbjct: 156 ASVPRIIYFDSDLIVVDDVAKLWNINLGVHVLGAPEYCHANFTNYFN-----------SR 204
Query: 268 PWVSGKPYAVN-----SCMPDLGMLLIDARKL-EKYILEAFLWWKKVINQ-RDRSIGRSP 320
W S + YA + +C + G+++ID K E E +W KV + R +G P
Sbjct: 205 FW-SNEGYAASFRERRACYFNTGVMVIDLMKWREGKYREKLEYWMKVQKKYRIYELGSLP 263
Query: 321 AIALAL 326
L
Sbjct: 264 PFLLVF 269
>gi|409894769|gb|AFV46220.1| hypothetical protein, partial [Scutellaria baicalensis]
Length = 258
Score = 40.4 bits (93), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 43/173 (24%), Positives = 73/173 (42%), Gaps = 28/173 (16%)
Query: 201 FVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDC---SERLNSYVNPDV 257
F + ++ L K +++ V+V+ + L IDL A + C RL+ Y+N
Sbjct: 67 FYMPEMYPKLHKILFLDDDVVVQKDLTALWKIDLDGKVNGAINTCFGSFHRLSEYLN--- 123
Query: 258 LDAIQRSASQPWVSGKPYAVNSCMPDLGMLL--IDARKLEKYILEAFLWWKKVINQRDRS 315
S P + K + +C LGM + +DA +LEK E + +W+ + DR+
Sbjct: 124 -------FSHPLIKEK-FNPRACAWALGMNIFDLDAWRLEK-CTEQYHYWQNL--NEDRT 172
Query: 316 I---GRSPAIALALYDRYLKLSSSWLVTDSTSS------VVNKSLAIRYDGPM 359
+ G PA + Y L +W V S+ + K+ I + G M
Sbjct: 173 LWKMGTLPAGLVTFYSTTKPLDKAWHVLGLGSNPSISMEEIEKAAVIHFSGDM 225
>gi|357486289|ref|XP_003613432.1| Glycosyltransferase CAZy family GT8 [Medicago truncatula]
gi|355514767|gb|AES96390.1| Glycosyltransferase CAZy family GT8 [Medicago truncatula]
Length = 395
Score = 40.0 bits (92), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 60/321 (18%), Positives = 115/321 (35%), Gaps = 18/321 (5%)
Query: 56 LGLLQFLPATHFRHPSDPFRIWVPFNSNTSFSKTRDSGDRNSGSISSISQDDGMVHIVSW 115
LG+ F H PFR F+ + + S ++ S D +VHI
Sbjct: 25 LGIRSFPTTNTAEHSPTPFRFSGAFSEAPEYRNGAGCPKQKSKTLFPSSCDPSLVHIAMT 84
Query: 116 MQCLDLRLLAVLVNSTLSGSRYPDLLHFHL----FVPKGSEDMVSFYKLKVLFPHSNLEF 171
+ LR V+S L S P+ + FH F P + + +FP N +
Sbjct: 85 LDSGYLRGSIAAVHSVLRHSSCPENIFFHFISAEFDPTTPRTLTRL--VASVFPSLNFKV 142
Query: 172 HGQEEVKKVIRTASTGVKYSVQN-FEEIVPFVIASVHQSLSKFIYMSPSVIVKGRVEELI 230
+ E VI S+ ++ +++N ++ + + + IY+ ++V + +L
Sbjct: 143 YIFRE-DTVINLISSSIRLALENPLNYARNYLGDMLDTCVERVIYLDSDIVVVDDISKLW 201
Query: 231 GIDL-SNYAIAAADDCSERLNSYVNPDVLDAIQRSASQPWVSGKPYAVNSCMPDLGMLLI 289
+ + + I A + C Y + + P +S A +C + G++++
Sbjct: 202 SVKMDAKKVIGAPEYCHANFTKYFTDEFWN-------DPLLSRVFKARKACYFNTGVMVM 254
Query: 290 DARKLEK--YILEAFLWWKKVINQRDRSIGRSPAIALALYDRYLKLSSSWLVTDSTSSVV 347
D K + Y + W + +R +G P L + W +
Sbjct: 255 DLMKWREGNYRRKIENWMELQKKRRIYELGSLPPFLLVFAGNVEAIDHRWNQHGLGGDNL 314
Query: 348 NKSLAIRYDGPMTACSEFGDG 368
N + GP++ G G
Sbjct: 315 NGVCRSLHPGPVSLLHWSGKG 335
>gi|413933398|gb|AFW67949.1| hypothetical protein ZEAMMB73_846223 [Zea mays]
Length = 269
Score = 40.0 bits (92), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 43/209 (20%), Positives = 83/209 (39%), Gaps = 25/209 (11%)
Query: 170 EFHGQEEVKKVIRTASTGVKYSVQNFEEIVP---FVIASVHQSLSKFIYMSPSVIVKGRV 226
E++ + + K + S+ +KY + ++ F + V+ L+K +++ ++V+ +
Sbjct: 44 EYYFKADRPKTLSAGSSNLKYRNPKYLSMLNHLRFYLPQVYPKLNKILFLDDDIVVQRDL 103
Query: 227 EELIGIDLSNYAIAAADDCSE---RLNSYVNPDVLDAIQRSASQPWVSGKPYAVNSCMPD 283
L +DL+ A + C E R + Y+N S P ++ + + N+C
Sbjct: 104 TGLWEVDLNGNVNGAVETCGESFHRFDKYLN----------FSNPNIA-QNFDPNACGWA 152
Query: 284 LGMLLIDARKLEKYILEAFL--WWKKVINQRDRSIGRSPAIALALYDRYLKLSSSWLVTD 341
GM + D + +K + W N+ +G P L Y L SW V
Sbjct: 153 YGMNMFDLEEWKKKDITGIYHKWQNMNENRLLWKLGTLPPGLLTFYKLTHPLDKSWHVLG 212
Query: 342 ------STSSVVNKSLAIRYDGPMTACSE 364
S ++ + I Y+G M E
Sbjct: 213 LGYNPTVERSEIDNAAVIHYNGNMKPWLE 241
>gi|357151818|ref|XP_003575914.1| PREDICTED: probable galacturonosyltransferase 6-like [Brachypodium
distachyon]
Length = 536
Score = 40.0 bits (92), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 62/286 (21%), Positives = 111/286 (38%), Gaps = 51/286 (17%)
Query: 125 AVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFPH-----SNLEFHGQEEVKK 179
AV++NST++ S+ P + FH+ D +SF + + F + ++ +E K
Sbjct: 261 AVVINSTIAASKDPKRIMFHVVA-----DALSFPAMMMWFLSNPPSPATIQIENLDEFKW 315
Query: 180 V-----IRTASTGVK----YSVQNFEEIVPFVIASVHQSLSKFIYMSPSVIVKGRVEELI 230
+ R G++ S N + F + V SLSK + + V+V+ + L
Sbjct: 316 LPSDFSSRFKQKGIRDPRYTSALNH---LRFYLPQVFPSLSKVLLLDHDVVVQKDLSGLW 372
Query: 231 GIDLSNYAIAAADDCSE-----RLNSYVNPDVLDAIQRSASQPWVSGKPYAVNSCMPDLG 285
ID+ + A + C+ RL ++VN S P + K + +C+ G
Sbjct: 373 EIDMKHKVNGALETCTSGYGYLRLENFVN----------FSDPSIFNK-FNAKACIYAFG 421
Query: 286 MLLIDARKLEKYILEAFL--WWKKVINQRDRSIGRSPAIALALYDRYLKLSSSWLV---- 339
M + D + L A W++ +R G P L Y++ + L + W V
Sbjct: 422 MNIFDLTEWRNKGLTATYDKWFQMGKRRRLWKAGSLPLGQLVFYNQTVPLDNRWHVLGLG 481
Query: 340 --TDSTSSVVNKSLAIRYDGPMTACSEFGDGANMEPARGDLWKQHL 383
++ + + I Y G + E P D W + L
Sbjct: 482 RDSNMEREEIESAAVIHYSGNLKPWLEISI-----PKYRDYWNRFL 522
>gi|145356270|ref|XP_001422356.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144582597|gb|ABP00673.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 259
Score = 39.7 bits (91), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 63/266 (23%), Positives = 113/266 (42%), Gaps = 32/266 (12%)
Query: 109 MVHIVSWMQCLDLRLLAVLVNSTLSGSRYPDLLHFHLFVPKGS----EDMVSFYKLKVLF 164
+VHI L L +++S LS + P + FH+F + + ++ Y + F
Sbjct: 2 LVHIAFACDPTQLFTLGPVISSVLSATASPHRIRFHIFTARDALTDASVQLNCYSRAIPF 61
Query: 165 PHSNLEFHGQEEVKKVIR---TASTGVKYSVQNFEEIVPFVIASVHQSLSKFIYMSPSVI 221
E H E K +IR T + ++ +QN F A + + K +Y+ +I
Sbjct: 62 I---WELH--EFSKDMIRANITVHSRKEWRLQNAFNYARFYFAEILSDVQKVVYLDTDII 116
Query: 222 VKGRVEELIGIDL--SNYAIAAADDCSERLNSYVNPDVLDAIQRSASQPWVSGKPYAVNS 279
VKG + L +L S+ ++ AA S L S +N +A +S+ + K ++ N+
Sbjct: 117 VKGDICRLHDANLRSSSTSVIAAVKRSVPLGSLLNFS--NAAVKSSG---LREKMHSFNA 171
Query: 280 CMPDLGMLLIDARKLEKYILEAFL--WWKKVINQRDRSIGRSPAIALALYDRYLKLSSSW 337
G+LLID + + + + W K + S G P + L D + + S W
Sbjct: 172 -----GVLLIDLESWRRKRITSTVETWLKMNSVSKLYSHGSQPPLLLVFGDSFESIPSHW 226
Query: 338 LVTDS------TSSVVNKSLAIRYDG 357
V +SV+N++ + + G
Sbjct: 227 NVDGVGYKKGLRASVLNEARVLHWSG 252
>gi|194689812|gb|ACF78990.1| unknown [Zea mays]
gi|414872195|tpg|DAA50752.1| TPA: hypothetical protein ZEAMMB73_194342 [Zea mays]
Length = 291
Score = 39.7 bits (91), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 43/209 (20%), Positives = 83/209 (39%), Gaps = 25/209 (11%)
Query: 170 EFHGQEEVKKVIRTASTGVKYSVQNFEEIVP---FVIASVHQSLSKFIYMSPSVIVKGRV 226
E++ + + K + S+ +KY + ++ F + V+ L+K +++ ++V+ +
Sbjct: 66 EYYFKADRPKTLSAGSSNLKYRNPKYLSMLNHLRFYLPQVYPKLNKILFLDDDIVVQRDL 125
Query: 227 EELIGIDLSNYAIAAADDCSE---RLNSYVNPDVLDAIQRSASQPWVSGKPYAVNSCMPD 283
L +DL+ A + C E R + Y+N S P ++ + + N+C
Sbjct: 126 TGLWEVDLNGNVNGAVETCGESFHRFDKYLN----------FSNPNIA-QNFDPNACGWA 174
Query: 284 LGMLLIDARKLEKYILEAFL--WWKKVINQRDRSIGRSPAIALALYDRYLKLSSSWLVTD 341
GM + D + +K + W N+ +G P L Y L SW V
Sbjct: 175 YGMNMFDLEEWKKKDITGIYHKWQNMNENRLLWKLGTLPPGLLTFYKLTHPLDKSWHVLG 234
Query: 342 ------STSSVVNKSLAIRYDGPMTACSE 364
S ++ + I Y+G M E
Sbjct: 235 LGYNPTIERSEIDNAAVIHYNGNMKPWLE 263
>gi|242097150|ref|XP_002439065.1| hypothetical protein SORBIDRAFT_10g030870 [Sorghum bicolor]
gi|241917288|gb|EER90432.1| hypothetical protein SORBIDRAFT_10g030870 [Sorghum bicolor]
Length = 697
Score = 39.7 bits (91), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 33/144 (22%), Positives = 63/144 (43%), Gaps = 16/144 (11%)
Query: 201 FVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDV 257
F + +H L K +++ V+V+ + L IDL A + C E R ++Y+N
Sbjct: 505 FYMPEIHPKLEKILFLDDDVVVQKDLTPLWDIDLKGMVNGAVETCKESFHRFDTYLN--- 561
Query: 258 LDAIQRSASQPWVSGKPYAVNSCMPDLGMLLIDARKLEKY-ILEAFLWWKKVINQRDR-S 315
S P +S + + ++C GM + D ++ +K I + +W+ + R
Sbjct: 562 -------FSHPKIS-ENFDPHACGWAFGMNMFDLKEWKKRNITGIYHYWQDLNEDRKLWK 613
Query: 316 IGRSPAIALALYDRYLKLSSSWLV 339
+G P + Y+ L+ +W V
Sbjct: 614 LGTLPPGLITFYNLTYPLNRTWHV 637
>gi|356570614|ref|XP_003553480.1| PREDICTED: probable galacturonosyltransferase 6-like [Glycine max]
Length = 625
Score = 39.7 bits (91), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 60/256 (23%), Positives = 107/256 (41%), Gaps = 29/256 (11%)
Query: 120 DLRLLAVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFPHSNLEFHGQE---- 175
+L AV+VNST+S ++ + L FH V S + + + +L P H Q
Sbjct: 346 NLLACAVVVNSTVSNAKKKEKLVFH--VVTNSLNFPAIWMWFLLNPPGKATVHIQSIENF 403
Query: 176 ----EVKKVIRTASTGVKYSVQ-NFEEIVPFVIASVHQSLSKFIYMSPSVIVKGRVEELI 230
+ S+ +Y+ + N+ + F + + +L+K + V+V+ + L
Sbjct: 404 EWLPMYNTFNKHNSSDPRYTSELNY---LRFYLPDIFPTLNKILLFDHDVVVQQDLSGLW 460
Query: 231 GIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRSASQPWVSGKPYAVNSCMPDLGMLLID 290
+L IAA C E S+ D+L S P+++ + + N+C GM L D
Sbjct: 461 NANLKGKVIAAVGTCQEGGTSFHRMDMLINF----SDPFIAER-FDANACTWAFGMNLFD 515
Query: 291 ARKLEKYILEAFLWWKKVINQRDR---SIGRSPAIALALYDRYLKLSSSW----LVTDS- 342
++ ++ L L+ + + R +IG P L Y++ L W L DS
Sbjct: 516 LQQWRRHNLTT-LYHRYLQMGSKRPLWNIGSLPLGWLTFYNKTKVLDRRWHILGLGYDSG 574
Query: 343 -TSSVVNKSLAIRYDG 357
+ + + I YDG
Sbjct: 575 VDKNEIEGAAVIHYDG 590
>gi|218198875|gb|EEC81302.1| hypothetical protein OsI_24438 [Oryza sativa Indica Group]
gi|222636212|gb|EEE66344.1| hypothetical protein OsJ_22634 [Oryza sativa Japonica Group]
Length = 588
Score = 39.7 bits (91), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 59/277 (21%), Positives = 109/277 (39%), Gaps = 53/277 (19%)
Query: 125 AVLVNSTLSGSRYPDLLHFHLFVPK---GSEDM----------------VSFYK------ 159
+V+VNST+ ++ P+ FHL K G+ +M V +K
Sbjct: 290 SVVVNSTIMNAKEPEKHVFHLVTDKLNFGAMNMWFLLNPPGDATIHVENVDDFKWLNSSY 349
Query: 160 ---LKVLFPHSNLEFHGQEEVKKVIRTASTGVKYSVQNFEEIVP---FVIASVHQSLSKF 213
LK L + E++ + + K + S+ +KY + ++ F + V+ L+K
Sbjct: 350 CPVLKQLESVAMKEYYFKADRPKTLSAGSSNLKYRNPKYLSMLNHLRFYLPQVYPKLNKI 409
Query: 214 IYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDVLDAIQRSASQPWV 270
+++ ++V+ + L +DL+ A + C E R + Y+N S P +
Sbjct: 410 LFLDDDIVVQKDLTGLWEVDLNGNVNGAVETCGESFHRFDKYLN----------FSNPNI 459
Query: 271 SGKPYAVNSCMPDLGMLLIDARKLEKYILEAFL--WWKKVINQRDRSIGRSPAIALALYD 328
+ + + N+C GM + D + +K + W N+ +G P L Y
Sbjct: 460 A-QNFDPNACGWAYGMNMFDLEEWKKKDITGIYHKWQNMNENRLLWKLGTLPPGLLTFYK 518
Query: 329 RYLKLSSSWLVT------DSTSSVVNKSLAIRYDGPM 359
L SW V S ++ + I Y+G M
Sbjct: 519 LTHPLDKSWHVLGLGYNPSIERSEIDNAAVIHYNGNM 555
>gi|356532892|ref|XP_003535003.1| PREDICTED: probable galacturonosyltransferase 4-like isoform 2
[Glycine max]
Length = 663
Score = 39.7 bits (91), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 47/222 (21%), Positives = 91/222 (40%), Gaps = 39/222 (17%)
Query: 150 GSEDMVSFY-KLKVLFPHSNLEFHGQEEVKKVIRTASTGVKYSVQNFEEIVPFVIASVHQ 208
GS+ M+ +Y K + SNL+F + + S+ N + F + +
Sbjct: 436 GSQSMIDYYFKAHRVTSDSNLKFRNPKYL-------------SILNH---LRFYLPEIFP 479
Query: 209 SLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDVLDAIQRSA 265
L+K +++ ++V+ + +L IDL A + C E R + Y+N
Sbjct: 480 KLNKVLFLDDDIVVQKDLTDLWSIDLKGNVNGAVETCGESFHRFDRYLN----------F 529
Query: 266 SQPWVSGKPYAVNSCMPDLGMLLID-ARKLEKYILEAFLWWKKVINQRDR-SIGRSPAIA 323
S P ++ K + ++C GM + D A + I E + W+ + + R +G P
Sbjct: 530 SNPLIA-KNFDPHACGWAYGMNVFDLAEWKRQNITEVYHNWQNLNHDRQLWKLGTLPPGL 588
Query: 324 LALYDRYLKLSSSWLVT------DSTSSVVNKSLAIRYDGPM 359
+ + R L+ SW + + + +S + Y+G M
Sbjct: 589 ITFWKRTFPLNRSWHILGLGYNPNVNQRDIEQSAVVHYNGNM 630
>gi|356532890|ref|XP_003535002.1| PREDICTED: probable galacturonosyltransferase 4-like isoform 1
[Glycine max]
Length = 657
Score = 39.3 bits (90), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 47/222 (21%), Positives = 91/222 (40%), Gaps = 39/222 (17%)
Query: 150 GSEDMVSFY-KLKVLFPHSNLEFHGQEEVKKVIRTASTGVKYSVQNFEEIVPFVIASVHQ 208
GS+ M+ +Y K + SNL+F + + S+ N + F + +
Sbjct: 430 GSQSMIDYYFKAHRVTSDSNLKFRNPKYL-------------SILNH---LRFYLPEIFP 473
Query: 209 SLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDVLDAIQRSA 265
L+K +++ ++V+ + +L IDL A + C E R + Y+N
Sbjct: 474 KLNKVLFLDDDIVVQKDLTDLWSIDLKGNVNGAVETCGESFHRFDRYLN----------F 523
Query: 266 SQPWVSGKPYAVNSCMPDLGMLLID-ARKLEKYILEAFLWWKKVINQRDR-SIGRSPAIA 323
S P ++ K + ++C GM + D A + I E + W+ + + R +G P
Sbjct: 524 SNPLIA-KNFDPHACGWAYGMNVFDLAEWKRQNITEVYHNWQNLNHDRQLWKLGTLPPGL 582
Query: 324 LALYDRYLKLSSSWLVT------DSTSSVVNKSLAIRYDGPM 359
+ + R L+ SW + + + +S + Y+G M
Sbjct: 583 ITFWKRTFPLNRSWHILGLGYNPNVNQRDIEQSAVVHYNGNM 624
>gi|297809943|ref|XP_002872855.1| hypothetical protein ARALYDRAFT_490360 [Arabidopsis lyrata subsp.
lyrata]
gi|297318692|gb|EFH49114.1| hypothetical protein ARALYDRAFT_490360 [Arabidopsis lyrata subsp.
lyrata]
Length = 346
Score = 39.3 bits (90), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 70/327 (21%), Positives = 122/327 (37%), Gaps = 29/327 (8%)
Query: 49 ALGLIVFLGLLQ-FLPATHFRHPSDPFRIWVPFNSNTSFSKTRDSGDR--NSGSISSISQ 105
A+ +IV LQ F PA R P P N +GD +SG + + S
Sbjct: 14 AMAVIVLSPSLQSFPPAAAIRSSPSPIFRKAPVVFN--------NGDECLSSGGVCNPS- 64
Query: 106 DDGMVHIVSWMQCLDLRLLAVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFP 165
+VH+ + LR VNS L S P+ + FH V +++ ++ +FP
Sbjct: 65 ---LVHVAITLDVEYLRGSIAAVNSILQHSICPESVFFHFIVVSEETNLLESL-VRSIFP 120
Query: 166 HSNLEFHGQEEVKKVIR-TASTGVKYSV-QNFEEIVPFVIASVHQSLSKFIYMSPSVIVK 223
L+F+ + + +R S+ V+ ++ Q ++ + +S+ IY+ ++V
Sbjct: 121 --GLKFNIYDFAPETVRGLISSSVRQALEQPLNYARNYLADLLEPCVSRVIYLDSDLVVV 178
Query: 224 GRVEELIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRSASQPWVSGKPYAVNSCMPD 283
+ +L L + I A + C Y S+ SG C +
Sbjct: 179 DDIAKLWKTSLGSRIIGAPEYCHANFTKYFTGGFW-------SEERFSGAFRGRKPCYFN 231
Query: 284 LGMLLIDARKLEK--YILEAFLWWKKVINQRDRSIGRSPAIALALYDRYLKLSSSWLVTD 341
G+++ID +K + Y W + +R +G P L +S W
Sbjct: 232 TGVMVIDLKKWRRGGYTKRIEKWMEIQRTERIYDLGSLPPFLLVFAGHVAPISHRWNQHG 291
Query: 342 STSSVVNKSLAIRYDGPMTACSEFGDG 368
V S + GP++ G G
Sbjct: 292 LGGDNVRGSCRDLHPGPVSLLHWSGSG 318
>gi|224124762|ref|XP_002329942.1| glycosyltransferase, CAZy family GT8 [Populus trichocarpa]
gi|222871964|gb|EEF09095.1| glycosyltransferase, CAZy family GT8 [Populus trichocarpa]
Length = 665
Score = 39.3 bits (90), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 37/177 (20%), Positives = 73/177 (41%), Gaps = 22/177 (12%)
Query: 201 FVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDV 257
F + + L+K +++ ++V+ + L +DL A + C E R + Y+N
Sbjct: 474 FYLPEIFPKLNKVLFLDDDIVVQKDLTGLWSLDLKGNVNGAVETCGENFHRFDRYLN--- 530
Query: 258 LDAIQRSASQPWVSGKPYAVNSCMPDLGMLLIDARKLEKY-ILEAFLWWKKVINQRDR-S 315
S P +S K + +C GM + D ++ ++ I + + W+K+ + R
Sbjct: 531 -------FSNPHIS-KNFDPRACGWAYGMNIFDLKEWKRQNITDVYHTWQKLNHDRQLWK 582
Query: 316 IGRSPAIALALYDRYLKLSSSWLVT------DSTSSVVNKSLAIRYDGPMTACSEFG 366
+G P + + R L W V + + + ++ I Y+G M E G
Sbjct: 583 LGTLPPGLITFWKRTHPLDRRWHVLGLGYNPNVSQREIERAAVIHYNGNMKPWLEIG 639
>gi|356520019|ref|XP_003528664.1| PREDICTED: probable galacturonosyltransferase-like 4-like [Glycine
max]
Length = 350
Score = 39.3 bits (90), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 46/224 (20%), Positives = 91/224 (40%), Gaps = 19/224 (8%)
Query: 121 LRLLAVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFPHSNLEFHGQEEVKKV 180
LR V S L S P+ L FH F+ + F ++ FP+ N++ + + + +V
Sbjct: 75 LRGTMAAVLSILQHSTCPENLAFH-FLSAHDDAPELFSSIRSTFPYLNMKIY-RFDSNRV 132
Query: 181 IRTASTGVKYSVQ---NFEEIVPFVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNY 237
S ++ ++ N+ I ++ ++ + + + IY ++V + +L G+D+
Sbjct: 133 RGKISKSIRQALDQPLNYARI--YLADTIPEDVKRVIYFDSDLVVVDDIAKLWGVDMEGK 190
Query: 238 AIAAADDCSERLNSYVNPDVLDAIQRSASQPWVSGKPYAVNSCMPDLGMLL--IDARKLE 295
+AA + C Y + S P ++ C + G+++ +D + E
Sbjct: 191 LVAAPEYCHANFTLYFTDNFW-------SDPVLAKTFEGRKPCYFNTGVMVMDVDTWRKE 243
Query: 296 KYILEAFLWWKKVINQRDR--SIGRSPAIALALYDRYLKLSSSW 337
+Y E W V Q+ R +G P L L + W
Sbjct: 244 RYT-EKVEEWMAVQKQQKRIYHLGSLPPFLLVLAGNIKAVDHRW 286
>gi|326437108|gb|EGD82678.1| hypothetical protein PTSG_03339 [Salpingoeca sp. ATCC 50818]
Length = 336
Score = 39.3 bits (90), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 17/42 (40%), Positives = 26/42 (61%)
Query: 211 SKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSERLNSY 252
S+FIY+ VIV+G + EL +D+ + IA + DCS+ Y
Sbjct: 151 SRFIYLDDDVIVQGDILELWEVDMLSRGIAVSTDCSDTAQQY 192
>gi|291527266|emb|CBK92852.1| Lipopolysaccharide biosynthesis proteins, LPS:glycosyltransferases
[Eubacterium rectale M104/1]
Length = 723
Score = 39.3 bits (90), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 21/64 (32%), Positives = 35/64 (54%), Gaps = 1/64 (1%)
Query: 209 SLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRSASQP 268
L K IY+ + V +EEL +++ NY +AAA DCS + ++ P + A Q S +
Sbjct: 440 DLKKIIYLDSDLFVNTDIEELWNLNIDNYCLAAAQDCS-TIRNWGTPYAVAAGQTSRDRY 498
Query: 269 WVSG 272
+ +G
Sbjct: 499 FNAG 502
>gi|288800023|ref|ZP_06405482.1| lipopolysaccharide 1,2-glucosyltransferase/general stress protein
[Prevotella sp. oral taxon 299 str. F0039]
gi|288333271|gb|EFC71750.1| lipopolysaccharide 1,2-glucosyltransferase/general stress protein
[Prevotella sp. oral taxon 299 str. F0039]
Length = 309
Score = 39.3 bits (90), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 24/107 (22%), Positives = 51/107 (47%), Gaps = 2/107 (1%)
Query: 143 FHLFVPKGSEDMVSFYKLKVLFPHSNLEFHGQEEVKKVIRTASTGVKYSVQNFEEIVPFV 202
FHL + K +D + L +++++F+ + T G ++ + + F+
Sbjct: 33 FHLIIDKSVDDKCKNKLQQCLVGNTSIQFYHFNDSILTKYTIGKGHHLTIATYYRL--FI 90
Query: 203 IASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSERL 249
+ Q++S+ IY+ +IV+ + +L DL+N A+ A D E +
Sbjct: 91 CNLLPQNISRIIYLDCDLIVRHPLNKLWNTDLTNKALGAVTDMGEAM 137
>gi|238924869|ref|YP_002938385.1| Glycosyltransferase Family 2 modular protein [Eubacterium rectale
ATCC 33656]
gi|238876544|gb|ACR76251.1| Glycosyltransferase Family 2 modular protein [Eubacterium rectale
ATCC 33656]
Length = 723
Score = 39.3 bits (90), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 21/63 (33%), Positives = 35/63 (55%), Gaps = 1/63 (1%)
Query: 210 LSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRSASQPW 269
L K IY+ + V +EEL +++ NY +AAA DCS + ++ P + A Q S + +
Sbjct: 441 LKKIIYLDSDLFVNTDIEELWNLNIDNYCLAAAQDCS-TIRNWGTPYAVAAGQTSRDRYF 499
Query: 270 VSG 272
+G
Sbjct: 500 NAG 502
>gi|296125750|ref|YP_003633002.1| glycosyl transferase [Brachyspira murdochii DSM 12563]
gi|296017566|gb|ADG70803.1| glycosyl transferase family 8 [Brachyspira murdochii DSM 12563]
Length = 331
Score = 38.9 bits (89), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 33/143 (23%), Positives = 67/143 (46%), Gaps = 15/143 (10%)
Query: 122 RLLAVLVNSTLSGSRYPDLLHFHLF----VPKGSEDMVSFYKLKVLFPHSNLEFH--GQE 175
+ + + S LS S+ + ++FHL + ++S K+K + +++F+ E
Sbjct: 15 KYMGTTIASILSNSKEDEEIYFHLLDGGITEENKNKLLSLKKIK----NCDMKFYVPDFE 70
Query: 176 EVKKVIRTASTGVKYSVQNFEEIVPFVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLS 235
+ KK + ++S F + ++ + K +Y+ IV+ ++EL ID++
Sbjct: 71 KYKKWYELGNYNGRFSPAMFYRLDIHILMP---DIDKVLYLDCDTIVRSSLKELFEIDIN 127
Query: 236 NYAIAAADDCSERLNSYVNPDVL 258
NY AD+ R N+Y N V+
Sbjct: 128 NYYALVADNA--RNNNYFNSGVI 148
>gi|291526000|emb|CBK91587.1| Lipopolysaccharide biosynthesis proteins, LPS:glycosyltransferases
[Eubacterium rectale DSM 17629]
Length = 723
Score = 38.9 bits (89), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 16/38 (42%), Positives = 23/38 (60%)
Query: 209 SLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCS 246
L K IY+ + V +EEL +++ NY +AAA DCS
Sbjct: 440 DLKKIIYLDSDLFVNTDIEELWNLNIDNYCLAAAQDCS 477
>gi|356564442|ref|XP_003550463.1| PREDICTED: uncharacterized protein LOC100794732 [Glycine max]
Length = 1469
Score = 38.9 bits (89), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 36/170 (21%), Positives = 71/170 (41%), Gaps = 22/170 (12%)
Query: 201 FVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDV 257
F + + L+K +++ ++V+ + L IDL A + C E R + Y+N
Sbjct: 1278 FYLPEIFPKLNKVLFLDDDIVVQKDLTGLWSIDLKGNVNGAVETCGERFHRFDRYLN--- 1334
Query: 258 LDAIQRSASQPWVSGKPYAVNSCMPDLGMLLIDARKLEKY-ILEAFLWWKKVINQRDR-S 315
S P ++ K + +C GM + D + ++ I + + W+K+ + R
Sbjct: 1335 -------FSNPLIA-KNFDPRACGWAYGMNVFDLVQWKRQNITDVYHKWQKMNHDRQLWK 1386
Query: 316 IGRSPAIALALYDRYLKLSSSWLVT------DSTSSVVNKSLAIRYDGPM 359
+G P + + R +L SW V + + ++ I Y+G M
Sbjct: 1387 LGTLPPGLITFWKRTFQLHRSWHVLGLGYNPNINQKEIERAAVIHYNGNM 1436
>gi|413935035|gb|AFW69586.1| hypothetical protein ZEAMMB73_786543 [Zea mays]
Length = 693
Score = 38.9 bits (89), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 32/144 (22%), Positives = 63/144 (43%), Gaps = 16/144 (11%)
Query: 201 FVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDV 257
F + +H L K +++ V+V+ + L +DL A + C E R ++Y+N
Sbjct: 501 FYMPEIHPKLEKILFLDDDVVVQKDLTPLWDVDLKGMVNGAVETCKESFHRFDTYLN--- 557
Query: 258 LDAIQRSASQPWVSGKPYAVNSCMPDLGMLLIDARKLEKY-ILEAFLWWKKVINQRDR-S 315
S P +S + + ++C GM + D ++ +K I + +W+ + R
Sbjct: 558 -------FSHPKIS-ENFDPHACGWAFGMNMFDLKEWKKRNITGIYHYWQDLNEDRKLWK 609
Query: 316 IGRSPAIALALYDRYLKLSSSWLV 339
+G P + Y+ L+ +W V
Sbjct: 610 LGTLPPGLITFYNLTYPLNRTWHV 633
>gi|319639377|ref|ZP_07994128.1| glycosyl transferase family 8 [Neisseria mucosa C102]
gi|317399561|gb|EFV80231.1| glycosyl transferase family 8 [Neisseria mucosa C102]
Length = 342
Score = 38.9 bits (89), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 37/141 (26%), Positives = 62/141 (43%), Gaps = 11/141 (7%)
Query: 111 HIVSWMQCLDLRLLAVLVNSTLSGSRYPDLLHFHLFVPKGSE--DMVSFYKLKVLFPHSN 168
HIV +R + + S L + +P +HFHL V GSE D F L+ + P+
Sbjct: 25 HIVYAADQNYIRHIGTSMLSVLENNHFP--IHFHLLV-SGSESYDFSIFNHLETINPNYA 81
Query: 169 LE-FHGQEEVKKVIRTASTGVKYSVQNFEEIVPFVIASVHQSLSKFIYMSPSVIVKGRVE 227
+ +H E ++T G + +P ++ + + +Y+ V+ G +
Sbjct: 82 VTVYHLNTEYFTTLQT--NGYFTIAMYYRMCIPAILGGISDTA---LYLDTDVLCLGDIS 136
Query: 228 ELIGIDLSNYAIAAADDCSER 248
EL IDL+ +AAA D R
Sbjct: 137 ELFTIDLTGQLVAAAPDTMFR 157
>gi|255567642|ref|XP_002524800.1| transferase, transferring glycosyl groups, putative [Ricinus
communis]
gi|223535984|gb|EEF37643.1| transferase, transferring glycosyl groups, putative [Ricinus
communis]
Length = 350
Score = 38.9 bits (89), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 46/226 (20%), Positives = 88/226 (38%), Gaps = 25/226 (11%)
Query: 110 VHIVSWMQCLDLRLLAVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFPHSNL 169
+H+ + LR V S L S P+ + FH ++ F +K FP+ N
Sbjct: 66 IHVTMTLDANYLRGTMAAVLSILQHSTCPENVEFHFLWAHFEPEV--FSNIKSTFPYLNF 123
Query: 170 EFHG------QEEVKKVIRTA-STGVKYSVQNFEEIVPFVIASVHQSLSKFIYMSPSVIV 222
+ + + ++ K IR A + Y+ +I+P +++ IY+ ++V
Sbjct: 124 KIYRFDSNRVRGKISKSIRQALDQPLNYARIYLADILPL-------DVNRVIYLDSDLVV 176
Query: 223 KGRVEELIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRSASQPWVSGKPYAVNSCMP 282
+ +L +DL +AA + C +Y ++ + + GK C
Sbjct: 177 VDDIAKLWQVDLEGKVLAAPEYCHANFTNYFTELFWSNLEWAKT---FEGK----RPCYF 229
Query: 283 DLGMLLIDARKLEK--YILEAFLWWKKVINQRDRSIGRSPAIALAL 326
+ G++++D K Y + W +QR +G P L L
Sbjct: 230 NTGVMVVDVDKWRTGGYTQKVEEWMMVQKHQRLYDLGSLPPFLLVL 275
>gi|326494626|dbj|BAJ94432.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 538
Score = 38.9 bits (89), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 63/274 (22%), Positives = 110/274 (40%), Gaps = 53/274 (19%)
Query: 125 AVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFPHSNLEFHGQEE-------- 176
+V+VNST+S + +P L FH+ + + +S + L F +E H +E
Sbjct: 247 SVVVNSTVSNANHPQQLVFHVVTDRINFGAMSTWFLINDFKGCTVEVHCIDEFSWFNATA 306
Query: 177 ---VKKV-----------IRTASTGVKYSVQNFEEIVP---FVIASVHQSLSKFIYMSPS 219
V+++ ++T +K+ F ++ F I + +L K +++
Sbjct: 307 SSLVRRLSDMETKGSSGGLKTQEREIKFHNPKFVSLLNHLRFYIPQILPNLEKVVFLDDD 366
Query: 220 VIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDVLDAIQRSASQPWVSGK--P 274
V+V+ + +L I+L I A + C E R + Y+N SQP +S K P
Sbjct: 367 VVVQKDLTQLFSIELHGNVIGAVETCLESFHRYHKYLN----------FSQPIISSKIDP 416
Query: 275 YAVNSCMPDLGMLLIDA---RKLEKYILEAFLWWKKVINQRDRSIGRSPAIALALYDRYL 331
Y +C GM + D RK L + W ++ +Q G PA L Y
Sbjct: 417 Y---TCGWAFGMNIFDLIAWRKANATSLYHY-WQEQNADQLLWRTGTLPAGLLTFYGLME 472
Query: 332 KLSSSWLV------TDSTSSVVNKSLAIRYDGPM 359
L W + D ++ + + Y+G M
Sbjct: 473 PLDRRWHLLGLGYDVDIDDRLIESAAVVHYNGNM 506
>gi|33152214|ref|NP_873567.1| glycosyl transferase family protein [Haemophilus ducreyi 35000HP]
gi|6760462|gb|AAF28363.1|AF224467_2 putative glycosyl transferase [Haemophilus ducreyi]
gi|33148436|gb|AAP95956.1| putative glycosyl transferase [Haemophilus ducreyi 35000HP]
Length = 269
Score = 38.9 bits (89), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 38/150 (25%), Positives = 63/150 (42%), Gaps = 37/150 (24%)
Query: 212 KFIYMSPSVIVKGRVEELIGIDLSNYAIAAADD-CSERLNSYVNPDVLDAIQRSASQPWV 270
K IY+ ++V G + EL D+SNY +AA D SE++ YVN + +A
Sbjct: 102 KVIYLDADIVVNGSLTELYQTDISNYFLAAVKDIISEKI--YVNNHIFNA---------- 149
Query: 271 SGKPYAVNSCMPDLGMLLIDARKLEKYILEAFLWWKKVINQRDRSIGRSP-----AIALA 325
GMLLI+ +K ++ + F ++ ++ I P + L
Sbjct: 150 --------------GMLLINNKKWREHNITQF-----CLSLSEKYINSLPDADQSILNLI 190
Query: 326 LYDRYLKLSSSWLVTDSTSSVVNKSLAIRY 355
D++LKL+ + T + K RY
Sbjct: 191 FKDKWLKLNRGYNYLIGTDYLFFKYGKTRY 220
>gi|86438644|emb|CAJ26362.1| glycosyl transferase-like protein [Brachypodium sylvaticum]
Length = 689
Score = 38.9 bits (89), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 39/179 (21%), Positives = 71/179 (39%), Gaps = 26/179 (14%)
Query: 201 FVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDV 257
F + + L+K +++ +V+ + L IDL A + C E R + Y+N
Sbjct: 498 FYLPEIFPKLNKVLFLDDDTVVQQDLSALWSIDLKGKVNGAVETCGETFHRFDKYLN--- 554
Query: 258 LDAIQRSASQPWVSGKPYAVNSCMPDLGMLLIDARKLEKY-ILEAFLWWKKVINQRDR-- 314
S P V+ + +C GM + D + K I + + W+K+ DR
Sbjct: 555 -------FSNPIVANN-FHPQACGWAFGMNMFDLSEWRKQNITDVYHTWQKL--NEDRLL 604
Query: 315 -SIGRSPAIALALYDRYLKLSSSWLVT------DSTSSVVNKSLAIRYDGPMTACSEFG 366
+G PA + ++R L SW + + + ++ I Y+G + E G
Sbjct: 605 WKLGTLPAGLVTFWNRTFPLDRSWHLLGLGYNPNVNERDIRRASVIHYNGNLKPWLEIG 663
>gi|357145600|ref|XP_003573699.1| PREDICTED: probable galacturonosyltransferase 4-like isoform 2
[Brachypodium distachyon]
Length = 660
Score = 38.9 bits (89), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 41/177 (23%), Positives = 72/177 (40%), Gaps = 22/177 (12%)
Query: 201 FVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDV 257
F + ++ L K +++ V+VK + L ID+ A + C E R + Y+N
Sbjct: 469 FYLPEIYPKLDKMVFLDDDVVVKKDLTGLWSIDMKGKVNGAVETCGESFHRFDRYLN--- 525
Query: 258 LDAIQRSASQPWVSGKPYAVNSCMPDLGMLLID-ARKLEKYILEAFLWWKKVINQRDR-S 315
S P V K + ++C GM + D A + I E + W+K+ +
Sbjct: 526 -------FSNP-VIAKNFDPHACGWAFGMNVFDLAEWRRQDITEIYHSWQKLSSGLLLWK 577
Query: 316 IGRSPAIALALYDRYLKLSSSWLVT------DSTSSVVNKSLAIRYDGPMTACSEFG 366
+G P + +++ L+ SW V S + ++ I Y+G M E G
Sbjct: 578 LGTLPPGLITFWNKTFPLNRSWHVLGLGYNPHVNSRDIERAAVIHYNGNMKPWLEIG 634
>gi|297606545|ref|NP_001058636.2| Os06g0727300 [Oryza sativa Japonica Group]
gi|54291141|dbj|BAD61814.1| putative glycosyl transferase protein A [Oryza sativa Japonica
Group]
gi|125598568|gb|EAZ38348.1| hypothetical protein OsJ_22722 [Oryza sativa Japonica Group]
gi|255677417|dbj|BAF20550.2| Os06g0727300 [Oryza sativa Japonica Group]
Length = 601
Score = 38.5 bits (88), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 33/146 (22%), Positives = 64/146 (43%), Gaps = 20/146 (13%)
Query: 201 FVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDV 257
F + +H L K +++ V+V+ + L +DL A + C E R N+Y+N
Sbjct: 409 FYMPEIHPKLDKILFLDDDVVVQKDLTPLWDVDLKGIVNGAVETCKESFHRFNTYLN--- 465
Query: 258 LDAIQRSASQPWVSGKPYAVNSCMPDLGMLLIDARKLEKY-ILEAFLWWKKVINQRDRSI 316
S P +S + + ++C GM + D ++ +K I + +W+ + DR +
Sbjct: 466 -------FSHPKIS-ENFDPHACGWAFGMNMFDLKEWKKQNITGIYHYWQDL--NEDRKL 515
Query: 317 GR---SPAIALALYDRYLKLSSSWLV 339
+ P + Y+ L+ +W V
Sbjct: 516 WKLDTLPPGLITFYNLTYPLNRTWHV 541
>gi|224085587|ref|XP_002307628.1| glycosyltransferase, CAZy family GT8 [Populus trichocarpa]
gi|222857077|gb|EEE94624.1| glycosyltransferase, CAZy family GT8 [Populus trichocarpa]
Length = 605
Score = 38.5 bits (88), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 58/269 (21%), Positives = 110/269 (40%), Gaps = 41/269 (15%)
Query: 125 AVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKL------KVLFPHSNLEFHG----Q 174
AV+VNST+S + P+ + FH+ + +S + L + S ++F G
Sbjct: 329 AVVVNSTVSSAMEPEKIVFHIVTDSLNLPTISMWFLLNPPGKATIQIQSLVDFKGLSANY 388
Query: 175 EEVKKVIRTASTGVKYSVQNFEEIVPFVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDL 234
K + + + ++ + F + V L+K + V+V+ + L +++
Sbjct: 389 NSTLKQLNSRDSRYTSALNHLR----FYLPDVFPQLNKIVLFDHDVVVQKDLAGLWSLNM 444
Query: 235 SNYAIAAADDCSE------RLNSYVNPDVLDAIQRSASQPWVSGKPYAVNSCMPDLGMLL 288
I A D C E R++ ++N S P+V K + +C GM L
Sbjct: 445 KGKVIGAVDTCREGEPSFRRMDKFIN----------FSDPFVI-KRFDAKACTWAFGMNL 493
Query: 289 IDARKLEKYILEAFLWWKKVINQRDRSIGRSPAIALA---LYDRYLKLSSSW--LVTDST 343
D ++ ++ L A L+ K + R + ++ ++ L Y+R + L W L
Sbjct: 494 FDLQEWRRHKLTA-LYNKYLQLGHTRQLWKAGSLPLGWATFYNRTVILDRRWHKLGLGHE 552
Query: 344 SSV----VNKSLAIRYDGPMTACSEFGDG 368
+ V V ++ + YDG M + G G
Sbjct: 553 AGVGHDGVEQAAVLHYDGVMKPWLDIGIG 581
>gi|115480327|ref|NP_001063757.1| Os09g0531900 [Oryza sativa Japonica Group]
gi|52075938|dbj|BAD46018.1| glycosyl transferase family 8 protein-like [Oryza sativa Japonica
Group]
gi|52077221|dbj|BAD46265.1| glycosyl transferase family 8 protein-like [Oryza sativa Japonica
Group]
gi|113631990|dbj|BAF25671.1| Os09g0531900 [Oryza sativa Japonica Group]
gi|222641970|gb|EEE70102.1| hypothetical protein OsJ_30110 [Oryza sativa Japonica Group]
Length = 695
Score = 38.5 bits (88), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 58/285 (20%), Positives = 112/285 (39%), Gaps = 55/285 (19%)
Query: 125 AVLVNSTLSGSRYPDLLHFHLFVPK---GSEDM----------------VSFYK------ 159
+V+VNST+ ++ P+ FHL K G+ +M V +K
Sbjct: 397 SVVVNSTIMNAKEPEKHVFHLVTDKLNFGAMNMWFLLNPPGKATIHVENVDEFKWLNSSY 456
Query: 160 ---LKVLFPHSNLEFHGQEEVKKVIRTASTGVKYSVQNFEEIVP---FVIASVHQSLSKF 213
L+ L + E++ + + + S+ +KY + ++ F + V+ L K
Sbjct: 457 CPVLRQLESAAMKEYYFKADRPTTLSAGSSNLKYRNPKYLSMLNHLRFYLPQVYPKLDKI 516
Query: 214 IYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDVLDAIQRSASQPWV 270
+++ ++V+ + L +DL+ A + C E R + Y+N S P +
Sbjct: 517 LFLDDDIVVQKDLTGLWDVDLNGKVNGAVETCGESFHRFDKYLN----------FSNPHI 566
Query: 271 SGKPYAVNSCMPDLGMLLIDARKLEKYILEAFLWWKKVINQRDR---SIGRSPAIALALY 327
+ + + N+C GM + D ++ +K + + +N+ DR +G P L Y
Sbjct: 567 A-RNFDPNACGWAYGMNIFDLKEWKKKDITGIYHKWQSMNE-DRVLWKLGTLPPGLLTFY 624
Query: 328 DRYLKLSSSWLVT------DSTSSVVNKSLAIRYDGPMTACSEFG 366
L SW V S ++ + + Y+G M E
Sbjct: 625 KLTHPLDKSWHVLGLGYNPSIDRSEIDNAAVVHYNGNMKPWLELA 669
>gi|90657589|gb|ABD96888.1| hypothetical protein [Cleome spinosa]
Length = 351
Score = 38.5 bits (88), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 50/240 (20%), Positives = 92/240 (38%), Gaps = 28/240 (11%)
Query: 95 RNSGSISSISQDDGMVHIVSWMQCLDLRLLAVLVNSTLSGSRYPDLLHFH-LFVPKGSED 153
RN + D +HI + LR V S L S P+ L FH L +P +
Sbjct: 53 RNGDQCGTGESDQ--IHIAMTLDTNYLRGTMAAVLSLLQHSTCPENLSFHFLSLPHFETE 110
Query: 154 MVSFYKLKVLFPHSNLEFHG------QEEVKKVIRTA-STGVKYSVQNFEEIVPFVIASV 206
+ F +K FP+ + + + + + K IR A + Y+ +I+P
Sbjct: 111 L--FTSIKSTFPYLSFKIYQFDPNLVRSRISKSIRQALDQPLNYARIYLSDILP------ 162
Query: 207 HQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRSAS 266
++ + IY+ ++V +E+L +++ +AA + C +Y S
Sbjct: 163 -SNVDRIIYLDSDLVVVDDIEKLWHVEMEEKVVAAPEYCHANFTNYFTDTFW-------S 214
Query: 267 QPWVSGKPYAVNSCMPDLGMLLIDARKLEK--YILEAFLWWKKVINQRDRSIGRSPAIAL 324
P ++ + C + G++++D K K Y W +R +G P L
Sbjct: 215 DPELAKVLEGRHPCYFNTGVMVVDVEKWRKGGYTQRVEEWMAVQKQKRIYHLGSLPPFLL 274
>gi|449436453|ref|XP_004136007.1| PREDICTED: probable galacturonosyltransferase-like 7-like [Cucumis
sativus]
gi|449505333|ref|XP_004162438.1| PREDICTED: probable galacturonosyltransferase-like 7-like [Cucumis
sativus]
Length = 367
Score = 38.5 bits (88), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 53/267 (19%), Positives = 96/267 (35%), Gaps = 17/267 (6%)
Query: 106 DDGMVHIVSWMQCLDLRLLAVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFP 165
D +VH+ + LR VNS L S P+ + FH V + + + V ++ FP
Sbjct: 86 DPSLVHVAITLDVEYLRGSIAAVNSILQHSLCPESVFFHFLVSETNLEAV----VRSAFP 141
Query: 166 HSNLE--FHGQEEVKKVIRTASTGVKYSVQNFEEIVPFVIASVHQSLSKFIYMSPSVIVK 223
+ + V+ +I T+ N+ ++ + + + IY+ ++V
Sbjct: 142 QLKFKVYYFNPAIVQNLISTSVRQALEEPLNYAR--NYLAELLEPCVRRVIYLDSDLVVV 199
Query: 224 GRVEELIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRSASQPWVSGKPYAVNSCMPD 283
+ +L +L + I A + C Y + + S ++ KP NS
Sbjct: 200 DDISKLWSTNLGSKTIGAPEYCHANFTKYFTSRFW--LDKRFSGTFLGRKPCYFNS---- 253
Query: 284 LGMLLIDARKLEK--YILEAFLWWKKVINQRDRSIGRSPAIALALYDRYLKLSSSWLVTD 341
G+++ID K + Y W + N R +G P L + W
Sbjct: 254 -GVMVIDLAKWRRAGYTKRIERWMEIQKNNRIYELGSLPPFLLVFAGDVSPIEHRWNQHG 312
Query: 342 STSSVVNKSLAIRYDGPMTACSEFGDG 368
V S + GP++ G G
Sbjct: 313 LGGDNVKGSCRNLHAGPVSLLHWSGSG 339
>gi|218202507|gb|EEC84934.1| hypothetical protein OsI_32147 [Oryza sativa Indica Group]
Length = 695
Score = 38.5 bits (88), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 58/285 (20%), Positives = 112/285 (39%), Gaps = 55/285 (19%)
Query: 125 AVLVNSTLSGSRYPDLLHFHLFVPK---GSEDM----------------VSFYK------ 159
+V+VNST+ ++ P+ FHL K G+ +M V +K
Sbjct: 397 SVVVNSTIMNAKEPEKHVFHLVTDKLNFGAMNMWFLLNPPGKATIHVENVDEFKWLNSSY 456
Query: 160 ---LKVLFPHSNLEFHGQEEVKKVIRTASTGVKYSVQNFEEIVP---FVIASVHQSLSKF 213
L+ L + E++ + + + S+ +KY + ++ F + V+ L K
Sbjct: 457 CPVLRQLESAAMKEYYFKADRPTTLSAGSSNLKYRNPKYLSMLNHLRFYLPQVYPKLDKI 516
Query: 214 IYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDVLDAIQRSASQPWV 270
+++ ++V+ + L +DL+ A + C E R + Y+N S P +
Sbjct: 517 LFLDDDIVVQKDLTGLWDVDLNGKVNGAVETCGESFHRFDKYLN----------FSNPHI 566
Query: 271 SGKPYAVNSCMPDLGMLLIDARKLEKYILEAFLWWKKVINQRDR---SIGRSPAIALALY 327
+ + + N+C GM + D ++ +K + + +N+ DR +G P L Y
Sbjct: 567 A-RNFDPNACGWAYGMNIFDLKEWKKKDITGIYHKWQSMNE-DRVLWKLGTLPPGLLTFY 624
Query: 328 DRYLKLSSSWLVT------DSTSSVVNKSLAIRYDGPMTACSEFG 366
L SW V S ++ + + Y+G M E
Sbjct: 625 KLTHPLDKSWHVLGLGYNPSIDRSEIDNAAVVHYNGNMKPWLELA 669
>gi|338732825|ref|YP_004671298.1| hypothetical protein SNE_A09300 [Simkania negevensis Z]
gi|336482208|emb|CCB88807.1| hypothetical protein SNE_A09300 [Simkania negevensis Z]
Length = 699
Score = 38.1 bits (87), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 50/190 (26%), Positives = 82/190 (43%), Gaps = 34/190 (17%)
Query: 113 VSW-MQCLDLRLLAVLVNSTLSGSRYPDLLHFHLFVPKG-----SEDMVSFYKLKVL-FP 165
V W ++CL + A+ + + L+G FH F K S +M S Y L++L FP
Sbjct: 213 VDWKIECLGFLIFAIALGAALNG--------FHSFFLKKISHKISGNMTSRYLLQLLKFP 264
Query: 166 HSNLEFHGQEEVKKVIRTASTGVKYSVQNFEEIVPFVIA---SVHQSLSKFIY--MSPSV 220
L F GQ + +++ K+S + +EEIVP I S+ + F Y + SV
Sbjct: 265 ---LTFFGQRDPREIADRIKEIGKFSQRYYEEIVPTFIQLTLSIFYAAIMFSYSPLVASV 321
Query: 221 IVKGRVEELIGIDLSNYAIAAADDCSERLNSYVNPDVL------DAIQRSASQ-----PW 269
+ G L+ + + + A D ER + + L DAI+ S+ W
Sbjct: 322 TIFGAFLSLLVLQTVSRSRLALYDTWERARNQSDEASLEQMHSIDAIKAVGSEQFFFHKW 381
Query: 270 VSGKPYAVNS 279
+ Y +N+
Sbjct: 382 IKHYTYLLNT 391
>gi|423295529|ref|ZP_17273656.1| hypothetical protein HMPREF1070_02321 [Bacteroides ovatus
CL03T12C18]
gi|392672453|gb|EIY65921.1| hypothetical protein HMPREF1070_02321 [Bacteroides ovatus
CL03T12C18]
Length = 324
Score = 38.1 bits (87), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 22/36 (61%)
Query: 209 SLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADD 244
+L K +Y+ +IV GR+E L IDL Y I A +D
Sbjct: 105 TLDKILYLDCDIIVNGRIESLWNIDLKYYTIGAVED 140
>gi|156066430|gb|ABU43074.1| homogalacturonan alpha-1,4-galacturonosyltransferase [Gossypium
barbadense]
Length = 421
Score = 37.7 bits (86), Expect = 9.0, Method: Compositional matrix adjust.
Identities = 58/276 (21%), Positives = 110/276 (39%), Gaps = 30/276 (10%)
Query: 125 AVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFPHSNLEFHGQE-EVKKVIRT 183
+V+VNST+S ++ P+ + FH+ + +S + L L P H Q E + T
Sbjct: 145 SVVVNSTISSAKEPEKIVFHVVTDSLNLPAISMWFL--LNPPGKATIHVQSIENFDWLST 202
Query: 184 ASTGVKYSVQNFE-------EIVPFVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSN 236
++++ + F + + +L+K + V+V+ + E+ ID+
Sbjct: 203 KYNSTLNEQKSYDPRYSSALNHLRFYLPDIFPALNKIVLFDHDVVVQRDLTEIWSIDMKG 262
Query: 237 YAIAAADDCSERLNSYVNPDVLDAIQRSASQPWVSGKPYAVNSCMPDLGMLLIDARKLEK 296
A + C E S+ + + + S P+++ + + N C GM L D + +
Sbjct: 263 KVNGAVETCLESEASFRSIQMF----MNFSDPFLARR-FNANVCTWAFGMNLFDLHEWRR 317
Query: 297 YILEAFLWWKKVINQRDRSI---GRSPAIALALYDRYLKLSSSWLV------TDSTSSVV 347
L L+ + RS+ G P + Y++ + L W +D +
Sbjct: 318 KNL-TMLYRNYLQLGLKRSLWKGGSLPIGWITFYNQTVALEKRWHTLGLGYNSDVPPGDI 376
Query: 348 NKSLAIRYDGPMTACSEFGDGANMEPARGDLWKQHL 383
+ I YDG M E G + +G W +HL
Sbjct: 377 ENAAVIHYDGVMKPWLETG----IAKYKG-YWSKHL 407
>gi|15235247|ref|NP_192122.1| putative galacturonosyltransferase-like 6 [Arabidopsis thaliana]
gi|30679061|ref|NP_849285.1| putative galacturonosyltransferase-like 6 [Arabidopsis thaliana]
gi|79324977|ref|NP_001031573.1| putative galacturonosyltransferase-like 6 [Arabidopsis thaliana]
gi|75317781|sp|O04253.1|GATL6_ARATH RecName: Full=Probable galacturonosyltransferase-like 6; AltName:
Full=Like glycosyl transferase 10
gi|2104536|gb|AAC78704.1| predicted glycosyl transferase [Arabidopsis thaliana]
gi|7268597|emb|CAB80706.1| predicted glycosyl transferase [Arabidopsis thaliana]
gi|24030376|gb|AAN41350.1| putative glycosyl transferase [Arabidopsis thaliana]
gi|26453088|dbj|BAC43620.1| putative glycosyl transferase [Arabidopsis thaliana]
gi|110738563|dbj|BAF01207.1| glycosyl transferase like protein [Arabidopsis thaliana]
gi|332656727|gb|AEE82127.1| putative galacturonosyltransferase-like 6 [Arabidopsis thaliana]
gi|332656728|gb|AEE82128.1| putative galacturonosyltransferase-like 6 [Arabidopsis thaliana]
gi|332656729|gb|AEE82129.1| putative galacturonosyltransferase-like 6 [Arabidopsis thaliana]
Length = 346
Score = 37.7 bits (86), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 58/285 (20%), Positives = 107/285 (37%), Gaps = 22/285 (7%)
Query: 91 DSGDR--NSGSISSISQDDGMVHIVSWMQCLDLRLLAVLVNSTLSGSRYPDLLHFHLFVP 148
++GD +SG + + S +VH+ + LR VNS L S P+ + FH F+
Sbjct: 49 NNGDECLSSGGVCNPS----LVHVAITLDVEYLRGSIAAVNSILQHSVCPESVFFH-FIA 103
Query: 149 KGSEDMVSFYKLKVLFPHSNLEFH--GQEEVKKVIRTASTGVKYSV-QNFEEIVPFVIAS 205
E + ++ +FP + E V+ +I S+ V+ ++ Q ++
Sbjct: 104 VSEETNLLESLVRSVFPRLKFNIYDFAPETVRGLI---SSSVRQALEQPLNYARSYLADL 160
Query: 206 VHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRSA 265
+ +++ IY+ ++V + +L L + I A + C Y
Sbjct: 161 LEPCVNRVIYLDSDLVVVDDIAKLWKTSLGSRIIGAPEYCHANFTKYFTGGFW------- 213
Query: 266 SQPWVSGKPYAVNSCMPDLGMLLIDARKLEK--YILEAFLWWKKVINQRDRSIGRSPAIA 323
S+ SG C + G+++ID +K + Y W + +R +G P
Sbjct: 214 SEERFSGTFRGRKPCYFNTGVMVIDLKKWRRGGYTKRIEKWMEIQRRERIYELGSLPPFL 273
Query: 324 LALYDRYLKLSSSWLVTDSTSSVVNKSLAIRYDGPMTACSEFGDG 368
L +S W V S + GP++ G G
Sbjct: 274 LVFSGHVAPISHRWNQHGLGGDNVRGSCRDLHPGPVSLLHWSGSG 318
>gi|326532202|dbj|BAK01477.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 697
Score = 37.7 bits (86), Expect = 9.3, Method: Compositional matrix adjust.
Identities = 38/179 (21%), Positives = 71/179 (39%), Gaps = 26/179 (14%)
Query: 201 FVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDV 257
F + + L+K +++ +V+ + L IDL A + C E R + Y+N
Sbjct: 506 FYLPEIFPKLNKVLFLDDDTVVQQDLSALWSIDLKGKVNGAVETCGESFHRFDKYLN--- 562
Query: 258 LDAIQRSASQPWVSGKPYAVNSCMPDLGMLLIDARKLEKY-ILEAFLWWKKVINQRDR-- 314
S P ++ + +SC GM + D + K I + + W+ + DR
Sbjct: 563 -------FSNPLIASN-FNPHSCGWAYGMNMFDLSEWRKQNITDVYHTWQNL--NEDRLL 612
Query: 315 -SIGRSPAIALALYDRYLKLSSSWLVT------DSTSSVVNKSLAIRYDGPMTACSEFG 366
+G PA + ++R L SW + + + ++ I Y+G + E G
Sbjct: 613 WKLGSLPAGLVTFWNRTFPLDRSWHLLGLGYNPNVNEKEIRRASVIHYNGNLKPWLEIG 671
>gi|387133452|ref|YP_006299424.1| glycosyltransferase family protein [Prevotella intermedia 17]
gi|386376300|gb|AFJ09183.1| glycosyltransferase family 8 [Prevotella intermedia 17]
Length = 328
Score = 37.7 bits (86), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 25/99 (25%), Positives = 41/99 (41%), Gaps = 18/99 (18%)
Query: 166 HSNLEFHGQEEVKKVIRTASTGVKYSVQNFEEIVPFVIASVHQSLS-------------- 211
H L +GQ+ + ++ ++Y + + + F IA H +LS
Sbjct: 22 HHGLSQNGQQLISNMVLRYGAKIRYYDVDVDRLSSFSIAEYHPNLSIATYFRFFLPSFLD 81
Query: 212 ----KFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCS 246
K +Y+ VIV V EL IDL+ Y +A D +
Sbjct: 82 SSIKKVLYLDCDVIVLKDVSELFHIDLAGYGVAGVKDVT 120
>gi|224110116|ref|XP_002315420.1| glycosyltransferase, CAZy family GT8 [Populus trichocarpa]
gi|222864460|gb|EEF01591.1| glycosyltransferase, CAZy family GT8 [Populus trichocarpa]
Length = 348
Score = 37.7 bits (86), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 50/238 (21%), Positives = 95/238 (39%), Gaps = 27/238 (11%)
Query: 110 VHIVSWMQCLDLRLLAVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFPHSNL 169
+HIV + LR + S L S P+ + FH + ++ F +K FP+ N
Sbjct: 64 IHIVMTLDANYLRGTMAAIFSILRHSTCPENMEFHFLWARFDREV--FSSIKSTFPYLNF 121
Query: 170 EFHG------QEEVKKVIRTA-STGVKYSVQNFEEIVPFVIASVHQSLSKFIYMSPSVIV 222
+F+ + ++ K IR + + Y+ +I+P ++ + IY+ +++
Sbjct: 122 KFYRFDSNRVRGKISKSIRQSLDQPLNYARIYLADIIP-------SNVKRVIYLDSDLLL 174
Query: 223 KGRVEELIGIDLSNYAIAAADDCSERLNSYV-NPDVLDAIQRSASQPWVSGKPYAVNSCM 281
+ +L +DL + +AA + C Y N LD P ++ + C
Sbjct: 175 VDDIAKLWEVDLEDRVLAAPEYCHANFTYYFSNLFWLD--------PVLARTFHGRRPCY 226
Query: 282 PDLGMLLIDARKLEKYIL-EAFLWWKKVINQ-RDRSIGRSPAIALALYDRYLKLSSSW 337
+ G++++D K + L + W V Q R +G P L L + W
Sbjct: 227 FNTGVMVVDVEKWRQVQLTQKVEGWMTVQKQKRIYHLGSLPPFLLVLAGNIKGVDHRW 284
>gi|12003394|gb|AAG43554.1|AF211536_1 Avr9/Cf-9 rapidly elicited protein 231 [Nicotiana tabacum]
Length = 353
Score = 37.7 bits (86), Expect = 9.6, Method: Compositional matrix adjust.
Identities = 49/267 (18%), Positives = 103/267 (38%), Gaps = 12/267 (4%)
Query: 106 DDGMVHIVSWMQCLDLRLLAVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFP 165
D ++HI + R V+S L + P+ ++FH K + + +FP
Sbjct: 65 DPNLIHIAMTLDSHYFRGSIAAVHSVLKHTSCPENIYFHFVTSKDFDFQQLTQTVMSIFP 124
Query: 166 HSNLEFHGQEEVKKVIRTASTGVKYSVQN-FEEIVPFVIASVHQSLSKFIYMSPSVIVKG 224
+ + + +E+ +V S+ ++ ++ N ++ + + + IY+ VI+
Sbjct: 125 SLSFKVYSFDEL-RVKNLISSSIRQALDNPLNYARTYLAEIIEHCVERVIYLDSDVILVD 183
Query: 225 RVEELIGIDLS-NYAIAAADDCSERLNSYVNPDVLDAIQRSASQPWVSGKPYAVNSCMPD 283
+++L I L+ + I A + C +Y N + + S+ + KP C +
Sbjct: 184 DIQKLWSISLTGSRIIGAPEYCHANFRTYFNDNFWSDTK--FSKVFQGKKP-----CYFN 236
Query: 284 LGMLLIDARKLEK--YILEAFLWWKKVINQRDRSIGRSPAIALALYDRYLKLSSSWLVTD 341
G++++D K K Y + W + +R +G P L + W
Sbjct: 237 TGVMVMDLGKWRKGDYTEKIENWMEIQKKKRIYELGSLPPFMLVFGGEIEGIDHKWNQHG 296
Query: 342 STSSVVNKSLAIRYDGPMTACSEFGDG 368
+ S + GP++ G G
Sbjct: 297 LGGDNLVNSCRTLHPGPVSLLHWSGKG 323
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.320 0.134 0.403
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,392,448,583
Number of Sequences: 23463169
Number of extensions: 267765726
Number of successful extensions: 581666
Number of sequences better than 100.0: 180
Number of HSP's better than 100.0 without gapping: 27
Number of HSP's successfully gapped in prelim test: 153
Number of HSP's that attempted gapping in prelim test: 581496
Number of HSP's gapped (non-prelim): 224
length of query: 393
length of database: 8,064,228,071
effective HSP length: 144
effective length of query: 249
effective length of database: 8,980,499,031
effective search space: 2236144258719
effective search space used: 2236144258719
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 78 (34.7 bits)