BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 016223
         (393 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|224075287|ref|XP_002304587.1| predicted protein [Populus trichocarpa]
 gi|222842019|gb|EEE79566.1| predicted protein [Populus trichocarpa]
          Length = 379

 Score =  467 bits (1202), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 230/390 (58%), Positives = 288/390 (73%), Gaps = 13/390 (3%)

Query: 3   SSSFLPSRSSASQRRTAQASDNPNTDGTHGPVLEQAPTPKPIFQFIALGLIVFLGLLQFL 62
           SSSFLPS S A+QRRT+Q   NP + G   P +  AP P+PIFQF+A GLI+FL LLQ L
Sbjct: 2   SSSFLPSTSFATQRRTSQKLGNPKSAGPLDPCITPAPNPRPIFQFLAFGLIIFLALLQLL 61

Query: 63  PATHFRHPSDPFRIWVPFNSNTSFSKTRDSGDRNSGSISSISQDDGMVHIVSWMQCLDLR 122
           PATHFR P DPFR W P +SN S S  + SGD            DGMVH+VSWM CLDLR
Sbjct: 62  PATHFRDPFDPFRNWAPLHSNPS-SPVKFSGD------------DGMVHVVSWMDCLDLR 108

Query: 123 LLAVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFPHSNLEFHGQEEVKKVIR 182
            LAVL NSTLS S +PDL+ FH F+P G+ED VSFYKLKVLFPHSNLE HGQE+VK+++R
Sbjct: 109 ALAVLANSTLSSSSHPDLVFFHFFIPGGNEDKVSFYKLKVLFPHSNLEIHGQEQVKEIMR 168

Query: 183 TASTGVKYSVQNFEEIVPFVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAA 242
           T  +G +Y+  ++EEIVPF+I +VH  L KFIY+S +VI+K RVEELIG+DL+NYAIA A
Sbjct: 169 TVFSGGQYAEPSYEEIVPFIIPTVHNFLRKFIYVSANVIMKARVEELIGVDLNNYAIATA 228

Query: 243 DDCSERLNSYVNPDVLDAIQRSASQPWVSGKPYAVNSCMPDLGMLLIDARKLEKYILEAF 302
           +DCS RL +YVN DVLDAIQRSAS+ WVS  PYA + C+PD  +L+I+ARKLEK  +E  
Sbjct: 229 EDCSRRLKNYVNLDVLDAIQRSASKAWVSETPYAKDFCLPDFSVLVINARKLEKDFVEIV 288

Query: 303 LWWKKVINQRDRSIGRSPAIALALYDRYLKLSSSWLVTDSTSSVVNKSLAIRYDGPMTAC 362
           LWW K +N R+R+  ++ A+ALALY+ YLKLSSSWLV D +SS VN S+ I YDGP T+C
Sbjct: 289 LWWSKALNLRERTNKKNVAVALALYNSYLKLSSSWLVKDFSSSQVNNSMIIHYDGPKTSC 348

Query: 363 SEFGDGANMEPARGDLWKQHLPPLFYQMVG 392
            E  +G   E + G++W ++LP    +++G
Sbjct: 349 KESINGTASEYSHGNVWTRYLPSTSDRILG 378


>gi|224053693|ref|XP_002297932.1| predicted protein [Populus trichocarpa]
 gi|222845190|gb|EEE82737.1| predicted protein [Populus trichocarpa]
          Length = 387

 Score =  443 bits (1139), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 229/390 (58%), Positives = 290/390 (74%), Gaps = 5/390 (1%)

Query: 3   SSSFLPSRSSASQRRTAQASDNPNTDGTHGPVLEQAPTPKPIFQFIALGLIVFLGLLQFL 62
           +SS+LPS  SA+QRR++Q SDNP T G   P L  AP P+PIFQF+A GL + LGLLQ L
Sbjct: 2   ASSYLPSAYSATQRRSSQTSDNPITAGPLYPGLVPAPNPRPIFQFLAFGLFILLGLLQLL 61

Query: 63  PATHFRHPSDPFRIWVPFNSNTSFSKTRDSGDRNSGSISSISQDDGMVHIVSWMQCLDLR 122
           PATHFR P DP R WVP +SN + S  R+   RN            MVH+VSWM CLDLR
Sbjct: 62  PATHFRDPFDPSRNWVPLHSNLA-SPLREFDARNGDGGGDDG----MVHVVSWMDCLDLR 116

Query: 123 LLAVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFPHSNLEFHGQEEVKKVIR 182
           +LAVL NSTLS S YP+L+ FH F+P G+ED V FYKLKVLFPHSNLE HGQEEVK+++R
Sbjct: 117 VLAVLANSTLSSSSYPELVSFHFFIPGGNEDKVPFYKLKVLFPHSNLEIHGQEEVKEIVR 176

Query: 183 TASTGVKYSVQNFEEIVPFVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAA 242
            A +  +Y+   +EEIVPF+I +VHQ LSKFIY+S +VI+K RVEELIG+DL +YAIA A
Sbjct: 177 IAFSDEQYAKPRYEEIVPFIIPTVHQFLSKFIYVSANVIMKARVEELIGVDLDDYAIATA 236

Query: 243 DDCSERLNSYVNPDVLDAIQRSASQPWVSGKPYAVNSCMPDLGMLLIDARKLEKYILEAF 302
           +DCS+RL +YVN +VLDAIQRS S+PWVS  PYA ++C+PDL +L+I+ARKL K I+E  
Sbjct: 237 EDCSQRLKTYVNSEVLDAIQRSVSKPWVSETPYAKDTCLPDLSVLVINARKLGKDIVETV 296

Query: 303 LWWKKVINQRDRSIGRSPAIALALYDRYLKLSSSWLVTDSTSSVVNKSLAIRYDGPMTAC 362
           LWW K +N R+R+  ++ A+ALALY+RYLKLSSSWLV D TS  VN S+ I YDGP T+C
Sbjct: 297 LWWSKALNLRERTDQKNLALALALYNRYLKLSSSWLVKDITSPEVNNSMIIYYDGPKTSC 356

Query: 363 SEFGDGANMEPARGDLWKQHLPPLFYQMVG 392
            +   GA  E + G++W Q+LP +  +++G
Sbjct: 357 IKSISGAASEYSHGNVWTQYLPSISDRILG 386


>gi|225465111|ref|XP_002270401.1| PREDICTED: uncharacterized protein LOC100242682 [Vitis vinifera]
 gi|302143265|emb|CBI20560.3| unnamed protein product [Vitis vinifera]
          Length = 382

 Score =  425 bits (1093), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 209/380 (55%), Positives = 284/380 (74%), Gaps = 10/380 (2%)

Query: 13  ASQRRTAQASDNPNTDGTHGPVLEQAPTPKPIFQFIALGLIVFLGLLQFLPATHFRHPSD 72
           A+ RR  QAS + +  G   P ++ +P P+PI Q +A  ++VFLGLLQFLPATHFRHPSD
Sbjct: 12  ATYRRPHQASTSVSPPGPPTPPVDASPNPRPIAQLLAFVVVVFLGLLQFLPATHFRHPSD 71

Query: 73  PFRIWVPFNSNTSFSKTRDSGDRNSGSISSISQDDGMVHIVSWMQCLDLRLLAVLVNSTL 132
           P R WVPF+S +  S +            S++ +DG +HIVSWM CLDLR+L VL NSTL
Sbjct: 72  PLRKWVPFDSGSPVSSS---------FSRSVASEDGKIHIVSWMDCLDLRVLVVLANSTL 122

Query: 133 SGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFPHSNLEFHGQEEVKKVIRTASTGVKYSV 192
           S SRYPD +HFH F+P+G +D VS++KLKVLFPHS+LE  GQEEVK+ +RTA++G+ Y+ 
Sbjct: 123 SNSRYPDKVHFHFFIPEGHDDKVSYFKLKVLFPHSDLEIFGQEEVKEKVRTANSGIVYAG 182

Query: 193 QNFEEIVPFVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSERLNSY 252
           ++FEEIVPFVI ++H+S  KFIY+SP+VIVKGRVEEL+  +L++YA+A  +DCS+RL++Y
Sbjct: 183 RSFEEIVPFVIPTIHRSWGKFIYISPNVIVKGRVEELLEANLTSYAVAVIEDCSKRLDNY 242

Query: 253 VNPDVLDAIQRSASQPWVSGKPYAVNSCMPDLGMLLIDARKLEKYILEAFLWWKKVINQR 312
           VN +VL AIQR+AS+ W+SG PYA+ +CMPD  +LLID RKL+K ++EA LWW KV+N  
Sbjct: 243 VNSEVLAAIQRTASKSWISGTPYAMKACMPDSSILLIDPRKLDKDLVEAILWWSKVLNWS 302

Query: 313 DRSIGRSPAIALALYDRYLKLSSSWLVTDSTSSVVNKSLAIRYDGPMTACSEFGDGANME 372
           +RS  ++PAIALAL+ RY KLSSSWL+  S      + + I YDGP   CS FG+G + +
Sbjct: 303 ERSSPKNPAIALALHGRYSKLSSSWLLGHSRRGTGKEIMIIPYDGPTNMCSGFGNGPS-Q 361

Query: 373 PARGDLWKQHLPPLFYQMVG 392
              G++WKQ+LPP+  +++G
Sbjct: 362 SVPGNIWKQYLPPMADRILG 381


>gi|242037931|ref|XP_002466360.1| hypothetical protein SORBIDRAFT_01g006383 [Sorghum bicolor]
 gi|241920214|gb|EER93358.1| hypothetical protein SORBIDRAFT_01g006383 [Sorghum bicolor]
          Length = 382

 Score =  257 bits (657), Expect = 6e-66,   Method: Compositional matrix adjust.
 Identities = 140/347 (40%), Positives = 204/347 (58%), Gaps = 24/347 (6%)

Query: 39  PTPKPIFQFIALGLIVFLGLLQFLPATHFRHPSDPFRIWVPFNSNTSFSKTRDSGDRNSG 98
           P P P  Q +A+  IV LG LQFLPATHFR P+DP R W+P           D     +G
Sbjct: 48  PRPWPWMQKLAIVAIVVLGCLQFLPATHFRDPNDPQRNWIPI----------DGSGNPTG 97

Query: 99  SISSISQDDGMVHIVSWMQCLDLRLLAVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFY 158
           S++ +    G V + SW+ CLDLR LAV+ NSTLS S  P  + FH  +P+G  D  S++
Sbjct: 98  SLNYV----GSVDVFSWISCLDLRTLAVMTNSTLSSSSDPQNISFHFLIPEGGNDKSSYH 153

Query: 159 KLKVLFPHSNLEFHGQEEVKKVIRTASTGVKYSVQNFEEIVPFVIASVHQSLSKFIYMSP 218
           KLK L P S+L    Q+++K  +  A+    +      E+ PF+IA    S  +++Y++ 
Sbjct: 154 KLKALLPDSDLTVTSQKQIKDKLNVATPEGNFLWLFHMELSPFLIAKSQLSKKRYVYIAA 213

Query: 219 SVIVKGRVEELIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRS-ASQPWVSGKPYAV 277
             I+KG+VE+L  +DL +YAI A  DCS+RLN Y++ DVL AIQR+ A+Q WVS +PY  
Sbjct: 214 DSIIKGKVEDLGRMDLGSYAIGAVLDCSKRLNDYISMDVLSAIQRAGAAQSWVSKEPYNK 273

Query: 278 NSCMPDLGMLLIDARKLEKYILEAFLWWKKVINQRDRSIGRSPAIALALYDRYLKLSSSW 337
           ++C+ D  +LLI+ RKL+K ++ + +WW   IN  ++      A+ LA  D+Y+KL S+W
Sbjct: 274 DACLLDFDVLLIEPRKLDKNLVASIMWWINAINLANQRDQIRLAVTLAFNDKYMKLPSNW 333

Query: 338 LVTDSTSSVVNKSLAIRYDGPMTACSEFGDGANMEPA-RGDLWKQHL 383
              D  + ++N      +DGP   CSE  DG   E +  G  W+Q+L
Sbjct: 334 KRKDGDTDILN------FDGPKIVCSE--DGRQHEQSGYGATWQQYL 372


>gi|147819623|emb|CAN76562.1| hypothetical protein VITISV_010091 [Vitis vinifera]
          Length = 838

 Score =  249 bits (635), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 113/200 (56%), Positives = 161/200 (80%), Gaps = 1/200 (0%)

Query: 135 SRYPDLLHFHLFVPKGSEDMVSFYKLKVLFPHSNLEFHGQEEVKKVIRTASTGVKYSVQN 194
           +RYPD +HFH F+P+G +D VS++KLKVLFPHS+LE  GQEEVK+ +RTA++G+ Y+ ++
Sbjct: 620 NRYPDKVHFHFFIPEGHDDKVSYFKLKVLFPHSDLEIFGQEEVKEKVRTANSGIVYAGRS 679

Query: 195 FEEIVPFVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSERLNSYVN 254
           FEEIVPFVI ++H+S  KFIY+SP+VIVKGRVEEL+  +L++YA+A  +DCS+RL++YVN
Sbjct: 680 FEEIVPFVIPTIHRSWXKFIYISPNVIVKGRVEELLEANLTSYAVAVIEDCSKRLDNYVN 739

Query: 255 PDVLDAIQRSASQPWVSGKPYAVNSCMPDLGMLLIDARKLEKYILEAFLWWKKVINQRDR 314
            +VL AIQR+AS+ W+SG PYA+ +CMPD  +LLID RKL+K ++EA LWW KV+N  +R
Sbjct: 740 SEVLAAIQRTASKSWISGTPYAMKACMPDSSILLIDPRKLDKDLVEAILWWSKVLNWSER 799

Query: 315 SIGRSPAIALALYDRYLKLS 334
            +  +P I   +Y   ++L+
Sbjct: 800 FV-IAPTIFFVVYFGKIRLA 818


>gi|20303602|gb|AAM19029.1|AC084748_19 hypothetical protein [Oryza sativa Japonica Group]
 gi|24899455|gb|AAN65025.1| hypothetical protein [Oryza sativa Japonica Group]
 gi|108711353|gb|ABF99148.1| expressed protein [Oryza sativa Japonica Group]
          Length = 373

 Score =  240 bits (613), Expect = 8e-61,   Method: Compositional matrix adjust.
 Identities = 133/356 (37%), Positives = 204/356 (57%), Gaps = 28/356 (7%)

Query: 31  HGPVLEQAPTPKPIFQFIALGLIVFLGLLQFLPATHFRHPSDPFRIWVPFNSNTSFSKTR 90
           HG + +  P  +P  Q +A+  IV LG LQFLPATHFR P+DP R W+PF          
Sbjct: 34  HGDI-DPPPRARPAVQKLAIAAIVVLGCLQFLPATHFRDPADPQRNWIPF---------- 82

Query: 91  DSGDRNSGSISSISQDDGMVHIVSWMQCLDLRLLAVLVNSTLSGSRYPDLLHFHLFVPKG 150
              DR+   +  +  + G V++ SW+ CLDLR LAVL NST+S S  P  + F+  +P+ 
Sbjct: 83  ---DRSRKPVD-LPDEVGSVNVFSWISCLDLRTLAVLTNSTISSSSDPHNISFNFLIPER 138

Query: 151 SEDMVSFYKLKVLFPHSNLEFHGQEEVKKVIRTASTGVKYSVQNFEEIVPFVIASVHQSL 210
             D   +YKLK + P SN+    Q++++  +  A+           E+ P +I +   S 
Sbjct: 139 GTDKSPYYKLKAVLPDSNVTVASQKKIEDKLNVATPEGNLFWSFPNELSPIIIGTTQFSQ 198

Query: 211 SKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRSASQPWV 270
            +++Y+S   IVKG++E+L  IDL  YAIAA +DC++R+  YV+ DVL+A+Q++A +  V
Sbjct: 199 KRYVYISADSIVKGKIEDLGRIDLGTYAIAAVEDCTKRIGDYVSIDVLNAVQKTAPKNLV 258

Query: 271 SGKPYAVNSCMPDLGMLLIDARKLEKYILEAFLWWKK---VINQRDRSIGRSPAIALALY 327
             +PY  ++C+ D  +L+++ RKL K ++++ + W +   + N RD+      AI LALY
Sbjct: 259 YTEPYDKDACLLDFDVLVVEPRKLRKDLIDSIMVWFRAFSLANPRDQI---RLAITLALY 315

Query: 328 DRYLKLSSSWLVTDSTSSVVNKSLAIRYDGPMTACSEFGDGANMEPARGDLWKQHL 383
           D YLKL S+W   D+ S ++N      Y+GP   CSE G     E   GD W+ +L
Sbjct: 316 DNYLKLPSNWKRADANSDILN------YNGPKNVCSEGGRQLQEE-GNGDEWQLYL 364


>gi|413932915|gb|AFW67466.1| hypothetical protein ZEAMMB73_148709 [Zea mays]
          Length = 382

 Score =  239 bits (610), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 150/383 (39%), Positives = 217/383 (56%), Gaps = 35/383 (9%)

Query: 10  RSSASQRRTAQASDNPNTDGT---HGPVLEQAPTPKPIFQFIALGLIVFLGLLQFLPATH 66
           R  A+  R+   +  P   G    H  V E  P P P  Q +A+  IV LG LQFLPATH
Sbjct: 16  RRPAAAARSGVGAQEPPPRGQLAIHADV-EPPPRPWPGMQKLAIVAIVVLGCLQFLPATH 74

Query: 67  FRHPSDPFRIWVPFNSNTSFSKTRDSGDRN-SGSISSISQDDGMVHIVSWMQCLDLRLLA 125
           FR P+DP R W+PF+           G RN +GS++ +    G V + SW+ CLDLR LA
Sbjct: 75  FRDPNDPQRNWIPFD-----------GSRNPTGSLNDV----GSVDVFSWISCLDLRTLA 119

Query: 126 VLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFPHSNLEFHGQEEVKKVIRTAS 185
           VL NSTLS S  P  + FH  +P+G  D +S+ KLKVL P S+L    Q+++K  +  A+
Sbjct: 120 VLTNSTLSSSSDPHNISFHYLIPEGGNDKLSYLKLKVLLPDSDLTVTSQKQIKDKLNVAT 179

Query: 186 TGVKYSVQNFEEIVPFVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDC 245
               +      E+ PF+IA    S  +F+Y++   I+KG+VE+L  ++L  YAI A +DC
Sbjct: 180 PEGNFLWLFHTELSPFLIAKSQLSKKRFVYIAADSIIKGKVEDLGRMELGGYAIGAVEDC 239

Query: 246 SERLNSYVNPDVLDAIQRS-ASQPWVSGKPYAVNSCMPDLGMLLIDARKLEKYILEAFLW 304
           S+RL+ Y++ DVL AIQR+ A+  WV  +PY  ++C+ D  +LL++ RKLEK ++ + +W
Sbjct: 240 SKRLSDYISMDVLSAIQRAGAAFSWVPKEPYNEDACLLDFDVLLMEPRKLEKTLITSIMW 299

Query: 305 WKKVI---NQRDRSIGRSPAIALALYDRYLKLSSSWLVTDSTSSVVNKSLAIRYDGPMTA 361
           W   I   NQRD       A+ L+   +Y +L S+W   D  ++ +     + +DGP   
Sbjct: 300 WADAIRLANQRDLI---RLAVTLSFNGKYTRLPSNWKRADDDNADI-----LNFDGPKNV 351

Query: 362 CSEFGDGANME-PARGDLWKQHL 383
           CSE  DG   E P  G  W+Q+L
Sbjct: 352 CSE--DGRQHEQPGYGATWQQYL 372


>gi|357113463|ref|XP_003558522.1| PREDICTED: uncharacterized protein LOC100841880 [Brachypodium
           distachyon]
          Length = 362

 Score =  233 bits (594), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 131/349 (37%), Positives = 198/349 (56%), Gaps = 24/349 (6%)

Query: 35  LEQAPTPKPIFQFIALGLIVFLGLLQFLPATHFRHPSDPFRIWVPFNSNTSFSKTRDSGD 94
           +E AP P P  Q +A+  ++ LG LQFLPATHFRHP+DP R WVPF+             
Sbjct: 28  VEPAPRPWPGMQMLAIAAVLVLGGLQFLPATHFRHPADPGRNWVPFDPT----------- 76

Query: 95  RNSGSISSISQDDGMVHIVSWMQCLDLRLLAVLVNSTLSGSRYPDLLHFHLFVPKGSEDM 154
           R+   IS      G + I SW+ CLDLR LAVL NSTLS S  P  ++F+  +P+G  D 
Sbjct: 77  RHPLDIS------GRIEIFSWISCLDLRTLAVLTNSTLSSSSEPHDVYFNFLIPEGGNDK 130

Query: 155 VSFYKLKVLFPHSNLEFHGQEEVKKVIRTASTGVKYSVQNFEEIVPFVIASVHQSLSKFI 214
           + FYK+K + P SN+    Q+++K  +  A+    +      E+   +IA+   S  +++
Sbjct: 131 LPFYKIKSVLPDSNITVTSQKQIKDKLNVATPEGNFFWSFHNELSSIIIATTQPSKKRYV 190

Query: 215 YMSPSVIVKGRVEELIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRSASQPWVSGKP 274
           Y+S   ++KG++EEL  IDL  YAI A +DCS+R+  Y N DVL+A+ R+A     S +P
Sbjct: 191 YISADSVIKGKIEELARIDLGIYAIGAVEDCSKRVGDYTNMDVLNAVHRTAPTGLASTEP 250

Query: 275 YAVNSCMPDLGMLLIDARKLEKYILEAFLWWKKVINQRDRSIGRSPAIALALYDRYLKLS 334
           Y  ++C+ D+ +L+++ R L++  ++A   W   ++  +   G   AI LA YD YLKL 
Sbjct: 251 YNKDTCLLDIDVLVVEPRNLKRNTIDAIKVWVTGLSLANPRDGIQLAITLAFYDNYLKLP 310

Query: 335 SSWLVTDSTSSVVNKSLAIRYDGPMTACSEFGDGANMEPARGDLWKQHL 383
           SSW   ++ + +      + YDGP   CS  G   + E   G+ W+Q+L
Sbjct: 311 SSWKRGNANADI------LHYDGPKNVCSADGR-QHQEQGSGETWRQYL 352


>gi|125588106|gb|EAZ28770.1| hypothetical protein OsJ_12790 [Oryza sativa Japonica Group]
          Length = 302

 Score =  194 bits (494), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 106/281 (37%), Positives = 168/281 (59%), Gaps = 17/281 (6%)

Query: 108 GMVHIVSWMQCLDLRLLAVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFPHS 167
           G V++ SW+ CLDLR LAVL NST+S S  P  + F+  +P+   D   +YKLK + P S
Sbjct: 25  GSVNVFSWISCLDLRTLAVLTNSTISSSSDPHNISFNFLIPERGTDKSPYYKLKAVLPDS 84

Query: 168 NLEFHGQEEVKKVIRTAS--TGVKYSVQNFEEIVPFVIASVHQSLSKFIYMSPSVIVKGR 225
           N+    Q++++  +  A+    + +S  N  E+ P +I +   S  +++Y+S   IVKG+
Sbjct: 85  NVTVASQKKIEDKLNVATPEGNLFWSFPN--ELSPIIIGTTQFSQKRYVYISADSIVKGK 142

Query: 226 VEELIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRSASQPWVSGKPYAVNSCMPDLG 285
           +E+L  IDL  YAIAA +DC++R+  YV+ DVL+A+Q++A +  V  +PY  ++C+ D  
Sbjct: 143 IEDLGRIDLGTYAIAAVEDCTKRIGDYVSIDVLNAVQKTAPKNLVYTEPYDKDACLLDFD 202

Query: 286 MLLIDARKLEKYILEAFLWWKK---VINQRDRSIGRSPAIALALYDRYLKLSSSWLVTDS 342
           +L+++ RKL K ++++ + W +   + N RD+      AI LALYD YLKL S+W   D+
Sbjct: 203 VLVVEPRKLRKDLIDSIMVWFRAFSLANPRDQI---RLAITLALYDNYLKLPSNWKRADA 259

Query: 343 TSSVVNKSLAIRYDGPMTACSEFGDGANMEPARGDLWKQHL 383
            S ++N      Y+GP   CSE G     E   GD W+ +L
Sbjct: 260 NSDILN------YNGPKNVCSEGGRQL-QEEGNGDEWQLYL 293


>gi|115455681|ref|NP_001051441.1| Os03g0777600 [Oryza sativa Japonica Group]
 gi|113549912|dbj|BAF13355.1| Os03g0777600, partial [Oryza sativa Japonica Group]
          Length = 289

 Score =  193 bits (491), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 106/281 (37%), Positives = 167/281 (59%), Gaps = 17/281 (6%)

Query: 108 GMVHIVSWMQCLDLRLLAVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFPHS 167
           G V++ SW+ CLDLR LAVL NST+S S  P  + F+  +P+   D   +YKLK + P S
Sbjct: 12  GSVNVFSWISCLDLRTLAVLTNSTISSSSDPHNISFNFLIPERGTDKSPYYKLKAVLPDS 71

Query: 168 NLEFHGQEEVKKVIRTAS--TGVKYSVQNFEEIVPFVIASVHQSLSKFIYMSPSVIVKGR 225
           N+    Q++++  +  A+    + +S  N  E+ P +I +   S  +++Y+S   IVKG+
Sbjct: 72  NVTVASQKKIEDKLNVATPEGNLFWSFPN--ELSPIIIGTTQFSQKRYVYISADSIVKGK 129

Query: 226 VEELIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRSASQPWVSGKPYAVNSCMPDLG 285
           +E+L  IDL  YAIAA +DC++R+  YV+ DVL+A+Q++A +  V  +PY  ++C+ D  
Sbjct: 130 IEDLGRIDLGTYAIAAVEDCTKRIGDYVSIDVLNAVQKTAPKNLVYTEPYDKDACLLDFD 189

Query: 286 MLLIDARKLEKYILEAFLWWKKVI---NQRDRSIGRSPAIALALYDRYLKLSSSWLVTDS 342
           +L+++ RKL K ++++ + W +     N RD+      AI LALYD YLKL S+W   D+
Sbjct: 190 VLVVEPRKLRKDLIDSIMVWFRAFSLANPRDQI---RLAITLALYDNYLKLPSNWKRADA 246

Query: 343 TSSVVNKSLAIRYDGPMTACSEFGDGANMEPARGDLWKQHL 383
            S ++N      Y+GP   CSE G     E   GD W+ +L
Sbjct: 247 NSDILN------YNGPKNVCSEGGRQL-QEEGNGDEWQLYL 280


>gi|326515044|dbj|BAJ99883.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 370

 Score =  189 bits (481), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 123/355 (34%), Positives = 203/355 (57%), Gaps = 29/355 (8%)

Query: 35  LEQAPTPKPIFQFIALGLIVFLGLLQFLPATHFRHPSDPFRIWVPFNSNTSFSKTRDSGD 94
           +E AP   P  Q +AL  ++ LG LQ LPATH RHP+D  R WVP +       +R S D
Sbjct: 33  VEPAPRAWPGMQMLALAAVLVLGGLQLLPATHTRHPADRSRTWVPVDP------SRHSQD 86

Query: 95  RNSGSISSISQDDGMVHIVSWMQCLDLRLLAVLVNSTLSGSRYPDLLHFHLFVPKGSEDM 154
                   +S +   V + S + CLDLR LAVL NSTLS S  P  + F+  +P+G  D 
Sbjct: 87  --------LSHEVASVDVFSSISCLDLRTLAVLTNSTLSSSSDPHHVSFNFVIPEGGNDQ 138

Query: 155 VSFYKLKVLFPHSNLEFHGQEEVKKVIRTASTGVKYSVQNFEEIVPFVIASVHQSLSKFI 214
           V +YK+K + P SN+    Q+++K  +  A+    +      E+ P V+A    S  +++
Sbjct: 139 VPYYKIKAVLPDSNITVTSQKQIKDKLNVATPEGNFFASFPNELSPIVVART-LSRKRYV 197

Query: 215 YMSPSVIVKGRVEELIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRSASQPWV-SGK 273
           Y+S   I+KG++E+L+ IDL ++A+ A++DCS+RL  Y++ DVL+A+QR+A +  V   +
Sbjct: 198 YISADSIIKGKIEDLVRIDLGSHAVGASEDCSKRLGDYISMDVLNAVQRTAPRGLVHHTE 257

Query: 274 PYAVNSCMPDLGMLLIDARKLEKYILEAFLWWK---KVINQRDRSIGRSPAIALALYDRY 330
            +  ++C+ D  +LL++ R +++  +++  +W    K+ N RD  +    A+ LA Y+ Y
Sbjct: 258 SFDKDACLLDFDVLLVEPRNIKRNPIDSIAFWTMAVKLANPRDSVML---AMTLAFYNDY 314

Query: 331 LKLSSSWLVTDSTSSVVNKSLAIRYDGPMTACSEFGDGANMEPARGDLWKQHLPP 385
           LKL ++W   ++ + +      + YDGP   C+E G   + E   G++W+Q+L P
Sbjct: 315 LKLPTNWERANADTDI------LYYDGPKNVCTEDG-RQHQEKGFGEIWQQYLGP 362


>gi|116790957|gb|ABK25803.1| unknown [Picea sitchensis]
          Length = 381

 Score =  182 bits (463), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 116/330 (35%), Positives = 176/330 (53%), Gaps = 21/330 (6%)

Query: 32  GPV---LEQAPTPKPIFQFIALGLIVFLGLLQFLPATHFRHPSDPFRIWVPFNSNTSFSK 88
           GP+   +++AP P      IAL +++    LQ LPATH+R P D  R W       S + 
Sbjct: 31  GPLHIDIDEAPKPWRFAPLIALLIMIIFLFLQLLPATHYRDPFDSHRQW------RSITP 84

Query: 89  TRDSGDRNSGSISSISQDDGMVHIVSWMQCLDLRLLAVLVNSTLSGSRYPDLLHFHLFVP 148
              + +R    I   ++    VHI SW   LDLR LAVLVNST+S SR P  ++F+LF+P
Sbjct: 85  DVLTANRTITHIPMQAKPLEHVHIFSWSDDLDLRPLAVLVNSTISNSRDPGNIYFYLFIP 144

Query: 149 KGSEDMVSFYKLKVLFPHSNLEFHGQEEVKKVIRTASTGVKYSVQNFEEIVPFVIASVHQ 208
              E+  S     +LF  SN+  +    +++  +  ++G + S  +F   +PF I  ++Q
Sbjct: 145 DDWEEERSRSIFSLLFRKSNIFIYRHSVIRE--KAFASGGEDSKSDFPYFLPFYIPKIYQ 202

Query: 209 SLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRSASQP 268
           +L +FIY  P +IVKG+VEEL  ++L+N  +AA +DCS     Y+N          +S+P
Sbjct: 203 NLRRFIYAVPDIIVKGKVEELFQVNLTNSPVAAVEDCSHNF-EYINA--------KSSRP 253

Query: 269 WVSGKPYAVNSCMPDLGMLLIDARKLEKYILEAFLWWKKVINQRDRSIGRSPAIALALYD 328
           WV+  PYA N+C+ D  +LL++   L K        WKK+ +   R+     AI LAL  
Sbjct: 254 WVAQTPYAKNTCILDHSVLLVNVELLAKENFMEVTKWKKLFHTGGRNDQSDQAIMLALNG 313

Query: 329 RYLKLSSSWLVTDSTSSVVNKSLAI-RYDG 357
            Y KL +SW   +S  S ++  + I  +DG
Sbjct: 314 NYTKLDASWNSRESRFSGIDSDVKIFHFDG 343


>gi|168026280|ref|XP_001765660.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162683086|gb|EDQ69499.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 527

 Score =  168 bits (426), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 107/346 (30%), Positives = 177/346 (51%), Gaps = 43/346 (12%)

Query: 35  LEQAPTPKPIFQFIALGLIVFLGLLQFLPATHFRHPSDPFRIWVP--------------F 80
           LE AP P P   ++AL L+     LQ  P+TH+RHP D  + W+P              F
Sbjct: 90  LEPAPQPWPGLTYVALFLLCGFVALQMYPSTHWRHPQDRTKTWIPHDRTETDQASSTDPF 149

Query: 81  NSNTSFSKTRDSGDRNS-GSISSISQDDG---------------MVHIVSWMQCLDLRLL 124
            S+ S  + + + D  S G++S++S D+                +VHI       D R L
Sbjct: 150 YSSLSEEQGQQAQDLASQGTVSNLSNDEKHDEYTLAFLKKASEQVVHIFVSTDGADFRPL 209

Query: 125 AVLVNSTLSGSRYPDLLHFHLFVPKG----SEDMVSFYK-LKVLFPHSNLEFHGQEEVKK 179
           AVLVNST+S + +P+ LHFHL +P      ++ + +F++  K+     N++F   E  K 
Sbjct: 210 AVLVNSTISNAVHPERLHFHLVLPASHHSRAKHLAAFFQDTKIDIVSENIDFKDME--KH 267

Query: 180 VIRTASTGVKYSVQNFEEIVPFVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAI 239
           +    ++  +  +Q+     PF++    + + +FIY+   ++VKG +EELI IDL N A 
Sbjct: 268 ITFRKNSKARPELQSVYNFAPFLLPLHFKDVGRFIYLDADIVVKGNIEELIQIDLGNRAA 327

Query: 240 AAADDCSERLNSYVNPDVLDAIQRSASQP-WVSGKPYAVNSCMPDLGMLLIDARK-LEKY 297
           AA +DCS+   +Y + + L  IQ    +P WV  +P   ++C+ + G+L+ID  + +++ 
Sbjct: 328 AAVEDCSQTFETYFDFNELAKIQARPEKPTWVPTEPIKPDACVFNRGVLVIDTNQWIKQQ 387

Query: 298 ILEAFLWWKKVINQRDRSIGR----SPAIALALYDRYLKLSSSWLV 339
           + EA LWW       +  + +     P   LALY +Y+KL + W V
Sbjct: 388 VTEAILWWMDEFQSAESVLYKYGLSQPPFLLALYGKYMKLDTPWNV 433


>gi|414876209|tpg|DAA53340.1| TPA: hypothetical protein ZEAMMB73_753685 [Zea mays]
          Length = 403

 Score =  165 bits (418), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 89/214 (41%), Positives = 136/214 (63%), Gaps = 1/214 (0%)

Query: 105 QDDGMVHIVSWMQCLDLRLLAVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLF 164
            D G V + SW++CLDLR LAVL NSTLS S  P  + FH  +P+G  D +S++KLKVL 
Sbjct: 120 NDVGSVDVFSWIRCLDLRTLAVLTNSTLSSSSDPHNISFHYLIPEGGNDKLSYFKLKVLL 179

Query: 165 PHSNLEFHGQEEVKKVIRTASTGVKYSVQNFEEIVPFVIASVHQSLSKFIYMSPSVIVKG 224
           P S+L    Q+++K  +  A+    +      E+ PF+IA    S  +F+Y++   I+KG
Sbjct: 180 PDSDLTVTSQKQIKDKLNVATPKGNFLWLFHIELSPFLIAKSQLSKKRFVYIAADSIIKG 239

Query: 225 RVEELIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRS-ASQPWVSGKPYAVNSCMPD 283
           +VE+L  ++L  YAI A +DCS+RL+ Y++ DVL AIQR+ A+  WV  +PY  ++C+ D
Sbjct: 240 KVEDLGRMELGGYAIGAVEDCSKRLSDYISMDVLSAIQRAGATFSWVPKEPYNEDACLLD 299

Query: 284 LGMLLIDARKLEKYILEAFLWWKKVINQRDRSIG 317
             +LL++ RKLEK ++ + +WW     Q+ R +G
Sbjct: 300 FDVLLMEPRKLEKTLITSIMWWATGCLQQCRVMG 333


>gi|414876210|tpg|DAA53341.1| TPA: hypothetical protein ZEAMMB73_753685, partial [Zea mays]
          Length = 218

 Score =  160 bits (406), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 86/202 (42%), Positives = 131/202 (64%), Gaps = 1/202 (0%)

Query: 105 QDDGMVHIVSWMQCLDLRLLAVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLF 164
            D G V + SW++CLDLR LAVL NSTLS S  P  + FH  +P+G  D +S++KLKVL 
Sbjct: 10  NDVGSVDVFSWIRCLDLRTLAVLTNSTLSSSSDPHNISFHYLIPEGGNDKLSYFKLKVLL 69

Query: 165 PHSNLEFHGQEEVKKVIRTASTGVKYSVQNFEEIVPFVIASVHQSLSKFIYMSPSVIVKG 224
           P S+L    Q+++K  +  A+    +      E+ PF+IA    S  +F+Y++   I+KG
Sbjct: 70  PDSDLTVTSQKQIKDKLNVATPKGNFLWLFHIELSPFLIAKSQLSKKRFVYIAADSIIKG 129

Query: 225 RVEELIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRS-ASQPWVSGKPYAVNSCMPD 283
           +VE+L  ++L  YAI A +DCS+RL+ Y++ DVL AIQR+ A+  WV  +PY  ++C+ D
Sbjct: 130 KVEDLGRMELGGYAIGAVEDCSKRLSDYISMDVLSAIQRAGATFSWVPKEPYNEDACLLD 189

Query: 284 LGMLLIDARKLEKYILEAFLWW 305
             +LL++ RKLEK ++ + +WW
Sbjct: 190 FDVLLMEPRKLEKTLITSIMWW 211


>gi|302767718|ref|XP_002967279.1| hypothetical protein SELMODRAFT_439806 [Selaginella moellendorffii]
 gi|300165270|gb|EFJ31878.1| hypothetical protein SELMODRAFT_439806 [Selaginella moellendorffii]
          Length = 1342

 Score =  158 bits (400), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 115/373 (30%), Positives = 183/373 (49%), Gaps = 53/373 (14%)

Query: 1   MSSSSFLPSRSSASQRRTAQASDNPNTDGTHGPVL---EQAPTPKPIFQ---FIALGLIV 54
            S +  LP R    +RR      N + DG   P+L   E AP PKP        AL  + 
Sbjct: 12  FSPAGALPVRRPLGKRR------NGDGDGEVDPLLPVPELAPAPKPWKGAPIVAALICVA 65

Query: 55  FLGLLQFLPATHFRHPSDPFRIWVPFNSNTSFSKTRDSGDRNSGSISSISQDDGMVHI-- 112
           FL +LQ LPATH+RHP+DP R W+P           D G+ +   +  +   + +  I  
Sbjct: 66  FL-MLQMLPATHYRHPADPTRTWIPIRQEP------DQGE-SGKEVEEVDAREFVTGIKS 117

Query: 113 --VSWMQCL---DLRLLAVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFPHS 167
             +    C    DLR L V++NSTL+ +R+   + FH+   + S+      KLK LFP +
Sbjct: 118 TSIDLFVCTDEKDLRPLVVVINSTLANTRHTQRIRFHIITTE-SQREAWLSKLKALFPLA 176

Query: 168 NLE---------FHGQEEV------KKVIRTASTGVKYSVQNFEEIVPFVIASVHQSLSK 212
            ++         FHG E++        V     +  + ++ +    +PF +  +   + +
Sbjct: 177 AIDMVSFLDIVLFHGSEKIDFEEIGNHVFYRKDSKAREALTSPYNFLPFYLPRMFPGMQR 236

Query: 213 FIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQ-RSASQ--PW 269
            IY+   V+  G +EEL   DL ++ +AA +DCS+   SY N D+L  IQ R AS+  PW
Sbjct: 237 IIYLDSDVV--GDIEELFNTDLEDHPVAAVEDCSQIFGSYFNFDLLHRIQSREASESTPW 294

Query: 270 VSGKPYAVNSCMPDLGMLLIDARK-LEKYILEAFLWWKKVINQRDRSIGR----SPAIAL 324
           +  +P+  ++C+ + G+L+ID RK +E+   EA  WW    +Q  + + +     P   L
Sbjct: 295 IPSQPFDPSACIFNRGVLVIDPRKWIEQNSTEAIEWWLDEFHQAQKPLYKYGVSQPPFLL 354

Query: 325 ALYDRYLKLSSSW 337
           ALY+ Y KL ++W
Sbjct: 355 ALYNHYKKLDTAW 367


>gi|302754034|ref|XP_002960441.1| hypothetical protein SELMODRAFT_437525 [Selaginella moellendorffii]
 gi|300171380|gb|EFJ37980.1| hypothetical protein SELMODRAFT_437525 [Selaginella moellendorffii]
          Length = 1342

 Score =  158 bits (399), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 115/373 (30%), Positives = 183/373 (49%), Gaps = 53/373 (14%)

Query: 1   MSSSSFLPSRSSASQRRTAQASDNPNTDGTHGPVL---EQAPTPKPIFQ---FIALGLIV 54
            S +  LP R    +RR      N + DG   P+L   E AP PKP        AL  + 
Sbjct: 12  FSPAGALPVRRPLGKRR------NGDGDGEVDPLLPVPELAPAPKPWKGAPIVAALICVA 65

Query: 55  FLGLLQFLPATHFRHPSDPFRIWVPFNSNTSFSKTRDSGDRNSGSISSISQDDGMVHI-- 112
           FL +LQ LPATH+RHP+DP R W+P        + +D G+ +   +  +   + +  I  
Sbjct: 66  FL-MLQMLPATHYRHPADPTRTWIPIR------QEQDQGE-SGKEVEEVDAREFVTGIKS 117

Query: 113 --VSWMQCL---DLRLLAVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFPHS 167
             +    C    DLR L V++NSTL+ +R+   + FH+   + S+      KLK LFP +
Sbjct: 118 TSIDLFVCTDEKDLRPLVVVINSTLANTRHTQRIRFHIITTE-SQREAWLSKLKALFPLA 176

Query: 168 NLE---------FHGQEEV------KKVIRTASTGVKYSVQNFEEIVPFVIASVHQSLSK 212
            ++         FHG E++        V     +  + ++ +    +PF +  +   + +
Sbjct: 177 AIDMVSFLDIVLFHGSEKIDFEEISNHVFYRKDSKAREALTSPYNFLPFYLPRMFPGMQR 236

Query: 213 FIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQ-RSASQ--PW 269
            IY+   V+  G +EEL   DL ++ +AA +DCS+   SY N D+L  IQ R AS+  PW
Sbjct: 237 IIYLDSDVV--GDIEELFNTDLEDHPVAAVEDCSQIFGSYFNFDLLHRIQSREASESTPW 294

Query: 270 VSGKPYAVNSCMPDLGMLLIDARK-LEKYILEAFLWWKKVINQRDRSIGR----SPAIAL 324
           +  +P+   +C+ + G+L+ID RK +E    EA  WW    +Q  + + +     P   L
Sbjct: 295 IPRQPFDPTACIFNRGVLVIDPRKWIEHNSTEAIEWWLDEFHQAQKPLYKYGVSQPPFLL 354

Query: 325 ALYDRYLKLSSSW 337
           ALY+ Y KL ++W
Sbjct: 355 ALYNHYKKLDTAW 367


>gi|168016352|ref|XP_001760713.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162688073|gb|EDQ74452.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 1016

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 68/245 (27%), Positives = 127/245 (51%), Gaps = 13/245 (5%)

Query: 79  PFNSNTSFSKTRDSGDRNSGSISSISQDDGMVHIVSWMQCLDLRLLAVLVNSTLSGSRYP 138
           P  +  S   T  S  R    I  + ++D  +H+       DLR LAVL+NS+++   +P
Sbjct: 381 PILNAESEEGTNYSLKREDEPIDVVKRED--IHVFVCTDEADLRPLAVLINSSMANCPHP 438

Query: 139 DLLHFHLFVPKGSEDMVSFYKLKVLFPHSNLEFHGQ----EEVKKVIR-TASTGVKYSVQ 193
           + L +HL +P    +     +LK LFP++ +E   +     EV++ I     TG +  + 
Sbjct: 439 ERLFYHLVMPYSQRNAAK--RLKHLFPNARVEMAEKYIDIREVEEHITFRNDTGARKELV 496

Query: 194 NFEEIVPFVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSERLNSYV 253
           +    +PF +   +  + + IY+   ++VKG +E L  +DL  +++AA +DCS+R   Y 
Sbjct: 497 SPYNFLPFYLPKTYSEIRRIIYLDSDIVVKGNLEVLNDVDLEGHSVAAIEDCSQRFQVYF 556

Query: 254 NPDVLDAIQRSAS--QP-WVSGKPYAVNSCMPDLGMLLIDARK-LEKYILEAFLWWKKVI 309
           +   LD I +     +P W+  +P+  ++C+ + G+L+ID  + +E+ I +A +WW    
Sbjct: 557 DFAQLDEIHKRQGPDRPKWLPDEPFNKSACVFNRGVLIIDTNQWIEQNITKAIVWWMDEF 616

Query: 310 NQRDR 314
            + D+
Sbjct: 617 RKADK 621


>gi|168001842|ref|XP_001753623.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162695030|gb|EDQ81375.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 565

 Score =  104 bits (259), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 78/292 (26%), Positives = 145/292 (49%), Gaps = 35/292 (11%)

Query: 79  PFNSNTSFSKTRDSGDRNSGSISSISQDDGMVHIVSWMQCLDLRLLAVLVNSTLSGSRYP 138
           P  S  S  +  DS ++   ++ + + +D  +H+       DLR LAVL+NS+++   +P
Sbjct: 178 PIRSTESEDENIDSSNKGDETVEASTLED--IHVFVCTDEADLRPLAVLINSSMANCPHP 235

Query: 139 DLLHFHLFVPKGSEDMVSFYKLKVLFPHSNLEFHGQ----EEVKKVIR-TASTGVKYSVQ 193
           + L +HL +P    +     +LK L P + +E   +     EV++ I     TG +  + 
Sbjct: 236 ERLFYHLVMPHNQRNAAK--RLKHLLPKARIEMAEKYIDIREVEEHITFRNDTGARKELV 293

Query: 194 NFEEIVPF-------------VIAS----VHQSLSKFIYMSPSVIVKGRVEELIGIDLSN 236
           +    +PF             VI S    + Q   + I  +P ++++G +E L  +DL  
Sbjct: 294 SPYNFLPFYLPKTIFKLLRATVICSFCLAIGQRFIQLISSTPLIVLQGNLEVLNDVDLEG 353

Query: 237 YAIAAADDCSERLNSYVNPDVLDAIQRSAS--QP-WVSGKPYAVNSCMPDLGMLLIDARK 293
           +++AA +DCS+R   Y +   LD IQ+     +P W+  +P+  ++C+ + G+L+ID ++
Sbjct: 354 HSVAAIEDCSQRFQVYFDFAQLDEIQKRQGPDRPSWLPDEPFNKSACVFNRGVLVIDTKE 413

Query: 294 -LEKYILEAFLWWKKVINQRDR----SIGRS-PAIALALYDRYLKLSSSWLV 339
            +++ I +A +WW     + D+      G S P   LALY ++  L  +W V
Sbjct: 414 WIDQNITKAIVWWMDEFRKADKKALYKAGMSQPPFLLALYGKHKVLDETWNV 465


>gi|168010261|ref|XP_001757823.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162691099|gb|EDQ77463.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 264

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 57/153 (37%), Positives = 83/153 (54%), Gaps = 6/153 (3%)

Query: 193 QNFEEIVPFVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSERLNSY 252
           Q+    VPF++    + + + IY+   V+VKG +EEL+ IDL N AIAA +DCS++L +Y
Sbjct: 18  QSMYSFVPFLLPQYFKDVGRLIYLDADVVVKGNIEELMHIDLENKAIAAVEDCSQKLETY 77

Query: 253 VNPDVLDAIQRSASQP-WVSGKPYAVNSCMPDLGMLLIDARKLEK-YILEAFLWWKKVIN 310
            + D L  IQ    +P WV  +P   N+C  + G+L+ID     K  + +A  WW     
Sbjct: 78  FDLDRLAKIQARPEKPAWVPAEPINPNACGLNEGVLVIDTNPWNKQQVTKAIFWWMDEFR 137

Query: 311 QRDRSIGR----SPAIALALYDRYLKLSSSWLV 339
             D ++ +     P   LALY RY KL S W V
Sbjct: 138 SADSALYKHGFSQPLFLLALYGRYKKLDSPWNV 170


>gi|219363365|ref|NP_001136812.1| uncharacterized protein LOC100216958 [Zea mays]
 gi|194697210|gb|ACF82689.1| unknown [Zea mays]
          Length = 126

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 45/109 (41%), Positives = 63/109 (57%)

Query: 105 QDDGMVHIVSWMQCLDLRLLAVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLF 164
            D G V + SW+ CLDLR LAVL NSTLS S  P  + FH  +P+G  D +S++KLKVL 
Sbjct: 10  NDVGSVDVFSWISCLDLRSLAVLTNSTLSSSSDPHNISFHYLIPEGGNDKLSYFKLKVLL 69

Query: 165 PHSNLEFHGQEEVKKVIRTASTGVKYSVQNFEEIVPFVIASVHQSLSKF 213
           P S+L    Q+++K  +  A+    +      E+ PF+IA    S  +F
Sbjct: 70  PDSDLTVTSQKQIKDKLNVATPEGNFLWLFHTELSPFLIAKSQLSKKRF 118


>gi|323452866|gb|EGB08739.1| hypothetical protein AURANDRAFT_5202, partial [Aureococcus
           anophagefferens]
          Length = 153

 Score = 58.5 bits (140), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 45/136 (33%), Positives = 68/136 (50%), Gaps = 12/136 (8%)

Query: 211 SKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRSASQPWV 270
            K +Y+   V+V+G V EL  ID+  +A AA +DCS+++  YVN ++L  +       W 
Sbjct: 19  EKILYLDTDVVVRGDVGELAAIDMEGFAAAAVEDCSQQVAKYVNLELLADV-----DAWG 73

Query: 271 SGKPYAVN--SCMPDLGMLLID-AR----KLEKYILEAFLWWKKVINQRDRSIGRSPAIA 323
            G     +  +C+ + G++L D AR    +L + I E    + K   +  R     P   
Sbjct: 74  LGARVREHGGACVFNRGVVLFDPARWRNLRLTETIEELVAAFTKSSARLWRGGISQPPFL 133

Query: 324 LALYDRYLKLSSSWLV 339
           LAL  RYLKL  SW V
Sbjct: 134 LALAGRYLKLDISWNV 149


>gi|449469050|ref|XP_004152234.1| PREDICTED: probable galacturonosyltransferase-like 7-like [Cucumis
           sativus]
 gi|449515897|ref|XP_004164984.1| PREDICTED: probable galacturonosyltransferase-like 7-like [Cucumis
           sativus]
          Length = 367

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 75/310 (24%), Positives = 118/310 (38%), Gaps = 36/310 (11%)

Query: 49  ALGLIVFLGLLQ-FLPATHFRHPSDPFRIWVPFNSNTS-----FSKTRDSGDRNSGSISS 102
           A+ +IV    LQ F PA   R     F + +P ++ +S     FS  + S  RN+G    
Sbjct: 14  AMVMIVLAPALQSFPPAEAIRSSHLDFYLRLPVDTASSRSLNRFSFRKASTFRNAGQCGG 73

Query: 103 ISQ---------DDGMVHIVSWMQCLDLRLLAVLVNSTLSGSRYPDLLHFHLFVPKGS-E 152
                       D  +VH+   +    LR     VNS L  S  P+ + FH  V   S E
Sbjct: 74  GDSRFSGKFGVCDPNLVHVAITLDVEYLRGSVAAVNSILRNSLCPESVFFHFLVSDTSLE 133

Query: 153 DMVSFYKLKVLFPHSNLE--FHGQEEVKKVIRTASTGVKYSVQNFEEIVPFVIASVHQS- 209
           D V     +  FP  N +  +   E V+ +I   ST V+ +++         +A + +S 
Sbjct: 134 DFV-----RSTFPQMNFKVYYFDPEIVRNLI---STSVRQALEQPLNYARNYLAGLLESC 185

Query: 210 LSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRSASQPW 269
           + K IY+   +IV   + +L   +L  + I A + C    + Y          R  S   
Sbjct: 186 VKKVIYLDSDLIVVDDIRKLWTTNLGEWTIGAPEYCHANFSKYFT-------TRFWSDER 238

Query: 270 VSGKPYAVNSCMPDLGMLLIDARKLEK-YILEAFLWWKKV-INQRDRSIGRSPAIALALY 327
             G       C  + G+++ID  K       E   WW K+  + R   +G  P   L   
Sbjct: 239 FFGTFAGRKPCYFNTGVMVIDLVKWRNGGYTEKIEWWMKLQKSNRIYELGSLPPFLLVFA 298

Query: 328 DRYLKLSSSW 337
                +   W
Sbjct: 299 GNVATIEHRW 308


>gi|323447251|gb|EGB03182.1| hypothetical protein AURANDRAFT_34357 [Aureococcus anophagefferens]
          Length = 185

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 46/181 (25%), Positives = 82/181 (45%), Gaps = 41/181 (22%)

Query: 192 VQNFEEIVPFVIASVHQSL-----SKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCS 246
           V+  E + PF  A+ +         + +Y+    IV+G V EL  +DL     AA +DC+
Sbjct: 9   VKRKELLSPFNFAAFYLPYVLLESRRVLYLDTDAIVEGDVGELAHLDLGGAPAAAVEDCT 68

Query: 247 ERLNSYVNPDVLDA-----------IQRSASQPW----VSGKPYAVNSCMPDLGMLLIDA 291
           +++  Y+N ++L+            + R+A + W     +   Y+  +C+ + G++L D 
Sbjct: 69  QKVFKYINYELLERYDSGGRSKLGPMNRNAPRLWSRFGFTADAYSNETCVFNRGVVLFDC 128

Query: 292 ---RKLE-----KYILEAFL-----WWKKVINQRDRSIGRSPAIALALYDRYLKLSSSWL 338
              R+L      + +++AF+      W+  I+Q        P   LAL  RY KL   W 
Sbjct: 129 PRWRELRLTETIEDLVDAFVASRAKLWRGGISQ--------PPFLLALAGRYFKLDMEWN 180

Query: 339 V 339
           V
Sbjct: 181 V 181


>gi|218193839|gb|EEC76266.1| hypothetical protein OsI_13731 [Oryza sativa Indica Group]
          Length = 83

 Score = 50.1 bits (118), Expect = 0.002,   Method: Composition-based stats.
 Identities = 27/63 (42%), Positives = 35/63 (55%), Gaps = 7/63 (11%)

Query: 321 AIALALYDRYLKLSSSWLVTDSTSSVVNKSLAIRYDGPMTACSEFGDGANMEPARGDLWK 380
           A+ LALYD YLKL S+W   D+ S ++N      Y+GP   CSE G     E   GD W+
Sbjct: 19  AVTLALYDNYLKLPSNWKRADANSDILN------YNGPKNVCSEGGRQLQ-EEGNGDEWQ 71

Query: 381 QHL 383
            +L
Sbjct: 72  LYL 74


>gi|15217851|ref|NP_171772.1| galacturonosyltransferase 5 [Arabidopsis thaliana]
 gi|42571307|ref|NP_973744.1| galacturonosyltransferase 5 [Arabidopsis thaliana]
 gi|75172933|sp|Q9FWY9.1|GATL5_ARATH RecName: Full=Probable galacturonosyltransferase-like 5
 gi|9972380|gb|AAG10630.1|AC022521_8 Unknown protein [Arabidopsis thaliana]
 gi|25083419|gb|AAN72073.1| Unknown protein [Arabidopsis thaliana]
 gi|26452192|dbj|BAC43184.1| unknown protein [Arabidopsis thaliana]
 gi|31711862|gb|AAP68287.1| At1g02720 [Arabidopsis thaliana]
 gi|332189341|gb|AEE27462.1| galacturonosyltransferase 5 [Arabidopsis thaliana]
 gi|332189342|gb|AEE27463.1| galacturonosyltransferase 5 [Arabidopsis thaliana]
          Length = 361

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 78/338 (23%), Positives = 135/338 (39%), Gaps = 36/338 (10%)

Query: 49  ALGLIVFLGLLQ-FLPATHFR--HP-SDPFRIWVPFNSNTSFSKTRDSGD-RNSGSISSI 103
           AL +I+    LQ F PA   R  HP +D F+       N+ +S  R+S   RN+    S 
Sbjct: 14  ALAMILLSPSLQSFSPAAAIRSSHPYADEFKP----QQNSDYSSFRESPMFRNAEQCRSS 69

Query: 104 SQDDG-----MVHIVSWMQCLDLRLLAVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFY 158
            +D G     +VH+   +    LR     VNS L  S  P  + FH  V   S+++ S  
Sbjct: 70  GEDSGVCNPNLVHVAITLDIDYLRGSIAAVNSILQHSMCPQSVFFHFLVSSESQNLESL- 128

Query: 159 KLKVLFPH-SNLE--FHGQEEVKKVIRTASTGVKYSV-QNFEEIVPFVIASVHQSLSKFI 214
            ++  FP  +NL+  +   E V+ +I   S+ V+ ++ Q       ++   +   + + I
Sbjct: 129 -IRSTFPKLTNLKIYYFAPETVQSLI---SSSVRQALEQPLNYARNYLADLLEPCVKRVI 184

Query: 215 YMSPSVIVKGRVEELIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRSASQPWVSGKP 274
           Y+   ++V   + +L    L    I A + C      Y             S    +G  
Sbjct: 185 YLDSDLVVVDDIVKLWKTGLGQRTIGAPEYCHANFTKYFTGGFW-------SDKRFNGTF 237

Query: 275 YAVNSCMPDLGMLLIDARKLEKY----ILEAFLWWKKVINQRDRSIGRSPAIALALYDRY 330
              N C  + G+++ID +K  ++     +E ++  +K+  +R   +G  P   L      
Sbjct: 238 KGRNPCYFNTGVMVIDLKKWRQFRFTKRIEKWMEIQKI--ERIYELGSLPPFLLVFAGHV 295

Query: 331 LKLSSSWLVTDSTSSVVNKSLAIRYDGPMTACSEFGDG 368
             +S  W         V  S    + GP++     G G
Sbjct: 296 APISHRWNQHGLGGDNVRGSCRDLHSGPVSLLHWSGSG 333


>gi|224089503|ref|XP_002308736.1| glycosyltransferase [Populus trichocarpa]
 gi|222854712|gb|EEE92259.1| glycosyltransferase [Populus trichocarpa]
          Length = 648

 Score = 47.8 bits (112), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 61/280 (21%), Positives = 110/280 (39%), Gaps = 49/280 (17%)

Query: 125 AVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKL-----KVLFPHSNL---------- 169
           AV+VNST++ S++P  L FHL   + S   +  + L     K      N+          
Sbjct: 354 AVVVNSTVTNSKHPSKLVFHLVSDRLSYAAMRMWFLVNPPGKATIQVQNIDEFTWLNSSY 413

Query: 170 -----EFHGQEEVKKVIRTAS----TGVKYSVQNFEEIVP---FVIASVHQSLSKFIYMS 217
                + H Q  +    R  S    + +KY    +  I+    F +  +   L+K +++ 
Sbjct: 414 SPVLKQLHSQSMIDYYFRAHSANSDSNLKYRNPKYLSILNHLRFYLPEIFPKLNKVLFLD 473

Query: 218 PSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDVLDAIQRSASQPWVSGKP 274
             ++V+  +  L  +DL      A + C E   R ++Y+N           S P +S   
Sbjct: 474 DDIVVQKDLTGLWSLDLKGKVNGAVETCRESFHRFDTYLN----------FSNPLISNN- 522

Query: 275 YAVNSCMPDLGMLLIDARKLEKY-ILEAFLWWKKVINQRDR-SIGRSPAIALALYDRYLK 332
           +   +C    GM L D  + ++  I + +  W+K+ + R    +G  P   + L+ R   
Sbjct: 523 FDPRACGWAYGMNLFDLEEWKRQNITDVYHSWQKLNHDRQLWKLGTLPPGLITLWKRTHP 582

Query: 333 LSSSWLVT------DSTSSVVNKSLAIRYDGPMTACSEFG 366
           L   W V       + +   + +   I Y+G M    E G
Sbjct: 583 LDRRWHVLGLGYNPNVSQIEIERGAVIHYNGNMKPWLEIG 622


>gi|115475918|ref|NP_001061555.1| Os08g0327100 [Oryza sativa Japonica Group]
 gi|38423965|dbj|BAD01674.1| putative glycosyltransferase [Oryza sativa Japonica Group]
 gi|38637194|dbj|BAD03445.1| putative glycosyltransferase [Oryza sativa Japonica Group]
 gi|113623524|dbj|BAF23469.1| Os08g0327100 [Oryza sativa Japonica Group]
 gi|222640351|gb|EEE68483.1| hypothetical protein OsJ_26894 [Oryza sativa Japonica Group]
          Length = 643

 Score = 47.8 bits (112), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 64/282 (22%), Positives = 113/282 (40%), Gaps = 53/282 (18%)

Query: 125 AVLVNSTLSGSRYPDLLHFHLFV---------------PKGS-----EDMVSFYKLKVLF 164
           AV+VNST+  +++P    FH+                 P G       ++  F  L   +
Sbjct: 349 AVVVNSTVLNAKHPSHHVFHIVTDRLNYAPMRMWFLSNPPGKATIEVRNIEEFTWLNASY 408

Query: 165 PHSNLEFHGQEEVKKVIRT----ASTGVKYSVQNFEEIVP---FVIASVHQSLSKFIYMS 217
                +   Q  +    RT    + + +KY    +  I+    F +  ++ +L K +++ 
Sbjct: 409 SPVLKQLESQSMIDYYFRTHRANSDSNLKYRNPKYLSILNHLRFYLPEIYPNLHKIVFLD 468

Query: 218 PSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDVLDAIQRSASQPWVSGKP 274
             V++K  +  L  ID+    I   + C E   R + Y+N           S P V  K 
Sbjct: 469 DDVVIKKDLTSLWSIDMKGKVIGVVETCGESFHRFDRYLN----------FSNP-VIVKN 517

Query: 275 YAVNSCMPDLGMLLID-ARKLEKYILEAFLWWKKVINQRDR---SIGRSPAIALALYDRY 330
           +  ++C    GM + D A    + I E +  W+K +NQ DR    +G  P   +  +++ 
Sbjct: 518 FDPHACGWAFGMNVFDLAEWRRQNITEIYHSWQK-LNQ-DRLLWKLGTLPPGLITFWNKT 575

Query: 331 LKLSSSWLVT------DSTSSVVNKSLAIRYDGPMTACSEFG 366
           L L+ SW V         +S  + ++  I Y+G M    E G
Sbjct: 576 LPLNRSWHVLGLGYNPHVSSRDIERAAVIHYNGNMKPWLEIG 617


>gi|449458506|ref|XP_004146988.1| PREDICTED: probable galacturonosyltransferase 6-like [Cucumis
           sativus]
          Length = 603

 Score = 47.4 bits (111), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 66/276 (23%), Positives = 114/276 (41%), Gaps = 30/276 (10%)

Query: 125 AVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFPHSNLEFHGQEEVKKVIRTA 184
           AV+VNST+S +  P+ + FHL     +   +S + L      + +E    E+ K +    
Sbjct: 327 AVVVNSTISSATEPEKIVFHLVTNSLNLPAMSMWFLLNPPGKATIEVLSMEDFKWLSNEY 386

Query: 185 STGVKYSVQNFEE--------IVPFVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSN 236
             G  + +QN  +         + F + ++  SL K I +   V+V+  +  L  + +  
Sbjct: 387 DLG--WKMQNSSDPRFTSELNYLRFYLPNIFPSLDKVILLDHDVVVQKDLSGLWHVGMKG 444

Query: 237 YAIAAADDCSERLNSYVNPDVLDAIQRSASQPWVSGKPYAVNSCMPDLGMLLIDARKLEK 296
               A + C +   S++  D+        S P V  K +   +C    GM L D R+  +
Sbjct: 445 KVNGAVETCQDTEVSFLRMDMFINF----SDP-VINKKFNNKACTWAFGMNLFDLRRWRE 499

Query: 297 YILEAFLWWKKVINQRDRSI---GRSPAIALALYDRYLKLSSSWLVT----DSTS--SVV 347
             L A L+ K +    +R I   G  P   +  Y++   L   W V     DST    ++
Sbjct: 500 ENLTA-LYHKYLRLSNERPILKGGSLPLGWVTFYNQTTALERRWHVLGLGHDSTVLLDII 558

Query: 348 NKSLAIRYDGPMTACSEFGDGANMEPARGDLWKQHL 383
            K+  I YDG      + G G   E     LW++++
Sbjct: 559 RKAAVIHYDGVRKPWLDIGFGEYKE-----LWRKYI 589


>gi|218200956|gb|EEC83383.1| hypothetical protein OsI_28798 [Oryza sativa Indica Group]
          Length = 621

 Score = 47.4 bits (111), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 64/282 (22%), Positives = 113/282 (40%), Gaps = 53/282 (18%)

Query: 125 AVLVNSTLSGSRYPDLLHFHLFV---------------PKGS-----EDMVSFYKLKVLF 164
           AV+VNST+  +++P    FH+                 P G       ++  F  L   +
Sbjct: 327 AVVVNSTVLNAKHPSHHVFHIVTDRLNYAPMRMWFLSNPPGKATIEVRNIEEFTWLNASY 386

Query: 165 PHSNLEFHGQEEVKKVIRT----ASTGVKYSVQNFEEIVP---FVIASVHQSLSKFIYMS 217
                +   Q  +    RT    + + +KY    +  I+    F +  ++ +L K +++ 
Sbjct: 387 SPVLKQLESQSMIDYYFRTHRANSDSNLKYRNPKYLSILNHLRFYLPEIYPNLHKIVFLD 446

Query: 218 PSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDVLDAIQRSASQPWVSGKP 274
             V++K  +  L  ID+    I   + C E   R + Y+N           S P V  K 
Sbjct: 447 DDVVIKKDLTSLWSIDMKGKVIGVVETCGESFHRFDRYLN----------FSNP-VIVKN 495

Query: 275 YAVNSCMPDLGMLLID-ARKLEKYILEAFLWWKKVINQRDR---SIGRSPAIALALYDRY 330
           +  ++C    GM + D A    + I E +  W+K +NQ DR    +G  P   +  +++ 
Sbjct: 496 FDPHACGWAFGMNVFDLAEWRRQNITEIYHSWQK-LNQ-DRLLWKLGTLPPGLITFWNKT 553

Query: 331 LKLSSSWLVT------DSTSSVVNKSLAIRYDGPMTACSEFG 366
           L L+ SW V         +S  + ++  I Y+G M    E G
Sbjct: 554 LPLNRSWHVLGLGYNPHVSSRDIERAAVIHYNGNMKPWLEIG 595


>gi|334182350|ref|NP_001184925.1| alpha-1,4-galacturonosyltransferase [Arabidopsis thaliana]
 gi|332189915|gb|AEE28036.1| alpha-1,4-galacturonosyltransferase [Arabidopsis thaliana]
          Length = 602

 Score = 47.4 bits (111), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 60/284 (21%), Positives = 109/284 (38%), Gaps = 44/284 (15%)

Query: 125 AVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFPHSNLEFHGQEEVKKVIRTA 184
           +V+VNST+S S+ P+ + FH+     +   +S + L  +   + ++    +++  + R  
Sbjct: 326 SVVVNSTISSSKEPERIVFHVVTDSLNYPAISMWFLLNIQSKATIQILNIDDMDVLPRDY 385

Query: 185 STGVKYSVQNFEEIVP------FVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYA 238
              +     N    +       F +  +   L+K + +   V+V+  +  L  ID+    
Sbjct: 386 DQLLMKQNSNDPRFISTLNHARFYLPDIFPGLNKMVLLDHDVVVQRDLSRLWSIDMKGKV 445

Query: 239 IAAADDCSERLNSYVNPDVLDAIQRSASQPWVSGKPYAVNSCMPDLGMLLID-----ARK 293
           + A + C E  +S+ +           S  WV+GK ++  +C    GM LID      RK
Sbjct: 446 VGAVETCLEGESSFRSMSTFINF----SDTWVAGK-FSPRACTWAFGMNLIDLEEWRIRK 500

Query: 294 LEKYILEAF-------LWWKKVINQRDRSIGRSPAIALALYDRYLKLSSSWLV------T 340
           L    ++ F       LW            G  P   L  Y + L L   W V      +
Sbjct: 501 LTSTYIKYFNLGTKRPLW----------KAGSLPIGWLTFYRQTLALDKRWHVMGLGRES 550

Query: 341 DSTSSVVNKSLAIRYDGPMTACSEFGDGANMEPARGDLWKQHLP 384
              +  + ++  I YDG M    + G     +      W  H+P
Sbjct: 551 GVKAVDIEQAAVIHYDGVMKPWLDIG-----KENYKRYWNIHVP 589


>gi|18390688|ref|NP_563771.1| alpha-1,4-galacturonosyltransferase [Arabidopsis thaliana]
 gi|75191689|sp|Q9M9Y5.1|GAUT6_ARATH RecName: Full=Probable galacturonosyltransferase 6
 gi|7523701|gb|AAF63140.1|AC011001_10 Unknown protein [Arabidopsis thaliana]
 gi|15028087|gb|AAK76574.1| unknown protein [Arabidopsis thaliana]
 gi|20259311|gb|AAM14391.1| unknown protein [Arabidopsis thaliana]
 gi|332189914|gb|AEE28035.1| alpha-1,4-galacturonosyltransferase [Arabidopsis thaliana]
          Length = 589

 Score = 47.4 bits (111), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 60/284 (21%), Positives = 109/284 (38%), Gaps = 44/284 (15%)

Query: 125 AVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFPHSNLEFHGQEEVKKVIRTA 184
           +V+VNST+S S+ P+ + FH+     +   +S + L  +   + ++    +++  + R  
Sbjct: 313 SVVVNSTISSSKEPERIVFHVVTDSLNYPAISMWFLLNIQSKATIQILNIDDMDVLPRDY 372

Query: 185 STGVKYSVQNFEEIVP------FVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYA 238
              +     N    +       F +  +   L+K + +   V+V+  +  L  ID+    
Sbjct: 373 DQLLMKQNSNDPRFISTLNHARFYLPDIFPGLNKMVLLDHDVVVQRDLSRLWSIDMKGKV 432

Query: 239 IAAADDCSERLNSYVNPDVLDAIQRSASQPWVSGKPYAVNSCMPDLGMLLID-----ARK 293
           + A + C E  +S+ +           S  WV+GK ++  +C    GM LID      RK
Sbjct: 433 VGAVETCLEGESSFRSMSTFINF----SDTWVAGK-FSPRACTWAFGMNLIDLEEWRIRK 487

Query: 294 LEKYILEAF-------LWWKKVINQRDRSIGRSPAIALALYDRYLKLSSSWLV------T 340
           L    ++ F       LW            G  P   L  Y + L L   W V      +
Sbjct: 488 LTSTYIKYFNLGTKRPLW----------KAGSLPIGWLTFYRQTLALDKRWHVMGLGRES 537

Query: 341 DSTSSVVNKSLAIRYDGPMTACSEFGDGANMEPARGDLWKQHLP 384
              +  + ++  I YDG M    + G     +      W  H+P
Sbjct: 538 GVKAVDIEQAAVIHYDGVMKPWLDIG-----KENYKRYWNIHVP 576


>gi|357111850|ref|XP_003557723.1| PREDICTED: probable galacturonosyltransferase 11-like [Brachypodium
           distachyon]
          Length = 539

 Score = 46.6 bits (109), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 63/275 (22%), Positives = 109/275 (39%), Gaps = 54/275 (19%)

Query: 125 AVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFPHSNLEFHGQEE-------- 176
           +V+VNST+S + +P  L FH+   +     +S + L   F    +E    +E        
Sbjct: 247 SVVVNSTVSNANHPQQLVFHVVTDRIHYGAMSTWFLMNDFKGCTVEVRCIDEFPWLNAAS 306

Query: 177 ---VKKV------------IRTASTGVKYSVQNFEEIVP---FVIASVHQSLSKFIYMSP 218
              V+++            ++T    +K+    F  ++    F I  +  +L K I++  
Sbjct: 307 SPLVRRLSEMETKGYYYGGLKTPEREIKFHNPKFVSLLNHLRFYIPQILPNLEKVIFLDD 366

Query: 219 SVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDVLDAIQRSASQPWVSGK-- 273
            V+V+  + +L  I+L    I A + C E   R + Y+N           SQP +S K  
Sbjct: 367 DVVVQKDLTQLFSIELHGNVIGAVETCLESFHRYHKYLN----------FSQPIISSKID 416

Query: 274 PYAVNSCMPDLGMLLIDA---RKLEKYILEAFLWWKKVINQRDRSIGRSPAIALALYDRY 330
           P+   +C    GM + D    RK     L  + W ++ ++Q     G  PA  L  Y   
Sbjct: 417 PH---TCGWAFGMNIFDLIAWRKANATALYHY-WEEQNMDQLLWRTGTLPAGLLTFYGLM 472

Query: 331 LKLSSSWLV------TDSTSSVVNKSLAIRYDGPM 359
             L   W V       D    ++  +  + Y+G M
Sbjct: 473 EPLDRRWHVLGLGYDVDIDDRLIESAAVVHYNGNM 507


>gi|356536655|ref|XP_003536852.1| PREDICTED: probable galacturonosyltransferase 6-like [Glycine max]
          Length = 585

 Score = 46.2 bits (108), Expect = 0.027,   Method: Compositional matrix adjust.
 Identities = 39/169 (23%), Positives = 72/169 (42%), Gaps = 25/169 (14%)

Query: 201 FVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE------RLNSYVN 254
           F +  +  +L+K + +   V+V+  + EL  I++    I A   C E      R++ ++N
Sbjct: 395 FYLPDIFPALNKIVLLDHDVVVQQDLSELWNINMKGNVIGAVGTCQEGKIPFYRIDMFIN 454

Query: 255 PDVLDAIQRSASQPWVSGKPYAVNSCMPDLGMLLIDARKLEKYILEAFLWWKKVINQRDR 314
                      S P + GK +  N+C    GM L D ++  ++ L A   ++  +     
Sbjct: 455 ----------LSDPLI-GKRFDANACTWAFGMNLFDLQQWRRHNLTAV--YQNYVQMGLW 501

Query: 315 SIGRSPAIALALYDRYLKLSSSWLV------TDSTSSVVNKSLAIRYDG 357
           +IG  P   L  Y++   L   W V      +D   + + ++  I YDG
Sbjct: 502 NIGSLPLGWLTFYNKTELLDRQWHVLGLGYSSDVDRNEIEQASVIHYDG 550


>gi|356504981|ref|XP_003521271.1| PREDICTED: probable galacturonosyltransferase 6-like [Glycine max]
          Length = 625

 Score = 45.8 bits (107), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 62/256 (24%), Positives = 111/256 (43%), Gaps = 29/256 (11%)

Query: 120 DLRLLAVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFPHSNLEFHGQE---- 175
           +L   AV+VNST+S ++  + L FH  V   S +  + +   +L P      H Q     
Sbjct: 346 NLLACAVVVNSTVSNAKKQEKLVFH--VVTNSLNFPAIWMWFLLNPPGKATVHIQSIENF 403

Query: 176 ----EVKKVIRTASTGVKYSVQ-NFEEIVPFVIASVHQSLSKFIYMSPSVIVKGRVEELI 230
               +     +  S+  +Y+ + N+   + F +  +  +L+K ++    V+V+  +  L 
Sbjct: 404 EWLPKYNTFNKHNSSDPRYTSELNY---LRFYLPDIFPTLNKILFFDHDVVVQQDLSGLW 460

Query: 231 GIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRSASQPWVSGKPYAVNSCMPDLGMLLID 290
             ++    IAA   C E   S+   D+        S P+++ K + VN+C    GM L D
Sbjct: 461 NANMKGKVIAAVGTCQEGGTSFHRMDMFINF----SDPFIA-KRFDVNACTWAFGMNLFD 515

Query: 291 ARKLEKYILEAFLWWKKVINQRDR---SIGRSPAIALALYDRYLKLSSSWLVTD-STSSV 346
            ++  ++ L A L+ + +     R   +IG  P   L  Y++   L   W +      SV
Sbjct: 516 LQQWRRHNLTA-LYHRYLQMGSKRPLWNIGSLPLGWLTFYNKTKVLDRRWHILGLGYDSV 574

Query: 347 VNK-----SLAIRYDG 357
           V+K     +  I YDG
Sbjct: 575 VDKNEIERAAIIHYDG 590


>gi|297843072|ref|XP_002889417.1| hypothetical protein ARALYDRAFT_470231 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335259|gb|EFH65676.1| hypothetical protein ARALYDRAFT_470231 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 361

 Score = 45.8 bits (107), Expect = 0.040,   Method: Compositional matrix adjust.
 Identities = 77/336 (22%), Positives = 128/336 (38%), Gaps = 32/336 (9%)

Query: 49  ALGLIVFLGLLQ-FLPATHFR--HP-SDPFRIWVPFNSNTSFSKTRDSG-DRNSGSISSI 103
           AL +I+    LQ F PA   R  HP +D F+       N+  S  R+S   RN+    S 
Sbjct: 14  ALAMILLSPSLQSFSPAAAIRSSHPYADEFKP----QQNSDHSSFRESPLFRNAEQCRSS 69

Query: 104 SQDDG-----MVHIVSWMQCLDLRLLAVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFY 158
            +D G     +VH+   +    LR     VNS L  S  P  + FH  V    +++ S  
Sbjct: 70  GEDSGVCNPNLVHVAITLDIDYLRGSIAAVNSILQHSMCPQSVFFHFLVSSEIQNLESL- 128

Query: 159 KLKVLFPH-SNLE--FHGQEEVKKVIRTASTGVKYSV-QNFEEIVPFVIASVHQSLSKFI 214
            ++  FP  +NL+  +   E V+ +I   S+ V+ ++ Q       ++   +   + + I
Sbjct: 129 -IRSTFPKLTNLKIYYFAPETVQSLI---SSSVRQALEQPLNYARNYLADLLEPCVKRVI 184

Query: 215 YMSPSVIVKGRVEELIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRSASQPWVSGKP 274
           Y+   ++V   + +L    L    I A + C      Y             S    +G  
Sbjct: 185 YLDSDLVVVDDIVKLWKTGLGQRTIGAPEYCHANFTKYFTGGFW-------SDKRFNGTF 237

Query: 275 YAVNSCMPDLGMLLIDARKLE--KYILEAFLWWKKVINQRDRSIGRSPAIALALYDRYLK 332
              N C  + G+++ID +K    +Y      W +    +R   +G  P   L        
Sbjct: 238 KGRNPCYFNTGVMVIDLKKWRRFRYTKRIEKWMEIQKMERIYELGSLPPFLLVFAGHVAP 297

Query: 333 LSSSWLVTDSTSSVVNKSLAIRYDGPMTACSEFGDG 368
           +S  W         V  S    + GP++     G G
Sbjct: 298 ISHRWNQHGLGGDNVRGSCRDLHSGPVSLLHWSGSG 333


>gi|224124222|ref|XP_002330135.1| glycosyltransferase [Populus trichocarpa]
 gi|222871269|gb|EEF08400.1| glycosyltransferase [Populus trichocarpa]
          Length = 531

 Score = 45.4 bits (106), Expect = 0.046,   Method: Compositional matrix adjust.
 Identities = 57/272 (20%), Positives = 101/272 (37%), Gaps = 48/272 (17%)

Query: 125 AVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFPHSNLEFHGQEE-------- 176
           +V+VNST+  + +P  L FH+     S   +  + L   F  + +E    EE        
Sbjct: 239 SVVVNSTVCNADHPKQLVFHIVTNGISYGSMQAWFLTNDFKGATVEVQNIEEFSWLNASY 298

Query: 177 ---VKKVIRTASTGVKYSVQNFEEIVP---------------FVIASVHQSLSKFIYMSP 218
              +K++I   S    +      ++ P               F I  ++  L K +++  
Sbjct: 299 APVIKQIIHQDSRAYYFGADQDMKVEPKLRNPKYLSLLNHLRFYIPEIYPLLEKIVFLDD 358

Query: 219 SVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDVLDAIQRSASQPWVSGKPY 275
            V+V+  +  L  +DL      A + C E   R   Y+N           S P +S K +
Sbjct: 359 DVVVQKDLTRLFSLDLHGNVNGAVETCLETFHRYYKYIN----------FSNPIISSK-F 407

Query: 276 AVNSCMPDLGMLLID--ARKLEKYILEAFLWWKKVINQRDRSIGRSPAIALALYDRYLKL 333
              +C    GM + D  A + E    +   W ++  +Q    +G  P   LA Y     L
Sbjct: 408 DPQACGWAFGMNIFDLIAWRKENVTAQYHYWQEQNADQTLWKLGTLPPALLAFYGLTEPL 467

Query: 334 SSSWLV------TDSTSSVVNKSLAIRYDGPM 359
              W V       +    +++ +  I ++G M
Sbjct: 468 DRRWHVLGLGYDMNIDDRLIDSAAVIHFNGNM 499


>gi|240254105|ref|NP_564057.4| alpha-1,4-galacturonosyltransferase [Arabidopsis thaliana]
 gi|75164846|sp|Q949Q1.1|GAUTB_ARATH RecName: Full=Probable galacturonosyltransferase 11
 gi|15293067|gb|AAK93644.1| unknown protein [Arabidopsis thaliana]
 gi|17064736|gb|AAL32522.1| Unknown protein [Arabidopsis thaliana]
 gi|332191610|gb|AEE29731.1| alpha-1,4-galacturonosyltransferase [Arabidopsis thaliana]
          Length = 537

 Score = 45.4 bits (106), Expect = 0.052,   Method: Compositional matrix adjust.
 Identities = 57/273 (20%), Positives = 105/273 (38%), Gaps = 48/273 (17%)

Query: 125 AVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFPHSNLEFHGQEE-------- 176
           +V+VNST+S + +P  L FH+   + S   +  + L   F  S +E    EE        
Sbjct: 243 SVVVNSTVSNADHPKQLVFHIVTNRVSYKAMQAWFLSNDFKGSAIEIRSVEEFSWLNASY 302

Query: 177 ---VKKVIRTASTGVKYSVQNFEEIVP-----------------FVIASVHQSLSKFIYM 216
              VK+++ T +    +  Q  ++ +                  F I  ++  L K +++
Sbjct: 303 SPVVKQLLDTDARAYYFGEQTSQDTISEPKVRNPKYLSLLNHLRFYIPEIYPQLEKIVFL 362

Query: 217 SPSVIVKGRVEELIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRSASQPWVSGKPYA 276
              V+V+  +  L  +DL      A + C E  + Y           + S P +S K + 
Sbjct: 363 DDDVVVQKDLTPLFSLDLHGNVNGAVETCLEAFHRYYK-------YLNFSNPLISSK-FD 414

Query: 277 VNSCMPDLGMLLIDARKLEKYILEA-FLWWKKVINQRDRS---IGRSPAIALALYDRYLK 332
             +C    GM + D        + A + +W+     R+R+   +G  P   L+ Y     
Sbjct: 415 PQACGWAFGMNVFDLIAWRNANVTARYHYWQD--QNRERTLWKLGTLPPGLLSFYGLTEP 472

Query: 333 LSSSWLV------TDSTSSVVNKSLAIRYDGPM 359
           L   W V       +  + ++  +  I Y+G M
Sbjct: 473 LDRRWHVLGLGYDVNIDNRLIETAAVIHYNGNM 505


>gi|218200953|gb|EEC83380.1| hypothetical protein OsI_28795 [Oryza sativa Indica Group]
          Length = 650

 Score = 45.1 bits (105), Expect = 0.059,   Method: Compositional matrix adjust.
 Identities = 62/275 (22%), Positives = 111/275 (40%), Gaps = 53/275 (19%)

Query: 125 AVLVNSTLSGSRYPDLLHFHLFV---------------PKGS-----EDMVSFYKLKVLF 164
           AV+VNST+  +++P    FH+                 P G       ++  F  L   +
Sbjct: 349 AVVVNSTVLNAKHPSHHVFHIVTDRLNYAPMRMWFLSNPPGKATIEVRNIEEFTWLNASY 408

Query: 165 PHSNLEFHGQEEVKKVIRT----ASTGVKYSVQNFEEIVP---FVIASVHQSLSKFIYMS 217
                +   Q  +    RT    + + +KY    +  I+    F +  ++ +L K +++ 
Sbjct: 409 SPVLKQLESQSMIDYYFRTHRANSDSNLKYRNPKYLSILNHLRFYLPEIYPNLHKIVFLD 468

Query: 218 PSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDVLDAIQRSASQPWVSGKP 274
             V++K  +  L  ID+    I   + C E   R + Y+N           S P V  K 
Sbjct: 469 DDVVIKKDLTSLWSIDMKGKVIGVVETCGESFHRFDRYLN----------FSNP-VIVKN 517

Query: 275 YAVNSCMPDLGMLLID-ARKLEKYILEAFLWWKKVINQRDR---SIGRSPAIALALYDRY 330
           +  ++C    GM + D A    + I E +  W+K +NQ DR    +G  P   +  +++ 
Sbjct: 518 FDPHACGWAFGMNVFDLAEWRRQNITEIYHSWQK-LNQ-DRLLWKLGTLPPGLITFWNKT 575

Query: 331 LKLSSSWLVT------DSTSSVVNKSLAIRYDGPM 359
           L L+ SW V         +S  + ++  I Y+G M
Sbjct: 576 LPLNRSWHVLGLGYNPHVSSRDIERAAVIHYNGNM 610


>gi|224101685|ref|XP_002312381.1| glycosyltransferase, CAZy family GT8 [Populus trichocarpa]
 gi|222852201|gb|EEE89748.1| glycosyltransferase, CAZy family GT8 [Populus trichocarpa]
          Length = 346

 Score = 44.7 bits (104), Expect = 0.082,   Method: Compositional matrix adjust.
 Identities = 48/229 (20%), Positives = 82/229 (35%), Gaps = 21/229 (9%)

Query: 105 QDDGMVHIVSWMQCLDLRLLAVLVNSTLSGSRYPDLLHFHLFVPKGSEDM-----VSFYK 159
            D  ++HI   +    LR     V S L  +  P+ + FH        D+      +F  
Sbjct: 57  HDPSIIHIAMTLDATYLRGSVAGVLSVLQHAACPEHIVFHFIATHRRADLRRTITSTFPY 116

Query: 160 LKVLFPHSNLEFHGQEEVKKVIRTASTGVKYSVQNFEEIVPFVIASVHQSLSKFIYMSPS 219
           L     H N +    +    + R     + Y+     +++PF +        + IY    
Sbjct: 117 LTFHLYHFNTDLVRGKISSSIRRALDQPLNYARIYLADLLPFTV-------RRIIYFDSD 169

Query: 220 VIVKGRVEELIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRSASQPWVSGKPYAVNS 279
           +IV   V +L  I+L  + + A + C    + Y N        R  S P  +       +
Sbjct: 170 LIVVDDVAKLWNINLGAHVLGAPEYCHVNFSYYFN-------SRFWSSPVYATSFTGRRA 222

Query: 280 CMPDLGMLLIDARKLE--KYILEAFLWWKKVINQRDRSIGRSPAIALAL 326
           C  + G+++ID RK    KY  +   W +     R   +G  P   L  
Sbjct: 223 CYFNTGVMVIDLRKWREGKYTEKLEYWMRVQKKNRIYELGSLPPFLLVF 271


>gi|356534929|ref|XP_003536003.1| PREDICTED: probable galacturonosyltransferase-like 7-like [Glycine
           max]
          Length = 359

 Score = 44.7 bits (104), Expect = 0.089,   Method: Compositional matrix adjust.
 Identities = 71/335 (21%), Positives = 127/335 (37%), Gaps = 27/335 (8%)

Query: 47  FIALGLIVFLG--LLQFLPATHFR---HPSDPFRIWVPFNSNTSFSKTRDSGDRNSGSIS 101
           F A  L++ L   L  F PA   R   H     R+  P  S     + R++ D N  + S
Sbjct: 11  FSAAMLVILLSPSLQSFHPAEAIRSSHHLDGLLRLPPPRLSFRPAPRFRNAADANKCASS 70

Query: 102 SISQ---DDGMVHIVSWMQCLDLRLLAVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFY 158
           S+S    D  +VH+   +    LR     V+S L  S+ P+ + FH  V + + + +   
Sbjct: 71  SVSTSVCDPSLVHVAITLDVEYLRGSIAAVHSILQHSQCPENIFFHFLVSETNLESL--- 127

Query: 159 KLKVLFPHSNLE--FHGQEEVKKVIRTASTGVKYSV-QNFEEIVPFVIASVHQSLSKFIY 215
            +K  FP  N +  +   E V+ +I   ST V+ ++ Q       ++   +   + + IY
Sbjct: 128 -VKSTFPQLNFKVYYFDPEIVRNLI---STSVRQALEQPLNYARNYLADLLEPCVERVIY 183

Query: 216 MSPSVIVKGRVEELIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRSASQPWVSGKPY 275
           +   ++V   + +L    L +  I A + C      Y         +   ++ +   +P 
Sbjct: 184 LDSDLVVVDDIAKLWSTSLGSRTIGAPEYCHANFTKYFTAAFWSDTR--FARAFAGRRP- 240

Query: 276 AVNSCMPDLGMLLIDARKLEK--YILEAFLWWKKVINQRDRSIGRSPAIALALYDRYLKL 333
               C  + G+++ID  +  +  Y      W +   N R   +G  P   L        +
Sbjct: 241 ----CYFNTGVMVIDLVRWRRIGYSKRIERWMEIQKNDRIYELGSLPPFLLVFAGHVAPI 296

Query: 334 SSSWLVTDSTSSVVNKSLAIRYDGPMTACSEFGDG 368
              W         V  S    + GP++     G G
Sbjct: 297 EHRWNQHGLGGDNVKGSCRDLHAGPVSLLHWSGSG 331


>gi|356502770|ref|XP_003520189.1| PREDICTED: probable galacturonosyltransferase 6-like [Glycine max]
          Length = 585

 Score = 44.3 bits (103), Expect = 0.092,   Method: Compositional matrix adjust.
 Identities = 54/252 (21%), Positives = 104/252 (41%), Gaps = 35/252 (13%)

Query: 125 AVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKL------KVLFPHSNLEFHGQEEVK 178
           A +VNST+S ++  + L FH+     +   +S + L        +   S   F    +  
Sbjct: 315 AAVVNSTISTAKEQEKLVFHVLTKSLNLPSISMWFLINPPGKATVHILSIDNFEWSSKYN 374

Query: 179 KVIRTASTGVKYSVQ-NFEEIVPFVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNY 237
                 S+  +Y+ + N+   + F +  +  +L+K +     V+V+  + EL  I++   
Sbjct: 375 TYQENNSSDPRYTSELNY---LRFYLPDIFPALNKIVLFDHDVVVQRDLSELWNINMKGK 431

Query: 238 AIAAADDCSE------RLNSYVNPDVLDAIQRSASQPWVSGKPYAVNSCMPDLGMLLIDA 291
            I A   C E      R++ ++N           S P + GK + VN+C    GM L D 
Sbjct: 432 VIGAIGTCQEGKIPFHRIDMFIN----------LSDPLI-GKRFDVNACTWAFGMNLFDL 480

Query: 292 RKLEKYILEAFLWWKKVINQRDRSIGRSPAIALALYDRYLKLSSSWLV------TDSTSS 345
           ++  ++ L     ++  +     +IG  P   L  Y++   L   W V      ++   +
Sbjct: 481 QQWRRHNLTVV--YQNYLQMGLWNIGSLPLGWLTFYNKTELLDRQWHVLGLGYSSNVDRN 538

Query: 346 VVNKSLAIRYDG 357
            + ++  I YDG
Sbjct: 539 EIEQAAVIHYDG 550


>gi|147771376|emb|CAN62993.1| hypothetical protein VITISV_021618 [Vitis vinifera]
          Length = 367

 Score = 44.3 bits (103), Expect = 0.092,   Method: Compositional matrix adjust.
 Identities = 60/296 (20%), Positives = 112/296 (37%), Gaps = 22/296 (7%)

Query: 80  FNSNTSFSKTRDSG--DRNSGSISSISQDDGMVHIVSWMQCLDLRLLAVLVNSTLSGSRY 137
           F   ++F+   + G  DRN    SS+  D  +VH+   +    LR     V+S L  S+ 
Sbjct: 59  FRKASAFNNAEECGFRDRNFSGKSSVC-DPFLVHVAITLDVHYLRGSMAAVHSILQHSQC 117

Query: 138 PDLLHFHLFVPKGSEDMVSFYKLKVLFPHSNLE--FHGQEEVKKVIRTASTGVKYSVQNF 195
           P+ + FH  V +   +++    ++  FP    +  +   E V+ +I   ST V+ ++++ 
Sbjct: 118 PEDIFFHFLVSETHLEIL----VRSTFPQLKFKVYYFNPEIVRNLI---STSVREALEHP 170

Query: 196 EEIVPFVIASVHQ-SLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSERLNSYVN 254
                  +A + +  + + IY+   +IV   + +L    L    I A + C      Y  
Sbjct: 171 LNYARNYLADLLEPCVRRVIYLDSDLIVVDDIYKLWSTSLGTRTIGAPEYCHANFTRYFT 230

Query: 255 PDVLDAIQRSASQPWVSGKPYAVNSCMPDLGMLLIDARKLEKYILEAFL--WWKKVINQR 312
                   +  S+    G       C  + G+++ID  K  ++     +  W +   N R
Sbjct: 231 -------DKFWSEKRYYGTFDGRKPCYFNTGVIVIDLAKWRRFGFTKRIERWMEVQKNNR 283

Query: 313 DRSIGRSPAIALALYDRYLKLSSSWLVTDSTSSVVNKSLAIRYDGPMTACSEFGDG 368
              +G  P   L        +   W         V  S    + GP++     G G
Sbjct: 284 IYELGSLPPYLLVFAGHVAPIEHRWNQHGLGGDNVKGSCRELHPGPVSLLHWSGSG 339


>gi|449491241|ref|XP_004158838.1| PREDICTED: probable galacturonosyltransferase 4-like [Cucumis
           sativus]
          Length = 641

 Score = 43.9 bits (102), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 43/194 (22%), Positives = 79/194 (40%), Gaps = 27/194 (13%)

Query: 201 FVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDV 257
           F +  +   L K +++   ++V+  +  L  IDL      A + C E   R + Y+N   
Sbjct: 450 FYLPQLFPKLKKVLFLDDDIVVQKDLTGLWSIDLKGNVNGAVETCGESFHRFDRYLN--- 506

Query: 258 LDAIQRSASQPWVSGKPYAVNSCMPDLGMLLIDARKLEKY-ILEAFLWWKKVINQRDR-S 315
                   S P +S K +  ++C    GM + D  + ++  I E +  W+K+   R    
Sbjct: 507 -------FSNPLIS-KSFDPHACGWAYGMNIFDLDEWKRQNITEVYHSWQKLNYDRQLWK 558

Query: 316 IGRSPAIALALYDRYLKLSSSWLV------TDSTSSVVNKSLAIRYDGPMTACSEFGDGA 369
           +G  P   +  + R  +L  SW V      T+     ++++  I Y+G M    E     
Sbjct: 559 LGTLPPGLITFWKRTYQLDKSWHVLGLGYNTNVGQKEIDRAAVIHYNGNMKPWLEIA--- 615

Query: 370 NMEPARGDLWKQHL 383
              P   + W +H+
Sbjct: 616 --IPKYRNYWTKHV 627


>gi|449434420|ref|XP_004134994.1| PREDICTED: probable galacturonosyltransferase 4-like [Cucumis
           sativus]
          Length = 625

 Score = 43.9 bits (102), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 43/194 (22%), Positives = 79/194 (40%), Gaps = 27/194 (13%)

Query: 201 FVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDV 257
           F +  +   L K +++   ++V+  +  L  IDL      A + C E   R + Y+N   
Sbjct: 434 FYLPQLFPKLKKVLFLDDDIVVQKDLTGLWSIDLKGNVNGAVETCGESFHRFDRYLN--- 490

Query: 258 LDAIQRSASQPWVSGKPYAVNSCMPDLGMLLIDARKLEKY-ILEAFLWWKKVINQRDR-S 315
                   S P +S K +  ++C    GM + D  + ++  I E +  W+K+   R    
Sbjct: 491 -------FSNPLIS-KSFDPHACGWAYGMNIFDLDEWKRQNITEVYHSWQKLNYDRQLWK 542

Query: 316 IGRSPAIALALYDRYLKLSSSWLV------TDSTSSVVNKSLAIRYDGPMTACSEFGDGA 369
           +G  P   +  + R  +L  SW V      T+     ++++  I Y+G M    E     
Sbjct: 543 LGTLPPGLITFWKRTYQLDKSWHVLGLGYNTNVGQKEIDRAAVIHYNGNMKPWLEIA--- 599

Query: 370 NMEPARGDLWKQHL 383
              P   + W +H+
Sbjct: 600 --IPKYRNYWTKHV 611


>gi|297833366|ref|XP_002884565.1| hypothetical protein ARALYDRAFT_477926 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297330405|gb|EFH60824.1| hypothetical protein ARALYDRAFT_477926 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 351

 Score = 43.9 bits (102), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 51/244 (20%), Positives = 94/244 (38%), Gaps = 26/244 (10%)

Query: 104 SQDDGMVHIVSWMQCLDLRLLAVLVNSTLSGSRYPDLLHFH-LFVPKGSEDMVSFYKLKV 162
           +++   +HI   +    LR     V S L  S  P+ L FH L +P    D+  F  +K 
Sbjct: 60  TREADQIHIAMTLDTNYLRGTMAAVLSLLQHSTCPENLSFHFLSLPHFENDL--FTSIKS 117

Query: 163 LFPHSNLEFHG------QEEVKKVIRTA-STGVKYSVQNFEEIVPFVIASVHQSLSKFIY 215
            FP+ N + +       + ++ K IR A    + Y+     +I+P        S+ + IY
Sbjct: 118 TFPYLNFKIYQFDPNLVRSKISKSIRQALDQPLNYARIYLADIIP-------SSVDRIIY 170

Query: 216 MSPSVIVKGRVEELIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRSASQPWVSGKPY 275
           +   ++V   +E+L  +++    +AA + C      Y         +   S P +     
Sbjct: 171 LDSDLVVVDDIEKLWHVEMEGKVVAAPEYCHANFTHYFT-------KTFWSDPVLVKVLE 223

Query: 276 AVNSCMPDLGMLLIDARKLEK--YILEAFLWWKKVINQRDRSIGRSPAIALALYDRYLKL 333
               C  + G++++D  K  K  Y  +   W      +R   +G  P   L        +
Sbjct: 224 GKRPCYFNTGVMVVDVNKWRKGMYTQKVEEWMTIQKQKRIYHLGSLPPFLLIFAGDIKAV 283

Query: 334 SSSW 337
           +  W
Sbjct: 284 NHRW 287


>gi|224100589|ref|XP_002311936.1| glycosyltransferase, CAZy family GT8 [Populus trichocarpa]
 gi|222851756|gb|EEE89303.1| glycosyltransferase, CAZy family GT8 [Populus trichocarpa]
          Length = 347

 Score = 43.5 bits (101), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 53/238 (22%), Positives = 94/238 (39%), Gaps = 27/238 (11%)

Query: 110 VHIVSWMQCLDLRLLAVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFPHSNL 169
           +HIV  +    LR     V S L  S  P+ + FH    +   ++  F  +K  FP+ N 
Sbjct: 63  IHIVMTLDANYLRGTMAAVLSILQHSTCPENMEFHFLWSRFEREV--FSSIKSTFPYLNF 120

Query: 170 EFHG------QEEVKKVIRTA-STGVKYSVQNFEEIVPFVIASVHQSLSKFIYMSPSVIV 222
           +F+       + ++ K IR A    + Y+     +I+P         + + IY+   ++V
Sbjct: 121 KFYRFDSNRVRGKISKSIRQALDQPLNYARIYLADIIP-------SDVKRVIYLDSDLVV 173

Query: 223 KGRVEELIGIDLSNYAIAAADDCSERLNSYV-NPDVLDAIQRSASQPWVSGKPYAVNSCM 281
              + +L  +DL    +AA + C     +Y  N   LD        P ++   +    C 
Sbjct: 174 VDDIAKLWEVDLEEKVLAAPEYCHANFTNYFSNLFWLD--------PVLAKTFHGRRPCY 225

Query: 282 PDLGMLLIDARKLEK-YILEAFLWWKKVINQ-RDRSIGRSPAIALALYDRYLKLSSSW 337
            + G++++D  K  +  I +    W  V  Q R   +G  P   L L      +   W
Sbjct: 226 FNTGVMVVDVEKWRQGGITQKVEEWMTVQKQKRIYHLGSLPPFLLVLAGNIKGVDHRW 283


>gi|225437483|ref|XP_002274175.1| PREDICTED: probable galacturonosyltransferase-like 7-like [Vitis
           vinifera]
          Length = 450

 Score = 43.5 bits (101), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 60/296 (20%), Positives = 112/296 (37%), Gaps = 22/296 (7%)

Query: 80  FNSNTSFSKTRDSG--DRNSGSISSISQDDGMVHIVSWMQCLDLRLLAVLVNSTLSGSRY 137
           F   ++F+   + G  DRN    SS+  D  +VH+   +    LR     V+S L  S+ 
Sbjct: 142 FRKASAFNNAEECGFRDRNFSGKSSVC-DPFLVHVAITLDVHYLRGSMAAVHSILQHSQC 200

Query: 138 PDLLHFHLFVPKGSEDMVSFYKLKVLFPHSNLE--FHGQEEVKKVIRTASTGVKYSVQNF 195
           P+ + FH  V +   +++    ++  FP    +  +   E V+ +I   ST V+ ++++ 
Sbjct: 201 PEDIFFHFLVSETHLEIL----VRSTFPQLKFKVYYFNPEIVRNLI---STSVREALEHP 253

Query: 196 EEIVPFVIASVHQ-SLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSERLNSYVN 254
                  +A + +  + + IY+   +IV   + +L    L    I A + C      Y  
Sbjct: 254 LNYARNYLADLLEPCVRRVIYLDSDLIVVDDIYKLWSTSLGTRTIGAPEYCHANFTRYFT 313

Query: 255 PDVLDAIQRSASQPWVSGKPYAVNSCMPDLGMLLIDARKLEKYILEAFL--WWKKVINQR 312
                   +  S+    G       C  + G+++ID  K  ++     +  W +   N R
Sbjct: 314 -------DKFWSEKRYYGTFDGRKPCYFNTGVIVIDLAKWRRFGFTKRIERWMEVQKNNR 366

Query: 313 DRSIGRSPAIALALYDRYLKLSSSWLVTDSTSSVVNKSLAIRYDGPMTACSEFGDG 368
              +G  P   L        +   W         V  S    + GP++     G G
Sbjct: 367 IYELGSLPPYLLVFAGHVAPIEHRWNQHGLGGDNVKGSCRELHPGPVSLLHWSGSG 422


>gi|15230679|ref|NP_187277.1| putative galacturonosyltransferase-like 4 [Arabidopsis thaliana]
 gi|75186391|sp|Q9M8J2.1|GATL4_ARATH RecName: Full=Probable galacturonosyltransferase-like 4; AltName:
           Full=Galactinol synthase 9; Short=AtGolS9; Short=GolS-9
 gi|6862930|gb|AAF30319.1|AC018907_19 putative glycosyl transferase [Arabidopsis thaliana]
 gi|332640845|gb|AEE74366.1| putative galacturonosyltransferase-like 4 [Arabidopsis thaliana]
          Length = 351

 Score = 43.5 bits (101), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 50/231 (21%), Positives = 91/231 (39%), Gaps = 26/231 (11%)

Query: 104 SQDDGMVHIVSWMQCLDLRLLAVLVNSTLSGSRYPDLLHFH-LFVPKGSEDMVSFYKLKV 162
           +++   +HI   +    LR     V S L  S  P+ L FH L +P    D+  F  +K 
Sbjct: 60  TREADQIHIAMTLDTNYLRGTMAAVLSLLQHSTCPENLSFHFLSLPHFENDL--FTSIKS 117

Query: 163 LFPHSNLEFHG------QEEVKKVIRTA-STGVKYSVQNFEEIVPFVIASVHQSLSKFIY 215
            FP+ N + +       + ++ K IR A    + Y+     +I+P        S+ + IY
Sbjct: 118 TFPYLNFKIYQFDPNLVRSKISKSIRQALDQPLNYARIYLADIIP-------SSVDRIIY 170

Query: 216 MSPSVIVKGRVEELIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRSASQPWVSGKPY 275
           +   ++V   +E+L  +++    +AA + C      Y         +   S P +     
Sbjct: 171 LDSDLVVVDDIEKLWHVEMEGKVVAAPEYCHANFTHYFT-------RTFWSDPVLVKVLE 223

Query: 276 AVNSCMPDLGMLLIDARKLEK--YILEAFLWWKKVINQRDRSIGRSPAIAL 324
               C  + G++++D  K  K  Y  +   W      +R   +G  P   L
Sbjct: 224 GKRPCYFNTGVMVVDVNKWRKGMYTQKVEEWMTIQKQKRIYHLGSLPPFLL 274


>gi|297843472|ref|XP_002889617.1| hypothetical protein ARALYDRAFT_470701 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335459|gb|EFH65876.1| hypothetical protein ARALYDRAFT_470701 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 595

 Score = 43.5 bits (101), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 61/291 (20%), Positives = 112/291 (38%), Gaps = 58/291 (19%)

Query: 125 AVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFPHSNLEFHGQEEV------- 177
           +V+VNST+S S+ P+ + FH+     +   +S + L  +   + ++    +++       
Sbjct: 319 SVVVNSTISSSKEPERIVFHVVTDSLNYPAISMWFLLNIQSRATIQILNIDDMDVLPPDY 378

Query: 178 -KKVIRTASTGVKYSVQNFEEIVP-----FVIASVHQSLSKFIYMSPSVIVKGRVEELIG 231
            + +++  S   ++       I P     F +  +   L+K +     V+V+  +  L  
Sbjct: 379 DQLLMKQNSNDPRF-------ISPLNHARFYLPDIFPGLNKIVLFDHDVVVQRDLSRLWS 431

Query: 232 IDLSNYAIAAADDCSERLNSYVNPDVLDAIQRSASQPWVSGKPYAVNSCMPDLGMLLID- 290
           ID+    + A + C E  +S+ +           S  WV+GK ++  +C    GM LID 
Sbjct: 432 IDMKGKVVGAVETCLEGESSFRSMSTFINF----SDTWVAGK-FSPRACTWAFGMNLIDL 486

Query: 291 ----ARKLEKYILEAF-------LWWKKVINQRDRSIGRSPAIALALYDRYLKLSSSWLV 339
                RKL    ++ F       LW            G  P   L  Y + L L   W V
Sbjct: 487 EEWRIRKLTSTYIKYFNLGTKRPLW----------KAGSLPIGWLTFYRQTLALDKRWHV 536

Query: 340 ------TDSTSSVVNKSLAIRYDGPMTACSEFGDGANMEPARGDLWKQHLP 384
                 +   +  + ++  I YDG M    + G     +      W  H+P
Sbjct: 537 MGLGRESGVKAVDIEQAAVIHYDGVMKPWLDIG-----KENYKRYWNIHVP 582


>gi|357464109|ref|XP_003602336.1| Glycosyltransferase CAZy family GT8 [Medicago truncatula]
 gi|355491384|gb|AES72587.1| Glycosyltransferase CAZy family GT8 [Medicago truncatula]
          Length = 343

 Score = 43.5 bits (101), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 56/268 (20%), Positives = 94/268 (35%), Gaps = 34/268 (12%)

Query: 80  FNSNTSFSKTRDSGDRNSGSISSISQDDGMVHIVSWMQCLDLRLLAVLVNSTLSGSRYPD 139
           F    +F   R+   R + SI         +HI   +    LR     V S L  +  P+
Sbjct: 34  FREAPAFRNGRECPPRETSSI---------IHIAMTLDATYLRGSTAGVFSVLQHASCPE 84

Query: 140 LLHFHLFVPKGSEDMVSFYKLKV-LFPHSNLE-FHGQEEVKK------VIRTASTGVKYS 191
            + FH FV           ++ +  FP+ N   +H    + +      + R     + Y+
Sbjct: 85  NIAFH-FVTTTHRRRQELRRIIISTFPYLNFHIYHFDSNLVRGKISYSIRRALDQPLNYA 143

Query: 192 VQNFEEIVPFVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSERLNS 251
                ++VP        +  + IY    +IV   V +L  IDL N+ + A + C     +
Sbjct: 144 RMYLADLVP-------ATAQRIIYFDSDLIVVDDVAKLWSIDLGNHVLGAPEYCHANFTT 196

Query: 252 YVNPDVLDAIQRSASQPWVSGKPYAVNSCMPDLGMLLIDARKLE--KYILEAFLWWKKVI 309
           Y          R  S P  S       +C  + G+++ID  K    KY  +   W +   
Sbjct: 197 YFT-------HRFWSNPSYSASFKGREACYFNTGVMVIDLWKWREGKYTEKLENWMRIQK 249

Query: 310 NQRDRSIGRSPAIALALYDRYLKLSSSW 337
             R   +G  P   L       ++   W
Sbjct: 250 RSRIYELGSLPPFLLVFAGDVERVEHRW 277


>gi|356500226|ref|XP_003518934.1| PREDICTED: probable galacturonosyltransferase-like 7-like [Glycine
           max]
          Length = 357

 Score = 43.5 bits (101), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 69/333 (20%), Positives = 129/333 (38%), Gaps = 25/333 (7%)

Query: 47  FIALGLIVFLG--LLQFLPATHFR---HPSDPFRIWVPFNSNTSFSKTRDSGD-RNSGSI 100
           F A  L++ L   L  F PA   R   H     R+  P  S    +  R++ D + + S+
Sbjct: 11  FSAAMLVILLSPSLQSFHPAEAIRSSHHLDGLLRLPPPRLSFRPAAPFRNAADGKCASSV 70

Query: 101 SSISQDDGMVHIVSWMQCLDLRLLAVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKL 160
            +   D  +VH+   +    LR     V+S L  S+ P+ + FH  V + + + +    +
Sbjct: 71  PTSVCDPSLVHVAITLDVEYLRGSIAAVHSILQHSQCPENIFFHFLVSETNLESL----V 126

Query: 161 KVLFPHSNLE--FHGQEEVKKVIRTASTGVKYSV-QNFEEIVPFVIASVHQSLSKFIYMS 217
           K  FP  N +  +   E V+ +I   ST V+ ++ Q       ++   +   + + IY+ 
Sbjct: 127 KSTFPQLNFKVYYFDPEIVRNLI---STSVRQALEQPLNYARNYLADLLEPCVERVIYLD 183

Query: 218 PSVIVKGRVEELIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRSASQPWVSGKPYAV 277
             +++   + +L    L +  I A + C      Y        ++ +++    +G+    
Sbjct: 184 SDLVLVDDIAKLWSTSLGSRTIGAPEYCHANFTKYFTAGFWSDMRFASA---FAGR---- 236

Query: 278 NSCMPDLGMLLIDARKLEK--YILEAFLWWKKVINQRDRSIGRSPAIALALYDRYLKLSS 335
             C  + G+++ID  +  K  Y      W +   N R   +G  P   L    R   +  
Sbjct: 237 RPCYFNTGVMVIDLVRWRKIGYSKRIERWMEIQKNDRIYELGSLPPFLLVFAGRVAPIEH 296

Query: 336 SWLVTDSTSSVVNKSLAIRYDGPMTACSEFGDG 368
            W         V  S    + GP++     G G
Sbjct: 297 RWNQHGLGGDNVKGSCRDLHAGPVSLLHWSGSG 329


>gi|225432474|ref|XP_002277334.1| PREDICTED: probable galacturonosyltransferase-like 4 [Vitis
           vinifera]
          Length = 352

 Score = 43.5 bits (101), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 48/243 (19%), Positives = 93/243 (38%), Gaps = 25/243 (10%)

Query: 104 SQDDGMVHIVSWMQCLDLRLLAVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVL 163
           ++D   +H+   +    LR     V S L  S  P+ L FH    +   ++VS   ++  
Sbjct: 62  TRDVDGIHVSMTLDANYLRGTMAAVLSILQHSTCPENLSFHFLSAQHVPEIVS--TIQAT 119

Query: 164 FPHSNLEFHG------QEEVKKVIRTA-STGVKYSVQNFEEIVPFVIASVHQSLSKFIYM 216
           FP+ N   +       + ++ K IR A    + Y+     +I+P        ++ + IY+
Sbjct: 120 FPYLNFRVYRFDSNRVRGKISKSIRRALDQPLNYARIYLADILP-------ANVRRVIYL 172

Query: 217 SPSVIVKGRVEELIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRSASQPWVSGKPYA 276
              +++   +  L G+DL +  +AA + C      Y   +         S P ++     
Sbjct: 173 DSDLVMVDDISNLWGVDLGDKVVAAPEYCHANFTKYFTDEFW-------SSPEMAKTFKG 225

Query: 277 VNSCMPDLGMLLIDARKLEK--YILEAFLWWKKVINQRDRSIGRSPAIALALYDRYLKLS 334
            + C  + G++++D  +  K  Y  +   W       R   +G  P   L L      + 
Sbjct: 226 RSPCYFNTGVMVVDVDRWRKGGYTQKVEEWMAVQKQNRIYDLGSLPPFLLVLAGNIKAVD 285

Query: 335 SSW 337
             W
Sbjct: 286 HRW 288


>gi|296080851|emb|CBI18781.3| unnamed protein product [Vitis vinifera]
          Length = 638

 Score = 43.1 bits (100), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 39/177 (22%), Positives = 73/177 (41%), Gaps = 22/177 (12%)

Query: 201 FVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDV 257
           F +  +   L+K +++   ++V+  +  L  IDL      A + C E   R + Y+N   
Sbjct: 447 FYLPEIFPKLNKVLFLDDDIVVQKDLTGLWSIDLKGNVNGAVETCGESFHRFDRYLN--- 503

Query: 258 LDAIQRSASQPWVSGKPYAVNSCMPDLGMLLIDARKLEK-YILEAFLWWKKVINQRDR-S 315
                   S P +S K +  ++C    GM + D  + +K +I E +  W+K+ + R    
Sbjct: 504 -------FSNPLIS-KNFDSHACGWAYGMNIFDLDQWKKQHITEVYHTWQKLNHDRQLWK 555

Query: 316 IGRSPAIALALYDRYLKLSSSWLVT------DSTSSVVNKSLAIRYDGPMTACSEFG 366
           +G  P   +  + R   +  SW V             + ++  I Y+G +    E G
Sbjct: 556 LGTLPPGLITFWKRTFPIDRSWHVLGLGYNPSVNRREIERAAVIHYNGNLKPWLEIG 612


>gi|413920988|gb|AFW60920.1| transferase [Zea mays]
          Length = 648

 Score = 43.1 bits (100), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 42/179 (23%), Positives = 75/179 (41%), Gaps = 26/179 (14%)

Query: 201 FVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDV 257
           F +  ++  L K +++   ++VK  +  L  I++      A + C E   R + Y+N   
Sbjct: 457 FYLPEIYPKLDKMVFLDDDIVVKKDLAGLWSINMKGKVNGAVETCGESFHRYDRYLN--- 513

Query: 258 LDAIQRSASQPWVSGKPYAVNSCMPDLGMLLID-ARKLEKYILEAFLWWKKVINQRDRS- 315
                   S P ++ K +  ++C+   GM + D A    + I E +  W+K+    DRS 
Sbjct: 514 -------FSNPIIT-KSFDPHACVWAFGMNVFDLAEWRRQNITEIYHSWQKL--NEDRSL 563

Query: 316 --IGRSPAIALALYDRYLKLSSSWLVT------DSTSSVVNKSLAIRYDGPMTACSEFG 366
             +G  P   +  +++   LS SW V          S  +  +  I Y+G M    E G
Sbjct: 564 WKLGTLPPGLVTFWNKTFPLSRSWHVLGLGYNPHVNSRDIEHAAVIHYNGNMKPWLEIG 622


>gi|168016061|ref|XP_001760568.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162688265|gb|EDQ74643.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 275

 Score = 43.1 bits (100), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 46/234 (19%), Positives = 93/234 (39%), Gaps = 18/234 (7%)

Query: 106 DDGMVHIVSWMQCLDLRLLAVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFP 165
           ++ +VHI   +    LR     + S L  +     + FH    K  ++    +    ++ 
Sbjct: 7   NESLVHIAMTLDANYLRGSMAAIYSILLHAECASNVRFHFVATKEKKNKCKSFCRSAMYF 66

Query: 166 HSNLEFHGQEEVKKVIRTASTGVKYSVQNFEEIVPFVIASVHQSLSKFIYMSPSVIVKGR 225
           +S        E+ K+I ++   +     N+     ++   +   + + IY+   V+V GR
Sbjct: 67  YSC-------ELLKLIYSSDFVITQEPLNYARF--YLAHMIDSCVKRIIYLDLDVLVLGR 117

Query: 226 VEELIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRSASQPWVSGKPYAVNSCMPDLG 285
           +EEL   ++ N  +   + C     SY   +    I  S +  + + +P   NS     G
Sbjct: 118 IEELWMTNMGNSTVGTPEYCHANFPSYFTENFW--INSSLASTFANKQPCYFNS-----G 170

Query: 286 MLLIDARKLEKYILEAFL-WWKKVINQRD-RSIGRSPAIALALYDRYLKLSSSW 337
           M+LI+  +  K    + L +W +V  Q+    +G  P + L        + + W
Sbjct: 171 MMLINLERWRKTRCTSTLEYWMEVQKQQHIYELGSLPPLLLTFAGSIQAIDNRW 224


>gi|223974051|gb|ACN31213.1| unknown [Zea mays]
 gi|413920987|gb|AFW60919.1| hypothetical protein ZEAMMB73_694281 [Zea mays]
          Length = 649

 Score = 43.1 bits (100), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 42/179 (23%), Positives = 75/179 (41%), Gaps = 26/179 (14%)

Query: 201 FVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDV 257
           F +  ++  L K +++   ++VK  +  L  I++      A + C E   R + Y+N   
Sbjct: 458 FYLPEIYPKLDKMVFLDDDIVVKKDLAGLWSINMKGKVNGAVETCGESFHRYDRYLN--- 514

Query: 258 LDAIQRSASQPWVSGKPYAVNSCMPDLGMLLID-ARKLEKYILEAFLWWKKVINQRDRS- 315
                   S P ++ K +  ++C+   GM + D A    + I E +  W+K+    DRS 
Sbjct: 515 -------FSNPIIT-KSFDPHACVWAFGMNVFDLAEWRRQNITEIYHSWQKL--NEDRSL 564

Query: 316 --IGRSPAIALALYDRYLKLSSSWLVT------DSTSSVVNKSLAIRYDGPMTACSEFG 366
             +G  P   +  +++   LS SW V          S  +  +  I Y+G M    E G
Sbjct: 565 WKLGTLPPGLVTFWNKTFPLSRSWHVLGLGYNPHVNSRDIEHAAVIHYNGNMKPWLEIG 623


>gi|226529842|ref|NP_001148932.1| transferase, transferring glycosyl groups [Zea mays]
 gi|195623384|gb|ACG33522.1| transferase, transferring glycosyl groups [Zea mays]
          Length = 648

 Score = 43.1 bits (100), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 42/179 (23%), Positives = 75/179 (41%), Gaps = 26/179 (14%)

Query: 201 FVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDV 257
           F +  ++  L K +++   ++VK  +  L  I++      A + C E   R + Y+N   
Sbjct: 457 FYLPEIYPKLDKMVFLDDDIVVKKDLAGLWSINMKGKVNGAVETCGESFHRYDRYLN--- 513

Query: 258 LDAIQRSASQPWVSGKPYAVNSCMPDLGMLLID-ARKLEKYILEAFLWWKKVINQRDRS- 315
                   S P ++ K +  ++C+   GM + D A    + I E +  W+K+    DRS 
Sbjct: 514 -------FSNPVIT-KSFDPHACVWAFGMNVFDLAEWRRQNITEIYHSWQKL--NEDRSL 563

Query: 316 --IGRSPAIALALYDRYLKLSSSWLVT------DSTSSVVNKSLAIRYDGPMTACSEFG 366
             +G  P   +  +++   LS SW V          S  +  +  I Y+G M    E G
Sbjct: 564 WKLGTLPPGLVTFWNKTFPLSRSWHVLGLGYNPHVNSRDIEHAAVIHYNGNMKPWLEIG 622


>gi|357145596|ref|XP_003573698.1| PREDICTED: probable galacturonosyltransferase 4-like isoform 1
           [Brachypodium distachyon]
          Length = 660

 Score = 43.1 bits (100), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 43/179 (24%), Positives = 73/179 (40%), Gaps = 26/179 (14%)

Query: 201 FVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDV 257
           F +  ++  L K +++   V+VK  +  L  ID+      A + C E   R + Y+N   
Sbjct: 469 FYLPEIYPKLDKMVFLDDDVVVKKDLTGLWSIDMKGKVNGAVETCGESFHRFDRYLN--- 525

Query: 258 LDAIQRSASQPWVSGKPYAVNSCMPDLGMLLID-ARKLEKYILEAFLWWKKVINQRDR-- 314
                   S P V  K +  ++C    GM + D A    + I E +  W+K+    DR  
Sbjct: 526 -------FSNP-VIAKNFDPHACGWAFGMNVFDLAEWRRQDITEIYHSWQKL--NEDRLL 575

Query: 315 -SIGRSPAIALALYDRYLKLSSSWLVT------DSTSSVVNKSLAIRYDGPMTACSEFG 366
             +G  P   +  +++   L+ SW V          S  + ++  I Y+G M    E G
Sbjct: 576 WKLGTLPPGLITFWNKTFPLNRSWHVLGLGYNPHVNSRDIERAAVIHYNGNMKPWLEIG 634


>gi|359496846|ref|XP_002272372.2| PREDICTED: uncharacterized protein LOC100258406 [Vitis vinifera]
          Length = 1286

 Score = 43.1 bits (100), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 39/177 (22%), Positives = 73/177 (41%), Gaps = 22/177 (12%)

Query: 201  FVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDV 257
            F +  +   L+K +++   ++V+  +  L  IDL      A + C E   R + Y+N   
Sbjct: 1095 FYLPEIFPKLNKVLFLDDDIVVQKDLTGLWSIDLKGNVNGAVETCGESFHRFDRYLN--- 1151

Query: 258  LDAIQRSASQPWVSGKPYAVNSCMPDLGMLLIDARKLEK-YILEAFLWWKKVINQRDR-S 315
                    S P +S K +  ++C    GM + D  + +K +I E +  W+K+ + R    
Sbjct: 1152 -------FSNPLIS-KNFDSHACGWAYGMNIFDLDQWKKQHITEVYHTWQKLNHDRQLWK 1203

Query: 316  IGRSPAIALALYDRYLKLSSSWLVT------DSTSSVVNKSLAIRYDGPMTACSEFG 366
            +G  P   +  + R   +  SW V             + ++  I Y+G +    E G
Sbjct: 1204 LGTLPPGLITFWKRTFPIDRSWHVLGLGYNPSVNRREIERAAVIHYNGNLKPWLEIG 1260


>gi|168057091|ref|XP_001780550.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162668028|gb|EDQ54644.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 280

 Score = 42.7 bits (99), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 46/219 (21%), Positives = 90/219 (41%), Gaps = 16/219 (7%)

Query: 123 LLAVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFPHSNLEFHGQEEVKKVIR 182
           + ++L ++  SG+     + FH     G    + F    V      L+F      + +++
Sbjct: 15  IYSILRHAECSGN-----IRFHFVATNGKVKYIRFPAFVVAETLPFLQFQTYPFDESLVK 69

Query: 183 T-ASTGVKYSVQNFEEIVPFVIAS-VHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIA 240
           +  S  V+++++       F +A  +   + + IY+   V+V  R+EEL  I++ N  + 
Sbjct: 70  SRISYAVRHALEEPLNYARFYLAHMIDPCVKRIIYLDSDVLVIDRIEELWMINMGNSTVG 129

Query: 241 AADDCSERLNSYVNPDVLDAIQRSASQPWVSGKPYAVNSCMPDLGMLLIDARKLEKYILE 300
             + C    +SY           S +  + + KP   NS     G++LI+  +  K    
Sbjct: 130 TPEYCHANFHSYFTERFWR--NSSLASIFANKKPCYFNS-----GVMLINLDRWRKEACT 182

Query: 301 AFL-WWKKVINQRD-RSIGRSPAIALALYDRYLKLSSSW 337
           A L +W +V  +R    +G  P + L        + S W
Sbjct: 183 ATLEYWMEVQKERHIYELGSLPPLLLTFAGSIQAIDSRW 221


>gi|255559941|ref|XP_002520989.1| Glycosyltransferase QUASIMODO1, putative [Ricinus communis]
 gi|223539826|gb|EEF41406.1| Glycosyltransferase QUASIMODO1, putative [Ricinus communis]
          Length = 633

 Score = 42.7 bits (99), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 60/265 (22%), Positives = 107/265 (40%), Gaps = 33/265 (12%)

Query: 125 AVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFPHSNLEFHGQEEVKKVIRTA 184
           AV+VNST+S ++  + + FH+     +   +S + L  L P S        +++ +    
Sbjct: 357 AVVVNSTVSSAKDAESIIFHVVTDSLNLPAISMWFL--LNPPSKATI----QIQSIDNFG 410

Query: 185 STGVKYSV----QNFEE--------IVPFVIASVHQSLSKFIYMSPSVIVKGRVEELIGI 232
               KYS     QN  +         + F +  +   L+K +     V+V+  +  L  +
Sbjct: 411 WLSTKYSSTVKQQNSHDPSYVSALNHLRFYLPDLFPLLNKIVLFDHDVVVQKDLTGLWSL 470

Query: 233 DLSNYAIAAADDCSERLNSYVNPDVLDAIQRSASQPWVSGKPYAVNSCMPDLGMLLIDAR 292
           D++     A + C E   SY   D+        S P+V+ K +  N+C    GM L D +
Sbjct: 471 DMNGKVNGAVETCQESDTSYRQMDMFINF----SDPFVT-KRFDANACTWAFGMNLFDLK 525

Query: 293 KLEKYILEAFLWWKKVINQRDRSI---GRSPAIALALYDRYLKLSSSW------LVTDST 343
           +  +  L A L+ K +     R +   G  P      Y++ + L   W        +D  
Sbjct: 526 EWRRQNLTA-LYHKYLQEGYKRPLWKAGSLPVGWATFYNQTVALDKRWHRLGLGYESDVG 584

Query: 344 SSVVNKSLAIRYDGPMTACSEFGDG 368
              +N++  + YDG M    + G G
Sbjct: 585 QDDINQAAVLHYDGVMKPWMDIGVG 609


>gi|147766034|emb|CAN70213.1| hypothetical protein VITISV_038741 [Vitis vinifera]
          Length = 759

 Score = 42.4 bits (98), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 39/177 (22%), Positives = 73/177 (41%), Gaps = 22/177 (12%)

Query: 201 FVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDV 257
           F +  +   L+K +++   ++V+  +  L  IDL      A + C E   R + Y+N   
Sbjct: 568 FYLPEIFPKLNKVLFLDDDIVVQKDLTGLWSIDLKGNVNGAVETCGESFHRFDRYLN--- 624

Query: 258 LDAIQRSASQPWVSGKPYAVNSCMPDLGMLLIDARKLEK-YILEAFLWWKKVINQRDR-S 315
                   S P +S K +  ++C    GM + D  + +K +I E +  W+K+ + R    
Sbjct: 625 -------FSNPLIS-KNFDSHACGWAYGMNIFDLDQWKKQHITEVYHTWQKLNHDRQLWK 676

Query: 316 IGRSPAIALALYDRYLKLSSSWLVT------DSTSSVVNKSLAIRYDGPMTACSEFG 366
           +G  P   +  + R   +  SW V             + ++  I Y+G +    E G
Sbjct: 677 LGTLPPGLITFWKRTXPIDRSWHVLGLGYNPSVNRREIERAAVIHYNGNLKPWLEIG 733


>gi|255547123|ref|XP_002514619.1| Glycosyltransferase QUASIMODO1, putative [Ricinus communis]
 gi|223546223|gb|EEF47725.1| Glycosyltransferase QUASIMODO1, putative [Ricinus communis]
          Length = 535

 Score = 42.4 bits (98), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 53/269 (19%), Positives = 103/269 (38%), Gaps = 42/269 (15%)

Query: 125 AVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFPHSNLEFHGQEE-------- 176
           +V+VNS +S + +P  L FH+     S   +  + L   F  + +E    +E        
Sbjct: 243 SVVVNSAISNADHPTQLVFHIVTNGVSYGAMQAWFLSDDFKGATIEVQNVKEFSWLNASY 302

Query: 177 ---VKKVIRTASTGVKYSVQNFEEIVP---------------FVIASVHQSLSKFIYMSP 218
              VK+++   S    +S     ++ P               F I  ++  L K +++  
Sbjct: 303 APVVKQLLAEDSRSYYFSGYQDMKVEPKLRNPKYLSLLNHLRFYIPEIYPQLEKIVFLDD 362

Query: 219 SVIVKGRVEELIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRSASQPWVSGKPYAVN 278
            V+V+  + +L  +DL      A + C E  + Y           + S P +S K +   
Sbjct: 363 DVVVQKDLTQLFSLDLHGNVNGAVETCLEAFHRYYK-------YLNFSNPIISSK-FDPQ 414

Query: 279 SCMPDLGMLLID--ARKLEKYILEAFLWWKKVINQRDRSIGRSPAIALALYDRYLKLSSS 336
           +C    GM + D  A +      +   W ++ +++    +G  P   LA Y     L   
Sbjct: 415 ACGWAFGMNVFDLIAWRKANVTAQYHYWQEQNVDRTLWKLGTLPPALLAFYGLTEPLDRR 474

Query: 337 WLV------TDSTSSVVNKSLAIRYDGPM 359
           W V      T+  + ++  +  + ++G M
Sbjct: 475 WHVLGLGYDTNIDNRLIESAAVVHFNGNM 503


>gi|357112099|ref|XP_003557847.1| PREDICTED: probable galacturonosyltransferase-like 4-like
           [Brachypodium distachyon]
          Length = 351

 Score = 42.4 bits (98), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 44/222 (19%), Positives = 87/222 (39%), Gaps = 17/222 (7%)

Query: 121 LRLLAVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFPHSNLEFHGQEEVKKV 180
           LR     V S L  +  P+ + FH    +   D+V    ++  FP+ +L  + + +  +V
Sbjct: 78  LRGTMAGVLSILQHTACPESVAFHFLATRMDGDLVGM--VRATFPYLDLRVY-RFDPSRV 134

Query: 181 IRTASTGVKYSVQ---NFEEIVPFVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNY 237
               S  +++++    N+  I  ++  ++   + + IY+   VIV   +  L  +DL  +
Sbjct: 135 RGRISRSIRHALDQPLNYARI--YLADTLPPDVRRVIYLDSDVIVVDDIRTLFSVDLGGH 192

Query: 238 AIAAADDCSERLNSYVNPDVLDAIQRSASQPWVSGKPYAVNSCMPDLGMLLIDARKLEK- 296
            + A + C     +Y      DA     + P +SG       C  + G++++D  +    
Sbjct: 193 VVGAPEYCHANFTNYFT----DAFW---TDPALSGTFRGRRPCYFNTGVMVMDVDRWRNG 245

Query: 297 -YILEAFLWWKKVINQRDRSIGRSPAIALALYDRYLKLSSSW 337
            Y      W      +R   +G  P   L L      +   W
Sbjct: 246 GYTRRVEGWMAVQKQKRIYHLGSLPPFLLVLAGDIQAVDHRW 287


>gi|242078863|ref|XP_002444200.1| hypothetical protein SORBIDRAFT_07g014890 [Sorghum bicolor]
 gi|241940550|gb|EES13695.1| hypothetical protein SORBIDRAFT_07g014890 [Sorghum bicolor]
          Length = 648

 Score = 42.0 bits (97), Expect = 0.49,   Method: Compositional matrix adjust.
 Identities = 41/179 (22%), Positives = 75/179 (41%), Gaps = 26/179 (14%)

Query: 201 FVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDV 257
           F +  ++  L K +++   ++VK  +  L  I++      A + C E   R + Y+N   
Sbjct: 457 FYLPEIYPKLDKMVFLDDDIVVKKDLTGLWSINMKGKVNGAVETCGESFHRYDRYLN--- 513

Query: 258 LDAIQRSASQPWVSGKPYAVNSCMPDLGMLLID-ARKLEKYILEAFLWWKKVINQRDRS- 315
                   S P ++ K +  ++C    GM + D A    + I + +  W+K+    DRS 
Sbjct: 514 -------FSNPIIA-KSFDPHACGWAFGMNVFDLAEWRRQNITQIYHSWQKL--NEDRSL 563

Query: 316 --IGRSPAIALALYDRYLKLSSSWLVT------DSTSSVVNKSLAIRYDGPMTACSEFG 366
             +G  P   +  +++   LS SW V          S  + ++  I Y+G M    E G
Sbjct: 564 WKLGTLPPGLITFWNKTFPLSRSWHVLGLGYNPHVNSRDIERAAVIHYNGNMKPWLEIG 622


>gi|357158970|ref|XP_003578298.1| PREDICTED: probable galacturonosyltransferase 4-like isoform 1
           [Brachypodium distachyon]
          Length = 696

 Score = 41.2 bits (95), Expect = 0.82,   Method: Compositional matrix adjust.
 Identities = 40/179 (22%), Positives = 73/179 (40%), Gaps = 26/179 (14%)

Query: 201 FVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDV 257
           F +  +   L+K +++    +V+  +  L  IDL      A + C E   R + Y+N   
Sbjct: 505 FYLPEIFPKLNKVLFLDDDTVVQQDLSALWSIDLKGKVNGAVETCGETFHRFDKYLN--- 561

Query: 258 LDAIQRSASQPWVSGKPYAVNSCMPDLGMLLIDARKLEKY-ILEAFLWWKKVINQRDR-- 314
                   S P ++   +   +C    GM + D  +  K  I + +  W+K+    DR  
Sbjct: 562 -------FSNPIIANN-FHPRACGWAYGMNMFDLSEWRKQNITDVYHTWQKL--NEDRLL 611

Query: 315 -SIGRSPAIALALYDRYLKLSSSWLV------TDSTSSVVNKSLAIRYDGPMTACSEFG 366
             +G  PA  +  ++R   L SSW +      T+     + ++  I Y+G +    E G
Sbjct: 612 WKLGTLPAGLVTFWNRTFPLDSSWHLLGLGYNTNVNERDIRRASVIHYNGNLKPWLEIG 670


>gi|357158973|ref|XP_003578299.1| PREDICTED: probable galacturonosyltransferase 4-like isoform 2
           [Brachypodium distachyon]
          Length = 690

 Score = 41.2 bits (95), Expect = 0.84,   Method: Compositional matrix adjust.
 Identities = 40/179 (22%), Positives = 73/179 (40%), Gaps = 26/179 (14%)

Query: 201 FVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDV 257
           F +  +   L+K +++    +V+  +  L  IDL      A + C E   R + Y+N   
Sbjct: 499 FYLPEIFPKLNKVLFLDDDTVVQQDLSALWSIDLKGKVNGAVETCGETFHRFDKYLN--- 555

Query: 258 LDAIQRSASQPWVSGKPYAVNSCMPDLGMLLIDARKLEKY-ILEAFLWWKKVINQRDR-- 314
                   S P ++   +   +C    GM + D  +  K  I + +  W+K+    DR  
Sbjct: 556 -------FSNPIIANN-FHPRACGWAYGMNMFDLSEWRKQNITDVYHTWQKL--NEDRLL 605

Query: 315 -SIGRSPAIALALYDRYLKLSSSWLV------TDSTSSVVNKSLAIRYDGPMTACSEFG 366
             +G  PA  +  ++R   L SSW +      T+     + ++  I Y+G +    E G
Sbjct: 606 WKLGTLPAGLVTFWNRTFPLDSSWHLLGLGYNTNVNERDIRRASVIHYNGNLKPWLEIG 664


>gi|413933399|gb|AFW67950.1| hypothetical protein ZEAMMB73_846223 [Zea mays]
          Length = 399

 Score = 41.2 bits (95), Expect = 0.88,   Method: Compositional matrix adjust.
 Identities = 59/277 (21%), Positives = 109/277 (39%), Gaps = 53/277 (19%)

Query: 125 AVLVNSTLSGSRYPDLLHFHLFVPK---GSEDM----------------VSFYK------ 159
           +V+VNST+  ++ P+   FHL   K   G+ +M                V  +K      
Sbjct: 101 SVVVNSTIMNAKEPEKHVFHLVTDKLNFGAMNMWFLLNPPGDATMHVENVDDFKWLNSSY 160

Query: 160 ---LKVLFPHSNLEFHGQEEVKKVIRTASTGVKYSVQNFEEIVP---FVIASVHQSLSKF 213
              LK L   +  E++ + +  K +   S+ +KY    +  ++    F +  V+  L+K 
Sbjct: 161 CPVLKQLESAAMKEYYFKADRPKTLSAGSSNLKYRNPKYLSMLNHLRFYLPQVYPKLNKI 220

Query: 214 IYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDVLDAIQRSASQPWV 270
           +++   ++V+  +  L  +DL+     A + C E   R + Y+N           S P +
Sbjct: 221 LFLDDDIVVQRDLTGLWEVDLNGNVNGAVETCGESFHRFDKYLN----------FSNPNI 270

Query: 271 SGKPYAVNSCMPDLGMLLIDARKLEKYILEAFL--WWKKVINQRDRSIGRSPAIALALYD 328
           + + +  N+C    GM + D  + +K  +      W     N+    +G  P   L  Y 
Sbjct: 271 A-QNFDPNACGWAYGMNMFDLEEWKKKDITGIYHKWQNMNENRLLWKLGTLPPGLLTFYK 329

Query: 329 RYLKLSSSWLVT------DSTSSVVNKSLAIRYDGPM 359
               L  SW V           S ++ +  I Y+G M
Sbjct: 330 LTHPLDKSWHVLGLGYNPTVERSEIDNAAVIHYNGNM 366


>gi|326527369|dbj|BAK04626.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 343

 Score = 41.2 bits (95), Expect = 0.99,   Method: Compositional matrix adjust.
 Identities = 43/222 (19%), Positives = 87/222 (39%), Gaps = 17/222 (7%)

Query: 121 LRLLAVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFPHSNLEFHGQEEVKKV 180
           LR     V S L  +  P+ + FH    +   D+V+   L+V FP+ +L  + + +  +V
Sbjct: 70  LRGTMAGVLSILQHTACPENVVFHFLAARMDGDLVAM--LRVTFPYLDLRVY-RFDPSRV 126

Query: 181 IRTASTGVKYSVQ---NFEEIVPFVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNY 237
               S  +++++    N+  I  ++  ++   + + IY+   VIV   +  L  + L+ +
Sbjct: 127 RGRISRSIRHALDQPLNYARI--YLADTLPPDVRRVIYLDSDVIVVDDIRTLFSVHLAGH 184

Query: 238 AIAAADDCSERLNSYVNPDVLDAIQRSASQPWVSGKPYAVNSCMPDLGMLLIDARKLEK- 296
            + A + C     +Y               P +SG  +    C  + G++++D  +    
Sbjct: 185 VVGAPEYCHTNFTNYFTDTFW-------MDPALSGTFHGRRPCYFNTGVMVMDVDQWRTG 237

Query: 297 -YILEAFLWWKKVINQRDRSIGRSPAIALALYDRYLKLSSSW 337
            Y      W      +R   +G  P   L L      +   W
Sbjct: 238 GYTRRVEGWMAVQKQKRIYHLGSLPPFLLVLAGDIQAVDHRW 279


>gi|242049946|ref|XP_002462717.1| hypothetical protein SORBIDRAFT_02g030820 [Sorghum bicolor]
 gi|241926094|gb|EER99238.1| hypothetical protein SORBIDRAFT_02g030820 [Sorghum bicolor]
          Length = 683

 Score = 40.8 bits (94), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 60/286 (20%), Positives = 113/286 (39%), Gaps = 57/286 (19%)

Query: 125 AVLVNSTLSGSRYPDLLHFHLFVPK---GSEDM----------------VSFYK------ 159
           +V+VNST+  ++ P+   FHL   K   G+ +M                V  +K      
Sbjct: 385 SVVVNSTIMNAKEPEKHVFHLVTDKLNFGAMNMWFLLNPPGKATIHVENVDEFKWLNSSY 444

Query: 160 ---LKVLFPHSNLEFHGQEEVKKVIRTASTGVKYSVQNFEEIVP---FVIASVHQSLSKF 213
              L+ L   +  E++ + +    +   S+ +KY    +  ++    F +  V+  L K 
Sbjct: 445 CPVLRQLESAAMKEYYFKADRPTTLSAGSSNLKYRNPKYLSMLNHLRFYLPEVYPKLDKI 504

Query: 214 IYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDVLDAIQRSASQPWV 270
           +++   ++V+  +  L  +DL+     A + C E   R + Y+N           S P +
Sbjct: 505 LFLDDDIVVQKDLTGLWDVDLNGKVNGAVETCGESFHRFDKYLN----------FSNPHI 554

Query: 271 SGKPYAVNSCMPDLGMLLIDARKLEKY-ILEAFLWWKKVINQRDRS---IGRSPAIALAL 326
           + + +  N+C    GM + D R+ +K  I   +  W+ +    DR+   +G  P   L  
Sbjct: 555 A-RNFDPNACGWAYGMNIFDLREWKKKDITGIYHKWQNL--NEDRALWKLGTLPPGLLTF 611

Query: 327 YDRYLKLSSSWLVT------DSTSSVVNKSLAIRYDGPMTACSEFG 366
           Y     L  SW V           S ++ +  + Y+G M    E  
Sbjct: 612 YKLTHPLDKSWHVLGLGYNPSIDRSEIDSAAVVHYNGNMKPWLELA 657


>gi|357118841|ref|XP_003561157.1| PREDICTED: alpha-1,4-galacturonosyltransferase 1-like [Brachypodium
           distachyon]
          Length = 589

 Score = 40.8 bits (94), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 58/277 (20%), Positives = 110/277 (39%), Gaps = 53/277 (19%)

Query: 125 AVLVNSTLSGSRYPDLLHFHLFVPK---GSEDM----------------VSFYK------ 159
           +V+VNST+  ++ P+   FHL   K   G+ +M                V  +K      
Sbjct: 291 SVVVNSTIVNAKEPEKHVFHLVTDKLNFGAMNMWFLLNPPGDATIHVENVDDFKWLNSSY 350

Query: 160 ---LKVLFPHSNLEFHGQEEVKKVIRTASTGVKYSVQNFEEIVP---FVIASVHQSLSKF 213
              LK L   +  E++ + + +K +   S+ +KY    +  ++    F +  V+  L+K 
Sbjct: 351 CPVLKQLESAAMKEYYFKADRQKTLSAGSSNLKYRNPKYLSMLNHLRFYLPQVYPKLNKI 410

Query: 214 IYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDVLDAIQRSASQPWV 270
           +++   ++V+  +  L  +DL+     A + C E   R + Y+N           S P +
Sbjct: 411 LFLDDDIVVQKDLTGLWEVDLNGNVNGAVETCGESFHRFDKYLN----------FSNPNI 460

Query: 271 SGKPYAVNSCMPDLGMLLIDARKLEKYILEAFL--WWKKVINQRDRSIGRSPAIALALYD 328
           + + +  N+C    GM + D  + +K  +      W     N+    +G  P   +  Y 
Sbjct: 461 A-QNFDPNACGWAYGMNMFDLEEWKKKDITGIYHKWQNMNENRLLWKLGTLPPGLMTFYK 519

Query: 329 RYLKLSSSWLVT------DSTSSVVNKSLAIRYDGPM 359
               L  SW V           S ++ +  I Y+G M
Sbjct: 520 LTHPLDKSWHVLGLGYNPTVEHSEIDSAAVIHYNGNM 556


>gi|293331885|ref|NP_001169332.1| uncharacterized protein LOC100383199 precursor [Zea mays]
 gi|224028751|gb|ACN33451.1| unknown [Zea mays]
 gi|413933401|gb|AFW67952.1| hypothetical protein ZEAMMB73_846223 [Zea mays]
          Length = 590

 Score = 40.8 bits (94), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 59/277 (21%), Positives = 109/277 (39%), Gaps = 53/277 (19%)

Query: 125 AVLVNSTLSGSRYPDLLHFHLFVPK---GSEDM----------------VSFYK------ 159
           +V+VNST+  ++ P+   FHL   K   G+ +M                V  +K      
Sbjct: 292 SVVVNSTIMNAKEPEKHVFHLVTDKLNFGAMNMWFLLNPPGDATMHVENVDDFKWLNSSY 351

Query: 160 ---LKVLFPHSNLEFHGQEEVKKVIRTASTGVKYSVQNFEEIVP---FVIASVHQSLSKF 213
              LK L   +  E++ + +  K +   S+ +KY    +  ++    F +  V+  L+K 
Sbjct: 352 CPVLKQLESAAMKEYYFKADRPKTLSAGSSNLKYRNPKYLSMLNHLRFYLPQVYPKLNKI 411

Query: 214 IYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDVLDAIQRSASQPWV 270
           +++   ++V+  +  L  +DL+     A + C E   R + Y+N           S P +
Sbjct: 412 LFLDDDIVVQRDLTGLWEVDLNGNVNGAVETCGESFHRFDKYLN----------FSNPNI 461

Query: 271 SGKPYAVNSCMPDLGMLLIDARKLEKYILEAFL--WWKKVINQRDRSIGRSPAIALALYD 328
           + + +  N+C    GM + D  + +K  +      W     N+    +G  P   L  Y 
Sbjct: 462 A-QNFDPNACGWAYGMNMFDLEEWKKKDITGIYHKWQNMNENRLLWKLGTLPPGLLTFYK 520

Query: 329 RYLKLSSSWLVT------DSTSSVVNKSLAIRYDGPM 359
               L  SW V           S ++ +  I Y+G M
Sbjct: 521 LTHPLDKSWHVLGLGYNPTVERSEIDNAAVIHYNGNM 557


>gi|356522095|ref|XP_003529685.1| PREDICTED: probable galacturonosyltransferase 4-like [Glycine max]
          Length = 661

 Score = 40.8 bits (94), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 37/170 (21%), Positives = 72/170 (42%), Gaps = 22/170 (12%)

Query: 201 FVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDV 257
           F +  +   L+K +++   ++V+  +  L  IDL      A + C E   R + Y+N   
Sbjct: 470 FYLPEIFPKLNKVLFLDDDIVVQKDLTGLWSIDLKGNVNGAVETCGERFHRFDRYLN--- 526

Query: 258 LDAIQRSASQPWVSGKPYAVNSCMPDLGMLLIDARKLEKY-ILEAFLWWKKVINQRDR-S 315
                   S P ++ K +   +C    GM + D  + ++  I E +  W+K+ + R    
Sbjct: 527 -------FSNPHIA-KNFDPRACGWAYGMNVFDLVQWKRQNITEVYHNWQKLNHDRQLWK 578

Query: 316 IGRSPAIALALYDRYLKLSSSWLVT------DSTSSVVNKSLAIRYDGPM 359
           +G  P   +  + R  +L+ SW V       +     + ++  I Y+G M
Sbjct: 579 LGTLPPGLITFWKRTFQLNRSWHVLGLGYNPNINQKEIERAAVIHYNGNM 628


>gi|326502964|dbj|BAJ99110.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 591

 Score = 40.8 bits (94), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 58/277 (20%), Positives = 110/277 (39%), Gaps = 53/277 (19%)

Query: 125 AVLVNSTLSGSRYPDLLHFHLFVPK---GSEDM----------------VSFYK------ 159
           +V+VNST+  ++ P+   FHL   K   G+ +M                V  +K      
Sbjct: 293 SVVVNSTIVNAKEPEKHVFHLVTDKLNFGAMNMWFLLNPPGDATIHVENVDDFKWLNSSY 352

Query: 160 ---LKVLFPHSNLEFHGQEEVKKVIRTASTGVKYSVQNFEEIVP---FVIASVHQSLSKF 213
              LK L   +  E++ + + +K +   S+ +KY    +  ++    F +  V+  L+K 
Sbjct: 353 CPVLKQLESAAMKEYYFKADRQKTLSAGSSNLKYRNPKYLSMLNHLRFYLPQVYPKLNKI 412

Query: 214 IYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDVLDAIQRSASQPWV 270
           +++   ++V+  +  L  +DL+     A + C E   R + Y+N           S P +
Sbjct: 413 LFLDDDIVVQKDLTGLWEVDLNGNVNGAVETCGESFHRFDKYLN----------FSNPNI 462

Query: 271 SGKPYAVNSCMPDLGMLLIDARKLEKYILEAFL--WWKKVINQRDRSIGRSPAIALALYD 328
           S + +  N+C    GM + D  + +K  +      W     N+    +G  P   +  Y 
Sbjct: 463 S-QNFDPNACGWAYGMNMFDLEEWKKKDITGIYHKWQNMNENRLLWKLGTLPPGLMTFYK 521

Query: 329 RYLKLSSSWLVT------DSTSSVVNKSLAIRYDGPM 359
               L  SW V           + ++ +  I Y+G M
Sbjct: 522 LTHPLDKSWHVLGLGYNPTVEHAEIDTAAVIHYNGNM 558


>gi|147846492|emb|CAN79512.1| hypothetical protein VITISV_014158 [Vitis vinifera]
          Length = 648

 Score = 40.8 bits (94), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 48/232 (20%), Positives = 94/232 (40%), Gaps = 50/232 (21%)

Query: 125 AVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFPHSNLEFHGQEEVKKVIRTA 184
           +V++NST+  +  P+   FH+       D +SF  +K+ F                +  +
Sbjct: 391 SVVINSTMLXASEPEKHVFHIVT-----DKLSFAAMKMWF----------------LVNS 429

Query: 185 STGVKYSVQNFEEI-----------VPFVIASVHQSLSKFIYMSPSVIVKGRVEELIGID 233
              V   V+N ++            + F +  V+  L K +++   ++V+  +  L  +D
Sbjct: 430 PAKVTIQVENIDDFKNPKYLSMLNHLRFYLPEVYPKLEKILFLDDDIVVQKDLTPLWSLD 489

Query: 234 LSNYAIAAADDCSE---RLNSYVNPDVLDAIQRSASQPWVSGKPYAVNSCMPDLGMLLID 290
           +     AA + C E   R + Y+N           S P +S + +  N+C    GM + D
Sbjct: 490 MQGMVNAAVETCKESFHRFDKYLN----------FSHPKIS-ENFDPNACGWAFGMNMFD 538

Query: 291 ARKLEKYILEAFLWWKKVINQRDRS---IGRSPAIALALYDRYLKLSSSWLV 339
            ++  K  +     + + +N+ DR+   +G  P   +  Y+    L  SW V
Sbjct: 539 LKEWRKRNMTGIYHYWQDMNE-DRTLWKLGSLPPGLITFYNLTYPLDRSWHV 589


>gi|212721098|ref|NP_001131608.1| uncharacterized protein LOC100192958 [Zea mays]
 gi|194692022|gb|ACF80095.1| unknown [Zea mays]
          Length = 228

 Score = 40.4 bits (93), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 44/213 (20%), Positives = 85/213 (39%), Gaps = 29/213 (13%)

Query: 170 EFHGQEEVKKVIRTASTGVKYSVQNFEEIVP---FVIASVHQSLSKFIYMSPSVIVKGRV 226
           E++ + +    +   S+ +KY    +  ++    F +  V+  L K +++   ++V+  +
Sbjct: 3   EYYFKADRPTTLSAGSSNLKYRNPKYLSMLNHLRFYLPEVYPKLDKILFLDDDIVVQKDL 62

Query: 227 EELIGIDLSNYAIAAADDCSE---RLNSYVNPDVLDAIQRSASQPWVSGKPYAVNSCMPD 283
             L  +DL+     A + C E   R + Y+N           S P ++ + +  N+C   
Sbjct: 63  TGLWDVDLNGKVNGAVETCGESFHRFDKYLN----------FSNPHIA-RNFDPNACGWA 111

Query: 284 LGMLLIDARKLEKY-ILEAFLWWKKVINQRDR---SIGRSPAIALALYDRYLKLSSSWLV 339
            GM + D R+ +K  I   +  W+ +    DR    +G  P   L  Y     L  SW V
Sbjct: 112 YGMNIFDLREWKKKDITGIYHKWQNM--NEDRVLWKLGTLPPGLLTFYKLTHPLDKSWHV 169

Query: 340 T------DSTSSVVNKSLAIRYDGPMTACSEFG 366
                      S ++ +  + Y+G M    E  
Sbjct: 170 LGLGYNPSIDRSEIDNAAVVHYNGNMKPWLELA 202


>gi|414872193|tpg|DAA50750.1| TPA: hypothetical protein ZEAMMB73_194342 [Zea mays]
          Length = 593

 Score = 40.4 bits (93), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 59/277 (21%), Positives = 109/277 (39%), Gaps = 53/277 (19%)

Query: 125 AVLVNSTLSGSRYPDLLHFHLFVPK---GSEDM----------------VSFYK------ 159
           +V+VNST+  ++ P+   FHL   K   G+ +M                V  +K      
Sbjct: 295 SVVVNSTIMNAKEPEKHVFHLVTDKLNFGAMNMWFLLNPPGDATIHVENVDDFKWLNSSY 354

Query: 160 ---LKVLFPHSNLEFHGQEEVKKVIRTASTGVKYSVQNFEEIVP---FVIASVHQSLSKF 213
              LK L   +  E++ + +  K +   S+ +KY    +  ++    F +  V+  L+K 
Sbjct: 355 CPVLKQLESAAMKEYYFKADRPKTLSAGSSNLKYRNPKYLSMLNHLRFYLPQVYPKLNKI 414

Query: 214 IYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDVLDAIQRSASQPWV 270
           +++   ++V+  +  L  +DL+     A + C E   R + Y+N           S P +
Sbjct: 415 LFLDDDIVVQRDLTGLWEVDLNGNVNGAVETCGESFHRFDKYLN----------FSNPNI 464

Query: 271 SGKPYAVNSCMPDLGMLLIDARKLEKYILEAFL--WWKKVINQRDRSIGRSPAIALALYD 328
           + + +  N+C    GM + D  + +K  +      W     N+    +G  P   L  Y 
Sbjct: 465 A-QNFDPNACGWAYGMNMFDLEEWKKKDITGIYHKWQNMNENRLLWKLGTLPPGLLTFYK 523

Query: 329 RYLKLSSSWLVT------DSTSSVVNKSLAIRYDGPM 359
               L  SW V           S ++ +  I Y+G M
Sbjct: 524 LTHPLDKSWHVLGLGYNPTIERSEIDNAAVIHYNGNM 560


>gi|212275396|ref|NP_001130678.1| uncharacterized protein LOC100191781 precursor [Zea mays]
 gi|194688930|gb|ACF78549.1| unknown [Zea mays]
 gi|414872194|tpg|DAA50751.1| TPA: hypothetical protein ZEAMMB73_194342 [Zea mays]
          Length = 588

 Score = 40.4 bits (93), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 59/277 (21%), Positives = 109/277 (39%), Gaps = 53/277 (19%)

Query: 125 AVLVNSTLSGSRYPDLLHFHLFVPK---GSEDM----------------VSFYK------ 159
           +V+VNST+  ++ P+   FHL   K   G+ +M                V  +K      
Sbjct: 290 SVVVNSTIMNAKEPEKHVFHLVTDKLNFGAMNMWFLLNPPGDATIHVENVDDFKWLNSSY 349

Query: 160 ---LKVLFPHSNLEFHGQEEVKKVIRTASTGVKYSVQNFEEIVP---FVIASVHQSLSKF 213
              LK L   +  E++ + +  K +   S+ +KY    +  ++    F +  V+  L+K 
Sbjct: 350 CPVLKQLESAAMKEYYFKADRPKTLSAGSSNLKYRNPKYLSMLNHLRFYLPQVYPKLNKI 409

Query: 214 IYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDVLDAIQRSASQPWV 270
           +++   ++V+  +  L  +DL+     A + C E   R + Y+N           S P +
Sbjct: 410 LFLDDDIVVQRDLTGLWEVDLNGNVNGAVETCGESFHRFDKYLN----------FSNPNI 459

Query: 271 SGKPYAVNSCMPDLGMLLIDARKLEKYILEAFL--WWKKVINQRDRSIGRSPAIALALYD 328
           + + +  N+C    GM + D  + +K  +      W     N+    +G  P   L  Y 
Sbjct: 460 A-QNFDPNACGWAYGMNMFDLEEWKKKDITGIYHKWQNMNENRLLWKLGTLPPGLLTFYK 518

Query: 329 RYLKLSSSWLVT------DSTSSVVNKSLAIRYDGPM 359
               L  SW V           S ++ +  I Y+G M
Sbjct: 519 LTHPLDKSWHVLGLGYNPTIERSEIDNAAVIHYNGNM 555


>gi|413933400|gb|AFW67951.1| hypothetical protein ZEAMMB73_846223 [Zea mays]
          Length = 507

 Score = 40.4 bits (93), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 59/277 (21%), Positives = 109/277 (39%), Gaps = 53/277 (19%)

Query: 125 AVLVNSTLSGSRYPDLLHFHLFVPK---GSEDM----------------VSFYK------ 159
           +V+VNST+  ++ P+   FHL   K   G+ +M                V  +K      
Sbjct: 209 SVVVNSTIMNAKEPEKHVFHLVTDKLNFGAMNMWFLLNPPGDATMHVENVDDFKWLNSSY 268

Query: 160 ---LKVLFPHSNLEFHGQEEVKKVIRTASTGVKYSVQNFEEIVP---FVIASVHQSLSKF 213
              LK L   +  E++ + +  K +   S+ +KY    +  ++    F +  V+  L+K 
Sbjct: 269 CPVLKQLESAAMKEYYFKADRPKTLSAGSSNLKYRNPKYLSMLNHLRFYLPQVYPKLNKI 328

Query: 214 IYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDVLDAIQRSASQPWV 270
           +++   ++V+  +  L  +DL+     A + C E   R + Y+N           S P +
Sbjct: 329 LFLDDDIVVQRDLTGLWEVDLNGNVNGAVETCGESFHRFDKYLN----------FSNPNI 378

Query: 271 SGKPYAVNSCMPDLGMLLIDARKLEKYILEAFL--WWKKVINQRDRSIGRSPAIALALYD 328
           + + +  N+C    GM + D  + +K  +      W     N+    +G  P   L  Y 
Sbjct: 379 A-QNFDPNACGWAYGMNMFDLEEWKKKDITGIYHKWQNMNENRLLWKLGTLPPGLLTFYK 437

Query: 329 RYLKLSSSWLVT------DSTSSVVNKSLAIRYDGPM 359
               L  SW V           S ++ +  I Y+G M
Sbjct: 438 LTHPLDKSWHVLGLGYNPTVERSEIDNAAVIHYNGNM 474


>gi|414886435|tpg|DAA62449.1| TPA: hypothetical protein ZEAMMB73_004043 [Zea mays]
          Length = 683

 Score = 40.4 bits (93), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 60/286 (20%), Positives = 112/286 (39%), Gaps = 57/286 (19%)

Query: 125 AVLVNSTLSGSRYPDLLHFHLFVPK---GSEDM----------------VSFYK------ 159
           +V+VNST+  ++ P+   FHL   K   G+ +M                V  +K      
Sbjct: 385 SVVVNSTIMNAKEPEKHVFHLVTDKLNFGAMNMWFLLNPPGKATIHVENVDEFKWLNSSY 444

Query: 160 ---LKVLFPHSNLEFHGQEEVKKVIRTASTGVKYSVQNFEEIVP---FVIASVHQSLSKF 213
              L+ L   +  E++ + +    +   S+ +KY    +  ++    F +  V+  L K 
Sbjct: 445 CPVLRQLESAAMKEYYFKADRPTTLSAGSSNLKYRNPKYLSMLNHLRFYLPEVYPKLDKI 504

Query: 214 IYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDVLDAIQRSASQPWV 270
           +++   ++V+  +  L  +DL+     A + C E   R + Y+N           S P +
Sbjct: 505 LFLDDDIVVQKDLTGLWDVDLNGKVNGAVETCGESFHRFDKYLN----------FSNPHI 554

Query: 271 SGKPYAVNSCMPDLGMLLIDARKLEKY-ILEAFLWWKKVINQRDR---SIGRSPAIALAL 326
           + + +  N+C    GM + D R+ +K  I   +  W+ +    DR    +G  P   L  
Sbjct: 555 A-RNFDPNACGWAYGMNIFDLREWKKKDITGIYHKWQNM--NEDRVLWKLGTLPPGLLTF 611

Query: 327 YDRYLKLSSSWLVT------DSTSSVVNKSLAIRYDGPMTACSEFG 366
           Y     L  SW V           S ++ +  + Y+G M    E  
Sbjct: 612 YKLTHPLDKSWHVLGLGYNPSIDRSEIDNAAVVHYNGNMKPWLELA 657


>gi|297844810|ref|XP_002890286.1| hypothetical protein ARALYDRAFT_312807 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297336128|gb|EFH66545.1| hypothetical protein ARALYDRAFT_312807 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 538

 Score = 40.4 bits (93), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 55/270 (20%), Positives = 101/270 (37%), Gaps = 48/270 (17%)

Query: 137 YPDLLHFHLFVPKGSEDMVSFYKLKVLFPHSNLEFHGQEE-----------VKKVIRTAS 185
           +P  L FH+   + S + +  + L   F  S +E    EE           VK+++ T +
Sbjct: 256 HPKQLVFHIVTNRVSYNAMQAWFLSNDFKGSAIEIRSIEEFSWLNASYSPVVKQLLDTDA 315

Query: 186 TGVKYSVQNFEEIVP-----------------FVIASVHQSLSKFIYMSPSVIVKGRVEE 228
               +  Q  ++ +                  F I  ++  L K I++   V+V+  +  
Sbjct: 316 RAYYFGEQTSQDTISEPKVRNPKYLSLLNHLRFYIPEIYPQLEKIIFLDDDVVVQKDLTP 375

Query: 229 LIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRSASQPWVSGKPYAVNSCMPDLGMLL 288
           L  +DL      A + C E  + Y           + S P +S K +   +C    GM +
Sbjct: 376 LFSLDLHGNVNGAVETCLEAFHRYYK-------YLNFSNPLISSK-FDPQACGWAFGMNV 427

Query: 289 IDARKLEKYILEA-FLWWKKVINQRDRS---IGRSPAIALALYDRYLKLSSSWLV----- 339
            D     K  + A + +W++    R+R+   +G  P   L+ Y     L   W V     
Sbjct: 428 FDLIAWRKANVTARYHYWQE--QNRERTLWKLGTLPPGLLSFYGLTEPLDRRWHVLGLGY 485

Query: 340 -TDSTSSVVNKSLAIRYDGPMTACSEFGDG 368
             +  + ++  +  I Y+G M    + G G
Sbjct: 486 DVNIDNRLIETAAVIHYNGNMKPWLKLGIG 515


>gi|225436890|ref|XP_002271296.1| PREDICTED: probable galacturonosyltransferase-like 4 [Vitis
           vinifera]
          Length = 356

 Score = 40.4 bits (93), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 47/232 (20%), Positives = 91/232 (39%), Gaps = 25/232 (10%)

Query: 104 SQDDGMVHIVSWMQCLDLRLLAVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVL 163
           S D   +H+V  +    LR     + S L  S  P+ + FH        D+  F  +   
Sbjct: 66  SSDADRIHVVMTLDANYLRGTIAALLSILQHSTCPENIDFHFLWSHFESDI--FSSINST 123

Query: 164 FPHSNLEFHG------QEEVKKVIRTA-STGVKYSVQNFEEIVPFVIASVHQSLSKFIYM 216
           FP  N + +       + ++ K IR A    + Y+     +I+P         + + IY+
Sbjct: 124 FPFLNFKVYRFDSNRVRGKISKSIRRALDQPLNYARIYLGDILP-------ADVRRVIYL 176

Query: 217 SPSVIVKGRVEELIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRSASQPWVSGKPYA 276
              +++   + +L G++L +  +AA + C     +Y          R+ ++ +   KP  
Sbjct: 177 DSDLVMVDDIAKLWGVELGDKVLAAPEYCHANFTNYFTSAFWS--DRALARTFDGRKP-- 232

Query: 277 VNSCMPDLGMLLIDARKLEK--YILEAFLWWKKVINQRDRSIGRSPAIALAL 326
              C  + G++++D  K  +  Y  +   W      +R   +G  P   L L
Sbjct: 233 ---CYFNTGVMVVDVEKWREGGYTKKVEEWMAVQKKKRIYQLGSLPPFLLVL 281


>gi|255566847|ref|XP_002524407.1| transferase, transferring glycosyl groups, putative [Ricinus
           communis]
 gi|223536368|gb|EEF38018.1| transferase, transferring glycosyl groups, putative [Ricinus
           communis]
          Length = 344

 Score = 40.4 bits (93), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 54/246 (21%), Positives = 92/246 (37%), Gaps = 35/246 (14%)

Query: 95  RNSGSISSISQDDGMVHIVSWMQCLDLRLLAVLVNSTLSGSRYPDLLHFHLFVPKGSEDM 154
           RN       +     +HI   +    LR     V S L  +  P+ + FH        ++
Sbjct: 45  RNGRECLQTTSSSTFIHIAMTLDSTYLRGSVAGVFSVLQHASCPENIIFHFIATHRRAEL 104

Query: 155 VSFYKLKVLFPHSNLE-FHGQEEVKK------VIRTASTGVKYSVQNFEEIVPFVIASVH 207
                + V FP+ N   +H   ++ K      V R     + Y+     +++P       
Sbjct: 105 RR--TITVTFPYLNFHLYHFDSDLVKGKISSSVRRALDQPLNYARFYLADLLP------- 155

Query: 208 QSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRSASQ 267
            S+ + IY    +IV   V +L  I+L  + + A + C     +Y N           S+
Sbjct: 156 ASVPRIIYFDSDLIVVDDVAKLWNINLGVHVLGAPEYCHANFTNYFN-----------SR 204

Query: 268 PWVSGKPYAVN-----SCMPDLGMLLIDARKL-EKYILEAFLWWKKVINQ-RDRSIGRSP 320
            W S + YA +     +C  + G+++ID  K  E    E   +W KV  + R   +G  P
Sbjct: 205 FW-SNEGYAASFRERRACYFNTGVMVIDLMKWREGKYREKLEYWMKVQKKYRIYELGSLP 263

Query: 321 AIALAL 326
              L  
Sbjct: 264 PFLLVF 269


>gi|409894769|gb|AFV46220.1| hypothetical protein, partial [Scutellaria baicalensis]
          Length = 258

 Score = 40.4 bits (93), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 43/173 (24%), Positives = 73/173 (42%), Gaps = 28/173 (16%)

Query: 201 FVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDC---SERLNSYVNPDV 257
           F +  ++  L K +++   V+V+  +  L  IDL      A + C     RL+ Y+N   
Sbjct: 67  FYMPEMYPKLHKILFLDDDVVVQKDLTALWKIDLDGKVNGAINTCFGSFHRLSEYLN--- 123

Query: 258 LDAIQRSASQPWVSGKPYAVNSCMPDLGMLL--IDARKLEKYILEAFLWWKKVINQRDRS 315
                   S P +  K +   +C   LGM +  +DA +LEK   E + +W+ +    DR+
Sbjct: 124 -------FSHPLIKEK-FNPRACAWALGMNIFDLDAWRLEK-CTEQYHYWQNL--NEDRT 172

Query: 316 I---GRSPAIALALYDRYLKLSSSWLVTDSTSS------VVNKSLAIRYDGPM 359
           +   G  PA  +  Y     L  +W V    S+       + K+  I + G M
Sbjct: 173 LWKMGTLPAGLVTFYSTTKPLDKAWHVLGLGSNPSISMEEIEKAAVIHFSGDM 225


>gi|357486289|ref|XP_003613432.1| Glycosyltransferase CAZy family GT8 [Medicago truncatula]
 gi|355514767|gb|AES96390.1| Glycosyltransferase CAZy family GT8 [Medicago truncatula]
          Length = 395

 Score = 40.0 bits (92), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 60/321 (18%), Positives = 115/321 (35%), Gaps = 18/321 (5%)

Query: 56  LGLLQFLPATHFRHPSDPFRIWVPFNSNTSFSKTRDSGDRNSGSISSISQDDGMVHIVSW 115
           LG+  F       H   PFR    F+    +        + S ++   S D  +VHI   
Sbjct: 25  LGIRSFPTTNTAEHSPTPFRFSGAFSEAPEYRNGAGCPKQKSKTLFPSSCDPSLVHIAMT 84

Query: 116 MQCLDLRLLAVLVNSTLSGSRYPDLLHFHL----FVPKGSEDMVSFYKLKVLFPHSNLEF 171
           +    LR     V+S L  S  P+ + FH     F P     +     +  +FP  N + 
Sbjct: 85  LDSGYLRGSIAAVHSVLRHSSCPENIFFHFISAEFDPTTPRTLTRL--VASVFPSLNFKV 142

Query: 172 HGQEEVKKVIRTASTGVKYSVQN-FEEIVPFVIASVHQSLSKFIYMSPSVIVKGRVEELI 230
           +   E   VI   S+ ++ +++N       ++   +   + + IY+   ++V   + +L 
Sbjct: 143 YIFRE-DTVINLISSSIRLALENPLNYARNYLGDMLDTCVERVIYLDSDIVVVDDISKLW 201

Query: 231 GIDL-SNYAIAAADDCSERLNSYVNPDVLDAIQRSASQPWVSGKPYAVNSCMPDLGMLLI 289
            + + +   I A + C      Y   +  +        P +S    A  +C  + G++++
Sbjct: 202 SVKMDAKKVIGAPEYCHANFTKYFTDEFWN-------DPLLSRVFKARKACYFNTGVMVM 254

Query: 290 DARKLEK--YILEAFLWWKKVINQRDRSIGRSPAIALALYDRYLKLSSSWLVTDSTSSVV 347
           D  K  +  Y  +   W +    +R   +G  P   L        +   W         +
Sbjct: 255 DLMKWREGNYRRKIENWMELQKKRRIYELGSLPPFLLVFAGNVEAIDHRWNQHGLGGDNL 314

Query: 348 NKSLAIRYDGPMTACSEFGDG 368
           N      + GP++     G G
Sbjct: 315 NGVCRSLHPGPVSLLHWSGKG 335


>gi|413933398|gb|AFW67949.1| hypothetical protein ZEAMMB73_846223 [Zea mays]
          Length = 269

 Score = 40.0 bits (92), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 43/209 (20%), Positives = 83/209 (39%), Gaps = 25/209 (11%)

Query: 170 EFHGQEEVKKVIRTASTGVKYSVQNFEEIVP---FVIASVHQSLSKFIYMSPSVIVKGRV 226
           E++ + +  K +   S+ +KY    +  ++    F +  V+  L+K +++   ++V+  +
Sbjct: 44  EYYFKADRPKTLSAGSSNLKYRNPKYLSMLNHLRFYLPQVYPKLNKILFLDDDIVVQRDL 103

Query: 227 EELIGIDLSNYAIAAADDCSE---RLNSYVNPDVLDAIQRSASQPWVSGKPYAVNSCMPD 283
             L  +DL+     A + C E   R + Y+N           S P ++ + +  N+C   
Sbjct: 104 TGLWEVDLNGNVNGAVETCGESFHRFDKYLN----------FSNPNIA-QNFDPNACGWA 152

Query: 284 LGMLLIDARKLEKYILEAFL--WWKKVINQRDRSIGRSPAIALALYDRYLKLSSSWLVTD 341
            GM + D  + +K  +      W     N+    +G  P   L  Y     L  SW V  
Sbjct: 153 YGMNMFDLEEWKKKDITGIYHKWQNMNENRLLWKLGTLPPGLLTFYKLTHPLDKSWHVLG 212

Query: 342 ------STSSVVNKSLAIRYDGPMTACSE 364
                    S ++ +  I Y+G M    E
Sbjct: 213 LGYNPTVERSEIDNAAVIHYNGNMKPWLE 241


>gi|357151818|ref|XP_003575914.1| PREDICTED: probable galacturonosyltransferase 6-like [Brachypodium
           distachyon]
          Length = 536

 Score = 40.0 bits (92), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 62/286 (21%), Positives = 111/286 (38%), Gaps = 51/286 (17%)

Query: 125 AVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFPH-----SNLEFHGQEEVKK 179
           AV++NST++ S+ P  + FH+       D +SF  + + F       + ++    +E K 
Sbjct: 261 AVVINSTIAASKDPKRIMFHVVA-----DALSFPAMMMWFLSNPPSPATIQIENLDEFKW 315

Query: 180 V-----IRTASTGVK----YSVQNFEEIVPFVIASVHQSLSKFIYMSPSVIVKGRVEELI 230
           +      R    G++     S  N    + F +  V  SLSK + +   V+V+  +  L 
Sbjct: 316 LPSDFSSRFKQKGIRDPRYTSALNH---LRFYLPQVFPSLSKVLLLDHDVVVQKDLSGLW 372

Query: 231 GIDLSNYAIAAADDCSE-----RLNSYVNPDVLDAIQRSASQPWVSGKPYAVNSCMPDLG 285
            ID+ +    A + C+      RL ++VN           S P +  K +   +C+   G
Sbjct: 373 EIDMKHKVNGALETCTSGYGYLRLENFVN----------FSDPSIFNK-FNAKACIYAFG 421

Query: 286 MLLIDARKLEKYILEAFL--WWKKVINQRDRSIGRSPAIALALYDRYLKLSSSWLV---- 339
           M + D  +     L A    W++    +R    G  P   L  Y++ + L + W V    
Sbjct: 422 MNIFDLTEWRNKGLTATYDKWFQMGKRRRLWKAGSLPLGQLVFYNQTVPLDNRWHVLGLG 481

Query: 340 --TDSTSSVVNKSLAIRYDGPMTACSEFGDGANMEPARGDLWKQHL 383
             ++     +  +  I Y G +    E        P   D W + L
Sbjct: 482 RDSNMEREEIESAAVIHYSGNLKPWLEISI-----PKYRDYWNRFL 522


>gi|145356270|ref|XP_001422356.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144582597|gb|ABP00673.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 259

 Score = 39.7 bits (91), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 63/266 (23%), Positives = 113/266 (42%), Gaps = 32/266 (12%)

Query: 109 MVHIVSWMQCLDLRLLAVLVNSTLSGSRYPDLLHFHLFVPKGS----EDMVSFYKLKVLF 164
           +VHI        L  L  +++S LS +  P  + FH+F  + +       ++ Y   + F
Sbjct: 2   LVHIAFACDPTQLFTLGPVISSVLSATASPHRIRFHIFTARDALTDASVQLNCYSRAIPF 61

Query: 165 PHSNLEFHGQEEVKKVIR---TASTGVKYSVQNFEEIVPFVIASVHQSLSKFIYMSPSVI 221
                E H  E  K +IR   T  +  ++ +QN      F  A +   + K +Y+   +I
Sbjct: 62  I---WELH--EFSKDMIRANITVHSRKEWRLQNAFNYARFYFAEILSDVQKVVYLDTDII 116

Query: 222 VKGRVEELIGIDL--SNYAIAAADDCSERLNSYVNPDVLDAIQRSASQPWVSGKPYAVNS 279
           VKG +  L   +L  S+ ++ AA   S  L S +N    +A  +S+    +  K ++ N+
Sbjct: 117 VKGDICRLHDANLRSSSTSVIAAVKRSVPLGSLLNFS--NAAVKSSG---LREKMHSFNA 171

Query: 280 CMPDLGMLLIDARKLEKYILEAFL--WWKKVINQRDRSIGRSPAIALALYDRYLKLSSSW 337
                G+LLID     +  + + +  W K     +  S G  P + L   D +  + S W
Sbjct: 172 -----GVLLIDLESWRRKRITSTVETWLKMNSVSKLYSHGSQPPLLLVFGDSFESIPSHW 226

Query: 338 LVTDS------TSSVVNKSLAIRYDG 357
            V          +SV+N++  + + G
Sbjct: 227 NVDGVGYKKGLRASVLNEARVLHWSG 252


>gi|194689812|gb|ACF78990.1| unknown [Zea mays]
 gi|414872195|tpg|DAA50752.1| TPA: hypothetical protein ZEAMMB73_194342 [Zea mays]
          Length = 291

 Score = 39.7 bits (91), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 43/209 (20%), Positives = 83/209 (39%), Gaps = 25/209 (11%)

Query: 170 EFHGQEEVKKVIRTASTGVKYSVQNFEEIVP---FVIASVHQSLSKFIYMSPSVIVKGRV 226
           E++ + +  K +   S+ +KY    +  ++    F +  V+  L+K +++   ++V+  +
Sbjct: 66  EYYFKADRPKTLSAGSSNLKYRNPKYLSMLNHLRFYLPQVYPKLNKILFLDDDIVVQRDL 125

Query: 227 EELIGIDLSNYAIAAADDCSE---RLNSYVNPDVLDAIQRSASQPWVSGKPYAVNSCMPD 283
             L  +DL+     A + C E   R + Y+N           S P ++ + +  N+C   
Sbjct: 126 TGLWEVDLNGNVNGAVETCGESFHRFDKYLN----------FSNPNIA-QNFDPNACGWA 174

Query: 284 LGMLLIDARKLEKYILEAFL--WWKKVINQRDRSIGRSPAIALALYDRYLKLSSSWLVTD 341
            GM + D  + +K  +      W     N+    +G  P   L  Y     L  SW V  
Sbjct: 175 YGMNMFDLEEWKKKDITGIYHKWQNMNENRLLWKLGTLPPGLLTFYKLTHPLDKSWHVLG 234

Query: 342 ------STSSVVNKSLAIRYDGPMTACSE 364
                    S ++ +  I Y+G M    E
Sbjct: 235 LGYNPTIERSEIDNAAVIHYNGNMKPWLE 263


>gi|242097150|ref|XP_002439065.1| hypothetical protein SORBIDRAFT_10g030870 [Sorghum bicolor]
 gi|241917288|gb|EER90432.1| hypothetical protein SORBIDRAFT_10g030870 [Sorghum bicolor]
          Length = 697

 Score = 39.7 bits (91), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 33/144 (22%), Positives = 63/144 (43%), Gaps = 16/144 (11%)

Query: 201 FVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDV 257
           F +  +H  L K +++   V+V+  +  L  IDL      A + C E   R ++Y+N   
Sbjct: 505 FYMPEIHPKLEKILFLDDDVVVQKDLTPLWDIDLKGMVNGAVETCKESFHRFDTYLN--- 561

Query: 258 LDAIQRSASQPWVSGKPYAVNSCMPDLGMLLIDARKLEKY-ILEAFLWWKKVINQRDR-S 315
                   S P +S + +  ++C    GM + D ++ +K  I   + +W+ +   R    
Sbjct: 562 -------FSHPKIS-ENFDPHACGWAFGMNMFDLKEWKKRNITGIYHYWQDLNEDRKLWK 613

Query: 316 IGRSPAIALALYDRYLKLSSSWLV 339
           +G  P   +  Y+    L+ +W V
Sbjct: 614 LGTLPPGLITFYNLTYPLNRTWHV 637


>gi|356570614|ref|XP_003553480.1| PREDICTED: probable galacturonosyltransferase 6-like [Glycine max]
          Length = 625

 Score = 39.7 bits (91), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 60/256 (23%), Positives = 107/256 (41%), Gaps = 29/256 (11%)

Query: 120 DLRLLAVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFPHSNLEFHGQE---- 175
           +L   AV+VNST+S ++  + L FH  V   S +  + +   +L P      H Q     
Sbjct: 346 NLLACAVVVNSTVSNAKKKEKLVFH--VVTNSLNFPAIWMWFLLNPPGKATVHIQSIENF 403

Query: 176 ----EVKKVIRTASTGVKYSVQ-NFEEIVPFVIASVHQSLSKFIYMSPSVIVKGRVEELI 230
                     +  S+  +Y+ + N+   + F +  +  +L+K +     V+V+  +  L 
Sbjct: 404 EWLPMYNTFNKHNSSDPRYTSELNY---LRFYLPDIFPTLNKILLFDHDVVVQQDLSGLW 460

Query: 231 GIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRSASQPWVSGKPYAVNSCMPDLGMLLID 290
             +L    IAA   C E   S+   D+L       S P+++ + +  N+C    GM L D
Sbjct: 461 NANLKGKVIAAVGTCQEGGTSFHRMDMLINF----SDPFIAER-FDANACTWAFGMNLFD 515

Query: 291 ARKLEKYILEAFLWWKKVINQRDR---SIGRSPAIALALYDRYLKLSSSW----LVTDS- 342
            ++  ++ L   L+ + +     R   +IG  P   L  Y++   L   W    L  DS 
Sbjct: 516 LQQWRRHNLTT-LYHRYLQMGSKRPLWNIGSLPLGWLTFYNKTKVLDRRWHILGLGYDSG 574

Query: 343 -TSSVVNKSLAIRYDG 357
              + +  +  I YDG
Sbjct: 575 VDKNEIEGAAVIHYDG 590


>gi|218198875|gb|EEC81302.1| hypothetical protein OsI_24438 [Oryza sativa Indica Group]
 gi|222636212|gb|EEE66344.1| hypothetical protein OsJ_22634 [Oryza sativa Japonica Group]
          Length = 588

 Score = 39.7 bits (91), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 59/277 (21%), Positives = 109/277 (39%), Gaps = 53/277 (19%)

Query: 125 AVLVNSTLSGSRYPDLLHFHLFVPK---GSEDM----------------VSFYK------ 159
           +V+VNST+  ++ P+   FHL   K   G+ +M                V  +K      
Sbjct: 290 SVVVNSTIMNAKEPEKHVFHLVTDKLNFGAMNMWFLLNPPGDATIHVENVDDFKWLNSSY 349

Query: 160 ---LKVLFPHSNLEFHGQEEVKKVIRTASTGVKYSVQNFEEIVP---FVIASVHQSLSKF 213
              LK L   +  E++ + +  K +   S+ +KY    +  ++    F +  V+  L+K 
Sbjct: 350 CPVLKQLESVAMKEYYFKADRPKTLSAGSSNLKYRNPKYLSMLNHLRFYLPQVYPKLNKI 409

Query: 214 IYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDVLDAIQRSASQPWV 270
           +++   ++V+  +  L  +DL+     A + C E   R + Y+N           S P +
Sbjct: 410 LFLDDDIVVQKDLTGLWEVDLNGNVNGAVETCGESFHRFDKYLN----------FSNPNI 459

Query: 271 SGKPYAVNSCMPDLGMLLIDARKLEKYILEAFL--WWKKVINQRDRSIGRSPAIALALYD 328
           + + +  N+C    GM + D  + +K  +      W     N+    +G  P   L  Y 
Sbjct: 460 A-QNFDPNACGWAYGMNMFDLEEWKKKDITGIYHKWQNMNENRLLWKLGTLPPGLLTFYK 518

Query: 329 RYLKLSSSWLVT------DSTSSVVNKSLAIRYDGPM 359
               L  SW V           S ++ +  I Y+G M
Sbjct: 519 LTHPLDKSWHVLGLGYNPSIERSEIDNAAVIHYNGNM 555


>gi|356532892|ref|XP_003535003.1| PREDICTED: probable galacturonosyltransferase 4-like isoform 2
           [Glycine max]
          Length = 663

 Score = 39.7 bits (91), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 47/222 (21%), Positives = 91/222 (40%), Gaps = 39/222 (17%)

Query: 150 GSEDMVSFY-KLKVLFPHSNLEFHGQEEVKKVIRTASTGVKYSVQNFEEIVPFVIASVHQ 208
           GS+ M+ +Y K   +   SNL+F   + +             S+ N    + F +  +  
Sbjct: 436 GSQSMIDYYFKAHRVTSDSNLKFRNPKYL-------------SILNH---LRFYLPEIFP 479

Query: 209 SLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDVLDAIQRSA 265
            L+K +++   ++V+  + +L  IDL      A + C E   R + Y+N           
Sbjct: 480 KLNKVLFLDDDIVVQKDLTDLWSIDLKGNVNGAVETCGESFHRFDRYLN----------F 529

Query: 266 SQPWVSGKPYAVNSCMPDLGMLLID-ARKLEKYILEAFLWWKKVINQRDR-SIGRSPAIA 323
           S P ++ K +  ++C    GM + D A    + I E +  W+ + + R    +G  P   
Sbjct: 530 SNPLIA-KNFDPHACGWAYGMNVFDLAEWKRQNITEVYHNWQNLNHDRQLWKLGTLPPGL 588

Query: 324 LALYDRYLKLSSSWLVT------DSTSSVVNKSLAIRYDGPM 359
           +  + R   L+ SW +       +     + +S  + Y+G M
Sbjct: 589 ITFWKRTFPLNRSWHILGLGYNPNVNQRDIEQSAVVHYNGNM 630


>gi|356532890|ref|XP_003535002.1| PREDICTED: probable galacturonosyltransferase 4-like isoform 1
           [Glycine max]
          Length = 657

 Score = 39.3 bits (90), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 47/222 (21%), Positives = 91/222 (40%), Gaps = 39/222 (17%)

Query: 150 GSEDMVSFY-KLKVLFPHSNLEFHGQEEVKKVIRTASTGVKYSVQNFEEIVPFVIASVHQ 208
           GS+ M+ +Y K   +   SNL+F   + +             S+ N    + F +  +  
Sbjct: 430 GSQSMIDYYFKAHRVTSDSNLKFRNPKYL-------------SILNH---LRFYLPEIFP 473

Query: 209 SLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDVLDAIQRSA 265
            L+K +++   ++V+  + +L  IDL      A + C E   R + Y+N           
Sbjct: 474 KLNKVLFLDDDIVVQKDLTDLWSIDLKGNVNGAVETCGESFHRFDRYLN----------F 523

Query: 266 SQPWVSGKPYAVNSCMPDLGMLLID-ARKLEKYILEAFLWWKKVINQRDR-SIGRSPAIA 323
           S P ++ K +  ++C    GM + D A    + I E +  W+ + + R    +G  P   
Sbjct: 524 SNPLIA-KNFDPHACGWAYGMNVFDLAEWKRQNITEVYHNWQNLNHDRQLWKLGTLPPGL 582

Query: 324 LALYDRYLKLSSSWLVT------DSTSSVVNKSLAIRYDGPM 359
           +  + R   L+ SW +       +     + +S  + Y+G M
Sbjct: 583 ITFWKRTFPLNRSWHILGLGYNPNVNQRDIEQSAVVHYNGNM 624


>gi|297809943|ref|XP_002872855.1| hypothetical protein ARALYDRAFT_490360 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297318692|gb|EFH49114.1| hypothetical protein ARALYDRAFT_490360 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 346

 Score = 39.3 bits (90), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 70/327 (21%), Positives = 122/327 (37%), Gaps = 29/327 (8%)

Query: 49  ALGLIVFLGLLQ-FLPATHFRHPSDPFRIWVPFNSNTSFSKTRDSGDR--NSGSISSISQ 105
           A+ +IV    LQ F PA   R    P     P   N        +GD   +SG + + S 
Sbjct: 14  AMAVIVLSPSLQSFPPAAAIRSSPSPIFRKAPVVFN--------NGDECLSSGGVCNPS- 64

Query: 106 DDGMVHIVSWMQCLDLRLLAVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFP 165
              +VH+   +    LR     VNS L  S  P+ + FH  V     +++    ++ +FP
Sbjct: 65  ---LVHVAITLDVEYLRGSIAAVNSILQHSICPESVFFHFIVVSEETNLLESL-VRSIFP 120

Query: 166 HSNLEFHGQEEVKKVIR-TASTGVKYSV-QNFEEIVPFVIASVHQSLSKFIYMSPSVIVK 223
              L+F+  +   + +R   S+ V+ ++ Q       ++   +   +S+ IY+   ++V 
Sbjct: 121 --GLKFNIYDFAPETVRGLISSSVRQALEQPLNYARNYLADLLEPCVSRVIYLDSDLVVV 178

Query: 224 GRVEELIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRSASQPWVSGKPYAVNSCMPD 283
             + +L    L +  I A + C      Y             S+   SG       C  +
Sbjct: 179 DDIAKLWKTSLGSRIIGAPEYCHANFTKYFTGGFW-------SEERFSGAFRGRKPCYFN 231

Query: 284 LGMLLIDARKLEK--YILEAFLWWKKVINQRDRSIGRSPAIALALYDRYLKLSSSWLVTD 341
            G+++ID +K  +  Y      W +    +R   +G  P   L        +S  W    
Sbjct: 232 TGVMVIDLKKWRRGGYTKRIEKWMEIQRTERIYDLGSLPPFLLVFAGHVAPISHRWNQHG 291

Query: 342 STSSVVNKSLAIRYDGPMTACSEFGDG 368
                V  S    + GP++     G G
Sbjct: 292 LGGDNVRGSCRDLHPGPVSLLHWSGSG 318


>gi|224124762|ref|XP_002329942.1| glycosyltransferase, CAZy family GT8 [Populus trichocarpa]
 gi|222871964|gb|EEF09095.1| glycosyltransferase, CAZy family GT8 [Populus trichocarpa]
          Length = 665

 Score = 39.3 bits (90), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 37/177 (20%), Positives = 73/177 (41%), Gaps = 22/177 (12%)

Query: 201 FVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDV 257
           F +  +   L+K +++   ++V+  +  L  +DL      A + C E   R + Y+N   
Sbjct: 474 FYLPEIFPKLNKVLFLDDDIVVQKDLTGLWSLDLKGNVNGAVETCGENFHRFDRYLN--- 530

Query: 258 LDAIQRSASQPWVSGKPYAVNSCMPDLGMLLIDARKLEKY-ILEAFLWWKKVINQRDR-S 315
                   S P +S K +   +C    GM + D ++ ++  I + +  W+K+ + R    
Sbjct: 531 -------FSNPHIS-KNFDPRACGWAYGMNIFDLKEWKRQNITDVYHTWQKLNHDRQLWK 582

Query: 316 IGRSPAIALALYDRYLKLSSSWLVT------DSTSSVVNKSLAIRYDGPMTACSEFG 366
           +G  P   +  + R   L   W V       + +   + ++  I Y+G M    E G
Sbjct: 583 LGTLPPGLITFWKRTHPLDRRWHVLGLGYNPNVSQREIERAAVIHYNGNMKPWLEIG 639


>gi|356520019|ref|XP_003528664.1| PREDICTED: probable galacturonosyltransferase-like 4-like [Glycine
           max]
          Length = 350

 Score = 39.3 bits (90), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 46/224 (20%), Positives = 91/224 (40%), Gaps = 19/224 (8%)

Query: 121 LRLLAVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFPHSNLEFHGQEEVKKV 180
           LR     V S L  S  P+ L FH F+    +    F  ++  FP+ N++ + + +  +V
Sbjct: 75  LRGTMAAVLSILQHSTCPENLAFH-FLSAHDDAPELFSSIRSTFPYLNMKIY-RFDSNRV 132

Query: 181 IRTASTGVKYSVQ---NFEEIVPFVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNY 237
               S  ++ ++    N+  I  ++  ++ + + + IY    ++V   + +L G+D+   
Sbjct: 133 RGKISKSIRQALDQPLNYARI--YLADTIPEDVKRVIYFDSDLVVVDDIAKLWGVDMEGK 190

Query: 238 AIAAADDCSERLNSYVNPDVLDAIQRSASQPWVSGKPYAVNSCMPDLGMLL--IDARKLE 295
            +AA + C      Y   +         S P ++        C  + G+++  +D  + E
Sbjct: 191 LVAAPEYCHANFTLYFTDNFW-------SDPVLAKTFEGRKPCYFNTGVMVMDVDTWRKE 243

Query: 296 KYILEAFLWWKKVINQRDR--SIGRSPAIALALYDRYLKLSSSW 337
           +Y  E    W  V  Q+ R   +G  P   L L      +   W
Sbjct: 244 RYT-EKVEEWMAVQKQQKRIYHLGSLPPFLLVLAGNIKAVDHRW 286


>gi|326437108|gb|EGD82678.1| hypothetical protein PTSG_03339 [Salpingoeca sp. ATCC 50818]
          Length = 336

 Score = 39.3 bits (90), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 17/42 (40%), Positives = 26/42 (61%)

Query: 211 SKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSERLNSY 252
           S+FIY+   VIV+G + EL  +D+ +  IA + DCS+    Y
Sbjct: 151 SRFIYLDDDVIVQGDILELWEVDMLSRGIAVSTDCSDTAQQY 192


>gi|291527266|emb|CBK92852.1| Lipopolysaccharide biosynthesis proteins, LPS:glycosyltransferases
           [Eubacterium rectale M104/1]
          Length = 723

 Score = 39.3 bits (90), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 21/64 (32%), Positives = 35/64 (54%), Gaps = 1/64 (1%)

Query: 209 SLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRSASQP 268
            L K IY+   + V   +EEL  +++ NY +AAA DCS  + ++  P  + A Q S  + 
Sbjct: 440 DLKKIIYLDSDLFVNTDIEELWNLNIDNYCLAAAQDCS-TIRNWGTPYAVAAGQTSRDRY 498

Query: 269 WVSG 272
           + +G
Sbjct: 499 FNAG 502


>gi|288800023|ref|ZP_06405482.1| lipopolysaccharide 1,2-glucosyltransferase/general stress protein
           [Prevotella sp. oral taxon 299 str. F0039]
 gi|288333271|gb|EFC71750.1| lipopolysaccharide 1,2-glucosyltransferase/general stress protein
           [Prevotella sp. oral taxon 299 str. F0039]
          Length = 309

 Score = 39.3 bits (90), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 24/107 (22%), Positives = 51/107 (47%), Gaps = 2/107 (1%)

Query: 143 FHLFVPKGSEDMVSFYKLKVLFPHSNLEFHGQEEVKKVIRTASTGVKYSVQNFEEIVPFV 202
           FHL + K  +D       + L  +++++F+   +      T   G   ++  +  +  F+
Sbjct: 33  FHLIIDKSVDDKCKNKLQQCLVGNTSIQFYHFNDSILTKYTIGKGHHLTIATYYRL--FI 90

Query: 203 IASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSERL 249
              + Q++S+ IY+   +IV+  + +L   DL+N A+ A  D  E +
Sbjct: 91  CNLLPQNISRIIYLDCDLIVRHPLNKLWNTDLTNKALGAVTDMGEAM 137


>gi|238924869|ref|YP_002938385.1| Glycosyltransferase Family 2 modular protein [Eubacterium rectale
           ATCC 33656]
 gi|238876544|gb|ACR76251.1| Glycosyltransferase Family 2 modular protein [Eubacterium rectale
           ATCC 33656]
          Length = 723

 Score = 39.3 bits (90), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 21/63 (33%), Positives = 35/63 (55%), Gaps = 1/63 (1%)

Query: 210 LSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRSASQPW 269
           L K IY+   + V   +EEL  +++ NY +AAA DCS  + ++  P  + A Q S  + +
Sbjct: 441 LKKIIYLDSDLFVNTDIEELWNLNIDNYCLAAAQDCS-TIRNWGTPYAVAAGQTSRDRYF 499

Query: 270 VSG 272
            +G
Sbjct: 500 NAG 502


>gi|296125750|ref|YP_003633002.1| glycosyl transferase [Brachyspira murdochii DSM 12563]
 gi|296017566|gb|ADG70803.1| glycosyl transferase family 8 [Brachyspira murdochii DSM 12563]
          Length = 331

 Score = 38.9 bits (89), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 33/143 (23%), Positives = 67/143 (46%), Gaps = 15/143 (10%)

Query: 122 RLLAVLVNSTLSGSRYPDLLHFHLF----VPKGSEDMVSFYKLKVLFPHSNLEFH--GQE 175
           + +   + S LS S+  + ++FHL       +    ++S  K+K    + +++F+    E
Sbjct: 15  KYMGTTIASILSNSKEDEEIYFHLLDGGITEENKNKLLSLKKIK----NCDMKFYVPDFE 70

Query: 176 EVKKVIRTASTGVKYSVQNFEEIVPFVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLS 235
           + KK     +   ++S   F  +   ++      + K +Y+    IV+  ++EL  ID++
Sbjct: 71  KYKKWYELGNYNGRFSPAMFYRLDIHILMP---DIDKVLYLDCDTIVRSSLKELFEIDIN 127

Query: 236 NYAIAAADDCSERLNSYVNPDVL 258
           NY    AD+   R N+Y N  V+
Sbjct: 128 NYYALVADNA--RNNNYFNSGVI 148


>gi|291526000|emb|CBK91587.1| Lipopolysaccharide biosynthesis proteins, LPS:glycosyltransferases
           [Eubacterium rectale DSM 17629]
          Length = 723

 Score = 38.9 bits (89), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 16/38 (42%), Positives = 23/38 (60%)

Query: 209 SLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCS 246
            L K IY+   + V   +EEL  +++ NY +AAA DCS
Sbjct: 440 DLKKIIYLDSDLFVNTDIEELWNLNIDNYCLAAAQDCS 477


>gi|356564442|ref|XP_003550463.1| PREDICTED: uncharacterized protein LOC100794732 [Glycine max]
          Length = 1469

 Score = 38.9 bits (89), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 36/170 (21%), Positives = 71/170 (41%), Gaps = 22/170 (12%)

Query: 201  FVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDV 257
            F +  +   L+K +++   ++V+  +  L  IDL      A + C E   R + Y+N   
Sbjct: 1278 FYLPEIFPKLNKVLFLDDDIVVQKDLTGLWSIDLKGNVNGAVETCGERFHRFDRYLN--- 1334

Query: 258  LDAIQRSASQPWVSGKPYAVNSCMPDLGMLLIDARKLEKY-ILEAFLWWKKVINQRDR-S 315
                    S P ++ K +   +C    GM + D  + ++  I + +  W+K+ + R    
Sbjct: 1335 -------FSNPLIA-KNFDPRACGWAYGMNVFDLVQWKRQNITDVYHKWQKMNHDRQLWK 1386

Query: 316  IGRSPAIALALYDRYLKLSSSWLVT------DSTSSVVNKSLAIRYDGPM 359
            +G  P   +  + R  +L  SW V       +     + ++  I Y+G M
Sbjct: 1387 LGTLPPGLITFWKRTFQLHRSWHVLGLGYNPNINQKEIERAAVIHYNGNM 1436


>gi|413935035|gb|AFW69586.1| hypothetical protein ZEAMMB73_786543 [Zea mays]
          Length = 693

 Score = 38.9 bits (89), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 32/144 (22%), Positives = 63/144 (43%), Gaps = 16/144 (11%)

Query: 201 FVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDV 257
           F +  +H  L K +++   V+V+  +  L  +DL      A + C E   R ++Y+N   
Sbjct: 501 FYMPEIHPKLEKILFLDDDVVVQKDLTPLWDVDLKGMVNGAVETCKESFHRFDTYLN--- 557

Query: 258 LDAIQRSASQPWVSGKPYAVNSCMPDLGMLLIDARKLEKY-ILEAFLWWKKVINQRDR-S 315
                   S P +S + +  ++C    GM + D ++ +K  I   + +W+ +   R    
Sbjct: 558 -------FSHPKIS-ENFDPHACGWAFGMNMFDLKEWKKRNITGIYHYWQDLNEDRKLWK 609

Query: 316 IGRSPAIALALYDRYLKLSSSWLV 339
           +G  P   +  Y+    L+ +W V
Sbjct: 610 LGTLPPGLITFYNLTYPLNRTWHV 633


>gi|319639377|ref|ZP_07994128.1| glycosyl transferase family 8 [Neisseria mucosa C102]
 gi|317399561|gb|EFV80231.1| glycosyl transferase family 8 [Neisseria mucosa C102]
          Length = 342

 Score = 38.9 bits (89), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 37/141 (26%), Positives = 62/141 (43%), Gaps = 11/141 (7%)

Query: 111 HIVSWMQCLDLRLLAVLVNSTLSGSRYPDLLHFHLFVPKGSE--DMVSFYKLKVLFPHSN 168
           HIV       +R +   + S L  + +P  +HFHL V  GSE  D   F  L+ + P+  
Sbjct: 25  HIVYAADQNYIRHIGTSMLSVLENNHFP--IHFHLLV-SGSESYDFSIFNHLETINPNYA 81

Query: 169 LE-FHGQEEVKKVIRTASTGVKYSVQNFEEIVPFVIASVHQSLSKFIYMSPSVIVKGRVE 227
           +  +H   E    ++T   G       +   +P ++  +  +    +Y+   V+  G + 
Sbjct: 82  VTVYHLNTEYFTTLQT--NGYFTIAMYYRMCIPAILGGISDTA---LYLDTDVLCLGDIS 136

Query: 228 ELIGIDLSNYAIAAADDCSER 248
           EL  IDL+   +AAA D   R
Sbjct: 137 ELFTIDLTGQLVAAAPDTMFR 157


>gi|255567642|ref|XP_002524800.1| transferase, transferring glycosyl groups, putative [Ricinus
           communis]
 gi|223535984|gb|EEF37643.1| transferase, transferring glycosyl groups, putative [Ricinus
           communis]
          Length = 350

 Score = 38.9 bits (89), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 46/226 (20%), Positives = 88/226 (38%), Gaps = 25/226 (11%)

Query: 110 VHIVSWMQCLDLRLLAVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFPHSNL 169
           +H+   +    LR     V S L  S  P+ + FH        ++  F  +K  FP+ N 
Sbjct: 66  IHVTMTLDANYLRGTMAAVLSILQHSTCPENVEFHFLWAHFEPEV--FSNIKSTFPYLNF 123

Query: 170 EFHG------QEEVKKVIRTA-STGVKYSVQNFEEIVPFVIASVHQSLSKFIYMSPSVIV 222
           + +       + ++ K IR A    + Y+     +I+P         +++ IY+   ++V
Sbjct: 124 KIYRFDSNRVRGKISKSIRQALDQPLNYARIYLADILPL-------DVNRVIYLDSDLVV 176

Query: 223 KGRVEELIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRSASQPWVSGKPYAVNSCMP 282
              + +L  +DL    +AA + C     +Y        ++ + +     GK      C  
Sbjct: 177 VDDIAKLWQVDLEGKVLAAPEYCHANFTNYFTELFWSNLEWAKT---FEGK----RPCYF 229

Query: 283 DLGMLLIDARKLEK--YILEAFLWWKKVINQRDRSIGRSPAIALAL 326
           + G++++D  K     Y  +   W     +QR   +G  P   L L
Sbjct: 230 NTGVMVVDVDKWRTGGYTQKVEEWMMVQKHQRLYDLGSLPPFLLVL 275


>gi|326494626|dbj|BAJ94432.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 538

 Score = 38.9 bits (89), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 63/274 (22%), Positives = 110/274 (40%), Gaps = 53/274 (19%)

Query: 125 AVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFPHSNLEFHGQEE-------- 176
           +V+VNST+S + +P  L FH+   + +   +S + L   F    +E H  +E        
Sbjct: 247 SVVVNSTVSNANHPQQLVFHVVTDRINFGAMSTWFLINDFKGCTVEVHCIDEFSWFNATA 306

Query: 177 ---VKKV-----------IRTASTGVKYSVQNFEEIVP---FVIASVHQSLSKFIYMSPS 219
              V+++           ++T    +K+    F  ++    F I  +  +L K +++   
Sbjct: 307 SSLVRRLSDMETKGSSGGLKTQEREIKFHNPKFVSLLNHLRFYIPQILPNLEKVVFLDDD 366

Query: 220 VIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDVLDAIQRSASQPWVSGK--P 274
           V+V+  + +L  I+L    I A + C E   R + Y+N           SQP +S K  P
Sbjct: 367 VVVQKDLTQLFSIELHGNVIGAVETCLESFHRYHKYLN----------FSQPIISSKIDP 416

Query: 275 YAVNSCMPDLGMLLIDA---RKLEKYILEAFLWWKKVINQRDRSIGRSPAIALALYDRYL 331
           Y   +C    GM + D    RK     L  + W ++  +Q     G  PA  L  Y    
Sbjct: 417 Y---TCGWAFGMNIFDLIAWRKANATSLYHY-WQEQNADQLLWRTGTLPAGLLTFYGLME 472

Query: 332 KLSSSWLV------TDSTSSVVNKSLAIRYDGPM 359
            L   W +       D    ++  +  + Y+G M
Sbjct: 473 PLDRRWHLLGLGYDVDIDDRLIESAAVVHYNGNM 506


>gi|33152214|ref|NP_873567.1| glycosyl transferase family protein [Haemophilus ducreyi 35000HP]
 gi|6760462|gb|AAF28363.1|AF224467_2 putative glycosyl transferase [Haemophilus ducreyi]
 gi|33148436|gb|AAP95956.1| putative glycosyl transferase [Haemophilus ducreyi 35000HP]
          Length = 269

 Score = 38.9 bits (89), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 38/150 (25%), Positives = 63/150 (42%), Gaps = 37/150 (24%)

Query: 212 KFIYMSPSVIVKGRVEELIGIDLSNYAIAAADD-CSERLNSYVNPDVLDAIQRSASQPWV 270
           K IY+   ++V G + EL   D+SNY +AA  D  SE++  YVN  + +A          
Sbjct: 102 KVIYLDADIVVNGSLTELYQTDISNYFLAAVKDIISEKI--YVNNHIFNA---------- 149

Query: 271 SGKPYAVNSCMPDLGMLLIDARKLEKYILEAFLWWKKVINQRDRSIGRSP-----AIALA 325
                         GMLLI+ +K  ++ +  F      ++  ++ I   P      + L 
Sbjct: 150 --------------GMLLINNKKWREHNITQF-----CLSLSEKYINSLPDADQSILNLI 190

Query: 326 LYDRYLKLSSSWLVTDSTSSVVNKSLAIRY 355
             D++LKL+  +     T  +  K    RY
Sbjct: 191 FKDKWLKLNRGYNYLIGTDYLFFKYGKTRY 220


>gi|86438644|emb|CAJ26362.1| glycosyl transferase-like protein [Brachypodium sylvaticum]
          Length = 689

 Score = 38.9 bits (89), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 39/179 (21%), Positives = 71/179 (39%), Gaps = 26/179 (14%)

Query: 201 FVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDV 257
           F +  +   L+K +++    +V+  +  L  IDL      A + C E   R + Y+N   
Sbjct: 498 FYLPEIFPKLNKVLFLDDDTVVQQDLSALWSIDLKGKVNGAVETCGETFHRFDKYLN--- 554

Query: 258 LDAIQRSASQPWVSGKPYAVNSCMPDLGMLLIDARKLEKY-ILEAFLWWKKVINQRDR-- 314
                   S P V+   +   +C    GM + D  +  K  I + +  W+K+    DR  
Sbjct: 555 -------FSNPIVANN-FHPQACGWAFGMNMFDLSEWRKQNITDVYHTWQKL--NEDRLL 604

Query: 315 -SIGRSPAIALALYDRYLKLSSSWLVT------DSTSSVVNKSLAIRYDGPMTACSEFG 366
             +G  PA  +  ++R   L  SW +       +     + ++  I Y+G +    E G
Sbjct: 605 WKLGTLPAGLVTFWNRTFPLDRSWHLLGLGYNPNVNERDIRRASVIHYNGNLKPWLEIG 663


>gi|357145600|ref|XP_003573699.1| PREDICTED: probable galacturonosyltransferase 4-like isoform 2
           [Brachypodium distachyon]
          Length = 660

 Score = 38.9 bits (89), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 41/177 (23%), Positives = 72/177 (40%), Gaps = 22/177 (12%)

Query: 201 FVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDV 257
           F +  ++  L K +++   V+VK  +  L  ID+      A + C E   R + Y+N   
Sbjct: 469 FYLPEIYPKLDKMVFLDDDVVVKKDLTGLWSIDMKGKVNGAVETCGESFHRFDRYLN--- 525

Query: 258 LDAIQRSASQPWVSGKPYAVNSCMPDLGMLLID-ARKLEKYILEAFLWWKKVINQRDR-S 315
                   S P V  K +  ++C    GM + D A    + I E +  W+K+ +      
Sbjct: 526 -------FSNP-VIAKNFDPHACGWAFGMNVFDLAEWRRQDITEIYHSWQKLSSGLLLWK 577

Query: 316 IGRSPAIALALYDRYLKLSSSWLVT------DSTSSVVNKSLAIRYDGPMTACSEFG 366
           +G  P   +  +++   L+ SW V          S  + ++  I Y+G M    E G
Sbjct: 578 LGTLPPGLITFWNKTFPLNRSWHVLGLGYNPHVNSRDIERAAVIHYNGNMKPWLEIG 634


>gi|297606545|ref|NP_001058636.2| Os06g0727300 [Oryza sativa Japonica Group]
 gi|54291141|dbj|BAD61814.1| putative glycosyl transferase protein A [Oryza sativa Japonica
           Group]
 gi|125598568|gb|EAZ38348.1| hypothetical protein OsJ_22722 [Oryza sativa Japonica Group]
 gi|255677417|dbj|BAF20550.2| Os06g0727300 [Oryza sativa Japonica Group]
          Length = 601

 Score = 38.5 bits (88), Expect = 5.4,   Method: Compositional matrix adjust.
 Identities = 33/146 (22%), Positives = 64/146 (43%), Gaps = 20/146 (13%)

Query: 201 FVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDV 257
           F +  +H  L K +++   V+V+  +  L  +DL      A + C E   R N+Y+N   
Sbjct: 409 FYMPEIHPKLDKILFLDDDVVVQKDLTPLWDVDLKGIVNGAVETCKESFHRFNTYLN--- 465

Query: 258 LDAIQRSASQPWVSGKPYAVNSCMPDLGMLLIDARKLEKY-ILEAFLWWKKVINQRDRSI 316
                   S P +S + +  ++C    GM + D ++ +K  I   + +W+ +    DR +
Sbjct: 466 -------FSHPKIS-ENFDPHACGWAFGMNMFDLKEWKKQNITGIYHYWQDL--NEDRKL 515

Query: 317 GR---SPAIALALYDRYLKLSSSWLV 339
            +    P   +  Y+    L+ +W V
Sbjct: 516 WKLDTLPPGLITFYNLTYPLNRTWHV 541


>gi|224085587|ref|XP_002307628.1| glycosyltransferase, CAZy family GT8 [Populus trichocarpa]
 gi|222857077|gb|EEE94624.1| glycosyltransferase, CAZy family GT8 [Populus trichocarpa]
          Length = 605

 Score = 38.5 bits (88), Expect = 5.5,   Method: Compositional matrix adjust.
 Identities = 58/269 (21%), Positives = 110/269 (40%), Gaps = 41/269 (15%)

Query: 125 AVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKL------KVLFPHSNLEFHG----Q 174
           AV+VNST+S +  P+ + FH+     +   +S + L        +   S ++F G     
Sbjct: 329 AVVVNSTVSSAMEPEKIVFHIVTDSLNLPTISMWFLLNPPGKATIQIQSLVDFKGLSANY 388

Query: 175 EEVKKVIRTASTGVKYSVQNFEEIVPFVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDL 234
               K + +  +    ++ +      F +  V   L+K +     V+V+  +  L  +++
Sbjct: 389 NSTLKQLNSRDSRYTSALNHLR----FYLPDVFPQLNKIVLFDHDVVVQKDLAGLWSLNM 444

Query: 235 SNYAIAAADDCSE------RLNSYVNPDVLDAIQRSASQPWVSGKPYAVNSCMPDLGMLL 288
               I A D C E      R++ ++N           S P+V  K +   +C    GM L
Sbjct: 445 KGKVIGAVDTCREGEPSFRRMDKFIN----------FSDPFVI-KRFDAKACTWAFGMNL 493

Query: 289 IDARKLEKYILEAFLWWKKVINQRDRSIGRSPAIALA---LYDRYLKLSSSW--LVTDST 343
            D ++  ++ L A L+ K +     R + ++ ++ L     Y+R + L   W  L     
Sbjct: 494 FDLQEWRRHKLTA-LYNKYLQLGHTRQLWKAGSLPLGWATFYNRTVILDRRWHKLGLGHE 552

Query: 344 SSV----VNKSLAIRYDGPMTACSEFGDG 368
           + V    V ++  + YDG M    + G G
Sbjct: 553 AGVGHDGVEQAAVLHYDGVMKPWLDIGIG 581


>gi|115480327|ref|NP_001063757.1| Os09g0531900 [Oryza sativa Japonica Group]
 gi|52075938|dbj|BAD46018.1| glycosyl transferase family 8 protein-like [Oryza sativa Japonica
           Group]
 gi|52077221|dbj|BAD46265.1| glycosyl transferase family 8 protein-like [Oryza sativa Japonica
           Group]
 gi|113631990|dbj|BAF25671.1| Os09g0531900 [Oryza sativa Japonica Group]
 gi|222641970|gb|EEE70102.1| hypothetical protein OsJ_30110 [Oryza sativa Japonica Group]
          Length = 695

 Score = 38.5 bits (88), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 58/285 (20%), Positives = 112/285 (39%), Gaps = 55/285 (19%)

Query: 125 AVLVNSTLSGSRYPDLLHFHLFVPK---GSEDM----------------VSFYK------ 159
           +V+VNST+  ++ P+   FHL   K   G+ +M                V  +K      
Sbjct: 397 SVVVNSTIMNAKEPEKHVFHLVTDKLNFGAMNMWFLLNPPGKATIHVENVDEFKWLNSSY 456

Query: 160 ---LKVLFPHSNLEFHGQEEVKKVIRTASTGVKYSVQNFEEIVP---FVIASVHQSLSKF 213
              L+ L   +  E++ + +    +   S+ +KY    +  ++    F +  V+  L K 
Sbjct: 457 CPVLRQLESAAMKEYYFKADRPTTLSAGSSNLKYRNPKYLSMLNHLRFYLPQVYPKLDKI 516

Query: 214 IYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDVLDAIQRSASQPWV 270
           +++   ++V+  +  L  +DL+     A + C E   R + Y+N           S P +
Sbjct: 517 LFLDDDIVVQKDLTGLWDVDLNGKVNGAVETCGESFHRFDKYLN----------FSNPHI 566

Query: 271 SGKPYAVNSCMPDLGMLLIDARKLEKYILEAFLWWKKVINQRDR---SIGRSPAIALALY 327
           + + +  N+C    GM + D ++ +K  +       + +N+ DR    +G  P   L  Y
Sbjct: 567 A-RNFDPNACGWAYGMNIFDLKEWKKKDITGIYHKWQSMNE-DRVLWKLGTLPPGLLTFY 624

Query: 328 DRYLKLSSSWLVT------DSTSSVVNKSLAIRYDGPMTACSEFG 366
                L  SW V           S ++ +  + Y+G M    E  
Sbjct: 625 KLTHPLDKSWHVLGLGYNPSIDRSEIDNAAVVHYNGNMKPWLELA 669


>gi|90657589|gb|ABD96888.1| hypothetical protein [Cleome spinosa]
          Length = 351

 Score = 38.5 bits (88), Expect = 5.8,   Method: Compositional matrix adjust.
 Identities = 50/240 (20%), Positives = 92/240 (38%), Gaps = 28/240 (11%)

Query: 95  RNSGSISSISQDDGMVHIVSWMQCLDLRLLAVLVNSTLSGSRYPDLLHFH-LFVPKGSED 153
           RN     +   D   +HI   +    LR     V S L  S  P+ L FH L +P    +
Sbjct: 53  RNGDQCGTGESDQ--IHIAMTLDTNYLRGTMAAVLSLLQHSTCPENLSFHFLSLPHFETE 110

Query: 154 MVSFYKLKVLFPHSNLEFHG------QEEVKKVIRTA-STGVKYSVQNFEEIVPFVIASV 206
           +  F  +K  FP+ + + +       +  + K IR A    + Y+     +I+P      
Sbjct: 111 L--FTSIKSTFPYLSFKIYQFDPNLVRSRISKSIRQALDQPLNYARIYLSDILP------ 162

Query: 207 HQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRSAS 266
             ++ + IY+   ++V   +E+L  +++    +AA + C     +Y             S
Sbjct: 163 -SNVDRIIYLDSDLVVVDDIEKLWHVEMEEKVVAAPEYCHANFTNYFTDTFW-------S 214

Query: 267 QPWVSGKPYAVNSCMPDLGMLLIDARKLEK--YILEAFLWWKKVINQRDRSIGRSPAIAL 324
            P ++      + C  + G++++D  K  K  Y      W      +R   +G  P   L
Sbjct: 215 DPELAKVLEGRHPCYFNTGVMVVDVEKWRKGGYTQRVEEWMAVQKQKRIYHLGSLPPFLL 274


>gi|449436453|ref|XP_004136007.1| PREDICTED: probable galacturonosyltransferase-like 7-like [Cucumis
           sativus]
 gi|449505333|ref|XP_004162438.1| PREDICTED: probable galacturonosyltransferase-like 7-like [Cucumis
           sativus]
          Length = 367

 Score = 38.5 bits (88), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 53/267 (19%), Positives = 96/267 (35%), Gaps = 17/267 (6%)

Query: 106 DDGMVHIVSWMQCLDLRLLAVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFP 165
           D  +VH+   +    LR     VNS L  S  P+ + FH  V + + + V    ++  FP
Sbjct: 86  DPSLVHVAITLDVEYLRGSIAAVNSILQHSLCPESVFFHFLVSETNLEAV----VRSAFP 141

Query: 166 HSNLE--FHGQEEVKKVIRTASTGVKYSVQNFEEIVPFVIASVHQSLSKFIYMSPSVIVK 223
               +  +     V+ +I T+         N+     ++   +   + + IY+   ++V 
Sbjct: 142 QLKFKVYYFNPAIVQNLISTSVRQALEEPLNYAR--NYLAELLEPCVRRVIYLDSDLVVV 199

Query: 224 GRVEELIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRSASQPWVSGKPYAVNSCMPD 283
             + +L   +L +  I A + C      Y        + +  S  ++  KP   NS    
Sbjct: 200 DDISKLWSTNLGSKTIGAPEYCHANFTKYFTSRFW--LDKRFSGTFLGRKPCYFNS---- 253

Query: 284 LGMLLIDARKLEK--YILEAFLWWKKVINQRDRSIGRSPAIALALYDRYLKLSSSWLVTD 341
            G+++ID  K  +  Y      W +   N R   +G  P   L        +   W    
Sbjct: 254 -GVMVIDLAKWRRAGYTKRIERWMEIQKNNRIYELGSLPPFLLVFAGDVSPIEHRWNQHG 312

Query: 342 STSSVVNKSLAIRYDGPMTACSEFGDG 368
                V  S    + GP++     G G
Sbjct: 313 LGGDNVKGSCRNLHAGPVSLLHWSGSG 339


>gi|218202507|gb|EEC84934.1| hypothetical protein OsI_32147 [Oryza sativa Indica Group]
          Length = 695

 Score = 38.5 bits (88), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 58/285 (20%), Positives = 112/285 (39%), Gaps = 55/285 (19%)

Query: 125 AVLVNSTLSGSRYPDLLHFHLFVPK---GSEDM----------------VSFYK------ 159
           +V+VNST+  ++ P+   FHL   K   G+ +M                V  +K      
Sbjct: 397 SVVVNSTIMNAKEPEKHVFHLVTDKLNFGAMNMWFLLNPPGKATIHVENVDEFKWLNSSY 456

Query: 160 ---LKVLFPHSNLEFHGQEEVKKVIRTASTGVKYSVQNFEEIVP---FVIASVHQSLSKF 213
              L+ L   +  E++ + +    +   S+ +KY    +  ++    F +  V+  L K 
Sbjct: 457 CPVLRQLESAAMKEYYFKADRPTTLSAGSSNLKYRNPKYLSMLNHLRFYLPQVYPKLDKI 516

Query: 214 IYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDVLDAIQRSASQPWV 270
           +++   ++V+  +  L  +DL+     A + C E   R + Y+N           S P +
Sbjct: 517 LFLDDDIVVQKDLTGLWDVDLNGKVNGAVETCGESFHRFDKYLN----------FSNPHI 566

Query: 271 SGKPYAVNSCMPDLGMLLIDARKLEKYILEAFLWWKKVINQRDR---SIGRSPAIALALY 327
           + + +  N+C    GM + D ++ +K  +       + +N+ DR    +G  P   L  Y
Sbjct: 567 A-RNFDPNACGWAYGMNIFDLKEWKKKDITGIYHKWQSMNE-DRVLWKLGTLPPGLLTFY 624

Query: 328 DRYLKLSSSWLVT------DSTSSVVNKSLAIRYDGPMTACSEFG 366
                L  SW V           S ++ +  + Y+G M    E  
Sbjct: 625 KLTHPLDKSWHVLGLGYNPSIDRSEIDNAAVVHYNGNMKPWLELA 669


>gi|338732825|ref|YP_004671298.1| hypothetical protein SNE_A09300 [Simkania negevensis Z]
 gi|336482208|emb|CCB88807.1| hypothetical protein SNE_A09300 [Simkania negevensis Z]
          Length = 699

 Score = 38.1 bits (87), Expect = 7.2,   Method: Compositional matrix adjust.
 Identities = 50/190 (26%), Positives = 82/190 (43%), Gaps = 34/190 (17%)

Query: 113 VSW-MQCLDLRLLAVLVNSTLSGSRYPDLLHFHLFVPKG-----SEDMVSFYKLKVL-FP 165
           V W ++CL   + A+ + + L+G        FH F  K      S +M S Y L++L FP
Sbjct: 213 VDWKIECLGFLIFAIALGAALNG--------FHSFFLKKISHKISGNMTSRYLLQLLKFP 264

Query: 166 HSNLEFHGQEEVKKVIRTASTGVKYSVQNFEEIVPFVIA---SVHQSLSKFIY--MSPSV 220
              L F GQ + +++        K+S + +EEIVP  I    S+  +   F Y  +  SV
Sbjct: 265 ---LTFFGQRDPREIADRIKEIGKFSQRYYEEIVPTFIQLTLSIFYAAIMFSYSPLVASV 321

Query: 221 IVKGRVEELIGIDLSNYAIAAADDCSERLNSYVNPDVL------DAIQRSASQ-----PW 269
            + G    L+ +   + +  A  D  ER  +  +   L      DAI+   S+      W
Sbjct: 322 TIFGAFLSLLVLQTVSRSRLALYDTWERARNQSDEASLEQMHSIDAIKAVGSEQFFFHKW 381

Query: 270 VSGKPYAVNS 279
           +    Y +N+
Sbjct: 382 IKHYTYLLNT 391


>gi|423295529|ref|ZP_17273656.1| hypothetical protein HMPREF1070_02321 [Bacteroides ovatus
           CL03T12C18]
 gi|392672453|gb|EIY65921.1| hypothetical protein HMPREF1070_02321 [Bacteroides ovatus
           CL03T12C18]
          Length = 324

 Score = 38.1 bits (87), Expect = 7.9,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 22/36 (61%)

Query: 209 SLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADD 244
           +L K +Y+   +IV GR+E L  IDL  Y I A +D
Sbjct: 105 TLDKILYLDCDIIVNGRIESLWNIDLKYYTIGAVED 140


>gi|156066430|gb|ABU43074.1| homogalacturonan alpha-1,4-galacturonosyltransferase [Gossypium
           barbadense]
          Length = 421

 Score = 37.7 bits (86), Expect = 9.0,   Method: Compositional matrix adjust.
 Identities = 58/276 (21%), Positives = 110/276 (39%), Gaps = 30/276 (10%)

Query: 125 AVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFPHSNLEFHGQE-EVKKVIRT 183
           +V+VNST+S ++ P+ + FH+     +   +S + L  L P      H Q  E    + T
Sbjct: 145 SVVVNSTISSAKEPEKIVFHVVTDSLNLPAISMWFL--LNPPGKATIHVQSIENFDWLST 202

Query: 184 ASTGVKYSVQNFE-------EIVPFVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSN 236
                    ++++         + F +  +  +L+K +     V+V+  + E+  ID+  
Sbjct: 203 KYNSTLNEQKSYDPRYSSALNHLRFYLPDIFPALNKIVLFDHDVVVQRDLTEIWSIDMKG 262

Query: 237 YAIAAADDCSERLNSYVNPDVLDAIQRSASQPWVSGKPYAVNSCMPDLGMLLIDARKLEK 296
               A + C E   S+ +  +      + S P+++ + +  N C    GM L D  +  +
Sbjct: 263 KVNGAVETCLESEASFRSIQMF----MNFSDPFLARR-FNANVCTWAFGMNLFDLHEWRR 317

Query: 297 YILEAFLWWKKVINQRDRSI---GRSPAIALALYDRYLKLSSSWLV------TDSTSSVV 347
             L   L+   +     RS+   G  P   +  Y++ + L   W        +D     +
Sbjct: 318 KNL-TMLYRNYLQLGLKRSLWKGGSLPIGWITFYNQTVALEKRWHTLGLGYNSDVPPGDI 376

Query: 348 NKSLAIRYDGPMTACSEFGDGANMEPARGDLWKQHL 383
             +  I YDG M    E G    +   +G  W +HL
Sbjct: 377 ENAAVIHYDGVMKPWLETG----IAKYKG-YWSKHL 407


>gi|15235247|ref|NP_192122.1| putative galacturonosyltransferase-like 6 [Arabidopsis thaliana]
 gi|30679061|ref|NP_849285.1| putative galacturonosyltransferase-like 6 [Arabidopsis thaliana]
 gi|79324977|ref|NP_001031573.1| putative galacturonosyltransferase-like 6 [Arabidopsis thaliana]
 gi|75317781|sp|O04253.1|GATL6_ARATH RecName: Full=Probable galacturonosyltransferase-like 6; AltName:
           Full=Like glycosyl transferase 10
 gi|2104536|gb|AAC78704.1| predicted glycosyl transferase [Arabidopsis thaliana]
 gi|7268597|emb|CAB80706.1| predicted glycosyl transferase [Arabidopsis thaliana]
 gi|24030376|gb|AAN41350.1| putative glycosyl transferase [Arabidopsis thaliana]
 gi|26453088|dbj|BAC43620.1| putative glycosyl transferase [Arabidopsis thaliana]
 gi|110738563|dbj|BAF01207.1| glycosyl transferase like protein [Arabidopsis thaliana]
 gi|332656727|gb|AEE82127.1| putative galacturonosyltransferase-like 6 [Arabidopsis thaliana]
 gi|332656728|gb|AEE82128.1| putative galacturonosyltransferase-like 6 [Arabidopsis thaliana]
 gi|332656729|gb|AEE82129.1| putative galacturonosyltransferase-like 6 [Arabidopsis thaliana]
          Length = 346

 Score = 37.7 bits (86), Expect = 9.1,   Method: Compositional matrix adjust.
 Identities = 58/285 (20%), Positives = 107/285 (37%), Gaps = 22/285 (7%)

Query: 91  DSGDR--NSGSISSISQDDGMVHIVSWMQCLDLRLLAVLVNSTLSGSRYPDLLHFHLFVP 148
           ++GD   +SG + + S    +VH+   +    LR     VNS L  S  P+ + FH F+ 
Sbjct: 49  NNGDECLSSGGVCNPS----LVHVAITLDVEYLRGSIAAVNSILQHSVCPESVFFH-FIA 103

Query: 149 KGSEDMVSFYKLKVLFPHSNLEFH--GQEEVKKVIRTASTGVKYSV-QNFEEIVPFVIAS 205
              E  +    ++ +FP      +    E V+ +I   S+ V+ ++ Q       ++   
Sbjct: 104 VSEETNLLESLVRSVFPRLKFNIYDFAPETVRGLI---SSSVRQALEQPLNYARSYLADL 160

Query: 206 VHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSERLNSYVNPDVLDAIQRSA 265
           +   +++ IY+   ++V   + +L    L +  I A + C      Y             
Sbjct: 161 LEPCVNRVIYLDSDLVVVDDIAKLWKTSLGSRIIGAPEYCHANFTKYFTGGFW------- 213

Query: 266 SQPWVSGKPYAVNSCMPDLGMLLIDARKLEK--YILEAFLWWKKVINQRDRSIGRSPAIA 323
           S+   SG       C  + G+++ID +K  +  Y      W +    +R   +G  P   
Sbjct: 214 SEERFSGTFRGRKPCYFNTGVMVIDLKKWRRGGYTKRIEKWMEIQRRERIYELGSLPPFL 273

Query: 324 LALYDRYLKLSSSWLVTDSTSSVVNKSLAIRYDGPMTACSEFGDG 368
           L        +S  W         V  S    + GP++     G G
Sbjct: 274 LVFSGHVAPISHRWNQHGLGGDNVRGSCRDLHPGPVSLLHWSGSG 318


>gi|326532202|dbj|BAK01477.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 697

 Score = 37.7 bits (86), Expect = 9.3,   Method: Compositional matrix adjust.
 Identities = 38/179 (21%), Positives = 71/179 (39%), Gaps = 26/179 (14%)

Query: 201 FVIASVHQSLSKFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCSE---RLNSYVNPDV 257
           F +  +   L+K +++    +V+  +  L  IDL      A + C E   R + Y+N   
Sbjct: 506 FYLPEIFPKLNKVLFLDDDTVVQQDLSALWSIDLKGKVNGAVETCGESFHRFDKYLN--- 562

Query: 258 LDAIQRSASQPWVSGKPYAVNSCMPDLGMLLIDARKLEKY-ILEAFLWWKKVINQRDR-- 314
                   S P ++   +  +SC    GM + D  +  K  I + +  W+ +    DR  
Sbjct: 563 -------FSNPLIASN-FNPHSCGWAYGMNMFDLSEWRKQNITDVYHTWQNL--NEDRLL 612

Query: 315 -SIGRSPAIALALYDRYLKLSSSWLVT------DSTSSVVNKSLAIRYDGPMTACSEFG 366
             +G  PA  +  ++R   L  SW +       +     + ++  I Y+G +    E G
Sbjct: 613 WKLGSLPAGLVTFWNRTFPLDRSWHLLGLGYNPNVNEKEIRRASVIHYNGNLKPWLEIG 671


>gi|387133452|ref|YP_006299424.1| glycosyltransferase family protein [Prevotella intermedia 17]
 gi|386376300|gb|AFJ09183.1| glycosyltransferase family 8 [Prevotella intermedia 17]
          Length = 328

 Score = 37.7 bits (86), Expect = 9.5,   Method: Compositional matrix adjust.
 Identities = 25/99 (25%), Positives = 41/99 (41%), Gaps = 18/99 (18%)

Query: 166 HSNLEFHGQEEVKKVIRTASTGVKYSVQNFEEIVPFVIASVHQSLS-------------- 211
           H  L  +GQ+ +  ++      ++Y   + + +  F IA  H +LS              
Sbjct: 22  HHGLSQNGQQLISNMVLRYGAKIRYYDVDVDRLSSFSIAEYHPNLSIATYFRFFLPSFLD 81

Query: 212 ----KFIYMSPSVIVKGRVEELIGIDLSNYAIAAADDCS 246
               K +Y+   VIV   V EL  IDL+ Y +A   D +
Sbjct: 82  SSIKKVLYLDCDVIVLKDVSELFHIDLAGYGVAGVKDVT 120


>gi|224110116|ref|XP_002315420.1| glycosyltransferase, CAZy family GT8 [Populus trichocarpa]
 gi|222864460|gb|EEF01591.1| glycosyltransferase, CAZy family GT8 [Populus trichocarpa]
          Length = 348

 Score = 37.7 bits (86), Expect = 9.5,   Method: Compositional matrix adjust.
 Identities = 50/238 (21%), Positives = 95/238 (39%), Gaps = 27/238 (11%)

Query: 110 VHIVSWMQCLDLRLLAVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFPHSNL 169
           +HIV  +    LR     + S L  S  P+ + FH    +   ++  F  +K  FP+ N 
Sbjct: 64  IHIVMTLDANYLRGTMAAIFSILRHSTCPENMEFHFLWARFDREV--FSSIKSTFPYLNF 121

Query: 170 EFHG------QEEVKKVIRTA-STGVKYSVQNFEEIVPFVIASVHQSLSKFIYMSPSVIV 222
           +F+       + ++ K IR +    + Y+     +I+P        ++ + IY+   +++
Sbjct: 122 KFYRFDSNRVRGKISKSIRQSLDQPLNYARIYLADIIP-------SNVKRVIYLDSDLLL 174

Query: 223 KGRVEELIGIDLSNYAIAAADDCSERLNSYV-NPDVLDAIQRSASQPWVSGKPYAVNSCM 281
              + +L  +DL +  +AA + C      Y  N   LD        P ++   +    C 
Sbjct: 175 VDDIAKLWEVDLEDRVLAAPEYCHANFTYYFSNLFWLD--------PVLARTFHGRRPCY 226

Query: 282 PDLGMLLIDARKLEKYIL-EAFLWWKKVINQ-RDRSIGRSPAIALALYDRYLKLSSSW 337
            + G++++D  K  +  L +    W  V  Q R   +G  P   L L      +   W
Sbjct: 227 FNTGVMVVDVEKWRQVQLTQKVEGWMTVQKQKRIYHLGSLPPFLLVLAGNIKGVDHRW 284


>gi|12003394|gb|AAG43554.1|AF211536_1 Avr9/Cf-9 rapidly elicited protein 231 [Nicotiana tabacum]
          Length = 353

 Score = 37.7 bits (86), Expect = 9.6,   Method: Compositional matrix adjust.
 Identities = 49/267 (18%), Positives = 103/267 (38%), Gaps = 12/267 (4%)

Query: 106 DDGMVHIVSWMQCLDLRLLAVLVNSTLSGSRYPDLLHFHLFVPKGSEDMVSFYKLKVLFP 165
           D  ++HI   +     R     V+S L  +  P+ ++FH    K  +       +  +FP
Sbjct: 65  DPNLIHIAMTLDSHYFRGSIAAVHSVLKHTSCPENIYFHFVTSKDFDFQQLTQTVMSIFP 124

Query: 166 HSNLEFHGQEEVKKVIRTASTGVKYSVQN-FEEIVPFVIASVHQSLSKFIYMSPSVIVKG 224
             + + +  +E+ +V    S+ ++ ++ N       ++   +   + + IY+   VI+  
Sbjct: 125 SLSFKVYSFDEL-RVKNLISSSIRQALDNPLNYARTYLAEIIEHCVERVIYLDSDVILVD 183

Query: 225 RVEELIGIDLS-NYAIAAADDCSERLNSYVNPDVLDAIQRSASQPWVSGKPYAVNSCMPD 283
            +++L  I L+ +  I A + C     +Y N +     +   S+ +   KP     C  +
Sbjct: 184 DIQKLWSISLTGSRIIGAPEYCHANFRTYFNDNFWSDTK--FSKVFQGKKP-----CYFN 236

Query: 284 LGMLLIDARKLEK--YILEAFLWWKKVINQRDRSIGRSPAIALALYDRYLKLSSSWLVTD 341
            G++++D  K  K  Y  +   W +    +R   +G  P   L        +   W    
Sbjct: 237 TGVMVMDLGKWRKGDYTEKIENWMEIQKKKRIYELGSLPPFMLVFGGEIEGIDHKWNQHG 296

Query: 342 STSSVVNKSLAIRYDGPMTACSEFGDG 368
                +  S    + GP++     G G
Sbjct: 297 LGGDNLVNSCRTLHPGPVSLLHWSGKG 323


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.320    0.134    0.403 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,392,448,583
Number of Sequences: 23463169
Number of extensions: 267765726
Number of successful extensions: 581666
Number of sequences better than 100.0: 180
Number of HSP's better than 100.0 without gapping: 27
Number of HSP's successfully gapped in prelim test: 153
Number of HSP's that attempted gapping in prelim test: 581496
Number of HSP's gapped (non-prelim): 224
length of query: 393
length of database: 8,064,228,071
effective HSP length: 144
effective length of query: 249
effective length of database: 8,980,499,031
effective search space: 2236144258719
effective search space used: 2236144258719
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 78 (34.7 bits)